首页 | 本学科首页   官方微博 | 高级检索  
     

基于DBM-LSTM的多特征语音情感识别
引用本文:高帆,张雪英,黄丽霞,李宝芸.基于DBM-LSTM的多特征语音情感识别[J].计算机工程与设计,2020,41(2):465-470.
作者姓名:高帆  张雪英  黄丽霞  李宝芸
作者单位:太原理工大学 信息与计算机学院,山西 太原 030024;太原理工大学 信息与计算机学院,山西 太原 030024;太原理工大学 信息与计算机学院,山西 太原 030024;太原理工大学 信息与计算机学院,山西 太原 030024
基金项目:国家自然科学基金;研究生教育创新项目
摘    要:为增强不同情感特征的融合程度和语音情感识别模型的鲁棒性,提出一种神经网络结构DBM-LSTM用于语音情感识别。利用深度受限玻尔兹曼机的特征重构原理将不同的情感特征进行融合;利用长短时记忆单元对短时特征进行长时建模,增强语音情感识别模型的鲁棒性;在柏林情感语音数据库上进行分类实验。研究结果表明,与传统识别模型相比,DBM-LSTM网络结构更适用于多特征语音情感识别任务,最优识别结果提升11%。

关 键 词:语音情感识别  深度受限玻尔兹曼机  长短时记忆单元  柏林情感语音数据库  多特征

Multi-feature speech emotion recognition based on DBM-LSTM
GAO Fan,ZHANG Xue-ying,HUANG Li-xia,LI Bao-yun.Multi-feature speech emotion recognition based on DBM-LSTM[J].Computer Engineering and Design,2020,41(2):465-470.
Authors:GAO Fan  ZHANG Xue-ying  HUANG Li-xia  LI Bao-yun
Affiliation:(College of Information and Computer,Taiyuan University of Technology,Taiyuan 030024,China)
Abstract:To enhance the fusion degree of different emotional features and the robustness of speech emotion recognition model,a neural network structure of deep-restricted Boltzmann machine and long short term memory(DBM-LSTM)for speech emotion recognition was proposed.The principle of feature reconstruction based on DBM was used to fuse the different emotional features.LSTM was used to model short-term features for a long-term to enhance the robustness of speech emotion recognition model.The classification experiments were carried out on EMO-DB database.The results show that,compared with traditional recognition model,DBM-LSTM network structure is more suitable for multi-feature speech emotion recognition tasks.The optimal recognition results can be improved by 11%.
Keywords:speech emotion recognition  DBM  LSTM  EMO-DB  multi-feature
本文献已被 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号