基于DBM-LSTM的多特征语音情感识别 Multi-feature speech emotion recognition based on DBM-LSTM期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于DBM-LSTM的多特征语音情感识别

引用本文：	高帆,张雪英,黄丽霞,李宝芸.基于DBM-LSTM的多特征语音情感识别[J].计算机工程与设计,2020,41(2):465-470.

作者姓名：	高帆张雪英黄丽霞李宝芸

作者单位：	太原理工大学信息与计算机学院,山西太原 030024;太原理工大学信息与计算机学院,山西太原 030024;太原理工大学信息与计算机学院,山西太原 030024;太原理工大学信息与计算机学院,山西太原 030024

基金项目：	国家自然科学基金;研究生教育创新项目

摘要：	为增强不同情感特征的融合程度和语音情感识别模型的鲁棒性,提出一种神经网络结构DBM-LSTM用于语音情感识别。利用深度受限玻尔兹曼机的特征重构原理将不同的情感特征进行融合;利用长短时记忆单元对短时特征进行长时建模,增强语音情感识别模型的鲁棒性;在柏林情感语音数据库上进行分类实验。研究结果表明,与传统识别模型相比,DBM-LSTM网络结构更适用于多特征语音情感识别任务,最优识别结果提升11%。
关键词：	语音情感识别深度受限玻尔兹曼机长短时记忆单元柏林情感语音数据库多特征
Multi-feature speech emotion recognition based on DBM-LSTM

GAO Fan,ZHANG Xue-ying,HUANG Li-xia,LI Bao-yun.Multi-feature speech emotion recognition based on DBM-LSTM[J].Computer Engineering and Design,2020,41(2):465-470.

Authors:	GAO Fan ZHANG Xue-ying HUANG Li-xia LI Bao-yun

Affiliation:	(College of Information and Computer,Taiyuan University of Technology,Taiyuan 030024,China)

Abstract:	To enhance the fusion degree of different emotional features and the robustness of speech emotion recognition model,a neural network structure of deep-restricted Boltzmann machine and long short term memory(DBM-LSTM)for speech emotion recognition was proposed.The principle of feature reconstruction based on DBM was used to fuse the different emotional features.LSTM was used to model short-term features for a long-term to enhance the robustness of speech emotion recognition model.The classification experiments were carried out on EMO-DB database.The results show that,compared with traditional recognition model,DBM-LSTM network structure is more suitable for multi-feature speech emotion recognition tasks.The optimal recognition results can be improved by 11%.

Keywords:	speech emotion recognition DBM LSTM EMO-DB multi-feature
本文献已被维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏