基于多核学习特征融合的语音情感识别方法 Speech Emotion Recognition Method Based on Multiple Kernel Learning Feature Fusion期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于多核学习特征融合的语音情感识别方法

引用本文：	王忠民,刘戈,宋辉. 基于多核学习特征融合的语音情感识别方法[J]. 计算机工程, 2019, 45(8): 248-254

作者姓名：	王忠民刘戈宋辉

作者单位：	西安邮电大学计算机学院,西安710121;西安邮电大学陕西省网络数据分析与智能处理重点实验室,西安710121;西安邮电大学计算机学院,西安,710121

基金项目：	国家自然科学基金;陕西省科技统筹创新工程计划;陕西省教育厅专项科研项目;西安市科技局科技项目;西安邮电大学研究生创新创业基金项目

摘要：	在语音情感识别中提取梅尔频率倒谱系数(MFCC)会丢失谱特征信息,导致情感识别准确率较低。为此,提出一种结合MFCC和语谱图特征的语音情感识别方法。从音频信号中提取MFCC特征,将信号转换为语谱图,利用卷积神经网络提取图像特征。在此基础上,使用多核学习算法融合音频特征,并将生成的核函数应用于支持向量机进行情感分类。在2种语音情感数据集上的实验结果表明,与单一特征的分类器相比,该方法的语音情感识别准确率高达96 %。
关键词：	语音情感识别多核学习卷积神经网络梅尔频率倒谱系数语谱图
Speech Emotion Recognition Method Based on Multiple Kernel Learning Feature Fusion

WANG Zhongmin,LIU Ge,SONG Hui. Speech Emotion Recognition Method Based on Multiple Kernel Learning Feature Fusion[J]. Computer Engineering, 2019, 45(8): 248-254

Authors:	WANG Zhongmin LIU Ge SONG Hui

Affiliation:	(School of Computer Science and Technology,Xi’an University of Posts and Telecommunications,Xi’an 710121,China;Shaanxi Key Laboratory of Network Data Analysis and Intelligent Processing,Xi’an University of Posts and Telecommunications,Xi’an 710121,China)

Abstract:	WANG Zhongmin;LIU Ge;SONG Hui(School of Computer Science and Technology,Xi’an University of Posts and Telecommunications,Xi’an 710121,China;Shaanxi Key Laboratory of Network Data Analysis and Intelligent Processing,Xi’an University of Posts and Telecommunications,Xi’an 710121,China)

Keywords:	speech emotion recognition Multiple Kernel Learning(MKL) Convolution Neural Network(CNN) Mel-Frequency Cepstral Coefficients(MFCC) spectrogram
本文献已被维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏