首页 | 本学科首页   官方微博 | 高级检索  
     

实用语音情感的特征分析与识别的研究
引用本文:黄程韦,赵艳,金赟,于寅骅,赵力.实用语音情感的特征分析与识别的研究[J].电子与信息学报,2011,33(1):112-116.
作者姓名:黄程韦  赵艳  金赟  于寅骅  赵力
作者单位:东南大学水声信号处理教育部重点实验室 南京 210096 徐州师范大学物理与电子工程学院 徐州 221116
基金项目:国家自然科学基金(60472058,60975017,51075068); 江苏省自然科学基金(BK2008291)资助课题
摘    要: 该文针对语音情感识别在实际中的应用,研究了烦躁等实用语音情感的分析与识别。通过计算机游戏诱发的方式采集了高自然度的语音情感数据,提取了74种情感特征,分析了韵律特征、音质特征与情感维度之间的关系,对烦躁等实用语音情感的声学特征进行了评价与选择,提出了针对实际应用环境的可拒判的实用语音情感识别方法。实验结果表明,文中采用的语音情感特征,能较好识别烦躁等实用语音情感,平均识别率达到75%以上。可拒判的实用语音情感识别方法,对模糊的和未知的情感类别的分类进行了合理的决策,在语音情感的实际应用中具有重要的意义。

关 键 词:语音识别  实用语音情感  韵律特征  音质特征  拒判方法
收稿时间:2009-06-16

A Study on Feature Analysis and Recognition of Practical Speech Emotion
Huang Cheng-wei,Zhao Yan,Jin Yun,Yu Yin-hua,Zhao Li.A Study on Feature Analysis and Recognition of Practical Speech Emotion[J].Journal of Electronics & Information Technology,2011,33(1):112-116.
Authors:Huang Cheng-wei  Zhao Yan  Jin Yun  Yu Yin-hua  Zhao Li
Affiliation:Key Laboratory of Underwater Acoustic Signal Processing of Ministry of Education, Southeast University,
Nanjing 210096, China School of Physics and Electronics Engineering, Xuzhou Normal University, Xuzhou 221116, China
Abstract:Practical speech emotions as impatience and happiness are studied especially for evaluation of emotional well-being in real world applications. Induced natural speech emotion data is collected with a computer game, 74 emotion features are extracted, prosody features and voice quality features are analyzed according to dimensional emotion model, evaluation and selection of acoustic features are carried out for practical emotions in this paper, a method of practical speech emotion classification with rejection decision is proposed for real world occasions. The experiment results show, the speech features analyzed in this paper are suitable for classification of practical speech emotions like impatience and happiness, average recognition rate is above 75%, and the method of emotion classification with rejection decision is necessary for the proper recognition decision of ambiguous or unknown emotion samples, especially for the real world challenges.
Keywords:Speech recognition  Practical speech emotion  Prosody features  Voice quality features  Rejection decision
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《电子与信息学报》浏览原始摘要信息
点击此处可从《电子与信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号