首页 | 本学科首页   官方微博 | 高级检索  
     

基于邻居相似现象的情感说话人识别
引用本文:陈力,杨莹春.基于邻居相似现象的情感说话人识别[J].浙江大学学报(自然科学版 ),2012,46(10):1790-1795.
作者姓名:陈力  杨莹春
作者单位:浙江大学 计算机科学与技术学院,浙江 杭州 310027
基金项目:国家自然科学基金资助项目(60970080);核高基重大专项资助项目(2009ZX01039-002-001-04)
摘    要:根据语音学的研究,提出中性时发音相似的说话人,在情感状态下的发音人相似的假设--邻居相似现象,并通过定量和定性的分析验证了该假设,即在音素内容相同的情况下,同一说话人的中性模型和情感模型对应高斯分量的“邻居”基本类似.为了解决说话人情感变化时语音短时特征的分布与中性语音模型存在差异的问题,提出说话人情感模型合成的方法--将开发库中学习到的中性 情感变化规律移植到评测库中,根据说话人的中性模型合成出情感模型.从邻居相似现象的特性出发,根据KL距离选取该说话人中性下若干相似的邻居,根据基于邻居的方法和基于邻居变换的方法,合成出该说话人的情感模型.MASC库上的实验结果表明,该方法的识别准确率比传统的GMM-UBM算法提高了2.81%,与情感属性映射(EAP)方法相比识别率提高了1.3%.

关 键 词:情感说话人识别  邻居相似现象  情感模型合成

Emotional speaker recognition based on similar neighbor phenomenon
CHEN Li,YANG Ying-chun.Emotional speaker recognition based on similar neighbor phenomenon[J].Journal of Zhejiang University(Engineering Science),2012,46(10):1790-1795.
Authors:CHEN Li  YANG Ying-chun
Affiliation:(College of Computer Science and Technology,Zhejiang University,Hangzhou 310027,China)
Abstract:Based on the research on phonetics, the assumption that similar-sounding speakers in neutral condition also sound similar when they change their emotions was proposed, known as Similar Neighbor Phenomenon. Additionally, the qualitative and quantitative analysis was conducted to prove the assumption. The “neighbors” of neutral and emotional model of the similar speaker are almost the same under the identical phonetic event. The emotional model synthesis method was proposed in order to overcome the problem that the distribution of acoustic feature under emotional states was different from that of the neutral speaker model. The method can learn the neutral-emotion transformation rules from the development corpus, and apply them into the evaluation corpus to construct the emotional speaker model from his/her neutral one. From the view of Similar Neighbor Phenomenon, neighbors under neutral were selected by the KL distance. The emotional models were constructed by the neighbors-based transformation method and shift-based transformation method. The experiments carried on MASC showed an identification rate (IR) increase of 2.81% over the GMM-UBM algorithm and 1.3% over the emotional attribute projection (EAP) algorithm.
Keywords:
本文献已被 CNKI 等数据库收录!
点击此处可从《浙江大学学报(自然科学版 )》浏览原始摘要信息
点击此处可从《浙江大学学报(自然科学版 )》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号