首页 | 本学科首页   官方微博 | 高级检索  
     

基于模型自适应的声效鲁棒性语音识别算法
引用本文:晁 浩,宋 成,薛 霄,刘志中.基于模型自适应的声效鲁棒性语音识别算法[J].计算机工程与应用,2016,52(2):156-160.
作者姓名:晁 浩  宋 成  薛 霄  刘志中
作者单位:河南理工大学 计算机科学与技术学院,河南 焦作 454000
摘    要:针对声音效果变化引起的语音声学特性的改变,提出基于声学模型自适应的方法。分析了正常模式下训练的声学模型在识别其他声效模式下语音的表现;根据随机段模型的模型特性,将最大似然线性回归方法引入到随机段模型系统中,并利用自适应后的声学模型来识别对应的声效模式下的语音。在“863-test”测试集上进行的汉语连续语音识别实验显示,正常模式下训练的声学模型识别其他四种声效模式下的语音时,识别精度均有较大程度的下降;而自适应后的系统在识别对应的声效模式的语音时,识别精度有了明显的改观。表明了基于声学模型自适应的方法在解决语音识别中声音效果变化问题上的有效性。

关 键 词:语音识别  声音效果  自适应  最大似然线性回归  

Vocal effort related robust speech recognition based on adaptation method
CHAO Hao,SONG Cheng,XUE Xiao,LIU Zhizhong.Vocal effort related robust speech recognition based on adaptation method[J].Computer Engineering and Applications,2016,52(2):156-160.
Authors:CHAO Hao  SONG Cheng  XUE Xiao  LIU Zhizhong
Affiliation:School of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, Henan 454000, China
Abstract:Adaptation of acoustic models is presented to cope with the acoustic variability caused vocal effort variability in Mandarin speech recognition. Acoustic models trained on normal speech are applied to recognize sentences under the remaining four vocal effort modes. The maximum likelihood linear regression adaptation method is extended to the stochastic segment model, and the acoustic models after adaptation are used to recognize speech of corresponding vocal effort mode. Experiments conducted on “863-test” show that there is significant?decrease in recognition accuracy in case of mismatched speech models, and the recognition performance can be improved considerably by adaptation. This proves that adaptation of acoustic models is effective in solving the acoustic variability caused vocal effort.
Keywords:speech recognition  vocal effort  adaptation  maximum likelihood linear regression  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号