首页 | 本学科首页   官方微博 | 高级检索  
     

语音识别系统中上下文相关声学模型建模优化
引用本文:彭 荻,刘 刚,郭 军.语音识别系统中上下文相关声学模型建模优化[J].北京邮电大学学报,2006,29(22):188-191.
作者姓名:彭 荻  刘 刚  郭 军
作者单位:北京邮电大学 信息工程学院, 北京 100876
摘    要:在实验中发现,某些带调三音子的训练数据稀疏会引起识别错误率的上升,为了在一定程度上减少这种影响,提出了使用其无调三音子的模型参数对有调三音子进行初始化。此外还调整了决策树状态捆绑的停止门限,并且采用了混合度分量的自适应增长训练。在863语音库上的实验结果表明,所有这些获得了一定的音子识别性能提高,同时也一定程度上压缩了声学模型大小。

关 键 词:声学模型  语音识别  三音子
收稿时间:2006-10-18

Refining Context-Dependent Tonal Acoustic Modeling in Mandarin LVCSR
Affiliation:School of Information Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China
Abstract:In order to minimize the recognition errors caused by inaccurate model estimations from those toned triphones with limited training samples, we proposed to initialize toned triphones using their own toneless triphone model parameters. Besides, works concerning stopping criteria of decision tree state tying as well as mixture component adaptation are also explored to obtain better performance as well as reduce model scale. Experiments results have shown that, on the 863 corpus, along with all this improvements our system achieves certain increase of phone recognition rate, with much more trainable model scale as well.
Keywords:acoustic model  speech recognition  triphone  tone
点击此处可从《北京邮电大学学报》浏览原始摘要信息
点击此处可从《北京邮电大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号