语音识别系统中上下文相关声学模型建模优化 Refining Context-Dependent Tonal Acoustic Modeling in Mandarin LVCSR期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

语音识别系统中上下文相关声学模型建模优化

引用本文：	彭荻,刘刚,郭军.语音识别系统中上下文相关声学模型建模优化[J].北京邮电大学学报,2006,29(22):188-191.

作者姓名：	彭荻刘刚郭军

作者单位：	北京邮电大学信息工程学院, 北京 100876

摘要：	在实验中发现，某些带调三音子的训练数据稀疏会引起识别错误率的上升,为了在一定程度上减少这种影响，提出了使用其无调三音子的模型参数对有调三音子进行初始化。此外还调整了决策树状态捆绑的停止门限，并且采用了混合度分量的自适应增长训练。在863语音库上的实验结果表明，所有这些获得了一定的音子识别性能提高，同时也一定程度上压缩了声学模型大小。
关键词：	声学模型语音识别三音子
收稿时间：	2006-10-18
Refining Context-Dependent Tonal Acoustic Modeling in Mandarin LVCSR

Affiliation:	School of Information Engineering, Beijing University of Posts and Telecommunications, Beijing 100876, China

Abstract:	In order to minimize the recognition errors caused by inaccurate model estimations from those toned triphones with limited training samples, we proposed to initialize toned triphones using their own toneless triphone model parameters. Besides, works concerning stopping criteria of decision tree state tying as well as mixture component adaptation are also explored to obtain better performance as well as reduce model scale. Experiments results have shown that, on the 863 corpus, along with all this improvements our system achieves certain increase of phone recognition rate, with much more trainable model scale as well.

Keywords:	acoustic model speech recognition triphone tone

	点击此处可从《北京邮电大学学报》浏览原始摘要信息
	点击此处可从《北京邮电大学学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏