首页 | 本学科首页   官方微博 | 高级检索  
     

多预测子融合实时连续语音识别输出词正误判别
引用本文:付跃文,杜利民.多预测子融合实时连续语音识别输出词正误判别[J].中文信息学报,2005,19(6):86-93.
作者姓名:付跃文  杜利民
作者单位:中国科学院声学研究所语音交互信息技术中心,北京 100080
摘    要:本文在采用堆栈译码词网重估输出作为识别最终输出的连续语音识别实时解码条件下,利用决策树方法将多个预测子融合,对识别输出词进行正确和错误的判别。本文首先构造了词后验概率、词长、相邻词的后验概率、词的声学和语言得分等共13 个预测子,然后利用决策树方法,通过选择不同的预测子组合方式和适当的决策树建树参数,筛选出预测子的最佳组合,建立优化的决策树进行输出词的正误判别。实验结果表明:利用局域词图计算的词后验概率与词长、相邻词的后验概率等几种实时预测子融合后,对识别输出词的正误判别能力得到提高,并且在实时性和分类效果两个方面优于n - best 输出的相应结果,相对于基线系统, 则分类错误率下降41. 4 %。实验结果也表明本文提出的相邻词的后验概率是相对重要的预测子。

关 键 词:计算机应用  中文信息处理  连续语音识别  预测子  决策树  
文章编号:1003-0077(2005)06-0084-08
收稿时间:2004-09-27
修稿时间:2004-11-30

Combination of Multiple Predictors for Correct - Incorrect Classification of Output Words in Real Time Continuous Speech Recognition
FU Yue-wen,DU Li-min.Combination of Multiple Predictors for Correct - Incorrect Classification of Output Words in Real Time Continuous Speech Recognition[J].Journal of Chinese Information Processing,2005,19(6):86-93.
Authors:FU Yue-wen  DU Li-min
Affiliation:Center for Speech Interaction Technology Research ,Institute of Acoustics , Chinese Academy of Sciences
Abstract:Under the decoding strategy of using stack decoding to rescore the word trellis to generate final output,this paper uses decision tree to combine multiple predictors to identify each of recognition output words as correct or incorrect.A series of predictors are constructed,including word posterior probability,word length,word posterior probability of neighboring words,13 in all.Optimal combination of predictors is found and best decision tree is constructed for correct-incorrect classification of output words by testing different combination of predictors and choosing appropriate tree parameters.The experimental results show that the combination of local word posterior probabilities(LWPP) with some of other predictors constructed by this paper,including mainly word length and LWPPs of neighboring words,can give a significant improvement in classification performance,and is better in time consumption and quality than the corresponding results from n-best list.Compared with baseline system,the classification error rate getsan improvement of 41.4%.The experimental results also show that posterior probabilities of neighboring words proposed by this paper are among relatively important predictors.
Keywords:computer application  Chinese information processing  continuous speech recognition  predictor  decision tree
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号