首页 | 本学科首页   官方微博 | 高级检索  
     

基于目标驱动的多层MLLR自适应算法
引用本文:穆向禹,贾磊,张树武,徐波. 基于目标驱动的多层MLLR自适应算法[J]. 中文信息学报, 2003, 17(6): 40-47
作者姓名:穆向禹  贾磊  张树武  徐波
作者单位:中科院自动化所高新技术创新中心
基金项目:973项目资助(G19980 30 0 5 0 4 ),教育部留学归国人员启动基金资助
摘    要:本文在对语音识别中基于自适应回归树的极大似然线性变换(MLLR)模型自适应算法深刻分析的基础上,提出了一种基于目标驱动的多层MLLR自适应(TMLLR)算法。这种算法基于目标驱动的原则,引入反馈机制,根据目标函数似然概率的增加来动态决定MLLR变换的变换类,大大提高了系统的识别率。并且由于这种算法的特殊多层结构,减少了许多中间的冗余计算,算法在具有较高的自适应精度的同时还具有较快的自适应速度。在有监督自适应实验中,经过此算法自适应后的系统识别率比基于自适应回归树的MLLR算法自适应后系统的误识率降低了10% ,自适应速度也比基于自适应回归树的MLLR算法快近一倍。

关 键 词:计算机应用  中文信息处理  语音识别  模型自适应  自适应回归树  极大似然线性变换  
文章编号:1003-0077(2003)06-0039-08
修稿时间:2003-05-19

Multi-Layer Structure MLLR Adaptation Algorithm Based on Target-Driven
MU Xiang yu,JIA Lei,ZHANG Shu wu,XU Bo. Multi-Layer Structure MLLR Adaptation Algorithm Based on Target-Driven[J]. Journal of Chinese Information Processing, 2003, 17(6): 40-47
Authors:MU Xiang yu  JIA Lei  ZHANG Shu wu  XU Bo
Affiliation:Hi-tech-Innovation Centre , Chinese Academy of Sciences
Abstract:In this paper, a new algorithm called Target Driven based multiple layer maximum likelihood linear regression (TMLLR) is proposed for model adaptation in speech recognition. The algorithm can be regarded as the improvement of maximum likelihood linear regression (MLLR) using the generation of regression class trees for model adaptation. Different from conventional MLLR, the regression classes of TMLLR are generated dynamically based on increment of target function and a multi layer feedback mechanism. Because of the special multi layer structure of TMLLR, some redundant computing cost can be reduced, which caused much faster adaptation speed. The target driven strategy is aimed at increasing the likelihood probability, which is same to measure of speech recognition, so a higher recognition accuracy of the system can be achieved. In comparison with the conventional MLLR using the generation of regression class tree, TMLLR achieved a further word error rate reduction by 10% and had only about half computational time consuming in supervised adaptation experiments.
Keywords:computer application  Chinese information processing  speech recognition  model adaptation  regression class trees  maximum likelihood linear regression (MLLR)
本文献已被 CNKI 维普 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号