首页 | 本学科首页   官方微博 | 高级检索  
     

一种针对区分性训练的受限线性搜索优化方法
引用本文:刘聪,胡郁,戴礼荣,王仁华.一种针对区分性训练的受限线性搜索优化方法[J].模式识别与人工智能,2010,23(4):450-455.
作者姓名:刘聪  胡郁  戴礼荣  王仁华
作者单位:中国科学技术大学 电子工程与信息科学系 讯飞语音实验室 合肥 230027
摘    要:提出一种称为“受限线性搜索”的优化方法,并用于语音识别中混合高斯的连续密度隐马尔科夫(CDHMM)模型的区分性训练。该方法可用于优化基于最大互信息(MMI)准则的区分性训练目标函数。在该方法中,首先把隐马尔科夫模型(HMM)的区分性训练问题看成一个受限的优化问题,并利用模型间的KL度量作为优化过程中的一个限制。再基于线性搜索的思想,指出通过限制更新前后模型间的KL度量,可将HMM的参数表示成一种简单的二次形式。该方法可用于优化混合高斯CDHMM模型中的任何参数,包括均值、协方差矩阵、高斯权重等。将该方法分别用于中英文两个标准语音识别任务上,包括英文TIDIGITS数据库和中文863数据库。实验结果表明,该方法相对传统的扩展Baum-Welch方法在识别性能和收敛特性上都取得一致提升。

关 键 词:自动语音识别  区分性训练  受限线性搜索(CLS)  
收稿时间:2009-06-25

A Constrained Line Search Optimization Method for Discriminative Training of HMMs
LIU Cong,HU Yu,DAI Li-Rong,WANG Ren-Hua.A Constrained Line Search Optimization Method for Discriminative Training of HMMs[J].Pattern Recognition and Artificial Intelligence,2010,23(4):450-455.
Authors:LIU Cong  HU Yu  DAI Li-Rong  WANG Ren-Hua
Affiliation:iFlytek Speech Laboratory,Department of Electronic Engineering and Information Science,University of Science and Technology of China,Hefei 230027
Abstract:In this paper a optimization algorithm called constrained line search (CLS) is proposed for discriminative training (DT) of Gaussian mixture Continuous Density Hidden Markov Model (CDHMM) in speech recognition. The CLS method is used to optimize the objective function of Maximum Mutual Information (MMI) criterion based discriminative training. In this method, discriminative training of HMM is firstly treated as a constrained optimization problem, and Kullback-Leibler divergence (KLD) between models is explicitly applied as a constraint during optimization. Based upon the idea of line search, it is pointed out that a simple formula of HMM parameters can be expressed as a simple quadratic formula by constraining the KLD between HMM of two successive iterations. The proposed CLS method can be applied to optimize all model parameters in Gaussian mixture CDHMMs, including means, covariances, and mixture weights. The proposed CLS approach is investigated on two benchmark speech recognition tasks of both English and Chinese, including TIDIGITS and Mandarin 863 database. Experimental results show that the CLS optimization method consistently outperforms the conventional extended Baum-Welch (EBW) method in both recognition performance and convergence characteristic.
Keywords:Automatic Speech Recognition  Discriminative Training  Constrained Line Search (CLS)  
本文献已被 万方数据 等数据库收录!
点击此处可从《模式识别与人工智能》浏览原始摘要信息
点击此处可从《模式识别与人工智能》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号