首页 | 本学科首页   官方微博 | 高级检索  
     

电话语音识别中统一的加性噪声和卷积噪声补偿算法
引用本文:韩兆兵, 张树武, 徐波, 黄泰翼. 电话语音识别中统一的加性噪声和卷积噪声补偿算法. 自动化学报, 2004, 30(2): 169-175.
作者姓名:韩兆兵  张树武  徐波  黄泰翼
作者单位:1.中国科学院自动化研究所模式识别国家重点实验室,北京
基金项目:National Natural Science Foundation of P.R.China(69835003),the National“973”Plan (G19980300504)
摘    要:为了统一地补偿电话语音受加性噪声和卷积通道响应的影响,本文提出了矢量分段多项式近似(VPP)算法.并把此算法成功地应用到稳态噪声和非稳态噪声环境.对于稳态噪声环境,在log谱域采用Batch EM(B EM)方法;对于非稳态噪声环境,在倒谱域采用递归EM(REM)方法.这两种方法都是基于最小均方误差估计(MMSE)准则的特征补偿.实验结果表明,受背景噪声和电话通道(包括固定电话和GSM)影响的大词汇量连续语音识别应用此算法误识率可以降低约18%.

关 键 词:语音识别   分段多项式近似   环境补偿   递归EM算法
收稿时间:2002-12-02

An Additive and Convolutive Bias Compensation Algorithm for Telephoe Speech Recognition
HAN Zhao-Bing, ZHANG Shu-Wu, XU Bo, HUANG Tai-Yi. An Additive and Convolutive Bias Compensation Algorithm for Telephoe Speech Recognition. ACTA AUTOMATICA SINICA, 2004, 30(2): 169-175.
Authors:HAN Zhao-Bing  ZHANG Shu-Wu  XU Bo  HUANG Tai-Yi
Affiliation:1. National Laboratory of Pattern Recognition,Institute of Automation,Chinese Academy of Sciences,Beijing
Abstract:A Vector piecewise polynomial ( VPP) approximation algorithm is proposed for environment compensation of speech signals degraded by both additive and convolutive noises. By investigating the model of the telephone environment, we propose a piecewise polynomial, namely two linear polynomials and a quadratic polynomial, to approximate the environment function precisely. The VPP is applied either to the stationary noise, or to the non-stationary noise. In the first case, the batch EM is used in log-spectral domain; in the second case the recursive EM with iterative stochastic approximation is developed in cepstral domain. Both approaches are based on the minimum mean squared error (MMSE) sense. Experimental results are presented on the application of this approach in improving the performance of Mandarin large vocabulary continuous speech recognition (LVCSR) due to the background noises and different transmission channels (such as fixed telephone line and GSM). The method can reduce the average character error rate (CER) by a-bout 18%.
Keywords:Speech recognition   piecewise polynomial approximation   environment compensation   recursive EM algorithm
本文献已被 CNKI 维普 等数据库收录!
点击此处可从《自动化学报》浏览原始摘要信息
点击此处可从《自动化学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号