首页 | 本学科首页   官方微博 | 高级检索  
     

一种基于改进CP网络与HMM相结合的混合音素识别方法
引用本文:邓伟,赵荣椿.一种基于改进CP网络与HMM相结合的混合音素识别方法[J].数据采集与处理,2000,15(1):6-11.
作者姓名:邓伟  赵荣椿
作者单位:1. 苏州大学计算机工程系,苏州,215006
2. 西北工业大学计算机科学与工程系,西安,710072
摘    要:提出了一种基于改进对偶传播(CP)神经网络与隐驰尔可夫模型(HMM)相结合的混合音素识别方法.这一方法的特点是用一个具有有指导学习矢量量化(LVQ)和动态节点分配等特性的改进的CP网络生成离散HMM音素识别系统中的码书。因此,用这一方法构造的混合音素识别系统中的码书实际上是一个由有指导LVQ算法训练的具有很强分类能力的高性能分类器,这就意味着在用HMM对语音信号进行建模之前,由码书产生的观测序列中

关 键 词:隐马尔可夫模型  音素识别  CP网络  语音识别

A Hybrid Approach for Phoneme Recognition Based on Combination of Improved CP Neural Network and HMM
Deng Wei,Zhao Rongchun.A Hybrid Approach for Phoneme Recognition Based on Combination of Improved CP Neural Network and HMM[J].Journal of Data Acquisition & Processing,2000,15(1):6-11.
Authors:Deng Wei  Zhao Rongchun
Abstract:Proposes a hybrid approach for phoneme recognition based on combination of improved counter propagation (CP) neural network and hidden Markov model (HMM). The characteristic of the approach is that the codebook in a discrete HMM based phoneme recognition system is generated by a modified CP neural network with a few improvements, such as supervised learning vector quantization (LVQ) and dynamic node allocation. Hence, in effect, the codebook in the hybrid phoneme recognition system created through the approach is a high performance classifier with much better discriminating power trained by the supervised LVQ algorithm. It means that before a HMM is used for modeling a speech signal, the observation sequence generated by such a codebook contains highly discriminating information. This will greatly improve the recognition performance of HMM at phoneme level. On the other hand, since the training is done for an improved CP neural network with several new designs, the LVQ learning in the training process can be automatically performed in a supervised mode; learning is accelerated; system convergence is improved; more accurate classification is developed; at the same time, size of codebook is effectively reduced, resulting in the additional advantage of making HMM parameter estimation easier. Finally, through two speaker dependent phoneme recognition experiments, the hybrid approach is compared with the traditional VQ HMM phoneme recognition approach, which uses K means generated codebook. The results show that a correct recognition rate of 98%~99% can be achieved by the hybrid recognition system, and the error rate is 4~6 times lower than that of VQ HMM recognition system using a K means generated codebook of the same size.
Keywords:neural network  hidden Markov model  hybrid  phoneme recognition  counter  propagation  learning vector quantization
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号