首页 | 本学科首页   官方微博 | 高级检索  
     


Duration-Distribution-Based HMM for Speech Recognition
Authors:Wang Zuo-ying and Xiao Xi
Affiliation:(1) Department of Electronic Engineering, Tsinghua University, Beijing, 100084, China
Abstract:To overcome the defects of the duration modeling in the homogeneous Hidden Markov Model (HMM) for speech recognition, a duration-distribution-based HMM (DDBHMM) is proposed in this paper based on a formalized definition of a left-to-right inhomogeneous Markov model. It has been demonstrated that it can be identically defined by either the state duration or the state transition probability. The speaker-independent continuous speech recognition experiments show that by only modeling the state duration in DDBHMM, a significant improvement (17.8% error rate reduction) can be achieved compared with the classical HMM. The ideal properties of DDBHMM give promise to many aspects of speech modeling, such as the modeling of the state duration, speed variation, speech discontinuity, and interframe correlation. Translated from Acta Electronica Sinica, 2004, 32(1): 46–49 (in Chinese)
Keywords:duration  speech recognition  DDBHMM
本文献已被 万方数据 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号