首页 | 本学科首页   官方微博 | 高级检索  
     


Comparison of Khasi Speech Representations with Different Spectral Features and Hidden Markov States
Authors:Bronson Syiem  Sushanta Kabir Dutta  Juwesh Binong  Lairenlakpam Joyprakash Singh
Affiliation:Department of Electronics and Communication Engineering, North-Eastern Hill University, Shillong, 793022, India
Abstract:In this paper, we present a comparison of Khasi speech representations with four different spectral features and novel extension towards the development of Khasi speech corpora. These four features include linear predictive coding (LPC), linear prediction cepstrum coefficient (LPCC), perceptual linear prediction (PLP), and Mel frequency cepstral coefficient (MFCC). The 10-hour speech data were used for training and 3-hour data for testing. For each spectral feature, different hidden Markov model (HMM) based recognizers with variations in HMM states and different Gaussian mixture models (GMMs) were built. The performance was evaluated by using the word error rate (WER). The experimental results show that MFCC provides a better representation for Khasi speech compared with the other three spectral features.
Keywords:Acoustic model (AM)  Gaussian mixture model (GMM)  hidden Markov model (HMM)  language model (LM)  linear predictive coding (LPC)  linear prediction cepstral coefficient (LPCC)  Mel frequency cepstral coefficient (MFCC)  perceptual linear prediction (PLP)
本文献已被 ScienceDirect 等数据库收录!
点击此处可从《电子科技学刊:英文版》浏览原始摘要信息
点击此处可从《电子科技学刊:英文版》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号