Comparison of Khasi Speech Representations with Different Spectral Features and Hidden Markov States期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Comparison of Khasi Speech Representations with Different Spectral Features and Hidden Markov States

Authors:	Bronson Syiem Sushanta Kabir Dutta Juwesh Binong Lairenlakpam Joyprakash Singh

Affiliation:	Department of Electronics and Communication Engineering, North-Eastern Hill University, Shillong, 793022, India

Abstract:	In this paper, we present a comparison of Khasi speech representations with four different spectral features and novel extension towards the development of Khasi speech corpora. These four features include linear predictive coding (LPC), linear prediction cepstrum coefficient (LPCC), perceptual linear prediction (PLP), and Mel frequency cepstral coefficient (MFCC). The 10-hour speech data were used for training and 3-hour data for testing. For each spectral feature, different hidden Markov model (HMM) based recognizers with variations in HMM states and different Gaussian mixture models (GMMs) were built. The performance was evaluated by using the word error rate (WER). The experimental results show that MFCC provides a better representation for Khasi speech compared with the other three spectral features.

Keywords:	Acoustic model (AM) Gaussian mixture model (GMM) hidden Markov model (HMM) language model (LM) linear predictive coding (LPC) linear prediction cepstral coefficient (LPCC) Mel frequency cepstral coefficient (MFCC) perceptual linear prediction (PLP)
本文献已被 ScienceDirect 等数据库收录！
	点击此处可从《电子科技学刊:英文版》浏览原始摘要信息
	点击此处可从《电子科技学刊:英文版》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏