Discriminative tonal feature extraction method in mandarin speech recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Discriminative tonal feature extraction method in mandarin speech recognition

Authors:	HUANG Hao ZHU Jie

Affiliation:	Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, China

Abstract:	To utilize the supra-segmental nature of Mandarin tones, this article proposes a feature extraction method for hidden markov model (HMM) based tone modeling. The method uses linear transforms to project F0 (fundamental frequency) features of neighboring syllables as compensations, and adds them to the original F0 features of the current syllable. The transforms are discriminatively trained by using an objective function termed as "minimum tone error", which is a smooth approximation of tone recognition accuracy. Experiments show that the new tonal features achieve 3.82% tone recognition rate improvement, compared with the baseline, using maximum likelihood trained HMM on the normal F0 features. Further experiments show that discriminative HMM training on the new features is 8.78% better than the baseline.

Keywords:	discriminative training tone recognition feature extraction Mandarin speech recognition
本文献已被万方数据 ScienceDirect 等数据库收录！