汉语普通话易混淆音素的识别 Recognition of Easily Confused Mandarin Phone期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

汉语普通话易混淆音素的识别

引用本文：	李晨冲,董滨,潘复平,曾兴雯,颜永红.汉语普通话易混淆音素的识别[J].计算机工程,2009,35(23):201-203.

作者姓名：	李晨冲董滨潘复平曾兴雯颜永红

作者单位：	1. 西安电子科技大学通信工程学院,西安,710071;中国科学院声学研究所中科信利语音实验室,北京,100190 2. 中国科学院声学研究所中科信利语音实验室,北京,100190 3. 西安电子科技大学通信工程学院,西安,710071

基金项目：	国家"863"计划基金资助项目，国家"973"计划基金资助项目，国家自然科学基金资助项目

摘要：	针对汉语普通话语音识别中易混淆音素的声学特征，把小波包分解理论应用在感觉加权线性预测（PLP）特征中，提出一种新的特征参数提取算法，可以更精确地描述易混淆音素的频谱特征。使甩高斯混合模型对新的声学特征进行分类，从而达到区分的目的。实验结果证明，新的特征参数识别结果优于使用传统PLP特征参数的识别结果，识别错误率下降30％以上。
关键词：	小波包分解感觉加权线性预测语音识别
修稿时间：
Recognition of Easily Confused Mandarin Phone

LI Chen-chong,DONG Bin,PAN Fu-ping,ZENG Xing-wen,YAN Yong-hong.Recognition of Easily Confused Mandarin Phone[J].Computer Engineering,2009,35(23):201-203.

Authors:	LI Chen-chong DONG Bin PAN Fu-ping ZENG Xing-wen YAN Yong-hong

Affiliation:	(1. School of Telecommunication Engineering, Xidian University, Xi’an 710071; 2. ThinkIT Speech Lab, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190)

Abstract:	Aiming at the acoustic features of some easily confused mandarin speech recognition, this paper directs towards revising the Perceptual Linear Predictive(PLP) acoustic feature of these consonants by applying wavelet packet decomposition theory, in which a new feature extraction algorithm is proposed. The new feature can describe frequency spectrum of the easily confused phones more accurately. It uses Gaussian Mixture Modeling(GMM) to classify the new feature for phone discrimination. Experimental results show that the distinguishing error rates of those easily confused consonants are decreased greatly more than 30% compared with traditional PLP feature.

Keywords:	wavelet packet decomposition Perceptual Linear Predictive(PLP) speech recognit
本文献已被维普万方数据等数据库收录！
	点击此处可从《计算机工程》浏览原始摘要信息
	点击此处可从《计算机工程》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏