首页 | 本学科首页   官方微博 | 高级检索  
     

汉语普通话易混淆音素的识别
引用本文:李晨冲,董滨,潘复平,曾兴雯,颜永红.汉语普通话易混淆音素的识别[J].计算机工程,2009,35(23):201-203.
作者姓名:李晨冲  董滨  潘复平  曾兴雯  颜永红
作者单位:1. 西安电子科技大学通信工程学院,西安,710071;中国科学院声学研究所中科信利语音实验室,北京,100190
2. 中国科学院声学研究所中科信利语音实验室,北京,100190
3. 西安电子科技大学通信工程学院,西安,710071
基金项目:国家"863"计划基金资助项目,国家"973"计划基金资助项目,国家自然科学基金资助项目 
摘    要:针对汉语普通话语音识别中易混淆音素的声学特征,把小波包分解理论应用在感觉加权线性预测(PLP)特征中,提出一种新的特征参数提取算法,可以更精确地描述易混淆音素的频谱特征。使甩高斯混合模型对新的声学特征进行分类,从而达到区分的目的。实验结果证明,新的特征参数识别结果优于使用传统PLP特征参数的识别结果,识别错误率下降30%以上。

关 键 词:小波包分解  感觉加权线性预测  语音识别
修稿时间: 

Recognition of Easily Confused Mandarin Phone
LI Chen-chong,DONG Bin,PAN Fu-ping,ZENG Xing-wen,YAN Yong-hong.Recognition of Easily Confused Mandarin Phone[J].Computer Engineering,2009,35(23):201-203.
Authors:LI Chen-chong  DONG Bin  PAN Fu-ping  ZENG Xing-wen  YAN Yong-hong
Affiliation:(1. School of Telecommunication Engineering, Xidian University, Xi’an 710071; 2. ThinkIT Speech Lab, Institute of Acoustics, Chinese Academy of Sciences, Beijing 100190)
Abstract:Aiming at the acoustic features of some easily confused mandarin speech recognition, this paper directs towards revising the Perceptual Linear Predictive(PLP) acoustic feature of these consonants by applying wavelet packet decomposition theory, in which a new feature extraction algorithm is proposed. The new feature can describe frequency spectrum of the easily confused phones more accurately. It uses Gaussian Mixture Modeling(GMM) to classify the new feature for phone discrimination. Experimental results show that the distinguishing error rates of those easily confused consonants are decreased greatly more than 30% compared with traditional PLP feature.
Keywords:wavelet packet decomposition  Perceptual Linear Predictive(PLP)  speech recognit
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程》浏览原始摘要信息
点击此处可从《计算机工程》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号