首页 | 本学科首页   官方微博 | 高级检索  
     

基于能量分布和共振峰结构的汉语鼻音检测
引用本文:陈斌,张连海,牛铜,王波.基于能量分布和共振峰结构的汉语鼻音检测[J].中文信息学报,2012,26(1):104-110.
作者姓名:陈斌  张连海  牛铜  王波
作者单位:解放军信息工程大学 信息工程学院,河南 郑州 450002
基金项目:国家高技术研究发展(863)计划资助项目(2006AA01Z146);国家自然科学基金资助项目(60872142)
摘    要:该文提出了一种基于能量分布和共振峰结构的汉语鼻音检测方法,该方法首先基于Seneff听觉谱提取了一组描述音段能量分布和共振峰结构的特征参数,然后采用支持向量机模型进行检测和分类,得到候选的鼻音,最后根据音段持续时间、前端韵母能量、高低频能量差、中低频能量比等特征对候选的鼻音进行后处理,去除插入错误,提高鼻音检测的准确率。实验结果表明,干净语音鼻音检测准确率可以达到90.4%,信噪比10dB的语音鼻音检测准确率可达到84.4%以上。

关 键 词:鼻音检测  能量分布  共振峰结构  Seneff听觉模型  

A Method for Chinese Nasal Detection Based on Energy Distribute and Formant Structure Characteristic
CHEN Bin , ZHANG Lianhai , NIU Tong , WANG Bo.A Method for Chinese Nasal Detection Based on Energy Distribute and Formant Structure Characteristic[J].Journal of Chinese Information Processing,2012,26(1):104-110.
Authors:CHEN Bin  ZHANG Lianhai  NIU Tong  WANG Bo
Affiliation:Institute of Information Engineering,PLA Information Engineering University, Zhengzhou, Henan 450002, China
Abstract:A Chinese nasal detection method based on energy distribute and formant structure characteristics is presented.According to this method,the energy distribute and formant structure features are first acquired by Seneff’s auditory spectrum,then SVM classifier is combined to realize candidate nasal detection.Finally,post processing is conducted to remove the insertion errors in accordance with parameters of segment duration,front vowel energy,energy difference of high and low frequency,energy ratio of middle and low frequency,etc.The experimental results show that the accuracy is 90.4% for clean speech,above 84.4% for noisy speech with the SNR of 10dB.
Keywords:nasal detection  energy distribute  formant structure  Seneff auditory model
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《中文信息学报》浏览原始摘要信息
点击此处可从《中文信息学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号