基于能量分布和共振峰结构的汉语鼻音检测 A Method for Chinese Nasal Detection Based on Energy Distribute and Formant Structure Characteristic期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于能量分布和共振峰结构的汉语鼻音检测

引用本文：	陈斌,张连海,牛铜,王波.基于能量分布和共振峰结构的汉语鼻音检测[J].中文信息学报,2012,26(1):104-110.

作者姓名：	陈斌张连海牛铜王波

作者单位：	解放军信息工程大学信息工程学院,河南郑州 450002

基金项目：	国家高技术研究发展(863)计划资助项目(2006AA01Z146);国家自然科学基金资助项目(60872142)

摘要：	该文提出了一种基于能量分布和共振峰结构的汉语鼻音检测方法,该方法首先基于Seneff听觉谱提取了一组描述音段能量分布和共振峰结构的特征参数,然后采用支持向量机模型进行检测和分类,得到候选的鼻音,最后根据音段持续时间、前端韵母能量、高低频能量差、中低频能量比等特征对候选的鼻音进行后处理,去除插入错误,提高鼻音检测的准确率。实验结果表明,干净语音鼻音检测准确率可以达到90.4%,信噪比10dB的语音鼻音检测准确率可达到84.4%以上。
关键词：	鼻音检测能量分布共振峰结构 Seneff听觉模型
A Method for Chinese Nasal Detection Based on Energy Distribute and Formant Structure Characteristic

CHEN Bin , ZHANG Lianhai , NIU Tong , WANG Bo.A Method for Chinese Nasal Detection Based on Energy Distribute and Formant Structure Characteristic[J].Journal of Chinese Information Processing,2012,26(1):104-110.

Authors:	CHEN Bin ZHANG Lianhai NIU Tong WANG Bo

Affiliation:	Institute of Information Engineering,PLA Information Engineering University, Zhengzhou, Henan 450002, China

Abstract:	A Chinese nasal detection method based on energy distribute and formant structure characteristics is presented.According to this method,the energy distribute and formant structure features are first acquired by Seneff’s auditory spectrum,then SVM classifier is combined to realize candidate nasal detection.Finally,post processing is conducted to remove the insertion errors in accordance with parameters of segment duration,front vowel energy,energy difference of high and low frequency,energy ratio of middle and low frequency,etc.The experimental results show that the accuracy is 90.4% for clean speech,above 84.4% for noisy speech with the SNR of 10dB.

Keywords:	nasal detection energy distribute formant structure Seneff auditory model
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《中文信息学报》浏览原始摘要信息
	点击此处可从《中文信息学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏