基于元音模板匹配的声效多级检测 Multi-Level Detection of Vocal Effort Based on Vowel Template Matching期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于元音模板匹配的声效多级检测

引用本文：	晁浩,宋成,刘志中.基于元音模板匹配的声效多级检测[J].北京邮电大学学报,2016,39(4):98-102.

作者姓名：	晁浩宋成刘志中

作者单位：	河南理工大学计算机科学与技术学院, 河南焦作 454000

基金项目：	国家自然科学基金项目(61300124;61502150)，河南省基础与前沿技术研究计划资助项目(132300410332)

摘要：	针对鲁棒语音识别中的声效模式检测问题，提出了一种分级检测方法. 首先使用整体谱特征训练高斯混合模型来判定语音信号是否耳语. 对于非耳语的语音信号，通过声学界标点检测来获取信号中的元音段，然后通过元音模板匹配来确定语音信号具体的声效模式. 在863-test测试集上进行的声效检测实验结果显示，除耳语识别精度略有下降外，其他4种声效模式的识别精度均有大幅度的提高. 实验结果表明了将语音信号整体特征与局部元音特征相结合在声效检测中的有效性.
关键词：	语音识别声效元音模板匹配高斯混合模型
收稿时间：	2015-05-28
Multi-Level Detection of Vocal Effort Based on Vowel Template Matching

CHAO Hao,SONG Ceng,LIU Zhi-zhong.Multi-Level Detection of Vocal Effort Based on Vowel Template Matching[J].Journal of Beijing University of Posts and Telecommunications,2016,39(4):98-102.

Authors:	CHAO Hao SONG Ceng LIU Zhi-zhong

Affiliation:	College of Computer Science and Technology, Henan Polytechnic University, Henan Jiaozuo 454000, China

Abstract:	A two-stage detection method was proposed for the identification of vocal effort modes in robust speech recognition. Firstly, whisper identification of speech signal is performed by using Gaussian mix-ture model ( GMMs) which are trained by global spectrum features. Secondly, vowels are acquired based on landmark detection for the speech signal which does not belong to the whisper mode, and the vocal ef-fort mode of the speech signal is determined by vowel template matching. Experiments conducted on 863-test show that, accompanied by a slight decline for whisper mode, the significant improvement of recogni-tion accuracy for the remaining four vocal effort modes can be achieved.

Keywords:	speech recognition vocal effort vowel adaptive modulation Gaussian mixture model
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《北京邮电大学学报》浏览原始摘要信息
	点击此处可从《北京邮电大学学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏