首页 | 本学科首页   官方微博 | 高级检索  
     

基于元音模板匹配的声效多级检测
引用本文:晁浩,宋成,刘志中.基于元音模板匹配的声效多级检测[J].北京邮电大学学报,2016,39(4):98-102.
作者姓名:晁浩  宋成  刘志中
作者单位:河南理工大学 计算机科学与技术学院, 河南 焦作 454000
基金项目:国家自然科学基金项目(61300124;61502150),河南省基础与前沿技术研究计划资助项目(132300410332)
摘    要:针对鲁棒语音识别中的声效模式检测问题,提出了一种分级检测方法. 首先使用整体谱特征训练高斯混合模型来判定语音信号是否耳语. 对于非耳语的语音信号,通过声学界标点检测来获取信号中的元音段,然后通过元音模板匹配来确定语音信号具体的声效模式. 在863-test测试集上进行的声效检测实验结果显示,除耳语识别精度略有下降外,其他4种声效模式的识别精度均有大幅度的提高. 实验结果表明了将语音信号整体特征与局部元音特征相结合在声效检测中的有效性.

关 键 词:语音识别  声效  元音  模板匹配  高斯混合模型  
收稿时间:2015-05-28

Multi-Level Detection of Vocal Effort Based on Vowel Template Matching
CHAO Hao,SONG Ceng,LIU Zhi-zhong.Multi-Level Detection of Vocal Effort Based on Vowel Template Matching[J].Journal of Beijing University of Posts and Telecommunications,2016,39(4):98-102.
Authors:CHAO Hao  SONG Ceng  LIU Zhi-zhong
Affiliation:College of Computer Science and Technology, Henan Polytechnic University, Henan Jiaozuo 454000, China
Abstract:A two-stage detection method was proposed for the identification of vocal effort modes in robust speech recognition. Firstly, whisper identification of speech signal is performed by using Gaussian mix-ture model ( GMMs) which are trained by global spectrum features. Secondly, vowels are acquired based on landmark detection for the speech signal which does not belong to the whisper mode, and the vocal ef-fort mode of the speech signal is determined by vowel template matching. Experiments conducted on 863-test show that, accompanied by a slight decline for whisper mode, the significant improvement of recogni-tion accuracy for the remaining four vocal effort modes can be achieved.
Keywords:speech recognition  vocal effort  vowel  adaptive modulation  Gaussian mixture model
本文献已被 CNKI 万方数据 等数据库收录!
点击此处可从《北京邮电大学学报》浏览原始摘要信息
点击此处可从《北京邮电大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号