首页 | 本学科首页   官方微博 | 高级检索  
     

基于DTW 和EMD的孤立词语音识别研究
引用本文:徐必伟,苏成利,杨微,曹江涛.基于DTW 和EMD的孤立词语音识别研究[J].辽宁石油化工大学学报,2018,38(1):74-78.
作者姓名:徐必伟  苏成利  杨微  曹江涛
作者单位:1.辽宁石油化工大学信息与控制工程学院,辽宁抚顺113001;2.辽宁装备制造职业技术学院,辽宁沈阳110000
基金项目::国家自然科学基金项目(61673199);辽宁省高校优秀人才支持计划项目(LJQ2015061)。
摘    要:针对语音识别过程中环境噪声干扰大的问题,提出一种基于经验模态分解(EMD)与动态时间规整 (DTW)相结合的孤立词识别算法。该方法利用EMD 算法,首先将提取的性能不好的语音信号分解成若干个基本模函数(IMF),去掉原始信号中的干扰和噪声。然后,基于DTW 算法,采用短时过零率和短时能量对语音信号进行端点检测,提取语音特征参数后与参考模板进行匹配。将参考模板与待测模板之间的最短路径作为识别结果。仿真结果表明,该算法能够提高语音的识别效率和识别的正确率。

关 键 词:语音识别  经验模态分解  动态时间规整  孤立词识别  
收稿时间:2017-05-04

Research on Isolated Word Speech Recognition Based on DTW and EMD
Xu Biwei,Su Chengli,Yang Wei,Cao Jiangtao.Research on Isolated Word Speech Recognition Based on DTW and EMD[J].Journal of Liaoning University of Petroleum & Chemical Technology,2018,38(1):74-78.
Authors:Xu Biwei  Su Chengli  Yang Wei  Cao Jiangtao
Affiliation:1.School of Information and Control Engineering, Liaoning Shihua University, Fushun Liaoning 113001, China;  2.Liaoning Equipment Manufacturing Vocational and Technical College, Shenyang Liaoning 110000, China
Abstract:In order to solve the problem of large interference of environmental noise during speech recognition, an isolated word recognition algorithm based on empirical mode decomposition (EMD) and dynamic time warping (DTW) is proposed. In this method, the EMD algorithm is used to decompose the speech signal with poor performance into several basic mode functions (IMF) and remove the interference and noise in the original signal. Then, based on the DTW algorithm, the short-time zero crossing rate and short-time energy are used to detect the endpoint detection of speech signal. After the speech feature parameters are extracted, the speech signal is matched with the reference template. Finally, the shortest path between the reference template and the template to be measured is used as the recognition result. The simulation results show that the proposed algorithm can improve the recognition efficiency of speech and the accuracy of recognition.
Keywords:Speech recognition  Empirical mode decomposition  Dynamic time warping  Isolated word recognition  
本文献已被 CNKI 等数据库收录!
点击此处可从《辽宁石油化工大学学报》浏览原始摘要信息
点击此处可从《辽宁石油化工大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号