首页 | 本学科首页   官方微博 | 高级检索  
     

基于MFCC与分形维数混合参数的语音识别
引用本文:雷涛,姚明海,马洪蕊.基于MFCC与分形维数混合参数的语音识别[J].控制工程,2005(Z1).
作者姓名:雷涛  姚明海  马洪蕊
作者单位:浙江工业大学信息工程学院 浙江杭州310032 (雷涛,姚明海),浙江工业大学信息工程学院 浙江杭州310032(马洪蕊)
基金项目:浙江省自然科学基金资助项目(602099)
摘    要:针对语音气流中具有混沌特征,而分形可以定量地分析混沌现象,故分形可用来分析语音信号。语音波形具有分形特征,将分形用于改善语音识别技术更好地表现语音的特征,避免传统的分段线性处理所产生的局限性。将传统特征参数MFCC与分形特征结合起来,组成混合参数用于语音识别。实验结论显示,基于MFCC与分形维数混合参数的语音识别方法要好于使用单一MFCC参数的语音识别方法。

关 键 词:分形维数  语音识别  MFCC  神经网络

Method for Speech Recognition Based on Mixed Parameter of MFCC and Fractal Dimension
LEI Tao,YAO Ming-hai,MA Hong-mi.Method for Speech Recognition Based on Mixed Parameter of MFCC and Fractal Dimension[J].Control Engineering of China,2005(Z1).
Authors:LEI Tao  YAO Ming-hai  MA Hong-mi
Abstract:Because of chaos characters in speech air flow, fractal can be used to quantify the chaotic phenomenon in speech signals. Speech wave appears in fractal feature. Fractal used to improve the technique of speech recognition will be more important. The traditional linear feature parameter, such as MFCC, can not represent nonlinear feature of speech. So in order to represent the feature of speech better and avoid the localization of using subsection linear method, MFCC is combined with fractal feature for speech recognition. The experiment result shows better effect is achieved by mixed parameter of MFCC and fractal dimension than only with MFCC.
Keywords:fractal dimension  speech recognition  MFCC  artificial neural network (ANN)
本文献已被 CNKI 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号