首页 | 本学科首页   官方微博 | 高级检索  
     

基于非线性方法的病理嗓音识别研究
引用本文:张晓俊,陶金,陆洵,曹毅,陶智.基于非线性方法的病理嗓音识别研究[J].信息安全与通信保密,2014(3):113-115.
作者姓名:张晓俊  陶金  陆洵  曹毅  陶智
作者单位:[1]苏州大学物理科学与技术学院能源学院,江苏苏州215006 [2] 苏州大学电子信息学院,江苏苏州215006
基金项目:国家自然科学基金资助项目(批准号:61271359).
摘    要:提出了一种基于非线性特性来提取声带小结类病理嗓音的特征参数的识别方法.首先通过滤波分割的方法,分两个通道处理语音信号,低频部分采用符合人耳听觉特性的巴克滤波器组进行信号重构并提取语音特征,高频部分采用非线性动力学的最大李雅普诺夫指数来描述,最后整合为语音特征序列并进行语音识别.采用美国MEEI公司的病理嗓音数据库进行识别实验.实验结果表明,这种方法能够有效地提高病理嗓音的识别率,达到99.4%的识别率.

关 键 词:非线性  病理嗓音  bark滤波器  最大李雅普诺夫指数

Pathological Voice Recognition based on Nonlinear Method
ZHANG Xiao-jun,TAO Jin,LU Xun,CAO Yi,TAO Zhi.Pathological Voice Recognition based on Nonlinear Method[J].China Information Security,2014(3):113-115.
Authors:ZHANG Xiao-jun  TAO Jin  LU Xun  CAO Yi  TAO Zhi
Affiliation:(a.Sehool of Physical Science and Technology & School of Energy, Sooehow University; b.Sehool of Electronics & Information, Sooehow University, Suzhou Jiangsu 215006, China)
Abstract:This paper proposes a special identification method based on nonlinear characteristics to extract the characteristic parameters of the vocal nodules class of pathological voice. Firstly, by using the method of filtering segmentation, these voice signals are divided into two-channel to process. The low-frequency voice signals use the Bark filter bank meeting the human auditory characteristics to reconstruct the signals and extract the speech features, while the high-frequency ones use the largest Lyapunov exponent based on the nonlinear dynamics. Finally, The two should be integrated into a sequence of speech features for voice recognition. Recognition experiment is based on the database of pathological voices of the U.S. MEEI's company. The experimental results indicate that this method can effectively improve the recognition rate of pathological voice, making the recognition rate up to 99.4%.
Keywords:nonlinear  Pathological voices  Bark filter  the largest Lyapunov exponent
本文献已被 CNKI 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号