首页 | 本学科首页   官方微博 | 高级检索  
     

基于听觉模型的说话人语音特征提取
引用本文:何朝霞,潘平. 基于听觉模型的说话人语音特征提取[J]. 微型机与应用, 2012, 31(1): 37-39
作者姓名:何朝霞  潘平
作者单位:贵州大学计算机科学与信息学院,贵州贵阳,550025
基金项目:国家科技计划基金资助项目,贵州省国际科技合作计划基金资助项目
摘    要:基于听觉模型的特性,仿照MFCC参数提取过程,提出了一种基于Gammatone滤波器组的说话人语音特征提取方法。该方法用Gammatone滤波器组代替三角滤波器组求得倒谱系数,并且可以调整Gammatone滤波器组的通道数和带宽。将该方法所求得的特征在高斯混合模型识别系统中进行仿真实验,实验结果表明,该特征在一定情况下优于MFCC特征在系统的识别率,同时在Gammatone滤波器组通道数较高或滤波器带宽较小的情况下,系统具有较高的识别率。

关 键 词:听觉模型  Gammatone滤波器组  MFCC  特征  识别率

Feature extraction for speaker recognition based on auditory model
He Zhaoxia,Pan Ping. Feature extraction for speaker recognition based on auditory model[J]. Microcomputer & its Applications, 2012, 31(1): 37-39
Authors:He Zhaoxia  Pan Ping
Affiliation:(College of Computer Science & Information,Guizhou University,Guiyang 550025,China)
Abstract:In this paper,a novel feature based on an auditory model and Gammatone filter band is proposed for speaker recognition,which imitates the parameters extraction process of MFCC.The frequency cepstrum coefficient features are calculated using a Gammatone filter band instead of commonly used triangle filter band.Moreover,the dimension and the equivalent rectangular bandwidth of Gammatone filter band could be adjusted.Simulation results with Gaussian mixture model indicate that the recognition rate is significantly improved compared with MFCC in some condition,and the correct recognition rate is higher by more dimensions or smaller equivalent rectangular bandwidth.
Keywords:auditory model  Gammatone filter band  MFCC  feature  recognition rate
本文献已被 CNKI 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号