基于听觉模型的说话人语音特征提取 Feature extraction for speaker recognition based on auditory model期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于听觉模型的说话人语音特征提取

引用本文：	何朝霞,潘平. 基于听觉模型的说话人语音特征提取[J]. 微型机与应用, 2012, 31(1): 37-39

作者姓名：	何朝霞潘平

作者单位：	贵州大学计算机科学与信息学院,贵州贵阳,550025

基金项目：	国家科技计划基金资助项目，贵州省国际科技合作计划基金资助项目

摘要：	基于听觉模型的特性，仿照MFCC参数提取过程，提出了一种基于Gammatone滤波器组的说话人语音特征提取方法。该方法用Gammatone滤波器组代替三角滤波器组求得倒谱系数，并且可以调整Gammatone滤波器组的通道数和带宽。将该方法所求得的特征在高斯混合模型识别系统中进行仿真实验，实验结果表明，该特征在一定情况下优于MFCC特征在系统的识别率，同时在Gammatone滤波器组通道数较高或滤波器带宽较小的情况下，系统具有较高的识别率。
关键词：	听觉模型 Gammatone滤波器组 MFCC 特征识别率
Feature extraction for speaker recognition based on auditory model

He Zhaoxia,Pan Ping. Feature extraction for speaker recognition based on auditory model[J]. Microcomputer & its Applications, 2012, 31(1): 37-39

Authors:	He Zhaoxia Pan Ping

Affiliation:	(College of Computer Science & Information,Guizhou University,Guiyang 550025,China)

Abstract:	In this paper,a novel feature based on an auditory model and Gammatone filter band is proposed for speaker recognition,which imitates the parameters extraction process of MFCC.The frequency cepstrum coefficient features are calculated using a Gammatone filter band instead of commonly used triangle filter band.Moreover,the dimension and the equivalent rectangular bandwidth of Gammatone filter band could be adjusted.Simulation results with Gaussian mixture model indicate that the recognition rate is significantly improved compared with MFCC in some condition,and the correct recognition rate is higher by more dimensions or smaller equivalent rectangular bandwidth.

Keywords:	auditory model Gammatone filter band MFCC feature recognition rate
本文献已被 CNKI 万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏