首页 | 本学科首页   官方微博 | 高级检索  
     

融合AP和GMM的说话人识别方法研究
引用本文:王波,钟映春,陈俊彬.融合AP和GMM的说话人识别方法研究[J].广东工业大学学报,2015,32(4):145-149.
作者姓名:王波  钟映春  陈俊彬
作者单位:广东工业大学 自动化学院, 广东 广州 510006
基金项目:广东省科技计划项目(2010A030500006 )
摘    要:针对在说话人识别过程中经典的高斯混合模型(Gaussian Mixture Model,GMM)阶数的确定具有很大随意性的问题,提出采用吸引子传播聚类方法(AP聚类)自动获取GMM的阶数,进而实现说话人识别的方法.首先,采用Mel频率倒谱系数法(MFCC)与差分倒谱相结合的方法,提取语音特征参数;其次,采用吸引子传播聚类方法(AP聚类)对语音特征参数进行聚类处理,从而自动获得GMM的阶数;在此基础上进行GMM模型的训练;最后,采用训练好的GMM模型对Timit标准语音库以及自制网络志愿者语音库进行说话人识别测试实验.实验结果为:使用了AP聚类算法获取GMM阶数的情况下,对Timit标准语音库的测试结果为100%;在自制网络志愿者语音库中,训练样本为168个,其中潮汕话样本10个,湖南话样本10个,测试样本为42个,测试结果为97.6%.实验结果表明,引入AP聚类自动获取GMM的阶数,可以显著提高说话人识别的精度和效率.

关 键 词:说话人识别    MFCC    AP聚类算法    高斯混合模型  
收稿时间:2014-11-26

Research on Speaker Recognition Based on Both AP and GMM
WANG Bo,ZHONG Ying-Chun,CHEN Jun-Bin.Research on Speaker Recognition Based on Both AP and GMM[J].Journal of Guangdong University of Technology,2015,32(4):145-149.
Authors:WANG Bo  ZHONG Ying-Chun  CHEN Jun-Bin
Affiliation:School of Automation, Guangdong University of Technology, Guangzhou 510006, China
Abstract:According to the randomness of determining the order of the classical Gaussian Mixture Model (GMM), affinity propagation(AP) clustering is recommended to get the order of GMM automatically. A method is proposed to recognize the speakers by applying both AP and GMM. Firstly, the speech feature parameters are extracted by combining the Mel frequency cepstrum coefficient (MFCC) with the differential cepstrum. Secondly, the affinity propagation clustering (AP clustering) method is used as the clustering of the speech feature parameters, and then the best steps of GMM are obtained automatically. On this basis, GMM model is trained. Finally, the trained GMM is used for recognizing experiment of speakers on Timit standard speech library and self-made network volunteers’ speech library. The experiment results are: the test results are 100% on Timit standard speech library and 97.6% on self-made network volunteers’ speech library in case of obtaining the order of GMM by AP clustering algorithm. There are 168 samples for training which contain 10 Chaoshan samples and 10 Hunan samples and 42 samples for testing on self-made network volunteers’ speech library. The experiment results show that the recommended AP clustering algorithm to get the order of GMM automatically can improve the accuracy and efficiency of speaker recognition significantly.
Keywords:speaker recognition  mel frequency cepstrum coefficient (MFCC)  affinity propagation (AP) clustering algorithm  Gaussian mixture model (GMM)  
本文献已被 万方数据 等数据库收录!
点击此处可从《广东工业大学学报》浏览原始摘要信息
点击此处可从《广东工业大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号