首页 | 本学科首页   官方微博 | 高级检索  
     

嵌入时延神经网络的高斯混合模型说话人辨认
引用本文:陈存宝,赵力.嵌入时延神经网络的高斯混合模型说话人辨认[J].声学技术,2010,29(3):292-296.
作者姓名:陈存宝  赵力
作者单位:东南大学信息科学与工程学院,南京,210096
基金项目:国家自然科学基金,江苏省自然科学基金 
摘    要:提出了一种在高斯混合模型中嵌入时延神经网络的方法。它集成了作为判别性方法的时延神经网络和作为生成性方法的高斯混合模型各自的优点。时延神经网络挖掘了特征向量集的时间信息,并且通过时延网络的变换使需要假设变量独立的最大似然概率(ML)方法更为合理。以最大似然概率为准则,把它们作为一个整体来进行训练。训练过程中,高斯混合模型和神经网络的参数交替更新。实验结果表明,采用所提出的模型在各种信噪比情况下的识别率都比基线系统有所提高,最高能达到21%。

关 键 词:说话人识别  高斯混合模型(GMM)  时延神经网络(TDNN)  嵌入
收稿时间:2009/5/12 0:00:00
修稿时间:2009/8/29 0:00:00

Speaker identification based on GMM with embedded TDNN
CHEN Cun-bao and ZHAO Li.Speaker identification based on GMM with embedded TDNN[J].Technical Acoustics,2010,29(3):292-296.
Authors:CHEN Cun-bao and ZHAO Li
Affiliation:(School of Information Science and Engineering,Southeast university,Nanjing 210096,China)
Abstract:This paper proposes a modified Gaussian Mixed Model(GMM) with an embedded Time Delay Neural Network(TDNN).It integrates the merits of GMM which is generative and TDNN as a Discriminative model.TDNN digests the time information of the feature sets,and through the transformation of the feature vector it makes the hy-pothesis of independence that maximum likelihood needs more reasonable.GMM and TDNN are trained as a whole by means of maximum likelihood.In the process of training,the parameter of GMM and TDNN are updated alternately.Experiments show that the proposed system improves accuracy rate against baseline GMM at all SNR with a maximum to 21%.
Keywords:speaker identification  gaussian mixed model  time delay neural network  embedded
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《声学技术》浏览原始摘要信息
点击此处可从《声学技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号