首页 | 本学科首页   官方微博 | 高级检索  
     

语音识别中基于i-vector的说话人归一化研究
引用本文:李亚琦,黄浩. 语音识别中基于i-vector的说话人归一化研究[J]. 现代计算机, 2014, 0(5): 3-7
作者姓名:李亚琦  黄浩
作者单位:新疆大学信息科学与工程学院,乌鲁木齐830046
基金项目:国家自然科学基金资助项目(No.61365005、No.60965002)
摘    要:i-vector是反映说话人声学差异的一种重要特征,在目前的说话人识别和说话人验证中显示了有效性。将i-vector应用于语音识别中的说话人的声学特征归一化,对训练数据提取i-vector并利用LBG算法进行无监督聚类.然后对各类分别训练最大似然线性变换并使用说话人自适应训练来实现说话人的归一化。将变换后的特征用于训练和识别.实验表明该方法能够提高语音识别的性能。

关 键 词:说话人识别  i-vector  最大似然线性变换  特征提取  说话人归一化  LBG算法

Research on Speaker Normalization Based on i-vector in Speech Recognition
LI Ya-qi,HUANG Hao. Research on Speaker Normalization Based on i-vector in Speech Recognition[J]. Modem Computer, 2014, 0(5): 3-7
Authors:LI Ya-qi  HUANG Hao
Affiliation:(Department of Information Science and Engineering, Xinjiang University, Urumqi 830046)
Abstract:i-vector is an important feature which reflects differences of acoustic characteristics between speakers, and has shown effectiveness in speaker identification and speaker verification. Applies the i-vector method to speaker normalization in speech recognition: extracts the i- vectors of training data and carries out unsupervised clustering using the LBG algorithm. Then performs speaker adaptive training using the cluster information. Speech recognition experiments show that this method can consistantly improve the performance.
Keywords:Speech Recognition  i-vector  Maximum Likelihood Linear Transforms  Feature Extractor  LBG Algorithm
本文献已被 维普 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号