语音识别中基于i-vector的说话人归一化研究 Research on Speaker Normalization Based on i-vector in Speech Recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

语音识别中基于i-vector的说话人归一化研究

引用本文：	李亚琦,黄浩. 语音识别中基于i-vector的说话人归一化研究[J]. 现代计算机, 2014, 0(5): 3-7

作者姓名：	李亚琦黄浩

作者单位：	新疆大学信息科学与工程学院,乌鲁木齐830046

基金项目：	国家自然科学基金资助项目（No.61365005、No.60965002）

摘要：	i-vector是反映说话人声学差异的一种重要特征，在目前的说话人识别和说话人验证中显示了有效性。将i-vector应用于语音识别中的说话人的声学特征归一化，对训练数据提取i-vector并利用LBG算法进行无监督聚类．然后对各类分别训练最大似然线性变换并使用说话人自适应训练来实现说话人的归一化。将变换后的特征用于训练和识别．实验表明该方法能够提高语音识别的性能。
关键词：	说话人识别 i-vector 最大似然线性变换特征提取说话人归一化 LBG算法
Research on Speaker Normalization Based on i-vector in Speech Recognition

LI Ya-qi,HUANG Hao. Research on Speaker Normalization Based on i-vector in Speech Recognition[J]. Modem Computer, 2014, 0(5): 3-7

Authors:	LI Ya-qi HUANG Hao

Affiliation:	(Department of Information Science and Engineering, Xinjiang University, Urumqi 830046)

Abstract:	i-vector is an important feature which reflects differences of acoustic characteristics between speakers, and has shown effectiveness in speaker identification and speaker verification. Applies the i-vector method to speaker normalization in speech recognition： extracts the i- vectors of training data and carries out unsupervised clustering using the LBG algorithm. Then performs speaker adaptive training using the cluster information. Speech recognition experiments show that this method can consistantly improve the performance.

Keywords:	Speech Recognition i-vector Maximum Likelihood Linear Transforms Feature Extractor LBG Algorithm
本文献已被维普等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏