特征空间本征音说话人自适应 Feature Space Eigenvoice Speaker Adaptation期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

特征空间本征音说话人自适应

引用本文：	屈丹,杨绪魁,张文林.特征空间本征音说话人自适应[J].自动化学报,2015,41(7):1244-1252.

作者姓名：	屈丹杨绪魁张文林

作者单位：	1.解放军信息工程大学信息系统工程学院郑州 450000

基金项目：	国家自然科学基金(61175017, 61403415, 61302107)资助

摘要：	提出了特征空间本征音说话人自适应算法,该方法首先借鉴RATZ 算法的思想,采用高斯混合模型对特征空间中的说话人信息进行建模;其次利用子空间方法实现对特征补偿项的估计,减少估计参数的数量,在对特征空间精确建模的同时,降低了算法对自适应数据量的需求.基于微软语料库的中文连续语音识别实验表明,该算法在自适应数据量极少时仍能取得较好的性能,配合说话人自适应训练能够进一步降低词错误率,其实时性优于本征音说话人自适应算法.
关键词：	连续语音识别说话人自适应多高斯倒谱规整本征音
收稿时间：	2014-09-12
Feature Space Eigenvoice Speaker Adaptation

QU Dan,YANG Xu-Kui,ZHANG Wen-Lin.Feature Space Eigenvoice Speaker Adaptation[J].Acta Automatica Sinica,2015,41(7):1244-1252.

Authors:	QU Dan YANG Xu-Kui ZHANG Wen-Lin

Affiliation:	1.Institute of Information Systems Engineering, Information Engineering University, Zhengzhou 450000

Abstract:	A speaker adaptation method at feature level named feature-space eigenvoice adaptation method is proposed. In this method, similar to RATZ, the information of speakers in the feature space is modeled by a Gaussian mixture model. Moreover, the number of parameters to be estimated is decreased by taking the dependency of these parameters into account. This method can use very little data to construct a more accurate feature space model. Experimental results of continuous speech recognition on Microsoft speech database show that this method can still achieve good performance even when the adaptation data is limited. And speaker adaptive training based on this method can further decrease the word error rate with a superior real-time performance to that of eigenvoice methods.

Keywords:	Continuous speech recognition speaker adaptation multivariate Gaussian-based cepstral normalization eigenvoice

	点击此处可从《自动化学报》浏览原始摘要信息
	点击此处可从《自动化学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏