应用说话人聚类技术改善语言辨识系统识别率 Improving Automatic Language Identification by Speaker Clustering Methods期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

应用说话人聚类技术改善语言辨识系统识别率

引用本文：	张强,屈丹,侯风雷,王炳锡. 应用说话人聚类技术改善语言辨识系统识别率[J]. 电声技术, 2007, 31(3): 44-48

作者姓名：	张强屈丹侯风雷王炳锡

作者单位：	中国人民解放军信息工程大学,河南,郑州,450002;中国人民解放军信息工程大学,河南,郑州,450002;中国人民解放军信息工程大学,河南,郑州,450002;中国人民解放军信息工程大学,河南,郑州,450002

摘要：	说话人差异是影响语言辨识系统性能的一个重要方面.采用说话人聚类技术对训练数据进行了预分类,以此为基础训练各种语言模型,得到聚类后的语言辨识系统.高斯混合模型、高斯混合模型-全局背景模型和遍历隐马尔可夫模型3种语言模型的实验证明,这种方法是有效的,它普遍地减小了说话人差异对语言辨识的影响,提高了语言辨识率.
关键词：	说话人聚类高斯混合模型-全局背景模型遍历隐马尔可夫模型
文章编号：	1002-8684(2007)03-0044-05
修稿时间：	2006-11-27
Improving Automatic Language Identification by Speaker Clustering Methods

ZHANG Qiang,QU Dan,HOU Feng-lei,WANG Bing-xi. Improving Automatic Language Identification by Speaker Clustering Methods[J]. Audio Engineering, 2007, 31(3): 44-48

Authors:	ZHANG Qiang QU Dan HOU Feng-lei WANG Bing-xi

Affiliation:	PLA Information Engineering University, Zhengzhou 450002, China

Abstract:	The speaker difference is an important factor which affects the performance of the automatic language identification system.The speaker clustering method is applied to classify training-data in advance.By training the language model by these classified data,a new LID system is got.Experiments based on the reference system including GMM(Gaussian Mixed Model) ,GMM-UBM(Gaussian Mixture Model-Universal and Background Model) and HMM(Hidden Markov Model) show this method is effective,that the effect of speaker difference on LID performance is decreased,and that the language identification rate is improved in general.

Keywords:	speaker clustering GMM-UBM ergodic hidden Markov model
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏