基于子带GMM-UBM的广播语音多语种识别 Broadcast Speech Language Recognition Based on Sub-Band GMM-UBM期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于子带GMM-UBM的广播语音多语种识别

引用本文：	李思一,戴蓓蒨,王海祥. 基于子带GMM-UBM的广播语音多语种识别[J]. 数据采集与处理, 2007, 22(1): 14-18

作者姓名：	李思一戴蓓蒨王海祥

作者单位：	中国科学技术大学电子科学与技术系,合肥,230026;中国科学技术大学电子科学与技术系,合肥,230026;中国科学技术大学电子科学与技术系,合肥,230026

摘要：	提出了一种基于概率统计模型的与语言内容无关的语种识别方法,它不需要掌握各语种的专业语言学知识就可以实现几十种语言的语种识别;并针对广播语音噪声干扰大的特点,采用GMM-UBM模型作为语种模型,提高了系统的噪声鲁棒性;由于广播语音的背景噪声不是简单的全频带加性白噪声,因此本文构建了一种基于子带GMM-UBM模型的多子系统结构的语种识别系统,后端采用神经网络进行系统级融合。本文通过对37种语言及方言的识别实验,证明了子带GMM-UBM方法的有效性。
关键词：	语种识别语言内容无关广播语音子带GMM-UBM
文章编号：	1004-9037（2007）01-0014-05
收稿时间：	2006-01-03
修稿时间：	2006-04-12
Broadcast Speech Language Recognition Based on Sub-Band GMM-UBM

Li Siyi,Dai Beiqian,Wang Haixiang. Broadcast Speech Language Recognition Based on Sub-Band GMM-UBM[J]. Journal of Data Acquisition & Processing, 2007, 22(1): 14-18

Authors:	Li Siyi Dai Beiqian Wang Haixiang

Affiliation:	Department of Electronic Science and Technology, University of Science and Technology of China, Hefei, 230026, China

Abstract:	A language recognition method is proposed based on probability-statistical model.It can recognize several decade kinds of languages without professional linguistic knowledge.Aimed at the high noise of the broadcast speech,GMM-UBM is used as the language model to improve the system noise-robustness.And because the background noise of the broadcast speech is not the simply full-band Gaussian white noise,a language recognition system is built based on sub-band GMM-UBM model and subsystems structure by using neural network to fuse different subsystems.Experimental results for recognizing 37 languages and dialects verify the validity of the sub-band GMM-UBM method.

Keywords:	language recognition text-independent broadcast speech sub-band GMM-UBM
本文献已被 CNKI 维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏