首页 | 本学科首页   官方微博 | 高级检索  
     

基于子带GMM-UBM的广播语音多语种识别
引用本文:李思一,戴蓓蒨,王海祥. 基于子带GMM-UBM的广播语音多语种识别[J]. 数据采集与处理, 2007, 22(1): 14-18
作者姓名:李思一  戴蓓蒨  王海祥
作者单位:中国科学技术大学电子科学与技术系,合肥,230026;中国科学技术大学电子科学与技术系,合肥,230026;中国科学技术大学电子科学与技术系,合肥,230026
摘    要:提出了一种基于概率统计模型的与语言内容无关的语种识别方法,它不需要掌握各语种的专业语言学知识就可以实现几十种语言的语种识别;并针对广播语音噪声干扰大的特点,采用GMM-UBM模型作为语种模型,提高了系统的噪声鲁棒性;由于广播语音的背景噪声不是简单的全频带加性白噪声,因此本文构建了一种基于子带GMM-UBM模型的多子系统结构的语种识别系统,后端采用神经网络进行系统级融合。本文通过对37种语言及方言的识别实验,证明了子带GMM-UBM方法的有效性。

关 键 词:语种识别  语言内容无关  广播语音  子带GMM-UBM
文章编号:1004-9037(2007)01-0014-05
收稿时间:2006-01-03
修稿时间:2006-04-12

Broadcast Speech Language Recognition Based on Sub-Band GMM-UBM
Li Siyi,Dai Beiqian,Wang Haixiang. Broadcast Speech Language Recognition Based on Sub-Band GMM-UBM[J]. Journal of Data Acquisition & Processing, 2007, 22(1): 14-18
Authors:Li Siyi  Dai Beiqian  Wang Haixiang
Affiliation:Department of Electronic Science and Technology, University of Science and Technology of China, Hefei, 230026, China
Abstract:A language recognition method is proposed based on probability-statistical model.It can recognize several decade kinds of languages without professional linguistic knowledge.Aimed at the high noise of the broadcast speech,GMM-UBM is used as the language model to improve the system noise-robustness.And because the background noise of the broadcast speech is not the simply full-band Gaussian white noise,a language recognition system is built based on sub-band GMM-UBM model and subsystems structure by using neural network to fuse different subsystems.Experimental results for recognizing 37 languages and dialects verify the validity of the sub-band GMM-UBM method.
Keywords:language recognition  text-independent  broadcast speech  sub-band GMM-UBM
本文献已被 CNKI 维普 万方数据 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号