基于子字单元的维吾尔语语音识别研究 Research on Uyghur Speech Recognition Based on Subword Unit期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于子字单元的维吾尔语语音识别研究

引用本文：	薛化建,董兴华,周喜,吐尔洪·吾司曼,李晓. 基于子字单元的维吾尔语语音识别研究[J]. 计算机工程, 2011, 37(20): 208-210. DOI: 10.3969/j.issn.1000-3428.2011.20.072

作者姓名：	薛化建董兴华周喜吐尔洪·吾司曼李晓

作者单位：	1. 中国科学院新疆理化技术研究所,乌鲁木齐830011; 中国科学院研究生院,北京100190 2. 中国科学院新疆理化技术研究所,乌鲁木齐,830011

基金项目：	中国科学院西部行动计划高新技术基金资助项目(KGCX2-YW-507)

摘要：	为提高维吾尔语语音识别的识别率，在分析维吾尔语特点的基础上，设计一种基于子字单元的维吾尔语语音识别总体结构，指出维吾尔语单词的发音模型，给出构建子字发音字典的方法，及其以子字单元为基础构建语言模型与声学模型的方法。在一个语音库上进行实验，采用一种非监督的词切分方法对维吾尔语单词进行词切分，生成子字。实验结果表明，基于子字单元的维吾尔语语音识别可以获得更好的识别结果。
关键词：	维吾尔语词切分子字单元隐马尔科夫模型连续语音识别
收稿时间：	2011-03-30
Research on Uyghur Speech Recognition Based on Subword Unit

XUE Hua-jian,DONG Xing-hua,ZHOU Xi,Turghun Osman,LI Xiao. Research on Uyghur Speech Recognition Based on Subword Unit[J]. Computer Engineering, 2011, 37(20): 208-210. DOI: 10.3969/j.issn.1000-3428.2011.20.072

Authors:	XUE Hua-jian DONG Xing-hua ZHOU Xi Turghun Osman LI Xiao

Affiliation:	1(1.Xinjiang Technical Institute of Physics and Chemistry,Chinese Academy of Sciences,Urumqi 830011,China;2.Graduate University of Chinese Academy of Sciences,Beijing 100190,China)

Abstract:	To improve on accuracy of Uyghur speech recognition,based on analysis of Uyghur characteristics,the framework of Uyghur speech recognition based on subword is developed for the first time.Pronunciation model of Uyghur word is given.How to build subword pronouncing dictionary,subword language model and acoustic model is described.Experiments are completed on a speech corpus and an unsupervised Uyghur word segmentation method is utilized to produce subwords.Experimental results show that Uyghur speech recognition based on subword can gain better recognition results.

Keywords:	Uyghur word segmentation subword unit Hidden Markov Model（HMM） continuous speech recognition
本文献已被 CNKI 维普万方数据等数据库收录！
	点击此处可从《计算机工程》浏览原始摘要信息
	点击此处可从《计算机工程》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏