发音特征在维汉语音识别中的应用 Application of Articulatory Feature in Uygur and Mandarin Speech Recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

发音特征在维汉语音识别中的应用

引用本文：	秦春香,黄浩. 发音特征在维汉语音识别中的应用[J]. 计算机工程, 2012, 38(23): 177-180

作者姓名：	秦春香黄浩

作者单位：	新疆大学信息科学与工程学院,乌鲁木齐,830046

基金项目：	国家自然科学基金资助项目，新疆高校科研计划培育基金资助项目，新疆大学博士科研启动基金资助项目

摘要：	采用传统谱特征作为输入进行语音识别通常会受到声学环境差异的影响。为此，提出汉语和维语音素和音位的对应规则，并将这种规则应用于基于发音特征的语音识别系统。训练神经网络多层感知器，获取语音信号各类发音特征的后验概率，将其与美尔频率倒谱系数(MFCC)拼接后送入隐马尔科夫模型进行声学模型训练。将不同发音特征分别与传统MFCC特征进行组合并给出测试结果。实验结果表明，当汉语声带状况和送气发音特征与传统MFCC组合时，以及维语的发音方式和声带状况特征与MFCC组合之后，系统误识率较低。
关键词：	维汉语音识别多层感知器声学模型美尔频率倒谱系数特征组合
收稿时间：	2012-03-13
Application of Articulatory Feature in Uygur and Mandarin Speech Recognition

QIN Chun-xiang , HUANG Hao. Application of Articulatory Feature in Uygur and Mandarin Speech Recognition[J]. Computer Engineering, 2012, 38(23): 177-180

Authors:	QIN Chun-xiang HUANG Hao

Affiliation:	(College of Information Science and Engineering, Xinjiang University, Urumqi 830046, China)

Abstract:	Speech recognition based on traditional spectral feature is liable to be influenced by the acoustic conditions of the environment. Articulatory Feature(AF) is robust to such conditions. In this paper, the rules of phonetic mapping to AF of Mandarin and Uighur speech are derived. The neural networks are trained to obtain posterior probability of AF. The features are combined with Mel Frequency Cepstral Coefficient(MFCC) and are used to train the hidden Markov based acoustic model. Experimental results show that by combining the MFCC with the feature of voicing or aspiration in Mandarin, the feature of voicing or manner in Uighur, significant error reductions can be obtained.

Keywords:	Uygur and Mandarin speech recognition Multilayer Perceptron(MLP) acoustic model Mel Frequency Cepstral Coefficient(MFCC) feature combination
本文献已被 CNKI 万方数据等数据库收录！
	点击此处可从《计算机工程》浏览原始摘要信息
	点击此处可从《计算机工程》下载免费的PDF全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏