深层神经网络语音识别自适应方法研究 Adaptation Method for Deep Neural Network-based Speech Recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

深层神经网络语音识别自适应方法研究

引用本文：	邓侃,欧智坚.深层神经网络语音识别自适应方法研究[J].计算机应用研究,2016,33(7).

作者姓名：	邓侃欧智坚

作者单位：	清华大学电子工程系,清华大学电子工程系

基金项目：	国家自然科学基金资助项目(61075020,61473168)

摘要：	为了解决语音识别中深层神经网络的说话人与环境自适应问题，本文从语音信号中的说话人与环境因素的固有特点出发，提出了使用长时特征的自适应方案：首先基于高斯混合模型，建立说话人-环境联合补偿模型，对说话人与环境参数进行估计，将此参数作为长时特征；然后，将估计出来长时特征与短时特征一起送入深层神经网络，进行训练。Aurora4实验表明，这一方案可以有效地对说话人与环境因素进行分解，并提升自适应效果。
关键词：	语音识别声学模型自适应深层神经网络
收稿时间：	2015/4/10 0:00:00
修稿时间：	2015/5/22 0:00:00
Adaptation Method for Deep Neural Network-based Speech Recognition

DENG Kan and OU Zhi-jian.Adaptation Method for Deep Neural Network-based Speech Recognition[J].Application Research of Computers,2016,33(7).

Authors:	DENG Kan and OU Zhi-jian

Affiliation:	Department of Electronic Engineering,Tsinghua University,Department of Electronic Engineering,Tsinghua University

Abstract:	To handle the speaker and noise adaptation problem in deep neural network-based speech recognition system, this paper studied the inherent characters of speaker and noise random factors and proposed a new adaptation method using long term features. Firstly, a joint adaptation model was built based on Gaussian mixture models and the parameters of speaker and noise factors were estimated and used as long term features. Then, these long term features were used in deep neural network together with traditional short term features. Experiment results on Aurora4 database showed that this method could effectively factorize speaker and noise factors, and improve adaptation performance.

Keywords:	speech recognition acoustic model adaptation deep neural networks

	点击此处可从《计算机应用研究》浏览原始摘要信息
	点击此处可从《计算机应用研究》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏