首页 | 本学科首页   官方微博 | 高级检索  
     

深层神经网络语音识别自适应方法研究
引用本文:邓侃,欧智坚.深层神经网络语音识别自适应方法研究[J].计算机应用研究,2016,33(7).
作者姓名:邓侃  欧智坚
作者单位:清华大学 电子工程系,清华大学 电子工程系
基金项目:国家自然科学基金资助项目(61075020,61473168)
摘    要:为了解决语音识别中深层神经网络的说话人与环境自适应问题,本文从语音信号中的说话人与环境因素的固有特点出发,提出了使用长时特征的自适应方案:首先基于高斯混合模型,建立说话人-环境联合补偿模型,对说话人与环境参数进行估计,将此参数作为长时特征;然后,将估计出来长时特征与短时特征一起送入深层神经网络,进行训练。Aurora4实验表明,这一方案可以有效地对说话人与环境因素进行分解,并提升自适应效果。

关 键 词:语音识别  声学模型自适应  深层神经网络
收稿时间:2015/4/10 0:00:00
修稿时间:2015/5/22 0:00:00

Adaptation Method for Deep Neural Network-based Speech Recognition
DENG Kan and OU Zhi-jian.Adaptation Method for Deep Neural Network-based Speech Recognition[J].Application Research of Computers,2016,33(7).
Authors:DENG Kan and OU Zhi-jian
Affiliation:Department of Electronic Engineering,Tsinghua University,Department of Electronic Engineering,Tsinghua University
Abstract:To handle the speaker and noise adaptation problem in deep neural network-based speech recognition system, this paper studied the inherent characters of speaker and noise random factors and proposed a new adaptation method using long term features. Firstly, a joint adaptation model was built based on Gaussian mixture models and the parameters of speaker and noise factors were estimated and used as long term features. Then, these long term features were used in deep neural network together with traditional short term features. Experiment results on Aurora4 database showed that this method could effectively factorize speaker and noise factors, and improve adaptation performance.
Keywords:speech recognition  acoustic model adaptation  deep neural networks
点击此处可从《计算机应用研究》浏览原始摘要信息
点击此处可从《计算机应用研究》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号