首页 | 本学科首页   官方微博 | 高级检索  
     

连续语音识别中基于Dropout修正线性深度置信网络的声学模型
引用本文:陈雷,杨俊安,王龙,李晋徽.连续语音识别中基于Dropout修正线性深度置信网络的声学模型[J].声学技术,2016,35(2):146-154.
作者姓名:陈雷  杨俊安  王龙  李晋徽
作者单位:电子工程学院, 安徽合肥 230037;电子制约技术安徽省重点实验室, 安徽合肥 230037;电子工程学院, 安徽合肥 230037;电子制约技术安徽省重点实验室, 安徽合肥 230037;电子工程学院, 安徽合肥 230037;电子制约技术安徽省重点实验室, 安徽合肥 230037;电子工程学院, 安徽合肥 230037;电子制约技术安徽省重点实验室, 安徽合肥 230037
基金项目:国家自然科学基金资助项目(60872113)
摘    要:大词汇量连续语音识别系统中,为了增强现有声学模型的表征能力、防止模型过拟合,提出一种基于遗失策略(Dropout)修正线性深度置信网络的声学模型构建方法。该方法使用修正线性函数代替传统Logistic函数进行深度置信网络训练,修正线性函数更接近生物神经网络的工作方式,增强了模型的表征能力;同时引入Dropout策略对修正线性深度置信网络进行调整,避免节点之间的协同作用,防止网络出现过拟合。文章利用公开语音数据集进行了实验,实验结果证明了所提出的声学模型构建方法相对于传统方法的优越性。

关 键 词:连续语音识别|深度置信网络|修正线性|过拟合|Dropout
收稿时间:2015/3/8 0:00:00
修稿时间:2015/4/17 0:00:00

Acoustic model based on Dropout rectified deep belief network in large vocabulary continuous speech recognition system
CHEN Lei,YANG Jun-an,WANG Long and LI Jin-hui.Acoustic model based on Dropout rectified deep belief network in large vocabulary continuous speech recognition system[J].Technical Acoustics,2016,35(2):146-154.
Authors:CHEN Lei  YANG Jun-an  WANG Long and LI Jin-hui
Affiliation:Electronic Engineering Institute, Hefei 230037, Anhui, China;Key Laboratory of Electronic Restriction, Anhui Province, Hefei, 230037, Anhui, China;Electronic Engineering Institute, Hefei 230037, Anhui, China;Key Laboratory of Electronic Restriction, Anhui Province, Hefei, 230037, Anhui, China;Electronic Engineering Institute, Hefei 230037, Anhui, China;Key Laboratory of Electronic Restriction, Anhui Province, Hefei, 230037, Anhui, China;Electronic Engineering Institute, Hefei 230037, Anhui, China;Key Laboratory of Electronic Restriction, Anhui Province, Hefei, 230037, Anhui, China
Abstract:To improve representation ability of acoustic model and prevent over fitting in large vocabulary continuous speech recognition system, this article proposes a method of establishing the acoustic model based on Dropout rectified Deep Belief Network (DBN). This method uses rectified linear function instead of traditional Logistic function as the activation function for DBN training, and the rectified linear function that is closer to the working mode of biological neural network can improve acoustic representation ability of the model, simultaneously Dropout strategy is in-troduced to avoid the synergy between nodes and to prevent over fitting. The actual test certificate on public speech databases proves the superiority of the proposed method over the conventional one.
Keywords:large vocabulary continuous speech recognition|deep belief network|rectified linear function|over fitting|Dropout
本文献已被 CNKI 等数据库收录!
点击此处可从《声学技术》浏览原始摘要信息
点击此处可从《声学技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号