双向循环神经网络在语音识别中的应用 Application of Bidirectional Recurrent Neural Network in Speech Recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

双向循环神经网络在语音识别中的应用

引用本文：	更藏措毛,黄鹤鸣.双向循环神经网络在语音识别中的应用[J].计算机与现代化,2019,0(10):1.

作者姓名：	更藏措毛黄鹤鸣

作者单位：	青海师范大学计算机学院,青海西宁 810008;藏文信息处理教育部重点实验室,青海西宁 810008;青海师范大学计算机学院,青海西宁 810008;藏文信息处理教育部重点实验室,青海西宁 810008

基金项目：	青海省自然科学基金资助项目(2016-ZJ-904); 国家自然科学基金资助项目（61662062， 61462072）

摘要：	针对前馈神经网络难以处理时序数据的问题，提出将双向循环神经网络（BiRNN）应用在自动语音识别声学建模中。首先，应用梅尔频率倒谱系数进行特征提取；其次，采用双向循环神经网络作为声学模型；最后，测试不同参数对系统性能的影响。在TIMIT数据集上的实验结果表明，与基于卷积神经网络和深度神经网络的声学模型相比，识别率分别提升了1.3%和4.0%，说明基于双向循环神经网络的声学模型具有更好的性能。
关键词：	双向循环神经网络语音识别梅尔频率倒谱系数深度神经网络
收稿时间：	2019-10-29
Application of Bidirectional Recurrent Neural Network in Speech Recognition

Gengzang-Cuomao,HUANG He-ming.Application of Bidirectional Recurrent Neural Network in Speech Recognition[J].Computer and Modernization,2019,0(10):1.

Authors:	Gengzang-Cuomao HUANG He-ming

Abstract:	In order to solve the problem that feed-forward neural network is difficult to process time series data, bidirectional recurrent neural network (BiRNN) is applied in acoustic modeling of automatic speech recognition. Firstly, the Mel frequency cepstrum coefficients are used for feature extraction. Secondly, bidirectional recurrent neural network is used as acoustic model. And finally, the effects of different parameters on system performance are tested. Experimental results on TIMIT dataset show that, compared with convolutional neural network and deep neural network, the recognition rate of the proposed system is improved by 1.3% and 4.0% respectively, which indicates that BiRNN is more suitable for automatic speech recognition.

Keywords:	bidirectional recurrent neural network speech recognition Mel frequency cepstrum coefficient deep neural network
本文献已被万方数据等数据库收录！
	点击此处可从《计算机与现代化》浏览原始摘要信息
	点击此处可从《计算机与现代化》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏