基于改进的CNN语音识别研究 RESEARCH OF SPEECH RECOGNITION BASED ON IMPROVED CNN期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于改进的CNN语音识别研究

引用本文：	徐龙飞,张磊,郁进明. 基于改进的CNN语音识别研究[J]. 计算机应用与软件, 2022, 39(1): 119-125. DOI: 10.3969/j.issn.1000-386x.2022.01.018

作者姓名：	徐龙飞张磊郁进明

作者单位：	东华大学信息科学与技术学院上海 201620

基金项目：	国家自然科学青年基金项目(61901104)。

摘要：	针对卷积神经网络进行语音识别时识别率较低的问题,结合序列的最大子序列理论,把真实数据和预测数据看作两个序列并计算两者的最大子序列,再使用欧氏距离计算MSLoss损失函数.使用闵氏距离和神经网络反向更新时的参数,提出自适应卷积核ACKS算法,根据网络传播情况动态地改变卷积核大小,改善模型在不同阶段对数据特性的提取效果.设...
关键词：	卷积神经网络语音识别最大损失函数自适应卷积核
RESEARCH OF SPEECH RECOGNITION BASED ON IMPROVED CNN

Xu Longfei,Zhang Lei,Yu Jinming. RESEARCH OF SPEECH RECOGNITION BASED ON IMPROVED CNN[J]. Computer Applications and Software, 2022, 39(1): 119-125. DOI: 10.3969/j.issn.1000-386x.2022.01.018

Authors:	Xu Longfei Zhang Lei Yu Jinming

Affiliation:	(College of Information Science and Technology,Donghua University,Shanghai 201620,China)

Abstract:	Aiming at the problem of low recognition rate of speech recognition based on CNN,with the longest common subsequence theory,the real data and prediction data were regarded as two sequences,and the longest common subsequence of them was calculated.Then the MSLoss function was calculated by Euclidean distance.The ACKS algorithm was proposed combining the Minkowski distance and the parameters in the back updating of the neural network,which is able to dynamically change the size of the convolution kernel on the basis of network propagation conditions and improve the extraction of data features at different stages of the model.The improved network structure was designed,and the improved network was compared with the recurrent neural network and the long short-term memory neural network in the recognition rate and calculation time.The experimental results show that the improved model can reduce computing time by 2%and reduce error recognition rate by 3%.

Keywords:	Convolutional neural network Speech recognition Max similarity loss function Adaptive convolution kernel
本文献已被维普万方数据等数据库收录！

设为首页 | 免责声明 | 关于勤云 | 加入收藏