首页 | 官方网站   微博 | 高级检索  
     

基于注意力卷积模块的深度神经网络图像识别
引用本文:袁嘉杰,张灵,陈云华.基于注意力卷积模块的深度神经网络图像识别[J].计算机工程与应用,2019,55(8):9-16.
作者姓名:袁嘉杰  张灵  陈云华
作者单位:广东工业大学 计算机学院,广州,510006;广东工业大学 计算机学院,广州,510006;广东工业大学 计算机学院,广州,510006
基金项目:广东省自然科学基金;广东省交通运输厅科技项目;广东省科技计划项目;广东省自然科学基金;广东省科技计划产学研项目
摘    要:对于在深度神经网络的中间层分支进行深度融合,产生潜在可以共享有用信息的基础网络,从而优化信息流动,提升深度神经网络的性能,是近期的深度神经网络研究的挑战。对此提出一种基于注意力卷积模块的深度神经网络的图像识别方法。改进的模块主要分为树干分支与软分支两部分,在树干分支上,由两组残差模块组成,使该模块适用于其他深度神经网络;在软分支上,将给定的中间特征图沿着两个维度(空间与通道)获取注意力特征图,对输入中间特征图进行调整,强化有用信息抑制无用信息。改进的卷积残差模块既能解决输入与输出的尺寸不一致的问题,也能强化图像的关键信息与有效促进网络的信息流动。通过对cifar-10、cifar-100、ck+、AVEC2017数据集进行实验,实验结果表明了提出的方法应用于ResNet-50网络上对比Hu提出的方法在训练耗时相差不到0.3%的情况下,识别图像准确率有0.9%~1.2%的提高。

关 键 词:图像识别  残差模块  注意力  深度神经网络

Deep Neural Network Based on Attention Convolution Module for Image Recognition
YUAN Jiajie,ZHANG Ling,CHEN Yunhua.Deep Neural Network Based on Attention Convolution Module for Image Recognition[J].Computer Engineering and Applications,2019,55(8):9-16.
Authors:YUAN Jiajie  ZHANG Ling  CHEN Yunhua
Affiliation:College of Computer Science, Guangdong University of Technology, Guangzhou 510006, China
Abstract:For deep fusion in the middle layer branches of deep neural networks, it is a challenge for recent deep neural network research to generate a basic network that can share useful information, thereby optimizing information flow and improving the performance of deep neural networks. In this paper, the deep neural network based on attention convolution module is proposed. The proposed module is mainly divided into two parts:the trunk branch and the soft branch. On the trunk branch, it consists of two sets of residual modules, making the module suitable for other deep neural networks. On the soft branch, the given intermediate feature map is taken along two dimensions (space and channel) to obtain the attention feature map, and the input intermediate feature map is adjusted to strengthen useful information to suppress useless information. The proposed convolution residual module can solve the problem of inconsistent input and output size, strengthen the key information of the image and effectively promote the information flow of the network. Experiments on the cifar-10, cifar-100, ck+, AVEC2017 data sets show that the proposed method applied to the resnet-50 network has a higher recognition accuracy(0.9%~1.2%) than the method proposed by Hu when the training time difference is less than 0.3%.
Keywords:image identification  residual module  attention  deep neural network  
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《计算机工程与应用》浏览原始摘要信息
点击此处可从《计算机工程与应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司    京ICP备09084417号-23

京公网安备 11010802026262号