面向表情识别的重影非对称残差注意力网络模型 A ghost asymmetric residual attention network model for facial expression recognition期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

面向表情识别的重影非对称残差注意力网络模型

引用本文：	闫河,李梦雪,张宇宁,刘建骐.面向表情识别的重影非对称残差注意力网络模型[J].智能系统学报,2023,18(2):333-340.

作者姓名：	闫河李梦雪张宇宁刘建骐

作者单位：	重庆理工大学两江人工智能学院，重庆 401135

摘要：	针对ResNet50中的Bottleneck经过1×1卷积降维后主干分支丢失部分特征信息而导致在表情识别中准确率不高的问题，本文通过引入Ghost模块和深度可分离卷积分别替换Bottleneck中的1×1卷积和3×3卷积，保留更多原始特征信息，提升主干分支的特征提取能力；利用Mish激活函数替换Bottleneck中的ReLU激活函数，提高了表情识别的准确率；在此基础上，通过在改进的Bottleneck之间添加非对称残差注意力模块(asymmetric residual attention block, ARABlock)来提升模型对重要信息的表示能力，从而提出一种面向表情识别的重影非对称残差注意力网络（ghost asymmetric residual attention network, GARAN）模型。对比实验结果表明，本文方法在FER2013和CK+表情数据集上具有较高的识别准确率。
关键词：	表情识别特征提取 ResNet50 Ghost模块 Mish 非对称残差注意力深度可分离卷积深度学习
A ghost asymmetric residual attention network model for facial expression recognition

YAN He,LI Mengxue,ZHANG Yuning,LIU Jianqi.A ghost asymmetric residual attention network model for facial expression recognition[J].CAAL Transactions on Intelligent Systems,2023,18(2):333-340.

Authors:	YAN He LI Mengxue ZHANG Yuning LIU Jianqi

Affiliation:	School of Artificial Intelligence, Chongqing University of Technology, Chongqing 401135, China

Abstract:	In this paper, a solution is proposed to address the low accuracy in facial expression recognition that results from the 1×1 convolution dimensionality reduction of the Bottleneck in ResNet50. To do so, the authors introduce the Ghost module and depth separable convolution to replace the 1×1 and 3×3 convolutions in the Bottleneck, respectively, in order to preserve more of the original feature information and improve the feature extraction ability of the trunk branch. The Mish activation function is also used to replace the ReLU activation function in the Bottleneck, further enhancing the accuracy of facial expression recognition. To further improve the ability of the model to express important information, the authors also introduce an asymmetric residual attention block (ARABlock) between the improved Bottlenecks. The proposed method, which is referred to as the ghost asymmetric residual attention network (GARAN) model, shows high recognition accuracy on the FER2013 and CK+ facial expression datasets based on comparative experimental results.

Keywords:	expression recognition feature extraction ResNet50 Ghost module Mish asymmetric residual attention depthwise separable convolution deep learning

	点击此处可从《智能系统学报》浏览原始摘要信息
	点击此处可从《智能系统学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏