首页 | 本学科首页   官方微博 | 高级检索  
     

融合双注意力机制的人群计数算法
引用本文:徐晓晨,葛艳,杜军威,陈卓.融合双注意力机制的人群计数算法[J].计算机系统应用,2023,32(1):241-248.
作者姓名:徐晓晨  葛艳  杜军威  陈卓
作者单位:青岛科技大学 信息科学技术学院, 青岛 266061
基金项目:山东省自然科学基金(ZR2021MF092)
摘    要:针对背景复杂、遮挡、人群分布不均等人群计数常见问题,提出了一种结合联合损失的空间-通道双注意力机制卷积神经网络模型(joint loss-based space-channel dual attention network, JL-SCDANet).该网络前端进行图像粗粒度特征提取,中间加入空间注意力机制以及通道注意力机制突出图像重点区域,后端使用可加大感受野且不丢失图像分辨率的空洞卷积提取深层二维特征.此外,该模型结合联合损失函数进行训练,以增强模型的鲁棒性.为了验证模型的改进效果,在3个公共数据集(ShanghaiTech Part B、mall和UCF_CC_50)上分别进行了对比实验,在ShanghaiTech Part B数据集中平均绝对误差(MAE)和均方误差(MSE)分别达到了8.13和13.13;在mall数据集中MAE、MSE达到了1.78和2.28;在UCF_CC_50数据集中MAE、MSE分别达到了182.12和210.24,实验结果证明了该网络在提高人数统计准确率上的有效性.

关 键 词:人群计数  人群密度图  卷积神经网络(CNN)  注意力机制  空洞卷积  深度学习
收稿时间:2022/5/16 0:00:00
修稿时间:2022/6/15 0:00:00

Crowd Counting Algorithm Based on Dual Attention Mechanism
XU Xiao-Chen,GE Yan,DU Jun-Wei,CHEN Zhuo.Crowd Counting Algorithm Based on Dual Attention Mechanism[J].Computer Systems& Applications,2023,32(1):241-248.
Authors:XU Xiao-Chen  GE Yan  DU Jun-Wei  CHEN Zhuo
Affiliation:School of Information Science and Technology, Qingdao University of Science and Technology, Qingdao 266061, China
Abstract:Given the common problems of crowd counting with a complex background, occlusion, and uneven crowd distribution, a joint loss-based space-channel dual attention network (JL-SCDANet) is proposed. The front end of the network extracts coarse-grained features of an image, and the spatial attention mechanism and channel attention mechanism are added in the middle to highlight the key areas of the image, while the back end uses dilated convolution that can increase the receptive field without losing the image resolution to extract deep two-dimensional features. In addition, the model is trained with the joint loss function to enhance its robustness. Comparative experiments are carried out on three public data sets (i.e., ShanghaiTech Part B, mall, and UCF_CC_50) to verify the improvement effect of the model. In terms of the mean absolute error (MAE) and mean square error (MSE), the results on ShanghaiTech Part B, mall, and UCF_CC_50 reach 8.13 and 13.13, 1.78 and 2.28, and 182.12 and 210.24, respectively. The experimental results prove the effectiveness of the network in improving the accuracy of population statistics.
Keywords:crowd counting  crowd density map  convolutional neural network (CNN)  attention mechanism  dilated convolution  deep learning
点击此处可从《计算机系统应用》浏览原始摘要信息
点击此处可从《计算机系统应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号