首页 | 本学科首页   官方微博 | 高级检索  
     

基于空间维度循环感知网络的密集人群计数模型
引用本文:付倩慧,李庆奎,傅景楠,王羽. 基于空间维度循环感知网络的密集人群计数模型[J]. 计算机应用, 2021, 41(2): 544-549. DOI: 10.11772/j.issn.1001-9081.2020050623
作者姓名:付倩慧  李庆奎  傅景楠  王羽
作者单位:北京信息科技大学 自动化学院, 北京 100192
基金项目:促进高校内涵发展-研究生科技创新项目
摘    要:考虑目前对具有透视畸变的高密度人群图像进行特征提取的局限性,提出了一种融合全局特征感知网络(GFPNet)和局部关联性特征感知网络(LAFPNet)的人群计数模型LMCNN.GFPNet是LMCNN的主干网络,将其输出的特征图进一步序列化并作为LAFPNet的输入,再利用循环神经网络(RNN)在时序维度上对局部关联性特...

关 键 词:人群计数  人群密度估计  卷积神经网络  多列卷积神经网络  长短时记忆神经网络
收稿时间:2020-05-12
修稿时间:2020-09-18

Dense crowd counting model based on spatial dimensional recurrent perception network
FU Qianhui,LI Qingkui,FU Jingnan,WANG Yu. Dense crowd counting model based on spatial dimensional recurrent perception network[J]. Journal of Computer Applications, 2021, 41(2): 544-549. DOI: 10.11772/j.issn.1001-9081.2020050623
Authors:FU Qianhui  LI Qingkui  FU Jingnan  WANG Yu
Affiliation:School of Automation, Beijing Information Science and Technology University, Beijing 100192, China
Abstract:Considering the limitations of the feature extraction of high-density crowd images with perspective distortion, a crowd counting model, named LMCNN, that combines Global Feature Perception Network (GFPNet) and Local Association Feature Perception Network (LAFPNet) was proposed. GFPNet was the backbone network of LMCNN, its output feature map was serialized and used as the input of LAFPNet. And the characteristic that the Recurrent Neural Network (RNN) senses the local association features on the time-series dimension was used to map the single spatial static feature to the feature space with local sequence association features, thus effectively reducing the impact of perspective distortion on crowd density estimation. To verify the effectiveness of the proposed model, experiments were conducted on Shanghaitech Part A and UCF_CC_50 datasets. The results show that compared to Atrous Convolutions Spatial Pyramid Network (ACSPNet), the Mean Absolute Error (MAE) of LMCNN was decreased by 18.7% and 20.3% at least, respectively, and the Mean Square Error (MSE) was decreased by 22.3% and 22.6% at least, respectively. The focus of LMCNN is the association between the front and back features on the spatial dimension, and by fully integrating the spatial dimension features and the sequence features in a single image, the crowd counting error caused by perspective distortion is reduced, and the number of people in dense areas can be more accurately predicted, thereby improving the regression accuracy of crowd density.
Keywords:crowd counting  crowd density estimation  Convolutional Neural Network (CNN)  Multi-column Convolutional Neural Network (MCNN)  Long Short-Term Memory (LSTM) neural network  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机应用》浏览原始摘要信息
点击此处可从《计算机应用》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号