首页 | 本学科首页   官方微博 | 高级检索  
     

基于多模型深度卷积网络融合的人群计数算法
引用本文:雷翰林,张宝华.基于多模型深度卷积网络融合的人群计数算法[J].激光技术,2019,43(4):476-481.
作者姓名:雷翰林  张宝华
作者单位:内蒙古科技大学 信息工程学院,包头,014010;内蒙古科技大学 信息工程学院,包头,014010
基金项目:国家自然科学基金;国家自然科学基金;国家海洋局青年基金;内蒙古杰出青年培育资助项目;内蒙古自治区高等学校青年科技英才支持计划;内蒙古自治区自然科学基金;科技创新基金
摘    要:为了避免景深和遮挡的干扰,提高人群计数的准确性,采用了LeNet-5,AlexNet和VGG-163种模型,提取图像中不同景深目标的特性,调整上述模型的卷积核尺寸和网络结构,并进行了模型融合.构造出一种基于多模型融合的深度卷积神经网络结构,网络最后两层采用卷积核大小为1×1的卷积层取代传统的全连接层,对提取的特征图进行信息整合并输出密度图,极大地降低了网络参量且取得了一定提升的数据,兼顾了算法效率和精度,进行了理论分析和实验验证.结果表明,在公开人群计数数据集shanghaitech两个子集和UCF_CC_50子集上,本文中计数方法的平均绝对误差和均方误差分别是97.99和158.02,23.36和41.86,354.27和491.68,取得比现有传统人群计数方法更好的性能;通过迁移实验证明所提出的人群计数模型具有良好的泛化能力.该研究对人群计数精度的提高是有帮助的.

关 键 词:图像处理  人群计数  多模型融合  深度卷积神经网络
收稿时间:2018-09-18

Crowd counting algorithm based on multi-model deep convolution network integration
LEI Hanlin,ZHANG Baohua.Crowd counting algorithm based on multi-model deep convolution network integration[J].Laser Technology,2019,43(4):476-481.
Authors:LEI Hanlin  ZHANG Baohua
Affiliation:(School of Information Engineering, Inner Mongolia University of Science and Technology, Baotou 014010, China)
Abstract:To avoid the interference of depth of field and occlusion and improve the accuracy of crowd counting, three models of LeNet-5, AlexNet and VGG-16 were adopted and the characteristics of objects with different depth of field in the image were extracted. After adjusting the convolution core size and network structure of the above model, model fusion was carried out. A deep convolution neural network structure based on multi-model fusion was constructed. In the last two layers of the network, the convolution layer with convolution core size of 1×1 was used to replace the traditional full connection layer. The extracted feature maps were integrated with information and the density maps were output. The network parameters were greatly reduced and some improved data was obtained. The efficiency and accuracy of the algorithm were taken into account. The theoretical analysis and experimental verification were carried out. The results show that, in public population counting data set of two subsets of shanghaitech and UCF_CC_50, the mean absolute error and mean square error of this method are 97.99 and 158.02, 23.36 and 41.86, 354.27 and 491.68, respectively. It achieves better performance than the existing traditional crowd counting methods. At the same time, migration experiments are carried out. It proves that the population counting model proposed in this paper has good generalization ability. This study is helpful to improve the accuracy of population counting.
Keywords:image processing  crowd counting  multi-model integration  deep convolution neural network
本文献已被 维普 万方数据 等数据库收录!
点击此处可从《激光技术》浏览原始摘要信息
点击此处可从《激光技术》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号