首页 | 本学科首页   官方微博 | 高级检索  
     

基于多尺度特征融合的小目标行人检测
引用本文:张思宇,张轶. 基于多尺度特征融合的小目标行人检测[J]. 计算机工程与科学, 2019, 41(9): 1627-1634
作者姓名:张思宇  张轶
作者单位:四川大学计算机学院,四川 成都 610065;四川大学视觉合成图形图像技术国家重点学科实验室,四川 成都 610065;四川大学计算机学院,四川 成都 610065;四川大学视觉合成图形图像技术国家重点学科实验室,四川 成都 610065
摘    要:针对SSD当前存在的小目标漏检以及误检问题,结合反卷积与特征融合思想,提出hgSSD模型。将原SSD特征层反卷积后与较浅层特征结合,实现复杂场景下小目标行人检测。为了保留浅层网络特征,提高算法实时性,节省计算资源,hgSSD模型基础网络使用VGG16,而非更深层的ResNet101。为了加强对小目标的检测,将VGG16中的Conv3_3改进为特征层加入训练。融合后的网络相对于SSD较为复杂,但基本保证实时性,且成功检测到大部分SSD网络漏检的小目标,检测精度相比于SSD模型也有提升。在选择框置信度得分阈值为0.3的情况下,基本检测到SSD漏检小目标。在VOC2007+2012中相对于SSD行人检测的Average Precision值从0.765提升为0.83。

关 键 词:小目标行人检测  多尺度预测  特征融合  反卷积神经网络  深度学习
收稿时间:2019-01-25
修稿时间:2019-09-25

Small target pedestrian detectionbased on multi-scale feature fusion
ZHANG Si-yu,ZHANG Yi. Small target pedestrian detectionbased on multi-scale feature fusion[J]. Computer Engineering & Science, 2019, 41(9): 1627-1634
Authors:ZHANG Si-yu  ZHANG Yi
Affiliation:(1.College of Computer Science,Sichuan University,Chengdu 610065;2.National Key Laboratory of Fundamental Science on Synthetic Vision,Sichuan University,Chengdu 610065,China)
Abstract:Given the problems of missing detection and detection failure for small targets in the single shot multibox detector (SSD), we propose an hourglass SSD model based on the idea of deconvolution and feature fusion, called hgSSD model. It deconvolutes the conventional SSD feature, which is then combined with shallower features to detect small target pedestrians in complex scenes. In order to preserve shallow network characteristics, ensure real-time detection and save computing resources, we use the VGG-16 instead of the deeper RestNet-101 as the basic network. In order to enhance the detection of small targets, Conv3_3 in VGG16 is improved as the feature layer added into the training. The fused network is more complex than the conventional SSD, but the real-time performance is basically guaranteed. It can successfully detect most of the small targets that are missed by the conventional SSD network, and the network has a higher accuracy than the conventional SSD model. In the case where the default box confidence threshold of 0.3, it basically detects the small targets undetected by the conventional SSD. In VOC 2007+2012, the pedestrian average precision value is increased from 0.765 to 0.83 in comparison with the conventional SSD.
Keywords:small target pedestrian detection  multi-scale prediction  feature fusion  deconvolutional neural network  deep learning  
本文献已被 万方数据 等数据库收录!
点击此处可从《计算机工程与科学》浏览原始摘要信息
点击此处可从《计算机工程与科学》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号