首页 | 本学科首页   官方微博 | 高级检索  
     

基于ResNet34_D改进YOLOv3模型的行人检测算法
引用本文:钱惠敏,陈纬,马宜龙,施非,项文波.基于ResNet34_D改进YOLOv3模型的行人检测算法[J].控制与决策,2022,37(7):1713-1720.
作者姓名:钱惠敏  陈纬  马宜龙  施非  项文波
作者单位:河海大学 能源与电气学院,南京 211100;南京理工大学 自动化学院, 南京 210094
基金项目:中央高校基本科研业务费专项基金项目(2018B15514).
摘    要:针对自动驾驶场景下行人检测任务中对中、小尺寸目标和被遮挡目标的检测需求,以及现有深度学习模型的不足,提出基于ResNet34_D的改进YOLOv3模型:通过改进残差网络的卷积块结构提出ResNet34_D,并作为YOLOv3的主干网络以降低模型尺寸和训练难度;在ResNet34_D的3个尺度卷积特征图之后,增加SPP层和DropBlock模块以提高模型的泛化能力;基于K-means聚类算法确定自适应的多尺度锚框尺寸,提高对大、中、小3种尺寸行人目标的检测能力;引入DIoU损失函数,提高对被遮挡目标的识别能力.所提出模型的消融实验验证了各个改进部分在提高模型检测准确率上的有效性.实验结果表明,所提出的基于ResNet34_D的改进YOLOv3模型具有较好的准确率和实时性,在BDD100K-Person数据集上的AP50达到69.8%,检测速度达到130 FPS.由所提出方法与现有目标检测方法的对比实验可知,所提出方法对小目标和遮挡目标的误检率更低,速度更快,具有一定的实际应用价值.

关 键 词:行人检测  深度学习  YOLOv3  ResNet34_D  DIoU

Pedestrian detection based on developed YOLOv3 with ResNet34_D
QIAN Hui-min,CHEN Wei,MA Yi-long,SHI Fei,XIANG Wen-bo.Pedestrian detection based on developed YOLOv3 with ResNet34_D[J].Control and Decision,2022,37(7):1713-1720.
Authors:QIAN Hui-min  CHEN Wei  MA Yi-long  SHI Fei  XIANG Wen-bo
Affiliation:College of Energy and Electrical Engineering,Hohai University,Nanjing 211100,China; College of Automation,Nanjing University of Science and Technology,Nanjing 210094,China
Abstract:Pedestrian detection is one of the main tasks of autonomous driving. The existed deep neural network is lack of the ability to detect small-size or medium-size objects and occluded objects, which is the requirement of pedestrian detection since pedestrians in the images acquired by vehicle-equipped cameras are always small or medium or occluded. In this paper, an improved YOLOv3 model based on ResNet34_D is proposed for pedestrian detection. And the contributions of the improved model are as follows. Firstly, the developed residual network ResNet34_D by modifying the structure of convolutional block is proposed, and it is selected as the backbone of YOLOv3 to reduce the size of the model so as to decrease the training difficulty. Secondly, the SPP layer and the DropBlock module are introduced after the feature maps of three stages of ResNet34_D, which can improve the detection accuracy of pedestrian objects with different sizes. Thirdly, to further increase the detection accuracy, the multi-scale anchors are determined using the K-means. Finally, the DIoU loss function is used to improve the ability of detecting the occluded objects. Ablation experiments for the proposed model demonstrate the effectiveness of each developed technologies in improving detection accuracy. And more experimental results show that the AP50 of the proposed model on BDD100K-Person dataset reaches 69.8%, and the detection speed can achieve 130FPS. Comparison experiments between the proposed method and the other existed methods demonstrate that, using the proposed method, the false detection rate for small targets and occlusion targets is lower, and the speed is faster, therefore, the proposed improved YOLOv3 model based on Resnet34_D is valuable in practical applications.
Keywords:
点击此处可从《控制与决策》浏览原始摘要信息
点击此处可从《控制与决策》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号