首页 | 本学科首页   官方微博 | 高级检索  
     

融合注意力机制的高效率网络车型识别
引用本文:柳长源,何先平,毕晓君. 融合注意力机制的高效率网络车型识别[J]. 浙江大学学报(工学版), 2022, 56(4): 775-782. DOI: 10.3785/j.issn.1008-973X.2022.04.017
作者姓名:柳长源  何先平  毕晓君
作者单位:1. 哈尔滨理工大学 测控技术与通信工程学院,黑龙江 哈尔滨 1500802. 中央民族大学 信息工程学院,北京 100081
基金项目:国家自然科学基金资助项目(51779050); 黑龙江省自然科学基金资助项目(F2016022)
摘    要:为了解决现有的车型识别算法对车型特征描述不充分的情况,提出融合注意力机制的高效率网络车型识别算法. 利用高效率网络中的复合缩放方式来平衡网络的深度、宽度和分辨率,将深度可分离卷积集成到基础特征提取模块中来提高模型准确率. 增加双通道的残差注意力机制来关注图片中的关键信息,获得含有更加丰富语义信息的特征图. 在网络的末端添加单独的softmax分类器,使用标签平滑正则化对损失函数进行处理,减小模型过拟合的问题. 在BIT-Vehicles数据集上进行实验,结果表明,提出方法的平均分类准确率为96.83%,较改进前的模型提高了1.11%,优于现有DCNN、Faster-CNN的改进算法,较Faster R-CNN提升了7.16%.

关 键 词:车型识别  高效率网络  残差注意力机制  标签平滑正则化  深度可分离卷积  

Efficient network vehicle recognition combined with attention mechanism
Chang-yuan LIU,Xian-ping HE,Xiao-jun BI. Efficient network vehicle recognition combined with attention mechanism[J]. Journal of Zhejiang University(Engineering Science), 2022, 56(4): 775-782. DOI: 10.3785/j.issn.1008-973X.2022.04.017
Authors:Chang-yuan LIU  Xian-ping HE  Xiao-jun BI
Abstract:An efficient network vehicle recognition algorithm combined with attention mechanism was proposed in order to solve the problem that the existing vehicle type recognition algorithm does not adequately describe the vehicle type characteristics. The depth, width and resolution of the network were balanced by the compound scaling method in the efficient network, and the depth separable convolution was integrated into the basic feature extraction module in order to improve the accuracy of the model. The residual attention mechanism of two channels was added to pay attention to the key information in the picture, and the feature map with richer semantic information was obtained. A separate softmax classifier was added at the end of the network, and the label smoothing regularization was used to deal with the loss function in order to reduce the problem of model over-fitting. Experiments on BIT-Vehicles data set showed that the average classification precision of the proposed method was 96.83%, which was 1.11% higher than that of the original model, and was better than the existing improved algorithms of DCNN and Faster-CNN and 7.16% higher than Faster R-CNN.
Keywords:vehicle type identification  high efficiency network  residual attention mechanism  label smoothing regularization  depth separable convolution  
点击此处可从《浙江大学学报(工学版)》浏览原始摘要信息
点击此处可从《浙江大学学报(工学版)》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号