首页 | 本学科首页   官方微博 | 高级检索  
     

区域损失函数的孪生网络目标跟踪
引用本文:吴贵山,,林淑彬,,钟江华,杨文元,.区域损失函数的孪生网络目标跟踪[J].智能系统学报,2020,15(4):722-731.
作者姓名:吴贵山    林淑彬    钟江华  杨文元  
作者单位:1. 闽南师范大学 计算机学院,福建 漳州 363000;2. 闽南师范大学 福建省粒计算及其应用重点实验室,福建 漳州 363000;3. 闽南师范大学 信息与网络中心,福建 漳州 363000
摘    要:针对预训练卷积神经网络提取的深度特征空间分辨率低,快速运动造成运动目标空间细节信息丢失等问题,提出用区域损失函数构建孪生网络的目标跟踪,进一步降低深度特征通道之间的冗余性,并减少高层信息丢失。利用线下预训练的VGG-16卷积神经网络提取深度特征,构成初始深度特征空间。通过区域损失函数构建特征和尺度选择网络,根据反向传播的梯度大小进行特征选择。对筛选后的特征进行拼接,融入到孪生网络中匹配跟踪。在OTB-2013、OTB-2015、VOT2016、TempleColor数据集上与其他算法对比。实验结果表明,该算法在快速运动、低分辨率等场景中表现出较好的跟踪精度和鲁棒性。

关 键 词:计算机视觉  目标跟踪  区域损失  深度特征  孪生网络  卷积神经网络  反向传播  VGG网络

Regional loss function based siamese network for object tracking
WU Guishan,,LIN Shubin,,ZHONG Jianghua,YANG Wenyuan,.Regional loss function based siamese network for object tracking[J].CAAL Transactions on Intelligent Systems,2020,15(4):722-731.
Authors:WU Guishan    LIN Shubin    ZHONG Jianghua  YANG Wenyuan  
Affiliation:1. School of Computer Science, Minnan Normal University, Zhangzhou 363000, China;2. Fujian Key Laboratory of Granular Computing and Application, Minnan Normal University, Zhangzhou 363000, China;3. Information and Network Center, Minnan Normal University, Zhangzhou 363000, China
Abstract:Due to the low spatial resolution of deep features extracted by pre-trained convolutional neural network, fast motion causes loss of spatial details of a moving object. This paper proposes a method to construct a siamese network for object tracking, so as to reduce the redundancy between the deep feature channels and the loss of high-level information. First, the VGG-16 convolutional neural network is trained offline to extract deep features and form the initial deep feature space. And then, the regional loss function is used to construct the feature and scale selection network. The feature is selected according to the gradient size of back propagation. Further, the selected features are spliced and integrated into the siamese network for matching tracking. By comparing OTB-2013, OTB-2015, VOT2016 and TempleColor benchmark datasets with other algorithms, it shows that the algorithm has preferable precision and robustness in the challenging scenarios such as fast motion and low resolution.
Keywords:computer vision  object tracking  regional loss  depth features  siamese network  convolutional neural network  back propagation  VGG network
点击此处可从《智能系统学报》浏览原始摘要信息
点击此处可从《智能系统学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号