首页 | 本学科首页   官方微博 | 高级检索  
     

基于多注意力图的孪生网络视觉目标跟踪
引用本文:齐天卉,张辉,李嘉锋,卓力. 基于多注意力图的孪生网络视觉目标跟踪[J]. 信号处理, 2020, 36(9): 1557-1566. DOI: 10.16798/j.issn.1003-0530.2020.09.021
作者姓名:齐天卉  张辉  李嘉锋  卓力
作者单位:北京工业大学信号与信息处理研究室
基金项目:国家自然科学基金(61602018,61971016);北京市自然科学基金-市教委联合资助项目(KZ201810005002;KZ201910005007)
摘    要:在视觉跟踪应用中,目标外观通常由包含目标的矩形区域来建模,这种矩形化边框的描述方式不可避免地引入了背景干扰,并随着场景变化导致跟踪关注点的模糊及歧义,进而产生跟踪漂移。针对以上问题,提出了一种基于多注意力图的孪生网络视觉目标跟踪算法。首先,建立了一种关注于前景目标区域特征表达的孪生网络。该网络通过构建梯度注意力图损失函数项来引导网络训练,提升网络区分目标和干扰背景的能力。此外,嵌入通道注意力和空间注意力进一步强化目标的特征表达,自动发掘有区分的特征表示。在多个公共数据集上的实验验证了提出算法的有效性,以及算法可完成实时的视觉目标跟踪。 

关 键 词:视觉目标跟踪   孪生网络   梯度引导反向传播   注意力机制
收稿时间:2020-06-10

Siamese Network with Multi-Attention Map for Visual Object Tracking
Affiliation:Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing University of Technology
Abstract:In visual object tracking, the appearance of the target is usually modeled by a bounding-box containing the target, which inevitably introduces background interference. As the scene changes, the concerns become blurred and ambiguous, and then produces tracking drift. Considering the above problems, a Siamese network with multi-attention map for visual object tracking is proposed. Firstly, a Siamese network focusing on foreground feature representation of target is established. The gradient attention loss function is constructed to guide network training and improve the ability of distinguishing target and interference background. In addition, embedding channel attention and spatial attention further strengthens the feature expression of the target, and automatically discovers the distinguished feature expression. Extensive experiments on benchmark datasets demonstrate that the proposed tracker performs favorably, and its ability to achieve real-time visual object tracking. 
Keywords:
点击此处可从《信号处理》浏览原始摘要信息
点击此处可从《信号处理》下载免费的PDF全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号