首页 | 本学科首页   官方微博 | 高级检索  
     

基于孪生网络的快速视频目标分割
引用本文:付利华, 杨寒雪, 张博, 王俊翔, 吴会贤, 闫绍兴. 基于注意力修正的半监督视频目标分割[J]. 北京工业大学学报, 2022, 48(8): 822-829. DOI: 10.11936/bjutxb2020110025
作者姓名:付利华  杨寒雪  张博  王俊翔  吴会贤  闫绍兴
作者单位:1.北京工业大学信息学部,北京 100124
基金项目:北京市自然科学基金资助项目(4173072)
摘    要:

针对现有半监督视频目标分割方法不能同时满足分割精度和分割效率的问题,在传统半监督视频目标分割方法上引入注意力机制对分割结果进行修正. 首先,构建一个外观特征提取子网用于提取视频第1帧的特征图,并将其作为外观指导信息;然后,得到视频前一帧的分割结果,作为位置引导信息;最后,构建一个当前帧特征提取子网,以双分支的结构结合位置修正注意力与外观修正注意力,将位置信息和外观信息与当前帧特征图进行融合,实现目标分割. 实验结果表明,该目标分割方法可以纠正视频目标分割中的传播误差,并能有效提升分割精度.



关 键 词:视频目标分割  注意力机制  语义信息  通道注意力  空间注意力  半监督学习
收稿时间:2020-11-19
修稿时间:2021-02-02

Faster R-CNN: towards real-time object detection with region proposal networks
FU Lihua, YANG Hanxue, ZHANG Bo, WANG Junxiang, WU Huixian, YAN Shaoxing. Semi-supervised Video Target Segmentation Method Based on Attention Correction[J]. Journal of Beijing University of Technology, 2022, 48(8): 822-829. DOI: 10.11936/bjutxb2020110025
Authors:FU Lihua  YANG Hanxue  ZHANG Bo  WANG Junxiang  WU Huixian  YAN Shaoxing
Affiliation:1.Faculty of Information Technology, Beijing University of Technology, Beijing 100124, China
Abstract:To solve the problem that the existing semi-supervised video target segmentation methods cannot ensure segmentation accuracy and efficiency at the same time, an attention mechanism into the general semi-supervised video target segmentation method was introduced to modify segmentation results. First, an appearance feature extraction subnet was constructed to extract feature map of the first frame of video and it was used as appearance guidance information. Second, the segmentation result of the previous frame was obtained and used as position guidance information. Finally, a current frame feature extraction subnet was constructed, which combined position correction attention and appearance correction attention in a double branch structure, so as to integrate the position information and appearance information into the current frame feature map and accomplish the target segmentation. Experiments show that the target segmentation method can correct the propagation errors in video target segmentation and improve the segmentation accuracy.
Keywords:video object segmentation  attention mechanism  semantic information  channel attention  spatial attention  semi-supervised learning
点击此处可从《北京工业大学学报》浏览原始摘要信息
点击此处可从《北京工业大学学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号