首页 | 本学科首页   官方微博 | 高级检索  
     

基于反馈注意力机制和上下文融合的非模式实例分割
引用本文:董俊杰,刘华平,谢珺,续欣莹,孙富春.基于反馈注意力机制和上下文融合的非模式实例分割[J].智能系统学报,2021,16(4):801-810.
作者姓名:董俊杰  刘华平  谢珺  续欣莹  孙富春
作者单位:1. 太原理工大学 信息与计算机学院,山西 晋中 030600;2. 清华大学 智能技术与系统国家重点实验室,北京 100084;3. 太原理工大学 电气与动力工程学院,山西 太原 030024
摘    要:非模式实例分割是最近提出的对实例分割的扩展,其任务是对每个对象实例的可见区域和被遮挡区域都进行预测,感知完整的物理结构和语义概念。在预测对象被遮挡部分的形状和语义时,往往由于特征表示的识别能力不够和对上下文信息缺乏而导致对遮挡区域预测欠拟合甚至错误。针对这个问题,提出一个上下文注意模块和反馈注意力机制的特征金字塔结构,引入反馈连接进行再学习。该方法能够有效捕获全局语义信息和精细的空间细节,通过在COCO-amodal数据集训练和验证,非模式实例分割掩码平均精确率从8.4%提高到14.3%,平均召回率从16.6%提高到20.8%。实验结果表明,该方法能够显著提高对物体被遮挡部分预测的准确率,有效解决欠拟合问题。

关 键 词:非模式实例分割  遮挡预测  反馈连接  注意力机制  上下文信息  深度学习  神经网络  计算机视觉

Feedback attention mechanism and context fusion based amodal instance segmentation
DONG Junjie,LIU Huaping,XIE Jun,XU Xinying,SUN Fuchun.Feedback attention mechanism and context fusion based amodal instance segmentation[J].CAAL Transactions on Intelligent Systems,2021,16(4):801-810.
Authors:DONG Junjie  LIU Huaping  XIE Jun  XU Xinying  SUN Fuchun
Affiliation:1. College of Information and Computer, Taiyuan University of Technology, Jinzhong 030600, China;2. State Key Lab. of Intelligent Technology and Systems, Tsinghua University, Beijing 100084, China;3. College of Electrical and Power Engineering, Taiyuan University of Technology, Taiyuan 030024, China
Abstract:Recently, model instance segmentation has been proposed as an extension of instance segmentation to predict the visible and occluded areas of each object instance and perceive the complete physical structure and semantic concepts. When the shapes and meanings of occluded objects are being predicted, underfitting or even wrong results are obtained in the occlusion prediction due to the insufficient recognition capability of feature representation and the lack of contextual information. To solve this problem, this paper proposes a contextual attention module and feature pyramid structure of feedback attention mechanism and introduces feedback connections for relearning. The proposed method can effectively capture global semantic information and fine spatial details. Through training and verification in the COCO-amodal dataset, the average precision of the amodal instance segmentation mask increases from 8.4% to 14.3%, and the average recall rate increases from 16.6% to 20.8%. Experimental results show that this method can significantly improve the accuracy of occlusion prediction and effectively end underfitting.
Keywords:amodal instance segmentation  occlusion prediction  feedback connection  attention mechanism  context information  deep learning  neural network  computer vision
点击此处可从《智能系统学报》浏览原始摘要信息
点击此处可从《智能系统学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号