基于反馈注意力机制和上下文融合的非模式实例分割 Feedback attention mechanism and context fusion based amodal instance segmentation期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

基于反馈注意力机制和上下文融合的非模式实例分割

引用本文：	董俊杰,刘华平,谢珺,续欣莹,孙富春.基于反馈注意力机制和上下文融合的非模式实例分割[J].智能系统学报,2021,16(4):801-810.

作者姓名：	董俊杰刘华平谢珺续欣莹孙富春

作者单位：	1. 太原理工大学信息与计算机学院，山西晋中 030600;2. 清华大学智能技术与系统国家重点实验室，北京 100084;3. 太原理工大学电气与动力工程学院，山西太原 030024

摘要：	非模式实例分割是最近提出的对实例分割的扩展，其任务是对每个对象实例的可见区域和被遮挡区域都进行预测，感知完整的物理结构和语义概念。在预测对象被遮挡部分的形状和语义时，往往由于特征表示的识别能力不够和对上下文信息缺乏而导致对遮挡区域预测欠拟合甚至错误。针对这个问题，提出一个上下文注意模块和反馈注意力机制的特征金字塔结构，引入反馈连接进行再学习。该方法能够有效捕获全局语义信息和精细的空间细节，通过在COCO-amodal数据集训练和验证，非模式实例分割掩码平均精确率从8.4%提高到14.3%，平均召回率从16.6%提高到20.8%。实验结果表明，该方法能够显著提高对物体被遮挡部分预测的准确率，有效解决欠拟合问题。
关键词：	非模式实例分割遮挡预测反馈连接注意力机制上下文信息深度学习神经网络计算机视觉
Feedback attention mechanism and context fusion based amodal instance segmentation

DONG Junjie,LIU Huaping,XIE Jun,XU Xinying,SUN Fuchun.Feedback attention mechanism and context fusion based amodal instance segmentation[J].CAAL Transactions on Intelligent Systems,2021,16(4):801-810.

Authors:	DONG Junjie LIU Huaping XIE Jun XU Xinying SUN Fuchun

Affiliation:	1. College of Information and Computer, Taiyuan University of Technology, Jinzhong 030600, China;2. State Key Lab. of Intelligent Technology and Systems, Tsinghua University, Beijing 100084, China;3. College of Electrical and Power Engineering, Taiyuan University of Technology, Taiyuan 030024, China

Abstract:	Recently, model instance segmentation has been proposed as an extension of instance segmentation to predict the visible and occluded areas of each object instance and perceive the complete physical structure and semantic concepts. When the shapes and meanings of occluded objects are being predicted, underfitting or even wrong results are obtained in the occlusion prediction due to the insufficient recognition capability of feature representation and the lack of contextual information. To solve this problem, this paper proposes a contextual attention module and feature pyramid structure of feedback attention mechanism and introduces feedback connections for relearning. The proposed method can effectively capture global semantic information and fine spatial details. Through training and verification in the COCO-amodal dataset, the average precision of the amodal instance segmentation mask increases from 8.4% to 14.3%, and the average recall rate increases from 16.6% to 20.8%. Experimental results show that this method can significantly improve the accuracy of occlusion prediction and effectively end underfitting.

Keywords:	amodal instance segmentation occlusion prediction feedback connection attention mechanism context information deep learning neural network computer vision

	点击此处可从《智能系统学报》浏览原始摘要信息
	点击此处可从《智能系统学报》下载全文

设为首页 | 免责声明 | 关于勤云 | 加入收藏