首页 | 本学科首页   官方微博 | 高级检索  
     

基于网格切分的单阶段实例分割方法
引用本文:王文海,李志琦,路通.基于网格切分的单阶段实例分割方法[J].软件学报,2023,34(6):2906-2921.
作者姓名:王文海  李志琦  路通
作者单位:计算机软件新技术国家重点实验室(南京大学), 江苏 南京 210023
基金项目:国家自然科学基金(61672273,61832008)
摘    要:近年来,与二阶段实例分割方法相比,单阶段实例分割方法由于实时性强,已在实际应用中取得了初步进展,但目前仍然存在以下两个主要缺点.(1)精度较低:单阶段方法缺少多轮优化环节,因此其精度离实际应用仍存在差距;(2)不够灵活:目前大多数单阶段方法是独立设计的,难以兼容实际应用中不同类型的物体检测框架,因此适用范围相对有限.提出了一种精确且灵活的单阶段实例分割框架——网格实例分割方法(GridMask),其中两个关键步骤如下:(1)为了提高实例分割精度,提出了一种网格切分二值化算法,将物体边界框内的区域划分为多个独立的网格,然后在每个网格上进行实例分割.该步骤将物体分割任务简化成了多个网格切片的分割,有效降低了特征表示的复杂程度,进而提高了实例分割的精度;(2)为了兼容不同的物体检测方法,设计了一个可以即插即用的子网络模块.该模块可以无缝地接入到目前大多数主流物体检测框架中,以增强这些方法的分割性能.所提方法在公共数据集MS COCO上取得了出色的性能,优于现有的大部分单阶段方法,甚至一些二阶段方法.

关 键 词:实例分割  物体检测  卷积神经网络  网格切分  计算机视觉
收稿时间:2021/6/4 0:00:00
修稿时间:2021/7/23 0:00:00

Grid Dividing for Single-stage Instance Segmentation
WANG Wen-Hai,LI Zhi-Qi,LU Tong.Grid Dividing for Single-stage Instance Segmentation[J].Journal of Software,2023,34(6):2906-2921.
Authors:WANG Wen-Hai  LI Zhi-Qi  LU Tong
Affiliation:State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
Abstract:In recent years, single-stage instance segmentation methods have made preliminary progress in real-world applications due to their high efficiency, but there are still two drawbacks compared to two-stage counterparts. (1) Low accuracy: the single-stage method does not have multiple rounds of refinement, so its accuracy is some distance away from real-world applications; (2) Low flexibility: most existing single-stage methods are specifically designed models, which are not compatible with object detectors. This study presents an accurate and flexible framework for single-stage instance segmentation, which contains the following two key designs. (1) To improve the accuracy of instance segmentation, a grid dividing binarization algorithm is proposed, where the bounding box region is firstly divided into several grid cells and then instance segmentation is performed on each grid cell. In this way, the original full-object segmentation task is simplified into the sub-tasks of grid cells, which significantly reduces the complexity of feature representation and further improves the instance segmentation accuracy; (2) To be compatible with object detectors, a plug-and-play module is designed, which can be seamlessly plugged into most existing object detection methods, thus enabling them to perform instance segmentation. The proposed method achieves excellent performance on the public dataset, such as MS COCO. It outperforms most existing single-stage methods and even some two-stage methods.
Keywords:instance segmentation  object detection  convolutional neural network (CNN)  grid dividing  computer vision
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号