首页 | 本学科首页   官方微博 | 高级检索  
     


DCAM: Disturbed class activation maps for weakly supervised semantic segmentation
Affiliation:1. Department of Computer Science and Engineering, East China University of Science and Technology, Shanghai, 200237, PR China;2. Department of Computer Science and Engineering, State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, Shanghai, 200237, PR China;3. Business Intelligence and Visualization Research Center, National Engineering Laboratory for Big Data Distribution and Exchange Technologies, Shanghai, 200436, PR China;4. Shanghai Engineering Research Center of Big Data & Internet Audience, Shanghai, 200072, PR China;5. Innovation College North-Chiang Mai University, 169 Moo3, Nong Kaew, Hang Dong, Chiang Mai 50230 Thailand;6. International College of Digital Innovation, Chiang Mai University, Chiang Mai, 50200, Thailand
Abstract:In the field of weakly supervised semantic segmentation (WSSS), Class Activation Maps (CAM) are typically adopted to generate pseudo masks. Yet, we find that the crux of the unsatisfactory pseudo masks is the incomplete CAM. Specifically, as convolutional neural networks tend to be dominated by the specific regions in the high-confidence channels of feature maps during prediction, the extracted CAM contains only parts of the object. To address this issue, we propose the Disturbed CAM (DCAM), a simple yet effective method for WSSS. Following CAM, we adopt a binary cross-entropy (BCE) loss to train a multi-label classification model. Then, we disturb the feature map with retraining to enhance the high-confidence channels. In addition, a softmax cross-entropy (SCE) loss branch is employed to increase the model attention to the target classes. Once converged, we extract DCAM in the same way as in CAM. The evaluation on both PASCAL VOC and MS COCO shows that DCAM not only generates high-quality masks (6.2% and 1.4% higher than the benchmark models), but also enables more accurate activation in object regions. The code is available at https://github.com/gyyang23/DCAM.
Keywords:Weakly supervised semantic segmentation  Class activation map  Image-level class label  Disturbance injection
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号