Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping

Chunming He,Kai Li,Yachao Zhang,Guoxia Xu,Longxiang Tang,Yulun Zhang,Zhenhua Guo,Xiu Li

2023-05-18

Abstract:Weakly-Supervised Concealed Object Segmentation (WSCOS) aims to segment objects well blended with surrounding environments using sparsely-annotated data for model training. It remains a challenging task since (1) it is hard to distinguish concealed objects from the background due to the intrinsic similarity and (2) the sparsely-annotated training data only provide weak supervision for model learning. In this paper, we propose a new WSCOS method to address these two challenges. To tackle the intrinsic similarity challenge, we design a multi-scale feature grouping module that first groups features at different granularities and then aggregates these grouping results. By grouping similar features together, it encourages segmentation coherence, helping obtain complete segmentation results for both single and multiple-object images. For the weak supervision challenge, we utilize the recently-proposed vision foundation model, Segment Anything Model (SAM), and use the provided sparse annotations as prompts to generate segmentation masks, which are used to train the model. To alleviate the impact of low-quality segmentation masks, we further propose a series of strategies, including multi-augmentation result ensemble, entropy-based pixel-level weighting, and entropy-based image-level selection. These strategies help provide more reliable supervision to train the segmentation model. We verify the effectiveness of our method on various WSCOS tasks, and experiments demonstrate that our method achieves state-of-the-art performance on these tasks.

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The problem that this paper attempts to solve is Weakly - Supervised Concealed Object Segmentation (WSCOS). Specifically, the paper focuses on how to use sparsely - annotated data to train the model to identify and segment objects that are highly integrated with their surrounding environment. This task has two major challenges: 1. **Intrinsic Similarity**: There is a high degree of similarity between concealed objects and their backgrounds, which makes it difficult for the model to distinguish between the foreground and the background. 2. **Weak Supervision**: There are only sparse annotation points or lines in the training data, providing limited supervision information, which restricts the learning ability of the model. To solve these problems, the authors propose a new WSCOS method, which mainly includes the following aspects: - **Multi - scale Feature Grouping (MFG)**: By grouping features at different granularities and aggregating these grouping results, the consistency of segmentation is enhanced, so as to obtain more complete single - object or multi - object image segmentation results. - **SAM - based Pseudo - label Generation**: Utilize the recently proposed visual foundation model "Segment Anything Model (SAM)" to generate segmentation masks by using sparse annotations as prompts, which are used as pseudo - labels for training the model. - **Pseudo - label Improvement Strategies**: In order to improve the quality of pseudo - labels, the authors propose a series of strategies, including multi - enhanced result integration, entropy - based pixel - level weighting, and entropy - based image - level selection, to provide more reliable supervision information. Through these methods, the paper has been verified on multiple WSCOS tasks, and the experimental results show that this method has achieved state - of - the - art performance.

Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping

Weakly Supervised Instance Segmentation Using Multi-Prior Fusion.

Weakly Supervised Fine-Grained Semantic Segmentation Via Spatial Correlation-Guided Learning

WPS-SAM: Towards Weakly-Supervised Part Segmentation with Foundation Models

Weakly Supervised Semantic Segmentation with Consistency-Constrained Multi-Class Attention for Remote Sensing Scenes

Enhancing Weakly Supervised Semantic Segmentation with Multi-modal Foundation Models: An End-to-End Approach

Coupling Global Context and Local Contents for Weakly-Supervised Semantic Segmentation

Weakly Supervised Semantic Segmentation With Consistency-Constrained Multiclass Attention for Remote Sensing Scenes

Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation

Progressive Feature Self-reinforcement for Weakly Supervised Semantic Segmentation

Superpixel Consistency Saliency Map Generation for Weakly Supervised Semantic Segmentation of Remote Sensing Images

Attention Based Object Localization for Weakly Supervised Semantic Segmentation

Weakly-Supervised Semantic Segmentation with Visual Words Learning and Hybrid Pooling

WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition

Group-Wise Learning for Weakly Supervised Semantic Segmentation

A Creative Weak Supervised Semantic Segmentation for Remote Sensing Images

A Weakly Supervised Semantic Segmentation Method Based on Local Superpixel Transformation

Foundation Model Assisted Weakly Supervised Semantic Segmentation

Weakly-Supervised Semantic Segmentation with Image-Level Labels: from Traditional Models to Foundation Models

Looking Beyond Single Images for Weakly Supervised Semantic Segmentation Learning.

Boosting Weakly-Supervised Image Segmentation Via Representation, Transform, and Compensator