Abstract:Few-shot segmentation is an emerging and intriguing subfield within computer vision that tackles the challenging task of segmenting objects or regions in images when only a very limited number of annotated examples are available for each category. Although some existing few-shot segmentation techniques have made notable strides in performance, the task of few-shot segmentation continues to pose significant challenges. Particularly in the context of delving into the actual pixel relation between support and query images, the relation often suffers from the interference due to complicated appearance, shape difference, etc. However, the accuracy of pixel relation concerns the effectiveness of category information transmission, which is crucial for segmenting the target objects. In addition, most models are prone to overfitting observed base classes during the training process, resulting in the learned model being unable to be generalized to a wider range of invisible classes. To this end, a novel framework named Combining Hierarchical Sparse Representation with Adaptive Prompt for Few-shot Segmentation (HSRap) is proposed to mine more precise pixel relations between the support and query images, enhancing the transfer of category information and improving the generalization ability of the model. Specifically, HSRap begins with the design of a hierarchical sparse representation module, which uncovers latent dense relation representations between the support and query images through the reconstruction of dense relations with sparse regularization. Following this, a lightweight adaptive prompt meta-learner is designed to generate multiple prompts for each category, ensuring generalizability to a broader range of unseen classes within the same dataset and reducing sensitivity to class shift. HSRap undergoes extensive testing on three challenging benchmarks: PASCAL-5i, COCO-20i, and FSS-1000. The results demonstrate a significant enhancement in the performance of the baseline, achieving competitive results compared to state-of-the-art methods.

Visual Prompting for Generalized Few-shot Segmentation: A Multi-scale Approach

Prompt-and-Transfer: Dynamic Class-aware Enhancement for Few-shot Segmentation

Prompt-Matched Semantic Segmentation

Learning Visual Prompts for Guiding the Attention of Vision Transformers

Visual In-Context Prompting

Learnable Prompt for Few-Shot Semantic Segmentation in Remote Sensing Domain

Class-Prompting Transformer for Incremental Semantic Segmentation

Learning Common and Specific Visual Prompts for Domain Generalization.

Combining Hierarchical Sparse Representation with Adaptive Prompt for Few-Shot Segmentation

Self-Prompting Perceptual Edge Learning for Dense Prediction

PartSeg: Few-shot Part Segmentation via Part-aware Prompt Learning

Semantic Prompt for Few-Shot Image Recognition

Explicit Visual Prompting for Universal Foreground Segmentations

Prediction Calibration for Generalized Few-shot Semantic Segmentation.

Unleashing the Power of Visual Prompting At the Pixel Level

Self-Prompting Large Vision Models for Few-Shot Medical Image Segmentation

Semantic Prompt Based Multi-Scale Transformer for Few-Shot Classification.

MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping

One-shot and Partially-Supervised Cell Image Segmentation Using Small Visual Prompt

ICPC: Instance-Conditioned Prompting with Contrastive Learning for Semantic Segmentation