Abstract:Few-shot semantic segmentation (FSS) aims to segment unseen objects in a query image using a few pixel-wise annotated support images, thus expanding the capabilities of semantic segmentation. The main challenge lies in extracting sufficient information from the limited support images to guide the segmentation process. Conventional methods typically address this problem by generating single or multiple prototypes from the support images and calculating their cosine similarity to the query image. However, these methods often fail to capture meaningful information for modeling the de facto joint distribution of pixel and category. Consequently, they result in incomplete segmentation of foreground objects and mis-segmentation of the complex background. To overcome this issue, we propose the Cross Gaussian Mixture Generative Model (CGMGM), a novel Gaussian Mixture Models~(GMMs)-based FSS method, which establishes the joint distribution of pixel and category in both the support and query images. Specifically, our method initially matches the feature representations of the query image with those of the support images to generate and refine an initial segmentation mask. It then employs GMMs to accurately model the joint distribution of foreground and background using the support masks and the initial segmentation mask. Subsequently, a parametric decoder utilizes the posterior probability of pixels in the query image, by applying the Bayesian theorem, to the joint distribution, to generate the final segmentation mask. Experimental results on PASCAL-5i and COCO-20i datasets demonstrate our CGMGM's effectiveness and superior performance compared to the state-of-the-art methods.

Bi-aggregation-aggregation and Self-Merging Network for Few-Shot Image Semantic Segmentation

Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation

CGMGM: A Cross-Gaussian Mixture Generative Model for Few-Shot Semantic Segmentation

Iterative Few-shot Semantic Segmentation from Image Label Text

Few-Shot Semantic Segmentation via Mask Aggregation

Self-Enhanced Mixed Attention Network for Three-Modal Images Few-Shot Semantic Segmentation

Few-shot object segmentation with a new feature aggregation module

APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic Segmentation

Multi-scale Self-similarity Network for Few-Shot Segmentation

A Self-Supervised Few-Shot Semantic Segmentation Method Based on Multi-Task Learning and Dense Attention Computation

Multi-Layer Features Based Self-Support Network for Few-Shot Segmentation

Multi-Similarity Enhancement Network for Few-Shot Segmentation.

Self-Support Matching Networks with Multiscale Attention for Few-shot Semantic Segmentation

Progressively Dual Prior Guided Few-shot Semantic Segmentation

SCNet: Enhancing Few-Shot Semantic Segmentation by Self-Contrastive Background Prototypes

Self-Support Few-Shot Semantic Segmentation

Dual Branch Multi-Level Semantic Learning for Few-Shot Segmentation

Cycle association prototype network for few-shot semantic segmentation

Learning self-target knowledge for few-shot segmentation

A Self-Distillation Embedded Supervised Affinity Attention Model for Few-Shot Segmentation

Objectness-Aware Few-Shot Semantic Segmentation