Abstract:With the rapid advancement of modern hardware technology, breakthroughs have been made in many areas of artificial intelligence research, leading to the direction of machine replacement or assistance in various fields. However, most artificial intelligence or deep learning techniques require large amounts of training data and are typically applicable to a single task objective. Acquiring such large training datasets can be particularly challenging, especially in domains like medical imaging. In the field of image processing, few-shot image segmentation is an area of active research. Recent studies have employed deep learning and meta-learning approaches to enable models to segment objects in images with only a small amount of training data, allowing them to quickly adapt to new task objectives. This paper proposes a network architecture for meta-learning few-shot image segmentation, utilizing a meta-learning classification weight transfer network to generate masks for few-shot image segmentation. The architecture leverages pre-trained classification weight transfers to generate informative prior masks and employs pre-trained feature extraction architecture for feature extraction of query and support images. Furthermore, it utilizes a Feature Enrichment Module to adaptively propagate information from finer features to coarser features in a top-down manner for query image feature extraction. Finally, a classification module is employed for query image segmentation prediction. Experimental results demonstrate that compared to the baseline using the mean Intersection over Union (mIOU) as the evaluation metric, the accuracy increases by 1.7% in the one-shot experiment and by 2.6% in the five-shot experiment. Thus, compared to the baseline, the proposed architecture with meta-learning classification weight transfer network for mask generation exhibits superior performance in few-shot image segmentation.

Simpler is Better: Few-shot Semantic Segmentation with Classifier Weight Transformer

Dense Cross-Query-and-Support Attention Weighted Mask Aggregation for Few-Shot Segmentation

SemiCVT: Semi-Supervised Convolutional Vision Transformer for Semantic Segmentation

Iterative Few-shot Semantic Segmentation from Image Label Text

A lightweight siamese transformer for few-shot semantic segmentation

Less is More: Towards Efficient Few-shot 3D Semantic Segmentation via Training-free Networks

Few-Shot 3D Point Cloud Semantic Segmentation via Stratified Class-Specific Attention Based Transformer Network

Few-Shot Segmentation Via Cycle-Consistent Transformer

MSDNet: Multi-Scale Decoder for Few-Shot Semantic Segmentation via Transformer-Guided Prototyping

Few-Shot Image Segmentation Using Generating Mask with Meta-Learning Classifier Weight Transformer Network

Transductive meta-learning with enhanced feature ensemble for few-shot semantic segmentation

[CLS] Token is All You Need for Zero-Shot Semantic Segmentation

Cycle association prototype network for few-shot semantic segmentation

A New Local Transformation Module for Few-shot Segmentation

Feature-Proxy Transformer for Few-Shot Segmentation

STC: A Simple to Complex Framework for Weakly-Supervised Semantic Segmentation

Label-Efficient Few-Shot Semantic Segmentation with Unsupervised Meta-Training

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Differentiable Meta-learning Model for Few-shot Semantic Segmentation

SCNet: Enhancing Few-Shot Semantic Segmentation by Self-Contrastive Background Prototypes