Domain-Rectifying Adapter for Cross-Domain Few-Shot Segmentation

Jiapeng Su,Qi Fan,Guangming Lu,Fanglin Chen,Wenjie Pei
2024-04-16
Abstract:Few-shot semantic segmentation (FSS) has achieved great success on segmenting objects of novel classes, supported by only a few annotated samples. However, existing FSS methods often underperform in the presence of domain shifts, especially when encountering new domain styles that are unseen during training. It is suboptimal to directly adapt or generalize the entire model to new domains in the few-shot scenario. Instead, our key idea is to adapt a small adapter for rectifying diverse target domain styles to the source domain. Consequently, the rectified target domain features can fittingly benefit from the well-optimized source domain segmentation model, which is intently trained on sufficient source domain data. Training domain-rectifying adapter requires sufficiently diverse target domains. We thus propose a novel local-global style perturbation method to simulate diverse potential target domains by perturbating the feature channel statistics of the individual images and collective statistics of the entire source domain, respectively. Additionally, we propose a cyclic domain alignment module to facilitate the adapter effectively rectifying domains using a reverse domain rectification supervision. The adapter is trained to rectify the image features from diverse synthesized target domains to align with the source domain. During testing on target domains, we start by rectifying the image features and then conduct few-shot segmentation on the domain-rectified features. Extensive experiments demonstrate the effectiveness of our method, achieving promising results on cross-domain few-shot semantic segmentation tasks. Our code is available at
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to address the performance degradation issue caused by domain shift in Cross-Domain Few-Shot Segmentation (CD-FSS). Specifically, existing few-shot segmentation methods perform poorly when faced with new domains, especially those not seen during training. To overcome this challenge, the authors propose a novel approach that involves training a small adapter to adjust the diverse target domain features to align with the source domain, thereby leveraging the already optimized source domain segmentation model. ### Main Contributions 1. **Proposed a novel domain correction method**: Using a compact adapter to correct the diverse target domain features to the source domain, alleviating the overfitting problem in scenarios with limited training data. 2. **Proposed a unique local-global style perturbation module**: Enhancing the model's adaptability to different target domains by generating diverse target domain styles through perturbing feature channel statistics. 3. **Introduced a cyclic domain alignment loss**: Enhancing the domain correction effect, helping the adapter align the diverse domain styles to the source domain. ### Method Overview - **Local Domain Perturbation**: Synthesizing various domain styles by injecting Gaussian noise into the feature channel statistics of a single image. - **Global Domain Perturbation**: Perturbing the global style statistics of the entire dataset to improve the stability of domain style synthesis. - **Domain Correction Module**: Predicting two correction vectors through an adapter to restore the perturbed features to the source domain style. - **Cyclic Domain Alignment**: Constraining the adapter with cyclic alignment loss to ensure that features are corrected back to the source domain space from the target domain. ### Experimental Results - The proposed method significantly improves segmentation performance on multiple cross-domain few-shot segmentation benchmark datasets, such as FSS-1000, DeepGlobe, ISIC2018, and Chest X-ray. - Particularly on the Chest X-ray dataset, the 1-shot and 5-shot performances improved by 15.74% and 12.11% respectively compared to PATNet. - The experiments also demonstrated the effectiveness of each component and showed that the application of multiple adapters can further enhance performance.