Unsupervised Adaptation of Polyp Segmentation Models via Coarse-to-Fine Self-Supervision

Jiexiang Wang,Chaoqi Chen
DOI: https://doi.org/10.48550/arXiv.2308.06665
2023-08-13
Abstract:Unsupervised Domain Adaptation~(UDA) has attracted a surge of interest over the past decade but is difficult to be used in real-world applications. Considering the privacy-preservation issues and security concerns, in this work, we study a practical problem of Source-Free Domain Adaptation (SFDA), which eliminates the reliance on annotated source data. Current SFDA methods focus on extracting domain knowledge from the source-trained model but neglects the intrinsic structure of the target domain. Moreover, they typically utilize pseudo labels for self-training in the target domain, but suffer from the notorious error accumulation problem. To address these issues, we propose a new SFDA framework, called Region-to-Pixel Adaptation Network~(RPANet), which learns the region-level and pixel-level discriminative representations through coarse-to-fine self-supervision. The proposed RPANet consists of two modules, Foreground-aware Contrastive Learning (FCL) and Confidence-Calibrated Pseudo-Labeling (CCPL), which explicitly address the key challenges of ``how to distinguish'' and ``how to refine''. To be specific, FCL introduces a supervised contrastive learning paradigm in the region level to contrast different region centroids across different target images, which efficiently involves all pseudo labels while robust to noisy samples. CCPL designs a novel fusion strategy to reduce the overconfidence problem of pseudo labels by fusing two different target predictions without introducing any additional network modules. Extensive experiments on three cross-domain polyp segmentation tasks reveal that RPANet significantly outperforms state-of-the-art SFDA and UDA methods without access to source data, revealing the potential of SFDA in medical applications.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The paper attempts to address the problem of how to perform Source-Free Domain Adaptation (SFDA) without source data to improve the performance of colon polyp segmentation models in the target domain. Specifically, existing unsupervised domain adaptation (UDA) methods typically require joint training of source and target data, which presents the following issues in practical applications: 1. **Privacy Protection and Security**: Medical data usually needs to be stored locally and cannot be shared across hospitals. 2. **Memory Overhead**: The size of the source data is usually larger than the source training model, which increases the burden of storage and transmission in practical applications. 3. **Model Adaptability**: Existing methods mainly rely on extracting domain knowledge from the source training model but ignore the intrinsic structure of the target domain, leading to poor performance in complex adaptation scenarios. To address these issues, the authors propose a new SFDA framework called Region-to-Pixel Adaptation Network (RPANet). This framework improves the accuracy of colon polyp segmentation in the target domain through coarse-to-fine self-supervised learning, simultaneously learning region-level and pixel-level discriminative representations. Specifically, RPANet consists of two modules: 1. **Foreground-aware Contrastive Learning (FCL)**: By contrasting region center points in different target images, it learns region-level discriminative representations to distinguish between foreground and background. 2. **Confidence-Calibrated Pseudo-Labeling (CCPL)**: By integrating different target predictions, it reduces the overconfidence issue of pseudo-labels, thereby gradually refining the pseudo-labels. Experimental results show that RPANet significantly outperforms existing SFDA and UDA methods on three cross-domain colon polyp segmentation tasks, demonstrating its potential in practical medical applications.