Combating Medical Label Noise via Robust Semi-supervised Contrastive Learning

Bingzhi Chen,Zhanhao Ye,Yishu Liu,Zheng Zhang,Jiahui Pan,Biqing Zeng,Guangming Lu
DOI: https://doi.org/10.1007/978-3-031-43907-0_54
2023-01-01
Abstract:Deep learning-based AI diagnostic models rely heavily on high-quality exhaustive-annotated data for algorithm training but suffer from noisy label information. To enhance the model's robustness and prevent noisy label memorization, this paper proposes a robust Semisupervised Contrastive Learning paradigm called SSCL, which can efficiently merge semi-supervised learning and contrastive learning for combating medical label noise. Specifically, the proposed SSCL framework consists of three well-designed components: the Mixup Feature Embedding (MFE) module, the Semi-supervised Learning (SSL) module, and the Similarity Contrastive Learning (SCL) module. By taking the hybrid augmented images as inputs, the MFE module with momentum update mechanism is designed to mine abstract distributed feature representations. Meanwhile, a flexible pseudo-labeling promotion strategy is introduced into the SSL module, which can refine the supervised information of the noisy data with pseudo-labels based on initial categorical predictions. Benefitting from the measure of similarity between classification distributions, the SCL module can effectively capture more reliable confident pairs, further reducing the effects of label noise on contrastive learning. Furthermore, a noise-robust loss function is also leveraged to ensure the samples with correct labels dominate the learning process. Extensive experiments on multiple benchmark datasets demonstrate the superiority of SSCL over state-of-the-art baselines. The code and pretrained models are publicly available at https://github.com/Binz-Chen/MICCAI2023 SSCL.
What problem does this paper attempt to address?