Semi-supervised Medical Image Segmentation Method Based on Cross-pseudo Labeling Leveraging Strong and Weak Data Augmentation Strategies

Yifei Chen,Chenyan Zhang,Yifan Ke,Yiyu Huang,Xuezhou Dai,Feiwei Qin,Yongquan Zhang,Xiaodong Zhang,Changmiao Wang
2024-02-17
Abstract:Traditional supervised learning methods have historically encountered certain constraints in medical image segmentation due to the challenging collection process, high labeling cost, low signal-to-noise ratio, and complex features characterizing biomedical images. This paper proposes a semi-supervised model, DFCPS, which innovatively incorporates the Fixmatch concept. This significantly enhances the model's performance and generalizability through data augmentation processing, employing varied strategies for unlabeled data. Concurrently, the model design gives appropriate emphasis to the generation, filtration, and refinement processes of pseudo-labels. The novel concept of cross-pseudo-supervision is introduced, integrating consistency learning with self-training. This enables the model to fully leverage pseudo-labels from multiple perspectives, thereby enhancing training diversity. The DFCPS model is compared with both baseline and advanced models using the publicly accessible Kvasir-SEG dataset. Across all four subdivisions containing different proportions of unlabeled data, our model consistently exhibits superior performance. Our source code is available at
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The paper aims to address the challenges in medical image segmentation, particularly the issues of scarce annotated data, high annotation costs, and the complex features of medical images. The paper proposes a semi-supervised neural network model named DFCPS (Dual Fixmatch Cross Pseudo Supervision), which combines the concept of Fixmatch and innovatively introduces a Cross Pseudo Supervision mechanism. By utilizing strong and weak data augmentation strategies to handle unlabeled data, the DFCPS model can significantly improve segmentation performance and generalization ability with a small amount of labeled data. The study conducted experimental validation on the publicly available Kvasir-SEG dataset, demonstrating superior performance compared to baseline and other advanced models across different proportions of unlabeled data subsets. Additionally, the paper explores the impact of different data augmentation strategies on model performance and validates the effectiveness of the adopted data augmentation strategies through ablation experiments.