Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation

Yunhao Bai,Duowen Chen,Qingli Li,Wei Shen,Yan Wang
2023-05-01
Abstract:In semi-supervised medical image segmentation, there exist empirical mismatch problems between labeled and unlabeled data distribution. The knowledge learned from the labeled data may be largely discarded if treating labeled and unlabeled data separately or in an inconsistent manner. We propose a straightforward method for alleviating the problem - copy-pasting labeled and unlabeled data bidirectionally, in a simple Mean Teacher architecture. The method encourages unlabeled data to learn comprehensive common semantics from the labeled data in both inward and outward directions. More importantly, the consistent learning procedure for labeled and unlabeled data can largely reduce the empirical distribution gap. In detail, we copy-paste a random crop from a labeled image (foreground) onto an unlabeled image (background) and an unlabeled image (foreground) onto a labeled image (background), respectively. The two mixed images are fed into a Student network and supervised by the mixed supervisory signals of pseudo-labels and ground-truth. We reveal that the simple mechanism of copy-pasting bidirectionally between labeled and unlabeled data is good enough and the experiments show solid gains (e.g., over 21% Dice improvement on ACDC dataset with 5% labeled data) compared with other state-of-the-arts on various semi-supervised medical image segmentation datasets. Code is available at <a class="link-external link-https" href="https://github.com/DeepMed-Lab-ECNU/BCP" rel="external noopener nofollow">this https URL</a>}.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the empirical mismatch between the distribution of labeled data and unlabeled data in semi - supervised medical image segmentation. Specifically, due to the small amount of labeled data and the high cost of acquisition, it is difficult to accurately estimate the distribution of the entire data set from a small amount of labeled data, resulting in a significant empirical distribution difference between a large amount of unlabeled data and a small amount of labeled data. This difference will cause the knowledge learned from the labeled data to be largely discarded when processing unlabeled data, thus affecting the performance of the model. To alleviate this problem, the paper proposes a Bidirectional Copy - Paste (BCP) method, which is implemented in a simple Mean Teacher architecture. By pasting randomly cropped regions (foregrounds) of labeled images onto unlabeled images (backgrounds), and pasting randomly cropped regions (foregrounds) of unlabeled images onto labeled images (backgrounds), mixed images are generated. These mixed images are input into the Student network and supervised by mixed supervision signals (pseudo - labels and real labels). This can encourage unlabeled data to learn comprehensive common semantics from labeled data and reduce the empirical distribution gap between labeled data and unlabeled data. The experimental results show that this method has achieved significant performance improvements on multiple semi - supervised medical image segmentation data sets. In particular, on the ACDC data set, when using 5% of the labeled data, the Dice coefficient has increased by more than 21%. This indicates that the BCP method can effectively alleviate the empirical distribution mismatch problem between labeled data and unlabeled data and improve the generalization ability and segmentation accuracy of the model.