OneSeg: Self-learning and One-shot Learning based Single-slice Annotation for 3D Medical Image Segmentation

Yixuan Wu,Bo Zheng,Jintai Chen,Danny Z. Chen,Jian Wu
2023-09-24
Abstract:As deep learning methods continue to improve medical image segmentation performance, data annotation is still a big bottleneck due to the labor-intensive and time-consuming burden on medical experts, especially for 3D images. To significantly reduce annotation efforts while attaining competitive segmentation accuracy, we propose a self-learning and one-shot learning based framework for 3D medical image segmentation by annotating only one slice of each 3D image. Our approach takes two steps: (1) self-learning of a reconstruction network to learn semantic correspondence among 2D slices within 3D images, and (2) representative selection of single slices for one-shot manual annotation and propagating the annotated data with the well-trained reconstruction network. Extensive experiments verify that our new framework achieves comparable performance with less than 1% annotated data compared with fully supervised methods and generalizes well on several out-of-distribution testing sets.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the issue of reducing the need for annotated data in 3D medical image segmentation, particularly for 3D images, as annotating 3D images is very time-consuming and requires a lot of human resources. The paper proposes a method based on self-learning and one-shot learning, achieving efficient and accurate segmentation by annotating only one slice of each 3D image. This method aims to significantly reduce the annotation workload while maintaining segmentation accuracy competitive with fully supervised methods. Specifically, the paper proposes a two-step framework: 1. **Self-learning reconstruction network**: Learn the semantic correspondence between 2D slices within a 3D image. 2. **Representative slice selection**: Select the most representative single slice for manual annotation and use the trained reconstruction network to propagate the annotation data. Through this framework, the paper conducts experiments on multiple public datasets, verifying that the method can achieve performance comparable to fully supervised methods using less than 1% of the annotated data, and has good generalization ability across different distribution datasets.