Primitive Geometry Segment Pre-training for 3D Medical Image Segmentation

Ryu Tadokoro,Ryosuke Yamada,Kodai Nakashima,Ryo Nakamura,Hirokatsu Kataoka
2024-01-08
Abstract:The construction of 3D medical image datasets presents several issues, including requiring significant financial costs in data collection and specialized expertise for annotation, as well as strict privacy concerns for patient confidentiality compared to natural image datasets. Therefore, it has become a pressing issue in 3D medical image segmentation to enable data-efficient learning with limited 3D medical data and supervision. A promising approach is pre-training, but improving its performance in 3D medical image segmentation is difficult due to the small size of existing 3D medical image datasets. We thus present the Primitive Geometry Segment Pre-training (PrimGeoSeg) method to enable the learning of 3D semantic features by pre-training segmentation tasks using only primitive geometric objects for 3D medical image segmentation. PrimGeoSeg performs more accurate and efficient 3D medical image segmentation without manual data collection and annotation. Further, experimental results show that PrimGeoSeg on SwinUNETR improves performance over learning from scratch on BTCV, MSD (Task06), and BraTS datasets by 3.7%, 4.4%, and 0.3%, respectively. Remarkably, the performance was equal to or better than state-of-the-art self-supervised learning despite the equal number of pre-training data. From experimental results, we conclude that effective pre-training can be achieved by looking at primitive geometric objects only. Code and dataset are available at
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems the Paper Attempts to Solve This paper aims to address several key issues in the construction of 3D medical image datasets, including the high cost of data collection, the specialized knowledge required for annotation, and the strict requirements for patient privacy protection. These issues lead to the scarcity of training data and the difficulty of annotation in 3D medical image segmentation. Therefore, how to achieve efficient data utilization and model training under limited 3D medical data and supervision has become an urgent problem to solve. To tackle this challenge, the authors propose the Primitive Geometry Segment Pre-training (PrimGeoSeg) method. This method pre-trains using synthetic data containing only basic geometric objects, thereby learning effective 3D semantic features without the need for manual data collection and annotation. Specifically, PrimGeoSeg constructs a pre-training dataset by generating and arranging basic geometric objects and pre-trains the segmentation task using these datasets. Experimental results show that compared to training from scratch, PrimGeoSeg significantly improves performance on multiple 3D medical image segmentation benchmark datasets and outperforms existing self-supervised learning methods with the same amount of pre-training data. ### Main Contributions 1. **No Real Data and Manual Annotation Required**: The PrimGeoSeg method is proposed, which can pre-train without relying on real data and manual annotation. 2. **Performance Improvement**: Experimental results show that UNETR and SwinUNETR models pre-trained with PrimGeoSeg outperform existing self-supervised learning methods in 3D medical image segmentation tasks on BTCV and MSD datasets. 3. **Data Efficiency**: Even when using less training data (e.g., 30% of the BTCV dataset), PrimGeoSeg's performance can match the effect of training from scratch with 100% of the data. 4. **Privacy Protection**: By using synthetic data for pre-training, privacy issues in the use of 3D medical images can be effectively reduced. ### Experimental Validation The authors conducted experiments on multiple 3D medical image segmentation benchmark datasets (such as BTCV, MSD, and BraTS) to verify the effectiveness of PrimGeoSeg. The experimental results show that PrimGeoSeg not only improves performance but also excels in data efficiency and privacy protection. In summary, this paper proposes the PrimGeoSeg method, effectively solving the problems of data scarcity and annotation difficulty in 3D medical image segmentation, providing new ideas and methods for research in this field.