Abstract:The construction of 3D medical image datasets presents several issues, including requiring significant financial costs in data collection and specialized expertise for annotation, as well as strict privacy concerns for patient confidentiality compared to natural image datasets. Therefore, it has become a pressing issue in 3D medical image segmentation to enable data-efficient learning with limited 3D medical data and supervision. A promising approach is pre-training, but improving its performance in 3D medical image segmentation is difficult due to the small size of existing 3D medical image datasets. We thus present the Primitive Geometry Segment Pre-training (PrimGeoSeg) method to enable the learning of 3D semantic features by pre-training segmentation tasks using only primitive geometric objects for 3D medical image segmentation. PrimGeoSeg performs more accurate and efficient 3D medical image segmentation without manual data collection and annotation. Further, experimental results show that PrimGeoSeg on SwinUNETR improves performance over learning from scratch on BTCV, MSD (Task06), and BraTS datasets by 3.7%, 4.4%, and 0.3%, respectively. Remarkably, the performance was equal to or better than state-of-the-art self-supervised learning despite the equal number of pre-training data. From experimental results, we conclude that effective pre-training can be achieved by looking at primitive geometric objects only. Code and dataset are available at

What problem does this paper attempt to address?

### Problems the Paper Attempts to Solve This paper aims to address several key issues in the construction of 3D medical image datasets, including the high cost of data collection, the specialized knowledge required for annotation, and the strict requirements for patient privacy protection. These issues lead to the scarcity of training data and the difficulty of annotation in 3D medical image segmentation. Therefore, how to achieve efficient data utilization and model training under limited 3D medical data and supervision has become an urgent problem to solve. To tackle this challenge, the authors propose the Primitive Geometry Segment Pre-training (PrimGeoSeg) method. This method pre-trains using synthetic data containing only basic geometric objects, thereby learning effective 3D semantic features without the need for manual data collection and annotation. Specifically, PrimGeoSeg constructs a pre-training dataset by generating and arranging basic geometric objects and pre-trains the segmentation task using these datasets. Experimental results show that compared to training from scratch, PrimGeoSeg significantly improves performance on multiple 3D medical image segmentation benchmark datasets and outperforms existing self-supervised learning methods with the same amount of pre-training data. ### Main Contributions 1. **No Real Data and Manual Annotation Required**: The PrimGeoSeg method is proposed, which can pre-train without relying on real data and manual annotation. 2. **Performance Improvement**: Experimental results show that UNETR and SwinUNETR models pre-trained with PrimGeoSeg outperform existing self-supervised learning methods in 3D medical image segmentation tasks on BTCV and MSD datasets. 3. **Data Efficiency**: Even when using less training data (e.g., 30% of the BTCV dataset), PrimGeoSeg's performance can match the effect of training from scratch with 100% of the data. 4. **Privacy Protection**: By using synthetic data for pre-training, privacy issues in the use of 3D medical images can be effectively reduced. ### Experimental Validation The authors conducted experiments on multiple 3D medical image segmentation benchmark datasets (such as BTCV, MSD, and BraTS) to verify the effectiveness of PrimGeoSeg. The experimental results show that PrimGeoSeg not only improves performance but also excels in data efficiency and privacy protection. In summary, this paper proposes the PrimGeoSeg method, effectively solving the problems of data scarcity and annotation difficulty in 3D medical image segmentation, providing new ideas and methods for research in this field.

Primitive Geometry Segment Pre-training for 3D Medical Image Segmentation

Patch-Free 3D Medical Image Segmentation Driven by Super-Resolution Technique and Self-Supervised Guidance

Super-Resolution Based Patch-Free 3D Medical Image Segmentation with Self-Supervised Guidance

3D Graph-S<SUP>2</SUP>Net: Shape-Aware Self-ensembling Network for Semi-supervised Segmentation with Bilateral Graph Convolution

3D Reconstruction Based on Image Segmentation

GMIM: Self-supervised pre-training for 3D medical image segmentation with adaptive and hierarchical masked image modeling

BrainSegFounder: Towards 3D foundation models for neuroimage segmentation

Self-Supervised Pretraining for 2D Medical Image Segmentation

3D Self-Supervised Methods for Medical Imaging

Geo-Net: Geometry-Guided Pretraining for Tooth Point Cloud Segmentation

One Network to Segment Them All: A General, Lightweight System for Accurate 3D Medical Image Segmentation

AMAES: Augmented Masked Autoencoder Pretraining on Public Brain MRI Data for 3D-Native Segmentation

PGL: Prior-Guided Local Self-supervised Learning for 3D Medical Image Segmentation

Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation without Manual Labels

OneSeg: Self-learning and One-shot Learning based Single-slice Annotation for 3D Medical Image Segmentation

Accelerating 3D Medical Image Segmentation by Adaptive Small-Scale Target Localization

Mesh Segmentation for High Resolution Medical Data

Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images

AutoML Segmentation for 3D Medical Image Data: Contribution to the MSD Challenge 2018

SAM3D: Zero-Shot Semi-Automatic Segmentation in 3D Medical Images with the Segment Anything Model

Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentation