Abstract:Spinal surgery planning necessitates automatic segmentation of vertebrae in cone-beam computed tomography (CBCT), an intraoperative imaging modality that is widely used in intervention. However, CBCT images are of low-quality and artifact-laden due to noise, poor tissue contrast, and the presence of metallic objects, causing vertebra segmentation, even manually, a demanding task. In contrast, there exists a wealth of artifact-free, high quality CT images with vertebra annotations. This motivates us to build a CBCT vertebra segmentation model using unpaired CT images with annotations. To overcome the domain and artifact gaps between CBCT and CT, it is a must to address the three heterogeneous tasks of vertebra segmentation, artifact reduction and modality translation all together. To this, we propose a novel anatomy-aware artifact disentanglement and segmentation network (A$^3$DSegNet) that intensively leverages knowledge sharing of these three tasks to promote learning. Specifically, it takes a random pair of CBCT and CT images as the input and manipulates the synthesis and segmentation via different decoding combinations from the disentangled latent layers. Then, by proposing various forms of consistency among the synthesized images and among segmented vertebrae, the learning is achieved without paired (i.e., anatomically identical) data. Finally, we stack 2D slices together and build 3D networks on top to obtain final 3D segmentation result. Extensive experiments on a large number of clinical CBCT (21,364) and CT (17,089) images show that the proposed A$^3$DSegNet performs significantly better than state-of-the-art competing methods trained independently for each task and, remarkably, it achieves an average Dice coefficient of 0.926 for unpaired 3D CBCT vertebra segmentation.

Tube Masking-Based MAE Pre-Training for Three-Dimensional Lumbar Vertebrae Segmentation

DeU-Net: Deformable U-Net for 3D Cardiac MRI Video Segmentation

DeU-Net 2.0: Enhanced deformable U-Net for 3D cardiac cine MRI segmentation

C3D-UNET: A Comprehensive 3D Unet for Covid-19 Segmentation with Intact Encoding and Local Attention

Self Pre-training with Topology- and Spatiality-aware Masked Autoencoders for 3D Medical Image Segmentation

Revisiting MAE pre-training for 3D medical image segmentation

Joint-MAE: 2D-3D Joint Masked Autoencoders for 3D Point Cloud Pre-training

Medical supervised masked autoencoders: Crafting a better masking strategy and efficient fine-tuning schedule for medical image classification

Advancing Volumetric Medical Image Segmentation via Global-Local Masked Autoencoder

Prior-based 3D U-Net: A Model for Knee-Cartilage Segmentation in MRI Images

MDU-Net: A Convolutional Network for Clavicle and Rib Segmentation from a Chest Radiograph

Tissue-Contrastive Semi-Masked Autoencoders for Segmentation Pretraining on Chest CT

A$^3$DSegNet: Anatomy-aware artifact disentanglement and segmentation network for unpaired segmentation, artifact reduction, and modality translation

Multi-Scale Hybrid Attention Convolutional Neural Network for Automatic Segmentation of Lumbar Vertebrae From MRI

The role of mast cells, interleukin-13 and transient receptor potential channels in a mouse model of chemical-induced airway hyperresponsiveness

Dilated Densely Connected U-Net with Uncertainty Focus Loss for 3D ABUS Mass Segmentation.

Uncertainty Aware Temporal-Ensembling Model for Semi-Supervised ABUS Mass Segmentation

[Adrenalectomy under celioscopy. Experience of 25 operations].

Multi-Scale Masked 3-D U-Net For Brain Tumor Segmentation

Collaborative Learning for Annotation-Efficient Volumetric MR Image Segmentation

Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training