Conditional Diffusion Models for Semantic 3D Brain MRI Synthesis

Zolnamar Dorjsembe,Hsing-Kuo Pao,Sodtavilan Odonchimed,Furen Xiao
DOI: https://doi.org/10.1109/JBHI.2024.3385504
2024-04-19
Abstract:Artificial intelligence (AI) in healthcare, especially in medical imaging, faces challenges due to data scarcity and privacy concerns. Addressing these, we introduce Med-DDPM, a diffusion model designed for 3D semantic brain MRI synthesis. This model effectively tackles data scarcity and privacy issues by integrating semantic conditioning. This involves the channel-wise concatenation of a conditioning image to the model input, enabling control in image generation. Med-DDPM demonstrates superior stability and performance compared to existing 3D brain imaging synthesis methods. It generates diverse, anatomically coherent images with high visual fidelity. In terms of dice score accuracy in the tumor segmentation task, Med-DDPM achieves 0.6207, close to the 0.6531 accuracy of real images, and outperforms baseline models. Combined with real images, it further increases segmentation accuracy to 0.6675, showing the potential of our proposed method for data augmentation. This model represents the first use of a diffusion model in 3D semantic brain MRI synthesis, producing high-quality images. Its semantic conditioning feature also shows potential for image anonymization in biomedical imaging, addressing data and privacy issues. We provide the code and model weights for Med-DDPM on our GitHub repository (<a class="link-external link-https" href="https://github.com/mobaidoctor/med-ddpm/" rel="external noopener nofollow">this https URL</a>) to support reproducibility.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The paper primarily aims to address the issues of data scarcity and privacy protection in the field of medical imaging, particularly in the synthesis of 3D semantic brain MRI. Specifically: - **Data Scarcity**: In medical imaging, high-quality data is often very limited, which restricts the training effectiveness of deep learning models. - **Privacy Protection**: Medical data contains sensitive information, and generating useful medical images while protecting privacy is a significant challenge. - **Quality of 3D Image Synthesis**: Existing Generative Adversarial Network (GAN) methods face numerous challenges in generating high-resolution, three-dimensional medical images, such as unstable training and mode collapse. To address these issues, the research team proposed the Med-DDPM model, a diffusion model-based approach that integrates segmentation masks to guide the image generation process. Med-DDPM is capable of generating anatomically consistent and visually faithful images and demonstrates excellent performance in tumor segmentation tasks, approaching the quality of real images. Additionally, the model shows potential in data augmentation and image anonymization. Experimental validation indicates that Med-DDPM not only surpasses existing GAN methods in synthesized image quality but also significantly enhances model performance in tumor segmentation tasks.