Abstract:We propose a novel pipeline for the generation of synthetic ultrasound images via Denoising Diffusion Probabilistic Models (DDPMs) guided by cardiac semantic label maps. We show that these synthetic images can serve as a viable substitute for real data in the training of deep-learning models for ultrasound image analysis tasks such as cardiac segmentation. To demonstrate the effectiveness of this approach, we generated synthetic 2D echocardiograms and trained a neural network for segmenting the left ventricle and left atrium. The performance of the network trained on exclusively synthetic images was evaluated on an unseen dataset of real images and yielded mean Dice scores of 88.6 $\pm 4.91$ , 91.9 $\pm 4.22$, 85.2 $\pm 4.83$ \% for left ventricular endocardium, epicardium and left atrial segmentation respectively. This represents a relative increase of $9.2$, $3.3$ and $13.9$ \% in Dice scores compared to the previous state-of-the-art. The proposed pipeline has potential for application to a wide range of other tasks across various medical imaging modalities.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the insufficient training of deep - learning models in medical imaging, especially in the field of echocardiography (Echocardiography, echo), due to the lack of high - quality labeled data. Specifically, the authors propose a new pipeline based on Denoising Diffusion Probabilistic Models (DDPMs) for generating synthetic ultrasound images. These synthetic images can be used as effective substitutes for real data to train deep - learning models to perform ultrasound image analysis tasks, such as heart segmentation. ### Main problems and solutions 1. **Lack of high - quality labeled data**: - **Problem**: In medical image analysis, especially in echocardiography, it is very difficult and costly to obtain high - quality labeled data. This limits the training effect of deep - learning models. - **Solution**: By using DDPMs to generate synthetic ultrasound images, these synthetic images can be used to train deep - learning models, thereby reducing the dependence on real - labeled data. 2. **Limitations of existing generative models**: - **Problem**: Although existing Generative Adversarial Networks (GANs) and CycleGANs have achieved certain success in generating synthetic images, they have problems such as unstable training and mode collapse. - **Solution**: DDPMs are more stable than GANs, less prone to mode collapse, and can generate higher - quality synthetic images. ### Experimental results - **Performance evaluation**: The authors trained a neural network using the generated synthetic images to segment the left ventricle and left atrium. The performance of this model on unseen real images is as follows: - The average Dice score for left ventricular endocardial segmentation is 88.6 ± 4.91% - The average Dice score for left ventricular epicardial segmentation is 91.9 ± 4.22% - The average Dice score for left atrial segmentation is 85.2 ± 4.83% - **Relative improvement**: Compared with previous methods, these results respectively increase the Dice scores by 9.2%, 3.3% and 13.9%. ### Conclusion This study demonstrates the effectiveness of using DDPMs to generate synthetic ultrasound images and proves that these synthetic images can significantly improve the performance of deep - learning models on real data. This method not only solves the problem of lack of high - quality labeled data, but also provides potential application prospects for tasks in other medical imaging modalities.

Echo from noise: synthetic ultrasound image generation using diffusion models for real image segmentation

Efficient Semantic Diffusion Architectures for Model Training on Synthetic Echocardiograms

EchoDFKD: Data-Free Knowledge Distillation for Cardiac Ultrasound Segmentation using Synthetic Data

An efficient annotated data generation method for echocardiographic image segmentation

Echocardiography Segmentation Based on Cross-modal Data Augmentation Method

Denoising Plane Wave Ultrasound Images Using Diffusion Probabilistic Models

Transesophageal Echocardiography Generation using Anatomical Models

Synthetic Boost: Leveraging Synthetic Data for Enhanced Vision-Language Segmentation in Echocardiography

A Domain Translation Framework with an Adversarial Denoising Diffusion Model to Generate Synthetic Datasets of Echocardiography Images

Report from the Finnish Cancer Registry.

A Data Augmentation Pipeline to Generate Synthetic Labeled Datasets of 3D Echocardiography Images using a GAN

Denoising transthoracic echocardiographic images in regional wall motion abnormality using deep learning techniques

Paired Diffusion: Generation of related, synthetic PET-CT-Segmentation scans using Linked Denoising Diffusion Probabilistic Models

Echocardiography video synthesis from end diastolic semantic map via diffusion model

Deep Ultrasound Denoising Using Diffusion Probabilistic Models

Cardiac ultrasound simulation for autonomous ultrasound navigation

Ultrasound Imaging based on the Variance of a Diffusion Restoration Model

EchoNet-Synthetic: Privacy-preserving Video Generation for Safe Medical Data Sharing

The Proteome Analyst Suite of Automated Function Prediction Tools

A Disentanglement and Fusion Data Augmentation Approach for Echocardiography Segmentation