Echo from noise: synthetic ultrasound image generation using diffusion models for real image segmentation

David Stojanovski,Uxio Hermida,Pablo Lamata,Arian Beqiri,Alberto Gomez
2023-08-15
Abstract:We propose a novel pipeline for the generation of synthetic ultrasound images via Denoising Diffusion Probabilistic Models (DDPMs) guided by cardiac semantic label maps. We show that these synthetic images can serve as a viable substitute for real data in the training of deep-learning models for ultrasound image analysis tasks such as cardiac segmentation. To demonstrate the effectiveness of this approach, we generated synthetic 2D echocardiograms and trained a neural network for segmenting the left ventricle and left atrium. The performance of the network trained on exclusively synthetic images was evaluated on an unseen dataset of real images and yielded mean Dice scores of 88.6 $\pm 4.91$ , 91.9 $\pm 4.22$, 85.2 $\pm 4.83$ \% for left ventricular endocardium, epicardium and left atrial segmentation respectively. This represents a relative increase of $9.2$, $3.3$ and $13.9$ \% in Dice scores compared to the previous state-of-the-art. The proposed pipeline has potential for application to a wide range of other tasks across various medical imaging modalities.
Image and Video Processing,Computer Vision and Pattern Recognition,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the insufficient training of deep - learning models in medical imaging, especially in the field of echocardiography (Echocardiography, echo), due to the lack of high - quality labeled data. Specifically, the authors propose a new pipeline based on Denoising Diffusion Probabilistic Models (DDPMs) for generating synthetic ultrasound images. These synthetic images can be used as effective substitutes for real data to train deep - learning models to perform ultrasound image analysis tasks, such as heart segmentation. ### Main problems and solutions 1. **Lack of high - quality labeled data**: - **Problem**: In medical image analysis, especially in echocardiography, it is very difficult and costly to obtain high - quality labeled data. This limits the training effect of deep - learning models. - **Solution**: By using DDPMs to generate synthetic ultrasound images, these synthetic images can be used to train deep - learning models, thereby reducing the dependence on real - labeled data. 2. **Limitations of existing generative models**: - **Problem**: Although existing Generative Adversarial Networks (GANs) and CycleGANs have achieved certain success in generating synthetic images, they have problems such as unstable training and mode collapse. - **Solution**: DDPMs are more stable than GANs, less prone to mode collapse, and can generate higher - quality synthetic images. ### Experimental results - **Performance evaluation**: The authors trained a neural network using the generated synthetic images to segment the left ventricle and left atrium. The performance of this model on unseen real images is as follows: - The average Dice score for left ventricular endocardial segmentation is 88.6 ± 4.91% - The average Dice score for left ventricular epicardial segmentation is 91.9 ± 4.22% - The average Dice score for left atrial segmentation is 85.2 ± 4.83% - **Relative improvement**: Compared with previous methods, these results respectively increase the Dice scores by 9.2%, 3.3% and 13.9%. ### Conclusion This study demonstrates the effectiveness of using DDPMs to generate synthetic ultrasound images and proves that these synthetic images can significantly improve the performance of deep - learning models on real data. This method not only solves the problem of lack of high - quality labeled data, but also provides potential application prospects for tasks in other medical imaging modalities.