Polyp-DDPM: Diffusion-Based Semantic Polyp Synthesis for Enhanced Segmentation

Zolnamar Dorjsembe, Hsing-Kuo Pao, Furen Xiao
2024-02-07
Abstract:This study introduces Polyp-DDPM, a diffusion-based method for generating realistic images of polyps conditioned on masks, aimed at enhancing the segmentation of gastrointestinal (GI) tract polyps. Our approach addresses the challenges of data limitations, high annotation costs, and privacy concerns associated with medical images. By conditioning the diffusion model on segmentation masks-binary masks that represent abnormal areas-Polyp-DDPM outperforms state-of-the-art methods in terms of image quality (achieving a Frechet Inception Distance (FID) score of 78.47, compared to scores above 83.79) and segmentation performance (achieving an Intersection over Union (IoU) of 0.7156, versus less than 0.6694 for synthetic images from baseline models and 0.7067 for real data). Our method generates a high-quality, diverse synthetic dataset for training, thereby enhancing polyp segmentation models to be comparable with real images and offering greater data augmentation capabilities to improve segmentation models. The source code and pretrained weights for Polyp-DDPM are made publicly available at https://github.com/mobaidoctor/polyp-ddpm.
Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address the issues of data limitations, high annotation costs, and privacy concerns in gastrointestinal (GI) tract polyp detection. Specifically, the authors introduce a diffusion model-based approach—Polyp-DDPM, which is used to generate realistic polyp images conditioned on segmentation masks, aiming to improve the segmentation accuracy of GI tract polyps. Through this method, researchers hope to generate high-quality, diverse synthetic datasets to enhance the training of polyp segmentation models, making their performance close to or even surpassing that of real images, and providing stronger data augmentation capabilities to improve segmentation models. The paper mentions that existing methods, such as those based on Generative Adversarial Networks (GANs), face the issue of mode collapse when generating polyp images, resulting in insufficient diversity and inaccurate details in the generated images. In contrast, the diffusion model-based approach overcomes these issues, generating more diverse and high-quality images. Therefore, Polyp-DDPM not only surpasses existing methods in image quality but also performs better in segmentation performance, particularly achieving significant improvements in metrics such as Intersection over Union (IoU). This provides a new approach to addressing the data scarcity problem in the medical imaging field, helping to improve the accuracy of polyp detection, which is crucial for the prevention and early diagnosis of colorectal cancer.