Semi-Supervised Semantic Image Segmentation by Deep Diffusion Models and Generative Adversarial Networks
José Ángel Díaz-Francés,José David Fernández-Rodríguez,Karl Thurnhofer-Hemsi,Ezequiel López-Rubio
DOI: https://doi.org/10.1142/s0129065724500576
IF: 6.325
2024-08-20
International Journal of Neural Systems
Abstract:International Journal of Neural Systems, Ahead of Print. Typically, deep learning models for image segmentation tasks are trained using large datasets of images annotated at the pixel level, which can be expensive and highly time-consuming. A way to reduce the amount of annotated images required for training is to adopt a semi-supervised approach. In this regard, generative deep learning models, concretely Generative Adversarial Networks (GANs), have been adapted to semi-supervised training of segmentation tasks. This work proposes MaskGDM, a deep learning architecture combining some ideas from EditGAN, a GAN that jointly models images and their segmentations, together with a generative diffusion model. With careful integration, we find that using a generative diffusion model can improve EditGAN performance results in multiple segmentation datasets, both multi-class and with binary labels. According to the quantitative results obtained, the proposed model improves multi-class image segmentation when compared to the EditGAN and DatasetGAN models, respectively, by [math] and [math]. Moreover, using the ISIC dataset, our proposal improves the results from other models by up to [math] for the binary image segmentation approach.
computer science, artificial intelligence