Abstract:An important challenge and limiting factor in deep learning methods for medical imaging segmentation is the lack of available of annotated data to properly train models. For the specific task of tumor segmentation, the process entails clinicians labeling every slice of volumetric scans for every patient, which becomes prohibitive at the scale of datasets required to train neural networks to optimal performance. To address this, we propose a novel semi-supervised framework that allows training any segmentation (encoder–decoder) model using only information readily available in radiological data, namely the presence of a tumor in the image, in addition to a few annotated images. Specifically, we conjecture that a generative model performing domain translation on this weak label — healthy vs diseased scans — helps achieve tumor segmentation. The proposed GenSeg method first disentangles tumoral tissue from healthy “background” tissue. The latent representation is separated into (1) the common background information across both domains, and (2) the unique tumoral information. GenSeg then achieves diseased-to-healthy image translation by decoding a healthy version of the image from just the common representation, as well as a residual image that allows adding back the tumors. The same decoder that produces this residual tumor image, also outputs a tumor segmentation. Implicit data augmentation is achieved by re-using the same framework for healthy-to-diseased image translation, where a residual tumor image is produced from a prior distribution. By performing both image translation and segmentation simultaneously, GenSeg allows training on only partially annotated datasets. To test the framework, we trained U-Net-like architectures using GenSeg and evaluated their performance on 3 variants of a synthetic task, as well as on 2 benchmark datasets: brain tumor segmentation in MRI (derived from BraTS) and liver metastasis segmentation in CT (derived from LiTS). Our method outperforms the baseline semi-supervised (autoencoder and mean teacher) and supervised segmentation methods, with improvements ranging between 8–14% Dice score on the brain task and 5–8% on the liver task, when only 1% of the training images were annotated. These results show the proposed framework is ideal at addressing the problem of training deep segmentation models when a large portion of the available data is unlabeled and unpaired, a common issue in tumor segmentation.

LC-GAN: Image-to-image Translation Based on Generative Adversarial Network for Endoscopic Images

OA-GAN: Organ-Aware Generative Adversarial Network for Synthesizing Contrast-Enhanced Medical Images

Local Style Preservation in Improved GAN-Driven Synthetic Image Generation for Endoscopic Tool Segmentation

MedGAN: Medical Image Translation using GANs

Guided image generation for improved surgical image segmentation

AV-GAN: Attention-Based Varifocal Generative Adversarial Network for Uneven Medical Image Translation

EnGAN: Enhancement Generative Adversarial Network in Medical Image Segmentation

LAGAN: Lesion-Aware Generative Adversarial Networks for Edema Area Segmentation in SD-OCT Images.

SegAN: Adversarial Network with Multi-scale $L_1$ Loss for Medical Image Segmentation

Instance Segmentation of Unlabeled Modalities via Cyclic Segmentation GAN

ISGAN: Unsupervised Domain Adaptation with Improved Symmetric GAN for Cross-Modality Multi-organ Segmentation

GAN Augmentation: Augmenting Training Data using Generative Adversarial Networks

Accurate Colorectal Tumor Segmentation for CT Scans Based on the Label Assignment Generative Adversarial Network

SA-GAN: Structure-Aware GAN for Organ-Preserving Synthetic CT Generation

Towards cross-modal organ translation and segmentation: A cycle- and shape-consistent generative adversarial network

Applying Conditional Generative Adversarial Networks for Imaging Diagnosis

Towards annotation-efficient segmentation via image-to-image translation

Lesion Segmentation in Gastroscopic Images Using Generative Adversarial Networks.

Lesion-aware generative adversarial networks for color fundus image to fundus fluorescein angiography translation

LGAN: Lung segmentation in CT scans using generative adversarial network

SurgicaL-CD: Generating Surgical Images via Unpaired Image Translation with Latent Consistency Diffusion Models