Abstract:As a pragmatic data augmentation tool, data synthesis has generally returned dividends in performance for deep learning based medical image analysis. However, generating corresponding segmentation masks for synthetic medical images is laborious and subjective. To obtain paired synthetic medical images and segmentations, conditional generative models that use segmentation masks as synthesis conditions were proposed. However, these segmentation mask-conditioned generative models still relied on large, varied, and labeled training datasets, and they could only provide limited constraints on human anatomical structures, leading to unrealistic image features. Moreover, the invariant pixel-level conditions could reduce the variety of synthetic lesions and thus reduce the efficacy of data augmentation. To address these issues, in this work, we propose a novel strategy for medical image synthesis, namely Unsupervised Mask (UM)-guided synthesis, to obtain both synthetic images and segmentations using limited manual segmentation labels. We first develop a superpixel based algorithm to generate unsupervised structural guidance and then design a conditional generative model to synthesize images and annotations simultaneously from those unsupervised masks in a semi-supervised multi-task setting. In addition, we devise a multi-scale multi-task Fréchet Inception Distance (MM-FID) and multi-scale multi-task standard deviation (MM-STD) to harness both fidelity and variety evaluations of synthetic CT images. With multiple analyses on different scales, we could produce stable image quality measurements with high reproducibility. Compared with the segmentation mask guided synthesis, our UM-guided synthesis provided high-quality synthetic images with significantly higher fidelity, variety, and utility ($p<0.05$ by Wilcoxon Signed Ranked test).

Data Augmentation in Class-Conditional Diffusion Model for Semi-Supervised Medical Image Segmentation

Boosting Unsupervised Contrastive Learning Using Diffusion-Based Data Augmentation from Scratch

Synthetic Augmentation with Large-scale Unconditional Pre-training

Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts

Data Augmentation for Surgical Scene Segmentation with Anatomy-Aware Diffusion Models

Using diffusion models to generate synthetic labeled data for medical image segmentation

Automatic Data Augmentation for 3D Medical Image Segmentation

Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models

Using Diffusion Models to Generate Synthetic Labelled Data for Medical Image Segmentation

Diffusion-based Data Augmentation for Skin Disease Classification: Impact Across Original Medical Datasets to Fully Synthetic Images

Semi-supervised Task-driven Data Augmentation for Medical Image Segmentation

Less is More: Unsupervised Mask-guided Annotated CT Image Synthesis with Minimum Manual Segmentations

Data Augmentation Using Learned Transformations for One-Shot Medical Image Segmentation

Semi-supervised medical image classification via increasing prediction diversity

SAG-GAN: Semi-Supervised Attention-Guided GANs for Data Augmentation on Medical Images

Conditional Diffusion Models for Weakly Supervised Medical Image Segmentation

Mixing Data Augmentation with Preserving Foreground Regions in Medical Image Segmentation

Transformation Consistent Self-ensembling Model for Semi-supervised Medical Image Segmentation

Multi-Level Global Context Cross Consistency Model for Semi-Supervised Ultrasound Image Segmentation with Diffusion Model

Transformation-Consistent Self-Ensembling Model for Semisupervised Medical Image Segmentation

A data augmentation approach that ensures the reliability of foregrounds in medical image segmentation