Abstract:In the domain of computer vision, blind super-resolution is a key area focused on generating high-resolution images with enhanced visual quality from low-resolution counterparts affected by indeterminate degradation factors. This area is primarily advanced through self-supervised learning techniques utilizing GANs. Despite their prominence, GAN-based methods encounter challenges including unstable training dynamics and limited diversity, compounded by the intricate necessity to configure degradation models to mimic various blur effects and noise types. Lately, denoising diffusion models have shown promising results in image restoration, yet their sampling efficiency constraints impede their deployment in real-time scenarios. This study introduces the Generation Diffusion Degradation (GDD) model, a novel and efficient technique for replicating image degradation by applying random Gaussian noise in a sequential manner via a parametric Markov chain, followed by a progressive reconstruction of the initial image through a U-net-based noise predictor. This method adeptly mirrors the inherent degradation distribution observed in actual degraded images. Furthermore, we present an innovative training strategy that utilizes a composite loss function to train the GDD model, ensuring stable training, improving the authenticity of the generated degraded images, and precisely reflecting the degradation patterns of target images. Extensive experimental analyses underscore the superior performance of the proposed GDD model, both in objective metrics and subjective visual quality. The code is available at https://github.com/lgylab/GDD.

LaDiffGAN: Training GANs with Diffusion Supervision in Latent Spaces

Diffusion-GAN: Training GANs with Diffusion

Control3Diff: Learning Controllable 3D Diffusion Models from Single-view Images

Distilling Diffusion Models into Conditional GANs

Latent Denoising Diffusion GAN: Faster sampling, Higher image quality

Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance

SinDiffusion: Learning a Diffusion Model from a Single Natural Image

Exploring Explicit Domain Supervision for Latent Space Disentanglement in Unpaired Image-to-Image Translation

DifAugGAN: A Practical Diffusion-style Data Augmentation for GAN-based Single Image Super-resolution

NoiseCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions in Diffusion Models

Differentiable Augmentation for Data-Efficient GAN Training

FSRDiff: A Fast Diffusion-Based Super-Resolution Method Using GAN

Efficient Transfer Learning in Diffusion Models via Adversarial Noise

Generation diffusion degradation: Simple and efficient design for blind super-resolution

Dist-GAN: An Improved GAN using Distance Constraints

DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability

Pixel-Space Post-Training of Latent Diffusion Models

Image Neural Field Diffusion Models

LaCon: Late-Constraint Diffusion for Steerable Guided Image Synthesis

Enhancing Unsupervised Speech Recognition with Diffusion GANs

MCGAN: Enhancing GAN Training with Regression-Based Generator Loss