Gradient Domain Diffusion Models for Image Synthesis

Yuanhao Gong

2023-09-05

Abstract:Diffusion models are getting popular in generative image and video synthesis. However, due to the diffusion process, they require a large number of steps to converge. To tackle this issue, in this paper, we propose to perform the diffusion process in the gradient domain, where the convergence becomes faster. There are two reasons. First, thanks to the Poisson equation, the gradient domain is mathematically equivalent to the original image domain. Therefore, each diffusion step in the image domain has a unique corresponding gradient domain representation. Second, the gradient domain is much sparser than the image domain. As a result, gradient domain diffusion models converge faster. Several numerical experiments confirm that the gradient domain diffusion models are more efficient than the original diffusion models. The proposed method can be applied in a wide range of applications such as image processing, computer vision and machine learning tasks.

Computer Vision and Pattern Recognition,Machine Learning,Multimedia,Performance,Image and Video Processing

What problem does this paper attempt to address?

The problem that this paper attempts to solve is that when generating images, traditional diffusion models require a large number of time steps to converge, which leads to inefficiency in the generation process. To improve efficiency, the paper proposes a new method, that is, to carry out the diffusion process in the gradient domain instead of the original image domain. This method utilizes the mathematical properties of the Poisson equation, enabling the diffusion model in the gradient domain to converge more quickly. Specifically, the gradient domain is sparser than the image domain, which means that when adding random noise, the noise will become the dominant part more quickly, thus accelerating the convergence process. The main contributions of the paper include: 1. Proposing a new Gradient Domain Diffusion Model (GDDM). 2. Verifying through numerical experiments that the gradient - domain diffusion model converges more quickly than the traditional image - domain diffusion model. 3. Further optimizing the diffusion process using the Laplacian domain and proposing the Laplacian Domain Diffusion Model (LDDM). 4. Introducing the Poisson network module to recover images from the gradient or Laplacian fields, enhancing the robustness and practicality of the model. These improvements not only improve the efficiency of the training and sampling processes, but also provide new tools and methods for image processing, computer vision and machine learning tasks.

Gradient Domain Diffusion Models for Image Synthesis

Gradient Domain Based Processing Method for Image Synthesis

Accelerated Image-Aware Generative Diffusion Modeling

Tutorial on Diffusion Models for Imaging and Vision

Diffusion Models Generate Images Like Painters: an Analytical Theory of Outline First, Details Later

Differential Diffusion: Giving Each Pixel Its Strength

G2D2: Gradient-guided Discrete Diffusion for image inverse problem solving

Nested Diffusion Processes for Anytime Image Generation

Diffusion Models Beat GANs on Image Synthesis

Simultaneous Image-to-Zero and Zero-to-Noise: Diffusion Models with Analytical Image Attenuation

Image Neural Field Diffusion Models

Efficient image generation with Contour Wavelet Diffusion

Steered Diffusion: A Generalized Framework for Plug-and-Play Conditional Image Synthesis

Diffusion Cocktail: Mixing Domain-Specific Diffusion Models for Diversified Image Generations

Diffusion Models in Vision: A Survey

Accelerating Video Diffusion Models via Distribution Matching

Diffusion Model for Generative Image Denoising

Contour wavelet diffusion: A fast and high‐quality image generation model

Renormalization Group flow, Optimal Transport and Diffusion-based Generative Model

Principles of Diffusion Models and Their Applications on Medicine