Gradient Domain Diffusion Models for Image Synthesis

Yuanhao Gong
2023-09-05
Abstract:Diffusion models are getting popular in generative image and video synthesis. However, due to the diffusion process, they require a large number of steps to converge. To tackle this issue, in this paper, we propose to perform the diffusion process in the gradient domain, where the convergence becomes faster. There are two reasons. First, thanks to the Poisson equation, the gradient domain is mathematically equivalent to the original image domain. Therefore, each diffusion step in the image domain has a unique corresponding gradient domain representation. Second, the gradient domain is much sparser than the image domain. As a result, gradient domain diffusion models converge faster. Several numerical experiments confirm that the gradient domain diffusion models are more efficient than the original diffusion models. The proposed method can be applied in a wide range of applications such as image processing, computer vision and machine learning tasks.
Computer Vision and Pattern Recognition,Machine Learning,Multimedia,Performance,Image and Video Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that when generating images, traditional diffusion models require a large number of time steps to converge, which leads to inefficiency in the generation process. To improve efficiency, the paper proposes a new method, that is, to carry out the diffusion process in the gradient domain instead of the original image domain. This method utilizes the mathematical properties of the Poisson equation, enabling the diffusion model in the gradient domain to converge more quickly. Specifically, the gradient domain is sparser than the image domain, which means that when adding random noise, the noise will become the dominant part more quickly, thus accelerating the convergence process. The main contributions of the paper include: 1. Proposing a new Gradient Domain Diffusion Model (GDDM). 2. Verifying through numerical experiments that the gradient - domain diffusion model converges more quickly than the traditional image - domain diffusion model. 3. Further optimizing the diffusion process using the Laplacian domain and proposing the Laplacian Domain Diffusion Model (LDDM). 4. Introducing the Poisson network module to recover images from the gradient or Laplacian fields, enhancing the robustness and practicality of the model. These improvements not only improve the efficiency of the training and sampling processes, but also provide new tools and methods for image processing, computer vision and machine learning tasks.