Constrained Diffusion Implicit Models

Vivek Jayaram,Ira Kemelmacher-Shlizerman,Steven M. Seitz,John Thickstun
2024-11-01
Abstract:This paper describes an efficient algorithm for solving noisy linear inverse problems using pretrained diffusion models. Extending the paradigm of denoising diffusion implicit models (DDIM), we propose constrained diffusion implicit models (CDIM) that modify the diffusion updates to enforce a constraint upon the final output. For noiseless inverse problems, CDIM exactly satisfies the constraints; in the noisy case, we generalize CDIM to satisfy an exact constraint on the residual distribution of the noise. Experiments across a variety of tasks and metrics show strong performance of CDIM, with analogous inference acceleration to unconstrained DDIM: 10 to 50 times faster than previous conditional diffusion methods. We demonstrate the versatility of our approach on many problems including super-resolution, denoising, inpainting, deblurring, and 3D point cloud reconstruction.
Machine Learning,Image and Video Processing
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper aims to propose an efficient algorithm that leverages pre-trained Diffusion Models to solve Noisy Linear Inverse Problems. Specifically, the paper extends the paradigm of Denoising Diffusion Implicit Models (DDIM) and introduces Constrained Diffusion Implicit Models (CDIM), which enforce specific constraints on the final output by modifying the diffusion update process. ### Main Contributions 1. **Accelerated Inference**: CDIM significantly speeds up the inference process, reducing the number of model evaluations and actual runtime. It is 10 to 50 times faster than previous conditional diffusion methods while maintaining comparable quality. 2. **Exact Recovery with Noiseless Observations**: In the absence of noise, CDIM can find solutions that exactly match the observations. 3. **General Noise Model**: By minimizing distribution divergence, the CDIM framework can handle arbitrary observation noise distributions, including non-Gaussian noise such as Poisson noise. ### Method Overview - **Optimize \( \hat{x}_0 \) to Match Observations**: For linear measurements \( A \), the DDIM update steps are modified so that the posterior mean estimate \( \hat{x}_0 \) at each step satisfies \( A\hat{x}_0 = y \). - **Optimize Residual KL Divergence**: For noisy inverse problems, the empirical distribution of the observation residuals is minimized against the known noise distribution's KL divergence to avoid overfitting the noise. - **Early Stopping**: An early stopping mechanism is employed to prevent overfitting to noisy observations, further reducing inference costs. - **Step Size Selection**: A step size selection method based on the expected gradient of the training data is proposed to improve optimization stability and convergence speed. ### Experimental Results The paper conducts experiments on multiple tasks and datasets, including super-resolution, denoising, inpainting, deblurring, and 3D point cloud reconstruction. The results demonstrate that CDIM outperforms existing methods in both efficiency and quality and performs well under different noise models. ### Application Examples - **Time Travel Rephotography**: CDIM is used to convert old photographs into modern images, improving issues like exaggerated wrinkles caused by insufficient red sensitivity. - **Sparse Point Cloud Reconstruction**: By handling missing pixels in sparse point cloud projections, CDIM generates high-quality 3D views. ### Conclusion CDIM effectively solves Noisy Linear Inverse Problems by leveraging the structure of pre-trained diffusion models, generating high-quality images while significantly improving inference speed. The method performs excellently with linear constraints but has limitations when dealing with nonlinear constraints.