Fast Sampling of Diffusion Models via Operator Learning

Hongkai Zheng,Weili Nie,Arash Vahdat,Kamyar Azizzadenesheli,Anima Anandkumar

2023-07-22

Abstract:Diffusion models have found widespread adoption in various areas. However, their sampling process is slow because it requires hundreds to thousands of network evaluations to emulate a continuous process defined by differential equations. In this work, we use neural operators, an efficient method to solve the probability flow differential equations, to accelerate the sampling process of diffusion models. Compared to other fast sampling methods that have a sequential nature, we are the first to propose a parallel decoding method that generates images with only one model forward pass. We propose diffusion model sampling with neural operator (DSNO) that maps the initial condition, i.e., Gaussian distribution, to the continuous-time solution trajectory of the reverse diffusion process. To model the temporal correlations along the trajectory, we introduce temporal convolution layers that are parameterized in the Fourier space into the given diffusion model backbone. We show our method achieves state-of-the-art FID of 3.78 for CIFAR-10 and 7.83 for ImageNet-64 in the one-model-evaluation setting.

Machine Learning,Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The paper aims to address the issue of slow sample generation in diffusion models. Specifically, existing diffusion models require hundreds to thousands of network evaluations to simulate the continuous process defined by differential equations, making them much slower than other generative models such as Generative Adversarial Networks (GANs). This paper proposes a neural operator-based method—DSNO (Diffusion Sampling with Neural Operator)—to accelerate the sampling process of diffusion models. Compared to existing fast sampling methods, DSNO introduces a parallel decoding method for the first time, capable of generating images in a single model forward pass. The main contributions include: 1. Proposing the DSNO model for fast sampling, which can generate high-quality images with only one model evaluation. 2. Introducing a time-domain convolution block parameterized in the Fourier space, which can be easily integrated with existing diffusion model architectures to construct the DSNO backbone network, adding only a small number of model parameters (about 10%). 3. Proposing for the first time a parallel decoding method that uses continuous function representation to generate image trajectories, achieving a single-step final solution. 4. Achieving new state-of-the-art FID scores on the CIFAR-10 and ImageNet-64 datasets, with scores of 3.78 and 7.83, respectively. In summary, this paper addresses the complex differential equation solving problem in the sampling process of diffusion models by introducing neural operators, significantly improving sampling efficiency.

Fast Sampling of Diffusion Models via Operator Learning

Fast Diffusion Probabilistic Model Sampling through the lens of Backward Error Analysis

Accelerating Parallel Sampling of Diffusion Models

Fast constrained sampling in pre-trained diffusion models

Fast ODE-based Sampling for Diffusion Models in Around 5 Steps

Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference

Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time

Fast Sampling Through The Reuse Of Attention Maps In Diffusion Models

Non-Uniform Diffusion Models

Non-uniform Timestep Sampling: Towards Faster Diffusion Model Training

Diffusion Model Based Posterior Sampling for Noisy Linear Inverse Problems

Improving Efficiency of Diffusion Models via Multi-Stage Framework and Tailored Multi-Decoder Architectures

Accelerating Diffusion Models with One-to-Many Knowledge Distillation

Diffusion with Forward Models: Solving Stochastic Inverse Problems Without Direct Supervision

Flexiffusion: Segment-wise Neural Architecture Search for Flexible Denoising Schedule

Elucidating the Solution Space of Extended Reverse-Time SDE for Diffusion Models.

Learning to Discretize Denoising Diffusion ODEs

PFDiff: Training-free Acceleration of Diffusion Models through the Gradient Guidance of Past and Future

Fast Inference in Denoising Diffusion Models via MMD Finetuning

One Step Diffusion via Shortcut Models