Abstract:Discrete diffusion models have emerged as powerful tools for high-quality data generation. Despite their success in discrete spaces, such as text generation tasks, the acceleration of discrete diffusion models remains under-explored. In this paper, we propose discrete non-Markov diffusion models (DNDM), which naturally induce the predetermined transition time set. This enables a training-free sampling algorithm that significantly reduces the number of function evaluations (i.e., calls to the neural network), making the sampling process much faster. Furthermore, we study the transition from finite to infinite step sampling, offering new insights into bridging the gap between discrete and continuous-time processes for discrete diffusion models. Extensive experiments on natural language generation and machine translation tasks demonstrate the superior performance of our method in terms of both generation speed and sample quality compared to existing methods for discrete diffusion models.

What problem does this paper attempt to address?

### Problems the paper attempts to solve The paper aims to solve the sampling speed problem of Discrete Diffusion Models (DDMs) when generating high - quality data. Although discrete diffusion models have achieved remarkable success in tasks such as text generation, there are still relatively few studies on their acceleration. This paper proposes a new Discrete Non - Markov Diffusion Model (DNDM). By introducing a Predetermined Transition Time Set, a fast sampling algorithm without training is achieved. This method significantly reduces the number of neural network function calls, thus speeding up the sampling process while maintaining sample quality. ### Specific problem descriptions 1. **Slow sampling speed**: - Discrete diffusion models need to call neural network functions multiple times during the generation process, resulting in a slow sampling speed. - Although some methods (such as DDIM) have achieved acceleration in continuous spaces, there are fewer studies on acceleration in discrete spaces. 2. **Training - dependence**: - Existing acceleration methods usually depend on the training process, which limits their flexibility and application scope. 3. **Gap between discrete and continuous processes**: - How to achieve a smooth transition from a finite number of steps to an infinite number of steps in discrete diffusion models to better connect discrete and continuous - time processes. ### Solutions 1. **DNDM model**: - A Discrete Non - Markov Diffusion Model (DNDM) is proposed. By introducing a predetermined transition time set, the sampling process can be significantly accelerated. - DNDM retains the key properties of the original discrete diffusion model, such as marginal distribution and conditional distribution. 2. **Accelerated sampling algorithm**: - An accelerated sampling algorithm based on the predetermined transition time set is designed, which reduces the required number of neural network function calls. - For a standard T - step discrete diffusion process, DNDM only requires |T| function calls, where |T| is the cardinality of the transition time set and |T| is much smaller than T. 3. **Infinite - step sampling**: - The limit case when T tends to infinity is explored, and an infinite - step sampling algorithm is introduced to further improve the sampling efficiency. ### Experimental results - **Machine translation tasks**: - Experiments were carried out on the IWSLT14, WMT14 and WMT16 datasets. The results show that DNDM is superior to the baseline methods in terms of both sampling speed and sample quality. - In particular, when increasing the sampling steps, the generation time of DNDM only increases slightly, while the generation time of the baseline methods increases linearly. - **Unconditional text generation**: - The experimental results show that DNDM also performs well in unconditional text generation tasks, significantly improving the generation quality and speed. ### Summary This paper effectively solves the sampling speed problem of discrete diffusion models when generating high - quality data by introducing the DNDM model and the accelerated sampling algorithm. This method not only significantly improves the sampling efficiency but also provides new insights into the bridge between discrete and continuous - time processes.

Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time

Utilizing Latent Diffusion Model to Accelerate Sampling Speed and Enhance Text Generation Quality

Accelerating Parallel Sampling of Diffusion Models

$\textit{Jump Your Steps}$: Optimizing Sampling Schedule of Discrete Diffusion Models

Fast Diffusion Probabilistic Model Sampling through the lens of Backward Error Analysis

Non-uniform Timestep Sampling: Towards Faster Diffusion Model Training

Towards Faster Non-Asymptotic Convergence for Diffusion-Based Generative Models

Accelerating Convergence of Score-Based Diffusion Models, Provably

Adaptive Non-Uniform Timestep Sampling for Diffusion Model Training

Fast Sampling Through The Reuse Of Attention Maps In Diffusion Models

A Reparameterized Discrete Diffusion Model for Text Generation

Can Diffusion Model Achieve Better Performance in Text Generation? Bridging the Gap Between Training and Inference!

AdaDiff: Adaptive Step Selection for Fast Diffusion Models

One Step Diffusion via Shortcut Models

Simple and Fast Distillation of Diffusion Models

Fast Inference in Denoising Diffusion Models via MMD Finetuning

Fast Sampling of Diffusion Models via Operator Learning

Fast constrained sampling in pre-trained diffusion models

Accelerating Diffusion Models via Early Stop of the Diffusion Process

Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling

Diffusion-TS: Interpretable Diffusion for General Time Series Generation