Fast Sampling via Discrete Non-Markov Diffusion Models with Predetermined Transition Time

Zixiang Chen,Huizhuo Yuan,Yongqian Li,Yiwen Kou,Junkai Zhang,Quanquan Gu
2024-12-06
Abstract:Discrete diffusion models have emerged as powerful tools for high-quality data generation. Despite their success in discrete spaces, such as text generation tasks, the acceleration of discrete diffusion models remains under-explored. In this paper, we propose discrete non-Markov diffusion models (DNDM), which naturally induce the predetermined transition time set. This enables a training-free sampling algorithm that significantly reduces the number of function evaluations (i.e., calls to the neural network), making the sampling process much faster. Furthermore, we study the transition from finite to infinite step sampling, offering new insights into bridging the gap between discrete and continuous-time processes for discrete diffusion models. Extensive experiments on natural language generation and machine translation tasks demonstrate the superior performance of our method in terms of both generation speed and sample quality compared to existing methods for discrete diffusion models.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
### Problems the paper attempts to solve The paper aims to solve the sampling speed problem of Discrete Diffusion Models (DDMs) when generating high - quality data. Although discrete diffusion models have achieved remarkable success in tasks such as text generation, there are still relatively few studies on their acceleration. This paper proposes a new Discrete Non - Markov Diffusion Model (DNDM). By introducing a Predetermined Transition Time Set, a fast sampling algorithm without training is achieved. This method significantly reduces the number of neural network function calls, thus speeding up the sampling process while maintaining sample quality. ### Specific problem descriptions 1. **Slow sampling speed**: - Discrete diffusion models need to call neural network functions multiple times during the generation process, resulting in a slow sampling speed. - Although some methods (such as DDIM) have achieved acceleration in continuous spaces, there are fewer studies on acceleration in discrete spaces. 2. **Training - dependence**: - Existing acceleration methods usually depend on the training process, which limits their flexibility and application scope. 3. **Gap between discrete and continuous processes**: - How to achieve a smooth transition from a finite number of steps to an infinite number of steps in discrete diffusion models to better connect discrete and continuous - time processes. ### Solutions 1. **DNDM model**: - A Discrete Non - Markov Diffusion Model (DNDM) is proposed. By introducing a predetermined transition time set, the sampling process can be significantly accelerated. - DNDM retains the key properties of the original discrete diffusion model, such as marginal distribution and conditional distribution. 2. **Accelerated sampling algorithm**: - An accelerated sampling algorithm based on the predetermined transition time set is designed, which reduces the required number of neural network function calls. - For a standard T - step discrete diffusion process, DNDM only requires |T| function calls, where |T| is the cardinality of the transition time set and |T| is much smaller than T. 3. **Infinite - step sampling**: - The limit case when T tends to infinity is explored, and an infinite - step sampling algorithm is introduced to further improve the sampling efficiency. ### Experimental results - **Machine translation tasks**: - Experiments were carried out on the IWSLT14, WMT14 and WMT16 datasets. The results show that DNDM is superior to the baseline methods in terms of both sampling speed and sample quality. - In particular, when increasing the sampling steps, the generation time of DNDM only increases slightly, while the generation time of the baseline methods increases linearly. - **Unconditional text generation**: - The experimental results show that DNDM also performs well in unconditional text generation tasks, significantly improving the generation quality and speed. ### Summary This paper effectively solves the sampling speed problem of discrete diffusion models when generating high - quality data by introducing the DNDM model and the accelerated sampling algorithm. This method not only significantly improves the sampling efficiency but also provides new insights into the bridge between discrete and continuous - time processes.