Target-Driven Distillation: Consistency Distillation with Target Timestep Selection and Decoupled Guidance

Cunzheng Wang,Ziyuan Guo,Yuxuan Duan,Huaxia Li,Nemo Chen,Xu Tang,Yao Hu
2024-09-03
Abstract:Consistency distillation methods have demonstrated significant success in accelerating generative tasks of diffusion models. However, since previous consistency distillation methods use simple and straightforward strategies in selecting target timesteps, they usually struggle with blurs and detail losses in generated images. To address these limitations, we introduce Target-Driven Distillation (TDD), which (1) adopts a delicate selection strategy of target timesteps, increasing the training efficiency; (2) utilizes decoupled guidances during training, making TDD open to post-tuning on guidance scale during inference periods; (3) can be optionally equipped with non-equidistant sampling and x0 clipping, enabling a more flexible and accurate way for image sampling. Experiments verify that TDD achieves state-of-the-art performance in few-step generation, offering a better choice among consistency distillation models.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is the blurring and detail - loss problems existing in the existing consistency distillation methods when generating images. Specifically, because previous consistency distillation methods adopted simple and direct strategies when selecting the target time step, the generated images usually had problems of blurring and detail loss. To solve these problems, the author proposed the **Target - Driven Distillation (TDD)** method, which is mainly improved in the following aspects: 1. **Selection strategy of target time step**: TDD adopts a refined target time step selection strategy, which improves the training efficiency. 2. **Decoupled guidance**: Decoupled guidance is used during the training process, enabling TDD to adjust the guidance scale in the inference stage. 3. **Non - equidistant sampling and x0 cropping**: Optionally introduce non - equidistant sampling and x0 cropping to make image sampling more flexible and accurate. These improvements enable TDD to achieve state - of - the - art performance in the few - step generation task and provide a better choice of consistency distillation models.