SDXL-Lightning: Progressive Adversarial Diffusion Distillation

Shanchuan Lin,Anran Wang,Xiao Yang
2024-02-22
Abstract:We propose a diffusion distillation method that achieves new state-of-the-art in one-step/few-step 1024px text-to-image generation based on SDXL. Our method combines progressive and adversarial distillation to achieve a balance between quality and mode coverage. In this paper, we discuss the theoretical analysis, discriminator design, model formulation, and training techniques. We open-source our distilled SDXL-Lightning models both as LoRA and full UNet weights.
Artificial Intelligence,Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### Problems the Paper Aims to Solve This paper aims to address the issue of diffusion models requiring a large number of inference steps to generate high-quality images. Specifically: 1. **Background and Challenges**: - Diffusion models have achieved significant results in tasks such as text-to-image generation, but their iterative generation process is relatively slow and computationally expensive. - Existing methods can reduce the number of inference steps, but still require more than 20 steps to generate high-quality images. 2. **Research Objectives**: - Propose a new distillation method (SDXL-Lightning) that can achieve one-step or multi-step generation of high-quality images at 1024-pixel resolution. - Combine the advantages of progressive distillation and adversarial distillation to balance image quality and mode coverage. - Open-source the model, supporting both LoRA and full UNet weight formats. Through these improvements, the paper aims to reduce the number of inference steps while maintaining or enhancing the quality of generated images, and ensure the model is compatible with existing LoRA modules and control plugins.