Improved Noise Schedule for Diffusion Training

Tiankai Hang,Shuyang Gu
2024-07-04
Abstract:Diffusion models have emerged as the de facto choice for generating visual signals. However, training a single model to predict noise across various levels poses significant challenges, necessitating numerous iterations and incurring significant computational costs. Various approaches, such as loss weighting strategy design and architectural refinements, have been introduced to expedite convergence. In this study, we propose a novel approach to design the noise schedule for enhancing the training of diffusion models. Our key insight is that the importance sampling of the logarithm of the Signal-to-Noise ratio (logSNR), theoretically equivalent to a modified noise schedule, is particularly beneficial for training efficiency when increasing the sample frequency around $\log \text{SNR}=0$. We empirically demonstrate the superiority of our noise schedule over the standard cosine schedule. Furthermore, we highlight the advantages of our noise schedule design on the ImageNet benchmark, showing that the designed schedule consistently benefits different prediction targets.
Computer Vision and Pattern Recognition,Artificial Intelligence
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is to improve the efficiency of diffusion model training. Specifically, the paper points out that although diffusion models have shown strong capabilities in generating visual signals, training a single model to predict noise at different noise levels still faces significant challenges and requires a large number of iterations and computing resources. To this end, the authors propose a new method to design a noise schedule. By sampling different noise intensities through importance sampling, especially increasing the sample frequency in the region where the log - signal - to - noise ratio (logSNR) is close to 0, the training efficiency is improved. The paper experimentally verifies the superiority of the proposed noise schedule over the standard cosine schedule and shows its advantages on different prediction targets in the ImageNet benchmark test.