The statistical thermodynamics of generative diffusion models: Phase transitions, symmetry breaking and critical instability

Luca Ambrogioni
2024-06-20
Abstract:Generative diffusion models have achieved spectacular performance in many areas of machine learning and generative modeling. While the fundamental ideas behind these models come from non-equilibrium physics, variational inference and stochastic calculus, in this paper we show that many aspects of these models can be understood using the tools of equilibrium statistical mechanics. Using this reformulation, we show that generative diffusion models undergo second-order phase transitions corresponding to symmetry breaking phenomena. We show that these phase-transitions are always in a mean-field universality class, as they are the result of a self-consistency condition in the generative dynamics. We argue that the critical instability that arises from the phase transitions lies at the heart of their generative capabilities, which are characterized by a set of mean-field critical exponents. Finally, we show that the dynamic equation of the generative process can be interpreted as a stochastic adiabatic transformation that minimizes the free energy while keeping the system in thermal equilibrium.
Machine Learning
What problem does this paper attempt to address?
The paper investigates the statistical thermodynamic properties of the generative diffusion models in the field of machine learning and generative modeling, particularly the phenomena associated with phase transitions, symmetry breaking, and critical instability. The authors propose to reframe these models within the framework of equilibrium statistical physics and state that these models undergo second-order phase transitions, similar to spontaneous symmetry breaking. These transitions belong to the mean-field universality class and are caused by the self-consistency conditions of the generative dynamics. The paper also discusses that the critical instability in the phase transition is at the core of generative capabilities, characterized by a set of mean-field critical exponents. Additionally, the authors demonstrate that the dynamics equation of the generative process can be interpreted as a stochastic adiabatic transformation that minimizes the free energy while keeping the system in a thermally balanced state. Finally, the paper proposes a generative bath model for a multi-site system as an extension of mean-field theory.