Understanding Diffusion Models by Feynman's Path Integral

Yuji Hirono,Akinori Tanaka,Kenji Fukushima
2024-03-18
Abstract:Score-based diffusion models have proven effective in image generation and have gained widespread usage; however, the underlying factors contributing to the performance disparity between stochastic and deterministic (i.e., the probability flow ODEs) sampling schemes remain unclear. We introduce a novel formulation of diffusion models using Feynman's path integral, which is a formulation originally developed for quantum physics. We find this formulation providing comprehensive descriptions of score-based generative models, and demonstrate the derivation of backward stochastic differential equations and loss functions.The formulation accommodates an interpolating parameter connecting stochastic and deterministic sampling schemes, and we identify this parameter as a counterpart of Planck's constant in quantum physics. This analogy enables us to apply the Wentzel-Kramers-Brillouin (WKB) expansion, a well-established technique in quantum physics, for evaluating the negative log-likelihood to assess the performance disparity between stochastic and deterministic sampling schemes.
Machine Learning,Statistical Mechanics,Artificial Intelligence,High Energy Physics - Theory
What problem does this paper attempt to address?
This paper explores a diffusion model based on Feynman path integral, which is a theory in quantum physics that aims to address the ambiguity in performance between random and deterministic sampling schemes in diffusion modeling. The authors propose a new representation method for diffusion models using the Feynman path integral formula, originally developed for quantum physics. Through this approach, they are able to provide a more comprehensive description of fraction-based generative models and derive reverse stochastic differential equations and loss functions. The paper also introduces a parameter, similar to the Planck constant in quantum physics, to connect the two sampling schemes and evaluate the differences in the negative log-likelihood of different sampling schemes using the WKB (Wentzel–Kramers–Brillouin) expansion. The main contributions of the paper include: 1. The re-expression of diffusion models using path integral techniques, which deepens the understanding of the mathematical properties of these models and allows for the application of various techniques from quantum physics. 2. The introduction of an interpolation parameter h that connects random generation (h=1) and probabilistic flow ODE (h=0). The path integral framework reveals the role of h in diffusion models similar to the Planck constant in quantum physics. 3. The application of the WKB expansion method to quantitatively analyze the benefits of noise in the sampling process by conducting a first-order expansion of the negative log-likelihood calculation with respect to h. The paper also discusses the differences between random and deterministic sampling processes and the role of noise in the sampling process, but lacks in-depth quantitative analysis or a rigorous theoretical framework to explain this phenomenon. By drawing analogies with quantum physics, these contributions reveal a deep connection between diffusion models and physics, going beyond the perspectives of classical Brownian motion.