Parallel simulation for sampling under isoperimetry and score-based diffusion models

Huanjian Zhou,Masashi Sugiyama
2024-12-10
Abstract:In recent years, there has been a surge of interest in proving discretization bounds for sampling under isoperimetry and for diffusion models. As data size grows, reducing the iteration cost becomes an important goal. Inspired by the great success of the parallel simulation of the initial value problem in scientific computation, we propose parallel Picard methods for sampling tasks. Rigorous theoretical analysis reveals that our algorithm achieves better dependence on dimension $d$ than prior works in iteration complexity (i.e., reduced from $\widetilde{O}(\log^2 d)$ to $\widetilde{O}(\log d)$), which is even optimal for sampling under isoperimetry with specific iteration complexity. Our work highlights the potential advantages of simulation methods in scientific computation for dynamics-based sampling and diffusion models.
Data Structures and Algorithms,Distributed, Parallel, and Cluster Computing,Machine Learning,Numerical Analysis
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: in high - dimensional space, how to effectively sample from probability distributions that satisfy specific conditions (such as isoperimetric inequalities), and improve the sampling efficiency of Diffusion Models. Specifically, the paper proposes the parallel Picard method to accelerate the iterative processes of these two tasks, thereby reducing the computational complexity and time cost. ### Problem Background 1. **Sampling Task**: - Given a potential function \( f: D \to \mathbb{R} \), the goal is to draw samples from the density \( \pi_f = Z_f^{-1} \exp(-f) \), where \( Z_f := \int_D \exp(-f(x)) dx \) is the normalization constant. - When \( f \) is strongly convex, \( \pi_f \) is a strongly log - concave distribution; when \( f \) is second - order differentiable and \( \nabla^2 f \preceq \beta I \), \( f \) is \( \beta \)-smooth, and \( \pi_f \) is \( \beta \)-log - smooth. 2. **Sampling in Diffusion Models**: - In Score - based Generative Models (SGMs), consider the forward process \( (x_t)_{t \in [0,T]} \) that satisfies the classical Ornstein - Uhlenbeck (OU) process: \[ dx_t = -\frac{1}{2} x_t dt + dB_t, \quad x_0 \sim q_0, \quad t \in [0,T] \] - The corresponding backward process \( (\vec{x}_t)_{t \in [0,T]} \) satisfies the following stochastic differential equation (SDE): \[ d\vec{x}_t = \left( \frac{1}{2} \vec{x}_t + \nabla \log \vec{p}_t(\vec{x}_t) \right) dt + dB_t, \quad \vec{x}_0 \sim p_0 \approx N(0_d, I_d), \quad t \in [0,T] \] ### Main Contributions of the Paper 1. **Faster Parallel Sampling Method**: - An efficient parallel algorithm suitable for overdamped Langevin diffusion and SGMs is proposed. - For distributions that satisfy the log - Sobolev inequality, the iterative complexity is improved from \( eO(\log^2 d / \epsilon^2) \) to \( eO(\log d / \epsilon^2) \), which is consistent with the recent lower bound. 2. **Application of the Parallel Picard Method**: - The parallel Picard method avoids error accumulation by repeatedly performing Picard iterations within each time slice and significantly reduces the number of required iterations. - For diffusion models, a new discretization scheme is proposed, which combines exponential integrators and gradually decreasing time steps to further improve the convergence speed. ### Technological Innovations - **Parallel Update of Time Slices**: - A diagonal - style update strategy is adopted instead of directly updating all time slices simultaneously or sequentially, ensuring the convergence of Picard iterations. - **Controlling Error Accumulation**: - By choosing an appropriate number of Picard iterations and depth, the accumulation of truncation error and score estimation error is balanced, ensuring that the total error remains bounded. ### Summary The paper significantly improves the sampling efficiency in high - dimensional space by introducing the parallel Picard method, especially for distributions that satisfy specific conditions and diffusion models. These improvements not only reduce the iterative complexity but also provide for future research.