Faster high-accuracy log-concave sampling via algorithmic warm starts

Jason M. Altschuler,Sinho Chewi
DOI: https://doi.org/10.1145/3653446
IF: 2.269
2024-03-20
Journal of the ACM
Abstract:It is a fundamental problem to understand the complexity of high-accuracy sampling from a strongly log-concave density π on \(\mathbb {R}^d \) . Indeed, in practice, high-accuracy samplers such as the Metropolis-adjusted Langevin algorithm (MALA) remain the de facto gold standard; and in theory, via the proximal sampler reduction, it is understood that such samplers are key for sampling even beyond log-concavity (in particular, for sampling under isoperimetric assumptions). This paper improves the dimension dependence of this sampling problem to \(\widetilde{O}(d^{1/2}) \) . The previous best result for MALA was \(\widetilde{O}(d) \) . This closes the long line of work on the complexity of MALA, and moreover leads to state-of-the-art guarantees for high-accuracy sampling under strong log-concavity and beyond (thanks to the aforementioned reduction). Our starting point is that the complexity of MALA improves to \(\widetilde{O}(d^{1/2}) \) , but only under a warm start (an initialization with constant Rényi divergence w.r.t. π ). Previous algorithms for finding a warm start took O ( d ) time and thus dominated the computational effort of sampling. Our main technical contribution resolves this gap by establishing the first \(\widetilde{O}(d^{1/2}) \) Rényi mixing rates for the discretized underdamped Langevin diffusion. For this, we develop new differential-privacy-inspired techniques based on Rényi divergences with Orlicz–Wasserstein shifts, which allow us to sidestep longstanding challenges for proving fast convergence of hypocoercive differential equations.
computer science, information systems, theory & methods, software engineering, hardware & architecture
What problem does this paper attempt to address?
### Problems the paper attempts to solve The paper aims to solve the problem of efficient sampling from strongly log - concave density, especially in terms of the complexity of high - accuracy sampling. Specifically, the paper focuses on how to improve the computational complexity of high - accuracy sampling from strongly log - concave distributions in high - dimensional spaces and proposes a new algorithmic framework to achieve this goal. #### Main problem descriptions 1. **Complexity of high - accuracy sampling**: - In practice, classical high - accuracy samplers such as the Metropolis - adjusted Langevin algorithm (MALA) are still the de facto gold standard. - Theoretically, through proximal sampler reduction, these samplers are also crucial for sampling beyond log - concavity (for example, under the isoperimetric assumption). - The goal of this paper is to improve the dimension - dependence of MALA from the previous \( \tilde{O}(\kappa) \) to \( \tilde{O}(\kappa^{1/2}) \), thus closing a long - standing research line on the complexity of MALA and providing the latest high - accuracy sampling guarantees. 2. **Warm Start problem**: - Warm Start refers to starting sampling from an initial distribution with a constant Rényi divergence from the target distribution. - Previous methods required \( O(\kappa) \) time to obtain a Warm Start, which dominated the computational cost of sampling. - The main technical contribution of this paper is to establish a \( \tilde{O}(\kappa^{1/2}) \) Rényi mixing rate for the discretized underdamped Langevin diffusion, solving the long - standing challenge of fast convergence in hypocoercive differential equations. 3. **Gap between low - accuracy and high - accuracy sampling**: - Low - accuracy samplers are usually implemented by discretizing random processes such as Langevin diffusion or underdamped Langevin diffusion, but they are biased. - High - accuracy samplers eliminate the bias by attaching a Metropolis - Hastings filter, but this de - biasing method complicates the analysis. - The paper attempts to narrow the complexity gap between these two types of samplers, especially by providing an effective Warm Start method to accelerate high - accuracy sampling. ### Overview of solutions 1. **Algorithmic Warm Start**: - By using the underdamped Langevin Monte Carlo (ULMC) algorithm, the paper achieves a \( \tilde{O}(\kappa^{1/2}) \) Rényi mixing rate, thus solving the Warm Start problem. - This result is applicable not only to strongly log - concave distributions but also to a wider range of non - log - concave distributions that satisfy the standard isoperimetric assumption. 2. **Faster high - accuracy log - concave sampling**: - Combining the Warm Start technique and the improved condition number dependence, the paper advances the state - of - the - art in high - accuracy log - concave sampling. - The proposed algorithm can output samples that meet a given performance metric with a query complexity of \( \tilde{O}(\kappa^{1/2}) \). 3. **High - accuracy sampling beyond log - concavity**: - Using the faster log - concave sampler, the paper improves the per - iteration complexity of the proximal sampler, thus obtaining the latest complexity results for other classes of distributions (such as weakly log - concave, distributions that satisfy the log - Sobolev inequality or the Poincaré inequality). ### Technical difficulties and solutions 1. **Rényi divergence and hypocoercivity**: - Rényi divergence is a strong performance metric. The paper overcomes this by introducing the Orlicz - Wasserstein translation technique.