Abstract:It is of significant interest in many applications to sample from a high-dimensional target distribution $\pi$ with the density $\pi(\text{d} x) \propto e^{-U(x)} (\text{d} x) $, based on the temporal discretization of the Langevin stochastic differential equations (SDEs). In this paper, we propose an explicit projected Langevin Monte Carlo (PLMC) algorithm with non-convex potential $U$ and super-linear gradient of $U$ and investigate the non-asymptotic analysis of its sampling error in total variation distance. Equipped with time-independent regularity estimates for the associated Kolmogorov equation, we derive the non-asymptotic bounds on the total variation distance between the target distribution of the Langevin SDEs and the law induced by the PLMC scheme with order $\mathcal{O}(d^{\max\{3\gamma/2 , 2\gamma-1 \}} h |\ln h|)$, where $d$ is the dimension of the target distribution and $\gamma \geq 1$ characterizes the growth of the gradient of $U$. In addition, if the gradient of $U$ is globally Lipschitz continuous, an improved convergence order of $\mathcal{O}(d^{3/2} h)$ for the classical Langevin Monte Carlo (LMC) scheme is derived with a refinement of the proof based on Malliavin calculus techniques. To achieve a given precision $\epsilon$, the smallest number of iterations of the PLMC algorithm is proved to be of order ${\mathcal{O}}\big(\tfrac{d^{\max\{3\gamma/2 , 2\gamma-1 \}}}{\epsilon} \ \cdot \ln (\tfrac{d}{\epsilon}) \cdot \ln (\tfrac{1}{\epsilon}) \big)$. In particular, the classical Langevin Monte Carlo (LMC) scheme with the non-convex potential $U$ and the globally Lipschitz gradient of $U$ can be guaranteed by order ${\mathcal{O}}\big(\tfrac{d^{3/2}}{\epsilon} \cdot \ln (\tfrac{1}{\epsilon}) \big)$. Numerical experiments are provided to confirm the theoretical findings.
What problem does this paper attempt to address?
This paper attempts to solve the problem of sampling from high - dimensional target distributions in non - convex and super - linear settings. Specifically, the authors propose an explicit Projected Langevin Monte Carlo (PLMC) algorithm and study its non - asymptotic sampling error under the total variation distance. The main focuses are as follows:
1. **Non - convex potential energy**: Traditional methods usually assume that the potential energy function \(U\) is strongly convex, but in many practical applications, \(U\) does not satisfy this condition. This paper considers the case of non - convex potential energy \(U\).
2. **Super - linear gradient**: When the gradient of the potential energy function \(U\) grows at a super - linear rate, the traditional Langevin Monte Carlo (LMC) algorithm may fail. In this paper, this situation is dealt with by introducing a projection operator \(P\).
3. **Non - asymptotic analysis**: The authors not only focus on the asymptotic behavior of the algorithm but also conduct non - asymptotic analysis, that is, explicitly give the dependence relationship between the error and the algorithm parameters (such as step size \(h\) and dimension \(d\)).
4. **Convergence**: For Langevin SDEs in non - convex and super - linear settings, the authors derive the non - asymptotic convergence bound of the PLMC algorithm under the total variation distance. In particular, when the gradient \(\nabla U\) is globally Lipschitz continuous, the convergence order of the classical LMC algorithm is improved.
5. **Number of iterations**: To achieve a given precision \(\epsilon\), the authors prove the minimum number of iterations required by the PLMC algorithm and give a specific expression.
### Main contributions
1. **Projected Langevin Monte Carlo algorithm**: A Projected Langevin Monte Carlo algorithm that can handle super - linear systems is proposed, which includes the classical Langevin Monte Carlo algorithm.
2. **Non - asymptotic bound**: The non - asymptotic bound between the law of the PLMC algorithm and the target distribution under the total variation distance in the case of non - convex potential energy is established.
3. **Minimum number of iterations**: The minimum number of iterations required for the Projected Langevin Monte Carlo algorithm to approach the target distribution under a given precision is shown.
### Mathematical formulas
- **Langevin SDE**:
\[
dX_t=-\nabla U(X_t)dt + \sqrt{2}dW_t,\quad X_0 = x_0,\quad t>0
\]
where \(W_t\) is a standard Brownian motion.
- **PLMC algorithm**:
\[
Y_{n + 1}=P(Y_n)-\nabla U(P(Y_n))h+\sqrt{2h}\xi_{n + 1},\quad Y_0=x_0
\]
where \(\xi_k\) are independent and identically distributed standard \(d\)-dimensional Gaussian vectors, and \(P\) is a projection operator.
- **Projection operator**:
\[
P(x)=\begin{cases}
\min\left\{1,\frac{\vartheta(d/h)^{1/2\gamma}}{\|x\|}\right\}x, & \text{if }\gamma>1,\\
x, & \text{if }\gamma = 1,\quad\forall x\in\mathbb{R}^d
\end{cases}
\]
where \(\gamma\) characterizes the growth rate of the gradient \(\nabla U\), and \(\vartheta\geq1\) is a constant independent of the step size \(h\) and dimension \(d\).
- **Non - asymptotic bound**:
\[
\|\Pi(Y_N^{x_0})-\pi\|_{\text{TV}}\leq C^\star