Bayesian sampling using interacting particles

Shi Chen,Zhiyan Ding,Qin Li
2024-05-13
Abstract:Bayesian sampling is an important task in statistics and machine learning. Over the past decade, many ensemble-type sampling methods have been proposed. In contrast to the classical Markov chain Monte Carlo methods, these new methods deploy a large number of interactive samples, and the communication between these samples is crucial in speeding up the convergence. To justify the validity of these sampling strategies, the concept of interacting particles naturally calls for the mean-field theory. The theory establishes a correspondence between particle interactions encoded in a set of coupled ODEs/SDEs and a PDE that characterizes the evolution of the underlying distribution. This bridges numerical algorithms with the PDE theory used to show convergence in time. Many mathematical machineries are developed to provide the mean-field analysis, and we showcase two such examples: The coupling method and the compactness argument built upon the martingale strategy. The former has been deployed to show the convergence of ensemble Kalman sampler and ensemble Kalman inversion, and the latter will be shown to be immensely powerful in proving the validity of the Vlasov-Boltzmann simulator.
Numerical Analysis,Analysis of PDEs
What problem does this paper attempt to address?
### Problems the paper attempts to solve This paper aims to explore the effectiveness and theoretical basis of Bayesian sampling methods based on interacting particles. Specifically, the paper focuses on how to accelerate the convergence rate of Bayesian sampling through multi - particle systems and prove the rationality of these methods through mean - field theory. ### Background and motivation Bayesian sampling is an important task in statistics and machine learning. Although the traditional Markov Chain Monte Carlo (MCMC) method is effective, when the problem dimension is high, the convergence rate is slow. In recent years, some ensemble - based sampling methods have emerged. These methods accelerate the sampling process through multiple interacting particles. However, the effectiveness of these new methods needs strict mathematical analysis to verify. ### Main research contents 1. **Mean - field Limit**: - The mean - field limit means that when the number of particles tends to infinity, the evolution of the multi - particle system can be approximated as the evolution of a continuous probability distribution. The paper proves the effectiveness of these ensemble - based methods through mean - field theory. - Specifically, the paper shows two main mean - field analysis methods: the coupling method and the compactness argument based on martingale strategies. 2. **Coupling method**: - The coupling method designs an auxiliary particle system so that the system can bridge the underlying distribution and the ensemble system. When the system has good properties (such as Lipschitz conditions), the coupling method is very effective. - The paper details the application of the coupling method in Ensemble Kalman Inversion (EKI) and Ensemble Kalman Sampler (EKS). 3. **Compactness argument based on martingale strategies**: - The martingale strategy deals with PDE problems by transforming them into martingale problems in path measure space. This method is suitable for more complex systems, especially those that do not satisfy the global Lipschitz condition. - The paper uses this method to prove the effectiveness of the Vlasov - Boltzmann simulator because the Boltzmann collision operator is very complex and difficult to handle by the coupling method. ### Main results 1. **Mean - field analysis of EKI**: - Through the coupling method, the paper proves that the mean - field limit of EKI holds in the sense of Wasserstein distance and weak convergence. Specifically, for any \(\epsilon>0\), there exists a constant \(C_\epsilon(t)\) independent of \(N\) such that: \[ E(W_2(\rho_N(t, x), \rho(t, x))) \leq C_\epsilon(t) \begin{cases} N^{-1/4}, & d \leq 4 \\ N^{-1/d}, & d > 4 \end{cases} \] - For any \(l\)-Lipschitz test function \(g\), for any \(\epsilon > 0\), there exists a constant \(C_\epsilon\) independent of \(N\) such that: \[ E\left( \left\| \int g(x) [\rho_N(t, x)dx - \rho(x, t)dx] \right\|^2 \right)^{1/2} \leq C_\epsilon N^{-1/2+\epsilon} \] 2. **Mean - field analysis of Vlasov - Boltzmann simulator**: - Through the martingale strategy, the paper proves that the mean - field limit of the Vlasov - Boltzmann simulator holds in the sense of weak convergence. This method is suitable for the more complex Boltzmann collision operator. ### Conclusion The paper through...