Abstract:Bayesian sampling is an important task in statistics and machine learning. Over the past decade, many ensemble-type sampling methods have been proposed. In contrast to the classical Markov chain Monte Carlo methods, these new methods deploy a large number of interactive samples, and the communication between these samples is crucial in speeding up the convergence. To justify the validity of these sampling strategies, the concept of interacting particles naturally calls for the mean-field theory. The theory establishes a correspondence between particle interactions encoded in a set of coupled ODEs/SDEs and a PDE that characterizes the evolution of the underlying distribution. This bridges numerical algorithms with the PDE theory used to show convergence in time. Many mathematical machineries are developed to provide the mean-field analysis, and we showcase two such examples: The coupling method and the compactness argument built upon the martingale strategy. The former has been deployed to show the convergence of ensemble Kalman sampler and ensemble Kalman inversion, and the latter will be shown to be immensely powerful in proving the validity of the Vlasov-Boltzmann simulator.

What problem does this paper attempt to address?

### Problems the paper attempts to solve This paper aims to explore the effectiveness and theoretical basis of Bayesian sampling methods based on interacting particles. Specifically, the paper focuses on how to accelerate the convergence rate of Bayesian sampling through multi - particle systems and prove the rationality of these methods through mean - field theory. ### Background and motivation Bayesian sampling is an important task in statistics and machine learning. Although the traditional Markov Chain Monte Carlo (MCMC) method is effective, when the problem dimension is high, the convergence rate is slow. In recent years, some ensemble - based sampling methods have emerged. These methods accelerate the sampling process through multiple interacting particles. However, the effectiveness of these new methods needs strict mathematical analysis to verify. ### Main research contents 1. **Mean - field Limit**: - The mean - field limit means that when the number of particles tends to infinity, the evolution of the multi - particle system can be approximated as the evolution of a continuous probability distribution. The paper proves the effectiveness of these ensemble - based methods through mean - field theory. - Specifically, the paper shows two main mean - field analysis methods: the coupling method and the compactness argument based on martingale strategies. 2. **Coupling method**: - The coupling method designs an auxiliary particle system so that the system can bridge the underlying distribution and the ensemble system. When the system has good properties (such as Lipschitz conditions), the coupling method is very effective. - The paper details the application of the coupling method in Ensemble Kalman Inversion (EKI) and Ensemble Kalman Sampler (EKS). 3. **Compactness argument based on martingale strategies**: - The martingale strategy deals with PDE problems by transforming them into martingale problems in path measure space. This method is suitable for more complex systems, especially those that do not satisfy the global Lipschitz condition. - The paper uses this method to prove the effectiveness of the Vlasov - Boltzmann simulator because the Boltzmann collision operator is very complex and difficult to handle by the coupling method. ### Main results 1. **Mean - field analysis of EKI**: - Through the coupling method, the paper proves that the mean - field limit of EKI holds in the sense of Wasserstein distance and weak convergence. Specifically, for any \(\epsilon>0\), there exists a constant \(C_\epsilon(t)\) independent of \(N\) such that: \[ E(W_2(\rho_N(t, x), \rho(t, x))) \leq C_\epsilon(t) \begin{cases} N^{-1/4}, & d \leq 4 \\ N^{-1/d}, & d > 4 \end{cases} \] - For any \(l\)-Lipschitz test function \(g\), for any \(\epsilon > 0\), there exists a constant \(C_\epsilon\) independent of \(N\) such that: \[ E\left( \left\| \int g(x) [\rho_N(t, x)dx - \rho(x, t)dx] \right\|^2 \right)^{1/2} \leq C_\epsilon N^{-1/2+\epsilon} \] 2. **Mean - field analysis of Vlasov - Boltzmann simulator**: - Through the martingale strategy, the paper proves that the mean - field limit of the Vlasov - Boltzmann simulator holds in the sense of weak convergence. This method is suitable for the more complex Boltzmann collision operator. ### Conclusion The paper through...

Bayesian sampling using interacting particles

Sampling from the Mean-Field Stationary Distribution

Metropolis-adjusted interacting particle sampling

Ensemble-Based Annealed Importance Sampling

A Unified Particle-Optimization Framework for Scalable Bayesian Sampling

A comparison of Bayesian sampling algorithms for high-dimensional particle physics and cosmology applications

Electrostatics-based particle sampling and approximate inference

Sampling algorithms in statistical physics: a guide for statistics and machine learning

The Bouncy Particle Sampler: A Nonreversible Rejection-Free Markov Chain Monte Carlo Method

Eigenfunction martingale estimators for interacting particle systems and their mean field limit

Solving stationary nonlinear Fokker-Planck equations via sampling

Single-ensemble multilevel Monte Carlo for discrete interacting-particle methods

Using Autodiff to Estimate Posterior Moments, Marginals and Samples

Sampling from Bayesian Neural Network Posteriors with Symmetric Minibatch Splitting Langevin Dynamics

HJ-sampler: A Bayesian sampler for inverse problems of a stochastic process by leveraging Hamilton-Jacobi PDEs and score-based generative models

On particle Gibbs sampling

Importance Sampling for the Empirical Measure of Weakly Interacting Diffusions

Sampling parameters of ordinary differential equations with Langevin dynamics that satisfy constraints

Dynamical computation of the density of states and Bayes factors using nonequilibrium importance sampling

Enhancing Score-Based Sampling Methods with Ensembles

Accelerate Langevin Sampling with Birth-Death process and Exploration Component