Path-Guided Particle-based Sampling

Mingzhou Fan,Ruida Zhou,Chao Tian,Xiaoning Qian
2024-12-04
Abstract:Particle-based Bayesian inference methods by sampling from a partition-free target (posterior) distribution, e.g., Stein variational gradient descent (SVGD), have attracted significant attention. We propose a path-guided particle-based sampling~(PGPS) method based on a novel Log-weighted Shrinkage (LwS) density path linking an initial distribution to the target distribution. We propose to utilize a Neural network to learn a vector field motivated by the Fokker-Planck equation of the designed density path. Particles, initiated from the initial distribution, evolve according to the ordinary differential equation defined by the vector field. The distribution of these particles is guided along a density path from the initial distribution to the target distribution. The proposed LwS density path allows for an efficient search of modes of the target distribution while canonical methods fail. We theoretically analyze the Wasserstein distance of the distribution of the PGPS-generated samples and the target distribution due to approximation and discretization errors. Practically, the proposed PGPS-LwS method demonstrates higher Bayesian inference accuracy and better calibration ability in experiments conducted on both synthetic and real-world Bayesian learning tasks, compared to baselines, such as SVGD and Langevin dynamics, etc.
Machine Learning,Artificial Intelligence
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the poor performance of sampling methods in Bayesian inference on complex distributions (such as multimodal distributions). Specifically, traditional particle sampling methods based on gradient flow (such as Stein Variational Gradient Descent (SVGD) and Langevin Dynamics (LD)) are prone to mode - missing or misplacement of mode weights when dealing with complex models (such as Bayesian Neural Networks (BNNs)), which affects their generalization ability, uncertainty quantification and calibration ability. To solve these problems, the authors propose the Path - Guided Particle - based Sampling (PGPS). PGPS introduces a continuous path from the initial distribution to the target distribution and uses a neural network to learn a vector field to guide the particles to evolve along this path, thus exploring different modes of the target distribution more effectively. This method not only improves the accuracy of Bayesian inference but also enhances the calibration ability of the model. The following are the main contributions of the paper: 1. Propose PGPS as a new flow - based sampling framework and derive an operable criterion applicable to any differentiable and unnormalized - constant path. 2. Theoretically prove that under the discretization error caused by the approximation error δ and the step size h, the Wasserstein distance between the sample distribution generated by PGPS and the target distribution is bounded by O(δ)+O(√h). 3. Experimental verification shows that PGPS is superior to existing benchmark methods in finding modes more quickly and estimating weights more accurately, and exhibits higher test accuracy and stronger calibration ability in Bayesian inference tasks. These improvements make PGPS have better performance when dealing with complex distributions, especially in the case of multimodal distributions, and can more effectively capture all important modes and avoid the mode - missing problem.