Abstract:Particle-based Bayesian inference methods by sampling from a partition-free target (posterior) distribution, e.g., Stein variational gradient descent (SVGD), have attracted significant attention. We propose a path-guided particle-based sampling~(PGPS) method based on a novel Log-weighted Shrinkage (LwS) density path linking an initial distribution to the target distribution. We propose to utilize a Neural network to learn a vector field motivated by the Fokker-Planck equation of the designed density path. Particles, initiated from the initial distribution, evolve according to the ordinary differential equation defined by the vector field. The distribution of these particles is guided along a density path from the initial distribution to the target distribution. The proposed LwS density path allows for an efficient search of modes of the target distribution while canonical methods fail. We theoretically analyze the Wasserstein distance of the distribution of the PGPS-generated samples and the target distribution due to approximation and discretization errors. Practically, the proposed PGPS-LwS method demonstrates higher Bayesian inference accuracy and better calibration ability in experiments conducted on both synthetic and real-world Bayesian learning tasks, compared to baselines, such as SVGD and Langevin dynamics, etc.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the poor performance of sampling methods in Bayesian inference on complex distributions (such as multimodal distributions). Specifically, traditional particle sampling methods based on gradient flow (such as Stein Variational Gradient Descent (SVGD) and Langevin Dynamics (LD)) are prone to mode - missing or misplacement of mode weights when dealing with complex models (such as Bayesian Neural Networks (BNNs)), which affects their generalization ability, uncertainty quantification and calibration ability. To solve these problems, the authors propose the Path - Guided Particle - based Sampling (PGPS). PGPS introduces a continuous path from the initial distribution to the target distribution and uses a neural network to learn a vector field to guide the particles to evolve along this path, thus exploring different modes of the target distribution more effectively. This method not only improves the accuracy of Bayesian inference but also enhances the calibration ability of the model. The following are the main contributions of the paper: 1. Propose PGPS as a new flow - based sampling framework and derive an operable criterion applicable to any differentiable and unnormalized - constant path. 2. Theoretically prove that under the discretization error caused by the approximation error δ and the step size h, the Wasserstein distance between the sample distribution generated by PGPS and the target distribution is bounded by O(δ)+O(√h). 3. Experimental verification shows that PGPS is superior to existing benchmark methods in finding modes more quickly and estimating weights more accurately, and exhibits higher test accuracy and stronger calibration ability in Bayesian inference tasks. These improvements make PGPS have better performance when dealing with complex distributions, especially in the case of multimodal distributions, and can more effectively capture all important modes and avoid the mode - missing problem.

Path-Guided Particle-based Sampling

A Unified Particle-Optimization Framework for Scalable Bayesian Sampling

A Stochastic Particle Variational Bayesian Inference Inspired Deep-Unfolding Network for Sensing over Wireless Networks

Particle Optimization in Stochastic Gradient MCMC

Electrostatics-based particle sampling and approximate inference

Relative Entropy Gradient Sampler for Unnormalized Distributions

Augmented Message Passing Stein Variational Gradient Descent

Ancestor Sampling for Particle Gibbs

Noise-Free Sampling Algorithms via Regularized Wasserstein Proximals

Stein Boltzmann Sampling: A Variational Approach for Global Optimization

Group Importance Sampling for Particle Filtering and MCMC

Diffusive Gibbs Sampling

Plug-and-Play Split Gibbs Sampler: Embedding Deep Generative Priors in Bayesian Inference

Iterative Construction of Gaussian Process Surrogate Models for Bayesian Inference

Variational Inference via Smoothed Particle Hydrodynamics

Path Guiding for Monte Carlo PDE Solvers

A Langevin-like Sampler for Discrete Distributions

Fused $L_{1/2}$ prior for large scale linear inverse problem with Gibbs bouncy particle sampler

Likelihood Scouting Via Map Inversion For A Posterior-Sampled Particle Filter

Transition Path Sampling with Improved Off-Policy Training of Diffusion Path Samplers

Neural Parametric Mixtures for Path Guiding