Accelerating Markov Chain Monte Carlo sampling with diffusion models

N. T. Hunt-Smith,W. Melnitchouk,F. Ringer,N. Sato,A. W Thomas,M. J. White
2023-09-04
Abstract:Global fits of physics models require efficient methods for exploring high-dimensional and/or multimodal posterior functions. We introduce a novel method for accelerating Markov Chain Monte Carlo (MCMC) sampling by pairing a Metropolis-Hastings algorithm with a diffusion model that can draw global samples with the aim of approximating the posterior. We briefly review diffusion models in the context of image synthesis before providing a streamlined diffusion model tailored towards low-dimensional data arrays. We then present our adapted Metropolis-Hastings algorithm which combines local proposals with global proposals taken from a diffusion model that is regularly trained on the samples produced during the MCMC run. Our approach leads to a significant reduction in the number of likelihood evaluations required to obtain an accurate representation of the Bayesian posterior across several analytic functions, as well as for a physical example based on a global analysis of parton distribution functions. Our method is extensible to other MCMC techniques, and we briefly compare our method to similar approaches based on normalizing flows. A code implementation can be found at <a class="link-external link-https" href="https://github.com/NickHunt-Smith/MCMC-diffusion" rel="external noopener nofollow">this https URL</a>.
High Energy Physics - Phenomenology,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to improve the efficiency of Markov Chain Monte Carlo (MCMC) sampling in the global fitting of high - dimensional and/or multimodal posterior functions. Specifically, the author introduces a new method to accelerate the MCMC sampling process by combining the Metropolis - Hastings algorithm with the diffusion model. This method aims to reduce the number of likelihood evaluations required to obtain an accurate Bayesian posterior representation, thereby improving the efficiency when exploring large parameter spaces. The paper focuses particularly on how to sample effectively in multimodal posterior functions, which is a known drawback of traditional MCMC methods, because proposal functions based on local proposals tend to find only one mode and have difficulty jumping between different modes. By using the diffusion model as a global proposal function, the efficiency of MCMC sampling can be significantly improved, especially when dealing with multimodal posterior functions.