Accelerated Bayesian Inference for Molecular Simulations using Local Gaussian Process Surrogate Models

B. L. Shanks,H. W. Sullivan,A. R. Shazed,M. P. Hoepfner
DOI: https://doi.org/10.1021/acs.jctc.3c01358
2024-04-02
Abstract:While Bayesian inference is the gold standard for uncertainty quantification and propagation, its use within physical chemistry encounters formidable computational barriers. These bottlenecks are magnified for modeling data with many independent variables, such as X-ray/neutron scattering patterns and electromagnetic spectra. To address this challenge, we apply a Bayesian framework accelerated via local Gaussian process (LGP) surrogate models. We show that the time-complexity of LGPs scales linearly in the number of independent variables, in stark contrast to the computationally expensive cubic scaling of conventional Gaussian processes. To illustrate the method, we trained a LGP surrogate model on the experimental radial distribution function of liquid neon, and observed a remarkable 288,000-fold speed-up compared to molecular dynamics with insignificant loss in predictive accuracy. We conclude that LGPs are robust and efficient surrogate models, poised to expand the application of Bayesian inference in molecular simulations to a broad spectrum of ever-advancing experimental data.
Chemical Physics,Soft Condensed Matter,Statistical Mechanics
What problem does this paper attempt to address?
The paper aims to address the computational barriers faced in Bayesian inference for molecular simulations, particularly in dealing with data from multiple independent variables (such as X-ray/neutron scattering patterns and electromagnetic spectra). By using a local Gaussian process (LGP) surrogate model, the paper accelerates Bayesian optimization of these complex thermophysical properties to reduce time complexity and improve efficiency. The research shows that compared to traditional Gaussian processes, LGP achieves significant computational speed improvements, thereby extending the application scope of Bayesian inference in molecular simulations to handle large-scale complex experimental data.