Fusion of Gaussian Processes Predictions with Monte Carlo Sampling

Marzieh Ajirak,Daniel Waxman,Fernando Llorente,Petar M. Djuric
DOI: https://doi.org/10.1109/IEEECONF59524.2023.10476787
2024-03-03
Abstract:In science and engineering, we often work with models designed for accurate prediction of variables of interest. Recognizing that these models are approximations of reality, it becomes desirable to apply multiple models to the same data and integrate their outcomes. In this paper, we operate within the Bayesian paradigm, relying on Gaussian processes as our models. These models generate predictive probability density functions (pdfs), and the objective is to integrate them systematically, employing both linear and log-linear pooling. We introduce novel approaches for log-linear pooling, determining input-dependent weights for the predictive pdfs of the Gaussian processes. The aggregation of the pdfs is realized through Monte Carlo sampling, drawing samples of weights from their posterior. The performance of these methods, as well as those based on linear pooling, is demonstrated using a synthetic dataset.
Machine Learning
What problem does this paper attempt to address?
This paper discusses how to integrate Gaussian Processes (GPs) prediction to improve the accuracy of variable prediction in science and engineering. The authors propose a Bayesian framework-based method that utilizes probability density functions (pdfs) generated by Gaussian Processes and systematically integrates them. They introduce new approaches of linear and logarithmic pooling to determine the weights of the predictive pdfs dependent on the input and aggregate the pdfs by sampling from the posterior distribution of the weights using Monte Carlo sampling. The main problem addressed in the paper is how to effectively combine the predictions of multiple models, as a single model may not accurately represent reality. The study focuses on using Gaussian Processes as models and combining the predictions of different Gaussian Processes through linear or logarithmic pooling strategies, estimating input-dependent weights using Monte Carlo sampling. The paper also compares the performance of linear pooling and logarithmic pooling methods, demonstrating the effectiveness of these methods using synthetic datasets. Additionally, the paper mentions two existing fusion strategies: Bayesian Hierarchical Stacking (BHS) and Mixture of GP Experts (MoGPE), and proposes their variants - Product BHS (P-BHS) and Product of GP Experts (PoGPE). These new methods allow more flexible fusion of Gaussian Process predictions beyond simple linear combinations. Finally, through numerical experiments, the paper evaluates the performance of different methods with different numbers of experts (Gaussian Processes) and Random Fourier Features (RFF) frequencies, demonstrating that logarithmic pooling may outperform linear pooling in certain scenarios.