Multivariate Bayesian Last Layer for Regression: Uncertainty Quantification and Disentanglement

Han Wang,Eiji Kawasaki,Guillaume Damblin,Geoffrey Daniel
2024-05-03
Abstract:We present new Bayesian Last Layer models in the setting of multivariate regression under heteroscedastic noise, and propose an optimization algorithm for parameter learning. Bayesian Last Layer combines Bayesian modelling of the predictive distribution with neural networks for parameterization of the prior, and has the attractive property of uncertainty quantification with a single forward pass. The proposed framework is capable of disentangling the aleatoric and epistemic uncertainty, and can be used to transfer a canonically trained deep neural network to new data domains with uncertainty-aware capability.
Machine Learning
What problem does this paper attempt to address?
This paper proposes a new Multivariate Bayesian Last Layer (BLL) model for addressing regression problems under heteroscedastic noise. In deep neural networks (DNNs), uncertainty quantification (UQ) of predictions is becoming increasingly important as it involves two types of uncertainties: aleatoric uncertainty and epistemic uncertainty. The focus of the paper is to develop a framework that can differentiate between these two uncertainties and enable the trained DNN to have uncertainty-aware capability in new data domains. The paper first reviews traditional linear regression models and then extends them to the case of multivariate heteroscedasticity. It introduces a new BLL model where the neural network is used to parameterize the prior distribution, allowing for one-shot forward propagation to quantify uncertainty. Through theoretical analysis of matrix variable distributions, the paper demonstrates how to decouple aleatoric and epistemic uncertainties in high dimensions, similar to the Deep Evidential Regression (DER) approach. The paper also discusses the issue of parameter estimation, particularly in the optimization of evidence (or marginal likelihood), where gradient estimates may be inaccurate due to the non-independence of batch data. To address this problem, they propose an algorithm called ELBO-EM (Evidence Lower Bound Expectation Maximization), which combines variational inference and expectation maximization (EM) to efficiently handle batch training and transfer learning. The experimental section showcases the applications of BLL in toy models, transfer learning, and multivariate time series prediction, demonstrating the effectiveness of the approach, especially in uncertainty quantification and model generalization capability. In summary, the main contribution of this paper is the development of a new BLL framework for handling uncertainty in multivariate regression, with particular emphasis on heteroscedastic noise, and providing an efficient optimization algorithm for batch training.