Hyper-Diffusion: Estimating Epistemic and Aleatoric Uncertainty with a Single Model

Matthew A. Chan,Maria J. Molina,Christopher A. Metzler
2024-02-06
Abstract:Estimating and disentangling epistemic uncertainty (uncertainty that can be reduced with more training data) and aleatoric uncertainty (uncertainty that is inherent to the task at hand) is critically important when applying machine learning (ML) to high-stakes applications such as medical imaging and weather forecasting. Conditional diffusion models' breakthrough ability to accurately and efficiently sample from the posterior distribution of a dataset now makes uncertainty estimation conceptually straightforward: One need only train and sample from a large ensemble of diffusion models. Unfortunately, training such an ensemble becomes computationally intractable as the complexity of the model architecture grows.
Machine Learning,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?