Abstract:Advances in deep neural network (DNN)-based molecular property prediction have recently led to the development of models of remarkable accuracy and generalization ability, with graph convolutional neural networks (GCNNs) reporting state-of-the-art performance for this task. However, some challenges remain, and one of the most important that needs to be fully addressed concerns uncertainty quantification. DNN performance is affected by the volume and the quality of the training samples. Therefore, establishing when and to what extent a prediction can be considered reliable is just as important as outputting accurate predictions, especially when out-of-domain molecules are targeted. Recently, several methods to account for uncertainty in DNNs have been proposed, most of which are based on approximate Bayesian inference. Among these, only a few scale to the large data sets required in applications. Evaluating and comparing these methods has recently attracted great interest, but results are generally fragmented and absent for molecular property prediction. In this paper, we quantitatively compare scalable techniques for uncertainty estimation in GCNNs. We introduce a set of quantitative criteria to capture different uncertainty aspects and then use these criteria to compare MC-dropout, Deep Ensembles, and bootstrapping, both theoretically in a unified framework that separates aleatoric/epistemic uncertainty and experimentally on public data sets. Our experiments quantify the performance of the different uncertainty estimation methods and their impact on uncertainty-related error reduction. Our findings indicate that Deep Ensembles and bootstrapping consistently outperform MC-dropout, with different context-specific pros and cons. Our analysis leads to a better understanding of the role of aleatoric/epistemic uncertainty, also in relation to the target data set features, and highlights the challenge posed by out-of-domain uncertainty.

Chemoinformatic regression methods and their applicability domain

A Statistical View of Some Chemometrics Regression Tools

Characterizing Uncertainty in Machine Learning for Chemistry

Methods for comparing uncertainty quantifications for material property predictions

Error Assessment of Computational Models in Chemistry

Determining Domain of Machine Learning Models using Kernel Density Estimates: Applications in Materials Property Prediction

Rethinking the applicability domain analysis in QSAR models

Application of a genomic model for high-dimensional chemometric analysis

Uncertainty Quantification Metrics for Deep Regression

Robust multivariate methods in Chemometrics

A hybrid framework for improving uncertainty quantification in deep learning-based QSAR regression modeling

Analysis of uncertainty of neural fingerprint-based models

Evolution of Support Vector Machine and Regression Modeling in Chemoinformatics and Drug Discovery

chemmodlab: A Cheminformatics Modeling Laboratory for Fitting and Assessing Machine Learning Models

A comparative study of conformal prediction methods for valid uncertainty quantification in machine learning

Experimental methods in chemical engineering: Monte Carlo

Beyond the Norms: Detecting Prediction Errors in Regression Models

Holistic chemical evaluation reveals pitfalls in reaction prediction models

Regression-based analysis of multivariate non-Gaussian datasets for diagnosing abnormal situations in chemical processes

Evaluating Scalable Uncertainty Estimation Methods for Deep Learning-Based Molecular Property Prediction

Distribution-free risk assessment of regression-based machine learning algorithms