Abstract:Background: The willingness to trust predictions formulated by automatic algorithms is key in a wide range of domains. However, a vast number of deep architectures are only able to formulate predictions without associated uncertainty. Purpose: In this study, we propose a method to convert a standard neural network into a Bayesian neural network and estimate the variability of predictions by sampling different networks similar to the original one at each forward pass. Methods: We combine our method with a tunable rejection-based approach that employs only the fraction of the data, i.e., the share that the model can classify with an uncertainty below a user-set threshold. We test our model in a large cohort of brain images from patients with Alzheimer's disease and healthy controls, discriminating the former and latter classes based on morphometric images exclusively. Results: We demonstrate how combining estimated uncertainty with a rejection-based approach increases classification accuracy from 0.86 to 0.95 while retaining 75% of the test set. In addition, the model can select the cases to be recommended for, e.g., expert human evaluation due to excessive uncertainty. Importantly, our framework circumvents additional workload during the training phase by using our network "turned into Bayesian" to implicitly investigate the loss landscape in the neighborhood of each test sample in order to determine the reliability of the predictions. Conclusion: We believe that being able to estimate the uncertainty of a prediction, along with tools that can modulate the behavior of the network to a degree of confidence that the user is informed about (and comfortable with), can represent a crucial step in the direction of user compliance and easier integration of deep learning tools into everyday tasks currently performed by human operators.

Self-Calibrating Neural-Probabilistic Model for Authorship Verification Under Covariate Shift

Deep Bayes Factor Scoring for Authorship Verification

Forging the Forger: An Attempt to Improve Authorship Verification via Data Augmentation

Addressing Topic Leakage in Cross-Topic Evaluation for Authorship Verification

A model-independent redundancy measure for human versus ChatGPT authorship discrimination using a Bayesian probabilistic approach

Authorship Verification based on the Likelihood Ratio of Grammar Models

Towards Trustworthy Predictions from Deep Neural Networks with Fast Adversarial Calibration

Rethinking the Authorship Verification Experimental Setups

A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution

Authorship attribution based on a probabilistic topic model

InstructAV: Instruction Fine-tuning Large Language Models for Authorship Verification

Authorship Verification - An Approach based on Random Forest

Robust and Accurate Authorship Attribution via Program Normalization

Neural Authorship Attribution: Stylometric Analysis on Large Language Models

Post-hoc Uncertainty Calibration for Domain Drift Scenarios

Mind the Uncertainty in Human Disagreement: Evaluating Discrepancies between Model Predictions and Human Responses in VQA

Enabling uncertainty estimation in neural networks through weight perturbation for improved Alzheimer's disease classification

A Comprehensive Study of Calibration and Uncertainty Quantification for Bayesian Convolutional Neural Networks - An Application to Seismic Data

Field-aware Calibration: A Simple and Empirically Strong Method for Reliable Probabilistic Predictions

Calibration and Uncertainty Quantification of Bayesian Convolutional Neural Networks for Geophysical Applications

CAVE: Controllable Authorship Verification Explanations