Abstract:Trustworthiness in neural networks is crucial for their deployment in critical applications, where reliability, confidence, and uncertainty play pivotal roles in decision-making. Traditional performance metrics such as accuracy and precision fail to capture these aspects, particularly in cases where models exhibit overconfidence. To address these limitations, this paper introduces a novel framework for quantifying the trustworthiness of neural networks by incorporating subjective logic into the evaluation of Expected Calibration Error (ECE). This method provides a comprehensive measure of trust, disbelief, and uncertainty by clustering predicted probabilities and fusing opinions using appropriate fusion operators. We demonstrate the effectiveness of this approach through experiments on MNIST and CIFAR-10 datasets, where post-calibration results indicate improved trustworthiness. The proposed framework offers a more interpretable and nuanced assessment of AI models, with potential applications in sensitive domains such as healthcare and autonomous systems.

What problem does this paper attempt to address?

### What problems does this paper attempt to solve? This paper aims to address the challenges of trustworthiness assessment of neural networks in critical applications. Specifically, traditional performance metrics such as accuracy, precision, and recall cannot capture the uncertainty and subjective aspects in model predictions, especially when the model shows over - confidence. These issues are particularly important in sensitive areas such as healthcare, finance, and autonomous systems, as wrong decisions can lead to serious consequences. #### Main problems 1. **Limitations of trustworthiness assessment**: - Traditional performance metrics (such as accuracy, precision, and recall) only focus on the correctness of model outputs and fail to reflect the confidence and uncertainty in predictions. - Existing calibration methods (such as Expected Calibration Error, ECE) are helpful in aligning predicted probabilities with actual results, but are difficult to interpret and cannot comprehensively assess the trustworthiness of AI systems. 2. **Deficiencies of existing methods**: - Existing methods for assessing trustworthiness usually require ground - truth label data at the operational stage, which may be difficult or impossible to obtain in real - world scenarios. - Methods that rely on assumed "oracles" to provide ground - truth labels may themselves be unreliable or untrustworthy, further complicating the assessment process. 3. **Lack of a comprehensive trustworthiness assessment framework**: - Current methods do not fully consider the degrees of belief, disbelief, and uncertainty, which are crucial for more detailed trust assessment. #### Solutions To solve the above problems, the authors propose a new framework based on subjective logic to quantify the trustworthiness of neural networks. The main contributions of this framework include: - **Extension of traditional ECE**: By introducing subjective logic, belief, disbelief, and uncertainty are incorporated into trustworthiness assessment, providing a more detailed and interpretable measure of trustworthiness. - **Clustering of predicted probabilities**: The predicted probability values are clustered, and a trustworthiness opinion is calculated for each cluster, enabling a more fine - grained trustworthiness analysis. - **Fusion of trust opinions**: Appropriate fusion operators are used to combine the trust opinions of each cluster into a comprehensive trust opinion, reflecting the overall trustworthiness of the entire neural network prediction. #### Experimental verification The authors verified the effectiveness of this framework through experiments on the MNIST and CIFAR - 10 datasets. The experimental results show that after temperature - scaling calibration, the trustworthiness of the model is significantly improved, indicating that this method has potential application value in practical application scenarios. ### Summary By introducing subjective logic, this paper proposes a new framework for quantifying trustworthiness, which solves the limitations of traditional assessment methods in the trustworthiness assessment of neural networks. This method not only improves the interpretability of trustworthiness assessment but also provides important support for reliable and ethical deployment in critical applications.

Quantifying calibration error in modern neural networks through evidence based theory

Addressing Deep Learning Model Calibration Using Evidential Neural Networks And Uncertainty-Aware Training

Beyond Calibration: Assessing the Probabilistic Fit of Neural Regressors via Conditional Congruence

Revisiting the Calibration of Modern Neural Networks

On Calibration of Modern Quantized Efficient Neural Networks

Better Uncertainty Calibration via Proper Scores for Classification and Beyond

Towards Trustworthy Predictions from Deep Neural Networks with Fast Adversarial Calibration

Two Sides of Miscalibration: Identifying Over and Under-Confidence Prediction for Network Calibration

Calibration Error Estimation Using Fuzzy Binning

Calibrating Deep Neural Network using Euclidean Distance

A Theoretical and Practical Framework for Evaluating Uncertainty Calibration in Object Detection

Accurate Uncertainties for Deep Learning Using Calibrated Regression

A Confidence Interval for the $\ell_2$ Expected Calibration Error

Expectation consistency for calibration of neural networks

Confidence Calibration for Object Detection and Segmentation

Improving Uncertainty Calibration of Deep Neural Networks via Truth Discovery and Geometric Optimization

Field-aware Calibration: A Simple and Empirically Strong Method for Reliable Probabilistic Predictions

Effect of confidence and explanation on accuracy and trust calibration in AI-assisted decision making

Fidelity: Towards Measuring the Trustworthiness of Neural Network Classification

On the Calibration of Epistemic Uncertainty: Principles, Paradoxes and Conflictual Loss

On Deep Neural Network Calibration by Regularization and its Impact on Refinement