Abstract:Background and objective: Uncertainty quantification is a pivotal field that contributes to realizing reliable and robust systems. It becomes instrumental in fortifying safe decisions by providing complementary information, particularly within high-risk applications. existing studies have explored various methods that often operate under specific assumptions or necessitate substantial modifications to the network architecture to effectively account for uncertainties. The objective of this paper is to study Conformal Prediction, an emerging distribution-free uncertainty quantification technique, and provide a comprehensive understanding of the advantages and limitations inherent in various methods within the medical imaging field. Methods: In this study, we developed Conformal Prediction, Monte Carlo Dropout, and Evidential Deep Learning approaches to assess uncertainty quantification in deep neural networks. The effectiveness of these methods is evaluated using three public medical imaging datasets focused on detecting pigmented skin lesions and blood cell types. Results: The experimental results demonstrate a significant enhancement in uncertainty quantification with the utilization of the Conformal Prediction method, surpassing the performance of the other two methods. Furthermore, the results present insights into the effectiveness of each uncertainty method in handling Out-of-Distribution samples from domain-shifted datasets. Our code is available at: Conclusions: Our conclusion highlights a robust and consistent performance of conformal prediction across diverse testing conditions. This positions it as the preferred choice for decision-making in safety-critical applications.

What problem does this paper attempt to address?

### Problems the Paper Aims to Solve This paper aims to study and validate the application of **Conformal Prediction (CP)**, an emerging distribution-free uncertainty quantification technique, in the field of medical imaging, particularly for skin lesion classification tasks. Specifically, the goals of the paper include: 1. **Exploring and evaluating the advantages and limitations of different uncertainty quantification methods**: The paper compares three top uncertainty quantification methods—Conformal Prediction (CP), Monte Carlo Dropout (MCD), and Evidential Deep Learning (EDL)—and provides a detailed analysis of these methods when handling medical imaging data. 2. **Improving the effectiveness of uncertainty quantification**: Through experimental validation, the paper demonstrates the significant improvement of the CP method in uncertainty quantification, surpassing the other two methods. 3. **Handling Out-of-Distribution (OOD) samples**: The paper explores the effectiveness of each uncertainty quantification method when dealing with datasets from different domains, particularly in the presence of OOD samples. 4. **Providing safe and reliable decision support**: The paper emphasizes the robust performance of the CP method under various test conditions, making it the preferred method for decision support in high-risk applications. ### Background and Motivation In medical applications, deep learning models can achieve high accuracy in tasks such as image classification, tumor detection, and medical image segmentation. However, these models are typically trained on clean and well-curated datasets. In real-world applications, images may be affected by noise, artifacts, and other factors, leading to increased uncertainty in model predictions. Therefore, expressing the confidence of network predictions is crucial for assessing the reliability and robustness of model results, especially in critical applications like medical imaging. ### Methods The paper employs three different uncertainty quantification methods and conducts experiments on three public medical imaging datasets: 1. **Conformal Prediction (CP)**: A distribution-free uncertainty quantification technique that generates prediction sets by calibrating with a portion of unused data. 2. **Monte Carlo Dropout (MCD)**: Estimates model uncertainty by performing multiple forward passes during the inference phase. 3. **Evidential Deep Learning (EDL)**: Estimates prediction uncertainty by training the model to output evidence vectors. ### Results The experimental results show that the CP method exhibits significant advantages in uncertainty quantification, especially when handling OOD samples. Specifically: - **CP**: The average uncertainty value for correctly classified samples is 0.4, and for incorrectly classified samples, it is 0.79. - **MCD**: The average uncertainty value for correctly classified samples is 0.01, and for incorrectly classified samples, it is 0.09. - **EDL**: The average uncertainty value for correctly classified samples is 0.19, and for incorrectly classified samples, it is 0.51. ### Conclusion The paper concludes that the CP method demonstrates robust and consistent performance under various test conditions, making it particularly suitable for decision support in high-risk applications. This makes CP the preferred method for handling uncertainty in medical imaging classification tasks.

Empirical Validation of Conformal Prediction for Trustworthy Skin Lesions Classification

Joint Dermatological Lesion Classification and Confidence Modeling with Uncertainty Estimation

Fair Conformal Predictors for Applications in Medical Imaging

Deep Conformal Supervision: Leveraging Intermediate Features for Robust Uncertainty Quantification

Quantifying Deep Learning Model Uncertainty in Conformal Prediction

Conformal Prediction: A Gentle Introduction

Improving Trustworthiness of AI Disease Severity Rating in Medical Imaging with Ordinal Conformal Prediction Sets

A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification

A conformalized learning of a prediction set with applications to medical imaging classification

Three Applications of Conformal Prediction for Rating Breast Density in Mammography

Federated Conformal Predictors for Distributed Uncertainty Quantification

Adaptive Conformal Prediction Intervals Using Data-Dependent Weights With Application to Seismic Response Prediction

Conformal Risk Control for Pulmonary Nodule Detection

Empirically Validating Conformal Prediction on Modern Vision Architectures Under Distribution Shift and Long-tailed Data

Toward Clinically Trustworthy Deep Learning: Applying Conformal Prediction to Intracranial Hemorrhage Detection

Evaluating the Utility of Conformal Prediction Sets for AI-Advised Image Labeling

Conformal Prediction on Quantifying Uncertainty of Dynamic Systems

Deep Learning-Based Conformal Prediction of Toxicity

Adaptive Uncertainty Quantification for Generative AI

Conformal Depression Prediction

Uncertainty Quantification in Computer-Aided Diagnosis: Make Your Model say "I don't know" for Ambiguous Cases