Can I Trust You? Rethinking Calibration with Controllable Confidence Ranking

Wenshu Ge,Huan Ma,Changqing Zhang
DOI: https://doi.org/10.1109/ijcnn60899.2024.10651478
2024-01-01
Abstract:While modeling uncertainty is becoming one of the mainstays in machine learning, the problem of evaluating the estimated confidence remains challenging. For the confidence estimation in classification, the main difficulty lies in lacking sample-level confidence. Existing measures either conduct an evaluation based on a set of predictions (Expected Calibration Error: ECE) or use 0/1 as an approximation of confidence (Negative Log-Likelihood: NLL). In this study, we reveal that these measures are inadequate in evaluating reasonable calibration, and propose to evaluate the predictive confidence for modern neural networks using a complementary internal measure to exploit the sample-level confidence. Specifically, based on the principle "information is the eliminated uncertainty", we propose a general framework to investigate whether (even well-calibrated) models estimate the confidence of a sample that matches the information it contains with controllable-confidence-ranking (CCR), and then devise a novel regularization strategy stabilizing the sample-level confidence estimation, producing more reasonable and sharper confidence. We conduct extensive experiments to validate the effectiveness and potential of our method.
What problem does this paper attempt to address?