Abstract:Despite the great success of state-of-the-art deep neural networks, several studies have reported models to be over-confident in predictions, indicating miscalibration. Label Smoothing has been proposed as a solution to the over-confidence problem and works by softening hard targets during training, typically by distributing part of the probability mass from a `one-hot' label uniformly to all other labels. However, neither model nor human confidence in a label are likely to be uniformly distributed in this manner, with some labels more likely to be confused than others. In this paper we integrate notions of model confidence and human confidence with label smoothing, respectively \textit{Model Confidence LS} and \textit{Human Confidence LS}, to achieve better model calibration and generalization. To enhance model generalization, we show how our model and human confidence scores can be successfully applied to curriculum learning, a training strategy inspired by learning of `easier to harder' tasks. A higher model or human confidence score indicates a more recognisable and therefore easier sample, and can therefore be used as a scoring function to rank samples in curriculum learning. We evaluate our proposed methods with four state-of-the-art architectures for image and text classification task, using datasets with multi-rater label annotations by humans. We report that integrating model or human confidence information in label smoothing and curriculum learning improves both model performance and model calibration. The code are available at \url{<a class="link-external link-https" href="https://github.com/AoShuang92/Confidence_Calibration_CL" rel="external noopener nofollow">this https URL</a>}.

Trusting Language Models in Education

Large Language Model Confidence Estimation via Black-Box Access

Language Models (Mostly) Know What They Know

Quantifying Uncertainty in Answers from any Language Model and Enhancing their Trustworthiness

On the Intersection of Self-Correction and Trust in Language Models

The Calibration Gap between Model and Human Confidence in Large Language Models

Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty

FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation

A Diachronic Perspective on User Trust in AI under Uncertainty

Improving the Reliability of Large Language Models by Leveraging Uncertainty-Aware In-Context Learning

Confidence Under the Hood: An Investigation into the Confidence-Probability Alignment in Large Language Models

Calibrating the Confidence of Large Language Models by Eliciting Fidelity

Finetuning Language Models to Emit Linguistic Expressions of Uncertainty

Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation

Large Language Models Must Be Taught to Know What They Don't Know

Uncovering Name-Based Biases in Large Language Models Through Simulated Trust Game

Enhancing Trust in Large Language Models with Uncertainty-Aware Fine-Tuning

Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning

Confidence-Aware Calibration and Scoring Functions for Curriculum Learning

Calibrating Large Language Models Using Their Generations Only

Towards Trustworthy Large Language Models.