Abstract:With the widespread application of Large Language Models (LLMs) to various domains, concerns regarding the trustworthiness of LLMs in safety-critical scenarios have been raised, due to their unpredictable tendency to hallucinate and generate misinformation. Existing LLMs do not have an inherent functionality to provide the users with an uncertainty/confidence metric for each response it generates, making it difficult to evaluate trustworthiness. Although several studies aim to develop uncertainty quantification methods for LLMs, they have fundamental limitations, such as being restricted to classification tasks, requiring additional training and data, considering only lexical instead of semantic information, and being prompt-wise but not response-wise. A new framework is proposed in this paper to address these issues. Semantic density extracts uncertainty/confidence information for each response from a probability distribution perspective in semantic space. It has no restriction on task types and is "off-the-shelf" for new models and tasks. Experiments on seven state-of-the-art LLMs, including the latest Llama 3 and Mixtral-8x22B models, on four free-form question-answering benchmarks demonstrate the superior performance and robustness of semantic density compared to prior approaches.

What problem does this paper attempt to address?

The problem this paper attempts to address is: Current large language models (LLMs) lack quantifiable metrics for uncertainty and confidence when generating responses, making it difficult to assess their reliability in high-risk scenarios. Specifically, existing LLMs cannot provide uncertainty or confidence measures for each generated response, making it challenging for users to determine the reliability of these responses. ### Background and Motivation 1. **Trust Issues**: Although LLMs have made significant progress in various fields, they have an unpredictable tendency to hallucinate when generating responses, i.e., generating incorrect or misleading information. 2. **Limitations of Existing Methods**: - **Classification Task Limitation**: Many existing methods are only applicable to classification tasks and not to free-form natural language generation tasks. - **Additional Training Requirements**: Some methods require additional training data and specific task labels, limiting their applicability. - **Lexical Rather Than Semantic**: Existing methods mainly focus on lexical-level uncertainty, ignoring semantic-level information. ### Solution The paper proposes a new framework—**Semantic Density (SD)**—to extract uncertainty/confidence information for each response in the semantic space. Specific advantages include: 1. **Response-Level Metrics**: Semantic density can provide confidence metrics for each specific response, not just for the entire prompt. 2. **Fine-Grained Semantic Differences**: It considers fine-grained semantic differences between different responses, making uncertainty quantification more precise. 3. **No Additional Training Required**: Semantic density is a "plug-and-play" tool that can be directly applied to any pre-trained LLM without modifying the model or requiring additional training. 4. **No Task Type Restrictions**: Applicable to various task types, especially general free-form generation tasks. ### Experimental Results The paper validates the effectiveness of semantic density through experiments on seven state-of-the-art LLMs, including the latest Llama 3 and Mixtral-8x22B models, across four free-form question-answering benchmark datasets. The experimental results show that semantic density outperforms existing uncertainty/confidence quantification methods on multiple metrics, particularly excelling in AUROC and AUPR metrics. ### Conclusion Semantic density provides an effective method to quantify the confidence of LLM-generated responses, thereby enhancing the ability to assess reliability in high-risk scenarios. The widespread application of this method is expected to promote deeper applications of LLMs in more fields.

Semantic Density: Uncertainty Quantification for Large Language Models through Confidence Measurement in Semantic Space

Improving Uncertainty Quantification in Large Language Models via Semantic Embeddings

On Verbalized Confidence Scores for LLMs

A Survey on Uncertainty Quantification of Large Language Models: Taxonomy, Open Research Challenges, and Future Directions

Large Language Model Confidence Estimation via Black-Box Access

Shifting Attention to Relevance: Towards the Uncertainty Estimation of Large Language Models

Generating with Confidence: Uncertainty Quantification for Black-box Large Language Models

Quantifying Uncertainty in Answers from any Language Model and Enhancing their Trustworthiness

Semantic Uncertainty: Linguistic Invariances for Uncertainty Estimation in Natural Language Generation

Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities

Uncertainty Quantification in Large Language Models Through Convex Hull Analysis

Can LLMs Express Their Uncertainty? An Empirical Evaluation of Confidence Elicitation in LLMs

Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words?

Uncertainty Quantification for In-Context Learning of Large Language Models

Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models

Semantically Diverse Language Generation for Uncertainty Estimation in Language Models

Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models

A Survey of Uncertainty Estimation in LLMs: Theory Meets Practice

Uncertainty-Based Abstention in LLMs Improves Safety and Reduces Hallucinations

Benchmarking LLMs via Uncertainty Quantification