Abstract:Uncertainty estimation is a necessary component when implementing AI in high-risk settings, such as autonomous cars, medicine, or insurances. Large Language Models (LLMs) have seen a surge in popularity in recent years, but they are subject to hallucinations, which may cause serious harm in high-risk settings. Despite their success, LLMs are expensive to train and run: they need a large amount of computations and memory, preventing the use of ensembling methods in practice. In this work, we present a novel method that allows for fast and memory-friendly training of LLM ensembles. We show that the resulting ensembles can detect hallucinations and are a viable approach in practice as only one GPU is needed for training and inference.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is the problem of detecting hallucinations in large - language models (LLMs). Specifically, the paper focuses on two types of hallucinations: faithfulness hallucinations and factual hallucinations. These hallucinations may lead to serious consequences when using AI in high - risk scenarios (such as self - driving cars, the medical or insurance industries). Although existing methods have shown certain effectiveness in specific tasks, they are usually limited to a narrow range of tasks and lack broad applicability. In addition, although existing deep integration methods can provide more reliable uncertainty estimates, they require a large amount of computing resources, which limits their practical applications. To solve these problems, the paper proposes a fast and memory - efficient method to train LLM ensembles. This method utilizes low - rank adaptation (LoRA) matrices and component - specific rank - 1 matrix modifications to reduce computational overhead and make the effective use of the ensemble method possible. Through this method, the author shows how to use the uncertainty estimates generated by LLMs as features to train a binary classifier to distinguish between hallucinatory content and correct content, thereby achieving effective detection of both types of hallucinations. The main contributions of the paper include: 1. Proposing a fast and memory - efficient LLM fine - tuning method, using LoRA matrices and component - specific rank - 1 matrix modifications, reducing computational overhead and making the application of the ensemble method more feasible. 2. Proposing a new hallucination - detection method, redefining hallucination detection as a binary - classification task, and using the uncertainty estimates of LLMs as features to distinguish between hallucinatory content and correct content. 3. Demonstrating that this method can operate effectively under the minimum hardware configuration (only requiring one A40 GPU), proving its efficiency and scalability. Through these contributions, the paper not only improves the accuracy of hallucination detection in LLMs but also provides a practical solution for deploying these models in resource - constrained environments.

Hallucination Detection in LLMs: Fast and Memory-Efficient Fine-Tuned Models

Cost-Effective Hallucination Detection for LLMs

Unsupervised Real-Time Hallucination Detection based on the Internal States of Large Language Models

Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach

Hallucination Detection and Hallucination Mitigation: An Investigation

Hallucination Detection for Generative Large Language Models by Bayesian Sequential Estimation

Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection

SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination Detection

Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus

Banishing LLM Hallucinations Requires Rethinking Generalization

Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks

LLM Internal States Reveal Hallucination Risk Faced With a Query

Fine-grained Hallucination Detection and Editing for Language Models

Zero-Resource Hallucination Prevention for Large Language Models

Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback

Learning to Trust Your Feelings: Leveraging Self-awareness in LLMs for Hallucination Mitigation

Reference-free Hallucination Detection for Large Vision-Language Models

Beyond Fine-Tuning: Effective Strategies for Mitigating Hallucinations in Large Language Models for Data Analytics

FactCheckmate: Preemptively Detecting and Mitigating Hallucinations in LMs

The Two Sides of the Coin: Hallucination Generation and Detection with LLMs as Evaluators for LLMs

OPDAI at SemEval-2024 Task 6: Small LLMs can Accelerate Hallucination Detection with Weakly Supervised Data