Insights into Classifying and Mitigating LLMs' Hallucinations

Alessandro Bruno,Pier Luigi Mazzeo,Aladine Chetouani,Marouane Tliba,Mohamed Amine Kerkouri
DOI: https://doi.org/10.48550/arXiv.2311.08117
2023-11-14
Abstract:The widespread adoption of large language models (LLMs) across diverse AI applications is proof of the outstanding achievements obtained in several tasks, such as text mining, text generation, and question answering. However, LLMs are not exempt from drawbacks. One of the most concerning aspects regards the emerging problematic phenomena known as "Hallucinations". They manifest in text generation systems, particularly in question-answering systems reliant on LLMs, potentially resulting in false or misleading information propagation. This paper delves into the underlying causes of AI hallucination and elucidates its significance in artificial intelligence. In particular, Hallucination classification is tackled over several tasks (Machine Translation, Question and Answer, Dialog Systems, Summarisation Systems, Knowledge Graph with LLMs, and Visual Question Answer). Additionally, we explore potential strategies to mitigate hallucinations, aiming to enhance the overall reliability of LLMs. Our research addresses this critical issue within the HeReFaNMi (Health-Related Fake News Mitigation) project, generously supported by NGI Search, dedicated to combating Health-Related Fake News dissemination on the Internet. This endeavour represents a concerted effort to safeguard the integrity of information dissemination in an age of evolving AI technologies.
Computation and Language
What problem does this paper attempt to address?
The paper attempts to address the issue of "hallucinations" generated by large language models (LLMs) when producing text. Specifically, these hallucinations can lead to the generation of incorrect or misleading information, especially in tasks such as question answering systems, machine translation, dialogue systems, summarization, knowledge graph generation, and visual question answering. The main objectives of the paper are: 1. **In-depth analysis of hallucination phenomena**: Investigate the causes of hallucinations and their significance in artificial intelligence. 2. **Classification of hallucination types**: Categorize hallucinations in different tasks, including machine translation, question answering systems, dialogue systems, summarization systems, knowledge graph-based text generation, and visual question answering. 3. **Propose mitigation strategies**: Explore potential strategies to alleviate hallucinations to improve the overall reliability of LLMs. Through this research, the paper aims to provide a theoretical foundation and practical guidance for addressing the hallucination problem, particularly in the HeReFaNMi (Health-Related Fake News Mitigation) project, which is dedicated to reducing the spread of health-related fake news on the internet.