XAI meets LLMs: A Survey of the Relation between Explainable AI and Large Language Models

Erik Cambria,Lorenzo Malandri,Fabio Mercorio,Navid Nobani,Andrea Seveso
2024-07-22
Abstract:In this survey, we address the key challenges in Large Language Models (LLM) research, focusing on the importance of interpretability. Driven by increasing interest from AI and business sectors, we highlight the need for transparency in LLMs. We examine the dual paths in current LLM research and eXplainable Artificial Intelligence (XAI): enhancing performance through XAI and the emerging focus on model interpretability. Our paper advocates for a balanced approach that values interpretability equally with functional advancements. Recognizing the rapid development in LLM research, our survey includes both peer-reviewed and preprint (arXiv) papers, offering a comprehensive overview of XAI's role in LLM research. We conclude by urging the research community to advance both LLM and XAI fields together.
Computation and Language
What problem does this paper attempt to address?
The paper attempts to address issues primarily focused on the interpretability of large language models (LLMs). Specifically, the paper explores the following aspects: 1. **Complexity and Opacity of LLMs**: Due to their complex internal structure and black-box nature, LLMs are difficult to understand, which can lead to inappropriate content or misleading outputs in practical applications, especially in critical fields such as healthcare and finance. 2. **Trust and Transparency**: In fields like healthcare and finance, the opacity of LLMs results in low user trust, affecting the scope and acceptance of these models. Particularly under strict regulatory requirements, such as the EU's GDPR, the interpretability of models becomes especially important. 3. **Misuse and Impact on Critical Thinking**: The multifunctionality of LLMs may lead to their use for harmful purposes, such as generating inappropriate content to evade censorship. Additionally, over-reliance on LLMs may weaken users' critical thinking and independent analytical abilities, especially in the education sector. 4. **Ethical and Privacy Issues**: The use of LLMs also raises ethical and privacy concerns, such as fairness, hate speech, and sensitive data leakage, necessitating proactive measures and the development of ethical guidelines to address these issues. 5. **Inaccuracy and Hallucination**: LLMs may generate false information, posing risks in fields like education, journalism, and healthcare. Addressing these issues requires improving the accuracy of LLMs, educating users, and developing fact-checking systems. To address the above issues, the paper emphasizes the importance of explainable artificial intelligence (XAI) in LLMs, aiming to enhance the transparency, trustworthiness, and ethical use of LLMs through the development of XAI frameworks. The paper also proposes a systematic classification framework to evaluate current research on LLM interpretability and identifies research gaps and future directions through a comprehensive survey of existing peer-reviewed and preprint papers.