Towards Explainability in Retrieval-Augmented LLMs

Joel Rorseth,Lukasz Golab,P. Godfrey,Jaroslaw Szlichta,Divesh Srivastava
DOI: https://doi.org/10.1109/ICDE60146.2024.00466
2024-05-13
Abstract:In an era where artificial intelligence (AI) is re-shaping countless aspects of society, we present a forward-looking perspective for enhancing the explainability of large language models (LLMs), with a particular focus on the retrieval-augmented generation (RAG) prompting technique. We motivate the urgency for developing techniques to explain LLM decision-making behaviour, especially as these models are deployed in critical sectors. Central to this effort is RAGE, our novel explain-ability tool that can trace the provenance of an LLM's answer back to external knowledge sources provided via RAG. RAGE builds upon established explainability techniques to recover citations for LLM answers, identify context biases, and mine answer rules. Through our novel explainability formulations and practical use cases, we chart a course toward more transparent and trustworthy AI technologies.
Computer Science
What problem does this paper attempt to address?