Augmented non-hallucinating large language models as medical information curators

Stephen Gilbert,Jakob Nikolas Kather,Aidan Hogan
DOI: https://doi.org/10.1038/s41746-024-01081-0
IF: 15.2
2024-04-24
npj Digital Medicine
Abstract:Reliably processing and interlinking medical information has been recognized as a critical foundation to the digital transformation of medical workflows, and despite the development of medical ontologies, the optimization of these has been a major bottleneck to digital medicine. The advent of large language models has brought great excitement, and maybe a solution to the medicines' 'communication problem' is in sight, but how can the known weaknesses of these models, such as hallucination and non-determinism, be tempered? Retrieval Augmented Generation, particularly through knowledge graphs, is an automated approach that can deliver structured reasoning and a model of truth alongside LLMs, relevant to information structuring and therefore also to decision support.
health care sciences & services,medical informatics
What problem does this paper attempt to address?
The paper primarily explores the "semantic problem" (also known as the "communication problem") in medical information processing, which involves reliably recording medical information and achieving interoperability between different systems. Despite the development of medical ontologies, this issue remains a major bottleneck in the digital transformation of medicine. The emergence of large language models (LLMs) offers hope for addressing the communication problem in the medical field, but these models have known weaknesses such as hallucination and non-determinism. The paper proposes a Retrieval Augmented Generation (RAG) method enhanced by knowledge graphs (KGs) to overcome the limitations of LLMs and applies it to the structuring of medical information and decision support. Specifically, the paper discusses the following points: 1. **Challenges of Medical Information**: Medical information is often in natural language form, making it difficult for information systems to process. 2. **Advantages and Limitations of LLMs**: While LLMs perform well in information structuring, they have issues such as bias, hallucination, and inaccuracy. 3. **Complementary Role of Knowledge Graphs**: Combining LLMs with knowledge graphs can mitigate the shortcomings of LLMs, improving the accuracy and reliability of information processing. 4. **Future Directions**: The combined approach of LLMs and knowledge graphs is expected to solve the communication problem in medical information and support applications like precision medicine. In summary, the paper aims to explore how combining large language models and knowledge graphs can address key challenges in medical information processing.