KARGEN: Knowledge-enhanced Automated Radiology Report Generation Using Large Language Models

Yingshu Li,Zhanyu Wang,Yunyi Liu,Lei Wang,Lingqiao Liu,Luping Zhou

DOI: https://doi.org/10.48550/arXiv.2409.05370

2024-09-09

Abstract:Harnessing the robust capabilities of Large Language Models (LLMs) for narrative generation, logical reasoning, and common-sense knowledge integration, this study delves into utilizing LLMs to enhance automated radiology report generation (R2Gen). Despite the wealth of knowledge within LLMs, efficiently triggering relevant knowledge within these large models for specific tasks like R2Gen poses a critical research challenge. This paper presents KARGEN, a Knowledge-enhanced Automated radiology Report GENeration framework based on LLMs. Utilizing a frozen LLM to generate reports, the framework integrates a knowledge graph to unlock chest disease-related knowledge within the LLM to enhance the clinical utility of generated reports. This is achieved by leveraging the knowledge graph to distill disease-related features in a designed way. Since a radiology report encompasses both normal and disease-related findings, the extracted graph-enhanced disease-related features are integrated with regional image features, attending to both aspects. We explore two fusion methods to automatically prioritize and select the most relevant features. The fused features are employed by LLM to generate reports that are more sensitive to diseases and of improved quality. Our approach demonstrates promising results on the MIMIC-CXR and IU-Xray datasets.

Computer Vision and Pattern Recognition,Artificial Intelligence

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to more effectively utilize the knowledge of large - language models (LLMs) in automated radiology report generation (R2Gen) to improve the quality and clinical practicality of generated reports. Although LLMs possess strong capabilities in narrative generation, logical reasoning, and common - sense knowledge integration, in specific tasks such as R2Gen, how to efficiently trigger relevant knowledge in LLMs is an important research challenge. Specifically, the paper proposes a framework named KARGEN, which aims to solve this problem in the following ways: 1. **Integrating the medical domain knowledge graph**: Use a frozen LLM to generate reports, and unlock knowledge related to chest diseases by introducing a knowledge graph, thereby enhancing the clinical practicality of the generated reports. 2. **Extracting disease - related features**: Distill disease - related features through the knowledge graph and combine them with regional image features to focus on normal and disease - related findings in radiology reports. 3. **Feature fusion methods**: Explore two fusion methods (element - level fusion and modality - level fusion) to automatically prioritize the most relevant features, making the generated reports more sensitive to diseases and of higher quality. 4. **Verification and evaluation**: Conduct experiments on two public datasets, MIMIC - CXR and IU - Xray. The results show that KARGEN outperforms existing methods on multiple evaluation metrics, especially performing prominently on clinically - relevant evaluation metrics. Through these improvements, KARGEN can more accurately identify and describe diseases when generating radiology reports, thereby improving the clinical practicality and quality of the reports.

KARGEN: Knowledge-enhanced Automated Radiology Report Generation Using Large Language Models

Leveraging A Medical Knowledge Graph into Large Language Models for Diagnosis Prediction

Large Language Models and Medical Knowledge Grounding for Diagnosis Prediction

Rethinking Medical Report Generation: Disease Revealing Enhancement with Knowledge Graph

R2GenGPT: Radiology Report Generation with Frozen LLMs

Language Models and Retrieval Augmented Generation for Automated Structured Data Extraction from Diagnostic Reports

Automated Clinical Data Extraction with Knowledge Conditioned LLMs

Knowledge-Driven Encode, Retrieve, Paraphrase for Medical Image Report Generation

LLM-RadJudge: Achieving Radiologist-Level Evaluation for X-Ray Report Generation

Enhancing LLMs for Impression Generation in Radiology Reports through a Multi-Agent System

Integrating Knowledge Retrieval and Large Language Models for Clinical Report Correction

When Radiology Report Generation Meets Knowledge Graph

Enhanced Knowledge Injection for Radiology Report Generation

Learning to Generate Radiology Findings from Impressions Based on Large Language Model

DKA-RG: Disease-Knowledge-Enhanced Fine-Grained Image–Text Alignment for Automatic Radiology Report Generation

Development and Testing of Retrieval Augmented Generation in Large Language Models -- A Case Study Report

Radiology Report Generation with a Learned Knowledge Base and Multi-Modal Alignment

Large language models for structured reporting in radiology: past, present, and future

Think and Retrieval: A Hypothesis Knowledge Graph Enhanced Medical Large Language Models