The Impact of Auxiliary Patient Data on Automated Chest X-Ray Report Generation and How to Incorporate It

Aaron Nicolson,Shengyao Zhuang,Jason Dowling,Bevan Koopman
2024-06-19
Abstract:This study investigates the integration of diverse patient data sources into multimodal language models for automated chest X-ray (CXR) report generation. Traditionally, CXR report generation relies solely on CXR images and limited radiology data, overlooking valuable information from patient health records, particularly from emergency departments. Utilising the MIMIC-CXR and MIMIC-IV-ED datasets, we incorporate detailed patient information such as aperiodic vital signs, medications, and clinical history to enhance diagnostic accuracy. We introduce a novel approach to transform these heterogeneous data sources into embeddings that prompt a multimodal language model, significantly enhancing the diagnostic accuracy of generated radiology reports. Our comprehensive evaluation demonstrates the benefits of using a broader set of patient data, underscoring the potential for enhanced diagnostic capabilities and better patient outcomes through the integration of multimodal data in CXR report generation.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: How to improve the diagnostic accuracy of automated chest X - ray (CXR) report generation by integrating multiple patient data sources (such as vital signs, medication records, and clinical history, etc.). Traditional methods mainly rely on CXR images and limited radiology data, ignoring valuable information from patient health records, especially data from the emergency department. The authors use the MIMIC - CXR and MIMIC - IV - ED datasets to incorporate detailed patient information into a multimodal language model to enhance diagnostic accuracy. Specifically, the goals of the paper include: 1. **Research on the influence of patient data on CXR report generation**: In particular, the specific influence of different data sources (such as drugs, vital signs, etc.). 2. **Empirical evaluation**: Demonstrate that combining multiple patient data sources (including the patient's CXR examination and emergency department records) can significantly improve diagnostic accuracy. 3. **Introduce new methods**: Develop methods for converting multimodal patient data into embedded representations, including numerical, categorical, free - text, time - series, and image data. 4. **Publish dataset splits**: Based on the MIMIC - CXR and MIMIC - IV - ED datasets, link patient examinations to their related emergency department records, and provide code repositories and pre - trained models for others to conduct experiments. ### Main contributions of the paper - **Explore the influence of patient data**: Focus on the influence of specific data sources (such as drugs and vital signs) on CXR report generation. - **Empirical evaluation**: Demonstrate that using multiple patient data sources can significantly improve diagnostic accuracy. - **New methods**: Introduce methods for converting multimodal patient data into embedded representations. - **Dataset release**: Provide dataset splits that link patient examinations to emergency department records, as well as code and pre - trained models. Through these efforts, the paper shows how to enhance the accuracy and efficiency of CXR report generation by integrating more comprehensive patient data, thereby improving the quality of patient care.