Abstract:We introduce ColaCare, a framework that enhances Electronic Health Record (EHR) modeling through multi-agent collaboration driven by Large Language Models (LLMs). Our approach seamlessly integrates domain-specific expert models with LLMs to bridge the gap between structured EHR data and text-based reasoning. Inspired by clinical consultations, ColaCare employs two types of agents: DoctorAgent and MetaAgent, which collaboratively analyze patient data. Expert models process and generate predictions from numerical EHR data, while LLM agents produce reasoning references and decision-making reports within the collaborative consultation framework. We additionally incorporate the Merck Manual of Diagnosis and Therapy (MSD) medical guideline within a retrieval-augmented generation (RAG) module for authoritative evidence support. Extensive experiments conducted on four distinct EHR datasets demonstrate ColaCare's superior performance in mortality prediction tasks, underscoring its potential to revolutionize clinical decision support systems and advance personalized precision medicine. The code, complete prompt templates, more case studies, etc. are publicly available at the anonymous link: <a class="link-external link-https" href="https://colacare.netlify.app" rel="external noopener nofollow">this https URL</a>.

What problem does this paper attempt to address?

### Problems the paper attempts to solve This paper aims to solve several key problems in electronic health record (EHR) modeling: 1. **Limitations of data - driven methods**: - Existing EHR modeling methods are mainly pure data - driven end - to - end methods. These methods are independent of external knowledge and cannot understand the clinical significance of record features, only regarding them as variables without semantic context. - These "black - box" methods have limitations in input data distribution sensitivity and over - fitting, especially when the number and diversity of training samples are limited, which is a common problem in real - world clinical practice. 2. **Insufficient interpretability of models**: - Existing methods have limited interpretability and usually rely on traditional feature importance analysis techniques, such as Attention mechanism, SHAP (SHapley Additive exPlanations) and activation level visualization. These techniques can only provide basic interpretability, which is not sufficient for meaningful communication with doctors. 3. **Challenges of knowledge embedding**: - Although some works attempt to embed knowledge through ICD codes and knowledge graphs, these methods face challenges in practical applications because they rely on manually constructed knowledge forms and slow knowledge updates, which are often inconsistent with the latest medical research, clinical reports or updated guidelines, and these factors are crucial for clinical prediction tasks. 4. **Limitations of large language models (LLM) in structured EHR data analysis**: - Although LLM performs well in handling natural language tasks and medical Q&A, its ability in structured EHR data analysis and prediction is limited. In particular, its reasoning ability in few - sample settings still has a significant gap compared with traditional methods. ### Solutions To solve the above problems, the paper proposes the ColaCare framework, which enhances EHR modeling through multi - agent collaboration and large - language - model - (LLM) - driven methods. Specifically, the main contributions of the ColaCare framework include: 1. **Combining external knowledge**: - External knowledge is introduced through the Retrieval - Augmented Generation (RAG) module, enabling the model to be not only EHR data - driven but also able to enrich external knowledge and have self - review capabilities. 2. **Multi - perspective clinical decision - making evidence**: - ColaCare can output multi - perspective clinical decision - making evidence from multiple doctor agents, enhancing model transparency and providing human - understandable decision - making bases, which is helpful for doctors' diagnostic thinking. 3. **Experimental verification**: - Extensive experimental results show that ColaCare performs excellently in the clinical mortality prediction tasks on four different EHR datasets. Case studies highlight the rationality and interpretability of its generated reports, providing a potentially revolutionary solution for the development of clinical decision - support systems and personalized precision medicine. Through these innovations, the ColaCare framework aims to construct a human - interpretable EHR modeling method that can provide individualized prediction reasons and specific patient evidence clues, and has the ability to identify and reflect on potential fatal errors in the prediction results and evidence - finding process.

ColaCare: Enhancing Electronic Health Record Modeling through Large Language Model-Driven Multi-Agent Collaboration

Critical Care Studies Using Large Language Models Based on Electronic Healthcare Records: A Technical Note

Improving Clinical Expertise in Large Language Models Using Electronic Medical Records

LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing

Large language models enabled multiagent ensemble method for efficient EHR data labeling

REALM: RAG-Driven Enhancement of Multimodal Electronic Health Records Analysis via Large Language Models

EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records

MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning

A Demonstration of Adaptive Collaboration of Large Language Models for Medical Decision-Making

AI Hospital: Benchmarking Large Language Models in a Multi-agent Medical Interaction Simulator

MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making

MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation

Natural Language Programming in Medicine: Administering Evidence Based Clinical Workflows with Autonomous Agents Powered by Generative Large Language Models

IntelliCare: Improving Healthcare Analysis with Variance-Controlled Patient-Level Knowledge from Large Language Models

AI Hospital: Interactive Evaluation and Collaboration of LLMs As Intern Doctors for Clinical Diagnosis

AgentMD: Empowering Language Agents for Risk Prediction with Large-Scale Clinical Tool Learning

Large Language Models Illuminate a Progressive Pathway to Artificial Healthcare Assistant: A Review

Enhancing Early Detection of Cognitive Decline in the Elderly: A Comparative Study Utilizing Large Language Models in Clinical Notes

EHR Interaction Between Patients and AI: NoteAid EHR Interaction

Unlocking the Potential of Free Text in Electronic Health Records with Large Language Models (LLM): Enhancing Patient Safety and Consultation Interactions

A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics