IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents

Jean-Philippe Corbeil

2024-04-24

Abstract:In natural language processing applied to the clinical domain, utilizing large language models has emerged as a promising avenue for error detection and correction on clinical notes, a knowledge-intensive task for which annotated data is scarce. This paper presents MedReAct'N'MedReFlex, which leverages a suite of four LLM-based medical agents. The MedReAct agent initiates the process by observing, analyzing, and taking action, generating trajectories to guide the search to target a potential error in the clinical notes. Subsequently, the MedEval agent employs five evaluators to assess the targeted error and the proposed correction. In cases where MedReAct's actions prove insufficient, the MedReFlex agent intervenes, engaging in reflective analysis and proposing alternative strategies. Finally, the MedFinalParser agent formats the final output, preserving the original style while ensuring the integrity of the error correction process. One core component of our method is our RAG pipeline based on our ClinicalCorp corpora. Among other well-known sources containing clinical guidelines and information, we preprocess and release the open-source MedWiki dataset for clinical RAG application. Our results demonstrate the central role of our RAG approach with ClinicalCorp leveraged through the MedReAct'N'MedReFlex framework. It achieved the ninth rank on the MEDIQA-CORR 2024 final leaderboard.

Computation and Language,Artificial Intelligence,Multiagent Systems

What problem does this paper attempt to address?

The paper aims to address the issue of medical error detection and correction in clinical notes. Specifically, the authors propose a multi-agent framework named MedReAct’N’MedReFlex to participate in the MEDIQA-CORR 2024 competition. This framework includes four specialized medical agents: MedReAct, MedReFlex, MedEval, and MedFinalParser. These agents work collaboratively to identify and correct potential errors in clinical notes through a Retrieval-Augmented Generation (RAG) framework. The main contributions include: 1. Designing a multi-agent framework based on four medical agents, MedReAct’N’MedReFlex, to tackle the task of medical error detection and correction in the MEDIQA-CORR 2024 competition. 2. Releasing the open-source dataset MedWiki, which contains approximately 1.3 million article fragments focused on the medical field. 3. Providing a method to build a large corpus, ClinicalCorp, for RAG applications, which includes over 2.3 million document fragments. 4. Releasing a RAG version of the pre-training guide, containing over 710,000 fragments from eight open-source datasets. 5. Publishing the codebase on GitHub. Through a series of experiments, the authors found that the optimal settings were to retrieve the top 50 documents, re-rank the top 20 documents, and set an evaluation threshold. These optimizations significantly improved the system's performance. Ultimately, the framework achieved 9th place in the MEDIQA-CORR 2024 competition.

IryoNLP at MEDIQA-CORR 2024: Tackling the Medical Error Detection & Correction Task On the Shoulders of Medical Agents

Maven at MEDIQA-CORR 2024: Leveraging RAG and Medical LLM for Error Detection and Correction in Medical Notes

PromptMind Team at MEDIQA-CORR 2024: Improving Clinical Text Correction with Error Categorization and LLM Ensembles

MediFact at MEDIQA-CORR 2024: Why AI Needs a Human Touch

WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical Error Detection and Correction

HSE NLP Team at MEDIQA-CORR 2024 Task: In-Prompt Ensemble with Entities and Knowledge Graph for Medical Error Correction

Agentic LLM Workflows for Generating Patient-Friendly Medical Reports

Edinburgh Clinical NLP at MEDIQA-CORR 2024: Guiding Large Language Models with Hints

A Framework to Assess Clinical Safety and Hallucination Rates of LLMs for Medical Text Summarisation

MADEx: A System for Detecting Medications, Adverse Drug Events, and Their Relations from Clinical Notes

Cross-lingual Natural Language Processing on Limited Annotated Case/Radiology Reports in English and Japanese: Insights from the Real-MedNLP Workshop

Med7: a transferable clinical natural language processing model for electronic health records

MedAgents: Large Language Models as Collaborators for Zero-shot Medical Reasoning

Surf at MEDIQA 2019: Improving Performance of Natural Language Inference in the Clinical Domain by Adopting Pre-trained Language Model

Integrating Knowledge Retrieval and Large Language Models for Clinical Report Correction

Automated Drug-Related Information Extraction from French Clinical Documents: ReLyfe Approach

LLMs in Biomedicine: A study on clinical Named Entity Recognition

Adaptive Reasoning and Acting in Medical Language Agents

Accelerating Clinical Text Annotation in Underrepresented Languages: A Case Study on Text De-Identification

MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge Aggregation