PromptMind Team at MEDIQA-CORR 2024: Improving Clinical Text Correction with Error Categorization and LLM Ensembles

Satya Kesav Gundabathula,Sriram R Kolar
2024-05-14
Abstract:This paper describes our approach to the MEDIQA-CORR shared task, which involves error detection and correction in clinical notes curated by medical professionals. This task involves handling three subtasks: detecting the presence of errors, identifying the specific sentence containing the error, and correcting it. Through our work, we aim to assess the capabilities of Large Language Models (LLMs) trained on a vast corpora of internet data that contain both factual and unreliable information. We propose to comprehensively address all subtasks together, and suggest employing a unique prompt-based in-context learning strategy. We will evaluate its efficacy in this specialized task demanding a combination of general reasoning and medical knowledge. In medical systems where prediction errors can have grave consequences, we propose leveraging self-consistency and ensemble methods to enhance error correction and error detection performance.
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The paper attempts to address the problem of detecting and correcting errors in clinical notes. Specifically, the paper focuses on the following three subtasks: 1. **Error Detection**: Determine whether there are errors in the clinical notes. 2. **Error Sentence Identification**: If errors exist, identify the specific sentences that contain errors. 3. **Error Correction**: Correct the identified erroneous sentences. The goal of the paper is to evaluate the performance of large language models (LLMs) on these tasks and propose a prompt-based contextual learning strategy to comprehensively handle all subtasks. Additionally, the paper explores how to improve the performance of error detection and correction through self-consistency and ensemble methods.