WangLab at MEDIQA-CORR 2024: Optimized LLM-based Programs for Medical Error Detection and Correction

Augustin Toma,Ronald Xie,Steven Palayew,Patrick R. Lawler,Bo Wang
2024-04-23
Abstract:Medical errors in clinical text pose significant risks to patient safety. The MEDIQA-CORR 2024 shared task focuses on detecting and correcting these errors across three subtasks: identifying the presence of an error, extracting the erroneous sentence, and generating a corrected sentence. In this paper, we present our approach that achieved top performance in all three subtasks. For the MS dataset, which contains subtle errors, we developed a retrieval-based system leveraging external medical question-answering datasets. For the UW dataset, reflecting more realistic clinical notes, we created a pipeline of modules to detect, localize, and correct errors. Both approaches utilized the DSPy framework for optimizing prompts and few-shot examples in large language model (LLM) based programs. Our results demonstrate the effectiveness of LLM based programs for medical error correction. However, our approach has limitations in addressing the full diversity of potential errors in medical documentation. We discuss the implications of our work and highlight future research directions to advance the robustness and applicability of medical error detection and correction systems.
Computation and Language
What problem does this paper attempt to address?
The paper attempts to address the problem of detecting and correcting medical errors in clinical texts. Specifically, the paper focuses on the MEDIQA-CORR 2024 shared task, which includes three subtasks: 1. **Error Flag Prediction**: Determine whether there are errors in the clinical text. 2. **Error Sentence Detection**: Identify the specific sentences that contain errors. 3. **Sentence Correction**: Correct the sentences that contain errors. These tasks aim to improve the accuracy and consistency of medical documentation, thereby supporting patient safety and healthcare quality. Medical errors in clinical texts can pose serious risks, including increased patient morbidity, mortality, and healthcare costs. Therefore, automated detection and correction of these errors are crucial for ensuring accurate medical records and promoting effective communication among healthcare providers.