Abstract:Developing technology to assist medical experts in their everyday decision-making is currently a hot topic in the field of Artificial Intelligence (AI). This is specially true within the framework of Evidence-Based Medicine (EBM), where the aim is to facilitate the extraction of relevant information using natural language as a tool for mediating in human-AI interaction. In this context, AI techniques can be beneficial in finding arguments for past decisions in evolution notes or patient journeys, especially when different doctors are involved in a patient's care. In those documents the decision-making process towards treating the patient is reported. Thus, applying Natural Language Processing (NLP) techniques has the potential to assist doctors in extracting arguments for a more comprehensive understanding of the decisions made. This work focuses on the explanatory argument identification step by setting up the task in a Question Answering (QA) scenario in which clinicians ask questions to the AI model to assist them in identifying those arguments. In order to explore the capabilities of current AI-based language models, we present a new dataset which, unlike previous work: (i) includes not only explanatory arguments for the correct hypothesis, but also arguments to reason on the incorrectness of other hypotheses; (ii) the explanations are written originally in Spanish by doctors to reason over cases from the Spanish Residency Medical Exams. Furthermore, this new benchmark allows us to set up a novel extractive task by identifying the explanation written by medical doctors that supports the correct answer within an argumentative text. An additional benefit of our approach lies in its ability to evaluate the extractive performance of language models using automatic metrics, which in the Antidote CasiMedicos dataset corresponds to a 74.47 F1 score. Comprehensive experimentation shows that our novel dataset and approach is an effective technique to help practitioners in identifying relevant evidence-based explanations for medical questions.

[Extracorporeal shock wave lithotripsy of a deep calculus of the ureter].

Development of an Extractive Clinical Question Answering Dataset with Multi-Answer and Multi-Focus Questions

A Question-Answering System over Traditional Chinese Medicine

A Joint Model For Question-Answering Over Traditional Chinese Medicine

Give me Some Hard Questions: Synthetic Data Generation for Clinical QA

RealMedQA: A pilot biomedical question answering dataset containing realistic clinical questions

XAIQA: Explainer-Based Data Augmentation for Extractive Question Answering

Annotating Electronic Medical Records for Question Answering

Explanatory argument extraction of correct answers in resident medical exams

Question Answering for Electronic Health Records: A Scoping Review of datasets and models

EHRXQA: A Multi-Modal Question Answering Dataset for Electronic Health Records with Chest X-ray Images

Experimental Design of Extractive Question-Answering Systems: Influence of Error Scores and Answer Length

A Question Answering Based Pipeline for Comprehensive Chinese EHR Information Extraction

What Disease Does This Patient Have? A Large-Scale Open Domain Question Answering Dataset from Medical Exams

K-QA: A Real-World Medical Q&A Benchmark

Medical Data Inquiry Using a Question Answering Model.

ECG-QA: A Comprehensive Question Answering Dataset Combined With Electrocardiogram

Question-Answering System Extracts Information on Injection Drug Use from Clinical Notes

AskHERMES: An online question answering system for complex clinical questions

PubMedQA: A Dataset for Biomedical Research Question Answering