Jointly Extracting Interventions, Outcomes, and Findings from RCT Reports with LLMs

Somin Wadhwa,Jay DeYoung,Benjamin Nye,Silvio Amir,Byron C. Wallace
2023-07-18
Abstract:Results from Randomized Controlled Trials (RCTs) establish the comparative effectiveness of interventions, and are in turn critical inputs for evidence-based care. However, results from RCTs are presented in (often unstructured) natural language articles describing the design, execution, and outcomes of trials; clinicians must manually extract findings pertaining to interventions and outcomes of interest from such articles. This onerous manual process has motivated work on (semi-)automating extraction of structured evidence from trial reports. In this work we propose and evaluate a text-to-text model built on instruction-tuned Large Language Models (LLMs) to jointly extract Interventions, Outcomes, and Comparators (ICO elements) from clinical abstracts, and infer the associated results reported. Manual (expert) and automated evaluations indicate that framing evidence extraction as a conditional generation task and fine-tuning LLMs for this purpose realizes considerable ($\sim$20 point absolute F1 score) gains over the previous SOTA. We perform ablations and error analyses to assess aspects that contribute to model performance, and to highlight potential directions for further improvements. We apply our model to a collection of published RCTs through mid-2022, and release a searchable database of structured findings: <a class="link-external link-http" href="http://ico-relations.ebm-nlp.com" rel="external noopener nofollow">this http URL</a>
Computation and Language
What problem does this paper attempt to address?
The problem that this paper attempts to solve is the challenge of automatically extracting interventions, outcomes, and research findings from randomized controlled trial (RCT) reports. Specifically, the results of RCTs are crucial for evidence - based medical practice, but these results usually appear in the form of unstructured natural language in articles describing trial design, implementation, and results. Clinicians need to manually extract findings related to specific interventions and outcomes from these articles, which is a time - consuming and labor - intensive process. Therefore, this study aims to develop a text - to - text model capable of jointly extracting interventions, outcomes, and comparators (referred to as ICO elements) and inferring the relevant results reported by leveraging the capabilities of large - language models (LLMs). This method not only improves the extraction efficiency but also significantly enhances the extraction accuracy, with the F1 score increasing by approximately 20 percentage points compared to previous methods. The key contributions of the paper include: - **Method innovation**: Proposed an instruction - tuned large - language model to perform conditional generation tasks for jointly extracting ICO elements and related results from clinical abstracts. - **Performance improvement**: Achieved a significant performance improvement relative to the previous best method through fine - tuning of the model. - **Evaluation and analysis**: Conducted exhaustive manual and automated evaluations and explored factors affecting model performance and directions for future improvement through ablation experiments and error analysis. - **Application and resources**: Applied the model to the RCT collection published as of mid - 2022 and released a searchable database of structured findings for use by researchers and clinicians. In summary, this paper solves the problem of efficiently and accurately extracting key information from RCT reports, which is of great significance for promoting evidence - based medical practice.