Evidence Inference 2.0: More Data, Better Models

Jay DeYoung,Eric Lehman,Ben Nye,Iain J. Marshall,Byron C. Wallace
DOI: https://doi.org/10.48550/arXiv.2005.04177
2020-05-14
Abstract:How do we most effectively treat a disease or condition? Ideally, we could consult a database of evidence gleaned from clinical trials to answer such questions. Unfortunately, no such database exists; clinical trial results are instead disseminated primarily via lengthy natural language articles. Perusing all such articles would be prohibitively time-consuming for healthcare practitioners; they instead tend to depend on manually compiled systematic reviews of medical literature to inform care. NLP may speed this process up, and eventually facilitate immediate consult of published evidence. The Evidence Inference dataset was recently released to facilitate research toward this end. This task entails inferring the comparative performance of two treatments, with respect to a given outcome, from a particular article (describing a clinical trial) and identifying supporting evidence. For instance: Does this article report that chemotherapy performed better than surgery for five-year survival rates of operable cancers? In this paper, we collect additional annotations to expand the Evidence Inference dataset by 25\%, provide stronger baseline models, systematically inspect the errors that these make, and probe dataset quality. We also release an abstract only (as opposed to full-texts) version of the task for rapid model prototyping. The updated corpus, documentation, and code for new baselines and evaluations are available at <a class="link-external link-http" href="http://evidence-inference.ebm-nlp.com/" rel="external noopener nofollow">this http URL</a>.
Computation and Language
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to extract evidence from clinical trial reports more effectively to support medical decision - making. Specifically, the paper focuses on how to use natural language processing (NLP) techniques to automatically identify the relationships among specific treatment interventions, comparators and outcomes from the full text or abstract of clinical trials, and to judge the performance of these treatment interventions relative to the comparators in terms of given outcomes (such as significant increase, significant decrease or no significant difference). This helps to accelerate the process of systematic review, enabling healthcare practitioners to obtain the latest clinical evidence more quickly and thus make evidence - based medical decisions. The paper mentions that currently, the results of clinical trials are mainly published in the form of long natural - language articles, and it is very time - consuming for healthcare practitioners to manually consult all relevant articles. Therefore, the development of techniques that can automatically extract key information from these articles is of great significance for improving the quality and efficiency of medical decision - making.