Sebis at SemEval-2023 Task 7: A Joint System for Natural Language Inference and Evidence Retrieval from Clinical Trial Reports

Juraj Vladika,Florian Matthes
2023-05-03
Abstract:With the increasing number of clinical trial reports generated every day, it is becoming hard to keep up with novel discoveries that inform evidence-based healthcare recommendations. To help automate this process and assist medical experts, NLP solutions are being developed. This motivated the SemEval-2023 Task 7, where the goal was to develop an NLP system for two tasks: evidence retrieval and natural language inference from clinical trial data. In this paper, we describe our two developed systems. The first one is a pipeline system that models the two tasks separately, while the second one is a joint system that learns the two tasks simultaneously with a shared representation and a multi-task learning approach. The final system combines their outputs in an ensemble system. We formalize the models, present their characteristics and challenges, and provide an analysis of achieved results. Our system ranked 3rd out of 40 participants with a final submission.
Computation and Language
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to automatically perform evidence retrieval and natural language inference in clinical trial reports. Specifically, with the increasing number of clinical trial reports generated every day, it is becoming more and more difficult to keep up with these new findings to support evidence - based medical decision - making. To this end, researchers have developed natural language processing (NLP) solutions aimed at automating this process and assisting medical experts. The goal of SemEval - 2023 Task 7 is to develop an NLP system to complete two tasks: retrieving evidence from clinical trial data and performing natural language inference. The paper describes two systems developed by the authors. The first is a pipeline system that models these two tasks separately; the second is a joint system that simultaneously learns these two tasks through shared representations and multi - task learning methods. The final system forms an integrated system by combining the outputs of these two systems. The authors formally define the models, introduce their characteristics and challenges, and provide an analysis of the results achieved. In the competition, their system ranked third among 40 participating teams. **Summary of main problems**: 1. **Evidence retrieval**: Given a clinical trial document and a statement, develop a model to infer the logical relationship between them, that is, whether the statement is consistent or contradictory with the evidence in the document. 2. **Natural language inference**: Given all the sentences in a clinical trial document, develop a model to select those sentences that serve as evidence for judging the logical relationship between the statement and the document. These two tasks are closely related because high - quality evidence selection is crucial for identifying textual entailment, and the final entailment/contradiction label in turn affects the selection of specific evidence. By solving these problems, this system can help medical experts process a large number of clinical trial reports more efficiently and promote evidence - based medical decision - making.