THiFLY Research at SemEval-2023 Task 7: A Multi-granularity System for CTR-based Textual Entailment and Evidence Retrieval

Yuxuan Zhou,Ziyu Jin,Meiwei Li,Miao Li,Xien Liu,Xinxin You,Ji Wu

2023-06-02

Abstract:The NLI4CT task aims to entail hypotheses based on Clinical Trial Reports (CTRs) and retrieve the corresponding evidence supporting the justification. This task poses a significant challenge, as verifying hypotheses in the NLI4CT task requires the integration of multiple pieces of evidence from one or two CTR(s) and the application of diverse levels of reasoning, including textual and numerical. To address these problems, we present a multi-granularity system for CTR-based textual entailment and evidence retrieval in this paper. Specifically, we construct a Multi-granularity Inference Network (MGNet) that exploits sentence-level and token-level encoding to handle both textual entailment and evidence retrieval tasks. Moreover, we enhance the numerical inference capability of the system by leveraging a T5-based model, SciFive, which is pre-trained on the medical corpus. Model ensembling and a joint inference method are further utilized in the system to increase the stability and consistency of inference. The system achieves f1-scores of 0.856 and 0.853 on textual entailment and evidence retrieval tasks, resulting in the best performance on both subtasks. The experimental results corroborate the effectiveness of our proposed method. Our code is publicly available at <a class="link-external link-https" href="https://github.com/THUMLP/NLI4CT" rel="external noopener nofollow">this https URL</a>.

Computation and Language

What problem does this paper attempt to address?

The problem that this paper attempts to solve is textual entailment and evidence retrieval in the natural language inference (NLI) task based on clinical trial reports (CTRs). Specifically, the NLI4CT task aims to construct an interpretable multi - evidence NLI system based on CTRs. This task contains two subtasks: 1. **Textual Entailment (Task A)**: Determine whether the CTR supports or refutes a given hypothesis. 2. **Evidence Retrieval (Task B)**: Find evidence in the CTR that supports the prediction in Task A. The main challenges lie in the fact that verifying a hypothesis usually requires integrating multiple evidence fragments, and these evidences may come from one or two CTRs. In addition, verifying a hypothesis also requires applying different levels of reasoning, including textual and numerical reasoning. To address these challenges, the paper proposes a multi - granularity system to handle textual entailment and evidence retrieval tasks through the multi - granularity inference network (MGNet), and utilizes the pre - trained SciFive model to enhance the numerical reasoning ability of the system.

THiFLY Research at SemEval-2023 Task 7: A Multi-granularity System for CTR-based Textual Entailment and Evidence Retrieval

NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports

SEME at SemEval-2024 Task 2: Comparing Masked and Generative Language Models on Natural Language Inference for Clinical Trials

Sebis at SemEval-2023 Task 7: A Joint System for Natural Language Inference and Evidence Retrieval from Clinical Trial Reports

R$^2$F: A General Retrieval, Reading and Fusion Framework for Document-level Natural Language Inference

A Multi-Modal Context Reasoning Approach for Conditional Inference on Joint Textual and Visual Clues

Using LLMs to label medical papers according to the CIViC evidence model

SemEval-2024 Task 2: Safe Biomedical Natural Language Inference for Clinical Trials

Predicting Clinical Trial Results by Implicit Evidence Integration

Natural Language Inference Using Lstm Model With Sentence Fusion

FZI-WIM at SemEval-2024 Task 2: Self-Consistent CoT for Complex NLI in Biomedical Domain

THUIR@COLIEE-2020: Leveraging Semantic Understanding and Exact Matching for Legal Case Retrieval and Entailment

Multi-hop Evidence Pursuit Meets the Web: Team Papelo at FEVER 2024

Multilevel Image-Enhanced Sentence Representation Net for Natural Language Inference

FLIP: Fine-grained Alignment between ID-based Models and Pretrained Language Models for CTR Prediction

Exploring Lexical, Syntactic, and Semantic Features for Chinese Textual Entailment in NTCIR RITE Evaluation Tasks

Evidence Inference 2.0: More Data, Better Models

Research on judgment reasoning using natural language inference in Chinese medical texts

Divide & Conquer for Entailment-aware Multi-hop Evidence Retrieval

IITK at SemEval-2024 Task 2: Exploring the Capabilities of LLMs for Safe Biomedical Natural Language Inference for Clinical Trials

BERT-CNN based evidence retrieval and aggregation for Chinese legal multi-choice question answering