TLDR at SemEval-2024 Task 2: T5-generated clinical-Language summaries for DeBERTa Report Analysis

Spandan Das,Vinay Samuel,Shahriar Noroozizadeh
2024-04-14
Abstract:This paper introduces novel methodologies for the Natural Language Inference for Clinical Trials (NLI4CT) task. We present TLDR (T5-generated clinical-Language summaries for DeBERTa Report Analysis) which incorporates T5-model generated premise summaries for improved entailment and contradiction analysis in clinical NLI tasks. This approach overcomes the challenges posed by small context windows and lengthy premises, leading to a substantial improvement in Macro F1 scores: a 0.184 increase over truncated premises. Our comprehensive experimental evaluation, including detailed error analysis and ablations, confirms the superiority of TLDR in achieving consistency and faithfulness in predictions against semantically altered inputs.
Computation and Language,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to effectively handle long premises in the Natural Language Inference (NLI) tasks in Clinical Trial Reports (CTRs). Specifically, the paper focuses on how to shorten the long texts in clinical trial reports without losing key information, so as to better input them together with statements into the natural language inference model for analysis. Traditional methods such as truncating premises or using extractive summarization methods may lead to the loss of important information when dealing with long texts, thus affecting the accuracy of inference. Therefore, the paper proposes a new framework, TLDR (T5 - generated clinical - Language summaries for DeBERTa Report Analysis), which overcomes these challenges by using the premise summaries generated by the T5 model, thereby improving the performance of NLI tasks. The main contributions of the paper are as follows: - **Introducing T5 - generated clinical language summaries**: Use the T5 model to generate summaries of long premises to adapt to the input length limitations of the DeBERTa model. - **Improving the performance of NLI tasks**: Through this method, the paper has achieved a significant improvement in the macro - F1 score, which is 0.184 higher than the method using truncated premises and 0.046 higher than the premises using extractive summaries. - **Enhancing the consistency and fidelity of the model**: Through detailed error analysis and ablation experiments, it is proved that the TLDR framework can maintain consistency and fidelity when facing semantically - changed inputs, thereby improving the robustness and effectiveness of the model. In conclusion, the paper aims to solve the long - text processing problems faced by natural language inference tasks in clinical trial reports through innovative summarization generation techniques, thereby improving the performance of the model in this specific field.