Revisiting Relation Extraction in the era of Large Language Models

Somin Wadhwa,Silvio Amir,Byron C. Wallace

2024-07-16

Abstract:Relation extraction (RE) is the core NLP task of inferring semantic relationships between entities from text. Standard supervised RE techniques entail training modules to tag tokens comprising entity spans and then predict the relationship between them. Recent work has instead treated the problem as a \emph{sequence-to-sequence} task, linearizing relations between entities as target strings to be generated conditioned on the input. Here we push the limits of this approach, using larger language models (GPT-3 and Flan-T5 large) than considered in prior work and evaluating their performance on standard RE tasks under varying levels of supervision. We address issues inherent to evaluating generative approaches to RE by doing human evaluations, in lieu of relying on exact matching. Under this refined evaluation, we find that: (1) Few-shot prompting with GPT-3 achieves near SOTA performance, i.e., roughly equivalent to existing fully supervised models; (2) Flan-T5 is not as capable in the few-shot setting, but supervising and fine-tuning it with Chain-of-Thought (CoT) style explanations (generated via GPT-3) yields SOTA results. We release this model as a new baseline for RE tasks.

Computation and Language

What problem does this paper attempt to address?

The paper primarily aims to address several key issues in the task of Relation Extraction (RE): 1. **Improvement of Evaluation Methods**: The paper discusses the evaluation challenges encountered when using generative models for relation extraction and proposes assessing the consistency between model outputs and reference answers through manual annotation to overcome the inaccuracies caused by strict matching. 2. **Effectiveness of Few-Shot Learning**: The study examines the performance of large-scale language models (such as GPT-3) in relation extraction with a few examples (few-shot) and finds that their performance is close to or even surpasses existing fully supervised models. 3. **Optimization of Flan-T5**: Although Flan-T5 does not perform as well as GPT-3 in few-shot learning, its performance is significantly improved by introducing Chain-of-Thought (CoT) explanations to enhance the supervision signal, achieving the current State-of-the-Art (SOTA) level. Through experimental analysis on different datasets (ADE, CoNLL, NYT, and DocRED), the paper demonstrates the effectiveness and applicability of these methods. Specifically, training Flan-T5 with CoT explanations generated by GPT-3 not only improves the model's performance but also reduces the dependency on large-scale pre-trained models, making it more practical and cost-effective.

Revisiting Relation Extraction in the era of Large Language Models

Revisiting Relation Extraction in the era of Large Language Models

GPT-RE: In-context Learning for Relation Extraction using Large Language Models

Rescue Implicit and Long-tail Cases: Nearest Neighbor Relation Extraction

Relation Extraction with Fine-Tuned Large Language Models in Retrieval Augmented Generation Frameworks

A Comprehensive Survey on Relation Extraction: Recent Advances and New Frontiers

Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction

A Survey Deep Learning Based Relation Extraction

Empowering Few-Shot Relation Extraction with The Integration of Traditional RE Methods and Large Language Models

A survey on cutting-edge relation extraction techniques based on language models

Revisiting Large Language Models as Zero-shot Relation Extractors

AutoRE: Document-Level Relation Extraction with Large Language Models

How to Unleash the Power of Large Language Models for Few-shot Relation Extraction?

Fine-tuning Pre-Trained Transformer Language Models to Distantly Supervised Relation Extraction

More Data, More Relations, More Context and More Openness: A Review and Outlook for Relation Extraction.

GenRES: Rethinking Evaluation for Generative Relation Extraction in the Era of Large Language Models

Recall, Retrieve and Reason: Towards Better In-Context Relation Extraction

Towards Realistic Low-resource Relation Extraction: A Benchmark with Empirical Baseline Study

Relation of the Relations: A New Paradigm of the Relation Extraction Problem

Cross-Lingual Relation Extraction with Transformers