CARE: Co-Attention Network for Joint Entity and Relation Extraction

Wenjun Kong,Yamei Xia

2024-03-27

Abstract:Joint entity and relation extraction is the fundamental task of information extraction, consisting of two subtasks: named entity recognition and relation extraction. However, most existing joint extraction methods suffer from issues of feature confusion or inadequate interaction between the two subtasks. Addressing these challenges, in this work, we propose a Co-Attention network for joint entity and Relation Extraction (CARE). Our approach includes adopting a parallel encoding strategy to learn separate representations for each subtask, aiming to avoid feature overlap or confusion. At the core of our approach is the co-attention module that captures two-way interaction between the two subtasks, allowing the model to leverage entity information for relation prediction and vice versa, thus promoting mutual enhancement. Through extensive experiments on three benchmark datasets for joint entity and relation extraction (NYT, WebNLG, and SciERC), we demonstrate that our proposed model outperforms existing baseline models. Our code will be available at <a class="link-external link-https" href="https://github.com/kwj0x7f/CARE" rel="external noopener nofollow">this https URL</a>.

Computation and Language

What problem does this paper attempt to address?

The paper attempts to address the issues of feature confusion and insufficient interaction between subtasks in Joint Entity and Relation Extraction (JERE). Specifically: 1. **Feature Confusion**: Most existing joint extraction methods suffer from feature confusion due to shared representations when handling the two subtasks of Named Entity Recognition (NER) and Relation Extraction (RE). This confusion can affect the model's performance because different tasks require different feature representations. 2. **Insufficient Interaction Between Subtasks**: Existing methods fail to effectively model the interaction between NER and RE. Entity information often contains important relationship indicators, and relationship information also includes valuable clues about entities. The lack of effective interaction can lead to performance degradation. To overcome these issues, the paper proposes a joint entity and relation extraction model based on a co-attention mechanism (CARE). By adopting a parallel encoding strategy to learn independent representations for each subtask and introducing a co-attention module to capture the bidirectional interaction between the two subtasks, the model's performance is improved. Experimental results show that CARE outperforms existing baseline models on three benchmark datasets (NYT, WebNLG, and SciERC).

CARE: Co-Attention Network for Joint Entity and Relation Extraction

Unleashing the Power of Context: Contextual Association Network with Cross-Task Attention for Joint Relational Extraction.

Rescue Implicit and Long-tail Cases: Nearest Neighbor Relation Extraction

Joint extraction of entities and relations via an entity correlated attention neural model

Integrated Extraction of Entities and Relations via Attentive Graph Convolutional Networks

RECA: Relation Extraction Based on Cross-Attention Neural Network

Attention As Relation: Learning Supervised Multi-head Self-Attention for Relation Extraction

Similarity-based Memory Enhanced Joint Entity and Relation Extraction

Entity relation joint extraction model combining pointer network and attention mechanism based on relative position embedding

Joint Model of Entity Recognition and Relation Extraction with Self-attention Mechanism

Construction and Application of Text Entity Relation Joint Extraction Model Based on Multi-Head Attention Neural Network

Joint Multimodal Entity-Relation Extraction Based on Edge-enhanced Graph Alignment Network and Word-pair Relation Tagging

Joint extraction of entities and relations based on character graph convolutional network and Multi-Head Self-Attention Mechanism

FSN: Joint Entity and Relation Extraction Based on Filter Separator Network

Causal-relationship representation enhanced joint extraction model for elements and relationships

Interactive Learning for Joint Event and Relation Extraction

Hierarchical Attention Cnn And Entity-Aware For Relation Extraction

Attention-Based Convolutional Neural Network for Semantic Relation Extraction.

Modeling Dense Cross-Modal Interactions for Joint Entity-Relation Extraction

An Entity-Relation Joint Extraction Method Based on Two Independent Sub-Modules From Unstructured Text