CARE: Co-Attention Network for Joint Entity and Relation Extraction

Wenjun Kong,Yamei Xia
2024-03-27
Abstract:Joint entity and relation extraction is the fundamental task of information extraction, consisting of two subtasks: named entity recognition and relation extraction. However, most existing joint extraction methods suffer from issues of feature confusion or inadequate interaction between the two subtasks. Addressing these challenges, in this work, we propose a Co-Attention network for joint entity and Relation Extraction (CARE). Our approach includes adopting a parallel encoding strategy to learn separate representations for each subtask, aiming to avoid feature overlap or confusion. At the core of our approach is the co-attention module that captures two-way interaction between the two subtasks, allowing the model to leverage entity information for relation prediction and vice versa, thus promoting mutual enhancement. Through extensive experiments on three benchmark datasets for joint entity and relation extraction (NYT, WebNLG, and SciERC), we demonstrate that our proposed model outperforms existing baseline models. Our code will be available at <a class="link-external link-https" href="https://github.com/kwj0x7f/CARE" rel="external noopener nofollow">this https URL</a>.
Computation and Language
What problem does this paper attempt to address?
The paper attempts to address the issues of feature confusion and insufficient interaction between subtasks in Joint Entity and Relation Extraction (JERE). Specifically: 1. **Feature Confusion**: Most existing joint extraction methods suffer from feature confusion due to shared representations when handling the two subtasks of Named Entity Recognition (NER) and Relation Extraction (RE). This confusion can affect the model's performance because different tasks require different feature representations. 2. **Insufficient Interaction Between Subtasks**: Existing methods fail to effectively model the interaction between NER and RE. Entity information often contains important relationship indicators, and relationship information also includes valuable clues about entities. The lack of effective interaction can lead to performance degradation. To overcome these issues, the paper proposes a joint entity and relation extraction model based on a co-attention mechanism (CARE). By adopting a parallel encoding strategy to learn independent representations for each subtask and introducing a co-attention module to capture the bidirectional interaction between the two subtasks, the model's performance is improved. Experimental results show that CARE outperforms existing baseline models on three benchmark datasets (NYT, WebNLG, and SciERC).