Genre: generative multi-turn question answering with contrastive learning for entity–relation extraction

Lulu Wang,Kai Yu,Aishan Wumaier,Peng Zhang,Tuergen Yibulayin,Xi Wu,Jibing Gong,Maihemuti Maimaiti
DOI: https://doi.org/10.1007/s40747-023-01321-y
IF: 6.7
2024-02-08
Complex & Intelligent Systems
Abstract:Abstract Extractive approaches have been the mainstream paradigm for identifying overlapping entity–relation extraction. However, limited by their inherently methodological flaws, which hardly deal with three issues: hierarchical dependent entity–relations, implicit entity–relations, and entity normalization. Recent advances have proposed an effective solution based on generative language models, which cast entity–relation extraction as a sequence-to-sequence text generation task. Inspired by the observation that humans learn by getting to the bottom of things, we propose a novel framework, namely GenRE, Generative multi-turn question answering with contrastive learning for entity–relation extraction. Specifically, a template-based question prompt generation first is designed to answer in different turns. We then formulate entity–relation extraction as a generative question answering task based on the general language model instead of span-based machine reading comprehension. Meanwhile, the contrastive learning strategy in fine-tuning is introduced to add negative samples to mitigate the exposure bias inherent in generative models. Our extensive experiments demonstrate that GenRE performs competitively on two public datasets and a custom dataset, highlighting its superiority in entity normalization and implicit entity–relation extraction. (The code is available at https://github.com/lovelyllwang/GenRE ).
computer science, artificial intelligence
What problem does this paper attempt to address?
The paper aims to address several key issues in the task of Relation Extraction (RE), which have not been adequately handled by existing methods. Specifically, the paper proposes new solutions for the following three main challenges: 1. **Hierarchical Dependency of Entity Relations**: In real-world scenarios, entity relations between different labels may have hierarchical dependencies, which traditional relation extraction methods may fail to fully capture in terms of complexity and richness. For example, there is a hierarchical relationship between the entity "Harvard University" and the dependent entity "PhD degree," but existing models can only extract partial information. 2. **Implicit Entity Relations**: The context may contain implicit entity relations, and most methods can only extract entities that appear in the text, but they struggle to identify relations that can only be inferred through logical reasoning. For instance, from the sentence "Robert M. Metcalfe graduated from MIT in 1969," it can be inferred that he started his studies at MIT around 1964. 3. **Entity Normalization**: Entities may have different expressions and need to be normalized. For example, "MIT" and "Massachusetts Institute of Technology" refer to the same entity, but without a reference set, entity extractors find it difficult to unify them. To address these challenges, the authors propose a new framework called GenRE, which transforms the relation extraction problem into a task based on generative multi-turn question answering, combined with a contrastive learning strategy to improve the accuracy of the answers. Specifically, GenRE uses a template-driven question generation mechanism to adapt to different domains and generates entity relations step by step through multi-turn question answering. Additionally, to enhance the credibility of the generated answers, the framework introduces contrastive learning to distinguish between positive and negative samples. In summary, the main contributions of the paper include: - Proposing a novel framework that transforms the relation extraction problem into a generative multi-turn question answering problem, considering the rich semantic information between entity relations. - Significantly improving the model's ability to distinguish between positive and negative answers by integrating contrastive learning. - Conducting extensive experiments on three datasets (including different versions of two public datasets and a custom dataset) to validate the effectiveness and flexibility of the method, especially in terms of entity normalization and reasoning.