Leveraging Knowledge Graph Embeddings to Enhance Contextual Representations for Relation Extraction

Fréjus A. A. Laleye,Loïc Rakotoson,Sylvain Massip
DOI: https://doi.org/10.1007/978-3-031-41501-2_2
2023-06-07
Abstract:Relation extraction task is a crucial and challenging aspect of Natural Language Processing. Several methods have surfaced as of late, exhibiting notable performance in addressing the task; however, most of these approaches rely on vast amounts of data from large-scale knowledge graphs or language models pretrained on voluminous corpora. In this paper, we hone in on the effective utilization of solely the knowledge supplied by a corpus to create a high-performing model. Our objective is to showcase that by leveraging the hierarchical structure and relational distribution of entities within a corpus without introducing external knowledge, a relation extraction model can achieve significantly enhanced performance. We therefore proposed a relation extraction approach based on the incorporation of pretrained knowledge graph embeddings at the corpus scale into the sentence-level contextual representation. We conducted a series of experiments which revealed promising and very interesting results for our proposed approach.The obtained results demonstrated an outperformance of our method compared to context-based relation extraction models.
Computation and Language,Machine Learning
What problem does this paper attempt to address?
The paper is primarily dedicated to addressing the task of relation extraction in natural language processing (NLP), particularly improving model performance in scenarios with limited data in specialized domains. Specifically, the goal of the paper is to demonstrate that significant improvements in relation extraction models can be achieved by leveraging the knowledge provided solely by the training corpus itself, without introducing external knowledge graphs or large-scale pre-trained language models. To achieve this goal, the researchers propose a novel method that integrates pre-trained knowledge graph embeddings into sentence-level contextual representations. The key aspect of this method is the construction of a local knowledge graph based on the training data, from which embeddings of relationships between entities are generated. These embeddings are then used to enrich the sentence-level contextual representations, enhancing the model's ability to understand and predict relationships between entities. By conducting experiments on several standard datasets in the biomedical domain, the researchers validated that the proposed model shows significant improvements compared to relation extraction models that rely solely on contextual information. Notably, it also demonstrated better generalization capabilities in predicting relationships that were not present during the training phase. The advantage of this method lies in its independence from large-scale external knowledge graphs, making it highly suitable for specialized domains with limited external resources.