Two Heads Are Better Than One: Integrating Knowledge from Knowledge Graphs and Large Language Models for Entity Alignment

Linyao Yang,Hongyang Chen,Xiao Wang,Jing Yang,Fei-Yue Wang,Han Liu
2024-01-30
Abstract:Entity alignment, which is a prerequisite for creating a more comprehensive Knowledge Graph (KG), involves pinpointing equivalent entities across disparate KGs. Contemporary methods for entity alignment have predominantly utilized knowledge embedding models to procure entity embeddings that encapsulate various similarities-structural, relational, and attributive. These embeddings are then integrated through attention-based information fusion mechanisms. Despite this progress, effectively harnessing multifaceted information remains challenging due to inherent heterogeneity. Moreover, while Large Language Models (LLMs) have exhibited exceptional performance across diverse downstream tasks by implicitly capturing entity semantics, this implicit knowledge has yet to be exploited for entity alignment. In this study, we propose a Large Language Model-enhanced Entity Alignment framework (LLMEA), integrating structural knowledge from KGs with semantic knowledge from LLMs to enhance entity alignment. Specifically, LLMEA identifies candidate alignments for a given entity by considering both embedding similarities between entities across KGs and edit distances to a virtual equivalent entity. It then engages an LLM iteratively, posing multiple multi-choice questions to draw upon the LLM's inference capability. The final prediction of the equivalent entity is derived from the LLM's output. Experiments conducted on three public datasets reveal that LLMEA surpasses leading baseline models. Additional ablation studies underscore the efficacy of our proposed framework.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
This paper mainly discusses the problem of entity alignment, which is a key step in constructing a more comprehensive knowledge graph (KG), involving the identification of equivalent entities representing the same concept in different KGs. The current methods mainly rely on knowledge embedding models to obtain entity embeddings containing various similarities, and then fuse this information through attention mechanisms. Despite some progress, effectively utilizing multifaceted information still remains challenging, especially due to the inherent heterogeneity. In addition, although large language models (LLMs) have performed well in various downstream tasks, the implicit entity semantics captured by LLMs have not been used for entity alignment. To address this, the paper proposes a framework named LLMEA (Large Language Model-enhanced Entity Alignment) that combines the structural knowledge of KGs with the semantic knowledge of LLMs to enhance entity alignment. Specifically, LLMEA first considers the embedding similarity between entities across KGs and the edit distance of virtual equivalent entities to identify candidate alignments. Then, it utilizes the inferential ability of LLMs through proposing multiple-choice questions iteratively. The final prediction of equivalent entities comes from the output of LLMs. Experimental results show that LLMEA outperforms existing baseline models on three public datasets, and further ablation studies verify the effectiveness of this framework. The paper also points out that existing methods often overlook the rich knowledge of LLMs and are limited by input token size restrictions when dealing with large-scale entity alignment. To address these issues, LLMEA introduces a method that filters candidate aligned entities based on the structural features of KGs and the internal knowledge of LLMs. By learning embeddings that combine the structural and relational similarities of entities, and then determining candidate equivalent entities through embedding similarity and edit distance, these candidate entities are input as options to LLMs for predicting the final alignment results. In summary, this paper addresses the problem of effectively integrating the knowledge of KGs and LLMs to improve entity alignment and proposes a novel framework named LLMEA, which achieves performance enhancement through the inferential ability of LLMs.