ASGEA: Exploiting Logic Rules from Align-Subgraphs for Entity Alignment

Yangyifei Luo,Zhuo Chen,Lingbing Guo,Qian Li,Wenxuan Zeng,Zhixin Cai,Jianxin Li
2024-03-05
Abstract:Entity alignment (EA) aims to identify entities across different knowledge graphs that represent the same real-world objects. Recent embedding-based EA methods have achieved state-of-the-art performance in EA yet faced interpretability challenges as they purely rely on the embedding distance and neglect the logic rules behind a pair of aligned entities. In this paper, we propose the Align-Subgraph Entity Alignment (ASGEA) framework to exploit logic rules from Align-Subgraphs. ASGEA uses anchor links as bridges to construct Align-Subgraphs and spreads along the paths across KGs, which distinguishes it from the embedding-based methods. Furthermore, we design an interpretable Path-based Graph Neural Network, ASGNN, to effectively identify and integrate the logic rules across KGs. We also introduce a node-level multi-modal attention mechanism coupled with multi-modal enriched anchors to augment the Align-Subgraph. Our experimental results demonstrate the superior performance of ASGEA over the existing embedding-based methods in both EA and Multi-Modal EA (MMEA) tasks.
Computation and Language,Artificial Intelligence
What problem does this paper attempt to address?
This paper attempts to solve the interpretability problem in entity alignment (EA), especially the deficiencies of existing embedding - based methods when dealing with entities with similar neighborhood structures but different alignment correlations. Specifically, current methods mainly rely on embedding distances and ignore the logical rules behind entity alignment, resulting in difficulty in distinguishing relevant and irrelevant neighbor information. ### Main problems of the paper 1. **Interpretability challenges**: - Existing embedding - based EA methods rely entirely on embedding distances and cannot explain why some entities are aligned while others are not. - These methods ignore the logical rules behind entity alignment, for example, if the spouses of two entities are the same, then these two entities may also be the same. 2. **Similar neighborhood structures but different alignment correlations**: - When entities in different knowledge graphs (KGs) have similar neighborhood structures, existing methods have difficulty accurately distinguishing whether they should be aligned or not. - As shown in Figure 1, "Maurice Sendak" and "Bill Evans" are structurally similar but should not be aligned because they represent different entities. 3. **Utilization of multi - modal information**: - Existing methods perform poorly in handling multi - modal EA tasks, especially in the absence of sufficient anchor links, it is difficult to obtain a wide range of potential paths. ### Proposed solutions To solve the above problems, the paper proposes a new framework - **Align - Subgraph Entity Alignment (ASGEA)**. The main contributions of this framework include: 1. **Align - Subgraph extraction algorithm**: - By constructing sub - graphs containing all possible alignment - rule paths, effectively reduce the noise of non - relevant neighbor information. - The ASG extraction algorithm can identify and filter out irrelevant neighbor information, focusing on key alignment details. 2. **Interpretable Path - based Graph Neural Network (ASGNN)**: - ASGNN uses an interpretable attention mechanism to prioritize important paths and adopts a one - way path message - passing strategy to preserve path - specific information. - This network ensures that the generated entity representations focus only on key alignment details, improving the interpretability and accuracy of the model. 3. **Node - level multi - modal attention mechanism**: - Introduce a node - level multi - modal attention mechanism to enhance the performance of the model in multi - modal EA tasks. - By creating new multi - modal anchor links and integrating auxiliary anchor data, promote the extraction of ASG. ### Experimental results The experimental results show that ASGEA significantly outperforms existing embedding - based methods on multiple benchmark datasets, especially performing well in multi - modal EA tasks. Specifically: - **FBDB15K and FBYG15K datasets**: ASGEA - MM improves by 14.7% and 29.0% in the H@1 and MRR metrics respectively, indicating its stable advantage under different training data proportions. - **DBP15K dataset**: ASGEA - MM improves by 9.3% and 6.1% in the H@1 and MRR metrics respectively on three subsets, proving the importance of multi - modal information. - **Multi - OpenEA dataset**: ASGEA - MM improves by 6.3% and 4.1% on the EN - FR - 15K and EN - DE - 15K datasets respectively, further verifying its superior performance. In summary, ASGEA significantly improves the accuracy and interpretability of entity alignment tasks by introducing logical rules and multi - modal information.