Abstract:The increasing popularity of multi-modal knowledge graphs (MMKGs) has led to a need for efficient entity alignment techniques that can exploit multi-modal information to integrate knowledge from different sources. GNN-based multi-modal entity alignment (MMEA) methods have achieved significant progress in entity alignment (EA) areas. However, these methods only rely on Graph Neural Networks (GNNs) to encode structural information, while ignoring visual and semantic modalities, which may lead to incomplete representation, thus how to integrate the visual and semantic information into GNN-based EA methods remains unexplored. In light of our insight that incorporating the message-passing mechanism of Graph Neural Networks to integrate multi-modal information is essential for fully exploiting the graph representation capability of GNN, we propose a novel Cross-modal Graph attention network for Entity Alignment (XGEA) that enables visual knowledge to interact with other views of the entity, including structural and literal information. We leverage the information from one modality as complementary relation information to compute the attention of another modality in the graph attention layers, enabling the learning of entity embedding by integrating multiple modalities. Moreover, the quantity of labeled data plays a crucial role in model performance, yet obtaining sufficient training data is expensive. To mitigate this issue, we use visual and semantic information to generate pseudo-pairs and introduce a soft pseudo-labeling method for entity alignment to assign weights to the augmented training data to balance its quantity and quality. Extensive experiments show that our XGEA achieves superior performance consistently over the state-of-the-art MMEA baselines.

Cross-Modal Graph Attention Network for Entity Alignment

Position-Aware Active Learning for Multi-Modal Entity Alignment

Multi-modal Graph Convolutional Network for Knowledge Graph Entity Alignment

An Entity Alignment Method Based on Graph Attention Network with Pre-classification

Rethinking Uncertainly Missing and Ambiguous Visual Modality in Multi-Modal Entity Alignment

Multi-modal Siamese Network for Entity Alignment

A Contextual Alignment Enhanced Cross Graph Attention Network for Cross-lingual Entity Alignment.

MMEA: Entity Alignment for Multi-modal Knowledge Graph.

LoginMEA: Local-to-Global Interaction Network for Multi-modal Entity Alignment

Multi-modal Entity Alignment Via Position-enhanced Multi-label Propagation

Subgraph-aware Virtual Node Matching Graph Attention Network for Entity Alignment

Multi-Channel Graph Neural Network for Entity Alignment.

Attribute-Consistent Knowledge Graph Representation Learning for Multi-Modal Entity Alignment

Enhanced Entity Interaction Modeling for Multi-Modal Entity Alignment.

MRAEA: An Efficient and Robust Entity Alignment Approach for Cross-lingual Knowledge Graph

Chinese Cross-modal Entity Alignment Method Based on Multi-modal Knowledge Graph

MEAformer: Multi-modal Entity Alignment Transformer for Meta Modality Hybrid

PSNEA: Pseudo-Siamese Network for Entity Alignment between Multi-modal Knowledge Graphs

Multi-Modal Entity Alignment Method Based on Feature Enhancement

A Multi -Role Graph Attention Network for Knowledge Graph Alignment.

Leveraging Intra-modal and Inter-modal Interaction for Multi-Modal Entity Alignment