Multi-Modal Siamese Network for Few-Shot Knowledge Graph Completion

Yuyang Wei,Wei Chen,Xiaofang Zhang,Pengpeng Zhao,Jianfeng Qu,Lei Zhao
DOI: https://doi.org/10.1109/icde60146.2024.00061
2024-01-01
Abstract:Multi-modal data have recently been utilized to improve the performance of knowledge graph completion (KGC), attracting widespread research interest. However, they have been ignored in few-shot knowledge graph completion (FKGC), which aims to discover potential facts involving unseen relations that only appear in few-shot triples. The most relevant FKGC study simply concatenates various modal features, but the performance is still limited due to the following problems: (1) lack of exploiting significant multi-modal features in neighborhoods, and (2) ineffectively modeling inter-modal interactions in a few-shot setting. To tackle these problems, we propose a novel relational learning model entitled MMSN (Multi-Modal Siamese Network) for few-shot knowledge graph completion, which is composed of the following two primary modules: the Siamese multi-modal neighbor encoder (SMNE) and the meta-learning multi-modal knowledge representation decoder (MKRD). The module SMNE is developed to encode diverse modalities of neighbors by a Siamese attention network, fuse multi-modal information through a gating fusion network, and learn effective relational embeddings using an aggregator. The module MKRD is introduced to handle inter-modal interactions between multiple modalities and train the proposed model in a few-shot scenario. Extensive experiments demonstrate that our proposed model MMSN outperforms the state-of-the-art FKGC models, including uni-modal and multi-modal models, on two real-world few-shot multi-modal datasets.
What problem does this paper attempt to address?