Mutually-paced Knowledge Distillation for Cross-lingual Temporal Knowledge Graph Reasoning

Ruijie Wang,Zheng Li,Jingfeng Yang,Tianyu Cao,Chao Zhang,Bing Yin,Tarek Abdelzaher

DOI: https://doi.org/10.48550/arXiv.2303.14898

2023-03-27

Abstract:This paper investigates cross-lingual temporal knowledge graph reasoning problem, which aims to facilitate reasoning on Temporal Knowledge Graphs (TKGs) in low-resource languages by transfering knowledge from TKGs in high-resource ones. The cross-lingual distillation ability across TKGs becomes increasingly crucial, in light of the unsatisfying performance of existing reasoning methods on those severely incomplete TKGs, especially in low-resource languages. However, it poses tremendous challenges in two aspects. First, the cross-lingual alignments, which serve as bridges for knowledge transfer, are usually too scarce to transfer sufficient knowledge between two TKGs. Second, temporal knowledge discrepancy of the aligned entities, especially when alignments are unreliable, can mislead the knowledge distillation process. We correspondingly propose a mutually-paced knowledge distillation model MP-KD, where a teacher network trained on a source TKG can guide the training of a student network on target TKGs with an alignment module. Concretely, to deal with the scarcity issue, MP-KD generates pseudo alignments between TKGs based on the temporal information extracted by our representation module. To maximize the efficacy of knowledge transfer and control the noise caused by the temporal knowledge discrepancy, we enhance MP-KD with a temporal cross-lingual attention mechanism to dynamically estimate the alignment strength. The two procedures are mutually paced along with model training. Extensive experiments on twelve cross-lingual TKG transfer tasks in the EventKG benchmark demonstrate the effectiveness of the proposed MP-KD method.

Machine Learning,Artificial Intelligence,Computation and Language,Social and Information Networks

What problem does this paper attempt to address?

The paper aims to address the problem of Cross-lingual Temporal Knowledge Graph Reasoning. Specifically, the main objective of the research is to enhance reasoning performance in low-resource language environments by transferring knowledge from Temporal Knowledge Graphs (TKGs) in high-resource languages to TKGs in low-resource languages. The paper points out that existing methods perform poorly when dealing with severely incomplete TKGs, especially in low-resource languages. To this end, the authors propose a method called "Mutually-paced Knowledge Distillation" (MP-KD). MP-KD primarily addresses two key challenges: 1. **Scarcity of Cross-lingual Alignments**: To alleviate this issue, MP-KD extends the upper limit of knowledge transfer by generating pseudo alignments, and this process is synchronized with model training. 2. **Temporal Knowledge Discrepancies**: To mitigate the negative impact of these discrepancies, MP-KD employs a method of dynamically adjusting alignment strength, estimating the temporal alignment strength distribution of the graph structure through a spatiotemporal attention mechanism. Through the aforementioned methods, MP-KD is able to achieve more efficient knowledge transfer in cross-lingual temporal knowledge graphs, and its effectiveness is validated through experiments.

Mutually-paced Knowledge Distillation for Cross-lingual Temporal Knowledge Graph Reasoning

DistilE: Distiling Knowledge Graph Embeddings for Faster and Cheaper Reasoning

Learning Long- and Short-term Representations for Temporal Knowledge Graph Reasoning

Collaborative Knowledge Distillation Via Multiknowledge Transfer.

Temporal Knowledge Graph Reasoning Based on Dynamic Fusion Representation Learning

MetaTKG: Learning Evolutionary Meta-Knowledge for Temporal Knowledge Graph Reasoning

Learning dual disentangled representation with self-supervision for temporal knowledge graph reasoning

Cross-Task Knowledge Distillation in Multi-Task Recommendation

Learning multi-graph structure for Temporal Knowledge Graph reasoning

Knowledge Representing: Efficient, Sparse Representation of Prior Knowledge for Knowledge Distillation

RLGNet: Repeating-Local-Global History Network for Temporal Knowledge Graph Reasoning

RLAT: Multi-hop temporal knowledge graph reasoning based on reinforcement learning and attention mechanism

Temporal Knowledge Graph Reasoning Based on Evolutional Representation and Contrastive Learning

Learning Latent Relations for Temporal Knowledge Graph Reasoning

Temporal knowledge graph reasoning triggered by memories

Attention and feature transfer based knowledge distillation

Adapting Distilled Knowledge for Few-shot Relation Reasoning over Knowledge Graphs

Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution Alignment

TimeTraveler: Reinforcement Learning for Temporal Knowledge Graph Forecasting

Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation