Wasserstein Topology Transfer for Joint Distilling Embeddings of Knowledge Graph Entities and Relations.

Jiachen Yu,Yuehong Wu,Shangsong Liang
DOI: https://doi.org/10.1145/3639631.3639662
2023-01-01
Abstract:A high-dimensional knowledge graph embedding (KGE) space is usually required for a better reasoning capability in representing entities and relations. However, a high-dimensional KGE can lead to high memory overhead with the increasing number of entities and the diversity of relations, which is limited in mobile or edge devices. Recently, researchers have focused on reducing the memory burden by distilling critical information from a high-dimensional KGE teacher into a low-dimensional KGE student. However, these methods still face two main issues: (1) they neglect the distillation of graph topology from teachers; (2) they distil entities and relations independently instead of in a shared semantic space. To address these issues, we propose Wasserstein Topology Transfer (termed as WTT), which effectively discovers graph structure and transfers graph topology from teachers to students. In particular, we propose two novel optimal transport (OT) regularizers: Reversing Wasserstein Regularizer and Blending Wasserstein Regularizer, to constrain the student’s behaviours and enable an efficient distillation from teachers to students of entity and relation embeddings. The regularizers achieve entities and relations distillation in the same OT procedure and reduce redundant transport procedures by adjacency matrices in KGs. Extensive experimental results have demonstrated the effectiveness of our WTT method.
What problem does this paper attempt to address?