COSINE: Compressive Network Embedding on Large-scale Information Networks

Zhengyan Zhang,Cheng Yang,Zhiyuan Liu,Maosong Sun,Zhichong Fang,Bo Zhang,Leyu Lin
DOI: https://doi.org/10.1109/tkde.2020.3030539
IF: 9.235
2022-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:There is recently a surge in approaches that learn low-dimensional embeddings of nodes in networks. However, for large-scale real-world networks, it’s inefficient for existing approaches to store amounts of parameters in memory and update them edge by edge. With the knowledge that nodes having similar neighborhoods will be close to each other in the embedding space, we propose COSINE (COmpresSIve Network Embedding) algorithm, which reduces the memory footprint and accelerates the training process by parameter sharing among similar nodes. COSINE applies graph partitioning algorithms to networks and builds parameter sharing dependency of nodes based on the results of partitioning. In this way, COSINE injects prior knowledge about high-order structural information into models, which makes network embedding more efficient and effective. COSINE can be applied to any embedding lookup method and learn high-quality embeddings with limited memory and less training time. We conduct experiments on multi-label classification and link prediction, where baselines and our model have the same memory usage. Experimental results show that COSINE improves baselines by up to 23 percent on classification and 25 percent on link prediction. Moreover, the training time of all representation learning methods using COSINE decreases by 30 to 70 percent.
What problem does this paper attempt to address?