Ripple2vec: Node Embedding with Ripple Distance of Structures

Luo, Jizhou,Xiao, Song,Jiang, Shouxu,Gao, Hong,Xiao, Yinuo
DOI: https://doi.org/10.1007/s41019-022-00184-6
2022-01-01
Data Science and Engineering
Abstract:Graph is a generic model of various networks in real-world applications. And, graph embedding aims to represent nodes (edges or graphs) as low-dimensional vectors which can be fed into machine learning algorithms for downstream graph analysis tasks. However, existing random walk-based node embedding methods often map some nodes with (dis)similar local structures to (near) far vectors. To overcome this issue, this paper proposes to implement node embedding by constructing a context graph via a new defined ripple distance over ripple vectors, whose components are the hitting times of fully condensed neighborhoods and thus characterize their structures as pure quantities. The distance is able to capture the (dis)similarities of nodes’ local neighborhood structures and satisfies the triangular inequality. The neighbors of each node in the context graph are defined via the ripple distance, which makes the short random walks from a given node over the context graph only visit its similar nodes in the original graph. This property guarantees that the proposed method, named as $$\mathsf {ripple2vec}$$ ripple 2 vec , is able to map (dis)similar nodes to (far) near vectors. Experimental results on real datasets, where labels are mainly related to nodes’ local structures, show that the results of $$\mathsf {ripple2vec}$$ ripple 2 vec behave better than those of state-of-the-art methods, in node clustering and node classification, and are competitive to other methods in link prediction.
What problem does this paper attempt to address?