Semantic-aware network embedding via optimized random walk and paragaraph2vec
Lei Chen,Yuan Li,Xingye Deng,Zhaohua Liu,Mingyang Lv,Tingqin He
DOI: https://doi.org/10.1016/j.jocs.2022.101825
IF: 3.817
2022-09-01
Journal of Computational Science
Abstract:The Deepwalk-based network embedding algorithm, which was inspired by the word2vec model, can quickly and efficiently learn a low-dimensional vector representation for each node in a complex network. However, the success of word2vec in natural language environments relies heavily on ubiquitous contextual semantics, and the Deepwalk model is less accurate than other deep models due to ignoring contextual semantics in complex networks. To solve this challenge, a semantic-aware network embedding algorithm is proposed in this paper, simply called SANE, via optimized random walk and paragaraph2vec. In the SANE, a semantic-aware random walk strategy is designed to unsupervised generate many semantic-bearing node sequences from a complex network as a training set. Then, based on the training set, the paragraph2vec model is optimized by considering both global semantics and local semantics for learning more accurate low-dimensional node feature representations. Experiments on multiple tasks and multiple networks show the advantages of our SANE algorithm.
computer science, theory & methods, interdisciplinary applications