Abstract:Information networks, such as social and citation networks, are ubiquitous in the real world so that network analysis plays an important role in data mining and knowledge discovery. To alleviate the sparsity problem of network analysis, it is common to capture the network semantics by projecting nodes onto a vector space as network embeddings. Moreover, random walks are usually exploited to efficiently learn node embeddings and preserve network proximity. In addition to proximity structure, heterogeneous networks have more knowledge about the types of nodes. However, to profit from heterogeneous knowledge, most of the existing approaches guide the random walks through predefined meta-paths or specific strategies, which can distort the understanding of network structures. Furthermore, traditional random walk-based approaches much favor the nodes with higher degrees while other nodes are equivalently important for the downstream applications. In this paper, we propose Meta-context Aware Random Walks (MARU) to overcome these challenges, thereby learning richer and more unbiased representations for heterogeneous networks. To reduce the bias in classical random walks, the algorithm of bidirectional extended random walks is introduced to improve the fairness of representation learning. Based on the enhanced random walks, the meta-context aware skip-gram model is then presented to learn robust network embeddings with dynamic meta-contexts. Therefore, MARU can not only fairly understand the overall network structures but also leverage the sophisticated heterogeneous knowledge in the networks. Extensive experiments have been conducted on three real-world large-scale publicly available datasets. The experimental results demonstrate that MARU significantly outperforms state-of-the-art heterogeneous network embedding methods across three general machine learning tasks, including multi-label node classification, node clustering, and link prediction.

UniNet: Scalable Network Representation Learning with Metropolis-Hastings Sampling

Universal Network Representation for Heterogeneous Information Networks.

A Unified Framework for Community Detection and Network Representation Learning

MARU: Meta-context Aware Random Walks for Heterogeneous Network Representation Learning

Network Representation Learning Guided by Partial Community Structure

Common Neighbors Matter: Fast Random Walk Sampling with Common Neighbor Awareness

Walking with Perception: Efficient Random Walk Sampling via Common Neighbor Awareness

A Unified Generative Adversarial Learning Framework for Improvement of Skip-Gram Network Representation Learning Methods

Community-enhanced Network Representation Learning for Network Analysis.

Network Representation Learning: From Preprocessing, Feature Extraction to Node Embedding

Network Representation Learning: From Traditional Feature Learning to Deep Learning

Context-aware Sampling of Large Networks via Graph Representation Learning

NetRL: Task-aware Network Denoising via Deep Reinforcement Learning

Sampling Online Social Networks by Random Walk with Indirect Jumps

Random Walk on Multiple Networks

A Multi-Semantic Metapath Model for Large Scale Heterogeneous Network Representation Learning

TransNet: Translation-Based Network Representation Learning for Social Relation Extraction.

A United Approach to Learning Sparse Attributed Network Embedding

Representation Learning for Heterogeneous Information Networks via Embedding Events

UniWalk: Unidirectional Random Walk Based Scalable SimRank Computation over Large Graph

AttrHIN: Network Representation Learning Method for Heterogeneous Information Network