WMPEClus: Clustering Via Weighted Meta-Path Embedding for Heterogeneous Information Networks.

Yongjun Zhang,Xiaoping Yang,Liang Wang,Kede Li
DOI: https://doi.org/10.1109/ictai50040.2020.00127
2020-01-01
Abstract:A low-dimensional embedding of multiple nodes is great convenient for clustering, which is one of the most fundamental tasks for heterogeneous information networks (HINs). In the meantime, the random walk-based network embedding is proved to be equivalent to the method of matrix factorization whose computational cost is very expensive. Moreover, mapping different types of nodes into one metric space may result in incompatibility. To cope with the two challenges above, a weighted meta-path embedding based clustering method (called WMPEClus) is proposed in this paper. On the one hand, in order to solve the incompatibility problem, the original network is transformed into several subnetworks with independent semantics specified by meta-paths which are automatically generated by our method. On the other hand, an approximate commute embedding approach, avoiding eigen-decomposition to reduce computational cost, is leveraged to the representation learning of the nodes in each subnetwork. At last, a unified probabilistic generation model is designed to aggregate the vectorized representations learned in different metric spaces for clustering. Experiment results show that WMPEClus is effective in HIN clustering and outperforms the state-of-the-art baselines on two real-world datasets.
What problem does this paper attempt to address?