Extending LINE for Network Embedding with Completely Imbalanced Labels

Zheng Wang,Qiao Wang,Tanjie Zhu,Xiaojun Ye
DOI: https://doi.org/10.4018/ijdwm.2020070102
2020-01-01
International Journal of Data Warehousing and Mining
Abstract:Network embedding is a fundamental problem in network research. Semi-supervised network embedding, which benefits from labeled data, has recently attracted considerable interest. However, existing semi-supervised methods would get biased results in the completely-imbalanced label setting where labeled data cannot cover all classes. This article proposes a novel network embedding method which could benefit from completely-imbalanced labels by approximately guaranteeing both intra-class similarity and inter-class dissimilarity. In addition, the authors prove and adopt the matrix factorization form of LINE (a famous network embedding method) as the network structure preserving model. Extensive experiments demonstrate the superiority and robustness of this method.
What problem does this paper attempt to address?