Multi-Source Bayesian Embeddings for Learning Social Knowledge Graphs.

Zhilin Yang,Jie Tang,William Cohen
2016-01-01
Abstract:Understanding the semantics of social networks is important for social mining. However, few works have studied how to bridge the gap between large-scale social networks and abundant collective knowledge. In this work, we study the problem of learning social knowledge graphs, which aims to accurately connect social network vertices to knowledge concepts. We propose a multi-source Bayesian embedding model, GenVector, to jointly incorporate topic models and word/network embeddings. The model leverages largescale unlabeled data by incorporating the embeddings, and co-represents social network vertices and knowledge concepts in a shared latent topic space. Experiments on three datasets show that our method outperforms state-of-the-art methods. We deploy our algorithm on a large-scale academic social network by linking 39 million researchers to 35 million knowledge concepts, and decrease the error rate by 67% according to online test.
What problem does this paper attempt to address?