Stochastic Relational Models for Large-scale Dyadic Data using MCMC

Shenghuo Zhu,Kai Yu,Yihong Gong
2008-01-01
Abstract:Stochastic relational models (SRMs) (15) provide a rich family of choices for learning and predicting dyadic data between two sets of entities. The models gen- eralize matrix factorization to a supervised learning problem that utilizes attributes of entities in a hierarchical Bayesian framework. Previously variational Bayes in- ference was applied for SRMs, which is, however, not scalable when the size of either entity set grows to tens of thousands. In this paper, we introduce a Markov chain Monte Carlo (MCMC) algorithm for equivalent models of SRMs in order to scale the computation to very large dyadic data sets. Both superior scalability and predictive accuracy are demonstrated on a collaborative filtering problem, which involves tens of thousands users and half million items.
What problem does this paper attempt to address?