Parallelized Research on Collaborative Filtering Algorithm Based on Matrix Factorization

Quan-min WANG,Yu MIAO,Ming HE,Shuang ZHENG
DOI: https://doi.org/10.3969/j.issn.1673-629X.2015.02.013
2015-01-01
Abstract:Collaborative filtering algorithm based on matrix factorization is a collaborative filtering recommendation technique proposed in recent years. In the process of recommendation each prediction depends on the collaboration of the whole known rating set and the feature matrices need huge storage. So the recommendation with only one node will meet the bottleneck of time and resource. Through in-depth study on the principle and feature of current parallel implementation of a collaborative filtering algorithm based on ALS ( Alternating-Least-Squares) ,get the reason why the computing efficiency of the implementation of traditional iterative algorithm on Hadoop is very low. According to the idea of iterative MapReduce,some methods such as loop-aware scheduling algorithm,static data caching,job loop controlling,fixed point detecting are proposed. The experiment on Netflix data set shows that the iterative MapReduce has improved the parallel computing efficiency of collaborative filtering algorithm based on ALS.
What problem does this paper attempt to address?