Research on Collaborative Filtering Book Recommendation Based on Hadoop and Mahout

Feng Guohe,Huang Jiaxing
DOI: https://doi.org/10.7536/j.issn.0252-3116.2013.18.020
2013-01-01
Abstract:Firstly,this paper builds a book recommendation engine system based on the Hadoop open source distributed computing framework and mahout collaborative filtering recommendation engine technology.Then it takes advantage of the cloud model and Pearson coefficient to improve the traditional collaborative filtering recommendation algorithm,and resolves the problems of poor system performance and recommendation results inaccurate of traditional stand-alone recommendation algorithm in high-dimensional sparse matrix operations.Thirdly,it experiments and evaluates the overall performance of the distributed recommendation platform and the improved collaborative filtering algorithm.It finds that:(1) when the virtual machine nodes are increasing,the computation time of collaborative filtering recommendation engine is declining in the experimental tests,which shows that the overall performance of the system has been improved.(2) it improves the mahout original collaborative filtering recommendation engine with the Pearson coefficient and evaluates the recommended effect with MAE indices of the original collaborative filtering recommendation algorithm,which finds the recommendation accuracy rate increases 13.1% and the subjectivity differences of user ratings have great impact on the recommendation accuracy.
What problem does this paper attempt to address?