Collaborative Filtering Algorithm Combining the Feature of Controversy

ZHANG Xue-sheng,CHEN Chao,ZHAGN Ying-feng,YU Neng-hai
DOI: https://doi.org/10.3969/j.issn.1000-1220.2012.04.005
2012-01-01
Abstract:Item-based Collaborative Filtering(CF) algorithm has been widely used in e-commerce.The most critical component of the algorithm is how to measure the similarity between items.Traditional calculations of similarities relied on the scores of the items that two users both rated,which suffers from data sparsity and poor prediction quality problems.In this paper,we consider the whole ratings between items and propose the conception of Item Controversy Similarity(ICS),which measures the items′ similarity by calculating the divergence of variance of the rating values between items.Combing the ICS to the traditional similarity calculation algorithm,we propose a new CF algorithm,which could reduce the inaccurate similarity in data sparsity.Empirical studies on dataset MovieLens show that algorithm outperforms other state-of-the-art CF algorithms and it is more robust against data sparsity.
What problem does this paper attempt to address?