A Fast Two-Stage Algorithm For Computing Simrank And Its Extensions

Xu Jia,Hongyan Liu,Li Zou,Jun He,Xiaoyong Du
DOI: https://doi.org/10.1007/978-3-642-16720-1_6
2010-01-01
Abstract:Similarity estimation can be used in many applications such as recommender system, cluster analysis, information retrieval and link prediction. SimRank is a famous algorithm to measure objects' similarities based on link structure. We observe that if one node has no in-link, similarity score between this node and any of the others is always zero. Based on this observation, we propose a new algorithm, fast two-stage SimRank (F2S-SimRank), which can avoid storing unnecessary zeros and can accelerate the computation without accuracy loss. Under the circumstance of no accuracy loss, this algorithm uses less computation time and occupies less main memory. Experiments conducted on real and synthetic data sets demonstrate the effectiveness and efficiency of our F2S-SimRank.
What problem does this paper attempt to address?