Efficient Top-K SimRank-based Similarity Join.

Wenbo Tao,Minghe Yu,Guoliang Li
DOI: https://doi.org/10.14778/2735508.2735520
IF: 2.5
2014-01-01
Proceedings of the VLDB Endowment
Abstract:SimRank is an effective and widely adopted measure to quantify the structural similarity between pairs of nodes in a graph. In this paper we study the problem of top-k SimRank-based similarity join, which finds k pairs of nodes with the largest SimRank values. To the best of our knowledge, this is the first attempt to address this problem. We propose a random-walk-based method to efficiently identify top-k pairs. Experiment results on real datasets show that our method significantly outperforms baseline approaches.
What problem does this paper attempt to address?