Strongly Connected Components Based Efficient PPR Algorithms

Hong-Guo YANG,De-Rong SHEN,Yue KOU,Tie-Zheng NIE,Ge YU
DOI: https://doi.org/10.11897/SP.J.1016.2017.00584
2017-01-01
Abstract:Personalized PageRank (PPR) is usually employed to rank the nodes of graphs.Due to the ever increasing volume of graph, how to improve the efficiency of PPR computations has become a research focus.Thus the purpose of this paper is to improve the efficiency of PPR computations.The existing optimization algorithms can be generally classified into two categories: distributed algorithm and serial algorithm, and the general approach of them is mainly through partitioning the computations on the big graph into computations on multiple smaller sub-graphs, but the communication between sub-graphs usually involve a large amount of data and is of a high frequency.The SCC (Strongly Connected Component) based algorithms proposed in this paper can resolve these problems effectively.The main computation steps of them is: first identify and remove volumes of unrelated nodes and edges quickly before PPR computation, then transform the PPR computations on big graph into that on multiple SCC sub-graphs, which make the multiple data communications between sub-graphs turn into one time communication.The SCC based algorithm in this paper can reduce not only the communication amount between SCCs but also the storage I/O frequency, while keeping high algorithm accuracy.The experiments demonstrate that the algorithms proposed in this paper can make obvious improvements for the PPR computation efficiency.
What problem does this paper attempt to address?