Combination of in-memory graph computation with mapreduce: a subgraph-centric method of pagerank

Qiuhong Li,Wei Wang,Peng Wang,Ke Dai,Zhihui Wang,Yang Wang,Weiwei Sun
DOI: https://doi.org/10.1007/978-3-642-38562-9_18
2013-01-01
Abstract:In order to improve the efficiency of the PageRank algorithm, parallelizing methods, especially the ones based on MapReduce, interest many researchers during the past several years. Previous implementations of the PageRank algorithm on MapReduce ignore the characteristic of locality in distributed systems which is very important to reduce the I/O and network costs. In this paper, we explore the locality property and propose a new method for fast PageRank computation by supporting a subgraph as an input record for map functions. Graph partitioning techniques and a message grouping method are employed to guarantee the efficiency of communication among different subgraphs. Experiments show that our method is significantly more efficient than previous approaches without accuracy loss. The key idea to change the granularity of basic processing units from edges to subgraphs can benefit many other parallelizing algorithms for graph processing.
What problem does this paper attempt to address?