Pregel Algorithms for Graph Connectivity Problems with Performance Guarantees

Da Yan,James Cheng,Kai Xing,Yi Lu,Wilfred Ng,Yingyi Bu
DOI: https://doi.org/10.14778/2733085.2733089
IF: 2.5
2014-01-01
Proceedings of the VLDB Endowment
Abstract:Graphs in real life applications are often huge, such as the Web graph and various social networks. These massive graphs are often stored and processed in distributed sites. In this paper, we study graph algorithms that adopt Google's Pregel, an iterative vertex-centric framework for graph processing in the Cloud. We first identify a set of desirable properties of an efficient Pregel algorithm, such as linear space, communication and computation cost per iteration, and logarithmic number of iterations. We define such an algorithm as a practical Pregel algorithm (PPA). We then propose PPAs for computing connected components (CCs), biconnected components (BCCs) and strongly connected components (SCCs). The PPAs for computing BCCs and SCCs use the PPAs of many fundamental graph problems as building blocks, which are of interest by themselves. Extensive experiments over large real graphs verified the efficiency of our algorithms.
What problem does this paper attempt to address?