OnFlyP:An Online Distributed Partition Algorithm for Large Scale Graphs Based on Edge-Exchange Model

Zhi-Gang WANG,Yu GU,Yu-Bin BAO,Ge YU
DOI: https://doi.org/10.11897/SP.J.1016.2015.01838
2015-01-01
Chinese Journal of Computers
Abstract:With the arrival of the big data era,the iterative computation of large graphs on cloud computing environments has attracted a lot of attention as a new hot topic.However,the overall computation performance greatly depends on graph partition methods in terms of improving the executing efficiency and reducing the number of communication edges among different subgraphs. Although a large of efforts have been made to tackle this issue,such as offline or online partition methods, the requirements of the two aforementioned aspects are hard to be satisfied simultaneously.This paper proposes the concept of “cluster coefficient”and then analyzes the locality of vertex distribution for real-world graphs.Accordingly,an online distributed partition algorithm (OnFlyP )based on a directional edge-exchange model is presented to efficiently support iterative computations.For a specific iterative algorithm,OnFlyP can be executed with high efficiency during the phase of loading data.Meanwhile,it greatly reduces the communication edge scale by exchanging edges.OnFlyP employs the real-time control policy or the minimum symmetric matrix policy,to implement the load balance among subgraphs.They respectively focus on the high-efficiency and the effect of reducing the communication edge scale,and are suitable for different real applications.Finally,extensive experiments on various real-world graphs validate the effectiveness of OnFlyP.
What problem does this paper attempt to address?