Abstract:Large scale iterative graph computation presents an interesting systems challenge due to two well known problems: (1) the lack of access locality and (2) the lack of storage efficiency. This paper presents PathGraph, a system for improving iterative graph computation on graphs with billions of edges. First, we improve the memory and disk access locality for iterative computation algorithms on large graphs by modeling a large graph using a collection of tree-based partitions. This enables us to use path-centric computation rather than vertex-centric or edge-centric computation. For each tree partition, we re-label vertices using DFS in order to preserve consistency between the order of vertex ids and vertex order in the paths. Second, a compact storage that is optimized for iterative graph parallel computation is developed in the PathGraph system. Concretely, we employ delta-compression and store tree-based partitions in a DFS order. By clustering highly correlated paths together as tree based partitions, we maximize sequential access and minimize random access on storage media. Third but not the least, our path-centric computation model is implemented using a scatter/gather programming model. We parallel the iterative computation at partition tree level and perform sequential local updates for vertices in each tree partition to improve the convergence speed. To provide well balanced workloads among parallel threads at tree partition level, we introduce the concept of multiple stealing points based task queue to allow work stealings from multiple points in the task queue. We evaluate the effectiveness of PathGraph by comparing with recent representative graph processing systems such as GraphChi and X-Stream etc. Our experimental results show that our approach outperforms the two systems on a number of graph algorithms for both in-memory and out-of-core graphs. While our approach achieves better data balance and load balance, it also shows better speedup than the two systems with the growth of threads.

XTree: Traversal-Based Partitioning for Extreme-Scale Graph Processing on Supercomputers

FT-topo: Architecture-Driven Folded-Triangle Partitioning for Communication-efficient Graph Processing

Scaling Graph Traversal to 281 Trillion Edges with 40 Million Cores

Superblock: An Application-Aware Dynamic Partition Strategy for Large-Scale Graph

Scalable Graph Traversal on Sunway TaihuLight with Ten Million Cores

Divide & Conquer: I/O Efficient Depth-First Search

A Feasible Graph Partition Framework for Parallel Computing of Big Graph

PathGraph: A Path Centric Graph Processing System

How to Partition a Billion-Node Graph

GridGraph: Large-Scale Graph Processing on a Single Machine Using 2-Level Hierarchical Partitioning

GraphCube: Interconnection Hierarchy-aware Graph Processing.

Reducing Communication in Parallel Breadth-First Search on Distributed Memory Systems

VPC: Pruning Connected Components Using Vector-Based Path Compression for Graph500

3-D Partitioning for Large-Scale Graph Processing.

Understanding Parallelism in Graph Traversal on Multi-Core Clusters

Practical and high-quality partitioning algorithm for large-scale and time-evolving graphs

K-Core Decomposition on Super Large Graphs with Limited Resources

Edge Cluster Based Large Graph Partitioning and Iterative Processing in BSP

A parallel graph partitioning algorithm to speed up the large-scale distributed graph mining.

A Feasible Graph Partition Framework for Random Walks Implemented by Parallel Computing in Big Graph

Optimal Representation of Large-Scale Graph Data Based on K2-Tree