Fast Triangle Counting on GPU

Chuangyi Gui,Long Zheng,Pengcheng Yao,Hai Jin
DOI: https://doi.org/10.1109/hpec.2019.8916216
2019-01-01
Abstract:Triangle counting is one of the most basic graph applications to solve many real-world problems in a wide variety of domains. Exploring the massive parallelism of the Graphics Processing Unit (GPU) to accelerate the triangle counting is prevail. We identify that the stat-of-the-art GPU-based studies that focus on improving the load balancing still exhibit inherently a large number of random accesses in degrading the performance. In this paper, we design a prefetching scheme that buffers the neighbor list of the processed vertex in advance in the fast shared memory to avoid high latency of random global memory access. Also, we adopt the degree-based graph reordering technique and design a simple heuristic to evenly distribute the workload. Compared to the state-of-the-art HEPC Graph Challenge Champion in the last year, we advance to improve the performance of triangle counting by up to 5.9× speedup with> 109 TEPS on a single GPU for many large real graphs from graph challenge datasets.
What problem does this paper attempt to address?