Accelerating Biclique Counting on GPU

Linshan Qiu,Zhonggen Li,Xiangyu Ke,Lu Chen,Yunjun Gao
DOI: https://doi.org/10.1109/icde60146.2024.00247
2024-01-01
Abstract:Counting (p,q)-bicliques in bipartite graphs poses a foundational challengewith broad applications, from densest subgraph discovery in algorithmicresearch to personalized content recommendation in practical scenarios. Despiteits significance, current leading (p,q)-biclique counting algorithms fallshort, particularly when faced with larger graph sizes and clique scales.Fortunately, the problem's inherent structure, allowing for the independentcounting of each biclique starting from every vertex, combined with asubstantial set intersections, makes it highly amenable to parallelization.Recent successes in GPU-accelerated algorithms across various domains motivateour exploration into harnessing the parallelism power of GPUs to efficientlyaddress the (p,q)-biclique counting challenge. We introduce GBC (GPU-basedBiclique Counting), a novel approach designed to enable efficient and scalable(p,q)-biclique counting on GPUs. To address major bottleneck arising fromredundant comparisons in set intersections (occupying an average of 90runtime), we introduce a novel data structure that hashes adjacency lists intotruncated bitmaps to enable efficient set intersection on GPUs via bit-wise ANDoperations. Our innovative hybrid DFS-BFS exploration strategy further enhancesthread utilization and effectively manages memory constraints. A composite loadbalancing strategy, integrating pre-runtime and runtime workload allocation,ensures equitable distribution among threads. Additionally, we employ vertexreordering and graph partitioning strategies for improved compactness andscalability. Experimental evaluations on eight real-life and two syntheticdatasets demonstrate that GBC outperforms state-of-the-art algorithms by asubstantial margin. In particular, GBC achieves an average speedup of 497.8x,with the largest instance achieving a remarkable 1217.7x speedup when p = q =8.
What problem does this paper attempt to address?