Accelerating Graph Convolutional Networks Through a PIM-Accelerated Approach

Hai Jin,Dan Chen,Long Zheng,Yu Huang,Pengcheng Yao,Jin Zhao,Xiaofei Liao,Wenbin Jiang
DOI: https://doi.org/10.1109/tc.2023.3257514
IF: 3.183
2023-01-01
IEEE Transactions on Computers
Abstract:Graph convolutional networks (GCNs) are promising to enable machine learning on graph data. GCNs show potential vertex-level and intra-vertex parallelism for GPU acceleration, but their irregular memory accesses arising in aggregation operations and the inherent sparsity for vertex features of graphs cause inefficiencies on the GPU. In this paper, we present gPIM, which aims to accelerate GCNs inference through a processing-in-memory (PIM) enabled architecture. gPIM is expected to perform compute-intensive combination on the GPU while aggregation and memory-bound combination are offloaded to the PIM-featured hybrid memory cubes (HMCs). To maximize the efficiency of such GPU-HMC architecture, gPIM is novel with two key designs: 1) A GCN-induced graph partitioning that minimizes communication overheads between cubes, 2) A programmer-transparent performance estimation mechanism that predicts the performance bound of operations accurately for workload offloading. Experimental results show that gPIM significantly outperforms Intel Xeon E5-2680v3 CPU (8,979.52×), NVIDIA Tesla V100 GPU (96.01×), and a state-of-the-art GCN accelerator AWB-GCN (4.18×).
engineering, electrical & electronic,computer science, hardware & architecture
What problem does this paper attempt to address?