Eliminating Data Processing Bottlenecks in GNN Training over Large Graphs via Two-level Feature Compression

Yuxin Ma,Ping Gong,Tianming Wu,Jiawei Yi,Chengru Yang,Cheng Li,Qirong Peng,Guiming Xie,Yongcheng Bao,Haifeng Liu,Yinlong Xu
DOI: https://doi.org/10.14778/3681954.3681968
IF: 2.5
2024-07-01
Proceedings of the VLDB Endowment
Abstract:Training GNNs over large graphs faces a severe data processing bottleneck, involving both sampling and feature loading. To tackle this issue, we introduce F 2 CGT, a fast GNN training system incorporating feature compression. To avoid potential accuracy degradation, we propose a two-level, hybrid feature compression approach that applies different compression methods to various graph nodes. This differentiated choice strikes a balance between rounding errors, compression ratios, model accuracy loss, and preprocessing costs. Our theoretical analysis proves that this approach offers convergence and comparable model accuracy as the conventional training without feature compression. Additionally, we also co-design the on-GPU cache sub-system with compression-enabled training within F 2 CGT. The new cache sub-system, driven by a cost model, runs new cache policies to carefully choose graph nodes with high access frequencies, and well partitions the spare GPU memory for various types of graph data, for improving cache hit rates. Finally, extensive evaluation of F 2 CGT on two popular GNN models and four datasets, including three large public datasets, demonstrates that F 2 CGT achieves a compression ratio of up to 128 and provides GNN training speedups of 1.23-2.56× and 3.58--71.46× for single-machine and distributed training, respectively, with up to 32 GPUs and marginal accuracy loss.
computer science, information systems, theory & methods
What problem does this paper attempt to address?