EGraph: Efficient Concurrent GPU-based Dynamic Graph Processing

Yu Zhang,Yuxuan Liang,Jin Zhao,Fubing Mao,Lin Gu,Xiaofei Liao,Hai Jin,Haikun Liu,Song Guo,Yangqing Zeng,Hang Hu,Chen Li,Ji Zhang,Biao Wang
DOI: https://doi.org/10.1109/tkde.2022.3171588
IF: 9.235
2022-01-01
IEEE Transactions on Knowledge and Data Engineering
Abstract:In many applications of the analysis of dynamic graph, many Timing iterative Graph Processing (TGP) jobs usually need to be generated for the processing of the corresponding snapshots of the dynamic graph to obtain the results at different points of time. For high throughput of such applications, it is expected to run the TGP jobs on the GPU concurrently. Although many GPU-based systems have been recently developed, for out-of-GPU-memory dynamic graph processing, this concurrent way suffers from significant data access overhead due to a large volume of data transfer between CPU and GPU and the interference between these concurrently running jobs, which eventually incurs low GPU utilization ratio. In this work, we observed that the TGP jobs have strong temporal and spatial similarity when they access different snapshots for their own processing as most parts of the snapshots are the same and only a few parts are changing with time. It creates ideal opportunities for efficient concurrent execution of the TGP jobs by dramatically reducing CPU-GPU graph data transfer cost. Based on this observation, we develop the first GPU-based dynamic graph processing system EGraph, which can be integrated into the existing out-of-GPU-memory static graph processing systems to enable them to efficiently support concurrent execution of TGP jobs on dynamic graphs with the help of GPU accelerators. Different from the existing approaches, we propose in EGraph an effective Loading-Processing-Switching (LPS) execution model. It is able to effectively reduce the overhead of CPU-GPU data transfer and ensures a higher GPU utilization ratio for efficient execution of the TGP jobs by fully utilizing the data access similarity between the TGP jobs. Experimental results show that the existing GPU-accelerated systems achieve performance improvements of 2.3-3.5 times after being integrated with EGraph.
computer science, information systems, artificial intelligence,engineering, electrical & electronic
What problem does this paper attempt to address?