Allok: a machine learning approach for efficient graph execution on CPU–GPU clusters

Marcelo Koji Moori,Hiago Mayk G. de A. Rocha,Arthur F. Lorenzon,Antonio Carlos S. Beck
DOI: https://doi.org/10.1007/s11227-024-06079-9
IF: 3.3
2024-05-25
The Journal of Supercomputing
Abstract:The unprecedented increase in interconnected data has driven the development of efficient graph analytics for extensive data analysis, resulting in improvements across various domains. Prior work has focused on optimizing graph execution for both CPUs and GPUs while overlooking the scalability of graph applications and the selection of an ideal architecture. Thus, we propose Allok, a flexible graph processing framework that aids in selecting the optimal processing architecture (CPU or GPU) for a batch of graph applications while also optimizing number of threads on CPUs. Allok relies solely on high-level graph features to make decisions without the need for further application execution. Our experiments on an HPC system with 4 CPUs and 3 GPUs running 5 algorithms over 25 input graphs show that Allok reduces application execution time by an average of 67.26 and reduces energy consumption and energy-delay product by an average of 18.06 and 1237.96 , respectively.
computer science, theory & methods,engineering, electrical & electronic, hardware & architecture
What problem does this paper attempt to address?