Enhancing Graph Random Walk Acceleration via Efficient Dataflow and Hybrid Memory Architecture

Yingxue Gao,Teng Wang,Lei Gong,Chao Wang,Yiqing Hu,Yi Yang,Zhongming Liu,Xi Li,Xuehai Zhou
DOI: https://doi.org/10.1109/tc.2023.3347674
IF: 3.183
2024-02-13
IEEE Transactions on Computers
Abstract:Graph random walk sampling is becoming increasingly important with the widespread popularity of graph applications. It aims to capture the desirable graph properties by launching multiple walkers to collect feature paths. However, previous research suffers long sampling latency and severe memory access bottlenecks due to intrinsic data dependency and skewed vertex distribution. Thus, in this paper, we propose FastRW, a dedicated accelerator to boost graph random walk operation on FPGAs. Specifically, FastRW first integrates multiple parallel processing engines to achieve data-level parallelism, where each processing engine also leverages dataflow scheduling to resolve data dependency and hide long sampling latency. Secondly, FastRW leverages a combination of multiple storage resources to implement a hybrid memory architecture adapted to skewed vertex distribution. By integrating the above optimizations, FastRW develops a performance model to take advantage of the balance between computation parallelism and bandwidth demand. We evaluate FastRW with two classic sampling algorithms on a wide range of real-world graph datasets. The experimental results show that FastRW achieves a speedup of 37.52 × on average over the system running on two 8-core Intel CPUs. FastRW also achieves an average of 28.04 × speedup over the architecture implemented on V100 GPU.
engineering, electrical & electronic,computer science, hardware & architecture
What problem does this paper attempt to address?