Accelerated Spark Streaming Based on FPGA

Weiren Song,Jia You,Xisheng Li
DOI: https://doi.org/10.1109/cac57257.2022.10055892
2022-01-01
Abstract:Most of the big data computing frameworks run on the CPU. For compute-intensive tasks, CPU has high latency and cannot meet the real-time demand. Therefore, adopting hardware accelerators to improve the performance of big data computing framework has attracted a lot of attention. This paper proposes to accelerate Spark Streaming system based on FPGA (ASSF), uses PCIe technology and the OpenCL standard of heterogeneous systems to achieve lightweight integration of the system, optimizes the data transmission process through pointer mapping and batch transmission, and optimizes OpenCL programming framework and kernel functions to enhance the parallelism of the FPGA. Finally, the experimental results demonstrate that in the case of matrix multiplication, the delay is greatly reduced, the throughput is improved, and the CPU occupancy rate is reduced. Therefore, the method proposed in this paper can effectively improve the overall performance of Spark Streaming system, and has strong generality.
What problem does this paper attempt to address?