ConvCloud: an Adaptive Convolutional Neural Network Accelerator on Cloud FPGAs.

Yang,Chao Wang,Lei Gong,Xuehai Zhou
DOI: https://doi.org/10.1145/3373087.3375387
2020-01-01
Abstract:Among all the neural network specialized hardware accelerators like the Application-Specific-Integrate-Circuit(ASIC), an FPGA accelerator stands out for its flexibility, short time-to-market, and energy efficiency. However, when it comes to multitasking and high-speed requirements or realtime and power-efficient scenarios (e.g., UAVs, self-driving cars, and IoT devices), a single-board FPGA accelerator has difficulties in achieving excellent performance. Therefore, Cloud FPGAs(Multi-FPGAs) will be a significant role in high-performance and energy-efficient computation of CNNs for both mobile and cloud computing domains. In this work, we propose an adaptive neural network accelerator on Cloud FPGAs, using multi-FPGA design to satisfy multitasking and high-speed requirements or realtime and power-efficient scenarios. We adopt the roofline model to figure out the optimal configuration of each CNN layer. And a layer clustering algorithm and a layer sequence detection method are proposed to transform CNN models into layer sequences for mapping the CNN model layers efficiently to different FPGA boards. Then, we built an adaptive CNN mapping method of Multi-FPGA chips for CNN models. Preliminary results on the Multi-FPGAs platform demonstrate that our accelerator can improve the performance significantly due to the adaptive mapping method.
What problem does this paper attempt to address?