Design Exploration of Multi-FPGAs for Accelerating Deep Learning

Teng Wang,Lei Gong,Chao Wang,Xuehai Zhou,Huaping Chen
DOI: https://doi.org/10.1109/cluster.2019.8891044
2019-01-01
Abstract:Due to the low power consumption and reconfigurability of FPGA, the use of FPGA for accelerating calculations is becoming more and more hot, including deep learning. However, due to limited hardware resource, single FPGA-based accelerator cannot configure optimal parameters for each layer, and its performance is also limited by the data memory bandwidth. To accelerate the calculation of neural network, this work designs a calculation module, and based on this module, further optimizes the data transmission path between multi-FPGA, thus achieving nearly linear performance growth between performance and the number of FPGAs.
What problem does this paper attempt to address?