An Efficient Mapping Approach To Large-Scale Dnns On Multi-Fpga Architectures

Wentai Zhang,Jiaxi Zhang,Minghua Shen,Guojie Luo,Nong Xiao
DOI: https://doi.org/10.23919/DATE.2019.8715174
2019-01-01
Abstract:FPGAs are very attractive to accelerate the deep neural networks (DNNs). While single FPGA can provide good performance for small-scale DNNs, support for large-scale DNNs is limited due to higher resource demand. In this paper, we propose an efficient mapping approach for accelerating largescale DNNs on asymmetric multi-FPGA architectures. In this approach, the neural network mapping can be formulated as a resource allocation problem. We design a dynamic programming-based partitioning to solve this problem optimally. Experimental results using the large-scale ResNet-152 demonstrate that our deploys sixteen FPGAs to provide an advantage of 16.4x GOPS over the state-of-the-art work.
What problem does this paper attempt to address?