Bit-Width Based Resource Partitioning for CNN Acceleration on FPGA.

Jianxin Guo,Shouyi Yin,Peng Ouyang,Leibo Liu,Shaojun Wei
DOI: https://doi.org/10.1109/fccm.2017.13
2017-01-01
Abstract:Convolutional neural networks (CNNs) have achieved great success in many applications. Recently, various FPGA-based accelerators have been proposed to improve the performance of CNNs. However, current most FPGA-based methods use single bit-width selection for all CNN layers, which lead to very low resource utilization efficiency and difficulty in further performance improvement. In this paper, we propose a new approach utilizing bit-width partitioning of FPGA DSP resources to improve the performance and resource utilization efficiency of CNN accelerator. Moreover, we use optimization approach to find the optimal allocation plan for DSP resources. On a Xilinx Virtex-7 FPGA, our design approach achieves performance over the state-of-the-art FPGA-based CNN accelerators from 5.48x to 7.25x and by 6.21x on average, when we evaluate the popular CNNs.
What problem does this paper attempt to address?