Design and Implementation of a FPGA-based Accelerator for Convolutional Neural Networks

Bang ZHANG,Jinmei LAI
DOI: https://doi.org/10.15943/j.cnki.fdxb-jns.2018.02.012
2018-01-01
Abstract:In this paper,a convolutional neural network(CNN)acceleration method based on field programmable gate array(FPGA)is proposed,which aims to accelerate the calculation of CNN in resource and power limited platform.Firstly,we used the data quantization to convert parameters from the floating-point into fixed-point ones,which improves the hardware efficiency;Secondly,we proposed a system architecture that initiates data transaction from the FPGA side,which avoids the performance degradation caused by the processor's frequent configuration.At last,we proposed an efficient processing element and data buffer for the CNN calculation,which improves the computational efficiency.In this paper,we implement all the method proposed above based on a CNN targeting at traffic sign recognition(TSR).Test result shows that the hardware implementation introduces a 0.6% accuracy loss with 49ms recognition delay,at which a single multiplier contributes 0.081GOPS throughput and the performance power ratio reaches 6.81GOPS/W.Compared with other works related in recent years,it can be seen that the proposed method can provide higher performance in the case of limited resources and power.
What problem does this paper attempt to address?