A FPGA-based Hardware Accelerator for Multiple Convolutional Neural Networks

Yuchen Yao,Qinghua Duan,Zhiqian Zhang,Jiabao Gao,Jian Wang,Meng Yang,Xinxuan Tao,Jinmei Lai
DOI: https://doi.org/10.1109/icsict.2018.8565657
2018-01-01
Abstract:Convolution Neural Network (CNN) has been widely used in many computer vision tasks. Due to the rapid growth of CNN, the accelerator that only supports single network could not meet the requirement of application. Based on the work of ZynqNet, which is a dedicated CNN accelerator, in this paper, we propose a FPGA-based CNN accelerator which supports the acceleration of multiple networks, and present an automatic mapping flow in which users only need to provide network description files and test image to accelerate a specified network. And we adopt a dynamic fixed-point quantization strategy to reduce resource consumption. Experimental results shows the performance density and power efficiency of our design can reach 0.054GOPS/DSP and 5.24GOPS/W respectively when accelerating SqueezeNet.
What problem does this paper attempt to address?