Research on Dynamic Reconfiguration Technology of Neural Network Accelerator Based on Zynq

Hao Lv,Shengbing Zhang,Xiaojian Liu,Shuo Liu,Yongqiang Liu,Wei Han,Shaowei Xu
DOI: https://doi.org/10.1088/1742-6596/1650/3/032093
2020-01-01
Journal of Physics Conference Series
Abstract:Abstract Target detection based on convolutional neural network is a research hotspot in the field of computer vision. Conventional neural network (CNN) accelerators use the time division multiplexing method, and different network layers use the same accelerator, and their adaptability and resource utilization are not high. How to combine the dynamic reconfigurable characteristics of FPGA so that the calculation of each layer can be matched with the corresponding accelerator architecture at the cost of a certain configuration delay, and to improve the utilization efficiency of computing resources is a research hotspot. This article takes the YOLOv2 target detection algorithm widely used in the industry as an example, and uses Xilinx’s Zynq as the platform to describe the process of mapping the CNN model to the FPGA. Combined with the dynamic reconfigurability of FPGA, the calculation of each layer can be matched with the reconstructed accelerator architecture at the cost of a certain configuration delay, and the reconstruction delay can be shared by batch data multiplexing accelerator architecture, which effectively improves In order to improve the accelerator performance, the convolutional layer and the cascaded maximum pooling layer are merged to reduce the memory access delay. Experiments and evaluations were carried out on the accelerator architecture combined with dynamic reconfigurable characteristics, and the performance of 30.35GOP/s was obtained on the Zynq platform. Provide a reference for the application and optimization of CNN on embedded platforms.
What problem does this paper attempt to address?