A High Performance FPGA-based Accelerator for MobileNet

Xiao Zhang,Jun Yang
DOI: https://doi.org/10.1109/icinc58035.2022.00024
2022-01-01
Abstract:There are many problems in the existing FPGA implementation version of the current lightweight convolutional neural network model MobileNet. For example, the computational latency between depthwise convolution and point convolution is too long, and the frequent communication between the processor and the FPGA leads to low computational efficiency and low resource utilization. This paper proposes an FPGA optimization design for MobileNet, which effectively improves the real-time performance of the system and the resource utilization of the hardware acceleration unit.
What problem does this paper attempt to address?