Real-Time Fixed-Point Hardware Accelerator of Convolutional Neural Network on FPGA Based

Bahadir Özkilbaç,Ibrahim Yücel Ozbek,Tevhit Karacali,Bahadir Ozkilbac,Ibrahim Yucel Ozbek
DOI: https://doi.org/10.1109/icci54321.2022.9756093
2022-03-09
Abstract:Convolutional neural networks (CNN), which have the advantage of automatically detecting the important features of the input data without any human interfere, are widely used in many applications such as face recognition, speech recognition, image classification and object detection. In real-time CNN applications, computation speed is very important as well as accuracy. However, in some applications with high computational complexity, available systems are insufficient to meet the high-speed performance demand at low power consumption. In this study, the design of the CNN accelerator hardware in FPGA is presented to meet the speed demand. In this design, CNN is considered as a streaming interface application. Thus, temporary storage amount and memory latency are reduced. Each layer is designed with maximum parallelism, taking advantage of the FPGA. Because fixed-point number representation has the advantage of low latency, it is preferred in design with negligible sacrifice of accuracy. Thus, forward propagation of a CNN can be executed at high speed in FPGA. In order to compare real-time performance, digit classification application is executed in this hardware designed in FPGA and ARM processor on the same chip. The real-time results show that the application in the hardware designed in the FPGA is 30x faster than the ARM processor.
What problem does this paper attempt to address?