Research on Embedded Deployment Optimization Method of Neural Network.

Jin Liu,Changbo Hou,Hao Meng,Keyong Wang,Qianyi Zhao,Ruiqi Wang,Mohan Ma,Bin Wang
DOI: https://doi.org/10.1145/3594300.3594313
2023-01-01
Abstract:Neural networks are more and more widely used in industrial production. Traditional central processing unit (CPU) and graphics processing unit (GPU) neural network deployment platforms have the disadvantages of large volume and high-power consumption. Platforms based on Advanced RISC Machines (ARM) processors, although easy to deploy, suffer from the disadvantage of low computing power. To solve this problem, based on Zynq embedded platform, this paper makes full use of its Field Programmable Gate Array (FPGA) side parallel computing characteristics, designs a fully connected neural network (FCNN) and a convolutional neural network (CNN). The efficient recognition of handwritten digits deployed on the Zynq embedded platform is achieved through the High-Level Synthesis (HLS) optimization method. Finally, the relationship between the performance and hardware resource consumption of the two neural networks is comprehensively compared and analyzed.
What problem does this paper attempt to address?