An Efficient Parallel CNN Inference Framework for Multi-zone Processor

Jie Chen,Zhong Liu
DOI: https://doi.org/10.1109/HPCC-DSS-SmartCity-DependSys57074.2022.00210
2022-01-01
Abstract:Deep convolutional neural networks have been widely used in various fields, and how to accelerate and optimize the performance of neural network models according to the architectural characteristics of accelerator hardware has become the research hot spot. We propose and design an efficient parallel convolutional neural network inference framework named PCIF for multi-zone processor. Its core is a multi-core parallel and vectorized C++ operator library. PCIF has designed optimized data layout and transmission strategies, layer fusion and multi-image fusion methods, which can automatically parse and efficiently map existing neural network models to multi-core vector accelerators to accelerate inference computation. Users can also build new neural network models through the C++ and Python APIs provided by PCIF to achieve high performance inference computing. The results of the experiment show that the AlexNet, VGG-16, GoogleNet, and ResNet-50 network models achieve efficient computing performance using PCIF.
What problem does this paper attempt to address?