A Compressed Data Partition and Loop Scheduling Scheme for Neural Networks

Dejian Li,Rongqiang Fang,Jing Wang,Dongyan Zhao,Ting Chong,Zengmin Ren,Jun Ma
DOI: https://doi.org/10.1109/access.2022.3204038
IF: 3.9
2022-09-21
IEEE Access
Abstract:Neural networks (NNs) have been widely adopted in various application domains. Deeper NNs greatly enhance the output accuracy, but complex NNs with more parameters incur intensive memory accesses, and the data usually need to be partitioned since it may exceed the on-chip storage. However, there is no research considering the partition and scheduling co-design of the NNs. In this paper, we propose a sparse NN data partition and loop scheduling scheme. We establish the compression efficiency model of the matrix sparse algorithm and design a partition selection method based on sparsity characteristics analyzed by the compression efficiency model. Further, we design a loop scheduling scheme based on the proper partition size. The experiment results show that the average memory access of each layer can be compressed to 68% of the original, and the throughput of the AlexNet, VGG and VGG19 is increased to an average of 1.66 times.
computer science, information systems,telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?