Automatic Generation and Optimization Framework of NoC-Based Neural Network Accelerator Through Reinforcement Learning

Yongqi Xue,Jinlun Ji,Xinming Yu,Shize Zhou,Siyue Li,Xinyi Li,Tong Cheng,Shiping Li,Kai Chen,Zhonghai Lu,Li,Yuxiang Fu
DOI: https://doi.org/10.1109/tc.2024.3441822
IF: 3.183
2024-01-01
IEEE Transactions on Computers
Abstract:Choices of dataflows, which are known as intra-core neural network (NN) computation loop nest scheduling and inter-core hardware mapping strategies, play a critical role in the performance and energy efficiency of NoC-based neural network accelerators. Confronted with an enormous dataflow exploration space, this paper proposes an automatic framework for generating and optimizing the full-layer-mappings based on two reinforcement learning algorithms including A2C and PPO. Combining soft and hard constraints, this work transforms the mapping configuration into a sequential decision problem and aims to explore the performance and energy efficient hardware mapping for NoC systems. We evaluate the performance of the proposed framework on 10 experimental neural networks. The results show that compared with the direct-X mapping, the direct-Y mapping, GA-base mapping, and NN-aware mapping, our optimization framework reduces the average execution time of 10 experimental NNs by 9.09%, improves the throughput by 11.27%, reduces the energy by 12.62%, and reduces the time-energy-product (TEP) by 14.49%. The results also show that the performance enhancement is related to the coefficient of variation of the neural network to be computed.
What problem does this paper attempt to address?