TTNNM: Thermal- and Traffic-Aware Neural Network Mapping on 3D-Noc-based Accelerator
Xinyi Li,Wenjie Fan,Heng Zhang,Jinlun Ji,Tong Cheng,Shiping Li,Li,Yuxiang Fu
DOI: https://doi.org/10.1145/3649476.3658703
2024-01-01
Abstract:3D Network on Chips (3D-NoCs) have ample on-chip wiring resources and high bandwidth, yet face numerous hotspots and higher temperature gradients due to increased integration and power density. This could lead to device failure, impacting system stability. Our paper introduces a thermal- and traffic-aware mapping method for 3D-NoC-based neural network accelerators. Firstly, based on the average load of different neural network layer, we determine their mapping sequences and suitable dies. Secondly, to minimize delay and alleviate hotspot temperatures, we allocate groups to appropriate nodes. Compared with previous works, TTNNM reduces the average temperature by 3.0 degrees C, 2.2 degrees C, 2.4 degrees C, temperature variance by 58.4%, 64.8%, 73.0%, maximum temperature by 9.3 degrees C, 7.9 degrees C, 12.0 degrees C, and packet latency by 31.7%, 17.2%, 25.1%.