Dynamic Precision-Scalable Thermal Mapping Algorithm for Three Dimensional Systolic-Array Based Neural Network Accelerator

Shu-Yen Lin,Chun-Kuan Tsai,Wen-Chun Kao
DOI: https://doi.org/10.1109/tce.2024.3378706
2024-01-01
IEEE Transactions on Consumer Electronics
Abstract:nowadays, the systolic-array based accelerator has been used widely for the neural-network applications. Multiple systolic-array based accelerator chips can be stacked by the 3D IC technology to improve the performance of the neural-network applications. However, the 3D accelerator increases the power density and causes the overheating. To avoid the overheating, the sacrifice of the performance for the 3D accelerator under the thermal limitations is important. In this work, a dynamic precision-scalable thermal mapping algorithm (DPSTM) is proposed to change the active processing elements with different data precisions in the 3D accelerators dynamically. The goal is to control the power density and peak temperature of the 3D accelerator. Compared with the related works, DPSTM can reduce 29%-77% and 7%-73% latencies in AlexNet and ResNet-18 with 92-95C thermal limitations.
telecommunications,engineering, electrical & electronic
What problem does this paper attempt to address?