Thermally Constrained Codesign of Heterogeneous 3-D Integration of Compute-in-Memory, Digital ML Accelerator, and RISC-V Cores for Mixed ML and Non-ML Workloads

Yuan-Chun Luo,Anni Lu,Janak Sharda,Moritz Scherer,Jorge Tomas Gomez,Syed Shakib Sarwar,Ziyun Li,Reid Frederick Pinkham,Barbara De Salvo,Shimeng Yu
DOI: https://doi.org/10.1109/tvlsi.2024.3415481
2024-08-28
IEEE Transactions on Very Large Scale Integration (VLSI) Systems
Abstract:Heterogeneous 3-D (H3D) integration not only reduces the chip form factor and fabrication cost but also allows the merging of diverse compute paradigms that suit different applications. This is especially attractive when modern algorithms, such as the augmented reality/virtual reality (AR/VR) workloads, consist of mixed machine learning (ML) and non-ML workloads. To date, codesign that considers the thermal, latency, and power constraints of H3D hardware is largely unexplored. In this work, a thermally aware framework for H3D hardware design is developed to evaluate the thermal, latency, and power trade-offs for a heterogeneous system with compute-in-memory (CIM), digital ML cores, and RISC-V cores. The framework solves for runtime tunable operating points described as the optimal speedup factor, the number of activated RISC-V cores, the cooling coefficient, and the activity rate based on user-defined criteria, achieving up to 135 TOPS and 215 TOPS/W under C for the AR/VR workloads.
engineering, electrical & electronic,computer science, hardware & architecture
What problem does this paper attempt to address?