Analyzing and Improving Hardware Modeling of Accel-Sim

Rodrigo Huerta,Mojtaba Abaie Shoushtary,Antonio González
2024-01-18
Abstract:GPU architectures have become popular for executing general-purpose programs. Their many-core architecture supports a large number of threads that run concurrently to hide the latency among dependent instructions. In modern GPU architectures, each SM/core is typically composed of several sub-cores, where each sub-core has its own independent pipeline.
Hardware Architecture
What problem does this paper attempt to address?