NVIDIA A100 Tensor Core GPU: Performance and Innovation

Jack Choquette,Wishwesh Gandhi,Olivier Giroux,Nick Stam,Ronny Krashinsky
DOI: https://doi.org/10.1109/mm.2021.3061394
IF: 2.8212
2021-03-01
IEEE Micro
Abstract:NVIDIA A100 Tensor Core GPU is NVIDIA's latest flagship GPU. It has been designed with many new innovative features to provide performance and capabilities for HPC, AI, and data analytics workloads. Feature enhancements include a Third-Generation Tensor Core, new asynchronous data movement and programming model, enhanced L2 cache, HBM2 DRAM, and third-generation NVIDIA NVLink I/O.
computer science, software engineering, hardware & architecture
What problem does this paper attempt to address?