LRAMM -- Low precision approximates GEMM via RSVD

Hongyaoxing Gu
2024-05-27
Abstract:Matrix multiplication computation acceleration has been a research hotspot across various domains. Due to the characteristics of some applications, approximate matrix multiplication can achieve significant performance improvements without losing much precision. In this paper, we propose LRAMM - a high-performance matrix multiplication approximation algorithm that combines mixed-precision quantized matrix multiplication with RSVD techniques, further enhancing efficiency within the error range of low-precision matrix multiplication by utilizing matrix low-rank decomposition technology.
Numerical Analysis,Performance
What problem does this paper attempt to address?