Implementing Communication-Optimal Parallel and Sequential QR Factorizations

James Demmel,Laura Grigori,Mark Hoemmen,Julien Langou
DOI: https://doi.org/10.48550/arXiv.0809.2407
2008-09-15
Abstract:We present parallel and sequential dense QR factorization algorithms for tall and skinny matrices and general rectangular matrices that both minimize communication, and are as stable as Householder QR. The sequential and parallel algorithms for tall and skinny matrices lead to significant speedups in practice over some of the existing algorithms, including LAPACK and ScaLAPACK, for example up to 6.7x over ScaLAPACK. The parallel algorithm for general rectangular matrices is estimated to show significant speedups over ScaLAPACK, up to 22x over ScaLAPACK.
Numerical Analysis
What problem does this paper attempt to address?