An efficient hybrid tridiagonal divide-and-conquer algorithm on distributed memory architectures

Shengguo Li,Francois-Henry Rouet,Jie Liu,Chun Huang,Xingyu Gao,Xuebin Chi
DOI: https://doi.org/10.48550/arXiv.1612.07526
2016-12-22
Abstract:In this paper, an efficient divide-and-conquer (DC) algorithm is proposed for the symmetric tridiagonal matrices based on ScaLAPACK and the hierarchically semiseparable (HSS) matrices. HSS is an important type of rank-structured <a class="link-external link-http" href="http://matrices.Most" rel="external noopener nofollow">this http URL</a> time of the DC algorithm is cost by computing the eigenvectors via the matrix-matrix multiplications (MMM). In our parallel hybrid DC (PHDC) algorithm, MMM is accelerated by using the HSS matrix techniques when the intermediate matrix is large. All the HSS algorithms are done via the package STRUMPACK. PHDC has been tested by using many different matrices. Compared with the DC implementation in MKL, PHDC can be faster for some matrices with few deflations when using hundreds of processes. However, the gains decrease as the number of processes increases. The comparisons of PHDC with ELPA (the Eigenvalue soLvers for Petascale Applications library) are similar. PHDC is usually slower than MKL and ELPA when using 300 or more processes on Tianhe-2 supercomputer.
Mathematical Software,Distributed, Parallel, and Cluster Computing,Numerical Analysis
What problem does this paper attempt to address?