Abstract:This thesis is concerned with two classical topics in matrix computations : The QR algorithm for solving nonsymmetric eigenvalue problems and the computation of matrix exponentials for two types of structured matrices. We focus on the performance in the former topic and on accuracy in the latter one. For computing all eigenvalues of a non-Hermitian matrix, the QR algorithm which iteratively computes a Schur decomposition of the matrix is the method of choice. We present a new parallel implementation of the multishift QR algorithm targeting distributed memory architectures. Starting from recent developments of the parallel multishift QR algorithm, we propose a number of algorithmic and implementation improvements. Guidelines concerning several important tunable algorithmic parameters are also provided. Numerous computational experiments confirm that our new implementation significantly outperforms previous parallel implementations of the QR algorithm. The computation of the exponential of a square matrix is also an important task in matrix computations. For a general dense matrix, the scaling and squaring method coupled with Pade approximation is the most popular approach. However, for an essentially nonnegative matrix (a real square matrix with nonnegative off-diagonal entries), truncated Taylor series rather than Pade approximation is preferred to achieve componentwise accuracy in the matrix exponential. We propose a method which efficiently computes all entries of the exponential of an essentially nonnegative matrix to high relative accuracy. Truncation and rounding error bounds, as well as numerical experiments demonstrate the efficiency and accuracy of our method. When the matrix is banded, the entries of its matrix exponential decay exponentially away from the main diagonal. We analyze the decay property for the exponentials of several classes of doubly-infinite skew-Hermitian matrices. Then finite section methods based on the decay property are established. We also propose a repeated doubling strategy which works well even when a priori error estimates are pessimistic or not easy to compute. Finally, numerical experiments are presented to illustrate the effectiveness of the finite section method.

High-Performance Computation of the Exponential of a Large Sparse Matrix

Efficient scaling and squaring method for the matrix exponential

On the finite section method for computing exponentials of doubly-infinite skew-Hermitian matrices

Dense and Structured Matrix Computations —the Parallel QR Algorithm and Matrix Exponentials

Low-synchronization Arnoldi Methods for the Matrix Exponential with Application to Exponential Integrators

Computing exponentials of essentially non-negative matrices entrywise to high relative accuracy.

Computation of the exponential function of matrices by a formula without oscillatory integrals on infinite intervals

Performance Optimization for Sparse A(T)Ax in Parallel on Multicore Cpu

Orthogonal layers of parallelism in large-scale eigenvalue computations

A New Sparse Matrix Vector Multiplication GPU Algorithm Designed for Finite Element Problems

Aggressively Truncated Taylor Series Method For Accurate Computation Of Exponentials Of Essentially Nonnegative Matrices

Increasing the Efficiency of Sparse Matrix-Matrix Multiplication with a 2.5D Algorithm and One-Sided MPI

Avoiding communication in sparse matrix computations

A Communication-Avoiding Parallel Algorithm for the Symmetric Eigenvalue Problem

Computing Functions of Symmetric Hierarchically Semiseparable Matrices

Parallel Sparse Matrix Multiplication for Preconditioning and SSTA on a Many-Core Architecture

Approximating Element-Wise Functions of Matrix with Improved Streaming Randomized SVD

Accelerating Convergence by Augmented Rayleigh-Ritz Projections for Large-Scale Eigenpair Computation

An Algorithmic Framework for Efficient Large-Scale Circuit Simulation Using Exponential Integrators

An efficient sparse stiffness matrix vector multiplication using compressed sparse row storage format on AMD GPU

On Parallelizing Matrix Multiplication by the Column-Row Method