GPU Accelerated Parallel Cholesky Factorization

Liang Wang,Yi Sheng Zhang,Bin Zhu,Chi Xu,Xiao Wei Tian,Chao Wang,Jian Hua Mo,Jian Li
DOI: https://doi.org/10.4028/www.scientific.net/amm.148-149.1370
2011-01-01
Applied Mechanics and Materials
Abstract:One of the fundamental problems in scientific computing is to find solutions for linear equation systems. For finite element problem, Cholesky factorization is often used to solve symmetric positive definite matrices. In this paper, Cholesky factorization is massively parallelized and three different optimization methods - highly parallel factorization, tile strategy and memory scheduling are used to accelerate Cholesky factorization effectively. A novel algorithm using OpenCL is implemented. Testing on GPU shows that performance of the algorithm increases with the dimension of matrix, reaching 785.41GFlops, about 50x times speedup. Cholesky factorization is remarkably improved with OpenCL on GPU.
What problem does this paper attempt to address?