CUDA-based PCG algorithm optimization for a large sparse matrix

Jingwei Zheng,Xuehui An,Miansong Huang
DOI: https://doi.org/10.16511/j.cnki.qhdxxb.2014.08.001
2014-01-01
Abstract:The GPU (graphics processing unit) has many advantages such as high bandwidth,low cost and fast throughput.The PCG (preconditioned conjugate gradient) algorithm solver was optimized for large sparse matrixes using the CSR (compress spare row)storage format for calculations using CUDA (compute unified device architecture) to solve sparse matrixes.Two parallel optimization strategies were used with memory optimization or data flow optimization.This paper describes the optimized process to calculate the vector inner product of a sparse matrix and a vector on a GPU.Solutions of a sparse matrix on a practical model of a hydraulic tunnel on a CPU and a GPU showed that the GPU was more than 13 times faster than that CPU.Thus,this optimized algorithm provides fast solutions to large sparse matrix problems.
What problem does this paper attempt to address?