A Parallel Computing Method Using Blocked Format with Optimal Partitioning for SpMV on GPU

Wangdong Yang,Kenli Li,Keqin Li
DOI: https://doi.org/10.1016/j.jcss.2017.09.010
IF: 1.043
2018-01-01
Journal of Computer and System Sciences
Abstract:For large-scale sparse matrices, SpMV cannot be processed on GPU using the common storage formats because of the memory limitation. In addition, the parallel effect is poor using general formats for the sparse matrices with extremely uneven distribution of non-zero elements, which leads to performance deterioration. This paper presents an optimal partitioning strategy based on the distribution of non-zero elements in a sparse matrix to improve the performance of SpMV, and uses a hybrid format, which mixes CSR and ELL formats, to store the blocks partitioned from the sparse matrix. The hybrid blocked format has better compression effect and more uniform distribution of non-zero elements, which can be suitable for more types of sparse matrices. Our pattitioning strategy is proven to be optimal, which can yield the minimum parallel execution time on GPU. (C) 2017 Elsevier Inc. All rights reserved.
What problem does this paper attempt to address?