Quantitative GPGPU Performance Model Targeting OpenCL Architecture

ZHU Jun-feng,CHEN Gang,ZHANG Ke-liang,WU Bai-feng
DOI: https://doi.org/10.3969/j.issn.1000-1220.2013.05.033
2013-01-01
Abstract:For the sake of evaluating the potential execution performance of a data-level parallel application that will be parallelized onto GPU architecture,a quantitative GPGPU performance model targeting OpenCL architecture is proposed.The present model embodies various features of the GPU architecture which affect the performance of a GPGPU program such as global memory access,local memory access,overlapping memory access with useful computation,conditional branch divergence and synchronization.By statically analyzing a DLP application with considering of the specific OpenCL execution configuration,the present model can approximately estimate the execution time of a DLP application without the need of writing the actual GPGPU program.Analytical and experimental results for matrix multiplication and parallel prefix-sum on AMD RadeonTM HD 5870 GPU and NVIDIA GeForceTM GTX 280 GPU show that the present model can estimate the execution time of DLP applications relative accurately.
What problem does this paper attempt to address?