An Experimental GPU Global Memory Performance Estimation and Optimization

Junfeng Zhu,Gang Chen,Keliang Zhang,Baifeng Wu
DOI: https://doi.org/10.1109/icsai.2012.6223155
2012-01-01
Abstract:The enormous computational power available in modern graphics processing units (GPUs) has enabled the widely use of them for general-purpose applications. However, manual development of high-performance parallel codes for GPUs is still very challenging. In order for improving GPGPU application performance by efficiently using GPU global memory, we extend the polyhedral model to capture memory access patterns inside the source programs. We determine the global memory accesses are coalesced or not. We also estimate the memory performance of a GPGPU kernel, with the purpose of eliminating the uncoalesced global memory accesses. Experimental results show that that the present global memory performance model can estimate the global memory performance of these two applications relative accurately and the present global memory optimization methods can significantly improve performance.
What problem does this paper attempt to address?