Performance Optimization Strategies of High Performance Computing on GPU

Anguo Ma,Jing Cai,Yu Cheng,Xiaoqiang Ni,Yuxing Tang,Zuocheng Xing
DOI: https://doi.org/10.1007/978-3-642-03644-6_12
2009-01-01
Abstract:Recently GPU is widely utilized in scientific computing and engineering applications, owing primarily to the evolution of GPU architecture. Firstly, we analyze some key performance characters of GPU in detail, and the relationships among GPU architecture, programming model and memory hierarchy. Secondly, we present three performance optimization strategies: Prefetching, Streamlizing, and Task Division. Adequate experiments have been done to abstract the relationships among different factors and efficiency. Finally, we map the HPL benchmark to testify our strategies and achieve certain speedup.
What problem does this paper attempt to address?