Performance Tuning and Analysis for Stencil-Based Applications on POWER8 Processor.

Jingheng Xu,Haohuan Fu,Wen Shi,Lin Gan,Yuxuan Li,Wayne Luk,Guangwen Yang
DOI: https://doi.org/10.1145/3264422
2019-01-01
Abstract:This article demonstrates an approach for combining general tuning techniques with the POWER8 hardware architecture through optimizing three representative stencil benchmarks. Two typical real-world applications, with kernels similar to those of the winning programs of the Gordon Bell Prize 2016 and 2017, are employed to illustrate algorithm modifications and a combination of hardware-oriented tuning strategies with the application algorithms. This work fills the gap between hardware capability and software performance of the POWER8 processor, and provides useful guidance for optimizing stencil-based scientific applications on POWER systems.
What problem does this paper attempt to address?