Two Methods for Combining Original Memory Access Coalescing and Equivalent Memory Access Coalescing on GPGPU.

Yulong Pei,Licheng Yu,Minghui Wu,Tianzhou Chen,Xueqing Lou,Tiefei Zhang
DOI: https://doi.org/10.1109/icess.2016.15
2016-01-01
Abstract:The modern GPU has powerful parallel processing unit and programmable pipeline, therefore GPU has some advantages for non-graphic computing. However, GPU needs a lot of memory access bandwidth. GPU uses memory access coalescing method to reduce memory access requests that have good locality. Paper [1] proposes equivalent memory access coalescing to improve memory access performance when the program's memory access requests have bad locality. That means original memory access coalescing and equivalent memory access coalescing can complement to each other. In this paper, we propose two methods to combine these two memory access coalescing methods. In the experiment, we choose 30 benchmarks from two suites. By using the first method, 20 benchmarks' memory access performance can be improved. The average speed up is 141.4%. By using the second method, 27 benchmarks can choose the better memory access coalescing method.
What problem does this paper attempt to address?