Efficient Method for Histogram Generation on GPU

DI Peng,HU Chang-jun,LI Jian-jiang
2012-01-01
Computer Science
Abstract:Histogram generation is an inherently sequential loop computation with irregular data-dependence,which has a full range of applications in diverse fields.However,the presence of irregular memory access in histogram loop nest poses an obstacle to its paralleled execution using a massive number of fine-grained threads due to access latency leaded by bank conflicts.It is non-trivial to accelerate histogram generation algorithm on parallel platform,particularly on the state-of-the-art parallel platform,graphics processing unit(GPU).For reducing bank conflicts,utilization of padding technique can evenly distribute shared memory access of multiple threads to different banks and largely exploit GPU's potential on accelerating histogram generation.Moreover,efficient near-optimal configuration search model can guide programmers choosing appropriate GPU execution parameters for higher performance.Experimental result demonstrates the improved histogram generation algorithm has approximate 42% to 88% speedups than traditional histogram generation algorithm on GPU.
What problem does this paper attempt to address?