Data Level Parallelism Implementation of Odd-even Merge Sort

ZHANG Ke-liang,LI Jia-jia,CHEN Gang,WU Bai-feng
DOI: https://doi.org/10.3969/j.issn.1000-1220.2012.06.038
2012-01-01
Abstract:Due to the significant data level parallelism of the odd-even merge sort algorithm,we implement it on the GPU processor with enormous computational potential for the sake of getting much better performance.However,there is no synchronization mechanism for different work-items among various work-groups.Thus,we present two methods to solve this problem: the first method is using the host program to control the iteration process,it can completely avoid the requirement of synchronization operation among all work-items;and the second method is utilizing the bucket partition preprocessing technology to make the requirement of synchronization operation to individual work-group only,and then can deal with the synchronization operation correctly by the synchronization mechanism supported by every single work-group.Experiment result shows that the performance of our implementation has achieved obvious performance improvement compared with the sort algorithm in the C++ STL library.
What problem does this paper attempt to address?