Techniques For Efficient Dct/Idct Implementation On Generic Gpu

B Fang,Gb Shen,Sp Li,Hf Chen
DOI: https://doi.org/10.1109/ISCAS.2005.1464791
2005-01-01
Abstract:Emergence of programmable graphics processing unit has led to increasing interest in offloading numerically intensive computations on graphics hardware. DCT/IDCT is widely adopted in modern image/video compression standards and is usually one of the most computational expensive parts. In this paper, we present several techniques for efficient implementation of DCT/IDCT on generic programmable GPU, using direct matrix multiplication. Our experimental results demonstrate that the speed of IDCT on GPU with the proposed techniques can well exceed that on CPU with MM optimization.
What problem does this paper attempt to address?