Algorithm of H.264 Fast Deblocking Filter on CUDA

LIU Hu,SUN Zhao-min,CHEN Qi-mei
DOI: https://doi.org/10.3724/sp.j.1087.2010.03252
2011-01-01
Abstract:In H.264/AVC video coding standard,deblocking filter was used for enhancing the coding efficiency.The filter was very complicated and cost a lot of time.A fast algorithm and efficient implementation of H.264 deblocking filter based on NVIDIA Compute Unified Device Architecture(CUDA) was proposed.The parallel hardware architecture and software development process of Graphic Processing Unit(GPU) were introduced firstly.On the basis of the parallel architecture and hardware characteristic of GPU,some algorithms were focused on BS computation and optimization of deblocking filter to reduce complexity and improve the computing speed,and the shared memory was used to improve the data access efficiency.The experimental results clearly show that,in the same image quality,the average acceleration rate is about 20,and the algorithm on CPU can achieve better performance.
What problem does this paper attempt to address?