Parallelization Of Computing-Intensive Tasks Of The H.264 High Profile Decoding Algorithm On A Reconfigurable Multimedia System

Tongsheng Geng,Leibo Liu,Shouyi Yin,Min Zhu,Shaojun Wei
DOI: https://doi.org/10.1587/transinf.E93.D.3223
2010-01-01
IEICE Transactions on Information and Systems
Abstract:This paper proposes approaches to perform HW/SW (Hardware/Software) partition and parallelization of computing intensive tasks of the H 264 HIP (High Profile) decoding algorithm on an embedded coarse grained reconfigurable multimedia system called REMUS (REconfigurable MUltimedia System) Several techniques such as MB (Macro Block) based parallelization unfixed sub block operation etc are utilized to speed up the decoding process satisfying the requirements of real time and high quality H 264 applications Tests show that the execution performance of MC (Motion Compensation) deblockmg and IDCT IQ (Inverse Discrete Cosine Transform Inverse Quantization) on REMUS is Improved by 60% 73% 88 5% in the typical case and 60% 69% 88 5% in the worst case respectively compared with that on XPP PACT (a commercial reconfigurable processor) Compared with ASIC solutions the performance of MC is improved by 70% 74% in the typical and in the worst case respectively while those of Deblocking rennin the same As for IDCT_IQ the performance is Improved by 17% no matter in the typical or worst else Re lying on the proposed techniques 1080p@30 fps of H 264 HiP@ Level 4 decoding could be achieved on REMUS when utilizing a 200 MHz working frequency
What problem does this paper attempt to address?