Fast Algorithm for HEVC Using Frequency-based Convolutional Neural Networks

Yi Wu,Lei Chen
DOI: https://doi.org/10.1109/eiect60552.2023.10442731
2023-01-01
Abstract:Compared to the previous H.264 video encoding standard, High Efficiency Video Coding (HEVC) offers a remarkable 50% reduction in bitrate while maintaining perceptual video quality. However, this enhancement comes at the cost of significantly increased encoding complexity. In HEVC, the quad-tree partitioning of coding units (CUs) is a major contributor to this complexity, as it involves an exhaustive search for rate-distortion optimization (RDO). Many existing approaches address this issue by utilizing neural networks to predict block partition decisions, primarily focusing on the spatial domain. In our work, we take a novel approach by introducing a frequency-based network to tackle this problem from a different angle. In comparison to the original HM16.5 encoding, the proposed method of learning in the frequency domain average time reductions about 64.74%, 67.21%, 70.8% and 79.52% on average at four QPs respectively.
What problem does this paper attempt to address?