Cnn-Based Depth Map Prediction for Fast Block Partitioning in HEVC Intra Coding.

Aolin Feng,Changsheng Gao,Li Li,Dong Liu,Feng Wu
DOI: https://doi.org/10.1109/ICME51207.2021.9428069
2021-01-01
Abstract:High Efficiency Video Coding (HEVC) achieves significant improvement in compression efficiency by introducing quadtree-based block partition. However, in the HEVC reference software–HM, the optimal partition is found by a recursive rate-distortion optimization (RDO) process, which is computationally expensive and not friendly to hardware implementation. We propose a fast block partitioning algorithm using convolutional neural network (CNN) based depth map prediction for HEVC intra coding. We use the depth map to represent the block partition of a coding tree unit (CTU). Then, we design a CNN to predict the depth map for a CTU, and we construct a large-scale dataset to train the CNN. Through the depth map prediction, we obtain a block partitioning structure for the entire CTU, and then we could directly compress each coding unit, getting rid of the recursive RDO process for partitioning. Experimental results show that our proposed method reduces 65.55% encoding time of HM at the cost of 2.02% Bjøntegaard Delta rate (BD-rate) increase on the common test sequences. For 4K sequences, our method achieves 76.97% time saving with 2.89% BD-rate increase.
What problem does this paper attempt to address?