Width-Adaptive CNN: Fast CU Partition Prediction for VVC Screen Content Coding
Chao Jiao,Huanqiang Zeng,Jing Chen,Chih-Hsien Hsia,Tianlei Wang,Kai-Kuang Ma
DOI: https://doi.org/10.1109/tmm.2024.3410116
IF: 7.3
2024-08-28
IEEE Transactions on Multimedia
Abstract:Screen content coding (SCC) in Versatile Video Coding (VVC) improves the coding efficiency of screen content videos (SCVs) significantly but results in high computational complexity due to the quad-tree plus multi-type tree (QTMT) structure of the coding unit (CU) partitioning. Therefore, we make the first attempt to reduce the encoding complexity from the perspective of CU partitioning for SCC in VVC. To this end, a fast CU partition prediction method is technically developed for VVC-SCC. First, to solve the problem of lacking sufficient SCC training data, SCVs are collected to establish a database containing CUs of various sizes and corresponding partition labels. Second, to determine the partition decision in advance, a novel WA-CNN model is proposed, which is capable of predicting two large CUs for VVC-SCC by adjusting the feature channels based on the size of input CU blocks. Finally, considering the imbalanced proportion of diverse partition decisions, a loss function with the weight that equalizes the contribution of imbalanced data is formulated to train the proposed WA-CNN model. Experimental results show that the proposed model reduces the SCC intra-encoding time by 35.65% 38.31% with an average of 1.84% 2.42% BDBR increase.
computer science, information systems,telecommunications, software engineering