Decoder-side Secondary Transform Derivation for Video Coding Beyond AVS3

Yuhuai Zhang,Huiwen Ren,Xin Liu,Lu Zhao,Shiqi Wang,Siwei Ma
DOI: https://doi.org/10.1109/dcc58796.2024.00049
2024-01-01
Abstract:Secondary transform was adopted into the third generation Audio Video coding Standard (AVS3) to improve the intra-coded residual coding by applying a 4x4 secondary transform kernel. However, the adaptability of the single 4x4 transform kernel is limited for various residual data. In order to achieve higher residual coding gains, we propose a Decoder-side Secondary Transform Derivation (DSTD) method. Specifically, DSTD expands the maximum range of secondary transform from 4x4 to 8x8, where an 8x8 size transform kernel is introduced to further enhance the capability of compacting residuals. In particularly, three flipped secondary transform types are employed to extend transform candidates, including horizontal, vertical and diagonal flipping types. The boundary continuity is utilized to derive the transform type. Experimental results show that the proposed method can achieve 0.51% and 0.18% BD-rate savings on average under All Intra (AI) and Random Access (RA) configurations, respectively. DSTD has been adopted into the Exploration Video Model (EVM) for AVS4.
What problem does this paper attempt to address?