Neural network-based cross-channel chroma prediction for versatile video coding

Fang Liang,Jingde Zhang
DOI: https://doi.org/10.1007/s11227-023-05868-y
IF: 3.3
2024-02-09
The Journal of Supercomputing
Abstract:Despite linear models being introduced in the latest versatile video coding (VVC) standard to exploit the correlation among luma and chroma channels for removing redundancy, these models cannot take into account the nonlinearity of components, resulting in degraded intraprediction precision. In this paper, a neural network-based method is proposed for cross-channel chroma intraprediction to enhance the coding efficiency. Specifically, the neighboring reference and co-located samples are separately input into the proposed network to exploit spatial and cross-channel correlations fully. Furthermore, in order to acquire a more compact representation of residual signals, a transform-based loss is employed to enhance the effectiveness of the compression. The proposed method is integrated into VVC, competing with the intrinsic chroma prediction regarding rate-distortion optimization to enhance coding performance further. The extensive experimental results demonstrate the superiority of the proposed method over the VVC test model (VTM) 18.0, achieving average bitrate savings of 0.28%, 2.44%, and 1.89% for Y, U, and V components, respectively.
computer science, theory & methods,engineering, electrical & electronic, hardware & architecture
What problem does this paper attempt to address?