Improving Learned Video Compression by Exploring Spatial Redundancy

Jiayu Yang,Chunhui Yang,Yongqi Zhai,Qi Wang,Xinghao Pan,Ronggang Wang
DOI: https://doi.org/10.1109/icassp48485.2024.10448496
2024-01-01
Abstract:Learned video compression has developed rapidly and shown promising rate-distortion performance recently. Existing works have made great progress on removing temporal redundancy between inter-frames, while neglecting spatial redundancy within a frame. In this paper, we propose to explore spatial redundancy to further improve compression efficiency and reduce computation complexity by extending inter-frame prediction from temporal prediction to spatial-temporal prediction. Specifically, the current frame is first down-sampled to reduce spatial redundancy, which is then temporally predicted to compensate motions and spatially predicted to restore resolutions. The spatial prediction module is designed in a reference-based manner, which improves prediction performance without additional bit rate cost by mining high-frequency details from a neighboring high-resolution decoded frame. Experimental results demonstrate that our proposed module can improve compression efficiency and reduce encoding/decoding time. Our framework achieves competitive compression efficiency compared with recent learned methods and traditional codecs.
What problem does this paper attempt to address?