Coordinate Attention-Based Convolution Neural Network for In-loop Filter of AVS3

Ruotong Wu,Songlin Sun,Jiaqi Zou,Shaokang Wang,Zhilei Ling
DOI: https://doi.org/10.1109/iscit57293.2023.10376069
2023-01-01
Abstract:Employing deep learning is a promising solution for reducing encoding bit rate in future video encoding systems. This paper proposes a neural network-based in-loop filter for the third generation of Audio Video Coding Standard (AVS3). The proposed network introduces a coordinate attention mechanism-based convolutional neural network in-loop filter (AMCNNLF) with a flexible attention module and a residual feature aggregation (RFA) module. Specifically, the attention module focuses on salient features to capture the visual structure, while the RFA module takes full advantage of the local refinement features. By leveraging the encoding parameters, we introduce the Quantizer Parameter (QP) values as auxiliary features to enable the proposed network suitable for processing encoded videos with multiple QPs. Experimental results indicate that the proposed network reduces the average Bjøntegaard-Delta rate (BD-Rate) of luma component about 0.4% under all intra configuration compared with the benchmark.
What problem does this paper attempt to address?