Lightweight macro-pixel quality enhancement network for light field images compressed by versatile video coding

Hongyue Huang,Chen Cui,Chuanmin Jia,Xinfeng Zhang,Siwei Ma
DOI: https://doi.org/10.1016/j.jvcir.2024.104329
IF: 2.887
2024-11-01
Journal of Visual Communication and Image Representation
Abstract:Previous research demonstrated that filtering Macro-Pixels (MPs) in a decoded Light Field Image (LFI) sequence can effectively enhances the quality of the corresponding Sub-Aperture Images (SAIs). In this paper, we propose a deep-learning-based quality enhancement model following the MP-wise processing approach tailored to LFIs encoded by the Versatile Video Coding (VVC) standard. The proposed novel Res2Net Quality Enhancement Convolutional Neural Network (R2NQE-CNN) architecture is both lightweight and powerful, in which the Res2Net modules are employed to perform LFI filtering for the first time, and are implemented with a novel improved 3D-feature-processing structure. The proposed method incorporates only 205K model parameters and achieves significant Y-BD-rate reductions over VVC of up to 32%, representing a relative improvement of up to 33% compared to the state-of-the-art method, which has more than three times the number of parameters of our proposed model.
computer science, information systems, software engineering
What problem does this paper attempt to address?