Abstract:Learned image compression methods have achieved satisfactory results in recent years. However, existing methods are typically designed for RGB format, which are not suitable for YUV420 format due to the variance of different formats. In this paper, we propose an information-guided compression framework using cross-component attention mechanism, which can achieve efficient image compression in YUV420 format. Specifically, we design a dual-branch advanced information-preserving module (AIPM) based on the information-guided unit (IGU) and attention mechanism. On the one hand, the dual-branch architecture can prevent changes in original data distribution and avoid information disturbance between different components. The feature attention block (FAB) can preserve the important information. On the other hand, IGU can efficiently utilize the correlations between Y and UV components, which can further preserve the information of UV by the guidance of Y. Furthermore, we design an adaptive cross-channel enhancement module (ACEM) to reconstruct the details by utilizing the relations from different components, which makes use of the reconstructed Y as the textural and structural guidance for UV components. Extensive experiments show that the proposed framework can achieve the state-of-the-art performance in image compression for YUV420 format. More importantly, the proposed framework outperforms Versatile Video Coding (VVC) with 8.37% BD-rate reduction on common test conditions (CTC) sequences on average. In addition, we propose a quantization scheme for context model without model retraining, which can overcome the cross-platform decoding error caused by the floating-point operations in context model and provide a reference approach for the application of neural codec on different platforms.

A channel-wise contextual module for learned intra video compression

Learned Video Compression with Adaptive Temporal Prior and Decoded Motion-aided Quality Enhancement

Learned Video Compression With Efficient Temporal Context Learning

Learned Block-based Hybrid Image Compression

Neural-Network-Based Cross-Channel Intra Prediction

Temporal Context Mining for Learned Video Compression

Optimized Spatial Recurrent Network for Intra Prediction in Video Coding

Multiscale Motion-Aware and Spatial-Temporal-Channel Contextual Coding Network for Learned Video Compression

Learned Image Compression Using Cross-Component Attention Mechanism

Bi-Directional Deep Contextual Video Compression

ECVC: Exploiting Non-Local Correlations in Multiple Frames for Contextual Video Compression

Temporal context video compression with flow-guided feature prediction

Improved Deep Image Compression with Joint Optimization of Cross Channel Context Model and Generalized Loop Filter

Intra Prediction Using Fully Connected Network for Video Coding

Channel-Wise Feature Decorrelation for Enhanced Learned Image Compression

Unified Intra Mode Coding Based on Short and Long Range Correlations.

Long-term Temporal Context Gathering for Neural Video Compression

Enhancing Temporal Context for Learned Video Compression

Multi-Scale Convolutional Neural Network-Based Intra Prediction for Video Coding.

Learned Video Compression via Heterogeneous Deformable Compensation Network