Abstract:End-to-end image and video compression using auto-encoders (AE) offers new appealing perspectives in terms of rate-distortion gains and applications. While most complex models are on par with the latest compression standard like VVC/H.266 on objective metrics, practical implementation and complexity remain strong issues for real-world applications. In this paper, we propose a practical implementation suitable for realistic applications, leading to a low-complexity model. We demonstrate that some gains can be achieved on top of a state-of-the-art low-complexity AE, even when using simpler implementation. Improvements include off-training entropy coding improvement and encoder side Rate Distortion Optimized Quantization. Results show a 19% improvement in BDrate on basic implementation of fully-factorized model, and 15.3% improvement compared to the original implementation. The proposed implementation also allows a direct integration of such approaches on a variety of platforms.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to improve the performance of end - to - end image and video compression techniques based on auto - encoders (AE) while maintaining low complexity. Specifically, although existing complex models are comparable to the latest compression standards (such as VVC/H.266) in objective metrics, their actual implementation and complexity remain major obstacles in practical applications. Therefore, the paper proposes a low - complexity model implementation method suitable for practical applications, aiming to further improve performance on the basis of simplified implementation. The main contributions of the paper include: 1. **Entropy Coding Improvement**: An entropy coding improvement method after offline training is proposed. By learning a new conditional entropy model based on context switching, the coding efficiency of latent variables is optimized. 2. **Rate - Distortion Optimized Quantization (RDOQ) on the Encoding Side**: A new rate - distortion optimized process during encoding is introduced to reduce the amortization gap without the support of an external framework, which can be achieved only using the distilled decoder. 3. **Hardware - Friendly Implementation**: By replacing the GDN/IGDN activation functions with ReLU/ReLU, and using 16 - bit integer networks and simplified integer operations, the model is made more suitable for hardware implementation. 4. **Performance Verification**: Experimental results show that, compared with the basic full - factor model, the proposed implementation method improves the BD - rate by 19%, and by 15.3% compared with the original implementation. In addition, this method also allows direct integration into various platforms. In conclusion, through a series of innovative technical means, this paper significantly improves the performance of auto - encoder - based image and video compression techniques while maintaining low complexity, making it more suitable for practical applications.

Entropy Coding Improvement for Low-complexity Compressive Auto-encoders

Channel-wise Autoregressive Entropy Models for Learned Image Compression

Conditional Entropy Coding for Efficient Video Compression

Slimmable Compressive Autoencoders for Practical Neural Image Compression

An End-to-End Face Compression and Recognition Framework Based on Entropy Coding Model

Learned Image Compression with Dual-Branch Encoder and Conditional Information Coding

Optimized learned entropy coding parameters for practical neural-based image and video compression

Low-complexity Overfitted Neural Image Codec

Feedback Recurrent Autoencoder for Video Compression

Exploiting Latent Properties to Optimize Neural Codecs

Noise-to-Compression Variational Autoencoder for Efficient End-to-End Optimized Image Coding

Video Compression With Rate-Distortion Autoencoders

On the Rate-Distortion-Complexity Trade-offs of Neural Video Coding

Evaluation of Beyond-Hevc Entropy Coding Methods for Dct Transform Coefficients

An Improved Upper Bound on the Rate-Distortion Function of Images

Deep Convolutional AutoEncoder-based Lossy Image Compression

A Universal Encoder Rate Distortion Optimization Framework for Learned Compression

Practical Learned Image Compression with Online Encoder Optimization

Multi-rate Adaptive Transform Coding for Video Compression

Reducing The Amortization Gap of Entropy Bottleneck In End-to-End Image Compression

An Extended Context-Based Entropy Hybrid Modeling for Image Compression.