Abstract:This paper addresses the problem of lossy image compression, a fundamental problem in image processing and information theory that is involved in many real-world applications. We start by reviewing the framework of variational autoencoders (VAEs), a powerful class of generative probabilistic models that has a deep connection to lossy compression. Based on VAEs, we develop a novel scheme for lossy image compression, which we name quantization-aware ResNet VAE (QARV). Our method incorporates a hierarchical VAE architecture integrated with test-time quantization and quantization-aware training, without which efficient entropy coding would not be possible. In addition, we design the neural network architecture of QARV specifically for fast decoding and propose an adaptive normalization operation for variable-rate compression. Extensive experiments are conducted, and results show that QARV achieves variable-rate compression, high-speed decoding, and a better rate-distortion performance than existing baseline methods. The code of our method is publicly accessible at <a class="link-external link-https" href="https://github.com/duanzhiihao/lossy-vae" rel="external noopener nofollow">this https URL</a>

What problem does this paper attempt to address?

The problem that this paper attempts to solve is lossy image compression, which is a fundamental problem in image processing and information theory and is widely used in many real - world scenarios. Specifically, the paper proposes a new lossy image compression scheme based on Variational Autoencoders (VAEs), called Quantization - Aware ResNet VAE (QARV). This method aims to improve the performance of lossy image compression through the following improvements: 1. **Quantization - Aware Training and Test - Time Quantization**: QARV combines test - time quantization and quantization - aware training, making efficient entropy coding possible. This solves the problem in traditional methods that existing entropy coding algorithms cannot be directly applied due to continuous - valued latent variables. 2. **Fast Decoding**: QARV designs a neural network architecture, especially for achieving fast decoding. The paper introduces a new block architecture that can transfer more computations from the decoder to the encoder, thus achieving a faster decoding speed than most previous image compression methods. 3. **Variable - Rate Compression**: The paper introduces a new variable - rate compression method - Adaptive Layer Normalization (AdaLN), which can be used in a plug - and - play manner in modern neural network architectures. This method allows QARV to achieve continuously adjustable compression rates while maintaining a single model. 4. **No Need for Context Models**: Unlike most existing methods, QARV avoids using spatial/channel autoregressive context models, which are not only complex in design but may also be computationally infeasible in practical applications. QARV achieves higher computational efficiency through its hierarchical VAE architecture while still achieving better compression performance than existing methods. In summary, the main objective of this paper is to provide a more efficient, more flexible, and more powerful lossy image compression method through QARV, with particular emphasis on the ability to achieve fast decoding and variable - rate compression while maintaining high - quality image reconstruction.

QARV: Quantization-Aware ResNet VAE for Lossy Image Compression

Lossy Image Compression with Quantized Hierarchical VAEs

QVRF: A Quantization-error-aware Variable Rate Framework for Learned Image Compression

LVQAC: Lattice Vector Quantization Coupled with Spatially Adaptive Companding for Efficient Learned Image Compression

Rate Controllable Learned Image Compression Based on RFL Model

A Predictive VQ Based Video Compression Scheme

Universal End-to-End Neural Network for Lossy Image Compression

Neural Image Compression with Quantization Rectifier

A New Image Distortionless Compression Scheme Based on Wavelet

Deep Lossy Plus Residual Coding for Lossless and Near-lossless Image Compression

An Improved Upper Bound on the Rate-Distortion Function of Images

Noise-to-Compression Variational Autoencoder for Efficient End-to-End Optimized Image Coding

Extreme Image Compression using Fine-tuned VQGANs

Learning a Deep Vector Quantization Network for Image Compression

Learning a Virtual Codec Based on Deep Convolutional Neural Network to Compress Image

Channel-Level Variable Quantization Network for Deep Image Compression

Learning Scalable ℓ∞-Constrained Near-lossless Image Compression Via Joint Lossy Image and Residual Compression

A multi-layer image representation using Regularized Residual Quantization: application to compression and denoising

VecQ: Minimal Loss DNN Model Compression With Vectorized Weight Quantization

Semantic-oriented learning-based image compression by Only-Train-Once quantized autoencoders

Variable-rate Learned Image Compression with Adaptive Quantization Step Size