Neural Image Compression with Quantization Rectifier

Wei Luo,Bo Chen
2024-03-26
Abstract:Neural image compression has been shown to outperform traditional image codecs in terms of rate-distortion performance. However, quantization introduces errors in the compression process, which can degrade the quality of the compressed image. Existing approaches address the train-test mismatch problem incurred during quantization, the random impact of quantization on the expressiveness of image features is still unsolved. This paper presents a novel quantization rectifier (QR) method for image compression that leverages image feature correlation to mitigate the impact of quantization. Our method designs a neural network architecture that predicts unquantized features from the quantized ones, preserving feature expressiveness for better image reconstruction quality. We develop a soft-to-predictive training technique to integrate QR into existing neural image codecs. In evaluation, we integrate QR into state-of-the-art neural image codecs and compare enhanced models and baselines on the widely-used Kodak benchmark. The results show consistent coding efficiency improvement by QR with a negligible increase in the running time.
Machine Learning
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in neural network image compression, the error introduced during the quantization process will degrade the quality of the compressed image. Specifically, although existing methods have solved the mismatch problem between training and testing, the random impact of quantization on feature expressiveness remains unresolved. Quantization compresses an image by mapping continuous values to a limited set of discrete values, which can lead to different levels of noise and unpredictably change the expressiveness of features. For example, quantization from the range \([-0.5, 0.5)\) to zero will introduce noise in the range \((-0.5, 0.5]\). More importantly, quantization changes the expressiveness of latent features in an unpredictable manner. To address this challenge, the paper proposes a new Quantization Rectifier (QR) method, which utilizes the spatial correlation of image features to mitigate the impact of quantization. The core idea of QR is to design a neural network architecture to predict unquantized features from quantized features, thereby preserving feature expressiveness and improving image reconstruction quality. In addition, the paper also develops a Soft - to - Predictive (STP) training technique, enabling QR to be seamlessly integrated into existing neural image codecs. Specifically, the main contributions of the paper include: 1. **Proposing the QR method**: Correcting quantization errors by predicting quantized features, preserving feature expressiveness, and improving coding efficiency. 2. **Developing the STP training procedure**: As well as a hyper - parameter exploration algorithm, enabling QR to be seamlessly integrated into existing neural image codecs. 3. **Extensively evaluating QR**: Conducting a large number of experiments on state - of - the - art neural image codecs, demonstrating the superiority of QR. Through these methods, the paper significantly improves the image quality of the baseline model without increasing the bit rate, especially in complex models.