Abstract:Recently, deep learning-based image compression has made significant progresses, and has achieved better rate-distortion (R-D) performance than the latest traditional method, H.266/VVC, in both MS-SSIM metric and the more challenging PSNR metric. However, a major problem is that the complexities of many leading learned schemes are too high. In this paper, we propose an efficient and effective image coding framework, which achieves similar R-D performance with lower complexity than the state of the art. First, we develop an improved multi-scale residual block (MSRB) that can expand the receptive field and capture global information more efficiently, which further reduces the spatial correlation of the latent representations. Second, an importance scaling network is introduced to directly scale the latents to achieve content-adaptive bit allocation without sending side information, which is more flexible than previous importance map methods. Third, we apply a post-quantization filter (PQF) to reduce the quantization error, motivated by the Sample Adaptive Offset (SAO) filter in video coding. Moreover, our experiments show that the performance of the system is less sensitive to the complexity of the decoder. Therefore, we design an asymmetric paradigm, in which the encoder employs three stages of MSRBs to improve the learning capacity, whereas the decoder only uses one stage of MSRB, which reduces the decoder complexity and still yields satisfactory performance. Experimental results show that compared to the state-of-the-art method, the encoding and decoding time of the proposed method are about 17 times faster, and the R-D performance is only reduced by about 1% on both Kodak and Tecnick-40 datasets, which is still better than H.266/VVC(4:4:4) and other leading learning-based methods. Our source code is publicly available at https://github.com/fengyurenpingsheng.

Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks

A GAN-based Tunable Image Compression System

Deep Image Compression via End-to-End Learning

Learning Better Lossless Compression Using Lossy Compression

Spatially adaptive image compression using a tiled deep network

Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression

Improved deep learning image compression model: performance optimization based on convolutional modules and local attention mechanism

Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules

Efficient Learned Lossless JPEG Recompression

Deep learning-based Edge-aware pre and post-processing methods for JPEG compressed images

Practical Full Resolution Learned Lossless Image Compression

Real-Time Adaptive Image Compression

On the Impact of Lossy Image and Video Compression on the Performance of Deep Convolutional Neural Network Architectures

Improving Inference for Neural Image Compression

Asymmetric Learned Image Compression with Multi-Scale Residual Block, Importance Scaling, and Post-Quantization Filtering

Joint Hierarchical Priors and Adaptive Spatial Resolution for Efficient Neural Image Compression

Sibling Neural Estimators: Improving Iterative Image Decoding with Gradient Communication

End-to-End Learnt Image Compression via Non-Local Attention Optimization and Improved Context Modeling

Fast and High-Performance Learned Image Compression With Improved Checkerboard Context Model, Deformable Residual Module, and Knowledge Distillation