Abstract:Recently, deep learning-based image compression has made significant progresses, and has achieved better rate-distortion (R-D) performance than the latest traditional method, H.266/VVC, in both MS-SSIM metric and the more challenging PSNR metric. However, a major problem is that the complexities of many leading learned schemes are too high. In this paper, we propose an efficient and effective image coding framework, which achieves similar R-D performance with lower complexity than the state of the art. First, we develop an improved multi-scale residual block (MSRB) that can expand the receptive field and capture global information more efficiently, which further reduces the spatial correlation of the latent representations. Second, an importance scaling network is introduced to directly scale the latents to achieve content-adaptive bit allocation without sending side information, which is more flexible than previous importance map methods. Third, we apply a post-quantization filter (PQF) to reduce the quantization error, motivated by the Sample Adaptive Offset (SAO) filter in video coding. Moreover, our experiments show that the performance of the system is less sensitive to the complexity of the decoder. Therefore, we design an asymmetric paradigm, in which the encoder employs three stages of MSRBs to improve the learning capacity, whereas the decoder only uses one stage of MSRB, which reduces the decoder complexity and still yields satisfactory performance. Experimental results show that compared to the state-of-the-art method, the encoding and decoding time of the proposed method are about 17 times faster, and the R-D performance is only reduced by about 1% on both Kodak and Tecnick-40 datasets, which is still better than H.266/VVC(4:4:4) and other leading learning-based methods. Our source code is publicly available at https://github.com/fengyurenpingsheng.

BSSIC: Stereo Image Compression Based on Block Shift

Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model

Disparity-based Stereo Image Compression with Aligned Cross-View Priors

ECSIC: Epipolar Cross Attention for Stereo Image Compression

FFCA-Net: Stereo Image Compression via Fast Cascade Alignment of Side Information

Fast Stereoscopic Frame Estimation and Interpolation Algorithm in Compressed Stereo Video Streams

Deep Homography for Efficient Stereo Image Compression

Efficient Hybrid Feature Interaction Network for Stereo Image Super-Resolution

Asymmetric Learned Image Compression with Multi-Scale Residual Block, Importance Scaling, and Post-Quantization Filtering

Dual-branch spectral–spatial feature extraction network for multispectral image compression

Learned Block-based Hybrid Image Compression

SSSIC: Semantics-to-Signal Scalable Image Coding with Learned Structural Representations.

SDI-Net: Toward Sufficient Dual-View Interaction for Low-light Stereo Image Enhancement

A Convolutional Neural Network-Based Quantization Method for Block Compressed Sensing of Images

Low-Latency Neural Stereo Streaming

A New Robust Multiple Description Coding Method for Image Based on Block Compressed Sensing

A stereo matching network with a cascade spatial pyramid pooling (CSPP) substructure

Low-light Stereo Image Enhancement and De-noising in the Low-frequency Information Enhanced Image Space

Region-of-interest and channel attention-based joint optimization of image compression and computer vision

Line-Based Distributed Coding Scheme For Onboard Lossless Compression Of High-Resolution Stereo Images