Abstract:Among the recent deep image compression frameworks, transform coding together with a context-adaptive entropy model is the most representative approach to achieve the best coding performance. For entropy model, 2D mask convolution is widely utilized to capture the spatial context, which omits the correlations along channel dimension. To complement to the spatial context, a cross channel context model is proposed. For transform, if given more network layers to improve its representation ability, how to allocate these network layers in forward and inverse transform is investigated. After analyzing the scheme of deep image compression connected with loop filter, we find this investigation can be regarded as a more generalized loop filter. The proposed cross channel context model and generalized loop filter (CCCMGLF) are integrated into the deep image compression framework and jointly optimized to improve the coding performance. Experimental results demonstrate that, using PSNR as distortion metric, the proposed CCCMGLF outperforms VTM-11.0 by 1.20%, 10.82% and 5.38% in terms of BD-rate reductions for Y, U and V components, respectively, for the Kodak dataset. For the JVET CTC sequences, the proposed method outperforms VTM-11.0 by 1.44% for Y but has a coding performance loss of 24.74% and 11.91% for U and V, respectively. Over the baseline deep compression framework, the proposed method provides 7.80%, 12.66% and 11.15% performance improvement for Y, U, and V, respectively, for the Kodak dataset; 9.10%, 12.27%, and 12.68% performance improvement for Y, U and V, respectively, for the JVET CTC sequences. The proposed approaches are applicable in both image compression and intra coding in video compression.

Bilateral Context Modeling for Residual Coding in Lossless 3D Medical Image Compression

Learning Lossless Compression for High Bit-Depth Volumetric Medical Image

Deep Lossy Plus Residual Coding for Lossless and Near-lossless Image Compression

Context-based, adaptive, lossless image coding

DBVC: an End-to-End 3-D Deep Biomedical Video Coding Framework

Context-Based Lossless Compression of Mosaic Image with Bayer Pattern

Improved Deep Image Compression with Joint Optimization of Cross Channel Context Model and Generalized Loop Filter

Corner-to-Center Long-range Context Model for Efficient Learned Image Compression

Gated Context Model with Embedded Priors for Deep Image Compression

A Cross Channel Context Model for Latents in Deep Image Compression

Content Adaptive Checkerboard Context Model for Learned Image Compression

Progressive Deep Image Compression for Hybrid Contexts of Image Classification and Reconstruction

Lightweight Context Model Equipped aiWave in Response to the AVS Call for Evidence on Volumetric Medical Image Coding

Learning Context-Based Non-local Entropy Modeling for Image Compression

Diagnosis-oriented Medical Image Compression with Efficient Transfer Learning

Learning Context-Based Nonlocal Entropy Modeling for Image Compression

Streaming Lossless Volumetric Compression of Medical Images Using Gated Recurrent Convolutional Neural Network

Learned Image Compression with Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules

Model-based compression for 3D medical images stored in the DICOM format

End-to-End Learnt Image Compression via Non-Local Attention Optimization and Improved Context Modeling

Object-Based Image Coding: A Learning-Driven Revisit