Abstract:Image compression is a widely used technique to reduce the spatial redundancy in images. Recently, learning based image compression has achieved significant progress by using the powerful representation ability from neural networks. However, the current state-of-the-art learning based image compression methods suffer from the huge computational cost, which limits their capacity for practical applications. In this paper, we propose a unified framework called Efficient Deep Image Compression (EDIC) based on three new technologies, including a channel attention module, a Gaussian mixture model and a decoder-side enhancement module. Specifically, we design an auto-encoder style network for learning based image compression. To improve the coding efficiency, we exploit the channel relationship between latent representations by using the channel attention module. Besides, the Gaussian mixture model is introduced for the entropy model and improves the accuracy for bitrate estimation. Furthermore, we introduce the decoder-side enhancement module to further improve image compression performance. Our EDIC method can also be readily incorporated with the Deep Video Compression (DVC) framework to further improve the video compression performance. Simultaneously, our EDIC method boosts the coding performance significantly while bringing slightly increased computational cost. More importantly, experimental results demonstrate that the proposed approach outperforms the current state-of-the-art image compression methods and is up to more than 150 times faster in terms of decoding speed when compared with Minnen's method. The proposed framework also successfully improves the performance of the recent deep video compression system DVC. Our code will be released at <a class="link-external link-https" href="https://github.com/liujiaheng/compression" rel="external noopener nofollow">this https URL</a>.

Consistency Guided Diffusion Model with Neural Syntax for Perceptual Image Compression

Lossy Image Compression with Conditional Diffusion Models

Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach

Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior

Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder

A Residual Diffusion Model for High Perceptual Quality Codec Augmentation

Extreme Video Compression with Pre-trained Diffusion Models

Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules

High Frequency Matters: Uncertainty Guided Image Compression with Wavelet Diffusion

Lossy Image Compression with Foundation Diffusion Models

Extreme Generative Image Compression by Learning Text Embedding from Diffusion Models

Content-aware Deep Perceptual Image Compression

Diffusion-based Extreme Image Compression with Compressed Feature Initialization

Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds

Map-Assisted Remote-Sensing Image Compression at Extremely Low Bitrates

A Unified End-to-End Framework for Efficient Deep Image Compression

Semantic Ensemble Loss and Latent Refinement for High-Fidelity Neural Image Compression

Substitutional Neural Image Compression

Diffusion-Driven Semantic Communication for Generative Models with Bandwidth Constraints