Abstract:Image compression is a widely used technique to reduce the spatial redundancy in images. Recently, learning based image compression has achieved significant progress by using the powerful representation ability from neural networks. However, the current state-of-the-art learning based image compression methods suffer from the huge computational cost, which limits their capacity for practical applications. In this paper, we propose a unified framework called Efficient Deep Image Compression (EDIC) based on three new technologies, including a channel attention module, a Gaussian mixture model and a decoder-side enhancement module. Specifically, we design an auto-encoder style network for learning based image compression. To improve the coding efficiency, we exploit the channel relationship between latent representations by using the channel attention module. Besides, the Gaussian mixture model is introduced for the entropy model and improves the accuracy for bitrate estimation. Furthermore, we introduce the decoder-side enhancement module to further improve image compression performance. Our EDIC method can also be readily incorporated with the Deep Video Compression (DVC) framework to further improve the video compression performance. Simultaneously, our EDIC method boosts the coding performance significantly while bringing slightly increased computational cost. More importantly, experimental results demonstrate that the proposed approach outperforms the current state-of-the-art image compression methods and is up to more than 150 times faster in terms of decoding speed when compared with Minnen's method. The proposed framework also successfully improves the performance of the recent deep video compression system DVC. Our code will be released at <a class="link-external link-https" href="https://github.com/liujiaheng/compression" rel="external noopener nofollow">this https URL</a>.

Learned Image Compression for Both Humans and Machines Via Dynamic Adaptation

Dynamic Low-Rank Instance Adaptation for Universal Neural Image Compression

Learned Image Coding for Machines: A Content-Adaptive Approach

Unveiling the Future of Human and Machine Coding: A Survey of End-to-End Learned Image Compression

Machine Perception-Driven Image Compression: A Layered Generative Approach

Learned Image Compression for Machine Perception

DNN-Compressed Domain Visual Recognition with Feature Adaptation

Asymmetric Learned Image Compression with Multi-Scale Residual Block, Importance Scaling, and Post-Quantization Filtering

Unified and Scalable Deep Image Compression Framework for Human and Machine

Toward Scalable Image Feature Compression: A Content-Adaptive and Diffusion-Based Approach

Deep Image Compression Toward Machine Vision: A Unified Optimization Framework

Deep Image Compression Towards Machine Vision: A Unified Optimization Framework

A Unified Image Compression Method for Human Perception and Multiple Vision Tasks

Real-Time Adaptive Image Compression

Rate-Distortion-Cognition Controllable Versatile Neural Image Compression

Learned Image Compression Using Adaptive Block-Wise Encoding and Reconstruction Network

Multi-Modality Deep Network for Extreme Learned Image Compression

Interpretable Learned Image Compression: A Frequency Transform Decomposition Perspective

Task-Driven Video Compression for Humans and Machines: Framework Design and Optimization

A Universal Optimization Framework for Learning-based Image Codec

A Unified End-to-End Framework for Efficient Deep Image Compression