Abstract:Image compression is a widely used technique to reduce the spatial redundancy in images. Recently, learning based image compression has achieved significant progress by using the powerful representation ability from neural networks. However, the current state-of-the-art learning based image compression methods suffer from the huge computational cost, which limits their capacity for practical applications. In this paper, we propose a unified framework called Efficient Deep Image Compression (EDIC) based on three new technologies, including a channel attention module, a Gaussian mixture model and a decoder-side enhancement module. Specifically, we design an auto-encoder style network for learning based image compression. To improve the coding efficiency, we exploit the channel relationship between latent representations by using the channel attention module. Besides, the Gaussian mixture model is introduced for the entropy model and improves the accuracy for bitrate estimation. Furthermore, we introduce the decoder-side enhancement module to further improve image compression performance. Our EDIC method can also be readily incorporated with the Deep Video Compression (DVC) framework to further improve the video compression performance. Simultaneously, our EDIC method boosts the coding performance significantly while bringing slightly increased computational cost. More importantly, experimental results demonstrate that the proposed approach outperforms the current state-of-the-art image compression methods and is up to more than 150 times faster in terms of decoding speed when compared with Minnen's method. The proposed framework also successfully improves the performance of the recent deep video compression system DVC. Our code will be released at <a class="link-external link-https" href="https://github.com/liujiaheng/compression" rel="external noopener nofollow">this https URL</a>.

Learning-Based Video Coding with Joint Deep Compression and Enhancement

Deep Video Coding with Dual-Path Generative Adversarial Network

A Neural-network Enhanced Video Coding Framework beyond ECM

High Efficiency Deep-learning Based Video Compression

A Unified End-to-End Framework for Efficient Deep Image Compression

FVC: An End-to-End Framework Towards Deep Video Compression in Feature Space

Learned Video Compression with Adaptive Temporal Prior and Decoded Motion-aided Quality Enhancement

Deep Learning-Based Video Coding

<Emphasis Type="Italic">CodedVision</Emphasis>: Towards Joint Image Understanding and Compression via End-to-End Learning

DeepCoder: A Deep Neural Network Based Video Compression

Accelerating Learnt Video Codecs with Gradient Decay and Layer-wise Distillation

Low-complexity Deep Video Compression with A Distributed Coding Architecture

High-Efficiency Neural Video Compression via Hierarchical Predictive Learning

Asymmetric Learned Image Compression with Multi-Scale Residual Block, Importance Scaling, and Post-Quantization Filtering

Deep Learning-Based Video Coding: A Review and A Case Study

Decomposition, Compression, and Synthesis (DCS)-based Video Coding: A Neural Exploration via Resolution-Adaptive Learning

M-LVC: Multiple Frames Prediction for Learned Video Compression

Neural Video Coding Using Multiscale Motion Compensation and Spatiotemporal Context Model

Deep Predictive Video Compression Using Mode-Selective Uni- and Bi-Directional Predictions Based on Multi-Frame Hypothesis

Enhanced Motion-Compensated Video Coding with Deep Virtual Reference Frame Generation

Content Adaptive and Error Propagation Aware Deep Video Compression