Abstract:Recent deep image compression methods have achieved prominent progress by using nonlinear modeling and powerful representation capabilities of neural networks. However, most existing learning-based image compression approaches employ customized convolutional neural network (CNN) to utilize visual features by treating all pixels equally, neglecting the effect of local key features. Meanwhile, the convolutional filters in CNN usually express the local spatial relationship within the receptive field and seldom consider the long-range dependencies from distant locations. This results in the long-range dependencies of latent representations not being fully compressed. To address these issues, an end-to-end image compression method is proposed by integrating graph attention and asymmetric convolutional neural network (ACNN). Specifically, ACNN is used to strengthen the effect of local key features and reduce the cost of model training. Graph attention is introduced into image compression to address the bottleneck problem of CNN in modeling long-range dependencies. Meanwhile, regarding the limitation that existing attention mechanisms for image compression hardly share information, we propose a self-attention approach which allows information flow to achieve reasonable bit allocation. The proposed self-attention approach is in compliance with the perceptual characteristics of human visual system, as information can interact with each other via attention modules. Moreover, the proposed self-attention approach takes into account channel-level relationship and positional information to promote the compression effect of rich-texture regions. Experimental results demonstrate that the proposed method achieves state-of-the-art rate-distortion performances after being optimized by MS-SSIM compared to recent deep compression models on the benchmark datasets of Kodak and Tecnick. The project page with the source code can be found in https://mic.tongji.edu.cn.

Non-local Attention Optimized Deep Image Compression

Neural Image Compression via Non-Local Attention Optimization and Improved Context Modeling

End-to-End Learnt Image Compression via Non-Local Attention Optimization and Improved Context Modeling

Subjective Quality Optimized Efficient Image Compression

Optimized Decoupled Structure with Non-Local Attention for Deep Image Compression

Learned Image Compression Using A Long and Short Attention Module

Enhancing High-Resolution Image Compression Through Local-Global Joint Attention Mechanism

Enhancing Learned Image Compression via Cross Window-based Attention

Joint Graph Attention and Asymmetric Convolutional Neural Network for Deep Image Compression

Region-of-interest and channel attention-based joint optimization of image compression and computer vision

Neural Image Compression Via Attentional Multi-scale Back Projection and Frequency Decomposition

Improved deep learning image compression model: performance optimization based on convolutional modules and local attention mechanism

Learned Image Compression With Gaussian-Laplacian-Logistic Mixture Model and Concatenated Residual Modules

Learned Image Compression with Inception Residual Blocks and Multi-Scale Attention Module.

Improving Inference for Neural Image Compression

High-Efficiency Lossy Image Coding Through Adaptive Neighborhood Information Aggregation

Learned image compression via neighborhood-based attention optimization and context modeling with multi-scale guiding

A Unified End-to-End Framework for Efficient Deep Image Compression

Learning Context-Based Nonlocal Entropy Modeling for Image Compression

Noise-to-Compression Variational Autoencoder for Efficient End-to-End Optimized Image Coding