Abstract:Image compression distortion can cause performance degradation of machine analysis tasks, therefore recent years have witnessed fast progress in developing deep image compression methods optimized for machine perception. However, the investigation still lacks for saliency segmentation. First, in this paper we propose a deep compression network increasing local signal fidelity of important image pixels for saliency segmentation, which is different from existing methods utilizing the analysis network loss for backward propagation. By this means, these two types of networks can be decoupled to improve the compatibility of proposed compression method for diverse saliency segmentation networks. Second, pixel-level bit weights are modeled with probability distribution in the proposed bit allocation method. The ascending cosine roll-down (ACRD) function allocates bits to those important pixels, which fits the essence that saliency segmentation can be regarded as the pixel-level bi-classification task. Third, the compression network is trained without the help of saliency segmentation, where latent representations are decomposed into base and enhancement channels. Base channels are retained in the whole image, while enhancement channels are utilized only for important pixels, and therefore more bits can benefit saliency segmentation via enhancement channels. Extensive experimental results demonstrate that the proposed method can save an average of 10.34% bitrate compared with the state-of-the-art deep image compression method, where the rate-accuracy (R-A) performances are evaluated on sixteen downstream saliency segmentation networks with five conventional SOD datasets.

Saliency Map-Guided End-to-End Image Coding for Machines

Towards Efficient Learned Image Coding for Machines Via Saliency-Driven Rate Allocation.

Bridging the gap between image coding for machines and humans

Image Coding for Machines with Edge Information Learning Using Segment Anything

Learnt Mutual Feature Compression for Machine Vision

Image Coding for Machines based on Non-Uniform Importance Allocation.

Image Coding for Machines with Object Region Learning

Remote Sensing Image Coding for Machines on Semantic Segmentation via Contrastive Learning

Improving Image Coding for Machines through Optimizing Encoder via Auxiliary Loss

Perceptual Video Coding for Machines via Satisfied Machine Ratio Modeling

Activation Map Saliency Guided Filtering for Efficient Image Compression for Vision Tasks

End-to-End Learned Scalable Multilayer Feature Compression for Machine Vision Tasks

Composable Image Coding for Machine Via Task-oriented Internal Adaptor and External Prior

Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs

Image Coding for Machines with Omnipotent Feature Learning

LL-ICM: Image Compression for Low-level Machine Vision via Large Vision-Language Model

Deep Image Compression Towards Machine Vision: A Unified Optimization Framework

Deep Image Compression Toward Machine Vision: A Unified Optimization Framework

Learned Image Coding for Human-Machine Collaborative Optimization

Saliency Segmentation Oriented Deep Image Compression with Novel Bit Allocation

Hybrid Single Input and Multiple Output Method for Compressing Features Towards Machine Vision Tasks