Abstract:Neural compression has the potential to revolutionize lossy image compression. Based on generative models, recent schemes achieve unprecedented compression rates at high perceptual quality but compromise semantic fidelity. Details of decompressed images may appear optically flawless but semantically different from the originals, making compression errors difficult or impossible to detect. We explore the problem space and propose a provisional taxonomy of miscompressions. It defines three types of 'what happens' and has a binary 'high impact' flag indicating miscompressions that alter symbols. We discuss how the taxonomy can facilitate risk communication and research into mitigations.

What problem does this paper attempt to address?

This paper attempts to solve the problem of semantic errors introduced by neural compression techniques in image compression. Specifically, the paper focuses on: 1. **Advantages and Disadvantages of Neural Compression Techniques**: Neural compression schemes based on generative models can achieve unprecedented compression rates while maintaining high - perceptual quality. However, this technique sacrifices semantic fidelity, that is, although the details of the decompressed image may look optically flawless, there may be semantic differences from the original image, making compression errors difficult or impossible to detect. 2. **Proposing the Concept of "Miscompression"**: In order to describe the semantic changes caused by lossy compression, the author proposes the term "miscompression". Miscompression refers to the situation where the semantic meaning of the reconstructed image is inconsistent with that of the original image during the neural compression process. These changes may be subtle, but may have a significant impact for certain application scenarios (such as forensic investigations). 3. **Constructing a Miscompression Classification System**: In order to solve and study the miscompression problem, the author has developed a preliminary classification system (taxonomy). This classification system is based on an exploratory visual inspection of three benchmark datasets and examines the performance of five different neural compression schemes under different quality settings. According to the obvious characteristics of signal transformation, three main types of miscompression are defined: - **Amplitude**: Refers to the change in the amplitude of spatial frequency in the image signal, such as the change in brightness, color saturation or the intensity of high - frequency components. - **Geometry**: Refers to geometric transformations, such as translation, rotation, scaling and shearing. - **Shape**: Refers to the change in the shape of an object, which may be caused by the deviation in the retrieval - enhancement process. 4. **Emphasizing the Influence of Symbols**: In order to further classify the potential semantic impacts of miscompression, the author introduces the "Symbol" modifier. When the affected object is a symbol (such as a letter, number, logo, etc.), even a slight change may completely change its semantic meaning. 5. **Exploring the Practical Applications and Risks of Miscompression**: The paper discusses the extensive impacts of miscompression on forensic science and society, and proposes directions for future research, including how to reduce the occurrence of miscompression by improving the compression algorithm, and how to detect and deal with the existing miscompression risks. In summary, this paper aims to reveal the new challenges brought by neural compression techniques and provide guidance for subsequent research and practical applications through systematic classification and analysis.

A Taxonomy of Miscompressions: Preparing Image Forensics for Neural Compression

Machine Perceptual Quality: Evaluating the Impact of Severe Lossy Compression on Audio and Image Models

An Introduction to Neural Data Compression

Towards improved lossy image compression: Human image reconstruction with public-domain images

Context-Based Lossless Compression of Mosaic Image with Bayer Pattern

Towards Robust Neural Image Compression: Adversarial Attack and Model Finetuning

Toward Robust Neural Image Compression: Adversarial Attack and Model Finetuning

Can Image Compression Rely on CLIP?

Better Compression With Deep Pre-Editing

Neural Image Compression: Generalization, Robustness, and Spectral Biases

Learned Image Compression for Machine Perception

Controlling Rate, Distortion, and Realism: Towards a Single Comprehensive Neural Image Compression Model

Advancing The Rate-Distortion-Computation Frontier For Neural Image Compression

Substitutional Neural Image Compression

Facial Image Compression via Neural Image Manifold Compression

Computationally-Efficient Neural Image Compression with Shallow Decoders

Recompression Based JPEG Tamper Detection and Localization Using Deep Neural Network Eliminating Compression Factor Dependency

Lossy Image Compression with Conditional Diffusion Models

Conditional Hallucinations for Image Compression

Improving Inference for Neural Image Compression

Machine Perception-Driven Image Compression: A Layered Generative Approach