Abstract:While raw images possess distinct advantages over sRGB images, e.g., linearity and fine-grained quantization levels, they are not widely adopted by general users due to their substantial storage requirements. Very recent studies propose to compress raw images by designing sampling masks within the pixel space of the raw image. However, these approaches often leave space for pursuing more effective image representations and compact metadata. In this work, we propose a novel framework that learns a compact representation in the latent space, serving as metadata, in an end-to-end manner. Compared with lossy image compression, we analyze the intrinsic difference of the raw image reconstruction task caused by rich information from the sRGB image. Based on the analysis, a novel design of the backbone with asymmetric and hybrid spatial feature resolutions is proposed, which significantly improves the rate-distortion performance. Besides, we propose a novel design of the sRGB-guided context model, which can better predict the order masks of encoding/decoding based on both the sRGB image and the the masks of already processed features. Benefited from the better modeling of the correlation between order masks, the already processed information can be better utilized. Moreover, a novel sRGB-guided adaptive quantization precision strategy, which dynamically assigns varying levels of quantization precision to different regions, further enhances the representation ability of the model. Finally, based on the iterative properties of the proposed context model, we propose a novel strategy to achieve variable bit rates using a single model. This strategy allows for the continuous convergence of a wide range of bit rates. We demonstrate how our raw image compression scheme effectively allocates more bits to image regions that hold greater global importance. Extensive experimental results validate the superior performance of the proposed method, achieving high-quality raw image reconstruction with a smaller metadata size, compared with existing SOTA methods.

RAW Image Reconstruction Using a Self-contained sRGB–JPEG Image with Small Memory Overhead

Metadata-Based RAW Reconstruction Via Implicit Neural Functions.

Beyond Learned Metadata-Based Raw Image Reconstruction

Learning sRGB-to-Raw-RGB De-rendering with Content-Aware Metadata

Raw Image Reconstruction with Learned Compact Metadata

Invertible Image Signal Processing

Efficient Visual Computing with Camera RAW Snapshots

In-Camera Raw Compression: A New Paradigm from Image Acquisition to Display.

Efficient HDR Reconstruction from Real-World Raw Images

A Learnable Color Correction Matrix for RAW Reconstruction

Raw Instinct: Trust Your Classifiers and Skip the Conversion

High Dynamic Range and Super-Resolution from Raw Image Bursts

RawHDR: High Dynamic Range Image Reconstruction from a Single Raw Image

RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation

Towards Low Light Enhancement With RAW Images

Anti-Shake HDR Imaging Using RAW Image Data

Efficient Image Details Preservation of Image Processing Pipeline Based on Two-Stage Tone Mapping