Efficient Progressive Image Compression with Variance-aware Masking

Alberto Presta,Enzo Tartaglione,Attilio Fiandrotti,Marco Grangetto,Pamela Cosman
2024-11-15
Abstract:Learned progressive image compression is gaining momentum as it allows improved image reconstruction as more bits are decoded at the receiver. We propose a progressive image compression method in which an image is first represented as a pair of base-quality and top-quality latent representations. Next, a residual latent representation is encoded as the element-wise difference between the top and base representations. Our scheme enables progressive image compression with element-wise granularity by introducing a masking system that ranks each element of the residual latent representation from most to least important, dividing it into complementary components, which can be transmitted separately to the decoder in order to obtain different reconstruction quality. The masking system does not add further parameters nor complexity. At the receiver, any elements of the top latent representation excluded from the transmitted components can be independently replaced with the mean predicted by the hyperprior architecture, ensuring reliable reconstructions at any intermediate quality level. We also introduced Rate Enhancement Modules (REMs), which refine the estimation of entropy parameters using already decoded components. We obtain results competitive with state-of-the-art competitors, while significantly reducing computational complexity, decoding time, and number of parameters.
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve several key problems in image compression, especially in the field of Progressive Image Compression. Specifically, the authors propose a new progressive image compression method, which mainly solves the following problems: 1. **Progressive Image Quality Improvement**: - In progressive image compression, as the receiving end decodes more bits, the reconstructed image quality should gradually improve. Existing methods face challenges in achieving this feature. For example, different bitstreams need to be encoded and transmitted for each different bit - rate target. The method proposed in this paper achieves Fine - Grained Scalability (FGS) by introducing the residual latent representation and its element - level differences, thus allowing image reconstruction at different quality levels within the same bitstream. 2. **Computational Complexity and Resource Consumption**: - Some existing progressive image compression methods, although having good performance, have large problems in terms of computational complexity, decoding time, and the number of parameters. The scheme proposed in this paper significantly reduces the overhead in these aspects, making progressive image compression more efficient and practical. 3. **Flexible Quality Control**: - The proposed method can flexibly select and transmit important elements in the residual latent representation by introducing a masking system, so as to obtain high - quality image reconstruction at different bit - rates. This allows users to adjust the image quality according to actual needs without re - encoding the entire image. 4. **Robustness of Progressive Coding**: - In order to ensure reliable reconstruction at any intermediate quality level, the method proposed in this paper replaces the un - received top - level latent representation elements at the receiving end and fills them with the mean predicted by the hyperprior architecture, ensuring the robustness and stability of progressive coding. 5. **Entropy Parameter Estimation Optimization**: - To further improve the accuracy of entropy parameter estimation, this paper introduces Rate Enhancement Modules (REMs). These modules use the decoded components to improve the estimation of entropy parameters, thereby improving the overall compression efficiency. ### Summary In general, this paper proposes an efficient progressive image compression method. By introducing the residual latent representation, the masking system, and the rate enhancement modules, it solves the problems of high computational complexity and poor flexibility in existing methods for progressive image compression, and at the same time, it reaches a level comparable to existing advanced methods in terms of Rate - Distortion Performance.