Self-Asymmetric Invertible Network for Compression-Aware Image Rescaling

Jinhai Yang,Mengxi Guo,Shijie Zhao,Junlin Li,Li Zhang
2023-03-12
Abstract:High-resolution (HR) images are usually downscaled to low-resolution (LR) ones for better display and afterward upscaled back to the original size to recover details. Recent work in image rescaling formulates downscaling and upscaling as a unified task and learns a bijective mapping between HR and LR via invertible networks. However, in real-world applications (e.g., social media), most images are compressed for transmission. Lossy compression will lead to irreversible information loss on LR images, hence damaging the inverse upscaling procedure and degrading the reconstruction accuracy. In this paper, we propose the Self-Asymmetric Invertible Network (SAIN) for compression-aware image rescaling. To tackle the distribution shift, we first develop an end-to-end asymmetric framework with two separate bijective mappings for high-quality and compressed LR images, respectively. Then, based on empirical analysis of this framework, we model the distribution of the lost information (including downscaling and compression) using isotropic Gaussian mixtures and propose the Enhanced Invertible Block to derive high-quality/compressed LR images in one forward pass. Besides, we design a set of losses to regularize the learned LR images and enhance the invertibility. Extensive experiments demonstrate the consistent improvements of SAIN across various image rescaling datasets in terms of both quantitative and qualitative evaluation under standard image compression formats (i.e., JPEG and WebP).
Computer Vision and Pattern Recognition,Multimedia,Image and Video Processing
What problem does this paper attempt to address?
The problem that this paper attempts to solve is to perform high - quality image rescaling in the case of image compression distortion. Specifically, the existing image rescaling methods perform poorly when dealing with compressed low - resolution (LR) images, because lossy compression (such as JPEG and WebP) will lead to irreversible information loss, thereby harming the up - sampling process and reducing the reconstruction accuracy. To solve this problem, the author proposes a Self - Asymmetric Invertible Network (SAIN), aiming to improve the rescaling quality of compression - distorted images by introducing a new framework. ### Main contributions: 1. **First study on image rescaling under compression distortion**: A method of integrating rescaling and compression into a reversible process is proposed, and the robustness to compression distortion is enhanced through decoupled modeling. 2. **Self - asymmetric framework**: A self - asymmetric framework containing Enhanced Invertible Blocks is designed, and a series of loss functions are introduced to improve the reconstruction quality and regularize LR features. 3. **Experimental results**: Quantitative and qualitative experimental results show that SAIN significantly outperforms existing methods under standard image codecs (such as JPEG and WebP). ### Method overview: - **Preliminary concept**: SAIN is based on Normalizing Flow Models. It decomposes high - resolution (HR) images into low - frequency and high - frequency parts through reversible transformation and learns the reversible mapping between these parts. - **Self - asymmetric framework**: This framework contains two independent bijective mappings, which are respectively used to generate high - quality LR images and simulate compressed LR images. In this way, SAIN can simultaneously simulate the down - scaling and compression processes in the forward pass and recover the compression distortion in the reverse pass. - **Enhanced Invertible Block**: To enhance the capacity of the model, SAIN introduces the Enhanced Invertible Block, which separates the generation of high - quality LR images and compressed - simulated LR images by adding additional calculations in the low - frequency branch. - **Isotropic Gaussian Mixture Model**: The Isotropic Gaussian Mixture Model is used to model the information loss caused by down - scaling and compression, so as to better recover details in the up - sampling process. ### Experimental results: - **Performance under JPEG distortion**: The performance of SAIN under different JPEG quality factors (QF) is significantly better than that of existing methods. Especially at low QF, SAIN can better recover image details and generate clear edges. - **Cross - dataset verification**: In addition to the tests on the DIV2K validation set, the performance of SAIN on four other standard benchmark datasets (Set5, Set14, BSD100, Urban100) is also better than that of existing methods. - **Performance under WebP distortion**: The performance of SAIN under WebP compression distortion is also excellent, further verifying its robustness under different compression formats. ### Conclusion: By introducing the self - asymmetric invertible network framework and the enhanced invertible block, SAIN successfully solves the rescaling problem of compression - distorted images and significantly improves the quality of image reconstruction. This method has important value in practical applications, especially on social media and websites that need to transmit a large number of images.