Soft-IntroVAE for Continuous Latent space Image Super-Resolution

Zhi-Song Liu,Zijia Wang,Zhen Jia
2023-07-18
Abstract:Continuous image super-resolution (SR) recently receives a lot of attention from researchers, for its practical and flexible image scaling for various displays. Local implicit image representation is one of the methods that can map the coordinates and 2D features for latent space interpolation. Inspired by Variational AutoEncoder, we propose a Soft-introVAE for continuous latent space image super-resolution (SVAE-SR). A novel latent space adversarial training is achieved for photo-realistic image restoration. To further improve the quality, a positional encoding scheme is used to extend the original pixel coordinates by aggregating frequency information over the pixel areas. We show the effectiveness of the proposed SVAE-SR through quantitative and qualitative comparisons, and further, illustrate its generalization in denoising and real-image super-resolution.
Image and Video Processing,Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address several key challenges in Continuous Image Super-Resolution (SR): 1. **Arbitrary Scale Image Enlargement**: Most existing image super-resolution methods can only handle fixed scale enlargements (such as 2x, 4x) and cannot flexibly adapt to different enlargement needs. The goal of continuous image super-resolution is to enable image enlargement at arbitrary scales to meet the requirements of different display devices. 2. **Generating High-Quality Super-Resolution Images**: Continuous image super-resolution tends to produce overly smooth images and is sensitive to noise. Therefore, generating high-quality, realistic super-resolution images at arbitrary scales is an important issue. 3. **Reducing Training Costs**: Existing super-resolution methods usually require separate training models for each fixed enlargement scale, leading to high training costs. Continuous image super-resolution aims to reduce these training costs by achieving multiple enlargement scale tasks with a single model. To address these issues, the authors propose Soft-IntroVAE for Continuous Latent Space Image Super-Resolution (SV AE-SR). This method combines the advantages of Variational Autoencoders (VAE) and Local Implicit Image Functions (LIIF), achieving arbitrary scale image super-resolution through interpolation in continuous latent space. Additionally, the method introduces positional encoding to extend frequency information, thereby improving the reconstruction quality of high-frequency signals and reducing over-smoothing. Experimental results show that SV AE-SR performs excellently across different datasets and enlargement scales, especially outperforming existing methods when dealing with unseen enlargement scales.