Soft-IntroVAE for Continuous Latent space Image Super-Resolution

Zhi-Song Liu,Zijia Wang,Zhen Jia

2023-07-18

Abstract:Continuous image super-resolution (SR) recently receives a lot of attention from researchers, for its practical and flexible image scaling for various displays. Local implicit image representation is one of the methods that can map the coordinates and 2D features for latent space interpolation. Inspired by Variational AutoEncoder, we propose a Soft-introVAE for continuous latent space image super-resolution (SVAE-SR). A novel latent space adversarial training is achieved for photo-realistic image restoration. To further improve the quality, a positional encoding scheme is used to extend the original pixel coordinates by aggregating frequency information over the pixel areas. We show the effectiveness of the proposed SVAE-SR through quantitative and qualitative comparisons, and further, illustrate its generalization in denoising and real-image super-resolution.

Image and Video Processing,Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The paper attempts to address several key challenges in Continuous Image Super-Resolution (SR): 1. **Arbitrary Scale Image Enlargement**: Most existing image super-resolution methods can only handle fixed scale enlargements (such as 2x, 4x) and cannot flexibly adapt to different enlargement needs. The goal of continuous image super-resolution is to enable image enlargement at arbitrary scales to meet the requirements of different display devices. 2. **Generating High-Quality Super-Resolution Images**: Continuous image super-resolution tends to produce overly smooth images and is sensitive to noise. Therefore, generating high-quality, realistic super-resolution images at arbitrary scales is an important issue. 3. **Reducing Training Costs**: Existing super-resolution methods usually require separate training models for each fixed enlargement scale, leading to high training costs. Continuous image super-resolution aims to reduce these training costs by achieving multiple enlargement scale tasks with a single model. To address these issues, the authors propose Soft-IntroVAE for Continuous Latent Space Image Super-Resolution (SV AE-SR). This method combines the advantages of Variational Autoencoders (VAE) and Local Implicit Image Functions (LIIF), achieving arbitrary scale image super-resolution through interpolation in continuous latent space. Additionally, the method introduces positional encoding to extend frequency information, thereby improving the reconstruction quality of high-frequency signals and reducing over-smoothing. Experimental results show that SV AE-SR performs excellently across different datasets and enlargement scales, especially outperforming existing methods when dealing with unseen enlargement scales.

Soft-IntroVAE for Continuous Latent space Image Super-Resolution

Super-resolution Variational Auto-Encoders

UltraSR: Spatial Encoding is a Missing Key for Implicit Image Function-based Arbitrary-Scale Super-Resolution

CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Deformable Convolution Alignment and Dynamic Scale-Aware Network for Continuous-Scale Satellite Video Super-Resolution

SC-VAE: Sparse Coding-based Variational Autoencoder with Learned ISTA

VSpSR: Explorable Super-Resolution Via Variational Sparse Representation

Advancing Super-Resolution in Neural Radiance Fields via Variational Diffusion Strategies

Acute mitral regurgitation after acute myocardial infarction in a patient with a patent foramen ovale: review of the diagnosis and management of acute ischemic mitral regurgitation.

Hyperspectral Image Joint Super-Resolution via Local Implicit Spatial-Spectral Function Learning

Video Super-Resolution Via a Spatio-Temporal Alignment Network.

SuperVAE: Superpixelwise Variational Autoencoder for Salient Object Detection

IAA-VSR: an Iterative Alignment Algorithm for Video Super-Resolution.

Image Super-resolution Via Latent Diffusion: A Sampling-space Mixture Of Experts And Frequency-augmented Decoder Approach

Deep Feature Consistent Variational Autoencoder

SR-FEINR: Continuous Remote Sensing Image Super-Resolution Using Feature-Enhanced Implicit Neural Representation

AS-IntroVAE: Adversarial Similarity Distance Makes Robust IntroVAE

SSIF: Learning Continuous Image Representation for Spatial-Spectral Super-Resolution

VAE Learning via Stein Variational Gradient Descent

Towards Interpretable Video Super-Resolution Via Alternating Optimization.