Abstract:To address the problems in the existing video super-resolution methods, such as noise, over smooth and visual artifacts, which are caused by the reliance on limited external training or mismatch of internal similarity patch instances, this study proposes a novel video super-resolution reconstruction algorithm based on deep learning and spatio-temporal feature similarity (DLSS-VSR). The video super-resolution reconstruction mechanism with the joint internal and external constraints is established utilizing the complementary advantages of both external deep correlation mapping learning and internal spatio-temporal nonlocal self-similarity prior constraint. A deep learning model based on deep convolutional neural network is constructed to learn the nonlinear correlation mapping between low-resolution and high-resolution video frame patches. A novel spatio-temporal feature similarity calculation method is proposed, which considers both internal video spatio-temporal self-similarity and external clean nonlocal similarity. For the internal spatio-temporal feature self-similarity, we improve the accuracy and robustness of similarity matching by proposing a similarity measure strategy based on spatio-temporal moment feature similarity and structural similarity. The external nonlocal similarity prior constraint is learned by the patch group-based Gaussian mixture model. The time efficiency for spatio-temporal similarity matching is further improved based on saliency detection and region correlation judgment strategy, which achieves a better tradeoff between super-resolution accuracy and speed. Experimental results demonstrate that the DLSS-VSR algorithm achieves competitive super-resolution quality compared to other state-of-the-art algorithms in both subjective and objective evaluations.

Attention-guided dual spatial-temporal non-local network for video super-resolution

Lightweight Multi-Attention Fusion Network for Image Super-Resolution

Video super-resolution with phase-aided deformable alignment network

Video Super-Resolution Based on Multiple Networks Merging

Video super-resolution via dense non-local spatial-temporal convolutional network

Video super-resolution via mixed spatial-temporal convolution and selective fusion

Video Super-Resolution Via a Spatio-Temporal Alignment Network.

Non-Local Nested Residual Attention Network for Stereo Image Super-Resolution.

A Lightweight Recurrent Grouping Attention Network for Video Super-Resolution

Video Super-Resolution With Temporal Group Attention

You Only Align Once: Bidirectional Interaction for Spatial-Temporal Video Super-Resolution

Multi-Stage Feature Fusion Network for Video Super-Resolution

Attention-guided video super-resolution with recurrent multi-scale spatial–temporal transformer

Dual feature enhanced video super-resolution network based on low-light scenarios

S2A: Scale-Attention-Aware Networks for Video Super-Resolution.

Local-Global Dynamic Filtering Network for Video Super-Resolution

Video Super-Resolution Reconstruction Based on Deep Learning and Spatio-Temporal Feature Self-similarity

STDAN: Deformable Attention Network for Space-Time Video Super-Resolution

Learning Spatial Adaptation and Temporal Coherence in Diffusion Models for Video Super-Resolution

Multiframe Video Satellite Image Super-Resolution via Attention-Based Residual Learning

Fast Spatio-Temporal Residual Network For Video Super-Resolution