Abstract:Existing video super-resolution (VSR) methods generally adopt a recurrent propagation network to extract spatio-temporal information from the entire video sequences, exhibiting impressive performance. However, the key components in recurrent-based VSR networks significantly impact model efficiency, e.g., the alignment module occupies a substantial portion of model parameters, while the bidirectional propagation mechanism significantly amplifies the inference time. Consequently, developing a compact and efficient VSR method that can be deployed on resource-constrained devices, e.g., smartphones, remains challenging. To this end, we propose a cascaded temporal updating network (CTUN) for efficient VSR. We first develop an implicit cascaded alignment module to explore spatio-temporal correspondences from adjacent frames. Moreover, we propose a unidirectional propagation updating network to efficiently explore long-range temporal information, which is crucial for high-quality video reconstruction. Specifically, we develop a simple yet effective hidden updater that can leverage future information to update hidden features during forward propagation, significantly reducing inference time while maintaining performance. Finally, we formulate all of these components into an end-to-end trainable VSR network. Extensive experimental results show that our CTUN achieves a favorable trade-off between efficiency and performance compared to existing methods. Notably, compared with BasicVSR, our method obtains better results while employing only about 30% of the parameters and running time. The source code and pre-trained models will be available at <a class="link-external link-https" href="https://github.com/House-Leo/CTUN" rel="external noopener nofollow">this https URL</a>.

SkipVSR: Adaptive Patch Routing for Video Super-Resolution with Inter-Frame Mask

Video super-resolution with phase-aided deformable alignment network

Noucsr: Efficient Super-Resolution Network Without Upsampling Convolution

Cascaded Temporal Updating Network for Efficient Video Super-Resolution

Accelerating the Training of Video Super-Resolution Models

Adaptive Recurrent Frame Prediction with Learnable Motion Vectors.

Online Video Super-Resolution with Convolutional Kernel Bypass Grafts

Enhanced Video Super-Resolution Network Towards Compressed Data

Video Super-Resolution Via a Spatio-Temporal Alignment Network.

Fast and Accurate Single Image Super-Resolution Via an Energy-Aware Improved Deep Residual Network.

Real-Time Video Super-Resolution with Spatio-Temporal Modeling and Redundancy-Aware Inference

How Video Super-Resolution and Frame Interpolation Mutually Benefit

FM-VSR: Feature Multiplexing Video Super-Resolution for Compressed Video

Dual feature enhanced video super-resolution network based on low-light scenarios

EPS: Efficient Patch Sampling for Video Overfitting in Deep Super-Resolution Model Training

RTSR: A Real-Time Super-Resolution Model for AV1 Compressed Content

A Lightweight Recurrent Grouping Attention Network for Video Super-Resolution

Adapting Single-Image Super-Resolution Models to Video Super-Resolution: A Plug-and-Play Approach

Real-World Video Super-Resolution with a Degradation-Adaptive Model

Structured Sparsity Learning for Efficient Video Super-Resolution

You Only Align Once: Bidirectional Interaction for Spatial-Temporal Video Super-Resolution