Abstract:Single image super-resolution task has witnessed the great strides with the development of deep learning. However, most existing studies focus on building a more complex neural network with a massive number of layers, bringing heavy computational cost and memory storage. Recently, as Transformer yields brilliant results in NLP tasks, more and more researchers start to explore the application of Transformer in computer vision tasks. But with the heavy computational cost and high GPU memory occupation of vision Transformer, the network can not be designed too deep. To address this problem, we propose a novel Efficient Super-Resolution Transformer (ESRT) for fast and accurate image super-resolution. ESRT is a hybrid Transformer where a CNN-based SR network is first designed in the front to extract deep features. Specifically, there are two backbones for formatting the ESRT: lightweight CNN backbone (LCB) and lightweight Transformer backbone (LTB). Among them, LCB is a lightweight SR network to extract deep SR features at a low computational cost by dynamically adjusting the size of the feature map. LTB is made up with an efficient Transformer (ET) with small GPU memory occupation, which benefited from the novel efficient multi-head attention (EMHA). In EMHA, a feature split module (FSM) is proposed to split the long sequence into sub-segments and then these sub-segments are applied by attention operation. This module can significantly decreases the GPU memory occupation. Extensive experiments show that our ESRT achieves competitive results. Compared with the original Transformer which occupies 16057M GPU memory, the proposed ET only occupies 4191M GPU memory with better performance.

RSHAN: Image Super-Resolution Network Based on Residual Separation Hybrid Attention Module

Parallel-Connected Residual Channel Attention Network for Remote Sensing Image Super-Resolution

A Residual Network with Efficient Transformer for Lightweight Image Super-Resolution

An Efficient Hybrid CNN-Transformer Approach for Remote Sensing Super-Resolution

Super-Resolution Algorithm Based on Transformer+CNN

Efficient Transformer for Single Image Super-Resolution.

Remote Sensing Image Super-Resolution via Residual-Dense Hybrid Attention Network

HMANet: Hybrid Multi-Axis Aggregation Network for Image Super-Resolution

Image Super-Resolution Using Very Deep Residual Channel Attention Networks

Remote Sensing Image Super-Resolution Using Enriched Spatial-Channel Feature Aggregation Networks

RISK ASSESSMENT OF OIL SPILL ACCIDENTS

Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach

Hybrid-Scale Hierarchical Transformer for Remote Sensing Image Super-Resolution

Attention-guided hybrid transformer-convolutional neural network for underwater image super-resolution

Efficient Adaptive Feature Fusion Network for Remote-Sensing Image Super-Resolution

PCCFormer: Parallel coupled convolutional transformer for image super-resolution

Cross-Spatial Pixel Integration and Cross-Stage Feature Fusion Based Transformer Network for Remote Sensing Image Super-Resolution

Efficient Single Image Super-Resolution with Entropy Attention and Receptive Field Augmentation

Transformer-based image super-resolution and its lightweight

Hybrid Residual Attention Network for Single Image Super Resolution