Abstract:Remote sensing images usually contain abundant targets and complex information distributions. Consequently, networks are required to model both global and local information in the super-resolution (SR) reconstruction of remote sensing images. The existing SR reconstruction algorithms generally focus on only local or global features, neglecting effective feedback for reconstruction errors. Therefore, a Global Residual Multi-attention Fusion Back-projection Network (SRBPSwin) is introduced by combining the back-projection mechanism with the Swin Transformer. We incorporate a concatenated Channel and Spatial Attention Block (CSAB) into the Swin Transformer Block (STB) to design a Multi-attention Hybrid Swin Transformer Block (MAHSTB). SRBPSwin develops dense back-projection units to provide bidirectional feedback for reconstruction errors, enhancing the network's feature extraction capabilities and improving reconstruction performance. SRBPSwin consists of the following four main stages: shallow feature extraction, shallow feature refinement, dense back projection, and image reconstruction. Firstly, for the input low-resolution (LR) image, shallow features are extracted and refined through the shallow feature extraction and shallow feature refinement stages. Secondly, multiple up-projection and down-projection units are designed to alternately process features between high-resolution (HR) and LR spaces, obtaining more accurate and detailed feature representations. Finally, global residual connections are utilized to transfer shallow features during the image reconstruction stage. We propose a perceptual loss function based on the Swin Transformer to enhance the detail of the reconstructed image. Extensive experiments demonstrate the significant reconstruction advantages of SRBPSwin in quantitative evaluation and visual quality.

CSwT-SR: Conv-Swin Transformer for Blind Remote Sensing Image Super-Resolution with Amplitude-Phase Learning and Structural Detail Alternating Learning

SRBPSwin: Single-Image Super-Resolution for Remote Sensing Images Using a Global Residual Multi-Attention Hybrid Back-Projection Network Based on the Swin Transformer

Dual Self-Attention Swin Transformer for Hyperspectral Image Super-Resolution

An Efficient Hybrid CNN-Transformer Approach for Remote Sensing Super-Resolution

MSWAGAN: Multispectral Remote Sensing Image Super-Resolution Based on Multiscale Window Attention Transformer

Residual SwinV2 transformer coordinate attention network for image super resolution

HiT-SR: Hierarchical Transformer for Efficient Image Super-Resolution

Single Remote Sensing Image Super-Resolution Via a Generative Adversarial Network with Stratified Dense Sampling and Chain Training

Efficient Swin Transformer for Remote Sensing Image Super-Resolution

Blind Super-Resolution for Single Remote Sensing Image via Sparse Representation and Transformed Self-Similarity

Two-Stage Spatial-Frequency Joint Learning for Large-Factor Remote Sensing Image Super-Resolution

Cross-Spatial Pixel Integration and Cross-Stage Feature Fusion Based Transformer Network for Remote Sensing Image Super-Resolution

Enhanced Window-Based Self-Attention with Global and Multi-Scale Representations for Remote Sensing Image Super-Resolution

Hybrid-Scale Hierarchical Transformer for Remote Sensing Image Super-Resolution

Super-resolution Method based on CS and Structural Self-similarity for Remote Sensing Images

A Swin Transformer-Based Fusion Approach for Hyperspectral Image Super-Resolution

Robust Remote Sensing Super-Resolution With Frequency Domain Decoupling for Multiscenarios

Remote Sensing Image Super-Resolution Using Enriched Spatial-Channel Feature Aggregation Networks

Various Degradation: Dual Cross-Refinement Transformer for Blind Sonar Image Super-Resolution

A Super-Resolution Algorithm Based on Hybrid Network for Multi-Channel Remote Sensing Images

Scale-Aware Backprojection Transformer for Single Remote Sensing Image Super-Resolution