Abstract:In remote sensing image fusion, the conventional Convolutional Neural Networks (CNNs) extract local features of the image through layered convolution, which is limited by the receptive field and struggles to capture global features. Transformer utilizes self-attention to capture long-distance dependencies in images, which has a global receptive field, but the computational cost for high-resolution images is excessively high. In response to the above issues, this paper draws inspiration from the FusionNet network, harnessing the local detail acquisition capability of CNNs and the global data procuring capacity of Transformer. It presents a novel method for remote sensing image sharpening named Guided Filtering-Cross Stage Partial Network-Transformer, abbreviated as GF-CSTNet. This solution unifies the strengths of Guided Filtering (GF), Cross Stage Partial Network (CSPNet), and Transformer. Firstly, this method utilizes GF to enhance the acquired remote sensing image data. The CSPNet and Transformer structures are then combined to further enhance fusion performance by leveraging their respective advantages. Subsequently, a Rep-Conv2Former method is designed to streamline attention and extract diverse receptive field features through a multi-scale convolution modulator block. Simultaneously, a reparameterization module is constructed to integrate the multiple branches generated during training into a unified branch during inference, thereby optimizing the model's inference speed. Finally, a residual learning module incorporating attention has been devised to augment the modeling and feature extraction capabilities of images. Experimental results obtained from the GaoFen-2 and WorldView-3 datasets demonstrate the effectiveness of the proposed GF-CSTNet approach. It effectively extracts detailed information from images while avoiding the problem of spectral distortion.

An efficient parallel fusion structure of distilled and transformer-enhanced modules for lightweight image super-resolution

Multi-Scale Cross-Attention Fusion Network Based on Image Super-Resolution

Lightweight Multi-Attention Fusion Network for Image Super-Resolution

Efficient multi-branch dynamic fusion network for super-resolution of industrial component image

Efficient Adaptive Feature Fusion Network for Remote-Sensing Image Super-Resolution

Multi-Modal Image Fusion Via Deep Laplacian Pyramid Hybrid Network

Attention-guided hybrid transformer-convolutional neural network for underwater image super-resolution

A Multi-Attention Feature Distillation Neural Network for Lightweight Single Image Super-Resolution

Image Super-resolution via Efficient Transformer Embedding Frequency Decomposition with Restart

Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach

A Lightweight Pyramid Feature Fusion Network for Single Image Super-Resolution Reconstruction

DTCNet: Transformer-CNN Distillation for Super-Resolution of Remote Sensing Image

Cross-receptive Focused Inference Network for Lightweight Image Super-Resolution

A novel pansharpening method based on cross stage partial network and transformer

Incorporating Transformer Designs into Convolutions for Lightweight Image Super-Resolution

Pyramid Fusion Attention Network for Single Image Super-Resolution

Fusformer: A Transformer-based Fusion Approach for Hyperspectral Image Super-resolution

Exploring Frequency-Inspired Optimization in Transformer for Efficient Single Image Super-Resolution

Deeply Recursive Low- and High-Frequency Fusing Networks for Single Image Super-Resolution

Dynamic feature distillation and pyramid split large kernel attention network for lightweight image super-resolution