Abstract:Recent progress in single-image super-resolution (SISR) has achieved remarkable performance, yet the computational costs of these methods remain a challenge for deployment on resource-constrained devices. In particular, transformer-based methods, which leverage self-attention mechanisms, have led to significant breakthroughs but also introduce substantial computational costs. To tackle this issue, we introduce the Convolutional Transformer layer (ConvFormer) and propose a ConvFormer-based Super-Resolution network (CFSR), offering an effective and efficient solution for lightweight image super-resolution. The proposed method inherits the advantages of both convolution-based and transformer-based approaches. Specifically, CFSR utilizes large kernel convolutions as a feature mixer to replace the self-attention module, efficiently modeling long-range dependencies and extensive receptive fields with minimal computational overhead. Furthermore, we propose an edge-preserving feed-forward network (EFN) designed to achieve local feature aggregation while effectively preserving high-frequency information. Extensive experiments demonstrate that CFSR strikes an optimal balance between computational cost and performance compared to existing lightweight SR methods. When benchmarked against state-of-the-art methods such as ShuffleMixer, the proposed CFSR achieves a gain of 0.39 dB on the Urban100 dataset for the x2 super-resolution task while requiring 26\% and 31\% fewer parameters and FLOPs, respectively. The code and pre-trained models are available at <a class="link-external link-https" href="https://github.com/Aitical/CFSR" rel="external noopener nofollow">this https URL</a>.

Lightweight image super-resolution network based on extended convolution mixer

Lightweight Image Super-Resolution Network Using 3D Convolutional Neural Networks

Lightweight Multi-Attention Fusion Network for Image Super-Resolution

Epistemic-Uncertainty-Based Divide-and-Conquer Network for Single-Image Super-Resolution

MixerSR: A New Feature Extraction Paradigm for Single Image Super-Resolution

A very lightweight and efficient image super-resolution network

Transforming Image Super-Resolution: A ConvFormer-based Efficient Approach

ESKN: Enhanced Selective Kernel Network for Single Image Super-Resolution

Multi-scale strip-shaped convolution attention network for lightweight image super-resolution

ShuffleMixer: An Efficient ConvNet for Image Super-Resolution

CAMixerSR: Only Details Need More "Attention"

Single-image Super-Resolution Via Selective Multi-Scale Network

A Lightweight Multi-Scale Channel Attention Network for Image Super-Resolution.

Lightweight image super-resolution based on stepwise feedback mechanism and multi-feature maps fusion

Spatial and Channel Aggregation Network for Lightweight Image Super-Resolution

Incorporating Transformer Designs into Convolutions for Lightweight Image Super-Resolution

Fully $1\times1$ Convolutional Network for Lightweight Image Super-Resolution

Single Image Super Resolution based on a Modified U-net with Mixed Gradient Loss

Lightweight single-image super-resolution via multi-scale feature fusion CNN and multiple attention block

Lightweight Image Super-resolution with Local Attention Enhancement