Abstract:Given the impressive achievement of convolutional neural networks (CNNs) in grasping image priors from extensive datasets, they have been widely utilized for tasks related to image restoration. Recently, there is been significant progress in another category of neural architectures—Transformers. These models have demonstrated remarkable performance in natural language tasks and higher‐level vision applications. Despite their ability to address some of CNNs limitations, such as restricted receptive fields and adaptability issues, Transformer models often face difficulties when processing images with a high level of detail. This is because the complexity of the computations required increases significantly with the image's spatial resolution. As a result, their application to most high‐resolution image restoration tasks becomes impractical. In our research, we introduce a novel Transformer model, named DehFormer, by implementing specific design modifications in its fundamental components, for example, the multi‐head attention and feed‐forward network. Specifically, the proposed architecture consists of the three modules, that is, (a) multi‐scale feature aggregation network (MSFAN), (b) the gated‐Dconv feed‐forward network (GFFN), (c) and the multi‐Dconv head transposed attention (MDHTA). For the MDHTA module, our objective is to scrutinize the mechanics of scaled dot‐product attention through the utilization of per‐element product operations, thereby bypassing the need for matrix multiplications and operating directly in the frequency domain for enhanced efficiency. For the GFFN module, which enables only the relevant and valuable information to advance through the network hierarchy, thereby enhancing the efficiency of information flow within the model. Extensive experiments are conducted on the SateHazelk, RS‐Haze, and RSID datasets, resulting in performance that significantly exceeds that of existing methods.

U²-Former: Nested U-Shaped Transformer for Image Restoration Via Multi-View Contrastive Learning

U2-Former: A Nested U-shaped Transformer for Image Restoration

Uformer: A General U-Shaped Transformer for Image Restoration

An Efficient Dehazing Algorithm Based on the Fusion of Transformer and Convolutional Neural Network.

Dual-former: Hybrid Self-attention Transformer for Efficient Image Restoration

Cascaded Transformer U-net for Image Restoration

Joint multi-dimensional dynamic attention and transformer for general image restoration

Comprehensive and Delicate: an Efficient Transformer for Image Restoration

iiTransformer: A Unified Approach to Exploiting Local and Non-local Information for Image Restoration

Segmentation Guided Sparse Transformer for Under-Display Camera Image Restoration

Look-Around Before You Leap: High-Frequency Injected Transformer for Image Restoration

Correlation Matching Transformation Transformers for UHD Image Restoration

Unformer: A Transformer-Based Approach for Adaptive Multi-Scale Feature Aggregation in Underwater Image Enhancement

Decomformer: Decompose Self-Attention of Transformer for Efficient Image Restoration

An efficient multi‐scale transformer for satellite image dehazing

An Illumination-Guided Dual Attention Vision Transformer for Low-Light Image Enhancement

Convolution-Enhanced Transformer with Frequency Domain Contrastive Learning for Image Deraining

Convformer: Dual-Stream Vision Transformers and Convolutional Networks for Image Restoration

Progressive Convolutional Transformer for Image Restoration

Low-light image enhancement using transformer with color fusion and channel attention

Adapt or Perish: Adaptive Sparse Transformer with Attentive Feature Refinement for Image Restoration