Abstract:Given the impressive achievement of convolutional neural networks (CNNs) in grasping image priors from extensive datasets, they have been widely utilized for tasks related to image restoration. Recently, there is been significant progress in another category of neural architectures—Transformers. These models have demonstrated remarkable performance in natural language tasks and higher‐level vision applications. Despite their ability to address some of CNNs limitations, such as restricted receptive fields and adaptability issues, Transformer models often face difficulties when processing images with a high level of detail. This is because the complexity of the computations required increases significantly with the image's spatial resolution. As a result, their application to most high‐resolution image restoration tasks becomes impractical. In our research, we introduce a novel Transformer model, named DehFormer, by implementing specific design modifications in its fundamental components, for example, the multi‐head attention and feed‐forward network. Specifically, the proposed architecture consists of the three modules, that is, (a) multi‐scale feature aggregation network (MSFAN), (b) the gated‐Dconv feed‐forward network (GFFN), (c) and the multi‐Dconv head transposed attention (MDHTA). For the MDHTA module, our objective is to scrutinize the mechanics of scaled dot‐product attention through the utilization of per‐element product operations, thereby bypassing the need for matrix multiplications and operating directly in the frequency domain for enhanced efficiency. For the GFFN module, which enables only the relevant and valuable information to advance through the network hierarchy, thereby enhancing the efficiency of information flow within the model. Extensive experiments are conducted on the SateHazelk, RS‐Haze, and RSID datasets, resulting in performance that significantly exceeds that of existing methods.

Visual transformer with stable prior and patch-level attention for single image dehazing

Vision Transformers for Single Image Dehazing

An Efficient Dehazing Algorithm Based on the Fusion of Transformer and Convolutional Neural Network.

TransDehaze: transformer-enhanced texture attention for end-to-end single image dehaze

Transformer-based progressive residual network for single image dehazing

DHFormer: A Vision Transformer-Based Attention Module for Image Dehazing

Parallel Cross Strip Attention Network for Single Image Dehazing

CTHD-Net: CNN-Transformer hybrid dehazing network via residual global attention and gated boosting strategy

Contrastive Multiscale Transformer for Image Dehazing

DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional Transformer

TransRA: transformer and residual attention fusion for single remote sensing image dehazing

Adaptive haze pixel intensity perception transformer structure for image dehazing networks

An efficient multi‐scale transformer for satellite image dehazing

Towards Domain Invariant Single Image Dehazing

Haze-Aware Attention Network for Single-Image Dehazing

Complementary Feature Enhanced Network with Vision Transformer for Image Dehazing

Adaptive feature fusion network based on boosted attention mechanism for single image dehazing

Image Deblurring by Exploring In-Depth Properties of Transformer

Transformer-Driven Inverse Problem Transform for Fast Blind Hyperspectral Image Dehazing

MB-TaylorFormer: Multi-branch Efficient Transformer Expanded by Taylor Formula for Image Dehazing

Hierarchical Patch Aggregation Transformer for Motion Deblurring