DHFormer: A Vision Transformer-Based Attention Module for Image Dehazing

Abdul Wasi,O. Jeba Shiney

2023-12-16

Abstract:Images acquired in hazy conditions have degradations induced in them. Dehazing such images is a vexed and ill-posed problem. Scores of prior-based and learning-based approaches have been proposed to mitigate the effect of haze and generate haze-free images. Many conventional methods are constrained by their lack of awareness regarding scene depth and their incapacity to capture long-range dependencies. In this paper, a method that uses residual learning and vision transformers in an attention module is proposed. It essentially comprises two networks: In the first one, the network takes the ratio of a hazy image and the approximated transmission matrix to estimate a residual map. The second network takes this residual image as input and passes it through convolution layers before superposing it on the generated feature maps. It is then passed through global context and depth-aware transformer encoders to obtain channel attention. The attention module then infers the spatial attention map before generating the final haze-free image. Experimental results, including several quantitative metrics, demonstrate the efficiency and scalability of the suggested methodology.

Computer Vision and Pattern Recognition,Image and Video Processing

What problem does this paper attempt to address?

The paper attempts to address the issue of image degradation under smog conditions, where images exhibit artifacts such as reduced brightness, abnormal contrast, and changes in hue and saturation due to light absorption or scattering by suspended particles. Dehazing (i.e., removing haze from images) is a complex and ill-posed problem, and many traditional methods are limited due to their lack of scene depth perception and ability to capture long-range dependencies. Specifically, this paper proposes an attention module based on residual learning and Vision Transformer, aiming to improve dehazing performance by estimating the transmission matrix and generating residual maps. The method includes two networks: the first network takes the ratio of the hazy image to the approximate transmission matrix as input to estimate the residual map; the second network uses this residual map as input, processes it through convolutional layers, overlays it on the generated feature map, and then obtains channel attention through a transformer encoder with global context and depth perception. Finally, the attention module infers the spatial attention map and generates the final dehazed image. Experimental results show that this method demonstrates efficiency and scalability across multiple quantitative metrics, effectively removing haze from images and improving image quality.

DHFormer: A Vision Transformer-Based Attention Module for Image Dehazing

Vision Transformers for Single Image Dehazing

Visual transformer with stable prior and patch-level attention for single image dehazing

An Efficient Dehazing Algorithm Based on the Fusion of Transformer and Convolutional Neural Network.

Haze-Aware Attention Network for Single-Image Dehazing

DEHRFormer: Real-time Transformer for Depth Estimation and Haze Removal from Varicolored Haze Scenes

Contrastive Multiscale Transformer for Image Dehazing

Residual Spatial and Channel Attention Networks for Single Image Dehazing

TransDehaze: transformer-enhanced texture attention for end-to-end single image dehaze

A new end-to-end image dehazing algorithm based on residual attention mechanism

An Enhancement in Single-Image Dehazing Employing Contrastive Attention over Variational Auto-Encoder (CA-VAE) Method

CTHD-Net: CNN-Transformer hybrid dehazing network via residual global attention and gated boosting strategy

Adaptive feature fusion network based on boosted attention mechanism for single image dehazing

Haze Relevant Feature Attention Network for Single Image Dehazing

Adaptive haze pixel intensity perception transformer structure for image dehazing networks

Lightweight single image dehazing network with residual feature attention

TransRA: transformer and residual attention fusion for single remote sensing image dehazing

Towards Domain Invariant Single Image Dehazing

DehazeDCT: Towards Effective Non-Homogeneous Dehazing via Deformable Convolutional Transformer

USCFormer: Unified Transformer With Semantically Contrastive Learning for Image Dehazing

Transformer-based progressive residual network for single image dehazing