Abstract:In the realm of image deraining, traditional CNN-based deep learning deraining systems exhibit efficient expression of local features and strong generalization capabilities. However, their limited local receptive fields and independence from input content hinder their ability to model global features, rendering them less effective in mitigating complex and dynamic long rain streak scenarios. On the other hand, deraining systems based on the Transformer architecture possess robust global feature aggregation capabilities. Yet, their computational complexity increases quadratically with the expansion of the image spatial scale, making them less suitable for high-quality image deraining tasks. However, for ill-posed problems like image deraining, the precise representation of both local and global features has become increasingly pivotal to addressing the multifaceted challenges of rain streak removal. Consequently, we introduce an innovative solution, the CNN and Gated Multi Axial-Sparse Transformer Feature Fusion Network, referred to as CGMAformer. This approach optimizes both architectural paradigms jointly, effectively harnessing their respective strengths for image deraining. Specifically, in the local feature extraction phase based on CNN, we employ the Degradation-aware Mixture of Experts Feature Compensator (DMEFC) for adaptive representation of local spatial rain streak features. In the global feature extraction phase based on the Transformer, we introduce a dual-branch adaptive Gated Multi-Axis Sparse Transformer (GAST) attention mechanism to complement global background spatial features in rainy images. This approach ensures the preservation of global feature integrity while effectively reducing model complexity. Ultimately, through a feature fusion network, we fully exploit the local characteristics of CNN and the self-attention-based global aggregation capabilities of the Transformer for efficient image deraining.

Multi-Scale Dilated Convolution Transformer for Single Image Deraining

Dual-Path Multi-Scale Transformer for High-Quality Image Deraining

CTFCD: Channel Transformer Based on Full Convolutional Decoder for Single Image Deraining

A Hybrid Transformer-Mamba Network for Single Image Deraining

Hybrid CNN-Transformer Feature Fusion for Single Image Deraining

Dbswin: Transformer Based Dual Branch Network for Single Image Deraining

Multi-Scale Fusion and Decomposition Network for Single Image Deraining

DR-DiT: Image Deraining Using Diffusion Model with Transformer

Combined with Pyramid Split Attention and Multi-Scale Feature Learning Network for Single Image Deraining

Combining multiscale learning and attention mechanism densely connected network for single image deraining

CGMAformer: CNN and gated multi axial-sparse transformer feature fusion network for image deraining

Gabor-guided transformer for single image deraining

Poxvirus protein N1L targets the I-kappaB kinase complex, inhibits signaling to NF-kappaB by the tumor necrosis factor superfamily of receptors, and inhibits NF-kappaB and IRF3 signaling by toll-like receptors.

Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining

Learning A Sparse Transformer Network for Effective Image Deraining

Global–local transformer for single-image rain removal

Multi-Scale Hybrid Fusion Network for Single Image Deraining

Progressive network based on detail scaling and texture extraction: A more general framework for image deraining

Exploiting Regional Information Transformer for Single Image Deraining

Frequency domain-enhanced transformer for single image deraining

An Efficient Dehazing Algorithm Based on the Fusion of Transformer and Convolutional Neural Network.