Multi-patch de-raindrop Transformer for UAV images

Yufeng Li,Qianhui Zhou,Chuanlong Xie,Shuang Wu
DOI: https://doi.org/10.1007/s11760-024-03586-3
IF: 1.583
2024-12-06
Signal Image and Video Processing
Abstract:Due to the intricacies inherent in aerial photography environments during precipitation, rain droplets randomly adhere to the lens, significantly degrading image quality. Therefore, this paper proposes a de-raindrop method for UAV aerial images based on a fusion network of multi-patch and frequency Transformer. The architecture of the proposed network comprises a three-stage image restoration network that utilizes a multi-patch segmentation strategy to optimize image patches of different sizes and positions. The proposed method leverages the strengths of Transformer algorithms by introducing a Frequency Attention Transformer Block (FATB). This block introduces a frequency attention mechanism that effectively decouples high-frequency and low-frequency components within the self-attention layers. By concurrently concentrating on both local and global information within the image, FATB achieves high-quality image reconstruction. Furthermore, to augment feature fusion during image restoration, we introduce an Adaptive Feature Enhancement Module (AFEM). This module enhances the representational capacity of features across different stages, thereby further boosting the quality of image restoration. Experimental results illustrate that the proposed method outperforms the state-of-the-art algorithms in raindrop removal, achieving an improvement of 0.41dB over the best existing algorithm while demonstrating superior efficiency. Additionally, the method exhibits strong performance across other public benchmark raindrop removal datasets, indicating its broad applicability. In summary, this research not only advances the field of UAV image de-raindrop but also provides clearer and more reliable images for subsequent visual tasks.
engineering, electrical & electronic,imaging science & photographic technology
What problem does this paper attempt to address?