SCAFNet: Semantic-Guided Cascade Adaptive Fusion Network for Infrared Small Targets Detection

Shizhou Zhang,Zhang Wang,Yinghui Xing,Liangkui Lin,Xiaoting Su,Yanning Zhang
DOI: https://doi.org/10.1109/tgrs.2024.3492256
IF: 8.2
2024-01-01
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Infrared small targets detection is a crucial component of infrared targets tracking and search. It is challenging due to the complex backgrounds, low contrast between targets and backgrounds, and the small, dim nature of the targets. Therefore, effectively representing the targets and enhancing the distinction between targets and background is essential. Existing deep-learning based methods struggle to capture the subtle details of weak targets, neglecting the complementary characteristics of multi-level features, which leads to inaccurate localization of targets. In this paper, we propose a Semantic-Guided Cascade Adaptive Fusion Network (SCAFNet) to address these challenges. To improve the representation of small targets in the deeper layers, we introduce a Multi-resolution Auxiliary Enhancement (MAE) encoder to progressively enhance detailed information within the deep features. After extracting multi-scale features, an adaptive fusion (AdaFus) decoder is proposed to fuse them. It has a Semantic-Guided Cascade Fusion (SGCF) module to integrate feature maps at three different resolutions. Specifically, SGCF first employs rich semantic features from the high-level feature map to guide the spatial distribution of the low-level feature maps, thereby improving the distinction between the target and the background. Then, adaptive fusion weights are generated to guide the fusion process, ensuring that the final feature map combines rich semantic information with precise spatial details. Furthermore, we perform long-distance modeling on the feature map to achieve detailed reconstruction, which aids in restoring the shape information of the target. The effectiveness of our method is validated through experiments on various public infrared small targets detection datasets.
What problem does this paper attempt to address?