DSDANet: Infrared Dim Small Target Detection Via Attention Enhanced Feature Fusion Network

Fei Chen,Hao Wang,Yuan Zhou,Tingting Ye,Zunlin Fan
DOI: https://doi.org/10.1007/978-981-97-5594-3_19
2024-01-01
Abstract:Single-frame infrared small target (SIRST) detection is a challenging problem, especially in complex environments. Multi-level feature fusion schemes are widely studied in the literature, however, high false alarm and missed detection problems remain unsolved. To handle this problem, we propose an attention-enhanced feature fusion network to integrate features from different layers. This framework employs a deep convolutional neural network to extract low-level texture features and high-level semantic features. To explore the relationships between low-level and high-level features, we propose two feature fusion modules that are based on two different cross-attention mechanisms (CAM-I and CAM-II). The self-attention and CAM-I are utilized to capture long-range and global dependencies among the feature positions, enhancing the contextual feature representations of the target. CAM-II is used to further refine the features in both the local and global manner. Finally, a segmentation head is employed to classify each pixel in the feature maps generated by the feature fusion modules. To demonstrate the effectiveness of the proposed approach, the experiments are conducted on four infrared small target datasets, including SIRST, SIRST-AUG, MDFA, and NUDT-SIRST. Our proposed framework achieves state-of-the-art performance when compared to existing methods, with an F1 score of 0.8514 on the SIRST dataset, 0.8482 on the SIRST-AUG dataset, 0.6883 on the MDFA dataset, and 0.9462 on NUDT-SIRST dataset, respectively. Specifically, it outperforms the representative methods AGPCNet and LW-IRST by 3.58% and 3.44% on the MDFA dataset.
What problem does this paper attempt to address?