CSAN-UNet: Channel Spatial Attention Nested UNet for Infrared Small Target Detection

Yuhan Zhong,Zhiguang Shi,Yan Zhang,Yong Zhang,Hanyu Li
DOI: https://doi.org/10.3390/rs16111894
IF: 5
2024-05-25
Remote Sensing
Abstract:Segmenting small infrared targets presents a significant challenge for traditional image processing architectures due to the inherent lack of texture, minimal shape information, and their sparse pixel representation within images. The conventional UNet architecture, while proficient in general segmentation tasks, inadequately addresses the nuances of small infrared target segmentation due to its reliance on downsampling operations, such as pooling, which often results in the loss of critical target information. This paper introduces the Channel Spatial Attention Nested UNet (CSAN-UNet), an innovative architecture designed specifically to enhance the detection and segmentation of small infrared targets. Central to CSAN-UNet's design is the Cascaded Channel and Spatial Convolutional Attention Module (CSCAM), a novel component that adaptively enhances multi-level features and mitigates the loss of target information attributable to downsampling processes. Additionally, the Channel-priority and Spatial Attention Cascade Module (CPSAM) represents another pivotal advancement within CSAN-UNet, prioritizing channel-level adjustments alongside spatial attention mechanisms to efficiently extract deep semantic information pertinent to small infrared targets. Empirical validation conducted on two public datasets confirms that CSAN-UNet surpasses existing state-of-the-art algorithms in segmentation performance, while simultaneously reducing computational overhead.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
The paper primarily addresses the challenges in infrared small target detection, particularly the limitations of traditional image processing architectures when dealing with such targets, such as lack of texture, insufficient shape information, and sparse pixel representation. The authors propose a new network structure—Channel and Spatial Attention Nested UNet (CSAN-UNet), aimed at improving the detection and segmentation performance of small infrared targets. Specifically, the study addresses the following key issues: 1. **Improving Detection Accuracy**: Traditional UNet architectures have deficiencies when handling small infrared targets because they rely on downsampling operations (e.g., pooling), which often lead to the loss of important target information. Therefore, the paper proposes CSAN-UNet, which enhances multi-level features and mitigates the loss of target information due to downsampling by introducing a Cascade Channel and Spatial Convolutional Attention Module (CSCAM). 2. **Optimizing Feature Extraction**: To efficiently extract key semantic information related to small infrared targets, the paper also develops a lightweight Channel-Priority and Spatial Attention Cascade Module (CPSAM). This module can effectively map spatial relationships at a low computational cost, ensuring precise extraction of target information. 3. **Balancing Performance and Efficiency**: By combining the advantages of CSCAM and CPSAM, CSAN-UNet can effectively retain detailed target features while maintaining computational resource efficiency. This addresses the issue in existing methods where either the computational demand is too high or small object features may be lost. 4. **Empirical Validation**: The authors demonstrate through empirical validation on two public datasets that CSAN-UNet has significant advantages in segmentation performance compared to existing state-of-the-art algorithms, while also reducing computational overhead. In summary, the main contribution of this study is the proposal of an innovative network structure aimed at improving the accuracy of infrared small target detection while considering computational efficiency. Through careful design of the network structure, especially the introduction of CSCAM and CPSAM, CSAN-UNet has made significant progress in addressing the complex problem of infrared small target detection.