Dim2Clear Network for Infrared Small Target Detection

Xinbo Gao,Jing Zhang,Mingjin Zhang,Jie Guo,Yunsong Li,Rui Zhang
DOI: https://doi.org/10.1109/TGRS.2023.3263848
IF: 8.2
IEEE Transactions on Geoscience and Remote Sensing
Abstract:Infrared small target detection (IRSTD) is important for many practical applications such as hazardous aircraft warning, especially when the target is not visible in visible light image due to atmospheric conditions such as fog and cloud. However, IRSTD is challenging due to noises, small and dim targets. To address this challenge, we propose a novel Dim2Clear network (Dim2Clear) for IRSTD in this article. Specifically, the Dim2Clear consists of a U-Net backbone encoder, a context mixer decoder (CMD) based on spatial and frequency attention (SFA), and an eyeball-shaped enhancement module (EEM). The CMD is composed of cascaded regular residual blocks where two SFA modules are inserted. Each SFA module receives features from different residual blocks and generates spatial attention map from them to modulate the low-level features, which are then decomposed into low and high frequencies using the discrete cosine transformation. Accordingly, features are further modulated according to the generated frequency attention maps. In this way, SFA can extract both spatial context and frequency context to improve the feature representation capacity. In addition, we design an EEM to suppress the noise and enhance the signal-to-noise ratio (SNR) in the segmentation results from the perspective of image super-resolution. Experiments on the SIRST dataset and our newly constructed IRSTD-1k dataset show that the proposed Dim2Clear outperforms the state-of-the-art (SOTA) methods.
Engineering,Computer Science
What problem does this paper attempt to address?