ST-Trans: Spatial-Temporal Transformer for Infrared Small Target Detection in Sequential Images

Xiaozhong Tong,Zhen Zuo,Shaojing Su,Junyu Wei,Xiaoyong Sun,Peng Wu,Zongqing Zhao
DOI: https://doi.org/10.1109/tgrs.2024.3355947
IF: 8.2
2024-02-16
IEEE Transactions on Geoscience and Remote Sensing
Abstract:The detection of small infrared targets with low signal-to-noise ratio (SNR) and low contrast in high-noise backgrounds is challenging due to the lack of spatial features of the targets and the scarcity of real-world datasets. Most existing methods are based on single-frame images, which are prone to numerous false alarms and missed detections. This article proposes spatial-temporal (ST)-Trans that provides an efficient end-to-end solution for the detection of small infrared targets in the complex context of sequential images. First, the detection of small infrared targets in complex backgrounds relying only on a single image has been significantly difficult due to the lack of available spatial features. The temporal and motion information of the sequence image was found to effectively improve target detection performance. Therefore, we used the C2FDark backbone to learn the spatial features associated with small targets, and the spatial-temporal transformer module (STTM) to learn the spatiotemporal dependencies between successive frames of small infrared targets. This improved the detection performance in challenging scenes. Second, due to the lack of publicly available infrared small target sequence datasets for training, we annotated a set of small infrared targets for challenging scenes and published them as the sequential infrared small target detection (SIRSTD) dataset. Finally, we performed extensive ablation experiments on the SIRSTD dataset and compared its performance with that of the other state-of-the-art (SOTA) methods to demonstrate the superiority of the proposed method. The results revealed that ST-Trans outperformed other models and can effectively improve the detection performance for small infrared targets. The SIRSTD dataset is available at https://github.com/aurora-sea/SIRSTD.
imaging science & photographic technology,remote sensing,engineering, electrical & electronic,geochemistry & geophysics
What problem does this paper attempt to address?