Multiscale Feature Extraction U-Net for Infrared Dim- and Small-Target Detection

Xiaozhen Wang,Chengshan Han,Jiaqi Li,Ting Nie,Mingxuan Li,Xiaofeng Wang,Liang Huang
DOI: https://doi.org/10.3390/rs16040643
IF: 5
2024-02-09
Remote Sensing
Abstract:The technology of infrared dim- and small-target detection is irreplaceable in many fields, such as those of missile early warning systems and forest fire prevention, among others. However, numerous components interfere with infrared imaging, presenting challenges for achieving successful detection of infrared dim and small targets with a low rate of false alarms. Hence, we propose a new infrared dim- and small-target detection network, Multiscale Feature Extraction U-Net for Infrared Dim- and Small-Target Detection (MFEU-Net), which can accurately detect targets in complex backgrounds. It uses the U-Net structure, and the encoders and decoders consist of ReSidual U-block and Inception, allowing rich multiscale feature information to be extracted. Thus, the effectiveness of algorithms in detecting very small-sized targets can be improved. In addition, through the multidimensional channel and spatial attention mechanism, the model can be adjusted to focus more on the target area in the image, improving its extraction of target information and detection performance in different scenarios. The experimental results show that our proposed algorithm outperforms other advanced algorithms in detection performance. On the MFIRST, SIRST, and IRSTD-1k datasets, we achieved detection rates of 0.864, 0.962, and 0.965; IoU values of 0.514, 0.671, and 0.630; and false alarm rates of 3.08 × 10−5, 2.61 × 10−6, and 1.81 × 10−5, respectively.
environmental sciences,imaging science & photographic technology,remote sensing,geosciences, multidisciplinary
What problem does this paper attempt to address?
The paper is primarily dedicated to addressing the technical challenges in infrared dim- and small-target detection. Specifically, the research team proposed a new network architecture named "Multi-Scale Feature Extraction U-Net (MFEU-Net)" aimed at improving the detection performance of Infrared Dim- and Small-Targets (IDSTs) in complex backgrounds. The main issues mentioned in the paper include: 1. **Target Characteristics**: IDSTs appear as very small and low-brightness targets in images, making them difficult to distinguish from the background. 2. **Background Interference**: Complex background information (such as noise and small edges) interferes with target detection, especially in ground backgrounds. 3. **Limitations of Existing Methods**: Traditional filters, Local Contrast Measure (LCM), and data structure-based methods can solve some problems but perform poorly in complex scenes or have issues such as high computational cost and limited real-time application. 4. **Shortcomings of Deep Learning Algorithms**: Although existing deep learning algorithms have achieved good results, most cannot balance detection rate and false alarm rate well. Additionally, some algorithms have weak generalization capabilities, limiting their application beyond specific datasets. To overcome the above issues, the authors designed the MFEU-Net network, with its main contributions including: - Combining ReSidual U-block (RSU) and Inception modules to extract multi-scale features, adapting to targets of different sizes. - Introducing Multi-Dimensional Channel and Spatial Attention Mechanism (MCSAM), enabling the network to better focus on target areas, thereby improving detection performance. - Compared to existing advanced algorithms, this algorithm achieved better detection results on different datasets, with lower miss rates and false alarm rates.