Abstract:Infrared small-target detection has widespread influences on anti-missile warning, precise weapon guidance, infrared stealth and anti-stealth, military reconnaissance, and other national defense fields. However, small targets are easily submerged in background clutter noise and have fewer pixels and shape features. Furthermore, random target positions and irregular motion can lead to target detection being carried out in the whole space–time domain. This could result in a large amount of calculation, and the accuracy and real-time performance are difficult to be guaranteed. Therefore, infrared small-target detection is still a challenging and far-reaching research hotspot. To solve the above problem, a novel multimodal feature fusion network (MFFN) is proposed, based on morphological characteristics, infrared radiation, and motion characteristics, which could compensate for the deficiency in the description of single modal characteristics of small targets and improve the recognition precision. Our innovations introduced in the paper are addressed in the following three aspects: Firstly, in the morphological domain, we propose a network with the skip-connected feature pyramid network (SCFPN) and dilated convolutional block attention module integrated with Resblock (DAMR) introduced to the backbone, which is designed to improve the feature extraction ability for infrared small targets. Secondly, in the radiation characteristic domain, we propose a prediction model of atmospheric transmittance based on deep neural networks (DNNs), which predicts the atmospheric transmittance effectively without being limited by the complex environment to improve the measurement accuracy of radiation characteristics. Finally, the dilated convolutional-network-based bidirectional encoder representation from a transformers (DC-BERT) structure combined with an attention mechanism is proposed for the feature extraction of radiation and motion characteristics. Finally, experiments on our self-established optoelectronic equipment detected dataset (OEDD) show that our method is superior to eight state-of-the-art algorithms in terms of the accuracy and robustness of infrared small-target detection. The comparative experimental results of four kinds of target sequences indicate that the average recognition rate Pavg is 92.64%, the mean average precision (mAP) is 92.01%, and the F1 score is 90.52%.

FTC-Net: Fusion of Transformer and CNN Features for Infrared Small Target Detection

MRF3Net: Infrared Small Target Detection Using Multi-Receptive Field Perception and Effective Feature Fusion

Multi-scale feature fusion attention network for infrared small target detection

SFFNet: Shallow Feature Fusion Network Based on Detection Framework for Infrared Small Target Detection

Multi-Stage Multi-Scale Local Feature Fusion for Infrared Small Target Detection

Infrared Small-Dim Target Detection with Transformer under Complex Backgrounds

DC-Net: A Dual-Channel and Cross-Scale Feature Fusion Infrared Small Target Detection Network

Multiscale Progressive Fusion Filter Network for Infrared Small Target Detection

SCTransNet: Spatial-channel Cross Transformer Network for Infrared Small Target Detection

An infrared small target detection method using coordinate attention and feature fusion

A Nonlocal Enhanced Feature Pyramid Network for Infrared Small Target Detection

HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection

AFFPN: Attention Fusion Feature Pyramid Network for Small Infrared Target Detection

Infrared Small Target Detection Based on a Temporally-Aware Fully Convolutional Neural Network

HDCTfusion: Hybrid Dual-Branch Network Based on CNN and Transformer for Infrared and Visible Image Fusion

Exploring Feature Compensation and Cross-level Correlation for Infrared Small Target Detection

Improved Dense Nested Attention Network Based on Transformer for Infrared Small Target Detection

ABC: Attention with Bilinear Correlation for Infrared Small Target Detection

An Improved U-Net Infrared Small Target Detection Algorithm Based on Multi-Scale Feature Decomposition and Fusion and Attention Mechanism

IRSTFormer: A Hierarchical Vision Transformer for Infrared Small Target Detection

Infrared Small-Target Detection Based on Radiation Characteristics with a Multimodal Feature Fusion Network