MDIGCNet: Multi-Directional Information-Guided Contextual Network for Infrared Small Target Detection

Luping Zhang,Junhai Luo,Yian Huang,Fengyi Wu,Xingye Cui,Zhenming Peng
DOI: https://doi.org/10.1109/jstars.2024.3508255
IF: 4.715
2024-01-01
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
Abstract:Infrared Small Target Detection (ISTD) technology has extensive applications in the military field. Due to the quality of imaging equipment and environmental interference, infrared small target images lack texture and structural information. Deep learning-based algorithms have achieved superior accuracy in this field compared to traditional algorithms; however, these methods are often not designed with domain knowledge integration. In this paper, we propose a Multi-Directional Information-Guided Contextual Network (MDIGCNet) for ISTD. The primary structure of this network adopts the U-Net architecture. To address the issue of lacking texture and structural information in the target images, we employ an Integrated Differential Convolution (IDConv) module to extract richer image features during both the encoding and decoding stages. Skip connections in the network utilize a Multi-directional Gradient Information Extraction Block (MGIEB) to obtain gradient features of infrared small targets. Our domain-inspired Multi-directional Gaussian Differential Convolution (MGDC) module is employed to extract features of Gaussian-distributed small targets, enhancing the distinction between targets and backgrounds. Additionally, we designed a Local-Global Feature Fusion (LGFF) module incorporating an attention mechanism to merge shallow and deep features, thereby improving the efficiency of feature utilization within the model. Furthermore, since both IDConv and MGDC are parallel multi-convolutional kernel structures, reparameterization techniques are used to avoid excessive parameters and computational load. Experimental results on public datasets NUDT-SIRST, IRSTD-1k, and SIRST-Aug demonstrate that our algorithm outperforms other state-of-the-art methods in detection performance.
What problem does this paper attempt to address?