CS-ViG-UNet: Infrared small and dim target detection based on cycle shift vision graph convolution network

Jian Lin,Shaoyi Li,Xi Yang,Saisai Niu,Binbin Yan,Zhongjie Meng
DOI: https://doi.org/10.1016/j.eswa.2024.124385
IF: 8.5
2024-06-07
Expert Systems with Applications
Abstract:Infrared small and dim target detection benefits from the exploration of correlations among targets, neighboring regions, and the background. However, existing methods that rely on convolutional neural networks and vision transformers cannot effectively capture long-range information correlations within images. To overcome this limitation, this paper proposes CS-ViG-UNet, a framework that introduces vision graph convolution for infrared small and dim target detection. Our framework employs a cyclic shift sparse graph attention mechanism to address the issue of reduced expressive power. Meanwhile, the CS-ViG module is designed to construct an effective graph structure using image patches, thereby capturing feature information relevant to target recognition. On the public datasets Sirst AUG and IRSTD-1K, our method obtained F1 scores of 0.8561 and 0.745, respectively, showing an improvement of 3.15 % and 4.1 % compared to the state-of-the-art methods. On the RTX3090 with TensorRT acceleration, CS-ViG-UNet can process approximately 357 images of size 256 × 256 pixels per second at FP16 precision. For detailed information, please visit our homepage: https://linaom1214.github.io/CSViG-UNet .
computer science, artificial intelligence,engineering, electrical & electronic,operations research & management science
What problem does this paper attempt to address?