HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection

Shibiao Xu,ShuChen Zheng,Wenhao Xu,Rongtao Xu,Changwei Wang,Jiguang Zhang,Xiaoqiang Teng,Ao Li,Li Guo
2024-03-16
Abstract:Infrared small object detection is an important computer vision task involving the recognition and localization of tiny objects in infrared images, which usually contain only a few pixels. However, it encounters difficulties due to the diminutive size of the objects and the generally complex backgrounds in infrared images. In this paper, we propose a deep learning method, HCF-Net, that significantly improves infrared small object detection performance through multiple practical modules. Specifically, it includes the parallelized patch-aware attention (PPA) module, dimension-aware selective integration (DASI) module, and multi-dilated channel refiner (MDCR) module. The PPA module uses a multi-branch feature extraction strategy to capture feature information at different scales and levels. The DASI module enables adaptive channel selection and fusion. The MDCR module captures spatial features of different receptive field ranges through multiple depth-separable convolutional layers. Extensive experimental results on the SIRST infrared single-frame image dataset show that the proposed HCF-Net performs well, surpassing other traditional and deep learning models. Code is available at
Computer Vision and Pattern Recognition
What problem does this paper attempt to address?
The paper attempts to address two main challenges encountered in Infrared Small Object Detection (ISOD): 1. **Information loss of small objects**: Due to the typically small size and weak thermal signals of infrared small objects, their contours are not clear, leading to information loss during multi-level downsampling. 2. **Background complexity**: Compared to visible light images, infrared images lack physical information and have lower contrast, making small objects easily overwhelmed by complex backgrounds. To solve these problems, the authors propose a deep learning method named HCF-Net (Hierarchical Context Fusion Network), which significantly improves the performance of infrared small object detection by introducing several practical modules. These modules include: - **Parallelized Patch-Aware Attention (PPA)**: Captures feature information at different scales and levels through a multi-branch feature extraction strategy, ensuring the retention of key information during multi-level downsampling. - **Dimension-Aware Selective Integration (DASI)**: Enhances skip connections in U-Net, focusing on the adaptive selection and fine integration of high-dimensional and low-dimensional features to enhance the saliency of small objects. - **Multi-Dilated Channel Refiner (MDCR)**: Captures spatial features with different receptive field ranges through multiple depthwise separable convolution layers, modeling the differences between objects and backgrounds more finely, and enhancing the localization ability of small objects. Through the organic combination of these modules, HCF-Net can more effectively address the challenges in small object detection, improving detection performance and robustness. Experimental results show that HCF-Net outperforms other traditional and deep learning models on the SIRST infrared single-frame image dataset.