HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection

Shibiao Xu,ShuChen Zheng,Wenhao Xu,Rongtao Xu,Changwei Wang,Jiguang Zhang,Xiaoqiang Teng,Ao Li,Li Guo

2024-03-16

Abstract:Infrared small object detection is an important computer vision task involving the recognition and localization of tiny objects in infrared images, which usually contain only a few pixels. However, it encounters difficulties due to the diminutive size of the objects and the generally complex backgrounds in infrared images. In this paper, we propose a deep learning method, HCF-Net, that significantly improves infrared small object detection performance through multiple practical modules. Specifically, it includes the parallelized patch-aware attention (PPA) module, dimension-aware selective integration (DASI) module, and multi-dilated channel refiner (MDCR) module. The PPA module uses a multi-branch feature extraction strategy to capture feature information at different scales and levels. The DASI module enables adaptive channel selection and fusion. The MDCR module captures spatial features of different receptive field ranges through multiple depth-separable convolutional layers. Extensive experimental results on the SIRST infrared single-frame image dataset show that the proposed HCF-Net performs well, surpassing other traditional and deep learning models. Code is available at

Computer Vision and Pattern Recognition

What problem does this paper attempt to address?

The paper attempts to address two main challenges encountered in Infrared Small Object Detection (ISOD): 1. **Information loss of small objects**: Due to the typically small size and weak thermal signals of infrared small objects, their contours are not clear, leading to information loss during multi-level downsampling. 2. **Background complexity**: Compared to visible light images, infrared images lack physical information and have lower contrast, making small objects easily overwhelmed by complex backgrounds. To solve these problems, the authors propose a deep learning method named HCF-Net (Hierarchical Context Fusion Network), which significantly improves the performance of infrared small object detection by introducing several practical modules. These modules include: - **Parallelized Patch-Aware Attention (PPA)**: Captures feature information at different scales and levels through a multi-branch feature extraction strategy, ensuring the retention of key information during multi-level downsampling. - **Dimension-Aware Selective Integration (DASI)**: Enhances skip connections in U-Net, focusing on the adaptive selection and fine integration of high-dimensional and low-dimensional features to enhance the saliency of small objects. - **Multi-Dilated Channel Refiner (MDCR)**: Captures spatial features with different receptive field ranges through multiple depthwise separable convolution layers, modeling the differences between objects and backgrounds more finely, and enhancing the localization ability of small objects. Through the organic combination of these modules, HCF-Net can more effectively address the challenges in small object detection, improving detection performance and robustness. Experimental results show that HCF-Net outperforms other traditional and deep learning models on the SIRST infrared single-frame image dataset.

HCF-Net: Hierarchical Context Fusion Network for Infrared Small Object Detection

MRF3Net: Infrared Small Target Detection Using Multi-Receptive Field Perception and Effective Feature Fusion

An infrared small target detection method using coordinate attention and feature fusion

Local Information Guided Global Integration for Infrared Small Target Detection.

SFFNet: Shallow Feature Fusion Network Based on Detection Framework for Infrared Small Target Detection

ℱ3-Net: Feature Fusion and Filtration Network for Object Detection in Optical Remote Sensing Images

Infrared Small Target Detection Using Focally Multi-Patch Network

DSDANet: Infrared Dim Small Target Detection Via Attention Enhanced Feature Fusion Network

Context-aware Cross-Level Attention Fusion Network for Infrared Small Target Detection

High-Resolution Feature Representation Driven Infrared Small-Dim Object Detection.

Multi-scale feature fusion attention network for infrared small target detection

Multiscale Multilevel Residual Feature Fusion for Real-Time Infrared Small Target Detection.

Cellular Interactive Attention Network for Infrared Small Target Detection

IMD-Net: Interpretable multi-scale detection network for infrared dim and small objects

Multiscale Progressive Fusion Filter Network for Infrared Small Target Detection

Research on Single Object Detection Technology Based on Infrared Multi-spectrum Fusion

Diffusion-Based Continuous Feature Representation for Infrared Small-Dim Target Detection

CMF Net: Detecting Objects in Infrared Traffic Image with Combination of Multiscale Features

Dense Nested Attention Network for Infrared Small Target Detection

SCAFNet: Semantic-Guided Cascade Adaptive Fusion Network for Infrared Small Targets Detection

Lightweight Spatial Sliced-Concatenate-Multireceptive-Field Enhance and Joint Channel Attention Mechanism for Infrared Object Detection