A YOLO Network Based on Depthwise Convolution Attention, Feature Fusion, and KL Divergence (DFK-YOLO): A Deep Learning Method for Infrared Small Target Detection Based on YOLOv7

Peng Ji,Changhao Wu,Xiangyue Zhang,Hean Liu,Dongsheng He
DOI: https://doi.org/10.3390/electronics13234820
IF: 2.9
2024-12-07
Electronics
Abstract:Infrared imaging technology has a wide range of applications across various fields, with one of its most critical uses being the detection of small infrared targets. However, model-driven approaches often lack robustness in identifying these small targets, while current deep learning-based methods face challenges in effectively extracting and integrating features. Additionally, appropriate labeling strategies for small infrared targets remain underdeveloped. To address these limitations, this paper proposes a novel detection method based on YOLOv7. Specifically, an attention module leveraging Depthwise Convolution is incorporated into the backbone of YOLOv7. Furthermore, a new Feature Fusion Neck is designed to replace the original neck component of YOLOv7. Lastly, a novel label assignment strategy is introduced. The proposed method achieves a mAP@0.5 of 99.5% and a mAP@0.75 of 71.6% on a public dataset, surpassing the baseline YOLOv7 by 1% and 4.6%, respectively. Compared to state-of-the-art deep learning object detection methods, the proposed approach demonstrates superior performance.
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?