Abstract:Aerial image target detection technology has essential application value in navigation security, traffic control and environmental monitoring. Compared with natural scene images, the background of aerial images is more complex, and there are more small targets, which puts higher requirements on the detection accuracy and real-time performance of the algorithm. To further improve the detection accuracy of lightweight networks for small targets in aerial images, we propose a cross-scale multi-feature fusion target detection method (CMF-YOLOv5s) for aerial images. Based on the original YOLOv5s, a bidirectional cross-scale feature fusion sub-network (BsNet) is constructed, using a newly designed multi-scale fusion module (MFF) and cross-scale feature fusion strategy to enhance the algorithm's ability, that fuses multi-scale feature information and reduces the loss of small target feature information. To improve the problem of the high leakage detection rate of small targets in aerial images, we constructed a multi-scale detection head containing four outputs to improve the network's ability to perceive small targets. To enhance the network's recognition rate of small target samples, we improve the K-means algorithm by introducing a genetic algorithm to optimize the prediction frame size to generate anchor boxes more suitable for aerial images. The experimental results show that on the aerial image small target dataset VisDrone-2019, the proposed method can detect more small targets in aerial images with complex backgrounds. With a detection speed of 116 FPS, compared with the original algorithm, the detection accuracy metrics mAP0.5 and mAP0.5:0.95 for small targets are improved by 5.5% and 3.6%, respectively. Meanwhile, compared with eight advanced lightweight networks such as YOLOv7-Tiny and PP-PicoDet-s, mAP0.5 improves by more than 3.3%, and mAP0.5:0.95 improves by more than 1.9%.

Improving Object Detection in YOLOv8n with the C2f-f Module and Multi-Scale Fusion Reconstruction

An improved YOLOv5 method for large objects detection with multi-scale feature cross-layer fusion network

Improved YOLO model with multi-feature fully convolutional network for object detection

M2YOLOF: Based on effective receptive fields and multiple-in-single-out encoder for object detection

An Object Detection Method Based on Improved YOLOX

MSF-YOLO: A multi-scale features fusion-based method for small object detection

FA-YOLO: Research On Efficient Feature Selection YOLO Improved Algorithm Based On FMDS and AGMF Modules

A Lightweight YOLO Object Detection Algorithm Based on Bidirectional Multi‐Scale Feature Enhancement

FE-YOLOv5: Feature enhancement network based on YOLOv5 for small object detection

MCF-YOLOv5: A Small Target Detection Algorithm Based on Multi-Scale Feature Fusion Improved YOLOv5

MFIL-FCOS: A Multi-Scale Fusion and Interactive Learning Method for 2D Object Detection and Remote Sensing Image Detection

An advanced YOLOv3 method for small object detection

Improvement of Yolov5 Target Detection Algorithm Combined with Multi-Scale Feature Fusion

CF-YOLOX: An Autonomous Driving Detection Model for Multi-Scale Object Detection

Multi-Branch Auxiliary Fusion YOLO with Re-parameterization Heterogeneous Convolutional for accurate object detection

A YOLOX Object Detection Algorithm Based on Bidirectional Cross-scale Path Aggregation

Aerial images object detection method based on cross-scale multi-feature fusion

AIE-YOLO: Auxiliary Information Enhanced YOLO for Small Object Detection

MRT-YOLO: A Fine-Grained Feature-Based Method for Object Detection

YOLOv8-CAS: An improvement of multi-class target detection algorithm based on YOLO

Object Detection for Remote Sensing Based on the Enhanced YOLOv8 With WBiFPN