Abstract:Aerial image target detection technology has essential application value in navigation security, traffic control and environmental monitoring. Compared with natural scene images, the background of aerial images is more complex, and there are more small targets, which puts higher requirements on the detection accuracy and real-time performance of the algorithm. To further improve the detection accuracy of lightweight networks for small targets in aerial images, we propose a cross-scale multi-feature fusion target detection method (CMF-YOLOv5s) for aerial images. Based on the original YOLOv5s, a bidirectional cross-scale feature fusion sub-network (BsNet) is constructed, using a newly designed multi-scale fusion module (MFF) and cross-scale feature fusion strategy to enhance the algorithm's ability, that fuses multi-scale feature information and reduces the loss of small target feature information. To improve the problem of the high leakage detection rate of small targets in aerial images, we constructed a multi-scale detection head containing four outputs to improve the network's ability to perceive small targets. To enhance the network's recognition rate of small target samples, we improve the K-means algorithm by introducing a genetic algorithm to optimize the prediction frame size to generate anchor boxes more suitable for aerial images. The experimental results show that on the aerial image small target dataset VisDrone-2019, the proposed method can detect more small targets in aerial images with complex backgrounds. With a detection speed of 116 FPS, compared with the original algorithm, the detection accuracy metrics mAP0.5 and mAP0.5:0.95 for small targets are improved by 5.5% and 3.6%, respectively. Meanwhile, compared with eight advanced lightweight networks such as YOLOv7-Tiny and PP-PicoDet-s, mAP0.5 improves by more than 3.3%, and mAP0.5:0.95 improves by more than 1.9%.

Exploiting Cross-scale Consistency for Object Detection in Aerial Images

Scale Enhancement Network for Object Detection in Aerial Images

Extended Feature Pyramid Network with Adaptive Scale Training Strategy and Anchors for Object Detection in Aerial Images

SODCNN: A Convolutional Neural Network Model for Small Object Detection in Drone-Captured Images

Small object detection leveraging density‐aware scale adaptation

Scale Decoupled Pyramid for Object Detection in Aerial Images

Learnable Cross-Scale Sparse Attention Guided Feature Fusion for UAV Object Detection

Scale Optimization Using Evolutionary Reinforcement Learning for Object Detection on Drone Imagery

Delving into the Scale Variance Problem in Object Detection

SSN: Scale Selection Network for Multi-Scale Object Detection in Remote Sensing Images

ZoomInNet: A Novel Small Object Detector in Drone Images with Cross-Scale Knowledge Distillation

Object Detection in Aerial Remote Sensing Images with Multi-scale Feature Enhancement

Small Object Detection with Multiscale Features

Scale Enhancement Pyramid Network for Small Object Detection from UAV Images

Scale-Adaptive Salience Supervision and Dynamic Token Filtering for Small Object Detection in Remote Sensing Images

An Efficient UAV Image Object Detection Algorithm Based on Global Attention and Multi-Scale Feature Fusion

Aerial images object detection method based on cross-scale multi-feature fusion

SCLNet: A Scale-Robust Complementary Learning Network for Object Detection in UAV Images

A Deep CNN-Based Detection Method for Multi-Scale Fine-Grained Objects in Remote Sensing Images

CSSDet: small object detection via cross-scale feature enhancement on drone-view images

Multi-scale Fusion Based Multi-stage Small Object Detection in Aerial Images ∗