Abstract:Unmanned aerial vehicles (UAVs) equipped with remote-sensing object-detection devices are increasingly employed across diverse domains. However, the detection of small, densely-packed objects against complex backgrounds and at various scales presents a formidable challenge to conventional detection algorithms, exacerbated by the computational constraints of UAV-embedded systems that necessitate a delicate balance between detection speed and accuracy. To address these issues, this paper proposes the Efficient Multidimensional Global Feature Adaptive Fusion Network (MGFAFNET), an innovative detection method for UAV platforms. The novelties of our approach are threefold: Firstly, we introduce the Dual-Branch Multidimensional Aggregation Backbone Network (DBMA), an efficient architectural innovation that captures multidimensional global spatial interactions, significantly enhancing feature distinguishability for complex and occluded targets. Simultaneously, it reduces the computational burden typically associated with processing high-resolution imagery. Secondly, we construct the Dynamic Spatial Perception Feature Fusion Network (DSPF), which is tailored specifically to accommodate the notable scale variances encountered during UAV operation. By implementing a multi-layer dynamic spatial fusion coupled with feature-refinement modules, the network adeptly minimizes informational redundancy, leading to more efficient feature representation. Finally, our novel Localized Compensation Dual-Mask Distillation (LCDD) strategy is devised to adeptly translate the rich local and global features from the higher-capacity teacher network to the more resource-constrained student network, capturing both low-level spatial details and high-level semantic cues with unprecedented efficacy. The practicability and superior performance of our MGFAFNET are corroborated by a dedicated UAV detection platform, showcasing remarkable improvements over state-of-the-art object-detection methods, as demonstrated by rigorous evaluations conducted using the VisDrone2021 benchmark and a meticulously assembled proprietary dataset.

Fast Detection and Obstacle Avoidance on UAVs Using Lightweight Convolutional Neural Network Based on the Fusion of Radar and Camera

A Small UAV Detection Method Based on Optical Flow and Visual Feature Fusion

Fast Detection and Recognition Method of UAV in Sky Background

Detection and Recognition Method of Fast Low-Altitude Unmanned Aerial Vehicle Based on Dual Channel

Millimeter-Wave Radar and Vision Fusion Target Detection Algorithm Based on an Extended Network

Radar-Optical Fusion Detection of UAV Based On Improved YOLOv7-tiny

Lightweight UAV Object-Detection Method Based on Efficient Multidimensional Global Feature Adaptive Fusion and Knowledge Distillation

SLBAF-Net: Super-Lightweight bimodal adaptive fusion network for UAV detection in low recognition environment

Multi-scale object detection in UAV images based on adaptive feature fusion

Small Object Detection in UAV Images Based on YOLOv8n

Fully Convolutional Network-Based Fast UAV Detection in Pulse Doppler Radar

Lightweight Detection Network Based on Sub-Pixel Convolution and Objectness-Aware Structure for UAV Images

Real-Time Multi-Modal Active Vision for Object Detection on UAVs Equipped With Limited Field of View LiDAR and Camera

MS-YOLO: Object Detection Based on YOLOv5 Optimized Fusion Millimeter-wave Radar and Machine Vision

High-precision real-time UAV target recognition based on improved YOLOv4

A Lightweight and Accurate UAV Detection Method Based on YOLOv4

Implementation of Lightweight Convolutional Neural Networks with an Early Exit Mechanism Utilizing 40 nm CMOS Process for Fire Detection in Unmanned Aerial Vehicles

An Efficient UAV Image Object Detection Algorithm Based on Global Attention and Multi-Scale Feature Fusion

Optical Flow-Guided Deep Convolutional Neural Networks for UAV Detection in Infrared Videos.

Multi-Sensor Fusion for UAV Classification Based on Feature Maps of Image and Radar Data

SCCMDet: Adaptive Sparse Convolutional Networks Based on Class Maps for Real-Time Onboard Detection in Unmanned Aerial Vehicle Remote Sensing Images