Abstract:Geolocating multiple targets of interest on the ground from an aerial platform is an important activity in many applications, such as visual surveillance. However, due to the limited measurement accuracy of commonly used airborne sensors (including altimeters, accelerometer, gyroscopes, and so on) and the small size, complex motion, and a large number of ground targets in aerial images, most of the current unmanned aerial vehicle (UAV)-based ground target geolocation algorithms have difficulty in obtaining accurate geographic location coordinates online, especially at middle and high altitudes. To solve these problems, in this article, a novel online ground multitarget geolocation framework using a UAV platform is proposed, which minimizes the introduction of sensor error sources and uses only monocular aerial image sequences and global positioning system (GPS) data to perform parallel processing of target detection and rapid 3-D sparse geographic map construction and target geographic location estimations, thereby improving the accuracy and speed of ground multitarget online geolocation. In this framework, a detection algorithm based on deep learning is first adopted to improve the accuracy and robustness of small target detection in aerial images by constructing an aerial image dataset. Then, we propose a novel target geolocation algorithm based on 3-D map construction, which combines continuous images and GPS data collected online using a UAV platform to generate a 3-D geographic map and accurately estimates the GPS location of the target center pixel through projection and triangulation of the map points on the image. Finally, we design a data transmission architecture that selects multiple processes to perform image acquisition, target detection, and target geolocation tasks in parallel and utilizes database communication between the processes to achieve accurate online geolocation of ground targets in aerial images, regardless of whether the target is in a st-tic or moving state. To evaluate the effectiveness of the proposed framework, we build an online ground multitarget geolocation system using a quad-rotor UAV and carry out a large number of experiments in simulations and real environments. Qualitative and quantitative experimental results proved that the framework can accurately locate ground targets in various complex environments, such as parks, highways, schools, cities, different flight altitudes (50–2000 m), and different attitude angles, and the average positioning error is approximately 1 m at 2000 m for cities with rich 3-D structures.

Monocular-GPS Fusion 3D Object Detection for UAVs

LODM: Large-scale Online Dense Mapping for UAV

Aerial Monocular 3D Object Detection

Learnable Cross-Scale Sparse Attention Guided Feature Fusion for UAV Object Detection

Real-Time Multi-Modal Active Vision for Object Detection on UAVs Equipped With Limited Field of View LiDAR and Camera

Self-Attention Guidance and Multiscale Feature Fusion-Based UAV Image Object Detection

Multi-Sensory Data Fusion in Terms of UAV Detection in 3D Space

MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection.

UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation

Multi-UAV Collaboration and IMU Fusion Localization Method in Partial GNSS-Denied Scenarios

Onboard Real-Time Dense Reconstruction in Large Terrain Scene Using Embedded UAV Platform

Leveraging Map Retrieval and Alignment for Robust UAV Visual Geo-Localization

Online Ground Multitarget Geolocation Based on 3-D Map Construction Using a UAV Platform

Dense and Small Object Detection in UAV-Vision Based on a Global-Local Feature Enhanced Network

Vision-Based UAV Self-Positioning in Low-Altitude Urban Environments

UCDNet: Multi-UAV Collaborative 3D Object Detection Network by Reliable Feature Mapping

Long-Tailed 3D Detection via Multi-Modal Fusion

Robust Visual-Lidar Simultaneous Localization and Mapping System for UAV

MFEFNet: A Multi-Scale Feature Information Extraction and Fusion Network for Multi-Scale Object Detection in UAV Aerial Images

Lightweight UAV Object-Detection Method Based on Efficient Multidimensional Global Feature Adaptive Fusion and Knowledge Distillation

UVCPNet: A UAV-Vehicle Collaborative Perception Network for 3D Object Detection