Abstract:The precise positioning issue of oblique aerial image has been widely studied in recent years. However, there are still some deficiencies in applying the existing methods to highly time-sensitive engineering. For the real-time positioning issues of oblique images involved in Unmanned Aerial Vehicle's (UAV's) patrolling applications, existing photogrammetry method cannot meet the real-time positioning requirements, existing binocular vision method cannot meet the dynamic and precise positioning requirements, existing optical flow method cannot meet the absolute positioning requirements, and existing multi-source feature matching method cannot meet the robust positioning requirements. In order to meet the real-time, dynamic, precise, absolute and robust positioning requirements of UAV's patrolling images, a real-time positioning model for UAV's patrolling images based on airborne LiDAR point cloud fusion is proposed. First , a precise Digital Surface Model (DSM) is generated by rasterizing and imaging the raw airborne LiDAR point cloud, in which a pixel's grayscale is exactly equal to elevation of local area covered by the pixel. Second , the generated DSM and UAV's patrolling image are fused under specific geometric constrains, so as to realize real-time positioning of UAV's patrolling image pixel by pixel. Finally , more precise positioning of selected key points on UAV's patrolling image can be realized by performing Principal Component Analysis (PCA)on the raw airborne LiDAR point cloud that surrounds the selected key points. The above methods are analyzed and verified by three groups of practical experiments, and results indicate that the proposed model can achieve real-time positioning of a single UAV's patrolling image (4000 × 6000 pixels) with an accuracy of 0.5 m within 0.38 seconds in arbitrary areas, and can further realize precise positioning of any selected key point on UAV's patrolling image with an accuracy of 0.2 m in 0.001 seconds.

Patrol Agent: an Autonomous UAV Framework for Urban Patrol Using on Board Vision Language Model and on Cloud Large Language Model

Traffic Collisions Early Warning Aided by Small Unmanned Aerial Vehicle Companion

NavAgent: Multi-scale Urban Street View Fusion For UAV Embodied Vision-and-Language Navigation

A Vision-based UAV Tracker Aiming at Aerial Targets

Towards Realistic UAV Vision-Language Navigation: Platform, Benchmark, and Methodology

Real-Time Multi-Modal Active Vision for Object Detection on UAVs Equipped With Limited Field of View LiDAR and Camera

Target Detection Formulti-Uavs Via Digital Pheromones and Navigation Algorithm in Unknownenvironments

Automated and Connected Unmanned Aerial Vehicles (AC-UAV) for Service Patrol: System Design and Field Experiments

Multi-Agent Reinforcement Learning Aided Intelligent UAV Swarm for Target Tracking

Urban traffic tiny object detection via attention and multi-scale feature driven in UAV-vision

Demo Abstract: Embodied Aerial Agent for City-level Visual Language Navigation Using Large Language Model

Dense Multi-Agent Reinforcement Learning Aided Multi-UAV Information Coverage for Vehicular Networks

Development of Real-Time Unmanned Aerial Vehicle Urban Object Detection System with Federated Learning

A Vision-Based Target Detection, Tracking, and Positioning Algorithm for Unmanned Aerial Vehicle

UAV Multi-Dynamic Target Interception: A Hybrid Intelligent Method Using Deep Reinforcement Learning and Fuzzy Logic

AerialVLN: Vision-and-Language Navigation for UAVs

A Real-time Positioning Model for UAV's Patrolling Images Based on Airborne LiDAR Point Cloud Fusion

Aerial Vision-and-Language Navigation via Semantic-Topo-Metric Representation Guided LLM Reasoning

Poster Abstract: Emergency Networking Using UAVs: A Reinforcement Learning Approach with Large Language Model

The Programming Model of Air-Ground Cooperative Patrol Between Multi-UAV and Police Car

A Language Agent for Autonomous Driving