Robust UAV Policy Learning for Urban Infrastructure Surface Screening*

Bingqing Du,Uddin Md. Borhan,Tianxing Chen,Jianyong Chen,Jianqiang Li,Jie Chen
DOI: https://doi.org/10.1109/icarm62033.2024.10715841
2024-01-01
Abstract:Unmanned Aerial Vehicles (UAVs) have emerged as pivotal tools for infrastructure inspection, mitigating risks to human life and enhancing operational efficiency. Despite advancements, achieving autonomous surface coverage inspection in GPS-denied environments remains a significant challenge. This paper presents a novel approach to enhance UAV policy learning for surface screening of urban infrastructure in environments lacking GPS. Utilizing Deep Reinforcement Learning (DRL), the methodology enables UAVs to autonomously conduct surface coverage inspections more robustly and efficiently. A key innovation is the Optimal Speed Decision (OSD) model, which allows UAVs to adjust flight speed dynamically based on scene perception, enhancing efficiency and safety. Additionally, an Off-Policy Critic-Based Reward Shaping (OPCBRS) method is introduced to improve learning efficiency and performance by addressing the issue of sparse rewards using offline data. Simulation results confirm the effectiveness of these methods, with the OSD model demonstrating significant improvements in navigating complex scenarios and maintaining a balance between efficiency and robustness over traditional uniform speed policies. This research offers a robust framework for UAV-based infrastructure inspection in GPS-denied environments, advancing autonomous UAV applications.
What problem does this paper attempt to address?