Abstract:Unmanned aerial vehicles (UAVs) are recognized as promising technologies for area coverage due to the flexibility and adaptability. However, the ability of a single UAV is limited, and as for the large-scale three-dimensional (3D) scenario, UAV swarms can establish seamless wireless communication services. Hence, in this work, we consider a scenario of UAV swarm deployment and trajectory to satisfy 3D coverage considering the effects of obstacles. In detail, we propose a hierarchical swarm framework to efficiently serve the large-area users. Then, the problem is formulated to minimize the total trajectory loss of the UAV swarm. However, the problem is intractable due to the non-convex property, and we decompose it into smaller issues of users clustering, UAV swarm hovering points selection, and swarm trajectory determination. Moreover, we design a Q-learning based algorithm to accelerate the solution efficiency. Finally, we conduct extensive simulations to verify the proposed mechanisms, and the designed algorithm outperforms other referred methods.

What problem does this paper attempt to address?

This paper aims to solve the problems of UAV swarm deployment and trajectory planning in large - scale three - dimensional area coverage scenarios. Specifically, the paper focuses on how to achieve efficient large - scale three - dimensional area coverage by optimizing the deployment positions and flight trajectories of the UAV swarm (UAV swarm), while considering the influence of obstacles. The main objective of the research is to minimize the total flight trajectory loss of the UAV swarm while meeting the coverage requirements, and improve energy efficiency and service quality. ### Main Problems and Challenges: 1. **Efficient Cooperation**: Multiple UAVs need to cooperate effectively to serve multiple ground users (GUs). 2. **Deployment - Energy Efficiency Balance**: Find the optimal balance point between coverage range and energy efficiency. 3. **Trajectory Planning**: Plan the flight trajectories of the UAV swarm considering the influence of obstacles. ### Solutions: 1. **Hierarchical Framework**: A hierarchical UAV swarm framework is proposed, including a cluster - head UAV (H - UAV) and multiple trailing UAVs (T - UAVs), to efficiently complete the coverage task. 2. **Problem Decomposition**: Decompose the complex problem into smaller sub - problems, including the clustering of ground users, the selection of UAV swarm hovering points and trajectory planning. 3. **K - means Clustering**: Use the K - means algorithm to cluster ground users and select the optimal hovering points as deployment positions. 4. **Markov Decision Process (MDP) Modeling**: Model the trajectory planning problem as MDP to describe the complex environment. 5. **Q - learning Algorithm**: Design a Q - learning - based UAV swarm trajectory planning algorithm (QLUTP) to accelerate the efficiency of the solution. ### Experimental Verification: The paper verifies the effectiveness of the proposed method through extensive simulations. The experimental results show that the designed algorithm is superior to other benchmark methods in terms of coverage performance and trajectory planning. ### Conclusion: This paper proposes a hierarchical UAV swarm framework, combined with K - means clustering and Q - learning algorithm, which effectively solves the problems of UAV swarm deployment and trajectory planning in large - scale three - dimensional area coverage scenarios. The experimental results verify the superior performance of this method in improving the coverage rate and reducing the flight trajectory loss.

UAV Swarm Deployment and Trajectory for 3D Area Coverage via Reinforcement Learning

Deep Reinforcement Learning Based Three-Dimensional Area Coverage With UAV Swarm

Three-Dimensional Area Coverage with UAV Swarm Based on Deep Reinforcement Learning

Towards 3D Deployment of UAV Base Stations in Uneven Terrain.

Three-Dimension Trajectory Design for Multi-UAV Wireless Network With Deep Reinforcement Learning

Three-dimensional deep reinforcement learning for trajectory and resource optimization in UAV communication systems

Learning-Based UAV Coverage-Aware Path Planning in Large-scale Urban Environments

3D-Trajectory and Phase-Shift Design for RIS-Assisted UAV Systems Using Deep Reinforcement Learning

Collaborative Reinforcement Learning Based Unmanned Aerial Vehicle (UAV) Trajectory Design for 3D UAV Tracking

Multi-UAV Cooperative Target Tracking Based on Swarm Intelligence

Computationally-Efficient Distributed Algorithms of Navigation of Teams of Autonomous UAVs for 3D Coverage and Flocking

A Reinforcement Learning-based Decentralized Method of Avoiding Multi-UAV Collision in 3-D Airspace

Dynamic Decentralized 3D Urban Coverage and Patrol with UAVs

Control-Aware Trajectory Predictions for Communication-Efficient Drone Swarm Coordination in Cluttered Environments

A deep reinforcement learning based distributed multi-UAV dynamic area coverage algorithm for complex environment

Collaborative Coverage Path Planning of UAV Cluster based on Deep Reinforcement Learning

UAV Swarm Path Planning with Reinforcement Learning for Field prospecting

UAV Obstacle Avoidance by Human-in-the-Loop Reinforcement in Arbitrary 3D Environment

Multi-UAV Trajectory Planning for Energy-Efficient Content Coverage: A Decentralized Learning-Based Approach

3D UAV Trajectory and Data Collection Optimisation via Deep Reinforcement Learning