UAV Swarm Deployment and Trajectory for 3D Area Coverage via Reinforcement Learning

Jia He,Ziye Jia,Chao Dong,Junyu Liu,Qihui Wu,Jingxian Liu
2023-09-21
Abstract:Unmanned aerial vehicles (UAVs) are recognized as promising technologies for area coverage due to the flexibility and adaptability. However, the ability of a single UAV is limited, and as for the large-scale three-dimensional (3D) scenario, UAV swarms can establish seamless wireless communication services. Hence, in this work, we consider a scenario of UAV swarm deployment and trajectory to satisfy 3D coverage considering the effects of obstacles. In detail, we propose a hierarchical swarm framework to efficiently serve the large-area users. Then, the problem is formulated to minimize the total trajectory loss of the UAV swarm. However, the problem is intractable due to the non-convex property, and we decompose it into smaller issues of users clustering, UAV swarm hovering points selection, and swarm trajectory determination. Moreover, we design a Q-learning based algorithm to accelerate the solution efficiency. Finally, we conduct extensive simulations to verify the proposed mechanisms, and the designed algorithm outperforms other referred methods.
Signal Processing,Networking and Internet Architecture
What problem does this paper attempt to address?
This paper aims to solve the problems of UAV swarm deployment and trajectory planning in large - scale three - dimensional area coverage scenarios. Specifically, the paper focuses on how to achieve efficient large - scale three - dimensional area coverage by optimizing the deployment positions and flight trajectories of the UAV swarm (UAV swarm), while considering the influence of obstacles. The main objective of the research is to minimize the total flight trajectory loss of the UAV swarm while meeting the coverage requirements, and improve energy efficiency and service quality. ### Main Problems and Challenges: 1. **Efficient Cooperation**: Multiple UAVs need to cooperate effectively to serve multiple ground users (GUs). 2. **Deployment - Energy Efficiency Balance**: Find the optimal balance point between coverage range and energy efficiency. 3. **Trajectory Planning**: Plan the flight trajectories of the UAV swarm considering the influence of obstacles. ### Solutions: 1. **Hierarchical Framework**: A hierarchical UAV swarm framework is proposed, including a cluster - head UAV (H - UAV) and multiple trailing UAVs (T - UAVs), to efficiently complete the coverage task. 2. **Problem Decomposition**: Decompose the complex problem into smaller sub - problems, including the clustering of ground users, the selection of UAV swarm hovering points and trajectory planning. 3. **K - means Clustering**: Use the K - means algorithm to cluster ground users and select the optimal hovering points as deployment positions. 4. **Markov Decision Process (MDP) Modeling**: Model the trajectory planning problem as MDP to describe the complex environment. 5. **Q - learning Algorithm**: Design a Q - learning - based UAV swarm trajectory planning algorithm (QLUTP) to accelerate the efficiency of the solution. ### Experimental Verification: The paper verifies the effectiveness of the proposed method through extensive simulations. The experimental results show that the designed algorithm is superior to other benchmark methods in terms of coverage performance and trajectory planning. ### Conclusion: This paper proposes a hierarchical UAV swarm framework, combined with K - means clustering and Q - learning algorithm, which effectively solves the problems of UAV swarm deployment and trajectory planning in large - scale three - dimensional area coverage scenarios. The experimental results verify the superior performance of this method in improving the coverage rate and reducing the flight trajectory loss.