Abstract:The deployment of UAVs as aerial base stations (BSs) has been considered as a promising supplement to the ground networks, which can quickly build an emergency communication network in a disaster area or significantly relief the communication burden imposed by hot-spots. However, the application of UAVs as aerial BSs is constrained by the limited onboard energy and communication coverage of UAVs. In particular, for a large target area, multiple UAVs should be deployed to meet the communication requirements. Therefore, designing the optimal trajectories of multiple UAVs is crucial to boost the UAV network performance. Inspired by the promising future of UAV BSs, this paper aims at proposing a distributed 3-dimensional (3D) trajectory design algorithm for multiple UAVs to optimize the system performance. We formulate the trajectory design problem as a multi-objective optimization problem to improve the user equipment (UE) access rate, ensure fair access opportunities, increase transmitted data volume and reduce energy consumption. Further inspired by the decision-making ability of deep reinforcement learning (DRL) in complex environments, we propose a DRL based trajectory design algorithm for multiple UAVs, namely DMTD, in which UAVs can explore both the optimal flight altitude and the potential UE distribution area in the iterative interactions with the environment, and then select the optimal flight trajectories to boost the network performance from multiple aspects. Extensive experimental results under different UE distributions have demonstrated that the proposed DMTD algorithm can find the optimal altitude to provide maximum coverage. Moreover, DMTD beats existing algorithms by providing high UE access rate, ensuring fair network service and increasing total transmitted data volume at the cost of a relatively low energy consumption. Especially in the scenes with dense and randomly distributed UEs, DMTD provides a UE access rate close to 0.9 and transmits 6 times of data volume than existing algorithms.

Trajectory Design for UAV-Based Inspection System: A Deep Reinforcement Learning Approach.

Trajectory Design for UAV-Based Internet of Things Data Collection: A Deep Reinforcement Learning Approach

Three-Dimensional Trajectory Design for Multi-User MISO UAV Communications: A Deep Reinforcement Learning Approach

Cellular UAV-to-Device Communications: Trajectory Design and Mode Selection by Multi-agent Deep Reinforcement Learning

3D-Trajectory and Phase-Shift Design for RIS-Assisted UAV Systems Using Deep Reinforcement Learning

Multi-UAV Trajectory Design and Power Control Based on Deep Reinforcement Learning.

Deep Reinforcement Learning-Based 3D Trajectory Planning for Cellular Connected UAV

Deep Reinforcement Learning Based Distributed 3D UAV Trajectory Design

Three-dimensional deep reinforcement learning for trajectory and resource optimization in UAV communication systems

Deep Reinforcement Learning Based Trajectory Design and Resource Allocation for UAV-Assisted Communications

Multi-agent Deep Reinforcement Learning-based Trajectory Design for UAV-aided Edge Computing System.

Distributed Trajectory Design for Cooperative Internet of UAVs Using Deep Reinforcement Learning

Autonomous Navigation for Cellular-Connected UAV in Highly Dynamic Environments: A Deep Reinforcement Learning Approach

Three-Dimension Trajectory Design for Multi-UAV Wireless Network With Deep Reinforcement Learning

Trajectory Design and Resource Allocation for Multi-UAV Networks: Deep Reinforcement Learning Approaches

Simultaneous Navigation and Radio Mapping for Cellular-Connected UAV With Deep Reinforcement Learning

Continuous Transfer Learning for UAV Communication-aware Trajectory Design

Federated deep reinforcement learning based trajectory design for UAV-assisted networks with mobile ground devices

Mobility-Aware Trajectory Design For Aerial Base Station Using Deep Reinforcement Learning

Deep Reinforcement Learning Multi-UAV Trajectory Control for Target Tracking

Memory-Enhanced Twin Delayed Deep Deterministic Policy Gradient (ME-TD3)-Based Unmanned Combat Aerial Vehicle Trajectory Planning for Avoiding Radar Detection Threats in Dynamic and Unknown Environments