Abstract:As an important part of the fifth generation (5G) mobile networks, unmanned aerial vehicles (UAVs) have been applied in various communication scenarios due to their high operability and low cost. In this paper, we investigate a multi-UAV communication system with moving users and consider the co-channel interference caused by the transmissions of all other UAVs. To ensure the fairness, we maximize the minimum average user rate during the observed time by jointly optimizing UAVs' trajectories, transmission power, and user association. Considering that UAVs can cover a large area for communications, UAVs do not need to move as soon as the users move. Therefore, a two-timescale structure is proposed for the considered scenario, where the UAVs' trajectories are optimized based on the channel state information (CSI) in a long timescale, while the transmission power and the user association are optimized based on the instantaneous CSI in a short timescale. To effectively tackle this challenging non-convex problem with both discrete and continuous variables, we propose a joint neural network (NN) design, where a deep reinforcement learning based Pointer Network named advantage pointer-critic (APC) is applied to optimize discrete variables and a deep-unfolding NN is used to optimize the continuous variables. Specifically, we first formulate a Markov decision process to model the user association, and then employ the APC network trained by the advantage actor-critic algorithm to address it. The APC network consists of a Pointer Network and a Multilayer Perceptron. As for the deep-unfolding NN, we first develop a block coordinate descent based algorithm to optimize the UAVs' trajectories and transmission power, and then unfold the algorithm into a layer-wise NN with introduced trainable parameters. These two networks are jointly trained in an unsupervised fashion. Simulation results validate that the proposed joint NN significantly outperforms the optimization algorithm with much lower complexity, and achieves good performances on scalability and generalization ability.

Deep Reinforcement Learning Assisted UAV Trajectory and Resource Optimization for NOMA Networks

Deep reinforcement learning in NOMA-assisted UAV networks for path selection and resource offloading

Multi-Agent Reinforcement Learning in NOMA-aided UAV Networks for Cellular Offloading

Deep Reinforcement Learning and NOMA-Based Multi-Objective RIS-Assisted IS-UAV-TNs: Trajectory Optimization and Beamforming Design

Energy-efficient UAV communication: A NOMA scheme with resource allocation and trajectory optimization

Joint Resource Allocation and Trajectory Optimization with QoS in UAV-Based NOMA Wireless Networks

UAV Trajectory and Resource Optimization for NOMA-VLC Systems Via HA-DRL Algorithm

Deep Reinforcement Learning for Aerial Data Collection in Hybrid-Powered NOMA-IoT Networks

Joint Resource Allocation and Trajectory Design for Multi-UAV Systems With Moving Users: Pointer Network and Unfolding

Deep Reinforcement Learning-Based UAV Data Collection and Offloading in NOMA-Enabled Marine IoT Systems

Resource Allocation and 3D Trajectory Design for Power-Efficient IRS-Assisted UAV-NOMA Communications.

Joint Resource Allocation and Trajectory Optimization with QoS in NOMA UAV Networks

A deep reinforcement learning framework and its implementation for UAV-aided covert communication

Resource Allocation in UAV-Assisted Networks: A Clustering-Aided Reinforcement Learning Approach

Intelligent Resource Allocation for UAV-Based Cognitive NOMA Networks: An Active Inference Approach

RIS-Aided Ground-Aerial NOMA Communications: A Distributionally Robust DRL Approach

Online Maneuver Design for UAV-Enabled NOMA Systems via Reinforcement Learning

AI-based Radio Resource Management and Trajectory Design for PD-NOMA Communication in IRS-UAV Assisted Networks

Joint Trajectory Design and Power Allocation for UAV-Enabled Non-Orthogonal Multiple Access Systems

Multiple Access for Mobile-UAV Enabled Networks: Joint Trajectory Design and Resource Allocation

Three-dimensional deep reinforcement learning for trajectory and resource optimization in UAV communication systems