Air-Ground Coordination Communication by Multi-Agent Deep Reinforcement Learning

Ruijin Ding,Feifei Gao,Guanghua Yang,Xuemin Sherman Shen
DOI: https://doi.org/10.1109/icc42927.2021.9500477
2021-01-01
Abstract:In this paper, we investigate an air-ground coordination communication system where ground users (GUs) access suitable UAV base stations (UAV-BSs) to maximize their own throughput and UAV-BSs design their trajectories to maximize the total throughput and keep GU fairness. Note that the action space of GUs is discrete, and UAV-BSs’ action space is continuous. To deal with the hybrid action space, we propose a multi-agent deep reinforcement learning (MADRL) approach, named AG-PMADDPG (air-ground probabilistic multi-agent deep deterministic policy gradient), where GUs transform the discrete actions to continuous action probabilities, and then sample actions according to the probabilities. The proposed method enable the users make decisions based on their local information, which is beneficial for user privacy. Simulation results demonstrate that AG-PMADDPG can outperform the benchmark algorithms in terms of fairness and throughput.
What problem does this paper attempt to address?