A Graph-Based PPO Approach in Multi-UAV Navigation for Communication Coverage

Zhiling Jiang,Yining Chen,Ke Wang,Bowei Yang,Guanghua Song
DOI: https://doi.org/10.15837/ijccc.2023.6.5505
IF: 2.635
2023-01-01
International Journal of Computers Communications & Control
Abstract:Multi-Agent Reinforcement Learning (MARL) is widely used to solve various problems in real life. In the multi-agent reinforcement learning tasks, there are multiple agents in the environment, the existing Proximal Policy Optimization (PPO) algorithm can be applied to multi-agent rein-forcement learning. However, it cannot deal with the communication problem between agents. In order to resolve this issue, we propose a Graph-based PPO algorithm, this approach can solve the communication problem between agents and it can enhance the exploration efficiency of agents in the environment and speed up the learning process. We apply our algorithms to the task of multi-UAV navigation for communication coverage to verify the functionality and performance of our proposed algorithms.
What problem does this paper attempt to address?