HA-MARL: Heuristic and APF Assisted Multi-Agent Reinforcement Learning for Wireless Data Sharing in AUV Swarms

Zonglin Li,Jun Du,Chunxiao Jiang,Weishi Mi,Yong Ren
DOI: https://doi.org/10.1109/icc51166.2024.10622437
2024-01-01
Abstract:This paper focuses on the design of intelligent game strategy for multi-autonomous underwater vehicle (multi-AUV) underwater network system. The challenge lies in ensuring the coordination and stability between AUVs in complex underwater environments. To meet underwater data sharing requirements, we formulate an intelligent game strategy incorporating communication delays by formulating the problem as a partially observable Markov decision process (POMDP). Additionally, to address the issue of sparse rewards during exploration in multi-agent reinforcement learning (MARL) models and improve the coordination among AUVs, we propose a heuristic and artificial potential field (APF)-assisted multi-agent proximal policy optimization (HA-MAPPO) algorithm. Our proposed scheme addresses the issue of sparse rewards in MARL by using APF as path planner and subsequently utilizes heuristic algorithm for task scheduling to achieve optimal goal allocation. Simulation results demonstrate that our proposed HA-MAPPO algorithm outperforms current mainstream MARL algorithms regarding convergence speed while maximizing the winning rates.
What problem does this paper attempt to address?