Adaptive Meta-Reinforcement Learning for AUVs 3D Guidance and Control under Unknown Ocean Currents

Yu Jiang,Kaixin Zhang,Minghao Zhao,Hongde Qin
DOI: https://doi.org/10.1016/j.oceaneng.2024.118498
IF: 5
2024-01-01
Ocean Engineering
Abstract:Given the challenges posed by underwater communication constraints, Autonomous Underwater Vehicles (AUVs) must operate with enhanced autonomy while balancing operational safety and efficiency within complex marine settings. AUVs face a 3D guidance and control challenge, requiring complex adjustments to unknown ocean currents that affect their stability, maneuverability, and Guidance-Navigation-Control system efficiency. This study introduces and implements an off-policy meta-reinforcement learning framework to address AUVs’ 3D guidance and control challenges under unknown ocean currents. Our method, APE-SAC, utilizes an inference network to identify latent variables, streamlining the adaptation to unknown ocean currents by optimizing their strength and direction. Consequently, this framework redefines the issue of guidance and control under unknown current disturbances into a manageable learning task across multiple predefined scenarios. Utilizing the 3D dynamics of the REMUS AUV, we construct meta-Markov Decision Processes to detail the comprehensive state space, action space, and robust reward function tailored to the task’s specific demands. Furthermore, we develop a highly realistic simulation environment specifically designed for comprehensive experiments. Extensive results demonstrate that, compared to deep reinforcement learning baselines, APE-SAC exhibits outstanding performance in guiding and controlling AUVs under unknown ocean currents.
What problem does this paper attempt to address?