Underwater Multi-Target Node Path Planning in Hybrid Action Space: A Deep Reinforcement Learning Approach

Guangjie Han,Zixiao Feng,Hao Wang,Yun Hou,Fan Zhang
DOI: https://doi.org/10.1109/tmc.2024.3421541
IF: 6.075
2024-01-01
IEEE Transactions on Mobile Computing
Abstract:Path planning is a basic requirement for Autonomous Underwater Vehicles (AUVs) to accomplish underwater missions. However, previous studies often have limitations, such as ignoring the basic condition that the AUV operates in an ocean current environment and discretizing its actions without considering the action space, which results in the simulation being far from the actual situation. To solve the above problems, this paper proposes a method of using a Parametrized Deep Q-Network (PDQN) to output hybrid actions for path planning, which can output a hybrid action space based on the current local observation, flexibly avoid obstacles under limited sensor observations, and realize the refined operation of AUV actions. According to the setup of the simulation environment, the AUV needs to visit multiple target nodes underwater and decelerate within the communication range of the nodes to have enough time to communicate with the nodes. The PDQN enables the AUV to easily learn the connection between the current state and discrete actions. It outputs the corresponding continuous actions based on the current discrete actions, which realizes a time-saving strategy of accelerating and then decelerating among the nodes. Meanwhile, we also utilize the actual current data and terrain data to restore the simulation environment as accurately as possible, and the simulation results prove the superiority and robustness of the algorithm.
What problem does this paper attempt to address?