Research on Underwater Gliders Path Tracking Based on Reinforcement Learning Algorithm

SHI Qingqing,ZHANG Runfeng,ZHANG Lianhong,LAN Shiquan
DOI: https://doi.org/10.3969/j.issn.1004-132x.2023.09.011
2023-01-01
Abstract:Aiming at the large deviations between the actual paths and the predetermined ones of underwater gliders affected by ocean current, a neural network ocean current prediction model with long-term and short-term memory and attention mechanism was established based on the traditional long-term and short-term memory network model.The dynamic Q-table of underwater glider motions was generated by depth neural network, and the optimal motion attitude was selected by reinforcement learning algorithm. Considering the influences of ocean current, an underwater glider path tracking algorithm was constructed based on depth reinforcement learning. The results show that the long-term and short-term memory network based on attention mechanism has less mean square errors and root mean square errors in ocean current prediction than that of the traditional integrated moving average autoregressive model and long-term and short-term memory network.Compared with the traditional PID control, the deep reinforcement learning model may reduce the root mean square errors of the underwater glider trajectory by 50.9%, and significantly improve the path tracking accuracy.
What problem does this paper attempt to address?