A Real-time Algorithm for USV Navigation Based on Deep Reinforcement Learning

Zhiguo Zhou,Yipeng Zheng,Kaiyuan Liu,Xu He,Chong Qu
DOI: https://doi.org/10.1109/icsidp47821.2019.9173280
2019-01-01
Abstract:Aiming at the demand of flexibility and real-time performance in unknown aquatorium, a path planning algorithm based on Deep Reinforcement Learning (DRL) is proposed. According to plan-avoid-acclimate request, the proposed algorithm involves optimization of net structure and navigation data enrichment based on $A$ 3C, re-regulation of action space of the agent, and is trained with specific tasks in three kinds of maps to improve flexibility. The algorithm is integrated with GPU, which helps achieve high training efficiency and real-time performance by creating a neural network to collect pre-training data. Experimental results show that obstacle ability is confirmed. In comparison with current algorithm, training time reduces by 59.3% and efficiency rises by more than 71.7%. Meanwhile, performance of trained model in unknown environment is validated.
What problem does this paper attempt to address?