Performance-Guaranteed Adaptive Optimized Control of Intelligent Surface Vehicle Using Reinforcement Learning

Chao Dong,Lin Chen,Shi-Lu Dai
DOI: https://doi.org/10.1109/tiv.2023.3338486
IF: 8.2
2024-01-01
IEEE Transactions on Intelligent Vehicles
Abstract:This article develops a reinforcement learning (RL) strategy to address the robust optimal tracking control problem for an intelligent surface vehicle (ISV) with modeling uncertainties and unknown ocean disturbances. A neural network (NN) identifier is constructed to learn uncertain nonlinear dynamics. Unlike the typical architecture of actor-critic networks, a single-critic network is employed to obtain the approximate solution of Hamilton-Jacobi-Bellman (HJB) equation. By introducing an additional stability term and experience replay (ER) technique, we present a novel critic weight update rule, such that (i) the traditional persistent excitation (PE) condition is relaxed, and (ii) the request of an initial admissible control is alleviated. Subsequently, a performance-guaranteed adaptive optimal tracking control algorithm is developed to guarantee the prescribed transient behavior of tracking errors and minimize the cost function simultaneously. A rigorous theoretical analysis indicates that the developed controller guarantees semi-globally uniformly ultimate boundedness of the closed-loop adaptive system with prescribed performance. Simulation studies demonstrate the effectiveness and robustness of the presented control algorithm.
What problem does this paper attempt to address?