Abstract:We present a novel approach for achieving high-precision trajectory tracking control in an unmanned surface vehicle (USV) through utilization of receding horizon reinforcement learning (RHRL). The control architecture for the USV involves a composite of feedforward and feedback components. The feedforward control component is derived directly from the curvature of the reference path and the dynamic model. Feedback control is acquired through application of the RHRL algorithm, effectively addressing the problem of achieving optimal tracking control. The methodology introduced in this paper synergizes with the rolling time domain optimization mechanism, converting the perpetual time domain optimal control predicament into a succession of finite time domain control problems amenable to resolution. In contrast to Lyapunov model predictive control (LMPC) and sliding mode control (SMC), our proposed method employs the RHRL controller, which yields an explicit state feedback control law. This characteristic endows the controller with the dual capabilities of direct offline and online learning deployment. Within each prediction time domain, we employ a time-independent executive–evaluator network structure to glean insights into the optimal value function and control strategy. Furthermore, we substantiate the convergence of the RHRL algorithm in each prediction time domain through rigorous theoretical proof, with concurrent analysis to verify the stability of the closed-loop system. To conclude, USV trajectory control tests are carried out within a simulated environment.

Hypersonic Vehicle Attitude-Tracking Control Using Model-Free Deep Reinforcement Learning

Model-free Maneuvering Control of Fixed-Wing UAVs Based on Deep Reinforcement Learning

Reinforcement Learning Control of Hypersonic Vehicles and Performance Evaluations

Attitude Control of Hypersonic Vehicle based on Reinforcement Learning

Composite Observer-Based Optimal Attitude-Tracking Control With Reinforcement Learning for Hypersonic Vehicles

Deep Reinforcement Learning-Based Backstepping Control of Air-Breathing Hypersonic Vehicles with Actuator Constraints

Learning with Training Wheels: Speeding up Training with a Simple Controller for Deep Reinforcement Learning

Reinforcement Learning for UAV Attitude Control

Robust adaptive dynamic programming based attitude tracking control for hypersonic vehicle

USV Trajectory Tracking Control Based on Receding Horizon Reinforcement Learning

Learning Hierarchical Behavior and Motion Planning for Autonomous Driving.

Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization

Reinforcement learning strategy for spacecraft attitude hyperagile tracking control with uncertainties

Online policy iteration ADP-based attitude-tracking control for hypersonic vehicles

Continuous‐time receding‐horizon reinforcement learning and its application to path‐tracking control of autonomous ground vehicles

Adaptive Nonlinear Model Predictive Horizon Using Deep Reinforcement Learning for Optimal Trajectory Planning

Quadrotor motion control using deep reinforcement learning

A deep reinforcement learning-based approach to onboard trajectory generation for hypersonic vehicles

Trajectory Planning for Hypersonic Vehicles with Reinforcement Learning

Autonomous trajectory planning method for hypersonic vehicles in glide phase based on DDPG algorithm

Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments