Abstract:A reinforcement learning framework is constructed to handle the unknown dynamics and environmental disturbances and achieve the optimal control performance. Combining the reinforcement learning technique, we develop the reinforcement learning‐based finite‐time control strategy without the singularity issue, which exhibits good robustness and provides the system's states a finite‐time convergence, while obtaining the optimal performance of the controller. This study proposes a reinforcement learning‐based finite‐time cross‐media tracking control approach for a slender body cross‐media vehicle encountering unknown hydrodynamics, wind, and wave disturbances. Initially, a reinforcement learning framework consisting of the actor neural network and critic neural network is constructed. The critic neural network monitors the actions of the actor neural network and approximates the cost function, while the actor neural network estimates the unknown hydrodynamics and disturbances, minimising the cost function to optimise performance. Subsequently, the command filter featuring finite‐time convergence is formulated, effectively managing the corresponding filter error through a proposed error compensating signal. By integrating these techniques, a reinforcement learning‐based finite‐time control strategy is developed, circumventing the singularity issue inherent in traditional finite‐time backstepping strategies. Comparative analysis with existing methods demonstrates the strong robustness of the proposed scheme against unknown hydrodynamics and disturbances, ensuring finite‐time convergence of the system's states and optimising controller performance. Finally, simulations confirm the effectiveness and superiority of the presented approach.

Reinforcement learning-based finite-time tracking control of an unknown unmanned surface vehicle with input constraints

Self‐learning‐based optimal tracking control of an unmanned surface vehicle with pose and velocity constraints

Robust Trajectory Tracking Control of Underactuated Unmanned Surface Vehicles with Exponential Stability: Theory and Experimental Validation.

USV Trajectory Tracking Control Based on Receding Horizon Reinforcement Learning

Adaptive Security Tracking Control of Constrained Unmanned Surface Vehicles

Adaptive Robust Trajectory-Tracking Control for Underactuated USVs with Model Uncertainty and Environmental Disturbance

Adaptive Optimal Tracking Control of an Underactuated Surface Vessel Using Actor–Critic Reinforcement Learning

Data-Driven Performance-Prescribed Reinforcement Learning Control of an Unmanned Surface Vehicle

Adaptive Neural Network-Quantized Tracking Control of Uncertain Unmanned Surface Vehicles with Output Constraints

Event-triggered output-feedback adaptive tracking control of autonomous underwater vehicles using reinforcement learning

Adaptive Sliding Mode Control Design for Nonlinear Unmanned Surface Vessel Using RBFNN and Disturbance-Observer

Collision Avoidance and Path Point Tracking Control for Underactuated Unmanned Surface Vehicles with Unknown Model Nonlinearity

Fixed-Time Trajectory Tracking Control of Fully Actuated Unmanned Surface Vessels with Error Constraints

Reinforcement learning‐based finite‐time cross‐media tracking control for a cross‐media vehicle under unknown dynamics and disturbances

Active Vision-Based Finite-Time Trajectory-Tracking Control of an Unmanned Surface Vehicle Without Direct Position Measurements

Attention-Based Meta-Reinforcement Learning for Tracking Control of AUV with Time-Varying Dynamics

Gender Differences in the Link Between Excessive Drinking and Domain-Specific Cognitive Functioning Among Older Adults

State constrained control strategy for unmanned surface vehicle trajectory tracking based on improved barrier Lyapunov function

Learning-based Robust Optimal Tracking Controller Design for Unmanned Underwater Vehicles with Full-State and Input Constraints

Realizing asynchronous finite-time robust tracking control of switched flight vehicles by using nonfragile deep reinforcement learning

Robust adaptive finite-time course tracking control of vessel under actuator attacks