Abstract:Reinforcement learning is a compelling area of research within machine learning because it enables the improvement of control strategies for future manipulation of dynamic systems, leveraging previous data even without a precise model of the system. It usually makes complex, model-free predictions from data alone, which is actually consistent with the purpose of control in that they both aim to design systems using richly structured perceptions to execute planning and control strategies that adequately adapt to changing environments. The robust trajectory tracking control of intricate mechanical systems presents a challenging problem that necessitates effective control methods. In this paper, we propose a novel nonlinear control strategy based on deep reinforcement learning to solve the trajectory tracking problem of a 3-degree-of-freedom (3-DOF) control moment gyroscope (CMG). First, dynamic modeling of the 3-DOF CMG is used as a policy solver for the reinforcement learning training environment, and transfer learning is employed to bridge the reality gap. Then, the hyperparameters and reward functions of the neural network are optimized using the asynchronous successive halving algorithm. Ultimately, the twin delay depth determination policy gradient algorithm is trained in simulation to yield an agent capable of tracking user-defined trajectory routes as a nonlinear controller for the system. Both simulation and experimental results show that the proposed method works well for both high-frequency and low-frequency varying trajectory tracking control, and that the proposed method has better response speed and robustness than classic linear parameter-varying control methods and the state-of-the-art nonlinear parameter-varying method and the neural network-based feedback linearization adaptive control method.

A Novel Reinforcement Learning Control for a Class of Strict-feedback Discrete-time Systems Via Multi-Gradient Recursive.

Reinforcement Learning-Based Control for Nonlinear Discrete-Time Systems with Unknown Control Directions and Control Constraints

Robust Iterative Learning Control Design Based on Gradient Method

Reinforcement Learning-Based Control for a Class of Nonlinear Systems with Unknown Control Directions

Robust Adaptive Repetitive Learning Control for a Class of Time-Varying Nonlinear Systems with Unknown Control Direction

Discrete-Time Adaptive Iterative Learning Control for High-Order Nonlinear Systems with Unknown Control Directions

Online Reinforcement Learning-based Neural Network Controller Design for Affine Nonlinear Discrete-time Systems.

Control of Nonaffine Nonlinear Discrete-Time Systems Using Reinforcement-Learning-Based Linearly Parameterized Neural Networks

Discrete-time adaptive iterative learning control with unknown control directions

Generalized regression neural networks-based data-driven iterative learning control for nonlinear non-affine discrete-time systems

Output-feedback adaptive learning control with unknown control direction

Learning-Based Neural Dynamic Surface Predictive Control for MMC

Robust Adaptive Iterative Learning Control for Discrete-Time Nonlinear Systems With Time-Iteration-Varying Parameters.

Repetitive learning output-feedback control with unknown high-frequency gain sign

Predictive Control of Voltage Source Inverter: an Online Reinforcement Learning Solution

Algibacter agarivorans sp. nov. and Algibacter agarilyticus sp. nov., isolated from seawater, reclassification of Marinivirga aestuarii as Algibacter aestuarii comb. nov. and emended description of the genus Algibacter.

Adaptive output feedback reinforcement learning control for continuous time switched stochastic nonlinear systems with unknown control coefficients and full-state constraints

Nonlinear control strategies for 3-DOF control moment gyroscope using deep reinforcement learning

Gradient-variation Iterative Learning Control for Nonlinear Systems

Robust adaptive neural network control for strict-feedback nonlinear systems via small-gain approaches

Real-Time Progressive Learning: Mutually Reinforcing Learning and Control with Neural-Network-Based Selective Memory