Abstract:The paper introduces an interactive machine learning mechanism to process the measurements of an uncertain, nonlinear dynamic process and hence advise an actuation strategy in real-time. For concept demonstration, a trajectory-following optimization problem of a Kinova robotic arm is solved using an integral reinforcement learning approach with guaranteed stability for slowly varying dynamics. The solution is implemented using a model-free value iteration process to solve the integral temporal difference equations of the problem. The performance of the proposed technique is benchmarked against that of another model-free high-order approach and is validated for dynamic payload and disturbances. Unlike its benchmark, the proposed adaptive strategy is capable of handling extreme process variations. This is experimentally demonstrated by introducing static and time-varying payloads close to the rated maximum payload capacity of the manipulator arm. The comparison algorithm exhibited up to a seven-fold percent overshoot compared to the proposed integral reinforcement learning solution. The robustness of the algorithm is further validated by disturbing the real-time adapted strategy gains with a white noise of a standard deviation as high as 5%.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is to implement a real - time measurement - driven reinforcement learning control method in uncertain nonlinear systems. Specifically, the paper proposes an adaptive learning mechanism based on Integral Reinforcement Learning (IRL) to process the measurement data of uncertain nonlinear dynamic processes and provide action policy suggestions in real - time. To demonstrate the concept, the paper uses the trajectory tracking optimization problem of a Kinova robotic arm as a case study, and solves the integral time - difference equation through a model - free value iteration process to achieve a stable control strategy. ### Main Problems and Goals 1. **Real - time measurement - driven control**: - The paper aims to develop a method that can process measurement data in real - time and adjust the control strategy accordingly, which is applicable to uncertain nonlinear systems. 2. **Model - free control**: - The method does not rely on the precise dynamic model of the system, but adapts to the dynamic changes of the system through online learning, thus avoiding the dependence on complex models. 3. **Robustness**: - The proposed method needs to be able to maintain good performance in the presence of dynamic loads and disturbances, and verify its robustness under extreme process changes. 4. **Optimized trajectory tracking**: - Specifically for the trajectory tracking problem of the Kinova robotic arm, an optimized control scheme is proposed. Through the adaptive learning mechanism, the joint action signals are adjusted in real - time, enabling the robotic arm to accurately follow the predetermined trajectory. ### Key Points of the Solution - **Integral Reinforcement Learning (IRL)**: - Use the IRL method to solve the optimal control problem, and update the control strategy through the value iteration process to ensure the stability of the system. - **Adaptive critic structure**: - Adopt two neural network structures, namely the Actor Network and the Critic Network, to approximate the optimal policy and the value function respectively, and adjust the network weights in real - time through the gradient descent method. - **High - order error dynamics**: - Consider high - order error dynamics, and improve the control accuracy by increasing the number of error samples, especially in complex high - order systems. - **Experimental verification**: - Verify the effectiveness and robustness of the proposed method through experiments. Especially under dynamic load and disturbance conditions, it is compared with another model - free high - order method, showing better performance. ### Summary The main contribution of the paper is to propose a real - time measurement - driven adaptive learning control method, which is especially suitable for uncertain nonlinear systems. Through Integral Reinforcement Learning and the adaptive critic structure, this method can achieve robust trajectory tracking control without relying on the precise model of the system. The experimental results show that this method performs well in dealing with dynamic loads and disturbances and has high practical value.

Real-Time Measurement-Driven Reinforcement Learning Control Approach for Uncertain Nonlinear Systems

Realtime Brain-Inspired Adaptive Learning Control for Nonlinear Systems with Configuration Uncertainties (I)

Realtime Brain-Inspired Adaptive Learning Control for Nonlinear Systems with Configuration Uncertainties

Robust Adaptive Iterative Learning Control for Discrete‐time Nonlinear Systems with Both Parametric and Nonparametric Uncertainties

Robust Tracking Control of Robot Manipulators with Actuator Faults and Joint Velocity Measurement Uncertainty

Direct Optimization Based Compensation Adaptive Robust Control of Nonlinear Systems with State and Input Constraints

Adaptive control of a mechatronic system using constrained residual reinforcement learning

Active Learning of Discrete-Time Dynamics for Uncertainty-Aware Model Predictive Control

Robust Learning-Based Control for Uncertain Nonlinear Systems With Validation on a Soft Robot

Online Multi-Objective Model-Independent Adaptive Tracking Mechanism for Dynamical Systems

A Data-Driven Model-Reference Adaptive Control Approach Based on Reinforcement Learning

Output Feedback Adaptive Optimal Control of Affine Nonlinear systems with a Linear Measurement Model

Model-free tracking control of complex dynamical trajectories with machine learning

Trajectory Tracking of Underactuated Sea Vessels With Uncertain Dynamics: An Integral Reinforcement Learning Approach

Adaptive neural network control of robotic manipulators with input constraints and without velocity measurements

Reinforcement Learning Based Adaptive Control for Uncertain Mechanical Systems with Asymptotic Tracking

Experience Recommendation for Long Term Safe Learning-based Model Predictive Control in Changing Operating Conditions

Adaptive Robust Controller for handling Unknown Uncertainty of Robotic Manipulators

Bridging Reinforcement Learning and Iterative Learning Control: Autonomous Motion Learning for Unknown, Nonlinear Dynamics

Robust Robot Arm Control with Nonlinear Feedback