A Data-Driven Model-Reference Adaptive Control Approach Based on Reinforcement Learning

Mohammed Abouheaf,Wail Gueaieb,Davide Spinello,Salah Al-Sharhan
DOI: https://doi.org/10.1109/ROSE52750.2021.9611772
2023-03-17
Abstract:Model-reference adaptive systems refer to a consortium of techniques that guide plants to track desired reference trajectories. Approaches based on theories like Lyapunov, sliding surfaces, and backstepping are typically employed to advise adaptive control strategies. The resulting solutions are often challenged by the complexity of the reference model and those of the derived control strategies. Additionally, the explicit dependence of the control strategies on the process dynamics and reference dynamical models may contribute in degrading their efficiency in the face of uncertain or unknown dynamics. A model-reference adaptive solution is developed here for autonomous systems where it solves the Hamilton-Jacobi-Bellman equation of an error-based structure. The proposed approach describes the process with an integral temporal difference equation and solves it using an integral reinforcement learning mechanism. This is done in real-time without knowing or employing the dynamics of either the process or reference model in the control strategies. A class of aircraft is adopted to validate the proposed technique.
Systems and Control,Machine Learning,Robotics
What problem does this paper attempt to address?
The paper attempts to address the problem of how to design a Model Reference Adaptive Control (MRAC) system under unknown or uncertain dynamic conditions to achieve precise tracking control of dynamic systems. Specifically, the paper proposes an integral model reference adaptive control method based on reinforcement learning, which can solve the integral Bellman equation in real-time without relying on the process dynamics model, thereby achieving efficient tracking of the target trajectory. ### Main Contributions of the Paper: 1. **Model Independence**: The proposed control strategy does not rely on the specific dynamics model of the controlled object, making it applicable to a wide range of systems. 2. **Real-time Adaptability**: Through the online integral reinforcement learning (IRL) mechanism, the control gains are adjusted in real-time, ensuring system stability and tracking performance. 3. **Flexibility**: The algorithm does not impose strict limitations on the order of the system, making it suitable for high-order systems. 4. **Stability Guarantee**: The asymptotic stability of the proposed control strategy is proven using the Lyapunov function. ### Application Cases: The paper validates the effectiveness of the proposed method through simulations, particularly in the application of controlling the longitudinal motion of an aircraft. The experimental results show that even in the presence of dynamic disturbances and unknown dynamics, the proposed method can effectively reduce tracking errors and enable rapid convergence of control gains. ### Summary: The paper proposes a novel reinforcement learning-based model reference adaptive control method, which performs excellently in handling unknown or uncertain dynamic systems, demonstrating high practical value and theoretical significance.