Reinforcement Learning for Input Constrained Sub-optimal Tracking Control in Discrete-time Two-time-scale Systems

Xuejie Que,Zhenlei Wang,Xin Wang
DOI: https://doi.org/10.1007/s12555-022-0355-6
2023-07-30
Abstract:Two-time-scale (TTS) systems were proposed to describe accurately complex systems that include multiple variables running on two-time scales. Different response speeds of variables and incomplete model information affect tracking performance of TTS systems. For tracking control of unknown model, practicability of reinforcement learning (RL) has been subject to criticism, as the method requires stable initial policy. Based on singular perturbation theory (SPT), a composite sub-optimal tracking policy is investigated combining model information with measured data. Besides, a selection criterion of initial stabilizing policy is presented by considering the policy as an input constraint. The proposed method integrating RL technique with convex optimization improves the tracking performance and practicability effectively. Finally, an emulation experiment in F-8 aircraft is given to demonstrate the validity of the developed method.
automation & control systems
What problem does this paper attempt to address?