Hybrid Q-learning for Data-Based Optimal Control of Non-Linear Switching System

Li Xiaofeng,Dong Lu,Sun Changyin
DOI: https://doi.org/10.23919/jsee.2022.000114
2022-01-01
Abstract:In this paper, the optimal control of non-linear switching system is investigated without knowing the system dynamics. First, the Hamilton-Jacobi-Bellman(HJB) equation is derived with the consideration of hybrid action space. Then, a novel data-based hybrid Q-learning(HQL) algorithm is proposed to find the optimal solution in an iterative manner. In addition, the theoretical analysis is provided to illustrate the convergence and optimality of the proposed algorithm. Finally, the algorithm is implemented with the actor-critic(AC) structure, and two linearin-parameter neural networks are utilized to approximate the functions. Simulation results validate the effectiveness of the data-driven method.
What problem does this paper attempt to address?