Leader-Follower Optimal Consensus of Discrete-Time Linear Multi-agent Systems based on Q-Learning

Ye Li,Fuyong Wang,Zhongxin Liu,Zengqiang Chen
DOI: https://doi.org/10.1007/978-981-16-6328-4_52
2021-01-01
Abstract:This paper proposes a Q-learning solution to achieve optimal consensus of discrete-time linear leader-follower multi-agent systems. Model-free linear dynamic systems are considered herein and it is one of the topics that the control system community pays high attention to. A value function based on state error is designed for directed graph, and the weights of it can be estimated through a data-driven method. Then the consensus policies will be calculated iteratively by minimizing energy consumption. Besides, the proof of stability is given to ensure the theoretical convergence. Finally, a simple simulation is shown to verify the validity of the algorithm.
What problem does this paper attempt to address?