Reinforcement Learning Consensus Control for Discrete-Time Multi-Agent Systems

Xiaoxia Zhu,Xin Yuan,Yuanda Wang,Changyin Sun
DOI: https://doi.org/10.23919/chicc.2019.8865975
2019-01-01
Abstract:In this paper, the consensus control of leader-follower multi-agent systems is investigated. To achieve the consensus of the discrete-time multi-agent systems, the data-driven iterative neighbor and target Q-learning algorithm is proposed. To implement the proposed method, the actor-critic architecture with neighbor and target networks are employed to approximate the Q-function and control signal. The reasonable reinforcement signal and cost function are chosen from the environment. This method is independent on the accurate system model where most practical systems are too complicated to build the accurate models. Finally, the simulation example is given to demonstrate the effectiveness of the proposed approach.
What problem does this paper attempt to address?