Output-feedback Optimized Consensus for Directed Graph Multi-Agent Systems Based on Reinforcement Learning and Subsystem Error Derivatives.

Dongdong Li,Jiuxiang Dong
DOI: https://doi.org/10.1016/j.ins.2023.119577
IF: 8.1
2023-01-01
Information Sciences
Abstract:A distributed output-feedback optimal tracking control (OTC) method based on reinforcement learning (RL) is proposed for state-unmeasured multi-agent systems (MASs) under a directed graph. Firstly, the state observers are designed to estimate the states of MASs using the output signals, and the gains of the observers are not required to satisfy the Hurwitz matrix inequality. Then, a class of value functions based on error derivatives is proposed to solve the unbounded problem of traditional value functions for strict-feedback MASs. By using the value functions, the Hamilton-Jacobi Bellman (HJB) equations and the optimal control inputs are derived. The traditional RL-based backstepping control (OBC) methods are difficult to deal with the optimized consensus problem under a directed digraph, it is solved by using the observer-actor-critic structures and the new value functions in this paper, and online learning is implemented. Furthermore, there is no "dimensional pressure" to approximate the optimal control inputs and value functions using actor-critic neural networks (NNs), and the observed error and the consensus error are shown to be bounded. Finally, the effectiveness and advantages of the algorithm are verified by simulation.
What problem does this paper attempt to address?