On Passivity and Reinforcement Learning in Finite Games.

Bolin Gao,Lacra Pavel
DOI: https://doi.org/10.1109/cdc.2018.8619157
2018-01-01
Abstract:We use a passivity-based methodology for the analysis and design of reinforcement learning in multi-agent games. We consider an exponentially-discounted reinforcement learning scheme, and show that convergence can be guaranteed for the class of games characterized by the monotonicity property of their (negative) payoff. We further exploit passivity properties to propose a class of higher-order schemes that preserve convergence properties.
What problem does this paper attempt to address?