Meta-game equilibrium for multi-agent reinforcement learning

Yang Gao,Joshua Zhexue Huang,Hongqiang Rong,Zhi-Hua Zhou
DOI: https://doi.org/10.1007/978-3-540-30549-1_81
2004-01-01
Abstract:This paper proposes a multi-agent Q-learning algorithm called meta-game-Q learning that is developed from the meta-game equilibrium concept Different from Nash equilibrium, meta-game equilibrium can achieve the optimal joint action game through deliberating its preference and predicting others' policies in the general-sum game A distributed negotiation algorithm is used to solve the meta-game equilibrium problem instead of using centralized linear programming algorithms We use the repeated prisoner's dilemma example to empirically demonstrate that the algorithm converges to meta-game equilibrium.
What problem does this paper attempt to address?