Multi-agent Learning in Cooperative General-Sum Games

Hai-Bo Liu
2007-01-01
Abstract:Rationality and convergence are two topics in the research on multi-agent learning.A new method called Pareto-Q is proposed with the concept of Pareto optimum,which is more rational than Nash equilibrium with regard to the cooperative system.At the same time,social conventions are also introduced to promise the convergence of learning.When tested on a two-person grid game,the algorithm performs better than the single Q-learning and Nash-Q learning.
What problem does this paper attempt to address?