Abstract:It is well-known that acting in an individually rational manner, according to the principles of classical game theory, may lead to sub-optimal solutions in a class of problems named social dilemmas. In contrast, humans generally do not have much difficulty with social dilemmas, as they are able to balance personal benefit and group benefit. As agents in multi-agent systems are regularly confronted with social dilemmas, for instance in tasks such as resource allocation, these agents may benefit from the inclusion of mechanisms thought to facilitate human fairness. Although many of such mechanisms have already been implemented in a multi-agent systems context, their application is usually limited to rather abstract social dilemmas with a discrete set of available strategies (usually two). Given that many real-world examples of social dilemmas are actually continuous in nature, we extend this previous work to more general dilemmas, in which agents operate in a continuous strategy space. The social dilemma under study here is the well-known Ultimatum Game, in which an optimal solution is achieved if agents agree on a common strategy. We investigate whether a scale-free interaction network facilitates agents to reach agreement, especially in the presence of fixed-strategy agents that represent a desired (e.g. human) outcome. Moreover, we study the influence of rewiring in the interaction network. The agents are equipped with continuous-action learning automata and play a large number of random pairwise games in order to establish a common strategy. From our experiments, we may conclude that results obtained in discrete-strategy games can be generalized to continuous-strategy games to a certain extent: a scale-free interaction network structure allows agents to achieve agreement on a common strategy, and rewiring in the interaction network greatly enhances the agents ability to reach agreement. However, it also becomes clear that some alternative mechanisms, such as reputation and volunteering, have many subtleties involved and do not have convincing beneficial effects in the continuous case.

Optimal Evolution Strategy for Continuous Strategy Games on Complex Networks via Reinforcement Learning

Maximum Entropy Reinforcement Learning with Evolution Strategies

Evolution of cooperation on reinforcement-learning driven-adaptive networks

An efficient model‐free adaptive optimal control of continuous‐time nonlinear non‐zero‐sum games based on integral reinforcement learning with exploration

Evolutionary Dynamics On Stochastic Evolving Networks For Multiple-Strategy Games

Adaptive Co-Evolution of Strategies and Network Leading to Optimal Cooperation Level in Spatial Prisoner’s Dilemma Game

Balance of exploration and exploitation: Non-cooperative game-driven evolutionary reinforcement learning

Exploring Dominant Strategies in Iterated and Evolutionary Games: a Multi-Agent Reinforcement Learning Approach

Continuous-time Distributed Nash Strategy over Switching Topologies with Gain Adaptation

Evolutionary Game on a Stochastic Growth Network

Time scales in evolutionary game on adaptive networks

Evolutionary hypergame dynamics

Finding Mixed Strategy Nash Equilibrium for Continuous Games through Deep Learning

The evolutionary prisoner's dilemma game in continuous signed networks

Learning to Reach Agreement in a Continuous Ultimatum Game

Optimal coordination in Minority Game: A solution from reinforcement learning

Evolutionary Multi-agent Reinforcement Learning in Group Social Dilemmas

Solving Deep Reinforcement Learning Tasks with Evolution Strategies and Linear Policy Networks

Local Minority Game with Evolutionary Strategies

Modeling and optimization of networked evolutionary game based on incomplete information with switched topologies

Learning and innovative elements of strategy adoption rules expand cooperative network topologies