Abstract:A multi-agent system (MAS) that composed of multiple interacting intelligent agents can be used to solve problems which are difficult or impossible for an individual agent or monolithic system to solve. Since the agent is autonomous and intelligent, it is reasonable to assume that it choice the behavior to bring itself the maximal benefit. Thus, the cooperation and coordination can be achieved successfully if we can wisely design the utility function for every agent so that every agent can get the maximal reward from the cooperation to accomplish a given task. However, the utility function of one agent usually involves those of others for most “real-world” cooperation needed tasks. Moreover, it is not uncommon that the conflicts between the gains of these agents arise. In other words, the individual optimality is not always consistent with collective optimality in MAS. These conflicts will reduce the collective utility if there is no coordination among these decentralized, autonomous agents. This paper addresses the essential that in MAS the action of one agent may influence the action of others and there usually be conflicts among the payoff of one another. We investigated the optimal coordination approach for multi-agent foraging, a typical MAS task, from the point view of game theory. After introduced several concepts, we built the equivalence between the optimal solution of MAS and the equilibrium of the game corresponding to that situation, and then we introduced evolutionarily stable strategy into the approach hope that it maybe be of service in addressing the equilibrium selection problem of traditional game theory. Finally, based on the hawk-dove game model, an evolutionarily cooperation foraging algorithm (ECFA) is proposed to evolve a stable evolutionarily stable strategy (ESS) and bring the maximal reward for the group. If there be some change in the configuration of the environment, ECFA can, then, evolve to the new ESS automatically. And we also proposed a reinforcement factor to accelerate the convergence process of ECFA and thus make a new algorithm Accelerated ECFA (AECFA). These techniques were shown to be successful by the multi-agent foraging simulations.

Balance of exploration and exploitation: Non-cooperative game-driven evolutionary reinforcement learning

Ponder-reinforcement Cooperative Algorithm for Multi-Agent Foraging Task Based on Evolutionary Stable Equilibrium

Evolutionary Game Theory Based Cooperation Algorithm in Multi-Agent System

Adaptive algorithm for multi-agent learning optimal cooperative pursuit strategy based on Markov game

Evolutionary Reinforcement Learning via Cooperative Coevolution

Non-local Policy Optimization via Diversity-regularized Collaborative Exploration

Evolutionary reinforcement learning via cooperative coevolutionary negatively correlated search

Dynamics of heuristics selection for cooperative behaviour

Improved cooperation by balancing exploration and exploitation in intertemporal social dilemma tasks

Effects of Different Optimization Formulations in Evolutionary Reinforcement Learning on Diverse Behavior Generation

Evolutionary Game Dynamics of Multi-Agent Cooperation Driven by Self-Learning

Exploring Dominant Strategies in Iterated and Evolutionary Games: a Multi-Agent Reinforcement Learning Approach

Optimal Evolution Strategy for Continuous Strategy Games on Complex Networks via Reinforcement Learning

Enhancing cooperative evolution in spatial public goods game by particle swarm optimization based on exploration and q-learning

Advances in co-evolutionary algorithms

Long-Term Progress and Behavior Complexification in Competitive Co-Evolution

A novelty-search-based evolutionary reinforcement learning algorithm for continuous optimization problems

A Single-Task and Multi-Decision Evolutionary Game Model Based on Multi-Agent Reinforcement Learning

Competitive Coevolutionary Multi-Agent Systems: The Application to Mapping and Scheduling Problems

Evolving Constrained Reinforcement Learning Policy

Aspiration-driven co-evolution of cooperation with individual behavioral diversity