Abstract:Coevolutionary learning provides a framework for modeling more realistic iterated prisoner's dilemma (IPD) interactions and to study conditions of how and why certain behaviors (e.g., cooperation) in a complex environment can be learned through an adaptation process guided by strategic interactions. The coevolutionary learning of cooperative behaviors can be attributed to the mechanism of direct reciprocity (e.g., repeated encounters). However, for the more complex IPD game with more choices, it is unknown precisely why the mechanism of direct reciprocity is less effective in promoting the learning of cooperative behaviors. Here, our study suggests that the evolution of defection may be a result of strategies effectively having more opportunities to exploit others when there are more choices. We note that strategies are less able to resolve the intention of an intermediate choice, e.g., whether it is a signal to engender further cooperation or a subtle exploitation. A likely consequence is strategies adapting to lower cooperation plays that offer higher payoffs in the short-term view when they cannot resolve the intention of opponents. However, cooperation in complex human interactions may also involve indirect interactions rather than direct interactions only. Following this, we study the coevolutionary learning of IPD with more choices and reputation. Here, current behavioral interactions depend not only on choices made in previous moves (direct interactions), but also choices made in past interactions that are reflected by their reputation scores (indirect interactions). The coevolutionary learning of cooperative behaviors is possible in the IPD with more choices when strategies use reputation as a mechanism to estimate behaviors of future partners and to elicit mutual cooperation play right from the start of interactions. In addition, we study the impact of the accuracy of reputation estimation in reflecting strategy behaviors of different implementations and why it is important for the evolution of cooperation. We show that the accuracy is related to how memory of games from previous generations is incorporated to calculate reputation scores and how frequently reputation scores are updated.

The Impact of Noise on Iterated Prisoner's Dilemma with Multiple Levels of Cooperation

Behavioral Diversity, Choices and Noise in the Iterated Prisoner's Dilemma

Effects of Expectation and Noise on Evolutionary Games

Risk Consideration and Cooperation in the Iterated Prisoner’s Dilemma

Lévy noise promotes cooperation in the prisoner’s dilemma game with reinforcement learning

USING PARTICLE SWARM OPTIMIZATION TO EVOLVE COOPERATION IN MULTIPLE CHOICES ITERATED PRISONER'S DILEMMA GAME

Effect of Spatial Structure on the Evolution of Cooperation in the N-Choice Iterated Prisoner's Dilemma.

The influence of experienced guider on cooperative behavior in the Prisoner's dilemma game

Effects of Strategy-Migration Direction and Noise in the Evolutionary Spatial Prisoner's Dilemma.

Using Social Network to Evolve Cooperation in Multiple Choices Iterated Prisoner’s Dilemma Game

Noise-induced sustainability of cooperation in Prisoner's Dilemma game

An Experimental Study of N-Person Iterated Prisoner's Dilemma Games

Selection of noise level in strategy adoption for spatial social dilemmas

How Important is Your Reputation in a Multi-Agent Environment

Multiple Choices and Reputation in Multiagent Interactions

Evolutionary Game Dynamics of Combining the Payoff-Driven and Conformity-Driven Update Rules

Evolution of Cooperation in Prisoner's Dilemma within Changeable External Environments

Noise-induced Enhancement of Network Reciprocity in Social Dilemmas

Repeated Thinking Promotes Cooperation in Spatial Prisoner's Dilemma Game

Adaptive Co-Evolution of Strategies and Network Leading to Optimal Cooperation Level in Spatial Prisoner’s Dilemma Game

Exploring Social Influence on Evolutionary Prisoner'S Dilemma Games in Networks