Abstract:Researchers have explored the performance of Iterated Prisoner's Dilemma strategies for decades, from the celebrated performance of Tit for Tat to the introduction of the zero-determinant strategies and the use of sophisticated learning structures such as neural networks. Many new strategies have been introduced and tested in a variety of tournaments and population dynamics. Typical results in the literature, however, rely on performance against a small number of somewhat arbitrarily selected strategies in a small number of tournaments, casting doubt on the generalizability of conclusions. In this work, we analyze a large collection of 195 strategies in thousands of computer tournaments, present the top performing strategies across multiple tournament types, and distill their salient features. The results show that there is not yet a single strategy that performs well in diverse Iterated Prisoner's Dilemma scenarios, nevertheless there are several properties that heavily influence the best performing strategies. This refines the properties described by Axelrod in light of recent and more diverse opponent populations to: be nice, be provocable and generous, be a little envious, be clever, and adapt to the environment. More precisely, we find that strategies perform best when their probability of cooperation matches the total tournament population's aggregate cooperation probabilities. The features of high performing strategies help cast some light on why strategies such as Tit For Tat performed historically well in tournaments and why zero-determinant strategies typically do not fare well in tournament settings. Furthermore, our findings have implications for the future training of autonomous agents, as understanding the crucial features for incorporation into these agents becomes essential.

Reinforcement Learning Produces Dominant Strategies for the Iterated Prisoner's Dilemma

Properties of Winning Iterated Prisoner's Dilemma Strategies

Exploring Dominant Strategies in Iterated and Evolutionary Games: a Multi-Agent Reinforcement Learning Approach

Iterated Prisoner’s Dilemma contains strategies that dominate any evolutionary opponent

More Effective Choice in the Prisoner's Dilemma

A Reinforcement Learning Based Strategy for the Double-Game Prisoner's Dilemma ?

Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior

No Strategy Can Win in the Repeated Prisoner's Dilemma: Linking Game Theory and Computer Simulations

Recognising and evaluating the effectiveness of extortion in the Iterated Prisoner's Dilemma

Win-Stay-Lose-Shift as a self-confirming equilibrium in the iterated Prisoner’s Dilemma

Win-Stay-Lose-Shift as a self-confirming equilibrium in the iterated Prisoner's Dilemma

The Iterated Prisoner's Dilemma: Good Strategies and Their Dynamics

Learning to Play No-Press Diplomacy with Best Response Policy Iteration

Symmetric equilibrium of multi-agent reinforcement learning in repeated prisoner's dilemma

Inverse Reinforcement Learning for Strategy Identification

Reinforcement Learning: Playing Tic-Tac-Toe

Multiple strategy competition among structured populations

Invincible Strategies of Iterated Prisoner's Dilemma

Learning multiagent coordination in the absence of communication channels

Win-stay-lose-learn Promotes Cooperation in the Spatial Prisoner's Dilemma Game.

Resolution of the stochastic strategy spatial prisoner's dilemma by means of particle swarm optimization