Control of Asynchronous Imitation Dynamics on Networks

James Riehl,Pouria Ramazi,Ming Cao
DOI: https://doi.org/10.48550/arXiv.1704.04416
2017-04-13
Abstract:Imitation is widely observed in populations of decision-making agents. Using our recent convergence results for asynchronous imitation dynamics on networks, we consider how such networks can be efficiently driven to a desired equilibrium state by offering payoff incentives for using a certain strategy, either uniformly or targeted to individuals. In particular, if for each available strategy, agents playing that strategy receive maximum payoff when their neighbors play that same strategy, we show that providing incentives to agents in a network that is at equilibrium will result in convergence to a unique new equilibrium. For the case when a uniform incentive can be offered to all agents, this result allows the computation of the optimal incentive using a binary search algorithm. When incentives can be targeted to individual agents, we propose an algorithm to select which agents should be chosen based on iteratively maximizing a ratio of the number of agents who adopt the desired strategy to the payoff incentive required to get those agents to do so. Simulations demonstrate that the proposed algorithm computes near-optimal targeted payoff incentives for a range of networks and payoff distributions in coordination games.
Computer Science and Game Theory,Adaptation and Self-Organizing Systems
What problem does this paper attempt to address?
This paper aims to solve the control problem of asynchronous imitation dynamics in networks. Specifically, it focuses on how to efficiently guide the network to reach a desired equilibrium state by providing payoff incentives (i.e., rewards). The paper is concerned with how to use payoff incentives to make agents in networks adopt a specific strategy, especially strategy A, in networks where decision - makers (or agents) make decisions by imitating their most successful neighbors. The paper mainly explores two incentive methods: one is to provide uniform incentives to all agents, and the other is to provide customized incentives for individual agents. ### Core Problems of the Paper 1. **Uniform Incentive Control**: When uniform incentives can be provided to all agents, the paper proposes a binary - search algorithm to calculate the optimal incentive value to ensure that all agents will eventually adopt strategy A. This algorithm is based on the following assumptions: if the network is in a certain equilibrium state, then providing incentives should not cause any agent to switch from strategy A to strategy B, and the network will converge to a unique new equilibrium state. 2. **Targeted Incentive Control**: When different incentives can be provided to individual agents, the paper proposes an Iterative Potential - to - Reward Optimization (IPRO) algorithm to select agents that can maximize the ratio of the number of agents adopting strategy A to the required incentives for incentives. This algorithm can calculate near - optimal targeted incentives under various network structures and payoff distributions. ### Theoretical Contributions - **A - Coordinated Network Games**: The paper defines the concept of A - coordinated network games, that is, if an agent updates to strategy A, even if more neighbors also adopt strategy A, this agent will still choose strategy A. This property ensures that after providing incentives, no agent will switch from strategy A to strategy B, and the network will converge to a unique equilibrium state. - **Opponent - Coordinated Agents**: The paper further proves that in networks of opponent - coordinated agents, that is, when the maximum payoff of each agent occurs when its neighbors also adopt the same strategy, the network has A - coordination. This provides a theoretical basis for designing effective control algorithms. ### Experimental Verification The paper shows the performance of the IPRO algorithm under different types of networks and payoff distributions through simulation. The results show that this algorithm can achieve near - optimal targeted incentive allocation, which is superior to incentive allocation methods based on degree, payoff, and other criteria. In conclusion, through theoretical analysis and experimental verification, this paper proposes a series of effective methods to control the imitation dynamics in networks to reach the desired equilibrium state, especially having important application values in social, economic, and engineering fields.