Adaptive Incentive Design with Learning Agents

Chinmay Maheshwari,Kshitij Kulkarni,Manxi Wu,Shankar Sastry

2024-09-03

Abstract:How can the system operator learn an incentive mechanism that achieves social optimality based on limited information about the agents' behavior, who are dynamically updating their strategies? To answer this question, we propose an \emph{adaptive} incentive mechanism. This mechanism updates the incentives of agents based on the feedback of each agent's externality, evaluated as the difference between the player's marginal cost and society's marginal cost at each time step. The proposed mechanism updates the incentives on a slower timescale compared to the agents' learning dynamics, resulting in a two-timescale coupled dynamical system. Notably, this mechanism is agnostic to the specific learning dynamics used by agents to update their strategies. We show that any fixed point of this adaptive incentive mechanism corresponds to the optimal incentive mechanism, ensuring that the Nash equilibrium coincides with the socially optimal strategy. Additionally, we provide sufficient conditions that guarantee the convergence of the adaptive incentive mechanism to a fixed point. Our results apply to both atomic and non-atomic games. To demonstrate the effectiveness of our proposed mechanism, we verify the convergence conditions in two practically relevant games: atomic networked quadratic aggregative games and non-atomic network routing games.

Computer Science and Game Theory,Multiagent Systems,Systems and Control,Dynamical Systems

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how system operators can design an incentive mechanism to achieve the socially optimal solution while basing the design on limited information about agent behavior. Specifically, the paper focuses on how, in the case where agents dynamically update their strategies, system operators can adjust the incentive mechanism through learning to ensure that the ultimately reached state is socially optimal. The paper proposes an adaptive incentive mechanism that updates agents' incentives according to the externality of each agent (i.e., the difference between the individual marginal cost and the social marginal cost). This mechanism operates on two time scales: a slower time scale for updating incentives and a faster time scale for the learning dynamics of agents. In this way, even when agents continuously update their strategies, system operators can gradually adjust the incentive mechanism and ultimately achieve the socially optimal solution. The key contribution of the paper lies in proposing an incentive mechanism that can adapt to changes in agent behavior and proving that under certain conditions, this mechanism can converge to a fixed point, which corresponds to the socially optimal incentive mechanism. In addition, the paper also provides sufficient conditions to ensure the convergence of the adaptive incentive mechanism to a fixed point and applies these results to two practically relevant games: atomic network quadratic aggregation games and non - atomic network routing games, verifying the effectiveness of the proposed method.

Adaptive Incentive Design with Learning Agents

Inducing Social Optimality in Games via Adaptive Incentive Design

Distributed Mechanism Design with Learning Guarantees

Incentive Designs for Learning Agents to Stabilize Coupled Exogenous Systems

Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement Learning

Coordinating the Crowd: Inducing Desirable Equilibria in Non-Cooperative Systems

Socially-Optimal Mechanism Design for Incentivized Online Learning

Social Optimum Equilibrium Selection for Distributed Multi-Agent Optimization

Incentive Engineering for Concurrent Games

Personalized incentives as feedback design in generalized Nash equilibrium problems

Control of Asynchronous Imitation Dynamics on Networks

Learning to Control Unknown Strongly Monotone Games

Nash Incentive-compatible Online Mechanism Learning via Weakly Differentially Private Online Learning

Adaptive Mechanism Design using Multi-Agent Revealed Preferences

Paying to Do Better: Games with Payments between Learning Agents

Penalty-Regulated Dynamics and Robust Learning Procedures in Games

Learning to Incentivize Other Learning Agents

State-Dependent Optimal Incentive Allocation Protocols for Cooperation in Public Goods Games on Regular Networks

Incentive Compatible Active Learning

Relationship Design for Socially-Aware Behavior in Static Games

Estimating and Incentivizing Imperfect-Knowledge Agents with Hidden Rewards