Abstract:We study zero-sum differential games with state constraints and one-sided information, where the informed player (Player 1) has a categorical payoff type unknown to the uninformed player (Player 2). The goal of Player 1 is to minimize his payoff without violating the constraints, while that of Player 2 is to violate the state constraints if possible, or to maximize the payoff otherwise. One example of the game is a man-to-man matchup in football. Without state constraints, Cardaliaguet (2007) showed that the value of such a game exists and is convex to the common belief of players. Our theoretical contribution is an extension of this result to games with state constraints and the derivation of the primal and dual subdynamic principles necessary for computing behavioral strategies. Different from existing works that are concerned about the scalability of no-regret learning in games with discrete dynamics, our study reveals the underlying structure of strategies for belief manipulation resulting from information asymmetry and state constraints. This structure will be necessary for scalable learning on games with continuous actions and long time windows. We use a simplified football game to demonstrate the utility of this work, where we reveal player positions and belief states in which the attacker should (or should not) play specific random deceptive moves to take advantage of information asymmetry, and compute how the defender should respond.

Inverse Reinforcement Learning for Identification of Linear-Quadratic Zero-Sum Differential Games

Inverse linear-quadratic nonzero-sum differential games

Reinforcement Learning for Inverse Non-Cooperative Linear-Quadratic Output-feedback Differential Games

Reinforcement Learning for Inverse Linear-quadratic Dynamic Non-cooperative Games

Inverse linear quadratic dynamic games using partial state observations

Inverse reinforcement learning methods for linear differential games

Network Learning from Best-Response Dynamics in LQ Games.

A kind of linear quadratic non-zero sum differential game of backward stochastic differential equation with asymmetric information

Two person non-zero-sum linear-quadratic differential game with Markovian jumps in infinite horizon

Long-Time Behavior of Zero-Sum Linear-Quadratic Stochastic Differential Games

Nash Equilibria for Linear Quadratic Discrete-time Dynamic Games via Iterative and Data-driven Algorithms

Linear Quadratic Nonzero-Sum Mean-Field Stochastic Differential Games with Regime Switching

The Equivalence Conditions of Optimal Feedback Control-Strategy Operators for Zero-Sum Linear Quadratic Stochastic Differential Game with Random Coefficients

Linear-Quadratic Non-Zero Sum Backward Stochastic Differential Game With Overlapping Information

Primal-Dual Reinforcement Learning for Zero-Sum Games in the Optimal Tracking Control

Optimal Mixed Strategies to the Zero-sum Linear Differential Game

Discrete-Time LQ Stochastic Two-Person Nonzero-Sum Difference Games with Random Coefficients:~Open-Loop Nash Equilibrium

State-Constrained Zero-Sum Differential Games with One-Sided Information

Nash Equilibrium Sequence in a Singular Two-Person Linear-Quadratic Differential Game

Multidimensional indefinite stochastic Riccati equations and zero-sum linear-quadratic stochastic differential games with non-markovian regime switching

Optimal Asymptotic Tracking Control for Nonzero-Sum Differential Game Systems with Unknown Drift Dynamics via Integral Reinforcement Learning