Abstract:$ $This paper addresses the inverse problem for Linear-Quadratic (LQ) nonzero-sum $N$-player differential games, where the goal is to learn parameters of an unknown cost function for the game, called observed, given the demonstrated trajectories that are known to be generated by stationary linear feedback Nash equilibrium laws. Towards this end, using the demonstrated data, a synthesized game needs to be constructed, which is required to be equivalent to the observed game in the sense that the trajectories generated by the equilibrium feedback laws of the $N$ players in the synthesized game are the same as those demonstrated trajectories. We show a model-based algorithm that can accomplish this task using the given trajectories. We then extend this model-based algorithm to a model-free setting to solve the same problem in the case when the system's matrices are unknown. The algorithms combine both inverse optimal control and reinforcement learning methods making extensive use of gradient descent optimization for the latter. The analysis of the algorithm focuses on the proof of its convergence and stability. To further illustrate possible solution characterization, we show how to generate an infinite number of equivalent games, not requiring to run repeatedly the complete algorithm. Simulation results validate the effectiveness of the proposed algorithms.

Inverse linear-quadratic nonzero-sum differential games

Inverse Reinforcement Learning for Identification of Linear-Quadratic Zero-Sum Differential Games

Reinforcement Learning for Inverse Non-Cooperative Linear-Quadratic Output-feedback Differential Games

Reinforcement Learning for Inverse Linear-quadratic Dynamic Non-cooperative Games

Two person non-zero-sum linear-quadratic differential game with Markovian jumps in infinite horizon

A kind of linear quadratic non-zero sum differential game of backward stochastic differential equation with asymmetric information

Inverse reinforcement learning methods for linear differential games

Discrete-Time LQ Stochastic Two-Person Nonzero-Sum Difference Games with Random Coefficients:~Open-Loop Nash Equilibrium

Inverse linear quadratic dynamic games using partial state observations

Linear-Quadratic Non-Zero Sum Backward Stochastic Differential Game With Overlapping Information

Linear Quadratic Nonzero-Sum Mean-Field Stochastic Differential Games with Regime Switching

Stochastic linear-quadratic differential game with Markovian jumps in an infinite horizon

Long-Time Behavior of Zero-Sum Linear-Quadratic Stochastic Differential Games

Distributed-Observer-Based Nash Equilibrium Seeking Algorithm for Quadratic Games With Nonlinear Dynamics

The Equivalence Conditions of Optimal Feedback Control-Strategy Operators for Zero-Sum Linear Quadratic Stochastic Differential Game with Random Coefficients

Nash Equilibria for Linear Quadratic Discrete-time Dynamic Games via Iterative and Data-driven Algorithms

Zero-sum and nonzero-sum differential games without Isaacs condition

Min–max adaptive dynamic programming for zero-sum differential games

Multidimensional indefinite stochastic Riccati equations and zero-sum linear-quadratic stochastic differential games with non-markovian regime switching

Non‐zero‐sum games of discrete‐time Markov jump systems with unknown dynamics: An off‐policy reinforcement learning method