Inverse reinforcement learning methods for linear differential games

Hamed Jabbari Asl,Eiji Uchibe
DOI: https://doi.org/10.1016/j.sysconle.2024.105936
IF: 2.742
2024-10-06
Systems & Control Letters
Abstract:In this study, we considered the problem of inverse reinforcement learning or estimating the cost function of expert players in multi-player differential games. We proposed two online data-driven solutions for linear–quadratic games that are applicable to systems that fulfill a specific dimension criterion or whose unknown matrices in the cost function conform to a diagonal condition. The first method, which is partially model-free, utilizes the trajectories of expert agents to solve the problem. The second method is entirely model-free and employs the trajectories of both expert and learner agents. We determined the conditions under which the solutions are applicable and identified the necessary requirements for the collected data. We conducted numerical simulations to establish the effectiveness of the proposed methods.
automation & control systems,operations research & management science
What problem does this paper attempt to address?