Agent Probing Interaction Policies

Siddharth Ghiya,Oluwafemi Azeez,Brendan Miller
DOI: https://doi.org/10.48550/arXiv.1911.09535
2019-12-14
Abstract:Reinforcement learning in a multi agent system is difficult because these systems are inherently non-stationary in nature. In such a case, identifying the type of the opposite agent is crucial and can help us address this non-stationary environment. We have investigated if we can employ some probing policies which help us better identify the type of the other agent in the environment. We've made a simplifying assumption that the other agent has a stationary policy that our probing policy is trying to approximate. Our work extends Environmental Probing Interaction Policy framework to handle multi agent environments.
Multiagent Systems,Artificial Intelligence,Machine Learning
What problem does this paper attempt to address?