Abstract:Understanding the intentions and beliefs of others, a phenomenon known as "theory of mind", is a crucial element in social behavior. These beliefs and perceptions are inherently subjective and latent, making them often unobservable for investigation. Social interactions further complicate the matter, as multiple agents can engage in recursive reasoning about each other's strategies with increasing levels of cognitive hierarchy. While previous research has shown promise in understanding a single agent's belief of values through inverse reinforcement learning, extending this to model interactions among multiple agents remains an open challenge due to the computational complexity. In this work, we adopted a probabilistic recursive modeling of cognitive levels and joint value decomposition to achieve efficient multi-agent inverse reinforcement learning (MAIRL). We validated our method using simulations of a cooperative foraging task. Our algorithm revealed both the ground truth goal-directed value function and agents' beliefs about their counter-parts' strategies. When applied to human behavior in a cooperative hallway task, our method identified meaningful goal maps that evolved with task proficiency and an interaction map that is related to key states in the task without accessing to the task rules. Similarly, in a non-cooperative task performed by monkeys, we identified mutual predictions that correlated with the animals' social hierarchy, highlighting the behavioral relevance of the latent beliefs we uncovered. Together, our findings demonstrate that MAIRL offers a new framework for uncovering human or animal beliefs in social behavior, thereby illuminating previously opaque aspects of social cognition.

Probabilistic Recursive Reasoning for Multi-Agent Reinforcement Learning.

Modelling Bounded Rationality in Multi-Agent Interactions by Generalized Recursive Reasoning

Neural Recursive Belief States in Multi-Agent Reinforcement Learning

Multiagent Inverse Reinforcement Learning via Theory of Mind Reasoning

Unveiling the latent dynamics in social cognition with multi-agent inverse reinforcement learning

Abstract Spatial-Temporal Reasoning Via Probabilistic Abduction and Execution

Multi-Agent Cooperation Via Reasoning About The Behavior Of Others

Nested Reasoning About Autonomous Agents Using Probabilistic Programs

Learning to Reason in Round-based Games: Multi-task Sequence Generation for Purchasing Decision Making in First-person Shooters

Context-Aware Bayesian Network Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning

Competitive Multi-agent Deep Reinforcement Learning with Counterfactual Thinking

A Game-Theoretic Approach to Multi-agent Trust Region Optimization.

Negotiated Reasoning: On Provably Addressing Relative Over-Generalization

Learning to Play General-Sum Games against Multiple Boundedly Rational Agents

Structural relational inference actor-critic for multi-agent reinforcement learning

A Deep Bayesian Policy Reuse Approach Against Non-Stationary Agents.

Resolving Implicit Coordination in Multi-Agent Deep Reinforcement Learning with Deep Q-Networks & Game Theory

Inducing Cooperation via Team Regret Minimization based Multi-Agent Deep Reinforcement Learning

K-Level Reasoning: Establishing Higher Order Beliefs in Large Language Models for Strategic Reasoning

Theory of Mind as Intrinsic Motivation for Multi-Agent Reinforcement Learning

Incorporating Pragmatic Reasoning Communication into Emergent Language