Abstract:Existing value-factorized based Multi-Agent deep Reinforce-ment Learning (MARL) approaches are well-performing invarious multi-agent cooperative environment under thecen-tralized training and decentralized execution(CTDE) scheme,where all agents are trained together by the centralized valuenetwork and each agent execute its policy independently. How-ever, an issue remains open: in the centralized training process,when the environment for the team is partially observable ornon-stationary, i.e., the observation and action informationof all the agents cannot represent the global states, existingmethods perform poorly and sample inefficiently. Regret Min-imization (RM) can be a promising approach as it performswell in partially observable and fully competitive <a class="link-external link-http" href="http://settings.However" rel="external noopener nofollow">this http URL</a>, it tends to model others as opponents and thus can-not work well under the CTDE scheme. In this work, wepropose a novel team RM based Bayesian MARL with threekey contributions: (a) we design a novel RM method to traincooperative agents as a team and obtain a team regret-basedpolicy for that team; (b) we introduce a novel method to de-compose the team regret to generate the policy for each agentfor decentralized execution; (c) to further improve the perfor-mance, we leverage a differential particle filter (a SequentialMonte Carlo method) network to get an accurate estimation ofthe state for each agent. Experimental results on two-step ma-trix games (cooperative game) and battle games (large-scalemixed cooperative-competitive games) demonstrate that ouralgorithm significantly outperforms state-of-the-art methods.

Research on the Combihation of Bayesian Learning and Reinforcement Learning

The Multi-Agent System Based on Reinforcement Learning

Optimal Exploration Algorithm of Multi-Agent Reinforcement Learning Methods (Student Abstract)

A multiagent reinforcement learning approach based on different states

A Two-Layered Multi-Agent Reinforcement Learning Model and Algorithm

Fully Decentralized Cooperative Multi-Agent Reinforcement Learning: A Survey

LMRL: a Multi-Agent Reinforcement Learning Model and Algorithm

Model-Based Bayesian Reinforcement Learning in Large Structured Domains

Towards Uncertainty in Decision: A Survey on Recent Advances and Challenges in Bayesian Reinforcement Learning

Cooperative Learning of Multi-Agent Systems Via Reinforcement Learning

HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism

Multi-agent Cooperative Combat Simulation in Naval Battlefield with Reinforcement Learning

Two Heads Are Better Than One: A Simple Exploration Framework for Efficient Multi-Agent Reinforcement Learning.

Multi-Agent Reinforcement Learning with Optimal Equivalent Action of Neighborhood

Bayesian Reinforcement Learning: A Survey

Inducing Cooperation via Team Regret Minimization based Multi-Agent Deep Reinforcement Learning

Intelligent Model Learning Based on Variance for Bayesian Reinforcement Learning

A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications

Intention Propagation for Multi-agent Reinforcement Learning

Hierarchical Method for Cooperative Multiagent Reinforcement Learning in Markov Decision Processes

Context-Aware Bayesian Network Actor-Critic Methods for Cooperative Multi-Agent Reinforcement Learning