Abstract:Many current large-scale multiagent team implementations can be characterized as following the belief-desire-intention (BDI) paradigm, with explicit representation of team plans. Despite their promise, current BDI team approaches lack tools for quantitative performance analysis under uncertainty. Distributed partially observable Markov decision problems (POMDPs) are well suited for such analysis, but the complexity of finding optimal policies in such models is highly intractable. The key contribution of this article is a hybrid BDI-POMDP approach, where BDI team plans are exploited to improve POMDP tractability and POMDP analysis improves BDI team plan performance. Concretely, we focus on role allocation, a fundamental problem in BDI teams: which agents to allocate to the different roles in the team. The article provides three key contributions. First, we describe a role allocation technique that takes into account future uncertainties in the domain; prior work in multiagent role allocation has failed to address such uncertainties. To that end, we introduce RMTDP (Role-based Markov Team Decision Problem), a new distributed POMDP model for analysis of role allocations. Our technique gains in tractability by significantly curtailing RMTDP policy search; in particular, BDI team plans provide incomplete RMTDP policies, and the RMTDP policy search fills the gaps in such incomplete policies by searching for the best role allocation. Our second key contribution is a novel decomposition technique to further improve RMTDP policy search efficiency. Even though limited to searching role allocations, there are still combinatorially many role allocations, and evaluating each in RMTDP to identify the best is extremely difficult. Our decomposition technique exploits the structure in the BDI team plans to significantly prune the search space of role allocations. Our third key contribution is a significantly faster policy evaluation algorithm suited for our BDI-POMDP hybrid approach. Finally, we also present experimental results from two domains: mission rehearsal simulation and RoboCupRescue disaster rescue simulation.

Efficient Multiagent Planning via Shared Action Suggestions

Integrating Decision Sharing with Prediction in Decentralized Planning for Multi-Agent Coordination under Uncertainty.

Planning for Decentralized Control of Multiple Robots Under Uncertainty

Optimally Solving Simultaneous-Move Dec-POMDPs: The Sequential Central Planning Approach

A Role-Based POMDPs Approach for Decentralized Implicit Cooperation of Multiple Agents.

Factored Online Planning in Many-Agent POMDPs

A Framework for Sequential Planning in Multi-Agent Settings

Decentralized Control of Partially Observable Markov Decision Processes using Belief Space Macro-actions

Learning for Decentralized Control of Multiagent Systems in Large, Partially-Observable Stochastic Environments

Optimizing Agent Collaboration through Heuristic Multi-Agent Planning

Scalable Planning and Learning for Multiagent POMDPs: Extended Version

Bridging the Gap between Partially Observable Stochastic Games and Sparse POMDP Methods

Decentralized control of multi-robot partially observable Markov decision processes using belief space macro-actions

Scalable Decision-Theoretic Planning in Open and Typed Multiagent Systems

Multi-Agent Planning under Uncertainty with Monte Carlo Q-Value Function

Communication Decision in Decentralized Control of Coordinated System

Interactive POMDP Lite: Towards Practical Planning to Predict and Exploit Intentions for Interacting with Self-Interested Agents

Scalable Anytime Planning for Multi-Agent MDPs

Intention-Aware Navigation in Crowds with Extended-Space POMDP Planning

Hybrid BDI-POMDP Framework for Multiagent Teaming

Modeling Communication of Collaborative Multi-Agent System under Epistemic Planning