The Multi-Agent Pickup and Delivery Problem: MAPF, MARL and Its Warehouse Applications

Tim Tsz-Kit Lau,Biswa Sengupta
DOI: https://doi.org/10.48550/arXiv.2203.07092
2022-03-14
Abstract:We study two state-of-the-art solutions to the multi-agent pickup and delivery (MAPD) problem based on different principles -- multi-agent path-finding (MAPF) and multi-agent reinforcement learning (MARL). Specifically, a recent MAPF algorithm called conflict-based search (CBS) and a current MARL algorithm called shared experience actor-critic (SEAC) are studied. While the performance of these algorithms is measured using quite different metrics in their separate lines of work, we aim to benchmark these two methods comprehensively in a simulated warehouse automation environment.
Machine Learning,Multiagent Systems
What problem does this paper attempt to address?