Abstract:The Robotic Mobile Fulfillment System (RMFS) is a "goods-to-worker" system that utilizes pods to store goods and employs robots for pods movement. In the RMFS, the multi-agent pickup and delivery (MAPD) problem which robots complete pickup and delivery tasks and ensure that there are no conflicts has been extensively studied. Several studies have made varying assumptions and layouts to address the MAPD problem, making it challenging to compare their proposed algorithms. This study presents a benchmark for MAPD based on eight factors that influence robot conflicts such as layout scale, pillar, cross-aisle, direction, storage strategy, the number of robots, the number of tasks, and dynamic events. The 12800 instances with 256 different combinations are designed based on 8 parameters with 2 levels that affect the number of conflicts. Identical task-set was used for 8 combinations with different numbers of robots, directions, and with or without cross-aisle. The robot and layout configurations ensure scalability for subsequent research. The objective of MAPD is to minimize the total completion time to demonstrate the robotic efficiency. Three different rules and algorithms were used to determine the lower bound and upper bound. The selection method based on hardness is proposed to obtain a more discriminant benchmark. The 2560 instances are selected to constitute the benchmark considering hardness, exhaustiveness, scalability, and amenity of statistical analysis. This benchmark can be utilized by researchers and practitioners for comparing different methods, rules, and algorithms for the MAPD problem in RMFS, and can be extended according to research problems, objectives, and actual system requirements, such as increasing conflicts for more challenging instances or decreasing conflicts for enhanced safety in the actual system. In conclusion, this paper proposes a benchmark for MAPD in RMFS to be utilized by researchers and practitioners through the analysis of conflicts, robots, and layout.

The Multi-Agent Pickup and Delivery Problem: MAPF, MARL and Its Warehouse Applications

Learning to Cooperate: Application of Deep Reinforcement Learning for Online AGV Path Finding.

Lifelong Multi-Agent Path Finding for Online Pickup and Delivery Tasks

Double-Deck Multi-Agent Pickup and Delivery: Multi-Robot Rearrangement in Large-Scale Warehouses

Dynamic Path Finding for Multi-Load Agent Pickup and Delivery Problem

Integrated Task Assignment and Path Planning for Capacitated Multi-Agent Pickup and Delivery

Multi-Agent Path Finding with Real Robot Dynamics and Interdependent Tasks for Automated Warehouses

Multi-Agent Path Finding with Heterogeneous Geometric and Kinematic Constraints in Continuous Space

Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers

Multi-Agent Path Finding Method Based on Evolutionary Reinforcement Learning

MAPF and MAPD: Recent Developments and Future Directions (short paper)

Multi-agent Pathfinding with Local and Global Guidance

MAPDP: Cooperative Multi-Agent Reinforcement Learning to Solve Pickup and Delivery Problems

Lifelong Path Planning with Kinematic Constraints for Multi-Agent Pickup and Delivery

Lifelong Multi-Agent Path Finding in Large-Scale Warehouses

Collaborative optimization of task scheduling and multi-agent path planning in automated warehouses

PCE: Multi-Agent Path Finding Via Priority-Aware Communication & Experience Learning

Benchmark for multi-agent pickup and delivery problem in a robotic mobile fulfillment system

Multi-Agent Target Assignment and Path Finding for Intelligent Warehouse: A Cooperative Multi-Agent Deep Reinforcement Learning Perspective

Robust Multi-Agent Pickup and Delivery with Delays