Lifelong Multi-Agent Path Finding in Large-Scale Warehouses

Jiaoyang Li,Andrew Tinka,Scott Kiesel,Joseph W. Durham,T. K. Satish Kumar,Sven Koenig
DOI: https://doi.org/10.48550/arXiv.2005.07371
2021-03-13
Abstract:Multi-Agent Path Finding (MAPF) is the problem of moving a team of agents to their goal locations without collisions. In this paper, we study the lifelong variant of MAPF, where agents are constantly engaged with new goal locations, such as in large-scale automated warehouses. We propose a new framework Rolling-Horizon Collision Resolution (RHCR) for solving lifelong MAPF by decomposing the problem into a sequence of Windowed MAPF instances, where a Windowed MAPF solver resolves collisions among the paths of the agents only within a bounded time horizon and ignores collisions beyond it. RHCR is particularly well suited to generating pliable plans that adapt to continually arriving new goal locations. We empirically evaluate RHCR with a variety of MAPF solvers and show that it can produce high-quality solutions for up to 1,000 agents (= 38.9\% of the empty cells on the map) for simulated warehouse instances, significantly outperforming existing work.
Artificial Intelligence,Multiagent Systems,Robotics
What problem does this paper attempt to address?
The problem that this paper attempts to solve is how to efficiently plan collision - free paths for multiple agents (such as mobile robots) in large - scale automated warehouses, especially when these agents need to be continuously assigned new target positions. Specifically, the paper focuses on the **Lifelong Multi - Agent Path Finding (lifelong MAPF)**. ### Problem Background In Multi - Agent Path Finding (MAPF), the goal is to move a group of agents from their starting positions to their target positions while avoiding collisions among them. The quality of the solution to the MAPF problem is usually measured by **flowtime** or **makespan**. However, in many practical applications, agents do not stop after reaching their target positions but are assigned new target positions instead, which is known as **lifelong MAPF**. For example, in large - scale automated warehouses, drive units need to continuously move inventory pallets or packages from one location to another, and these locations are dynamically changing. ### Limitations of Existing Methods Existing methods for solving lifelong MAPF include: 1. **Overall Solution**: Solve all target positions at once. However, this requires prior knowledge of all target positions and has limited scalability. 2. **Re - planning at Each Step**: Re - plan paths for all agents at each time step. But this method has a large computational cost and limited scalability. 3. **Partial Re - planning**: Only re - plan paths for agents newly assigned target positions. But this requires the map to have a specific structure and may lead to sub - optimal solutions or the inability to find solutions. ### New Method Proposed in the Paper To better solve these problems, the paper proposes a new framework - **Rolling - Horizon Collision Resolution (RHCR)**. The main features of RHCR are as follows: - **Decomposition into Windowed MAPF Instances**: Decompose the lifelong MAPF problem into a series of windowed MAPF instances, each of which only considers conflicts within a limited time window. - **Periodic Re - planning**: Re - plan every fixed number of time steps (replanning period \( h \)), rather than re - planning at each time step. - **Adaptation to New Target Positions**: Allow agents to be assigned multiple target positions in the same windowed MAPF instance, and only need to resolve conflicts within the first time window. ### Main Contributions 1. **Improved Efficiency**: By limiting the time window for conflict resolution, RHCR significantly reduces the computational cost while maintaining a high throughput. 2. **Adaptation to Dynamic Environments**: Plans generated by RHCR can flexibly adapt to newly arriving target positions. 3. **High Scalability**: Experiments show that RHCR can handle up to 1000 agents (38.9% of the free cells) in a simulated warehouse environment, significantly outperforming existing methods. ### Conclusion In conclusion, this paper proposes a new framework, RHCR, for solving the lifelong multi - agent path planning problem in large - scale automated warehouses. By introducing rolling - horizon collision resolution and windowed MAPF instances, RHCR not only improves computational efficiency but also better adapts to dynamically changing target positions.