Abstract:Multi-Agent Path Finding (MAPF) is the problem of moving a team of agents to their goal locations without collisions. In this paper, we study the lifelong variant of MAPF, where agents are constantly engaged with new goal locations, such as in large-scale automated warehouses. We propose a new framework Rolling-Horizon Collision Resolution (RHCR) for solving lifelong MAPF by decomposing the problem into a sequence of Windowed MAPF instances, where a Windowed MAPF solver resolves collisions among the paths of the agents only within a bounded time horizon and ignores collisions beyond it. RHCR is particularly well suited to generating pliable plans that adapt to continually arriving new goal locations. We empirically evaluate RHCR with a variety of MAPF solvers and show that it can produce high-quality solutions for up to 1,000 agents (= 38.9\% of the empty cells on the map) for simulated warehouse instances, significantly outperforming existing work.

What problem does this paper attempt to address?

The problem that this paper attempts to solve is how to efficiently plan collision - free paths for multiple agents (such as mobile robots) in large - scale automated warehouses, especially when these agents need to be continuously assigned new target positions. Specifically, the paper focuses on the **Lifelong Multi - Agent Path Finding (lifelong MAPF)**. ### Problem Background In Multi - Agent Path Finding (MAPF), the goal is to move a group of agents from their starting positions to their target positions while avoiding collisions among them. The quality of the solution to the MAPF problem is usually measured by **flowtime** or **makespan**. However, in many practical applications, agents do not stop after reaching their target positions but are assigned new target positions instead, which is known as **lifelong MAPF**. For example, in large - scale automated warehouses, drive units need to continuously move inventory pallets or packages from one location to another, and these locations are dynamically changing. ### Limitations of Existing Methods Existing methods for solving lifelong MAPF include: 1. **Overall Solution**: Solve all target positions at once. However, this requires prior knowledge of all target positions and has limited scalability. 2. **Re - planning at Each Step**: Re - plan paths for all agents at each time step. But this method has a large computational cost and limited scalability. 3. **Partial Re - planning**: Only re - plan paths for agents newly assigned target positions. But this requires the map to have a specific structure and may lead to sub - optimal solutions or the inability to find solutions. ### New Method Proposed in the Paper To better solve these problems, the paper proposes a new framework - **Rolling - Horizon Collision Resolution (RHCR)**. The main features of RHCR are as follows: - **Decomposition into Windowed MAPF Instances**: Decompose the lifelong MAPF problem into a series of windowed MAPF instances, each of which only considers conflicts within a limited time window. - **Periodic Re - planning**: Re - plan every fixed number of time steps (replanning period \( h \)), rather than re - planning at each time step. - **Adaptation to New Target Positions**: Allow agents to be assigned multiple target positions in the same windowed MAPF instance, and only need to resolve conflicts within the first time window. ### Main Contributions 1. **Improved Efficiency**: By limiting the time window for conflict resolution, RHCR significantly reduces the computational cost while maintaining a high throughput. 2. **Adaptation to Dynamic Environments**: Plans generated by RHCR can flexibly adapt to newly arriving target positions. 3. **High Scalability**: Experiments show that RHCR can handle up to 1000 agents (38.9% of the free cells) in a simulated warehouse environment, significantly outperforming existing methods. ### Conclusion In conclusion, this paper proposes a new framework, RHCR, for solving the lifelong multi - agent path planning problem in large - scale automated warehouses. By introducing rolling - horizon collision resolution and windowed MAPF instances, RHCR not only improves computational efficiency but also better adapts to dynamically changing target positions.

Lifelong Multi-Agent Path Finding in Large-Scale Warehouses

Learning to Cooperate: Application of Deep Reinforcement Learning for Online AGV Path Finding.

Moving Forward in Formation: A Decentralized Hierarchical Learning Approach to Multi-Agent Moving Together

Multi-Agent Path Finding with Real Robot Dynamics and Interdependent Tasks for Automated Warehouses

Lifelong Multi-Agent Path Finding for Online Pickup and Delivery Tasks

Multi-Agent Path Finding with Heterogeneous Geometric and Kinematic Constraints in Continuous Space

Caching-Augmented Lifelong Multi-Agent Path Finding

Multi-Agent Path Finding Method Based on Evolutionary Reinforcement Learning

Multi-agent Pathfinding with Local and Global Guidance

Traffic Flow Optimisation for Lifelong Multi-Agent Path Finding

The Study of Highway for Lifelong Multi-Agent Path Finding

Lifelong Multi-Agent Path Finding in A Dynamic Environment.

Scaling Lifelong Multi-Agent Path Finding to More Realistic Settings: Research Challenges and Opportunities

Transient Multi-Agent Path Finding for Lifelong Navigation in Dense Environments

Lifelong Path Planning with Kinematic Constraints for Multi-Agent Pickup and Delivery

ALPHA: Attention-based Long-horizon Pathfinding in Highly-structured Areas

Multi-Agent Path Finding in Continuous Spaces with Projected Diffusion Models

Dynamic Path Finding for Multi-Load Agent Pickup and Delivery Problem

MAPF in 3D Warehouses: Dataset and Analysis

Anytime Multi-Agent Path Finding via Machine Learning-Guided Large Neighborhood Search

Learn to Follow: Decentralized Lifelong Multi-agent Pathfinding via Planning and Learning