Abstract:Multi-Agent Path Finding (MAPF) deals with finding conflict-free paths for a set of agents from an initial configuration to a given target configuration. The Lifelong MAPF (LMAPF) problem is a well-studied online version of MAPF in which an agent receives a new target when it reaches its current target. The common approach for solving LMAPF is to treat it as a sequence of MAPF problems, periodically replanning from the agents' current configurations to their current targets. A significant drawback in this approach is that in MAPF the agents must reach a configuration in which all agents are at their targets simultaneously, which is needlessly restrictive for LMAPF. Techniques have been proposed to indirectly mitigate this drawback. We describe cases where these mitigation techniques fail. As an alternative, we propose to solve LMAPF problems by solving a sequence of modified MAPF problems, in which the objective is for each agent to eventually visit its target, but not necessarily for all agents to do so simultaneously. We refer to this MAPF variant as Transient MAPF (TMAPF) and propose several algorithms for solving it based on existing MAPF algorithms. A limited experimental evaluation identifies some cases where using a TMAPF algorithm instead of a MAPF algorithm with an LMAPF framework can improve the system throughput significantly.

What problem does this paper attempt to address?

### What problem does this paper attempt to solve? This paper aims to solve the **mismatch problem between multi - agent path finding (MAPF) and lifelong multi - agent path finding (LMAPF)**. Specifically: 1. **Background and problem description**: - **MAPF problem**: It involves finding conflict - free paths for a group of agents from their initial configurations to their target configurations. - **LMAPF problem**: It is an online version of MAPF. When an agent reaches its current target, it will be immediately assigned a new target. In this case, the agent does not need to reach all targets simultaneously but can continuously receive new tasks. 2. **Limitations of existing methods**: - The common method currently used to solve the LMAPF problem is to treat it as a series of MAPF problems. However, this method has significant drawbacks because it requires all agents to reach their respective targets simultaneously, which is unnecessary and too strict in LMAPF. - This strict synchronization requirement leads to the inability to find effective solutions in some scenarios, such as the situation shown in Figure 1, where two agents cannot exchange positions. 3. **Proposed solution**: - The paper proposes a new variant - **transient multi - agent path finding (TMAPF)**, in which each agent only needs to finally visit its target, and it is not required that all agents be at the target positions simultaneously. - TMAPF relaxes the requirement for synchronous arrival, allowing agents to complete tasks in more flexible situations. 4. **Improvements and experimental results**: - The paper shows through experiments that in some cases, using the TMAPF algorithm can significantly improve the system throughput, especially in environments with limited and dense target locations. - The experimental results indicate that when the number of target locations is limited, the performance of the TMAPF algorithm is better than that of the traditional MAPF algorithm. ### Formula representation To ensure the correctness and readability of the formulas, the following are the key formulas involved in the paper: - **Path length definition**: \[ \text{len}(\pi_i)=|\pi_i|- 1 \] where \(\pi_i\) is the path of agent \(a_i\), and \(|\pi_i|\) represents the number of vertices in the path (not unique), and subtracting 1 gives the path length. - **Sum of Costs (SOC)**: \[ \text{SOC}(\pi)=\sum_{\pi_i\in\pi}(\text{len}(\pi_i)- 1) \] - **Makespan**: \[ \text{Makespan}(\pi)=\max_{\pi_i\in\pi}(\text{len}(\pi_i)- 1) \] These formulas are used to evaluate the performance of different path planning algorithms. ### Summary The core problem of the paper is to solve the mismatch between MAPF and LMAPF. By introducing TMAPF to provide a more flexible path planning scheme, the throughput and efficiency of the system in complex environments are improved.

Transient Multi-Agent Path Finding for Lifelong Navigation in Dense Environments

Learning to Cooperate: Application of Deep Reinforcement Learning for Online AGV Path Finding.

Traffic Flow Optimisation for Lifelong Multi-Agent Path Finding

Caching-Augmented Lifelong Multi-Agent Path Finding

Scaling Lifelong Multi-Agent Path Finding to More Realistic Settings: Research Challenges and Opportunities

Learn to Follow: Decentralized Lifelong Multi-agent Pathfinding via Planning and Learning

Anytime Multi-Agent Path Finding via Machine Learning-Guided Large Neighborhood Search

Lifelong Multi-Agent Path Finding in Large-Scale Warehouses

Lifelong Multi-Agent Path Finding in A Dynamic Environment.

Lifelong Multi-Agent Path Finding for Online Pickup and Delivery Tasks

The Study of Highway for Lifelong Multi-Agent Path Finding

Safe Multi-Agent Pathfinding with Time Uncertainty

Multi-Agent Path Finding with Heterogeneous Geometric and Kinematic Constraints in Continuous Space

ALPHA: Attention-based Long-horizon Pathfinding in Highly-structured Areas

Anytime Multi-Agent Path Finding using Operation Parallelism in Large Neighborhood Search

MAPF-LNS2: Fast Repairing for Multi-Agent Path Finding via Large Neighborhood Search

Leveraging Experience in Lifelong Multi-Agent Pathfinding

Anytime Multi-Agent Path Finding with an Adaptive Delay-Based Heuristic

Multi-agent Path Finding with Continuous Time Viewed Through Satisfiability Modulo Theories (SMT)

Overview: Generalizations of Multi-Agent Path Finding to Real-World Scenarios

Cooperative Multi-Agent Path Finding: Beyond Path Planning and Collision Avoidance