Transient Multi-Agent Path Finding for Lifelong Navigation in Dense Environments

Jonathan Morag,Noy Gabay,Daniel koyfman,Roni Stern
2024-12-05
Abstract:Multi-Agent Path Finding (MAPF) deals with finding conflict-free paths for a set of agents from an initial configuration to a given target configuration. The Lifelong MAPF (LMAPF) problem is a well-studied online version of MAPF in which an agent receives a new target when it reaches its current target. The common approach for solving LMAPF is to treat it as a sequence of MAPF problems, periodically replanning from the agents' current configurations to their current targets. A significant drawback in this approach is that in MAPF the agents must reach a configuration in which all agents are at their targets simultaneously, which is needlessly restrictive for LMAPF. Techniques have been proposed to indirectly mitigate this drawback. We describe cases where these mitigation techniques fail. As an alternative, we propose to solve LMAPF problems by solving a sequence of modified MAPF problems, in which the objective is for each agent to eventually visit its target, but not necessarily for all agents to do so simultaneously. We refer to this MAPF variant as Transient MAPF (TMAPF) and propose several algorithms for solving it based on existing MAPF algorithms. A limited experimental evaluation identifies some cases where using a TMAPF algorithm instead of a MAPF algorithm with an LMAPF framework can improve the system throughput significantly.
Multiagent Systems,Artificial Intelligence,Robotics
What problem does this paper attempt to address?
### What problem does this paper attempt to solve? This paper aims to solve the **mismatch problem between multi - agent path finding (MAPF) and lifelong multi - agent path finding (LMAPF)**. Specifically: 1. **Background and problem description**: - **MAPF problem**: It involves finding conflict - free paths for a group of agents from their initial configurations to their target configurations. - **LMAPF problem**: It is an online version of MAPF. When an agent reaches its current target, it will be immediately assigned a new target. In this case, the agent does not need to reach all targets simultaneously but can continuously receive new tasks. 2. **Limitations of existing methods**: - The common method currently used to solve the LMAPF problem is to treat it as a series of MAPF problems. However, this method has significant drawbacks because it requires all agents to reach their respective targets simultaneously, which is unnecessary and too strict in LMAPF. - This strict synchronization requirement leads to the inability to find effective solutions in some scenarios, such as the situation shown in Figure 1, where two agents cannot exchange positions. 3. **Proposed solution**: - The paper proposes a new variant - **transient multi - agent path finding (TMAPF)**, in which each agent only needs to finally visit its target, and it is not required that all agents be at the target positions simultaneously. - TMAPF relaxes the requirement for synchronous arrival, allowing agents to complete tasks in more flexible situations. 4. **Improvements and experimental results**: - The paper shows through experiments that in some cases, using the TMAPF algorithm can significantly improve the system throughput, especially in environments with limited and dense target locations. - The experimental results indicate that when the number of target locations is limited, the performance of the TMAPF algorithm is better than that of the traditional MAPF algorithm. ### Formula representation To ensure the correctness and readability of the formulas, the following are the key formulas involved in the paper: - **Path length definition**: \[ \text{len}(\pi_i)=|\pi_i|- 1 \] where \(\pi_i\) is the path of agent \(a_i\), and \(|\pi_i|\) represents the number of vertices in the path (not unique), and subtracting 1 gives the path length. - **Sum of Costs (SOC)**: \[ \text{SOC}(\pi)=\sum_{\pi_i\in\pi}(\text{len}(\pi_i)- 1) \] - **Makespan**: \[ \text{Makespan}(\pi)=\max_{\pi_i\in\pi}(\text{len}(\pi_i)- 1) \] These formulas are used to evaluate the performance of different path planning algorithms. ### Summary The core problem of the paper is to solve the mismatch between MAPF and LMAPF. By introducing TMAPF to provide a more flexible path planning scheme, the throughput and efficiency of the system in complex environments are improved.