LayeredMAPF: A Decomposition of Mapf Instance to Reduce Solving Costs

Zhuo Yao

doi:10.20944/preprints202503.0298.v1

Submitted:

04 March 2025

Posted:

05 March 2025

You are already at the latest version

Abstract

Multi-agent pathfinding (MAPF) holds significant utility within autonomous systems, however, the calculation and memory space required for multi-agent path finding (MAPF) grows exponentially as the number of agents increases. This often results in some MAPF instances being unsolvable under limited computational resources and memory space, thereby limiting the application of MAPF in complex scenarios. Hence, we propose a decomposition approach for MAPF instances, which breaks down instances involving a large number of agents into multiple isolated subproblems involving fewer agents. Moreover, we present a framework to enable general MAPF algorithms to solve each subproblem independently and merge their solutions into one conflict-free final solution, and avoid loss of solvability as much as possible. Unlike existing works that propose isolated methods aimed at reducing the time cost of MAPF, our method is applicable to all MAPF methods. In our results, we apply decomposition to multiple state-of-the-art MAPF methods using a classic MAPF benchmark\footnote{https://movingai.com/benchmarks/mapf.html}. The decomposition of MAPF instances is completed on average within 1s, and its application to seven MAPF methods reduces the memory usage or time cost significantly, particularly for serial methods. Based on massive experiments, we speculate the possibilty about loss of solvability caused by our method is $<$ 1\%. To facilitate further research within the community, we have made the source code of the proposed algorithm publicly available\footnote{https://github.com/JoeYao-bit/LayeredMAPF/tree/minimize\_dependence}.

Keywords:

Multi-Agent Path Finding (MAPF)

;

graph theory

Subject:

Computer Science and Mathematics - Robotics

1. Introduction

Multi-agent pathfinding (MAPF), as its name suggests, computes a set of collision-free paths for multiple agents from their respective starting locations to target locations. MAPF is widely utilized in autonomous systems, such as automated warehouses [1] and UAV traffic management [2].

Existing methods for MAPF are capable of determining optimal or bounded suboptimal solutions, but efficiency remains a key factor limiting its application. Researchers have proposed novel methods to address this issue, such as trading off solution quality to reduce runtime, reducing search branch factors, and solving agents independently based on their priorities. However, these techniques are often not applicable to all MAPF methods, as they are limited to specific types of MAPF problems.

Motivated by the phenomenon that the cost of solving MAPF instances grows exponentially as the number of agents increases [3], we propose a novel approach to reduce the cost of MAPF methods by decomposing a MAPF instance into multiple smaller subproblems. These subproblems are solved independently, while considering the solutions of other subproblems as dynamic obstacles.

This idea bears resemblance to Priority-Based Search (PBS) [4], which assigns a unique priority to each agent and solves agents separately in decreasing priority order. PBS can be viewed as decomposing a MAPF instance into subproblems, each involving only one agent. While PBS is efficient, it lacks a guarantee of completeness. Compring to PBS, our method try to reduce possibility of loss of solvability by make each subproblem contain more than one agent. We formulate the decomposition of MAPF instances as a progressive optimization problem. Initially, we decompose the MAPF instance into multiple subproblems without restricting the order of solving, and then further split them into smaller subproblems with a limited solving order. To minimize loss of solvability and minimize the size of each subproblem, we evaluate the solvability of each step of decomposition, allowing only decompositions that pass a solvability check. As a result, we demonstrate the performance of our method across various maps and illustrate its improvement on multiple cutting-edge MAPF methods.

The contributions of this manuscript are listed as follows:

1. We propose a novel method that decomposes a MAPF instance into multiple subproblems with minimizing loss of solvability. An overview of our method is depicted in Figure 1.

2. We present a framework that enables general MAPF methods to solve subproblems independently and merge their solutions to obtain the solution of the original problem.

3. We conduct extensive testing to evaluate how the decomposition of MAPF instances influences various MAPF methods using classic MAPF datasets. We evaluate the impact in terms of time cost, memory usage, success rate, and solution quality.

The remainder of this article is organized as follows. In Section 2, we review related problems and methods aimed at reducing the solving cost of MAPF instances. We then define basic terminology in Section 3 and introduce our method in Section 4. Our test results regarding the performance of decomposition under various maps are presented in Section 5, followed by an examination of how decomposition benefits multiple cutting-edge MAPF methods in Section 6. Finally, Section 7 concludes this article.

2. Related Works

In this section, we provide an overview of existing works aimed at reducing the time cost of MAPF methods. MAPF methods typically involve searching for paths connecting the starting and target locations for multiple agents. Broadly, there are two approaches to determining the paths of agents:

1, plan one agent’s path from its start to target while keeping other agents static and attempting to avoid conflicts with the paths of other agents. This approach is adopted by CBS-based methods, HCA*, LNS, and PBS.

2, plan for all agents simultaneously by avoiding conflicts with each other at each time step. This approach is employed by LaCAM, PIBT, and Push and Swap.

We refer to the first approach as serial MAPF methods and the second approach as parallel MAPF methods. This distinction is crucial in understanding how the decomposition of a MAPF instance applies to them. Further details about this difference can be found in Section 4.4.

2.1. Conflict Based Search

Confict-Based Search (CBS) [5] is a two-level complete and optimal multi-agent pathfinding algorithm. At the high level, CBS employs a constraint tree in which nodes represent conflicts between paths of two agents. By splitting a conflict, CBS obtains constraints for the related two agents, respectively. The low level utilizes heuristic single-agent searches to find new paths that satisfy all constraints imposed by the high-level constraint tree node. Therefore, CBS operates as a serial MAPF method. If CBS finds a new path, it checks for conflicts between the new path and previous paths. If no new conflicts are found, CBS exits with a conflict-free solution; otherwise, it inserts the new constraints generated from a conflict as nodes into the conflict tree and repeats the process of splitting conflicts and searching for new paths. If no new conflicts are found, CBS exits with a conflict-free solution. If conflicts exist in the current high-level node, the algorithm first selects one conflict, then generates a pair of constraints to resolve the conflict. Finally, it generates two child nodes, each adding a new constraint (together with existing constraints inherited from the current node). Then in each child node, paths are then replanned to satisfy new constraints and conflicts are updated.

While CBS can find optimal solutions, it can be time-consuming in certain cases. To accelerate CBS, various improvements have been proposed.

• Trade off solution quality for efficency

Bounded CBS (BCBS) [6] incorporates focal search into the low-level search of CBS, enabling the low-level search to consider avoiding conflicts with other agents’ paths and generate bounded suboptimal solutions. Enhanced CBS (ECBS) [6] utilizes the same low-level search as BCBS and applies focal search to the high-level search of CBS, aiming to minimize the number of nodes from the current constraint tree (CT) node to the CT node representing the conflict-free solution. Similar to BCBS, ECBS also produces bounded suboptimal solutions.

Explicit Estimation CBS (EECBS) [7] employs online learning to estimate the cost of the solution beneath each high-level node. It utilizes Explicit Estimation Search (EES) [8] to select high-level nodes for expansion, mitigating two drawbacks of ECBS: (1) the cost of the solution beneath a CT node N might exceed the cost of N and thus could surpass the suboptimality bound; (2) the lower bound of ECBS seldom increases, leading to difficulties in finding a solution within a reasonable time if the optimal sum of costs is not within the initial suboptimality bound.

• Bypassing conflicts

Bypassing Conflicts [9] is a conflict resolution technique that alters the paths of agents involved in a chosen conflict instead of splitting a constraint tree (CT) node and searching for paths to avoid related constraints. When expanding a CT node N and generating child CT nodes, if the cost of a child CT node equals the cost of N and the number of conflicts of the paths in the child node is smaller than the number of conflicts of N, then the child node is used to replace N, and all generated child CT nodes are discarded. Otherwise, N is split as before. It has been demonstrated that bypassing conflicts often results in smaller CTs and reduces the runtime of CBS.

• Prioritizing conflicts

Prioritizing Conflicts [9] is a conflict-selection technique. A conflict is considered cardinal if, when CBS uses the conflict to split a constraint tree (CT) node N, the costs of both child CT nodes are larger than the cost of N. It is semi-cardinal if the cost of one child CT node is larger than the cost of N, and the cost of the other node is not. It is non-cardinal if the costs of both child CT nodes are equal to the cost of N. CBS can significantly enhance its efficiency by resolving cardinal conflicts first, then semi-cardinal conflicts, and finally non-cardinal conflicts. This prioritization is effective because generating CT nodes with larger costs first typically improves the lower bound of the CT (i.e., the minimum cost of the CT nodes in the open list) faster, thereby producing a smaller constraint tree and accelerating CBS.

• Symmetry reasoning

Symmetry Reasoning [10,11] is a technique aimed at avoiding the repeated resolution of conflicts between the same pair of agents due to symmetric paths and conflicts. It efficiently identifies each symmetry and resolves it through a single splitting action with specialized constraints. This process results in a smaller constraint tree and reduces the time cost of CBS.

• Weighted Dependency Graph (WDG)

The Weighted Dependency Graph (WDG) heuristic [12] is an admissible heuristic employed in the high-level search of CBS. It operates by constructing a weighted dependency graph for each constraint tree (CT) node N. In these graphs, vertices represent agents, and edges denote dependency between agents. This dependency is defined as follows: the minimum sum of costs of their conflict-free paths satisfying N’s constraints (computed via solving a 2-agent MAPF instance using CBS) is greater than the sum of costs of their paths in N’s paths (the shortest paths satisfying N’s constraints but not necessarily conflict-free). The edge weights signify the difference between the minimum sum of costs of conflict-free paths satisfying N’s constraints and the sum of costs of their paths in N’s paths.

The value of the edge-weighted minimum vertex cover of the graph serves as an admissible heuristic for the high-level search of CBS. Despite the runtime overhead associated with building the weighted dependency graphs and determining their edge-weighted minimum vertex cover, incorporating the WDG heuristic often leads to a reduction in constraint tree size and a decrease in the runtime of CBS.

2.2. Large Neighborhood Search

Large Neighborhood Search (LNS) is a classic algorithm for finding good solutions to challenging discrete optimization problems. Li et al. [13] proposed the first (to our best knowledge) LNS approaches for MAPF, MAPF-LNS.

MAPF-LNS starts with a given solution and deletes a portion of it, referred to as a neighborhood, while treating the remaining part as fixed. This simplifies the original problem, making it easier to solve. If the newly found solution is superior to the current one, it replaces the existing solution. Consequently, MAPF-LNS is capable of iteratively enhancing solution quality towards near-optimality. Its completeness depends on the algorithm used to generate initial paths. During each iteration, LNS selects a subset of agents and updates their paths without modifying those of other agents. Thus, MAPF-LNS operates as a serial MAPF method. MAPF-LNS can employ any desired approach to solve the simplified problem, provided it can account for the fixed information.

Building on this work, Li et al. introduced MAPF-LNS2 [3], which efficiently finds a solution (rather than improving a given solution) for a MAPF instance. Initially, MAPF-LNS2 invokes a MAPF algorithm to solve the instance and acquire a (partial or complete) plan. For agents lacking a path, MAPF-LNS2 devises a path that minimizes collisions with existing paths.

2.3. Priority Based Search

Priority-Based Search (PBS) [4] is an incomplete, suboptimal algorithm designed for prioritized planning. It employs a depth-first search at the high level to dynamically construct a priority ordering, thereby forming a priority tree (PT). When confronted with a collision, PBS greedily determines which agent should be assigned a higher priority. It efficiently backtracks and explores alternative branches only if no solution is found in the current branch. Consequently, PBS incrementally constructs a single partial priority ordering until all collisions are resolved.

Once each agent is assigned a unique priority, PBS computes a minimum-cost path (in priority order) from its starting vertex to its target vertex without colliding with the paths of agents with higher priorities that have already been planned. Therefore, PBS operates as a serial MAPF method.

PBS ranks among the most efficient methods for solving MAPF. However, prioritized planning with an arbitrary priority ordering does not guarantee completeness or optimality in general [4].

• Greedy PBS

Given that PBS becomes less effective for MAPF instances with high densities of agents and obstacles, [14] introduced Greedy PBS (GPBS). GPBS utilizes greedy strategies to enhance PBS by minimizing the number of collisions between agents. In essence, GPBS employs the numbers of conflicts and conflicting pairs as heuristics to guide the search on its low and high levels, respectively.

PBS can be regarded as a special decomposition of MAPF instances, where each subproblem involves only one agent. Unlike PBS, our decomposition does not necessitate that every subproblem contain only one agent, so our decomposition has less loss of solvability. Additionally, our decomposition serves as an auxiliary component rather than an isolated method, making it applicable to a wide range of MAPF algorithms.

2.4. PIBT and LaCAM

2.4.1. Priority Inheritance with Backtracking (PIBT)

PIBT [15] is an incomplete and suboptimal MAPF method that assigns a unique priority to each agent at every timestep. This prioritization ensures that all movements are ordered, and all agents act within a single timestep according to their priority, making it a parallel MAPF method. Priority inheritance is utilized to effectively handle priority inversion in path adjustment within a small time window, and a backtracking protocol prevents agents from becoming stuck.

Based on priorities and assuming only local interactions, PIBT is easily adaptable to decentralized contexts. Intuitively, PIBT is well-suited for large environments where intensive communication is prevalent, and path efficiency improves with a lower density of agents.

In PIBT, for each timestep, each agent must evaluate distances from surrounding nodes to its goal. While this operation could be implemented by calling A* on demand, it may also become a bottleneck. To address this issue, the authors of PIBT proposed PIBT+ [16], which saves computation time by preparing distance tables from each agent’s goal.

2.4.2. Lazy Constraints Addition for MAPF (LaCAM)

LaCAM [17] is a complete and suboptimal two-level MAPF method. At the high level, it explores a sequence of configurations, where each search node corresponds to one configuration. At the low level, it searches for constraints specifying which agents go where in the next configuration, making it a parallel MAPF method.

For each high-level node, LaCAM performs a low-level search that generates constraints determining the agents’ positions in the next configuration. Successors at the high level (i.e., configurations) are lazily generated while adhering to constraints from the low level, resulting in a significant reduction in search effort. The successor generator of LaCAM is implemented using PIBT.

It is noteworthy that when LaCAM encounters a known configuration, it reinserts the corresponding high-level node into the open set (which stores nodes for high-level search). This action prevents repeated appearance of configurations, thereby preventing LaCAM from avoiding conflicts with external paths (external path: path not belonging to current MAPF instance), as all agents may need to wait at times to avoid conflicts with external paths.

Okumura et al. proposed LaCAM2 [18], which introduces two enhancements to LaCAM. The first enhancement is its anytime version, called LaCAM*, which converges to optimality eventually, provided that solution costs accumulate transition costs. The second enhancement adds a swap action to the successor generator of LaCAM, enabling the quick generation of initial solutions.

2.5. Independence Detection

The idea of splitting a MAPF problem into multiple smaller subproblems has been explored by researchers in recent years. Standley et al.[19,20] proposed that if the optimal paths of two agents have no conflicts, they can be solved independently. Standley[20] introduced an independence detection (ID) algorithm to decompose a group of agents into the smallest possible groups. ID begins by assigning each agent to its own group and finds an initial path for each group independently. It then attempts to find alternative paths to avoid conflicts, via classic MAPF methods. If attempts to find conflict-free paths fail, ID merges the conflicting groups. The process continues until there are no conflicts between agents from different groups.

Sharon et al.[21] proposed a continuum between CBS and ID called Meta-Agent CBS (MA-CBS). MA-CBS introduces a predefined parameter B, where conflicting agents are merged into a meta-agent and treated as a joint composite agent if the number of conflicts exceeds B. The original CBS algorithm corresponds to the extreme case where B = ∞ (never merge agents), while the ID framework[19,20] represents the other extreme, where B = 0 (always merge agents when conflicts occur).

Compared to ID, our method considers the order of solving subproblems, thus our method have more possibility to decomposeMAPF instance into smaller subproblems. And our decomposition is decoupled from MAPF methods, while ID may need run multiple times of MAPF methods for an agent to find conflict-free paths. So in terms of time cost, our method has significant advantages. Since ID only considers making a group avoid conflict with one group at one time, a group may have conflict with a group it avoided before. For example, group A avoid conflicts wth group B, but A may have conflicts with B after A tries to avoid conflicts with group C. So it may be trapped into infinite loops. This phenomenon is not frequent when there are a few agents but more likely to happen if there are dense agents. In results, we compare our method with ID. And our method demonstrates significant advantages comparing to ID, in terms of max subproblem size, time cost and success rate to find conflict-free solution.

2.6. SEQ(Sequence) and DSP(Delayed Shortest Path)

Online Multi-Agent Path Finding (MAPF)[22] is the online version of MAPF where new agents appear over time. One kind of Online MAPF assumes when an agent reaches its goal it stays there, this results in a setting similar to Lifelong MAPF[23]. Another kind of Online MAPF assumes an agent disappears when reaching its goal.

For the second kind of online MAPF, non-conflicting paths can be generated by considering the agents sequentially according to a given order, taking their shortest paths from start to goal location and adding delays at the beginning of the paths to avoid conflicts. The algorithm SEQ (SEQUENCE)[24] does that but, for each agent, naively chooses a relative delay that is equal to the length of the shortest path of the agent that precedes it in the order. Hence, in SEQ, one agent at a time follows its shortest path to destination while the other agents wait. Delayed Shortest Path (DSP)[25] improve SEQ by calculating safe delays, i.e. relative delays that are possibly shorter than the shortest path of the previous agent in the order but are long enough to safely avoid conflicts.

DSP and SEQ use a given priority order for the agents. There are several types of priority order:

SH (Shortest path first) gives a higher priority to agents with shorter paths.
LH (Longer path first) is the opposite of SH and gives a higher priority to agents with longer paths. RND prioritizes the agents randomly.
LD (Lowest delays first) is a greedy method that prioritizes agents according to the lowest safe delay.

The similarity between our method and SEQ and DSP is that we also decompose a MAPF instance to subproblems with a priority order and introduce delays to get conflict-free solution. However, our method focuses on classic MAPF (i.e., offline MAPF), each agent stays at its start position before move and stay at its target position after finish its path. The biggest difference between offline MAPF and online MAPF is, agents staying in start or target positions may block other agents’ path. So essentially, our method focus on different problem comparing to SEQ and DSP.

3. Preliminaries

3.1. Basic Definitions

In this section, we present fundamental definitions of multi-agent pathfinding. Since our method is dimension-independent, the provided definitions apply to any-dimensional cell space

C_{N}

.

• Grid space

Let

C_{N}

(

N \geq 2

) denote a finite

N

-dimensional integer Euclidean space, where the size of the space is defined as

D

, with

D = {d_{1}, d_{2}, \dots, d_{i}, \dots, d_{N}}

and

d_{i} \in N

. The coordinates of an element

g

in this space are defined as a vector

(x_{1}, x_{2}, \dots, x_{i}, \dots, x_{N})

, where

x_{i} \in ([0, d_{i}) \cap N)

.

• Cell states

There are only two possible states for a cell/element in

C_{N}

: passable or unpassable. The set of all passable cells in

C_{N}

is denoted as

F

, while the set of all unpassable cells is denoted as

O

. Therefore,

F \cup O = C_{N}

,

F \cap O = \emptyset

.

• Agents

Assuming there are k agents

A = {a_{1}, a_{2}, \dots, a_{k}}

in the grid space

C_{N}

, where each agent always occupies a passable cell. For convenience, we denote starts or target state of an agent

a_{i}

as

S [a_{i}]

or

T [a_{i}]

.

Each agent has a unique starting cell and a targeting cell. Specifically, for all

i \in {1, 2, \dots, k}

and

j \in {1, 2, \dots, k}

with

i \neq j

, we have

S [a_{i}] \neq S [a_{j}]

,

T [a_{i}] \neq T [a_{j}]

,

S [a_{i}] \neq T [a_{j}]

, and

T [a_{i}] \neq S [a_{j}]

.

• Timestep

Time is discretized into timesteps. At each timestep, every agent can either move to an adjacent cell or wait at its current cell.

• Path

A path

p_{i}

for agent

a_{i}

is a sequence of cells that every pair of subsequent cells is adjacent or identical (indicating a wait action), starting at the start cell

s_{i}

and ending at the target cell

t_{i}

. The cost of a path is the number of time steps it takes, i.e., the size of the path sequence

| p_{i} |

. We assume that the agents remain at their targets forever after completing their paths.

• Conflict

We say that two agents have a conflict if they are at the same cell or exchange their positions at the same timestep (e.g.,

p_{i} [t] = = p_{j} [t + 1]

and

p_{i} [t + 1] = = p_{j} [t]

, where

p_{i}

and

p_{j}

means path of agent

a_{i}

and

a_{j}

,

p_{i} [t]

means

a_{i}

’s location at timestep t).

• Solution

A solution of a MAPF instance is a set of conflict-free paths

{p_{1}, p_{2}, \dots, p_{k}}

, one for each agent. Generally, an optimal solution is one with minimal sum of costs (SOC)

\sum_{i = 1}^{k} | p_{i} |

or makespan

\underset{i \in {1, 2, \dots, k}}{argmax} | p_{i} |

. In this manuscript, we consider the effects of decomposition on both SOC and makespan.

• Solvability

If there exists a solution for a MAPF instance, it is solvable. In the manuscript, we only consider the decomposition of solvable MAPF instances.

A MAPF method is considered complete if it can always find a solution for a solvable problem; otherwise, it is incomplete. For example, EECBS is complete while PBS is incomplete.

A quick method to determine whether a MAPF instance is solvable is whether every agent can find a collision-free path (i.e., pass no unpassable cells) from the grid space. It is noteworthy that passing through other agents’ starting or target cells is acceptable.

Here, we define a search path function:

Definition 1.

s e a r c h_p a t h (a_{i}, C_{N}, a v o i d_n o d e_s e t)

: a complete method to search a path in

C_{N}

that connects the start state and target state of agent

a_{i}

. The parameter

a v o i d_n o d e_s e t

refers to the set of nodes that cannot be part of the path. If a solution exists, we denote this as

s e a r c h_p a t h (a_{i}, C_{N}, a v o i d_n o d e_s e t) \neq \emptyset

.

s e a r c h_p a t h

could be implemented using Best-First Search or Breadth-First Search. If a LA-MAPF instance is solvable, then for all agents

a_{i} \in A

, it holds that

s e a r c h_p a t h (a_{i}, C_{N}, \emptyset) \neq \emptyset

.

Here is an example about how to check a MAPF instance is solvable, as shown in Figure 2. We notice if a MAPF instance failed to meet this condition, it must be unsolvable; however, if a instance meet this condition, it might be unsolvable, an example is shown in Figure 3. So our decomposition minimizing every subproblem’s possibility of unsolvable by making every subproblem meet this condition.

3.2. Decomposition of MAPF Instance

Given a MAPF instance’s graph have N nodes and k agents, and the maximum length of path are T, the total number of solutions (including solution that have conflicts) is

N^{T k}

, so the state space grows exponentially with agent size increases. In the worst cases, a complete MAPF algorithm need to travelsal all solutions to get a conflict-free solution. So the time cost of solve MAPF instance grows nearly exponentially as the number of agents increases[3].

To reduce the cost of solving MAPF instance, we proposed a new technique: splitting a MAPF instance into multiple subproblems, each with fewer agents but under the same map, and solving these subproblems in a specific order. Here, we define the number of agents in each subproblem as the size of the subproblem. When solving a subproblem, it is crucial to ensure that the current paths do not conflict with the paths from previous subproblems. If every subproblem is solvable, and every subproblem’s solution doesn’t pass previous subproblem’s target and later subproblem’s start, we can combining all subproblems’ results (by adding wait action at the begining of each subproblem’s solution, related proof and examples can be found in Section 4) yields the solution of the original MAPF instance. If we can’t split the instance into subproblems that pass the mentioned solvability check, we don’t split it.

As the time cost of solving a MAPF instance grows nearly exponentially with the number of agents, we need to minimize the size of each subproblem to reduce the total time cost of solving all subproblems. So the criteria for judging whether one decomposition is better than another involves sorting the subproblems’ sizes in decreasing order and comparing the sizes of the subproblems from the largest to the smallest. We define a decomposition that not worse than any other decomposition of this instance as a optimal decomposition. There might more than one optimal decomposition for a MAPF instace, when two decompositions have the same size of subproblems but subproblem contain different agents. The first smaller subproblem encountered indicates the better decomposition.

For example, suppose there are two decompositions of a MAPF instance with 100 agents. The sizes of the subproblems for the first decomposition are 40, 20, 15, 14, and 11, while for the second decomposition, they are 40, 20, 19, 13, and 8. The first decomposition is considered better than the second one because the third subproblem’s size (15) in the first decomposition is smaller than the second decomposition’s third subproblem size (19).

Considering all agent plans together can get better solution via coordination all agent’s path while solving them separately can’t coordinate agent of different subproblems, solving all agents together tends to finding a shorter solution compared to solving them separately. Therefore, MAPF methods with decomposition of MAPF instances tend to produce worse solutions than raw methods. However, we provide no theoretical analysis regarding the extent of this degradation in this manuscript; hence, we conduct empirical comparisons through extensive testing in Section 6.

Here we define the decomposition of a MAPF instance:

Definition 2.

Decomposition of a MAPF instance: Splitting a set of k agents,

A = {a_{1}, a_{2}, \dots, a_{k}}

, where

k \geq 1

, into m subsets of agents,

A_{1}, A_{2}, \dots, A_{m}

, where

m \geq 1

and

A_{1} \cup A_{2} \cup \dots \cup A_{m} = A

. Each subset (i.e., subproblem) should be solvable independently without updating the solutions of the other subsets. A decomposition of a MAPF instance is considered legal if each subproblem passes the solvability check.

Before discussing how to decompose a MAPF instance, we first introduce the criteria for determining whether a decomposition is legal.

We simplify this by allowing the agents in each subproblem to start moving only after all agents in the previous subproblems have reached their target states, while avoiding the start states of subproblems that have not yet been solved. In other words, in this simplified scenario, each subproblem treats the agents from other subproblems as static obstacles (remaining at their start state if solved later than the current subproblem, or staying at their target state if solved earlier). An example is shown in Figure 4.

If all subproblems of a decomposition pass the mentioned solvability check under this simplified scenario, we say that we have found a legal decomposition of the MAPF instance.

If every subproblem passes the solvability check under this simplified scenario, and every subproblem is solved, we can obtain a conflict-free total solution by combining the solutions of each subproblem. Specifically, making an agent from a subproblem wait at the start state until the agents from previous subproblems have reached their target states in the spatial-time map ensures a conflict-free total solution. An example is shown in Figure 5.

However, it is noteworthy that the mentioned solvability check is not always accurate; it may misjudge unsolvable subproblems as solvable, resulting in a loss of solvability. An example is shown in Figure 6. Based on empirical observations, this phenomenon becomes more frequent in instances with dense agents. However, the decomposition of instances also becomes less effective in cases with many dense agents (which means the size of the largest subproblem will be close to the total number of agents, or the instance may not be split). In such cases, the side effect of loss of solvability is sometimes offset.

3.2.1. Cluster and Level

Definition 3.

Here, we define two types of subproblems: cluster, a type of subproblem with no restrictions on the order of solving; and level, a type of subproblem that must be solved in a specific order.

For convenience, we denote all start or target states of a cluster c and a level l as

S [c]

or

T [c]

, and

S [l]

or

T [l]

, respectively.

A brief example of clusters and levels is shown in Figure 7. A MAPF instance may be decomposed into multiple clusters,

C = {c_{1}, c_{2}, \dots, c_{m}}

, or multiple levels,

L = {l_{1}, l_{2}, \dots, l_{n}}

, where

1 \leq m \leq k

and

1 \leq n \leq k

, with k representing the total number of agents in the current instance.

To ensure that the decomposition of a MAPF instance into clusters or levels is legal, both the clusters and levels must satisfy certain conditions.

Theorem 1.

Legal condition for decomposition into clusters:

If a MAPF instance is decomposed into multiple clusters

C = {c_{1}, c_{2}, \dots, c_{m}}

, then

\forall c_{i} \in C

, the

a v o i d_n o d e_s e t

is defined as

a v o i d_n o d e_s e t = {s ∣ s \in {⋃_{j = 0}^{m} S [c_{j}] \cup ⋃_{j = 0}^{m} T [c_{j}]}, j \neq i} .

If

\forall a \in c_{i}, s e a r c h_p a t h (a, C_{N}, a v o i d_n o d e_s e t) \neq \emptyset

, then the decomposition of the MAPF instance into clusters is considered legal.

Figure 7. Figure A and B show an example about two kinds of subproblem (cluster and level) in MAPF that can be solved independently.

For MAPF instance in Figure A, we can solve

a_{1}

and

a_{2}

separately and have no limitation in order of solve. If we solve

a_{1}

first, a conflict-free solution might be

a_{1}

=

{(1, 2) \to (1, 1) \to (1, 0)}

, and

a_{2}

=

{(0, 1) \to (0, 1) \to (1, 1) \to (2, 1)}

. If we solve

a_{2}

first, a conflict-free solution might be

a_{1}

=

{(1, 2) \to (1, 2) \to (1, 1) \to (1, 0)}

, and

a_{2}

=

{(0, 1) \to (1, 1) \to (2, 1)}

. So we say MAPF instance in Figure A can be decompose into two clusters, cluster 1 =

{a_{1}}

, cluster 2 =

{a_{2}}

.

For MAPF instance in Figure B, we can solve

a_{1}

and

a_{2}

separately but there is a limitation in order of solve. If we solve

a_{1}

first, the solution of

a_{1}

=

{(1, 2) \to (1, 1)}

, then

a_{2}

find no conflict free solution while considering

a_{1}

as dynamic obstacles. If we solve

a_{2}

first, a conflict-free solution might be

a_{1}

=

{(1, 2) \to (1, 2) \to (1, 1)}

, and

a_{2}

=

{(0, 1) \to (1, 1) \to (2, 1)}

. So we say MAPF instance in Figure B can only be decompose into two levels, level 1 =

{a_{2}}

, level 2 =

{a_{1}}

. Agents in level 1 must be solved before level 2 to pass the solvability check condition and ensure the solvability of subproblems.

In other words, if the agents in each cluster have paths that do not pass through the start or target states of agents in other clusters, this subproblem passes the mentioned solvability check, and the decomposition of the MAPF instance into clusters is considered legal.

Proof.

If the agents in each cluster have paths that do not pass through the start or target states of agents in other clusters, then the decomposition is legal under the mentioned simplified scenario, and the decomposition of the MAPF instance into clusters is considered legal.

□

Theorem 2.

Legal condition for decomposition into levels:

If a MAPF instance is decomposed into multiple levels

L = {l_{1}, l_{2}, \dots, l_{n}}

, then for each

l_{i} \in L

, the

a v o i d_n o d e_s e t

is defined as

a v o i d_n o d e_s e t = {s ∣ s \in {⋃_{j = 0}^{i - 1} T [l_{j}] \cup ⋃_{j = i + 1}^{m} S [l_{j}]}} .

If

\forall a \in l_{i}, s e a r c h_{p} a t h (a, C_{N}, a v o i d_{n} o d e_{s} e t) \neq \emptyset

, then the decomposition of the MAPF instance into levels is considered legal.

In other words, if the agents in each level have paths that do not pass through the target states of agents in previous levels or the start states of agents in subsequent levels, then the decomposition into levels is considered legal.

Proof.

If the agents in each level have paths that do not pass through the target states of agents in previous levels or the start states of agents in subsequent levels, then the decomposition is legal under the mentioned simplified scenario, and the decomposition of the MAPF instance into levels is considered legal. □

It is noteworthy that the legal requirements for decomposing into clusters and levels do not guarantee the solvability of subproblems. The legal requirements are used to avoid unsolvable subproblems that cannot pass the mentioned solvability check.

3.2.2. Connectivity graph

As our main focus is on whether an agent’s path passes through the start or target states of other agents, and to reduce the time cost of searching for paths, we divide the passable cells into two types: one being the agent’s start or target state, and the other being the remaining cells, which we refer to as free cells.

We then use connected component detection to split all free cells into subsets (which we call free cell groups), where every cell in a group has a path to another cell in the same group without passing through any agent’s start or target state, or cells from other groups.

We then propose a connectivity graph,

G_{c}

, whose nodes consist of free cell groups and the start and target states of agents, with edges representing the connectivity between these nodes. Since

G c

has a smaller size than

C N

, the time cost of searching for paths can be significantly reduced, especially for instances with sparse agents.

In the implementation, since free cell groups do not influence whether an agent’s dependent path passes through other agents’ states, we simplify

G_{c}

by directly connecting the nodes representing agent states if they can be connected via free cell groups. After simplification,

G c

has

2 k

nodes, where k is the number of agents. Compared to

C N

, the simplified

G_{c}

has a significantly smaller size, which accelerates the process of determining whether two agents can be in different subproblems.

An example of the connectivity graph is shown in Figure 8.

3.2.3. Variants of Search Path

To reduce the time cost of searching for paths, we search for paths in

G c

rather than in

C N

. We denote the paths in

G_{c}

as dependence paths.

Definition 4.

The dependence path of an agent

a_{i}

is defined as the path from

s_{i}

to

t_{i}

, consisting of nodes in

G_{c}

.

As an example, the dependence path for agent 1 in the instance shown in Figure 8 is “

S_{1} \to T_{2} \to F_{1} \to T_{3} \to T_{1}

"; for agent 2, its dependence path is “

S_{2} \to F_{2} \to T_{3} \to F_{1} \to T_{2}

"; and for agent 3, the dependence path is “

S_{3} \to F_{1} \to T_{3}

".

The dependence path of an agent reflects its relationship with other agents. For a valid decomposition, each agent’s path must not intersect with the targets of previous subproblems or the starts of later subproblems. The dependence paths of agents determine whether two agents must be solved in the same subproblem or the order in which they are solved across different subproblems. Specifically, there are four cases:

1, if the dependence path of agent

a_{i}

contains only the start of agent

a_{j}

, it indicates that

a_{i}

should be solved after

a_{j}

to enable they could be in different subproblems;

2, if the dependence path of agent

a_{i}

contains only the target of agent

a_{j}

, it implies that

a_{i}

should be solved before

a_{j}

to enable they could be in different subproblems;

3, if the dependence path of agent

a_{i}

contains both the start and target of agent

a_{j}

, it signifies that

a_{i}

and

a_{j}

must be solved in the same subproblem;

4, if the dependence path of agent

a_{i}

contains neither the start nor the target of agent

a_{j}

, it suggests that

a_{i}

and

a_{j}

can be solved in different subproblems, and the order of solving them does not matter.

It is worth noting that an agent’s dependence path is not unique. By updating their dependence paths, we can alter the relationship between two agents (in implementation, we check for the existence of such paths rather than explicitly maintaining all agents’ dependence paths). By changing the relationship between two agents, we can update the subproblems of a MAPF instance’s decomposition. Considering that making an agent’s dependence path avoid another agent might cause the path to pass through a third agent, and thus fail to generate smaller subproblems, we have no direct method to ensure that agents’ dependence paths correspond to smaller subproblems, we have proposed several approaches to encourage the generation of smaller subproblems; their details can be found in Section 4.

Here, we define two variants of the search path function used for decomposing the LA-MAPF instance into clusters:

Definition 5.

s e a r c h_p a t h_a g e n t (a_{i}, G_{c}, a v a i l a b l e_a g e n t s, a v o i d_a g e n t s)

, where

a v a i l a b l e_a g e n t s

represents agents whose start or target states the dependence path of

a_{i}

can pass through, and

a v o i d_a g e n t s

represents agents whose start or target states the dependence path of

a_{i}

cannot pass through. If

a v a i l a b l e_a g e n t s = \emptyset

, it means the path can pass through all nodes in

G_{c}

, except those that belong to the start or target states of agents in

a v o i d_a g e n t s

.

s e a r c h_p a t h_a g e n t

performs a complete graph search on

G_{c}

and returns the agents whose dependence paths involve

a_{i}

. If no such dependence path exists,

s e a r c h_p a t h_a g e n t

returns ∅.

Similarly, we define a variant of

s e a r c h_p a t h

used in decomposing the instance into levels:

Definition 6.

s e a r c h_p a t h_S A T (a_{i}, G_{c}, a v a i l a b l e_S A T s, a v o i d_S A T s)

, where

a v a i l a b l e_S A T s

represents the start or target states that the dependence path of

a_{i}

can pass through, and

a v o i d_S A T s

represents the start or target states that the dependence path of

a_{i}

cannot pass through. If

a v a i l a b l e_S A T s = \emptyset

, it means the path can pass through all nodes in

G_{c}

, except those that belong to the start or target states of agents in

a v o i d_{S} A T s

.

s e a r c h_p a t h_S A T

also performs a complete graph search on

G_{c}

and returns the start or target states of agents whose dependence paths are involved with

a_{i}

. If no such dependence path exists,

s e a r c h_p a t h_S A T

returns ∅.

Here are some examples of the variants of the search path for the instance in Figure 8:

s e a r c h_p a t h_a g e n t (a_{1}, G_{c}, {a_{1}, a_{2}, a_{3}}, \emptyset) = {a_{1}, a_{2}, a_{3}}

;

s e a r c h_p a t h_a g e n t (a_{1}, G_{c}, \emptyset, {a_{2}}) = \emptyset

(since

a_{1}

’s dependence path must pass through

T [a_{2}]

);

s e a r c h_p a t h_a g e n t (a_{1}, G_{c}, {a_{1}, a_{3}}, \emptyset) = \emptyset

(since

a_{1}

’s dependence path must pass through

T [a_{2}]

);

s e a r c h_p a t h_S A T (a_{1}, G_{c}, \emptyset, \emptyset) = {S [a_{1}], T [a_{1}], T [a_{2}], T [a_{3}]}

;

s e a r c h_p a t h_S A T (a_{2}, G_{c}, \emptyset, \emptyset) = {S [a_{2}], T [a_{2}], T [a_{3}]}

;

s e a r c h_p a t h_S A T (a_{2}, G_{c}, \emptyset, {T [a_{3}]}) = \emptyset

(since

a_{2}

’s dependence path must pass through

T [a_{3}]

);

s e a r c h_p a t h_S A T (a_{2}, G_{c}, {S [a_{2}], T [a_{2}], T [a_{3}]}, \emptyset) = {S [a_{2}], T [a_{2}], T [a_{3}]}

;

There are two main advantages of

s e a r c h_p a t h_a g e n t

and

s e a r c h_p a t h_S A T

compared to

s e a r c h_p a t h

:

$G_{c}$ has a smaller size compared to $C_{N}$ (e.g., $C_{N}$ in Figure 8 has 4*4=16 nodes, while the related simplified $G_{c}$ has only 2*3=6 nodes), so $s e a r c h_p a t h_a g e n t$ and $s e a r c h_p a t h_S A T$ are faster compared to $s e a r c h_p a t h$ .
$s e a r c h_p a t h_a g e n t$ and $s e a r c h_p a t h_S A T$ ’s return values provide direct information about the current agent’s relationship with other agents, which is very useful in determining whether two agents must be in the same subproblem.

4. Methodology

As mentioned previously, we have not identified a direct method to decompose a MAPF instance into multiple subproblems. Thus, we initiate the process with an initial decomposition, progressively refining it to generate smaller subproblems while ensuring that each step pass the mentioned solvability check. We define two types of subproblems encountered in various stages: clusters, which have no restrictions on the order of solving, and levels, which impose limitations on the order of solving.

The decomposition of a MAPF instance involves three steps:

1, identifying all agents’ dependence path and establishing initial clusters based on them;

2, refining agent dependencies to bipartition clusters until they cannot be further divided into smaller clusters;

3, further refining agent dependencies to decompose clusters into levels and sorting.

Step 1 and 2 are described in Section 4.1, and step 3 is described in Section 4.3.

An overview of the decomposition process of a MAPF instance is depicted in Figure 1.

Following decomposition, we explore methods to solve subproblems independently and then combine their results to obtain a conflict-free solution for the original MAPF instance, as detailed in Section 4.4.

4.1. Decomposetion to clusters

The first step is to determine the connectivity graph

G_{c}

from a grid space

C_{N}

and the related agents

A

. We then determine initial clusters based on the dependence paths derived from the connectivity graph.

Definition 7.

Relevance of two agents: If an agent

a_{i}

’s dependence path contains another agent

a_{j}

’s start or target, we consider those two agents as relevant, regardless of whether

a_{j}

’s dependence path contains

a_{i}

’s start or target.

It is noteworthy that the relevance of two agents is determined by their dependence paths, so whether two agents are relevant changes as their dependence paths change.

Definition 8.

Graph of agents’ relevance

G_{a}

: an undirected graph where nodes represent agents and edges represent whether two agents are relevant.

Example about

G_{a}

can be found in Figure 10.

It is important to note that the relevance of two agents is determined by

a v a i l_a g e n t s

,

a v o i d_a g e n t s

, and

G_{c}

. While the subgraph

G_{c}

is constant,

a v a i l_a g e n t s

and

a v o i d_a g e n t s

are variables, meaning that the relevance between two agents may change as these sets are modified.

Essentially, based on the definition of a cluster, a cluster is a maximal connected component in

G_{a}

. Intuitively, a cluster is a set of agents from the raw MAPF instance, where any agents within it are only relevant to agents within the same set (based on their dependence paths). Additionally, we refer to an agent set that constitutes a cluster as an independent agent set.

An intuitive way to obtain clusters is by generating clusters from the graph of agent’s relavance determined by the dependence paths that visit the fewest number of agents for each agent. We refer to such clusters as initial clusters(Line 1 ∼ 8 in Algorithm 1). An example about initialize of clusters are shown in Figure 9.

However, it is evident that selecting dependence paths containing the fewest agents may not always result in the smallest possible clusters, as it does not consider how to guide dependence paths to avoid agents from other clusters.

There is room for decomposing these initial clusters into smaller ones by updating the dependence paths of agents. Thus, we propose a method to iteratively bipartition initial clusters until further subdivision is not possible, aiming to minimize the size of subproblems. More details about bipartitioning can be found in the following section. This process ensures the legality of decomposition, and the more bipartitioning steps, the better the results obtained, although it does not guarantee the discovery of the optimal decomposition. An overview of the process of decomposing an instance into clusters is outlined in Algorithm 1.

Figure 9. These figures show the initialization of clusters in a simple instance. Figure A illustrates the instance, where white cells represent passable cells and grey cells represent unpassable cells. The notations “

S_{i}

” and “

T_{i}

” denote the start and target states of agent

a_{i}

. Figure B displays the simplified connectivity graph of agents. Figure C shows the relevant agents for each agent, and Figure D presents the graph of agent relevance (

G_{a}

). Figure E shows the initial clusters determined by the graph of agent relevance, i.e., all connected components in

G_{a}

.

Figure 9. These figures show the initialization of clusters in a simple instance. Figure A illustrates the instance, where white cells represent passable cells and grey cells represent unpassable cells. The notations “

S_{i}

” and “

T_{i}

” denote the start and target states of agent

a_{i}

. Figure B displays the simplified connectivity graph of agents. Figure C shows the relevant agents for each agent, and Figure D presents the graph of agent relevance (

G_{a}

). Figure E shows the initial clusters determined by the graph of agent relevance, i.e., all connected components in

G_{a}

.

4.2. Bipartition of Clusters

This section is dedicated to decomposing a cluster into two smaller clusters. However, before delving into the decomposition process, it’s essential to introduce some necessary concepts.

Definition 9.

Unavoidable agents of an agent a in a cluster r refer to those agents within r that must be traversed by a’s dependence path. In other words, these agents must belong to the same cluster and cannot be further divided.

For example, for the instance in Figure 8,

a_{2}

and

a_{3}

are unavoidable agents of

a_{1}

, as

a_{1}

’s dependence path must pass through

T [a_{2}]

and

T [a_{3}]

.

Definition 10.

Unavoidance graph

G_{u}

of a cluster: an undirected representation of whether one agent is unavoidable to another agent within the cluster. Assume there are two agent

a_{i}

and

a_{j}

,

a_{i}

is unavoidable to

a_{j}

if

a_{j}

’s dependence path must pass

a_{i}

’s start or target, meanwhile all dependence paths cannot pass a agent’s start or target if the agent is not in current cluster. It is important to note that for a given cluster r, its unavoidance graph is unique.

Figure 10. These figures demonstrate the process of bipartitioning cluster

C_{1}

, as mentioned in Figure 9. Figure A illustrates the unavoidable agents for each agent. Figure B shows the related unavoidance graph

G_{u}

. Figures C, D, and E depict steps 2, 3, and 4 in the bipartition process of the cluster.

Figure 10. These figures demonstrate the process of bipartitioning cluster

C_{1}

, as mentioned in Figure 9. Figure A illustrates the unavoidable agents for each agent. Figure B shows the related unavoidance graph

G_{u}

. Figures C, D, and E depict steps 2, 3, and 4 in the bipartition process of the cluster.

An example of

G_{u}

can be found in Figure 10.

Both the unavoidance graph and the relevance graph depict relationships between agents within the cluster.

Definition 11.

Maximum unavoidable agents of cluster: the largest connected component of the unavoidance graph

G_{u}

associated with the cluster.

Intuitively, these maximum unavoidable agents represent the largest undividable subset within the current cluster, are referred to as “major set" during the cluster bipartition process. Agents within the cluster, excluding those belonging to the maximum unavoidable agents, are referred to as “remaining set" during the cluster bipartition process. An example of the maximum unavoidable agents of a cluster can be found in Figure 10.

It is noteworthy that during the bipartition process, agents in the remaining set will be moved to the major set, but agents in the remaining set will not move into the major set until both the major set and the remaining set are independent.

The bipartition of a cluster comprises four steps:

1, identify the maximum unavoidable agents of the cluster, referred to as the major set (Line 1 ∼ 12 in Algorithm 2);

2, examine each agent in the remaining set to determine if its dependence path must pass through agents of the major set. Move agents meeting this criterion to the major set, until the remain set is independent or empty(Line 16 ∼ 27 in Algorithm 2);

3, check any newly added agents to the major set to ascertain if their dependence paths must pass through agents in the remaining set. Transfer such agents from agents the remaining set to the major set, until the major set is independent(Line 29 ∼ 40 in Algorithm 2);

4, verify whether the major set and the remaining set both meet the legality requirements of the cluster (for every agent in a cluster, its dependence path does not pass through the start or target states of agents in other clusters). If they do (or if the remaining set is empty), exit and return the major set and the remaining set as the result of the bipartition. (Line 42 ∼ 44 in Algorithm 2) Otherwise, proceed to step 2;

The bipartition process of a cluster concludes when both sets satisfy the cluster’s requirements, or when there are no remaining agents, indicating that the cluster cannot be further decomposed into smaller clusters.

The pseudocode for bipartitioning a cluster is outlined in Algorithm 2. An example of the bipartition of a cluster is shown in Figure 10. The verification of the legality of these clusters is shown in Figure 11.

4.3. Decomposing to Levels and Sorting

Clusters can be solved irrespective of the order of solving, thus providing an opportunity to decompose MAPF into smaller problems by considering the order of solving. By considering the order of solving, we can divide two agents into different subproblems even if one agent’s dependence path contains the start or target of another agent. We refer to these smaller problems decomposed from clusters as levels. In this section, we focus on how to decompose clusters into levels and determine the order of solving. To facilitate this discussion, we introduce some new concepts.

Definition 12.

Order of solving agents: If an agent

a_{i}

must be solved before another agent

a_{j}

, we denote it as

a_{i} > a_{j}

; if

a_{i}

must be solved later than

a_{j}

, we denote it as

a_{i} < a_{j}

.

It is important to note that both

a_{i} > a_{j}

and

a_{i} < a_{j}

can coexist (indicating that both agents must be solved in the same problem) or neither can exist (indicating no order limitation between solving

a_{i}

and

a_{j}

).

The order of solving agents is determined by whether an agent’s dependence path contains another agent’s start or target. If

a_{i}

’s dependence path contains

a_{j}

’s start, it implies that

a_{i}

must be solved later than

a_{j}

to ensure that

a_{j}

’s start is not occupied (to enable

a_{i}

and

a_{j}

could be in different subproblem). Conversely, if

a_{i}

’s dependence path contains

a_{j}

’s target,

a_{i}

must be solved before

a_{j}

to ensure that

a_{j}

’s target is not occupied. Furthermore, the order of solving levels is determined by the order of solving agents within them.

Algorithm 1 Decomposition instance to clusters

Input:: $C_{N}, G_{c}$
Output: $R$
1:: $/ /$ determine initial clusters
2:: $P = {p_{1}, p_{2}, . . ., p_{k}}$ ; $/ /$ dependence path of all agents
3:: for i = 1, 2,...,k do
4:: $p_{i} = s e a r c h_p a t h_a g e n t (a_{i}, G_{c}, \emptyset, \emptyset)$ ;
5:: end for
6:: get $G_{a}$ from P;
7:: $/ /$ get initial clusters
8:: $R_{i n i t}$ = all connected components of $G_{a}$ ;
9:: $R = \emptyset$ ; $/ /$ final clusters
10:: for $r \in R_{i n i t}$ do
11:: $t e m p_c l u s t e r = r$ ;
12:: while $t e m p_c l u s t e r \neq \emptyset$ do
13:: ${r_{m a j o r}, r_{r e m a i n}}$ = Bipartition of cluster ( $t e m p_c l u s t e r, G_{c}$ );
14:: add $r_{m a j o r}$ to $R$ ;
15:: $t e m p_c l u s t e r = r_{r e m a i n}$ ;
16:: end while
17:: end for
18:: return $R$ ;

Definition 13.

Solving order graph

G_{s}

of a cluster: a directed graph representing whether an agent must be solved before another agent. Its nodes are agents, and edges indicate the order of solving between agents. The structure of the solving order graph is determined by each agent’s dependence path. Specifically, if agent

a_{i}

must be solved before agent

a_{j}

, the corresponding edge in

G_{s}

is denoted as

a_{i} \to a_{j}

.

An example of the solving order graph can be found in Figure 12.

Essentially, a level is a strongly connected component of

G_{s}

. Intuitively, a level is a group of agents that form a loop in

G_{s}

, meaning that the agents within the loop must be solved simultaneously. For example, if three agents, A, B, and C, are in the same loop such that

A \to B \to C \to A

, it implies that A must be solved before B, B before C, and C before A. Therefore, A, B, and C must be solved together in parallel. Unlike clusters, where the order of solving is arbitrary, the order of solving levels is strictly determined.

The order of solve levels is determine by the edge in

G_{s}

that connect them.

The order of solving levels is determined by the edges in

G_{s}

that connect them.

Definition 14.

Order of solving levels: if a level

l_{i}

must be solved before another level

l_{j}

, we denote it as

l_{i} > l_{j}

; if

l_{i}

must be solved later than

l_{j}

, we denote it as

l_{i} < l_{j}

.

Definition 15.

Level ordering graph

G_{l}

is a directed graph whose nodes are levels, and edges indicate whether a level must be solved earlier than another level. If there is no edge connecting two levels, this implies that there is no explicit order to solve them, although there may be an implicit order. For example, if level A has no edge connecting to level C, but level A is connected to level B (A > B) and level B is connected to level C (B > C), it implies that level A must be solved earlier than level C.

Essentially,

G_{l}

serves as a condensed version of

G_{s}

.

There are five steps involved in decomposing a cluster into multiple levels and determining the order of solving:

Algorithm 2 Bipartition of cluster

Input: $r, G_{c}$
Output: ${r_{m a j o r}, r_{r e m a i n}}$
1:: $/ /$ step 1
2:: $G_{u} = \emptyset$ ;
3:: for $a_{i} \in r$ do
4:: for $a_{j} \in r$ do
5:: if $a_{i} \neq a_{j}$ then
6:: if $s e a r c h_p a t h_a g e n t (a_{i}, G_{c}, r, {a_{j}}) = \emptyset$ then
7:: add $(a_{i} \to a_{j})$ as an edge to $G_{u}$ ;
8:: end if
9:: end if
10:: end for
11:: end for
12:: $r_{m a j o r}$ = largest connected component of $G_{u}$ ;
13:: $r_{r e m a i n} = r - r_{m a j o r}$ ;
14:: while $T r u e$ do
15:: $/ /$ step 2
16:: $n e w_a g e n t s = \emptyset$ ;
17:: while $T r u e$ do
18:: for $a_{i} \in r_{r e m a i n}$ do
19:: if $s e a r c h_p a t h_a g e n t (a_{i}, G_{c}, r_{r e m a i n}$ , $r_{m a j o r}$ $) = \emptyset$ then
20:: move $a_{i}$ from $r_{r e m a i n}$ to $r_{m a j o r}$ ;
21:: add $a_{i}$ to $n e w_a g e n t s$ ;
22:: end if
23:: end for
24:: if $r_{r e m a i n}$ is independent $o r$ $r_{r e m a i n}$ is empty then
25:: break;
26:: end if
27:: end while
28:: $/ /$ step 3
29:: while $T r u e$ do
30:: for $a_{j} \in n e w_a g e n t s$ do
31:: if $s e a r c h_p a t h_a g e n t (a_{i}, G_{c}, r_{m a j o r}, r_{r e m a i n}) = \emptyset$ then
32:: $/ /$ consider $a_{i}$ belong to cluster r, following search path always success
33:: $p_{i} = s e a r c h_p a t h_a g e n t (a_{i}, G_{c}, r, \emptyset$ );
34:: move agents in ( $p_{i} \cap r_{r e m a i n}$ ) from $r_{r e m a i n}$ to $r_{m a j o r}$ ;
35:: end if
36:: end for
37:: if $r_{m a j o r}$ is independent then
38:: break;
39:: end if
40:: end while
41:: $/ /$ step 4
42:: if both $r_{m a j o r}$ and $r_{r e m a i n}$ are independent then
43:: break;
44:: end if
45:: end while
46:: return ${r_{m a j o r}, r_{r e m a i n}}$ ;

Algorithm 3 Decomposing to levels and sorting

Input: $r, G_{c}$
Output: $L$
1:: $/ /$ determine initial dependence paths
2:: $P = \emptyset$ ; $/ /$ $P [a]$ means agent a’s denpendence path
3:: for agent a in r do
4:: $p_{i} = s e a r c h_p a t h_S A T (a_{i}, G_{c}, r, \emptyset)$ ;
5:: add $p_{i}$ to P;
6:: end for
7:: get $G_{s}$ from P;
8:: determine all loop in $G_{s}$ via strong component detect;
9:: $/ /$ sort levels;
10:: construct graph $G_{l}$ via $G_{s}$ ;
11:: $r o o t_l e v e l s$ = level in $G_{l}$ that not later than other level;
12:: set order of all level to 0;
13:: $l e v e l s$ = $r o o t_l e v e l s$ ;
14:: while levels is not empty do
15:: $n e x t_l e v e l s$ = ∅;
16:: for level in levels do
17:: for another_level that later than level in $G_{l}$ do
18:: if $a n o t h e r_l e v e l$ ’s order ≤ level’s order then
19:: $a n o t h e r_l e v e l$ ’s order = level’s order + 1;
20:: add $a n o t h e r_l e v e l$ to $n e x t_l e v e l s$ ;
21:: end if
22:: end for
23:: end for
24:: $l e v e l s$ = $n e x t_l e v e l s$ ;
25:: end while
26:: return all levels in the order of last visit to them;

Algorithm 4 Merge results

Input: ${P_{1}, . . ., P_{m}}$ $/ /$ assume there are m subproblems and their solution is $P_{i}$
Output: ${P_{1}, . . ., P_{m}}$
1:: set last occupied time of all cells to 0;
2:: for each solution P in ${P_{1}, . . ., P_{m}}$ do
3:: while True do
4:: t=1; $/ /$ current time index
5:: $/ /$ check whether insert wait action to P
6:: $n e e d_w a i t$ = False;
7:: $a l l_f i n i s h e d$ = True;
8:: for path p in P do
9:: if t > size of $p - 1$ then
10:: $/ /$ don’t check path that have reach target
11:: continue;
12:: else
13:: $a l l_f i n i s h e d$ = False;
14:: end if
15:: $/ /$ $p [t]$ means path p’s location at time t
16:: if last occupied time of $p [t] \geq t$ then
17:: $n e e d_w a i t$ = True;
18:: break;
19:: end if
20:: end for
21:: if all_finished then
22:: break;
23:: end if
24:: $/ /$ insert wait action to all path in P
25:: if need_wait then
26:: for path p in P do
27:: if t > size of $p - 1$ then
28:: continue;
29:: end if
30:: repeat $p [t]$ in p at time index t;
31:: end for
32:: else
33:: $/ /$ update last occupied time of cells
34:: for path p in P do
35:: if t > size of $p - 1$ then
36:: continue;
37:: end if
38:: set last occupied time of $p [t]$ to t;
39:: end for
40:: end if
41:: t = t + 1;
42:: end while
43:: end for
44:: return ${P_{1}, . . ., P_{m}}$ ;

Algorithm 5 Layered MAPF

Input: $C_{N}, A$ , MAPF
Output: P
1:: $/ /$ 1, decomposition of MAPF instance
2:: construct $G_{c}$ from $C_{N}, A$ ;
3:: R = Decomposing to cluster( $G_{c}$ );
4:: $L = \emptyset$ ;
5:: for cluster r in R do
6:: $L^{'}$ = Decomposing to leve and sorting(r, $G_{c}$ );
7:: merge $L^{'}$ to L;
8:: end for
9:: $a l l_s u b p r o b l e m_p a t h s$ = ∅;
10:: $/ /$ 2, solve subproblems
11:: forl in L do
12:: if MAPF is serial then
13:: set later subproblem’s start in $C_{N}$ to unpassable;
14:: $c u r r e n t_s o l u t i o n$ = MAPF( $C_{N}, l, a l l_s u b p r o b l e m_p a t h s$ );
15:: add $c u r r e n t_s o l u t i o n$ to $a l l_s u b p r o b l e m_p a t h s$ ;
16:: reset $C_{N}$ ;
17:: else
18:: set previous subproblem’s target and later subproblem’s start in $C_{N}$ to unpassable;
19:: $c u r r e n t_s o l u t i o n$ = MAPF( $C_{N}, l$ );
20:: add $c u r r e n t_s o l u t i o n$ to $a l l_s u b p r o b l e m_p a t h s$ ;
21:: reset $C_{N}$ ;
22:: end if
23:: end for
24:: $/ /$ 3, merge solutions of subproblems
25:: if MAPF is serial then
26:: P = $a l l_s u b p r o b l e m_p a t h s$ ;
27:: else
28:: P = Merge results( $a l l_s u b p r o b l e m_p a t h s$ );
29:: end if
30:: returnP;

1, determine each agent’s dependence path, and available agents constrained to the agents of the current cluster (Line 1 ∼ 6 in Algorithm 3);

2, obtain the solving order graph

G_{s}

from the dependence paths of the current cluster(Line 7 in Algorithm 3);

3, identify all strong components of

G_{s}

as levels (Line 8 in Algorithm 3);

4, determine the relationships between levels by examining the edges that connect them. Construct the level ordering graph

G_{l}

, which represents the order of solving between level (Line 9 ∼ 10 in Algorithm 3);

5, set levels that are not later than any other levels as root levels. Then, using Breadth First Search, traverse all levels in

G_{l}

, where the order of the last visit to each level represents the order of solving them (Line 11 ∼ 25 in Algorithm 3).

In implementation, Tarjan’s algorithm [26] is utilized to determine strongly connected components from a directed graph.

It is important to note that after completing all steps of decomposition, we ensure that the final subproblems do not compromise legality. The entire process of decomposing into levels and sorting them is depicted in Algorithm 3. An example of decomposing a cluster into levels can be found in Figure 12.

4.4. Solving and Combining

After completing the decomposition of a MAPF instance, the next step involves considering how to solve the subproblems separately and combine their results without conflicts. There are two main types of MAPF methods: serial MAPF methods and parallel MAPF methods. Serial MAPF methods, such as CBS-based methods, can use external paths as constraints to avoid conflicts, as they search for an agent’s path while the paths of other agents remain static. On the other hand, parallel MAPF methods, such as LaCAM and PIBT, cannot always treat external paths as dynamic obstacles to avoid, as they may avoid the same state (all agents’ locations) occur multiple times. For simplicity in this article, we do not set external paths as dynamic obstacles to avoid for all parallel MAPF methods.

In serial MAPF methods, we only need to merge separate results into a single set. However, for the second type of MAPF method, we need to add wait actions in the resulting paths to avoid conflicts between paths from different subproblems, as shown in Algorithm 4.

In detail, this algorithm maintains a table that records the last visited time step for each cell, in order to determine whether a subproblem’s solution collides with the solution of a previous subproblem. Starting from

t = 1

, it checks each agent’s solution to see if it conflicts with the solution of a previous subproblem (Lines 1 to 20 in Algorithm 4). If a solution collides with another solution at time t, a wait action is added at t to resolve the conflict (Lines 24 to 32 in Algorithm 4). Otherwise, the table is updated, and the algorithm checks whether there is a conflict with other solutions at time

t + 1

(Lines 33 to 39 in Algorithm 4), continuing until the last state is reached (i.e., the process is finished). Once all solutions are completed, the algorithm exits (Lines 21 to 23 in Algorithm 4).

In the following, we define serial MAPF as MAPF

(C_{N}, A, P)

, and parallel MAPF methods as MAPF

(C_{N}, A)

, where

A

represents agents and P represents external paths that need to be avoided.

The overall process of decomposing a MAPF instance into multiple subproblems (Line 1 ∼ 9 in Algorithm 5), solving the subproblems (Line 10 ∼ 23 in Algorithm 5), and merging their results (Line 24 ∼ 29 in Algorithm 5) is illustrated in Algorithm 5.

As mentioned earlier, our algorithm aims to minimize the loss of solvability in subproblems by introducing a legality check. Although we have not provided a theoretical analysis regarding the percentage of solvability loss, we offer empirical analysis based on extensive testing. This is detailed in the comparison between Layered LaCAM2 and raw LaCAM2 in Section 6.

4.5. Time Complexity Analysis

• Construction of connectivity graph

G_{c}

As mentioned in “Data Structures and Network Algorithms”[27], since Tarjan’s algorithm visits each node and edge only once, the time complexity of detecting connected components in a graph

G (V, E)

is

O (| V | + | E |)

.

Assuming a MAPF instance (

N

= 2), the map’s width and height are w and h, respectively. It contains at most

w \times h

cells and

4 \times w \times h

edges (when all cells are passable). Therefore, the time complexity to determine the

G_{c}

of an instance is

O (w \times h)

. By extension, for a MAPF instance with

N > 2

, the time complexity to determine

G_{c}

is

O (n)

, where n represents the total number of states (e.g., for

N

= 2, n is the area of the map; for

N

= 3, n is the volume of the map).

• Variants of path search

As mentioned earlier, the number of nodes in

G_{c}

is

2 k

, where k is the number of agents. In the worst case, the number of edges in

G_{c}

is

\frac{(2 k) (2 k - 1)}{2}

. According to Introduction to Algorithms [28], the time complexity of BFS is

O (| V | + | E |)

(where

| V |

and

| E |

represent the sizes of the nodes and edges, respectively). Therefore, the time complexity of

s e a r c h_{p} a t h_{a} g e n t

and

s e a r c h_{p} a t h_{S} A T

is

O (k^{2})

.

• Decomposing to clusters

In initialization of clusters, every agents need using

s e a r c h_p a t h_a g e n t

one time. So the time complexity of

s e a r c h_p a t h_a g e n t

are

k * O (k^{2}) = O (k^{3})

.

Considering there are k nodes and at most

\frac{k (k - 1)}{2}

edges in the graph of an agent’s relevance,

G_{a}

, and that the time complexity of detecting the connected components of a graph

G (V, E)

is

O (k^{2})

.

In the best case, there are k clusters, and each cluster has only one agent after detecting the connected components of

G_{a}

, so no further decomposition is needed. Therefore, the total time complexity of the decomposition in the best case is

O (k^{3})

(this usually happens in maps with sparse agents, such as the “warehouse-20-40-10-2-2" map and “Boston_0_256" in Section 5). However, in the worst case, further decomposition is needed, such as the bipartition of clusters and decomposition into levels (like the “random-32-32-20" map and “room-32-32-4" in Section 5).

In the bipartition of clusters, we first determine the complexity of decomposing a cluster into two clusters. Assuming there are m agents in a cluster, in the worst case, it takes

m - 1

calls to

s e a r c h_{p} a t h_{a} g e n t

to determine that one agent needs to be moved from the remaining set to the major set, and

m - 1

agents need to be moved from the remaining set to the major set. Therefore, the time complexity of splitting one cluster into two clusters is

{(m - 1)}^{2} \times O (k^{2}) = O (m^{2} k^{2})

.

Considering that the larger the cluster, the greater the time cost of splitting it, the worst case in the bipartition of clusters occurs when the major set always contains only one agent (i.e., maximizing the number of agents in the remaining set) until the remaining set is empty (as illustrated in Figure 14). This requires

m - 1

splits of a cluster into two clusters, where the number of agents in the clusters is

1, 2, \dots, k

. Therefore, the time complexity of the bipartition of a cluster in the worst case is:

\sum_{m = 1}^{k} O (m^{2} k^{2}) = O (\frac{k (k + 1) (2 k + 1)}{6} (k^{2})) = O (k^{5})

.

• Decomposition to level and sorting

In the initialization of levels, each agent needs to use

s e a r c h_{p} a t h_{S} A T

once. For a cluster with m agents, the time complexity of

s e a r c h_{p} a t h_{S} A T

is

m \times O (k^{2}) = O (m k^{2})

.

Considering there are m nodes and at most

m (m - 1)

edges in

G_{s}

, and that in the construction of

G_{l}

, we need to run Tarjan’s algorithm on

G_{s}

, whose time complexity is

O (| V | + | E |)

. Therefore, the time complexity for constructing

G_{l}

is

O (m^{2})

.

The time complexity of level sorting is determined by a BFS search, so its complexity depends on the size of the level-ordering graph. In the worst case, each level contains only one agent, and there are m levels in

G_{l}

, resulting in at most

m (m - 1)

edges in

G_{l}

. Since the time complexity of both DFS and BFS is

O (| V | + | E |)

, the time complexity of level sorting in the worst case is

O (m^{2})

.

Considering that

m \leq k

, the time complexity for decomposing a cluster with m agents into levels is

O (m k^{2}) + O (m^{2}) + O (m^{2}) = O (m k^{2})

.

So, in summary, considering the total number of agents in all clusters is k, the worst-case time complexity of decomposing all clusters into levels is

O (k^{3})

.

• Combining results

As mentioned before, serial MAPF methods can take extra paths as constraints to avoid (e.g., EECBS), so no extra actions are needed to merge subproblem solutions. However, parallel MAPF methods (e.g., LaCAM) need to add wait actions to avoid conflicts between the solutions of different subproblems.

For a MAPF instance with k agents, assuming there are m subproblems, let the number of agents in the i-th subproblem be denoted as

c_{i}

, and the makespan of their solution as

s_{i}

for

0 < i < m

. In the worst case, every subproblem needs to add wait actions equal to the sum of the previous subproblem’s makespan to ensure collision-free execution (i.e., waiting until all agents from previous subproblems have arrived at their targets, except for the first subproblem, as described in the simplified scenario). Thus, the total number of added wait actions in the worst case is

\sum_{i = 1}^{m - 1} c_{i} \cdot s_{i - 1}

. Assuming there is an upper bound on the makespan, T, we have:

\sum_{i = 1}^{m - 1} c_{i} * s_{i - 1} < = \sum_{i = 1}^{m - 1} c_{i} * T < \sum_{i = 1}^{m - 1} k * T = (m - 1) k * T < k^{2} T

. Therefore, the time complexity in the worst case is

O (k^{2} T)

.

It is noteworthy that the makespan of solutions varies across different MAPF methods, so the time cost of merging results also differs depending on the method used.

5. Results of Decomposition

In theory, the decomposition of a MAPF instance reduces the cost of solving the instance, but it also incurs its own time and memory costs. Therefore, before analyzing how the decomposition of a MAPF instance contributes to solving the instance, it is essential to examine the time and memory usage of the decomposition process.

Specifically, we measure the peak memory usage as the memory usage, and record the memory usage of the program at every millisecond during implementation. However, due to the resolution of memory recording being 1KB, memory usage less than 1 KB is recorded as 0 MB. This phenomenon may occur under certain maps.

Furthermore, we analyze how each step of the decomposition process influences the overall decomposition by evaluating changes in the decomposition rate (maximum subproblem size / total number of agents) and the number of subproblems after each step. For simplicity, the initialization of clusters is denoted as “IC", the bipartition of clusters as “BC", and decomposing to levels and sorting as “LS".

We employ a classic MAPF dataset [29] as MAPF instances for our analysis, comprising 24 typical maps from the dataset. These maps feature an increasing number of agents, and each number of agents is randomly selected 100 times, resulting in a total of 22,300 MAPF instances. The same instances are utilized to evaluate how decomposition influences typical MAPF methods.

Experiments were conducted on a laptop running Ubuntu 20.04, equipped with a Ryzen 7 5800h (3.2GHz) CPU and 16GB of memory. All code is implemented in C++.

The results of our experiments are presented in Figure 15 and Figure 16.

5.1. Decomposition rate

The decomposition rate is closely correlated with the time cost and memory usage in solving a MAPF instance, as both time cost and memory usage are primarily determined by the size of the largest subproblem.

5.1.1. Performance under different maps

As depicted in Figure 15, the final decomposition rate (i.e., the decomposition rate after step 3) demonstrates a consistent increase across almost all maps, with the exception of maps containing a high number of passable cells (e.g., den520, Berlin_1_256). In these maps, the number of passable cells closely aligns with the number of agents, resulting in fewer opportunities to alter dependence paths and decompose the instance. Consequently, the size of the largest subproblem approaches that of the original MAPF instance. This limitation becomes increasingly significant as the number of agents grows, ultimately leading to a decomposition rate close to 1, indicating that decomposition becomes ineffective when agents are densely packed.

Conversely, in maps with a surplus of passable cells compared to the number of agents, such as den520 and Berlin_1_256, decomposition remains highly effective despite increases in the number of agents. In these scenarios, the abundance of passable cells results in numerous free cell groups, facilitating the generation of dependence paths that no need to traverse multiple agents’ start and target locations.

5.1.2. How each step contribute to decomposition

As shown in Figure 15, we observe that each step of the decomposition process contributes to a decrease in the decomposition rate (i.e., a reduction in the maximum size of subproblems) overall. However, the extent of their contributions varies across different maps.

In maps characterized by a surplus of passable cells, exceeding the number of agents (such as Berlin_1_256, Paris_1_256, and Den520d), the maximum size of clusters is reduced to merely 1 after IC, even with hundreds of agents. This is due to the high likelihood of an agent’s initial dependence path avoiding passing through other agents’ start or target locations. Consequently, subsequent BC and LS have no further room for decomposition, resulting in similar time costs after each step in these maps.

Conversely, in maps where the number of passable cells is close to the number of agents (e.g., empty_16_16, empty_32_32, random-32-32-20, and maze-32-32-4), BC and LS contribute more to decomposition than IC as the number of agents increases. This is because there is a lower likelihood of an agent’s initial dependence path avoiding passing through other agents’ start or target locations. Consequently, BC and LS have the opportunity to decompose subproblems by updating dependence paths.

5.2. Number of subproblems

The count of subproblems generally follows a pattern of initial increase followed by a decrease as the number of agents increases. Initially, decomposition easily generates small subproblems, leading to an increase in their count as the number of agents grows. However, as agents become denser, decomposing becomes more challenging, resulting in an increase in the decomposition rate and a subsequent decrease in the number of subproblems until it equals 1, indicating the raw MAPF instance.

Special cases arise in maps with numerous passable cells, exceeding the number of agents, such as den520d and Berlin_1_256. In these instances, decomposition divides the raw instance into subproblems, each containing only one agent, resulting in the number of subproblems equaling the number of agents in the raw MAPF instance. However, it is predictable that as the number of agents increases, decomposition will become increasingly ineffective, ultimately reducing the number of subproblems to 1.

5.3. Time cost and memory usage

Time cost and memory usage are crucial factors in the application of decomposition for MAPF. While theoretically, decomposing a MAPF instance should reduce the time cost and memory usage of solving the problem, if the decomposition process itself consumes too much time or memory space, it may not effectively reduce the total cost of solving the MAPF instance. So we analysis how many resources decomposition cost under various of maps.

Generally, as depicted in the Figure 15 and Figure 16, the time cost of decomposition is less than 1 second in most maps and less than 3 seconds in the worst-case scenario (such as 800 to 1000 dense agents), which is deemed acceptable for practical application. In the majority of cases, all three steps contribute to the time cost. However, there are differences in the time cost of steps across different maps. In maps with an abundance of passable cells exceeding the number of agents, such as den512d, Berlin_1_256, and Paris_1_256, IC consumes almost all of the time cost, resulting in overlapping time costs after IC, BC, and LS. Conversely, in maps with few passable cells close to the number of agents, all steps contribute to the time cost, as these scenarios require BC and LS to decompose initial subproblems into smaller subproblems.

Regarding memory usage, decomposition for almost all maps consumes less than 1 MB, which is considered acceptable for common platforms such as laptops or industrial process computers.

5.4. Comparison with Independence Detection

Similar to Layered MAPF, Independence Detection (ID) [20] is a technique used to decompose a group of agents into the smallest possible groups. Unlike Layered MAPF, ID assigns each agent to its own unique group and then merges two groups if one group cannot find a solution (using MAPF methods) that avoids conflict with the solution of another group. ID is limited to methods that can take external paths as constraints to avoid conflicts, so not every MAPF method is applicable to ID. In this section, we compare Layered MAPF and ID using CBS and EECBS. The related results are shown in Figure 17 and Figure 19. For the implementation, we use the code from [30]1 for ID.

In this section, we compare Layered MAPF with ID in terms of maximum subproblem size, number of subproblems, time cost, and success rate in finding a solution. Since MAPF methods are embedded in ID, we compare the total cost of Layered MAPF (including the time cost of decomposition and solving subproblems) with the time cost of ID. To maintain consistency, we use the same MAPF instances for the decomposition of each instance.

As a common rule in solving MAPF instances, we set an upper bound on the time cost (30 seconds). Methods that are complete but fail to find a solution within the allotted time are considered to have failed. When ID fails to find collision-free solutions for an instance, we use the number of agents in the instance as its maximum subproblem size. To maintain consistency, this rule is also applied to Layered MAPF, even though our method decomposes the instance into multiple subproblems.

Figure 17. These figures show how Layered CBS and ID using CBS perform across various maps. Their performance is evaluated in terms of time cost, success rate, maximum subproblem size, and total number of subproblems. The data for Layered CBS is shown as solid lines, while the data for ID (using CBS) is shown as dotted lines. More details about the maps can be found in Figure 15 and Figure 16. The last row contains the legend for the upper figures; figures in the same column share the same legend, and the same applies hereinafter.

Figure 18. These figures show the average time cost, success rate, maximum subproblem size, and number of subproblems for Layered CBS and ID (using CBS) across various maps. These figures summarize the data from Figure 17. The map indices correspond to those in Figure 15 and Figure 16, and the same applies hereinafter.

Figure 19. These figures show how Layered EECBS and ID using EECBS perform across various maps. Their performance is evaluated in terms of time cost, success rate, maximum subproblem size, and total number of subproblems. The data for Layered EECBS is shown as solid lines, while the data for ID (using EECBS) is shown as dotted lines. More details about the maps can be found in Figure 15 and Figure 16.

As shown in Figure 17 and Figure 19, for both CBS and EECBS, Layered MAPF’s maximum subproblem size increases significantly slower than ID as the number of agents increases. Additionally, Layered MAPF decomposes the MAPF instance into more subproblems compared to ID. As a result, Layered MAPF has a lower time cost (for CBS, 14.5s < 27.4s; for EECBS, 14.4s < 26.0s), a higher success rate (for CBS, 0.64 > 0.12; for EECBS, 0.65 > 0.18), a smaller maximum subproblem size (for CBS, 206.8 < 426.5; for EECBS, 206.9 < 426.9), and more subproblems (for CBS, 218.7 > 1.9; for EECBS, 218.7 > 2.31), on average, compared to ID, as shown in Figure 18 and Figure 20.

There are several reasons that cause this phenomenon:

1, ID does not consider the order of solving subproblems, while Layered MAPF does. Therefore, Layered MAPF has a higher probability of decomposing a MAPF instance into smaller subproblems;

2, when ID tries to check whether a group (i.e., a subproblem) can avoid conflicts with another group, it does not consider avoiding groups that it has already avoided in the past. As a result, ID can sometimes fall into infinite loops. For example, suppose group A avoids group B. Then, ID detects conflicts between group A and group C and finds a new solution for group A that avoids conflicts with group C. However, this new solution for group A conflicts with group B. ID will then repeat the process of making group A avoid group B and group C until it runs out of time. This phenomenon is less frequent when there are a few sparse agents but becomes more common when there are dense agents;

3, in Layered MAPF, each agent participates in a multi-agent pathfinding process only once. In contrast, in ID, an agent may participate multiple times, as it can be merged into different groups at different stages. This causes ID to have a higher time cost than Layered MAPF.

5.5. Summary

Figure 21. This figure illustrates how the maximum subproblem size changes as the density of agents and the number of agents in the instance change. The data source is the same as in Figure 15 and Figure 16.

Our method demonstrates high effectiveness for MAPF instances with cells exceeding two times the number of agents, while its effectiveness diminishes as the number of passable cells approaches two times the number of agents. This is because a higher number of passable cells increases the likelihood of decomposing the instance into smaller subproblems. Conversely, for the same map, the effectiveness of decomposition decreases with an increase in the number of agents in the MAPF instance.

Regarding subproblems, in most maps, the number of subproblems initially increases and then decreases as the number of agents increases. Initially, with only a few agents, decomposition is effective, but as the number of agents increases, decomposition becomes less effective.

In terms of costs, the average time cost is less than 1 second in average, with a maximum of less than 3 seconds in the worst cases (such as 800 to 1000 dense agents), and memory usage remains below 1 MB. Comparatively, maps with more passable cells than the number of agents require fewer computations and less memory space.

We also studied how the decomposition of an instance changes as the number of agents and the agent density change (defined as the ratio between the number of agents and the number of passable cells), as shown in Figure 21. We can observe that the maximum subproblem size is large only when both the agent density and the number of agents are high. In other words, instance decomposition can be effective even with a large number of agents, as long as the agent density is low.

6. Results of decomposition’s application

Figure 22. These figures serve as a detailed version of Figure 23, offering insights into the performance of both raw MAPF methods and Layered MAPF methods under specific maps.

Figure 23. These figures provide a summary comparison between the raw MAPF method and the Layered MAPF method, combining the results across all maps. In the visualizations, crossed bars represent the Layered version of the MAPF method, while empty bars represent the raw MAPF method.

In this section, we evaluate the influence of decomposing MAPF instances on various MAPF methods, including EECBS, PBS, LNS, HCA∗ (serial MAPF method), and Push and Swap, PIBT+, LaCAM2 (parallel MAPF method). We refer to the methods with decomposition of MAPF instances as the “Layered" versions of the raw MAPF methods. To maintain consistency, we utilize the same MAPF instances as in Section 5 to analyze how decomposition affects their application. We assess decomposition’s impact in terms of time cost, memory usage, success rate, sum of cost, and makespan. The default configurations of these MAPF methods are used in our experiments. In particular, due to the memory usage not being immediately released after the code is executed on an Ubuntu system, the statistics of memory usage (in GB, MB, or kB) experience a delay, causing imprecision when comparing Layered MAPF and raw MAPF. Therefore, we use the count of key data elements that play a major role in memory usage as an indicator of memory usage, such as the number of high-level nodes for LaCAM2 or the size of the constraint tree for EECBS.

As a common rule in solving MAPF instances, we set an upper bound on the time cost (30 seconds). Methods that are complete but fail to find a solution within the limited time are considered as failed. Additionally, methods that run out of memory space are also considered as failed.

In this section, experiments were conducted on a computer running Ubuntu 20.04, equipped with an Intel Xeon(R) CPU (2.1 GHz) and 64 GB of memory. All code was implemented in C++.

6.1. Explicit Estimation CBS

Explicit Estimation CBS (EECBS) [7] stands out as one of the state-of-the-art CBS-based MAPF methods, characterized by its completeness and bounded suboptimality. Further details can be found in Section 2. EECBS’s code 2 operates by searching each agent’s path separately and provides interfaces to avoid conflicts with external paths, rendering it a serial MAPF method. In its Layered version, which we refer to as “Layered EECBS", previous subproblems’ paths and subsequent subproblems’ starts are set as constraints. We use the number of constraint nodes as an estimate of the memory usage of EECBS, as it is the only variable that increases during its execution.

The average time cost, success rate, memory usage, makespan, and total cost of Layered EECBS and raw EECBS are shown in Figure 23. How Layered EECBS and raw EECBS perform under various maps as the number of agents increases is shown in Figure A1 in the Appendix. As shown in Figure A1, Layered EECBS has lower time cost and thus higher success rates compared to raw EECBS when there are few agents. However, these advantages slowly diminish as the number of agents increases, as does Layered EECBS’s advantage in memory usage. Despite this, Layered EECBS still has an advantage in most cases. In terms of path quality, the solutions of Layered EECBS are slightly worse than those of raw EECBS. A summary of the performance of Layered EECBS and raw EECBS can be found in Figure 23.

As illustrated in Figure 23, on average, Layered EECBS exhibits lower time costs (

1.2 \times 10^{4}

ms <

1.7 \times 10^{4}

ms), leading to a higher success rate (0.68 > 0.57). The time cost of decomposition is relatively small compared to the time cost of solving the MAPF instance. Regarding memory usage, Layered EECBS consumes less memory space compared to raw EECBS (4839.0 constraint nodes < 7542.8 constraint nodes). Decomposition of the MAPF instance significantly reduces the growth of the constraint tree’s size by decreasing the number of agents solved simultaneously.

In terms of path quality, Layered EECBS yields a larger makespan (Layered EECBS:

2.3 \times 10^{2}

, raw EECBS:

1.9 \times 10^{2}

), resulting in a larger sum of costs (Layered EECBS:

4.7 \times 10^{4}

, raw EECBS:

3.2 \times 10^{4}

). In maps where both raw EECBS and Layered EECBS achieve a success rate close to 1 (e.g., map 1, 8, 10, 19, 21 and 22), both methods exhibit similar makespan and sum of costs. However, in maps where raw EECBS has a lower success rate compared to Layered EECBS (e.g., map 4, 16, 17 and 24), Layered EECBS demonstrates a larger makespan and sum of costs, as depicted in Figure 22.

This phenomenon can be attributed to two factors: Firstly, solving all agents together provides more opportunities to find shorter solutions than solving them separately. Secondly, raw EECBS struggles to find longer solutions, particularly in maps where its success rate is lower.

In summary, Layered EECBS boasts explicit advantages in terms of time cost, memory usage, and success rate compared to raw EECBS. Additionally, in maps where both methods achieve a high success rate, their solutions tend to be similar. However, Layered EECBS outperforms raw EECBS in finding longer solutions when the latter fails to do so.

6.2. Priority Based Search

Priority Based Search (PBS) [4] is an incomplete, suboptimal, and priority-based MAPF method. PBS searches each agent’s path separately and it’s code3 provides interfaces to avoid conflicts with external paths, making it a serial MAPF method. In its Layered version, which we refer to as “Layered PBS", we set previous subproblems’ paths and subsequent subproblems’ starts as constraints. We use the number of priority nodes as an estimate of the memory usage of PBS, as it is the only variable that increases during its execution.

The average time cost, success rate, memory usage, makespan, and total cost of Layered PBS and raw PBS are shown in Figure 23. How Layered PBS and raw PBS perform under various maps as the number of agents increases is shown in Figure A2 in the Appendix. As shown in Figure A2, Layered PBS has lower time cost and thus higher success rates compared to raw PBS when there are few agents. However, these advantages slowly diminish as the number of agents increases, as does Layered PBS’s advantage in memory usage. Despite this, Layered PBS still has an advantage in most cases. In terms of path quality, the solutions of Layered PBS are slightly worse than those of raw PBS. A summary of the performance of Layered PBS and raw PBS can be found in Figure 23.

As shown in Figure 23, on average, Layered PBS has a lower time cost (

1.3 \times 10^{4}

ms <

2.0 \times 10^{4}

ms) and thus a higher success rate (0.68 > 0.37). The time cost of decomposition is relatively small compared to the time cost of solving the MAPF instance. In terms of memory usage, Layered PBS has lower memory usage compared to raw PBS (234.4 priority nodes < 706.8 priority nodes) because decomposition of the MAPF instance reduces the complexity of the priority tree significantly.

When it comes to path quality, Layered PBS has a larger makespan (

2.3 \times 10^{2}

>

1.0 \times 10^{2}

) and thus a larger SOC (

4.5 \times 10^{4}

>

1.3 \times 10^{4}

). Similar to EECBS, raw PBS finds it difficult to find longer solutions and can only finds shorter solutions, resulting in lower success rates for maps where raw PBS struggles (e.g., map 4, 6, 16 and 17). For maps where both methods have high success rates (e.g., map 1, 9 and 10), they have similar makespan and SOC, as shown in Figure 22.

In summary, Layered PBS has explicit advantages in time cost, memory usage, and success rate compared to raw PBS. Additionally, Layered PBS produces solutions similar to raw PBS’s solutions on maps where both methods have success rates close to 1. Furthermore, Layered PBS achieves a higher success rate in finding long solutions when raw PBS fails.

6.3. Large Neighborhood Search 2

Large Neighborhood Search 2 (LNS2) [3] is a suboptimal MAPF method that starts from a set of paths with collisions and repeatedly replans subsets of paths to reduce the overall number of collisions until the paths become collision-free. LNS2 updates each agent’s path separately, and its code4 provides interfaces to avoid conflicts with external paths, making it a serial MAPF method. In LNS2’s implementation, LNS2 considers all agents in the initial planning phase, so it generally has the highest memory usage during this phase. LNS2 provides several classic MAPF methods to generate the initial solution, including EECBS, PP, PPS, CBS, PIBT, and winPIBT. To obtain a conflict-free solution as quickly as possible, we choose EECBS to generate the initial solutions. Therefore, we use the number of constraint nodes in EECBS during the initial planning phase as an estimate of the memory usage of LNS2, as this is the variable that contributes most to memory usage during its execution.

The average time cost, success rate, memory usage, makespan, and total cost of Layered LNS2 and raw LNS2 are shown in Figure 23. How Layered LNS2 and raw LNS2 perform under various maps as the number of agents increases is shown in Figure A3 in the Appendix. As shown in Figure A3, Layered LNS2 has lower time cost and thus a higher success rate when there are few agents compared to raw LNS2. However, these advantages slowly diminish as the number of agents increases, as does Layered LNS2’s advantage in memory usage. Despite this, Layered LNS2 still has an advantage in most cases. In terms of path quality, the solutions from Layered LNS2 are slightly worse than those from raw LNS2. A summary of the performance of Layered LNS2 and raw LNS2 can be found in Figure 23.

As shown in Figure 23, on average, Layered LNS2 has a lower time cost (

1.2 \times 10^{4}

ms <

1.6 \times 10^{4}

ms) and thus a higher success rate (0.72 > 0.52). In terms of memory usage, Layered LNS2 has lower memory usage compared to raw LNS2 (24.9 constraint nodes < 228.8 constraint nodes). Due to the decomposition of the MAPF instance, other agents’ paths are considered as dynamic obstacles and do not join conflict resolve, resulting in Layered LNS2 requiring fewer repair actions to fix conflicts between agents’ paths.

Regarding path quality, Layered LNS2 has a larger makespan (

2.4 \times 10^{2}

>

1.9 \times 10^{2}

) and SOC (

4.7 \times 10^{4}

>

3.0 \times 10^{4}

). Similar to EECBS and PBS, raw LNS2 struggles to find longer solutions and can only discovers shorter solutions, particularly in maps where raw LNS2 has a lower success rate (e.g., map 12 and 16). For maps where both methods have similar success rates (e.g., map 1, 2, 8, 21 and 22), raw LNS2 and Layered LNS2 have similar makespan and SOC, as shown in Figure 22.

In summary, Layered LNS2 exhibits explicit advantages in time cost, memory usage, and success rate compared to raw LNS2. Additionally, Layered LNS2 produces solutions similar to raw LNS2’s solutions in maps where both methods have success rates close to 1. Furthermore, Layered LNS2 achieves a higher success rate in finding long solutions when raw LNS2 cannot.

6.4. Push and Swap

Push and Swap (PAS) [31] is a suboptimal and complete rule-based MAPF method. Push and Swap updates agents’ paths simultaneously (via an action called “MULTIPUSH”, which moves a set of adjacent agents simultaneously), and its code5 provides no interfaces to avoid conflicts with external paths and ensures that the same state of agents does not occur twice in the solution, making it a parallel MAPF method. In its Layered version, referred to as “Layered Push and Swap" or “Layered PAS," we set previous subproblems’ targets and subsequent subproblems’ starts as static obstacles. Subsequently, we merge subproblems’ solutions by adding wait actions after solving all subproblems. In the implementation of Push and Swap, the variable that occupies the most memory is a two-dimensional matrix that stores each agent’s location at all times. Therefore, we use the number of elements in the matrix as an estimate of the memory usage of Push and Swap, which is calculated as the makespan of the solution multiplied by the number of agents.

The average time cost, success rate, memory usage, makespan, and sum of cost of Layered Push and Swap (PAS) and raw PAS are shown in Figure 23. How Layered PAS and raw PAS perform under various maps as the number of agents increases is shown in Figure A4 in the Appendix.

As shown in Figure A4, in some cases, Layered PAS takes more time than raw PAS, while in other cases, it does not, as PAS’s time cost is close to the time cost of instance decomposition. However, when there are many agents, Layered PAS takes less time than raw PAS. Consequently, Layered PAS achieves a higher success rate than raw PAS as the number of agents increases in most cases.

In terms of memory usage, Layered PAS consistently requires less memory than raw PAS, even as the number of agents increases.

Regarding path quality, Layered PAS’s solutions are slightly worse than raw PAS’s solutions. A summary of Layered PAS and raw PAS’s performance can be found in Figure 23.

As shown in Figure 23, on average, Layered PAS consumes less time (

6.8 \times 10^{3}

ms <

1.15 \times 10^{4}

ms) and memory space (

6.0 \times 10^{6}

constraint nodes <

4.0 \times 10^{7}

constraint nodes) than raw PAS, resulting in a higher success rate (0.92 > 0.69). Due to the number of “push” and “swap” actions grows exponentially as the number of agents increases, decomposing the MAPF instance enables Layered Push and Swap to solve subproblems separately, requiring fewer “push” and “swap” actions.

Regarding path quality, as PAS is a parallel MAPF method, the addition of wait actions to merge subproblems’ solutions, Layered PAS has a larger makespan (

1.0 \times 10^{4}

>

4.0 \times 10^{3}

) and SOC (

3.5 \times 10^{6}

>

1.2 \times 10^{6}

) compared to raw PAS.

In summary, Layered Push and Swap consume less time and memory space than raw Push and Swap, resulting in a higher success rate. However, the introduction of wait actions to merge subproblems’ solutions leads to a larger makespan and SOC compared to raw Push and Swap.

6.5. Hierarchical Cooperative A∗

Hierarchical Cooperative A* (HCA*) [32] is a suboptimal and incomplete prioritized MAPF algorithm that decouples into a series of single-agent searches. HCA* calculates a distance table for each agent to determine their priority. The distance table stores the distance from every location on the map to the target state. Since HCA* searches one path at a time, and after reviewing its code, we found that there are no other variables consuming significant memory space aside from the distance table, we conclude that HCA*’s memory usage is primarily determined by the size of these distance tables. Specifically, it depends on the number of agents multiplied by the size of the map. Thus, we use the product of the number of agents and the size of the map as an estimate for HCA*’s memory usage.

HCA* searches each agent’s path separately, making it a serial MAPF method. But the code6 for HCA* provides no interfaces to set external paths as dynamic obstacles to avoid. We updated its code to enable it to set external paths as dynamic obstacles.

The average time cost, success rate, memory usage, makespan, and sum of costs for Layered HCA* and raw HCA* are shown in Figure 23. A comparison of how Layered HCA* and raw HCA* perform under various maps as the number of agents increases is shown in Figure A5 in the Appendix. As illustrated in Figure A5, the time cost of Layered HCA* is very close to that of raw HCA*, as HCA* searches each agent’s path separately. However, Layered HCA* has lower memory usage compared to raw HCA*, as it does not need to store the heuristic table for all agents.

In terms of success rate, Layered HCA* has a slightly higher success rate, as HCA* is incomplete, and solving the instance separately may increase its chances of finding a solution (e.g., instances are decomposed into subproblems that involve only one agent).

Regarding path quality, the solution of Layered HCA* is slightly worse than that of raw HCA*. A summary of the performance of Layered HCA* and raw HCA* can be found in Figure 23.

On average, Layered HCA* has a slightly higher time cost than raw HCA* (

1.06 \times 10^{4}

ms >

1.05 \times 10^{4}

ms) and a similar success rate (0.71 > 0.66) as HCA* solves each agent separately, and decomposition does not improve the efficiency of HCA*. However, Layered HCA* requires less memory space than raw HCA* (

9.1 \times 10^{5}

<

2.6 \times 10^{7}

), as Layered HCA* only stores the distance table of the current subproblem’s agents, while raw HCA* needs to store the distance table of all agents.

Regarding path quality, Layered HCA* has a slightly larger makespan (

2.5 \times 10^{2}

>

2.2 \times 10^{2}

) and SOC (

5.2 \times 10^{4}

>

4.2 \times 10^{4}

). Because HCA* solves agents in order of decreasing distance to the target, raw HCA* solves agents with longer paths first, limiting the makespan of these agents’ solutions, while Layered HCA* breaks this order as agents with long paths may not be in the same subproblem and may not be solved first.

In summary, Layered HCA* shows little improvement compared to raw HCA* and has slightly worse solutions because HCA* has a fixed priority and solves each agent separately. However, Layered HCA* has an advantage in memory usage, as it only needs to store the distance table of the current subproblem’s agents rather than all agents.

6.6. Priority Inheritance with Backtracking+

Priority Inheritance with Backtracking+ (PIBT+) [16] is an incomplete and suboptimal MAPF algorithm. PIBT+ avoids the occurrence of the same state of agents twice in the solution. And the code7 for PIBT+ provides no interfaces to avoid conflicts with external paths, making it a parallel MAPF method. In the implementation of PIBT+, a function called

f u n c P I B T

is used recursively to move all agents one step at a time without any collisions. The function is called repeatedly until all agents reach their targets. In computer systems (e.g., Windows or Linux), function recursion relies on the stack to manage the context of function calls. Each recursive call allocates a new stack frame, and the depth of recursion is limited by the size of the stack. Therefore, the memory usage of PIBT+ is primarily determined by the recursion depth of

f u n c P I B T

. We use the maximum recursion depth as an estimate of the memory usage for PIBT+.

The average time cost, success rate, memory usage, makespan, and sum of costs for Layered PIBT+ and raw PIBT+ are shown in Figure 23. The performance of Layered PIBT+ and raw PIBT+ under various maps as the number of agents increases is presented in Figure A6 in the Appendix.

As shown in Figure A6, in some cases, Layered PIBT+ takes more time compared to raw PIBT+, while in other cases, it does not. This is because the time cost of PIBT+ is closely tied to the time cost of instance decomposition. Both Layered PIBT+ and raw PIBT+ have a high success rate, but neither achieves a 100% success rate, as PIBT+ is an incomplete MAPF method. In terms of memory usage, Layered PIBT+ consistently requires less memory than raw PIBT+, even as the number of agents increases.

Regarding path quality, the solutions found by Layered PIBT+ are generally worse than those found by raw PIBT+, though in some cases, both Layered PIBT+ and raw PIBT+ solutions see a decrease in SOC and makespan as the number of agents increases. This is because, with more agents, there are fewer opportunities for PIBT+ to find longer paths. A summary of Layered PIBT+ and raw PIBT+ performance can be found in Figure 23.

In general, Layered PIBT+ takes more time than raw PIBT+ (

2.2 \times 10^{3}

ms >

1.9 \times 10^{3}

ms). As mentioned in Section 5, the decomposition of MAPF instances with 800 to 1000 agents may cost 1s to 3s, and these instances play a major role in contributing to the mean time cost of the MAPF method. Although decomposition reduces the time cost of PIBT+, the time cost of decomposition offsets the reduced time cost and thus increases the overall time cost.

Layered PIBT+ has a similar success rate compared to raw PIBT+ (Layered PIBT+: 0.98, raw PIBT+: 0.98), as both Layered PIBT+ and raw PIBT+ typically take less than 30s in most cases. In terms of memory usage, Layered PIBT+ requires less memory space than raw PIBT+ (

6.5 \times 10^{5}

<

1.1 \times 10^{6}

), as decomposition reduces the number of intermediate states by reducing the number of agents solved simultaneously.

Regarding solution quality, as PIBT+ is a parallel MAPF method, Layered PIBT+ introduces extra wait actions, resulting in solutions with higher makespan (

9.4 \times 10^{3}

>

4.3 \times 10^{2}

) and SOC (

3.2 \times 10^{6}

>

1.3 \times 10^{5}

) compared to raw PIBT+’s solutions.

In summary, Layered PIBT+ shows little improvement compared to raw PIBT+ and has worse solutions due to the introduction of wait actions when merging solutions. However, Layered PIBT+ has an advantage in memory usage, as it reduces the number of intermediate states by reducing the number of agents solved simultaneously.

6.7. Lazy Constraints Addition for MAPF 2

Lazy Constraints Addition for MAPF 2 (LaCAM2) [18] is a complete and suboptimal method. More details about LaCAM2 can be found in Section 2. LaCAM avoids the occurrence of the same state of agents twice in the solution. Compared to LaCAM, LaCAM2 [18] introduces a swap action to the successor generator, allowing for the quick generation of solutions. The code8 for LaCAM2 provides no interfaces to avoid conflicts with external paths, making it a parallel MAPF method. In the implementation, LaCAM2 stores all high-level nodes during planning, which increases rapidly as the number of agents grows. Therefore, LaCAM2’s memory usage is primarily determined by the maximum number of high-level nodes. We use this maximum number as an estimation of LaCAM2’s memory usage.

The average time cost, success rate, memory usage, makespan, and sum of costs for both Layered LaCAM2 and raw LaCAM2 are shown in Figure 23. The performance of Layered LaCAM2 and raw LaCAM2 across various maps, as the number of agents increases, is presented in Figure A7 in the Appendix.

As shown in Figure A7, in some cases, Layered LaCAM2 takes more time than raw LaCAM2, while in other cases it does not, as LaCAM2’s time cost is close to the decomposition time cost of the instance. Both Layered LaCAM2 and raw LaCAM2 achieve a 100% success rate, as LaCAM2 is a complete and highly efficient MAPF method.

In terms of memory usage, Layered LaCAM2 consistently requires less memory than raw LaCAM2, even as the number of agents increases.

Regarding path quality, the solutions generated by Layered LaCAM2 are slightly worse than those generated by raw LaCAM2. A summary of the performance of both Layered LaCAM2 and raw LaCAM2 can be found in Figure 23.

In average, Layered LaCAM2 has larger time cost than raw LaCAM2 (

9.6 \times 10^{2}

ms >

7.3 \times 10^{2}

ms). Similar to PIBT+, the decomposition of MAPF instances with 800 to 1000 agents may cost 1s to 3s, and these instances play a major role in determining the time cost of the MAPF method. Although decomposition reduces the time cost of LaCAM2, the time cost of decomposition offsets the reduced time cost, resulting in no significant difference in the overall time cost.

Both Layered LaCAM2 and raw LaCAM2 achieve the same success rate (Layered LaCAM2: 1.0, raw LaCAM2: 1.0), indicating that both methods successfully find all solutions within 30 seconds. This suggests that the decomposition of the instance does not sacrifice its solvability during the experiments. If there had been any loss of solvability, Layered LaCAM2’s success rate would have been lower than 1.0. Based on this data, we speculate that the likelihood of our decomposition causing a loss of solvability is less than 1%. In terms of memory usage, Layered LaCAM2 requires significantly less memory space than raw LaCAM2 (

3.4 \times 10^{4}

configs <

1.7 \times 10^{5}

configs), as decomposition reduces the number of intermediate states by reducing the number of agents solved simultaneously.

Regarding solution quality, Layered LaCAM2 introduces extra wait actions, resulting in solutions with higher makespan (

9.5 \times 10^{3}

>

3.4 \times 10^{2}

) and SOC (

3.2 \times 10^{6}

>

9.5 \times 10^{4}

) compared to raw LaCAM2’s solutions.

LaCAM2 is the only algorithm among the seven that achieves a 100% success rate, making it a useful benchmark for estimating the loss of solvability caused by our decomposition of the instance. Since Layered LaCAM2 also maintains a 100% success rate, we speculate that the loss of solvability due to our decomposition is less than 1%.

In summary, Layered LaCAM2 shows little improvement compared to raw LaCAM2 and has worse solutions due to the introduction of wait actions when merging solutions. However, Layered LaCAM2 has an advantage in memory usage, as it reduces the number of intermediate states by reducing the number of agents solved simultaneously.

6.8. Summary

On average, decomposition of MAPF instances significantly reduces the memory usage of all seven methods and also decreases the time cost for EECBS, PBS, LNS2, Push and Swap. Layered MAPF methods exhibit higher success rates compared to raw MAPF methods, particularly for serial MAPF methods, with the exception of HCA*, an incomplete prioritized method that solves agents one-by-one.

Regarding solution quality, the Layered version of serial MAPF methods generally exhibits similar quality compared to the raw version. However, the Layered version of parallel MAPF methods tends to produce worse solutions due to the insertion of numerous wait actions during solution merging. This disparity arises from the fact that serial MAPF methods treat external paths as dynamic obstacles to avoid, while parallel MAPF methods do not.

In conclusion, decomposition of MAPF instances is most advantageous for serial MAPF methods, resulting in reduced time cost and memory usage without significantly sacrificing solution quality. For parallel MAPF methods, decomposition reduce memory usage significantly but often leads to worsened solutions without notable improvements in time cost.

Moreover, serial MAPF methods (e.g., EECBS, LNS2) generally demand more time compared to parallel MAPF methods (e.g., LaCAM and PIBT+), yet they yield higher-quality solutions. Notably, serial MAPF methods frequently fail due to time constraints, while parallel MAPF methods tend to fail due to memory limitations.

The observed discrepancy can be attributed to the differing search algorithms utilized: serial methods repeatedly search for complete paths. However, they employ memory-efficient data structures such as priority trees or constraint trees to store knowledge about resolving conflicts. In contrast, parallel methods store previous conflict-solving knowledge as intermediate states to avoid searching for the complete path of agents repeatedly. However, they may exhaust memory resources under complex MAPF instances.

7. Conclusion

MAPF is extensively utilized in autonomous systems, such as automated warehouses, UAV traffic management and multi-agent exploration. Motivated by the exponential growth in the cost of solving MAPF instances (in terms of time and memory usage) as the number of agents increases, we proposed Layered MAPF as a solution to reduce the computational burden. This approach decomposes a MAPF instance into multiple smaller subproblemsprogressively, and minimize the possibility of loss of solvability by introducing legality check during each step of decomposition. In terms of time complexity, in the worst case, time complexity of decomposition is

O (k^{5})

; while in the best case, its time complexity is

O (k^{3})

, where k represents the number of agents.

In the results of our decomposition of MAPF instances (Section 5), we observed that our method is highly effective for MAPF instances with passable cells exceeding twice the number of agents. On average, the time cost is around 1s and never exceeds 3s, even for dense instances with 800 to 1000 agents. Memory usage remains below 1MB, with fewer computations and memory space required for maps with more passable cells than agents.

When applied to the state-of-the-art methods (EECBS, PBS, LNS2, HCA*, Push and Swap, PIBT+, LaCAM2), Layered MAPF significantly reduces memory usage and time cost, particularly for serial MAPF methods. Consequently, Layered MAPF methods achieve higher success rates than raw MAPF methods, especially for serial MAPF. And the quality of solution for the Layered version of serial MAPF methods is similar to the raw version, while the Layered version of parallel MAPF methods produces inferior solutions due to the introduction of numerous wait actions during solution merging.

In conclusion, decomposition of MAPF instances is most beneficial for serial MAPF methods, resulting in reduced time cost and memory usage without sacrificing solution quality significantly. However, for parallel MAPF methods, decomposition may reduce memory usage but often worsens the solution without notable improvements in time cost.

Despite its effectiveness, Layered MAPF has limitations: it becomes less effective as the number of agents increases in dense instances, and its application to parallel MAPF methods introduces numerous wait actions during solution merging.

In the future, we plan to propose new merging solution techniques for parallel methods without compromising solution quality. Essentially, the decomposition of the instance is a trade-off between solution quality and efficiency, and a systematic exploration of this trade-off between runtime and solution quality is necessary. Given that the initial state of the cluster plays a crucial role in the decomposition process, a randomized restart approach could benefit the decomposition. We plan to generate different initial clusters during the bipartition phase and select the one that results in the best decomposition. While this process introduces some extra time cost, it may lead to better decomposition outcomes. Additionally, we aim to generalize the idea of decomposing MAPF instances to address extensions of MAPF problems, such as considering the shape of agents.

Appendix A Comparison between Layered MAPF and raw MAPF as the number of agent increasing

Figure A1. These figures illustrate how Layered EECBS and raw EECBS work under various maps as the number of agents increases. They are compared in terms of time cost, success rate, memory usage, and solution quality (sum of cost and makespan). The data for Layered EECBS are shown with a solid line, and the data for raw EECBS are shown with a dotted line. More details about the maps can be found in Figure 15 and Figure 16.

Figure A2. These figures illustrate how Layered PBS and raw PBS work under various maps as the number of agents increases. They are compared in terms of time cost, success rate, memory usage, and solution quality (sum of cost and makespan). The data for Layered PBS are shown with a solid line, and the data for raw PBS are shown with a dotted line. More details about the maps can be found in Figure 15 and Figure 16.

Figure A3. These figures illustrate how Layered LNS2 and raw LNS2 works under various of maps as number of agents increasing, they are compared in term of time cost, success rate, memory usage and solution quality (sum of cost and makespan). Layered LNS2’s data are shown in solid line and raw LNS2’s data are shown in dotted line. More details about maps can be found in Figure 15 and Figure 16.

Figure A4. These figures illustrate how Layered Push and Swap and raw Push and Swap work under various maps as the number of agents increases. They are compared in terms of time cost, success rate, memory usage, and solution quality (sum of cost and makespan). The data for Layered Push and Swap are shown with a solid line, and the data for raw Push and Swap are shown with a dotted line. More details about the maps can be found in Figure 15 and Figure 16.

Figure A5. These figures illustrate how Layered HCA and raw HCA work under various maps as the number of agents increases. They are compared in terms of time cost, success rate, memory usage, and solution quality (sum of cost and makespan). The data for Layered HCA are shown with a solid line, and the data for raw HCA are shown with a dotted line. More details about the maps can be found in Figure 15 and Figure 16.

Figure A6. These figures illustrate how Layered PIBT+ and raw PIBT+ work under various maps as the number of agents increases. They are compared in terms of time cost, success rate, memory usage, and solution quality (sum of cost and makespan). The data for Layered PIBT+ are shown with a solid line, and the data for raw PIBT+ are shown with a dotted line. More details about the maps can be found in Figure 15 and Figure 16.

Figure A7. These figures illustrate how Layered LaCAM2 and raw LaCAM2 work under various maps as the number of agents increases. They are compared in terms of time cost, success rate, memory usage, and solution quality (sum of cost and makespan). The data for Layered LaCAM2 are shown with a solid line, and the data for raw LaCAM2 are shown with a dotted line. More details about the maps can be found in Figure 15 and Figure 16.

References

Hönig, W.; Kiesel, S.; Tinka, A.; Durham, J.W.; Ayanian, N. Persistent and robust execution of MAPF schedules in warehouses. IEEE Robotics and Automation Letters 2019, 4, 1125–1131. [Google Scholar]
Ho, F.; Geraldes, R.; Gonçalves, A.; Rigault, B.; Sportich, B.; Kubo, D.; Cavazza, M.; Prendinger, H. Decentralized multi-agent path finding for UAV traffic management. IEEE Transactions on Intelligent Transportation Systems 2020, 23, 997–1008. [Google Scholar] [CrossRef]
Li, J.; Chen, Z.; Harabor, D.; Stuckey, P.J.; Koenig, S. MAPF-LNS2: Fast repairing for multi-agent path finding via large neighborhood search. In Proceedings of the Proceedings of the AAAI Conference on Artificial Intelligence, 2022, Vol. 36, pp. 10256–10265.
Ma, H.; Harabor, D.; Stuckey, P.J.; Li, J.; Koenig, S. Searching with consistent prioritization for multi-agent path finding. In Proceedings of the Proceedings of the AAAI conference on artificial intelligence, 2019, Vol. 33, pp. 7643–7650.
Sharon, G.; Stern, R.; Felner, A.; Sturtevant, N.R. Conflict-based search for optimal multi-agent pathfinding. Artificial intelligence 2015, 219, 40–66. [Google Scholar] [CrossRef]
Hatem, M.; Stern, R.; Ruml, W. Bounded suboptimal heuristic search in linear space. In Proceedings of the Proceedings of the International Symposium on Combinatorial Search, 2013, Vol. 4, pp. 98–104.
Li, J.; Ruml, W.; Koenig, S. Eecbs: A bounded-suboptimal search for multi-agent path finding. In Proceedings of the Proceedings of the AAAI conference on artificial intelligence, 2021, Vol. 35, pp. 12353–12362.
Thayer, J.T.; Ruml, W. Bounded suboptimal search: A direct approach using inadmissible estimates. In Proceedings of the IJCAI, 2011, Vol. 2011, pp. 674–679.
Boyarski, E.; Felner, A.; Sharon, G.; Stern, R. Don’t split, try to work it out: Bypassing conflicts in multi-agent pathfinding. In Proceedings of the Proceedings of the International Conference on Automated Planning and Scheduling, 2015, Vol. 25, pp. 47–51.
Li, J.; Harabor, D.; Stuckey, P.J.; Ma, H.; Koenig, S. Symmetry-breaking constraints for grid-based multi-agent path finding. In Proceedings of the Proceedings of the AAAI Conference on Artificial Intelligence, 2019, Vol. 33, pp. 6087–6095.
Li, J.; Gange, G.; Harabor, D.; Stuckey, P.J.; Ma, H.; Koenig, S. New techniques for pairwise symmetry breaking in multi-agent path finding. In Proceedings of the Proceedings of the International Conference on Automated Planning and Scheduling, 2020, Vol. 30, pp. 193–201.
Li, J.; Felner, A.; Boyarski, E.; Ma, H.; Koenig, S. Improved Heuristics for Multi-Agent Path Finding with Conflict-Based Search. In Proceedings of the IJCAI, 2019, Vol. 2019, pp. 442–449.
Li, J.; Chen, Z.; Harabor, D.; Stuckey, P.J.; Koenig, S. Anytime multi-agent path finding via large neighborhood search. In Proceedings of the International Joint Conference on Artificial Intelligence 2021. Association for the Advancement of Artificial Intelligence (AAAI); 2021; pp. 4127–4135. [Google Scholar]
Chan, S.H.; Stern, R.; Felner, A.; Koenig, S. Greedy priority-based search for suboptimal multi-agent path finding. In Proceedings of the Proceedings of the International Symposium on Combinatorial Search, 2023, Vol. 16, pp. 11–19.
Okumura, K.; Machida, M.; Défago, X.; Tamura, Y. Priority Inheritance with Backtracking for Iterative Multi-agent Path Finding. In Proceedings of the Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19. International Joint Conferences on Artificial Intelligence Organization, 7 2019, pp. 535–542. [CrossRef]
Okumura, K.; Machida, M.; Défago, X.; Tamura, Y. Priority Inheritance with Backtracking for Iterative Multi-agent Path Finding. Artificial Intelligence 2022, p. 103752. [CrossRef]
Okumura, K. Lacam: Search-based algorithm for quick multi-agent pathfinding. In Proceedings of the Proceedings of the AAAI Conference on Artificial Intelligence, 2023, Vol. 37, pp. 11655–11662.
Okumura, K. Improving lacam for scalable eventually optimal multi-agent pathfinding. arXiv, 2023; arXiv:2305.03632. [Google Scholar]
Standley, T.S. Finding Optimal Solutions to Cooperative Pathfinding Problems. Proceedings of the AAAI Conference on Artificial Intelligence 2010.
Standley, T.S.; Korf, R.E. Complete Algorithms for Cooperative Pathfinding Problems. In Proceedings of the International Joint Conference on Artificial Intelligence, 2011.
Sharon, G.; Stern, R.; Felner, A.; Sturtevant, N.R. Meta-Agent Conflict-Based Search For Optimal Multi-Agent Path Finding. In Proceedings of the Symposium on Combinatorial Search, 2021.
vancara, J.; Vlk, M.; Stern, R.; Atzmon, D.; Barták, R. Online Multi-Agent Pathfinding. Proceedings of the AAAI Conference on Artificial Intelligence 2019, 33, 7732–7739. [Google Scholar]
Ma, H.; Li, J.; Kumar, T.K.S.; Koenig, S. Lifelong Multi-Agent Path Finding for Online Pickup and Delivery Tasks. International Foundation for Autonomous Agents and Multiagent Systems 2017. [Google Scholar]
Ma, H. A competitive analysis of online multi-agent path finding. In Proceedings of the Proceedings of the International Conference on Automated Planning and Scheduling, 2021, Vol. 31, pp. 234–242.
Atzmon, D.; Bernardini, S.; Fagnani, F.; Fairbairn, D. Exploiting geometric constraints in multi-agent pathfinding. In Proceedings of the Proceedings of the Thirty-Third International Conference on Automated Planning and Scheduling. AAAI Press, 2023, ICAPS ’23. [CrossRef]
Tarjan, R. Depth-first search and linear graph algorithms. SIAM journal on computing 1972, 1, 146–160. [Google Scholar]
Tarjan, R.E. Data structures and network algorithms; SIAM, 1983.
Cormen, T.H.; Leiserson, C.E.; Rivest, R.L.; Stein, C. Introduction to algorithms; MIT press, 2022.
Stern, R.; Sturtevant, N.R.; Felner, A.; Koenig, S.; Ma, H.; Walker, T.T.; Li, J.; Atzmon, D.; Cohen, L.; Kumar, T.K.S.; et al. Multi-Agent Pathfinding: Definitions, Variants, and Benchmarks. Symposium on Combinatorial Search (SoCS) 2019, pp. 151–158.
Surynek, P.; Švancara, J.; Felner, A.; Boyarski, E. Variants of independence detection in sat-based optimal multi-agent path finding. In Proceedings of the Agents and Artificial Intelligence: 9th International Conference, ICAART 2017, Porto, Portugal, February 24–26, 2017, Revised Selected Papers 9. Springer, 2018, pp. 116–136.
Luna, R.; Bekris, K.E. Push and swap: Fast cooperative path-finding with completeness guarantees. In Proceedings of the IJCAI, 2011, Vol. 11, pp. 294–300.
Silver, D. Cooperative pathfinding. In Proceedings of the Proceedings of the aaai conference on artificial intelligence and interactive digital entertainment, 2005, Vol. 1, pp. 117–122.

1	https://github.com/svancaj/HybridMAPF
2	https://github.com/Jiaoyang-Li/EECBS
3	https://github.com/Jiaoyang-Li/PBS
4	https://github.com/Jiaoyang-Li/MAPF-LNS2
5	https://github.com/Kei18/pibt2
6	https://github.com/Kei18/pibt2
7	https://github.com/Kei18/pibt2
8	https://github.com/Kei18/lacam2

Figure 1. These figures illustrate the complete process of decomposing a MAPF instance, which includes decomposing agents into initial clusters (a type of subproblem), further decomposing initial clusters into smaller clusters, and decomposing clusters into levels (another type of subproblem). For more details, please refer to Section 4.

Figure 2. These figures show a solvable MAPF instance (Fig. A) and an unsolvable MAPF instance (Fig. C).

S_{1}, S_{2}, S_{3}

and

T_{1}, T_{2}, T_{3}

represent the start and target cells of the three agents in the instance, as follows. The paths of each agent in the instance are shown in Figures B and D, respectively. The instance in Fig. A passes the solvability check, as all its agents have a path to their target. However, the instance in Fig. C does not pass the solvability check and is unsolvable because only the 3rd agent has a path to its target, while the 1st and 2nd agents have no path to their target. Grey cells represent unpassable cells, while white cells represent passable cells, as follows.

Figure 2. These figures show a solvable MAPF instance (Fig. A) and an unsolvable MAPF instance (Fig. C).

S_{1}, S_{2}, S_{3}

and

T_{1}, T_{2}, T_{3}

represent the start and target cells of the three agents in the instance, as follows. The paths of each agent in the instance are shown in Figures B and D, respectively. The instance in Fig. A passes the solvability check, as all its agents have a path to their target. However, the instance in Fig. C does not pass the solvability check and is unsolvable because only the 3rd agent has a path to its target, while the 1st and 2nd agents have no path to their target. Grey cells represent unpassable cells, while white cells represent passable cells, as follows.

Figure 3. This figure shows a MAPF instance that passes the mentioned solvability check but is unsolvable.

a_{1}

’s start

S_{1}

have path to its target

T_{1}

and

a_{2}

’s start

S_{2}

have path to its target

T_{2}

, if another agent’s start and target are passable. However, due to

a_{2}

always block the path from

S_{1}

to

T_{1}

, this instance is unsolvable.

Figure 3. This figure shows a MAPF instance that passes the mentioned solvability check but is unsolvable.

a_{1}

’s start

S_{1}

have path to its target

T_{1}

and

a_{2}

’s start

S_{2}

have path to its target

T_{2}

, if another agent’s start and target are passable. However, due to

a_{2}

always block the path from

S_{1}

to

T_{1}

, this instance is unsolvable.

Figure 4. This figure illustrates three subproblems (

1, {a_{1}}; 2, {a_{2}}; 3, {a_{3}}

) of the MAPF instance shown in Figure 2 A and checks whether these three subproblems pass the solvability check under the simplified scenario. As shown in the figure, the agent in each subproblem has a solution that avoids the target state of the previous subproblem and the start state of the next subproblem. Therefore, since every subproblem passes the solvability check, the instance can be decomposed into these three subproblems.

Figure 4. This figure illustrates three subproblems (

1, {a_{1}}; 2, {a_{2}}; 3, {a_{3}}

) of the MAPF instance shown in Figure 2 A and checks whether these three subproblems pass the solvability check under the simplified scenario. As shown in the figure, the agent in each subproblem has a solution that avoids the target state of the previous subproblem and the start state of the next subproblem. Therefore, since every subproblem passes the solvability check, the instance can be decomposed into these three subproblems.

Figure 5. This figure shows the total solution generated by adding wait actions at the start for the MAPF instance in Figure 4. In the total solution, only one agent from each subproblem (denoted as

a_{i}

) is moving (marked with a dashed-line box), while the agents from the other subproblems stay at their start or target states. Specifically,

a_{1}

(from subproblem 1) moves when

0 \leq t \leq 4

;

a_{2}

(from subproblem 2) moves when

5 \leq t \leq 9

; and

a_{3}

(from subproblem 3) moves when

10 \leq t \leq 13

.

Figure 5. This figure shows the total solution generated by adding wait actions at the start for the MAPF instance in Figure 4. In the total solution, only one agent from each subproblem (denoted as

a_{i}

) is moving (marked with a dashed-line box), while the agents from the other subproblems stay at their start or target states. Specifically,

a_{1}

(from subproblem 1) moves when

0 \leq t \leq 4

;

a_{2}

(from subproblem 2) moves when

5 \leq t \leq 9

; and

a_{3}

(from subproblem 3) moves when

10 \leq t \leq 13

.

Figure 6. This figure shows an instance where our decomposition may cause a loss of solvability. Our decomposition may result in the following three subproblems: 1, {

a_{1}

}; 2, {

a_{2}

,

a_{3}

}; 3, {

a_{4}

}. However, the second subproblem is unsolvable in the simplified scenario, as

T_{1}

and

S_{4}

are occupied.

Figure 6. This figure shows an instance where our decomposition may cause a loss of solvability. Our decomposition may result in the following three subproblems: 1, {

a_{1}

}; 2, {

a_{2}

,

a_{3}

}; 3, {

a_{4}

}. However, the second subproblem is unsolvable in the simplified scenario, as

T_{1}

and

S_{4}

are occupied.

Figure 8. These figures show the free cell groups (marked as

F_{1}, F_{2}, \dots

, in Figure A), the connectivity graph

G_{c}

(in Figure B), and the simplified connectivity graph (in Figure C) of the instance in Figure 2 A.

Figure 8. These figures show the free cell groups (marked as

F_{1}, F_{2}, \dots

, in Figure A), the connectivity graph

G_{c}

(in Figure B), and the simplified connectivity graph (in Figure C) of the instance in Figure 2 A.

Figure 11. These figures show the verification of three clusters from the instance in Figure 10. Figure A illustrates the process of decomposing the instance into three clusters, similar to Figure 1. Figures B, C, and D show the verification of the legality of the three clusters, i.e., checking whether there is a solution for each subproblem’s agents while assuming the other clusters’ agents occupy both their start and target states. Since we ensure that each cluster’s agents are only related to the agents in the current cluster, all clusters are legal. The solutions for each agent are shown as dotted arrowed lines in different colors.

Figure 12. These figures illustrate the decomposition of a MAPF instance into levels in a simple scenario. Figure A shows the instance, where “

S_{i}

” and “

T_{i}

” represent the start state and target state of agent

a_{i}

. Figure B presents the simplified connectivity graph of agents. Figure C depicts the relevant SATs for each agent, while Figure D shows the solving order graph (

G_{s}

). Figure E displays the initial levels determined by the solving order graph, i.e., all strongly connected components in

G_{s}

, along with the related level ordering graph

G_{l}

.

Figure 12. These figures illustrate the decomposition of a MAPF instance into levels in a simple scenario. Figure A shows the instance, where “

S_{i}

” and “

T_{i}

” represent the start state and target state of agent

a_{i}

. Figure B presents the simplified connectivity graph of agents. Figure C depicts the relevant SATs for each agent, while Figure D shows the solving order graph (

G_{s}

). Figure E displays the initial levels determined by the solving order graph, i.e., all strongly connected components in

G_{s}

, along with the related level ordering graph

G_{l}

.

Figure 13. These figures show the verification of four levels from the instance in Figure 9. We check whether there is a solution for each level’s agents while assuming that the previous level’s target states and the next level’s start states are occupied. Since we ensure that each level’s agents are not related to the previous level’s target state and the next level’s start state, all levels are considered legal. The corresponding solutions for each agent are shown as dotted arrowed lines in different colors.

Figure 14. This figure illustrates the case that maximize the time cost of biparition of cluster.

Figure 15. These figures show our decomposition’s performance under various of maps, as number of agents increases, in terms of decomposition rate, number of subproblems time cost and memory usage. We also present scale and number of passable cells (in brackets). The map index is also used in the subsequent figures.

Figure 16. These figures illustrate the performance of our decomposition method across various maps, as number of agents increases, focusing on decomposition rate, number of subproblems, time cost, and memory usage. Additionally, we provide information on the scale and number of passable cells (in brackets). The map index is also used in the subsequent figures.

Figure 20. These figures show the average time cost, success rate, maximum subproblem size, and number of subproblems for Layered EECBS and ID (using EECBS) across various maps (please refer to Figure 16 for the maps corresponding to the map indices). These figures summarize the data from Figure 19.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

LayeredMAPF: A Decomposition of Mapf Instance to Reduce Solving Costs

Abstract

Keywords:

Subject:

1. Introduction

2. Related Works

2.1. Conflict Based Search

2.2. Large Neighborhood Search

2.3. Priority Based Search

2.4. PIBT and LaCAM

2.4.1. Priority Inheritance with Backtracking (PIBT)

2.4.2. Lazy Constraints Addition for MAPF (LaCAM)

2.5. Independence Detection

2.6. SEQ(Sequence) and DSP(Delayed Shortest Path)

3. Preliminaries

3.1. Basic Definitions

3.2. Decomposition of MAPF Instance

3.2.1. Cluster and Level

3.2.2. Connectivity graph

3.2.3. Variants of Search Path

4. Methodology

4.1. Decomposetion to clusters

4.2. Bipartition of Clusters

4.3. Decomposing to Levels and Sorting

4.4. Solving and Combining

4.5. Time Complexity Analysis

5. Results of Decomposition

5.1. Decomposition rate

5.1.1. Performance under different maps

5.1.2. How each step contribute to decomposition

5.2. Number of subproblems

5.3. Time cost and memory usage

5.4. Comparison with Independence Detection

5.5. Summary

6. Results of decomposition’s application

6.1. Explicit Estimation CBS

6.2. Priority Based Search

6.3. Large Neighborhood Search 2

6.4. Push and Swap

6.5. Hierarchical Cooperative A∗

6.6. Priority Inheritance with Backtracking+

6.7. Lazy Constraints Addition for MAPF 2

6.8. Summary

7. Conclusion

Appendix A Comparison between Layered MAPF and raw MAPF as the number of agent increasing

References

MDPI Initiatives

Important Links

Subscribe