Scalable Multi-Robot Task Allocation Using Graph Deep Reinforcement Learning with Graph Normalization

Zhenqiang Zhang,Xiangyuan Jiang,Zhenfa Yang,Sile Ma,Jiyang Chen,Wenxu Sun
DOI: https://doi.org/10.3390/electronics13081561
IF: 2.9
2024-04-20
Electronics
Abstract:Task allocation plays an important role in multi-robot systems regarding team efficiency. Conventional heuristic or meta-heuristic methods face difficulties in generating satisfactory solutions in a reasonable computational time, particularly for large-scale multi-robot task allocation problems. This paper proposes a novel graph deep-reinforcement-learning-based approach, which solves the problem through learning. The framework leverages the graph sample and aggregate concept as the encoder to extract the node features in the context of the graph, followed by a cross-attention decoder to output the probability that each task is allocated to each robot. A graph normalization technique is also proposed prior to the input, enabling an easy adaption to real-world applications, and a deterministic solution can be guaranteed. The most important advantage of this architecture is the scalability and quick feed-forward character; regardless of whether cases have a varying number of robots or tasks, single depots, multiple depots, or even mixed single and multiple depots, solutions can be output with little computational effort. The high efficiency and robustness of the proposed method are confirmed by extensive experiments in this paper, and various multi-robot task allocation scenarios demonstrate its advantage.
engineering, electrical & electronic,computer science, information systems,physics, applied
What problem does this paper attempt to address?
### Problems Addressed by the Paper The paper aims to address the efficiency and scalability issues in large-scale Multi-Robot Task Allocation (MRTA). Traditional heuristic or meta-heuristic methods face difficulties in generating satisfactory solutions, especially when dealing with large-scale MRTA problems, as they require excessive computation time. This paper proposes a novel approach based on Graph Deep Reinforcement Learning (DRL) to solve this problem through learning. Specifically, the paper focuses on Single-Task Robots (ST), Single-Robot Tasks (SR), and Time-Extended Allocation (TA) problems (ST-SR-TA), which involve task allocation and scheduling. This scenario is more realistic in mobile robots, as they are typically designed to perform one task at a time, and in most cases, tasks require only one robot to complete or can be decomposed into task elements that a single robot can handle. ### Main Contributions 1. **Scalability**: The proposed network structure can adapt to changes in the number of robots and tasks, as well as single warehouse, multiple warehouses, or mixed warehouse scenarios. 2. **Lightweight**: The method significantly reduces the computational burden. 3. **Robustness**: The method maintains performance regardless of how the input data is scaled, translated, or rotated. ### Method Overview 1. **Graph Normalization**: Normalizes input features before the encoder to eliminate the effects of translation, scaling, and rotation of the configuration. 2. **Encoder**: Uses the GraphSAGE concept to extract node features, considering the context information of the graph. 3. **Decoder**: Outputs the probability of each task being assigned to each robot through a cross-attention mechanism. 4. **Route Planner**: Uses the Lin–Kernighan–Helsgaun (LKH) solver to generate the optimal route for each robot. ### Experimental Validation The efficiency and robustness of the method are validated through extensive experiments, demonstrating its advantages in various multi-robot task allocation scenarios. Experimental results show that the method has significant advantages in handling large-scale problems, generating high-quality solutions in a shorter time.