Abstract:Intelligent decision-making systems that can solve task allocation problems are critical for multi-robot systems to conduct industrial applications in a collaborative and automated way, such as warehouse inspection using mobile robots, hydrographic surveying using unmanned surface vehicles, etc. This paper, therefore, aims to address the task allocation problem for multi-agent autonomous mobile systems to autonomously and intelligently allocate multiple tasks to a fleet of robots. Such a problem is normally regarded as an independent decision-making process decoupled from the following task planning for the member robots. To avoid the sub-optimal allocation caused by the decoupling, an end-to-end task allocation framework is proposed to tackle this combinatorial optimisation problem while taking the succeeding task planning into account during the optimisation process. The problem is formulated as a special variant of the multi-depot multiple travelling salesmen problem (mTSP). The proposed end-to-end task allocation framework employs deep reinforcement learning methods to replace the handcrafted heuristics used in previous works. The proposed framework features a modular design of the reinforcement learning agent which can be customised for various applications. Moreover, a real-robot implementation setup based on the Robot Operating System 2 is presented to fulfil the simulation-to-reality gap. A warehouse inspection mission is executed to validate the training outcome of the proposed framework. The framework has been cross-validated via both simulated and real-robot tests with various parameter settings, where adaptability and performance are well demonstrated. Note to Practitioners—This paper is motivated by the problem of dispatching a fleet of autonomous mobile robots to tackle a mission that can be resolved into multiple waypoint-following tasks. An end-to-end modular framework is proposed, making task allocation decisions based on the given waypoint information. By using the reinforcement learning technique, the deep neural network could learn sophisticated policies for allocating tasks. The policies are trained in a specific pattern which ensures their joint optimisation for a solver that outputs the near optimal task execution sequences in an efficient way. This leads to a multiple travelling salesmen problem (mTSP) solution. Pre-trained policies are tested in several industrial scenarios reflecting the applications of search and rescue, maritime surveying, and warehouse automation, among others. A hardware implementation configuration based on the Robot Operating System 2 is also presented to support the practical deployment the framework.

A deep reinforcement learning hyper-heuristic to solve order batching problem with mobile robots

Optimizing Robotic Mobile Fulfillment Systems for Order Picking Based on Deep Reinforcement Learning

A Deep Reinforcement Learning Hyper-Heuristic with Feature Fusion for Online Packing Problems

Towards reliable robot packing system based on deep reinforcement learning

Robot Online 3D Bin Packing Strategy Based on Deep Reinforcement Learning and 3D Vision

Genetic Scheduling and Reinforcement Learning in Multirobot Systems for Intelligent Warehouses

A Hierarchical Reinforcement Learning Based Optimization Framework for Large-scale Dynamic Pickup and Delivery Problems

Integrating Heuristic Methods with Deep Reinforcement Learning for Online 3D Bin-Packing Optimization

Multi-Objective Order Scheduling via Reinforcement Learning

A rule-based heuristic algorithm for joint order batching and delivery planning of online retailers with multiple order pickers

How to Deploy Robotic Mobile Fulfillment Systems

Deep Reinforcement Learning for Task Assignment and Shelf Reallocation in Smart Warehouses

An End-to-End Deep Reinforcement Learning Based Modular Task Allocation Framework for Autonomous Mobile Systems

An Anticipative Order Reservation and Online Order Batching Algorithm Based on Machine Learning

An Efficient Deep Reinforcement Learning Model for Online 3D Bin Packing Combining Object Rearrangement and Stable Placement

Deep Reinforcement Learning for Picker Routing Problem in Warehousing

Bin Packing Optimization via Deep Reinforcement Learning

Learning Efficient and Fair Policies for Uncertainty-Aware Collaborative Human-Robot Order Picking

Solving a New 3D Bin Packing Problem with Deep Reinforcement Learning Method

Deep Reinforcement Learning for Dynamic Order Picking in Warehouse Operations

A Deep Reinforcement Learning Approach for Online Parcel Assignment