Multi-agent Reinforcement Learning for Dynamic Dispatching in Material Handling Systems

Xian Yeow Lee,Haiyan Wang,Daisuke Katsumata,Takaharu Matsui,Chetan Gupta

2024-09-27

Abstract:This paper proposes a multi-agent reinforcement learning (MARL) approach to learn dynamic dispatching strategies, which is crucial for optimizing throughput in material handling systems across diverse industries. To benchmark our method, we developed a material handling environment that reflects the complexities of an actual system, such as various activities at different locations, physical constraints, and inherent uncertainties. To enhance exploration during learning, we propose a method to integrate domain knowledge in the form of existing dynamic dispatching heuristics. Our experimental results show that our method can outperform heuristics by up to 7.4 percent in terms of median throughput. Additionally, we analyze the effect of different architectures on MARL performance when training multiple agents with different functions. We also demonstrate that the MARL agents performance can be further improved by using the first iteration of MARL agents as heuristics to train a second iteration of MARL agents. This work demonstrates the potential of applying MARL to learn effective dynamic dispatching strategies that may be deployed in real-world systems to improve business outcomes.

Machine Learning,Artificial Intelligence,Multiagent Systems

What problem does this paper attempt to address?

The paper attempts to address the problem of optimizing throughput in material handling systems through dynamic scheduling. Specifically, the authors propose a Multi-Agent Reinforcement Learning (MARL) approach to learn dynamic scheduling policies, overcoming the limitations of traditional heuristic scheduling rules in complex material handling systems. These limitations include inherent system uncertainties, complex interactions between subprocesses, and system changes due to business expansion or contraction. To validate their proposed method, the authors developed a material handling environment that simulates a real-world system, reflecting various activities at different locations, physical constraints, and inherent uncertainties. Additionally, the authors proposed a method to integrate existing dynamic scheduling heuristic knowledge into the learning process to enhance exploration. Experimental results show that the proposed method can improve median throughput by up to 7.4% compared to heuristic methods. Furthermore, the authors analyzed the impact of different architectures on MARL performance and demonstrated that using first-generation MARL agents as heuristics to train second-generation MARL agents can further enhance performance. This work demonstrates the potential of applying MARL to learn effective dynamic scheduling policies that can be deployed in real-world systems to improve business outcomes.

Multi-agent Reinforcement Learning for Dynamic Dispatching in Material Handling Systems

Learning to Cooperate: Application of Deep Reinforcement Learning for Online AGV Path Finding.

Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers

Scalable Multi-agent Reinforcement Learning for Factory-wide Dynamic Scheduling

Multi-Agent Decision Transformers for Dynamic Dispatching in Material Handling Systems Leveraging Enterprise Big Data

Efficient Ridesharing Order Dispatching with Mean Field Multi-Agent Reinforcement Learning

Can Sophisticated Dispatching Strategy Acquired by Reinforcement Learning? - A Case Study in Dynamic Courier Dispatching System

Dynamic Dispatching for Large-Scale Heterogeneous Fleet via Multi-agent Deep Reinforcement Learning

Real-Time Multi-Vehicle Scheduling in Tasks With Dependency Relationships Using Multi-Agent Reinforcement Learning

Toward Energy-Efficient Routing of Multiple AGVs with Multi-Agent Reinforcement Learning

Multi-Agent Reinforcement Learning for Real-Time Dynamic Production Scheduling in a Robot Assembly Cell

Multi-agent-based deep reinforcement learning for dynamic flexible job shop scheduling

Scalable Multi-Agent Reinforcement Learning for Residential Load Scheduling under Data Governance

Towards Efficient Multi-Agent Learning Systems

Multi-Agent Reinforcement Learning for Network Load Balancing in Data Center

Research on Multi-AGVs dynamic scheduling based on deep reinforcement learning

Constrained Reinforcement Learning for Dynamic Material Handling

Multi-Objective Optimization Using Adaptive Distributed Reinforcement Learning

Hierarchical multi-agent reinforcement learning for repair crews dispatch control towards multi-energy microgrid resilience

A Self-Attention-Based Deep Reinforcement Learning Approach for AGV Dispatching Systems