Abstract:The Flexible Job Shop Scheduling Problem (FJSP), a classic NP-hard optimization challenge, has a direct impact on manufacturing system efficiency. Considering that the FJSP is more complex than the Job Shop Scheduling Problem (JSSP) due to its involvement of both job and machine selection, we have introduced a collaborative agent reinforcement learning (CARL) architecture to tackle this challenge for the first time. To enhance Co-Markov decision process, we introduced disjunctive graphs for the representation of state features. However, the representation of states and actions often leads to suboptimal solutions due to intricate variability. To achieve superior outcomes, we refined our approach to representing states and actions. During the solving process, we employed Graph Attention Network (GAT) to extract global state information from the disjunctive graph and used a Transformer Encoder to quantitatively capture the competitive relationships among machines. We configured two independent encoder–decoder components for job and machine agents, enabling the generation of two distinct action strategies. Finally, we employed the Soft Actor–Critic (SAC) algorithm and an integrated Deep Q Network (DQN) known as D5QN to train the decision network parameters of job and machine agents. Our experiments revealed that after just one training session, collaborative agents acquired exceptional scheduling strategies. These strategies excel not only in solution quality compared to traditional Priority Dispatching Rules (PDR) but also outperform results achieved by some metaheuristic and reinforcement learning algorithms. Additionally, they exhibit greater speed than OR-Tools. Moreover, the empirical findings on both randomized and benchmark instances underscore the remarkable robustness of our acquired policies in practical, large-scale scenarios. Notably, when confronted with the DPpaulli dataset, characterized by a considerable imbalance between the number of operations and machines, our approach achieved optimality in 11 out of 18 FJSP instances.

Cooperative multi-agent reinforcement learning for multi-area integrated scheduling in wafer fabs

Multi-Agent Reinforcement Learning for Extended Flexible Job Shop Scheduling

A reinforcement learning-based approach for solving multi-agent job shop scheduling problem

Dynamic flexible scheduling with transportation constraints by multi-agent reinforcement learning

Multi-Agent Reinforcement Learning for Real-Time Dynamic Production Scheduling in a Robot Assembly Cell

Multi-Task Multi-Agent Reinforcement Learning for Real-Time Scheduling of a Dual-Resource Flexible Job Shop with Robots

A Deep Multi-Agent Reinforcement Learning Approach to Solve Dynamic Job Shop Scheduling Problem

End-to-end Multi-Target Flexible Job Shop Scheduling with Deep Reinforcement Learning

Reinforcement Learning Approach for Multi-Agent Flexible Scheduling Problems

Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems

Multi-objective reinforcement learning framework for dynamic flexible job shop scheduling problem with uncertain events

Integrated scheduling optimization of U-shaped automated container terminal under loading and unloading mode

Solving job scheduling problems in a resource preemption environment with multi-agent reinforcement learning

A Reinforcement Learning Based Approach For Multi-Projects Scheduling In Cloud Manufacturing

Multi agent reinforcement learning for online layout planning and scheduling in flexible assembly systems

A novel collaborative agent reinforcement learning framework based on an attention mechanism and disjunctive graph embedding for flexible job shop scheduling problem

Multi-agent reinforcement learning framework for real-time scheduling of pump and valve in water distribution networks

Multi-agent-based deep reinforcement learning for dynamic flexible job shop scheduling

Multi-Agent Reinforcement Learning for Job Shop Scheduling in Dynamic Environments

Research on Multi-AGVs dynamic scheduling based on deep reinforcement learning

Evolutionary Computation and Reinforcement Learning Integrated Algorithm for Distributed Heterogeneous Flowshop Scheduling