Abstract:Previous research focuses on approaches of deep reinforcement learning (DRL) to optimize diverse types of the single-objective dynamic flexible job shop scheduling problem (DFJSP), e.g., energy consumption, earliness and tardiness penalty and machine utilization rate, which gain many improvements in terms of objective metrics in comparison with metaheuristic algorithms such as GA (genetic algorithm) and dispatching rules such as MRT (most remaining time first). However, single-objective optimization in the job shop floor cannot satisfy the requirements of modern smart manufacturing systems, and the multiple-objective DFJSP has become mainstream and the core of intelligent workshops. A complex production environment in a real-world factory causes scheduling entities to have sophisticated characteristics, e.g., a job's non-uniform processing time, uncertainty of the operation number and restraint of the due time, avoidance of the single machine's prolonged slack time as well as overweight load, which make a method of the combination of dispatching rules in DRL brought up to adapt to the manufacturing environment at different rescheduling points and accumulate maximum rewards for a global optimum. In our work, we apply the structure of a dual layer DDQN (DLDDQN) to solve the DFJSP in real time with new job arrivals, and two objectives are optimized simultaneously, i.e., the minimization of the delay time sum and makespan. The framework includes two layers (agents): the higher one is named as a goal selector, which utilizes DDQN as a function approximator for selecting one reward form from six proposed ones that embody the two optimization objectives, while the lower one, called an actuator, utilizes DDQN to decide on an optimal rule that has a maximum Q value. The generated benchmark instances trained in our framework converged perfectly, and the comparative experiments validated the superiority and generality of the proposed DLDDQN.

Deep Reinforcement Learning Based Optimization Algorithm for Permutation Flow-Shop Scheduling

A Reinforcement Learning Approach to Robust Scheduling of Permutation Flow Shop

Intelligent Decision-Making of Scheduling for Dynamic Permutation Flowshop via Deep Reinforcement Learning

A Deep Reinforcement Learning Approach to the Flexible Flowshop Scheduling Problem with Makespan Minimization

A Knowledge-Guided End-to-End Optimization Framework Based on Reinforcement Learning for Flow Shop Scheduling

Solving non-permutation flow-shop scheduling problem via a novel deep reinforcement learning approach

Learning to Optimize Permutation Flow Shop Scheduling via Graph-based Imitation Learning

An Optimization Method for Green Permutation Flow Shop Scheduling Based on Deep Reinforcement Learning and MOEA/D

Deep Reinforcement Learning for Distributed Flow Shop Scheduling with Flexible Maintenance

The marriage of operations research and reinforcement learning: Integration of NEH into Q-learning algorithm for the permutation flowshop scheduling problem

A Reinforcement Learning Environment For Job-Shop Scheduling

Efficient Multi-Objective Optimization on Dynamic Flexible Job Shop Scheduling Using Deep Reinforcement Learning Approach

Accelerating Exact Combinatorial Optimization via RL-based Initialization -- A Case Study in Scheduling

Parallel machine scheduling minimizing the mean weighted flow time

Evolutionary Computation and Reinforcement Learning Integrated Algorithm for Distributed Heterogeneous Flowshop Scheduling

An improved simulated annealing algorithm based on residual network for permutation flow shop scheduling

An Improved Artificial Bee Colony Algorithm With Q-Learning for Solving Permutation Flow-Shop Scheduling Problems

Reinforcement learning for robotic flow shop scheduling with processing time variations

A Cooperative Scatter Search with Reinforcement Learning Mechanism for the Distributed Permutation Flowshop Scheduling Problem with Sequence-Dependent Setup Times

Deep Reinforcement Learning Guided Improvement Heuristic for Job Shop Scheduling

Solving the flow-shop scheduling problem with human factors and two competing agents with deep reinforcement learning