Abstract:The high-quality development of the manufacturing industry necessitates accelerating its transformation towards high-end, intelligent, and green development. Considering logistics resource constraints, the impact of dynamic disturbance events on production, and the need for energy-efficient production, the integrated scheduling of production equipment and automated guided vehicles (AGVs) in a flexible job shop environment is investigated in this study. Firstly, a static model for the integrated scheduling of production equipment and AGVs (ISPEA) is developed based on mixed-integer programming, which aims to optimize the maximum completion time and total production energy consumption (EC). In recent years, reinforcement learning, including deep reinforcement learning (DRL), has demonstrated significant advantages in handling workshop scheduling issues with sequential decision-making characteristics, which can fully utilize the vast quantity of historical data accumulated in the workshop and adjust production plans in a timely manner based on changes in production conditions and demand. Accordingly, a DRL-based approach is introduced to address the common production disturbances in emergency order insertions. Combined with the characteristics of the ISPEA problem and an event-driven strategy for handling dynamic events, four types of agents, namely workpiece selection, machine selection, AGV selection, and target selection agents, are set up, which refine workshop production status characteristics as observation inputs and generate rules for selecting workpieces, machines, AGVs, and targets. These agents are trained offline using the QMIX multi-agent reinforcement learning framework, and the trained agents are utilized to solve the dynamic ISPEA problem. Finally, the effectiveness of the proposed model and method is validated through a comparison of the solution performance with other typical optimization algorithms for various cases.

Flow-Shop Scheduling Problem with Batch Processing Machines Via Deep Reinforcement Learning for Industrial Internet of Things

Deep-Reinforcement-Learning-Based Production Scheduling in Industrial Internet of Things

Intelligent Decision-Making of Scheduling for Dynamic Permutation Flowshop via Deep Reinforcement Learning

Solving non-permutation flow-shop scheduling problem via a novel deep reinforcement learning approach

Solving the flow-shop scheduling problem with human factors and two competing agents with deep reinforcement learning

A Deep Reinforcement Learning Method Based on a Transformer Model for the Flexible Job Shop Scheduling Problem

A Deep Reinforcement Learning Approach to the Flexible Flowshop Scheduling Problem with Makespan Minimization

Integration of deep reinforcement learning and multi-agent system for dynamic scheduling of re-entrant hybrid flow shop considering worker fatigue and skill levels

Solving flexible job shop scheduling problems via deep reinforcement learning

Hierarchical Reinforcement Learning for Multi-Objective Real-Time Flexible Scheduling in a Smart Shop Floor

Dynamic job-shop scheduling in smart manufacturing using deep reinforcement learning

Deep Reinforcement Learning for Dynamic Flexible Job Shop Scheduling with Random Job Arrival

Deep Reinforcement Learning for Distributed Flow Shop Scheduling with Flexible Maintenance

Deep Reinforcement Learning for Scheduling in an Edge Computing-Based Industrial Internet of Things

A Framework For Scheduling In Cloud Manufacturing With Deep Reinforcement Learning

Reinforcement Learning with Composite Rewards for Production Scheduling in a Smart Factory.

Bilevel learning for large-scale flexible flow shop scheduling

Logistics-involved task scheduling in cloud manufacturing with offline deep reinforcement learning

Dynamic Integrated Scheduling of Production Equipment and Automated Guided Vehicles in a Flexible Job Shop Based on Deep Reinforcement Learning

Reinforcement learning for robotic flow shop scheduling with processing time variations

Intelligent Scheduling of Discrete Automated Production Line Via Deep Reinforcement Learning