Abstract:Actual manufacturing process scheduling in enterprise alliances are multi-task scheduling problems involving dynamic factors, and the competition and conflict for manufacturing resources also exist between multi-tasks. How to perform adaptive multi-objective scheduling of multi-tasks based on the real-time state of the manufacturing environment becomes critical. Therefore, this paper constructs an adaptive multi-task multi-objective scheduling considering resource competition and conflict among tasks (AMMS-RCCT) model based on the enterprise alliance value net, and adopts a hybrid strategy of “parallel+serial” to resolve conflicts while reducing the waiting time of tasks. With the objective of optimizing the total manufacturing time and total manufacturing cost, an adaptive multi-objective deep Q network (AMDQN) is proposed to solve the AMMS-RCCT model. AMDQN is based on a two-hierarchy deep reinforcement learning architecture containing a front controller deep Q network (C-DQN) and a back actuator deep Q network (A-DQN), which performs hierarchical decision-making on optimization objectives and scheduling rules to achieve compromise between multiple objectives while reducing the complexity for optimal selection scheduling rules. For the two optimization objectives of time and cost, two reward algorithms are proposed by introducing two metrics, the estimated tardiness rate and the estimated overspend rate, which guide the A-DQN to learn and adjust the scheduling rules according to the state changes. Besides, nine composite scheduling rules are designed to adapt to the dynamic manufacturing environment from multiple dimensions such as task urgency and completion rate as well as manufacturing resource utilization and cost. Finally, AMDQN is experimentally compared with the proposed nine composite scheduling rules, scheduling rules in existing research, and other scheduling methods based on reinforcement learning in simulated manufacturing environments with different numbers of tasks, subtasks, and manufacturing cells. The experimental results verify the effectiveness and superiority of AMDQN for multi-objective adaptive scheduling in multi-task scheduling problems.

A New Multi-Domain Cooperative Resource Scheduling Method Using Proximal Policy Optimization

Multi-task Deep Reinforcement Learning for Scalable Parallel Task Scheduling.

Dynamic scheduling of decentralized high-end equipment R&D projects via deep reinforcement learning

Hybrid Edge-Cloud Collaborator Resource Scheduling Approach Based on Deep Reinforcement Learning and Multiobjective Optimization

A Deep Reinforcement Learning Approach for Resource-Constrained Project Scheduling

A Multi-Policy Deep Reinforcement Learning Approach for Multi-Objective Joint Routing and Scheduling in Deterministic Networks

A Cooperative Hierarchical Deep Reinforcement Learning based Multi-agent Method for Distributed Job Shop Scheduling Problem with Random Job Arrivals

A Multi-Objective Reinforcement Learning Algorithm for Deadline Constrained Scientific Workflow Scheduling in Clouds

Efficient Multi-Objective Optimization on Dynamic Flexible Job Shop Scheduling Using Deep Reinforcement Learning Approach

Multi-Dimensional Resource Allocation in Distributed Data Centers Using Deep Reinforcement Learning

Hybrid Task Scheduling in Cloud Manufacturing with Sparse-Reward Deep Reinforcement Learning

Multi-Objective Deep Reinforcement Learning Assisted Resource Allocation for MEC-Caching-coexist System

Collision-aware Multi-robot Motion Coordination Deep-RL with Dynamic Priority Strategy

DL-DRL: A double-level deep reinforcement learning approach for large-scale task scheduling of multi-UAV

Multi-resource constrained dynamic workshop scheduling based on proximal policy optimisation

Distributed Resource Scheduling for Large-Scale MEC Systems: A Multiagent Ensemble Deep Reinforcement Learning With Imitation Acceleration

A2C-DRL: Dynamic Scheduling for Stochastic Edge-Cloud Environments Using A2C and Deep Reinforcement Learning

Multi-Resource Scheduling for Multiple Service Function Chains with Deep Reinforcement Learning

Energy-Efficient Collaborative Multi-Access Edge Computing Via Deep Reinforcement Learning

Distributed Resource Scheduling for Large-Scale MEC Systems: A Multi-Agent Ensemble Deep Reinforcement Learning with Imitation Acceleration

An Adaptive Multi-Objective Multi-Task Scheduling Method by Hierarchical Deep Reinforcement Learning