Abstract:Personalized orders bring challenges to the production paradigm, and there is an urgent need for the dynamic responsiveness and self-adjustment ability of the workshop. Traditional dispatching rules and heuristic algorithms solve the production planning and control problems by making schedules. However, the previous methods cannot work well in a changeable workshop environment when encountering a large number of stochastic disturbances of orders and resources. Recently, the potential of artificial intelligence (AI) algorithms in solving the dynamic scheduling problem has attracted researchers' attention. Therefore, this paper presents a multi-agent manufacturing system based on deep reinforcement learning (DRL), which integrates the self-organization mechanism and self-learning strategy. Firstly, the manufacturing equipment in the workshop is constructed as an equipment agent with the support of edge computing node, and an improved contract network protocol (CNP) is applied to guide the cooperation and competition among multiple agents, so as to complete personalized orders efficiently. Secondly, a multi-layer perceptron is employed to establish the decision-making module called AI scheduler inside the equipment agent. According to the perceived workshop state information, AI scheduler intelligently generates an optimal production strategy to perform task allocation. Then, based on the collected sample trajectories of scheduling process, AI scheduler is periodically trained and updated through the proximal policy optimization (PPO) algorithm to improve its decision-making performance. Finally, in the multi-agent manufacturing system testbed, dynamic events such as stochastic job insertions and unpredictable machine failures are considered in the verification experiments. The experimental results show that the proposed method is capable of obtaining the scheduling solutions that meet various performance metrics, as well as dealing with resource or task disturbances efficiently and autonomously.

A stable method for task priority adaptation in quadratic programming via reinforcement learning

Iteratively Successive Projection: A Novel Continuous Approach for the Task-Based Control of Redundant Robots.

Task-Priority Control of Redundant Robotic Systems using Control Lyapunov and Control Barrier Function based Quadratic Programs

An Overview of Multi-task Control for Redundant Robot Based on Quadratic Programming

Robust Task-Space Quadratic Programming for Kinematic-Controlled Robots

A Dual-System Reinforcement Learning Method for Flexible Job Shop Dynamic Scheduling

Integrating Robot Assignment and Maintenance Management: A Multi-Agent Reinforcement Learning Approach for Holistic Control

Parallel and Proximal Constrained Linear-Quadratic Methods for Real-Time Nonlinear MPC

A Dynamic Architecture for Task Assignment and Scheduling for Collaborative Robotic Cells

Q-CP: Learning Action Values for Cooperative Planning

Prioritized Optimal Control

Impact-Aware Task-Space Quadratic-Programming Control

A multi-level action coupling reinforcement learning approach for online two-stage flexible assembly flow shop scheduling

Deep reinforcement learning on variable stiffness compliant control for programming-free robotic assembly in smart manufacturing

Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems

Hierarchical Reinforcement Learning for Multi-Objective Real-Time Flexible Scheduling in a Smart Shop Floor

Adaptive scheduling for assembly job shop with uncertain assembly times based on dual Q-learning

Direction-Constrained Control for Efficient Physical Human-Robot Interaction under Hierarchical Tasks

Constraint Handling in Continuous-Time DDP-Based Model Predictive Control

Accelerating reinforcement learning with case-based model-assisted experience augmentation for process control

Multi-Objective Optimization of AGV Real-Time Scheduling Based on Deep Reinforcement Learning