Abstract:Personalized orders bring challenges to the production paradigm, and there is an urgent need for the dynamic responsiveness and self-adjustment ability of the workshop. Traditional dispatching rules and heuristic algorithms solve the production planning and control problems by making schedules. However, the previous methods cannot work well in a changeable workshop environment when encountering a large number of stochastic disturbances of orders and resources. Recently, the potential of artificial intelligence (AI) algorithms in solving the dynamic scheduling problem has attracted researchers' attention. Therefore, this paper presents a multi-agent manufacturing system based on deep reinforcement learning (DRL), which integrates the self-organization mechanism and self-learning strategy. Firstly, the manufacturing equipment in the workshop is constructed as an equipment agent with the support of edge computing node, and an improved contract network protocol (CNP) is applied to guide the cooperation and competition among multiple agents, so as to complete personalized orders efficiently. Secondly, a multi-layer perceptron is employed to establish the decision-making module called AI scheduler inside the equipment agent. According to the perceived workshop state information, AI scheduler intelligently generates an optimal production strategy to perform task allocation. Then, based on the collected sample trajectories of scheduling process, AI scheduler is periodically trained and updated through the proximal policy optimization (PPO) algorithm to improve its decision-making performance. Finally, in the multi-agent manufacturing system testbed, dynamic events such as stochastic job insertions and unpredictable machine failures are considered in the verification experiments. The experimental results show that the proposed method is capable of obtaining the scheduling solutions that meet various performance metrics, as well as dealing with resource or task disturbances efficiently and autonomously.

A deep reinforcement learning method for multi-stage equipment development planning in uncertain environments

Adaptive Disassembly Sequence Planning for VR Maintenance Training Via Deep Reinforcement Learning

Spacecraft Attitude Maneuver Planning Based on Deep Reinforcement Learning under Complex Constraints

Hand-in-Hand Guidance: an Explore-Exploit Based Reinforcement Learning Method for Performance Driven Assembly-Adjustment

Dynamic scheduling of decentralized high-end equipment R&D projects via deep reinforcement learning

Dynamic Integrated Scheduling of Production Equipment and Automated Guided Vehicles in a Flexible Job Shop Based on Deep Reinforcement Learning

Adaptive Deep Reinforcement Learning for Non-Stationary Environments

Deep Reinforcement Learning Approach for Capacitated Supply Chain optimization under Demand Uncertainty

Deep reinforcement learning for dynamic distributed job shop scheduling problem with transfers

A Cooperative Hierarchical Deep Reinforcement Learning based Multi-agent Method for Distributed Job Shop Scheduling Problem with Random Job Arrivals

A Dual-System Reinforcement Learning Method for Flexible Job Shop Dynamic Scheduling

Dynamic job shop scheduling based on deep reinforcement learning for multi-agent manufacturing systems

A novel method based on deep reinforcement learning for machining process route planning

Deep Reinforcement Learning for Dynamic Flexible Job Shop Scheduling with Random Job Arrival

Probabilistic Automata-Based Method for Enhancing Performance of Deep Reinforcement Learning Systems

Dynamic scheduling for flexible job shop using a deep reinforcement learning approach

Real-time dispatch of an integrated energy system based on multi-stage reinforcement learning with an improved action-choosing strategy

Deep Reinforcement Learning in Nonstationary Environments With Unknown Change Points

Deep Reinforcement Learning Based Trajectory Planning Under Uncertain Constraints

Digital twin and deep reinforcement learning enabled real-time scheduling for complex product flexible shop-floor

Deep reinforcement learning applied to an assembly sequence planning problem with user preferences