Abstract:The rapid development of cloud computing and the Internet of Things (IoT) have facilitated near real-time optimization of the group distributed manufacturing systems. Currently, the most common technique to accomplish near-real-time optimization is cloud–edge cooperation for offloading optimization tasks. The tasks are partially offloaded to the cloud to be completed, and the remaining are kept at the edge. Due to the complexity of task offloading, such as capacity restrictions of cloud and edge computing resources, or task deadlines, unbalanced or insufficient tasks are offloaded to cloud and edge, causing time delay. To address the imbalance and insufficiency in the task offloading process, a mixed-integer programming model was developed to reduce the latency of task calculation. The task offloading problem is decomposed into two sub-problems: 1) Defining priorities for the tasks in near real-time. 2) Determining if the task is offloaded to the cloud. A multi-agent deep reinforcement learning with attention mechanism (MaDRLAM) framework is proposed to solve the two-step decision problem. The MaDRLAM framework consists of two agents, and each agent corresponds to a sub-problem. Each agent comprises an encoder and a decoder, and the two agents cooperate in devising an offloading strategy for the tasks. The Encoder and Decoder built for each agent are based on the Transformer structure. Unlike the traditional Transformer, we added the Pointer networks to the Transformer to solve the proposed decision problem. Besides, an improved multi-actor and single-critic strategy based on the REINFORCE algorithm is designed to train the proposed MaDRLAM. Finally, Extensive computational experiments are conducted on instances with a varying number of tasks, different task data sizes, and different cloud computing capacities. Computational results show that the proposed framework can find a solution with a GAP value of less than 1% within 1 s for each instance. The proposed framework is competitive in both solution accuracy and solution time compared with other offloading strategies.

Multi-Agent Deep Reinforcement Learning for Multi-Echelon Inventory Management

Learning to Cooperate: Application of Deep Reinforcement Learning for Online AGV Path Finding.

Cooperative Multi-Agent Reinforcement Learning for Inventory Management

An Analysis of Multi-Agent Reinforcement Learning for Decentralized Inventory Control Systems

Solving Inventory Management Problems Through Deep Reinforcement Learning

Deep Reinforcement Learning Approach for Capacitated Supply Chain optimization under Demand Uncertainty

Deep Reinforcement Learning for Large-Scale Inventory Management

A multi-agent deep reinforcement learning approach for solving the multi-depot vehicle routing problem

Multi-echelon inventory optimization using deep reinforcement learning

Can Deep Reinforcement Learning Improve Inventory Management? Performance on Dual Sourcing, Lost Sales and Multi-Echelon Problems

Performance of deep reinforcement learning algorithms in two-echelon inventory control systems

Edge-conditioned vector basis functions for the analysis and optimization of rectangular waveguide dual-mode filters

Case-based reinforcement learning for dynamic inventory control in a multi-agent supply-chain system

Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems

Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management

Improving Sample Efficiency in Multi-Agent Actor-Critic Methods

Robust Multi-Agent Reinforcement Learning via Minimax Deep Deterministic Policy Gradient

Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management

Multi-agent deep reinforcement learning for task offloading in group distributed manufacturing systems

Multi-Agent Deep Reinforcement Learning for Liquidation Strategy Analysis

Deep reinforcement learning for demand fulfillment in online retail