Abstract:Driven by the trend away from mass production towards more customization and individualism, manufacturing is under massive price pressure. Therefore, continuously improving production efficiency and reducing costs have always been the focus of manufacturing companies. With recent advances in Industry 4.0 and industrial Artificial Intelligence (AI), Automated Guided Vehicles (AGVs) have become a very promising technology to support this trend. Nowadays, they are widely used in job shop environments for material handling. However, this promising technology goes along with various new challenges, such as the high dynamics, complexity, and uncertainty of the job shop environment for AGV scheduling. To address these challenges, an adaptive Deep Reinforcement Learning (DRL) based AGV real-time scheduling approach is approached to optimize several efficiency parameters of the overall job shop system. Firstly, the DRL optimization problem is formulated as Markov Decision Process (MDP). The state and action representation, reward, and optimal policy function are described in detail. Then a novel DRL method is further developed to achieve the optimal mixed rule policy. For that, a novel RL algorithm, Proximal Policy Optimization (PPO), is applied to the Deep Neural Network (DNN) and implemented in TensorForce, a reinforcement learning package in Python based on TensorFlow. This algorithm is compared to conventional heuristic rules. Furthermore, SimPy, a relatively new discrete-event simulation package in Python, is used to implement the job shop environment. The job shop environment is based on a real-world scenario of a semiconductor factory, which is implemented in a simplified manner and applied to the DRL agent. The results are presented afterward, followed by a feasibility and effectiveness analysis of this approach.

Battery Management for Warehouse Robots Via Average-Reward Reinforcement Learning

Research on Cooperative Scheduling of AGV Transportation and Charging in Intelligent Warehouse System Based on Dynamic Task Chain

Real-Time Battery Thermal Management for Electric Vehicles Based on Deep Reinforcement Learning

Real-Time Charging Scheduling of Automated Guided Vehicles in Cyber-Physical Smart Factories Using Feature-Based Reinforcement Learning

Reward Mechanism Design for Deep Reinforcement Learning-Based Microgrid Energy Management

Deep Reinforcement Learning for Dynamic Order Picking in Warehouse Operations

Dynamic Balancing-Charging Management for Shared Autonomous Electric Vehicle Systems: A Two-Stage Learning-Based Approach

Multi-Objective Optimization of AGV Real-Time Scheduling Based on Deep Reinforcement Learning

Reward shaping to improve the performance of deep reinforcement learning in perishable inventory management

Battery Health-Aware and Deep Reinforcement Learning-Based Energy Management for Naturalistic Data-Driven Driving Scenarios

RTAW: An Attention Inspired Reinforcement Learning Method for Multi-Robot Task Allocation in Warehouse Environments

An Intelligent Energy Management Strategy for Hybrid Vehicle with Irrational Actions Using Twin Delayed Deep Deterministic Policy Gradient

Toward Energy-Efficient Routing of Multiple AGVs with Multi-Agent Reinforcement Learning

Enhancing Battery Storage Energy Arbitrage with Deep Reinforcement Learning and Time-Series Forecasting

Probabilistic Automata-Based Method for Enhancing Performance of Deep Reinforcement Learning Systems

Average-Reward Reinforcement Learning with Trust Region Methods

Can Deep Reinforcement Learning Improve Inventory Management? Performance on Dual Sourcing, Lost Sales and Multi-Echelon Problems

Study on an Average Reward Reinforcement Learning Algorithm

Intelligent Path Planning for AGV-UAV Transportation in 6G Smart Warehouse

Energy management strategy via maximum entropy reinforcement learning for an extended range logistics vehicle

Optimizing Robotic Mobile Fulfillment Systems for Order Picking Based on Deep Reinforcement Learning