Abstract:Determining optimum inventory replenishment decisions are critical for retail businesses with uncertain demand. The problem becomes particularly challenging when multiple products with different lead times and cross-product constraints are considered. This paper addresses the aforementioned challenges in multi-product, multi-period inventory management using deep reinforcement learning (deep RL). The proposed approach improves upon existing methods for inventory control on three fronts: (1) concurrent inventory management of a large number (hundreds) of products under realistic constraints, (2) minimal retraining requirements on the RL agent under system changes through the definition of an individual product meta-model, (3) efficient handling of multi-period constraints that stem from different lead times of different products. We approach the inventory problem as a special class of dynamical system control, and explain why the generic problem cannot be satisfactorily solved using classical optimisation techniques. Subsequently, we formulate the problem in a general framework that can be used for parallelised decision-making using off-the-shelf RL algorithms. We also benchmark the formulation against the theoretical optimum achieved by linear programming under the assumptions that the demands are deterministic and known apriori. Experiments on scales between 100 and 220 products show that the proposed RL-based approaches perform better than the baseline heuristics, and quite close to the theoretical optimum. Furthermore, they are also able to transfer learning without retraining to inventory control problems involving different number of products.

Combining deep reinforcement learning and multi-stage stochastic programming to address the supply chain inventory management problem

Performance of deep reinforcement learning algorithms in two-echelon inventory control systems

Deep Reinforcement Learning Approach for Capacitated Supply Chain optimization under Demand Uncertainty

Deep Reinforcement Learning for Large-Scale Inventory Management

Can Deep Reinforcement Learning Improve Inventory Management? Performance on Dual Sourcing, Lost Sales and Multi-Echelon Problems

Multi-echelon inventory optimization using deep reinforcement learning

An application of deep reinforcement learning and vendor-managed inventory in perishable supply chain management

Deep Inventory Management

Deep Reinforcement Learning for inventory optimization with non-stationary uncertain demand

Simultaneous Decision Making for Stochastic Multi-echelon Inventory Optimization with Deep Neural Networks as Decision Makers

Solving Inventory Management Problems Through Deep Reinforcement Learning

Reinforcement Learning for Multi-Product Multi-Node Inventory Management in Supply Chains

Modeling and Optimization of the Multiobjective Stochastic Joint Replenishment and Delivery Problem under Supply Chain Environment.

A Stochastic Dynamic Programming Based Heuristic for the Inventory Decision-Making Encountered in a Two-Echelon Warehouse System

Mixed-Integer Nonlinear Programming Models and Algorithms for Large-Scale Supply Chain Design with Stochastic Inventory Management

Integrating storage allocation with manual order picking and replenishment operations in a distribution centre

Scalable multi-product inventory control with lead time constraints using reinforcement learning

Algorithmic Approaches to Inventory Management Optimization

Spatial-temporal deep learning method for solving data-driven multi-echelon stochastic lot sizing problems with intermediate demands

Two-stage stochastic programming for the inventory routing problem with stochastic demands in fuel delivery

Applying machine learning to the dynamic selection of replenishment policies in fast-changing supply chain environments