Deep Reinforcement Learning for Large-Scale Inventory Management

Xiaotian Liu,Christos Alexopoulos,Hao Hu,Shuyu Han,Yijie Peng,Yongzhi Qi
DOI: https://doi.org/10.2139/ssrn.4490327
2023-01-01
SSRN Electronic Journal
Abstract:The boom of the e-commerce industry in recent years prompts the focus of inventory management into large-scale problems with multiple products and multi-echelon supply chains. This work introduces a simulation-driven solution for large-scale inventory problems, where deep reinforcement learning (DRL) is used as the central technique and deep learning (DL) is exploited to assist the training of the associated neural network. We first investigate a single-echelon multi-product problem as a representative of relatively simple inventory models with ex-post optimal or sub-optimal policies. Using training samples generated by simulation, a DL model is first trained by imitating a target policy, after which a DRL procedure is applied to fine-tune the DL model for further improvement. The numerical results on real-life data from a leading e-commerce company show that our method outperforms conventional base-stock policies and an existing DL method with regard to average operational cost. Then, we formulate a multi-echelon multi-product problem with a practical two-level warehouse network and shared storage resources as a representative of hard inventory models without available heuristic solutions. In this case, a DRL model is trained based on feedback from simulation. The numerical results on real-life data show that our method is capable of constructing intelligent ordering policies that involve coordination among stages and outperforms three combined heuristics adapted to this problem in operational cost.
What problem does this paper attempt to address?