Deep reinforcement learning for solving resource constrained project scheduling problems with resource disruptions

Hongxia Cai,Yunqi Bian,Lilan Liu
DOI: https://doi.org/10.1016/j.rcim.2023.102628
IF: 10.103
2024-02-01
Robotics and Computer-Integrated Manufacturing
Abstract:The resource-constrained project scheduling problem (RCPSP) is encountered in many fields, including manufacturing, supply chain, and construction. Nowadays, with the rapidly changing external environment and the emergence of new models such as smart manufacturing, it is more and more necessary to study RCPSP considering resource disruptions. A framework based on reinforcement learning (RL) and graph neural network (GNN) is proposed to solve RCPSP and further solve the RCPSP with resource disruptions (RCPSP-RD) on this basis. The scheduling process is formulated as sequential decision-making problems. Based on that, Markov decision process (MDP) models are developed for RL to learn scheduling policies. A GNN-based structure is proposed to extract features from problems and map them to action probability distributions by policy network. To optimize the scheduling policy, proximal policy optimization (PPO) is applied to train the model end-to-end. Computational results on benchmark instances show that the RL-GNN algorithm achieves competitive performance compared with some widely used methods.
robotics,computer science, interdisciplinary applications,engineering, manufacturing
What problem does this paper attempt to address?
### Problems Addressed by the Paper This paper primarily aims to address the Resource-Constrained Project Scheduling Problem (RCPSP) and its extended version under resource disruptions (RCPSP-RD). Specifically: 1. **Resource-Constrained Project Scheduling Problem (RCPSP)**: - **Problem Description**: The paper first focuses on the traditional RCPSP, which aims to minimize the project duration while satisfying precedence constraints and resource availability. - **Methodology**: The problem is tackled by combining Reinforcement Learning (RL) with Graph Neural Networks (GNN). The scheduling process is modeled as a sequential decision problem, constructing a Markov Decision Process (MDP). GNN is used to extract problem features, which are then mapped to action probability distributions through a policy network. 2. **Resource-Constrained Project Scheduling Problem with Resource Disruptions (RCPSP-RD)**: - **Problem Description**: The problem is further extended to consider resource disruptions, i.e., scheduling projects when resource capacities vary over time. - **Methodology**: The paper proposes a framework based on RL and GNN to handle uncertainties brought by resource disruptions. The MDP model is adjusted to accommodate resource changes and activity interruptions, and the Proximal Policy Optimization (PPO) algorithm is used for end-to-end training. ### Main Contributions 1. **Framework Design**: A framework combining RL and GNN is proposed to solve RCPSP and further address the RCPSP-RD problem. The MDP model and GNN structure are used to extract useful features from the problem and map them to action probability distributions. 2. **Experimental Results**: Experiments conducted on the Project Scheduling Problem Library (PSPLIB) show that the algorithm performs well not only on traditional RCPSP but also on RCPSP-RD, demonstrating competitive performance and good generalization capabilities. ### Summary This paper aims to solve the traditional Resource-Constrained Project Scheduling Problem and its extended version under resource disruptions by combining Reinforcement Learning and Graph Neural Networks, thereby improving the robustness and adaptability of scheduling strategies.