A Dynamic Maintenance Policy for Degradation System by State Monitoring and Deep Reinforcement Learning

Deming Xu,Yan Wang,Xiang Liu,Zhicheng Ji
DOI: https://doi.org/10.1016/j.compeleceng.2024.109603
IF: 4.152
2024-01-01
Computers & Electrical Engineering
Abstract:This paper studies how to model and resolve the dynamic maintenance problem for the continuous degrading system. The main objective is to realize the maintenance cost optimization and determine the optimal maintenance action in the real-time degradation system. First, a deterioration model based on the Wiener process with Brownian motion is described to simulate the deterioration system and estimate the deterioration level. Second, a data-processing method is applied to discretize the continuous degradation into the discrete degradation state. Thus, a discrete-time finite Markov decision process (MDP) model is established to describe the maintenance decision processes. Third, a dynamic maintenance decision framework is constructed by periodic status monitoring and a customized deep reinforcement learning method. The developed decision framework determines the maintenance action by utilizing the specific degradation state at each inspection stage. Fourth, a maintenance decision agent is developed by employing a customized proximal policy optimization with an entropy regularization approach to solve the dynamic maintenance problem, in which the deep actor- critic networks, advantage function estimation method, truncated surrogate approach, and entropy regularization approach are integrated to facilitate the policy optimization processes, and the customized maintenance decision approach is also verified by simulation analysis. To illustrate the effectiveness, comparison studies are conducted and the results demonstrate that: (i) the cumulative maintenance costs are decreased and approaching the approximated equilibrium state by running the proposed maintenance decision algorithm; (ii) the proposed customized deep reinforcement learning approach for the maintenance decision problem can be implemented easily, and the efficiency and stability can be enhanced distinctively; (iii) the proposed maintenance framework can provide a useful decision tool for engineering manager.
What problem does this paper attempt to address?