Abstract:Selective maintenance, which aims to choose a subset of feasible maintenance actions to be performed for a repairable system with limited maintenance resources, has been extensively studied over the past decade. Most of the reported works on selective maintenance have been dedicated to maximizing the success of a single future mission. Cases of multiple consecutive missions, which are oftentimes encountered in engineering practices, have been rarely investigated to date. In this paper, a new selective maintenance optimization for multi-state systems that can execute multiple consecutive missions over a finite horizon is developed. The selective maintenance strategy can be dynamically optimized to maximize the expected number of future mission successes whenever the states and effective ages of the components become known at the end of the last mission. The dynamic optimization problem, which accounts for imperfect maintenance, is formulated as a discrete-time finite-horizon Markov decision process with a mixed integer-discrete-continuous state space. Based on the framework of actor-critic algorithms, a customized deep reinforcement learning method is put forth to overcome the "curse of dimensionality" and mitigate the uncountable state space. In our proposed method, a postprocess is developed for the actor to search the optimal maintenance actions in a large-scale discrete action space, whereas the techniques of the experience replay and the target network are utilized to facilitate the agent training. The performance of the proposed method is examined by an illustrative example and an engineering example of a coal transportation system. (C) 2019 Elsevier B.V. All rights reserved.

Deep Reinforce Learning for Joint Optimization of Condition-Based Maintenance and Spare Ordering.

An Optimum Condition-Based Replacement And Spare Provisioning Policy Based On Markov Chains

Maintenance Optimization of Multi-Unit Balanced Systems Using Deep Reinforcement Learning

A Deep Reinforcement Learning Approach for Maintenance Planning of Multi-Component Systems with Complex Structure

Maintenance optimisation of multi-unit balanced systems using deep reinforcement learning

Dynamic Selective Maintenance Optimization for Multi-State Systems over a Finite Horizon: A Deep Reinforcement Learning Approach.

Deep Reinforcement Learning for Dynamic Opportunistic Maintenance of Multi-Component Systems With Load Sharing

Counterfactual-attention multi-agent reinforcement learning for joint condition-based maintenance and production scheduling

Condition-based Maintenance for Multi-state Systems with Prognostic and Deep Reinforcement Learning

Joint optimization of maintenance and quality inspection for manufacturing networks based on deep reinforcement learning

Integrated Scheduling and Flexible Maintenance in Deteriorating Multi-State Single Machine System Using a Reinforcement Learning Approach.

Joint Optimization of Multi-Stage Component Reassignment and Preventive Maintenance for Balanced Systems Considering Imperfect Maintenance

Joint Maintenance and Spare Part Ordering from Multiple Suppliers for Multicomponent Systems Using a Deep Reinforcement Learning Algorithm

A Condition-Based Maintenance Policy for Multi-Component Systems Subject to Stochastic and Economic Dependencies

Imperfect Maintenance Optimization of Multi-State Rolling Stocks Based on Deep Reinforcement Learning

Deep reinforcement learning for cost-optimal condition-based maintenance policy of offshore wind turbine components

Generalized Condition-Based Maintenance Optimization for Multi-Component Systems Considering Stochastic Dependency and Imperfect Maintenance.

Deep Reinforcement Learning for Maintenance Optimization of Multi-Component Production Systems Considering Quality and Production Plan

Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints

Condition-based Maintenance for Multi-component Systems:Modeling, Structural Properties, and Algorithms

Joint Condition-Based Maintenance and Spare Provisioning Policy for a K-out-of-N System with Failures During Inspection Intervals