A dynamic mission abort policy for the swarm executing missions and its solution method by tailored deep reinforcement learning
Lujie Liu,Jun Yang
DOI: https://doi.org/10.1016/j.ress.2023.109149
IF: 7.247
2023-01-01
Reliability Engineering & System Safety
Abstract:•A dynamic mission abort policy is proposed for the swarm with changing states.•The proposed policy maximizes the reward by specifying mission abort actions.•The sequential decision problem is formulated as a Markov decision process.•A tailored deep reinforcement learning approach is proposed to solve the problem.•A case study on UAV swarm is given to show the superiority of the proposed method.
What problem does this paper attempt to address?