Multi-objective Sensor Management Method Based on Twin Delayed Deep Deterministic policy gradient algorithm

Hui Chen,Xindi Zhang,Tao Li,Wenxu Zhang,Lei Xi,Jianrong Liu,Yangming Fan
DOI: https://doi.org/10.23919/CCC63176.2024.10661904
2024-07-28
Abstract:To solve the problem of sensor management in multi-target tracking optimization, this paper proposes a multi-target sensor management strategy method based on Deep reinforcement learning based on stochastic finite set theory. Firstly, based on the partially observable Markov decision process (POMDP) theory, the basic method of random finite set multi-target sensor management is presented. Secondly, a sensor management strategy framework based on multi-objective state estimation optimization based on deep reinforcement learning is proposed. Finally, the information gain is used as the reward function to solve the optimal sensor management strategy. The simulation based on OSPA distance evaluation verifies that the proposed sensor management method has better estimation effect on multi-target motion state than other algorithms.
Computer Science,Engineering
What problem does this paper attempt to address?