Train Safety Control Based on Deep Deterministic Policy Gradient with Control Barrier Function
Guoteng Wang,Jidong Lv,Hongjie Liu,Ming Chai,Tao Tang,Wentao Zhang
DOI: https://doi.org/10.1109/icnsc62968.2024.10760112
2024-01-01
Abstract:Train Relative Distance Braking Mode (RDBM) has great potential to increase line capacity by enabling shorter following distances based on train-to-train communication in-formation. However, it is difficult for traditional train control methods to ensure avoidance of potential collision risks in RDBM while dealing with the nonlinearity and uncertainty of train dynamics. In this paper, a novel train safety control method combining Deep Deterministic Policy Gradient (DDPG) and Control Barrier Function(Cbf)is proposed. Firstly, the safety constraint set for adjacent trains, considering their braking capabilities, is analyzed, and the uncertainty of train dynamics is modeled using a Gaussian process. Secondly, DDPG is used to explore the optimal control strategy under the action of uncertain train dynamics, and CBF provides compensation control input to ensure safety during the learning process. Then, a DDPG-CBF algorithm is presented to achieve efficient safety controller design and implementation. Finally, sufficient numerical simulations demonstrate that, compared with DDPG algorithm alone, the DDPG-CBF algorithm can not only ensure the safe and stable operation of trains, but also learn train control strategies expe-ditiously and accurately under different operating environments, with an average efficiency increase of 38.94 %.