An Automated Driving Strategy Generating Method Based on WGAIL–DDPG

Mingheng Zhang,Xing Wan,Longhui Gang,Xinfei Lv,Zengwen Wu,Zhaoyang Liu
DOI: https://doi.org/10.34768/amcs-2021-0031
IF: 2.157
2021-01-01
International Journal of Applied Mathematics and Computer Science
Abstract:Reliability, efficiency and generalization are basic evaluation criteria for a vehicle automated driving system. This paper proposes an automated driving decision-making method based on the Wasserstein generative adversarial imitation learning–deep deterministic policy gradient (WGAIL–DDPG(λ)). Here the exact reward function is designed based on the requirements of a vehicle’s driving performance, i.e., safety, dynamic and ride comfort performance. The model’s training efficiency is improved through the proposed imitation learning strategy, and a gain regulator is designed to smooth the transition from imitation to reinforcement phases. Test results show that the proposed decision-making model can generate actions quickly and accurately according to the surrounding environment. Meanwhile, the imitation learning strategy based on expert experience and the gain regulator can effectively improve the training efficiency for the reinforcement learning model. Additionally, an extended test also proves its good adaptability for different driving conditions.
What problem does this paper attempt to address?