End-to-End Autonomous Driving Decision-Making Solution Based on Pri-TD3

Jiaxu Meng,Yindong Wang,Tao Xu,Qiang Wang,Yongyi Yang
DOI: https://doi.org/10.1109/RICAI60863.2023.10489418
2023-12-01
Abstract:In order to address the issue of information loss in traditional hierarchical autonomous driving systems and enhance the decision-making capabilities of autonomous vehicles in complex environments, a deep reinforcement learning-based end-to-end autonomous driving solution is proposed. The Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm is capable of achieving superior policies for continuous action control compared to algorithms such as Deep Deterministic Policy Gradient (DDPG) and Deep Q-Network (DQN). However, it suffers from the problem of low sample efficiency. To address this issue, this paper introduces an improved Twin Delayed Deep Deterministic Policy Gradient (Pri-TD3) algorithm based on the prioritized experience replay technique. Additionally, the algorithm is validated through simulations in a scenario involving multiple vehicles making unprotected left turns using the Carla simulator. Results demonstrate that the proposed Pri-TD3 algorithm outperforms DDPG and TD3 in terms of convergence speed, reward acquisition, and policy stability, etc.
Computer Science,Engineering
What problem does this paper attempt to address?