A Deep Reinforcement Learning Algorithm with Expert Demonstrations and Supervised Loss and Its Application in Autonomous Driving

Kai Liu,Qin Wan,Yanjie Li
DOI: https://doi.org/10.23919/chicc.2018.8482790
2018-01-01
Abstract:In this paper, we propose a deep reinforcement learning(DRL) algorithm which combines Deep Deterministic Policy Gradient (DDPG) with expert demonstrations and supervised loss for decision making for autonomous driving. Training DRL agent with supervised learning is adopted to accelerate the exploration process and increase the stability. A supervised loss function is introduced in the algorithm to update the actor networks. In addition, reward construction is combined to make the training process more stable and efficient. The proposed algorithm is applied to a popular autonomous driving simulator called TORCS. The experimental results show that the training efficiency and stability are improved by utilizing our algorithm in autonomous driving.
What problem does this paper attempt to address?