Abstract:Reinforcement learning (RL) is a promising way to achieve human-like autonomous driving (HAD) in complex and dynamic traffic, but faces challenges such as low sample efficiency, partial observability, and sim2real transfer. In light of this, a comprehensive solution for RL-driven HAD is established. First, an efficient training scheme called Deep Recurrent Q-learning from demonstration algorithm (DRQfD) is proposed for lane-changing decision-making to address the low sample efficiency in RL and the poor generalization capability in Imitation Learning (IL). The inherent LSTM structure potentially learns to predict future states of surrounding vehicles, helping to address the partially observable problem in autonomous driving (AD). Second, to reduce the sim2real gap, a twin high-fidelity simulator is built based on ROS-Gazebo for simulating LiDAR sensing, model training, and evaluations. Domain randomization is used to improve the robustness and generalization ability, making it easier for the model to be transferred to real-world scenarios. In addition, for the multi-objective optimization and imbalanced data issues in this scenario, a hierarchical decision-making framework is proposed to decompose the complex decision-making problem into several subtasks, making the driving policies easier to converge. To avoid the excessive dependence of the decision-making module on the output of perception module in modular systems, we train each modularized skill in an end-to-end manner. Moreover, we compare our method with a vanilla RL method to show improvement in learning efficiency. Comparisons between RL-based model and IL baseline in terms of safety, travel efficiency, and human-likeness are also given. To further validate the generalization ability of our model, we test the model on real traffic dataset. Finally, we implement the RL model on physical cars to demonstrate the performance of sim2real transfer.

Continuous Reinforcement Learning From Human Demonstrations With Integrated Experience Replay For Autonomous Driving

Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving

Prioritized Experience-Based Reinforcement Learning With Human Guidance for Autonomous Driving

Neurobehavioral symptoms in caudate hemorrhage

Learning to drive via Apprenticeship Learning and Deep Reinforcement Learning

Learning to Drive Like Human Beings: A Method Based on Deep Reinforcement Learning

Efficient Deep Reinforcement Learning with Imitative Expert Priors for Autonomous Driving

Automated Driving Maneuvers under Interactive Environment based on Deep Reinforcement Learning

Autonomous driving policy learning from demonstration using regression loss function

Efficient and Generalized End-to-end Autonomous Driving System with Latent Deep Reinforcement Learning and Demonstrations

Self-Driving Car Racing: Application of Deep Reinforcement Learning

Human-in-the-Loop Deep Reinforcement Learning with Application to Autonomous Driving

Comprehensive Training and Evaluation on Deep Reinforcement Learning for Automated Driving in Various Simulated Driving Maneuvers

From Naturalistic Traffic Data to Learning-Based Driving Policy: A Sim-to-Real Study

Intelligent control of self-driving vehicles based on adaptive sampling supervised actor-critic and human driving experience

Human-Like Autonomous Car-Following Model with Deep Reinforcement Learning

Deep Reinforcement Learning framework for Autonomous Driving

Local precision of visuotopic organization in the middle temporal area (MT) of the macaque

Enhancing Car-Following Performance in Traffic Oscillations Using Expert Demonstration Reinforcement Learning

Exploring applications of deep reinforcement learning for real-world autonomous driving systems

Towards Robust Decision-Making for Autonomous Highway Driving Based on Safe Reinforcement Learning