Abstract:Reinforcement learning (RL) is a promising way to achieve human-like autonomous driving (HAD) in complex and dynamic traffic, but faces challenges such as low sample efficiency, partial observability, and sim2real transfer. In light of this, a comprehensive solution for RL-driven HAD is established. First, an efficient training scheme called Deep Recurrent Q-learning from demonstration algorithm (DRQfD) is proposed for lane-changing decision-making to address the low sample efficiency in RL and the poor generalization capability in Imitation Learning (IL). The inherent LSTM structure potentially learns to predict future states of surrounding vehicles, helping to address the partially observable problem in autonomous driving (AD). Second, to reduce the sim2real gap, a twin high-fidelity simulator is built based on ROS-Gazebo for simulating LiDAR sensing, model training, and evaluations. Domain randomization is used to improve the robustness and generalization ability, making it easier for the model to be transferred to real-world scenarios. In addition, for the multi-objective optimization and imbalanced data issues in this scenario, a hierarchical decision-making framework is proposed to decompose the complex decision-making problem into several subtasks, making the driving policies easier to converge. To avoid the excessive dependence of the decision-making module on the output of perception module in modular systems, we train each modularized skill in an end-to-end manner. Moreover, we compare our method with a vanilla RL method to show improvement in learning efficiency. Comparisons between RL-based model and IL baseline in terms of safety, travel efficiency, and human-likeness are also given. To further validate the generalization ability of our model, we test the model on real traffic dataset. Finally, we implement the RL model on physical cars to demonstrate the performance of sim2real transfer.

Confidence-Aware Reinforcement Learning for Self-Driving Cars

Continuous Improvement of Self-Driving Cars Using Dynamic Confidence-Aware Reinforcement Learning

Trustworthy safety improvement for autonomous driving using reinforcement learning

Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings

Autonomous Highway Driving using Deep Reinforcement Learning

Applications of Distributional Soft Actor-Critic in Real-world Autonomous Driving

Identify, Estimate and Bound the Uncertainty of Reinforcement Learning for Autonomous Driving

A Safe and Efficient Lane Change Decision-Making Strategy of Autonomous Driving Based on Deep Reinforcement Learning

Multi-objective Optimization Based Deep Reinforcement Learning for Autonomous Driving Policy

Learning to Drive Using Sparse Imitation Reinforcement Learning

Self-Awareness Safety of Deep Reinforcement Learning in Road Traffic Junction Driving

From Naturalistic Traffic Data to Learning-Based Driving Policy: A Sim-to-Real Study

Towards Robust Decision-Making for Autonomous Driving on Highway

Towards Robust Decision-Making for Autonomous Highway Driving Based on Safe Reinforcement Learning

Safe Autonomous Driving with Latent Dynamics and State-Wise Constraints

Imitation Is Not Enough: Robustifying Imitation with Reinforcement Learning for Challenging Driving Scenarios

Automated Driving Maneuvers under Interactive Environment based on Deep Reinforcement Learning

Reinforcement Learning based Control of Imitative Policies for Near-Accident Driving

SECRM-2D: RL-Based Efficient and Comfortable Route-Following Autonomous Driving with Analytic Safety Guarantees

Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving

Safe Reinforcement Learning for Autonomous Vehicles through Parallel Constrained Policy Optimization