Abstract:Reinforcement learning (RL) is a promising way to achieve human-like autonomous driving (HAD) in complex and dynamic traffic, but faces challenges such as low sample efficiency, partial observability, and sim2real transfer. In light of this, a comprehensive solution for RL-driven HAD is established. First, an efficient training scheme called Deep Recurrent Q-learning from demonstration algorithm (DRQfD) is proposed for lane-changing decision-making to address the low sample efficiency in RL and the poor generalization capability in Imitation Learning (IL). The inherent LSTM structure potentially learns to predict future states of surrounding vehicles, helping to address the partially observable problem in autonomous driving (AD). Second, to reduce the sim2real gap, a twin high-fidelity simulator is built based on ROS-Gazebo for simulating LiDAR sensing, model training, and evaluations. Domain randomization is used to improve the robustness and generalization ability, making it easier for the model to be transferred to real-world scenarios. In addition, for the multi-objective optimization and imbalanced data issues in this scenario, a hierarchical decision-making framework is proposed to decompose the complex decision-making problem into several subtasks, making the driving policies easier to converge. To avoid the excessive dependence of the decision-making module on the output of perception module in modular systems, we train each modularized skill in an end-to-end manner. Moreover, we compare our method with a vanilla RL method to show improvement in learning efficiency. Comparisons between RL-based model and IL baseline in terms of safety, travel efficiency, and human-likeness are also given. To further validate the generalization ability of our model, we test the model on real traffic dataset. Finally, we implement the RL model on physical cars to demonstrate the performance of sim2real transfer.

Applications of Distributional Soft Actor-Critic in Real-world Autonomous Driving

Encoding Distributional Soft Actor-Critic for Autonomous Driving in Multi-lane Scenarios

Deep Reinforcement Learning on Autonomous Driving Policy With Auxiliary Critic Network

Automated Driving Maneuvers under Interactive Environment based on Deep Reinforcement Learning

Autonomous Highway Driving using Deep Reinforcement Learning

Improved Deep Reinforcement Learning with Expert Demonstrations for Urban Autonomous Driving

Distributional Soft Actor-Critic for Decision-Making in On-Ramp Merge Scenarios

From Naturalistic Traffic Data to Learning-Based Driving Policy: A Sim-to-Real Study

A Comparative Analysis of Deep Reinforcement Learning-Enabled Freeway Decision-Making for Automated Vehicles

Improving Generalization of Reinforcement Learning with Minimax Distributional Soft Actor-Critic

Human-in-the-Loop Deep Reinforcement Learning with Application to Autonomous Driving

Confidence-Aware Reinforcement Learning for Self-Driving Cars

Exploring applications of deep reinforcement learning for real-world autonomous driving systems

Autonomous Driving with Deep Reinforcement Learning in CARLA Simulation

Efficient Deep Reinforcement Learning with Imitative Expert Priors for Autonomous Driving

Think2Drive: Efficient Reinforcement Learning by Thinking in Latent World Model for Quasi-Realistic Autonomous Driving (in CARLA-v2)

Think2Drive: Efficient Reinforcement Learning by Thinking with Latent World Model for Autonomous Driving (in CARLA-v2)

Towards Robust Decision-Making for Autonomous Driving on Highway

Multi-objective Optimization Based Deep Reinforcement Learning for Autonomous Driving Policy

Self-Driving Car Racing: Application of Deep Reinforcement Learning

Reinforcement Learning Based Oscillation Dampening: Scaling up Single-Agent RL algorithms to a 100 AV highway field operational test