Abstract:A safe and efficient decision-making system is crucial for autonomous vehicles. However, the complexity of driving environments limits the effectiveness of many rule-based and machine learning approaches. Reinforcement Learning, with its robust self-learning capabilities and environmental adaptability, offers a promising solution to these challenges. Nevertheless, safety and efficiency concerns during training hinder its widespread application. To address these concerns, we propose a novel RL framework, Simple to Complex Collaborative Decision (S2CD). First, we rapidly train the teacher model in a lightweight simulation environment. In the more complex and realistic environment, the teacher intervenes when the student agent exhibits suboptimal behavior by assessing actions' value to avert dangers. We also introduce an RL algorithm called Adaptive Clipping Proximal Policy Optimization (ACPPO), which combines samples from both teacher and student policies and employs dynamic clipping strategies based on sample importance. This approach improves sample efficiency while effectively alleviating data imbalance. Additionally, we employ the Kullback-Leibler divergence as a policy constraint, transforming it into an unconstrained problem with the Lagrangian method to accelerate the student's learning. Finally, a gradual weaning strategy ensures that the student learns to explore independently over time, overcoming the teacher's limitations and maximizing performance. Simulation experiments in highway lane-change scenarios show that the S2CD framework enhances learning efficiency, reduces training costs, and significantly improves safety compared to state-of-the-art algorithms. This framework also ensures effective knowledge transfer between teacher and student models, even with a suboptimal teacher, the student achieves superior performance, demonstrating the robustness and effectiveness of S2CD.

Runtime Safety Assurance for Learning-enabled Control of Autonomous Driving Vehicles

A Shared Control Approach for Autonomous Vehicles via Driver Behaviors Learning

Enhancing System-Level Safety in Mixed-Autonomy Platoon via Safe Reinforcement Learning

Safe Autonomous Driving with Latent Dynamics and State-Wise Constraints

The Black-Box Simplex Architecture for Runtime Assurance of Autonomous CPS

Enhancing High-Speed Cruising Performance of Autonomous Vehicles through Integrated Deep Reinforcement Learning Framework

Safe and efficient collision avoidance control for autonomous vehicles

Knowledge Transfer from Simple to Complex: A Safe and Efficient Reinforcement Learning Framework for Autonomous Driving Decision-Making

a ) mo a ) IL 0 E ° VERF ' Solving Schr 6 dinger ' s equation on the Intel iPSC by the Alternating Direction Method

Longitudinal control of automated vehicles: A novel approach by integrating deep reinforcement learning with intelligent driver model

Combining Deep Reinforcement Learning and Safety Based Control for Autonomous Driving

Explainable and Safe Reinforcement Learning for Autonomous Air Mobility

Safe-State Enhancement Method for Autonomous Driving Via Direct Hierarchical Reinforcement Learning.

Deep-Reinforcement-Learning-Based Collision Avoidance of Autonomous Driving System for Vulnerable Road User Safety

SECRM-2D: RL-Based Efficient and Comfortable Route-Following Autonomous Driving with Analytic Safety Guarantees

Towards Safe and Robust Autonomous Vehicle Platooning: A Self-Organizing Cooperative Control Framework

A Safe and Efficient Lane Change Decision-Making Strategy of Autonomous Driving Based on Deep Reinforcement Learning

A Finite-Time Safety Filter for Learning-Based Autonomous Driving

A Safe Hierarchical Planning Framework for Complex Driving Scenarios based on Reinforcement Learning

Safe by Design Autonomous Driving Systems