Abstract:Given the current limitations in intelligence and processing capabilities, machine learning systems are yet unable to fully tackle diverse scenarios, thereby restricting their potential to completely substitute for human roles in practical applications. Recognizing the robustness and adaptability demonstrated by human drivers in complex environments, autonomous driving training has incorporated driving intervention mechanisms. By integrating these interventions into Proximal Policy Optimization (PPO) algorithms, it becomes possible for drivers to intervene and rectify vehicles' irrational behaviors when necessary, during the training process, thereby significantly accelerating the enhancement of model performance. A human-centric experiential replay mechanism has been developed to increase the efficiency of utilizing driving intervention data. To evaluate the impact of driving intervention on the performance of intelligent agents, experiments were conducted across four distinct intervention frequencies within scenarios involving lane changes and navigation through congested roads. The results demonstrate that the bespoke intervention mechanism markedly improves the model's performance in the initial stages of training, enabling it to overcome local optima through timely driving interventions. Although an increase in intervention frequency typically results in improved model performance, an excessively high intervention rate can detrimentally affect the model's efficiency. To assess the practical applicability of the algorithm, a comprehensive testing scenario that includes lane changes, traffic signals, and congested road sections was devised. The performance of the trained model was evaluated under various traffic conditions. The outcomes reveal that the model can adapt to different traffic flows, successfully and safely navigate the testing segment, and maintain speeds close to the target. These findings highlight the model's robustness and its potential for real-world application, emphasizing the critical role of human intervention in enhancing the safety and reliability of autonomous driving systems.

An Improved Proximal Policy Optimization Algorithm for Autonomous Driving Decision-Making

Decision Planning for Autonomous Driving Based on Proximal Policy Optimization

Efficient and Balanced Exploration-driven Decision Making for Autonomous Racing Using Local Information

Proximal Policy Optimization with Future Rewards

Research on Autonomous Driving Decision-making Strategies based Deep Reinforcement Learning

Proximal Policy Optimization Algorithms

Proximal Policy Optimization Based on Self-directed Action Selection

Truly Proximal Policy Optimization

Proximal policy optimization via enhanced exploration efficiency

Self-Driving Via Improved DDPG Algorithm

An End-to-End Deep Reinforcement Learning Model Based on Proximal Policy Optimization Algorithm for Autonomous Driving of Off-Road Vehicle

Proximal Policy Optimization with Policy Feedback

Intersection decision making for autonomous vehicles based on improved PPO algorithm

Autonomous driving system using proximal policy optimization in deep reinforcement learning

Vehicle-Following Control Based on Deep Reinforcement Learning

Deep Deterministic Policy Gradient Algorithm Based on Convolutional Block Attention for Autonomous Driving.

Proximal policy optimization guidance algorithm for intercepting near-space maneuvering targets

Fast-PPO: Proximal Policy Optimization with Optimal Baseline Method

Research on reinforcement learning based on PPO algorithm for human-machine intervention in autonomous driving

Safe Driving Via Expert Guided Policy Optimization

Optimizing Autonomous Vehicle Navigation with DQN and PPO: A Reinforcement Learning Approach