Abstract:Given the current limitations in intelligence and processing capabilities, machine learning systems are yet unable to fully tackle diverse scenarios, thereby restricting their potential to completely substitute for human roles in practical applications. Recognizing the robustness and adaptability demonstrated by human drivers in complex environments, autonomous driving training has incorporated driving intervention mechanisms. By integrating these interventions into Proximal Policy Optimization (PPO) algorithms, it becomes possible for drivers to intervene and rectify vehicles' irrational behaviors when necessary, during the training process, thereby significantly accelerating the enhancement of model performance. A human-centric experiential replay mechanism has been developed to increase the efficiency of utilizing driving intervention data. To evaluate the impact of driving intervention on the performance of intelligent agents, experiments were conducted across four distinct intervention frequencies within scenarios involving lane changes and navigation through congested roads. The results demonstrate that the bespoke intervention mechanism markedly improves the model's performance in the initial stages of training, enabling it to overcome local optima through timely driving interventions. Although an increase in intervention frequency typically results in improved model performance, an excessively high intervention rate can detrimentally affect the model's efficiency. To assess the practical applicability of the algorithm, a comprehensive testing scenario that includes lane changes, traffic signals, and congested road sections was devised. The performance of the trained model was evaluated under various traffic conditions. The outcomes reveal that the model can adapt to different traffic flows, successfully and safely navigate the testing segment, and maintain speeds close to the target. These findings highlight the model's robustness and its potential for real-world application, emphasizing the critical role of human intervention in enhancing the safety and reliability of autonomous driving systems.

Karting racing: A revisit to PPO and SAC algorithm

The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games

The Surprising Effectiveness of PPO in Cooperative, Multi-Agent Games

Continuous-action Reinforcement Learning for Playing Racing Games: Comparing SPG to PPO

Authentic Boundary Proximal Policy Optimization

Learning Effective Communication for Cooperative Pursuit with Multi-Agent Reinforcement Learning

Proximal Policy Optimization Smoothed Algorithm

Multiple-UAV Reinforcement Learning Algorithm Based on Improved PPO in Ray Framework

Policy Optimization with Model-based Explorations

Proximal Policy Optimization Algorithms

A Graph-Based PPO Approach in Multi-UAV Navigation for Communication Coverage

CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric

Optimizing Autonomous Vehicle Navigation with DQN and PPO: A Reinforcement Learning Approach

Beyond the Boundaries of Proximal Policy Optimization

Coordinated Proximal Policy Optimization

Research on reinforcement learning based on PPO algorithm for human-machine intervention in autonomous driving

Truly Proximal Policy Optimization

Simulation of Robotic Arm Grasping Control Based on Proximal Policy Optimization Algorithm

Proximal policy optimization via enhanced exploration efficiency

Proximal Policy Optimization with Mixed Distributed Training

PPO-CMA: Proximal Policy Optimization with Covariance Matrix Adaptation