Abstract:Deep deterministic policy gradient (DDPG)-based car-following strategy can break through the constraints of the differential equation model due to the ability of exploration on complex environments. However, the car-following performance of DDPG is usually degraded by unreasonable reward function design, insufficient training, and low sampling efficiency. In order to solve this kind of problem, a hybrid car-following strategy based on DDPG and cooperative adaptive cruise control (CACC) is proposed. First, the car-following process is modeled as the Markov decision process to calculate CACC and DDPG simultaneously at each frame. Given a current state, two actions are obtained from CACC and DDPG, respectively. Then, an optimal action, corresponding to the one offering a larger reward, is chosen as the output of the hybrid strategy. Meanwhile, a rule is designed to ensure that the change rate of acceleration is smaller than the desired value. Therefore, the proposed strategy not only guarantees the basic performance of car-following through CACC but also makes full use of the advantages of exploration on complex environments via DDPG. Finally, simulation results show that the car-following performance of the proposed strategy is improved compared with that of DDPG and CACC. Note to Practitioners—This article presents a new car-following strategy, which avoids the impact of deep deterministic policy gradient (DDPG) performance degradation on the system. In the proposed strategy, DDPG is replaced with cooperative adaptive cruise control (CACC) when the performance of DDPG is worse than that of CACC. Meanwhile, a switching rule is designed to guarantee that the change rate of acceleration is smaller than the threshold. Simulation results show that the performance of hybrid car-following strategy has been improved compared with that of only using CACC or DDPG. Moreover, the proposed strategy has the advantages of low computational burden, high real-time-performance, and good scalability.

Personalized Car-Following Control Based on a Hybrid of Reinforcement Learning and Supervised Learning

Human-Like Autonomous Car-Following Model with Deep Reinforcement Learning

A Study on Learning and Simulating Personalized Car-Following Driving Style

Deep Reinforcement Learning Car-Following Control Based on Multivehicle Motion Prediction

Bilateral Deep Reinforcement Learning Approach for Better-than-human Car Following Model

Research on a Personalized Decision Control Algorithm for Autonomous Vehicles Based on the Reinforcement Learning from Human Feedback Strategy

Learning Hierarchical Behavior and Motion Planning for Autonomous Driving.

EnsembleFollower: A Hybrid Car-Following Framework Based On Reinforcement Learning and Hierarchical Planning

A Combined Reinforcement Learning and Model Predictive Control for Car-Following Maneuver of Autonomous Vehicles

Hybrid car following control for CAVs: Integrating linear feedback and deep reinforcement learning to stabilize mixed traffic

Personalized Adaptive Cruise Control with Deep Reinforcement Learning

Safe, efficient, and comfortable velocity control based on reinforcement learning for autonomous driving

From Naturalistic Traffic Data to Learning-Based Driving Policy: A Sim-to-Real Study

Learning to Drive Like Human Beings: A Method Based on Deep Reinforcement Learning

Hybrid Car-Following Strategy Based on Deep Deterministic Policy Gradient and Cooperative Adaptive Cruise Control

Reinforcement Learning-Based High-Speed Path Following Control for Autonomous Vehicles

An intelligent human-machine interaction-based longitudinal control strategy for autonomous vehicles

Optimization of Safety and Comfort in Car-following Scene Based on Reinforcement Learning

Coordinated Decision Control of Lane-Change and Car-Following for Intelligent Vehicle Based on Time Series Prediction and Deep Reinforcement Learning

Joint Optimization of Sensing, Decision-making and Motion-controlling for Autonomous Vehicles: A Deep Reinforcement Learning Approach