Research on Adaptive Cruise Control Based on Soft Actor-Critic Algorithm

Zhao Kegang,Shi Cuiduo,Liang Zhihao,Li Ziqi,Wang Yulong
DOI: https://doi.org/10.19620/j.cnki.1000-3703.20220500
2023-01-01
Abstract:For the problems of adaptive cruise control technology, including insufficient environmental adaptability of control algorithm for Deep Reinforcement Learning(DRL), poor model mitigation and generalization ability, this paper proposed the Soft Actor-Critic(SAC) control algorithm based on the principle of maximum entropy and stochastic off-line policy. SAC network was built to fit action value function and action policy function, and auto-adjusting temperature coefficient was used to improve the environmental exploration ability of intelligent agent. For the problem of sparse reward, the reward function was designed by using the idea of reward shaping. In addition, a new experience replay mechanism was proposed to improve the utilization rate of samples. The proposed control algorithm was simulated and tested in different scenes, and compared with Deep Deterministic Policy Gradient(DDPG). The results show that the algorithm has better model generalization ability and migration effect on real vehicles.
What problem does this paper attempt to address?