High-Speed Ramp Merging Behavior Decision for Autonomous Vehicles Based on Multi-Agent Reinforcement Learning
Xinfeng Zhang,Lin Wu,Huan Liu,Yajun Wang,Hao Li,Bin Xu
DOI: https://doi.org/10.1109/jiot.2023.3304890
IF: 10.6
2023-01-01
IEEE Internet of Things Journal
Abstract:To improve the decision success rate of a multi-agent reinforcement learning algorithm in merging high-speed ramps of autonomous vehicles, the independent proximal policy optimization (IPPO) method is presented. The Markov Decision Process (MDP) model for autonomous vehicle behavioral decision-making is developed. Moreover, the state space, reward function, and action space are all designed. An IPPO method is proposed using independent learning and parameter-sharing strategies based on the PPO algorithm. And further, a decision-making model for autonomous driving behavior is built. For simulation experiments, a highway ramp scenario is set. The experiment findings indicate that the IPPO algorithm can significantly increase the decision success rate of autonomous vehicles in the ramp merging assignment. Also, as compared to the MAACKTR and GPPO algorithms, the IPPO algorithm can achieve a better average reward and finish the ramp merging more rapidly.
computer science, information systems,telecommunications,engineering, electrical & electronic