Decision-Making Models Based on Meta-Reinforcement Learning for Intelligent Vehicles at Urban Intersections

Xuemei Chen,Jiahe Liu,Zijia Wang,Xintong Han,Yufan Sun,Xuelong Zheng
DOI: https://doi.org/10.15918/j.jbit1004-0579.2022.056
2022-01-01
Journal of Beijing Institute of Technology
Abstract:Behavioral decision-making at urban intersections is one of the primary difficulties cur-rently impeding the development of intelligent vehicle technology. The problem is that existing decision-making algorithms cannot effectively deal with complex random scenarios at urban inter-sections. To deal with this, a deep deterministic policy gradient (DDPG) decision-making algo-rithm (T-DDPG) based on a time-series Markov decision process (T-MDP) was developed, where the state was extended to collect observations from several consecutive frames. Experiments found that T-DDPG performed better in terms of convergence and generalizability in complex intersec-tion scenarios than a traditional DDPG algorithm. Furthermore, model-agnostic meta-learning (MAML) was incorporated into the T-DDPG algorithm to improve the training method, leading to a decision algorithm (T-MAML-DDPG) based on a secondary gradient. Simulation experiments of intersection scenarios were carried out on the Gym-Carla platform to verify and compare the deci-sion models. The results showed that T-MAML-DDPG was able to easily deal with the random states of complex intersection scenarios, which could improve traffic safety and efficiency. The above decision-making models based on meta-reinforcement learning are significant for enhancing the decision-making ability of intelligent vehicles at urban intersections.
What problem does this paper attempt to address?