Abstract:In this article, we improve the proximal policy optimisation (PPO) algorithm in deep reinforcement learning and propose an MSA‐PPO algorithm, which adopts the self‐attention mechanism for input data processing, effectively identifies and focuses on the most important information in the interaction between vehicles to improve the overall performance of the system, in addition to applying the ineffective action masking mechanism to select the effective actions under specific conditions and narrow the decision space, greatly improve the learning efficiency, and thus improve the overall performance of the system. The deployment of autonomous vehicles (AVs) in complex urban environments faces numerous challenges, especially at intersections where they coexist with human‐driven vehicles (HVs), resulting in increased safety risks. In response, this study proposes an improved control strategy based on the Proximal Policy Optimization (PPO) algorithm, specifically designed for hybrid intersections, known as MSA‐PPO. First, the Self‐Attention Mechanism (SAM) is introduced into the algorithmic framework to quickly identify the surrounding vehicles with a greater impact on the ego vehicle from different perspectives, accelerating data processing and improving decision quality. Second, an invalid action masking mechanism is adopted to reduce the action space, ensuring actions are only selected from feasible sets, thereby enhancing decision efficiency. Finally, comparative and ablation experiments in hybrid intersection simulation environments of varying complexity are conducted to validate the algorithm's effectiveness. The results show that the improved algorithm converges faster, achieves higher decision accuracy, and demonstrates the highest speed levels during driving compared to other baseline algorithms.

VN-MADDPG: A Variable-Noise-Based Multi-Agent Reinforcement Learning Algorithm for Autonomous Vehicles at Unsignalized Intersections

AF-DQN: A Large-Scale Decision-Making Method at Unsignalized Intersections with Safe Action Filter and Efficient Exploratory Training Strategy

Deep Reinforcement Learning Enabled Decision-Making for Autonomous Driving at Intersections

Autonomous Vehicle Decision-Making Framework for Considering Malicious Behavior at Unsignalized Intersections

Proximal Policy Optimization Through a Deep Reinforcement Learning Framework for Multiple Autonomous Vehicles at a Non-Signalized Intersection

High-Speed Ramp Merging Behavior Decision for Autonomous Vehicles Based on Multi-Agent Reinforcement Learning

Autonomous Driving at Unsignalized Intersections: A Review of Decision-Making Challenges and Reinforcement Learning-Based Solutions

Autonomous Intersection Management with Heterogeneous Vehicles: A Multi-Agent Reinforcement Learning Approach

Combining multi-agent deep deterministic policy gradient and rerouting technique to improve traffic network performance under mixed traffic conditions

Uncertainty-Aware Decision-Making for Autonomous Driving at Uncontrolled Intersections

Decision-Making for Autonomous Vehicles with Interaction-Aware Behavioral Prediction and Social-Attention Neural Network

Prediction Failure Risk-Aware Decision-Making for Autonomous Vehicles on Signalized Intersections

Research on Autonomous Driving Decision-making Strategies based Deep Reinforcement Learning

Research on robust decision making for intelligent connected vehicle at highway on-ramp

Self-Learned Autonomous Driving at Unsignalized Intersections: A Hierarchical Reinforced Learning Approach for Feasible Decision-Making

Multi-Vehicles Decision-Making in Interactive Highway Exit: A Graph Reinforcement Learning Approach

Roadside Units Assisted Localized Automated Vehicle Maneuvering: An Offline Reinforcement Learning Approach

Decision-making Strategy on Highway for Autonomous Vehicles using Deep Reinforcement Learning

Multi-objective Longitudinal Decision-making for Autonomous Electric Vehicle: A Entropy-constrained Reinforcement Learning Approach.

Reward-Driven Automated Curriculum Learning for Interaction-Aware Self-Driving at Unsignalized Intersections

Intersection decision making for autonomous vehicles based on improved PPO algorithm