Deep Reinforcement Learning for Advanced Longitudinal Control and Collision Avoidance in High-Risk Driving Scenarios

Dianwei Chen,Yaobang Gong,Xianfeng Yang
2024-04-30
Abstract:Existing Advanced Driver Assistance Systems primarily focus on the vehicle directly ahead, often overlooking potential risks from following vehicles. This oversight can lead to ineffective handling of high risk situations, such as high speed, closely spaced, multi vehicle scenarios where emergency braking by one vehicle might trigger a pile up collision. To overcome these limitations, this study introduces a novel deep reinforcement learning based algorithm for longitudinal control and collision avoidance. This proposed algorithm effectively considers the behavior of both leading and following vehicles. Its implementation in simulated high risk scenarios, which involve emergency braking in dense traffic where traditional systems typically fail, has demonstrated the algorithm ability to prevent potential pile up collisions, including those involving heavy duty vehicles.
Robotics,Artificial Intelligence,Machine Learning,Systems and Control
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in high - risk driving scenarios, the existing Advanced Driver - Assistance Systems (ADAS) mainly focus on the vehicles in front, while ignoring the potential risks that the vehicles behind may bring. This ignorance may lead to chain - collision accidents when the vehicle in front brakes urgently in multi - vehicle scenarios with high speed and close spacing, especially when heavy - duty vehicles are involved, and the severity of the accident will be further aggravated. To overcome these limitations, this paper proposes a longitudinal control and anti - collision algorithm based on deep reinforcement learning, which can effectively consider the behaviors of the front and rear vehicles, thereby preventing potential chain - collisions in simulated high - risk scenarios, especially those involving heavy - duty vehicles. ### Main contributions of the paper: 1. **Developed a vehicle braking and acceleration strategy**: By exploring edge - case collision scenarios, safety is enhanced, with special attention to potential safety risks from the vehicles behind. 2. **Developed a general - purpose algorithm**: Aimed at reducing severe chain - collision events. 3. **Simulation studies show**: The algorithm based on Deep Deterministic Policy Gradient (DDPG) significantly reduces the occurrence of collisions in situations where traditional methods cannot avoid collisions. ### Method overview: - **Markov Decision Process (MDP)**: Defines the state set \( S \), the action set \( A \), the transition probability \( P_a \), and the reward function \( r \). - **State \( S \)**: Includes the distance between the front vehicle and the ego - vehicle \( d_{fm} \), the distance between the ego - vehicle and the rear vehicle \( d_{mr} \), the speed of the front vehicle \( v_f \), the speed of the ego - vehicle \( v_m \), the speed of the rear vehicle \( v_r \), the acceleration of the front vehicle \( a_f \), the acceleration of the ego - vehicle \( a_m \), and the acceleration of the rear vehicle \( a_r \). - **Action \( A \)**: The acceleration \( a_m' \) of the ego - vehicle in the next time step in the current episode. - **Reward function \( r \)**: If a collision occurs, the reward is - 3000; if there is no collision, the reward is 15. The purpose of designing the reward function is to avoid collisions. ### Experimental results: - **DDPG training results**: After about 400 episodes of training, the DDPG model converges and successfully avoids any collisions, obtaining a reward of 22500. - **Baseline ADAS model results**: In high - risk scenarios, although the baseline ADAS algorithm can help the ego - vehicle avoid collisions with the front vehicle, it cannot prevent collisions with the rear vehicle, resulting in chain - collisions. - **Results of the proposed RL algorithm**: In all three high - risk scenarios, the proposed RL algorithm successfully avoids all collisions by dynamically selecting different accelerations, demonstrating its effectiveness in complex driving environments. ### Conclusion: This paper proposes a longitudinal control and anti - collision algorithm based on deep reinforcement learning, which can effectively consider the behaviors of the front and rear vehicles in high - risk driving scenarios and avoid chain - collisions, especially when heavy - duty vehicles are involved. This research provides a new direction for the development of ADAS technology, shifting from passive - reaction systems to active - safety mechanisms.