Traffic Conflict-Based Crash Risk Estimation: Machine Learning Meets Extreme Value Theory

Lai Zheng,Wei
DOI: https://doi.org/10.2139/ssrn.4657166
2023-01-01
Abstract:Developing the hybrid modeling approach of machine learning and statistical models is a promising direction in road safety prediction. Along this line, this study proposed a hybrid framework with the combination of machine learning and extreme value theory (EVT) for traffic conflict-based crash risk estimation. More specifically, the threshold to define conflict extremes was determined at first through peak over threshold plots. Secondly, the backpropagation neural network (BPNN) was employed to predict frequencies of conflict extremes of different severity levels based on dynamic traffic parameters. Then three sampling methods, namely uniform distribution sampling, normal distribution sampling, and arithmetic sampling, were proposed to transform the predicted frequencies of conflict extremes to corresponding extreme values. Lastly, the extreme values were fitted by generalized Pareto distribution for crash risk estimation. The proposed framework was applied to traffic conflicts measured by the modified time to collision (MTTC) and collected from an approach of a signalized intersection at the signal cycle level. The results show that 0.95s is an appropriate threshold to define conflict extremes. Both the BPNN prediction and histogram fitting of three sampling methods show high accuracy, and the estimated crash risks from the predicted conflict extremes are close to those estimated from observed conflict extremes. Moreover, the estimated crash risks are relatively robust to prediction errors in the BPNN model. It is also found that higher prediction accuracy could be obtained when 4-cycle data are used, and the upper limit of the hybrid modeling approach is to estimate crash risks as the same as the ones estimated from observed conflict extremes.
What problem does this paper attempt to address?