Prediction of Potential Accident Severity for Class-Imbalanced Dataset

yuan,Lan Zhao,Xuelian Zheng,Xiansheng Li,Jianfeng Xi,Lei Shi,Yanhui Fan
DOI: https://doi.org/10.2139/ssrn.4148188
2022-01-01
SSRN Electronic Journal
Abstract:The most common research on accident severity assesses post-accident severity, which is not conducive to accident prevention. In this paper, a potential accident severity prediction method is proposed, which predicts the severity of possible consequences when an accident occurs, by establishing a model. In establishing the model, two key problems are solved: how to characterize the severity of potential accidents and how to deal with the class-imbalanced dataset caused by the scarcity of severe accidents. For the first problem, we propose a systematic method to determine the representative features, the relative speed of the two vehicles (Relative-v) and the speed change of the main vehicle caused by the expected collision (Delta-V1), of the severity of potential accidents. For the second problem, we propose a data-level resampling method to deal with class-imbalanced dataset. The method includes the majority class’s undersampling: Remove Redundant Under Sampling (RRUS) and the minority class’s oversampling: Core Seed-based Synthetic Minority Oversampling TEchnique (CS-SMOTE), which transform the unbalanced dataset into a balanced dataset without affecting the distribution of the original dataset. Finally, based on the National Highway Traffic Safety Administration (NHTSA) database and the XGBoost algorithm, a potential accident severity prediction model is established. The results show that the prediction performance of the model is more than 97.7%. Also, the potential accident severity predicted by this method can be used to quantify the driving risk, which is helpful for assessing the safety of the driving environment in real time.
What problem does this paper attempt to address?