Preventing Traffic Accidents Through Machine Learning Predictive Models

Tarikwa Tesfa Bedane,Beakal Gizachew Assefa,Sudhir Kumar Mohapatra
DOI: https://doi.org/10.1109/ict4da53266.2021.9672249
2021-11-22
Abstract:Road Traffic Accidents (RTA) are a serious issue of societies resulting in huge losses at the economic and social levels and responsible for millions of deaths and injuries every year in the world. For instance, in Ethiopia, the number of deaths due to traffic accidents is increasing from one year to another. Addis Ababa is one of the popular and known cities that encounter a high number of RTAs due to the increasing number of vehicles and population. The main objective of this paper is to apply machine learning algorithms to predict the accident severity and identify the major causes of accidents in crowded cities (application of Addis Ababa city). The required data are collected from Addis Ababa city police departments and 12316 records of the accident are used for data analysis. We applied seven machine learning classification algorithms (Logistic Regression, Naive Bayes, Decision Tree, Support Vector Machine, K Nearest Neighbor, Random Forest, and AdaBoost) for predicting accident severity and compared the performance to choose the best model. We applied random undersampling and SMOTE oversampling techniques to handle the class imbalance nature of the dependent features and Principal Component Analysis (PCA) for dimension reduction. The experimental result shows that Random Forest achieved a 93.76% F1 score with SMOTE over-sampled data set and about 18% feature size reduction. Moreover, light condition, driving experience, age band of the driver, type of road lane, and types of junctions are identified as major determinant factors of the accident. According to this study, these are major factors to RTA and need to be considered in the design of infrastructure, regulations and policies to reduce accidents.
What problem does this paper attempt to address?