Abstract:The Active Traffic Management (ATM) system has been widely used in the United States and the European countries to improve the traffic safety of urban expressways. The accurate real-time crash risk prediction is fundamental to the system running well. Crash data are characterized by small probability, which poses a typical Imbalanced Data Classification problem. Most previous studies mainly improved the prediction methods only in data level or algorithm level, which may be inadequate to predict the crash risk accurately especially in a continuous real-time traffic data environment. The comprehensive imbalanced classification algorithm was examined in this research to build more accurate real-time traffic crash risk prediction model. At the output level, the Youden index method has been proved to be of the best ability to divide the prediction results and Probability Calibration Method was proposed to optimize the prediction results in further. At the data level, Under-sampling and Synthetic Minority Oversampling Technique(SMOTE) methods were compared to solve the imbalanced data classification problem by changing the data distribution. At the algorithm level, the cost-sensitive MLP algorithm and Adaboost algorithm were examined and finally the random sampling cost-sensitive MLP model(RCSMLP) and Rusboost model were constructed by synthesizing the optimization methods from three levels. The sensitivity of the RCSMLP model reached 78.10 % and the specificity of the model reached 81.44 %. The AUC and sensitivity of the Rusboost model reached 0.892 and 0.842 while the specificity of the model reached 0.816, which shows the better performance in dealing with the imbalanced traffic crash risk prediction problem compared to existed prediction models. The proposed method of improving prediction accuracy in this study is universal and can be applied to many other prediction models to predict real-time traffic crash risk.

On Evaluating Multi-class Network Traffic Classifiers Based on AUC

Multiclass ROC

A Comparison of Improving Multi-Class Imbalance for Internet Traffic Classification

A Cost-Sensitive Deep Learning-Based Approach for Network Traffic Classification

An efficient SVM-based method for multi-class network traffic classification

A New Network Traffic Classification Method Based on Classifier Integration

Learning with Multiclass AUC: Theory and Algorithms

A New Performance Evaluation Method For Imbalanced Data Learning

New Performance Evaluation Method for Classifier

A novel weighted combination technique for traffic classification

Multitask Learning for Network Traffic Classification

SmoteAdaNL: a learning method for network traffic classification

Examining imbalanced classification algorithms in predicting real-time traffic crash risk

Traffic Classification - Towards Accurate Real Time Network Applications

Machine Learned Real-Time Traffic Classifiers

BalancedBoost: A hybrid approach for real-time network traffic classification

A Unified Framework Against Topology and Class Imbalance

Network traffic grant classification based on 1DCNN-TCN-GRU hybrid model

Feature selection for optimizing traffic classification

Network Traffic Classification Techniques and Comparative Analysis Using Machine Learning Algorithms

Supervised Learning Real-time Traffic Classifiers