Liquefaction prediction with robust machine learning algorithms (SVM, RF, and XGBoost) supported by genetic algorithm-based feature selection and parameter optimization from the perspective of data processing

Selçuk Demir,Emrehan Kutluğ Şahin
DOI: https://doi.org/10.1007/s12665-022-10578-4
IF: 2.8
2022-09-23
Environmental Earth Sciences
Abstract:Liquefaction prediction is an important issue in the seismic design of engineering structures, and research on this topic has been continuing in current literature using different methods, including experimental, numerical, or soft computing. In this paper, three robust machine learning (ML) algorithms are applied to predict soil liquefaction using a set of 411 shear wave velocity case records. The Genetic Algorithm (GA) based feature selection (FS) and parameter optimization of Random Forest (RF), Support Vector Machines (SVM), and eXtreme Gradient Boosting (XGBoost) algorithms are utilized to improve the accuracy of the liquefaction prediction models. Simple Random Sampling (SRS) and Stratified Random Sampling (StrRS) are used for data sampling, and also SMOTE algorithm are applied to prepare the balanced training sets. The results of robust ML algorithms are assessed based on well-known five performance matrices, namely Accuracy ( Acc ), Kappa, Precision, Recall, and F-Measure. Evaluation of the results is made separately for each ML algorithm considering sampling data generated from SRS, StrRS, and SMOTE. As a result, the XGBoost model is more accurate ( Acc = 96%) than RF ( Acc = 93%) and SVM ( Acc = 91%) in the case of the SMOTE algorithm. This study reveals the superiority of the XGBoost algorithm in the liquefaction prediction and shows how the accuracy measures tend to improve when the predictive models are trained using balanced samples with StrRS and SMOTE sampling strategies.
environmental sciences,water resources,geosciences, multidisciplinary
What problem does this paper attempt to address?