Intelligent modelling of clay compressibility using hybrid meta-heuristic and machine learning algorithms
Pin Zhang,Zhen-Yu Yin,Yin-Fu Jin,Tommy H.T. Chan,Fu-Ping Gao
DOI: https://doi.org/10.1016/j.gsf.2020.02.014
IF: 8.9
2021-01-01
Geoscience Frontiers
Abstract:Novel machine learning based models are proposed for predicting compression index of clays.The performance of five commonly used machine learning algorithms in predicting Cc is comprehensively investigated.BPNN and RF models are recommended to predict the compression index of clays.Compression index <em>C</em><sub>c</sub> is an essential parameter in geotechnical design for which the effectiveness of correlation is still a challenge. This paper suggests a novel modelling approach using machine learning (ML) technique. The performance of five commonly used machine learning (ML) algorithms, i.e. back-propagation neural network (BPNN), extreme learning machine (ELM), support vector machine (SVM), random forest (RF) and evolutionary polynomial regression (EPR) in predicting <em>C</em><sub>c</sub> is comprehensively investigated. A database with a total number of 311 datasets including three input variables, i.e. initial void ratio <em>e</em><sub>0</sub>, liquid limit water content <em>w</em><sub>L</sub>, plasticity index <em>I</em><sub>p</sub>, and one output variable <em>C</em><sub>c</sub> is first established. Genetic algorithm (GA) is used to optimize the hyper-parameters in five ML algorithms, and the average prediction error for the 10-fold cross-validation (CV) sets is set as the fitness function in the GA for enhancing the robustness of ML models. The results indicate that ML models outperform empirical prediction formulations with lower prediction error. RF yields the lowest error followed by BPNN, ELM, EPR and SVM. If the ranges of input variables in the database are large enough, BPNN and RF models are recommended to predict <em>C</em><sub>c</sub>. Furthermore, if the distribution of input variables is continuous, RF model is the best one. Otherwise, EPR model is recommended if the ranges of input variables are small. The predicted correlations between input and output variables using five ML models show great agreement with the physical explanation.This paper suggests a novel modelling approach using machine learning (ML) technique. The performance of five commonly used machine learning (ML) algorithms, i.e. back-propagation neural network (BPNN), extreme learning machine (ELM), support vector machine (SVM), random forest (RF) and evolutionary polynomial regression (EPR) in predicting Cc is comprehensively investigated. The results indicate that ML models outperform empirical prediction formulations with lower prediction error. RF yields the lowest error followed by BPNN, ELM, EPR and SVM. If the ranges of input variables in the database are large enough, BPNN and RF models are recommended to predict Cc. Furthermore, if the distribution of input variables is continuous, RF model is the best one.<span class="display"><span><ol class="links-for-figure"><li><a class="anchor download-link u-font-sans" href="https://ars.els-cdn.com/content/image/1-s2.0-S1674987120300566-fx1_lrg.jpg"><span class="anchor-text">Download : <span class="download-link-title">Download high-res image (304KB)</span></span></a></li><li><a class="anchor download-link u-font-sans" href="https://ars.els-cdn.com/content/image/1-s2.0-S1674987120300566-fx1.jpg"><span class="anchor-text">Download : <span class="download-link-title">Download full-size image</span></span></a></li></ol></span></span>
geosciences, multidisciplinary