Data-driven Atmospheric Corrosion Prediction Model for Alloys Based on a Two-Stage Machine Learning Approach

Qian Chen,Han Wang,Haodi Ji,Xiaobing Ma,Yikun Cai
DOI: https://doi.org/10.1016/j.psep.2024.06.028
IF: 7.8
2024-01-01
Process Safety and Environmental Protection
Abstract:Alloy material corrosion is a comprehensive problem affected by multi-dimensional factors, including the time, environment, and material element. Its corrosion prediction has been facing the challenges of complex mechanism and small sample size. However, parametric methods are limited in terms of modeling accuracy and generalization performance for high-dimensional feature problems. Non-parametric machine learning algorithms, while having excellent modeling capabilities, need to be optimized in conjunction with the structure of the corrosion problem. Therefore, we develop a two-stage hybrid intelligent machine learning model for improving alloy corrosion prediction accuracy. The model takes the time, environment, and material information of the alloy corrosion process as inputs and the corrosion rate as output. The first stage of the model utilizes tree ensemble learning (TEL) models for initial prediction modeling and feature importance knowledge mining. In the second stage, the neural network (NN) is used to enhance the fusion of the multi-model prediction outputs, and it is optimized using feature importance modification (FIM). The proposed TEL-FIMNN model shows considerable performance improvement over the TEL and ANN models on real low-alloy steel corrosion dataset. Compared to other advanced multi-model fusion methods such as linear regression (LR), support vector regression (SVR), random forest (RF), and Bayesian model averaging (BMA), the proposed model also shows superior results. The TEL-FIMNN model provide a new structure from the perspectives of knowledge mining and fusion enhancement, which has good potential for multi-dimensional corrosion prediction.
What problem does this paper attempt to address?