Data-driven prediction of room temperature density for multicomponent silicate-based glasses

Kai Gong,Elsa Olivetti
DOI: https://doi.org/10.48550/arXiv.2209.02046
2022-09-06
Abstract:Density is one of the most commonly measured or estimated materials properties, especially for glasses and melts that are of significant interest to many fields, including metallurgy, geology, materials science and sustainable cements. Here, two types of machine learning (ML) models (i.e., random forest (RF) and artificial neural network (ANN)) have been developed to predict the room-temperature density of glasses in the compositional space of CaO-MgO-Al2O3-SiO2-TiO2-FeO-Fe2O3-Na2O-K2O-MnO (CMASTFNKM), based on ~2100 data points mined from ~140 literature studies. The results show that the RF and ANN models give accurate predictions of glass density with R2 values, RMSE, and MAPE of ~0.96-0.98, ~0.02-0.03 g/cm3 and ~0.59-0.79%, respectively, for the 15% testing set, which are more accurate compared with empirical density models based on ionic packing ratio (with R2 values, RMSE, and MAPE of ~0.28-0.91, ~0.05-0.15 g/cm3, and ~1.40-4.61%, respectively). Furthermore, glass density is shown to be a reliable reactivity indicator for a range of CaO-Al2O3-SiO2 (CAS) and volcanic glasses due to its strong correlation (R2 values above ~0.90) with the average metal-oxygen dissociation energy (a structural descriptor) of these glasses. Analysis of the predicted density-composition relationships from these models (for selected compositional subspaces) suggests that the ANN model exhibits a certain level of transferability (i.e., ability to extrapolate to compositional space not (or less) covered in the database) and captures known features including the mixed alkaline earth effects for (CaO-MgO)0.5-(Al2O3-SiO2)0.5 glasses.
Materials Science,Disordered Systems and Neural Networks
What problem does this paper attempt to address?
The problem that this paper attempts to solve is: **Develop machine - learning models to predict the density of multi - component silicate glasses at room temperature**. Specifically, the authors aim to predict the glass density in the CaO - MgO - Al₂O₃ - SiO₂ - TiO₂ - FeO - Fe₂O₃ - Na₂O - K₂O - MnO (CMASTFNKM) composition space through data - driven methods (i.e., random forest and artificial neural network). The solution to this problem is of great significance for many fields, including metallurgy, geology, materials science, and sustainable cement, etc. ### Problem Background 1. **Importance of Density**: - Density is one of the most frequently measured or estimated properties in materials, especially for glasses and melts. - Density is crucial in calculating other important glass properties (such as thermal conductivity, refractive index, elasticity, and optical properties). 2. **Limitations of Existing Methods**: - Traditional empirical models (such as models based on ion - packing ratios) can provide density predictions, but their errors are relatively large, usually within 10%. - Data - driven machine - learning models are excellent at capturing hidden trends between components and properties, but currently, there is a lack of ML research on silicate glasses related to cement and concrete applications. ### Research Objectives 1. **Develop Machine - Learning Models**: - Use two types of machine - learning models, random forest (RF) and artificial neural network (ANN), to predict the density of CMASTFNKM glasses based on approximately 2,100 room - temperature glass density data points extracted from about 140 articles. 2. **Evaluate Model Performance**: - Compare the prediction performance of RF and ANN models with that of the empirical model based on ion - packing ratios. - Evaluate the feasibility of density as a reactivity indicator, especially for synthetic CaO - Al₂O₃ - SiO₂ (CAS) glasses and natural volcanic glasses. 3. **Explore Composition - Density Relationships**: - Use ML models to explore the composition - density relationships of CAS, MgO - Al₂O₃ - SiO₂ (MAS), and CaO - MgO - Al₂O₃ - SiO₂ (CMAS) glasses, especially in the composition space not covered or less covered by the original database. ### Method Overview 1. **Data Collection and Pre - processing**: - Extract approximately 2,100 room - temperature glass density records and their corresponding chemical compositions from the literature. - Stratify and split the data to ensure that the training set and the test set contain the same proportion of density values. 2. **Machine - Learning Model Construction**: - Use random forest and artificial neural network models and optimize hyper - parameters to improve prediction performance. - Calculate and compare the performance metrics of different models, such as root - mean - square error (RMSE), mean absolute percentage error (MAPE), and coefficient of determination (R²). 3. **Molecular Dynamics Simulation**: - Conduct molecular dynamics simulations on six CAS glasses to generate detailed atomic structure representations. - Calculate the average metal - oxygen dissociation energy (AMODE) and compare it with density to evaluate the feasibility of density as a reactivity indicator. Through these methods, the paper aims to provide a more accurate and reliable tool for predicting the density of multi - component silicate glasses at room temperature, thereby providing support for research and applications in related fields.