Deriving Equation from Data Via Knowledge Discovery and Machine Learning: A Study of Young’s Modulus of Ti-Nb Alloys

Huiran Zhang,Xi Liu,Guangjie Zhang,Yuquan Zhu,Shengzhou Li,Quan Qian,Dongbo Dai,Renchao Che,Tao Xu
DOI: https://doi.org/10.1016/j.commatsci.2023.112349
IF: 3.572
2023-01-01
Computational Materials Science
Abstract:Acquiring knowledge and assisting materials design from computing and experimental data is very interesting and important at the intersection of materials and data science. In this work, we construct a whole framework to find an easy-to-interpret equation by taking a study of Young's modulus of Ti-Nb alloys as an example. Here, we used Young's modulus-targeted regression model (decision tree) to find a potential rule for classifying the dataset. The explicit equation was constructed through machine learning (ML) and validated based on physical laws. The transferability of the equation stood out after comparing it with five ML models including support vector machine (SVR), linear regression (LR), k-nearest neighbor (KNN), decision tree (DT), and random forest (RF). The results prove that our method indeed helps us to discover the contained knowledge hidden in the data and to uncover the relationship between features and property.
What problem does this paper attempt to address?