Large Language Models for Material Property Predictions: elastic constant tensor prediction and materials design

Siyu Liu,Tongqi Wen,Beilin Ye,Zhuoyuan Li,David J. Srolovitz
2024-11-19
Abstract:Efficient and accurate prediction of material properties is critical for advancing materials design and applications. The rapid-evolution of large language models (LLMs) presents a new opportunity for material property predictions, complementing experimental measurements and multi-scale computational methods. We focus on predicting the elastic constant tensor, as a case study, and develop domain-specific LLMs for predicting elastic constants and for materials discovery. The proposed ElaTBot LLM enables simultaneous prediction of elastic constant tensors, bulk modulus at finite temperatures, and the generation of new materials with targeted properties. Moreover, the capabilities of ElaTBot are further enhanced by integrating with general LLMs (GPT-4o) and Retrieval-Augmented Generation (RAG) for prediction. A specialized variant, ElaTBot-DFT, designed for 0 K elastic constant tensor prediction, reduces the prediction errors by 33.1% compared with domain-specific, material science LLMs (Darwin) trained on the same dataset. This natural language-based approach lowers the barriers to computational materials science and highlights the broader potential of LLMs for material property predictions and inverse design.
Materials Science,Computational Physics
What problem does this paper attempt to address?
This paper attempts to solve several key problems in material property prediction, especially the prediction of elastic constant tensors. Specifically: 1. **Improving the Efficiency and Accuracy of Material Property Prediction**: The paper points out that efficiently and accurately predicting material properties is crucial for promoting material design and application. Although traditional methods such as experimental measurement and multi - scale computational methods are effective, they are often limited by high cost, long time - consuming, and possible inconsistent or inaccurate results. Therefore, this research explores a new method of using large - language models (LLMs) to predict material properties to make up for these deficiencies. 2. **Using Large - Language Models for Material Property Prediction**: With the rapid development of large - language models, they have shown new opportunities in material property prediction. The paper specifically focuses on the prediction of elastic constant tensors and develops a domain - specific LLM - ElaTBot for predicting elastic constants and discovering new materials. ElaTBot can not only predict elastic constant tensors and bulk moduli at finite temperatures simultaneously but also generate new materials with target properties. 3. **Reducing Prediction Error**: Through the combination with general LLMs (such as GPT - 4o) and retrieval - augmented generation (RAG) techniques, the capabilities of ElaTBot are further enhanced. Especially in the 0 K elastic constant tensor prediction, the ElaTBot - DFT variant reduces the prediction error by 33.1% compared with other domain - specific materials science LLMs (such as Darwin) when trained on the same data set. 4. **Lowering the Entry Barrier**: This natural - language - based method lowers the entry barrier to computational materials science, enabling researchers without a strong computer science background to participate in materials science research. 5. **Expanding to Multi - task Knowledge - Fusion Training**: By integrating finite - temperature data and designing multi - task training text inputs, ElaTBot has multiple capabilities, including predicting elastic constant tensors at finite temperatures. In addition, through collaboration with general LLMs, ElaTBot can complete tasks such as material prediction, generation, and RAG - enhanced prediction in natural - language conversations without complex programming work. In summary, this paper aims to improve the efficiency and accuracy of material property prediction by developing and optimizing domain - specific large - language models, while lowering the entry barrier to the field of materials science and promoting the discovery and design of new materials.