Northeast Materials Database (NEMAD): Enabling Discovery of High Transition Temperature Magnetic Compounds

Suman Itani,Yibo Zhang,Jiadong Zang
2024-09-24
Abstract:The discovery of novel magnetic materials with greater operating temperature ranges and optimized performance is essential for advanced applications. Current data-driven approaches are challenging and limited due to the lack of accurate, comprehensive, and feature-rich databases. This study aims to address this challenge by introducing a new approach that uses Large Language Models (LLMs) to create a comprehensive, experiment-based, magnetic materials database named the Northeast Materials Database (NEMAD), which consists of 26,706 magnetic materials (<a class="link-external link-http" href="http://www.nemad.org" rel="external noopener nofollow">this http URL</a>). The database incorporates chemical composition, magnetic phase transition temperatures, structural details, and magnetic properties. Enabled by NEMAD, machine learning models were developed to classify materials and predict transition temperatures. Our classification model achieved an accuracy of 90% in categorizing materials as ferromagnetic (FM), antiferromagnetic (AFM), and non-magnetic (NM). The regression models predict Curie (Néel) temperature with a coefficient of determination (R2) of 0.86 (0.85) and a mean absolute error (MAE) of 62K (32K). These models identified 62 (19) FM (AFM) candidates with a predicted Curie (Néel) temperature above 500K (100K) from the Materials Project. This work shows the feasibility of combining LLMs for automated data extraction and machine learning models in accelerating the discovery of magnetic materials.
Materials Science,Machine Learning,Computational Physics
What problem does this paper attempt to address?
The problem this paper attempts to address is: how to accelerate the discovery of new high-performance magnetic materials, especially those with higher operating temperature ranges, by constructing a comprehensive, accurate, and feature-rich magnetic materials database. Current data-driven methods face challenges due to the lack of high-quality databases, which limits the effectiveness of machine learning models in classifying and predicting the performance of magnetic materials. Specifically, the paper introduces the Northeast Materials Database (NEMAD), which contains information on 26,706 magnetic materials, including their chemical composition, magnetic phase transition temperature, structural details, and magnetic properties. By using large language models (LLMs) to automatically extract data and combining them with machine learning models for classification and prediction, the paper demonstrates how these technologies can accelerate the discovery process of magnetic materials. The main objectives include: 1. **Constructing a comprehensive magnetic materials database**: NEMAD includes a large amount of experimentally validated magnetic materials data, covering chemical composition, structural details, and magnetic properties. 2. **Developing high-precision machine learning models**: Using the NEMAD database to train classification and regression models to accurately classify material types (ferromagnetic, antiferromagnetic, and non-magnetic) and predict Curie (Néel) temperatures. 3. **Screening potential high-performance magnetic materials**: Using model predictions to screen ferromagnetic materials with high Curie temperatures and antiferromagnetic materials with high Néel temperatures from the Materials Project database, providing directions for future experimental research. These efforts aim to overcome the limitations of existing methods and advance the science of magnetic materials, particularly in the field of high-temperature applications.