From electrons to phase diagrams with classical and machine learning potentials: automated workflows for materials science with pyiron

Sarath Menon,Yury Lysogorskiy,Alexander L. M. Knoll,Niklas Leimeroth,Marvin Poul,Minaam Qamar,Jan Janssen,Matous Mrovec,Jochen Rohrer,Karsten Albe,Jörg Behler,Ralf Drautz,Jörg Neugebauer
2024-03-09
Abstract:We present a comprehensive and user-friendly framework built upon the pyiron integrated development environment (IDE), enabling researchers to perform the entire Machine Learning Potential (MLP) development cycle consisting of (i) creating systematic DFT databases, (ii) fitting the Density Functional Theory (DFT) data to empirical potentials or MLPs, and (iii) validating the potentials in a largely automatic approach. The power and performance of this framework are demonstrated for three conceptually very different classes of interatomic potentials: an empirical potential (embedded atom method - EAM), neural networks (high-dimensional neural network potentials - HDNNP) and expansions in basis sets (atomic cluster expansion - ACE). As an advanced example for validation and application, we show the computation of a binary composition-temperature phase diagram for Al-Li, a technologically important lightweight alloy system with applications in the aerospace industry.
Materials Science
What problem does this paper attempt to address?
This paper presents a comprehensive and user-friendly framework based on the pyiron integrated development environment (IDE) for the entire development cycle of machine learning potentials (MLPs). The cycle includes (1) creating a systematic density functional theory (DFT) database, (2) fitting DFT data to empirical potentials or MLPs, and (3) largely automating the potential validation process. The study demonstrates the power and performance of the framework through three different types of interatomic potentials (embedded atom method-EAM, high-dimensional neural network potential-HDNNP, and atomic cluster expansion-ACE). The paper emphasizes the importance of automation and reliable workflows in MLP development, including tasks such as generating reference databases, fitting model parameters, and validation. Currently, these tasks lack standardized workflows and computational parameters, leading to inconsistent data between research groups and limiting data sharing and the construction of larger databases. In the validation stage, simple predictions are often insufficient compared to DFT data. It requires evaluating fundamental physical properties and conducting dynamic simulations at finite temperatures to examine the behavior of the model outside the training domain. The paper provides a standardized workflow covering all aspects from DFT data generation to MLP fitting and validation, using the calculation of phase diagrams in Al-Li binary alloys as an advanced application example. In the study, pyiron acts as a workflow manager, connecting different software tools and packages to achieve automation from structure generation to validation. By selecting the Al-Li alloy system, the authors demonstrate the development and validation processes of different types of potential functions, emphasizing the reproducibility, repeatability, and automation of these workflows. The aim is to promote the practice guidelines for modern MLP development and advanced thermodynamic applications while advancing towards the FAIR principles of data and software.