Atom-centered machine-learning force field package
Lei Li,Ryan A. Ciufo,Jiyoung Lee,Chuan Zhou,Bo Lin,Jaeyoung Cho,Naman Katyal,Graeme Henkelman
DOI: https://doi.org/10.1016/j.cpc.2023.108883
IF: 4.717
2023-09-01
Computer Physics Communications
Abstract:In recent years, machine learning algorithms have been widely used for constructing force fields with an accuracy of ab initio methods and the efficiency of classical force fields. Here, we developed a python-based atom-centered machine-learning force field (PyAMFF) package to provide a simple and efficient platform for fitting and using machine learning force fields by implementing an atom-centered neural-network algorithm with Behler-Parrinello symmetry functions as structural fingerprints. The following three features are included in PyAMFF: (1) integrated Fortran modules for fast fingerprint calculations and Python modules for user-friendly integration through scripts and facile extension of future algorithms; (2) a pure Fortran backend to interface with the software, including the long-timescale dynamic simulation package EON, enabling both molecular dynamic simulations and adaptive kinetic Monte Carlo simulations with machine-learning force fields; and (3) integration with the Atomic Simulation Environment package for active learning and ML-based algorithm development. Here, we demonstrate an efficient parallelization of PyAMFF in terms of CPU and memory usage and show that the Fortran-based PyAMFF calculator exhibits a linear scaling relationship with the number of symmetry functions and the system size. Program summary Program title: python-based atom-centered machine-learning force field (PyAMFF) CPC Library link to program files: https://doi.org/10.17632/fsn6dkcvrv.1 Developer's repository link: https://gitlab.com/pyamff/pyamff Licensing provisions: Apache License, 2.0 Nature of problem: Determine an approximate (surrogate) model based upon atomic forces and energies from density functional theory (DFT). With a surrogate model that is less computationally expensive to evaluate than DFT, there can be a rapid exploration of the potential energy surface, accelerated optimization to minima and saddle points, and ultimately, accelerated design of active materials where the kinetics are key to the material function. Solution method: The atomic environments of training data are calculated in terms of Behler-Parrinello fingerprints. These fingerprints are passed to a neural network which is trained to reproduce the energy and force of the training data. A parallel implementation and Fortran backend allow for efficient training and calculation of the resulting surrogate model. Examples of long-time simulations of materials on the surrogate model surfaces are provided.
physics, mathematical,computer science, interdisciplinary applications