In operando active learning of interatomic interaction during large-scale simulations

Max Hodapp,Alexander Shapeev
DOI: https://doi.org/10.1088/2632-2153/aba373
2020-09-21
Abstract:A well-known drawback of state-of-the-art machine-learning interatomic potentials is their poor ability to extrapolate beyond the training domain. For small-scale problems with tens to hundreds of atoms this can be solved by using active learning which is able to select atomic configurations on which a potential attempts extrapolation and add them to the ab initio-computed training set. In this sense an active learning algorithm can be viewed as an on-the-fly interpolation of an ab initio model. For large-scale problems, possibly involving tens of thousands of atoms, this is not feasible because one cannot afford even a single density functional theory computation with such a large number of atoms. This work marks a new milestone toward fully automatic ab initio-accurate large-scale atomistic simulations. We develop an active learning algorithm that identifies local subregions of the simulation region where the potential extrapolates. Then the algorithm constructs periodic configurations out of these local, non-periodic subregions, sufficiently small to be computable with plane-wave density functional theory codes, in order to obtain accurate ab initio energies. We benchmark our algorithm on the problem of the screw dislocation motion in bcc tungsten and show that our algorithm reaches ab initio accuracy, down to typical magnitudes of numerical noise in DFT codes. We show that our algorithm reproduces material properties such as core structure, Peierls barrier, and Peierls stress. This unleashes new capabilities for computational materials science toward applications which have currently been out of scope if approached solely by ab initio methods.
Computational Physics,Materials Science
What problem does this paper attempt to address?
The problem that this paper attempts to solve is that in large - scale atomic simulations, the existing machine - learning interatomic potentials (MLIPs) have poor extrapolation ability outside the training data. For large - scale simulations involving thousands of atoms, due to the high cost of a single density - functional - theory (DFT) calculation, traditional active - learning methods are no longer applicable. Therefore, this paper proposes a new active - learning algorithm, which aims to improve the accuracy of large - scale simulations by identifying local sub - regions in the simulation area where MLIPs attempt to extrapolate, and converting these sub - regions into periodic configurations for accurate ab - initio calculations using plane - wave DFT codes. Specifically, the main contributions of the paper include: 1. **Developed an active - learning algorithm for large - scale problems**: This algorithm can identify the local sub - regions in the simulation area that need to be extrapolated and convert these sub - regions into periodic configurations, making DFT calculations feasible. 2. **Improved the accuracy of material property prediction**: Through this method, the algorithm can achieve ab - initio calculation accuracy and can even reproduce key properties of materials, such as dislocation core structures, Peierls barriers, and Peierls stresses. 3. **Expanded the application range of computational materials science**: This method makes large - scale atomic simulations possible, especially for problems that are currently difficult to handle only by ab - initio calculation methods, such as dislocation motion. Through these improvements, the paper provides a new tool for computational materials science, which can significantly reduce computational costs while maintaining high accuracy, thereby promoting multi - scale simulation research of complex material systems.