Scalable Training of Neural Network Potentials for Complex Interfaces Through Data Augmentation

In Won Yeu,Annika Stuke,Jon L.pez-Zorrilla,James M. Stevenson,David R. Reichman,Richard A. Friesner,Alexander Urban,Nongnuch Artrith
2024-12-08
Abstract:Artificial neural network (ANN) potentials enable highly accurate atomistic simulations of complex materials at unprecedented scales. Despite their promise, training ANN potentials to represent intricate potential energy surfaces (PES) with transferability to diverse chemical environments remains computationally intensive, especially when atomic force data are incorporated to improve PES gradients. Here, we present an efficient ANN potential training methodology that uses Gaussian process regression (GPR) to incorporate atomic forces into ANN training, leading to accurate PES models with fewer additional first-principles calculations and a reduced computational effort for training. Our GPR-ANN approach generates synthetic energy data from force information in the reference dataset, thus augmenting the training datasets and bypassing direct force training. Benchmark tests on hybrid density-functional theory data for ethylene carbonate (EC) molecules and Li metal-EC interfaces, relevant for lithium metal battery applications, demonstrate that GPR-ANN potentials achieve accuracies comparable to fully force-trained ANNs with a significantly reduced computational overhead. Detailed comparisons show that the method improves both data efficiency and scalability for complex interfaces and heterogeneous environments. This work establishes the GPR-ANN method as a powerful and scalable framework for constructing high-fidelity machine learning interatomic potentials, offering the computational and memory efficiency critical for the large-scale simulations needed for the simulation of materials interfaces.
Disordered Systems and Neural Networks,Materials Science,Computational Physics
What problem does this paper attempt to address?
### What problems does this paper attempt to solve? This paper aims to solve the computational efficiency and data requirement challenges faced when constructing neural network potential functions (NNPs) for complex interfaces. Specifically: 1. **Improve training efficiency**: Traditional NNP training methods require a large number of first - principles calculations when representing complex potential energy surfaces (PESs), especially when introducing atomic force data to improve the PES gradient, which makes the training process computationally intensive and time - consuming. 2. **Reduce data requirements**: In order to accurately capture PESs in different chemical environments, a large number of reference data points are usually required, especially when surfaces and interfaces are involved. Due to the drastic changes in atomic environments, the data requirements are particularly large. 3. **Enhance the transferability of the model**: Existing NNP training methods may perform poorly in new environments, resulting in limited generalization ability of the model. Therefore, a method that can better adapt to different structural domains is needed. To solve the above problems, the authors propose a new training method - **GPR - ANN (Gaussian Process Regression combined with Artificial Neural Network)**. This method improves the training of NNP in the following ways: - **Indirect force training**: Use Gaussian Process Regression (GPR) to generate synthetic energy data from force information, thereby enhancing the training data set without the need for direct force training. - **Data augmentation**: Fit the energy and force of local subsystems through the GPR model and generate additional synthetic data, thereby effectively expanding the training data set. - **Efficient data utilization**: The GPR - ANN method can maintain or even improve the prediction accuracy while reducing computational overhead, especially when dealing with complex interfaces and heterogeneous environments. Through these improvements, the GPR - ANN method not only improves the training efficiency but also significantly reduces the required amount of reference data, thus providing a more efficient solution for large - scale material interface simulations.