Enhancing High-Fidelity Neural Network Potentials through Low-Fidelity Sampling

Gang Seob Jung
DOI: https://doi.org/10.26434/chemrxiv-2024-1s25h
2024-05-23
Abstract:The efficacy of neural network potentials (NNPs) critically depends on the quality of the configurational datasets used for training. Prior research using empirical potentials has shown that well-selected liquid-solid transitional configurations of a metallic system can be translated to other metallic systems. This study demonstrates that such validated configurations can be relabeled using density functional theory (DFT) calculations, thereby enhancing the development of high-fidelity NNPs. Training strategies and sampling approaches are efficiently assessed using empirical potentials and subsequently relabeled via DFT in a highly parallelized fashion for high-fidelity NNP training. Our results reveal that relying solely on energy and force for NNP training is inadequate to prevent overfitting, highlighting the necessity of incorporating stress terms into the loss functions. To optimize training involving force and stress terms, we propose employing transfer learning to fine-tune the weights, ensuring the potential surface is smooth for these quantities composed of energy derivatives. This approach markedly improves the accuracy of elastic constants derived from simulations in both empirical potential-based NNP and relabeled DFT-based NNP. Overall, this study offers significant insights into leveraging empirical potentials to expedite the development of reliable and robust NNPs at the DFT level.
Chemistry
What problem does this paper attempt to address?
This paper discusses how to improve the performance of high-precision neural network potentials (NNPs) through low-precision sampling strategies. In the study, the authors use configuration data generated by empirical potentials and reassign them using density functional theory (DFT) to improve the development of NNPs, particularly for simulating metal systems. They found that relying solely on energy and force training NNPs is insufficient to prevent overfitting, and the stress term needs to be included in the loss function. The paper proposes using transfer learning to optimize the training with both force and stress terms, ensuring the smoothness of the potential energy surface. This approach significantly improves the accuracy of elastic constants in simulations. Additionally, the paper presents a workflow that includes using empirical potentials for large-scale configuration sampling, enhancing sampling through multiple order multiple temperature (MOMT) molecular dynamics, and then training NNPs using active learning and data distillation strategies. By reassigning selected data using DFT labels, DFT-based NNPs can be constructed. The study also reveals the crucial role of including stress terms in training for improving the ability of the model to predict material mechanical properties, such as elastic constants. The paper demonstrates through experiments that the strategy of combining low-precision and high-precision data can effectively accelerate the development of reliable and robust NNPs, especially at the density functional level. This approach is not only applicable to single metal systems like nickel, but also to more complex material systems such as binary and ternary alloys and metal oxides.