Overcoming the chemical complexity bottleneck in on-the-fly machine learned molecular dynamics simulations

Lucas R. Timmerman,Shashikant Kumar,Phanish Suryanarayana,Andrew J. Medford
2024-06-12
Abstract:We develop a framework for on-the-fly machine learned force field molecular dynamics simulations based on the multipole featurization scheme that overcomes the bottleneck with the number of chemical elements. Considering bulk systems with up to 6 elements, we demonstrate that the number of density functional theory calls remains approximately independent of the number of chemical elements, in contrast to the increase in the smooth overlap of atomic positions scheme.
Computational Physics,Materials Science,Chemical Physics
What problem does this paper attempt to address?
This paper aims to address the bottleneck problem of computational complexity caused by the increase in the number of chemical elements in molecular dynamics simulations. Current methods, such as machine learning force fields based on Smooth Overlap of Atomic Positions (SOAP), are inefficient when dealing with multi-element systems, and the cost increases sharply with the number of elements. The paper proposes a framework based on multipole featureization for real-time machine learning force field simulations, which overcomes the bottleneck related to the number of chemical elements. Experimental results demonstrate that in systems containing up to 6 elements, this method makes the number of density functional theory (DFT) calls roughly independent of the number of elements, and shows improvement compared to the SOAP method. The newly developed method in the paper utilizes Normalized Gaussian Multipole (GMP) descriptors, whose feature vector size is independent of the number of chemical elements, thus reducing the cost of training and inference. This enhances the robustness of real-time training for systems with complex chemistry. By implementing GMP-based real-time potentials in the SPARC quantum chemistry software package, the researchers demonstrate its application in a range of metal and alloy systems, showing that the GMP model can achieve comparable accuracy and stability with lower computational costs compared to the SOAP model. Furthermore, the paper discusses the challenges of hyperparameter optimization, pointing out that inappropriate hyperparameter choices can lead to unstable models and even produce unphysical results. Despite these challenges, the results indicate the potential of the GMP featureization scheme for real-time force fields in multi-element systems, and its scalability to larger-scale simulations, particularly in computing complex properties such as the free energy of high-entropy alloys. Future work will focus on improving the efficiency of machine learning operations, automating hyperparameter selection, and developing more reliable uncertainty quantification and active learning methods.