Electronic Structure Prediction of Multi-million Atom Systems Through Uncertainty Quantification Enabled Transfer Learning

Shashank Pathrudkar,Ponkrshnan Thiagarajan,Shivang Agarwal,Amartya S. Banerjee,Susanta Ghosh
2024-05-01
Abstract:The ground state electron density -- obtainable using Kohn-Sham Density Functional Theory (KS-DFT) simulations -- contains a wealth of material information, making its prediction via machine learning (ML) models attractive. However, the computational expense of KS-DFT scales cubically with system size which tends to stymie training data generation, making it difficult to develop quantifiably accurate ML models that are applicable across many scales and system configurations. Here, we address this fundamental challenge by employing transfer learning to leverage the multi-scale nature of the training data, while comprehensively sampling system configurations using thermalization. Our ML models are less reliant on heuristics, and being based on Bayesian neural networks, enable uncertainty quantification. We show that our models incur significantly lower data generation costs while allowing confident -- and when verifiable, accurate -- predictions for a wide variety of bulk systems well beyond training, including systems with defects, different alloy compositions, and at unprecedented, multi-million-atom scales. Moreover, such predictions can be carried out using only modest computational resources.
Materials Science,Disordered Systems and Neural Networks,Computational Physics,Quantum Physics
What problem does this paper attempt to address?
This paper aims to address the challenges of predicting the ground state electron density of large-scale atomic systems, especially when using machine learning (ML) models to predict millions of atomic systems. Traditional Kohn-Sham density functional theory (KS-DFT) calculations become computationally expensive as the system size increases, limiting the accuracy of training data generation and models. The paper proposes a method that combines transfer learning and uncertainty quantification, using multiscale training data and thermalization techniques to comprehensively sample system configurations, in order to build ML models based on Bayesian neural networks. This method reduces the cost of data generation and provides reliable predictions, especially for systems with defects, different alloy compositions, and large-scale systems. Additionally, this approach allows predictions to be made with moderate computational resources, overcoming the cubic complexity of KS-DFT calculations.