Reducing the Cost of Quantum Chemical Data By Backpropagating Through Density Functional Theory

Alexander Mathiasen,Hatem Helal,Paul Balanca,Adam Krzywaniak,Ali Parviz,Frederik Hvilshøj,Blazej Banaszewski,Carlo Luschi,Andrew William Fitzgibbon

2024-02-06

Abstract:Density Functional Theory (DFT) accurately predicts the quantum chemical properties of molecules, but scales as $O(N_{\text{electrons}}^3)$. Schütt et al. (2019) successfully approximate DFT 1000x faster with Neural Networks (NN). Arguably, the biggest problem one faces when scaling to larger molecules is the cost of DFT labels. For example, it took years to create the PCQ dataset (Nakata & Shimazaki, 2017) on which subsequent NNs are trained within a week. DFT labels molecules by minimizing energy $E(\cdot )$ as a "loss function." We bypass dataset creation by directly training NNs with $E(\cdot )$ as a loss function. For comparison, Schütt et al. (2019) spent 626 hours creating a dataset on which they trained their NN for 160h, for a total of 786h; our method achieves comparable performance within 31h.

Machine Learning

What problem does this paper attempt to address?

This paper mainly explores how to reduce the cost of acquiring quantum chemical data. Currently, using density functional theory (DFT) to predict the quantum chemical properties of molecules is accurate but computationally expensive, with the required time growing exponentially as the molecule size increases. To address this issue, researchers propose a new pre-training technique, which directly uses the energy function E(·) of DFT as the loss function to train the neural network, thus avoiding the expensive process of generating DFT labels. The traditional approach is to first calculate a large amount of molecular data using DFT, and then use this data to train the neural network (NN). However, the time spent on generating the data set far exceeds the time spent on training the neural network. The method mentioned in the paper, called Quantum Pretraining Transformer (QPT), achieves new data samples at each training iteration by directly performing backpropagation with E(·) during the training process. This helps prevent overfitting and provides the potential for arbitrary scalability of the model. The QPT method achieves comparable accuracy to previous work without creating a data set, significantly reducing the total time and computational cost. The paper also mentions several key points, such as using initial DFT guesses to accelerate optimization, and techniques like quantum bias attention and density mixing to improve performance. Experimental results show that QPT achieves similar prediction accuracy as previous methods without using precomputed DFT labels, while greatly reducing the total time for data creation and training. This approach opens up new avenues for pretraining large molecules and neural network models, with potential applications in predicting protein-ligand interactions on a larger scale in the future.

Reducing the Cost of Quantum Chemical Data By Backpropagating Through Density Functional Theory

Deep Neural Network Computes Electron Densities and Energies of a Large Set of Organic Molecules Faster than Density Functional Theory (DFT)

Machine Learning Quantum Reaction Rate Constants

Quantum-Enhanced Neural Exchange-Correlation Functionals

Low-data deep quantum chemical learning for accurate MP2 and coupled-cluster correlations

Pushing the frontiers of density functionals by solving the fractional electron problem

$\nabla^2$DFT: A Universal Quantum Chemistry Dataset of Drug-Like Molecules and a Benchmark for Neural Network Potentials

Neural network backflow for ab-initio quantum chemistry

Ab Initio Molecular Dynamics Simulations of Atmospheric Molecular Clusters Boosted by Neural Networks

Neural Quantum States and Peaked Molecular Wave Functions: Curse or Blessing?

Quantum deep field: data-driven wave function, electron density generation, and atomization energy prediction and extrapolation with machine learning

Implementation of the Density-functional Theory on Quantum Computers with Linear Scaling with respect to the Number of Atoms

NNQS-Transformer: an Efficient and Scalable Neural Network Quantum States Approach for Ab initio Quantum Chemistry

Ab initio quantum chemistry with neural-network wavefunctions

Accelerating finite-temperature Kohn-Sham density functional theory with deep neural networks

Reducing the cost of neural network potential generation for reactive molecular systems

Quantum-chemical insights from deep tensor neural networks

Training Neural Nets To Learn Reactive Potential Energy Surfaces Using Interactive Quantum Chemistry in Virtual Reality

Neural network distillation of orbital dependent density functional theory

Efficient quantum computation of molecular forces and other energy gradients

Reducing Numerical Precision Requirements in Quantum Chemistry Calculations