Density Fitted Fragment Embedding – Principles and Applications

Yi Sun
DOI: https://doi.org/10.26434/chemrxiv-2024-0cb84-v3
2024-04-23
Abstract:We demonstrate how deterministic, stochastic and semistochastic density fitted fragment embedding can be constructed using both non-overlapping fragments and overlapping fragments to perform energy evaluations. We then implement the frameworks to first perform energy calculations on water clusters using a polarised 3-zeta basis set to demonstrate the observed scaling of the algorithms, which is $\mathcal{O}(N_{AO,\text{mol}}^{2.62})$ and stochastic algorithms can help to reduce the pre-factor. We then perform numerical structural optimisations on cyclic water and hydrogen fluoride clusters using the deterministic algorithm. The optimised structures' hydrogen bonding energies are then compared with results using the corresponding correlated all-electron solver, which is CCSD in this work. It turns out that if each interacting molecule is chosen as a fragment, d-BE1-DF-CCSD is able to recover almost all of the hydrogen bonding energy in both water and hydrogen fluoride clusters calculated using all-electron DF-CCSD. This work therefore provides foundations for the efficient generation and quality of fragment embedding energy data in large, weakly interacting systems.
Chemistry
What problem does this paper attempt to address?
The main problem that this paper attempts to solve is how to efficiently calculate the energy in macromolecular systems through deterministic, stochastic, and semi - stochastic density - fitted fragment embedding methods (Density Fitted Fragment Embedding, DF - FE), and verify the ability of these methods to recover hydrogen - bond interactions in weakly interacting systems (such as water clusters and hydrofluoric acid clusters). ### Specific problems include: 1. **Reducing computational complexity**: - The computational complexity of traditional coupled - cluster theory (CCSD) is very high, reaching \(O(N^7)\), which makes its application in macromolecular systems very limited. The paper proposes using the density - fitting (Density Fitting, DF) method to reduce computational complexity. For example, through the DF method, the complexity of CCSD can be reduced from \(O(N^7)\) to \(O(N^4)\) or lower. 2. **Improving the computational accuracy of weakly interacting systems**: - A key issue in the fragment - embedding method is whether it can accurately recover weak interactions, such as hydrogen bonds. By comparing the results of the fragment - embedding method (d - DF - BE1 - CCSD) with those of the all - electron CCSD method, the paper verifies the former's ability to recover hydrogen - bond interactions in water clusters and hydrofluoric acid clusters. 3. **Optimizing structures and evaluating energy**: - The paper also conducts numerical structure optimization to verify the performance of the fragment - embedding method in optimizing molecular structures. Specifically, the paper uses a deterministic algorithm to optimize the structures of cyclic water clusters and hydrofluoric acid clusters and compares the optimized hydrogen - bond energies. 4. **Introducing stochastic and semi - stochastic methods**: - Stochastic and semi - stochastic methods (s - DF and ss - DF) are introduced to further reduce computational costs. In particular, the ss - DF method reduces random errors by limiting the number of random orbitals, thereby maintaining high computational accuracy in large systems. ### Main conclusions: - The deterministic DF - FE method (d - DF - BE1 - CCSD) can recover approximately 97% of the hydrogen - bond interaction energy in weakly interacting systems. - After using the DF method, the computational complexity is significantly reduced, the iteration time grows nearly linearly, and the total computational time complexity is \(O(N^{2.62}_{\text{AO,mol}})\). - Although the semi - stochastic method (ss - DF) introduces random errors, the errors can be effectively reduced by taking the average of multiple calculations, while significantly reducing the computational cost. In conclusion, this paper provides an efficient fragment - embedding method that can significantly reduce the energy - calculation complexity of weak interactions in macromolecular systems while maintaining high accuracy.