Ab initio Accuracy Neural Network Potential for Drug-like Molecules

Manyi Yang,Duo Zhang,Xinyan Wang,Lingfeng Zhang,Tong Zhu,Han Wang
DOI: https://doi.org/10.26434/chemrxiv-2024-sq8nh
2024-05-20
Abstract:The advent of machine learning (ML) in computational chemistry heralds a transformative approach to one of the quintessential challenges in computer-aided drug design (CADD): the accurate and cost-effective calculation of atomic interactions. By leveraging a neural network (NN) potential, we address this balance and push the boundaries of the NN potential's representational capacity. Our work details the development of a robust general-purpose NN potential, architected on the framework of DPA-2, a deep learning potential with attention, which demonstrates remarkable fidelity in replicating the interatomic potential energy surface for drug-like molecules comprising eight critical chemical elements: H, C, N, O, F, S, Cl, and P. We employed state-of-the-art molecular dynamic techniques, including temperature acceleration and enhanced sampling, to construct a comprehensive dataset to ensure exhaustive coverage of relevant configurational spaces. Our rigorous testing protocols, including torsion scanning, global minimum searches, and high-temperature MD simulations across various organic molecules, have culminated in an NN model that achieves chemical precision commensurate with the highly regarded DFT model, while significantly outstripping the accuracy of prevalent semi-empirical methods. This study presents a leap forward in the predictive modelling of molecular interactions, offering extensive applications in drug development and beyond.
Chemistry
What problem does this paper attempt to address?
This paper aims to address a key challenge in computer-aided drug design (CADD), which is how to reduce computational costs while maintaining computational accuracy in accurately simulating molecular interactions. The research team used neural network (NN) potentials to balance this requirement and extended the capability of NN potentials, specifically targeting drug-like molecules containing eight key chemical elements: H, C, N, O, F, S, Cl, and P. They developed a robust and universal NN potential called DPA-2-Drug, based on the deep learning architecture DPA-2, which can replicate the atomic potentials of these molecules. To construct the training dataset, the researchers employed advanced molecular dynamics techniques such as temperature acceleration and enhanced sampling to ensure comprehensive coverage of relevant configuration space. Through concurrent learning algorithms, they reduced the number of configurations required for training while maintaining or expanding the representative chemical and conformational spaces of drug-like molecules. The DPA-2-Drug model was tested on various organic molecules, including torsional scans, global minimum searches, and high-temperature MD simulations, demonstrating its chemical accuracy on par with density functional theory (DFT) methods and superiority over popular semi-empirical methods. The challenges mentioned in the paper include creating a universal machine learning model capable of accurately predicting interactions among a wide range of drug molecules and reducing the size of the training dataset without sacrificing accuracy. Through their approach, the DPA-2-Drug model achieves quantum mechanical level accuracy while significantly improving computational efficiency, providing a powerful tool for drug development and other molecular interaction predictions.