Structure-Kinetic Relationship for Drug Design Revealed by a PLS Model with Retrosynthesis-Based Pre-Trained Molecular Representation and Molecular Dynamics Simulation

Feng Zhou,Shiqiu Yin,Yi Xiao,Zaiyun Lin,Weiqiang Fu,Yingsheng J. Zhang
DOI: https://doi.org/10.1021/acsomega.3c02294
IF: 4.1
2023-01-01
ACS Omega
Abstract:Drug design based on kinetic properties is growing inapplication.Here, we applied retrosynthesis-based pre-trained molecular representation(RPM) in machine learning (ML) to train 501 inhibitors of 55 proteinsand successfully predicted the dissociation rate constant (k (off)) values of 38 inhibitors from an independentdataset for the N-terminal domain of heat shock protein 90 alpha(N-HSP90). Our RPM molecular representation outperforms other pre-trainedmolecular representations such as GEM, MPG, and general moleculardescriptors from RDKit. Furthermore, we optimized the acceleratedmolecular dynamics to calculate the relative retention time (RT) forthe 128 inhibitors of N-HSP90 and obtained the protein-ligandinteraction fingerprints (IFPs) on their dissociation pathways andtheir influencing weights on the k (off) value.We observed a high correlation among the simulated, predicted, andexperimental -log-(k (off)) values.Combining ML, molecular dynamics (MD) simulation, and IFPs derivedfrom accelerated MD helps design a drug for specific kinetic propertiesand selectivity profiles to the target of interest. To further validateour k (off) predictive ML model, we testedour model on two new N-HSP90 inhibitors, which have experimental k (off) valuesand are not in our ML training dataset. The predicted k (off) values are consistent with experimental data, andthe mechanism of their kinetic properties can be explained by IFPs,which shed light on the nature of their selectivity against N-HSP90protein. We believe that the ML model described here is transferableto predict k (off) of other proteins andwill enhance the kinetics-based drug design endeavor.
What problem does this paper attempt to address?