Integrating Multiscale and Machine Learning Approaches towards the SAMPL9 LogP Challenge

Michael Roy Draper,Asa Waterman,Jonathan E Dannatt,Prajay M Patel
DOI: https://doi.org/10.1039/d3cp04140a
IF: 3.3
2024-02-16
Physical Chemistry Chemical Physics
Abstract:Three techniques intertwining and integrating quantum mechanics (QM), molecular mechanics (MM), and unsupervised machine learning were utilized in the prediction of the toluene-water partition coefficient (logP tol/w ) for sixteen drug molecules as part of the ninth iteration of the Statistical Assessment of the Modeling of Proteins and Ligands (SAMPL) series of blind prediction challenges. The three blind submissions yielded mean unsigned errors (MUE) ranging from 1.53-2.93 logP tol/w units. Out of all submissions (ranked and unranked), one of these methods yielded the third lowest MUE of 1.53 indicating an overall increase in errors with respect to predicting octanol-water partition coefficients (logP o/w ) for similar drug-like molecules. After applying numerous QM and MM approaches into multiscale and data-driven approaches to supplement the initial analysis, MUEs were reduced to 1.00 logP tol/w units when using density functional theory (DFT) on a single conformation, while generating an ensemble of rotamer structures elucidates subtle electronic and structural effects that are not considered in the analysis of a single conformation. Computational approaches developed for these SAMPL challenges will continue to serve as standard predictive tools for rational drug design.
chemistry, physical,physics, atomic, molecular & chemical
What problem does this paper attempt to address?