In Silico Prediction of Human Intravenous Pharmacokinetic Parameters with Improved Accuracy
Yuchen Wang,Haichun Liu,Yuanrong Fan,Xingye Chen,Yan Yang,Lu Zhu,Junnan Zhao,Yadong Chen,Yanmin Zhang
DOI: https://doi.org/10.1021/acs.jcim.9b00300
IF: 6.162
2019-08-12
Journal of Chemical Information and Modeling
Abstract:Human pharmacokinetics is of great significance in the selection of drug candidates, and in silico estimation of pharmacokinetic parameters in the early stage of drug development has become the trend of drug research owing to its time- and cost-saving advantages. Herein, quantitative structure–property relationship studies were carried out to predict four human pharmacokinetic parameters including volume of distribution at steady state (VD<sub>ss</sub>), clearance (CL), terminal half-life (<i>t</i><sub>1/2</sub>), and fraction unbound in plasma (<i>f</i><sub>u</sub>), using a data set consisting of 1352 drugs. A series of regression models were built using the most suitable features selected by Boruta algorithm and four machine learning methods including support vector machine (SVM), random forest (RF), gradient boosting machine (GBM), and XGBoost (XGB). For VD<sub>ss</sub>, SVM showed the best performance with <i>R</i><sup>2</sup><sub>test</sub> = 0.870 and RMSE<sub>test</sub> = 0.208. For the other three pharmacokinetic parameters, the RF models produced the superior prediction accuracy (for CL, <i>R</i><sup>2</sup><sub>test</sub> = 0.875 and RMSE<sub>test</sub> = 0.103; for <i>t</i><sub>1/2</sub>, <i>R</i><sup>2</sup><sub>test</sub> = 0.832 and RMSE<sub>test</sub> = 0.154; for <i>f</i><sub>u</sub>, <i>R</i><sup>2</sup><sub>test</sub> = 0.818 and RMSE<sub>test</sub> = 0.291). Assessed by 10-fold cross validation, leave-one-out cross validation, Y-randomization test and applicability domain evaluation, these models demonstrated excellent stability and predictive ability. Compared with other published models for human pharmacokinetic parameters estimation, it was further confirmed that our models obtained better predictive ability and could be used in the selection of preclinical candidates.
chemistry, multidisciplinary, medicinal,computer science, interdisciplinary applications, information systems