Using in vitro ADME data for lead compound selection: An emphasis on PAMPA pH 5 permeability and oral bioavailability
Jordan Williams,Vishal Siramshetty,Ðắc-Trung Nguyễn,Elias Carvalho Padilha,Md Kabir,Kyeong-Ri Yu,Amy Q Wang,Tongan Zhao,Misha Itkin,Paul Shinn,Ewy A Mathé,Xin Xu,Pranav Shah,Md. Kabir,Amy Q. Wang,Ewy A. Mathé
DOI: https://doi.org/10.1016/j.bmc.2021.116588
2022-02-01
Abstract:Membrane permeability plays an important role in oral drug absorption. Caco-2 and Madin-Darby Canine Kidney (MDCK) cell culture systems have been widely used for assessing intestinal permeability. Since most drugs are absorbed passively, Parallel Artificial Membrane Permeability Assay (PAMPA) has gained popularity as a low-cost and high-throughput method in early drug discovery when compared to high-cost, labor intensive cell-based assays. At the National Center for Advancing Translational Sciences (NCATS), PAMPA pH 5 is employed as one of the Tier I absorption, distribution, metabolism, and elimination (ADME) assays. In this study, we have developed a quantitative structure activity relationship (QSAR) model using our ∼6500 compound PAMPA pH 5 permeability dataset. Along with ensemble decision tree-based methods such as Random Forest and eXtreme Gradient Boosting, we employed deep neural network and a graph convolutional neural network to model PAMPA pH 5 permeability. The classification models trained on a balanced training set provided accuracies ranging from 71% to 78% on the external set. Of the four classifiers, the graph convolutional neural network that directly operates on molecular graphs offered the best classification performance. Additionally, an ∼85% correlation was obtained between PAMPA pH 5 permeability and in vivo oral bioavailability in mice and rats. These results suggest that data from this assay (experimental or predicted) can be used to rank-order compounds for preclinical in vivo testing with a high degree of confidence, reducing cost and attrition as well as accelerating the drug discovery process. Additionally, experimental data for 486 compounds (PubChem AID: 1645871) and the best models have been made publicly available (https://opendata.ncats.nih.gov/adme/).
biochemistry & molecular biology,chemistry, medicinal, organic