Abstract:Modeling of human intestinal absorption (HIA) data of 175 diverse drugs and 336 calculated descriptors is performed to develop global predictive models that are applicable to the whole medicinal chemistry space. With this aim, we employed two automated procedures, (a) Sphere Exclusion Algorithm (SEA) to select members of the training and test sets based on structural dissimilarity and (b) k‐Nearest Neighbors (kNN) method along with Genetic Algorithms (kNN‐QSAR‐GA) to select significant and independent descriptors. This methodology helped us to derive optimal Quantitative Structure–Property Relationship (QSPR) models based on three and four descriptors. The best three descriptor model is based on Delta Chi Index of order 3 (Cluster), Hydrogen type E‐State index ShsOH, AlogP99 ($\rm{ q_{{\rm{LOO}}}^2 }$=0.7401 and $\rm{ q_{{\rm{ext}}}^2 }$=0.7989); the best four variable model is based on auto‐correlation descriptor (Moran) weighted by atomic weights – order 7, AI‐State_Indices_AISssssC, number of hydrogen bond acceptors, AlogP99 ($\rm{ q_{{\rm{LOO}}}^2 }$=0.8196 and $\rm{ q_{{\rm{ext}}}^2 }$=0.6999). Based on extensive validation tests of the models M1–M4, comparison of their overall performance and $\rm{ q_{{\rm{ext}}}^2 }$ statistics with reported models using other approaches, it is shown that: (a) the models have high stability and are robust and (b) for the first time in HIA modeling, the combination of an automated training set selection (SEA) followed by variable selection (kNN‐QSAR_GA) is shown to be a promising methodology to build multiple stable models that are useful in consensus prediction. From the analysis of the physical meaning of the selected descriptors, it is inferred that the HIA of small organic compounds can be accurately predicted using calculated descriptors that code for the following fundamental properties: (1) lipophilicity, (2) hydrogen bonding capacity, (3) size, and (4) shape and further, the role of new calculated descriptors on the HIA profile of small organic compounds is uncovered. Finally, as the models reported herein are based on computed properties, they appear to be a valuable tool in virtual screening, where selection and prioritization of candidates is required.

In Silico ADME Modeling 3: Computational Models to Predict Human Intestinal Absorption Using Sphere Exclusion and kNN QSAR Methods

ADME Evaluation in Drug Discovery. 8. the Prediction of Human Intestinal Absorption by a Support Vector Machine

In Silico Modeling on ADME Properties of Natural Products: Classification Models for Blood-Brain Barrier Permeability, Its Application to Traditional Chinese Medicine and in Vitro Experimental Validation

Quantitative Structure–activity Relationship: Promising Advances in Drug Discovery Platforms

Predicting Human Intestinal Absorption from Three Dimensional Molecular Structure of Drugs

Ionization-specific analysis of human intestinal absorption

ADME evaluation in drug discovery. 7. Prediction of oral absorption by correlation and classification.

Qsar Study and Volsurf Characterization of Human Intestinal Absorption of Drugs

Artificial Intelligence-Based Quantitative Structure–Property Relationship Model for Predicting Human Intestinal Absorption of Compounds with Serotonergic Activity

Modeling of small molecule's affinity to phospholipids using IAM-HPLC and QSRR approach enhanced by similarity-based machine algorithms

ADME evaluation in drug discovery. 1. Applications of genetic algorithms to the prediction of blood-brain partitioning of a large set of drugs.

Structure – ADME Relationship: Still a Long Way to Go?

Structure-ADME relationship: still a long way to go?

ADME Properties Evaluation in Drug Discovery: Prediction of Caco-2 Cell Permeability Using a Combination of NSGA-II and Boosting

Validating ADME QSAR Models Using Marketed Drugs

Predicting ADME properties in silico: methods and models

Recent Advances On In Silico Adme Modeling

Computational modeling of human oral bioavailability: what will be next?

Advances in Computationally Modeling Human Oral Bioavailability.

Comparative Analysis of Chemical Descriptors by Machine Learning Reveals Atomistic Insights into Solute-Lipid Interactions

How Precise Are Our Quantitative Structure-Activity Relationship Derived Predictions for New Query Chemicals?