Computational prediction for the metabolism of human UDP-glucuronosyltransferase 1A1 substrates
Ya-Bian Luo,Yan-Yao Hou,Zhen Wang,Xin-Man Hu,Wei Li,Yan Li,Yong Liu,Tong-Jiang Li,Chun-Zhi Ai
DOI: https://doi.org/10.1016/j.compbiomed.2022.105959
IF: 7.7
2022-10-01
Computers in Biology and Medicine
Abstract:UDP-glucuronosyltransferase (UGT) 1A1, one of the most important isoforms in UGTs superfamily, has attracted increasing concerns for its special role in the clearance and detoxification of endogenous and exogenous substances. To avoid the clinical drug-drug interactions, it is of great importance to have the knowledge of the metabolic profile of UGT1A1 substrates early. Herein, we purposed to establish machine learning models to predict the metabolic propeties of UGT1A1 substrates. On the basis of the literature-derived substrates database of UGT1A1, automatic metabolism prediction models for the aromatic hydroxyl (ArOH) and carboxyl (COOH) groups were developed with eight machine learning methods, among which, three methods, i.e. Random Forest, Random Subspace and J48, illustrated the best performance either for the aromatic hydroxyl and the carboxyl model. The models illustrated good robustness when they were evaluated with functions like “Precision”, “Recall”, “F-Measure”, “AUC”, “MCC”, etc. Nice accuracy was observed for the aromatic hydroxyl and carboxyl model of these methods, whose AUCs ranged from 0.901 to 0.997. Additionally, the ArOH model was applied to predict the UGT1A1-mediated metabolism of an external set. Two new unknown substrates, cytochrome P450 (CYPs)-mediated metabolites of gefitinib, were predicted and identified, which were validated by in vitro assays. In summary, this study provides a reliable and robust strategy to predict UGT1A1 metabolites, which will be helpful either in rational-optimization of drug metabolism or in avoiding drug-drug interactions in clinic.
engineering, biomedical,computer science, interdisciplinary applications,mathematical & computational biology,biology