Abstract:BACKGROUND AND OBJECTIVE:Due to the synergistic effects of drugs, drug combination is one of the effective approaches for treating complex diseases. However, the identification of drug combinations by dose-response methods is still costly. It is promising to develop supervised learning-based approaches to predict potential drug combinations on a large scale. Nevertheless, these approaches have the inadequate utilization of heterogeneous features, which causes the loss of information useful to classification. Moreover, they have an intrinsic bias, because they assume unknown drug pairs as non-combinations, of which some could be real drug combinations in practice.METHODS:To address above issues, this work first designs a two-layer multiple classifier system (TLMCS) to effectively integrate heterogeneous features involving anatomical therapeutic chemical codes of drugs, drug-drug interactions, drug-target interactions, gene ontology of drug targets, and side effects. To avoid the bias caused by labelling unknown samples as negative, it then utilizes the one-class support vector machines, (which requires no negative instance and only labels approved drug combinations as positive instances), as the member classifiers in TLMCS. Last, both a 10-fold cross validation (10-CV) and a novel prediction are performed to validate the performance of TLMCS.RESULTS:The comparison with three state-of-the-art approaches under 10-CV exhibits the superiority of TLMCS, which achieves the area under the receiver operating characteristic curve = 0.824 and the area under the precision-recall curve = 0.372. Moreover, the experiment under the novel prediction demonstrates its ability, where 9 out of the top-20 predicted combinative drug pairs are validated by checking the published literature. Furthermore, for each of the newly-validated drug combinations, this work analyses the combining mode of the member drugs and investigates their relationship in terms of drug targeting pathways.CONCLUSIONS:The proposed TLMCS provides an effective framework to integrate those heterogeneous features and is trained by only positive samples such that the bias of taking unknown drug pairs as negative samples can be avoided. Furthermore, its results in the novel prediction reveal five types of drug combinations and three types of drug relationships in terms of pathways.

Prediction of CYP450 Enzyme-Substrate Selectivity Based on the Network-Based Label Space Division Method

Classification models for predicting cytochrome p450 enzyme-substrate selectivity

STS-NLSP: A Network-Based Label Space Partition Method for Predicting the Specificity of Membrane Transporter Substrates Using a Hybrid Feature of Structural and Semantic Similarity

Prediction of Cytochrome P450 Substrates Using the Explainable Multitask Deep Learning Models

Survey of Machine Learning Techniques for Prediction of the Isoform Specificity of Cytochrome P450 Substrates.

Improved Prediction Of Drug-Target Interactions Based On Ensemble Learning With Fuzzy Local Ternary Pattern

Computational Prediction of the Isoform Specificity of Cytochrome P450 Substrates by an Improved Bayesian Method

Classification Of Cytochrome P450 1a2 Inhibitors And Noninhibitors Based On Deep Belief Network

ATC-NLSP: Prediction of the Classes of Anatomical Therapeutic Chemicals Using a Network-Based Label Space Partition Method.

Predicting Combinative Drug Pairs Via Multiple Classifier System with Positive Samples Only

ADMET Evaluation in Drug Discovery. 19. Reliable Prediction of Human Cytochrome P450 Inhibition Using Artificial Intelligence Approaches

Prediction of Interaction Between Enzymes and Small Molecules in Metabolic Pathways Through Integrating Multiple Classifiers.

Machine learning to predict metabolic drug interactions related to cytochrome P450 isozymes

Prediction of human cytochrome P450 inhibition using a multi-task deep autoencoder neural network.

Machine Learning Models to Predict Cytochrome P450 2B6 Inhibitors and Substrates

Improved Prediction of Michaelis Constants in CYP450-Mediated Reactions by Resilient Back Propagation Algorithm.

DeepP450: Predicting Human P450 Activities of Small Molecules by Integrating Pretrained Protein Language Model and Molecular Representation

Prediction of Cytochrome P450 Inhibition Using a Deep Learning Approach and Substructure Pattern Recognition

Deep Learning Based Drug Metabolites Prediction

Predicting Drug-Target Interaction Networks Based on Functional Groups and Biological Features.

Computational Prediction of Site of Metabolism for UGT-catalyzed Reactions.