Abstract:BACKGROUND AND OBJECTIVE:Due to the synergistic effects of drugs, drug combination is one of the effective approaches for treating complex diseases. However, the identification of drug combinations by dose-response methods is still costly. It is promising to develop supervised learning-based approaches to predict potential drug combinations on a large scale. Nevertheless, these approaches have the inadequate utilization of heterogeneous features, which causes the loss of information useful to classification. Moreover, they have an intrinsic bias, because they assume unknown drug pairs as non-combinations, of which some could be real drug combinations in practice.METHODS:To address above issues, this work first designs a two-layer multiple classifier system (TLMCS) to effectively integrate heterogeneous features involving anatomical therapeutic chemical codes of drugs, drug-drug interactions, drug-target interactions, gene ontology of drug targets, and side effects. To avoid the bias caused by labelling unknown samples as negative, it then utilizes the one-class support vector machines, (which requires no negative instance and only labels approved drug combinations as positive instances), as the member classifiers in TLMCS. Last, both a 10-fold cross validation (10-CV) and a novel prediction are performed to validate the performance of TLMCS.RESULTS:The comparison with three state-of-the-art approaches under 10-CV exhibits the superiority of TLMCS, which achieves the area under the receiver operating characteristic curve = 0.824 and the area under the precision-recall curve = 0.372. Moreover, the experiment under the novel prediction demonstrates its ability, where 9 out of the top-20 predicted combinative drug pairs are validated by checking the published literature. Furthermore, for each of the newly-validated drug combinations, this work analyses the combining mode of the member drugs and investigates their relationship in terms of drug targeting pathways.CONCLUSIONS:The proposed TLMCS provides an effective framework to integrate those heterogeneous features and is trained by only positive samples such that the bias of taking unknown drug pairs as negative samples can be avoided. Furthermore, its results in the novel prediction reveal five types of drug combinations and three types of drug relationships in terms of pathways.

Predicting drug side effects by multi-label learning and ensemble learning

Learning important features from multi-view data to predict drug side effects

Predicting potential side effects of drugs by recommender methods and ensemble learning.

Drug Side Effect Prediction Through Linear Neighborhoods and Multiple Data Source Integration

Improved Prediction Of Drug-Target Interactions Based On Ensemble Learning With Fuzzy Local Ternary Pattern

Drug Side-Effect Prediction Using Heterogeneous Features and Bipartite Local Models

Quantitative Prediction of Drug Side Effects Based on Drug-Related Features

MSDSE: Predicting drug-side effects based on multi-scale features and deep multi-structure neural network

Prediction of drug side effects with transductive matrix co-completion

Identification of Drug-Side Effect Association via Semisupervised Model and Multiple Kernel Learning

Predicting Drug Side Effects Using Data Analytics and the Integration of Multiple Data Sources

An Explainable Framework for Predicting Drug-Side Effect Associations Via Meta-Path-Based Feature Learning in Heterogeneous Information Network.

A unified frame of predicting side effects of drugs by using linear neighborhood similarity

Predicting Combinative Drug Pairs Via Multiple Classifier System with Positive Samples Only

Feature-derived Graph Regularized Matrix Factorization for Predicting Drug Side Effects.

Identification of Drug-Side Effect Association Via Restricted Boltzmann Machines with Penalized Term.

Multiple Kronecker RLS fusion-based link propagation for drug-side effect prediction

Dual Representation Learning for Predicting Drug-side Effect Frequency using Protein Target Information

Identification of drug-side effect association via multiple information integration with centered kernel alignment

Predicting Drugs Side Effects Based on Chemical-Chemical Interactions and Protein-Chemical Interactions

Drug Side-effect Prediction Based on Comprehensive Drug Similarity