Abstract:In recent years, increasing associations between microRNAs (miRNAs) and human diseases have been identified. Based on accumulating biological data, many computational models for potential miRNA-disease associations inference have been developed, which saves time and expenditure on experimental studies, making great contributions to researching molecular mechanism of human diseases and developing new drugs for disease treatment. In this paper, we proposed a novel computational method named Ensemble of Decision Tree based MiRNA-Disease Association prediction (EDTMDA), which innovatively built a computational framework integrating ensemble learning and dimensionality reduction. For each miRNA-disease pair, the feature vector was extracted by calculating the statistical measures, graph theoretical measures, and matrix factorization results for the miRNA and disease, respectively. Then multiple base learnings were built to yield many decision trees (DTs) based on random selection of negative samples and miRNA/disease features. Particularly, Principal Components Analysis was applied to each base learning to reduce feature dimensionality and hence remove the noise or redundancy. Average strategy was adopted for these DTs to get final association scores between miRNAs and diseases. In model performance evaluation, EDTMDA showed AUC of 0.9309 in global leave-one-out cross validation (LOOCV) and AUC of 0.8524 in local LOOCV. Additionally, AUC of 0.9192+/-0.0009 in 5-fold cross validation proved the model's reliability and stability. Furthermore, three types of case studies for four human diseases were implemented. As a result, 94% (Esophageal Neoplasms), 86% (Kidney Neoplasms), 96% (Breast Neoplasms) and 88% (Carcinoma Hepatocellular) of top 50 predicted miRNAs were confirmed by experimental evidences in literature.MiRNAs are known as gene regulators and play critical roles in various biological processes. Many associations between miRNAs and human diseases have been identified, which promotes the understanding towards the molecular mechanisms of diseases and contributes to prevention and treatment of diseases. Computational methods of predicting potential miRNA-disease associations make the discovery more efficient and experiments more productive. We developed EDTMDA by constructing a computational framework integrating ensemble learning and dimensionality reduction. We performed global LOOCV, local LOOCV and 5-fold cross validation to evaluate performance of EDTMDA, which outperformed many classic methods. In addition, we carried out three types of case studies on important diseases, which were used to evaluate performance of model based on known associations in HMDD v2.0, for new diseases without known associations and based on known associations in HMDD v1.0. As a result, most predicted miRNAs in top 50 predictions were confirmed by experimental evidences in literature. So, we believe that EDTMDA can make reliable predictions and guide experiments to uncover more miRNA-disease associations.

Benchmark of Computational Methods for Predicting Microrna-Disease Associations

BHCMDA: A New Biased Heat Conduction Based Method for Potential MiRNA-Disease Association Prediction

Identification of miRNA–disease associations via multiple information integration with Bayesian ranking

Ensemble of decision tree reveals potential miRNA-disease associations

EMCMDA: predicting miRNA-disease associations via efficient matrix completion

MIMRDA: A Method Incorporating the miRNA and mRNA Expression Profiles for Predicting miRNA-Disease Associations to Identify Key miRNAs (microRNAs)

A Computational Model to Predict the Causal Mirnas for Diseases

Improved Prediction of miRNA-Disease Associations Based on Matrix Completion with Network Regularization

Identifying Potential Mirnas–disease Associations with Probability Matrix Factorization

Improved Inductive Matrix Completion Method for Predicting MicroRNA-Disease Associations

MDSCMF: Matrix Decomposition and Similarity-Constrained Matrix Factorization for miRNA–Disease Association Prediction

Identifying and Exploiting Potential Mirna-Disease Associations with Neighborhood Regularized Logistic Matrix Factorization

PRMDA: personalized recommendation-based MiRNA-disease association prediction

DNRLMF-MDA: Predicting Microrna-Disease Associations Based on Similarities of Micrornas and Diseases.

WBSMDA: Within and Between Score for MiRNA-Disease Association Prediction.

Deep-belief network for predicting potential miRNA-disease associations

BNPMDA: Bipartite Network Projection for MiRNA–Disease Association prediction

Predicting miRNA-disease association based on inductive matrix completion.

MDHGI: Matrix Decomposition and Heterogeneous Graph Inference for Mirna-Disease Association Prediction

A Survey of Deep Learning for Detecting miRNA- Disease Associations: Databases, Computational Methods, Challenges, and Future Directions

Benchmarking of computational methods for predicting circRNA-disease associations