Abstract:Background MicroRNAs (miRs) are small noncoding RNAs that bind to complementary/partially complementary sites in the 3' untranslated regions of target genes to regulate protein production of the target transcript and to induce mRNA degradation or mRNA cleavage. The ability to perform accurate, high-throughput identification of physiologically active miR targets would enable functional characterization of individual miRs. Current target prediction methods include traditional approaches that are based on specific base-pairing rules in the miR's seed region and implementation of cross-species conservation of the target site, and machine learning (ML) methods that explore patterns that contrast true and false miR-mRNA duplexes. However, in the case of the traditional methods research shows that some seed region matches that are conserved are false positives and that some of the experimentally validated target sites are not conserved. Results We present HuMiTar, a computational method for identifying common targets of miRs, which is based on a scoring function that considers base-pairing for both seed and non-seed positions for human miR-mRNA duplexes. Our design shows that certain non-seed miR nucleotides, such as 14, 18, 13, 11, and 17, are characterized by a strong bias towards formation of Watson-Crick pairing. We contrasted HuMiTar with several representative competing methods on two sets of human miR targets and a set of ten glioblastoma oncogenes. Comparison with the two best performing traditional methods, PicTar and TargetScanS, and a representative ML method that considers the non-seed positions, NBmiRTar, shows that HuMiTar predictions include majority of the predictions of the other three methods. At the same time, the proposed method is also capable of finding more true positive targets as a trade-off for an increased number of predictions. Genome-wide predictions show that the proposed method is characterized by 1.99 signal-to-noise ratio and linear, with respect to the length of the mRNA sequence, computational complexity. The ROC analysis shows that HuMiTar obtains results comparable with PicTar, which are characterized by high true positive rates that are coupled with moderate values of false positive rates. Conclusion The proposed HuMiTar method constitutes a step towards providing an efficient model for studying translational gene regulation by miRs.

Improving Performance of Mammalian Microrna Target Prediction.

A Comprehensive Study of a SVM-based Mirna Target Prediction Algorithm

[Microrna Target Predicition Based on SVM and the Optimized Feature Set].

A SVM Based Approach for Mirna Target Prediction

A Machine Learning Approach For Mirna Target Prediction

Survey of Computational Algorithms for MicroRNA Target Prediction

Mirmat: Mature Microrna Sequence Prediction

Advances in the Techniques for the Prediction of Microrna Targets.

The Prediction of the Porcine Pre-Micrornas in Genome-Wide Based on Support Vector Machine (SVM) and Homology Searching

Predicting Human Microrna Precursors Based on an Optimized Feature Subset Generated by GA-SVM.

Comprehensive Overview and Assessment of Microrna Target Prediction Tools in Homo Sapiens and Drosophila Melanogaster

Sysmicro: A Novel Systems Approach For Mirna Target Prediction

MiRenSVM: towards better prediction of microRNA precursors using an ensemble SVM classifier with multi-loop features

Microrna As An Integral Part Of Cell Communication: Regularized Target Prediction And Network Prediction

A Study of Mirnas Targets Prediction and Experimental Validation

Comprehensive overview and assessment of miRNA target prediction tools in human and drosophila melanogaster

Advancing microRNA Target Site Prediction with Transformer and Base-Pairing Patterns

A Novel Stepwise Support Vector Machine (svm) Method Based on Optimal Feature Combination for Predicting Mirna Precursors

HuMiTar: A sequence-based method for prediction of human microRNA targets

MiRFinder: an Improved Approach and Software Implementation for Genome-Wide Fast Microrna Precursor Scans