PRIMITI: a computational approach for accurate prediction of miRNA-target mRNA interaction

Korawich Uthayopas,Alex G. C. de Sá,Azadeh Alavi,Douglas E. V. Pires,David B. Ascher
DOI: https://doi.org/10.1101/2024.04.26.591419
2024-04-29
Abstract:Current medical research has been demonstrating the roles of miRNAs in a variety of cellular mechanisms, lending credence to the association between miRNA dysregulation and multiple diseases. Understanding the mechanisms of miRNA is critical for developing effective diagnostic and therapeutic strategies. miRNA-mRNA interactions emerge as the most important mechanism to be understood despite their experimental validation constraints. Accordingly, several computational models have been developed to predict miRNA-mRNA interactions, albeit presenting limited predictive capabilities, poor characterisation of miRNA-mRNA interactions and low usability. To address these drawbacks, we developed PRIMITI, a PRedictive model for the Identification of novel MIRNA-Target mRNA Interactions. PRIMITI is a novel machine learning model that utilises CLIP-seq and expression data to characterise functional target sites in 3’-untranslated regions (3’-UTRs) and predict miRNA-target mRNA repression activity. The model was trained using a reliable negative sample selection approach and the robust extreme gradient boosting (XGBoost) model, which was coupled with newly introduced features, including sequence and genetic variation information. PRIMITI achieved an area under the receiver operating characteristic (ROC) curve (AUC) up to 0.96 for a prediction of functional miRNA-target site binding and 0.96 for a prediction of miRNA-target mRNA repression activity on cross-validation and an independent blind test. Additionally, the model outperformed state-of-the-art methods in recovering miRNA-target repressions in an unseen microarray dataset and in a collection of validated miRNA-mRNA interactions, highlighting its utility for preliminary screening. PRIMITI is available on a reliable, scalable and user-friendly web server at .
Bioinformatics
What problem does this paper attempt to address?
The problem addressed in this paper is how to predict the interaction between microRNA (miRNA) and target mRNA more accurately, overcoming the limitations of existing methods in prediction ability, feature description, and usability. To tackle this, the researchers developed PRIMITI, a new machine-learning-based model that utilizes CLIP-seq and expression data to characterize the functional targets of 3' untranslated region and predict the regulatory activity of miRNA.