QuickMIRSeq: a Pipeline for Quick and Accurate Quantification of Both Known Mirnas and Isomirs by Jointly Processing Multiple Samples from Microrna Sequencing.

Shanrong Zhao,William Gordon,Sarah Du,Chi Zhang,Wen He,Li Xi,Sachin Mathur,Michael Agostino,Theresa Paradis,David von Schack,Michael Vincent,Baohong Zhang
DOI: https://doi.org/10.1186/s12859-017-1601-4
IF: 3.307
2017-01-01
BMC Bioinformatics
Abstract:BACKGROUND:Genome-wide miRNA expression data can be used to study miRNA dysregulation comprehensively. Although many open-source tools for microRNA (miRNA)-seq data analyses are available, challenges remain in accurate miRNA quantification from large-scale miRNA-seq dataset. We implemented a pipeline called QuickMIRSeq for accurate quantification of known miRNAs and miRNA isoforms (isomiRs) from multiple samples simultaneously.RESULTS:QuickMIRSeq considers the unique nature of miRNAs and combines many important features into its implementation. First, it takes advantage of high redundancy of miRNA reads and introduces joint mapping of multiple samples to reduce computational time. Second, it incorporates the strand information in the alignment step for more accurate quantification. Third, reads potentially arising from background noise are filtered out to improve the reliability of miRNA detection. Fourth, sequences aligned to miRNAs with mismatches are remapped to a reference genome to further reduce false positives. Finally, QuickMIRSeq generates a rich set of QC metrics and publication-ready plots.CONCLUSIONS:The rich visualization features implemented allow end users to interactively explore the results and gain more insights into miRNA-seq data analyses. The high degree of automation and interactivity in QuickMIRSeq leads to a substantial reduction in the time and effort required for miRNA-seq data analysis.
What problem does this paper attempt to address?