Abstract:BACKGROUND:Bacterial non-coding small RNAs (sRNAs) have attracted considerable attention due to their ubiquitous nature and contribution to numerous cellular processes including survival, adaptation and pathogenesis. Existing computational approaches for identifying bacterial sRNAs demonstrate varying levels of success and there remains considerable room for improvement.METHODOLOGY/PRINCIPAL FINDINGS:Here we have proposed a transcriptional signal-based computational method to identify intergenic sRNA transcriptional units (TUs) in completely sequenced bacterial genomes. Our sRNAscanner tool uses position weight matrices derived from experimentally defined E. coli K-12 MG1655 sRNA promoter and rho-independent terminator signals to identify intergenic sRNA TUs through sliding window based genome scans. Analysis of genomes representative of twelve species suggested that sRNAscanner demonstrated equivalent sensitivity to sRNAPredict2, the best performing bioinformatics tool available presently. However, each algorithm yielded substantial numbers of known and uncharacterized hits that were unique to one or the other tool only. sRNAscanner identified 118 novel putative intergenic sRNA genes in Salmonella enterica Typhimurium LT2, none of which were flagged by sRNAPredict2. Candidate sRNA locations were compared with available deep sequencing libraries derived from Hfq-co-immunoprecipitated RNA purified from a second Typhimurium strain (Sittka et al. (2008) PLoS Genetics 4: e1000163). Sixteen potential novel sRNAs computationally predicted and detected in deep sequencing libraries were selected for experimental validation by Northern analysis using total RNA isolated from bacteria grown under eleven different growth conditions. RNA bands of expected sizes were detected in Northern blots for six of the examined candidates. Furthermore, the 5'-ends of these six Northern-supported sRNA candidates were successfully mapped using 5'-RACE analysis.CONCLUSIONS/SIGNIFICANCE:We have developed, computationally examined and experimentally validated the sRNAscanner algorithm. Data derived from this study has successfully identified six novel S. Typhimurium sRNA genes. In addition, the computational specificity analysis we have undertaken suggests that approximately 40% of sRNAscanner hits with high cumulative sum of scores represent genuine, undiscovered sRNA genes. Collectively, these data strongly support the utility of sRNAscanner and offer a glimpse of its potential to reveal large numbers of sRNA genes that have to date defied identification. sRNAscanner is available from: http://bicmku.in:8081/sRNAscanner or http://cluster.physics.iisc.ernet.in/sRNAscanner/.

31 Discovery of Novel Ncrna by Scanning Multiple Genome Alignments

Discovery of Novel Ncrna Sequences in Multiple Genome Alignments on the Basis of Conserved and Stable Secondary Structures

RNAdetect: efficient computational detection of novel non-coding RNAs

An algorithm for rapid noncoding RNA sequence-structure alignment

DRAGoM: Classification and Quantification of Noncoding RNA in Metagenomic Data.

Versatile Interactions and Bioinformatics Analysis of Noncoding RNAs

Uncovering DCL1-dependent Small RNA Loci on Plant Genomes: a Structure-Based Approach.

The Detection And Assessment Of Possible Rna Secondary Structure Using Multiple Sequence Alignment

DecoyFinder: Identification of Contaminants in Sets of Homologous RNA Sequences

A Machine Learning Approach for Accurate Annotation of Noncoding RNAs.

Computational Approaches in Detecting Non- Coding RNA.

A Common Set of Distinct Features That Characterize Noncoding Rnas Across Multiple Species

A novel ncRNA gene finding model based on pair-wise alignment

Rsite2: an efficient computational method to predict the functional sites of noncoding RNAs

Sc-ncDNAPred: A Sequence-Based Predictor for Identifying Non-coding DNA in Saccharomyces Cerevisiae

Srnascanner: a Computational Tool for Intergenic Small RNA Detection in Bacterial Genomes.

Identification of multiple RNAs using feature fusion

Inferring Noncoding RNA Families and Classes by Means of Genome-Scale Structure-Based Clustering

Rsite: a Computational Method to Identify the Functional Sites of Noncoding RNAs

ncRNAInter: a novel strategy based on graph neural network to discover interactions between lncRNA and miRNA

Comparative analysis of RNA-Seq alignment algorithms and the RNA-Seq unified mapper (RUM)