Abstract:ABSTRACT The widespread utilization of high-throughput sequencing technologies has unequivocally demonstrated that eukaryotic transcriptomes consist primarily (>98%) of non-coding RNA (ncRNA) transcripts significantly more diverse than their protein-coding counterparts. ncRNAs are typically divided into two categories based on their length. (1) ncRNAs less than 200 nucleotides (nt) long are referred as small non-coding RNAs (sncRNAs) and include microRNAs (miRNAs), piwi-interacting RNAs (piRNAs), small nucleolar RNAs (snoRNAs), transfer ribonucleic RNAs (tRNAs), etc., and the majority of these are thought to function primarily in controlling gene expression. That said, the full repertoire of sncRNAs remains fairly poorly defined as evidenced by two entirely new classes of sncRNAs only recently being reported, i.e., snoRNA-derived RNAs (sdRNAs) and tRNA-derived fragments (tRFs). (2) ncRNAs longer than 200 nt long are known as long ncRNAs (lncRNAs). lncRNAs represent the 2 nd largest transcriptional output of the cell (behind only ribosomal RNAs), and although functional roles for several lncRNAs have been reported, most lncRNAs remain largely uncharacterized due to a lack of predictive tools aimed at guiding functional characterizations. Importantly, whereas the cost of high-throughput transcriptome sequencing is now feasible for most active research programs, tools necessary for the interpretation of these sequencings typically require significant computational expertise and resources markedly hindering widespread utilization of these datasets. In light of this, we have developed a powerful new ncRNA transcriptomics suite, SALTS, which is highly accurate, markedly efficient, and extremely user-friendly. SALTS stands for S URFR (sncRNA) A nd L AGOOn (lncRNA) T ranscriptomics S uite and offers platforms for comprehensive sncRNA and lncRNA profiling and discovery, ncRNA functional prediction, and the identification of significant differential expressions among datasets. Notably, SALTS is accessed through an intuitive Web-based interface, can be used to analyze either user-generated, standard next-generation sequencing (NGS) output file uploads (e.g., FASTQ) or existing NCBI Sequence Read Archive (SRA) data, and requires absolutely no dataset pre-processing or knowledge of library adapters/oligonucleotides. SALTS constitutes the first publically available, Web-based, comprehensive ncRNA transcriptomic NGS analysis platform designed specifically for users with no computational background, providing a much needed, powerful new resource capable of enabling more widespread ncRNA transcriptomic analyses. The SALTS WebServer is freely available online at http://salts.soc.southalabama.edu .

A technology-agnostic long-read analysis pipeline for transcriptome discovery and quantification

Illuminating the dark side of the human transcriptome with long read transcript sequencing

Transcriptome variation in human tissues revealed by long-read sequencing

High-Resolution Transcriptome Analysis with Long-Read RNA Sequencing

Enhancing novel isoform discovery: leveraging nanopore long-read sequencing and machine learning approaches

Comprehensive characterization of single-cell full-length isoforms in human and mouse with long-read sequencing

Long-read sequencing transcriptome quantification with lr-kallisto

Systematic assessment of long-read RNA-seq methods for transcript identification and quantification

L-RAPiT: A Cloud-Based Computing Pipeline for the Analysis of Long-Read RNA Sequencing Data

A long-read sequencing strategy with overlapping linkers on adjacent fragments (OLAF-Seq) for targeted resequencing and enrichment

Single-Cell Omics for Transcriptome CHaracterization (SCOTCH): isoform-level characterization of gene expression through long-read single-cell RNA sequencing

Fine mapping of RNA isoform diversity using an innovative targeted long-read RNA sequencing protocol with novel dedicated bioinformatics pipeline

SQANTI: extensive characterization of long-read transcript sequences for quality control in full-length transcriptome identification and quantification

TrAnnoScope: A Modular Snakemake Pipeline for Full-Length Transcriptome Analysis and Functional Annotation

Systematic evaluation of single-cell RNA-seq analyses performance based on long-read sequencing platforms

Contrasting and combining transcriptome complexity captured by short and long RNA sequencing reads

A mapping-free NLP-based technique for sequence search in Nanopore long-reads

Nanopore native RNA sequencing of a human poly(A) transcriptome

SALTS – SURFR (sncRNA) And LAGOOn (lncRNA) Transcriptomics Suite

FLAME: long-read bioinformatics tool for comprehensive spliceome characterization

UTAP: User-friendly Transcriptome Analysis Pipeline