Abstract:BACKGROUND:The advent of the NGS technologies has permitted profiling of whole-genome transcriptomes (i.e., RNA-Seq) at unprecedented speed and very low cost. RNA-Seq provides a far more precise measurement of transcript levels and their isoforms compared to other methods such as microarrays. A fundamental goal of RNA-Seq is to better identify expression changes between different biological or disease conditions. However, existing methods for detecting differential expression from RNA-Seq count data have not been comprehensively evaluated in large-scale RNA-Seq datasets. Many of them suffer from inflation of type I error and failure in controlling false discovery rate especially in the presence of abnormal high sequence read counts in RNA-Seq experiments.RESULTS:To address these challenges, we propose a powerful and robust tool, termed deGPS, for detecting differential expression in RNA-Seq data. This framework contains new normalization methods based on generalized Poisson distribution modeling sequence count data, followed by permutation-based differential expression tests. We systematically evaluated our new tool in simulated datasets from several large-scale TCGA RNA-Seq projects, unbiased benchmark data from compcodeR package, and real RNA-Seq data from the development transcriptome of Drosophila. deGPS can precisely control type I error and false discovery rate for the detection of differential expression and is robust in the presence of abnormal high sequence read counts in RNA-Seq experiments.CONCLUSIONS:Software implementing our deGPS was released within an R package with parallel computations ( https://github.com/LL-LAB-MCW/deGPS ). deGPS is a powerful and robust tool for data normalization and detecting different expression in RNA-Seq experiments. Beyond RNA-Seq, deGPS has the potential to significantly enhance future data analysis efforts from many other high-throughput platforms such as ChIP-Seq, MBD-Seq and RIP-Seq.

Unit-Free and Robust Detection of Differential Expression from RNA-Seq Data

A Unified Model for Joint Normalization and Differential Gene Expression Detection in RNA-Seq Data.

A Unified Model for Differential Expression Analysis of RNA-seq Data Via L1-Penalized Linear Regression

Joint Between-Sample Normalization and Differential Expression Detection Through ℓ0-Regularized Regression

A two-step strategy for detecting differential gene expression in cDNA microarray data

A Novel Approach to Detect Differentially Expressed Genes from Count-Based Digital Databases by Normalizing with Housekeeping Genes.

Degps is a Powerful Tool for Detecting Differential Expression in RNA-sequencing Studies

Rseqdiff: Detecting Differential Isoform Expression from RNA-Seq Data Using Hierarchical Likelihood Ratio Test.

Differential expression analysis for paired RNA-seq data

A Two-Part Mixed Model for Differential Expression Analysis in Single-Cell High-Throughput Gene Expression Data.

Identifying stably expressed genes from multiple RNA-Seq data sets

Differential expression analysis of multifactor RNA-Seq experiments with respect to biological variation

Robustly detecting differential expression in RNA sequencing data using observation weights

A balanced method detecting differentially expressed genes for RNA-sequencing data

A comparison of methods for differential expression analysis of RNA-seq data

Differential analyses for RNA-seq: transcript-level estimates improve gene-level inferences

Modeling expression ranks for noise-tolerant differential expression analysis of scRNA-seq data

UMI-count modeling and differential expression analysis for single-cell RNA sequencing

A statistical normalization method and differential expression analysis for RNA-seq data between different species

Statistical Modeling of RNA-Seq Data

Robust estimation of isoform expression with RNA-Seq data