Storm: Incorporating transient stochastic dynamics to infer the RNA velocity with metabolic labeling information

Qiangwei Peng,Xiaojie Qiu,Tiejun Li
DOI: https://doi.org/10.1371/journal.pcbi.1012606
2024-11-24
PLoS Computational Biology
Abstract:The time-resolved scRNA-seq (tscRNA-seq) provides the possibility to infer physically meaningful kinetic parameters, e.g., the transcription, splicing or RNA degradation rate constants with correct magnitudes, and RNA velocities by incorporating temporal information. Previous approaches utilizing the deterministic dynamics and steady-state assumption on gene expression states are insufficient to achieve favorable results for the data involving transient process. We present a dynamical approach, Storm (Stochastic models of RNA metabolic-labeling), to overcome these limitations by solving stochastic differential equations of gene expression dynamics. The derivation reveals that the new mRNA sequencing data obeys different types of cell-specific Poisson distributions when jointly considering both biological and cell-specific technical noise. Storm deals with measured counts data directly and extends the RNA velocity methodology based on metabolic labeling scRNA-seq data to transient stochastic systems. Furthermore, we relax the constant parameter assumption over genes/cells to obtain gene-cell-specific transcription/splicing rates and gene-specific degradation rates, thus revealing time-dependent and cell-state-specific transcriptional regulations. Storm will facilitate the study of the statistical properties of tscRNA-seq data, eventually advancing our understanding of the dynamic transcription regulation during development and disease. Intricate regulation of RNA biogenesis, such as RNA transcription, splicing and degradation, plays a critical role in most biological processes. Previous approaches have leveraged a deterministic model of spliced and unspliced RNAs to estimate kinetic parameters and quantify RNA velocity, the rate of changes in gene expression states in single cells. However, accurately estimating meaningful kinetic parameters and RNA velocity is hindered by biased capture of unspliced RNA and absence of temporal information in conventional scRNA-seq. Significant advances have been made in better measuring RNA kinetics with metabolic labeling enabled scRNA-seq, however, computational tools to analyze them lag far behind. Prior work of Dynamo provides one of the early solutions to properly model RNA metabolic labeling data, but its method still largely uses a deterministic model that only utilizes part of extreme cells and is unable to analyze datasets with significant transient dynamics. To address these challenges, we developed Storm that explicitly models transient stochastic RNA dynamics. Importantly, Storm models RNA kinetics with stochastic differential equations that explicitly account for biological and cell-specific noises. Storm is generally applicable to many metabolic labeling scRNA-seq datasets and we demonstrate excellent performance of Storm in fitting the data to capture the transient dynamics under various noise models.
biochemical research methods,mathematical & computational biology
What problem does this paper attempt to address?