Abstract:In experiments, the distributions of mRNA or protein numbers in single cells are often fitted to the random telegraph model which includes synthesis and decay of mRNA or protein, and switching of the gene between active and inactive states. While commonly used, this model does not describe how fluctuations are influenced by crucial biological mechanisms such as feedback regulation, non-exponential gene inactivation durations, and multiple gene activation pathways. Here we investigate the dynamical properties of four relatively complex gene expression models by fitting their steady-state mRNA or protein number distributions to the simple telegraph model. We show that despite the underlying complex biological mechanisms, the telegraph model with three effective parameters can accurately capture the steady-state gene product distributions, as well as the conditional distributions in the active gene state, of the complex models. Some effective parameters are reliable and can reflect realistic dynamic behaviors of the complex models, while others may deviate significantly from their real values in the complex models. The effective parameters can also be applied to characterize the capability for a complex model to exhibit multimodality. Using additional information such as single-cell data at multiple time points, we provide an effective method of distinguishing the complex models from the telegraph model. Furthermore, using measurements under varying experimental conditions, we show that fitting the mRNA or protein number distributions to the telegraph model may even reveal the underlying gene regulation mechanisms of the complex models. The effectiveness of these methods is confirmed by analysis of single-cell data for E. coli and mammalian cells. All these results are robust with respect to cooperative transcriptional regulation and extrinsic noise. In particular, we find that faster relaxation speed to the steady state results in more precise parameter inference under large extrinsic noise. Over the past decade, significant progress has been made in the theory and experiments of single-cell stochastic gene expression dynamics. The most well studied and widely used stochastic gene expression model is the two-state telegraph model. However, the conventional telegraph model is too simple and limited in its predictive power because it lacks a description of some important biological mechanisms, such as feedback regulation, multiple gene activation steps, and multiple gene activation pathways. This raises a important question: what can we learn when fitting a complex gene expression model to a simple telegraph model? In this paper, we investigate four complex gene expression models by fitting their steady-state mRNA or protein number distributions to the telegraph model and then obtain estimates of the effective parameters. We show that while the estimated values of the parameters in the "artificial" telegraph model are not always accurate, they are still sometimes reliable and can also reveal important dynamical properties of the complex models such as the ability for a complex model to produce bimodality. Moreover, we provide an effective method of distinguishing the complex models from the telegraph model by using additional information such as gene expression data at multiple time points. Finally, we show that fitting the mRNA or protein number distributions to the telegraph model may even reveal the underlying gene regulation mechanism of a complex model by using measurements under varying experimental conditions. The effectiveness of these methods is well confirmed by analysis of single-cell gene expression data for E. coli and mammalian cells.

Transient power-law behaviour following induction distinguishes between competing models of stochastic gene expression

Parameterizing Cell-to-cell Regulatory Heterogeneities Via Stochastic Transcriptional Profiles

Single-cell Analysis of Transcription Kinetics Across the Cell Cycle

Uncovering the effect of RNA polymerase steric interactions on gene expression noise: analytical distributions of nascent and mature RNA numbers

Interpretable and tractable models of transcriptional noise for the rational design of single-molecule quantification experiments

Analytical Results for A Multistate Gene Model

Using steady-state formula to estimate time-dependent parameters of stochastic gene transcription models

Quantifying how post-transcriptional noise and gene copy number variation bias transcriptional parameter inference from mRNA distributions

Induction level determines signature of gene expression noise in cellular systems

Stochastic Simulation And Statistical Inference Platform For Visualization And Estimation Of Transcriptional Kinetics

Exactly solvable models of stochastic gene expression

What can we learn when fitting a simple telegraph model to a complex gene expression model?

Genome-wide modelling of transcription kinetics reveals patterns of RNA production delays

Deciphering transcriptional dynamics in vivo by counting nascent RNA molecules

Efficient approximations of transcriptional bursting effects on the dynamics of a gene regulatory network

A mechanistic model for the negative binomial distribution of single-cell mRNA counts

Stochastic model of transcription factor-regulated gene expression

Distinguishing the Rates of Gene Activation from Phenotypic Variations

Global transcription regulation revealed from dynamical correlations in time-resolved single-cell RNA-sequencing

Transcriptional pulsing and consequent stochasticity in gene expression