What can we learn when fitting a simple telegraph model to a complex gene expression model?
Feng Jiao,Jing Li,Ting Liu,Yifeng Zhu,Wenhao Che,Leonidas Bleris,Chen Jia
DOI: https://doi.org/10.1371/journal.pcbi.1012118
2024-05-15
PLoS Computational Biology
Abstract:In experiments, the distributions of mRNA or protein numbers in single cells are often fitted to the random telegraph model which includes synthesis and decay of mRNA or protein, and switching of the gene between active and inactive states. While commonly used, this model does not describe how fluctuations are influenced by crucial biological mechanisms such as feedback regulation, non-exponential gene inactivation durations, and multiple gene activation pathways. Here we investigate the dynamical properties of four relatively complex gene expression models by fitting their steady-state mRNA or protein number distributions to the simple telegraph model. We show that despite the underlying complex biological mechanisms, the telegraph model with three effective parameters can accurately capture the steady-state gene product distributions, as well as the conditional distributions in the active gene state, of the complex models. Some effective parameters are reliable and can reflect realistic dynamic behaviors of the complex models, while others may deviate significantly from their real values in the complex models. The effective parameters can also be applied to characterize the capability for a complex model to exhibit multimodality. Using additional information such as single-cell data at multiple time points, we provide an effective method of distinguishing the complex models from the telegraph model. Furthermore, using measurements under varying experimental conditions, we show that fitting the mRNA or protein number distributions to the telegraph model may even reveal the underlying gene regulation mechanisms of the complex models. The effectiveness of these methods is confirmed by analysis of single-cell data for E. coli and mammalian cells. All these results are robust with respect to cooperative transcriptional regulation and extrinsic noise. In particular, we find that faster relaxation speed to the steady state results in more precise parameter inference under large extrinsic noise. Over the past decade, significant progress has been made in the theory and experiments of single-cell stochastic gene expression dynamics. The most well studied and widely used stochastic gene expression model is the two-state telegraph model. However, the conventional telegraph model is too simple and limited in its predictive power because it lacks a description of some important biological mechanisms, such as feedback regulation, multiple gene activation steps, and multiple gene activation pathways. This raises a important question: what can we learn when fitting a complex gene expression model to a simple telegraph model? In this paper, we investigate four complex gene expression models by fitting their steady-state mRNA or protein number distributions to the telegraph model and then obtain estimates of the effective parameters. We show that while the estimated values of the parameters in the "artificial" telegraph model are not always accurate, they are still sometimes reliable and can also reveal important dynamical properties of the complex models such as the ability for a complex model to produce bimodality. Moreover, we provide an effective method of distinguishing the complex models from the telegraph model by using additional information such as gene expression data at multiple time points. Finally, we show that fitting the mRNA or protein number distributions to the telegraph model may even reveal the underlying gene regulation mechanism of a complex model by using measurements under varying experimental conditions. The effectiveness of these methods is well confirmed by analysis of single-cell gene expression data for E. coli and mammalian cells.
biochemical research methods,mathematical & computational biology