1
|
Sukys A, Grima R. Cell-cycle dependence of bursty gene expression: insights from fitting mechanistic models to single-cell RNA-seq data. Nucleic Acids Res 2025; 53:gkaf295. [PMID: 40240003 PMCID: PMC12000877 DOI: 10.1093/nar/gkaf295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 03/22/2025] [Accepted: 03/28/2025] [Indexed: 04/18/2025] Open
Abstract
Bursty gene expression is characterized by two intuitive parameters, burst frequency and burst size, the cell-cycle dependence of which has not been extensively profiled at the transcriptome level. In this study, we estimate the burst parameters per allele in the G1 and G2/M cell-cycle phases for thousands of mouse genes by fitting mechanistic models of gene expression to messenger RNA count data, obtained by sequencing of single cells whose cell-cycle position has been inferred using a deep-learning method. We find that upon DNA replication, the median burst frequency approximately halves, while the burst size remains mostly unchanged. Genome-wide distributions of the burst parameter ratios between the G2/M and G1 phases are broad, indicating substantial heterogeneity in transcriptional regulation. We also observe a significant negative correlation between the burst frequency and size ratios, suggesting that regulatory processes do not independently control the burst parameters. We show that to accurately estimate the burst parameter ratios, mechanistic models must explicitly account for gene copy number variation and extrinsic noise due to the coupling of transcription to cell age across the cell cycle, but corrections for technical noise due to imperfect capture of RNA molecules in sequencing experiments are less critical.
Collapse
Affiliation(s)
- Augustinas Sukys
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, United Kingdom
- The Alan Turing Institute, London NW1 2DB, United Kingdom
- School of BioSciences, University of Melbourne, Parkville, Victoria 3052, Australia
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, United Kingdom
| |
Collapse
|
2
|
Miles CE. Incorporating spatial diffusion into models of bursty stochastic transcription. J R Soc Interface 2025; 22:20240739. [PMID: 40199347 PMCID: PMC11978452 DOI: 10.1098/rsif.2024.0739] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2024] [Revised: 12/21/2024] [Accepted: 01/16/2025] [Indexed: 04/10/2025] Open
Abstract
The dynamics of gene expression are stochastic and spatial at the molecular scale, with messenger RNA (mRNA) transcribed at specific nuclear locations and then transported to the nuclear boundary for export. Consequently, the spatial distributions of these molecules encode their underlying dynamics. While mechanistic models for molecular counts have revealed numerous insights into gene expression, they have largely neglected now-available subcellular spatial resolution down to individual molecules. Owing to the technical challenges inherent in spatial stochastic processes, tools for studying these subcellular spatial patterns are still limited. Here, we introduce a spatial stochastic model of nuclear mRNA with two-state (telegraph) transcriptional dynamics. Observations of the model can be concisely described as following a spatial Cox process driven by a stochastically switching partial differential equation. We derive analytical solutions for spatial and demographic moments and validate them with simulations. We show that the distribution of mRNA counts can be accurately approximated by a Poisson-beta distribution with tractable parameters, even with complex spatial dynamics. This observation allows for efficient parameter inference demonstrated on synthetic data. Altogether, our work adds progress towards a new frontier of subcellular spatial resolution in inferring the dynamics of gene expression from static snapshot data.
Collapse
Affiliation(s)
- Christopher E. Miles
- Department of Mathematics, Center for Complex Biological Systems, University of California, Irvine, CA, USA
| |
Collapse
|
3
|
Jin G, Porello EAL, Zhang J, Lim B. Heterogeneous Sox2 transcriptional dynamics mediate pluripotency maintenance in mESCs in response to LIF signaling perturbations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.17.643751. [PMID: 40166162 PMCID: PMC11957043 DOI: 10.1101/2025.03.17.643751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
The LIF signaling pathway and its regulation of internal factors like Sox2 is crucial for maintaining self-renewal and pluripotency in mESCs. However, the direct impact of LIF signaling on Sox2 transcriptional dynamics at the single-cell level remains elusive. Here, we employ PP7/PCP-mediated live imaging to analyze the transcriptional dynamics of Sox2 under perturbation of the LIF signaling pathway at single-cell resolution. Removal of the LIF ligand or addition of a JAK inhibitor heterogeneously affects the cell population, reducing the number of Sox2-active cells, rather than completely abolishing Sox2 expression. Moreover, Sox2-active cells under LIF perturbation exhibit significant reductions in mRNA production per cell. This reduction is characterized by decreased size and frequency of transcriptional bursting, resulting in shorter duration of Sox2 activity. Notably, cells with reduced or absent Sox2 expression demonstrate a significant loss in pluripotency, indicating that a reduction in Sox2 transcription (rather than a complete loss) is sufficient to trigger the transition from embryonic to an early differentiated state. In LIF-perturbed cells with Sox2 expression reduced to about 50% of non-perturbed levels, we observe a binary behavior, with cells either retaining or losing pluripotency-associated traits. Lastly, we find Sox2 expression is transcriptionally inherited across cell cycles, with Sox2-active mother cells more likely to reactivate Sox2 after mitosis compared to Sox2-inactive cells. This robust transcriptional memory is observed independent of LIF signaling perturbation. Our findings provide new insights into the transcriptional regulation of Sox2, advancing our understanding of the quantitative thresholds of gene expression required for pluripotency maintenance and highlighting the power of single-cell approaches to unravel dynamic regulatory mechanisms.
Collapse
|
4
|
Bokes P, Singh A. Optimisation of gene expression noise for cellular persistence against lethal events. J Theor Biol 2025; 598:111996. [PMID: 39603338 DOI: 10.1016/j.jtbi.2024.111996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2024] [Revised: 10/02/2024] [Accepted: 11/09/2024] [Indexed: 11/29/2024]
Abstract
Bacterial cell persistence, crucial for survival under adverse conditions like antibiotic exposure, is intrinsically linked to stochastic fluctuations in gene expression. Certain genes, while inhibiting growth under normal circumstances, confer tolerance to antibiotics at elevated expression levels. The occurrence of antibiotic events lead to instantaneous cellular responses with varied survival probabilities correlated with gene expression levels. Notably, cells with lower protein concentrations face higher mortality rates. This study aims to elucidate an optimal strategy for protein expression conducive to cellular survival. Through comprehensive mathematical analysis, we determine the optimal burst size and frequency that maximise cell proliferation. Furthermore, we explore how the optimal expression distribution changes as the cost of protein expression to growth escalates. Our model reveals a hysteresis phenomenon, characterised by discontinuous transitions between deterministic and stochastic optima. Intriguingly, stochastic optima possess a noise floor, representing the minimal level of fluctuations essential for optimal cellular resilience.
Collapse
Affiliation(s)
- Pavol Bokes
- Department of Applied Mathematics and Statistics, Comenius University, Bratislava 84248, Slovakia.
| | - Abhyudai Singh
- Department of Electrical and Computer Engineering, University of Delaware, Newark, DE 19716, USA.
| |
Collapse
|
5
|
Doe C, Brown D, Li H. Dynamics of two feed forward genetic motifs in the presence of molecular noise. Biosystems 2024; 246:105352. [PMID: 39433119 DOI: 10.1016/j.biosystems.2024.105352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2024] [Revised: 09/24/2024] [Accepted: 10/11/2024] [Indexed: 10/23/2024]
Abstract
Understanding the function of common motifs in gene regulatory networks is an important goal of systems biology. Feed forward loops (FFLs) are an example of such a motif. In FFLs, a gene (X) regulates another gene (Z) both directly and via an intermediary gene (Y). Previous theoretical studies have suggested several possible functions for FFLs, based on their transient responses to changes in input signals (using deterministic models) and their fluctuations around steady state (using stochastic models). In this paper we study stochastic models of the two most common FFLs, "coherent type 1" and "incoherent type 1". We incorporate molecular noise by treating DNA binding, transcription, translation, and decay as stochastic processes. By comparing the dynamics of these loops with models of alternative networks (in which X does not regulate Y), we explore how FFLs act to process information in the presence of noise. This work highlights the importance of incorporating realistic molecular noise in studying both the transient and steady-state behavior of gene regulatory networks.
Collapse
Affiliation(s)
- Cooper Doe
- Colorado College, United States of America.
| | | | - Hanqing Li
- Colorado College, United States of America
| |
Collapse
|
6
|
Zhang Q, Cao W, Wang J, Yin Y, Sun R, Tian Z, Hu Y, Tan Y, Zhang BG. Transcriptional bursting dynamics in gene expression. Front Genet 2024; 15:1451461. [PMID: 39346775 PMCID: PMC11437526 DOI: 10.3389/fgene.2024.1451461] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Accepted: 08/30/2024] [Indexed: 10/01/2024] Open
Abstract
Gene transcription is a stochastic process that occurs in all organisms. Transcriptional bursting, a critical molecular dynamics mechanism, creates significant heterogeneity in mRNA and protein levels. This heterogeneity drives cellular phenotypic diversity. Currently, the lack of a comprehensive quantitative model limits the research on transcriptional bursting. This review examines various gene expression models and compares their strengths and weaknesses to guide researchers in selecting the most suitable model for their research context. We also provide a detailed summary of the key metrics related to transcriptional bursting. We compared the temporal dynamics of transcriptional bursting across species and the molecular mechanisms influencing these bursts, and highlighted the spatiotemporal patterns of gene expression differences by utilizing metrics such as burst size and burst frequency. We summarized the strategies for modeling gene expression from both biostatistical and biochemical reaction network perspectives. Single-cell sequencing data and integrated multiomics approaches drive our exploration of cutting-edge trends in transcriptional bursting mechanisms. Moreover, we examined classical methods for parameter estimation that help capture dynamic parameters in gene expression data, assessing their merits and limitations to facilitate optimal parameter estimation. Our comprehensive summary and review of the current transcriptional burst dynamics theories provide deeper insights for promoting research on the nature of cell processes, cell fate determination, and cancer diagnosis.
Collapse
Affiliation(s)
- Qiuyu Zhang
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| | - Wenjie Cao
- School of Mathematics, Sun Yat-sen University, Guangzhou, China
| | - Jiaqi Wang
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| | - Yihao Yin
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| | - Rui Sun
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| | - Zunyi Tian
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| | - Yuhan Hu
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| | - Yalan Tan
- School of Bioengineering & Health, Wuhan Textile University, Wu Han, China
| | - Ben-Gong Zhang
- Research Center of Nonlinear Sciences, School of Mathematical & Physical Sciences, Wuhan Textile University, Wu Han, China
| |
Collapse
|
7
|
Volteras D, Shahrezaei V, Thomas P. Global transcription regulation revealed from dynamical correlations in time-resolved single-cell RNA sequencing. Cell Syst 2024; 15:694-708.e12. [PMID: 39121860 DOI: 10.1016/j.cels.2024.07.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 02/29/2024] [Accepted: 07/11/2024] [Indexed: 08/12/2024]
Abstract
Single-cell transcriptomics reveals significant variations in transcriptional activity across cells. Yet, it remains challenging to identify mechanisms of transcription dynamics from static snapshots. It is thus still unknown what drives global transcription dynamics in single cells. We present a stochastic model of gene expression with cell size- and cell cycle-dependent rates in growing and dividing cells that harnesses temporal dimensions of single-cell RNA sequencing through metabolic labeling protocols and cel lcycle reporters. We develop a parallel and highly scalable approximate Bayesian computation method that corrects for technical variation and accurately quantifies absolute burst frequency, burst size, and degradation rate along the cell cycle at a transcriptome-wide scale. Using Bayesian model selection, we reveal scaling between transcription rates and cell size and unveil waves of gene regulation across the cell cycle-dependent transcriptome. Our study shows that stochastic modeling of dynamical correlations identifies global mechanisms of transcription regulation. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Dimitris Volteras
- Department of Mathematics, Faculty of Natural Sciences, Imperial College London, London, SW7 2AZ, UK
| | - Vahid Shahrezaei
- Department of Mathematics, Faculty of Natural Sciences, Imperial College London, London, SW7 2AZ, UK.
| | - Philipp Thomas
- Department of Mathematics, Faculty of Natural Sciences, Imperial College London, London, SW7 2AZ, UK.
| |
Collapse
|
8
|
Wang Q, Lin J. Homeostasis of mRNA concentrations through coupling transcription, export, and degradation. iScience 2024; 27:110531. [PMID: 39175768 PMCID: PMC11338957 DOI: 10.1016/j.isci.2024.110531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 04/16/2024] [Accepted: 07/15/2024] [Indexed: 08/24/2024] Open
Abstract
Many experiments showed that eukaryotic cells maintain a constant mRNA concentration upon various perturbations by actively regulating mRNA production and degradation rates, known as mRNA buffering. However, the underlying mechanism is still unknown. In this work, we unveil a mechanistic model of mRNA buffering: the releasing-shuttling (RS) model. The model incorporates two crucial proteins, X and Y, which play several roles, including transcription, decay, and export factors, in the different stages of mRNA metabolism. The RS model predicts the constant mRNA concentration under genome-wide genetic perturbations and cell volume changes, the slowed-down mRNA degradation after Pol II depletion, and the temporal transcription dynamics after exonuclease depletion, in agreement with multiple experiments. Finally, we present a list of X and Y candidates and propose an experimental method to identify X. Our work uncovers potentially universal pathways coupling transcription, export, and degradation that help cells maintain mRNA homeostasis.
Collapse
Affiliation(s)
- Qirun Wang
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Jie Lin
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| |
Collapse
|
9
|
Ma M, Szavits-Nossan J, Singh A, Grima R. Analysis of a detailed multi-stage model of stochastic gene expression using queueing theory and model reduction. Math Biosci 2024; 373:109204. [PMID: 38710441 PMCID: PMC11536769 DOI: 10.1016/j.mbs.2024.109204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 04/03/2024] [Accepted: 04/29/2024] [Indexed: 05/08/2024]
Abstract
We introduce a biologically detailed, stochastic model of gene expression describing the multiple rate-limiting steps of transcription, nuclear pre-mRNA processing, nuclear mRNA export, cytoplasmic mRNA degradation and translation of mRNA into protein. The processes in sub-cellular compartments are described by an arbitrary number of processing stages, thus accounting for a significantly finer molecular description of gene expression than conventional models such as the telegraph, two-stage and three-stage models of gene expression. We use two distinct tools, queueing theory and model reduction using the slow-scale linear-noise approximation, to derive exact or approximate analytic expressions for the moments or distributions of nuclear mRNA, cytoplasmic mRNA and protein fluctuations, as well as lower bounds for their Fano factors in steady-state conditions. We use these to study the phase diagram of the stochastic model; in particular we derive parametric conditions determining three types of transitions in the properties of mRNA fluctuations: from sub-Poissonian to super-Poissonian noise, from high noise in the nucleus to high noise in the cytoplasm, and from a monotonic increase to a monotonic decrease of the Fano factor with the number of processing stages. In contrast, protein fluctuations are always super-Poissonian and show weak dependence on the number of mRNA processing stages. Our results delineate the region of parameter space where conventional models give qualitatively incorrect results and provide insight into how the number of processing stages, e.g. the number of rate-limiting steps in initiation, splicing and mRNA degradation, shape stochastic gene expression by modulation of molecular memory.
Collapse
Affiliation(s)
- Muhan Ma
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3BF, UK
| | | | - Abhyudai Singh
- Department of Electrical and Computer Engineering, University of Delaware, Newark DE 19716, USA
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3BF, UK.
| |
Collapse
|
10
|
Zhang Z, Zabaikina I, Nieto C, Vahdat Z, Bokes P, Singh A. Stochastic Gene Expression in Proliferating Cells: Differing Noise Intensity in Single-Cell and Population Perspectives. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.06.28.601263. [PMID: 38979195 PMCID: PMC11230457 DOI: 10.1101/2024.06.28.601263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
Random fluctuations (noise) in gene expression can be studied from two complementary perspectives: following expression in a single cell over time or comparing expression between cells in a proliferating population at a given time. Here, we systematically investigated scenarios where both perspectives lead to different levels of noise in a given gene product. We first consider a stable protein, whose concentration is diluted by cellular growth, and the protein inhibits growth at high concentrations, establishing a positive feedback loop. For a stochastic model with molecular bursting of gene products, we analytically predict and contrast the steady-state distributions of protein concentration in both frameworks. Although positive feedback amplifies the noise in expression, this amplification is much higher in the population framework compared to following a single cell over time. We also study other processes that lead to different noise levels even in the absence of such dilution-based feedback. When considering randomness in the partitioning of molecules between daughters during mitosis, we find that in the single-cell perspective, the noise in protein concentration is independent of noise in the cell cycle duration. In contrast, partitioning noise is amplified in the population perspective by increasing randomness in cell-cycle time. Overall, our results show that the commonly used single-cell framework that does not account for proliferating cells can, in some cases, underestimate the noise in gene product levels. These results have important implications for studying the inter-cellular variation of different stress-related expression programs across cell types that are known to inhibit cellular growth.
Collapse
Affiliation(s)
- Zhanhao Zhang
- Department of Electrical and Computer Engineering, University of Delaware. Newark, DE 19716, USA
| | - Iryna Zabaikina
- Department of Applied Mathematics and Statistics, Comenius University, Bratislava 84248, Slovakia
| | - César Nieto
- Department of Electrical and Computer Engineering, University of Delaware. Newark, DE 19716, USA
| | - Zahra Vahdat
- Department of Electrical and Computer Engineering, University of Delaware. Newark, DE 19716, USA
| | - Pavol Bokes
- Department of Applied Mathematics and Statistics, Comenius University, Bratislava 84248, Slovakia
| | - Abhyudai Singh
- Department of Electrical and Computer Engineering, University of Delaware. Newark, DE 19716, USA
| |
Collapse
|
11
|
Jiao F, Li J, Liu T, Zhu Y, Che W, Bleris L, Jia C. What can we learn when fitting a simple telegraph model to a complex gene expression model? PLoS Comput Biol 2024; 20:e1012118. [PMID: 38743803 PMCID: PMC11125521 DOI: 10.1371/journal.pcbi.1012118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 05/24/2024] [Accepted: 04/27/2024] [Indexed: 05/16/2024] Open
Abstract
In experiments, the distributions of mRNA or protein numbers in single cells are often fitted to the random telegraph model which includes synthesis and decay of mRNA or protein, and switching of the gene between active and inactive states. While commonly used, this model does not describe how fluctuations are influenced by crucial biological mechanisms such as feedback regulation, non-exponential gene inactivation durations, and multiple gene activation pathways. Here we investigate the dynamical properties of four relatively complex gene expression models by fitting their steady-state mRNA or protein number distributions to the simple telegraph model. We show that despite the underlying complex biological mechanisms, the telegraph model with three effective parameters can accurately capture the steady-state gene product distributions, as well as the conditional distributions in the active gene state, of the complex models. Some effective parameters are reliable and can reflect realistic dynamic behaviors of the complex models, while others may deviate significantly from their real values in the complex models. The effective parameters can also be applied to characterize the capability for a complex model to exhibit multimodality. Using additional information such as single-cell data at multiple time points, we provide an effective method of distinguishing the complex models from the telegraph model. Furthermore, using measurements under varying experimental conditions, we show that fitting the mRNA or protein number distributions to the telegraph model may even reveal the underlying gene regulation mechanisms of the complex models. The effectiveness of these methods is confirmed by analysis of single-cell data for E. coli and mammalian cells. All these results are robust with respect to cooperative transcriptional regulation and extrinsic noise. In particular, we find that faster relaxation speed to the steady state results in more precise parameter inference under large extrinsic noise.
Collapse
Affiliation(s)
- Feng Jiao
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, China
| | - Jing Li
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, China
| | - Ting Liu
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, China
| | - Yifeng Zhu
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, China
| | - Wenhao Che
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, China
| | - Leonidas Bleris
- Bioengineering Department, The University of Texas at Dallas, Richardson, Texas, United States of America
- Center for Systems Biology, The University of Texas at Dallas, Richardson, Texas, United States of America
- Department of Biological Sciences, The University of Texas at Dallas, Richardson, Texas, United States of America
| | - Chen Jia
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing, China
| |
Collapse
|
12
|
Zhang C, Jiao F. Using steady-state formula to estimate time-dependent parameters of stochastic gene transcription models. Biosystems 2024; 236:105128. [PMID: 38280446 DOI: 10.1016/j.biosystems.2024.105128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 01/20/2024] [Accepted: 01/21/2024] [Indexed: 01/29/2024]
Abstract
When studying stochastic gene transcription, it is important to understand how system parameters are temporally modulated in response to varying environments. Experimentally, the dynamic distribution data of RNA copy numbers measured at multiple time points are often fitted to stochastic transcription models to estimate time-dependent parameters. However, current methods require determining which parameters are time-dependent, as well as their analytical formulas, before the optimal fit. In this study, we developed a method to estimate time-dependent parameters in a classical two-state model without prior assumptions regarding the system parameters. At each measured time point, the method fitted the dynamic distribution data using a steady-state distribution formula, in which the estimated constant parameters were approximated as time-dependent parameter values at the measured time point. The accuracy of this method can be guaranteed for RNA molecules with relatively high degradation rates and genes with relatively slow responses to induction. We quantify the accuracy of the method and implemented this method on two sets of dynamic distribution data from prokaryotic and eukaryotic cells, and revealed the temporal modulation of transcription burst size in response to environmental changes.
Collapse
Affiliation(s)
- Congrun Zhang
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, 510006, PR China; College of Mathematics and Information Sciences, Guangzhou University, Guangzhou 51006, China
| | - Feng Jiao
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, 510006, PR China; College of Mathematics and Information Sciences, Guangzhou University, Guangzhou 51006, China.
| |
Collapse
|
13
|
Grima R, Esmenjaud PM. Quantifying and correcting bias in transcriptional parameter inference from single-cell data. Biophys J 2024; 123:4-30. [PMID: 37885177 PMCID: PMC10808030 DOI: 10.1016/j.bpj.2023.10.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 09/12/2023] [Accepted: 10/19/2023] [Indexed: 10/28/2023] Open
Abstract
The snapshot distribution of mRNA counts per cell can be measured using single-molecule fluorescence in situ hybridization or single-cell RNA sequencing. These distributions are often fit to the steady-state distribution of the two-state telegraph model to estimate the three transcriptional parameters for a gene of interest: mRNA synthesis rate, the switching on rate (the on state being the active transcriptional state), and the switching off rate. This model assumes no extrinsic noise, i.e., parameters do not vary between cells, and thus estimated parameters are to be understood as approximating the average values in a population. The accuracy of this approximation is currently unclear. Here, we develop a theory that explains the size and sign of estimation bias when inferring parameters from single-cell data using the standard telegraph model. We find specific bias signatures depending on the source of extrinsic noise (which parameter is most variable across cells) and the mode of transcriptional activity. If gene expression is not bursty then the population averages of all three parameters are overestimated if extrinsic noise is in the synthesis rate; underestimation occurs if extrinsic noise is in the switching on rate; both underestimation and overestimation can occur if extrinsic noise is in the switching off rate. We find that some estimated parameters tend to infinity as the size of extrinsic noise approaches a critical threshold. In contrast when gene expression is bursty, we find that in all cases the mean burst size (ratio of the synthesis rate to the switching off rate) is overestimated while the mean burst frequency (the switching on rate) is underestimated. We estimate the size of extrinsic noise from the covariance matrix of sequencing data and use this together with our theory to correct published estimates of transcriptional parameters for mammalian genes.
Collapse
Affiliation(s)
- Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom.
| | - Pierre-Marie Esmenjaud
- Biology Department, Ecole Polytechnique, Institut Polytechnique de Paris, Palaiseau, France
| |
Collapse
|
14
|
Wang Y, Yu Z, Grima R, Cao Z. Exact solution of a three-stage model of stochastic gene expression including cell-cycle dynamics. J Chem Phys 2023; 159:224102. [PMID: 38063222 DOI: 10.1063/5.0173742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 10/04/2023] [Indexed: 12/18/2023] Open
Abstract
The classical three-stage model of stochastic gene expression predicts the statistics of single cell mRNA and protein number fluctuations as a function of the rates of promoter switching, transcription, translation, degradation and dilution. While this model is easily simulated, its analytical solution remains an unsolved problem. Here we modify this model to explicitly include cell-cycle dynamics and then derive an exact solution for the time-dependent joint distribution of mRNA and protein numbers. We show large differences between this model and the classical model which captures cell-cycle effects implicitly via effective first-order dilution reactions. In particular we find that the Fano factor of protein numbers calculated from a population snapshot measurement are underestimated by the classical model whereas the correlation between mRNA and protein can be either over- or underestimated, depending on the timescales of mRNA degradation and promoter switching relative to the mean cell-cycle duration time.
Collapse
Affiliation(s)
- Yiling Wang
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, Shanghai 200237, China
| | - Zhenhua Yu
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, Shanghai 200237, China
| | - Ramon Grima
- School of Biological Sciences, The University of Edinburgh, Max Born Crescent, Edinburgh EH9 3BF, Scotland, United Kingdom
| | - Zhixing Cao
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, Shanghai 200237, China
- Department of Chemical Engineering, Queen's University, Kingston, Ontario K7L 3N6, Canada
| |
Collapse
|
15
|
Ruess J, Ballif G, Aditya C. Stochastic chemical kinetics of cell fate decision systems: From single cells to populations and back. J Chem Phys 2023; 159:184103. [PMID: 37937934 DOI: 10.1063/5.0160529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Accepted: 10/14/2023] [Indexed: 11/09/2023] Open
Abstract
Stochastic chemical kinetics is a widely used formalism for studying stochasticity of chemical reactions inside single cells. Experimental studies of reaction networks are generally performed with cells that are part of a growing population, yet the population context is rarely taken into account when models are developed. Models that neglect the population context lose their validity whenever the studied system influences traits of cells that can be selected in the population, a property that naturally arises in the complex interplay between single-cell and population dynamics of cell fate decision systems. Here, we represent such systems as absorbing continuous-time Markov chains. We show that conditioning on non-absorption allows one to derive a modified master equation that tracks the time evolution of the expected population composition within a growing population. This allows us to derive consistent population dynamics models from a specification of the single-cell process. We use this approach to classify cell fate decision systems into two types that lead to different characteristic phases in emerging population dynamics. Subsequently, we deploy the gained insights to experimentally study a recurrent problem in biology: how to link plasmid copy number fluctuations and plasmid loss events inside single cells to growth of cell populations in dynamically changing environments.
Collapse
Affiliation(s)
- Jakob Ruess
- Inria Saclay, 91120 Palaiseau, France
- Institut Pasteur, Université Paris Cité, 75015 Paris, France
| | | | - Chetan Aditya
- Institut Pasteur, Université Paris Cité, 75015 Paris, France
- Department of Molecular Biology, Princeton University, Princeton, New Jersey 08544, USA
| |
Collapse
|
16
|
Wang X, Li Y, Jia C. Poisson representation: a bridge between discrete and continuous models of stochastic gene regulatory networks. J R Soc Interface 2023; 20:20230467. [PMID: 38016635 PMCID: PMC10684348 DOI: 10.1098/rsif.2023.0467] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 11/08/2023] [Indexed: 11/30/2023] Open
Abstract
Stochastic gene expression dynamics can be modelled either discretely or continuously. Previous studies have shown that the mRNA or protein number distributions of some simple discrete and continuous gene expression models are related by Gardiner's Poisson representation. Here, we systematically investigate the Poisson representation in complex stochastic gene regulatory networks. We show that when the gene of interest is unregulated, the discrete and continuous descriptions of stochastic gene expression are always related by the Poisson representation, no matter how complex the model is. This generalizes the results obtained in Dattani & Barahona (Dattani & Barahona 2017 J. R. Soc. Interface 14, 20160833 (doi:10.1098/rsif.2016.0833)). In addition, using a simple counter-example, we find that the Poisson representation in general fails to link the two descriptions when the gene is regulated. However, for a general stochastic gene regulatory network, we demonstrate that the discrete and continuous models are approximately related by the Poisson representation in the limit of large protein numbers. These theoretical results are further applied to analytically solve many complex gene expression models whose exact distributions are previously unknown.
Collapse
Affiliation(s)
- Xinyu Wang
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing 100193, People’s Republic of China
| | - Youming Li
- School of Mathematical Sciences, University of Electronic Science and Technology of China, Chengdu 611731, People’s Republic of China
| | - Chen Jia
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing 100193, People’s Republic of China
| |
Collapse
|
17
|
Gorin G, Yoshida S, Pachter L. Assessing Markovian and Delay Models for Single-Nucleus RNA Sequencing. Bull Math Biol 2023; 85:114. [PMID: 37828255 DOI: 10.1007/s11538-023-01213-9] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Accepted: 09/11/2023] [Indexed: 10/14/2023]
Abstract
The serial nature of reactions involved in the RNA life-cycle motivates the incorporation of delays in models of transcriptional dynamics. The models couple a transcriptional process to a fairly general set of delayed monomolecular reactions with no feedback. We provide numerical strategies for calculating the RNA copy number distributions induced by these models, and solve several systems with splicing, degradation, and catalysis. An analysis of single-cell and single-nucleus RNA sequencing data using these models reveals that the kinetics of nuclear export do not appear to require invocation of a non-Markovian waiting time.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
| | - Shawn Yoshida
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125, USA.
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, 91125, USA.
| |
Collapse
|
18
|
Chari T, Gorin G, Pachter L. Biophysically Interpretable Inference of Cell Types from Multimodal Sequencing Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.17.558131. [PMID: 37745403 PMCID: PMC10516047 DOI: 10.1101/2023.09.17.558131] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
Multimodal, single-cell genomics technologies enable simultaneous capture of multiple facets of DNA and RNA processing in the cell. This creates opportunities for transcriptome-wide, mechanistic studies of cellular processing in heterogeneous cell types, with applications ranging from inferring kinetic differences between cells, to the role of stochasticity in driving heterogeneity. However, current methods for determining cell types or 'clusters' present in multimodal data often rely on ad hoc or independent treatment of modalities, and assumptions ignoring inherent properties of the count data. To enable interpretable and consistent cell cluster determination from multimodal data, we present meK-Means (mechanistic K-Means) which integrates modalities and learns underlying, shared biophysical states through a unifying model of transcription. In particular, we demonstrate how meK-Means can be used to cluster cells from unspliced and spliced mRNA count modalities. By utilizing the causal, physical relationships underlying these modalities, we identify shared transcriptional kinetics across cells, which induce the observed gene expression profiles, and provide an alternative definition for 'clusters' through the governing parameters of cellular processes.
Collapse
Affiliation(s)
- Tara Chari
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California
| | - Gennady Gorin
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, California
| |
Collapse
|