1
|
Tong CY, Li C, Hurni C, Jacq A, Nie XY, Guy CR, Suh JH, Wong RKW, Merlin C, Naef F, Menet JS, Jiang Y. Single-Cell Multiomic Analysis of Circadian Rhythmicity in Mouse Liver. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.04.03.647044. [PMID: 40291723 PMCID: PMC12026578 DOI: 10.1101/2025.04.03.647044] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/30/2025]
Abstract
From bacteria to humans, most organisms showcase inherent 24-hour circadian rhythms, best exemplified by the sleep-wake cycle. These rhythms are remarkably widespread, governing hormonal, metabolic, physiological, and behavioral oscillations, and are driven by "molecular clocks" that orchestrate the rhythmic expression of thousands of genes throughout the body. Here, we generate single-cell RNA and ATAC multiomic data to simultaneously characterize gene expression and chromatin accessibility of ~33,000 mouse liver cells across the 24-hour day. Our study yields several key insights, including: (i) detecting circadian rhythmicity in both discretized liver cell types and transient sub-lobule cell states, capturing space-time RNA and ATAC profiles in a cell-type- and cell-state-specific manner; (ii) delving beyond mean cyclic patterns to characterize distributions, accounting for gene expression stochasticity due to transcriptional bursting; (iii) interrogating multimodal circadian rhythmicity, encompassing RNAs, DNA regulatory elements, and transcription factors (TFs), while examining priming and lagging effects across modalities; and (iv) inferring spatiotemporal gene regulatory networks involving target genes, TFs, and cis-regulatory elements that controls circadian rhythmicity and liver physiology. Our findings apply to existing single-cell data of mouse and Drosophila brains and are further validated by time-series single molecule fluorescence in situ hybridization, as well as vast amounts of existing and orthogonal high-throughput data from chromatin immunoprecipitation followed by sequencing, capture Hi-C, and TF knockout experiments. Altogether, our study constructs a comprehensive map of the time-series transcriptomic and epigenomic landscapes that elucidate the function and mechanism of the liver peripheral clocks.
Collapse
|
2
|
Sukys A, Grima R. Cell-cycle dependence of bursty gene expression: insights from fitting mechanistic models to single-cell RNA-seq data. Nucleic Acids Res 2025; 53:gkaf295. [PMID: 40240003 PMCID: PMC12000877 DOI: 10.1093/nar/gkaf295] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 03/22/2025] [Accepted: 03/28/2025] [Indexed: 04/18/2025] Open
Abstract
Bursty gene expression is characterized by two intuitive parameters, burst frequency and burst size, the cell-cycle dependence of which has not been extensively profiled at the transcriptome level. In this study, we estimate the burst parameters per allele in the G1 and G2/M cell-cycle phases for thousands of mouse genes by fitting mechanistic models of gene expression to messenger RNA count data, obtained by sequencing of single cells whose cell-cycle position has been inferred using a deep-learning method. We find that upon DNA replication, the median burst frequency approximately halves, while the burst size remains mostly unchanged. Genome-wide distributions of the burst parameter ratios between the G2/M and G1 phases are broad, indicating substantial heterogeneity in transcriptional regulation. We also observe a significant negative correlation between the burst frequency and size ratios, suggesting that regulatory processes do not independently control the burst parameters. We show that to accurately estimate the burst parameter ratios, mechanistic models must explicitly account for gene copy number variation and extrinsic noise due to the coupling of transcription to cell age across the cell cycle, but corrections for technical noise due to imperfect capture of RNA molecules in sequencing experiments are less critical.
Collapse
Affiliation(s)
- Augustinas Sukys
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, United Kingdom
- The Alan Turing Institute, London NW1 2DB, United Kingdom
- School of BioSciences, University of Melbourne, Parkville, Victoria 3052, Australia
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, United Kingdom
| |
Collapse
|
3
|
Nicoll AG, Szavits-Nossan J, Evans MR, Grima R. Transient power-law behaviour following induction distinguishes between competing models of stochastic gene expression. Nat Commun 2025; 16:2833. [PMID: 40121209 PMCID: PMC11929856 DOI: 10.1038/s41467-025-58127-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Accepted: 03/10/2025] [Indexed: 03/25/2025] Open
Abstract
What features of transcription can be learnt by fitting mathematical models of gene expression to mRNA count data? Given a suite of models, fitting to data selects an optimal one, thus identifying a probable transcriptional mechanism. Whilst attractive, the utility of this methodology remains unclear. Here, we sample steady-state, single-cell mRNA count distributions from parameters in the physiological range, and show they cannot be used to confidently estimate the number of inactive gene states, i.e. the number of rate-limiting steps in transcriptional initiation. Distributions from over 99% of the parameter space generated using models with 2, 3, or 4 inactive states can be well fit by one with a single inactive state. However, we show that for many minutes following induction, eukaryotic cells show an increase in the mean mRNA count that obeys a power law whose exponent equals the sum of the number of states visited from the initial inactive to the active state and the number of rate-limiting post-transcriptional processing steps. Our study shows that estimation of the exponent from eukaryotic data can be sufficient to determine a lower bound on the total number of regulatory steps in transcription initiation, splicing, and nuclear export.
Collapse
Affiliation(s)
- Andrew G Nicoll
- School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Juraj Szavits-Nossan
- School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Martin R Evans
- School of Physics and Astronomy, University of Edinburgh, Edinburgh, United Kingdom
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom.
| |
Collapse
|
4
|
Qu R, Cheng X, Sefik E, Stanley Iii JS, Landa B, Strino F, Platt S, Garritano J, Odell ID, Coifman R, Flavell RA, Myung P, Kluger Y. Gene trajectory inference for single-cell data by optimal transport metrics. Nat Biotechnol 2025; 43:258-268. [PMID: 38580861 PMCID: PMC11452571 DOI: 10.1038/s41587-024-02186-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Accepted: 02/26/2024] [Indexed: 04/07/2024]
Abstract
Single-cell RNA sequencing has been widely used to investigate cell state transitions and gene dynamics of biological processes. Current strategies to infer the sequential dynamics of genes in a process typically rely on constructing cell pseudotime through cell trajectory inference. However, the presence of concurrent gene processes in the same group of cells and technical noise can obscure the true progression of the processes studied. To address this challenge, we present GeneTrajectory, an approach that identifies trajectories of genes rather than trajectories of cells. Specifically, optimal transport distances are calculated between gene distributions across the cell-cell graph to extract gene programs and define their gene pseudotemporal order. Here we demonstrate that GeneTrajectory accurately extracts progressive gene dynamics in myeloid lineage maturation. Moreover, we show that GeneTrajectory deconvolves key gene programs underlying mouse skin hair follicle dermal condensate differentiation that could not be resolved by cell trajectory approaches. GeneTrajectory facilitates the discovery of gene programs that control the changes and activities of biological processes.
Collapse
Affiliation(s)
- Rihao Qu
- Computational Biology & Bioinformatics Program, Yale University, New Haven, CT, USA
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
- Department of Immunobiology, Yale University School of Medicine, New Haven, CT, USA
| | - Xiuyuan Cheng
- Department of Mathematics, Duke University, Durham, NC, USA
| | - Esen Sefik
- Department of Immunobiology, Yale University School of Medicine, New Haven, CT, USA
| | | | - Boris Landa
- Program in Applied Mathematics, Yale University, New Haven, CT, USA
| | | | - Sarah Platt
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
- Department of Dermatology, Yale University School of Medicine, New Haven, CT, USA
| | - James Garritano
- Program in Applied Mathematics, Yale University, New Haven, CT, USA
| | - Ian D Odell
- Department of Immunobiology, Yale University School of Medicine, New Haven, CT, USA
- Department of Dermatology, Yale University School of Medicine, New Haven, CT, USA
| | - Ronald Coifman
- Program in Applied Mathematics, Yale University, New Haven, CT, USA
- Department of Mathematics, Yale University, New Haven, CT, USA
- Department of Electrical Engineering, Yale University, New Haven, CT, USA
| | - Richard A Flavell
- Department of Immunobiology, Yale University School of Medicine, New Haven, CT, USA
- Howard Hughes Medical Institute, Yale University School of Medicine, New Haven, CT, USA
| | - Peggy Myung
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA
- Department of Dermatology, Yale University School of Medicine, New Haven, CT, USA
| | - Yuval Kluger
- Computational Biology & Bioinformatics Program, Yale University, New Haven, CT, USA.
- Department of Pathology, Yale University School of Medicine, New Haven, CT, USA.
- Program in Applied Mathematics, Yale University, New Haven, CT, USA.
| |
Collapse
|
5
|
Zaccaron AZ, Chen LH, Stergiopoulos I. Transcriptome analysis of two isolates of the tomato pathogen Cladosporium fulvum, uncovers genome-wide patterns of alternative splicing during a host infection cycle. PLoS Pathog 2024; 20:e1012791. [PMID: 39693392 DOI: 10.1371/journal.ppat.1012791] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Revised: 01/02/2025] [Accepted: 11/25/2024] [Indexed: 12/20/2024] Open
Abstract
Alternative splicing (AS) is a key element of eukaryotic gene expression that increases transcript and proteome diversity in cells, thereby altering their responses to external stimuli and stresses. While AS has been intensively researched in plants and animals, its frequency, conservation, and putative impact on virulence, are relatively still understudied in plant pathogenic fungi. Here, we profiled the AS events occurring in genes of Cladosporium fulvum isolates Race 5 and Race 4, during nearly a complete compatible infection cycle on their tomato host. Our studies revealed extensive heterogeneity in the transcript isoforms assembled from different isolates, infections, and infection timepoints, as over 80% of the transcript isoforms were singletons that were detected in only a single sample. Despite that, nearly 40% of the protein-coding genes in each isolate were predicted to be recurrently AS across the disparate infection timepoints, infections, and the two isolates. Of these, 37.5% were common to both isolates and 59% resulted in multiple protein isoforms, thereby putatively increasing proteome diversity in the pathogen by 31% during infections. An enrichment analysis showed that AS mostly affected genes likely to be involved in the transport of nutrients, regulation of gene expression, and monooxygenase activity, suggesting a role for AS in finetuning adaptation of C. fulvum on its tomato host during infections. Tracing the location of the AS genes on the fungal chromosomes showed that they were mostly located in repeat-rich regions of the core chromosomes, indicating a causal connection between gene location on the genome and propensity to AS. Finally, multiple cases of differential isoform usage in AS genes of C. fulvum were identified, suggesting that modulation of AS at different infection stages may be another way by which pathogens refine infections on their hosts.
Collapse
Affiliation(s)
- Alex Z Zaccaron
- Department of Plant Pathology, University of California Davis (UC Davis), Davis, California United States of America
- Integrative Genetics and Genomics Graduate Group, University of California Davis (UC Davis), California, United States of America
| | - Li-Hung Chen
- Department of Plant Pathology, University of California Davis (UC Davis), Davis, California United States of America
| | - Ioannis Stergiopoulos
- Department of Plant Pathology, University of California Davis (UC Davis), Davis, California United States of America
| |
Collapse
|
6
|
Ramirez Sierra MA, Sokolowski TR. AI-powered simulation-based inference of a genuinely spatial-stochastic gene regulation model of early mouse embryogenesis. PLoS Comput Biol 2024; 20:e1012473. [PMID: 39541410 PMCID: PMC11614244 DOI: 10.1371/journal.pcbi.1012473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 12/03/2024] [Accepted: 09/10/2024] [Indexed: 11/16/2024] Open
Abstract
Understanding how multicellular organisms reliably orchestrate cell-fate decisions is a central challenge in developmental biology, particularly in early mammalian development, where tissue-level differentiation arises from seemingly cell-autonomous mechanisms. In this study, we present a multi-scale, spatial-stochastic simulation framework for mouse embryogenesis, focusing on inner cell mass (ICM) differentiation into epiblast (EPI) and primitive endoderm (PRE) at the blastocyst stage. Our framework models key regulatory and tissue-scale interactions in a biophysically realistic fashion, capturing the inherent stochasticity of intracellular gene expression and intercellular signaling, while efficiently simulating these processes by advancing event-driven simulation techniques. Leveraging the power of Simulation-Based Inference (SBI) through the AI-driven Sequential Neural Posterior Estimation (SNPE) algorithm, we conduct a large-scale Bayesian inferential analysis to identify parameter sets that faithfully reproduce experimentally observed features of ICM specification. Our results reveal mechanistic insights into how the combined action of autocrine and paracrine FGF4 signaling coordinates stochastic gene expression at the cellular scale to achieve robust and reproducible ICM patterning at the tissue scale. We further demonstrate that the ICM exhibits a specific time window of sensitivity to exogenous FGF4, enabling lineage proportions to be adjusted based on timing and dosage, thereby extending current experimental findings and providing quantitative predictions for both mutant and wild-type ICM systems. Notably, FGF4 signaling not only ensures correct EPI-PRE lineage proportions but also enhances ICM resilience to perturbations, reducing fate-proportioning errors by 10-20% compared to a purely cell-autonomous system. Additionally, we uncover a surprising role for variability in intracellular initial conditions, showing that high gene-expression heterogeneity can improve both the accuracy and precision of cell-fate proportioning, which remains robust when fewer than 25% of the ICM population experiences perturbed initial conditions. Our work offers a comprehensive, spatial-stochastic description of the biochemical processes driving ICM differentiation and identifies the necessary conditions for its robust unfolding. It also provides a framework for future exploration of similar spatial-stochastic systems in developmental biology.
Collapse
Affiliation(s)
- Michael Alexander Ramirez Sierra
- Frankfurt Institute for Advanced Studies (FIAS), Frankfurt am Main, Germany
- Faculty of Computer Science and Mathematics, Goethe-Universität Frankfurt am Main, Frankfurt am Main, Germany
| | | |
Collapse
|
7
|
Zhou L, Chen H, Zhang J, Zhang J, Qiu H, Zhou T. Exact burst-size distributions for gene-expression models with complex promoter structure. Biosystems 2024; 246:105337. [PMID: 39299486 DOI: 10.1016/j.biosystems.2024.105337] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Revised: 09/14/2024] [Accepted: 09/14/2024] [Indexed: 09/22/2024]
Abstract
In prokaryotic and eukaryotic cells, most genes are transcribed in a bursty fashion on one hand and complex gene regulations may lead to complex promoter structure on the other hand. This raises an unsolved issue: how does promoter structure shape transcriptional bursting kinetics characterized by burst size and frequency? Here we analyze stochastic models of gene transcription, which consider complex regulatory mechanisms. Notably, we develop an efficient method to derive exact burst-size distributions. The analytical results show that if the promoter of a gene contains only one active state, the burst size indeed follows a geometric distribution, in agreement with the previous result derived under certain limiting conditions. However, if it contains a multitude of active states, the burst size in general obeys a non-geometric distribution, which is a linearly weighted sum of geometric distributions. This superposition principle reveals the essential feature of bursting kinetics in complex cases of transcriptional regulation although it seems that there has been no direct experimental confirmation. The derived burst-size distributions not only highlight the importance of promoter structure in regulating bursting kinetics, but can be also used in the exact inference of this kinetics based on experimental data.
Collapse
Affiliation(s)
- Liying Zhou
- School of Mathematics, Sun Yat-Sen University, Guangzhou, 510275, PR China
| | - Haowen Chen
- School of Mathematics, Sun Yat-Sen University, Guangzhou, 510275, PR China
| | - Jinqiang Zhang
- School of Mathematics, Sun Yat-Sen University, Guangzhou, 510275, PR China
| | - Jiajun Zhang
- Key Laboratory of Computational Mathematics, Guangdong Province, PR China; School of Mathematics, Sun Yat-Sen University, Guangzhou, 510275, PR China
| | - Huahai Qiu
- School of Mathematical and Physical Sciences, Wuhan Textile University, Wuhan, 430200, PR China.
| | - Tianshou Zhou
- Key Laboratory of Computational Mathematics, Guangdong Province, PR China; School of Mathematics, Sun Yat-Sen University, Guangzhou, 510275, PR China.
| |
Collapse
|
8
|
Shi C, Yang X, Zhou T, Zhang J. Nascent RNA kinetics with complex promoter architecture: Analytic results and parameter inference. Phys Rev E 2024; 110:034413. [PMID: 39425372 DOI: 10.1103/physreve.110.034413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 09/11/2024] [Indexed: 10/21/2024]
Abstract
Transcription is a stochastic process that involves several downstream operations which make it difficult to model and infer transcription kinetics from mature RNA numbers in individual cell. However, recent advances in single-cell technologies have enabled a more precise measurement of the fluctuations of nascent RNA that closely reflect transcription kinetics. In this paper we introduce a general stochastic model to mimic nascent RNA kinetics with complex promoter architecture. We derive the exact distribution and moments of nascent RNA using queuing theory techniques, which provide valuable insights into the effect of the molecular memory created by the multistep activation and deactivation on the stochastic kinetics of nascent RNA. Moreover, based on the analytical results, we develop a statistical method to infer the promoter memory from stationary nascent RNA distributions. Data analysis of synthetic data and a realistic example, the HIV-1 gene, verifies the validity of this inference method.
Collapse
|
9
|
Volteras D, Shahrezaei V, Thomas P. Global transcription regulation revealed from dynamical correlations in time-resolved single-cell RNA sequencing. Cell Syst 2024; 15:694-708.e12. [PMID: 39121860 DOI: 10.1016/j.cels.2024.07.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2023] [Revised: 02/29/2024] [Accepted: 07/11/2024] [Indexed: 08/12/2024]
Abstract
Single-cell transcriptomics reveals significant variations in transcriptional activity across cells. Yet, it remains challenging to identify mechanisms of transcription dynamics from static snapshots. It is thus still unknown what drives global transcription dynamics in single cells. We present a stochastic model of gene expression with cell size- and cell cycle-dependent rates in growing and dividing cells that harnesses temporal dimensions of single-cell RNA sequencing through metabolic labeling protocols and cel lcycle reporters. We develop a parallel and highly scalable approximate Bayesian computation method that corrects for technical variation and accurately quantifies absolute burst frequency, burst size, and degradation rate along the cell cycle at a transcriptome-wide scale. Using Bayesian model selection, we reveal scaling between transcription rates and cell size and unveil waves of gene regulation across the cell cycle-dependent transcriptome. Our study shows that stochastic modeling of dynamical correlations identifies global mechanisms of transcription regulation. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Dimitris Volteras
- Department of Mathematics, Faculty of Natural Sciences, Imperial College London, London, SW7 2AZ, UK
| | - Vahid Shahrezaei
- Department of Mathematics, Faculty of Natural Sciences, Imperial College London, London, SW7 2AZ, UK.
| | - Philipp Thomas
- Department of Mathematics, Faculty of Natural Sciences, Imperial College London, London, SW7 2AZ, UK.
| |
Collapse
|
10
|
Mulas C. Control of cell state transitions by post-transcriptional regulation. Philos Trans R Soc Lond B Biol Sci 2024; 379:20230050. [PMID: 38432322 PMCID: PMC10909504 DOI: 10.1098/rstb.2023.0050] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 12/19/2023] [Indexed: 03/05/2024] Open
Abstract
Cell state transitions are prevalent in biology, playing a fundamental role in development, homeostasis and repair. Dysregulation of cell state transitions can lead to or occur in a wide range of diseases. In this letter, I explore and highlight the role of post-transcriptional regulatory mechanisms in determining the dynamics of cell state transitions. I propose that regulation of protein levels after transcription provides an under-appreciated regulatory route to obtain fast and sharp transitions between distinct cell states. This article is part of a discussion meeting issue 'Causes and consequences of stochastic processes in development and disease'.
Collapse
Affiliation(s)
- Carla Mulas
- Altos Labs Cambridge Institute of Science, Granta Park, Cambridge, CB21 6GP, UK
| |
Collapse
|
11
|
Darmasaputra GS, van Rijnberk LM, Galli M. Functional consequences of somatic polyploidy in development. Development 2024; 151:dev202392. [PMID: 38415794 PMCID: PMC10946441 DOI: 10.1242/dev.202392] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/29/2024]
Abstract
Polyploid cells contain multiple genome copies and arise in many animal tissues as a regulated part of development. However, polyploid cells can also arise due to cell division failure, DNA damage or tissue damage. Although polyploidization is crucial for the integrity and function of many tissues, the cellular and tissue-wide consequences of polyploidy can be very diverse. Nonetheless, many polyploid cell types and tissues share a remarkable similarity in function, providing important information about the possible contribution of polyploidy to cell and tissue function. Here, we review studies on polyploid cells in development, underlining parallel functions between different polyploid cell types, as well as differences between developmentally-programmed and stress-induced polyploidy.
Collapse
Affiliation(s)
- Gabriella S. Darmasaputra
- Hubrecht Institute, Royal Netherlands Academy of Arts and Sciences and University Medical Center Utrecht, Uppsalalaan 8, 3584 CT, Utrecht, the Netherlands
| | - Lotte M. van Rijnberk
- Hubrecht Institute, Royal Netherlands Academy of Arts and Sciences and University Medical Center Utrecht, Uppsalalaan 8, 3584 CT, Utrecht, the Netherlands
| | - Matilde Galli
- Hubrecht Institute, Royal Netherlands Academy of Arts and Sciences and University Medical Center Utrecht, Uppsalalaan 8, 3584 CT, Utrecht, the Netherlands
| |
Collapse
|
12
|
Zhang C, Jiao F. Using steady-state formula to estimate time-dependent parameters of stochastic gene transcription models. Biosystems 2024; 236:105128. [PMID: 38280446 DOI: 10.1016/j.biosystems.2024.105128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 01/20/2024] [Accepted: 01/21/2024] [Indexed: 01/29/2024]
Abstract
When studying stochastic gene transcription, it is important to understand how system parameters are temporally modulated in response to varying environments. Experimentally, the dynamic distribution data of RNA copy numbers measured at multiple time points are often fitted to stochastic transcription models to estimate time-dependent parameters. However, current methods require determining which parameters are time-dependent, as well as their analytical formulas, before the optimal fit. In this study, we developed a method to estimate time-dependent parameters in a classical two-state model without prior assumptions regarding the system parameters. At each measured time point, the method fitted the dynamic distribution data using a steady-state distribution formula, in which the estimated constant parameters were approximated as time-dependent parameter values at the measured time point. The accuracy of this method can be guaranteed for RNA molecules with relatively high degradation rates and genes with relatively slow responses to induction. We quantify the accuracy of the method and implemented this method on two sets of dynamic distribution data from prokaryotic and eukaryotic cells, and revealed the temporal modulation of transcription burst size in response to environmental changes.
Collapse
Affiliation(s)
- Congrun Zhang
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, 510006, PR China; College of Mathematics and Information Sciences, Guangzhou University, Guangzhou 51006, China
| | - Feng Jiao
- Guangzhou Center for Applied Mathematics, Guangzhou University, Guangzhou, 510006, PR China; College of Mathematics and Information Sciences, Guangzhou University, Guangzhou 51006, China.
| |
Collapse
|
13
|
Bruno S, Schlaeger TM, Del Vecchio D. Epigenetic OCT4 regulatory network: stochastic analysis of cellular reprogramming. NPJ Syst Biol Appl 2024; 10:3. [PMID: 38184707 PMCID: PMC10771499 DOI: 10.1038/s41540-023-00326-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 12/08/2023] [Indexed: 01/08/2024] Open
Abstract
Experimental studies have shown that chromatin modifiers have a critical effect on cellular reprogramming, i.e., the conversion of differentiated cells to pluripotent stem cells. Here, we develop a model of the OCT4 gene regulatory network that includes genes expressing chromatin modifiers TET1 and JMJD2, and the chromatin modification circuit on which these modifiers act. We employ this model to compare three reprogramming approaches that have been considered in the literature with respect to reprogramming efficiency and latency variability. These approaches are overexpression of OCT4 alone, overexpression of OCT4 with TET1, and overexpression of OCT4 with JMJD2. Our results show more efficient and less variable reprogramming when also JMJD2 and TET1 are overexpressed, consistent with previous experimental data. Nevertheless, TET1 overexpression can lead to more efficient reprogramming compared to JMJD2 overexpression. This is the case when the recruitment of DNA methylation by H3K9me3 is weak and the methyl-CpG-binding domain (MBD) proteins are sufficiently scarce such that they do not hamper TET1 binding to methylated DNA. The model that we developed provides a mechanistic understanding of existing experimental results and is also a tool for designing optimized reprogramming approaches that combine overexpression of cell-fate specific transcription factors (TFs) with targeted recruitment of epigenetic modifiers.
Collapse
Affiliation(s)
- Simone Bruno
- Department of Mechanical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA, 02139, USA
| | - Thorsten M Schlaeger
- Boston Children's Hospital Stem Cell Program, Boston Children's Hospital, 300 Longwood Avenue, Boston, MA, 02115, USA
| | - Domitilla Del Vecchio
- Department of Mechanical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA, 02139, USA.
| |
Collapse
|
14
|
Grima R, Esmenjaud PM. Quantifying and correcting bias in transcriptional parameter inference from single-cell data. Biophys J 2024; 123:4-30. [PMID: 37885177 PMCID: PMC10808030 DOI: 10.1016/j.bpj.2023.10.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Revised: 09/12/2023] [Accepted: 10/19/2023] [Indexed: 10/28/2023] Open
Abstract
The snapshot distribution of mRNA counts per cell can be measured using single-molecule fluorescence in situ hybridization or single-cell RNA sequencing. These distributions are often fit to the steady-state distribution of the two-state telegraph model to estimate the three transcriptional parameters for a gene of interest: mRNA synthesis rate, the switching on rate (the on state being the active transcriptional state), and the switching off rate. This model assumes no extrinsic noise, i.e., parameters do not vary between cells, and thus estimated parameters are to be understood as approximating the average values in a population. The accuracy of this approximation is currently unclear. Here, we develop a theory that explains the size and sign of estimation bias when inferring parameters from single-cell data using the standard telegraph model. We find specific bias signatures depending on the source of extrinsic noise (which parameter is most variable across cells) and the mode of transcriptional activity. If gene expression is not bursty then the population averages of all three parameters are overestimated if extrinsic noise is in the synthesis rate; underestimation occurs if extrinsic noise is in the switching on rate; both underestimation and overestimation can occur if extrinsic noise is in the switching off rate. We find that some estimated parameters tend to infinity as the size of extrinsic noise approaches a critical threshold. In contrast when gene expression is bursty, we find that in all cases the mean burst size (ratio of the synthesis rate to the switching off rate) is overestimated while the mean burst frequency (the switching on rate) is underestimated. We estimate the size of extrinsic noise from the covariance matrix of sequencing data and use this together with our theory to correct published estimates of transcriptional parameters for mammalian genes.
Collapse
Affiliation(s)
- Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom.
| | - Pierre-Marie Esmenjaud
- Biology Department, Ecole Polytechnique, Institut Polytechnique de Paris, Palaiseau, France
| |
Collapse
|
15
|
Golding I, Amir A. Gene expression in growing cells: A biophysical primer. ARXIV 2023:arXiv:2311.12143v1. [PMID: 38045483 PMCID: PMC10690283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 12/05/2023]
Abstract
Cell growth and gene expression, two essential elements of all living systems, have long been the focus of biophysical interrogation. Advances in experimental single-cell methods have invigorated theoretical studies into these processes. However, until recently, there was little dialog between the two areas of study. In particular, most theoretical models for gene regulation assumed gene activity to be oblivious to the progression of the cell cycle between birth and division. But, in fact, there are numerous ways in which the periodic character of all cellular observables can modulate gene expression. The molecular factors required for transcription and translation-RNA polymerase, transcription factors, ribosomes-increase in number during the cell cycle, but are also diluted due to the continuous increase in cell volume. The replication of the genome changes the dosage of those same cellular players but also provides competing targets for regulatory binding. Finally, cell division reduces their number again, and so forth. Stochasticity is inherent to all these biological processes, manifested in fluctuations in the synthesis and degradation of new cellular components as well as the random partitioning of molecules at each cell division event. The notion of gene expression as stationary is thus hard to justify. In this review, we survey the emerging paradigm of cell-cycle regulated gene expression, with an emphasis on the global expression patterns rather than gene-specific regulation. We discuss recent experimental reports where cell growth and gene expression were simultaneously measured in individual cells, providing first glimpses into the coupling between the two, and motivating several questions. How do the levels of gene expression products - mRNA and protein - scale with the cell volume and cell-cycle progression? What are the molecular origins of the observed scaling laws, and when do they break down to yield non-canonical behavior? What are the consequences of cell-cycle dependence for the heterogeneity ("noise") in gene expression within a cell population? While the experimental findings, not surprisingly, differ among genes, organisms, and environmental conditions, several theoretical models have emerged that attempt to reconcile these differences and form a unifying framework for understanding gene expression in growing cells.
Collapse
Affiliation(s)
- Ido Golding
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Department of Microbiology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Ariel Amir
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, USA
- Department of Physics of Complex Systems, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
16
|
Gorin G, Vastola JJ, Pachter L. Studying stochastic systems biology of the cell with single-cell genomics data. Cell Syst 2023; 14:822-843.e22. [PMID: 37751736 PMCID: PMC10725240 DOI: 10.1016/j.cels.2023.08.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 08/16/2023] [Accepted: 08/25/2023] [Indexed: 09/28/2023]
Abstract
Recent experimental developments in genome-wide RNA quantification hold considerable promise for systems biology. However, rigorously probing the biology of living cells requires a unified mathematical framework that accounts for single-molecule biological stochasticity in the context of technical variation associated with genomics assays. We review models for a variety of RNA transcription processes, as well as the encapsulation and library construction steps of microfluidics-based single-cell RNA sequencing, and present a framework to integrate these phenomena by the manipulation of generating functions. Finally, we use simulated scenarios and biological data to illustrate the implications and applications of the approach.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | - John J Vastola
- Department of Neurobiology, Harvard Medical School, Boston, MA 02115, USA
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA 91125, USA.
| |
Collapse
|
17
|
Shi C, Yang X, Zhang J, Zhou T. Stochastic modeling of the mRNA life process: A generalized master equation. Biophys J 2023; 122:4023-4041. [PMID: 37653725 PMCID: PMC10598292 DOI: 10.1016/j.bpj.2023.08.024] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 06/29/2023] [Accepted: 08/29/2023] [Indexed: 09/02/2023] Open
Abstract
The mRNA life cycle is a complex biochemical process, involving transcription initiation, elongation, termination, splicing, and degradation. Each of these molecular events is multistep and can create a memory. The effect of this molecular memory on gene expression is not clear, although there are many related yet scattered experimental reports. To address this important issue, we develop a general theoretical framework formulated as a master equation in the sense of queue theory, which can reduce to multiple previously studied gene models in limiting cases. This framework allows us to interpret experimental observations, extract kinetic parameters from experimental data, and identify how the mRNA kinetics vary under regulatory influences. Notably, it allows us to evaluate the influences of elongation processes on mature RNA distribution; e.g., we find that the non-exponential elongation time can induce the bimodal mRNA expression and there is an optimal elongation noise intensity such that the mature RNA noise achieves the lowest level. In a word, our framework can not only provide insight into complex mRNA life processes but also bridge a dialogue between theoretical studies and experimental data.
Collapse
Affiliation(s)
- Changhong Shi
- State Key Laboratory of Respiratory Disease, School of Public Health, Guangzhou Medical University, Guangzhou, China
| | - Xiyan Yang
- School of Financial Mathematics and Statistics, Guangdong University of Finance, Guangzhou, China
| | - Jiajun Zhang
- School of Mathematics and Computational Science and Guangdong Province Key Laboratory of Computational Science, Sun Yat-Sen University, Guangzhou, China.
| | - Tianshou Zhou
- School of Mathematics and Computational Science and Guangdong Province Key Laboratory of Computational Science, Sun Yat-Sen University, Guangzhou, China.
| |
Collapse
|
18
|
Liu H, Arsiè R, Schwabe D, Schilling M, Minia I, Alles J, Boltengagen A, Kocks C, Falcke M, Friedman N, Landthaler M, Rajewsky N. SLAM-Drop-seq reveals mRNA kinetic rates throughout the cell cycle. Mol Syst Biol 2023; 19:1-23. [PMID: 38778223 PMCID: PMC10568207 DOI: 10.15252/msb.202211427] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2022] [Revised: 07/24/2023] [Accepted: 08/04/2023] [Indexed: 05/25/2024] Open
Abstract
RNA abundance is tightly regulated in eukaryotic cells by modulating the kinetic rates of RNA production, processing, and degradation. To date, little is known about time‐dependent kinetic rates during dynamic processes. Here, we present SLAM‐Drop‐seq, a method that combines RNA metabolic labeling and alkylation of modified nucleotides in methanol‐fixed cells with droplet‐based sequencing to detect newly synthesized and preexisting mRNAs in single cells. As a first application, we sequenced 7280 HEK293 cells and calculated gene‐specific kinetic rates during the cell cycle using the novel package Eskrate. Of the 377 robust‐cycling genes that we identified, only a minor fraction is regulated solely by either dynamic transcription or degradation (6 and 4%, respectively). By contrast, the vast majority (89%) exhibit dynamically regulated transcription and degradation rates during the cell cycle. Our study thus shows that temporally regulated mRNA degradation is fundamental for the correct expression of a majority of cycling genes. SLAM‐Drop‐seq, combined with Eskrate, is a powerful approach to understanding the underlying mRNA kinetics of single‐cell gene expression dynamics in continuous biological processes.
Collapse
Affiliation(s)
- Haiyue Liu
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Roberto Arsiè
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Daniel Schwabe
- Mathematical Cell Physiology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Marcel Schilling
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
- Lübeck Interdisciplinary Platform for Genome Analytics (LIGA), University of Lübeck, Lübeck, Germany
| | - Igor Minia
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Jonathan Alles
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Anastasiya Boltengagen
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Christine Kocks
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
| | - Martin Falcke
- Mathematical Cell Physiology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany
- Department of Physics, Humboldt University Berlin, Berlin, Germany
| | - Nir Friedman
- The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University of Jerusalem, Jerusalem, Israel
- The Lautenberg Center for Immunology and Cancer Research, Institute of Medical Research Israel-Canada (IMRIC), Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
- The Center for Computational Medicine, Faculty of Medicine, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Markus Landthaler
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany.
- Institut für Biologie, Humboldt Universität zu Berlin, Berlin, Germany.
| | - Nikolaus Rajewsky
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin, Germany.
- Charité - Universitätsmedizin Berlin, Corporate Member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany.
- German Center for Cardiovascular Research (DZHK), Berlin, Germany.
- NeuroCure Cluster of Excellence, Berlin, Germany.
- German Cancer Consortium (DKTK), Berlin, Germany.
- National Center for Tumor Diseases (NCT), Berlin, Germany.
| |
Collapse
|
19
|
Das S, Singh A, Shah P. Evaluating single-cell variability in proteasomal decay. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.22.554358. [PMID: 37662347 PMCID: PMC10473619 DOI: 10.1101/2023.08.22.554358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
Gene expression is a stochastic process that leads to variability in mRNA and protein abundances even within an isogenic population of cells grown in the same environment. This variation, often called gene-expression noise, has typically been attributed to transcriptional and translational processes while ignoring the contributions of protein decay variability across cells. Here we estimate the single-cell protein decay rates of two degron GFPs in Saccharomyces cerevisiae using time-lapse microscopy. We find substantial cell-to-cell variability in the decay rates of the degron GFPs. We evaluate cellular features that explain the variability in the proteasomal decay and find that the amount of 20s catalytic beta subunit of the proteasome marginally explains the observed variability in the degron GFP half-lives. We propose alternate hypotheses that might explain the observed variability in the decay of the two degron GFPs. Overall, our study highlights the importance of studying the kinetics of the decay process at single-cell resolution and that decay rates vary at the single-cell level, and that the decay process is stochastic. A complex model of decay dynamics must be included when modeling stochastic gene expression to estimate gene expression noise.
Collapse
Affiliation(s)
| | - Abhyudai Singh
- Department of Electrical and Computer Engineering, Biomedical Engineering, University of Delaware
| | | |
Collapse
|
20
|
Alachkar N, Norton D, Wolkensdorfer Z, Muldoon M, Paszek P. Variability of the innate immune response is globally constrained by transcriptional bursting. Front Mol Biosci 2023; 10:1176107. [PMID: 37441161 PMCID: PMC10333517 DOI: 10.3389/fmolb.2023.1176107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 06/15/2023] [Indexed: 07/15/2023] Open
Abstract
Transcription of almost all mammalian genes occurs in stochastic bursts, however the fundamental control mechanisms that allow appropriate single-cell responses remain unresolved. Here we utilise single cell genomics data and stochastic models of transcription to perform global analysis of the toll-like receptor (TLR)-induced gene expression variability. Based on analysis of more than 2000 TLR-response genes across multiple experimental conditions we demonstrate that the single-cell, gene-by-gene expression variability can be empirically described by a linear function of the population mean. We show that response heterogeneity of individual genes can be characterised by the slope of the mean-variance line, which captures how cells respond to stimulus and provides insight into evolutionary differences between species. We further demonstrate that linear relationships theoretically determine the underlying transcriptional bursting kinetics, revealing different regulatory modes of TLR response heterogeneity. Stochastic modelling of temporal scRNA-seq count distributions demonstrates that increased response variability is associated with larger and more frequent transcriptional bursts, which emerge via increased complexity of transcriptional regulatory networks between genes and different species. Overall, we provide a methodology relying on inference of empirical mean-variance relationships from single cell data and new insights into control of innate immune response variability.
Collapse
Affiliation(s)
- Nissrin Alachkar
- Division of Immunology, Immunity to Infection and Respiratory Medicine, Lydia Becker Institute of Immunology and Inflammation, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester, United Kingdom
| | - Dale Norton
- Division of Immunology, Immunity to Infection and Respiratory Medicine, Lydia Becker Institute of Immunology and Inflammation, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester, United Kingdom
| | - Zsofia Wolkensdorfer
- Division of Immunology, Immunity to Infection and Respiratory Medicine, Lydia Becker Institute of Immunology and Inflammation, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester, United Kingdom
| | - Mark Muldoon
- Department of Mathematics, University of Manchester, Manchester, United Kingdom
| | - Pawel Paszek
- Division of Immunology, Immunity to Infection and Respiratory Medicine, Lydia Becker Institute of Immunology and Inflammation, Faculty of Biology, Medicine and Health, Manchester Academic Health Science Centre, University of Manchester, Manchester, United Kingdom
| |
Collapse
|
21
|
Gorin G, Vastola JJ, Pachter L. Studying stochastic systems biology of the cell with single-cell genomics data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.17.541250. [PMID: 37292934 PMCID: PMC10245677 DOI: 10.1101/2023.05.17.541250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Recent experimental developments in genome-wide RNA quantification hold considerable promise for systems biology. However, rigorously probing the biology of living cells requires a unified mathematical framework that accounts for single-molecule biological stochasticity in the context of technical variation associated with genomics assays. We review models for a variety of RNA transcription processes, as well as the encapsulation and library construction steps of microfluidics-based single-cell RNA sequencing, and present a framework to integrate these phenomena by the manipulation of generating functions. Finally, we use simulated scenarios and biological data to illustrate the implications and applications of the approach.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA, 91125
| | - John J. Vastola
- Department of Neurobiology, Harvard Medical School, Boston, MA, 02115
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, 91125
| |
Collapse
|
22
|
Vo HD, Forero-Quintero LS, Aguilera LU, Munsky B. Analysis and design of single-cell experiments to harvest fluctuation information while rejecting measurement noise. Front Cell Dev Biol 2023; 11:1133994. [PMID: 37305680 PMCID: PMC10250612 DOI: 10.3389/fcell.2023.1133994] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Accepted: 05/10/2023] [Indexed: 06/13/2023] Open
Abstract
Introduction: Despite continued technological improvements, measurement errors always reduce or distort the information that any real experiment can provide to quantify cellular dynamics. This problem is particularly serious for cell signaling studies to quantify heterogeneity in single-cell gene regulation, where important RNA and protein copy numbers are themselves subject to the inherently random fluctuations of biochemical reactions. Until now, it has not been clear how measurement noise should be managed in addition to other experiment design variables (e.g., sampling size, measurement times, or perturbation levels) to ensure that collected data will provide useful insights on signaling or gene expression mechanisms of interest. Methods: We propose a computational framework that takes explicit consideration of measurement errors to analyze single-cell observations, and we derive Fisher Information Matrix (FIM)-based criteria to quantify the information value of distorted experiments. Results and Discussion: We apply this framework to analyze multiple models in the context of simulated and experimental single-cell data for a reporter gene controlled by an HIV promoter. We show that the proposed approach quantitatively predicts how different types of measurement distortions affect the accuracy and precision of model identification, and we demonstrate that the effects of these distortions can be mitigated through explicit consideration during model inference. We conclude that this reformulation of the FIM could be used effectively to design single-cell experiments to optimally harvest fluctuation information while mitigating the effects of image distortion.
Collapse
Affiliation(s)
- Huy D. Vo
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States
| | - Linda S. Forero-Quintero
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States
| | - Luis U. Aguilera
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States
| | - Brian Munsky
- Department of Chemical and Biological Engineering, Colorado State University, Fort Collins, CO, United States
- School of Biomedical Engineering, Colorado State University, Fort Collins, CO, United States
| |
Collapse
|
23
|
Wang Y, He S. Inference on autoregulation in gene expression with variance-to-mean ratio. J Math Biol 2023; 86:87. [PMID: 37131095 PMCID: PMC10154285 DOI: 10.1007/s00285-023-01924-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 04/14/2023] [Accepted: 04/18/2023] [Indexed: 05/04/2023]
Abstract
Some genes can promote or repress their own expressions, which is called autoregulation. Although gene regulation is a central topic in biology, autoregulation is much less studied. In general, it is extremely difficult to determine the existence of autoregulation with direct biochemical approaches. Nevertheless, some papers have observed that certain types of autoregulations are linked to noise levels in gene expression. We generalize these results by two propositions on discrete-state continuous-time Markov chains. These two propositions form a simple but robust method to infer the existence of autoregulation from gene expression data. This method only needs to compare the mean and variance of the gene expression level. Compared to other methods for inferring autoregulation, our method only requires non-interventional one-time data, and does not need to estimate parameters. Besides, our method has few restrictions on the model. We apply this method to four groups of experimental data and find some genes that might have autoregulation. Some inferred autoregulations have been verified by experiments or other theoretical works.
Collapse
Affiliation(s)
- Yue Wang
- Department of Computational Medicine, University of California, Los Angeles, CA, 90095, USA.
- Institut des Hautes Études Scientifiques (IHÉS), Bures-sur-Yvette, 91440, Essonne, France.
| | - Siqi He
- Simons Center for Geometry and Physics, Stony Brook University, Stony Brook, NY, 11794, USA
| |
Collapse
|
24
|
Carilli M, Gorin G, Choi Y, Chari T, Pachter L. Biophysical modeling with variational autoencoders for bimodal, single-cell RNA sequencing data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.13.523995. [PMID: 36712140 PMCID: PMC9882246 DOI: 10.1101/2023.01.13.523995] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
We motivate and present biVI, which combines the variational autoencoder framework of scVI with biophysically motivated, bivariate models for nascent and mature RNA distributions. While previous approaches to integrate bimodal data via the variational autoencoder framework ignore the causal relationship between measurements, biVI models the biophysical processes that give rise to observations. We demonstrate through simulated benchmarking that biVI captures cell type structure in a low-dimensional space and accurately recapitulates parameter values and copy number distributions. On biological data, biVI provides a scalable route for identifying the biophysical mechanisms underlying gene expression. This analytical approach outlines a generalizable strategy for treating multimodal datasets generated by high-throughput, single-cell genomic assays.
Collapse
Affiliation(s)
- Maria Carilli
- Division of Biology and Biological Engineering, California Institute of Technology
| | - Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology
| | - Yongin Choi
- Biomedical Engineering Graduate Group, University of California, Davis
- Genome Center, University of California, Davis
| | - Tara Chari
- Division of Biology and Biological Engineering, California Institute of Technology
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology
- Department of Computing and Mathematical Sciences, California Institute of Technology
| |
Collapse
|
25
|
Ren J, Zhou H, Zeng H, Wang CK, Huang J, Qiu X, Sui X, Li Q, Wu X, Lin Z, Lo JA, Maher K, He Y, Tang X, Lam J, Chen H, Li B, Fisher DE, Liu J, Wang X. Spatiotemporally resolved transcriptomics reveals the subcellular RNA kinetic landscape. Nat Methods 2023; 20:695-705. [PMID: 37038000 PMCID: PMC10172111 DOI: 10.1038/s41592-023-01829-8] [Citation(s) in RCA: 28] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2022] [Accepted: 02/22/2023] [Indexed: 04/12/2023]
Abstract
Spatiotemporal regulation of the cellular transcriptome is crucial for proper protein expression and cellular function. However, the intricate subcellular dynamics of RNA remain obscured due to the limitations of existing transcriptomics methods. Here, we report TEMPOmap-a method that uncovers subcellular RNA profiles across time and space at the single-cell level. TEMPOmap integrates pulse-chase metabolic labeling with highly multiplexed three-dimensional in situ sequencing to simultaneously profile the age and location of individual RNA molecules. Using TEMPOmap, we constructed the subcellular RNA kinetic landscape in various human cells from transcription and translocation to degradation. Clustering analysis of RNA kinetic parameters across single cells revealed 'kinetic gene clusters' whose expression patterns were shaped by multistep kinetic sculpting. Importantly, these kinetic gene clusters are functionally segregated, suggesting that subcellular RNA kinetics are differentially regulated in a cell-state- and cell-type-dependent manner. Spatiotemporally resolved transcriptomics provides a gateway to uncovering new spatiotemporal gene regulation principles.
Collapse
Affiliation(s)
- Jingyi Ren
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Haowen Zhou
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Hu Zeng
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | | | - Jiahao Huang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Xiaojie Qiu
- Whitehead Institute for Biomedical Research Cambridge, Cambridge, MA, USA
| | - Xin Sui
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Qiang Li
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Boston, MA, USA
| | - Xunwei Wu
- Cutaneous Biology Research Center, Massachusetts General Hospital, Harvard Medical School, Charlestown, MA, USA
| | - Zuwan Lin
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA
| | - Jennifer A Lo
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Kamal Maher
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Computational and Systems Biology Program, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Yichun He
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Boston, MA, USA
| | - Xin Tang
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Boston, MA, USA
| | - Judson Lam
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Hongyu Chen
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Brian Li
- Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - David E Fisher
- Cutaneous Biology Research Center, Massachusetts General Hospital, Harvard Medical School, Charlestown, MA, USA
| | - Jia Liu
- John A. Paulson School of Engineering and Applied Sciences, Harvard University, Boston, MA, USA
| | - Xiao Wang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Broad Institute of MIT and Harvard, Cambridge, MA, USA.
| |
Collapse
|
26
|
Wang Y, He S. Using Fano factors to determine certain types of gene autoregulation. ARXIV 2023:arXiv:2301.06692v2. [PMID: 36713249 PMCID: PMC9882590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
The expression of one gene might be regulated by its corresponding protein, which is called autoregulation. Although gene regulation is a central topic in biology, autoregulation is much less studied. In general, it is extremely difficult to determine the existence of autoregulation with direct biochemical approaches. Nevertheless, some papers have observed that certain types of autoregulations are linked to noise levels in gene expression. We generalize these results by two propositions on discrete-state continuous-time Markov chains. These two propositions form a simple but robust method to infer the existence of autoregulation in certain scenarios from gene expression data. This method only depends on the Fano factor, namely the ratio of variance and mean of the gene expression level. Compared to other methods for inferring autoregulation, our method only requires non-interventional one-time data, and does not need to estimate parameters. Besides, our method has few restrictions on the model. We apply this method to four groups of experimental data and find some genes that might have autoregulation. Some inferred autoregulations have been verified by experiments or other theoretical works.
Collapse
Affiliation(s)
- Yue Wang
- Department of Computational Medicine, University of California, Los Angeles, California, United States of America
- Institut des Hautes Études Scientifiques, Bures-sur-Yvette, Essonne, France
| | - Siqi He
- Simons Center for Geometry and Physics, Stony Brook University, Stony Brook, New York, United States of America
| |
Collapse
|
27
|
Weidemann DE, Singh A, Grima R, Hauf S. The minimal intrinsic stochasticity of constitutively expressed eukaryotic genes is sub-Poissonian. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.06.531283. [PMID: 36945401 PMCID: PMC10028819 DOI: 10.1101/2023.03.06.531283] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/18/2023]
Abstract
Stochastic variation in gene products ("noise") is an inescapable by-product of gene expression. Noise must be minimized to allow for the reliable execution of cellular functions. However, noise cannot be suppressed beyond an intrinsic lower limit. For constitutively expressed genes, this limit is believed to be Poissonian, meaning that the variance in mRNA numbers cannot be lower than their mean. Here, we show that several cell division genes in fission yeast have mRNA variances significantly below this limit, which cannot be explained by the classical gene expression model for low-noise genes. Our analysis reveals that multiple steps in both transcription and mRNA degradation are essential to explain this sub-Poissonian variance. The sub-Poissonian regime differs qualitatively from previously characterized noise regimes, a hallmark being that cytoplasmic noise is reduced when the mRNA export rate increases. Our study re-defines the lower limit of eukaryotic gene expression noise and identifies molecular requirements for ultra-low noise which are expected to support essential cell functions.
Collapse
Affiliation(s)
- Douglas E Weidemann
- Department of Biological Sciences, Virginia Tech, Blacksburg, VA 24061, USA
- Fralin Life Sciences Institute, Virginia Tech, Blacksburg, VA 24061, USA
| | - Abhyudai Singh
- Department of Electrical and Computer Engineering, University of Delaware, Newark, DE 19716, USA
- Department of Biomedical Engineering, University of Delaware, Newark, DE 19716, USA
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3JR, Scotland, UK
| | - Silke Hauf
- Department of Biological Sciences, Virginia Tech, Blacksburg, VA 24061, USA
- Fralin Life Sciences Institute, Virginia Tech, Blacksburg, VA 24061, USA
| |
Collapse
|
28
|
Jia C, Grima R. Coupling gene expression dynamics to cell size dynamics and cell cycle events: Exact and approximate solutions of the extended telegraph model. iScience 2023; 26:105746. [PMID: 36619980 PMCID: PMC9813732 DOI: 10.1016/j.isci.2022.105746] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 11/02/2022] [Accepted: 12/02/2022] [Indexed: 12/12/2022] Open
Abstract
The standard model describing the fluctuations of mRNA numbers in single cells is the telegraph model which includes synthesis and degradation of mRNA, and switching of the gene between active and inactive states. While commonly used, this model does not describe how fluctuations are influenced by the cell cycle phase, cellular growth and division, and other crucial aspects of cellular biology. Here, we derive the analytical time-dependent solution of an extended telegraph model that explicitly considers the doubling of gene copy numbers upon DNA replication, dependence of the mRNA synthesis rate on cellular volume, gene dosage compensation, partitioning of molecules during cell division, cell-cycle duration variability, and cell-size control strategies. Based on the time-dependent solution, we obtain the analytical distributions of transcript numbers for lineage and population measurements in steady-state growth and also find a linear relation between the Fano factor of mRNA fluctuations and cell volume fluctuations. We show that generally the lineage and population distributions in steady-state growth cannot be accurately approximated by the steady-state solution of extrinsic noise models, i.e. a telegraph model with parameters drawn from probability distributions. This is because the mRNA lifetime is often not small enough compared to the cell cycle duration to erase the memory of division and replication. Accurate approximations are possible when this memory is weak, e.g. for genes with bursty expression and for which there is sufficient gene dosage compensation when replication occurs.
Collapse
Affiliation(s)
- Chen Jia
- Applied and Computational Mathematics Division, Beijing Computational Science Research Center, Beijing 100193, China
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, UK
| |
Collapse
|
29
|
Luo S, Wang Z, Zhang Z, Zhou T, Zhang J. Genome-wide inference reveals that feedback regulations constrain promoter-dependent transcriptional burst kinetics. Nucleic Acids Res 2022; 51:68-83. [PMID: 36583343 PMCID: PMC9874261 DOI: 10.1093/nar/gkac1204] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Revised: 11/06/2022] [Accepted: 12/06/2022] [Indexed: 12/31/2022] Open
Abstract
Gene expression in mammalian cells is highly variable and episodic, resulting in a series of discontinuous bursts of mRNAs. A challenge is to understand how static promoter architecture and dynamic feedback regulations dictate bursting on a genome-wide scale. Although single-cell RNA sequencing (scRNA-seq) provides an opportunity to address this challenge, effective analytical methods are scarce. We developed an interpretable and scalable inference framework, which combined experimental data with a mechanistic model to infer transcriptional burst kinetics (sizes and frequencies) and feedback regulations. Applying this framework to scRNA-seq data generated from embryonic mouse fibroblast cells, we found Simpson's paradoxes, i.e. genome-wide burst kinetics exhibit different characteristics in two cases without and with distinguishing feedback regulations. We also showed that feedbacks differently modulate burst frequencies and sizes and conceal the effects of transcription start site distributions on burst kinetics. Notably, only in the presence of positive feedback, TATA genes are expressed with high burst frequencies and enhancer-promoter interactions mainly modulate burst frequencies. The developed inference method provided a flexible and efficient way to investigate transcriptional burst kinetics and the obtained results would be helpful for understanding cell development and fate decision.
Collapse
Affiliation(s)
| | | | - Zhenquan Zhang
- Guangdong Province Key Laboratory of Computational Science, Sun Yat-sen University, Guangzhou, 510275, P. R. China,School of Mathematics, Sun Yat-sen University, Guangzhou, Guangdong Province, 510275, P. R. China
| | - Tianshou Zhou
- Correspondence may also be addressed to Tianshou Zhou. Tel: +86 20 84134958;
| | - Jiajun Zhang
- To whom correspondence should be addressed. Tel: +86 20 84111829;
| |
Collapse
|
30
|
Boe RH, Ayyappan V, Schuh L, Raj A. Allelic correlation is a marker of trade-offs between barriers to transmission of expression variability and signal responsiveness in genetic networks. Cell Syst 2022; 13:1016-1032.e6. [PMID: 36450286 PMCID: PMC9811561 DOI: 10.1016/j.cels.2022.10.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 06/28/2022] [Accepted: 10/28/2022] [Indexed: 12/03/2022]
Abstract
Genetic networks should respond to signals but prevent the transmission of spontaneous fluctuations. Limited data from mammalian cells suggest that noise transmission is uncommon, but systematic claims about noise transmission have been limited by the inability to directly measure it. Here, we build a mathematical framework modeling allelic correlation and noise transmission, showing that allelic correlation and noise transmission correspond across model parameters and network architectures. Limiting noise transmission comes with the trade-off of being unresponsive to signals, and within responsive regimes, there is a further trade-off between response time and basal noise transmission. Analysis of allele-specific single-cell RNA-sequencing data revealed that genes encoding upstream factors in signaling pathways and cell-type-specific factors have higher allelic correlation than downstream factors, suggesting they are more subject to regulation. Overall, our findings suggest that some noise transmission must result from signal responsiveness, but it can be minimized by trading off for a slower response. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Ryan H Boe
- Genetics and Epigenetics, Cell and Molecular Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Vinay Ayyappan
- Department of Bioengineering, School of Engineering and Applied Sciences, University of Pennsylvania, Philadelphia, PA, USA
| | - Lea Schuh
- Institute of AI for Health, Helmholtz Zentrum München, German Research Center for Environmental Health, 85764 Neuherberg, Germany; Department of Mathematics, Technical University of Munich, Garching 85748, Germany
| | - Arjun Raj
- Department of Bioengineering, School of Engineering and Applied Sciences, University of Pennsylvania, Philadelphia, PA, USA; Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA.
| |
Collapse
|
31
|
Govindaraj V, Sarma S, Karulkar A, Purwar R, Kar S. Transcriptional Fluctuations Govern the Serum-Dependent Cell Cycle Duration Heterogeneities in Mammalian Cells. ACS Synth Biol 2022; 11:3743-3758. [PMID: 36325971 DOI: 10.1021/acssynbio.2c00347] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Mammalian cells exhibit a high degree of intercellular variability in cell cycle period and phase durations. However, the factors orchestrating the cell cycle duration heterogeneities remain unclear. Herein, by combining cell cycle network-based mathematical models with live single-cell imaging studies under varied serum conditions, we demonstrate that fluctuating transcription rates of cell cycle regulatory genes across cell lineages and during cell cycle progression in mammalian cells majorly govern the robust correlation patterns of cell cycle period and phase durations among sister, cousin, and mother-daughter lineage pairs. However, for the overall cellular population, alteration in the serum level modulates the fluctuation and correlation patterns of cell cycle period and phase durations in a correlated manner. These heterogeneities at the population level can be fine-tuned under limited serum conditions by perturbing the cell cycle network using a p38-signaling inhibitor without affecting the robust lineage-level correlations. Overall, our approach identifies transcriptional fluctuations as the key controlling factor for the cell cycle duration heterogeneities and predicts ways to reduce cell-to-cell variabilities by perturbing the cell cycle network regulations.
Collapse
Affiliation(s)
| | - Subrot Sarma
- Department of Chemistry, IIT Bombay, Powai, Mumbai 400076, India
| | - Atharva Karulkar
- Department of Biosciences and Bioengineering, IIT Bombay, Powai, Mumbai 400076, India
| | - Rahul Purwar
- Department of Biosciences and Bioengineering, IIT Bombay, Powai, Mumbai 400076, India
| | - Sandip Kar
- Department of Chemistry, IIT Bombay, Powai, Mumbai 400076, India
| |
Collapse
|
32
|
Fu X, Patel HP, Coppola S, Xu L, Cao Z, Lenstra TL, Grima R. Quantifying how post-transcriptional noise and gene copy number variation bias transcriptional parameter inference from mRNA distributions. eLife 2022; 11:e82493. [PMID: 36250630 PMCID: PMC9648968 DOI: 10.7554/elife.82493] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2022] [Accepted: 10/14/2022] [Indexed: 11/13/2022] Open
Abstract
Transcriptional rates are often estimated by fitting the distribution of mature mRNA numbers measured using smFISH (single molecule fluorescence in situ hybridization) with the distribution predicted by the telegraph model of gene expression, which defines two promoter states of activity and inactivity. However, fluctuations in mature mRNA numbers are strongly affected by processes downstream of transcription. In addition, the telegraph model assumes one gene copy but in experiments, cells may have two gene copies as cells replicate their genome during the cell cycle. While it is often presumed that post-transcriptional noise and gene copy number variation affect transcriptional parameter estimation, the size of the error introduced remains unclear. To address this issue, here we measure both mature and nascent mRNA distributions of GAL10 in yeast cells using smFISH and classify each cell according to its cell cycle phase. We infer transcriptional parameters from mature and nascent mRNA distributions, with and without accounting for cell cycle phase and compare the results to live-cell transcription measurements of the same gene. We find that: (i) correcting for cell cycle dynamics decreases the promoter switching rates and the initiation rate, and increases the fraction of time spent in the active state, as well as the burst size; (ii) additional correction for post-transcriptional noise leads to further increases in the burst size and to a large reduction in the errors in parameter estimation. Furthermore, we outline how to correctly adjust for measurement noise in smFISH due to uncertainty in transcription site localisation when introns cannot be labelled. Simulations with parameters estimated from nascent smFISH data, which is corrected for cell cycle phases and measurement noise, leads to autocorrelation functions that agree with those obtained from live-cell imaging.
Collapse
Affiliation(s)
- Xiaoming Fu
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, East China University of Science and TechnologyShanghaiChina
- School of Biological Sciences, University of EdinburghEdinburghUnited Kingdom
- Center for Advanced Systems Understanding, Helmholtz-Zentrum Dresden-RossendorfGörlitzGermany
| | - Heta P Patel
- The Netherlands Cancer Institute, Oncode Institute, Division of Gene RegulationAmsterdamNetherlands
| | - Stefano Coppola
- The Netherlands Cancer Institute, Oncode Institute, Division of Gene RegulationAmsterdamNetherlands
| | - Libin Xu
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, East China University of Science and TechnologyShanghaiChina
| | - Zhixing Cao
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, East China University of Science and TechnologyShanghaiChina
| | - Tineke L Lenstra
- The Netherlands Cancer Institute, Oncode Institute, Division of Gene RegulationAmsterdamNetherlands
| | - Ramon Grima
- School of Biological Sciences, University of EdinburghEdinburghUnited Kingdom
| |
Collapse
|
33
|
Concentration fluctuations in growing and dividing cells: Insights into the emergence of concentration homeostasis. PLoS Comput Biol 2022; 18:e1010574. [PMID: 36194626 PMCID: PMC9565450 DOI: 10.1371/journal.pcbi.1010574] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2022] [Revised: 10/14/2022] [Accepted: 09/14/2022] [Indexed: 11/19/2022] Open
Abstract
Intracellular reaction rates depend on concentrations and hence their levels are often regulated. However classical models of stochastic gene expression lack a cell size description and cannot be used to predict noise in concentrations. Here, we construct a model of gene product dynamics that includes a description of cell growth, cell division, size-dependent gene expression, gene dosage compensation, and size control mechanisms that can vary with the cell cycle phase. We obtain expressions for the approximate distributions and power spectra of concentration fluctuations which lead to insight into the emergence of concentration homeostasis. We find that (i) the conditions necessary to suppress cell division-induced concentration oscillations are difficult to achieve; (ii) mRNA concentration and number distributions can have different number of modes; (iii) two-layer size control strategies such as sizer-timer or adder-timer are ideal because they maintain constant mean concentrations whilst minimising concentration noise; (iv) accurate concentration homeostasis requires a fine tuning of dosage compensation, replication timing, and size-dependent gene expression; (v) deviations from perfect concentration homeostasis show up as deviations of the concentration distribution from a gamma distribution. Some of these predictions are confirmed using data for E. coli, fission yeast, and budding yeast.
Collapse
|
34
|
Gorin G, Fang M, Chari T, Pachter L. RNA velocity unraveled. PLoS Comput Biol 2022; 18:e1010492. [PMID: 36094956 PMCID: PMC9499228 DOI: 10.1371/journal.pcbi.1010492] [Citation(s) in RCA: 75] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 09/22/2022] [Accepted: 08/14/2022] [Indexed: 11/24/2022] Open
Abstract
We perform a thorough analysis of RNA velocity methods, with a view towards understanding the suitability of the various assumptions underlying popular implementations. In addition to providing a self-contained exposition of the underlying mathematics, we undertake simulations and perform controlled experiments on biological datasets to assess workflow sensitivity to parameter choices and underlying biology. Finally, we argue for a more rigorous approach to RNA velocity, and present a framework for Markovian analysis that points to directions for improvement and mitigation of current problems.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - Meichen Fang
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - Tara Chari
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, California, United States of America
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, California, United States of America
| |
Collapse
|
35
|
Sun P, Zou W. Research progress of live-cell RNA imaging techniques. Zhejiang Da Xue Xue Bao Yi Xue Ban 2022; 51:362-372. [PMID: 36207827 PMCID: PMC9511491 DOI: 10.3724/zdxbyxb-2022-0017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2022] [Accepted: 04/12/2022] [Indexed: 06/16/2023]
Abstract
RNA molecules play diverse roles in many physiological and pathological processes as they interact with various nucleic acids and proteins. The various biological processes of RNA are highly dynamic. Tracking RNA dynamics in living cells is crucial for a better understanding of the spatiotemporal control of gene expression and the regulatory roles of RNA. Genetically encoded RNA-tagging systems include MS2/MCP, PP7/PCP, boxB/λN22 and CRISPR-Cas. The MS2/MCP system is the most widely applied, and it has the advantages of stable binding and high signal-to-noise ratio, while the realization of RNA imaging requires gene editing of the target RNA, which may change the characteristics of the target RNA. Recently developed CRISPR-dCas13 system does not require RNA modification, but the uncertainty in CRISPR RNA (crRNA) efficiency and low signal-to-noise ratio are its limitations. Fluorescent dye-based RNA-tagging systems include molecular beacons and fluorophore-binding aptamers. The molecular beacons have high specificity and high signal-to-noise ratio; Mango and Peppers outperform the other RNA-tagging system in signal-to-noise, but they also need gene editing. Live-cell RNA imaging allows us to visualize critical steps of RNA activities, including transcription, splicing, transport, translation (for message RNA only) and subcellular localization. It will contribute to studying biological processes such as cell differentiation and the transcriptional regulation mechanism when cells adapt to the external environment, and it improves our understanding of the pathogenic mechanism of various diseases caused by abnormal RNA behavior and helps to find potential therapeutic targets. This review provides an overview of current progress of live-cell RNA imaging techniques and highlights their major strengths and limitations.
Collapse
Affiliation(s)
- Pingping Sun
- 1. The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu 322000, Zhejiang Province, China
- 2. Institute of Translational Medicine, Zhejiang University, Hangzhou 310058, China
| | - Wei Zou
- 1. The Fourth Affiliated Hospital, Zhejiang University School of Medicine, Yiwu 322000, Zhejiang Province, China
- 2. Institute of Translational Medicine, Zhejiang University, Hangzhou 310058, China
| |
Collapse
|
36
|
Mechanisms of cellular mRNA transcript homeostasis. Trends Cell Biol 2022; 32:655-668. [PMID: 35660047 DOI: 10.1016/j.tcb.2022.05.003] [Citation(s) in RCA: 30] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 05/08/2022] [Accepted: 05/09/2022] [Indexed: 11/20/2022]
Abstract
For most genes, mRNA transcript abundance scales with cell size to ensure a constant concentration. Scaling of mRNA synthesis rates with cell size plays an important role, with regulation of the activity and abundance of RNA polymerase II (Pol II) now emerging as a key point of control. However, there is also considerable evidence for feedback mechanisms that kinetically couple the rates of mRNA synthesis, nuclear export, and degradation to allow cells to compensate for changes in one by adjusting the others. Researchers are beginning to integrate results from these different fields to reveal the mechanisms underlying transcript homeostasis. This will be crucial for moving beyond our current understanding of relative gene expression towards an appreciation of how absolute transcript levels are linked to other aspects of the cellular phenotype.
Collapse
|
37
|
Gorin G, Pachter L. Modeling bursty transcription and splicing with the chemical master equation. Biophys J 2022; 121:1056-1069. [PMID: 35143775 PMCID: PMC8943761 DOI: 10.1016/j.bpj.2022.02.004] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 11/29/2021] [Accepted: 02/03/2022] [Indexed: 11/16/2022] Open
Abstract
Splicing cascades that alter gene products posttranscriptionally also affect expression dynamics. We study a class of processes and associated distributions that emerge from models of bursty promoters coupled to directed acyclic graphs of splicing. These solutions provide full time-dependent joint distributions for an arbitrary number of species with general noise behaviors and transient phenomena, offering qualitative and quantitative insights about how splicing can regulate expression dynamics. Finally, we derive a set of quantitative constraints on the minimum complexity necessary to reproduce gene coexpression patterns using synchronized burst models. We validate these findings by analyzing long-read sequencing data, where we find evidence of expression patterns largely consistent with these constraints.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California
| | - Lior Pachter
- Division of Biology and Biological Engineering & Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, California.
| |
Collapse
|
38
|
Hettich J, Gebhardt JCM. Periodic synchronization of isolated network elements facilitates simulating and inferring gene regulatory networks including stochastic molecular kinetics. BMC Bioinformatics 2022; 23:13. [PMID: 34986805 PMCID: PMC8729106 DOI: 10.1186/s12859-021-04541-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 12/16/2021] [Indexed: 11/10/2022] Open
Abstract
Background The temporal progression of many fundamental processes in cells and organisms, including homeostasis, differentiation and development, are governed by gene regulatory networks (GRNs). GRNs balance fluctuations in the output of their genes, which trace back to the stochasticity of molecular interactions. Although highly desirable to understand life processes, predicting the temporal progression of gene products within a GRN is challenging when considering stochastic events such as transcription factor–DNA interactions or protein production and degradation.
Results We report a method to simulate and infer GRNs including genes and biochemical reactions at molecular detail. In our approach, we consider each network element to be isolated from other elements during small time intervals, after which we synchronize molecule numbers across all network elements. Thereby, the temporal behaviour of network elements is decoupled and can be treated by local stochastic or deterministic solutions. We demonstrate the working principle of this modular approach with a repressive gene cascade comprising four genes. By considering a deterministic time evolution within each time interval for all elements, our method approaches the solution of the system of deterministic differential equations associated with the GRN. By allowing genes to stochastically switch between on and off states or by considering stochastic production of gene outputs, we are able to include increasing levels of stochastic detail and approximate the solution of a Gillespie simulation. Thereby, CaiNet is able to reproduce noise-induced bi-stability and oscillations in dynamically complex GRNs. Notably, our modular approach further allows for a simple consideration of deterministic delays. We further infer relevant regulatory connections and steady-state parameters of a GRN of up to ten genes from steady-state measurements by identifying each gene of the network with a single perceptron in an artificial neuronal network and using a gradient decent method originally designed to train recurrent neural networks. To facilitate setting up GRNs and using our simulation and inference method, we provide a fast computer-aided interactive network simulation environment, CaiNet. Conclusion We developed a method to simulate GRNs at molecular detail and to infer the topology and steady-state parameters of GRNs. Our method and associated user-friendly framework CaiNet should prove helpful to analyze or predict the temporal progression of reaction networks or GRNs in cellular and organismic biology. CaiNet is freely available at https://gitlab.com/GebhardtLab/CaiNet. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04541-6.
Collapse
Affiliation(s)
- Johannes Hettich
- Institute of Biophysics, Ulm University, Albert-Einstein-Allee 11, 89081, Ulm, Germany
| | - J Christof M Gebhardt
- Institute of Biophysics, Ulm University, Albert-Einstein-Allee 11, 89081, Ulm, Germany.
| |
Collapse
|
39
|
Ham L, Jackson M, Stumpf MPH. Pathway dynamics can delineate the sources of transcriptional noise in gene expression. eLife 2021; 10:e69324. [PMID: 34636320 PMCID: PMC8608387 DOI: 10.7554/elife.69324] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2021] [Accepted: 10/11/2021] [Indexed: 11/25/2022] Open
Abstract
Single-cell expression profiling opens up new vistas on cellular processes. Extensive cell-to-cell variability at the transcriptomic and proteomic level has been one of the stand-out observations. Because most experimental analyses are destructive we only have access to snapshot data of cellular states. This loss of temporal information presents significant challenges for inferring dynamics, as well as causes of cell-to-cell variability. In particular, we typically cannot separate dynamic variability from within cells ('intrinsic noise') from variability across the population ('extrinsic noise'). Here, we make this non-identifiability mathematically precise, allowing us to identify new experimental set-ups that can assist in resolving this non-identifiability. We show that multiple generic reporters from the same biochemical pathways (e.g. mRNA and protein) can infer magnitudes of intrinsic and extrinsic transcriptional noise, identifying sources of heterogeneity. Stochastic simulations support our theory, and demonstrate that 'pathway-reporters' compare favourably to the well-known, but often difficult to implement, dual-reporter method.
Collapse
Affiliation(s)
- Lucy Ham
- School of BioSciences, University of MelbourneMelbourneAustralia
| | - Marcel Jackson
- Department of Mathematics and Statistics, La Trobe UniversityMelbourneAustralia
| | - Michael PH Stumpf
- School of Mathematics and Statistics, University of MelbourneMelbourneAustralia
| |
Collapse
|
40
|
Braichenko S, Holehouse J, Grima R. Distinguishing between models of mammalian gene expression: telegraph-like models versus mechanistic models. J R Soc Interface 2021; 18:20210510. [PMID: 34610262 PMCID: PMC8492181 DOI: 10.1098/rsif.2021.0510] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Two-state models (telegraph-like models) have a successful history of predicting distributions of cellular and nascent mRNA numbers that can well fit experimental data. These models exclude key rate limiting steps, and hence it is unclear why they are able to accurately predict the number distributions. To answer this question, here we compare these models to a novel stochastic mechanistic model of transcription in mammalian cells that presents a unified description of transcriptional factor, polymerase and mature mRNA dynamics. We show that there is a large region of parameter space where the first, second and third moments of the distributions of the waiting times between two consecutively produced transcripts (nascent or mature) of two-state and mechanistic models exactly match. In this region: (i) one can uniquely express the two-state model parameters in terms of those of the mechanistic model, (ii) the models are practically indistinguishable by comparison of their transcript numbers distributions, and (iii) they are distinguishable from the shape of their waiting time distributions. Our results clarify the relationship between different gene expression models and identify a means to select between them from experimental data.
Collapse
Affiliation(s)
- Svitlana Braichenko
- School of Biological Sciences, University of Edinburgh, Edinburgh, UK.,School of Informatics, University of Edinburgh, Edinburgh, UK
| | - James Holehouse
- School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh, UK
| |
Collapse
|
41
|
Kohrman AQ, Kim-Yip RP, Posfai E. Imaging developmental cell cycles. Biophys J 2021; 120:4149-4161. [PMID: 33964274 PMCID: PMC8516676 DOI: 10.1016/j.bpj.2021.04.035] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 04/14/2021] [Accepted: 04/30/2021] [Indexed: 01/05/2023] Open
Abstract
The last decade has seen a major expansion in development of live biosensors, the tools needed to genetically encode them into model organisms, and the microscopic techniques used to visualize them. When combined, these offer us powerful tools with which to make fundamental discoveries about complex biological processes. In this review, we summarize the availability of biosensors to visualize an essential cellular process, the cell cycle, and the techniques for single-cell tracking and quantification of these reporters. We also highlight studies investigating the connection of cellular behavior to the cell cycle, particularly through live imaging, and anticipate exciting discoveries with the combination of these technologies in developmental contexts.
Collapse
Affiliation(s)
- Abraham Q Kohrman
- Department of Molecular Biology, Princeton University, Princeton, New Jersey
| | - Rebecca P Kim-Yip
- Department of Molecular Biology, Princeton University, Princeton, New Jersey
| | - Eszter Posfai
- Department of Molecular Biology, Princeton University, Princeton, New Jersey.
| |
Collapse
|
42
|
Wheat JC, Steidl U. Gene expression at a single-molecule level: implications for myelodysplastic syndromes and acute myeloid leukemia. Blood 2021; 138:625-636. [PMID: 34436525 PMCID: PMC8394909 DOI: 10.1182/blood.2019004261] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2020] [Accepted: 12/03/2020] [Indexed: 12/11/2022] Open
Abstract
Nongenetic heterogeneity, or gene expression stochasticity, is an important source of variability in biological systems. With the advent and improvement of single molecule resolution technologies, it has been shown that transcription dynamics and resultant transcript number fluctuations generate significant cell-to-cell variability that has important biological effects and may contribute substantially to both tissue homeostasis and disease. In this respect, the pathophysiology of stem cell-derived malignancies such as acute myeloid leukemia and myelodysplastic syndromes, which has historically been studied at the ensemble level, may require reevaluation. To that end, it is our aim in this review to highlight the results of recent single-molecule, biophysical, and systems studies of gene expression dynamics, with the explicit purpose of demonstrating how the insights from these basic science studies may help inform and progress the field of leukemia biology and, ultimately, research into novel therapies.
Collapse
Affiliation(s)
- Justin C Wheat
- Albert Einstein College of Medicine - Montefiore Health System, Bronx, NY
| | - Ulrich Steidl
- Albert Einstein College of Medicine - Montefiore Health System, Bronx, NY
| |
Collapse
|
43
|
Jiao F, Lin G, Yu J. Approximating gene transcription dynamics using steady-state formulas. Phys Rev E 2021; 104:014401. [PMID: 34412315 DOI: 10.1103/physreve.104.014401] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Accepted: 06/02/2021] [Indexed: 01/01/2023]
Abstract
Understanding how genes in a single cell respond to dynamically changing signals has been a central question in stochastic gene transcription research. Recent studies have generated massive steady-state or snapshot mRNA distribution data of individual cells, and inferred a large spectrum of kinetic transcription parameters under varying conditions. However, there have been few algorithms to convert these static data into the temporal variation of kinetic rates. Real-time imaging has been developed to monitor stochastic transcription processes at the single-cell level, but the immense technicality has prevented its application to most endogenous loci in mammalian cells. In this article, we introduced a stochastic gene transcription model with variable kinetic rates induced by unstable cellular conditions. We approximated the transcription dynamics using easily obtained steady-state formulas in the model. We tested the approximation against experimental data in both prokaryotic and eukaryotic cells and further solidified the conditions that guarantee the robustness of the method. The method can be easily implemented to provide convenient tools for quantifying dynamic kinetics and mechanisms underlying the widespread static transcription data, and may shed a light on circumventing the limitation of current bursting data on transcriptional real-time imaging.
Collapse
Affiliation(s)
- Feng Jiao
- Center for Applied Mathematics, Guangzhou University, Guangzhou 510006, People's Republic of China.,College of Mathematics and Information Sciences, Guangzhou University, Guangzhou 51006, People's Republic of China
| | - Genghong Lin
- Center for Applied Mathematics, Guangzhou University, Guangzhou 510006, People's Republic of China.,College of Mathematics and Information Sciences, Guangzhou University, Guangzhou 51006, People's Republic of China
| | - Jianshe Yu
- Center for Applied Mathematics, Guangzhou University, Guangzhou 510006, People's Republic of China
| |
Collapse
|
44
|
Yang J, Lee J, Land MA, Lai S, Igoshin OA, St-Pierre F. A synthetic circuit for buffering gene dosage variation between individual mammalian cells. Nat Commun 2021; 12:4132. [PMID: 34226556 PMCID: PMC8257781 DOI: 10.1038/s41467-021-23889-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Accepted: 05/19/2021] [Indexed: 02/06/2023] Open
Abstract
Precise control of gene expression is critical for biological research and biotechnology. However, transient plasmid transfections in mammalian cells produce a wide distribution of copy numbers per cell, and consequently, high expression heterogeneity. Here, we report plasmid-based synthetic circuits - Equalizers - that buffer copy-number variation at the single-cell level. Equalizers couple a transcriptional negative feedback loop with post-transcriptional incoherent feedforward control. Computational modeling suggests that the combination of these two topologies enables Equalizers to operate over a wide range of plasmid copy numbers. We demonstrate experimentally that Equalizers outperform other gene dosage compensation topologies and produce as low cell-to-cell variation as chromosomally integrated genes. We also show that episome-encoded Equalizers enable the rapid generation of extrachromosomal cell lines with stable and uniform expression. Overall, Equalizers are simple and versatile devices for homogeneous gene expression and can facilitate the engineering of synthetic circuits that function reliably in every cell.
Collapse
Affiliation(s)
- Jin Yang
- Department of Bioengineering, Rice University, Houston, TX, USA
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Jihwan Lee
- Systems, Synthetic, and Physical Biology Program, Rice University, Houston, TX, USA
| | - Michelle A Land
- Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA
| | - Shujuan Lai
- Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA
| | - Oleg A Igoshin
- Department of Bioengineering, Rice University, Houston, TX, USA
- Systems, Synthetic, and Physical Biology Program, Rice University, Houston, TX, USA
- Department of Biosciences, Rice University, Houston, TX, USA
- Department of Chemistry, Rice University, Houston, TX, USA
| | - François St-Pierre
- Systems, Synthetic, and Physical Biology Program, Rice University, Houston, TX, USA.
- Department of Neuroscience, Baylor College of Medicine, Houston, TX, USA.
- Department of Electrical and Computer Engineering, Rice University, Houston, TX, USA.
- Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
45
|
Prieto C, Nguyen DTT, Liu Z, Wheat J, Perez A, Gourkanti S, Chou T, Barin E, Velleca A, Rohwetter T, Chow A, Taggart J, Savino AM, Hoskova K, Dhodapkar M, Schurer A, Barlowe TS, Vu LP, Leslie C, Steidl U, Rabadan R, Kharas MG. Transcriptional control of CBX5 by the RNA binding proteins RBMX and RBMXL1 maintains chromatin state in myeloid leukemia. NATURE CANCER 2021; 2:741-757. [PMID: 34458856 PMCID: PMC8388313 DOI: 10.1038/s43018-021-00220-w] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Accepted: 05/11/2021] [Indexed: 01/08/2023]
Abstract
RNA binding proteins (RBPs) are key arbiters of post-transcriptional regulation and are found to be found dysregulated in hematological malignancies. Here, we identify the RBP RBMX and its retrogene RBMXL1 to be required for murine and human myeloid leukemogenesis. RBMX/L1 are overexpressed in acute myeloid leukemia (AML) primary patients compared to healthy individuals, and RBMX/L1 loss delayed leukemia development. RBMX/L1 loss lead to significant changes in chromatin accessibility, as well as chromosomal breaks and gaps. We found that RBMX/L1 directly bind to mRNAs, affect transcription of multiple loci, including CBX5 (HP1α), and control the nascent transcription of the CBX5 locus. Forced CBX5 expression rescued the RBMX/L1 depletion effects on cell growth and apoptosis. Overall, we determine that RBMX/L1 control leukemia cell survival by regulating chromatin state through their downstream target CBX5. These findings identify a mechanism for RBPs directly promoting transcription and suggest RBMX/L1, as well as CBX5, as potential therapeutic targets in myeloid malignancies.
Collapse
Affiliation(s)
- Camila Prieto
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Diu T T Nguyen
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Zhaoqi Liu
- Program for Mathematical Genomics, Department of Systems Biology, Department of Biomedical Informatics, Columbia University Medical Center, New York, NY, USA
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, China National Center for Bioinformation, Beijing 100101, China
| | - Justin Wheat
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, NY USA
| | - Alexendar Perez
- Computational Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Saroj Gourkanti
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Timothy Chou
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Ersilia Barin
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Anthony Velleca
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Thomas Rohwetter
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Arthur Chow
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - James Taggart
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Angela M Savino
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Katerina Hoskova
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Meera Dhodapkar
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Alexandra Schurer
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Trevor S Barlowe
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Ly P Vu
- Terry Fox Laboratory, British Columbia Cancer Research Centre, Vancouver, BC, Canada; Molecular Biology and Biochemistry, Simon Fraser University, Vancouver, BC, Canada
| | - Christina Leslie
- Computational Biology Program, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Ulrich Steidl
- Department of Cell Biology, Albert Einstein College of Medicine, Bronx, NY USA
| | - Raul Rabadan
- Program for Mathematical Genomics, Department of Systems Biology, Department of Biomedical Informatics, Columbia University Medical Center, New York, NY, USA
| | - Michael G Kharas
- Molecular Pharmacology Program and Center for Cell Engineering, Center for Stem Cell Biology, Center for Experimental Therapeutics, Center for Hematologic Malignancies, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| |
Collapse
|
46
|
Popp AP, Hettich J, Gebhardt J. Altering transcription factor binding reveals comprehensive transcriptional kinetics of a basic gene. Nucleic Acids Res 2021; 49:6249-6266. [PMID: 34060631 PMCID: PMC8216454 DOI: 10.1093/nar/gkab443] [Citation(s) in RCA: 53] [Impact Index Per Article: 13.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2020] [Revised: 05/03/2021] [Accepted: 05/06/2021] [Indexed: 12/17/2022] Open
Abstract
Transcription is a vital process activated by transcription factor (TF) binding. The active gene releases a burst of transcripts before turning inactive again. While the basic course of transcription is well understood, it is unclear how binding of a TF affects the frequency, duration and size of a transcriptional burst. We systematically varied the residence time and concentration of a synthetic TF and characterized the transcription of a synthetic reporter gene by combining single molecule imaging, single molecule RNA-FISH, live transcript visualisation and analysis with a novel algorithm, Burst Inference from mRNA Distributions (BIRD). For this well-defined system, we found that TF binding solely affected burst frequency and variations in TF residence time had a stronger influence than variations in concentration. This enabled us to device a model of gene transcription, in which TF binding triggers multiple successive steps before the gene transits to the active state and actual mRNA synthesis is decoupled from TF presence. We quantified all transition times of the TF and the gene, including the TF search time and the delay between TF binding and the onset of transcription. Our quantitative measurements and analysis revealed detailed kinetic insight, which may serve as basis for a bottom-up understanding of gene regulation.
Collapse
Affiliation(s)
- Achim P Popp
- Institute of Biophysics, Ulm University, Albert-Einstein-Allee 11, 89081 Ulm, Germany
| | - Johannes Hettich
- Institute of Biophysics, Ulm University, Albert-Einstein-Allee 11, 89081 Ulm, Germany
| | - J Christof M Gebhardt
- Institute of Biophysics, Ulm University, Albert-Einstein-Allee 11, 89081 Ulm, Germany
| |
Collapse
|
47
|
Liu J, Hansen D, Eck E, Kim YJ, Turner M, Alamos S, Garcia HG. Real-time single-cell characterization of the eukaryotic transcription cycle reveals correlations between RNA initiation, elongation, and cleavage. PLoS Comput Biol 2021; 17:e1008999. [PMID: 34003867 PMCID: PMC8162642 DOI: 10.1371/journal.pcbi.1008999] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2020] [Revised: 05/28/2021] [Accepted: 04/23/2021] [Indexed: 12/23/2022] Open
Abstract
The eukaryotic transcription cycle consists of three main steps: initiation, elongation, and cleavage of the nascent RNA transcript. Although each of these steps can be regulated as well as coupled with each other, their in vivo dissection has remained challenging because available experimental readouts lack sufficient spatiotemporal resolution to separate the contributions from each of these steps. Here, we describe a novel application of Bayesian inference techniques to simultaneously infer the effective parameters of the transcription cycle in real time and at the single-cell level using a two-color MS2/PP7 reporter gene and the developing fruit fly embryo as a case study. Our method enables detailed investigations into cell-to-cell variability in transcription-cycle parameters as well as single-cell correlations between these parameters. These measurements, combined with theoretical modeling, suggest a substantial variability in the elongation rate of individual RNA polymerase molecules. We further illustrate the power of this technique by uncovering a novel mechanistic connection between RNA polymerase density and nascent RNA cleavage efficiency. Thus, our approach makes it possible to shed light on the regulatory mechanisms in play during each step of the transcription cycle in individual, living cells at high spatiotemporal resolution.
Collapse
Affiliation(s)
- Jonathan Liu
- Department of Physics, University of California at Berkeley, Berkeley, California, United States of America
| | - Donald Hansen
- Institute of Pharmacy and Molecular Biotechnology, University of Heidelberg, Heidelberg, Germany
| | - Elizabeth Eck
- Biophysics Graduate Group, University of California at Berkeley, Berkeley, California, United States of America
| | - Yang Joon Kim
- Biophysics Graduate Group, University of California at Berkeley, Berkeley, California, United States of America
| | - Meghan Turner
- Biophysics Graduate Group, University of California at Berkeley, Berkeley, California, United States of America
| | - Simon Alamos
- Department of Plant and Microbial Biology, University of California at Berkeley, Berkeley, California, United States of America
| | - Hernan G. Garcia
- Department of Physics, University of California at Berkeley, Berkeley, California, United States of America
- Biophysics Graduate Group, University of California at Berkeley, Berkeley, California, United States of America
- Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, California, United States of America
- Institute for Quantitative Biosciences-QB3, University of California at Berkeley, Berkeley, California, United States of America
| |
Collapse
|
48
|
Direct Nanopore Sequencing of mRNA Reveals Landscape of Transcript Isoforms in Apicomplexan Parasites. mSystems 2021; 6:6/2/e01081-20. [PMID: 33688018 PMCID: PMC8561664 DOI: 10.1128/msystems.01081-20] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Alternative splicing is a widespread phenomenon in metazoans by which single genes are able to produce multiple isoforms of the gene product. However, this has been poorly characterized in apicomplexans, a major phylum of some of the most important global parasites. Efforts have been hampered by atypical transcriptomic features, such as the high AU content of Plasmodium RNA, but also the limitations of short-read sequencing in deciphering complex splicing events. In this study, we utilized the long read direct RNA sequencing platform developed by Oxford Nanopore Technologies to survey the alternative splicing landscape of Toxoplasma gondii and Plasmodium falciparum. We find that while native RNA sequencing has a reduced throughput, it allows us to obtain full-length or nearly full-length transcripts with comparable quantification to Illumina sequencing. By comparing these data with available gene models, we find widespread alternative splicing, particularly intron retention, in these parasites. Most of these transcripts contain premature stop codons, suggesting that in these parasites, alternative splicing represents a pathway to transcriptomic diversity, rather than expanding proteomic diversity. Moreover, alternative splicing rates are comparable between parasites, suggesting a shared splicing machinery, despite notable transcriptomic differences between the parasites. This study highlights a strategy in using long-read sequencing to understand splicing events at the whole-transcript level and has implications in the future interpretation of transcriptome sequencing studies. IMPORTANCE We have used a novel nanopore sequencing technology to directly analyze parasite transcriptomes. The very long reads of this technology reveal the full-length genes of the parasites that cause malaria and toxoplasmosis. Gene transcripts must be processed in a process called splicing before they can be translated to protein. Our analysis reveals that these parasites very frequently only partially process their gene products, in a manner that departs dramatically from their human hosts.
Collapse
|
49
|
Wang X, He Y, Zhang Q, Ren X, Zhang Z. Direct Comparative Analyses of 10X Genomics Chromium and Smart-seq2. GENOMICS PROTEOMICS & BIOINFORMATICS 2021; 19:253-266. [PMID: 33662621 PMCID: PMC8602399 DOI: 10.1016/j.gpb.2020.02.005] [Citation(s) in RCA: 175] [Impact Index Per Article: 43.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Revised: 11/14/2019] [Accepted: 04/29/2020] [Indexed: 11/28/2022]
Abstract
Single-cell RNA sequencing (scRNA-seq) is generally used for profiling transcriptome of individual cells. The droplet-based 10X Genomics Chromium (10X) approach and the plate-based Smart-seq2 full-length method are two frequently used scRNA-seq platforms, yet there are only a few thorough and systematic comparisons of their advantages and limitations. Here, by directly comparing the scRNA-seq data generated by these two platforms from the same samples of CD45- cells, we systematically evaluated their features using a wide spectrum of analyses. Smart-seq2 detected more genes in a cell, especially low abundance transcripts as well as alternatively spliced transcripts, but captured higher proportion of mitochondrial genes. The composite of Smart-seq2 data also resembled bulk RNA-seq data more. For 10X-based data, we observed higher noise for mRNAs with low expression levels. Approximately 10%-30% of all detected transcripts by both platforms were from non-coding genes, with long non-coding RNAs (lncRNAs) accounting for a higher proportion in 10X. 10X-based data displayed more severe dropout problem, especially for genes with lower expression levels. However, 10X-data can detect rare cell types given its ability to cover a large number of cells. In addition, each platform detected distinct groups of differentially expressed genes between cell clusters, indicating the different characteristics of these technologies. Our study promotes better understanding of these two platforms and offers the basis for an informed choice of these widely used technologies.
Collapse
Affiliation(s)
- Xiliang Wang
- BIOPIC, Beijing Advanced Innovation Center for Genomics, and School of Life Sciences, Peking University, Beijing 100871, China
| | - Yao He
- Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Qiming Zhang
- BIOPIC, Beijing Advanced Innovation Center for Genomics, and School of Life Sciences, Peking University, Beijing 100871, China
| | - Xianwen Ren
- BIOPIC, Beijing Advanced Innovation Center for Genomics, and School of Life Sciences, Peking University, Beijing 100871, China; Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China
| | - Zemin Zhang
- BIOPIC, Beijing Advanced Innovation Center for Genomics, and School of Life Sciences, Peking University, Beijing 100871, China; Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing 100871, China.
| |
Collapse
|
50
|
Cavallaro M, Walsh MD, Jones M, Teahan J, Tiberi S, Finkenstädt B, Hebenstreit D. 3 '-5 ' crosstalk contributes to transcriptional bursting. Genome Biol 2021; 22:56. [PMID: 33541397 PMCID: PMC7860045 DOI: 10.1186/s13059-020-02227-5] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2020] [Accepted: 12/08/2020] [Indexed: 12/15/2022] Open
Abstract
BACKGROUND Transcription in mammalian cells is a complex stochastic process involving shuttling of polymerase between genes and phase-separated liquid condensates. It occurs in bursts, which results in vastly different numbers of an mRNA species in isogenic cell populations. Several factors contributing to transcriptional bursting have been identified, usually classified as intrinsic, in other words local to single genes, or extrinsic, relating to the macroscopic state of the cell. However, some possible contributors have not been explored yet. Here, we focus on processes at the 3 ' and 5 ' ends of a gene that enable reinitiation of transcription upon termination. RESULTS Using Bayesian methodology, we measure the transcriptional bursting in inducible transgenes, showing that perturbation of polymerase shuttling typically reduces burst size, increases burst frequency, and thus limits transcriptional noise. Analysis based on paired-end tag sequencing (PolII ChIA-PET) suggests that this effect is genome wide. The observed noise patterns are also reproduced by a generative model that captures major characteristics of the polymerase flux between the ends of a gene and a phase-separated compartment. CONCLUSIONS Interactions between the 3 ' and 5 ' ends of a gene, which facilitate polymerase recycling, are major contributors to transcriptional noise.
Collapse
Affiliation(s)
- Massimo Cavallaro
- School of Life Sciences, University of Warwick, Coventry, UK.
- Mathematics Institute and Zeeman Institute for Systems Biology and Infectious Disease Epidemiology Research, University of Warwick, Coventry, UK.
- Department of Statistics, University of Warwick, Coventry, UK.
| | - Mark D Walsh
- School of Life Sciences, University of Warwick, Coventry, UK
| | - Matt Jones
- School of Life Sciences, University of Warwick, Coventry, UK
| | - James Teahan
- Department of Chemistry, University of Warwick, Coventry, UK
| | - Simone Tiberi
- Institute of Molecular Life Sciences and Swiss Institute of Bioinformatics, University of Zurich, Zurich, Switzerland
| | | | | |
Collapse
|