1
|
Szavits-Nossan J, Grima R. Uncovering the effect of RNA polymerase steric interactions on gene expression noise: Analytical distributions of nascent and mature RNA numbers. Phys Rev E 2023; 108:034405. [PMID: 37849194 DOI: 10.1103/physreve.108.034405] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Accepted: 08/24/2023] [Indexed: 10/19/2023]
Abstract
The telegraph model is the standard model of stochastic gene expression, which can be solved exactly to obtain the distribution of mature RNA numbers per cell. A modification of this model also leads to an analytical distribution of nascent RNA numbers. These solutions are routinely used for the analysis of single-cell data, including the inference of transcriptional parameters. However, these models neglect important mechanistic features of transcription elongation, such as the stochastic movement of RNA polymerases and their steric (excluded-volume) interactions. Here we construct a model of gene expression describing promoter switching between inactive and active states, binding of RNA polymerases in the active state, their stochastic movement including steric interactions along the gene, and their unbinding leading to a mature transcript that subsequently decays. We derive the steady-state distributions of the nascent and mature RNA numbers in two important limiting cases: constitutive expression and slow promoter switching. We show that RNA fluctuations are suppressed by steric interactions between RNA polymerases, and that this suppression can in some instances even lead to sub-Poissonian fluctuations; these effects are most pronounced for nascent RNA and less prominent for mature RNA, since the latter is not a direct sensor of transcription. We find a relationship between the parameters of our microscopic mechanistic model and those of the standard models that ensures excellent consistency in their prediction of the first and second RNA number moments over vast regions of parameter space, encompassing slow, intermediate, and rapid promoter switching, provided the RNA number distributions are Poissonian or super-Poissonian. Furthermore, we identify the limitations of inference from mature RNA data, specifically showing that it cannot differentiate between highly distinct RNA polymerase traffic patterns on a gene.
Collapse
Affiliation(s)
- Juraj Szavits-Nossan
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, United Kingdom
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, United Kingdom
| |
Collapse
|
2
|
Moy MA, Collins-McMillen D, Crawford L, Parkins C, Zeltzer S, Caviness K, Caposio P, Goodrum F. UL135 and UL136 Epistasis Controls Reactivation of Human Cytomegalovirus. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.24.525282. [PMID: 36747736 PMCID: PMC9900790 DOI: 10.1101/2023.01.24.525282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Human cytomegalovirus (HCMV) is beta herpesvirus that persists indefinitely in the human host through a protracted, latent infection. The polycistronic UL133-UL138 gene locus of HCMV encodes genes regulating latency and reactivation. While UL138 is pro-latency, restricting virus replication in CD34+ hematopoietic progenitor cells (HPCs), UL135 overcomes this restriction for reactivation. By contrast, UL136 is expressed with later kinetics and encodes multiple protein isoforms with differential roles in latency and reactivation. Like UL135, the largest UL136 isoform, UL136p33, is required for reactivation from latency in hematopoietic cells. Furthermore, UL136p33 is unstable, and its instability is important for the establishment of latency and sufficient accumulation of UL136p33 is a checkpoint for reactivation. We hypothesized that stabilizing UL136p33 might overcome the requirement of UL135 for reactivation. To test this, we generated recombinant viruses lacking UL135 that expressed a stabilized variant of UL136p33. Stabilizing UL136p33 did not impact replication of the UL135-mutant virus in fibroblasts. However, in the context of infection in hematopoietic cells, stabilization of UL136p33 strikingly compensated for the loss of UL135, resulting in increased replication in CD34+ HPCs and in humanized NOD- scid IL2Rγ c null (NSG) mice. This finding suggests that while UL135 is essential for reactivation, it functions at steps preceding the accumulation of UL136p33 and that stabilized expression of UL136p33 largely overcomes the requirement for UL135 in reactivation. Taken together, our genetic evidence indicates an epistatic relationship between UL136p33 and UL135 whereby UL135 may initiate events early in reactivation that will result in the accumulation of UL136p33 to a threshold required for productive reactivation. SIGNIFICANCE Human cytomegalovirus (HCMV) is one of nine human herpesviruses and a significant human pathogen. While HCMV establishes a life-long latent infection that is typically asymptomatic in healthy individuals, its reactivation from latency can have devastating consequences in the immune compromised. Defining virus-host and virus-virus interactions important for HCMV latency, reactivation and replication is critical to defining the molecular basis of latent and replicative states and in controlling infection and CMV disease. Here we define a genetic relationship between two viral genes in controlling virus reactivation from latency using primary human hematopoietic progenitor cell and humanized mouse models.
Collapse
|
3
|
Zhang J, Cavallaro M, Hebenstreit D. Timing RNA polymerase pausing with TV-PRO-seq. CELL REPORTS METHODS 2021; 1:None. [PMID: 34723238 PMCID: PMC8547241 DOI: 10.1016/j.crmeth.2021.100083] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/03/2019] [Revised: 08/03/2021] [Accepted: 08/18/2021] [Indexed: 11/28/2022]
Abstract
Transcription of many genes in metazoans is subject to polymerase pausing, which is the transient stop of transcriptionally engaged polymerases. This is known to mainly occur in promoter-proximal regions but it is not well understood. In particular, a genome-wide measurement of pausing times at high resolution has been lacking. We present here the time-variant precision nuclear run-on and sequencing (TV-PRO-seq) assay, an extension of the standard PRO-seq that allows us to estimate genome-wide pausing times at single-base resolution. Its application to human cells demonstrates that, proximal to promoters, polymerases pause more frequently but for shorter times than in other genomic regions. Comparison with single-cell gene expression data reveals that the polymerase pausing times are longer in highly expressed genes, while transcriptionally noisier genes have higher pausing frequencies and slightly longer pausing times. Analyses of histone modifications suggest that the marker H3K36me3 is related to the polymerase pausing.
Collapse
Affiliation(s)
- Jie Zhang
- School of Life Sciences, Gibbet Hill Campus, the University of Warwick, CV4 7AL Coventry, UK
| | - Massimo Cavallaro
- School of Life Sciences, Gibbet Hill Campus, the University of Warwick, CV4 7AL Coventry, UK
- Mathematics Institute and Zeeman Institute for Systems Biology and Infectious Disease Epidemiology Research, the University of Warwick, CV4 7AL Coventry, UK
| | - Daniel Hebenstreit
- School of Life Sciences, Gibbet Hill Campus, the University of Warwick, CV4 7AL Coventry, UK
| |
Collapse
|
4
|
Zhang T, Foreman R, Wollman R. Identifying chromatin features that regulate gene expression distribution. Sci Rep 2020; 10:20566. [PMID: 33239733 PMCID: PMC7688950 DOI: 10.1038/s41598-020-77638-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2020] [Accepted: 11/10/2020] [Indexed: 12/17/2022] Open
Abstract
Gene expression variability, differences in the number of mRNA per cell across a population of cells, is ubiquitous across diverse organisms with broad impacts on cellular phenotypes. The role of chromatin in regulating average gene expression has been extensively studied. However, what aspects of the chromatin contribute to gene expression variability is still underexplored. Here we addressed this problem by leveraging chromatin diversity and using a systematic investigation of randomly integrated expression reporters to identify what aspects of chromatin microenvironment contribute to gene expression variability. Using DNA barcoding and split-pool decoding, we created a large library of isogenic reporter clones and identified reporter integration sites in a massive and parallel manner. By mapping our measurements of reporter expression at different genomic loci with multiple epigenetic profiles including the enrichment of transcription factors and the distance to different chromatin states, we identified new factors that impact the regulation of gene expression distributions.
Collapse
Affiliation(s)
- Thanutra Zhang
- Institute for Quantitative and Computational Biosciences, UCLA, Los Angeles, CA, USA
| | - Robert Foreman
- Institute for Quantitative and Computational Biosciences, UCLA, Los Angeles, CA, USA
| | - Roy Wollman
- Institute for Quantitative and Computational Biosciences, UCLA, Los Angeles, CA, USA.
- Departments of Integrative Biology and Physiology and Chemistry and Biochemistry, UCLA, Los Angeles, CA, USA.
| |
Collapse
|
5
|
Kim S, Jacobs-Wagner C. Effects of mRNA Degradation and Site-Specific Transcriptional Pausing on Protein Expression Noise. Biophys J 2019; 114:1718-1729. [PMID: 29642040 DOI: 10.1016/j.bpj.2018.02.010] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Revised: 01/30/2018] [Accepted: 02/07/2018] [Indexed: 12/20/2022] Open
Abstract
Genetically identical cells exhibit diverse phenotypes even when experiencing the same environment. This phenomenon in part originates from cell-to-cell variability (noise) in protein expression. Although various kinetic schemes of stochastic transcription initiation are known to affect gene expression noise, how posttranscription initiation events contribute to noise at the protein level remains incompletely understood. To address this question, we developed a stochastic simulation-based model of bacterial gene expression that integrates well-known dependencies between transcription initiation, transcription elongation dynamics, mRNA degradation, and translation. We identified realistic conditions under which mRNA lifetime and transcriptional pauses modulate the protein expression noise initially introduced by the promoter architecture. For instance, we found that the short lifetime of bacterial mRNAs facilitates the production of protein bursts. Conversely, RNA polymerase (RNAP) pausing at specific sites during transcription elongation can attenuate protein bursts by fluidizing the RNAP traffic to the point of erasing the effect of a bursty promoter. Pause-prone sites, if located close to the promoter, can also affect noise indirectly by reducing both transcription and translation initiation due to RNAP and ribosome congestion. Our findings highlight how the interplay between transcription initiation, transcription elongation, translation, and mRNA degradation shapes the distribution in protein numbers. They also have implications for our understanding of gene evolution and suggest combinatorial strategies for modulating phenotypic variability by genetic engineering.
Collapse
Affiliation(s)
- Sangjin Kim
- Microbial Sciences Institute, West Haven, Connecticut; Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut; Howard Hughes Medical Institute, New Haven, Connecticut
| | - Christine Jacobs-Wagner
- Microbial Sciences Institute, West Haven, Connecticut; Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut; Howard Hughes Medical Institute, New Haven, Connecticut; Department of Microbial Pathogenesis, Yale School of Medicine, Yale University, New Haven, Connecticut.
| |
Collapse
|
6
|
Prajapat MK, Ribeiro AS. Added value of autoregulation and multi-step kinetics of transcription initiation. ROYAL SOCIETY OPEN SCIENCE 2018; 5:181170. [PMID: 30564410 PMCID: PMC6281912 DOI: 10.1098/rsos.181170] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/27/2018] [Accepted: 11/01/2018] [Indexed: 06/09/2023]
Abstract
Bacterial gene expression regulation occurs mostly during transcription, which has two main rate-limiting steps: the close complex formation, when the RNA polymerase binds to an active promoter, and the subsequent open complex formation, after which it follows elongation. Tuning these steps' kinetics by the action of e.g. transcription factors, allows for a wide diversity of dynamics. For example, adding autoregulation generates single-gene circuits able to perform more complex tasks. Using stochastic models of transcription kinetics with empirically validated parameter values, we investigate how autoregulation and the multi-step transcription initiation kinetics of single-gene autoregulated circuits can be combined to fine-tune steady state mean and cell-to-cell variability in protein expression levels, as well as response times. Next, we investigate how they can be jointly tuned to control complex behaviours, namely, time counting, switching dynamics and memory storage. Overall, our finding suggests that, in bacteria, jointly regulating a single-gene circuit's topology and the transcription initiation multi-step dynamics allows enhancing complex task performance.
Collapse
Affiliation(s)
- Mahendra Kumar Prajapat
- Laboratory of Biosystem Dynamics, Faculty of Biomedical Sciences and Engineering, BioMediTech Institute, Tampere University of Technology, 33101 Tampere, Finland
| | - Andre S. Ribeiro
- Laboratory of Biosystem Dynamics, Faculty of Biomedical Sciences and Engineering, BioMediTech Institute, Tampere University of Technology, 33101 Tampere, Finland
- Multi-scaled Biodata Analysis and Modelling Research Community, Tampere University of Technology, 33101 Tampere, Finland
- CA3 CTS/UNINOVA, Faculdade de Ciencias e Tecnologia, Universidade Nova de Lisboa, Quinta da Torre, 2829-516 Caparica, Portugal
| |
Collapse
|
7
|
Bauer CR, Li S, Siegal ML. Essential gene disruptions reveal complex relationships between phenotypic robustness, pleiotropy, and fitness. Mol Syst Biol 2015; 11:773. [PMID: 25609648 PMCID: PMC4332149 DOI: 10.15252/msb.20145264] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The concept of robustness in biology has gained much attention recently, but a mechanistic understanding of how genetic networks regulate phenotypic variation has remained elusive. One approach to understand the genetic architecture of variability has been to analyze dispensable gene deletions in model organisms; however, the most important genes cannot be deleted. Here, we have utilized two systems in yeast whereby essential genes have been altered to reduce expression. Using high-throughput microscopy and image analysis, we have characterized a large number of morphological phenotypes, and their associated variation, for the majority of essential genes in yeast. Our results indicate that phenotypic robustness is more highly dependent upon the expression of essential genes than on the presence of dispensable genes. Morphological robustness appears to be a general property of a genotype that is closely related to pleiotropy. While the fitness profile across a range of expression levels is idiosyncratic to each gene, the global pattern indicates that there is a window in which phenotypic variation can be released before fitness effects are observable.
Collapse
Affiliation(s)
- Christopher R Bauer
- Department of Biology, NYU Center for Genomics and Systems Biology, New York, NY, USA
| | - Shuang Li
- Department of Biology, NYU Center for Genomics and Systems Biology, New York, NY, USA
| | - Mark L Siegal
- Department of Biology, NYU Center for Genomics and Systems Biology, New York, NY, USA
| |
Collapse
|
8
|
Ribeiro AS, Häkkinen A, Lloyd-Price J. Effects of gene length on the dynamics of gene expression. Comput Biol Chem 2012; 41:1-9. [PMID: 23142668 DOI: 10.1016/j.compbiolchem.2012.10.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2012] [Revised: 10/11/2012] [Accepted: 10/11/2012] [Indexed: 01/06/2023]
Abstract
In Escherichia coli, the nucleotide length of a gene is bound to affect its expression dynamics. From simulations of a stochastic model of gene expression at the nucleotide and codon levels we show that, within realistic parameter values, the nucleotide length affects RNA and protein mean levels, as well as the expected transient time for RNA and protein numbers to change, following a signal. Fluctuations in RNA and protein numbers are found to be minimized for a small range of lengths, which matches the means of the distributions of lengths found in E. coli of both essential and non-essential genes. The variance of the length distribution for essential genes is found to be smaller than for non-essential genes, implying that these distributions are far from random. Finally, gene lengths are shown to affect the kinetics of a genetic switch, namely, the correlation between temporal proteins numbers, the stability of the two noisy attractors of the switch, and how biased is the choice of noisy attractor. The stability increases with gene length due to increased 'memory' about the previous states of the switch. We argue that, by affecting the dynamics of gene expression and of genetic circuits, gene lengths are subject to selection.
Collapse
Affiliation(s)
- Andre S Ribeiro
- Laboratory of Biosystem Dynamics, Computational Systems Biology Research Group, Department of Signal Processing, Tampere University of Technology, PO Box 553, 33101 Tampere, Finland.
| | | | | |
Collapse
|
9
|
Emmert-Streib F, Häkkinen A, Ribeiro AS. Detecting sequence dependent transcriptional pauses from RNA and protein number time series. BMC Bioinformatics 2012; 13:152. [PMID: 22741547 PMCID: PMC3534578 DOI: 10.1186/1471-2105-13-152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2012] [Accepted: 06/20/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Evidence suggests that in prokaryotes sequence-dependent transcriptional pauses affect the dynamics of transcription and translation, as well as of small genetic circuits. So far, a few pause-prone sequences have been identified from in vitro measurements of transcription elongation kinetics. RESULTS Using a stochastic model of gene expression at the nucleotide and codon levels with realistic parameter values, we investigate three different but related questions and present statistical methods for their analysis. First, we show that information from in vivo RNA and protein temporal numbers is sufficient to discriminate between models with and without a pause site in their coding sequence. Second, we demonstrate that it is possible to separate a large variety of models from each other with pauses of various durations and locations in the template by means of a hierarchical clustering and a random forest classifier. Third, we introduce an approximate likelihood function that allows to estimate the location of a pause site. CONCLUSIONS This method can aid in detecting unknown pause-prone sequences from temporal measurements of RNA and protein numbers at a genome-wide scale and thus elucidate possible roles that these sequences play in the dynamics of genetic networks and phenotype.
Collapse
Affiliation(s)
- Frank Emmert-Streib
- Computational Biology and Machine Learning Lab, Center for CancerResearch and Cell Biology, School of Medicine, Dentistry and Biomedical Sciences, Queen’s University Belfast, Belfast, UK
| | | | | |
Collapse
|
10
|
Chalancon G, Ravarani CNJ, Balaji S, Martinez-Arias A, Aravind L, Jothi R, Babu MM. Interplay between gene expression noise and regulatory network architecture. Trends Genet 2012; 28:221-32. [PMID: 22365642 DOI: 10.1016/j.tig.2012.01.006] [Citation(s) in RCA: 200] [Impact Index Per Article: 15.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2011] [Revised: 01/23/2012] [Accepted: 01/26/2012] [Indexed: 01/24/2023]
Abstract
Complex regulatory networks orchestrate most cellular processes in biological systems. Genes in such networks are subject to expression noise, resulting in isogenic cell populations exhibiting cell-to-cell variation in protein levels. Increasing evidence suggests that cells have evolved regulatory strategies to limit, tolerate or amplify expression noise. In this context, fundamental questions arise: how can the architecture of gene regulatory networks generate, make use of or be constrained by expression noise? Here, we discuss the interplay between expression noise and gene regulatory network at different levels of organization, ranging from a single regulatory interaction to entire regulatory networks. We then consider how this interplay impacts a variety of phenomena, such as pathogenicity, disease, adaptation to changing environments, differential cell-fate outcome and incomplete or partial penetrance effects. Finally, we highlight recent technological developments that permit measurements at the single-cell level, and discuss directions for future research.
Collapse
Affiliation(s)
- Guilhem Chalancon
- MRC Laboratory of Molecular Biology, Hills Road, Cambridge, CB2 0QH, UK.
| | | | | | | | | | | | | |
Collapse
|
11
|
Häkkinen A, Ribeiro AS. Evolving kinetics of gene expression in stochastic environments. Comput Biol Chem 2012; 37:11-6. [PMID: 22410387 DOI: 10.1016/j.compbiolchem.2012.02.002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2011] [Revised: 01/18/2012] [Accepted: 02/14/2012] [Indexed: 10/28/2022]
Abstract
Recent studies have shown that the in vivo dynamics of RNA numbers in bacteria is regulated, to a great extent, by the kinetics of rate limiting steps in transcription. Strong evidence suggests that the kinetics of these steps is sequence dependent. We investigate the selective advantages of rate limiting steps of differing kinetics. For that, we model the kinetics of expression of a gene responsible for promoting cell division at the expense of resources in the environment in individual cells of a population. We model mutations that affect the kinetics of the rate limiting steps and selective pressure in various environmental conditions. Depletion of resources leads to cell death. We find that small changes in the evolutionary constraints can favor widely different noise levels in RNA and protein numbers. Increasing the cost in nutrients for division favors noisier expression. The results provide a better understanding of why different genes differ in the kinetics of production of RNA and proteins.
Collapse
Affiliation(s)
- Antti Häkkinen
- Laboratory of Biosystem Dynamics, Computational Systems Biology Research Group, Department of Signal Processing, Tampere University of Technology, Finland.
| | | |
Collapse
|
12
|
Mäkelä J, Lloyd-Price J, Yli-Harja O, Ribeiro AS. Stochastic sequence-level model of coupled transcription and translation in prokaryotes. BMC Bioinformatics 2011; 12:121. [PMID: 21521517 PMCID: PMC3113936 DOI: 10.1186/1471-2105-12-121] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2010] [Accepted: 04/26/2011] [Indexed: 12/31/2022] Open
Abstract
Background In prokaryotes, transcription and translation are dynamically coupled, as the latter starts before the former is complete. Also, from one transcript, several translation events occur in parallel. To study how events in transcription elongation affect translation elongation and fluctuations in protein levels, we propose a delayed stochastic model of prokaryotic transcription and translation at the nucleotide and codon level that includes the promoter open complex formation and alternative pathways to elongation, namely pausing, arrests, editing, pyrophosphorolysis, RNA polymerase traffic, and premature termination. Stepwise translation can start after the ribosome binding site is formed and accounts for variable codon translation rates, ribosome traffic, back-translocation, drop-off, and trans-translation. Results First, we show that the model accurately matches measurements of sequence-dependent translation elongation dynamics. Next, we characterize the degree of coupling between fluctuations in RNA and protein levels, and its dependence on the rates of transcription and translation initiation. Finally, modeling sequence-specific transcriptional pauses, we find that these affect protein noise levels. Conclusions For parameter values within realistic intervals, transcription and translation are found to be tightly coupled in Escherichia coli, as the noise in protein levels is mostly determined by the underlying noise in RNA levels. Sequence-dependent events in transcription elongation, e.g. pauses, are found to cause tangible effects in the degree of fluctuations in protein levels.
Collapse
Affiliation(s)
- Jarno Mäkelä
- Computational Systems Biology Research Group, Department of Signal Processing, Tampere University of Technology, FI-33101 Tampere, Finland
| | | | | | | |
Collapse
|