1
|
Athanasiadou R, Neymotin B, Brandt N, Wang W, Christiaen L, Gresham D, Tranchina D. A complete statistical model for calibration of RNA-seq counts using external spike-ins and maximum likelihood theory. PLoS Comput Biol 2019; 15:e1006794. [PMID: 30856174 PMCID: PMC6428340 DOI: 10.1371/journal.pcbi.1006794] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2018] [Revised: 03/21/2019] [Accepted: 01/16/2019] [Indexed: 01/09/2023] Open
Abstract
A fundamental assumption, common to the vast majority of high-throughput transcriptome analyses, is that the expression of most genes is unchanged among samples and that total cellular RNA remains constant. As the number of analyzed experimental systems increases however, different independent studies demonstrate that this assumption is often violated. We present a calibration method using RNA spike-ins that allows for the measurement of absolute cellular abundance of RNA molecules. We apply the method to pooled RNA from cell populations of known sizes. For each transcript, we compute a nominal abundance that can be converted to absolute by dividing by a scale factor determined in separate experiments: the yield coefficient of the transcript relative to that of a reference spike-in measured with the same protocol. The method is derived by maximum likelihood theory in the context of a complete statistical model for sequencing counts contributed by cellular RNA and spike-ins. The counts are based on a sample from a fixed number of cells to which a fixed population of spike-in molecules has been added. We illustrate and evaluate the method with applications to two global expression data sets, one from the model eukaryote Saccharomyces cerevisiae, proliferating at different growth rates, and differentiating cardiopharyngeal cell lineages in the chordate Ciona robusta. We tested the method in a technical replicate dilution study, and in a k-fold validation study.
Collapse
Affiliation(s)
- Rodoniki Athanasiadou
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, New York, United States of America
| | - Benjamin Neymotin
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, New York, United States of America
| | - Nathan Brandt
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, New York, United States of America
| | - Wei Wang
- Center for Developmental Genetics, Department of Biology, New York University, New York, New York, United States of America
| | - Lionel Christiaen
- Center for Developmental Genetics, Department of Biology, New York University, New York, New York, United States of America
| | - David Gresham
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, New York, United States of America
| | - Daniel Tranchina
- Department of Biology, New York University, New York, New York, United States of America
- Courant Institute of Mathematical Sciences, New York University, New York, New York, United States of America
| |
Collapse
|
2
|
Airoldi EM, Miller D, Athanasiadou R, Brandt N, Abdul-Rahman F, Neymotin B, Hashimoto T, Bahmani T, Gresham D. Steady-state and dynamic gene expression programs in Saccharomyces cerevisiae in response to variation in environmental nitrogen. Mol Biol Cell 2016; 27:1383-96. [PMID: 26941329 PMCID: PMC4831890 DOI: 10.1091/mbc.e14-05-1013] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2014] [Accepted: 02/23/2016] [Indexed: 11/16/2022] Open
Abstract
Steady-state and transiently perturbed nitrogen-limited chemostats show that nitrogen abundance is a primary signal controlling nitrogen-responsive gene expression. When cells experience an increase in nitrogen, some transcripts are rapidly degraded, suggesting that accelerated mRNA degradation contributes to remodeling of gene expression. Cell growth rate is regulated in response to the abundance and molecular form of essential nutrients. In Saccharomyces cerevisiae (budding yeast), the molecular form of environmental nitrogen is a major determinant of cell growth rate, supporting growth rates that vary at least threefold. Transcriptional control of nitrogen use is mediated in large part by nitrogen catabolite repression (NCR), which results in the repression of specific transcripts in the presence of a preferred nitrogen source that supports a fast growth rate, such as glutamine, that are otherwise expressed in the presence of a nonpreferred nitrogen source, such as proline, which supports a slower growth rate. Differential expression of the NCR regulon and additional nitrogen-responsive genes results in >500 transcripts that are differentially expressed in cells growing in the presence of different nitrogen sources in batch cultures. Here we find that in growth rate–controlled cultures using nitrogen-limited chemostats, gene expression programs are strikingly similar regardless of nitrogen source. NCR expression is derepressed in all nitrogen-limiting chemostat conditions regardless of nitrogen source, and in these conditions, only 34 transcripts exhibit nitrogen source–specific differential gene expression. Addition of either the preferred nitrogen source, glutamine, or the nonpreferred nitrogen source, proline, to cells growing in nitrogen-limited chemostats results in rapid, dose-dependent repression of the NCR regulon. Using a novel means of computational normalization to compare global gene expression programs in steady-state and dynamic conditions, we find evidence that the addition of nitrogen to nitrogen-limited cells results in the transient overproduction of transcripts required for protein translation. Simultaneously, we find that that accelerated mRNA degradation underlies the rapid clearing of a subset of transcripts, which is most pronounced for the highly expressed NCR-regulated permease genes GAP1, MEP2, DAL5, PUT4, and DIP5. Our results reveal novel aspects of nitrogen-regulated gene expression and highlight the need for a quantitative approach to study how the cell coordinates protein translation and nitrogen assimilation to optimize cell growth in different environments.
Collapse
Affiliation(s)
- Edoardo M Airoldi
- Department of Statistics, Harvard University, Cambridge, MA 02138 Broad Institute of Harvard and Massachusetts Institute of Technology, Cambridge, MA 02142
| | - Darach Miller
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10003
| | - Rodoniki Athanasiadou
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10003
| | - Nathan Brandt
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10003
| | - Farah Abdul-Rahman
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10003
| | - Benjamin Neymotin
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10003
| | - Tatsu Hashimoto
- Department of Statistics, Harvard University, Cambridge, MA 02138
| | - Tayebeh Bahmani
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10003
| | - David Gresham
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY 10003
| |
Collapse
|
3
|
Doidy J, Li Y, Neymotin B, Edwards MB, Varala K, Gresham D, Coruzzi GM. "Hit-and-Run" transcription: de novo transcription initiated by a transient bZIP1 "hit" persists after the "run". BMC Genomics 2016; 17:92. [PMID: 26843062 PMCID: PMC4738784 DOI: 10.1186/s12864-016-2410-2] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2015] [Accepted: 01/21/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Dynamic transcriptional regulation is critical for an organism's response to environmental signals and yet remains elusive to capture. Such transcriptional regulation is mediated by master transcription factors (TF) that control large gene regulatory networks. Recently, we described a dynamic mode of TF regulation named "hit-and-run". This model proposes that master TF can interact transiently with a set of targets, but the transcription of these transient targets continues after the TF dissociation from the target promoter. However, experimental evidence validating active transcription of the transient TF-targets is still lacking. RESULTS Here, we show that active transcription continues after transient TF-target interactions by tracking de novo synthesis of RNAs made in response to TF nuclear import. To do this, we introduced an affinity-labeled 4-thiouracil (4tU) nucleobase to specifically isolate newly synthesized transcripts following conditional TF nuclear import. Thus, we extended the TARGET system (Transient Assay Reporting Genome-wide Effects of Transcription factors) to include 4tU-labeling and named this new technology TARGET-tU. Our proof-of-principle example is the master TF Basic Leucine Zipper 1 (bZIP1), a central integrator of metabolic signaling in plants. Using TARGET-tU, we captured newly synthesized mRNAs made in response to bZIP1 nuclear import at a time when bZIP1 is no longer detectably bound to its target. Thus, the analysis of de novo transcripomics demonstrates that bZIP1 may act as a catalyst TF to initiate a transcriptional complex ("hit"), after which active transcription by RNA polymerase continues without the TF being bound to the gene promoter ("run"). CONCLUSION Our findings provide experimental proof for active transcription of transient TF-targets supporting a "hit-and-run" mode of action. This dynamic regulatory model allows a master TF to catalytically propagate rapid and broad transcriptional responses to changes in environment. Thus, the functional read-out of de novo transcripts produced by transient TF-target interactions allowed us to capture new models for genome-wide transcriptional control.
Collapse
Affiliation(s)
- Joan Doidy
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, 10003, USA.
| | - Ying Li
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, 10003, USA.
| | - Benjamin Neymotin
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, 10003, USA.
| | - Molly B Edwards
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, 10003, USA.
| | - Kranthi Varala
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, 10003, USA.
| | - David Gresham
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, 10003, USA.
| | - Gloria M Coruzzi
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, 10003, USA.
| |
Collapse
|
4
|
Abstract
The abundance of a transcript is determined by its rate of synthesis and its rate of degradation; however, global methods for quantifying RNA abundance cannot distinguish variation in these two processes. Here, we introduce RNA approach to equilibrium sequencing (RATE-seq), which uses in vivo metabolic labeling of RNA and approach to equilibrium kinetics, to determine absolute RNA degradation and synthesis rates. RATE-seq does not disturb cellular physiology, uses straightforward normalization with exogenous spike-ins, and can be readily adapted for studies in most organisms. We demonstrate the use of RATE-seq to estimate genome-wide kinetic parameters for coding and noncoding transcripts in Saccharomyces cerevisiae.
Collapse
Affiliation(s)
- Benjamin Neymotin
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, New York 10003, USA
| | - Rodoniki Athanasiadou
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, New York 10003, USA
| | - David Gresham
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, New York 10003, USA
| |
Collapse
|