1
|
Jones T, Sigauke RF, Sanford L, Taatjes DJ, Allen MA, Dowell RD. A transcription factor (TF) inference method that broadly measures TF activity and identifies mechanistically distinct TF networks. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.15.585303. [PMID: 38559193 PMCID: PMC10980006 DOI: 10.1101/2024.03.15.585303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
TF profiler is a method of inferring transcription factor regulatory activity, i.e. when a TF is present and actively regulating transcription, directly directly from nascent sequencing assays such as PRO-seq and GRO-seq. Transcription factors orchestrate transcription and play a critical role in cellular maintenance, identity and response to external stimuli. While ChIP assays have measured DNA localization, they fall short of identifying when and where transcription factors are actively regulating transcription. Our method, on the other hand, uses RNA polymerase activity to infer TF activity across hundreds of data sets and transcription factors. Based on these classifications we identify three distinct classes of transcription factors: ubiquitous factors that play roles in cellular homeostasis, driving basal gene programs across tissues and cell types, tissue specific factors that act almost exclusively at enhancers and are themselves regulated at transcription, and stimulus responsive TFs which are regulated post-transcriptionally but act predominantly at enhancers. TF profiler is broadly applicable, providing regulatory insights on any PRO-seq sample for any transcription factor with a known binding motif.
Collapse
|
2
|
Maas ZL, Dowell RD. Internal and external normalization of nascent RNA sequencing run-on experiments. BMC Bioinformatics 2024; 25:19. [PMID: 38216877 PMCID: PMC10785432 DOI: 10.1186/s12859-023-05607-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Accepted: 12/07/2023] [Indexed: 01/14/2024] Open
Abstract
In experiments with significant perturbations to transcription, nascent RNA sequencing protocols are dependent on external spike-ins for reliable normalization. Unlike in RNA-seq, these spike-ins are not standardized and, in many cases, depend on a run-on reaction that is assumed to have constant efficiency across samples. To assess the validity of this assumption, we analyze a large number of published nascent RNA spike-ins to quantify their variability across existing normalization methods. Furthermore, we develop a new biologically-informed Bayesian model to estimate the error in spike-in based normalization estimates, which we term Virtual Spike-In (VSI). We apply this method both to published external spike-ins as well as using reads at the [Formula: see text] end of long genes, building on prior work from Mahat (Mol Cell 62(1):63-78, 2016. https://doi.org/10.1016/j.molcel.2016.02.025 ) and Vihervaara (Nat Commun 8(1):255, 2017. https://doi.org/10.1038/s41467-017-00151-0 ). We find that spike-ins in existing nascent RNA experiments are typically under sequenced, with high variability between samples. Furthermore, we show that these high variability estimates can have significant downstream effects on analysis, complicating biological interpretations of results.
Collapse
Affiliation(s)
- Zachary L Maas
- Department of Computer Science, University of Colorado, Boulder, USA
- BioFrontiers Institute, University of Colorado, Boulder, USA
| | - Robin D Dowell
- Department of Computer Science, University of Colorado, Boulder, USA.
- BioFrontiers Institute, University of Colorado, Boulder, USA.
- Department of Molecular, Cellular, and Developmental Biology, University of Colorado, Boulder, USA.
| |
Collapse
|
3
|
Liu L, Zhao Y, Siepel A. DNA-sequence and epigenomic determinants of local rates of transcription elongation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.21.572932. [PMID: 38187771 PMCID: PMC10769381 DOI: 10.1101/2023.12.21.572932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]
Abstract
Across all branches of life, transcription elongation is a crucial, regulated phase in gene expression. Many recent studies in eukaryotes have focused on the regulation of promoter-proximal pausing of RNA Polymerase II (Pol II), but rates of productive elongation also vary substantially throughout the gene body, both within and across genes. Here, we introduce a probabilistic model for systematically evaluating potential determinants of the local elongation rate based on nascent RNA sequencing (NRS) data. Our model is derived from a unified model for both the kinetics of Pol II movement along the DNA template and the generation of NRS read counts at steady state. It allows for a continuously variable elongation rate along the gene body, with the rate at each nucleotide defined by a generalized linear relationship with nearby genomic and epigenomic features. High-dimensional feature vectors are accommodated through a sparse-regression extension. We show with simulations that the model allows accurate detection of associated features and accurate prediction of local elongation rates. In an analysis of public PRO-seq and epigenomic data, we identify several features that are strongly associated with reductions in the local elongation rate, including DNA methylation, splice sites, RNA stem-loops, CTCF binding sites, and several histone marks, including H3K36me3 and H4K20me1. By contrast, low-complexity sequences and H3K79me2 marks are associated with increases in elongation rate. In an analysis of DNA k -mers, we find that cytosine nucleotides are strongly associated with reductions in local elongation rate, particularly when preceded by guanines and followed by adenines or thymines. Increases in elongation rate are associated with thymines and A+T-rich k -mers. These associations are generally shared across cell types, and by considering them our model is effective at predicting features of held-out PRO-seq data. Overall, our analysis is the first to permit genome-wide predictions of relative nucleotide-specific elongation rates based on complex sets of genomic and epigenomic covariates. We have made predictions available for the K562, CD14+, MCF-7, and HeLa-S3 cell types in a UCSC Genome Browser track.
Collapse
Affiliation(s)
- Lingjie Liu
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
- Graduate Program in Genetics, Stony Brook University, Stony Brook, NY
| | - Yixin Zhao
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY
- Graduate Program in Genetics, Stony Brook University, Stony Brook, NY
| |
Collapse
|
4
|
Zhao Y, Liu L, Hassett R, Siepel A. Model-based characterization of the equilibrium dynamics of transcription initiation and promoter-proximal pausing in human cells. Nucleic Acids Res 2023; 51:e106. [PMID: 37889042 PMCID: PMC10681744 DOI: 10.1093/nar/gkad843] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 09/13/2023] [Accepted: 09/21/2023] [Indexed: 10/28/2023] Open
Abstract
In metazoans, both transcription initiation and the escape of RNA polymerase (RNAP) from promoter-proximal pausing are key rate-limiting steps in gene expression. These processes play out at physically proximal sites on the DNA template and appear to influence one another through steric interactions. Here, we examine the dynamics of these processes using a combination of statistical modeling, simulation, and analysis of real nascent RNA sequencing data. We develop a simple probabilistic model that jointly describes the kinetics of transcription initiation, pause-escape, and elongation, and the generation of nascent RNA sequencing read counts under steady-state conditions. We then extend this initial model to allow for variability across cells in promoter-proximal pause site locations and steric hindrance of transcription initiation from paused RNAPs. In an extensive series of simulations, we show that this model enables accurate estimation of initiation and pause-escape rates. Furthermore, we show by simulation and analysis of real data that pause-escape is often strongly rate-limiting and that steric hindrance can dramatically reduce initiation rates. Our modeling framework is applicable to a variety of inference problems, and our software for estimation and simulation is freely available.
Collapse
Affiliation(s)
- Yixin Zhao
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Lingjie Liu
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
- Graduate Program in Genetics, Stony Brook University, Stony Brook, NY, USA
| | - Rebecca Hassett
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
- Graduate Program in Genetics, Stony Brook University, Stony Brook, NY, USA
| |
Collapse
|
5
|
Zheng M, Lin Y, Wang W, Zhao Y, Bao X. Application of nucleoside or nucleotide analogues in RNA dynamics and RNA-binding protein analysis. WILEY INTERDISCIPLINARY REVIEWS. RNA 2022; 13:e1722. [PMID: 35218164 DOI: 10.1002/wrna.1722] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/18/2021] [Revised: 01/07/2022] [Accepted: 01/26/2022] [Indexed: 06/14/2023]
Abstract
Cellular RNAs undergo dynamic changes during RNA biological processes, which are tightly orchestrated by RNA-binding proteins (RBPs). Yet, the investigation of RNA dynamics is hurdled by highly abundant steady-state RNAs, which make the signals of dynamic RNAs less detectable. Notably, the exert of nucleoside or nucleotide analogue-based RNA technologies has provided a remarkable platform for RNA dynamics research, revealing diverse unnoticed features in RNA metabolism. In this review, we focus on the application of two types of analogue-based RNA sequencing, antigen-/antibody- and click chemistry-based methodologies, and summarize the RNA dynamics features revealed. Moreover, we discuss emerging single-cell newly transcribed RNA sequencing methodologies based on nucleoside analogue labeling, which provides novel insights into RNA dynamics regulation at single-cell resolution. On the other hand, we also emphasize the identification of RBPs that interact with polyA, non-polyA RNAs, or newly transcribed RNAs and also their associated RNA-binding domains at genomewide level through ultraviolet crosslinking and mass spectrometry in different contexts. We anticipated that further modification and development of these analogue-based RNA and RBP capture technologies will aid in obtaining an unprecedented understanding of RNA biology. This article is categorized under: RNA Interactions with Proteins and Other Molecules > Protein-RNA Recognition RNA Structure and Dynamics > RNA Structure, Dynamics and Chemistry RNA Methods > RNA Analyses in Cells.
Collapse
Affiliation(s)
- Meifeng Zheng
- Center for Cell Lineage and Development, CAS Key Laboratory of Regenerative Biology, Guangdong Provincial Key Laboratory of Stem Cell and Regenerative Medicine, GIBH-HKU Guangdong-Hong Kong Stem Cell and Regenerative Medicine Research Centre, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Yingying Lin
- Center for Cell Lineage and Development, CAS Key Laboratory of Regenerative Biology, Guangdong Provincial Key Laboratory of Stem Cell and Regenerative Medicine, GIBH-HKU Guangdong-Hong Kong Stem Cell and Regenerative Medicine Research Centre, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- The Center for Infection and Immunity Study, School of Medicine, Sun Yat-sen University, Guangming Science City, Shenzhen, China
| | - Wei Wang
- Center for Biosafety, Bioland Laboratory (Guangzhou Regenerative Medicine and Health Guangdong Laboratory), Guangzhou, China
| | - Yu Zhao
- Molecular Cancer Research Center, School of Medicine, Sun Yat-sen University, Shenzhen, China
| | - Xichen Bao
- Center for Cell Lineage and Development, CAS Key Laboratory of Regenerative Biology, Guangdong Provincial Key Laboratory of Stem Cell and Regenerative Medicine, GIBH-HKU Guangdong-Hong Kong Stem Cell and Regenerative Medicine Research Centre, Guangzhou Institutes of Biomedicine and Health, Chinese Academy of Sciences, Guangzhou, China
- University of Chinese Academy of Sciences, Beijing, China
- Center for Cell Lineage and Atlas, Bioland Laboratory (Guangzhou Regenerative Medicine and Health Guangdong Laboratory), Guangzhou, China
| |
Collapse
|
6
|
Chattopadhyay A, Guan P, Majumder S, Kaw K, Zhou Z, Zhang C, Prakash SK, Kaw A, Buja LM, Kwartler CS, Milewicz DM. Preventing Cholesterol-Induced Perk (Protein Kinase RNA-Like Endoplasmic Reticulum Kinase) Signaling in Smooth Muscle Cells Blocks Atherosclerotic Plaque Formation. Arterioscler Thromb Vasc Biol 2022; 42:1005-1022. [PMID: 35708026 PMCID: PMC9311463 DOI: 10.1161/atvbaha.121.317451] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Vascular smooth muscle cells (SMCs) undergo complex phenotypic modulation with atherosclerotic plaque formation in hyperlipidemic mice, which is characterized by de-differentiation and heterogeneous increases in the expression of macrophage, fibroblast, osteogenic, and stem cell markers. An increase of cellular cholesterol in SMCs triggers similar phenotypic changes in vitro with exposure to free cholesterol due to cholesterol entering the endoplasmic reticulum, triggering endoplasmic reticulum stress and activating Perk (protein kinase RNA-like endoplasmic reticulum kinase) signaling.
Collapse
Affiliation(s)
- Abhijnan Chattopadhyay
- Division of Medical Genetics, Department of Internal Medicine, McGovern Medical School The University of Texas Health Science Center at Houston (A.C., P.G., S.M., K.K., Z.Z., A.K., C.S.K., D.M.M.)
| | - Pujun Guan
- Division of Medical Genetics, Department of Internal Medicine, McGovern Medical School The University of Texas Health Science Center at Houston (A.C., P.G., S.M., K.K., Z.Z., A.K., C.S.K., D.M.M.).,Graduate School of Biomedical Sciences, University of Texas MD Anderson Cancer Center and UTHealth, Houston (P.G.)
| | - Suravi Majumder
- Division of Medical Genetics, Department of Internal Medicine, McGovern Medical School The University of Texas Health Science Center at Houston (A.C., P.G., S.M., K.K., Z.Z., A.K., C.S.K., D.M.M.)
| | - Kaveeta Kaw
- Division of Medical Genetics, Department of Internal Medicine, McGovern Medical School The University of Texas Health Science Center at Houston (A.C., P.G., S.M., K.K., Z.Z., A.K., C.S.K., D.M.M.)
| | - Zhen Zhou
- Division of Medical Genetics, Department of Internal Medicine, McGovern Medical School The University of Texas Health Science Center at Houston (A.C., P.G., S.M., K.K., Z.Z., A.K., C.S.K., D.M.M.)
| | - Chen Zhang
- Division of Cardiothoracic Surgery, Michael E. DeBakey Department of Surgery, Baylor College of Medicine, Houston, TX (C.Z.).,Department of Cardiovascular Surgery, Texas Heart Institute, Houston (C.Z.)
| | | | - Anita Kaw
- Division of Medical Genetics, Department of Internal Medicine, McGovern Medical School The University of Texas Health Science Center at Houston (A.C., P.G., S.M., K.K., Z.Z., A.K., C.S.K., D.M.M.)
| | - L Maximillian Buja
- Department of Pathology and Laboratory Medicine, The University of Texas Health Science Center at Houston (L.M.B.)
| | - Callie S Kwartler
- Division of Medical Genetics, Department of Internal Medicine, McGovern Medical School The University of Texas Health Science Center at Houston (A.C., P.G., S.M., K.K., Z.Z., A.K., C.S.K., D.M.M.)
| | - Dianna M Milewicz
- Division of Medical Genetics, Department of Internal Medicine, McGovern Medical School The University of Texas Health Science Center at Houston (A.C., P.G., S.M., K.K., Z.Z., A.K., C.S.K., D.M.M.)
| |
Collapse
|
7
|
Himanen SV, Puustinen MC, Da Silva AJ, Vihervaara A, Sistonen L. HSFs drive transcription of distinct genes and enhancers during oxidative stress and heat shock. Nucleic Acids Res 2022; 50:6102-6115. [PMID: 35687139 PMCID: PMC9226494 DOI: 10.1093/nar/gkac493] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Accepted: 05/25/2022] [Indexed: 11/13/2022] Open
Abstract
Reprogramming of transcription is critical for the survival under cellular stress. Heat shock has provided an excellent model to investigate nascent transcription in stressed cells, but the molecular mechanisms orchestrating RNA synthesis during other types of stress are unknown. We utilized PRO-seq and ChIP-seq to study how Heat Shock Factors, HSF1 and HSF2, coordinate transcription at genes and enhancers upon oxidative stress and heat shock. We show that pause-release of RNA polymerase II (Pol II) is a universal mechanism regulating gene transcription in stressed cells, while enhancers are activated at the level of Pol II recruitment. Moreover, besides functioning as conventional promoter-binding transcription factors, HSF1 and HSF2 bind to stress-induced enhancers to trigger Pol II pause-release from poised gene promoters. Importantly, HSFs act at distinct genes and enhancers in a stress type-specific manner. HSF1 binds to many chaperone genes upon oxidative and heat stress but activates them only in heat-shocked cells. Under oxidative stress, HSF1 localizes to a unique set of promoters and enhancers to trans-activate oxidative stress-specific genes. Taken together, we show that HSFs function as multi-stress-responsive factors that activate distinct genes and enhancers when encountering changes in temperature and redox state.
Collapse
Affiliation(s)
- Samu V Himanen
- Faculty of Science and Engineering, Cell Biology, Åbo Akademi University, 20520 Turku, Finland.,Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland
| | - Mikael C Puustinen
- Faculty of Science and Engineering, Cell Biology, Åbo Akademi University, 20520 Turku, Finland.,Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland
| | - Alejandro J Da Silva
- Faculty of Science and Engineering, Cell Biology, Åbo Akademi University, 20520 Turku, Finland.,Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland
| | - Anniina Vihervaara
- Department of Gene Technology, Science for Life Laboratory, KTH Royal Institute of Technology, 17165 Stockholm, Sweden
| | - Lea Sistonen
- Faculty of Science and Engineering, Cell Biology, Åbo Akademi University, 20520 Turku, Finland.,Turku Bioscience Centre, University of Turku and Åbo Akademi University, 20520 Turku, Finland
| |
Collapse
|
8
|
Hunter S, Sigauke RF, Stanley JT, Allen MA, Dowell RD. Protocol variations in run-on transcription dataset preparation produce detectable signatures in sequencing libraries. BMC Genomics 2022; 23:187. [PMID: 35255806 PMCID: PMC8900324 DOI: 10.1186/s12864-022-08352-8] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2021] [Accepted: 01/25/2022] [Indexed: 11/20/2022] Open
Abstract
Background A variety of protocols exist for producing whole genome run-on transcription datasets. However, little is known about how differences between these protocols affect the signal within the resulting libraries. Results Using run-on transcription datasets generated from the same biological system, we show that a variety of GRO- and PRO-seq preparation methods leave identifiable signatures within each library. Specifically we show that the library preparation method results in differences in quality control metrics, as well as differences in the signal distribution at the 5 ′ end of transcribed regions. These shifts lead to disparities in eRNA identification, but do not impact analyses aimed at inferring the key regulators involved in changes to transcription. Conclusions Run-on sequencing protocol variations result in technical signatures that can be used to identify both the enrichment and library preparation method of a particular data set. These technical signatures are batch effects that limit detailed comparisons of pausing ratios and eRNAs identified across protocols. However, these batch effects have only limited impact on our ability to infer which regulators underlie the observed transcriptional changes. Supplementary Information The online version contains supplementary material available at (10.1186/s12864-022-08352-8).
Collapse
Affiliation(s)
- Samuel Hunter
- BioFrontiers Institute, University of Colorado, Boulder, 80309, USA
| | - Rutendo F Sigauke
- Computational Bioscience Program, Anschutz Medical Campus, University of Colorado, Aurora, 80045, USA
| | - Jacob T Stanley
- Molecular, Cellular, and Developmental Biology, University of Colorado Boulder, Boulder, 80301, USA
| | - Mary A Allen
- BioFrontiers Institute, University of Colorado, Boulder, 80309, USA
| | - Robin D Dowell
- BioFrontiers Institute, University of Colorado, Boulder, 80309, USA. .,Computational Bioscience Program, Anschutz Medical Campus, University of Colorado, Aurora, 80045, USA. .,Molecular, Cellular, and Developmental Biology, University of Colorado Boulder, Boulder, 80301, USA. .,Department of Computer Science, University of Colorado, Boulder, 80309, USA.
| |
Collapse
|
9
|
Gupta A, Sasse SK, Gruca MA, Sanford L, Dowell RD, Gerber AN. Deconvolution of multiplexed transcriptional responses to wood smoke particles defines rapid aryl hydrocarbon receptor signaling dynamics. J Biol Chem 2021; 297:101147. [PMID: 34520756 PMCID: PMC8517214 DOI: 10.1016/j.jbc.2021.101147] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 08/26/2021] [Accepted: 08/27/2021] [Indexed: 12/24/2022] Open
Abstract
The heterogeneity of respirable particulates and compounds complicates our understanding of transcriptional responses to air pollution. Here, we address this by applying precision nuclear run-on sequencing and the assay for transposase-accessible chromatin sequencing to measure nascent transcription and chromatin accessibility in airway epithelial cells after wood smoke particle (WSP) exposure. We used transcription factor enrichment analysis to identify temporally distinct roles for ternary response factor-serum response factor complexes, the aryl hydrocarbon receptor (AHR), and NFκB in regulating transcriptional changes induced by WSP. Transcription of canonical targets of the AHR, such as CYP1A1 and AHRR, was robustly increased after just 30 min of WSP exposure, and we discovered novel AHR-regulated pathways and targets including the DNA methyltransferase, DNMT3L. Transcription of these genes and associated enhancers rapidly returned to near baseline by 120 min after exposure. The kinetics of AHR- and NFκB-regulated responses to WSP were distinguishable based on the timing of both transcriptional responses and chromatin remodeling, with induction of several cytokines implicated in maintaining NFκB-mediated responses through 120 min of exposure. In aggregate, our data establish a direct and primary role for AHR in mediating airway epithelial responses to WSP and identify crosstalk between AHR and NFκB signaling in controlling proinflammatory gene expression. This work also defines an integrated genomics-based strategy for deconvoluting multiplexed transcriptional responses to heterogeneous environmental exposures.
Collapse
Affiliation(s)
- Arnav Gupta
- Department of Medicine, National Jewish Health, Denver, Colorado, USA; Department of Medicine, University of Colorado, Aurora, Colorado, USA
| | - Sarah K Sasse
- Department of Medicine, National Jewish Health, Denver, Colorado, USA
| | - Margaret A Gruca
- BioFrontiers Institute, University of Colorado, Boulder, Colorado, USA
| | - Lynn Sanford
- BioFrontiers Institute, University of Colorado, Boulder, Colorado, USA
| | - Robin D Dowell
- BioFrontiers Institute, University of Colorado, Boulder, Colorado, USA; Department of Molecular, Cellular and Developmental Biology, University of Colorado, Boulder, Colorado, USA; Department of Computer Science, University of Colorado, Boulder, Colorado, USA
| | - Anthony N Gerber
- Department of Medicine, National Jewish Health, Denver, Colorado, USA; Department of Medicine, University of Colorado, Aurora, Colorado, USA; Department of Immunology and Genomic Medicine, National Jewish Health, Denver, Colorado, USA.
| |
Collapse
|
10
|
Zhao Y, Dukler N, Barshad G, Toneyan S, Danko CG, Siepel A. Deconvolution of Expression for Nascent RNA sequencing data (DENR) highlights pre-RNA isoform diversity in human cells. Bioinformatics 2021; 37:4727-4736. [PMID: 34382072 PMCID: PMC8665767 DOI: 10.1093/bioinformatics/btab582] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2021] [Revised: 07/24/2021] [Accepted: 08/09/2021] [Indexed: 12/03/2022] Open
Abstract
Motivation Quantification of isoform abundance has been extensively studied at the mature RNA level using RNA-seq but not at the level of precursor RNAs using nascent RNA sequencing. Results We address this problem with a new computational method called Deconvolution of Expression for Nascent RNA-sequencing data (DENR), which models nascent RNA-sequencing read-counts as a mixture of user-provided isoforms. The baseline algorithm is enhanced by machine-learning predictions of active transcription start sites and an adjustment for the typical ‘shape profile’ of read-counts along a transcription unit. We show that DENR outperforms simple read-count-based methods for estimating gene and isoform abundances, and that transcription of multiple pre-RNA isoforms per gene is widespread, with frequent differences between cell types. In addition, we provide evidence that a majority of human isoform diversity derives from primary transcription rather than from post-transcriptional processes. Availability and implementation DENR and nascentRNASim are freely available at https://github.com/CshlSiepelLab/DENR (version v1.0.0) and https://github.com/CshlSiepelLab/nascentRNASim (version v0.3.0). Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yixin Zhao
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Noah Dukler
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Gilad Barshad
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY 14853, USA
| | - Shushan Toneyan
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| | - Charles G Danko
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, NY 14853, USA
| | - Adam Siepel
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA
| |
Collapse
|
11
|
Gally F, Sasse SK, Kurche JS, Gruca MA, Cardwell JH, Okamoto T, Chu HW, Hou X, Poirion OB, Buchanan J, Preissl S, Ren B, Colgan SP, Dowell RD, Yang IV, Schwartz DA, Gerber AN. The MUC5B-associated variant rs35705950 resides within an enhancer subject to lineage- and disease-dependent epigenetic remodeling. JCI Insight 2021; 6:144294. [PMID: 33320836 PMCID: PMC7934873 DOI: 10.1172/jci.insight.144294] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Accepted: 12/09/2020] [Indexed: 12/19/2022] Open
Abstract
The G/T transversion rs35705950, located approximately 3 kb upstream of the MUC5B start site, is the cardinal risk factor for idiopathic pulmonary fibrosis (IPF). Here, we investigate the function and chromatin structure of this –3 kb region and provide evidence that it functions as a classically defined enhancer subject to epigenetic programming. We use nascent transcript analysis to show that RNA polymerase II loads within 10 bp of the G/T transversion site, definitively establishing enhancer function for the region. By integrating Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) analysis of fresh and cultured human airway epithelial cells with nuclease sensitivity data, we demonstrate that this region is in accessible chromatin that affects the expression of MUC5B. Through applying paired single-nucleus RNA- and ATAC-seq to frozen tissue from IPF lungs, we extend these findings directly to disease, with results indicating that epigenetic programming of the –3 kb enhancer in IPF occurs in both MUC5B-expressing and nonexpressing lineages. In aggregate, our results indicate that the MUC5B-associated variant rs35705950 resides within an enhancer that is subject to epigenetic remodeling and contributes to pathologic misexpression in IPF.
Collapse
Affiliation(s)
- Fabienne Gally
- Department of Immunology and Genomic Medicine, National Jewish Health, Denver, Colorado, USA.,Department of Medicine, University of Colorado, Aurora, Colorado, USA
| | - Sarah K Sasse
- Department of Medicine, National Jewish Health, Denver, Colorado, USA
| | - Jonathan S Kurche
- Department of Medicine, University of Colorado, Aurora, Colorado, USA
| | - Margaret A Gruca
- BioFrontiers Institute, University of Colorado-Boulder (CU Boulder), Boulder, Colorado, USA
| | | | - Tsukasa Okamoto
- Department of Medicine, University of Colorado, Aurora, Colorado, USA.,Department of Respiratory Medicine, Tokyo Medical and Dental University, Tokyo, Japan
| | - Hong W Chu
- Department of Medicine, National Jewish Health, Denver, Colorado, USA
| | - Xiaomeng Hou
- Center for Epigenomics, Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, California, USA
| | - Olivier B Poirion
- Center for Epigenomics, Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, California, USA
| | - Justin Buchanan
- Center for Epigenomics, Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, California, USA
| | - Sebastian Preissl
- Center for Epigenomics, Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, California, USA
| | - Bing Ren
- Center for Epigenomics, Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, California, USA.,Ludwig Institute for Cancer Research, La Jolla, California, USA
| | - Sean P Colgan
- Department of Medicine, University of Colorado, Aurora, Colorado, USA
| | - Robin D Dowell
- BioFrontiers Institute, University of Colorado-Boulder (CU Boulder), Boulder, Colorado, USA.,Molecular, Cellular and Developmental Biology, and.,Computer Science, CU Boulder, Boulder, Colorado, USA
| | - Ivana V Yang
- Department of Medicine, University of Colorado, Aurora, Colorado, USA
| | - David A Schwartz
- Department of Medicine, University of Colorado, Aurora, Colorado, USA
| | - Anthony N Gerber
- Department of Immunology and Genomic Medicine, National Jewish Health, Denver, Colorado, USA.,Department of Medicine, University of Colorado, Aurora, Colorado, USA.,Department of Medicine, National Jewish Health, Denver, Colorado, USA
| |
Collapse
|
12
|
Amelkina O, Comizzoli P. Initial response of ovarian tissue transcriptome to vitrification or microwave-assisted dehydration in the domestic cat model. BMC Genomics 2020; 21:828. [PMID: 33238878 PMCID: PMC7690003 DOI: 10.1186/s12864-020-07236-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Accepted: 11/17/2020] [Indexed: 12/11/2022] Open
Abstract
Background Long term preservation of living ovarian tissues is a critical approach in human reproductive medicine as well as in the conservation of rare animal genotypes. Compared to single cell preservation, optimization of protocols for tissues is highly complex because of the diversity of cells responding differently to non-physiological conditions. Using the prepubertal domestic cat as a model, the objective was to study immediate effects of vitrification or microwave-assisted dehydration on the global transcriptome dynamics in the ovarian cortex. RNA sequencing was performed on ovarian tissues (n = 6 individuals) from different conditions: fresh tissue after dissection (F), vitrified/warmed tissue (V), tissue dehydrated for 5 min (D5) or 10 min (D10) followed by rehydration. Differential gene expression analysis was performed for comparison pairs V vs. F, D10 vs. F, D5 vs. F and D10 vs. D5, and networks were built based on results of functional enrichment and in silico protein-protein interactions. Results The impact of the vitrification protocol was already measurable within 20 min after warming and involved upregulation of the expression of seven mitochondrial DNA genes related to mitochondrial respiration. The analysis of D10 vs. F revealed, 30 min after rehydration, major downregulation of gene expression with enrichment of in silico interacting genes in Ras, Rap1, PI3K-Akt and MAPK signaling pathways. However, comparison of D5 vs. F showed negligible effects of the shorter dehydration protocol with two genes enriched in Ras signaling. Comparison of D10 vs. D5 showed downregulation of only seven genes. Vitrification and dehydration protocols mainly changed the expression of different genes and functional terms, but some of the differentially expressed genes formed a major in silico protein-protein interaction cluster enriched for mitochondrial respiration and Ras/MAPK signaling pathways. Conclusions Our results showed, for the first time, different effects of vitrification and microwave-assisted dehydration protocols on the global transcriptome of the ovarian cortex (using the domestic cat as a biomedical model). Acquired data and networks built on the basis of differentially expressed genes (1) can help to better understand stress responses to non-physiological stresses and (2) can be used as directions for future preservation protocol optimizations. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-020-07236-z.
Collapse
Affiliation(s)
- Olga Amelkina
- Smithsonian Conservation Biology Institute, National Zoological Park, Washington, DC, USA
| | - Pierre Comizzoli
- Smithsonian Conservation Biology Institute, National Zoological Park, Washington, DC, USA.
| |
Collapse
|
13
|
Gopal U, Pizzo SV. Cell surface GRP78 signaling: An emerging role as a transcriptional modulator in cancer. J Cell Physiol 2020; 236:2352-2363. [PMID: 32864780 DOI: 10.1002/jcp.30030] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2020] [Revised: 08/07/2020] [Accepted: 08/17/2020] [Indexed: 12/14/2022]
Abstract
Cancer cells acquire dysregulated gene expression to establish specific transcriptional dependencies and their underlying mechanisms that are ultimately responsible for this addictions have not been fully elucidated. Glucose-regulated protein 78 (GRP78) is a stress-inducible, multifunctional, prosurvival, endoplasmic reticulum chaperone in the heat shock protein 70 family. Expression of cell surface GRP78 (CS-GRP78) is associated with increased malignant behavior and resistance to chemotherapy and radiotherapy by endowing various cancer cells with increased proliferative ability, altered metabolism, improved survival, and augmented invasive and metastatic potential. Emerging evidence has highlighted an unusual role of CS-GRP78 in regulating transcription factors (TFs) by mediating various signaling pathways involved in malignant transformation, metabolic reprogramming, and tumor progression. During the last decade, we targeted CS-GRP78 with C38 monoclonal antibody (C38 Mab) in numerous studies, which have highlighted the epigenetic interplay between CS-GRP78 and various TFs including c-MYC, Yes-associated protein/transcriptional coactivator with PDZ-binding motif, c-Fos, and histone acetylation to potentiate subsequent modulation of tumorigenesis, invasion, and metastasis. Here, we summarize the current state of knowledge about the role of CS-GRP78 in cancer development and progression, including epigenetic regulation and sheds light on CS-GRP78 as vulnerable target for cancer therapy. Overall, this review focuses on the mechanisms of TFs that are behind the transcriptional dysregulation in cancer and lays the groundwork for rational therapeutic use of C38 Mab based on CS-GRP78 biology.
Collapse
Affiliation(s)
- Udhayakumar Gopal
- Department of Pathology, Duke University Medical Center, Durham, North Carolina
| | - Salvatore V Pizzo
- Department of Pathology, Duke University Medical Center, Durham, North Carolina
| |
Collapse
|
14
|
Wissink EM, Vihervaara A, Tippens ND, Lis JT. Nascent RNA analyses: tracking transcription and its regulation. Nat Rev Genet 2019; 20:705-723. [PMID: 31399713 PMCID: PMC6858503 DOI: 10.1038/s41576-019-0159-6] [Citation(s) in RCA: 129] [Impact Index Per Article: 25.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/04/2019] [Indexed: 12/19/2022]
Abstract
The programmes that direct an organism's development and maintenance are encoded in its genome. Decoding of this information begins with regulated transcription of genomic DNA into RNA. Although transcription and its control can be tracked indirectly by measuring stable RNAs, it is only by directly measuring nascent RNAs that the immediate regulatory changes in response to developmental, environmental, disease and metabolic signals are revealed. Multiple complementary methods have been developed to quantitatively track nascent transcription genome-wide at nucleotide resolution, all of which have contributed novel insights into the mechanisms of gene regulation and transcription-coupled RNA processing. Here we critically evaluate the array of strategies used for investigating nascent transcription and discuss the recent conceptual advances they have provided.
Collapse
Affiliation(s)
- Erin M Wissink
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Anniina Vihervaara
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
| | - Nathaniel D Tippens
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA
- Tri-Institutional Training Program in Computational Biology and Medicine, New York, NY, USA
| | - John T Lis
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, USA.
| |
Collapse
|
15
|
Himanen SV, Sistonen L. New insights into transcriptional reprogramming during cellular stress. J Cell Sci 2019; 132:132/21/jcs238402. [PMID: 31676663 DOI: 10.1242/jcs.238402] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Cellular stress triggers reprogramming of transcription, which is required for the maintenance of homeostasis under adverse growth conditions. Stress-induced changes in transcription include induction of cyto-protective genes and repression of genes related to the regulation of the cell cycle, transcription and metabolism. Induction of transcription is mediated through the activation of stress-responsive transcription factors that facilitate the release of stalled RNA polymerase II and so allow for transcriptional elongation. Repression of transcription, in turn, involves components that retain RNA polymerase II in a paused state on gene promoters. Moreover, transcription during stress is regulated by a massive activation of enhancers and complex changes in chromatin organization. In this Review, we highlight the latest research regarding the molecular mechanisms of transcriptional reprogramming upon stress in the context of specific proteotoxic stress responses, including the heat-shock response, unfolded protein response, oxidative stress response and hypoxia response.
Collapse
Affiliation(s)
- Samu V Himanen
- Faculty of Science and Engineering, Cell Biology, Åbo Akademi University, Tykistökatu 6, 20520 Turku, Finland.,Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520 Turku, Finland
| | - Lea Sistonen
- Faculty of Science and Engineering, Cell Biology, Åbo Akademi University, Tykistökatu 6, 20520 Turku, Finland .,Turku Bioscience Centre, University of Turku and Åbo Akademi University, Tykistökatu 6, 20520 Turku, Finland
| |
Collapse
|
16
|
Abstract
Proteotoxic stress, that is, stress caused by protein misfolding and aggregation, triggers the rapid and global reprogramming of transcription at genes and enhancers. Genome-wide assays that track transcriptionally engaged RNA polymerase II (Pol II) at nucleotide resolution have provided key insights into the underlying molecular mechanisms that regulate transcriptional responses to stress. In addition, recent kinetic analyses of transcriptional control under heat stress have shown how cells 'prewire' and rapidly execute genome-wide changes in transcription while concurrently becoming poised for recovery. The regulation of Pol II at genes and enhancers in response to heat stress is coupled to chromatin modification and compartmentalization, as well as to co-transcriptional RNA processing. These mechanistic features seem to apply broadly to other coordinated genome-regulatory responses.
Collapse
|
17
|
Ball CB, Nilson KA, Price DH. Use of the nuclear walk-on methodology to determine sites of RNA polymerase II initiation and pausing and quantify nascent RNAs in cells. Methods 2019; 159-160:165-176. [PMID: 30743000 PMCID: PMC6589122 DOI: 10.1016/j.ymeth.2019.02.003] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2018] [Revised: 02/04/2019] [Accepted: 02/06/2019] [Indexed: 01/12/2023] Open
Abstract
Transcription by RNA polymerase II (Pol II) is controlled during initiation, elongation, and termination by a large variety of transcription factors, the state of chromatin modifications, and environmental conditions. Herein we describe experimental approaches for the examination of Pol II transcription at semi-global and genome-wide scales through analysis of nascent Pol II transcripts. We begin with a description of the nuclear walk-on (NWO) assay, which involves rapid isolation of nuclei in the presence of EDTA, followed by extension of about a quarter of the nascent transcripts with 32P-CTP. Labeled nascent transcripts are then analyzed by denaturing PAGE and phosphorimaging followed by densitometry analysis to quantify the signal on the gel. A parallel reaction containing α-amanitin to inhibit Pol II reveals transcription due to Pol I and Pol III, which can be subtracted to yield a profile of Pol II transcription. We then describe how to use the NWO as a front end for PRO-Seq and PRO-Cap methods, which permit the genome-wide characterization of Pol II transcription at nucleotide resolution and provide precise information about sites of transcription initiation and pausing. We discuss strategies for optimizing sequencing methods that capture nascent Pol II transcripts, methods of bias reduction, and approaches for normalizing these and other sequencing datasets using spike-in controls.
Collapse
Affiliation(s)
- Christopher B Ball
- Department of Biochemistry, University of Iowa, Iowa City, IA 52242, USA
| | - Kyle A Nilson
- Center for Eukaryotic Gene Regulation, Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, PA 16802, USA
| | - David H Price
- Department of Biochemistry, University of Iowa, Iowa City, IA 52242, USA.
| |
Collapse
|
18
|
Wang Z, Chu T, Choate LA, Danko CG. Identification of regulatory elements from nascent transcription using dREG. Genome Res 2019; 29:293-303. [PMID: 30573452 PMCID: PMC6360809 DOI: 10.1101/gr.238279.118] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2018] [Accepted: 12/18/2018] [Indexed: 02/02/2023]
Abstract
Our genomes encode a wealth of transcription initiation regions (TIRs) that can be identified by their distinctive patterns of actively elongating RNA polymerase. We previously introduced dREG to identify TIRs using PRO-seq data. Here, we introduce an efficient new implementation of dREG that uses PRO-seq data to identify both uni- and bidirectionally transcribed TIRs with 70% improvement in accuracy, three- to fourfold higher resolution, and >100-fold increases in computational efficiency. Using a novel strategy to identify TIRs based on their statistical confidence reveals extensive overlap with orthogonal assays, yet also reveals thousands of additional weakly transcribed TIRs that were not identified by H3K27ac ChIP-seq or DNase-seq. Novel TIRs discovered by dREG were often associated with RNA polymerase III initiation, bound by pioneer transcription factors, or located in broad domains marked by repressive chromatin modifications. Our results suggest that transcription initiation can be a powerful tool for expanding the catalog of functional elements.
Collapse
Affiliation(s)
- Zhong Wang
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, New York 14853, USA
| | - Tinyi Chu
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, New York 14853, USA
- Graduate Field of Computational Biology, Cornell University, Ithaca, New York 14853, USA
| | - Lauren A Choate
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, New York 14853, USA
| | - Charles G Danko
- Baker Institute for Animal Health, College of Veterinary Medicine, Cornell University, Ithaca, New York 14853, USA
- Department of Biomedical Sciences, College of Veterinary Medicine, Cornell University, Ithaca, New York 14853, USA
| |
Collapse
|
19
|
Abstract
The Mediator-associated kinases CDK8 and CDK19 function in the context of three additional proteins: CCNC and MED12, which activate CDK8/CDK19 kinase function, and MED13, which enables their association with the Mediator complex. The Mediator kinases affect RNA polymerase II (pol II) transcription indirectly, through phosphorylation of transcription factors and by controlling Mediator structure and function. In this review, we discuss cellular roles of the Mediator kinases and mechanisms that enable their biological functions. We focus on sequence-specific, DNA-binding transcription factors and other Mediator kinase substrates, and how CDK8 or CDK19 may enable metabolic and transcriptional reprogramming through enhancers and chromatin looping. We also summarize Mediator kinase inhibitors and their therapeutic potential. Throughout, we note conserved and divergent functions between yeast and mammalian CDK8, and highlight many aspects of kinase module function that remain enigmatic, ranging from potential roles in pol II promoter-proximal pausing to liquid-liquid phase separation.
Collapse
Affiliation(s)
- Charli B Fant
- a Department of Biochemistry , University of Colorado , Boulder , CO , USA
| | - Dylan J Taatjes
- a Department of Biochemistry , University of Colorado , Boulder , CO , USA
| |
Collapse
|
20
|
Dynamic evolution of regulatory element ensembles in primate CD4 + T cells. Nat Ecol Evol 2018; 2:537-548. [PMID: 29379187 DOI: 10.1038/s41559-017-0447-5] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Accepted: 12/08/2017] [Indexed: 12/12/2022]
Abstract
How evolutionary changes at enhancers affect the transcription of target genes remains an important open question. Previous comparative studies of gene expression have largely measured the abundance of messenger RNA, which is affected by post-transcriptional regulatory processes, hence limiting inferences about the mechanisms underlying expression differences. Here, we directly measured nascent transcription in primate species, allowing us to separate transcription from post-transcriptional regulation. We used precision run-on and sequencing to map RNA polymerases in resting and activated CD4+ T cells in multiple human, chimpanzee and rhesus macaque individuals, with rodents as outgroups. We observed general conservation in coding and non-coding transcription, punctuated by numerous differences between species, particularly at distal enhancers and non-coding RNAs. Genes regulated by larger numbers of enhancers are more frequently transcribed at evolutionarily stable levels, despite reduced conservation at individual enhancers. Adaptive nucleotide substitutions are associated with lineage-specific transcription and at one locus, SGPP2, we predict and experimentally validate that multiple substitutions contribute to human-specific transcription. Collectively, our findings suggest a pervasive role for evolutionary compensation across ensembles of enhancers that jointly regulate target genes.
Collapse
|