1
|
Lee SA, Kristjánsdóttir K, Kwak H. eRNA co-expression network uncovers TF dependency and convergent cooperativity. Sci Rep 2023; 13:19085. [PMID: 37925545 PMCID: PMC10625640 DOI: 10.1038/s41598-023-46415-2] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Accepted: 10/31/2023] [Indexed: 11/06/2023] Open
Abstract
Enhancer RNAs (eRNAs) are non-coding RNAs produced by transcriptional enhancers that are highly correlated with their activity. Using a capped nascent RNA sequencing (PRO-cap) dataset in human lymphoblastoid cell lines across 67 individuals, we identified inter-individual variation in the expression of over 80 thousand transcribed transcriptional regulatory elements (tTREs), in both enhancers and promoters. Co-expression analysis of eRNAs from tTREs across individuals revealed how enhancers are associated with each other and with promoters. Mid- to long-range co-expression showed a distance-dependent decay that was modified by TF occupancy. In particular, we found a class of "bivalent" TFs, including Cohesin, that both facilitate and isolate the interaction between enhancers and/or promoters, depending on their topology. At short distances, we observed strand-specific correlations between nearby eRNAs in both convergent and divergent orientations. Our results support a cooperative model of convergent eRNAs, consistent with eRNAs facilitating adjacent enhancers rather than interfering with each other. Therefore, our approach to infer functional interactions from co-expression analyses provided novel insights into the principles of enhancer interactions as a function of distance, orientation, and binding landscapes of TFs.
Collapse
Affiliation(s)
- Seungha Alisa Lee
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, 14850, USA
| | - Katla Kristjánsdóttir
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, 14850, USA
| | - Hojoong Kwak
- Department of Molecular Biology and Genetics, Cornell University, Ithaca, NY, 14850, USA.
| |
Collapse
|
2
|
Nikumbh S, Lenhard B. Identifying promoter sequence architectures via a chunking-based algorithm using non-negative matrix factorisation. PLoS Comput Biol 2023; 19:e1011491. [PMID: 37983292 PMCID: PMC10695386 DOI: 10.1371/journal.pcbi.1011491] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 12/04/2023] [Accepted: 09/05/2023] [Indexed: 11/22/2023] Open
Abstract
Core promoters are stretches of DNA at the beginning of genes that contain information that facilitates the binding of transcription initiation complexes. Different functional subsets of genes have core promoters with distinct architectures and characteristic motifs. Some of these motifs inform the selection of transcription start sites (TSS). By discovering motifs with fixed distances from known TSS positions, we could in principle classify promoters into different functional groups. Due to the variability and overlap of architectures, promoter classification is a difficult task that requires new approaches. In this study, we present a new method based on non-negative matrix factorisation (NMF) and the associated software called seqArchR that clusters promoter sequences based on their motifs at near-fixed distances from a reference point, such as TSS. When combined with experimental data from CAGE, seqArchR can efficiently identify TSS-directing motifs, including known ones like TATA, DPE, and nucleosome positioning signal, as well as novel lineage-specific motifs and the function of genes associated with them. By using seqArchR on developmental time courses, we reveal how relative use of promoter architectures changes over time with stage-specific expression. seqArchR is a powerful tool for initial genome-wide classification and functional characterisation of promoters. Its use cases are more general: it can also be used to discover any motifs at near-fixed distances from a reference point, even if they are present in only a small subset of sequences.
Collapse
Affiliation(s)
- Sarvesh Nikumbh
- Computational Regulatory Genomics, MRC London Institute of Medical Sciences, London, United Kingdom
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, United Kingdom
| | - Boris Lenhard
- Computational Regulatory Genomics, MRC London Institute of Medical Sciences, London, United Kingdom
- Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, Hammersmith Hospital Campus, London, United Kingdom
| |
Collapse
|
3
|
Bernardini A, Hollinger C, Willgenss D, Müller F, Devys D, Tora L. Transcription factor IID parks and drives preinitiation complexes at sharp or broad promoters. Trends Biochem Sci 2023; 48:839-848. [PMID: 37574371 PMCID: PMC10529448 DOI: 10.1016/j.tibs.2023.07.009] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 07/19/2023] [Accepted: 07/19/2023] [Indexed: 08/15/2023]
Abstract
Core promoters are sites where transcriptional regulatory inputs of a gene are integrated to direct the assembly of the preinitiation complex (PIC) and RNA polymerase II (Pol II) transcription output. Until now, core promoter functions have been investigated by distinct methods, including Pol II transcription initiation site mappings and structural characterization of PICs on distinct promoters. Here, we bring together these previously unconnected observations and hypothesize how, on metazoan TATA promoters, the precisely structured building up of transcription factor (TF) IID-based PICs results in sharp transcription start site (TSS) selection; or, in contrast, how the less strictly controlled positioning of the TATA-less promoter DNA relative to TFIID-core PIC components results in alternative broad TSS selections by Pol II.
Collapse
Affiliation(s)
- Andrea Bernardini
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, 67404 Illkirch, France; Centre National de la Recherche Scientifique, UMR7104, 67404 Illkirch, France; Institut National de la Santé et de la Recherche Médicale, U1258, 67404 Illkirch, France; Université de Strasbourg, 67404 Illkirch, France
| | | | | | - Ferenc Müller
- Institute of Cancer and Genomic Sciences, College of Medical and Dental Sciences, University of Birmingham, Birmingham, UK
| | - Didier Devys
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, 67404 Illkirch, France; Centre National de la Recherche Scientifique, UMR7104, 67404 Illkirch, France; Institut National de la Santé et de la Recherche Médicale, U1258, 67404 Illkirch, France; Université de Strasbourg, 67404 Illkirch, France.
| | - László Tora
- Institut de Génétique et de Biologie Moléculaire et Cellulaire, 67404 Illkirch, France; Centre National de la Recherche Scientifique, UMR7104, 67404 Illkirch, France; Institut National de la Santé et de la Recherche Médicale, U1258, 67404 Illkirch, France; Université de Strasbourg, 67404 Illkirch, France.
| |
Collapse
|
4
|
Herbert A. Flipons and small RNAs accentuate the asymmetries of pervasive transcription by the reset and sequence-specific microcoding of promoter conformation. J Biol Chem 2023; 299:105140. [PMID: 37544644 PMCID: PMC10474125 DOI: 10.1016/j.jbc.2023.105140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 07/25/2023] [Accepted: 07/31/2023] [Indexed: 08/08/2023] Open
Abstract
The role of alternate DNA conformations such as Z-DNA in the regulation of transcription is currently underappreciated. These structures are encoded by sequences called flipons, many of which are enriched in promoter and enhancer regions. Through a change in their conformation, flipons provide a tunable mechanism to mechanically reset promoters for the next round of transcription. They act as actuators that capture and release energy to ensure that the turnover of the proteins at promoters is optimized to cell state. Likewise, the single-stranded DNA formed as flipons cycle facilitates the docking of RNAs that are able to microcode promoter conformations and canalize the pervasive transcription commonly observed in metazoan genomes. The strand-specific nature of the interaction between RNA and DNA likely accounts for the known asymmetry of epigenetic marks present on the histone tetramers that pair to form nucleosomes. The role of these supercoil-dependent processes in promoter choice and transcriptional interference is reviewed. The evolutionary implications are examined: the resilience and canalization of flipon-dependent gene regulation is contrasted with the rapid adaptation enabled by the spread of flipon repeats throughout the genome. Overall, the current findings underscore the important role of flipons in modulating the readout of genetic information and how little we know about their biology.
Collapse
Affiliation(s)
- Alan Herbert
- Discovery Division, InsideOutBio, Charlestown, Massachusetts, USA.
| |
Collapse
|
5
|
Guzman C, Duttke S, Zhu Y, De Arruda Saldanha C, Downes N, Benner C, Heinz S. Combining TSS-MPRA and sensitive TSS profile dissimilarity scoring to study the sequence determinants of transcription initiation. Nucleic Acids Res 2023; 51:e80. [PMID: 37403796 PMCID: PMC10450201 DOI: 10.1093/nar/gkad562] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 06/13/2023] [Accepted: 06/20/2023] [Indexed: 07/06/2023] Open
Abstract
Cis-regulatory elements (CREs) can be classified by the shapes of their transcription start site (TSS) profiles, which are indicative of distinct regulatory mechanisms. Massively parallel reporter assays (MPRAs) are increasingly being used to study CRE regulatory mechanisms, yet the degree to which MPRAs replicate individual endogenous TSS profiles has not been determined. Here, we present a new low-input MPRA protocol (TSS-MPRA) that enables measuring TSS profiles of episomal reporters as well as after lentiviral reporter chromatinization. To sensitively compare MPRA and endogenous TSS profiles, we developed a novel dissimilarity scoring algorithm (WIP score) that outperforms the frequently used earth mover's distance on experimental data. Using TSS-MPRA and WIP scoring on 500 unique reporter inserts, we found that short (153 bp) MPRA promoter inserts replicate the endogenous TSS patterns of ∼60% of promoters. Lentiviral reporter chromatinization did not improve fidelity of TSS-MPRA initiation patterns, and increasing insert size frequently led to activation of extraneous TSS in the MPRA that are not active in vivo. We discuss the implications of our findings, which highlight important caveats when using MPRAs to study transcription mechanisms. Finally, we illustrate how TSS-MPRA and WIP scoring can provide novel insights into the impact of transcription factor motif mutations and genetic variants on TSS patterns and transcription levels.
Collapse
Affiliation(s)
- Carlos Guzman
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
- Department of Bioengineering, Graduate Program in Bioinformatics & Systems Biology, U.C. San Diego, La Jolla, CA 92093, USA
| | - Sascha Duttke
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
| | - Yixin Zhu
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
| | - Camila De Arruda Saldanha
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
| | - Nicholas L Downes
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
| | - Christopher Benner
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
| | - Sven Heinz
- Department of Medicine, Division of Endocrinology, U.C. San Diego School of Medicine, La Jolla, CA 92093, USA
| |
Collapse
|
6
|
Brooks EG, Elorriaga E, Liu Y, Duduit JR, Yuan G, Tsai CJ, Tuskan GA, Ranney TG, Yang X, Liu W. Plant Promoters and Terminators for High-Precision Bioengineering. BIODESIGN RESEARCH 2023; 5:0013. [PMID: 37849460 PMCID: PMC10328392 DOI: 10.34133/bdr.0013] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2023] [Accepted: 06/12/2023] [Indexed: 10/19/2023] Open
Abstract
High-precision bioengineering and synthetic biology require fine-tuning gene expression at both transcriptional and posttranscriptional levels. Gene transcription is tightly regulated by promoters and terminators. Promoters determine the timing, tissues and cells, and levels of the expression of genes. Terminators mediate transcription termination of genes and affect mRNA levels posttranscriptionally, e.g., the 3'-end processing, stability, translation efficiency, and nuclear to cytoplasmic export of mRNAs. The promoter and terminator combination affects gene expression. In the present article, we review the function and features of plant core promoters, proximal and distal promoters, and terminators, and their effects on and benchmarking strategies for regulating gene expression.
Collapse
Affiliation(s)
- Emily G. Brooks
- Department of Horticultural Science, North Carolina State University, Raleigh, NC 27607, USA
| | - Estefania Elorriaga
- Department of Horticultural Science, North Carolina State University, Raleigh, NC 27607, USA
| | - Yang Liu
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
- The Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
| | - James R. Duduit
- Department of Horticultural Science, North Carolina State University, Raleigh, NC 27607, USA
| | - Guoliang Yuan
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
- The Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
| | - Chung-Jui Tsai
- The Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
- Warnell School of Forestry and Natural Resource, University of Georgia, Athens, GA 30602, USA
- Department of Plant Biology, University of Georgia, Athens, GA 30602, USA
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Gerald A. Tuskan
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
- The Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
| | - Thomas G. Ranney
- Mountain Crop Improvement Lab, Department of Horticultural Science, Mountain Horticultural Crops Research and Extension Center, North Carolina State University, Mills River, NC 28759, USA
| | - Xiaohan Yang
- The Center for Bioenergy Innovation, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA
- Warnell School of Forestry and Natural Resource, University of Georgia, Athens, GA 30602, USA
| | - Wusheng Liu
- Department of Horticultural Science, North Carolina State University, Raleigh, NC 27607, USA
| |
Collapse
|
7
|
Dresch JM, Conrad RD, Klonaros D, Drewell RA. Investigating the sequence landscape in the Drosophila initiator core promoter element using an enhanced MARZ algorithm. PeerJ 2023; 11:e15597. [PMID: 37366427 PMCID: PMC10290830 DOI: 10.7717/peerj.15597] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 05/29/2023] [Indexed: 06/28/2023] Open
Abstract
The core promoter elements are important DNA sequences for the regulation of RNA polymerase II transcription in eukaryotic cells. Despite the broad evolutionary conservation of these elements, there is extensive variation in the nucleotide composition of the actual sequences. In this study, we aim to improve our understanding of the complexity of this sequence variation in the TATA box and initiator core promoter elements in Drosophila melanogaster. Using computational approaches, including an enhanced version of our previously developed MARZ algorithm that utilizes gapped nucleotide matrices, several sequence landscape features are uncovered, including an interdependency between the nucleotides in position 2 and 5 in the initiator. Incorporating this information in an expanded MARZ algorithm improves predictive performance for the identification of the initiator element. Overall our results demonstrate the need to carefully consider detailed sequence composition features in core promoter elements in order to make more robust and accurate bioinformatic predictions.
Collapse
|
8
|
Hamamoto K, Umemura Y, Makino S, Fukaya T. Dynamic interplay between non-coding enhancer transcription and gene activity in development. Nat Commun 2023; 14:826. [PMID: 36805453 PMCID: PMC9941499 DOI: 10.1038/s41467-023-36485-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2022] [Accepted: 02/03/2023] [Indexed: 02/22/2023] Open
Abstract
Non-coding transcription at the intergenic regulatory regions is a prevalent feature of metazoan genomes, but its biological function remains uncertain. Here, we devise a live-imaging system that permits simultaneous visualization of gene activity along with intergenic non-coding transcription at single-cell resolution in Drosophila. Quantitative image analysis reveals that elongation of RNA polymerase II across the internal core region of enhancers leads to suppression of transcriptional bursting from linked genes. Super-resolution imaging and genome-editing analysis further demonstrate that enhancer transcription antagonizes molecular crowding of transcription factors, thereby interrupting the formation of a transcription hub at the gene locus. We also show that a certain class of developmental enhancers are structurally optimized to co-activate gene transcription together with non-coding transcription effectively. We suggest that enhancer function is flexibly tunable through the modulation of hub formation via surrounding non-coding transcription during development.
Collapse
Affiliation(s)
- Kota Hamamoto
- Laboratory of Transcription Dynamics, Research Center for Biological Visualization, Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan.,Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Yusuke Umemura
- Laboratory of Transcription Dynamics, Research Center for Biological Visualization, Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan.,Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Shiho Makino
- Laboratory of Transcription Dynamics, Research Center for Biological Visualization, Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan
| | - Takashi Fukaya
- Laboratory of Transcription Dynamics, Research Center for Biological Visualization, Institute for Quantitative Biosciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan. .,Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Bunkyo-ku, Tokyo, Japan.
| |
Collapse
|
9
|
Arcuschin CD, Pinkasz M, Schor IE. Mechanisms of robustness in gene regulatory networks involved in neural development. Front Mol Neurosci 2023; 16:1114015. [PMID: 36814969 PMCID: PMC9940843 DOI: 10.3389/fnmol.2023.1114015] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 01/16/2023] [Indexed: 02/08/2023] Open
Abstract
The functions of living organisms are affected by different kinds of perturbation, both internal and external, which in many cases have functional effects and phenotypic impact. The effects of these perturbations become particularly relevant for multicellular organisms with complex body patterns and cell type heterogeneity, where transcriptional programs controlled by gene regulatory networks determine, for example, the cell fate during embryonic development. Therefore, an essential aspect of development in these organisms is the ability to maintain the functionality of their genetic developmental programs even in the presence of genetic variation, changing environmental conditions and biochemical noise, a property commonly termed robustness. We discuss the implication of different molecular mechanisms of robustness involved in neurodevelopment, which is characterized by the interplay of many developmental programs at a molecular, cellular and systemic level. We specifically focus on processes affecting the function of gene regulatory networks, encompassing transcriptional regulatory elements and post-transcriptional processes such as miRNA-based regulation, but also higher order regulatory organization, such as gene network topology. We also present cases where impairment of robustness mechanisms can be associated with neurodevelopmental disorders, as well as reasons why understanding these mechanisms should represent an important part of the study of gene regulatory networks driving neural development.
Collapse
Affiliation(s)
- Camila D. Arcuschin
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Universidad de Buenos Aires—Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina,Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Marina Pinkasz
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Universidad de Buenos Aires—Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Ignacio E. Schor
- Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE), Universidad de Buenos Aires—Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina,Departamento de Fisiología, Biología Molecular y Celular, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina,*Correspondence: Ignacio E. Schor ✉
| |
Collapse
|
10
|
Kubota N, Suyama M. Mapping of promoter usage QTL using RNA-seq data reveals their contributions to complex traits. PLoS Comput Biol 2022; 18:e1010436. [PMID: 36037215 PMCID: PMC9462676 DOI: 10.1371/journal.pcbi.1010436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2022] [Revised: 09/09/2022] [Accepted: 07/25/2022] [Indexed: 11/18/2022] Open
Abstract
Genomic variations are associated with gene expression levels, which are called expression quantitative trait loci (eQTL). Most eQTL may affect the total gene expression levels by regulating transcriptional activities of a specific promoter. However, the direct exploration of genomic loci associated with promoter activities using RNA-seq data has been challenging because eQTL analyses treat the total expression levels estimated by summing those of all isoforms transcribed from distinct promoters. Here we propose a new method for identifying genomic loci associated with promoter activities, called promoter usage quantitative trait loci (puQTL), using conventional RNA-seq data. By leveraging public RNA-seq datasets from the lymphoblastoid cell lines of 438 individuals from the GEUVADIS project, we obtained promoter activity estimates and mapped 2,592 puQTL at the 10% FDR level. The results of puQTL mapping enabled us to interpret the manner in which genomic variations regulate gene expression. We found that 310 puQTL genes (16.1%) were not detected by eQTL analysis, suggesting that our pipeline can identify novel variant–gene associations. Furthermore, we identified genomic loci associated with the activity of “hidden” promoters, which the standard eQTL studies have ignored. We found that most puQTL signals were concordant with at least one genome-wide association study (GWAS) signal, enabling novel interpretations of the molecular mechanisms of complex traits. Our results emphasize the importance of the re-analysis of public RNA-seq datasets to obtain novel insights into gene regulation by genomic variations and their contributions to complex traits. Many variations exist in the human genome, creating phenotypic diversity among individuals. It is well known that they are associated with the risk of disease development by affecting the expression levels of genes. Genes are transcribed from regulatory elements called promoters. Although some genes are transcribed from multiple promoters and translated into proteins with different functions, the relationship between genomic variations and promoter activities has not been investigated in depth compared to the relationship between genomic variations and gene expression levels. In this study, we proposed a new method to detect the association between genomic variations and promoter activities. Our method identified the associations between many variations and promoters using genomic and promoter activity data from blood cells of 438 individuals. This study allowed us to identify new functional associations between genomic variations and genes. Furthermore, we identified previously undiscovered variation-gene-disease associations. Our results will help to elucidate the molecular mechanisms of diseases in which genetic factors are involved.
Collapse
Affiliation(s)
- Naoto Kubota
- Division of Bioinformatics, Medical Institute of Bioregulation, Kyushu University, Fukuoka, Japan
| | - Mikita Suyama
- Division of Bioinformatics, Medical Institute of Bioregulation, Kyushu University, Fukuoka, Japan
- * E-mail:
| |
Collapse
|
11
|
Qi Z, Jung C, Bandilla P, Ludwig C, Heron M, Sophie Kiesel A, Museridze M, Philippou‐Massier J, Nikolov M, Renna Max Schnepf A, Unnerstall U, Ceolin S, Mühlig B, Gompel N, Soeding J, Gaul U. Large‐scale analysis of
Drosophila
core promoter function using synthetic promoters. Mol Syst Biol 2022; 18:e9816. [PMID: 35156763 PMCID: PMC8842121 DOI: 10.15252/msb.20209816] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2020] [Revised: 01/11/2022] [Accepted: 01/13/2022] [Indexed: 02/02/2023] Open
Abstract
The core promoter plays a central role in setting metazoan gene expression levels, but how exactly it “computes” expression remains poorly understood. To dissect its function, we carried out a comprehensive structure–function analysis in Drosophila. First, we performed a genome‐wide bioinformatic analysis, providing an improved picture of the sequence motifs architecture. We then measured synthetic promoters’ activities of ~3,000 mutational variants with and without an external stimulus (hormonal activation), at large scale and with high accuracy using robotics and a dual luciferase reporter assay. We observed a strong impact on activity of the different types of mutations, including knockout of individual sequence motifs and motif combinations, variations of motif strength, nucleosome positioning, and flanking sequences. A linear combination of the individual motif features largely accounts for the combinatorial effects on core promoter activity. These findings shed new light on the quantitative assessment of gene expression in metazoans.
Collapse
Affiliation(s)
- Zhan Qi
- Department of Biochemistry, Gene Center Ludwig‐Maximillians‐Universität München Feodor‐Lynen‐str 25 Munich Germany
| | - Christophe Jung
- Department of Biochemistry, Gene Center Ludwig‐Maximillians‐Universität München Feodor‐Lynen‐str 25 Munich Germany
| | - Peter Bandilla
- Department of Biochemistry, Gene Center Ludwig‐Maximillians‐Universität München Feodor‐Lynen‐str 25 Munich Germany
| | - Claudia Ludwig
- Department of Biochemistry, Gene Center Ludwig‐Maximillians‐Universität München Feodor‐Lynen‐str 25 Munich Germany
| | - Mark Heron
- Department of Biochemistry, Gene Center Ludwig‐Maximillians‐Universität München Feodor‐Lynen‐str 25 Munich Germany
| | - Anja Sophie Kiesel
- Department of Biochemistry, Gene Center Ludwig‐Maximillians‐Universität München Feodor‐Lynen‐str 25 Munich Germany
| | - Mariam Museridze
- Department of Biology II, Evolutionary Biology Ludwig‐Maximilians‐Universität München Planegg‐Martinsried Germany
| | - Julia Philippou‐Massier
- Department of Biochemistry, Gene Center Ludwig‐Maximillians‐Universität München Feodor‐Lynen‐str 25 Munich Germany
| | - Miroslav Nikolov
- Department of Biochemistry, Gene Center Ludwig‐Maximillians‐Universität München Feodor‐Lynen‐str 25 Munich Germany
| | - Alessio Renna Max Schnepf
- Department of Biochemistry, Gene Center Ludwig‐Maximillians‐Universität München Feodor‐Lynen‐str 25 Munich Germany
| | - Ulrich Unnerstall
- Department of Biochemistry, Gene Center Ludwig‐Maximillians‐Universität München Feodor‐Lynen‐str 25 Munich Germany
| | - Stefano Ceolin
- Department of Biology II, Evolutionary Biology Ludwig‐Maximilians‐Universität München Planegg‐Martinsried Germany
| | - Bettina Mühlig
- Department of Biology II, Evolutionary Biology Ludwig‐Maximilians‐Universität München Planegg‐Martinsried Germany
| | - Nicolas Gompel
- Department of Biology II, Evolutionary Biology Ludwig‐Maximilians‐Universität München Planegg‐Martinsried Germany
| | - Johannes Soeding
- Department of Biochemistry, Gene Center Ludwig‐Maximillians‐Universität München Feodor‐Lynen‐str 25 Munich Germany
- Max Planck Institute for Biophysical Chemistry Göttingen Germany
| | - Ulrike Gaul
- Department of Biochemistry, Gene Center Ludwig‐Maximillians‐Universität München Feodor‐Lynen‐str 25 Munich Germany
| |
Collapse
|
12
|
Einarsson H, Salvatore M, Vaagensø C, Alcaraz N, Bornholdt J, Rennie S, Andersson R. Promoter sequence and architecture determine expression variability and confer robustness to genetic variants. eLife 2022; 11:80943. [PMID: 36377861 PMCID: PMC9844987 DOI: 10.7554/elife.80943] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2022] [Accepted: 11/14/2022] [Indexed: 11/16/2022] Open
Abstract
Genetic and environmental exposures cause variability in gene expression. Although most genes are affected in a population, their effect sizes vary greatly, indicating the existence of regulatory mechanisms that could amplify or attenuate expression variability. Here, we investigate the relationship between the sequence and transcription start site architectures of promoters and their expression variability across human individuals. We find that expression variability can be largely explained by a promoter's DNA sequence and its binding sites for specific transcription factors. We show that promoter expression variability reflects the biological process of a gene, demonstrating a selective trade-off between stability for metabolic genes and plasticity for responsive genes and those involved in signaling. Promoters with a rigid transcription start site architecture are more prone to have variable expression and to be associated with genetic variants with large effect sizes, while a flexible usage of transcription start sites within a promoter attenuates expression variability and limits genotypic effects. Our work provides insights into the variable nature of responsive genes and reveals a novel mechanism for supplying transcriptional and mutational robustness to essential genes through multiple transcription start site regions within a promoter.
Collapse
Affiliation(s)
| | - Marco Salvatore
- Department of Biology, University of CopenhagenCopenhagenDenmark
| | | | - Nicolas Alcaraz
- Department of Biology, University of CopenhagenCopenhagenDenmark
| | - Jette Bornholdt
- Department of Biology, University of CopenhagenCopenhagenDenmark
| | - Sarah Rennie
- Department of Biology, University of CopenhagenCopenhagenDenmark
| | - Robin Andersson
- Department of Biology, University of CopenhagenCopenhagenDenmark
| |
Collapse
|
13
|
Lu Z, Berry K, Hu Z, Zhan Y, Ahn TH, Lin Z. TSSr: an R package for comprehensive analyses of TSS sequencing data. NAR Genom Bioinform 2021; 3:lqab108. [PMID: 34805991 PMCID: PMC8598296 DOI: 10.1093/nargab/lqab108] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2021] [Revised: 10/05/2021] [Accepted: 10/27/2021] [Indexed: 12/13/2022] Open
Abstract
Transcription initiation is regulated in a highly organized fashion to ensure proper cellular functions. Accurate identification of transcription start sites (TSSs) and quantitative characterization of transcription initiation activities are fundamental steps for studies of regulated transcriptions and core promoter structures. Several high-throughput techniques have been developed to sequence the very 5'end of RNA transcripts (TSS sequencing) on the genome scale. Bioinformatics tools are essential for processing, analysis, and visualization of TSS sequencing data. Here, we present TSSr, an R package that provides rich functions for mapping TSS and characterizations of structures and activities of core promoters based on all types of TSS sequencing data. Specifically, TSSr implements several newly developed algorithms for accurately identifying TSSs from mapped sequencing reads and inference of core promoters, which are a prerequisite for subsequent functional analyses of TSS data. Furthermore, TSSr also enables users to export various types of TSS data that can be visualized by genome browser for inspection of promoter activities in association with other genomic features, and to generate publication-ready TSS graphs. These user-friendly features could greatly facilitate studies of transcription initiation based on TSS sequencing data. The source code and detailed documentations of TSSr can be freely accessed at https://github.com/Linlab-slu/TSSr.
Collapse
Affiliation(s)
- Zhaolian Lu
- Department of Biology, Saint Louis University, St. Louis, MO 63103, USA
| | - Keenan Berry
- Program of Bioinformatics and Computational Biology, Saint Louis University, St. Louis, MO 63103, USA
| | - Zhenbin Hu
- Department of Biology, Saint Louis University, St. Louis, MO 63103, USA
| | - Yu Zhan
- Department of Biology, Saint Louis University, St. Louis, MO 63103, USA
| | - Tae-Hyuk Ahn
- Program of Bioinformatics and Computational Biology, Saint Louis University, St. Louis, MO 63103, USA
| | - Zhenguo Lin
- Department of Biology, Saint Louis University, St. Louis, MO 63103, USA
| |
Collapse
|
14
|
Allen AM, B Sokolowski M. Expression of the foraging gene in adult Drosophila melanogaster. J Neurogenet 2021; 35:192-212. [PMID: 34382904 PMCID: PMC8846931 DOI: 10.1080/01677063.2021.1941946] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Abstract
The foraging gene in Drosophila melanogaster, which encodes a cGMP-dependent protein kinase, is a highly conserved, complex gene with multiple pleiotropic behavioral and physiological functions in both the larval and adult fly. Adult foraging expression is less well characterized than in the larva. We characterized foraging expression in the brain, gastric system, and reproductive systems using a T2A-Gal4 gene-trap allele. In the brain, foraging expression appears to be restricted to multiple sub-types of glia. This glial-specific cellular localization of foraging was supported by single-cell transcriptomic atlases of the adult brain. foraging is extensively expressed in most cell types in the gastric and reproductive systems. We then mapped multiple cis-regulatory elements responsible for parts of the observed expression patterns by a nested cloned promoter-Gal4 analysis. The mapped cis-regulatory elements were consistently modular when comparing the larval and adult expression patterns. These new data using the T2A-Gal4 gene-trap and cloned foraging promoter fusion GAL4's are discussed with respect to previous work using an anti-FOR antibody, which we show here to be non-specific. Future studies of foraging's function will consider roles for glial subtypes and peripheral tissues (gastric and reproductive systems) in foraging's pleiotropic behavioral and physiological effects.
Collapse
Affiliation(s)
- Aaron M Allen
- Department of Cell and Systems Biology, University of Toronto, Toronto, Canada.,Centre for Neural Circuits and Behaviour, University of Oxford, Oxford, UK
| | - Marla B Sokolowski
- Department of Cell and Systems Biology, University of Toronto, Toronto, Canada.,Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada.,Child and Brain Development Program, Canadian Institute for Advanced Research (CIFAR), Toronto, Canada
| |
Collapse
|
15
|
Shih CH, Fay J. Cis-regulatory variants affect gene expression dynamics in yeast. eLife 2021; 10:e68469. [PMID: 34369376 PMCID: PMC8367379 DOI: 10.7554/elife.68469] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2021] [Accepted: 08/06/2021] [Indexed: 12/14/2022] Open
Abstract
Evolution of cis-regulatory sequences depends on how they affect gene expression and motivates both the identification and prediction of cis-regulatory variants responsible for expression differences within and between species. While much progress has been made in relating cis-regulatory variants to expression levels, the timing of gene activation and repression may also be important to the evolution of cis-regulatory sequences. We investigated allele-specific expression (ASE) dynamics within and between Saccharomyces species during the diauxic shift and found appreciable cis-acting variation in gene expression dynamics. Within-species ASE is associated with intergenic variants, and ASE dynamics are more strongly associated with insertions and deletions than ASE levels. To refine these associations, we used a high-throughput reporter assay to test promoter regions and individual variants. Within the subset of regions that recapitulated endogenous expression, we identified and characterized cis-regulatory variants that affect expression dynamics. Between species, chimeric promoter regions generate novel patterns and indicate constraints on the evolution of gene expression dynamics. We conclude that changes in cis-regulatory sequences can tune gene expression dynamics and that the interplay between expression dynamics and other aspects of expression is relevant to the evolution of cis-regulatory sequences.
Collapse
Affiliation(s)
- Ching-Hua Shih
- Department of Biology, University of RochesterRochesterUnited States
| | - Justin Fay
- Department of Biology, University of RochesterRochesterUnited States
| |
Collapse
|
16
|
Anreiter I, Allen AM, Vasquez OE, To L, Douglas SJ, Alvarez JV, Ewer J, Sokolowski MB. The Drosophila foraging gene plays a vital role at the start of metamorphosis for subsequent adult emergence. J Neurogenet 2021; 35:179-191. [PMID: 33944658 DOI: 10.1080/01677063.2021.1914608] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The foraging (for) gene has been extensively studied in many species for its functions in development, physiology, and behavior. It is common for genes that influence behavior and development to be essential genes, and for has been found to be an essential gene in both fruit flies and mammals, with for mutants dying before reaching the adult stage. However, the biological process underlying the lethality associated with this gene is not known. Here, we show that in Drosophila melanogaster, some but not all gene products of for are essential for survival. Specifically, we show that promoter 3 of for, but not promoters 1, 2, and 4 are required for survival past pupal stage. We use full and partial genetic deletions of for, and temperature-restricted knock-down of the gene to further investigate the stage of lethality. While deletion analysis shows that flies lacking for die at the end of pupal development, as pharate adults, temperature-restricted knock-down shows that for is only required at the start of pupal development, for normal adult emergence (AE) and viability. We further show that the inability of these mutants to emerge from their pupal cases is linked to deficiencies in emergence behaviors, caused by a possible energy deficiency, and finally, that the lethality of for mutants seems to be linked to protein isoform P3, transcribed from for promoter 3.
Collapse
Affiliation(s)
- Ina Anreiter
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada.,Child and Brain Development Program, Canadian Institute for Advanced Research (CIFAR), Toronto, Canada.,Department of Neurobiology, Stanford University, Stanford, CA, USA
| | - Aaron M Allen
- Department of Cell and Systems Biology, University of Toronto, Toronto, Canada.,Centre for Neural Circuits and Behavior, University of Oxford, Oxford, UK
| | - Oscar E Vasquez
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
| | - Lydia To
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
| | - Scott J Douglas
- Department of Cell and Systems Biology, University of Toronto, Toronto, Canada
| | - Javier V Alvarez
- Centro Interdisciplinario de Neurociencia de Valparaíso e Instituto de Neurociencia, Universidad de Valparaíso, Valparaíso, Chile
| | - John Ewer
- Centro Interdisciplinario de Neurociencia de Valparaíso e Instituto de Neurociencia, Universidad de Valparaíso, Valparaíso, Chile
| | - Marla B Sokolowski
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada.,Child and Brain Development Program, Canadian Institute for Advanced Research (CIFAR), Toronto, Canada.,Department of Cell and Systems Biology, University of Toronto, Toronto, Canada
| |
Collapse
|
17
|
Brázda V, Bartas M, Bowater RP. Evolution of Diverse Strategies for Promoter Regulation. Trends Genet 2021; 37:730-744. [PMID: 33931265 DOI: 10.1016/j.tig.2021.04.003] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2021] [Revised: 03/31/2021] [Accepted: 04/01/2021] [Indexed: 12/15/2022]
Abstract
DNA is fundamentally important for all cellular organisms due to its role as a store of hereditary genetic information. The precise and accurate regulation of gene transcription depends primarily on promoters, which vary significantly within and between genomes. Some promoters are rich in specific types of bases, while others have more varied, complex sequence characteristics. However, it is not only base sequence but also epigenetic modifications and altered DNA structure that regulate promoter activity. Significantly, many promoters across all organisms contain sequences that can form intrastrand hairpins (cruciforms) or four-stranded structures (G-quadruplex or i-motif). In this review we integrate recent studies on promoter regulation that highlight the importance of DNA structure in the evolutionary adaptation of promoter sequences.
Collapse
Affiliation(s)
- Václav Brázda
- Institute of Biophysics of the Czech Academy of Sciences, Královopolská 135, 612 65 Brno, Czech Republic
| | - Martin Bartas
- Department of Biology and Ecology/Institute of Environmental Technologies, Faculty of Science, University of Ostrava, 710 00 Ostrava, Czech Republic
| | - Richard P Bowater
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich NR4 7TJ, UK.
| |
Collapse
|
18
|
Liu Q, Jiang F, Zhang J, Li X, Kang L. Transcription initiation of distant core promoters in a large-sized genome of an insect. BMC Biol 2021; 19:62. [PMID: 33785021 PMCID: PMC8011201 DOI: 10.1186/s12915-021-01004-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Accepted: 03/16/2021] [Indexed: 12/30/2022] Open
Abstract
Background Core promoters have a substantial influence on various steps of transcription, including initiation, elongation, termination, polyadenylation, and finally, translation. The characterization of core promoters is crucial for exploring the regulatory code of transcription initiation. However, the current understanding of insect core promoters is focused on those of Diptera (especially Drosophila) species with small genome sizes. Results Here, we present an analysis of the transcription start sites (TSSs) in the migratory locust, Locusta migratoria, which has a genome size of 6.5 Gb. The genomic differences, including lower precision of transcription initiation and fewer constraints on the distance from transcription factor binding sites or regulatory elements to TSSs, were revealed in locusts compared with Drosophila insects. Furthermore, we found a distinct bimodal log distribution of the distances from the start codons to the core promoters of locust genes. We found stricter constraints on the exon length of mRNA leaders and widespread expression activity of the distant core promoters in locusts compared with fruit flies. We further compared core promoters in seven arthropod species across a broad range of genome sizes to reinforce our results on the emergence of distant core promoters in large-sized genomes. Conclusions In summary, our results provide novel insights into the effects of genome size expansion on distant transcription initiation. Supplementary Information The online version contains supplementary material available at 10.1186/s12915-021-01004-5.
Collapse
Affiliation(s)
- Qing Liu
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China.,Sino-Danish College, University of Chinese Academy of Sciences, Beijing, China.,Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Feng Jiang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China.,CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China
| | - Jie Zhang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China
| | - Xiao Li
- State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China
| | - Le Kang
- Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China. .,CAS Center for Excellence in Biotic Interactions, University of Chinese Academy of Sciences, Beijing, China. .,State Key Laboratory of Integrated Management of Pest Insects and Rodents, Institute of Zoology, Chinese Academy of Sciences, Beijing, 100101, China.
| |
Collapse
|
19
|
Floc'hlay S, Wong ES, Zhao B, Viales RR, Thomas-Chollier M, Thieffry D, Garfield DA, Furlong EEM. Cis-acting variation is common across regulatory layers but is often buffered during embryonic development. Genome Res 2021; 31:211-224. [PMID: 33310749 PMCID: PMC7849415 DOI: 10.1101/gr.266338.120] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Accepted: 12/09/2020] [Indexed: 12/14/2022]
Abstract
Precise patterns of gene expression are driven by interactions between transcription factors, regulatory DNA sequences, and chromatin. How DNA mutations affecting any one of these regulatory "layers" are buffered or propagated to gene expression remains unclear. To address this, we quantified allele-specific changes in chromatin accessibility, histone modifications, and gene expression in F1 embryos generated from eight Drosophila crosses at three embryonic stages, yielding a comprehensive data set of 240 samples spanning multiple regulatory layers. Genetic variation (allelic imbalance) impacts gene expression more frequently than chromatin features, with metabolic and environmental response genes being most often affected. Allelic imbalance in cis-regulatory elements (enhancers) is common and highly heritable, yet its functional impact does not generally propagate to gene expression. When it does, genetic variation impacts RNA levels through two alternative mechanisms involving either H3K4me3 or chromatin accessibility and H3K27ac. Changes in RNA are more predictive of variation in H3K4me3 than vice versa, suggesting a role for H3K4me3 downstream from transcription. The impact of a substantial proportion of genetic variation is consistent across embryonic stages, with 50% of allelic imbalanced features at one stage being also imbalanced at subsequent developmental stages. Crucially, buffering, as well as the magnitude and evolutionary impact of genetic variants, is influenced by regulatory complexity (i.e., number of enhancers regulating a gene), with transcription factors being most robust to cis-acting, but most influenced by trans-acting, variation.
Collapse
Affiliation(s)
- Swann Floc'hlay
- Institut de Biologie de l'ENS (IBENS), École Normale Supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France
| | - Emily S Wong
- Molecular, Structural and Computational Biology Division, Victor Chang Cardiac Research Institute, Darlinghurst, New South Wales 2010, Australia
- School of Biotechnology and Biomolecular Sciences, UNSW Sydney, Kensington, New South Wales 2052, Australia
| | - Bingqing Zhao
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, D-69117 Heidelberg, Germany
| | - Rebecca R Viales
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, D-69117 Heidelberg, Germany
| | - Morgane Thomas-Chollier
- Institut de Biologie de l'ENS (IBENS), École Normale Supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France
- Institut Universitaire de France (IUF), 75005 Paris, France
| | - Denis Thieffry
- Institut de Biologie de l'ENS (IBENS), École Normale Supérieure, CNRS, INSERM, Université PSL, 75005 Paris, France
| | - David A Garfield
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, D-69117 Heidelberg, Germany
| | - Eileen E M Furlong
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, D-69117 Heidelberg, Germany
| |
Collapse
|
20
|
Zhang H, Lu Z, Zhan Y, Rodriguez J, Lu C, Xue Y, Lin Z. Distinct roles of nucleosome sliding and histone modifications in controlling the fidelity of transcription initiation. RNA Biol 2021; 18:1642-1652. [PMID: 33280509 DOI: 10.1080/15476286.2020.1860389] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022] Open
Abstract
Regulation of gene expression starts from the transcription initiation. Regulated transcription initiation is critical for generating correct transcripts with proper abundance. The impact of epigenetic control, such as histone modifications and chromatin remodelling, on gene regulation has been extensively investigated, but their specific role in regulating transcription initiation is far from well understood. Here we aimed to better understand the roles of genes involved in histone H3 methylations and chromatin remodelling on the regulation of transcription initiation at a genome-scale using the budding yeast as a study system. We obtained and compared maps of transcription start site (TSS) at single-nucleotide resolution by nAnT-iCAGE for a strain with depletion of MINC (Mot1-Ino80C-Nc2) by Mot1p and Ino80p anchor-away (Mot1&Ino80AA) and a strain with loss of histone methylation (set1Δset2Δdot1Δ) to their wild-type controls. Our study showed that the depletion of MINC stimulated transcription initiation from many new sites flanking the dominant TSS of genes, while the loss of histone methylation generates more TSSs in the coding region. Moreover, the depletion of MINC led to less confined boundaries of TSS clusters (TCs) and resulted in broader core promoters, and such patterns are not present in the ssdΔ mutant. Our data also exhibits that the MINC has distinctive impacts on TATA-containing and TATA-less promoters. In conclusion, our study shows that MINC is required for accurate identification of bona fide TSSs, particularly in TATA-containing promoters, and histone methylation contributes to the repression of transcription initiation in coding regions.
Collapse
Affiliation(s)
- Huiming Zhang
- Jiangsu Key Laboratory of Marine Pharmaceutical Compound Screening, School of Pharmacy, Jiangsu Ocean University, Lianyungang, China
| | - Zhaolian Lu
- Department of Biology, Saint Louis University, St. Louis, Missouri, USA
| | - Yu Zhan
- Department of Biology, Saint Louis University, St. Louis, Missouri, USA
| | - Judith Rodriguez
- Program of Bioinformatics and Computational Biology, Saint Louis University, St. Louis, Missouri, USA
| | - Chen Lu
- Jiangsu Key Laboratory of Marine Pharmaceutical Compound Screening, School of Pharmacy, Jiangsu Ocean University, Lianyungang, China.,Jiangsu Institute of Marine Resources Development, Jiangsu Ocean University, Lianyungang, China.,Co-Innovation Center of Jiangsu Marine Bio-industry Technology, Jiangsu Ocean University, Lianyungang, China
| | - Yong Xue
- Jiangsu Key Laboratory of Marine Pharmaceutical Compound Screening, School of Pharmacy, Jiangsu Ocean University, Lianyungang, China.,Co-Innovation Center of Jiangsu Marine Bio-industry Technology, Jiangsu Ocean University, Lianyungang, China
| | - Zhenguo Lin
- Department of Biology, Saint Louis University, St. Louis, Missouri, USA
| |
Collapse
|
21
|
Population-scale study of eRNA transcription reveals bipartite functional enhancer architecture. Nat Commun 2020; 11:5963. [PMID: 33235186 PMCID: PMC7687912 DOI: 10.1038/s41467-020-19829-z] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Accepted: 10/28/2020] [Indexed: 12/13/2022] Open
Abstract
Enhancer RNAs (eRNA) are unstable non-coding RNAs, transcribed bidirectionally from active regulatory sequences, whose expression levels correlate with enhancer activity. We use capped-nascent-RNA sequencing to efficiently capture bidirectional transcription initiation across several human lymphoblastoid cell lines (Yoruba population) and detect ~75,000 eRNA transcription sites with high sensitivity and specificity. The use of nascent-RNA sequencing sidesteps the confounding effect of eRNA instability. We identify quantitative trait loci (QTLs) associated with the level and directionality of eRNA expression. High-resolution analyses of these two types of QTLs reveal distinct positions of enrichment at the central transcription factor (TF) binding regions and at the flanking eRNA initiation regions, both of which are associated with mRNA expression QTLs. These two regions-the central TF-binding footprint and the eRNA initiation cores-define a bipartite architecture of enhancers, inform enhancer function, and can be used as an indicator of the significance of non-coding regulatory variants.
Collapse
|
22
|
Lu Z, Lin Z. The origin and evolution of a distinct mechanism of transcription initiation in yeasts. Genome Res 2020; 31:51-63. [PMID: 33219055 PMCID: PMC7849388 DOI: 10.1101/gr.264325.120] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2020] [Accepted: 11/17/2020] [Indexed: 12/13/2022]
Abstract
The molecular process of transcription by RNA Polymerase II is highly conserved among eukaryotes (“classic model”). A distinct way of locating transcription start sites (TSSs) has been identified in a budding yeast Saccharomyces cerevisiae (“scanning model”). Herein, we applied genomic approaches to elucidate the origin of the scanning model and its underlying genetic mechanisms. We first identified TSSs at single-nucleotide resolution for 12 yeast species using the nAnT-iCAGE technique, which significantly improved the annotations of these genomes by providing accurate 5′ boundaries for protein-coding genes. We then inferred the initiation mechanism of each species based on its TSS maps and genome sequences. We discovered that the scanning model likely originated after the split of Yarrowia lipolytica and the other budding yeasts. Species that use the scanning model showed an adenine-rich region immediately upstream of the TSS that might facilitate TSS selection. Both initiation mechanisms share a strong preference for pyrimidine–purine dinucleotides surrounding the TSS. Our results suggest that the purine is required to accurately recruit the first nucleotide, thereby increasing the chances of a messenger RNA of being capped during mRNA maturation, which is critical for efficient translation initiation during protein biosynthesis. Based on our findings, we propose a model for TSS selection in the scanning-model species, as well as a model for the stepwise process responsible for the origin and evolution of the scanning model.
Collapse
Affiliation(s)
- Zhaolian Lu
- Department of Biology, Saint Louis University, St. Louis, Missouri 63104, USA
| | - Zhenguo Lin
- Department of Biology, Saint Louis University, St. Louis, Missouri 63104, USA
| |
Collapse
|
23
|
Reddington JP, Garfield DA, Sigalova OM, Karabacak Calviello A, Marco-Ferreres R, Girardot C, Viales RR, Degner JF, Ohler U, Furlong EEM. Lineage-Resolved Enhancer and Promoter Usage during a Time Course of Embryogenesis. Dev Cell 2020; 55:648-664.e9. [PMID: 33171098 DOI: 10.1016/j.devcel.2020.10.009] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2020] [Revised: 08/04/2020] [Accepted: 10/09/2020] [Indexed: 02/01/2023]
Abstract
Enhancers are essential drivers of cell states, yet the relationship between accessibility, regulatory activity, and in vivo lineage commitment during embryogenesis remains poorly understood. Here, we measure chromatin accessibility in isolated neural and mesodermal lineages across a time course of Drosophila embryogenesis. Promoters, including tissue-specific genes, are often constitutively open, even in contexts where the gene is not expressed. In contrast, the majority of distal elements have dynamic, tissue-specific accessibility. Enhancer priming appears rarely within a lineage, perhaps reflecting the speed of Drosophila embryogenesis. However, many tissue-specific enhancers are accessible in other lineages early on and become progressively closed as embryogenesis proceeds. We demonstrate the usefulness of this tissue- and time-resolved resource to definitively identify single-cell clusters, to uncover predictive motifs, and to identify many regulators of tissue development. For one such predicted neural regulator, l(3)neo38, we generate a loss-of-function mutant and uncover an essential role for neuromuscular junction and brain development.
Collapse
Affiliation(s)
- James P Reddington
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, 69117 Heidelberg, Germany
| | - David A Garfield
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, 69117 Heidelberg, Germany
| | - Olga M Sigalova
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, 69117 Heidelberg, Germany
| | | | - Raquel Marco-Ferreres
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, 69117 Heidelberg, Germany
| | - Charles Girardot
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, 69117 Heidelberg, Germany
| | - Rebecca R Viales
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, 69117 Heidelberg, Germany
| | - Jacob F Degner
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, 69117 Heidelberg, Germany
| | - Uwe Ohler
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine, Berlin, Germany
| | - Eileen E M Furlong
- European Molecular Biology Laboratory (EMBL), Genome Biology Unit, 69117 Heidelberg, Germany.
| |
Collapse
|
24
|
Liu J, Frochaux M, Gardeux V, Deplancke B, Robinson-Rechavi M. Inter-embryo gene expression variability recapitulates the hourglass pattern of evo-devo. BMC Biol 2020; 18:129. [PMID: 32950053 PMCID: PMC7502200 DOI: 10.1186/s12915-020-00842-z] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Accepted: 08/07/2020] [Indexed: 02/07/2023] Open
Abstract
BACKGROUND The evolution of embryological development has long been characterized by deep conservation. In animal development, the phylotypic stage in mid-embryogenesis is more conserved than either early or late stages among species within the same phylum. Hypotheses to explain this hourglass pattern have focused on purifying the selection of gene regulation. Here, we propose an alternative-genes are regulated in different ways at different stages and have different intrinsic capacities to respond to perturbations on gene expression. RESULTS To eliminate the influence of natural selection, we quantified the expression variability of isogenetic single embryo transcriptomes throughout fly Drosophila melanogaster embryogenesis. We found that the expression variability is lower at the phylotypic stage, supporting that the underlying regulatory architecture in this stage is more robust to stochastic variation on gene expression. We present evidence that the phylotypic stage is also robust to genetic variations on gene expression. Moreover, chromatin regulation appears to play a key role in the variation and evolution of gene expression. CONCLUSIONS We suggest that a phylum-level pattern of embryonic conservation can be explained by the intrinsic difference of gene regulatory mechanisms in different stages.
Collapse
Affiliation(s)
- Jialin Liu
- Department of Ecology and Evolution, University of Lausanne, 1015, Lausanne, Switzerland.
- Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland.
| | - Michael Frochaux
- Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Vincent Gardeux
- Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Bart Deplancke
- Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland
- Laboratory of Systems Biology and Genetics, Institute of Bioengineering, School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Marc Robinson-Rechavi
- Department of Ecology and Evolution, University of Lausanne, 1015, Lausanne, Switzerland.
- Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland.
| |
Collapse
|
25
|
Sigalova OM, Shaeiri A, Forneris M, Furlong EEM, Zaugg JB. Predictive features of gene expression variation reveal mechanistic link with differential expression. Mol Syst Biol 2020; 16:e9539. [PMID: 32767663 PMCID: PMC7411568 DOI: 10.15252/msb.20209539] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Revised: 06/26/2020] [Accepted: 06/30/2020] [Indexed: 12/22/2022] Open
Abstract
For most biological processes, organisms must respond to extrinsic cues, while maintaining essential gene expression programmes. Although studied extensively in single cells, it is still unclear how variation is controlled in multicellular organisms. Here, we used a machine-learning approach to identify genomic features that are predictive of genes with high versus low variation in their expression across individuals, using bulk data to remove stochastic cell-to-cell variation. Using embryonic gene expression across 75 Drosophila isogenic lines, we identify features predictive of expression variation (controlling for expression level), many of which are promoter-related. Genes with low variation fall into two classes reflecting different mechanisms to maintain robust expression, while genes with high variation seem to lack both types of stabilizing mechanisms. Applying this framework to humans revealed similar predictive features, indicating that promoter architecture is an ancient mechanism to control expression variation. Remarkably, expression variation features could also partially predict differential expression after diverse perturbations in both Drosophila and humans. Differential gene expression signatures may therefore be partially explained by genetically encoded gene-specific features, unrelated to the studied treatment.
Collapse
Affiliation(s)
- Olga M Sigalova
- Genome Biology UnitEuropean Molecular Biology Laboratory (EMBL)HeidelbergGermany
| | - Amirreza Shaeiri
- Structures and Computational Biology UnitEuropean Molecular Biology Laboratory (EMBL)HeidelbergGermany
| | - Mattia Forneris
- Genome Biology UnitEuropean Molecular Biology Laboratory (EMBL)HeidelbergGermany
| | - Eileen EM Furlong
- Genome Biology UnitEuropean Molecular Biology Laboratory (EMBL)HeidelbergGermany
| | - Judith B Zaugg
- Structures and Computational Biology UnitEuropean Molecular Biology Laboratory (EMBL)HeidelbergGermany
| |
Collapse
|
26
|
Seto K, Mok W, Stone J. Bridging the gap between theory and practice in elucidating modular gene regulatory sequence organisation within genomes. Genome 2020; 63:281-289. [PMID: 32114793 DOI: 10.1139/gen-2019-0150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Changes to promoter regions probably have been responsible for many morphological evolutionary transitions, especially in animals. This idea is becoming testable, as data from genome projects amass and enable bioinformaticians to conduct comparative sequence analyses and test for correlations between genotypic similarities or differences and phenotypic likeness or disparity. Although such practical pursuits have initiated some theoretical considerations, a conceptual framework for understanding promoter region evolution, potentially effecting morphological evolution, is only starting to emerge, predominantly resulting from computational research. We contribute to this framework by specifying three big problems for promoter region research; reviewing computational research on promoter region evolution; and exemplifying a topic for future promoter region research - module evolution.
Collapse
Affiliation(s)
- Kelly Seto
- Department of Molecular & Medical Genetics, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Wendy Mok
- Department of Molecular Biology & Biophysics, University of Connecticut Health, Farmington, CT 06032, USA
| | - Jonny Stone
- Department of Biology, McMaster University, Hamilton, ON L8S 4K1, Canada; SHARCNet, McMaster University, Hamilton, ON L8S 4L8, Canada; Origins Institute, McMaster University, Hamilton, ON L8S 4M1, Canada
| |
Collapse
|
27
|
Thodberg M, Thieffry A, Vitting-Seerup K, Andersson R, Sandelin A. CAGEfightR: analysis of 5'-end data using R/Bioconductor. BMC Bioinformatics 2019; 20:487. [PMID: 31585526 PMCID: PMC6778389 DOI: 10.1186/s12859-019-3029-5] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2019] [Accepted: 08/15/2019] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND 5'-end sequencing assays, and Cap Analysis of Gene Expression (CAGE) in particular, have been instrumental in studying transcriptional regulation. 5'-end methods provide genome-wide maps of transcription start sites (TSSs) with base pair resolution. Because active enhancers often feature bidirectional TSSs, such data can also be used to predict enhancer candidates. The current availability of mature and comprehensive computational tools for the analysis of 5'-end data is limited, preventing efficient analysis of new and existing 5'-end data. RESULTS We present CAGEfightR, a framework for analysis of CAGE and other 5'-end data implemented as an R/Bioconductor-package. CAGEfightR can import data from BigWig files and allows for fast and memory efficient prediction and analysis of TSSs and enhancers. Downstream analyses include quantification, normalization, annotation with transcript and gene models, TSS shape statistics, linking TSSs to enhancers via co-expression, identification of enhancer clusters, and genome-browser style visualization. While built to analyze CAGE data, we demonstrate the utility of CAGEfightR in analyzing nascent RNA 5'-data (PRO-Cap). CAGEfightR is implemented using standard Bioconductor classes, making it easy to learn, use and combine with other Bioconductor packages, for example popular differential expression tools such as limma, DESeq2 and edgeR. CONCLUSIONS CAGEfightR provides a single, scalable and easy-to-use framework for comprehensive downstream analysis of 5'-end data. CAGEfightR is designed to be interoperable with other Bioconductor packages, thereby unlocking hundreds of mature transcriptomic analysis tools for 5'-end data. CAGEfightR is freely available via Bioconductor: bioconductor.org/packages/CAGEfightR .
Collapse
Affiliation(s)
- Malte Thodberg
- Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK2100, Copenhagen N, Denmark.
- Biotech Research and Innovation Centre, University of Copenhagen, Ole Maaløes Vej 5, DK2100, Copenhagen N, Denmark.
| | - Axel Thieffry
- Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK2100, Copenhagen N, Denmark
- Biotech Research and Innovation Centre, University of Copenhagen, Ole Maaløes Vej 5, DK2100, Copenhagen N, Denmark
| | - Kristoffer Vitting-Seerup
- Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK2100, Copenhagen N, Denmark
- Biotech Research and Innovation Centre, University of Copenhagen, Ole Maaløes Vej 5, DK2100, Copenhagen N, Denmark
- Danish Cancer Society, Strandboulevarden 49 DK2100, Copenhagen Ø, Denmark
| | - Robin Andersson
- Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK2100, Copenhagen N, Denmark
| | - Albin Sandelin
- Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK2100, Copenhagen N, Denmark.
- Biotech Research and Innovation Centre, University of Copenhagen, Ole Maaløes Vej 5, DK2100, Copenhagen N, Denmark.
| |
Collapse
|
28
|
The RNA Polymerase II Core Promoter in Drosophila. Genetics 2019; 212:13-24. [PMID: 31053615 DOI: 10.1534/genetics.119.302021] [Citation(s) in RCA: 51] [Impact Index Per Article: 10.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2018] [Accepted: 03/05/2019] [Indexed: 11/18/2022] Open
Abstract
Transcription by RNA polymerase II initiates at the core promoter, which is sometimes referred to as the "gateway to transcription." Here, we describe the properties of the RNA polymerase II core promoter in Drosophila The core promoter is at a strategic position in the expression of genes, as it is the site of convergence of the signals that lead to transcriptional activation. Importantly, core promoters are diverse in terms of their structure and function. They are composed of various combinations of sequence motifs such as the TATA box, initiator (Inr), and downstream core promoter element (DPE). Different types of core promoters are transcribed via distinct mechanisms. Moreover, some transcriptional enhancers exhibit specificity for particular types of core promoters. These findings indicate that the core promoter is a central component of the transcriptional apparatus that regulates gene expression.
Collapse
|
29
|
Thodberg M, Thieffry A, Bornholdt J, Boyd M, Holmberg C, Azad A, Workman CT, Chen Y, Ekwall K, Nielsen O, Sandelin A. Comprehensive profiling of the fission yeast transcription start site activity during stress and media response. Nucleic Acids Res 2019; 47:1671-1691. [PMID: 30566651 PMCID: PMC6393241 DOI: 10.1093/nar/gky1227] [Citation(s) in RCA: 21] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Revised: 11/09/2018] [Accepted: 11/26/2018] [Indexed: 12/11/2022] Open
Abstract
Fission yeast, Schizosaccharomyces pombe, is an attractive model organism for transcriptional and chromatin biology research. Such research is contingent on accurate annotation of transcription start sites (TSSs). However, comprehensive genome-wide maps of TSSs and their usage across commonly applied laboratory conditions and treatments for S. pombe are lacking. To this end, we profiled TSS activity genome-wide in S. pombe cultures exposed to heat shock, nitrogen starvation, hydrogen peroxide and two commonly applied media, YES and EMM2, using Cap Analysis of Gene Expression (CAGE). CAGE-based annotation of TSSs is substantially more accurate than existing PomBase annotation; on average, CAGE TSSs fall 50–75 bp downstream of PomBase TSSs and co-localize with nucleosome boundaries. In contrast to higher eukaryotes, dispersed TSS distributions are not common in S. pombe. Our data recapitulate known S. pombe stress expression response patterns and identify stress- and media-responsive alternative TSSs. Notably, alteration of growth medium induces changes of similar magnitude as some stressors. We show a link between nucleosome occupancy and genetic variation, and that the proximal promoter region is genetically diverse between S. pombe strains. Our detailed TSS map constitutes a central resource for S. pombe gene regulation research.
Collapse
Affiliation(s)
- Malte Thodberg
- Department of Biology and Biotech Research and Innovation Centre, The Bioinformatics Centre, University of Copenhagen, DK2100 Copenhagen N, Denmark
| | - Axel Thieffry
- Department of Biology and Biotech Research and Innovation Centre, The Bioinformatics Centre, University of Copenhagen, DK2100 Copenhagen N, Denmark
| | - Jette Bornholdt
- Department of Biology and Biotech Research and Innovation Centre, The Bioinformatics Centre, University of Copenhagen, DK2100 Copenhagen N, Denmark
| | - Mette Boyd
- Department of Biology and Biotech Research and Innovation Centre, The Bioinformatics Centre, University of Copenhagen, DK2100 Copenhagen N, Denmark
| | - Christian Holmberg
- Department of Biology, Cell cycle and genome stability Group, University of Copenhagen, DK2100 Copenhagen N, Denmark
| | - Ajuna Azad
- Department of Biology and Biotech Research and Innovation Centre, The Bioinformatics Centre, University of Copenhagen, DK2100 Copenhagen N, Denmark
| | - Christopher T Workman
- Department of Biotechnology and Biomedicine, Technical University of Denmark, DK2800 Kongens Lyngby, Denmark
| | - Yun Chen
- Department of Biology and Biotech Research and Innovation Centre, The Bioinformatics Centre, University of Copenhagen, DK2100 Copenhagen N, Denmark
| | - Karl Ekwall
- Department of Biosciences and Nutrition, Karolinska Institute, SE14183 Huddinge, Sweden
| | - Olaf Nielsen
- Department of Biology, Cell cycle and genome stability Group, University of Copenhagen, DK2100 Copenhagen N, Denmark
- Correspondence may also be addressed to Olaf Nielsen. Tel: +45 26 41 06 66; Fax: +45 3532 1281;
| | - Albin Sandelin
- Department of Biology and Biotech Research and Innovation Centre, The Bioinformatics Centre, University of Copenhagen, DK2100 Copenhagen N, Denmark
- To whom correspondence should be addressed. Tel: +45 3532 1281; Fax: +45 3532 1281;
| |
Collapse
|
30
|
Bhardwaj V, Semplicio G, Erdogdu NU, Manke T, Akhtar A. MAPCap allows high-resolution detection and differential expression analysis of transcription start sites. Nat Commun 2019; 10:3219. [PMID: 31363093 PMCID: PMC6667505 DOI: 10.1038/s41467-019-11115-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2019] [Accepted: 06/20/2019] [Indexed: 01/06/2023] Open
Abstract
The position, shape and number of transcription start sites (TSS) are critical determinants of gene regulation. Most methods developed to detect TSSs and study promoter usage are, however, of limited use in studies that demand quantification of expression changes between two or more groups. In this study, we combine high-resolution detection of transcription start sites and differential expression analysis using a simplified TSS quantification protocol, MAPCap (Multiplexed Affinity Purification of Capped RNA) along with the software icetea. Applying MAPCap on developing Drosophila melanogaster embryos and larvae, we detected stage and sex-specific promoter and enhancer activity and quantify the effect of mutants of maleless (MLE) helicase at X-chromosomal promoters. We observe that MLE mutation leads to a median 1.9 fold drop in expression of X-chromosome promoters and affects the expression of several TSSs with a sexually dimorphic expression on autosomes. Our results provide quantitative insights into promoter activity during dosage compensation. The position, shape and number of transcription start sites (TSS) regulate gene expression. Here authors present MAPCap, a method for high-resolution detection and differential expression analysis of TSS, and apply MAPCap to early fly development, detecting stage and sex-specific promoter and enhancer activity.
Collapse
Affiliation(s)
- Vivek Bhardwaj
- Max Planck Institute for Immunobiology and Epigenetics, 79108, Freiburg, Germany.,Faculty of Biology, University of Freiburg, 79104, Freiburg, Germany
| | - Giuseppe Semplicio
- Max Planck Institute for Immunobiology and Epigenetics, 79108, Freiburg, Germany
| | - Niyazi Umut Erdogdu
- Max Planck Institute for Immunobiology and Epigenetics, 79108, Freiburg, Germany.,Faculty of Biology, University of Freiburg, 79104, Freiburg, Germany
| | - Thomas Manke
- Max Planck Institute for Immunobiology and Epigenetics, 79108, Freiburg, Germany
| | - Asifa Akhtar
- Max Planck Institute for Immunobiology and Epigenetics, 79108, Freiburg, Germany.
| |
Collapse
|
31
|
The landscape of transcription initiation across latent and lytic KSHV genomes. PLoS Pathog 2019; 15:e1007852. [PMID: 31188901 PMCID: PMC6590836 DOI: 10.1371/journal.ppat.1007852] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2019] [Revised: 06/24/2019] [Accepted: 05/20/2019] [Indexed: 11/19/2022] Open
Abstract
Precise promoter annotation is required for understanding the mechanistic basis of transcription initiation. In the context of complex genomes, such as herpesviruses where there is extensive genic overlap, identification of transcription start sites (TSSs) is particularly problematic and cannot be comprehensively accessed by standard RNA sequencing approaches. Kaposi's sarcoma-associated herpesvirus (KSHV) is an oncogenic gammaherpesvirus and the etiological agent of Kaposi's sarcoma and the B cell lymphoma primary effusion lymphoma (PEL). Here, we leverage RNA annotation and mapping of promoters for analysis of gene expression (RAMPAGE) and define KSHV TSSs transcriptome-wide and at nucleotide resolution in two widely used models of KSHV infection, namely iSLK.219 cells and the PEL cell line TREx-BCBL1-RTA. By mapping TSSs over a 96 h time course of reactivation we confirm 48 of 50 previously identified TSSs. Moreover, we identify over 100 novel transcription start site clusters (TSCs) in each cell line. Our analyses identified cell-type specific differences in TSC positions as well as promoter strength, and defined motifs within viral core promoters. Collectively, by defining TSSs at high resolution we have greatly expanded the transcriptional landscape of the KSHV genome and identified transcriptional control mechanisms at play during KSHV lytic reactivation.
Collapse
|
32
|
Lu Z, Lin Z. Pervasive and dynamic transcription initiation in Saccharomyces cerevisiae. Genome Res 2019; 29:1198-1210. [PMID: 31076411 PMCID: PMC6633255 DOI: 10.1101/gr.245456.118] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2018] [Accepted: 05/07/2019] [Indexed: 12/15/2022]
Abstract
Transcription initiation is finely regulated to ensure proper expression and function of genes. The regulated transcription initiation in response to various environmental stimuli in a classic model organism Saccharomyces cerevisiae has not been systematically investigated. In this study, we generated quantitative maps of transcription start sites (TSSs) at a single-nucleotide resolution for S. cerevisiae grown in nine different conditions using no-amplification nontagging Cap analysis of gene expression (nAnT-iCAGE) sequencing. We mapped ∼1 million well-supported TSSs, suggesting highly pervasive transcription initiation in the compact genome of the budding yeast. The comprehensive TSS maps allowed us to identify core promoters for ∼96% verified protein-coding genes. We corrected misannotation of translation start codon for 122 genes and suggested an alternative start codon for 57 genes. We found that 56% of yeast genes are controlled by multiple core promoters, and alternative core promoter usage by a gene is widespread in response to changing environments. Most core promoter shifts are coupled with altered gene expression, indicating that alternative core promoter usage might play an important role in controlling gene transcriptional activities. Based on their activities in responding to environmental cues, we divided core promoters into constitutive class (55%) and inducible class (45%). The two classes of core promoters display distinctive patterns in transcriptional abundance, chromatin structure, promoter shape, and sequence context. In summary, our study improved the annotation of the yeast genome and demonstrated a much more pervasive and dynamic nature of transcription initiation in yeast than previously recognized.
Collapse
Affiliation(s)
- Zhaolian Lu
- Department of Biology, Saint Louis University, St. Louis, Missouri 63104, USA
| | - Zhenguo Lin
- Department of Biology, Saint Louis University, St. Louis, Missouri 63104, USA
| |
Collapse
|
33
|
Liu S, Li L, Meng J, Song K, Huang B, Wang W, Zhang G. Association and Functional Analyses Revealed That PPP1R3B Plays an Important Role in the Regulation of Glycogen Content in the Pacific Oyster Crassostrea gigas. Front Genet 2019; 10:106. [PMID: 30853975 PMCID: PMC6396720 DOI: 10.3389/fgene.2019.00106] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2018] [Accepted: 01/30/2019] [Indexed: 12/15/2022] Open
Abstract
The Pacific oyster (Crassostrea gigas) is one of the most important aquaculture species worldwide. Glycogen contributes greatly to the special taste and creamy white color of oysters. Previous genome-wide association studies (GWAS) identified several single nucleotide polymorphism (SNP) sites that were strongly related to glycogen content. Genes within 100 kb upstream and downstream of the associated SNPs were screened. One gene annotated as protein phosphatase 1 regulatory subunit 3B (PPP1R3B), which can promote glycogen synthesis together with protein phosphatase 1 catalytic subunit (PPP1C) in mammals, was selected as a candidate gene in this study. First, full-length CgPPP1R3B was cloned and its function was characterized. The gene expression profiles of CgPPP1R3B in different tissues and seasons showed a close relationship to glycogen content. RNA interference (RNAi) experiments of this gene in vivo showed that decreased CgPPP1R3B levels resulted in lower glycogen contents in the experimental group than in the control group. Co-immunoprecipitation (Co-IP) and yeast two-hybrid (Y2H) assays indicated that CgPPP1R3B can interact with CgPPP1C, glycogen synthase (CgGS) and glycogen phosphorylase (CgGP), thus participating in glycogen metabolism. Co-sedimentation analysis in vitro demonstrated that the CgPPP1R3B protein can bind to glycogen molecules directly, and these results indicated the conserved function of the CgPPP1R3B protein compared to that of mammals. In addition, thirteen SNPs were precisely mapped in this gene. Ten of the thirteen SNPs were confirmed to be significantly (p < 0.05) related to glycogen content in an independent wild population (n = 288). The CgPPP1R3B levels in oysters with high glycogen content were significantly higher than those of oysters with low glycogen content, and gene expression levels were significantly associated with various genotypes of four associated SNPs (p < 0.05). The data indicated that the associated SNPs may control glycogen content by regulating CgPPP1R3B expression. These results suggest that CgPPP1R3B is an important gene for glycogen metabolic regulation and that the associated SNPs of this gene are potential markers for oyster molecular breeding for increased glycogen content.
Collapse
Affiliation(s)
- Sheng Liu
- Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China.,Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China.,University of Chinese Academy of Sciences, Beijing, China
| | - Li Li
- Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China.,Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China.,National and Local Joint Engineering Laboratory of Ecological Mariculture, Qingdao, China.,Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, China
| | - Jie Meng
- Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China.,Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China.,National and Local Joint Engineering Laboratory of Ecological Mariculture, Qingdao, China.,Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, China
| | - Kai Song
- Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China.,Laboratory for Marine Fisheries Science and Food Production Processes, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China.,National and Local Joint Engineering Laboratory of Ecological Mariculture, Qingdao, China.,Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, China
| | - Baoyu Huang
- Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China.,Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China.,National and Local Joint Engineering Laboratory of Ecological Mariculture, Qingdao, China.,Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, China
| | - Wei Wang
- Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China.,Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China.,National and Local Joint Engineering Laboratory of Ecological Mariculture, Qingdao, China.,Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, China
| | - Guofan Zhang
- Key Laboratory of Experimental Marine Biology, Institute of Oceanology, Chinese Academy of Sciences, Qingdao, China.,Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, Qingdao, China.,National and Local Joint Engineering Laboratory of Ecological Mariculture, Qingdao, China.,Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, China
| |
Collapse
|
34
|
Lam KC, Chung HR, Semplicio G, Iyer SS, Gaub A, Bhardwaj V, Holz H, Georgiev P, Akhtar A. The NSL complex-mediated nucleosome landscape is required to maintain transcription fidelity and suppression of transcription noise. Genes Dev 2019; 33:452-465. [PMID: 30819819 PMCID: PMC6446542 DOI: 10.1101/gad.321489.118] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2018] [Accepted: 01/11/2019] [Indexed: 11/24/2022]
Abstract
Nucleosomal organization at gene promoters is critical for transcription, with a nucleosome-depleted region (NDR) at transcription start sites (TSSs) being required for transcription initiation. How NDRs and the precise positioning of the +1 nucleosomes are maintained on active genes remains unclear. Here, we report that the Drosophila nonspecific lethal (NSL) complex is necessary to maintain this stereotypical nucleosomal organization at promoters. Upon NSL1 depletion, nucleosomes invade the NDRs at TSSs of NSL-bound genes. NSL complex member NSL3 binds to TATA-less promoters in a sequence-dependent manner. The NSL complex interacts with the NURF chromatin remodeling complex and is necessary and sufficient to recruit NURF to target promoters. Not only is the NSL complex essential for transcription, but it is required for accurate TSS selection for genes with multiple TSSs. Furthermore, loss of the NSL complex leads to an increase in transcriptional noise. Thus, the NSL complex establishes a canonical nucleosomal organization that enables transcription and determines TSS fidelity.
Collapse
Affiliation(s)
- Kin Chung Lam
- Department of Chromatin Regulation, Max Planck Institute of Immunobiology and Epigenetics, 79108 Freiburg, Germany
| | - Ho-Ryun Chung
- Otto-Warburg-Laboratory, Epigenomics, Max Planck Institute for Molecular Genetics, 14195 Berlin, Germany.,Institute for Medical Bioinformatics and Biostatistics, Philipps-Universität Marburg, 35037 Marburg, Germany
| | - Giuseppe Semplicio
- Department of Chromatin Regulation, Max Planck Institute of Immunobiology and Epigenetics, 79108 Freiburg, Germany
| | - Shantanu S Iyer
- Department of Chromatin Regulation, Max Planck Institute of Immunobiology and Epigenetics, 79108 Freiburg, Germany.,Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, 79108 Freiburg, Germany.,Faculty of Biology, University of Freiburg, 79108 Freiburg, Germany
| | - Aline Gaub
- Department of Chromatin Regulation, Max Planck Institute of Immunobiology and Epigenetics, 79108 Freiburg, Germany.,Faculty of Biology, University of Freiburg, 79108 Freiburg, Germany
| | - Vivek Bhardwaj
- Department of Chromatin Regulation, Max Planck Institute of Immunobiology and Epigenetics, 79108 Freiburg, Germany
| | - Herbert Holz
- Department of Chromatin Regulation, Max Planck Institute of Immunobiology and Epigenetics, 79108 Freiburg, Germany
| | - Plamen Georgiev
- Department of Chromatin Regulation, Max Planck Institute of Immunobiology and Epigenetics, 79108 Freiburg, Germany
| | - Asifa Akhtar
- Department of Chromatin Regulation, Max Planck Institute of Immunobiology and Epigenetics, 79108 Freiburg, Germany
| |
Collapse
|
35
|
Schor IE, Bussotti G, Maleš M, Forneris M, Viales RR, Enright AJ, Furlong EEM. Non-coding RNA Expression, Function, and Variation during Drosophila Embryogenesis. Curr Biol 2018; 28:3547-3561.e9. [PMID: 30393032 PMCID: PMC6264527 DOI: 10.1016/j.cub.2018.09.026] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2018] [Revised: 07/10/2018] [Accepted: 09/12/2018] [Indexed: 01/02/2023]
Abstract
Long non-coding RNAs (lncRNAs) can often function in the regulation of gene expression during development; however, their generality as essential regulators in developmental processes and organismal phenotypes remains unclear. Here, we performed a tailored investigation of lncRNA expression and function during Drosophila embryogenesis, interrogating multiple stages, tissue specificity, nuclear localization, and genetic backgrounds. Our results almost double the number of annotated lncRNAs expressed at these embryonic stages. lncRNA levels are generally positively correlated with those of their neighboring genes, with little evidence of transcriptional interference. Using fluorescent in situ hybridization, we report the spatiotemporal expression of 15 new lncRNAs, revealing very dynamic tissue-specific patterns. Despite this, deletion of selected lncRNA genes had no obvious developmental defects or effects on viability under standard and stressed conditions. However, two lncRNA deletions resulted in modest expression changes of a small number of genes, suggesting that they fine-tune expression of non-essential genes. Several lncRNAs have strain-specific expression, indicating that they are not fixed within the population. This intra-species variation across genetic backgrounds may thereby be a useful tool to distinguish rapidly evolving lncRNAs with as yet non-essential roles.
Collapse
Affiliation(s)
- Ignacio E Schor
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany; Instituto de Fisiología, Biología Molecular y Neurociencias (IFIBYNE-CONICET), Departamento de Fisiología, Biología Molecular y Celular, FCEyN, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Giovanni Bussotti
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
| | - Matilda Maleš
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany
| | - Mattia Forneris
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany
| | - Rebecca R Viales
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany
| | - Anton J Enright
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
| | - Eileen E M Furlong
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), 69117 Heidelberg, Germany.
| |
Collapse
|
36
|
Zoller B, Little SC, Gregor T. Diverse Spatial Expression Patterns Emerge from Unified Kinetics of Transcriptional Bursting. Cell 2018; 175:835-847.e25. [PMID: 30340044 PMCID: PMC6779125 DOI: 10.1016/j.cell.2018.09.056] [Citation(s) in RCA: 75] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Revised: 07/02/2018] [Accepted: 09/26/2018] [Indexed: 01/14/2023]
Abstract
How transcriptional bursting relates to gene regulation is a central question that has persisted for more than a decade. Here, we measure nascent transcriptional activity in early Drosophila embryos and characterize the variability in absolute activity levels across expression boundaries. We demonstrate that boundary formation follows a common transcription principle: a single control parameter determines the distribution of transcriptional activity, regardless of gene identity, boundary position, or enhancer-promoter architecture. We infer the underlying bursting kinetics and identify the key regulatory parameter as the fraction of time a gene is in a transcriptionally active state. Unexpectedly, both the rate of polymerase initiation and the switching rates are tightly constrained across all expression levels, predicting synchronous patterning outcomes at all positions in the embryo. These results point to a shared simplicity underlying the apparently complex transcriptional processes of early embryonic patterning and indicate a path to general rules in transcriptional regulation.
Collapse
Affiliation(s)
- Benjamin Zoller
- Joseph Henry Laboratories of Physics and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA
| | - Shawn C Little
- Department of Molecular Biology and Howard Hughes Medical Institute, Princeton University, Princeton, NJ 08544, USA; Department of Cell and Developmental Biology, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Thomas Gregor
- Joseph Henry Laboratories of Physics and the Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, NJ 08544, USA; Department of Developmental and Stem Cell Biology, Institut Pasteur, 75015 Paris, France.
| |
Collapse
|
37
|
Haberle V, Stark A. Eukaryotic core promoters and the functional basis of transcription initiation. Nat Rev Mol Cell Biol 2018; 19:621-637. [PMID: 29946135 PMCID: PMC6205604 DOI: 10.1038/s41580-018-0028-8] [Citation(s) in RCA: 357] [Impact Index Per Article: 59.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
RNA polymerase II (Pol II) core promoters are specialized DNA sequences at transcription start sites of protein-coding and non-coding genes that support the assembly of the transcription machinery and transcription initiation. They enable the highly regulated transcription of genes by selectively integrating regulatory cues from distal enhancers and their associated regulatory proteins. In this Review, we discuss the defining properties of gene core promoters, including their sequence features, chromatin architecture and transcription initiation patterns. We provide an overview of molecular mechanisms underlying the function and regulation of core promoters and their emerging functional diversity, which defines distinct transcription programmes. On the basis of the established properties of gene core promoters, we discuss transcription start sites within enhancers and integrate recent results obtained from dedicated functional assays to propose a functional model of transcription initiation. This model can explain the nature and function of transcription initiation at gene starts and at enhancers and can explain the different roles of core promoters, of Pol II and its associated factors and of the activating cues provided by enhancers and the transcription factors and cofactors they recruit.
Collapse
Affiliation(s)
- Vanja Haberle
- Research Institute of Molecular Pathology (IMP), Vienna Biocenter (VBC), Vienna, Austria
| | - Alexander Stark
- Research Institute of Molecular Pathology (IMP), Vienna Biocenter (VBC), Vienna, Austria.
- Medical University of Vienna, Vienna Biocenter (VBC), Vienna, Austria.
| |
Collapse
|
38
|
Morgan MD, Marioni JC. CpG island composition differences are a source of gene expression noise indicative of promoter responsiveness. Genome Biol 2018; 19:81. [PMID: 29945659 PMCID: PMC6020341 DOI: 10.1186/s13059-018-1461-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2018] [Accepted: 06/04/2018] [Indexed: 01/25/2023] Open
Abstract
BACKGROUND Population phenotypic variation can arise from genetic differences between individuals, or from cellular heterogeneity in an isogenic group of cells or organisms. The emergence of gene expression differences between genetically identical cells is referred to as gene expression noise, the sources of which are not well understood. RESULTS In this work, by studying gene expression noise between multiple cell lineages and mammalian species, we find consistent evidence of a role for CpG islands as sources of gene expression noise. Variation in noise among CpG island promoters can be partially attributed to differences in island size, in which short islands have noisier gene expression. Building on these findings, we investigate the potential for short CpG islands to act as fast response elements to environmental stimuli. Specifically, we find that these islands are enriched amongst primary response genes in SWI/SNF-independent stimuli, suggesting that expression noise is an indicator of promoter responsiveness. CONCLUSIONS Thus, through the integration of single-cell RNA expression profiling, chromatin landscape and temporal gene expression dynamics, we have uncovered a role for short CpG island promoters as fast response elements.
Collapse
Affiliation(s)
- Michael D Morgan
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
| | - John C Marioni
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SA, UK.
- Cancer Research UK Cambridge Institute, University of Cambridge, Robinson Way, Li Ka Shing Centre, Cambridge, CB2 0RE, UK.
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| |
Collapse
|
39
|
Rabieian R, Moein S, Khanahmad H, Mortazavi M, Gheisari Y. Transcriptional noise in intact and TGF-beta treated human kidney cells; the importance of time-series designs. Cell Biol Int 2018; 42:1265-1269. [PMID: 29802744 DOI: 10.1002/cbin.10992] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2018] [Accepted: 05/13/2018] [Indexed: 01/15/2023]
Abstract
The transforming growth factor (TGF)-β signaling pathway plays a key role in various cellular processes. However, insufficient knowledge about the complex and sometimes paradoxical functions of this pathway hinders its therapeutic targeting. In this study, the transcriptional profile of seven mediators and downstream elements of the TGF-β pathway were assessed in TGF-β treated and untreated human kidney derived cells for 2 weeks in a time course manner. As expected the up-regulation of ACTA2 and COL1A2 was evident in the treated cells. However, we observed remarkable fluctuations in gene expression, even in the supposedly steady states. The magnitude of noise was diverse in the examined genes. Our findings underscore the significance of time-course designs for gene expression analyses and clearly show that misleading data can be obtained in single point measurements. Furthermore, we propose specific considerations in the interpretation of time-course data in the context of noisy gene expression.
Collapse
Affiliation(s)
- Reyhaneh Rabieian
- Department of Genetics and Molecular Biology, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Shiva Moein
- Department of Genetics and Molecular Biology, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Hossein Khanahmad
- Department of Genetics and Molecular Biology, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Mojgan Mortazavi
- Isfahan Kidney Diseases Research Center, Isfahan University of Medical Sciences, Isfahan, Iran
| | - Yousof Gheisari
- Department of Genetics and Molecular Biology, Isfahan University of Medical Sciences, Isfahan, Iran.,Regenerative Medicine Lab, Isfahan Kidney Diseases Research Center, Isfahan University of Medical Sciences, Isfahan, Iran
| |
Collapse
|
40
|
Hart JC, Ellis NA, Eisen MB, Miller CT. Convergent evolution of gene expression in two high-toothed stickleback populations. PLoS Genet 2018; 14:e1007443. [PMID: 29897962 PMCID: PMC6016950 DOI: 10.1371/journal.pgen.1007443] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2018] [Revised: 06/25/2018] [Accepted: 05/23/2018] [Indexed: 12/30/2022] Open
Abstract
Changes in developmental gene regulatory networks enable evolved changes in morphology. These changes can be in cis regulatory elements that act in an allele-specific manner, or changes to the overall trans regulatory environment that interacts with cis regulatory sequences. Here we address several questions about the evolution of gene expression accompanying a convergently evolved constructive morphological trait, increases in tooth number in two independently derived freshwater populations of threespine stickleback fish (Gasterosteus aculeatus). Are convergently evolved cis and/or trans changes in gene expression associated with convergently evolved morphological evolution? Do cis or trans regulatory changes contribute more to gene expression changes accompanying an evolved morphological gain trait? Transcriptome data from dental tissue of ancestral low-toothed and two independently derived high-toothed stickleback populations revealed significantly shared gene expression changes that have convergently evolved in the two high-toothed populations. Comparing cis and trans regulatory changes using phased gene expression data from F1 hybrids, we found that trans regulatory changes were predominant and more likely to be shared among both high-toothed populations. In contrast, while cis regulatory changes have evolved in both high-toothed populations, overall these changes were distinct and not shared among high-toothed populations. Together these data suggest that a convergently evolved trait can occur through genetically distinct regulatory changes that converge on similar trans regulatory environments.
Collapse
Affiliation(s)
- James C. Hart
- Department of Molecular and Cell Biology, University of California-Berkeley, CA, United States of America
| | - Nicholas A. Ellis
- Department of Molecular and Cell Biology, University of California-Berkeley, CA, United States of America
| | - Michael B. Eisen
- Department of Molecular and Cell Biology, University of California-Berkeley, CA, United States of America
- Howard Hughes Medical Institute, University of California, Berkeley, CA, United States of America
| | - Craig T. Miller
- Department of Molecular and Cell Biology, University of California-Berkeley, CA, United States of America
- * E-mail:
| |
Collapse
|
41
|
Li C, Lenhard B, Luscombe NM. Integrated analysis sheds light on evolutionary trajectories of young transcription start sites in the human genome. Genome Res 2018; 28:676-688. [PMID: 29618487 PMCID: PMC5932608 DOI: 10.1101/gr.231449.117] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2017] [Accepted: 03/21/2018] [Indexed: 01/06/2023]
Abstract
Understanding the molecular mechanisms and evolution of the gene regulatory system remains a major challenge in biology. Transcription start sites (TSSs) are especially interesting because they are central to initiating gene expression. Previous studies revealed widespread transcription initiation and fast turnover of TSSs in mammalian genomes. Yet, how new TSSs originate and how they evolve over time remain poorly understood. To address these questions, we analyzed ∼200,000 human TSSs by integrating evolutionary (inter- and intra-species) and functional genomic data, particularly focusing on evolutionarily young TSSs that emerged in the primate lineage. TSSs were grouped according to their evolutionary age using sequence alignment information as a proxy. Comparisons of young and old TSSs revealed that (1) new TSSs emerge through a combination of intrinsic factors, like the sequence properties of transposable elements and tandem repeats, and extrinsic factors such as their proximity to existing regulatory modules; (2) new TSSs undergo rapid evolution that reduces the inherent instability of repeat sequences associated with a high propensity of TSS emergence; and (3) once established, the transcriptional competence of surviving TSSs is gradually enhanced, with evolutionary changes subject to temporal (fewer regulatory changes in younger TSSs) and spatial constraints (fewer regulatory changes in more isolated TSSs). These findings advance our understanding of how regulatory innovations arise in the genome throughout evolution and highlight the genomic robustness and evolvability in these processes.
Collapse
Affiliation(s)
- Cai Li
- The Francis Crick Institute, London NW1 1AT, United Kingdom
| | - Boris Lenhard
- Computational Regulatory Genomics, MRC London Institute of Medical Sciences, London W12 0NN, United Kingdom.,Institute of Clinical Sciences, Faculty of Medicine, Imperial College London, London W12 0NN, United Kingdom.,Sars International Centre for Marine Molecular Biology, University of Bergen, N-5008 Bergen, Norway
| | - Nicholas M Luscombe
- The Francis Crick Institute, London NW1 1AT, United Kingdom.,UCL Genetics Institute, University College London, London WC1E 6BT, United Kingdom.,Okinawa Institute of Science & Technology Graduate University, Okinawa, 904-0495, Japan
| |
Collapse
|
42
|
Marnetto D, Mantica F, Molineris I, Grassi E, Pesando I, Provero P. Evolutionary Rewiring of Human Regulatory Networks by Waves of Genome Expansion. Am J Hum Genet 2018; 102:207-218. [PMID: 29357977 DOI: 10.1016/j.ajhg.2017.12.014] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2017] [Accepted: 12/15/2017] [Indexed: 01/09/2023] Open
Abstract
Genome expansion is believed to be an important driver of the evolution of gene regulation. To investigate the role of a newly arising sequence in rewiring regulatory networks, we estimated the age of each region of the human genome by applying maximum parsimony to genome-wide alignments with 100 vertebrates. We then studied the age distribution of several types of functional regions, with a focus on regulatory elements. The age distribution of regulatory elements reveals the extensive use of newly formed genomic sequence in the evolution of regulatory interactions. Many transcription factors have expanded their repertoire of targets through waves of genomic expansions that can be traced to specific evolutionary times. Repeated elements contributed a major part of such expansion: many classes of such elements are enriched in binding sites of one or a few specific transcription factors, whose binding sites are localized in specific portions of the element and characterized by distinctive motif words. These features suggest that the binding sites were available as soon as the new sequence entered the genome, rather than being created later by accumulation of point mutations. By comparing the age of regulatory regions to the evolutionary shift in expression of nearby genes, we show that rewiring through genome expansion played an important role in shaping human regulatory networks.
Collapse
|
43
|
Mikhaylichenko O, Bondarenko V, Harnett D, Schor IE, Males M, Viales RR, Furlong EEM. The degree of enhancer or promoter activity is reflected by the levels and directionality of eRNA transcription. Genes Dev 2018; 32:42-57. [PMID: 29378788 PMCID: PMC5828394 DOI: 10.1101/gad.308619.117] [Citation(s) in RCA: 153] [Impact Index Per Article: 25.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2017] [Accepted: 12/21/2017] [Indexed: 12/03/2022]
Abstract
Here, Mikhaylichenko et al. investigate the transcriptional properties of enhancers during Drosophila embryogenesis using characterized developmental enhancers. The authors demonstrate that while the timing of enhancer transcription is correlated with enhancer activity, the levels and directionality of transcription are highly varied among active enhancers and conclude that this is likely an inherent sequence property of the elements themselves. Gene expression is regulated by promoters, which initiate transcription, and enhancers, which control their temporal and spatial activity. However, the discovery that mammalian enhancers also initiate transcription questions the inherent differences between enhancers and promoters. Here, we investigate the transcriptional properties of enhancers during Drosophila embryogenesis using characterized developmental enhancers. We show that while the timing of enhancer transcription is generally correlated with enhancer activity, the levels and directionality of transcription are highly varied among active enhancers. To assess how this impacts function, we developed a dual transgenic assay to simultaneously measure enhancer and promoter activities from a single element in the same embryo. Extensive transgenic analysis revealed a relationship between the direction of endogenous transcription and the ability to function as an enhancer or promoter in vivo, although enhancer RNA (eRNA) production and activity are not always strictly coupled. Some enhancers (mainly bidirectional) can act as weak promoters, producing overlapping spatio–temporal expression. Conversely, bidirectional promoters often act as strong enhancers, while unidirectional promoters generally cannot. The balance between enhancer and promoter activity is generally reflected in the levels and directionality of eRNA transcription and is likely an inherent sequence property of the elements themselves.
Collapse
Affiliation(s)
- Olga Mikhaylichenko
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), D-69117 Heidelberg, Germany
| | - Vladyslav Bondarenko
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), D-69117 Heidelberg, Germany
| | - Dermot Harnett
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), D-69117 Heidelberg, Germany
| | - Ignacio E Schor
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), D-69117 Heidelberg, Germany
| | - Matilda Males
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), D-69117 Heidelberg, Germany
| | - Rebecca R Viales
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), D-69117 Heidelberg, Germany
| | - Eileen E M Furlong
- Genome Biology Unit, European Molecular Biology Laboratory (EMBL), D-69117 Heidelberg, Germany
| |
Collapse
|
44
|
The rewiring of transcription circuits in evolution. Curr Opin Genet Dev 2017; 47:121-127. [PMID: 29120735 DOI: 10.1016/j.gde.2017.09.004] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2017] [Revised: 09/13/2017] [Accepted: 09/14/2017] [Indexed: 12/24/2022]
Abstract
The binding of transcription regulators to cis-regulatory sequences is a key step through which all cells regulate expression of their genes. Due to gains and losses of cis-regulatory sequences and changes in the transcription regulators themselves, the binding connections between regulators and their target genes rapidly change over evolutionary time and constitute a major source of biological novelty. This review covers recent work, carried out in a wide range of species, that addresses the overall extent of these evolutionary changes, their consequences, and some of the molecular mechanisms that lie behind them.
Collapse
|
45
|
The effect of genetic variation on promoter usage and enhancer activity. Nat Commun 2017; 8:1358. [PMID: 29116076 PMCID: PMC5677018 DOI: 10.1038/s41467-017-01467-7] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Accepted: 09/19/2017] [Indexed: 12/19/2022] Open
Abstract
The identification of genetic variants affecting gene expression, namely expression quantitative trait loci (eQTLs), has contributed to the understanding of mechanisms underlying human traits and diseases. The majority of these variants map in non-coding regulatory regions of the genome and their identification remains challenging. Here, we use natural genetic variation and CAGE transcriptomes from 154 EBV-transformed lymphoblastoid cell lines, derived from unrelated individuals, to map 5376 and 110 regulatory variants associated with promoter usage (puQTLs) and enhancer activity (eaQTLs), respectively. We characterize five categories of genes associated with puQTLs, distinguishing single from multi-promoter genes. Among multi-promoter genes, we find puQTL effects either specific to a single promoter or to multiple promoters with variable effect orientations. Regulatory variants associated with opposite effects on different mRNA isoforms suggest compensatory mechanisms occurring between alternative promoters. Our analyses identify differential promoter usage and modulation of enhancer activity as molecular mechanisms underlying eQTLs related to regulatory elements. Expression quantitative trait loci (eQTL) are widely studied, yet the mechanisms by which they exert their effects are largely unknown. Here, performing CAGE-seq on 154 lymphoblastoid cell lines, the authors map regulatory variants associated with promoter usage (puQTLs) and enhancer activity (eaQTLs).
Collapse
|
46
|
Hendy O, Campbell J, Weissman JD, Larson DR, Singer DS. Differential context-specific impact of individual core promoter elements on transcriptional dynamics. Mol Biol Cell 2017; 28:3360-3370. [PMID: 28931597 PMCID: PMC5687036 DOI: 10.1091/mbc.e17-06-0408] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2017] [Revised: 08/23/2017] [Accepted: 09/11/2017] [Indexed: 11/11/2022] Open
Abstract
The roles of individual core promoter elements in transcriptional dynamics of MHC class I gene expression were determined by smFISH in primary B-cells. The elements individually modulated transcriptional bursting, differentially contributing to burst size or burst frequency, to enable combinatorial fine-tuning of the level of transcription. Eukaryotic transcription occurs in bursts that vary in size and frequency, but the contribution of individual core promoter elements to transcriptional bursting is not known. Here we analyze the relative contributions to bursting of the individual core promoter elements—CCAAT, TATAA-like, Sp1BS, and Inr—of an MHC class I gene in primary B-cells during both basal and activated transcription. The TATAA-like, Sp1BS, and Inr elements all function as negative regulators of transcription, and each was found to contribute differentially to the overall bursting pattern of the promoter during basal transcription. Whereas the Sp1BS element regulates burst size, the Inr element regulates burst frequency. The TATAA-like element contributes to both. Surprisingly, each element has a distinct role in bursting during transcriptional activation by γ-interferon. The CCAAT element does not contribute significantly to the constitutive transcriptional dynamics of primary B-cells, but modulates both burst size and frequency in response to γ-interferon activation. The ability of core promoter elements to modulate transcriptional bursting individually allows combinatorial fine-tuning of the level of MHC class I gene expression in response to intrinsic and extrinsic signals.
Collapse
Affiliation(s)
- Oliver Hendy
- Experimental Immunology Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892
| | - John Campbell
- Experimental Immunology Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892
| | - Jocelyn D Weissman
- Experimental Immunology Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892
| | - Daniel R Larson
- Laboratory of Receptor Biology and Gene Expression, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892
| | - Dinah S Singer
- Experimental Immunology Branch, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892
| |
Collapse
|
47
|
Abstract
This review by Vo ngoc et al. expands the view of the RNA polymerase II core promoter, which is comprised of classical DNA sequence motifs, sequence-specific DNA-binding transcription factors, chromatin signals, and DNA structure. The signals that direct the initiation of transcription ultimately converge at the core promoter, which is the gateway to transcription. Here we provide an overview of the RNA polymerase II core promoter in bilateria (bilaterally symmetric animals). The core promoter is diverse in terms of its composition and function yet is also punctilious, as it acts with strict rules and precision. We additionally describe an expanded view of the core promoter that comprises the classical DNA sequence motifs, sequence-specific DNA-binding transcription factors, chromatin signals, and DNA structure. This model may eventually lead to a more unified conceptual understanding of the core promoter.
Collapse
Affiliation(s)
- Long Vo Ngoc
- Section of Molecular Biology, University of California at San Diego, La Jolla, California 92093, USA
| | - Yuan-Liang Wang
- Section of Molecular Biology, University of California at San Diego, La Jolla, California 92093, USA
| | - George A Kassavetis
- Section of Molecular Biology, University of California at San Diego, La Jolla, California 92093, USA
| | - James T Kadonaga
- Section of Molecular Biology, University of California at San Diego, La Jolla, California 92093, USA
| |
Collapse
|
48
|
Bussemaker HJ, Causton HC, Fazlollahi M, Lee E, Muroff I. Network-based approaches that exploit inferred transcription factor activity to analyze the impact of genetic variation on gene expression. ACTA ACUST UNITED AC 2017; 2:98-102. [PMID: 28691107 DOI: 10.1016/j.coisb.2017.04.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Over the past decade, a number of methods have emerged for inferring protein-level transcription factor activities in individual samples based on prior information about the structure of the gene regulatory network. We discuss how this has enabled new methods for dissecting trans-acting mechanisms that underpin genetic variation in gene expression.
Collapse
Affiliation(s)
- Harmen J Bussemaker
- Department of Biological Sciences, Columbia University, New York, NY 10027.,Department of Systems Biology, Columbia University, New York, NY 10032
| | - Helen C Causton
- Department of Pathology and Cell Biology, Columbia University Medical Center, New York, NY 10032
| | - Mina Fazlollahi
- Department of Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, NY 10029
| | - Eunjee Lee
- Department of Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, NY 10029
| | - Ivor Muroff
- Department of Biological Sciences, Columbia University, New York, NY 10027
| |
Collapse
|