1
|
Zhan C, Tang T, Wu E, Zhang Y, He M, Wu R, Bi C, Wang J, Zhang Y, Shen B. From multi-omics approaches to personalized medicine in myocardial infarction. Front Cardiovasc Med 2023; 10:1250340. [PMID: 37965091 PMCID: PMC10642346 DOI: 10.3389/fcvm.2023.1250340] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 10/17/2023] [Indexed: 11/16/2023] Open
Abstract
Myocardial infarction (MI) is a prevalent cardiovascular disease characterized by myocardial necrosis resulting from coronary artery ischemia and hypoxia, which can lead to severe complications such as arrhythmia, cardiac rupture, heart failure, and sudden death. Despite being a research hotspot, the etiological mechanism of MI remains unclear. The emergence and widespread use of omics technologies, including genomics, transcriptomics, proteomics, metabolomics, and other omics, have provided new opportunities for exploring the molecular mechanism of MI and identifying a large number of disease biomarkers. However, a single-omics approach has limitations in understanding the complex biological pathways of diseases. The multi-omics approach can reveal the interaction network among molecules at various levels and overcome the limitations of the single-omics approaches. This review focuses on the omics studies of MI, including genomics, epigenomics, transcriptomics, proteomics, metabolomics, and other omics. The exploration extended into the domain of multi-omics integrative analysis, accompanied by a compilation of diverse online resources, databases, and tools conducive to these investigations. Additionally, we discussed the role and prospects of multi-omics approaches in personalized medicine, highlighting the potential for improving diagnosis, treatment, and prognosis of MI.
Collapse
Affiliation(s)
- Chaoying Zhan
- Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
| | - Tong Tang
- Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
| | - Erman Wu
- Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
| | - Yuxin Zhang
- Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
- KeyLaboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
| | - Mengqiao He
- Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
| | - Rongrong Wu
- Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
| | - Cheng Bi
- Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
- KeyLaboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, China
| | - Jiao Wang
- Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
| | - Yingbo Zhang
- Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
- Tropical Crops Genetic Resources Institute, Chinese Academy of Tropical Agricultural Sciences, Haikou, China
| | - Bairong Shen
- Department of Cardiology and Institutes for Systems Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, China
| |
Collapse
|
2
|
Olsen SN, Godfrey L, Healy JP, Choi YA, Kai Y, Hatton C, Perner F, Haarer EL, Nabet B, Yuan GC, Armstrong SA. MLL::AF9 degradation induces rapid changes in transcriptional elongation and subsequent loss of an active chromatin landscape. Mol Cell 2022; 82:1140-1155.e11. [PMID: 35245435 PMCID: PMC9044330 DOI: 10.1016/j.molcel.2022.02.013] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Revised: 11/17/2021] [Accepted: 02/06/2022] [Indexed: 12/15/2022]
Abstract
MLL rearrangements produce fusion oncoproteins that drive leukemia development, but the direct effects of MLL-fusion inactivation remain poorly defined. We designed models with degradable MLL::AF9 where treatment with small molecules induces rapid degradation. We leveraged the kinetics of this system to identify a core subset of MLL::AF9 target genes where MLL::AF9 degradation induces changes in transcriptional elongation within 15 minutes. MLL::AF9 degradation subsequently causes loss of a transcriptionally active chromatin landscape. We used this insight to assess the effectiveness of small molecules that target members of the MLL::AF9 multiprotein complex, specifically DOT1L and MENIN. Combined DOT1L/MENIN inhibition resembles MLL::AF9 degradation, whereas single-agent treatment has more modest effects on MLL::AF9 occupancy and gene expression. Our data show that MLL::AF9 degradation leads to decreases in transcriptional elongation prior to changes in chromatin landscape at select loci and that combined inhibition of chromatin complexes releases the MLL::AF9 oncoprotein from chromatin globally.
Collapse
Affiliation(s)
- Sarah Naomi Olsen
- Department of Pediatric Oncology, Dana-Farber Cancer Institute/Boston Children's Hospital, Harvard Medical School, Boston, MA 02215, USA
| | - Laura Godfrey
- Department of Pediatric Oncology, Dana-Farber Cancer Institute/Boston Children's Hospital, Harvard Medical School, Boston, MA 02215, USA
| | - James P Healy
- Department of Pediatric Oncology, Dana-Farber Cancer Institute/Boston Children's Hospital, Harvard Medical School, Boston, MA 02215, USA
| | - Yoolim A Choi
- Department of Pediatric Oncology, Dana-Farber Cancer Institute/Boston Children's Hospital, Harvard Medical School, Boston, MA 02215, USA
| | - Yan Kai
- Department of Pediatric Oncology, Dana-Farber Cancer Institute/Boston Children's Hospital, Harvard Medical School, Boston, MA 02215, USA
| | - Charles Hatton
- Department of Pediatric Oncology, Dana-Farber Cancer Institute/Boston Children's Hospital, Harvard Medical School, Boston, MA 02215, USA
| | - Florian Perner
- Department of Pediatric Oncology, Dana-Farber Cancer Institute/Boston Children's Hospital, Harvard Medical School, Boston, MA 02215, USA; Internal Medicine C, University Medical Center Greifswald, 17475 Greifswald, Germany
| | - Elena L Haarer
- Department of Pediatric Oncology, Dana-Farber Cancer Institute/Boston Children's Hospital, Harvard Medical School, Boston, MA 02215, USA
| | - Behnam Nabet
- Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Guo-Cheng Yuan
- Department of Genetics and Genomic Sciences and Charles Bronfman Institute for Personalized Medicine, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Scott A Armstrong
- Department of Pediatric Oncology, Dana-Farber Cancer Institute/Boston Children's Hospital, Harvard Medical School, Boston, MA 02215, USA.
| |
Collapse
|
3
|
Tripp BA, Otu HH. Integration of Multi-Omics Data Using Probabilistic Graph Models and
External Knowledge. Curr Bioinform 2022. [DOI: 10.2174/1574893616666210906141545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Background:
High-throughput sequencing technologies have revolutionized the ability to
perform systems-level biology and elucidate molecular mechanisms of disease through the comprehensive
characterization of different layers of biological information. Integration of these heterogeneous
layers can provide insight into the underlying biology but is challenged by modeling complex interactions.
Objective:
We introduce OBaNK: omics integration using Bayesian networks and external knowledge,
an algorithm to model interactions between heterogeneous high-dimensional biological data to elucidate
complex functional clusters and emergent relationships associated with an observed phenotype.
Method:
Using Bayesian network learning, we modeled the statistical dependencies and interactions
between lipidomics, proteomics, and metabolomics data. The strength of a learned interaction between
molecules was altered based on external knowledge.
Results :
Networks learned from synthetic datasets based on real pathways achieved an average area under
the curve score of ~0.85, an improvement of ~0.23 from baseline methods. When applied to real
multi-omics data collected during pregnancy, five distinct functional networks of heterogeneous biological
data were identified, and the results were compared to other multi-omics integration approaches.
Conclusion:
OBaNK successfully improved the accuracy of learning interaction networks from data integrating
external knowledge, identified heterogeneous functional networks from real data, and suggested
potential novel interactions associated with the phenotype. These findings can guide future hypothesis
generation. OBaNK source code is available at: https://github.com/bridgettripp/OBaNK.git, and a
graphical user interface is available at: http://otulab.unl.edu/OBaNK.
Collapse
Affiliation(s)
- Bridget A. Tripp
- Department of Electrical and Computer Engineering, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
- PhD Program of Complex Biosystems, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Hasan H. Otu
- Department of Electrical and Computer Engineering, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| |
Collapse
|
4
|
Elrod ND, Henriques T, Huang KL, Tatomer DC, Wilusz JE, Wagner EJ, Adelman K. The Integrator Complex Attenuates Promoter-Proximal Transcription at Protein-Coding Genes. Mol Cell 2020; 76:738-752.e7. [PMID: 31809743 DOI: 10.1016/j.molcel.2019.10.034] [Citation(s) in RCA: 110] [Impact Index Per Article: 27.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Revised: 09/15/2019] [Accepted: 10/25/2019] [Indexed: 12/11/2022]
Abstract
The transition of RNA polymerase II (Pol II) from initiation to productive elongation is a central, regulated step in metazoan gene expression. At many genes, Pol II pauses stably in early elongation, remaining engaged with the 25- to 60-nt-long nascent RNA for many minutes while awaiting signals for release into the gene body. However, 15%-20% of genes display highly unstable promoter Pol II, suggesting that paused polymerase might dissociate from template DNA at these promoters and release a short, non-productive mRNA. Here, we report that paused Pol II can be actively destabilized by the Integrator complex. Specifically, we present evidence that Integrator utilizes its RNA endonuclease activity to cleave nascent RNA and drive termination of paused Pol II. These findings uncover a previously unappreciated mechanism of metazoan gene repression, akin to bacterial transcription attenuation, wherein promoter-proximal Pol II is prevented from entering productive elongation through factor-regulated termination.
Collapse
Affiliation(s)
- Nathan D Elrod
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch at Galveston, Galveston, TX 77550, USA
| | - Telmo Henriques
- Department of Biological Chemistry and Molecular Pharmacology, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Kai-Lieh Huang
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch at Galveston, Galveston, TX 77550, USA
| | - Deirdre C Tatomer
- Department of Biochemistry and Biophysics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Jeremy E Wilusz
- Department of Biochemistry and Biophysics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Eric J Wagner
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch at Galveston, Galveston, TX 77550, USA.
| | - Karen Adelman
- Department of Biological Chemistry and Molecular Pharmacology, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
5
|
Lavender CA, Shapiro AJ, Day FS, Fargo DC. ORSO (Online Resource for Social Omics): A data-driven social network connecting scientists to genomics datasets. PLoS Comput Biol 2020; 16:e1007571. [PMID: 31978042 PMCID: PMC7001987 DOI: 10.1371/journal.pcbi.1007571] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Revised: 02/05/2020] [Accepted: 11/29/2019] [Indexed: 11/17/2022] Open
Abstract
High-throughput sequencing has become ubiquitous in biomedical sciences. As new technologies emerge and sequencing costs decline, the diversity and volume of available data increases exponentially, and successfully navigating the data becomes more challenging. Though datasets are often hosted by public repositories, scientists must rely on inconsistent annotation to identify and interpret meaningful data. Moreover, the experimental heterogeneity and wide-ranging quality of high-throughput biological data means that even data with desired cell lines, tissue types, or molecular targets may not be readily interpretable or integrated. We have developed ORSO (Online Resource for Social Omics) as an easy-to-use web application to connect life scientists with genomics data. In ORSO, users interact within a data-driven social network, where they can favorite datasets and follow other users. In addition to more than 30,000 datasets hosted from major biomedical consortia, users may contribute their own data to ORSO, facilitating its discovery by other users. Leveraging user interactions, ORSO provides a novel recommendation system to automatically connect users with hosted data. In addition to social interactions, the recommendation system considers primary read coverage information and annotated metadata. Similarities used by the recommendation system are presented by ORSO in a graph display, allowing exploration of dataset associations. The topology of the network graph reflects established biology, with samples from related systems grouped together. We tested the recommendation system using an RNA-seq time course dataset from differentiation of embryonic stem cells to cardiomyocytes. The ORSO recommendation system correctly predicted early data point sources as embryonic stem cells and late data point sources as heart and muscle samples, resulting in recommendation of related datasets. By connecting scientists with relevant data, ORSO provides a critical new service that facilitates wide-ranging research interests. New sequencing technologies have rapidly transformed biomedical research. Public data repositories now contain millions of datasets, which have the potential to accelerate and bolster research projects. However, the sheer magnitude of available data makes navigation difficult. We created ORSO (Online Resource for Social Omics) to address these challenges. ORSO is a social network where entries are not status updates or tweets, but biological datasets. Users may add their own data to ORSO, joining 30,000 validated datasets that are already hosted, and other users may find these data through intuitive search functions and informative analytics. Users can then favorite datasets relevant to their interests or follow contributing users. ORSO also uses a recommendation system like those used on commercial websites to automatically recommend data to users based on user interactions and dataset similarities. By making data more accessible and by connecting users to relevant data, we anticipate that ORSO will be an important resource for scientists. ORSO may be the first of many applications that use methods originating in social media and ecommerce to enhance and further research projects in the life sciences.
Collapse
Affiliation(s)
- Christopher A Lavender
- Integrative Bioinformatics, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, North Carolina, United States of America
| | - Andrew J Shapiro
- Program Operations Branch, Division of the National Toxicology Program, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, North Carolina, United States of America
| | - Frank S Day
- Office of Scientific Computing, Division of Intramural Research, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, North Carolina, United States of America
| | - David C Fargo
- Office of Scientific Computing, Division of Intramural Research, National Institute of Environmental Health Sciences, National Institutes of Health, Research Triangle Park, North Carolina, United States of America
| |
Collapse
|
6
|
Elrod ND, Henriques T, Huang KL, Tatomer DC, Wilusz JE, Wagner EJ, Adelman K. The Integrator Complex Attenuates Promoter-Proximal Transcription at Protein-Coding Genes. Mol Cell 2019; 76:738-752.e7. [PMID: 31809743 DOI: 10.1101/725507] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Revised: 09/15/2019] [Accepted: 10/25/2019] [Indexed: 05/27/2023]
Abstract
The transition of RNA polymerase II (Pol II) from initiation to productive elongation is a central, regulated step in metazoan gene expression. At many genes, Pol II pauses stably in early elongation, remaining engaged with the 25- to 60-nt-long nascent RNA for many minutes while awaiting signals for release into the gene body. However, 15%-20% of genes display highly unstable promoter Pol II, suggesting that paused polymerase might dissociate from template DNA at these promoters and release a short, non-productive mRNA. Here, we report that paused Pol II can be actively destabilized by the Integrator complex. Specifically, we present evidence that Integrator utilizes its RNA endonuclease activity to cleave nascent RNA and drive termination of paused Pol II. These findings uncover a previously unappreciated mechanism of metazoan gene repression, akin to bacterial transcription attenuation, wherein promoter-proximal Pol II is prevented from entering productive elongation through factor-regulated termination.
Collapse
Affiliation(s)
- Nathan D Elrod
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch at Galveston, Galveston, TX 77550, USA
| | - Telmo Henriques
- Department of Biological Chemistry and Molecular Pharmacology, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA
| | - Kai-Lieh Huang
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch at Galveston, Galveston, TX 77550, USA
| | - Deirdre C Tatomer
- Department of Biochemistry and Biophysics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Jeremy E Wilusz
- Department of Biochemistry and Biophysics, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
| | - Eric J Wagner
- Department of Biochemistry and Molecular Biology, University of Texas Medical Branch at Galveston, Galveston, TX 77550, USA.
| | - Karen Adelman
- Department of Biological Chemistry and Molecular Pharmacology, Blavatnik Institute, Harvard Medical School, Boston, MA 02115, USA.
| |
Collapse
|
7
|
Fosslie M, Manaf A, Lerdrup M, Hansen K, Gilfillan GD, Dahl JA. Going low to reach high: Small-scale ChIP-seq maps new terrain. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2019; 12:e1465. [PMID: 31478357 DOI: 10.1002/wsbm.1465] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2019] [Revised: 07/02/2019] [Accepted: 07/25/2019] [Indexed: 12/20/2022]
Abstract
Chromatin immunoprecipitation (ChIP) enables mapping of specific histone modifications or chromatin-associated factors in the genome and represents a powerful tool in the study of chromatin and genome regulation. Importantly, recent technological advances that couple ChIP with whole-genome high-throughput sequencing (ChIP-seq) now allow the mapping of chromatin factors throughout the genome. However, the requirement for large amounts of ChIP-seq input material has long made it challenging to assess chromatin profiles of cell types only available in limited numbers. For many cell types, it is not feasible to reach high numbers when collecting them as homogeneous cell populations in vivo. Nonetheless, it is an advantage to work with pure cell populations to reach robust biological conclusions. Here, we review (a) how ChIP protocols have been scaled down for use with as little as a few hundred cells; (b) which considerations to be aware of when preparing small-scale ChIP-seq and analyzing data; and (c) the potential of small-scale ChIP-seq datasets for elucidating chromatin dynamics in various biological systems, including some examples such as oocyte maturation and preimplantation embryo development. This article is categorized under: Laboratory Methods and Technologies > Genetic/Genomic Methods Developmental Biology > Developmental Processes in Health and Disease Biological Mechanisms > Cell Fates.
Collapse
Affiliation(s)
| | - Adeel Manaf
- Department of Microbiology, Oslo University Hospital, Oslo, Norway
| | - Mads Lerdrup
- The Biotech Research and Innovation Centre, University of Copenhagen, Copenhagen, Denmark.,Centre for Epigenetics, University of Copenhagen, Copenhagen, Denmark
| | - Klaus Hansen
- The Biotech Research and Innovation Centre, University of Copenhagen, Copenhagen, Denmark.,Centre for Epigenetics, University of Copenhagen, Copenhagen, Denmark
| | - Gregor D Gilfillan
- Department of Medical Genetics, Oslo University Hospital and University of Oslo, Oslo, Norway
| | - John Arne Dahl
- Department of Microbiology, Oslo University Hospital, Oslo, Norway
| |
Collapse
|
8
|
FQStat: a parallel architecture for very high-speed assessment of sequencing quality metrics. BMC Bioinformatics 2019; 20:424. [PMID: 31416440 PMCID: PMC6694608 DOI: 10.1186/s12859-019-3015-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2019] [Accepted: 07/30/2019] [Indexed: 01/23/2023] Open
Abstract
Background High throughput DNA/RNA sequencing has revolutionized biological and clinical research. Sequencing is widely used, and generates very large amounts of data, mainly due to reduced cost and advanced technologies. Quickly assessing the quality of giga-to-tera base levels of sequencing data has become a routine but important task. Identification and elimination of low-quality sequence data is crucial for reliability of downstream analysis results. There is a need for a high-speed tool that uses optimized parallel programming for batch processing and simply gauges the quality of sequencing data from multiple datasets independent of any other processing steps. Results FQStat is a stand-alone, platform-independent software tool that assesses the quality of FASTQ files using parallel programming. Based on the machine architecture and input data, FQStat automatically determines the number of cores and the amount of memory to be allocated per file for optimum performance. Our results indicate that in a core-limited case, core assignment overhead exceeds the benefit of additional cores. In a core-unlimited case, there is a saturation point reached in performance by increasingly assigning additional cores per file. We also show that memory allocation per file has a lower priority in performance when compared to the allocation of cores. FQStat’s output is summarized in HTML web page, tab-delimited text file, and high-resolution image formats. FQStat calculates and plots read count, read length, quality score, and high-quality base statistics. FQStat identifies and marks low-quality sequencing data to suggest removal from downstream analysis. We applied FQStat on real sequencing data to optimize performance and to demonstrate its capabilities. We also compared FQStat’s performance to similar quality control (QC) tools that utilize parallel programming and attained improvements in run time. Conclusions FQStat is a user-friendly tool with a graphical interface that employs a parallel programming architecture and automatically optimizes its performance to generate quality control statistics for sequencing data. Unlike existing tools, these statistics are calculated for multiple datasets and separately at the “lane,” “sample,” and “experiment” level to identify subsets of the samples with low quality, thereby preventing the loss of complete samples when reliable data can still be obtained. Electronic supplementary material The online version of this article (10.1186/s12859-019-3015-y) contains supplementary material, which is available to authorized users.
Collapse
|
9
|
Nguyen TAT, Grimm SA, Bushel PR, Li J, Li Y, Bennett BD, Lavender CA, Ward JM, Fargo DC, Anderson CW, Li L, Resnick MA, Menendez D. Revealing a human p53 universe. Nucleic Acids Res 2019; 46:8153-8167. [PMID: 30107566 PMCID: PMC6144829 DOI: 10.1093/nar/gky720] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2018] [Accepted: 07/27/2018] [Indexed: 12/13/2022] Open
Abstract
p53 transcriptional networks are well-characterized in many organisms. However, a global understanding of requirements for in vivo p53 interactions with DNA and relationships with transcription across human biological systems in response to various p53 activating situations remains limited. Using a common analysis pipeline, we analyzed 41 data sets from genome-wide ChIP-seq studies of which 16 have associated gene expression data, including our recent primary data with normal human lymphocytes. The resulting extensive analysis, accessible at p53 BAER hub via the UCSC browser, provides a robust platform to characterize p53 binding throughout the human genome including direct influence on gene expression and underlying mechanisms. We establish the impact of spacers and mismatches from consensus on p53 binding in vivo and propose that once bound, neither significantly influences the likelihood of expression. Our rigorous approach revealed a large p53 genome-wide cistrome composed of >900 genes directly targeted by p53. Importantly, we identify a core cistrome signature composed of genes appearing in over half the data sets, and we identify signatures that are treatment- or cell-specific, demonstrating new functions for p53 in cell biology. Our analysis reveals a broad homeostatic role for human p53 that is relevant to both basic and translational studies.
Collapse
Affiliation(s)
- Thuy-Ai T Nguyen
- Genome Integrity & Structural Biology Laboratory, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Sara A Grimm
- Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Pierre R Bushel
- Biostatistics & Computational Biology Branch, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Jianying Li
- Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Yuanyuan Li
- Biostatistics & Computational Biology Branch, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Brian D Bennett
- Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Christopher A Lavender
- Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - James M Ward
- Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - David C Fargo
- Integrative Bioinformatics Support Group, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA.,Office of Scientific Computing, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Carl W Anderson
- Genome Integrity & Structural Biology Laboratory, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Leping Li
- Biostatistics & Computational Biology Branch, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Michael A Resnick
- Genome Integrity & Structural Biology Laboratory, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| | - Daniel Menendez
- Genome Integrity & Structural Biology Laboratory, National Institute of Environmental Health Sciences/National Institutes of Health, Research Triangle Park, NC 27709, USA
| |
Collapse
|
10
|
Henriques T, Scruggs BS, Inouye MO, Muse GW, Williams LH, Burkholder AB, Lavender CA, Fargo DC, Adelman K. Widespread transcriptional pausing and elongation control at enhancers. Genes Dev 2018; 32:26-41. [PMID: 29378787 PMCID: PMC5828392 DOI: 10.1101/gad.309351.117] [Citation(s) in RCA: 215] [Impact Index Per Article: 35.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2017] [Accepted: 12/21/2017] [Indexed: 02/07/2023]
Abstract
In this study, Henriques et al. demonstrate that transcription is a nearly universal feature of enhancers in Drosophila and mammalian cells and that nascent RNA sequencing strategies are optimal for identification of both enhancers and superenhancers. Their findings provide insights into the unique characteristics of superenhancers, which stimulate high-level gene expression through rapid pause release; interestingly, this property renders associated genes resistant to loss of factors that stabilize paused RNAPII. Regulation by gene-distal enhancers is critical for cell type-specific and condition-specific patterns of gene expression. Thus, to understand the basis of gene activity in a given cell type or tissue, we must identify the precise locations of enhancers and functionally characterize their behaviors. Here, we demonstrate that transcription is a nearly universal feature of enhancers in Drosophila and mammalian cells and that nascent RNA sequencing strategies are optimal for identification of both enhancers and superenhancers. We dissect the mechanisms governing enhancer transcription and discover remarkable similarities to transcription at protein-coding genes. We show that RNA polymerase II (RNAPII) undergoes regulated pausing and release at enhancers. However, as compared with mRNA genes, RNAPII at enhancers is less stable and more prone to early termination. Furthermore, we found that the level of histone H3 Lys4 (H3K4) methylation at enhancers corresponds to transcriptional activity such that highly active enhancers display H3K4 trimethylation rather than the H3K4 monomethylation considered a hallmark of enhancers. Finally, our work provides insights into the unique characteristics of superenhancers, which stimulate high-level gene expression through rapid pause release; interestingly, this property renders associated genes resistant to the loss of factors that stabilize paused RNAPII.
Collapse
Affiliation(s)
- Telmo Henriques
- Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA.,Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Benjamin S Scruggs
- Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA
| | - Michiko O Inouye
- Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA.,Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts 02115, USA
| | - Ginger W Muse
- Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA
| | - Lucy H Williams
- Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA
| | - Adam B Burkholder
- Center for Integrative Bioinformatics, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA
| | - Christopher A Lavender
- Center for Integrative Bioinformatics, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA
| | - David C Fargo
- Center for Integrative Bioinformatics, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA
| | - Karen Adelman
- Epigenetics and Stem Cell Biology Laboratory, National Institute of Environmental Health Sciences, Research Triangle Park, North Carolina 27709, USA.,Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts 02115, USA
| |
Collapse
|