1
|
Nemsick S, Hansen AS. Molecular models of bidirectional promoter regulation. Curr Opin Struct Biol 2024; 87:102865. [PMID: 38905929 PMCID: PMC11550790 DOI: 10.1016/j.sbi.2024.102865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 03/30/2024] [Accepted: 05/27/2024] [Indexed: 06/23/2024]
Abstract
Approximately 11% of human genes are transcribed by a bidirectional promoter (BDP), defined as two genes with <1 kb between their transcription start sites. Despite their evolutionary conservation and enrichment for housekeeping genes and oncogenes, the regulatory role of BDPs remains unclear. BDPs have been suggested to facilitate gene coregulation and/or decrease expression noise. This review discusses these potential regulatory functions through the context of six prospective underlying mechanistic models: a single nucleosome free region, shared transcription factor/regulator binding, cooperative negative supercoiling, bimodal histone marks, joint activation by enhancer(s), and RNA-mediated recruitment of regulators. These molecular mechanisms may act independently and/or cooperatively to facilitate the coregulation and/or decreased expression noise predicted of BDPs.
Collapse
Affiliation(s)
- Sarah Nemsick
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; The Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA; Koch Institute for Integrative Cancer Research, Cambridge, MA 02139, USA
| | - Anders S Hansen
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; The Gene Regulation Observatory, Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA; Koch Institute for Integrative Cancer Research, Cambridge, MA 02139, USA.
| |
Collapse
|
2
|
Di Vona C, Barba L, Ferrari R, de la Luna S. Loss of the DYRK1A Protein Kinase Results in the Reduction in Ribosomal Protein Gene Expression, Ribosome Mass and Reduced Translation. Biomolecules 2023; 14:31. [PMID: 38254631 PMCID: PMC10813206 DOI: 10.3390/biom14010031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 12/19/2023] [Accepted: 12/21/2023] [Indexed: 01/24/2024] Open
Abstract
Ribosomal proteins (RPs) are evolutionary conserved proteins that are essential for protein translation. RP expression must be tightly regulated to ensure the appropriate assembly of ribosomes and to respond to the growth demands of cells. The elements regulating the transcription of RP genes (RPGs) have been characterized in yeast and Drosophila, yet how cells regulate the production of RPs in mammals is less well understood. Here, we show that a subset of RPG promoters is characterized by the presence of the palindromic TCTCGCGAGA motif and marked by the recruitment of the protein kinase DYRK1A. The presence of DYRK1A at these promoters is associated with the enhanced binding of the TATA-binding protein, TBP, and it is negatively correlated with the binding of the GABP transcription factor, establishing at least two clusters of RPGs that could be coordinately regulated. However, DYRK1A silencing leads to a global reduction in RPGs mRNAs, pointing at DYRK1A activities beyond those dependent on its chromatin association. Significantly, cells in which DYRK1A is depleted have reduced RP levels, fewer ribosomes, reduced global protein synthesis and a smaller size. We therefore propose a novel role for DYRK1A in coordinating the expression of genes encoding RPs, thereby controlling cell growth in mammals.
Collapse
Affiliation(s)
- Chiara Di Vona
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology (BIST), Dr Aiguader 88, 08003 Barcelona, Spain
- Centro de Investigación Biomédica en Red en Enfermedades Raras (CIBERER), 28029 Madrid, Spain
| | - Laura Barba
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology (BIST), Dr Aiguader 88, 08003 Barcelona, Spain
- Centro de Investigación Biomédica en Red en Enfermedades Raras (CIBERER), 28029 Madrid, Spain
| | - Roberto Ferrari
- Department of Chemistry, Life Sciences and Environmental Sustainability, University of Parma, Viale delle Scienze 23/A, 43124 Parma, Italy;
| | - Susana de la Luna
- Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology (BIST), Dr Aiguader 88, 08003 Barcelona, Spain
- Centro de Investigación Biomédica en Red en Enfermedades Raras (CIBERER), 28029 Madrid, Spain
- Department of Medicine and Life Sciences, Universitat Pompeu Fabra (UPF), Dr Aiguader 88, 08003 Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Passeig Lluís Companys 23, 08010 Barcelona, Spain
| |
Collapse
|
3
|
Tokunaga M, Imamura T. Emerging concepts involving inhibitory and activating RNA functionalization towards the understanding of microcephaly phenotypes and brain diseases in humans. Front Cell Dev Biol 2023; 11:1168072. [PMID: 37408531 PMCID: PMC10318543 DOI: 10.3389/fcell.2023.1168072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 06/12/2023] [Indexed: 07/07/2023] Open
Abstract
Microcephaly is characterized as a small head circumference, and is often accompanied by developmental disorders. Several candidate risk genes for this disease have been described, and mutations in non-coding regions are occasionally found in patients with microcephaly. Various non-coding RNAs (ncRNAs), such as microRNAs (miRNAs), SINEUPs, telomerase RNA component (TERC), and promoter-associated lncRNAs (pancRNAs) are now being characterized. These ncRNAs regulate gene expression, enzyme activity, telomere length, and chromatin structure through RNA binding proteins (RBPs)-RNA interaction. Elucidating the potential roles of ncRNA-protein coordination in microcephaly pathogenesis might contribute to its prevention or recovery. Here, we introduce several syndromes whose clinical features include microcephaly. In particular, we focus on syndromes for which ncRNAs or genes that interact with ncRNAs may play roles. We discuss the possibility that the huge ncRNA field will provide possible new therapeutic approaches for microcephaly and also reveal clues about the factors enabling the evolutionary acquisition of the human-specific "large brain."
Collapse
|
4
|
Babu S, Takeuchi Y, Masai I. Banp regulates DNA damage response and chromosome segregation during the cell cycle in zebrafish retina. eLife 2022; 11:74611. [PMID: 35942692 PMCID: PMC9363121 DOI: 10.7554/elife.74611] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2021] [Accepted: 07/05/2022] [Indexed: 11/25/2022] Open
Abstract
Btg3-associated nuclear protein (Banp) was originally identified as a nuclear matrix-associated region (MAR)-binding protein and it functions as a tumor suppressor. At the molecular level, Banp regulates transcription of metabolic genes via a CGCG-containing motif called the Banp motif. However, its physiological roles in embryonic development are unknown. Here, we report that Banp is indispensable for the DNA damage response and chromosome segregation during mitosis. Zebrafish banp mutants show mitotic cell accumulation and apoptosis in developing retina. We found that DNA replication stress and tp53-dependent DNA damage responses were activated to induce apoptosis in banp mutants, suggesting that Banp is required for regulation of DNA replication and DNA damage repair. Furthermore, consistent with mitotic cell accumulation, chromosome segregation was not smoothly processed from prometaphase to anaphase in banp morphants, leading to a prolonged M-phase. Our RNA- and ATAC-sequencing identified 31 candidates for direct Banp target genes that carry the Banp motif. Interestingly, a DNA replication fork regulator, wrnip1, and two chromosome segregation regulators, cenpt and ncapg, are included in this list. Thus, Banp directly regulates transcription of wrnip1 for recovery from DNA replication stress, and cenpt and ncapg for chromosome segregation during mitosis. Our findings provide the first in vivo evidence that Banp is required for cell-cycle progression and cell survival by regulating DNA damage responses and chromosome segregation during mitosis. In order for a cell to divide, it must progress through a series of carefully controlled steps known as the cell cycle. First, the cell replicates its DNA and both copies get segregated to opposite ends. The cell then splits into two and each new cell receives a copy of the duplicated genetic material. If any of the stages in the cell cycle become disrupted or mis-regulated this can lead to uncontrolled divisions that may result in cancer. Researchers have often used a structure within the eye known as the retina to study the cell cycle in zebrafish and other animals as cells in the retina rapidly divide in a highly controlled manner. A protein called Banp is known to help stop tumors from growing in humans and mice, but its normal role in the body, particularly the cell cycle, has remained unclear. To investigate, Babu et al. studied the retina of mutant zebrafish that were unable to make the Banp protein. The experiments revealed that two stress responses indicating DNA damage or defects in copying DNA were active in the retinal cells of the mutant zebrafish. This suggested that Banp allows cell to progress through the cell cycle by repairing any DNA damage that may arise during replication. Banp does this by activating the gene for another protein called Wrnip1. Babu et al. also found that Banp helps segregate the two copies of DNA during cell division by promoting the activation of two other proteins called Cenpt and Ncapg. Further experiments identified 31 genes that were directly regulated by Banp. These findings demonstrate that Banp is required for zebrafish cells to be able to accurately copy their DNA and divide in to two new cells. In the future, the work of Babu et al. will provide a useful resource to investigate how tumors grow and spread around the body, and may contribute to the development of new treatments for cancer.
Collapse
Affiliation(s)
- Swathy Babu
- Developmental Neurobiology Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Japan
| | - Yuki Takeuchi
- Developmental Neurobiology Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Japan
| | - Ichiro Masai
- Developmental Neurobiology Unit, Okinawa Institute of Science and Technology Graduate University, Onna, Japan
| |
Collapse
|
5
|
Database of Potential Promoter Sequences in the Capsicum annuum Genome. BIOLOGY 2022; 11:biology11081117. [PMID: 35892972 PMCID: PMC9332048 DOI: 10.3390/biology11081117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Revised: 07/19/2022] [Accepted: 07/23/2022] [Indexed: 11/16/2022]
Abstract
In this study, we used a mathematical method for the multiple alignment of highly divergent sequences (MAHDS) to create a database of potential promoter sequences (PPSs) in the Capsicum annuum genome. To search for PPSs, 20 statistically significant classes of sequences located in the range from −499 to +100 nucleotides near the annotated genes were calculated. For each class, a position–weight matrix (PWM) was computed and then used to identify PPSs in the C. annuum genome. In total, 825,136 PPSs were detected, with a false positive rate of 0.13%. The PPSs obtained with the MAHDS method were tested using TSSFinder, which detects transcription start sites. The databank of the found PPSs provides their coordinates in chromosomes, the alignment of each PPS with the PWM, and the level of statistical significance as a normal distribution argument, and can be used in genetic engineering and biotechnology.
Collapse
|
6
|
Murach KA, Dungan CM, von Walden F, Wen Y. Epigenetic evidence for distinct contributions of resident and acquired myonuclei during long-term exercise adaptation using timed in vivo myonuclear labeling. Am J Physiol Cell Physiol 2022; 322:C86-C93. [PMID: 34817266 PMCID: PMC8765804 DOI: 10.1152/ajpcell.00358.2021] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Muscle fibers are syncytial postmitotic cells that can acquire exogenous nuclei from resident muscle stem cells, called satellite cells. Myonuclei are added to muscle fibers by satellite cells during conditions such as load-induced hypertrophy. It is difficult to dissect the molecular contributions of resident versus satellite cell-derived myonuclei during adaptation due to the complexity of labeling distinct nuclear populations in multinuclear cells without label transference between nuclei. To sidestep this barrier, we used a genetic mouse model where myonuclear DNA can be specifically and stably labeled via nonconstitutive H2B-GFP at any point in the lifespan. Resident myonuclei (Mn) were GFP-tagged in vivo before 8 wk of progressive weighted wheel running (PoWeR) in adult mice (>4-mo-old). Resident + satellite cell-derived myonuclei (Mn+SC Mn) were labeled at the end of PoWeR in a separate cohort. Following myonuclear isolation, promoter DNA methylation profiles acquired with low-input reduced representation bisulfite sequencing (RRBS) were compared to deduce epigenetic contributions of satellite cell-derived myonuclei during adaptation. Resident myonuclear DNA has hypomethylated promoters in genes related to protein turnover, whereas the addition of satellite cell-derived myonuclei shifts myonuclear methylation profiles to favor transcription factor regulation and cell-cell signaling. By comparing myonucleus-specific methylation profiling to previously published single-nucleus transcriptional analysis in the absence (Mn) versus the presence of satellite cells (Mn+SC Mn) with PoWeR, we provide evidence that satellite cell-derived myonuclei may preferentially supply specific ribosomal proteins to growing myofibers and retain an epigenetic "memory" of prior stem cell identity. These data offer insights on distinct epigenetic myonuclear characteristics and contributions during adult muscle growth.
Collapse
Affiliation(s)
- Kevin A. Murach
- 1Molecular Muscle Mass Regulation Laboratory, Exercise Science Research Center, Department of Health, Human Performance, and Recreation, University of Arkansas, Fayetteville, Arkansas,2Cell and Molecular Biology Program, University of Arkansas, Fayetteville, Arkansas,3The Center for Muscle Biology, University of Kentucky, Lexington, Kentucky
| | - Cory M. Dungan
- 3The Center for Muscle Biology, University of Kentucky, Lexington, Kentucky,4Department of Physical Therapy, College of Health Sciences, University of Kentucky, Lexington, Kentucky
| | - Ferdinand von Walden
- 5Department of Women’s and Children’s Health, Karolinska Institute, Stockholm, Sweden
| | - Yuan Wen
- 3The Center for Muscle Biology, University of Kentucky, Lexington, Kentucky,6Department of Physiology, College of Medicine, University of Kentucky, Lexington, Kentucky,7Myoanalytics, LLC, Lexington, Kentucky
| |
Collapse
|
7
|
Ahmad SS, Samia NSN, Khan AS, Turjya RR, Khan MAAK. Bidirectional promoters: an enigmatic genome architecture and their roles in cancers. Mol Biol Rep 2021; 48:6637-6644. [PMID: 34378109 DOI: 10.1007/s11033-021-06612-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2021] [Accepted: 07/29/2021] [Indexed: 11/28/2022]
Abstract
Bidirectional promoters are the transcription regulatory regions of genes positioned head-to-head on opposite strands. Specific sequence signals, chromatin modifications and three-dimensional structures of the transcription site facilitate the unconventional yet tightly regulated transcription proceeding in both directions from these promoters. Mutations or aberrant epigenetic changes can lead to abnormal enhanced or reduced expression from either of the bidirectionally transcribed genes resulting in tumorigenesis. Moreover, bidirectionally transcribed genes might also contribute towards the immune regulation in tumor microenvironment. In this review, we aimed to expound the characteristic features of bidirectional promoters alongside their transcriptional regulations, and ultimately, the association of these enigmatic genomic elements in different cancers.
Collapse
Affiliation(s)
- Sheikh Shafin Ahmad
- Department of Mathematics and Natural Sciences, Brac University, Dhaka, Bangladesh
| | | | - Auroni Semonti Khan
- Department of Genetic Engineering and Biotechnology, Jagannath University, Dhaka, Bangladesh
| | - Rafeed Rahman Turjya
- Department of Mathematics and Natural Sciences, Brac University, Dhaka, Bangladesh
| | | |
Collapse
|
8
|
Grand RS, Burger L, Gräwe C, Michael AK, Isbel L, Hess D, Hoerner L, Iesmantavicius V, Durdu S, Pregnolato M, Krebs AR, Smallwood SA, Thomä N, Vermeulen M, Schübeler D. BANP opens chromatin and activates CpG-island-regulated genes. Nature 2021; 596:133-137. [PMID: 34234345 DOI: 10.1038/s41586-021-03689-8] [Citation(s) in RCA: 56] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2020] [Accepted: 06/03/2021] [Indexed: 02/06/2023]
Abstract
The majority of gene transcripts generated by RNA polymerase II in mammalian genomes initiate at CpG island (CGI) promoters1,2, yet our understanding of their regulation remains limited. This is in part due to the incomplete information that we have on transcription factors, their DNA-binding motifs and which genomic binding sites are functional in any given cell type3-5. In addition, there are orphan motifs without known binders, such as the CGCG element, which is associated with highly expressed genes across human tissues and enriched near the transcription start site of a subset of CGI promoters6-8. Here we combine single-molecule footprinting with interaction proteomics to identify BTG3-associated nuclear protein (BANP) as the transcription factor that binds this element in the mouse and human genome. We show that BANP is a strong CGI activator that controls essential metabolic genes in pluripotent stem and terminally differentiated neuronal cells. BANP binding is repelled by DNA methylation of its motif in vitro and in vivo, which epigenetically restricts most binding to CGIs and accounts for differential binding at aberrantly methylated CGI promoters in cancer cells. Upon binding to an unmethylated motif, BANP opens chromatin and phases nucleosomes. These findings establish BANP as a critical activator of a set of essential genes and suggest a model in which the activity of CGI promoters relies on methylation-sensitive transcription factors that are capable of chromatin opening.
Collapse
Affiliation(s)
- Ralph S Grand
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
| | - Lukas Burger
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland.,Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Cathrin Gräwe
- Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, Nijmegen, The Netherlands
| | - Alicia K Michael
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
| | - Luke Isbel
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland.,School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Daniel Hess
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
| | - Leslie Hoerner
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
| | | | - Sevi Durdu
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
| | - Marco Pregnolato
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland.,Faculty of Science, University of Basel, Basel, Switzerland
| | - Arnaud R Krebs
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland.,Genome Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | | | - Nicolas Thomä
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
| | - Michiel Vermeulen
- Department of Molecular Biology, Faculty of Science, Radboud Institute for Molecular Life Sciences, Oncode Institute, Radboud University Nijmegen, Nijmegen, The Netherlands
| | - Dirk Schübeler
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland. .,Faculty of Science, University of Basel, Basel, Switzerland.
| |
Collapse
|
9
|
The evolutionary acquisition and mode of functions of promoter-associated non-coding RNAs (pancRNAs) for mammalian development. Essays Biochem 2021; 65:697-708. [PMID: 34328174 DOI: 10.1042/ebc20200143] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Revised: 05/13/2021] [Accepted: 07/16/2021] [Indexed: 12/22/2022]
Abstract
Increasing evidence has shown that many long non-coding RNAs (lncRNAs) are involved in gene regulation in a variety of ways such as transcriptional, post-transcriptional and epigenetic regulation. Promoter-associated non-coding RNAs (pancRNAs), which are categorized into the most abundant single-copy lncRNA biotype, play vital regulatory roles in finely tuning cellular specification at the epigenomic level. In short, pancRNAs can directly or indirectly regulate downstream genes to participate in the development of organisms in a cell-specific manner. In this review, we will introduce the evolutionarily acquired characteristics of pancRNAs as determined by comparative epigenomics and elaborate on the research progress on pancRNA-involving processes in mammalian embryonic development, including neural differentiation.
Collapse
|
10
|
Zhou H, Simion V, Pierce JB, Haemmig S, Chen AF, Feinberg MW. LncRNA-MAP3K4 regulates vascular inflammation through the p38 MAPK signaling pathway and cis-modulation of MAP3K4. FASEB J 2020; 35:e21133. [PMID: 33184917 DOI: 10.1096/fj.202001654rr] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 09/23/2020] [Accepted: 10/08/2020] [Indexed: 12/12/2022]
Abstract
Chronic vascular inflammation plays a key role in the pathogenesis of atherosclerosis. Long non-coding RNAs (lncRNAs) have emerged as essential inflammation regulators. We identify a novel lncRNA termed lncRNA-MAP3K4 that is enriched in the vessel wall and regulates vascular inflammation. In the aortic intima, lncRNA-MAP3K4 expression was reduced by 50% during the progression of atherosclerosis (chronic inflammation) and 70% during endotoxemia (acute inflammation). lncRNA-MAP3K4 knockdown reduced the expression of key inflammatory factors (eg, ICAM-1, E-selectin, MCP-1) in endothelial cells or vascular smooth muscle cells and decreased monocytes adhesion to endothelium, as well as reducing TNF-α, IL-1β, COX2 expression in macrophages. Mechanistically, lncRNA-MAP3K4 regulates inflammation through the p38 MAPK signaling pathway. lncRNA-MAP3K4 shares a bidirectional promoter with MAP3K4, an upstream regulator of the MAPK signaling pathway, and regulates its transcription in cis. lncRNA-MAP3K4 and MAP3K4 show coordinated expression in response to inflammation in vivo and in vitro. Similar to lncRNA-MAP3K4, MAP3K4 knockdown reduced the expression of inflammatory factors in several different vascular cells. Furthermore, lncRNA-MAP3K4 and MAP3K4 knockdown showed cooperativity in reducing inflammation in endothelial cells. Collectively, these findings unveil the role of a novel lncRNA in vascular inflammation by cis-regulating MAP3K4 via a p38 MAPK pathway.
Collapse
Affiliation(s)
- Haoyang Zhou
- Department of Medicine, Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.,Department of Cardiology, The Third Xiangya Hospital of Central South University, Changsha, China
| | - Viorel Simion
- Department of Medicine, Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Jacob B Pierce
- Department of Medicine, Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.,Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Stefan Haemmig
- Department of Medicine, Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Alex F Chen
- Department of Cardiology, The Third Xiangya Hospital of Central South University, Changsha, China
| | - Mark W Feinberg
- Department of Medicine, Cardiovascular Division, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
11
|
Evaluating the informativeness of deep learning annotations for human complex diseases. Nat Commun 2020; 11:4703. [PMID: 32943643 PMCID: PMC7499261 DOI: 10.1038/s41467-020-18515-4] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2020] [Accepted: 08/25/2020] [Indexed: 12/12/2022] Open
Abstract
Deep learning models have shown great promise in predicting regulatory effects from DNA sequence, but their informativeness for human complex diseases is not fully understood. Here, we evaluate genome-wide SNP annotations from two previous deep learning models, DeepSEA and Basenji, by applying stratified LD score regression to 41 diseases and traits (average N = 320K), conditioning on a broad set of coding, conserved and regulatory annotations. We aggregated annotations across all (respectively blood or brain) tissues/cell-types in meta-analyses across all (respectively 11 blood or 8 brain) traits. The annotations were highly enriched for disease heritability, but produced only limited conditionally significant results: non-tissue-specific and brain-specific Basenji-H3K4me3 for all traits and brain traits respectively. We conclude that deep learning models have yet to achieve their full potential to provide considerable unique information for complex disease, and that their conditional informativeness for disease cannot be inferred from their accuracy in predicting regulatory annotations. Deep learning models have shown great promise in predicting regulatory effects from DNA sequence. Here the authors evaluate sequence-based epigenomic deep learning models and conclude that these models are not yet ready to inform our knowledge of human disease.
Collapse
|