1
|
Calluori S, Stark R, Pearson BL. Gene-Environment Interactions in Repeat Expansion Diseases: Mechanisms of Environmentally Induced Repeat Instability. Biomedicines 2023; 11:515. [PMID: 36831049 PMCID: PMC9953593 DOI: 10.3390/biomedicines11020515] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Revised: 02/06/2023] [Accepted: 02/07/2023] [Indexed: 02/12/2023] Open
Abstract
Short tandem repeats (STRs) are units of 1-6 base pairs that occur in tandem repetition to form a repeat tract. STRs exhibit repeat instability, which generates expansions or contractions of the repeat tract. Over 50 diseases, primarily affecting the central nervous system and muscles, are characterized by repeat instability. Longer repeat tracts are typically associated with earlier age of onset and increased disease severity. Environmental exposures are suspected to play a role in the pathogenesis of repeat expansion diseases. Here, we review the current knowledge of mechanisms of environmentally induced repeat instability in repeat expansion diseases. The current evidence demonstrates that environmental factors modulate repeat instability via DNA damage and induction of DNA repair pathways, with distinct mechanisms for repeat expansion and contraction. Of particular note, oxidative stress is a key mediator of environmentally induced repeat instability. The preliminary evidence suggests epigenetic modifications as potential mediators of environmentally induced repeat instability. Future research incorporating an array of environmental exposures, new human cohorts, and improved model systems, with a continued focus on cell-types, tissues, and critical windows, will aid in identifying mechanisms of environmentally induced repeat instability. Identifying environmental modulators of repeat instability and their mechanisms of action will inform preventions, therapies, and public health measures.
Collapse
Affiliation(s)
- Stephanie Calluori
- Department of Environmental Health Sciences, Mailman School of Public Health Columbia University, New York, NY 10032, USA
- Barnard College of Columbia University, 3009 Broadway, New York, NY 10027, USA
| | - Rebecca Stark
- Department of Environmental Health Sciences, Mailman School of Public Health Columbia University, New York, NY 10032, USA
| | - Brandon L. Pearson
- Department of Environmental Health Sciences, Mailman School of Public Health Columbia University, New York, NY 10032, USA
| |
Collapse
|
2
|
Gershman A, Sauria MEG, Guitart X, Vollger MR, Hook PW, Hoyt SJ, Jain M, Shumate A, Razaghi R, Koren S, Altemose N, Caldas GV, Logsdon GA, Rhie A, Eichler EE, Schatz MC, O'Neill RJ, Phillippy AM, Miga KH, Timp W. Epigenetic patterns in a complete human genome. Science 2022; 376:eabj5089. [PMID: 35357915 PMCID: PMC9170183 DOI: 10.1126/science.abj5089] [Citation(s) in RCA: 164] [Impact Index Per Article: 54.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The completion of a telomere-to-telomere human reference genome, T2T-CHM13, has resolved complex regions of the genome, including repetitive and homologous regions. Here, we present a high-resolution epigenetic study of previously unresolved sequences, representing entire acrocentric chromosome short arms, gene family expansions, and a diverse collection of repeat classes. This resource precisely maps CpG methylation (32.28 million CpGs), DNA accessibility, and short-read datasets (166,058 previously unresolved chromatin immunoprecipitation sequencing peaks) to provide evidence of activity across previously unidentified or corrected genes and reveals clinically relevant paralog-specific regulation. Probing CpG methylation across human centromeres from six diverse individuals generated an estimate of variability in kinetochore localization. This analysis provides a framework with which to investigate the most elusive regions of the human genome, granting insights into epigenetic regulation.
Collapse
Affiliation(s)
- Ariel Gershman
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD, USA
| | - Michael E G Sauria
- Department of Biology and Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Xavi Guitart
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Mitchell R Vollger
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Paul W Hook
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Savannah J Hoyt
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Miten Jain
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Alaina Shumate
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Roham Razaghi
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Sergey Koren
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Nicolas Altemose
- Department of Bioengineering, University of California Berkeley, Berkeley, CA, USA
| | - Gina V Caldas
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley CA, USA
| | - Glennis A Logsdon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Arang Rhie
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Michael C Schatz
- Department of Biology and Computer Science, Johns Hopkins University, Baltimore, MD, USA
| | - Rachel J O'Neill
- Institute for Systems Genomics, University of Connecticut, Storrs, CT, USA
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, USA
| | - Adam M Phillippy
- Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD, USA
| | - Karen H Miga
- UC Santa Cruz Genomics Institute, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Winston Timp
- Department of Molecular Biology and Genetics, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| |
Collapse
|
3
|
Dumbovic G, Forcales SV, Perucho M. Emerging roles of macrosatellite repeats in genome organization and disease development. Epigenetics 2017; 12:515-526. [PMID: 28426282 PMCID: PMC5687341 DOI: 10.1080/15592294.2017.1318235] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2017] [Revised: 04/01/2017] [Accepted: 04/06/2017] [Indexed: 11/24/2022] Open
Abstract
Abundant repetitive DNA sequences are an enigmatic part of the human genome. Despite increasing evidence on the functionality of DNA repeats, their biologic role is still elusive and under frequent debate. Macrosatellites are the largest of the tandem DNA repeats, located on one or multiple chromosomes. The contribution of macrosatellites to genome regulation and human health was demonstrated for the D4Z4 macrosatellite repeat array on chromosome 4q35. Reduced copy number of D4Z4 repeats is associated with local euchromatinization and the onset of facioscapulohumeral muscular dystrophy. Although the role other macrosatellite families may play remains rather obscure, their diverse functionalities within the genome are being gradually revealed. In this review, we will outline structural and functional features of coding and noncoding macrosatellite repeats, and highlight recent findings that bring these sequences into the spotlight of genome organization and disease development.
Collapse
Affiliation(s)
- Gabrijela Dumbovic
- Program of Predictive and Personalized Medicine of Cancer (PMPPC), Institut d'Investigació en Ciències de la Salut Germans Trias i Pujol (IGTP), Campus Can Ruti, Badalona, Barcelona, Spain
| | - Sonia-V. Forcales
- Program of Predictive and Personalized Medicine of Cancer (PMPPC), Institut d'Investigació en Ciències de la Salut Germans Trias i Pujol (IGTP), Campus Can Ruti, Badalona, Barcelona, Spain
| | - Manuel Perucho
- Program of Predictive and Personalized Medicine of Cancer (PMPPC), Institut d'Investigació en Ciències de la Salut Germans Trias i Pujol (IGTP), Campus Can Ruti, Badalona, Barcelona, Spain
- Sanford-Burnham-Prebys Medical Discovery Institute (SBP), La Jolla, CA, USA
| |
Collapse
|
4
|
Casa V, Runfola V, Micheloni S, Aziz A, Dilworth FJ, Gabellini D. Polycomb repressive complex 1 provides a molecular explanation for repeat copy number dependency in FSHD muscular dystrophy. Hum Mol Genet 2017; 26:753-767. [PMID: 28040729 PMCID: PMC5409123 DOI: 10.1093/hmg/ddw426] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2016] [Accepted: 12/15/2016] [Indexed: 11/13/2022] Open
Abstract
Repression of repetitive elements is crucial to preserve genome integrity and has been traditionally ascribed to constitutive heterochromatin pathways. FacioScapuloHumeral Muscular Dystrophy (FSHD), one of the most common myopathies, is characterized by a complex interplay of genetic and epigenetic events. The main FSHD form is linked to a reduced copy number of the D4Z4 macrosatellite repeat on 4q35, causing loss of silencing and aberrant expression of the D4Z4-embedded DUX4 gene leading to disease. By an unknown mechanism, D4Z4 copy-number correlates with FSHD phenotype. Here we show that the DUX4 proximal promoter (DUX4p) is sufficient to nucleate the enrichment of both constitutive and facultative heterochromatin components and to mediate a copy-number dependent gene silencing. We found that both the CpG/GC dense DNA content and the repetitive nature of DUX4p arrays are important for their repressive ability. We showed that DUX4p mediates a copy number-dependent Polycomb Repressive Complex 1 (PRC1) recruitment, which is responsible for the copy-number dependent gene repression. Overall, we directly link genetic and epigenetic defects in FSHD by proposing a novel molecular explanation for the copy number-dependency in FSHD pathogenesis, and offer insight into the molecular functions of repeats in chromatin regulation.
Collapse
Affiliation(s)
- Valentina Casa
- Gene Expression and Muscular Dystrophy Unit, Division of Regenerative Medicine, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy.,Università Vita-Salute San Raffaele, Milan 20132, Italy
| | - Valeria Runfola
- Gene Expression and Muscular Dystrophy Unit, Division of Regenerative Medicine, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy
| | - Stefano Micheloni
- Gene Expression and Muscular Dystrophy Unit, Division of Regenerative Medicine, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy
| | - Arif Aziz
- The Sprott Center for Stem Cell Research, Regenerative Medicine Program, Ottawa Hospital Research Institute, Ottawa, ON K1Y 4E9, Canada
| | - F Jeffrey Dilworth
- The Sprott Center for Stem Cell Research, Regenerative Medicine Program, Ottawa Hospital Research Institute, Ottawa, ON K1Y 4E9, Canada
| | - Davide Gabellini
- Gene Expression and Muscular Dystrophy Unit, Division of Regenerative Medicine, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy.,Dulbecco Telethon Institute, Milan 20132, Italy
| |
Collapse
|
5
|
Influence of Repressive Histone and DNA Methylation upon D4Z4 Transcription in Non-Myogenic Cells. PLoS One 2016; 11:e0160022. [PMID: 27467759 PMCID: PMC4965136 DOI: 10.1371/journal.pone.0160022] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2015] [Accepted: 07/12/2016] [Indexed: 01/11/2023] Open
Abstract
We looked at a disease-associated macrosatellite array D4Z4 and focused on epigenetic factors influencing its chromatin state outside of the disease-context. We used the HCT116 cell line that contains the non-canonical polyadenylation (poly-A) signal required to stabilize somatic transcripts of the human double homeobox gene DUX4, encoded from D4Z4. In HCT116, D4Z4 is packaged into constitutive heterochromatin, characterized by DNA methylation and histone H3 tri-methylation at lysine 9 (H3K9me3), resulting in low basal levels of D4Z4-derived transcripts. However, a double knockout (DKO) of DNA methyltransferase genes, DNMT1 and DNMT3B, but not either alone, results in significant loss of DNA and H3K9 methylation. This is coupled with upregulation of transcript levels from the array, including DUX4 isoforms (DUX4-fl) that are abnormally expressed in somatic muscle in the disease Facioscapulohumeral muscular dystrophy (FSHD) along with DUX4 protein, as indicated indirectly by upregulation of bondafide targets of DUX4 in DKO but not HCT116 cells. Results from treatment with a chemical inhibitor of histone methylation in HCT116 suggest that in the absence of DNA hypomethylation, H3K9me3 loss alone is sufficient to facilitate DUX4-fl transcription. Additionally, characterization of a cell line from a patient with Immunodeficiency, Centromeric instability and Facial anomalies syndrome 1 (ICF1) possessing a non-canonical poly-A signal and DNA hypomethylation at D4Z4 showed DUX4 target gene upregulation in the patient when compared to controls in spite of retention of H3K9me3. Taken together, these data suggest that both DNA methylation and H3K9me3 are determinants of D4Z4 silencing. Moreover, we show that in addition to testis, there is appreciable expression of spliced and polyadenylated D4Z4 derived transcripts that contain the complete DUX4 open reading frame (ORF) along with DUX4 target gene expression in the thymus, suggesting that DUX4 may provide normal function in this somatic tissue.
Collapse
|
6
|
Tessereau C, Lesecque Y, Monnet N, Buisson M, Barjhoux L, Léoné M, Feng B, Goldgar DE, Sinilnikova OM, Mousset S, Duret L, Mazoyer S. Estimation of the RNU2 macrosatellite mutation rate by BRCA1 mutation tracing. Nucleic Acids Res 2014; 42:9121-30. [PMID: 25034697 PMCID: PMC4132748 DOI: 10.1093/nar/gku639] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Large tandem repeat sequences have been poorly investigated as severe technical limitations and their frequent absence from the genome reference hinder their analysis. Extensive allelotyping of this class of variation has not been possible until now and their mutational dynamics are still poorly known. In order to estimate the mutation rate of a macrosatellite, we analysed in detail the RNU2 locus, which displays at least 50 different alleles containing 5-82 copies of a 6.1 kb repeat unit. Mining data from the 1000 Genomes Project allowed us to precisely estimate copy numbers of the RNU2 repeat unit using read depth of coverage. This further revealed significantly different mean values in various recent modern human populations, favoring a scenario of fast evolution of this locus. Its proximity to a disease gene with numerous founder mutations, BRCA1, within the same linkage disequilibrium block, offered the unique opportunity to trace RNU2 arrays over a large timescale. Analysis of the transmission of RNU2 arrays associated with one ‘private’ mutation in an extended kindred and four founder mutations in multiple kindreds gave an estimation by maximum likelihood of 5 × 10−3 mutations per generation, which is close to that of microsatellites.
Collapse
Affiliation(s)
- Chloé Tessereau
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France Genomic Vision, Bagneux, Paris, France
| | - Yann Lesecque
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR5558, Université Lyon 1, France
| | - Nastasia Monnet
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France
| | - Monique Buisson
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France
| | - Laure Barjhoux
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France
| | - Mélanie Léoné
- Unité Mixte de Génétique Constitutionnelle des Cancers Fréquents, Hospices Civils de Lyon/Centre Léon Bérard, Lyon, France
| | - Bingjian Feng
- Department of Dermatology and Huntsman Cancer Institute University of Utah School of Medicine, Salt Lake City, Utah, USA
| | - David E Goldgar
- Department of Dermatology and Huntsman Cancer Institute University of Utah School of Medicine, Salt Lake City, Utah, USA
| | - Olga M Sinilnikova
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France Unité Mixte de Génétique Constitutionnelle des Cancers Fréquents, Hospices Civils de Lyon/Centre Léon Bérard, Lyon, France
| | - Sylvain Mousset
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR5558, Université Lyon 1, France
| | - Laurent Duret
- Laboratoire de Biométrie et Biologie Evolutive, CNRS UMR5558, Université Lyon 1, France
| | - Sylvie Mazoyer
- Genetics of Breast Cancer Team, Cancer Research Centre of Lyon, CNRS UMR5286, Inserm U1052, Université Lyon 1, Centre Léon Bérard, Lyon, France
| |
Collapse
|
7
|
Darrow EM, Chadwick BP. A novel tRNA variable number tandem repeat at human chromosome 1q23.3 is implicated as a boundary element based on conservation of a CTCF motif in mouse. Nucleic Acids Res 2014; 42:6421-35. [PMID: 24753417 PMCID: PMC4041453 DOI: 10.1093/nar/gku280] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2014] [Revised: 03/24/2014] [Accepted: 03/25/2014] [Indexed: 01/08/2023] Open
Abstract
The human genome contains numerous large tandem repeats, many of which remain poorly characterized. Here we report a novel transfer RNA (tRNA) tandem repeat on human chromosome 1q23.3 that shows extensive copy number variation with 9-43 repeat units per allele and displays evidence of meiotic and mitotic instability. Each repeat unit consists of a 7.3 kb GC-rich sequence that binds the insulator protein CTCF and bears the chromatin hallmarks of a bivalent domain in human embryonic stem cells. A tRNA containing tandem repeat composed of at least three 7.6-kb GC-rich repeat units reside within a syntenic region of mouse chromosome 1. However, DNA sequence analysis reveals that, with the exception of the tRNA genes that account for less than 6% of a repeat unit, the remaining 7.2 kb is not conserved with the notable exception of a 24 base pair sequence corresponding to the CTCF binding site, suggesting an important role for this protein at the locus.
Collapse
Affiliation(s)
- Emily M Darrow
- Department of Biological Science, Florida State University, Tallahassee, FL 32306-4295, USA
| | - Brian P Chadwick
- Department of Biological Science, Florida State University, Tallahassee, FL 32306-4295, USA
| |
Collapse
|
8
|
Goldstein DB, Allen A, Keebler J, Margulies EH, Petrou S, Petrovski S, Sunyaev S. Sequencing studies in human genetics: design and interpretation. Nat Rev Genet 2013; 14:460-70. [PMID: 23752795 DOI: 10.1038/nrg3455] [Citation(s) in RCA: 190] [Impact Index Per Article: 15.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Next-generation sequencing is becoming the primary discovery tool in human genetics. There have been many clear successes in identifying genes that are responsible for Mendelian diseases, and sequencing approaches are now poised to identify the mutations that cause undiagnosed childhood genetic diseases and those that predispose individuals to more common complex diseases. There are, however, growing concerns that the complexity and magnitude of complete sequence data could lead to an explosion of weakly justified claims of association between genetic variants and disease. Here, we provide an overview of the basic workflow in next-generation sequencing studies and emphasize, where possible, measures and considerations that facilitate accurate inferences from human sequencing studies.
Collapse
Affiliation(s)
- David B Goldstein
- Center for Human Genome Variation, Duke University School of Medicine, 308 Research Drive, Box 91009, LSRC B Wing, Room 330, Durham, North Carolina 27708, USA.
| | | | | | | | | | | | | |
Collapse
|
9
|
Schaap M, Lemmers RJLF, Maassen R, van der Vliet PJ, Hoogerheide LF, van Dijk HK, Baştürk N, de Knijff P, van der Maarel SM. Genome-wide analysis of macrosatellite repeat copy number variation in worldwide populations: evidence for differences and commonalities in size distributions and size restrictions. BMC Genomics 2013; 14:143. [PMID: 23496858 PMCID: PMC3599962 DOI: 10.1186/1471-2164-14-143] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2012] [Accepted: 02/25/2013] [Indexed: 11/27/2022] Open
Abstract
Background Macrosatellite repeats (MSRs), usually spanning hundreds of kilobases of genomic DNA, comprise a significant proportion of the human genome. Because of their highly polymorphic nature, MSRs represent an extreme example of copy number variation, but their structure and function is largely understudied. Here, we describe a detailed study of six autosomal and two X chromosomal MSRs among 270 HapMap individuals from Central Europe, Asia and Africa. Copy number variation, stability and genetic heterogeneity of the autosomal macrosatellite repeats RS447 (chromosome 4p), MSR5p (5p), FLJ40296 (13q), RNU2 (17q) and D4Z4 (4q and 10q) and X chromosomal DXZ4 and CT47 were investigated. Results Repeat array size distribution analysis shows that all of these MSRs are highly polymorphic with the most genetic variation among Africans and the least among Asians. A mitotic mutation rate of 0.4-2.2% was observed, exceeding meiotic mutation rates and possibly explaining the large size variability found for these MSRs. By means of a novel Bayesian approach, statistical support for a distinct multimodal rather than a uniform allele size distribution was detected in seven out of eight MSRs, with evidence for equidistant intervals between the modes. Conclusions The multimodal distributions with evidence for equidistant intervals, in combination with the observation of MSR-specific constraints on minimum array size, suggest that MSRs are limited in their configurations and that deviations thereof may cause disease, as is the case for facioscapulohumeral muscular dystrophy. However, at present we cannot exclude that there are mechanistic constraints for MSRs that are not directly disease-related. This study represents the first comprehensive study of MSRs in different human populations by applying novel statistical methods and identifies commonalities and differences in their organization and function in the human genome.
Collapse
Affiliation(s)
- Mireille Schaap
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | | | | | | | | | | | | | | | | |
Collapse
|
10
|
Cacabelos R, Cacabelos P, Aliev G. Genomics of schizophrenia and pharmacogenomics of antipsychotic drugs. ACTA ACUST UNITED AC 2013. [DOI: 10.4236/ojpsych.2013.31008] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
|
11
|
Horakova AH, Moseley SC, McLaughlin CR, Tremblay DC, Chadwick BP. The macrosatellite DXZ4 mediates CTCF-dependent long-range intrachromosomal interactions on the human inactive X chromosome. Hum Mol Genet 2012; 21:4367-77. [PMID: 22791747 PMCID: PMC3459461 DOI: 10.1093/hmg/dds270] [Citation(s) in RCA: 66] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2012] [Revised: 06/19/2012] [Accepted: 07/06/2012] [Indexed: 12/31/2022] Open
Abstract
The human X-linked macrosatellite DXZ4 is a large tandem repeat located at Xq23 that is packaged into heterochromatin on the male X chromosome and female active X chromosome and, in response to X chromosome, inactivation is organized into euchromatin bound by the insulator protein CCCTC-binding factor (CTCF) on the inactive X chromosome (Xi). The purpose served by this unusual epigenetic regulation is unclear, but suggests a Xi-specific gain of function for DXZ4. Other less extensive bands of euchromatin can be observed on the Xi, but the identity of the underlying DNA sequences is unknown. Here, we report the identification of two novel human X-linked tandem repeats, located 58 Mb proximal and 16 Mb distal to the macrosatellite DXZ4. Both tandem repeats are entirely contained within the transcriptional unit of novel spliced transcripts. Like DXZ4, the tandem repeats are packaged into Xi-specific CTCF-bound euchromatin. These sequences undergo frequent CTCF-dependent interactions with DXZ4 on the Xi, implicating DXZ4 as an epigenetically regulated Xi-specific structural element and providing the first putative functional attribute of a macrosatellite in the human genome.
Collapse
Affiliation(s)
| | | | | | | | - Brian P. Chadwick
- Department of Biological Science, Florida State University, Tallahassee, FL 32306-4295, USA
| |
Collapse
|
12
|
The mouse DXZ4 homolog retains Ctcf binding and proximity to Pls3 despite substantial organizational differences compared to the primate macrosatellite. Genome Biol 2012; 13:R70. [PMID: 22906166 PMCID: PMC3491370 DOI: 10.1186/gb-2012-13-8-r70] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2012] [Accepted: 08/20/2012] [Indexed: 12/20/2022] Open
Abstract
Background The X-linked macrosatellite DXZ4 is a large homogenous tandem repeat that in females adopts an alternative chromatin organization on the primate X chromosome in response to X-chromosome inactivation. It is packaged into heterochromatin on the active X chromosome but into euchromatin and bound by the epigenetic organizer protein CTCF on the inactive X chromosome. Because its DNA sequence diverges rapidly beyond the New World monkeys, the existence of DXZ4 outside the primate lineage is unknown. Results Here we extend our comparative genome analysis and report the identification and characterization of the mouse homolog of the macrosatellite. Furthermore, we provide evidence of DXZ4 in a conserved location downstream of the PLS3 gene in a diverse group of mammals, and reveal that DNA sequence conservation is restricted to the CTCF binding motif, supporting a central role for this protein at this locus. However, many features that characterize primate DXZ4 differ in mouse, including the overall size of the array, the mode of transcription, the chromatin organization and conservation between adjacent repeat units of DNA sequence and length. Ctcf binds Dxz4 but is not exclusive to the inactive X chromosome, as evidenced by association in some males and equal binding to both X chromosomes in trophoblast stem cells. Conclusions Characterization of Dxz4 reveals substantial differences in the organization of DNA sequence, chromatin packaging, and the mode of transcription, so the potential roles performed by this sequence in mouse have probably diverged from those on the primate X chromosome.
Collapse
|
13
|
Cabianca DS, Casa V, Bodega B, Xynos A, Ginelli E, Tanaka Y, Gabellini D. A long ncRNA links copy number variation to a polycomb/trithorax epigenetic switch in FSHD muscular dystrophy. Cell 2012; 149:819-31. [PMID: 22541069 PMCID: PMC3350859 DOI: 10.1016/j.cell.2012.03.035] [Citation(s) in RCA: 284] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2011] [Revised: 12/21/2011] [Accepted: 03/22/2012] [Indexed: 02/05/2023]
Abstract
Repetitive sequences account for more than 50% of the human genome. Facioscapulohumeral muscular dystrophy (FSHD) is an autosomal-dominant disease associated with reduction in the copy number of the D4Z4 repeat mapping to 4q35. By an unknown mechanism, D4Z4 deletion causes an epigenetic switch leading to de-repression of 4q35 genes. Here we show that the Polycomb group of epigenetic repressors targets D4Z4 in healthy subjects and that D4Z4 deletion is associated with reduced Polycomb silencing in FSHD patients. We identify DBE-T, a chromatin-associated noncoding RNA produced selectively in FSHD patients that coordinates de-repression of 4q35 genes. DBE-T recruits the Trithorax group protein Ash1L to the FSHD locus, driving histone H3 lysine 36 dimethylation, chromatin remodeling, and 4q35 gene transcription. This study provides insights into the biological function of repetitive sequences in regulating gene expression and shows how mutations of such elements can influence the progression of a human genetic disease.
Collapse
Affiliation(s)
- Daphne S Cabianca
- Dulbecco Telethon Institute at San Raffaele Scientific Institute, Division of Regenerative Medicine, Stem Cells, and Gene Therapy, Milan, Italy
| | | | | | | | | | | | | |
Collapse
|
14
|
Lee KW, Woon PS, Teo YY, Sim K. Genome wide association studies (GWAS) and copy number variation (CNV) studies of the major psychoses: what have we learnt? Neurosci Biobehav Rev 2011; 36:556-71. [PMID: 21946175 DOI: 10.1016/j.neubiorev.2011.09.001] [Citation(s) in RCA: 73] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2011] [Revised: 09/03/2011] [Accepted: 09/13/2011] [Indexed: 12/29/2022]
Abstract
Schizophrenia (SZ) and bipolar disorder (BPD) have high heritabilities and are clinically and genetically complex. Genome wide association studies (GWAS) and studies of copy number variations (CNV) in SZ and BPD have allowed probing of their underlying genetic risks. In this systematic review, we assess extant genetic signals from published GWAS and CNV studies of SZ and BPD up till March 2011. Risk genes associated with SZ at genome wide significance level (p value<7.2 × 10(-8)) include zinc finger binding protein 804A (ZNF804A), major histocompatibility (MHC) region on chromosome 6, neurogranin (NRGN) and transcription factor 4 (TCF4). Risk genes associated with BPD include ankyrin 3, node of Ranvier (ANK3), calcium channel, voltage dependent, L type, alpha 1C subunit (CACNA1C), diacylglycerol kinase eta (DGKH), gene locus on chromosome 16p12, and polybromo-1 (PBRM1) and very recently neurocan gene (NCAN). Possible common genes underlying psychosis include ZNF804A, CACNA1C, NRGN and PBRM1. The CNV studies suggest that whilst CNVs are found in both SZ and BPD, the large deletions and duplications are more likely found in SZ rather than BPD. The validation of any genetic signal is likely confounded by genetic and phenotypic heterogeneities which are influenced by epistatic, epigenetic and gene-environment interactions. There is a pressing need to better integrate the multiple research platforms including systems biology computational models, genomics, cross disorder phenotyping studies, transcriptomics, proteomics, metabolomics, neuroimaging and clinical correlations in order to get us closer to a more enlightened understanding of the genetic and biological basis underlying these potentially crippling conditions.
Collapse
Affiliation(s)
- Kok Wei Lee
- Institute of Mental Health/Woodbridge Hospital 10, Buangkok View, Singapore 539747, Singapore
| | | | | | | |
Collapse
|
15
|
Epigenetic regulation of the X-chromosomal macrosatellite repeat encoding for the cancer/testis gene CT47. Eur J Hum Genet 2011; 20:185-91. [PMID: 21811308 DOI: 10.1038/ejhg.2011.150] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
Macrosatellite repeats (MSRs) present an extreme example of copy number variation, yet their epigenetic regulation in normal and malignant cells is largely understudied. The CT47 cancer/testis antigen located on human Xq24 is organized as an array of 4.8 kb large units. CT47 is expressed in the testis and in certain types of cancer, but not in non-malignant somatic tissue. We used CT47 as a model to study a possible correlation between copy number variation, epigenetic regulation and transcription originating from MSRs in normal and malignant cells. In lymphoblastoid cell lines and primary fibroblasts, CT47 expression was absent, consistent with the observed heterochromatic structure and DNA hypermethylation of the CT47 promoter. Heterochromatinization of CT47 occurs early during development as human embryonic stem cells show high levels of DNA methylation and repressive chromatin modifications in the absence of CT47 expression. In small-cell lung carcinoma cell lines with low levels of CT47 transcripts, we observed reduced levels of histone 3 lysine 9 trimethylation (H3K9me3) and trimethylated lysine 27 of histone H3 (H3K27me3) without concomitant increase in euchromatic histone modifications. DNA methylation levels in the promoter region of CT47 are also significantly reduced in these cells. This supports a model in which during oncogenic transformation, there is a relative loss of repressive chromatin markers resulting in leaky expression of CT47. We conclude that some MSRs, like CT47 and the autosomal MSRs TAF11-Like, PRR20, ZAV and D4Z4, the latter being involved in facioscapulohumeral muscular dystrophy, seem to be governed by common regulatory mechanisms with their abundant expression mostly being restricted to the germ line.
Collapse
|
16
|
Tremblay DC, Moseley S, Chadwick BP. Variation in array size, monomer composition and expression of the macrosatellite DXZ4. PLoS One 2011; 6:e18969. [PMID: 21544201 PMCID: PMC3081327 DOI: 10.1371/journal.pone.0018969] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2010] [Accepted: 03/25/2011] [Indexed: 11/18/2022] Open
Abstract
Macrosatellites are some of the most polymorphic regions of the human genome, yet many remain uncharacterized despite the association of some arrays with disease susceptibility. This study sought to explore the polymorphic nature of the X-linked macrosatellite DXZ4. Four aspects of DXZ4 were explored in detail, including tandem repeat copy number variation, array instability, monomer sequence polymorphism and array expression. DXZ4 arrays contained between 12 and 100 3.0 kb repeat units with an average array containing 57. Monomers were confirmed to be arranged in uninterrupted tandem arrays by restriction digest analysis and extended fiber FISH, and therefore DXZ4 encompasses 36–288 kb of Xq23. Transmission of DXZ4 through three generations in three families displayed a high degree of meiotic instability (8.3%), consistent with other macrosatellite arrays, further highlighting the unstable nature of these sequences in the human genome. Subcloning and sequencing of complete DXZ4 monomers identified numerous single nucleotide polymorphisms and alleles for the three microsatellite repeats located within each monomer. Pairwise comparisons of DXZ4 monomer sequences revealed that repeat units from an array are more similar to one another than those originating from different arrays. RNA fluorescence in situ hybridization revealed significant variation in DXZ4 expression both within and between cell lines. DXZ4 transcripts could be detected originiating from both the active and inactive X chromosome. Expression levels of DXZ4 varied significantly between males, but did not relate to the size of the array, nor did inheritance of the same array result in similar expression levels. Collectively, these studies provide considerable insight into the polymorphic nature of DXZ4, further highlighting the instability and variation potential of macrosatellites in the human genome.
Collapse
Affiliation(s)
- Deanna C. Tremblay
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America
| | - Shawn Moseley
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America
| | - Brian P. Chadwick
- Department of Biological Science, Florida State University, Tallahassee, Florida, United States of America
- * E-mail:
| |
Collapse
|
17
|
McLaughlin CR, Chadwick BP. Characterization of DXZ4 conservation in primates implies important functional roles for CTCF binding, array expression and tandem repeat organization on the X chromosome. Genome Biol 2011; 12:R37. [PMID: 21489251 PMCID: PMC3218863 DOI: 10.1186/gb-2011-12-4-r37] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2011] [Revised: 02/28/2011] [Accepted: 04/13/2011] [Indexed: 01/02/2023] Open
Abstract
BACKGROUND Comparative sequence analysis is a powerful means with which to identify functionally relevant non-coding DNA elements through conserved nucleotide sequence. The macrosatellite DXZ4 is a polymorphic, uninterrupted, tandem array of 3-kb repeat units located exclusively on the human X chromosome. While not obviously protein coding, its chromatin organization suggests differing roles for the array on the active and inactive X chromosomes. RESULTS In order to identify important elements within DXZ4, we explored preservation of DNA sequence and chromatin conformation of the macrosatellite in primates. We found that DXZ4 DNA sequence conservation beyond New World monkeys is limited to the promoter and CTCF binding site, although DXZ4 remains a GC-rich tandem array. Investigation of chromatin organization in macaques revealed that DXZ4 in males and on the active X chromosome is packaged into heterochromatin, whereas on the inactive X, DXZ4 was euchromatic and bound by CTCF. CONCLUSIONS Collectively, these data suggest an important conserved role for DXZ4 on the X chromosome involving expression, CTCF binding and tandem organization.
Collapse
Affiliation(s)
- Christine R McLaughlin
- Department of Biological Science, Florida State University, 319 Stadium Drive, 3076 King Building, Tallahassee, FL 32306-4295, USA
| | | |
Collapse
|
18
|
Tremblay DC, Alexander G, Moseley S, Chadwick BP. Expression, tandem repeat copy number variation and stability of four macrosatellite arrays in the human genome. BMC Genomics 2010; 11:632. [PMID: 21078170 PMCID: PMC3018141 DOI: 10.1186/1471-2164-11-632] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/08/2010] [Accepted: 11/15/2010] [Indexed: 11/13/2022] Open
Abstract
Background Macrosatellites are some of the largest variable number tandem repeats in the human genome, but what role these unusual sequences perform is unknown. Their importance to human health is clearly demonstrated by the 4q35 macrosatellite D4Z4 that is associated with the onset of the muscle degenerative disease facioscapulohumeral muscular dystrophy. Nevertheless, many other macrosatellite arrays in the human genome remain poorly characterized. Results Here we describe the organization, tandem repeat copy number variation, transmission stability and expression of four macrosatellite arrays in the human genome: the TAF11-Like array located on chromosomes 5p15.1, the SST1 arrays on 4q28.3 and 19q13.12, the PRR20 array located on chromosome 13q21.1, and the ZAV array at 9q32. All are polymorphic macrosatellite arrays that at least for TAF11-Like and SST1 show evidence of meiotic instability. With the exception of the SST1 array that is ubiquitously expressed, all are expressed at high levels in the testis and to a lesser extent in the brain. Conclusions Our results extend the number of characterized macrosatellite arrays in the human genome and provide the foundation for formulation of hypotheses to begin assessing their functional role in the human genome.
Collapse
Affiliation(s)
- Deanna C Tremblay
- Department of Biological Sciences, Florida State University, King Life Science Building, Tallahassee, FL 32306-4295, USA
| | | | | | | |
Collapse
|
19
|
Hannan AJ. Tandem repeat polymorphisms: modulators of disease susceptibility and candidates for ‘missing heritability’. Trends Genet 2010; 26:59-65. [PMID: 20036436 DOI: 10.1016/j.tig.2009.11.008] [Citation(s) in RCA: 123] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2009] [Revised: 11/27/2009] [Accepted: 11/30/2009] [Indexed: 01/26/2023]
|