1
|
Logsdon GA, Eichler EE. The Dynamic Structure and Rapid Evolution of Human Centromeric Satellite DNA. Genes (Basel) 2022; 14:92. [PMID: 36672831 PMCID: PMC9859433 DOI: 10.3390/genes14010092] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 12/22/2022] [Accepted: 12/24/2022] [Indexed: 12/31/2022] Open
Abstract
The complete sequence of a human genome provided our first comprehensive view of the organization of satellite DNA associated with heterochromatin. We review how our understanding of the genetic architecture and epigenetic properties of human centromeric DNA have advanced as a result. Preliminary studies of human and nonhuman ape centromeres reveal complex, saltatory mutational changes organized around distinct evolutionary layers. Pockets of regional hypomethylation within higher-order α-satellite DNA, termed centromere dip regions, appear to define the site of kinetochore attachment in all human chromosomes, although such epigenetic features can vary even within the same chromosome. Sequence resolution of satellite DNA is providing new insights into centromeric function with potential implications for improving our understanding of human biology and health.
Collapse
Affiliation(s)
- Glennis A. Logsdon
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
| | - Evan E. Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA 98195, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
2
|
Mirceta M, Shum N, Schmidt MHM, Pearson CE. Fragile sites, chromosomal lesions, tandem repeats, and disease. Front Genet 2022; 13:985975. [PMID: 36468036 PMCID: PMC9714581 DOI: 10.3389/fgene.2022.985975] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2022] [Accepted: 09/02/2022] [Indexed: 09/16/2023] Open
Abstract
Expanded tandem repeat DNAs are associated with various unusual chromosomal lesions, despiralizations, multi-branched inter-chromosomal associations, and fragile sites. Fragile sites cytogenetically manifest as localized gaps or discontinuities in chromosome structure and are an important genetic, biological, and health-related phenomena. Common fragile sites (∼230), present in most individuals, are induced by aphidicolin and can be associated with cancer; of the 27 molecularly-mapped common sites, none are associated with a particular DNA sequence motif. Rare fragile sites ( ≳ 40 known), ≤ 5% of the population (may be as few as a single individual), can be associated with neurodevelopmental disease. All 10 molecularly-mapped folate-sensitive fragile sites, the largest category of rare fragile sites, are caused by gene-specific CGG/CCG tandem repeat expansions that are aberrantly CpG methylated and include FRAXA, FRAXE, FRAXF, FRA2A, FRA7A, FRA10A, FRA11A, FRA11B, FRA12A, and FRA16A. The minisatellite-associated rare fragile sites, FRA10B, FRA16B, can be induced by AT-rich DNA-ligands or nucleotide analogs. Despiralized lesions and multi-branched inter-chromosomal associations at the heterochromatic satellite repeats of chromosomes 1, 9, 16 are inducible by de-methylating agents like 5-azadeoxycytidine and can spontaneously arise in patients with ICF syndrome (Immunodeficiency Centromeric instability and Facial anomalies) with mutations in genes regulating DNA methylation. ICF individuals have hypomethylated satellites I-III, alpha-satellites, and subtelomeric repeats. Ribosomal repeats and subtelomeric D4Z4 megasatellites/macrosatellites, are associated with chromosome location, fragility, and disease. Telomere repeats can also assume fragile sites. Dietary deficiencies of folate or vitamin B12, or drug insults are associated with megaloblastic and/or pernicious anemia, that display chromosomes with fragile sites. The recent discovery of many new tandem repeat expansion loci, with varied repeat motifs, where motif lengths can range from mono-nucleotides to megabase units, could be the molecular cause of new fragile sites, or other chromosomal lesions. This review focuses on repeat-associated fragility, covering their induction, cytogenetics, epigenetics, cell type specificity, genetic instability (repeat instability, micronuclei, deletions/rearrangements, and sister chromatid exchange), unusual heritability, disease association, and penetrance. Understanding tandem repeat-associated chromosomal fragile sites provides insight to chromosome structure, genome packaging, genetic instability, and disease.
Collapse
Affiliation(s)
- Mila Mirceta
- Program of Genetics and Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, ON, Canada
- Program of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Natalie Shum
- Program of Genetics and Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, ON, Canada
- Program of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Monika H. M. Schmidt
- Program of Genetics and Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, ON, Canada
- Program of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Christopher E. Pearson
- Program of Genetics and Genome Biology, The Hospital for Sick Children, The Peter Gilgan Centre for Research and Learning, Toronto, ON, Canada
- Program of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
3
|
Hartley GA, Okhovat M, O'Neill RJ, Carbone L. Comparative analyses of gibbon centromeres reveal dynamic genus specific shifts in repeat composition. Mol Biol Evol 2021; 38:3972-3992. [PMID: 33983366 PMCID: PMC8382927 DOI: 10.1093/molbev/msab148] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Centromeres are functionally conserved chromosomal loci essential for proper chromosome segregation during cell division, yet they show high sequence diversity across species. Despite their variation, a near universal feature of centromeres is the presence of repetitive sequences, such as DNA satellites and transposable elements (TEs). Because of their rapidly evolving karyotypes, gibbons represent a compelling model to investigate divergence of functional centromere sequences across short evolutionary timescales. In this study, we use ChIP-seq, RNA-seq, and fluorescence in situ hybridization to comprehensively investigate the centromeric repeat content of the four extant gibbon genera (Hoolock, Hylobates, Nomascus, and Siamang). In all gibbon genera, we find that CENP-A nucleosomes and the DNA-proteins that interface with the inner kinetochore preferentially bind retroelements of broad classes rather than satellite DNA. A previously identified gibbon-specific composite retrotransposon, LAVA, known to be expanded within the centromere regions of one gibbon genus (Hoolock), displays centromere- and species-specific sequence differences, potentially as a result of its co-option to a centromeric function. When dissecting centromere satellite composition, we discovered the presence of the retroelement-derived macrosatellite SST1 in multiple centromeres of Hoolock, whereas alpha-satellites represent the predominate satellite in the other genera, further suggesting an independent evolutionary trajectory for Hoolock centromeres. Finally, using de novo assembly of centromere sequences, we determined that transcripts originating from gibbon centromeres recapitulate the species-specific TE composition. Combined, our data reveal dynamic shifts in the repeat content that define gibbon centromeres and coincide with the extensive karyotypic diversity within this lineage.
Collapse
Affiliation(s)
- Gabrielle A Hartley
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269
| | - Mariam Okhovat
- Department of Medicine, Knight Cardiovascular Institute, Oregon Health and Science University, Portland, OR, 97239
| | - Rachel J O'Neill
- Department of Molecular and Cell Biology, University of Connecticut, Storrs, CT, 06269.,Institute for Systems Genomics, University of Connecticut, Storrs, CT, 06269.,Department of Genomics and Genome Sciences, UConn Health, Farmington, CT, 06030
| | - Lucia Carbone
- Department of Medicine, Knight Cardiovascular Institute, Oregon Health and Science University, Portland, OR, 97239.,Division of Genetics, Oregon National Primate Research Center, Beaverton, OR, 97006.,Department of Molecular and Medical Genetics, Oregon Health and Science University, Portland, OR, 97239.,Department of Medical Informatics and Clinical Epidemiology, Oregon Health and Science University, Portland, OR, 97239
| |
Collapse
|
4
|
Ahmad SF, Singchat W, Jehangir M, Suntronpong A, Panthum T, Malaivijitnond S, Srikulnath K. Dark Matter of Primate Genomes: Satellite DNA Repeats and Their Evolutionary Dynamics. Cells 2020; 9:E2714. [PMID: 33352976 PMCID: PMC7767330 DOI: 10.3390/cells9122714] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/15/2020] [Accepted: 12/16/2020] [Indexed: 12/12/2022] Open
Abstract
A substantial portion of the primate genome is composed of non-coding regions, so-called "dark matter", which includes an abundance of tandemly repeated sequences called satellite DNA. Collectively known as the satellitome, this genomic component offers exciting evolutionary insights into aspects of primate genome biology that raise new questions and challenge existing paradigms. A complete human reference genome was recently reported with telomere-to-telomere human X chromosome assembly that resolved hundreds of dark regions, encompassing a 3.1 Mb centromeric satellite array that had not been identified previously. With the recent exponential increase in the availability of primate genomes, and the development of modern genomic and bioinformatics tools, extensive growth in our knowledge concerning the structure, function, and evolution of satellite elements is expected. The current state of knowledge on this topic is summarized, highlighting various types of primate-specific satellite repeats to compare their proportions across diverse lineages. Inter- and intraspecific variation of satellite repeats in the primate genome are reviewed. The functional significance of these sequences is discussed by describing how the transcriptional activity of satellite repeats can affect gene expression during different cellular processes. Sex-linked satellites are outlined, together with their respective genomic organization. Mechanisms are proposed whereby satellite repeats might have emerged as novel sequences during different evolutionary phases. Finally, the main challenges that hinder the detection of satellite DNA are outlined and an overview of the latest methodologies to address technological limitations is presented.
Collapse
Affiliation(s)
- Syed Farhan Ahmad
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Worapong Singchat
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Maryam Jehangir
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Department of Structural and Functional Biology, Institute of Bioscience at Botucatu, São Paulo State University (UNESP), Botucatu, São Paulo 18618-689, Brazil
| | - Aorarat Suntronpong
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Thitipong Panthum
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Suchinda Malaivijitnond
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Department of Biology, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand
| | - Kornsorn Srikulnath
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Center of Excellence on Agricultural Biotechnology (AG-BIO/PERDO-CHE), Bangkok 10900, Thailand
- Omics Center for Agriculture, Bioresources, Food and Health, Kasetsart University (OmiKU), Bangkok 10900, Thailand
| |
Collapse
|
5
|
Balzano E, Giunta S. Centromeres under Pressure: Evolutionary Innovation in Conflict with Conserved Function. Genes (Basel) 2020; 11:E912. [PMID: 32784998 PMCID: PMC7463522 DOI: 10.3390/genes11080912] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2020] [Revised: 08/04/2020] [Accepted: 08/04/2020] [Indexed: 12/22/2022] Open
Abstract
Centromeres are essential genetic elements that enable spindle microtubule attachment for chromosome segregation during mitosis and meiosis. While this function is preserved across species, centromeres display an array of dynamic features, including: (1) rapidly evolving DNA; (2) wide evolutionary diversity in size, shape and organization; (3) evidence of mutational processes to generate homogenized repetitive arrays that characterize centromeres in several species; (4) tolerance to changes in position, as in the case of neocentromeres; and (5) intrinsic fragility derived by sequence composition and secondary DNA structures. Centromere drive underlies rapid centromere DNA evolution due to the "selfish" pursuit to bias meiotic transmission and promote the propagation of stronger centromeres. Yet, the origins of other dynamic features of centromeres remain unclear. Here, we review our current understanding of centromere evolution and plasticity. We also detail the mutagenic processes proposed to shape the divergent genetic nature of centromeres. Changes to centromeres are not simply evolutionary relics, but ongoing shifts that on one side promote centromere flexibility, but on the other can undermine centromere integrity and function with potential pathological implications such as genome instability.
Collapse
Affiliation(s)
- Elisa Balzano
- Dipartimento di Biologia e Biotecnologie “Charles Darwin”, Sapienza Università di Roma, 00185 Roma, Italy;
| | - Simona Giunta
- Laboratory of Chromosome and Cell Biology, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA
| |
Collapse
|
6
|
Achrem M, Szućko I, Kalinka A. The epigenetic regulation of centromeres and telomeres in plants and animals. COMPARATIVE CYTOGENETICS 2020; 14:265-311. [PMID: 32733650 PMCID: PMC7360632 DOI: 10.3897/compcytogen.v14i2.51895] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 05/18/2020] [Indexed: 05/10/2023]
Abstract
The centromere is a chromosomal region where the kinetochore is formed, which is the attachment point of spindle fibers. Thus, it is responsible for the correct chromosome segregation during cell division. Telomeres protect chromosome ends against enzymatic degradation and fusions, and localize chromosomes in the cell nucleus. For this reason, centromeres and telomeres are parts of each linear chromosome that are necessary for their proper functioning. More and more research results show that the identity and functions of these chromosomal regions are epigenetically determined. Telomeres and centromeres are both usually described as highly condensed heterochromatin regions. However, the epigenetic nature of centromeres and telomeres is unique, as epigenetic modifications characteristic of both eu- and heterochromatin have been found in these areas. This specificity allows for the proper functioning of both regions, thereby affecting chromosome homeostasis. This review focuses on demonstrating the role of epigenetic mechanisms in the functioning of centromeres and telomeres in plants and animals.
Collapse
Affiliation(s)
- Magdalena Achrem
- Institute of Biology, University of Szczecin, Szczecin, PolandUniversity of SzczecinSzczecinPoland
- Molecular Biology and Biotechnology Center, University of Szczecin, Szczecin, PolandUniversity of SzczecinSzczecinPoland
| | - Izabela Szućko
- Institute of Biology, University of Szczecin, Szczecin, PolandUniversity of SzczecinSzczecinPoland
- Molecular Biology and Biotechnology Center, University of Szczecin, Szczecin, PolandUniversity of SzczecinSzczecinPoland
| | - Anna Kalinka
- Institute of Biology, University of Szczecin, Szczecin, PolandUniversity of SzczecinSzczecinPoland
- Molecular Biology and Biotechnology Center, University of Szczecin, Szczecin, PolandUniversity of SzczecinSzczecinPoland
| |
Collapse
|
7
|
Black EM, Giunta S. Repetitive Fragile Sites: Centromere Satellite DNA As a Source of Genome Instability in Human Diseases. Genes (Basel) 2018; 9:E615. [PMID: 30544645 PMCID: PMC6315641 DOI: 10.3390/genes9120615] [Citation(s) in RCA: 57] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Revised: 12/03/2018] [Accepted: 12/03/2018] [Indexed: 12/31/2022] Open
Abstract
Maintenance of an intact genome is essential for cellular and organismal homeostasis. The centromere is a specialized chromosomal locus required for faithful genome inheritance at each round of cell division. Human centromeres are composed of large tandem arrays of repetitive alpha-satellite DNA, which are often sites of aberrant rearrangements that may lead to chromosome fusions and genetic abnormalities. While the centromere has an essential role in chromosome segregation during mitosis, the long and repetitive nature of the highly identical repeats has greatly hindered in-depth genetic studies, and complete annotation of all human centromeres is still lacking. Here, we review our current understanding of human centromere genetics and epigenetics as well as recent investigations into the role of centromere DNA in disease, with a special focus on cancer, aging, and human immunodeficiency⁻centromeric instability⁻facial anomalies (ICF) syndrome. We also highlight the causes and consequences of genomic instability at these large repetitive arrays and describe the possible sources of centromere fragility. The novel connection between alpha-satellite DNA instability and human pathological conditions emphasizes the importance of obtaining a truly complete human genome assembly and accelerating our understanding of centromere repeats' role in physiology and beyond.
Collapse
Affiliation(s)
- Elizabeth M Black
- Laboratory of Chromosome and Cell Biology, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA.
| | - Simona Giunta
- Laboratory of Chromosome and Cell Biology, The Rockefeller University, 1230 York Avenue, New York, NY 10065, USA.
| |
Collapse
|
8
|
Cacheux L, Ponger L, Gerbault-Seureau M, Loll F, Gey D, Richard FA, Escudé C. The Targeted Sequencing of Alpha Satellite DNA in Cercopithecus pogonias Provides New Insight Into the Diversity and Dynamics of Centromeric Repeats in Old World Monkeys. Genome Biol Evol 2018; 10:1837-1851. [PMID: 29860303 PMCID: PMC6061836 DOI: 10.1093/gbe/evy109] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/29/2018] [Indexed: 02/06/2023] Open
Abstract
Alpha satellite is the major repeated DNA element of primate centromeres. Specific evolutionary mechanisms have led to a great diversity of sequence families with peculiar genomic organization and distribution, which have till now been studied mostly in great apes. Using high throughput sequencing of alpha satellite monomers obtained by enzymatic digestion followed by computational and cytogenetic analysis, we compare here the diversity and genomic distribution of alpha satellite DNA in two related Old World monkey species, Cercopithecus pogonias and Cercopithecus solatus, which are known to have diverged about 7 Ma. Two main families of monomers, called C1 and C2, are found in both species. A detailed analysis of our data sets revealed the existence of numerous subfamilies within the centromeric C1 family. Although the most abundant subfamily is conserved between both species, our fluorescence in situ hybridization (FISH) experiments clearly show that some subfamilies are specific for each species and that their distribution is restricted to a subset of chromosomes, thereby pointing to the existence of recurrent amplification/homogenization events. The pericentromeric C2 family is very abundant on the short arm of all acrocentric chromosomes in both species, pointing to specific mechanisms that lead to this distribution. Results obtained using two different restriction enzymes are fully consistent with a predominant monomeric organization of alpha satellite DNA that coexists with higher order organization patterns in the C. pogonias genome. Our study suggests a high dynamics of alpha satellite DNA in Cercopithecini, with recurrent apparition of new sequence variants and interchromosomal sequence transfer.
Collapse
Affiliation(s)
- Lauriane Cacheux
- Département Adaptations du Vivant, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
- Département Origines et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - Loïc Ponger
- Département Adaptations du Vivant, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - Michèle Gerbault-Seureau
- Département Origines et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - François Loll
- Département Adaptations du Vivant, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - Delphine Gey
- Service de Systématique Moléculaire, UMS 2700 CNRS, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - Florence Anne Richard
- Département Origines et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
- Université Versailles St-Quentin, Montigny-le-Bretonneux, France
| | - Christophe Escudé
- Département Adaptations du Vivant, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| |
Collapse
|
9
|
Casa V, Runfola V, Micheloni S, Aziz A, Dilworth FJ, Gabellini D. Polycomb repressive complex 1 provides a molecular explanation for repeat copy number dependency in FSHD muscular dystrophy. Hum Mol Genet 2017; 26:753-767. [PMID: 28040729 PMCID: PMC5409123 DOI: 10.1093/hmg/ddw426] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2016] [Accepted: 12/15/2016] [Indexed: 11/13/2022] Open
Abstract
Repression of repetitive elements is crucial to preserve genome integrity and has been traditionally ascribed to constitutive heterochromatin pathways. FacioScapuloHumeral Muscular Dystrophy (FSHD), one of the most common myopathies, is characterized by a complex interplay of genetic and epigenetic events. The main FSHD form is linked to a reduced copy number of the D4Z4 macrosatellite repeat on 4q35, causing loss of silencing and aberrant expression of the D4Z4-embedded DUX4 gene leading to disease. By an unknown mechanism, D4Z4 copy-number correlates with FSHD phenotype. Here we show that the DUX4 proximal promoter (DUX4p) is sufficient to nucleate the enrichment of both constitutive and facultative heterochromatin components and to mediate a copy-number dependent gene silencing. We found that both the CpG/GC dense DNA content and the repetitive nature of DUX4p arrays are important for their repressive ability. We showed that DUX4p mediates a copy number-dependent Polycomb Repressive Complex 1 (PRC1) recruitment, which is responsible for the copy-number dependent gene repression. Overall, we directly link genetic and epigenetic defects in FSHD by proposing a novel molecular explanation for the copy number-dependency in FSHD pathogenesis, and offer insight into the molecular functions of repeats in chromatin regulation.
Collapse
Affiliation(s)
- Valentina Casa
- Gene Expression and Muscular Dystrophy Unit, Division of Regenerative Medicine, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy.,Università Vita-Salute San Raffaele, Milan 20132, Italy
| | - Valeria Runfola
- Gene Expression and Muscular Dystrophy Unit, Division of Regenerative Medicine, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy
| | - Stefano Micheloni
- Gene Expression and Muscular Dystrophy Unit, Division of Regenerative Medicine, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy
| | - Arif Aziz
- The Sprott Center for Stem Cell Research, Regenerative Medicine Program, Ottawa Hospital Research Institute, Ottawa, ON K1Y 4E9, Canada
| | - F Jeffrey Dilworth
- The Sprott Center for Stem Cell Research, Regenerative Medicine Program, Ottawa Hospital Research Institute, Ottawa, ON K1Y 4E9, Canada
| | - Davide Gabellini
- Gene Expression and Muscular Dystrophy Unit, Division of Regenerative Medicine, IRCCS San Raffaele Scientific Institute, Milan 20132, Italy.,Dulbecco Telethon Institute, Milan 20132, Italy
| |
Collapse
|
10
|
Cacheux L, Ponger L, Gerbault-Seureau M, Richard FA, Escudé C. Diversity and distribution of alpha satellite DNA in the genome of an Old World monkey: Cercopithecus solatus. BMC Genomics 2016; 17:916. [PMID: 27842493 PMCID: PMC5109768 DOI: 10.1186/s12864-016-3246-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2016] [Accepted: 11/02/2016] [Indexed: 11/10/2022] Open
Abstract
Background Alpha satellite is the major repeated DNA element of primate centromeres. Evolution of these tandemly repeated sequences has led to the existence of numerous families of monomers exhibiting specific organizational patterns. The limited amount of information available in non-human primates is a restriction to the understanding of the evolutionary dynamics of alpha satellite DNA. Results We carried out the targeted high-throughput sequencing of alpha satellite monomers and dimers from the Cercopithecus solatus genome, an Old World monkey from the Cercopithecini tribe. Computational approaches were used to infer the existence of sequence families and to study how these families are organized with respect to each other. While previous studies had suggested that alpha satellites in Old World monkeys were poorly diversified, our analysis provides evidence for the existence of at least four distinct families of sequences within the studied species and of higher order organizational patterns. Fluorescence in situ hybridization using oligonucleotide probes that are able to target each family in a specific way showed that the different families had distinct distributions on chromosomes and were not homogeneously distributed between chromosomes. Conclusions Our new approach provides an unprecedented and comprehensive view of the diversity and organization of alpha satellites in a species outside the hominoid group. We consider these data with respect to previously known alpha satellite families and to potential mechanisms for satellite DNA evolution. Applying this approach to other species will open new perspectives regarding the integration of satellite DNA into comparative genomic and cytogenetic studies. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3246-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Lauriane Cacheux
- Département Régulations, Développement et Diversité Moléculaire, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France.,Département Systématique et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France
| | - Loïc Ponger
- Département Régulations, Développement et Diversité Moléculaire, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France
| | - Michèle Gerbault-Seureau
- Département Systématique et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France
| | - Florence Anne Richard
- Département Systématique et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France.,Université Versailles St-Quentin, Montigny-le-Bretonneux, France
| | - Christophe Escudé
- Département Régulations, Développement et Diversité Moléculaire, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France.
| |
Collapse
|
11
|
Chromosome-Specific Centromere Sequences Provide an Estimate of the Ancestral Chromosome 2 Fusion Event in Hominin Genomes. J Hered 2016; 108:45-52. [DOI: 10.1093/jhered/esw039] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Accepted: 06/20/2016] [Indexed: 12/14/2022] Open
|
12
|
Catacchio CR, Ragone R, Chiatante G, Ventura M. Organization and evolution of Gorilla centromeric DNA from old strategies to new approaches. Sci Rep 2015; 5:14189. [PMID: 26387916 PMCID: PMC4585704 DOI: 10.1038/srep14189] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2015] [Accepted: 08/18/2015] [Indexed: 11/09/2022] Open
Abstract
The centromere/kinetochore interaction is responsible for the pairing and segregation of replicated chromosomes in eukaryotes. Centromere DNA is portrayed as scarcely conserved, repetitive in nature, quickly evolving and protein-binding competent. Among primates, the major class of centromeric DNA is the pancentromeric α-satellite, made of arrays of 171 bp monomers, repeated in a head-to-tail pattern. α-satellite sequences can either form tandem heterogeneous monomeric arrays or assemble in higher-order repeats (HORs). Gorilla centromere DNA has barely been characterized, and data are mainly based on hybridizations of human alphoid sequences. We isolated and finely characterized gorilla α-satellite sequences and revealed relevant structure and chromosomal distribution similarities with other great apes as well as gorilla-specific features, such as the uniquely octameric structure of the suprachromosomal family-2 (SF2). We demonstrated for the first time the orthologous localization of alphoid suprachromosomal families-1 and −2 (SF1 and SF2) between human and gorilla in contrast to chimpanzee centromeres. Finally, the discovery of a new 189 bp monomer type in gorilla centromeres unravels clues to the role of the centromere protein B, paving the way to solve the significance of the centromere DNA’s essential repetitive nature in association with its function and the peculiar evolution of the α-satellite sequence.
Collapse
Affiliation(s)
- C R Catacchio
- University of Bari Aldo Moro, Department of Biology, Via Orabona 4, Bari, 70125, Italy
| | - R Ragone
- University of Bari Aldo Moro, Department of Biology, Via Orabona 4, Bari, 70125, Italy
| | - G Chiatante
- University of Bari Aldo Moro, Department of Biology, Via Orabona 4, Bari, 70125, Italy
| | - M Ventura
- University of Bari Aldo Moro, Department of Biology, Via Orabona 4, Bari, 70125, Italy
| |
Collapse
|
13
|
Melters DP, Bradnam KR, Young HA, Telis N, May MR, Ruby JG, Sebra R, Peluso P, Eid J, Rank D, Garcia JF, DeRisi JL, Smith T, Tobias C, Ross-Ibarra J, Korf I, Chan SWL. Comparative analysis of tandem repeats from hundreds of species reveals unique insights into centromere evolution. Genome Biol 2013; 14:R10. [PMID: 23363705 PMCID: PMC4053949 DOI: 10.1186/gb-2013-14-1-r10] [Citation(s) in RCA: 308] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2012] [Accepted: 01/30/2013] [Indexed: 01/01/2023] Open
Abstract
Background Centromeres are essential for chromosome segregation, yet their DNA sequences evolve rapidly. In most animals and plants that have been studied, centromeres contain megabase-scale arrays of tandem repeats. Despite their importance, very little is known about the degree to which centromere tandem repeats share common properties between different species across different phyla. We used bioinformatic methods to identify high-copy tandem repeats from 282 species using publicly available genomic sequence and our own data. Results Our methods are compatible with all current sequencing technologies. Long Pacific Biosciences sequence reads allowed us to find tandem repeat monomers up to 1,419 bp. We assumed that the most abundant tandem repeat is the centromere DNA, which was true for most species whose centromeres have been previously characterized, suggesting this is a general property of genomes. High-copy centromere tandem repeats were found in almost all animal and plant genomes, but repeat monomers were highly variable in sequence composition and length. Furthermore, phylogenetic analysis of sequence homology showed little evidence of sequence conservation beyond approximately 50 million years of divergence. We find that despite an overall lack of sequence conservation, centromere tandem repeats from diverse species showed similar modes of evolution. Conclusions While centromere position in most eukaryotes is epigenetically determined, our results indicate that tandem repeats are highly prevalent at centromeres of both animal and plant genomes. This suggests a functional role for such repeats, perhaps in promoting concerted evolution of centromere DNA across chromosomes.
Collapse
|
14
|
Casa V, Gabellini D. A repetitive elements perspective in Polycomb epigenetics. Front Genet 2012; 3:199. [PMID: 23060903 PMCID: PMC3465993 DOI: 10.3389/fgene.2012.00199] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2012] [Accepted: 09/17/2012] [Indexed: 01/10/2023] Open
Abstract
Repetitive elements comprise over two-thirds of the human genome. For a long time, these elements have received little attention since they were considered non-functional. On the contrary, recent evidence indicates that they play central roles in genome integrity, gene expression, and disease. Indeed, repeats display meiotic instability associated with disease and are located within common fragile sites, which are hotspots of chromosome re-arrangements in tumors. Moreover, a variety of diseases have been associated with aberrant transcription of repetitive elements. Overall this indicates that appropriate regulation of repetitive elements' activity is fundamental. Polycomb group (PcG) proteins are epigenetic regulators that are essential for the normal development of multicellular organisms. Mammalian PcG proteins are involved in fundamental processes, such as cellular memory, cell proliferation, genomic imprinting, X-inactivation, and cancer development. PcG proteins can convey their activity through long-distance interactions also on different chromosomes. This indicates that the 3D organization of PcG proteins contributes significantly to their function. However, it is still unclear how these complex mechanisms are orchestrated and which role PcG proteins play in the multi-level organization of gene regulation. Intriguingly, the greatest proportion of Polycomb-mediated chromatin modifications is located in genomic repeats and it has been suggested that they could provide a binding platform for Polycomb proteins. Here, these lines of evidence are woven together to discuss how repetitive elements could contribute to chromatin organization in the 3D nuclear space.
Collapse
Affiliation(s)
- Valentina Casa
- Division of Regenerative Medicine, Stem Cells, and Gene Therapy, Dulbecco Telethon Institute and San Raffaele Scientific Institute Milano, Italy ; Università Vita-Salute San Raffaele Milano, Italy
| | | |
Collapse
|
15
|
Rosandić M, Glunčić M, Paar V. Start/stop codon like trinucleotides extensions in primate alpha satellites. J Theor Biol 2012; 317:301-9. [PMID: 23026763 DOI: 10.1016/j.jtbi.2012.09.022] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2012] [Revised: 09/07/2012] [Accepted: 09/19/2012] [Indexed: 11/28/2022]
Abstract
The centromeres remain "the final frontier" in unexplored segments of genome landscape in primate genomes, characterized by 2-5 Mb arrays of evolutionary rapidly evolving alpha satellite (AS) higher order repeats (HORs). Alpha satellites as specific noncoding sequences may be also significant in light of regulatory role of noncoding sequences. Using the Global Repeat Map (GRM) algorithm we identify in NCBI assemblies of chromosome 5 the species-specific alpha satellite HORs: 13mer in human, 5mer in chimpanzee, 14mer in orangutan and 3mers in macaque. The suprachromosomal family (SF) classification of alpha satellite HORs and surrounding monomeric alpha satellites is performed and specific segmental structure was found for major alpha satellite arrays in chromosome 5 of primates. In the framework of our novel concept of start/stop Codon Like Trinucleotides (CLTs) as a "new DNA language in noncoding sequences", we find characteristics and differences of these species in CLT extensions, in particular the extensions of stop-TGA CLT. We hypothesize that these are regulators in noncoding sequences, acting at a distance, and that they can amplify or weaken the activity of start/stop codons in coding sequences in protein genesis, increasing the richness of regulatory phenomena.
Collapse
Affiliation(s)
- Marija Rosandić
- Faculty of Science, University of Zagreb, 10000 Zagreb, Croatia.
| | | | | |
Collapse
|
16
|
Morozov VM, Gavrilova EV, Ogryzko VV, Ishov AM. Dualistic function of Daxx at centromeric and pericentromeric heterochromatin in normal and stress conditions. Nucleus 2012; 3:276-85. [PMID: 22572957 DOI: 10.4161/nucl.20180] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Nuclear structures ND10/PML NBs are linked to multiple processes, including the maintenance of intranuclear homeostasis by sequestering proteins into "nuclear depot." This function presumes release of proteins from PML NBs and their redistribution to the alternative, supposedly "active" locations, in response to the external stress application. To further investigate this nuclear depot function, we focused on the intranuclear distribution of protein Daxx that in normal conditions is mainly accumulated at PML NBs, and has a minor association with centromeres and pericentromeres (CEN/periCEN). Here we report that application of physiological Heat Shock (HS) changes this balance forcing very robust and reversible accumulation of Daxx on CEN/periCEN heterochromatin. Heterochromatin architecture is essential for the proper orchestration of nuclear processes, while transcription from this part of genome is required for its maintenance. To understand functional consequences of Daxx deposition at CEN/periCEN, we tested for Daxx-dependency of heterochromatin transcription. Depletion of Daxx reduces accumulation of CEN RNA in normal conditions and periCEN RNA after HS application. Searching for the mechanism of Daxx-dependent regulation of heterochromatin transcription, we found that depletion of Daxx decreases incorporation of transcription-associated histone H3 variant, H3.3, into both CEN and periCEN. Surprisingly, HS-induced deposition of Daxx does not further elevate incorporation of H3.3 into CEN/periCEN that remained steady during stress and recovery. Instead, depletion of Daxx leads to HS-induced changes in the balance of epigenetic modifications at heterochromatin, most dramatically elevating levels of active H3K4Me2 modification at periCEN. We propose dualistic function of Daxx-containing complexes at CEN/periCEN: (1) regulation of H3.3 loading in normal conditions and (2) protection of epigenetic status upon stress-induced accumulation, thus collectively guarding epigenetic identity of CEN/periCEN heterochromatin.
Collapse
Affiliation(s)
- Viacheslav M Morozov
- University of Florida, Cancer & Genetics Research Complex and Department of Anatomy and Cell Biology, Gainesville, FL, USA
| | | | | | | |
Collapse
|
17
|
Komissarov AS, Gavrilova EV, Demin SJ, Ishov AM, Podgornaya OI. Tandemly repeated DNA families in the mouse genome. BMC Genomics 2011; 12:531. [PMID: 22035034 PMCID: PMC3218096 DOI: 10.1186/1471-2164-12-531] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2011] [Accepted: 10/28/2011] [Indexed: 12/23/2022] Open
Abstract
Background Functional and morphological studies of tandem DNA repeats, that combine high portion of most genomes, are mostly limited due to the incomplete characterization of these genome elements. We report here a genome wide analysis of the large tandem repeats (TR) found in the mouse genome assemblies. Results Using a bioinformatics approach, we identified large TR with array size more than 3 kb in two mouse whole genome shotgun (WGS) assemblies. Large TR were classified based on sequence similarity, chromosome position, monomer length, array variability, and GC content; we identified four superfamilies, eight families, and 62 subfamilies - including 60 not previously described. 1) The superfamily of centromeric minor satellite is only found in the unassembled part of the reference genome. 2) The pericentromeric major satellite is the most abundant superfamily and reveals high order repeat structure. 3) Transposable elements related superfamily contains two families. 4) The superfamily of heterogeneous tandem repeats includes four families. One family is found only in the WGS, while two families represent tandem repeats with either single or multi locus location. Despite multi locus location, TRPC-21A-MM is placed into a separated family due to its abundance, strictly pericentromeric location, and resemblance to big human satellites. To confirm our data, we next performed in situ hybridization with three repeats from distinct families. TRPC-21A-MM probe hybridized to chromosomes 3 and 17, multi locus TR-22A-MM probe hybridized to ten chromosomes, and single locus TR-54B-MM probe hybridized with the long loops that emerge from chromosome ends. In addition to in silico predicted several extra-chromosomes were positive for TR by in situ analysis, potentially indicating inaccurate genome assembly of the heterochromatic genome regions. Conclusions Chromosome-specific TR had been predicted for mouse but no reliable cytogenetic probes were available before. We report new analysis that identified in silico and confirmed in situ 3/17 chromosome-specific probe TRPC-21-MM. Thus, the new classification had proven to be useful tool for continuation of genome study, while annotated TR can be the valuable source of cytogenetic probes for chromosome recognition.
Collapse
|
18
|
Lee HR, Hayden KE, Willard HF. Organization and molecular evolution of CENP-A--associated satellite DNA families in a basal primate genome. Genome Biol Evol 2011; 3:1136-49. [PMID: 21828373 PMCID: PMC3194837 DOI: 10.1093/gbe/evr083] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Centromeric regions in many complex eukaryotic species contain highly repetitive satellite DNAs. Despite the diversity of centromeric DNA sequences among species, the functional centromeres in all species studied to date are marked by CENP-A, a centromere-specific histone H3 variant. Although it is well established that families of multimeric higher-order alpha satellite are conserved at the centromeres of human and great ape chromosomes and that diverged monomeric alpha satellite is found in old and new world monkey genomes, little is known about the organization, function, and evolution of centromeric sequences in more distant primates, including lemurs. Aye-Aye (Daubentonia madagascariensis) is a basal primate and is located at a key position in the evolutionary tree to study centromeric satellite transitions in primate genomes. Using the approach of chromatin immunoprecipitation with antibodies directed to CENP-A, we have identified two satellite families, Daubentonia madagascariensis Aye-Aye 1 (DMA1) and Daubentonia madagascariensis Aye-Aye 2 (DMA2), related to each other but unrelated in sequence to alpha satellite or any other previously described primate or mammalian satellite DNA families. Here, we describe the initial genomic and phylogenetic organization of DMA1 and DMA2 and present evidence of higher-order repeats in Aye-Aye centromeric domains, providing an opportunity to study the emergence of chromosome-specific modes of satellite DNA evolution in primate genomes.
Collapse
Affiliation(s)
- Hye-Ran Lee
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, USA
| | | | | |
Collapse
|
19
|
Paar V, Glunčić M, Basar I, Rosandić M, Paar P, Cvitković M. Large Tandem, Higher Order Repeats and Regularly Dispersed Repeat Units Contribute Substantially to Divergence Between Human and Chimpanzee Y Chromosomes. J Mol Evol 2010; 72:34-55. [DOI: 10.1007/s00239-010-9401-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2010] [Accepted: 10/25/2010] [Indexed: 10/18/2022]
|
20
|
Alkan C, Cardone MF, Catacchio CR, Antonacci F, O'Brien SJ, Ryder OA, Purgato S, Zoli M, Della Valle G, Eichler EE, Ventura M. Genome-wide characterization of centromeric satellites from multiple mammalian genomes. Genome Res 2010; 21:137-45. [PMID: 21081712 DOI: 10.1101/gr.111278.110] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.
Collapse
Affiliation(s)
- Can Alkan
- Department of Genome Sciences, Howard Hughes Medical Institute, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Cellamare A, Catacchio CR, Alkan C, Giannuzzi G, Antonacci F, Cardone MF, Della Valle G, Malig M, Rocchi M, Eichler EE, Ventura M. New insights into centromere organization and evolution from the white-cheeked gibbon and marmoset. Mol Biol Evol 2009; 26:1889-900. [PMID: 19429672 DOI: 10.1093/molbev/msp101] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The evolutionary history of alpha-satellite DNA, the major component of primate centromeres, is hardly defined because of the difficulty in its sequence assembly and its rapid evolution when compared with most genomic sequences. By using several approaches, we have cloned, sequenced, and characterized alpha-satellite sequences from two species representing critical nodes in the primate phylogeny: the white-cheeked gibbon, a lesser ape, and marmoset, a New World monkey. Sequence analyses demonstrate that white-cheeked gibbon and marmoset alpha-satellite sequences are formed by units of approximately 171 and approximately 342 bp, respectively, and they both lack the high-order structure found in humans and great apes. Fluorescent in situ hybridization characterization shows a broad dispersal of alpha-satellite in the white-cheeked gibbon genome including centromeric, telomeric, and chromosomal interstitial localizations. On the other hand, centromeres in marmoset appear organized in highly divergent dimers roughly of 342 bp that show a similarity between monomers much lower than previously reported dimers, thus representing an ancient dimeric structure. All these data shed light on the evolution of the centromeric sequences in Primates. Our results suggest radical differences in the structure, organization, and evolution of alpha-satellite DNA among different primate species, supporting the notion that 1) all the centromeric sequence in Primates evolved by genomic amplification, unequal crossover, and sequence homogenization using a 171 bp monomer as the basic seeding unit and 2) centromeric function is linked to relatively short repeated elements, more than higher-order structure. Moreover, our data indicate that complex higher-order repeat structures are a peculiarity of the hominid lineage, showing the more complex organization in humans.
Collapse
Affiliation(s)
- A Cellamare
- Department of Genetics and Microbiology, University of Bari, Bari, Italy
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Zhou H, Li D, Song L, Liu R, Chen J, Huang X. Thr11 phosphorylated H3 is associated with centromere DNA during mitosis in MCF-7 cells. Mol Cell Biochem 2007; 311:45-50. [PMID: 18163200 DOI: 10.1007/s11010-007-9692-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2007] [Accepted: 12/17/2007] [Indexed: 11/29/2022]
Abstract
In this study, indirect immunofluorescence labeling was used to examine the cellular dynamic distribution of Thr11 phosphorylated H3 at mitosis in MCF-7 cells. The Thr11 phosphorylation was observed beginning at prophase at centromeres. Upon progression of mitosis, fluorescence signal was enhanced in the central region of the metaphase plate and maintained till anaphase at centromeres. During telophase, the fluorescent signal of Thr11 phosphorylated H3 disappears from centromeres, but the signal appears again at the midbody during cytokinesis, which suggests that the modified histones may take part in the formation of the midbody and play a crucial role in cytokinesis. Chromatin immunoprecipitation (ChIP) was used to confirm that Thr11 phosphorylated H3 is specifically associated with centromere DNA at prophase to metaphase, which is coincident with the results observed by immunofluorescence. In conclusion, there was a precise spatial and temporal correlation between H3 phosphorylation of Thr11 and stages of chromatin condensation. The timing of Thr11 phosphorylation and dephosphorylation in mitosis were similar to that reported for Ser10 phosphorylation of H3. The Thr11 phosphorylated H3 localized at centromeres during mitosis, which was different from the Ser10 phosphorylated H3 localized at telomere regions and Thr3 phosphorylated H3 localized along the chromosome arms. The results suggest that the Thr11 phosphorylation of histone H3 may play a specific role which was different from Ser10 and Thr3 phosphorylation in mitosis.
Collapse
Affiliation(s)
- Hao Zhou
- Department of Biochemistry and Molecular biology, College of Life Sciences, Nankai University, Tianjin, China
| | | | | | | | | | | |
Collapse
|
23
|
Alkan C, Ventura M, Archidiacono N, Rocchi M, Sahinalp SC, Eichler EE. Organization and evolution of primate centromeric DNA from whole-genome shotgun sequence data. PLoS Comput Biol 2007; 3:1807-18. [PMID: 17907796 PMCID: PMC1994983 DOI: 10.1371/journal.pcbi.0030181] [Citation(s) in RCA: 70] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2007] [Accepted: 07/31/2007] [Indexed: 11/18/2022] Open
Abstract
The major DNA constituent of primate centromeres is alpha satellite DNA. As much as 2%–5% of sequence generated as part of primate genome sequencing projects consists of this material, which is fragmented or not assembled as part of published genome sequences due to its highly repetitive nature. Here, we develop computational methods to rapidly recover and categorize alpha-satellite sequences from previously uncharacterized whole-genome shotgun sequence data. We present an algorithm to computationally predict potential higher-order array structure based on paired-end sequence data and then experimentally validate its organization and distribution by experimental analyses. Using whole-genome shotgun data from the human, chimpanzee, and macaque genomes, we examine the phylogenetic relationship of these sequences and provide further support for a model for their evolution and mutation over the last 25 million years. Our results confirm fundamental differences in the dispersal and evolution of centromeric satellites in the Old World monkey and ape lineages of evolution. Centromeric DNA has been described as the last frontier of genomic sequencing; such regions are typically poorly assembled during the whole-genome shotgun sequence assembly process due to their repetitive complexity. This paper develops a computational algorithm to systematically extract data regarding primate centromeric DNA structure and organization from that ∼5% of sequence that is not included as part of standard genome sequence assemblies. Using this computational approach, we identify and reconstruct published human higher-order alpha satellite arrays and discover new families in human, chimpanzee, and Old World monkeys. Experimental validation confirms the utility of this computational approach to understanding the centromere organization of other nonhuman primates. An evolutionary analysis in diverse primate genomes supports fundamental differences in the structure and organization of centromere DNA between ape and Old World monkey lineages. The ability to extract meaningful biological data from random shotgun sequence data helps to fill an important void in large-scale sequencing of primate genomes, with implications for other genome sequencing projects.
Collapse
Affiliation(s)
- Can Alkan
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, United States of America
| | - Mario Ventura
- Department of Genetics and Microbiology, University of Bari, Bari, Italy
| | | | - Mariano Rocchi
- Department of Genetics and Microbiology, University of Bari, Bari, Italy
| | - S. Cenk Sahinalp
- Department of Computing Science, Simon Fraser University, Burnaby, British Columbia, Canada
| | - Evan E Eichler
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington, United States of America
- Howard Hughes Medical Institute, Seattle, Washington, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
24
|
Kehrer-Sawatzki H, Cooper DN. Understanding the recent evolution of the human genome: insights from human-chimpanzee genome comparisons. Hum Mutat 2007; 28:99-130. [PMID: 17024666 DOI: 10.1002/humu.20420] [Citation(s) in RCA: 94] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The sequencing of the chimpanzee genome and the comparison with its human counterpart have begun to reveal the spectrum of genetic changes that has accompanied human evolution. In addition to gross karyotypic rearrangements such as the fusion that formed human chromosome 2 and the human-specific pericentric inversions of chromosomes 1 and 18, there is considerable submicroscopic structural variation involving deletions, duplications, and inversions. Lineage-specific segmental duplications, detected by array comparative genomic hybridization and direct sequence comparison, have made a very significant contribution to this structural divergence, which is at least three-fold greater than that due to nucleotide substitutions. Since structural genomic changes may have given rise to irreversible functional differences between the diverging species, their detailed analysis could help to identify the biological processes that have accompanied speciation. To this end, interspecies comparisons have revealed numerous human-specific gains and losses of genes as well as changes in gene expression. The very considerable structural diversity (polymorphism) evident within both lineages has, however, hampered the analysis of the structural divergence between the human and chimpanzee genomes. The concomitant evaluation of genetic divergence and diversity at the nucleotide level has nevertheless served to identify many genes that have evolved under positive selection and may thus have been involved in the development of human lineage-specific traits. Genes that display signs of weak negative selection have also been identified and could represent candidate loci for complex genomic disorders. Here, we review recent progress in comparing the human and chimpanzee genomes and discuss how the differences detected have improved our understanding of the evolution of the human genome.
Collapse
|
25
|
Abstract
Centromeres are the elements of chromosomes that assemble the proteinaceous kinetochore, maintain sister chromatid cohesion, regulate chromosome attachment to the spindle, and direct chromosome movement during cell division. Although the functions of centromeres and the proteins that contribute to their complex structure and function are conserved in eukaryotes, centromeric DNA diverges rapidly. Human centromeres are particularly complicated. Here, we review studies on the organization of homogeneous arrays of chromosome-specific alpha-satellite repeats and evolutionary links among eukaryotic centromeric sequences. We also discuss epigenetic mechanisms of centromere identity that confer structural and functional features of the centromere through DNA-protein interactions and post-translational modifications, producing centromere-specific chromatin signatures. The assembly and organization of human centromeres, the contributions of satellite DNA to centromere identity and diversity, and the mechanism whereby centromeres are distinguished from the rest of the genome reflect ongoing puzzles in chromosome biology.
Collapse
Affiliation(s)
- Mary G Schueler
- Genome Technology Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | |
Collapse
|
26
|
Kawabe A, Nasuda S. Polymorphic chromosomal specificity of centromere satellite families in Arabidopsis halleri ssp. gemmifera. Genetica 2006; 126:335-42. [PMID: 16636927 DOI: 10.1007/s10709-005-0718-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2005] [Accepted: 07/07/2005] [Indexed: 11/26/2022]
Abstract
The chromosomal localizations of repetitive DNA clusters (ribosomal DNA and centromere satellites) were analyzed by fluorescent in situ hybridization in five strains of Arabidopsis halleri ssp. gemmifera. All five A. gemmifera strains have three chromosome pairs with 45S (5.8S-16S-26S) rDNA loci, and one pair with both 5S and 45S rDNA loci. These localizations are different from that of A. thaliana. Very unusually, there are three families of centromeric satellite DNAs (pAa, pAge1, and pAge2), and they showed polymorphism among the five strains studied. Overall, we found four different centromere satellite compositions. A plant from Fumuro was heterozygous for the chromosome specificities of centromere satellite families, possibly due to a reciprocal translocation involving centromere regions. Changes of centromeric satellite repeats appear to be rapid and frequent events in the history of A. gemmifera, and seem to occur by exchanging clusters as units.
Collapse
Affiliation(s)
- Akira Kawabe
- Laboratory of Plant Genetics, Graduate School of Agriculture, Kyoto University, Kitashirakawa Oiwake-cho, 606-8502, Sakyo-ku, Kyoto, Japan.
| | | |
Collapse
|
27
|
Sjöberg MK, Shestakova E, Mansuroglu Z, Maccioni RB, Bonnefoy E. Tau protein binds to pericentromeric DNA: a putative role for nuclear tau in nucleolar organization. J Cell Sci 2006; 119:2025-34. [PMID: 16638814 DOI: 10.1242/jcs.02907] [Citation(s) in RCA: 104] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The microtubule-associated tau protein participates in the organization and integrity of the neuronal cytoskeleton. A nuclear form of tau has been described in neuronal and non-neuronal cells, which displays a nucleolar localization during interphase but is associated with nucleolar-organizing regions in mitotic cells. In the present study, based on immunofluorescence, immuno-FISH and confocal microscopy, we show that nuclear tau is mainly present at the internal periphery of nucleoli, partially colocalizing with the nucleolar protein nucleolin and human AT-rich alpha-satellite DNA sequences organized as constitutive heterochromatin. By using gel retardation, we demonstrate that tau not only colocalizes with, but also specifically binds to, AT-rich satellite DNA sequences apparently through the recognition of AT-rich DNA stretches. Here we propose a functional role for nuclear tau in relation to the nucleolar organization and/or heterochromatinization of a portion of RNA genes. Since nuclear tau has also been found in neurons from patients with Alzheimer's disease (AD), aberrant nuclear tau could affect the nucleolar organization during the course of AD. We discuss nucleolar tau associated with AT-rich alpha-satellite DNA sequences as a potential molecular link between trisomy 21 and AD.
Collapse
Affiliation(s)
- Marcela K Sjöberg
- Laboratory of Cellular, Molecular Biology and Neurosciences, Millennium Institute for Advanced Studies in Cell Biology and Biotechnology (CBB), Faculty of Sciences, University of Chile, Las Encinas 3370, Nuñoa, Santiago, Chile.
| | | | | | | | | |
Collapse
|
28
|
Abstract
Alpha-satellite is a family of tandemly repeated sequences found at all normal human centromeres. In addition to its significance for understanding centromere function, alpha-satellite is also a model for concerted evolution, as alpha-satellite repeats are more similar within a species than between species. There are two types of alpha-satellite in the human genome; while both are made up of approximately 171-bp monomers, they can be distinguished by whether monomers are arranged in extremely homogeneous higher-order, multimeric repeat units or exist as more divergent monomeric alpha-satellite that lacks any multimeric periodicity. In this study, as a model to examine the genomic and evolutionary relationships between these two types, we have focused on the chromosome 17 centromeric region that has reached both higher-order and monomeric alpha-satellite in the human genome assembly. Monomeric and higher-order alpha-satellites on chromosome 17 are phylogenetically distinct, consistent with a model in which higher-order evolved independently of monomeric alpha-satellite. Comparative analysis between human chromosome 17 and the orthologous chimpanzee chromosome indicates that monomeric alpha-satellite is evolving at approximately the same rate as the adjacent non-alpha-satellite DNA. However, higher-order alpha-satellite is less conserved, suggesting different evolutionary rates for the two types of alpha-satellite.
Collapse
Affiliation(s)
- M Katharine Rudd
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina 27708, USA
| | | | | |
Collapse
|
29
|
Hall SE, Luo S, Hall AE, Preuss D. Differential rates of local and global homogenization in centromere satellites from Arabidopsis relatives. Genetics 2005; 170:1913-27. [PMID: 15937135 PMCID: PMC1449784 DOI: 10.1534/genetics.104.038208] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Higher eukaryotic centromeres contain thousands of satellite repeats organized into tandem arrays. As species diverge, new satellite variants are homogenized within and between chromosomes, yet the processes by which particular sequences are dispersed are poorly understood. Here, we isolated and analyzed centromere satellites in plants separated from Arabidopsis thaliana by 5-20 million years, uncovering more rapid satellite divergence compared to primate alpha-satellite repeats. We also found that satellites derived from the same genomic locus were more similar to each other than satellites derived from disparate genomic regions, indicating that new sequence alterations were homogenized more efficiently at a local, rather than global, level. Nonetheless, the presence of higher-order satellite arrays, similar to those identified in human centromeres, indicated limits to local homogenization and suggested that sequence polymorphisms may play important functional roles. In two species, we defined more extensive polymorphisms, identifying physically separated and highly distinct satellite types. Taken together, these data show that there is a balance between plant satellite homogenization and the persistence of satellite variants. This balance could ultimately generate sufficient sequence divergence to cause mating incompatibilities between plant species, while maintaining adequate conservation within a species for centromere activity.
Collapse
MESH Headings
- Amino Acid Sequence
- Arabidopsis/genetics
- Base Sequence
- Centromere/genetics
- Chromatin Immunoprecipitation
- Consensus Sequence
- DNA, Plant/analysis
- DNA, Satellite/genetics
- DNA, Satellite/metabolism
- Fluorescein-5-isothiocyanate
- Fluorescent Antibody Technique, Direct
- Fluorescent Dyes
- Genome, Plant
- Heterochromatin/metabolism
- In Situ Hybridization, Fluorescence
- Indoles
- Microscopy, Fluorescence
- Molecular Sequence Data
- Phylogeny
- Sequence Analysis, DNA
- Sequence Homology, Amino Acid
- Sequence Homology, Nucleic Acid
Collapse
Affiliation(s)
- Sarah E Hall
- Howard Hughes Medical Institute, University of Chicago, Chicago, Illinois 60637, USA
| | | | | | | |
Collapse
|
30
|
Kawabe A, Nasuda S. Structure and genomic organization of centromeric repeats in Arabidopsis species. Mol Genet Genomics 2004; 272:593-602. [PMID: 15586291 DOI: 10.1007/s00438-004-1081-x] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2004] [Accepted: 10/05/2004] [Indexed: 10/26/2022]
Abstract
Centromeric repetitive sequences were isolated from Arabidopsis halleri ssp. gemmifera and A. lyrata ssp. kawasakiana. Two novel repeat families isolated from A. gemmifera were designated pAge1 and pAge2. These repeats are 180 bp in length and are organized in a head-to-tail manner. They are similar to the pAL1 repeats of A. thaliana and the pAa units of A. arenosa. Both A. gemmifera and A. kawasakiana possess the pAa, pAge1 and pAge2 repeat families. Sequence comparisons of different centromeric repeats revealed that these families share a highly conserved region of approximately 50 bp. Within each of the four repeat families, two or three regions showed low levels of sequence variation. The average difference in nucleotide sequence was approximately 10% within families and 30% between families, which resulted in clear distinctions between families upon phylogenetic analysis. FISH analysis revealed that the localization patterns for the pAa, pAge1 and pAge2 families were chromosome specific in A. gemmifera and A. kawasakiana. In one pair of chromosomes in A. gemmifera, and three pairs of chromosomes in A. kawasakiana, two repeat families were present. The presence of three families of centromeric repeats in A. gemmifera and A. kawasakiana indicates that the first step toward homogenization of centromeric repeats occurred at the chromosome level.
Collapse
Affiliation(s)
- A Kawabe
- Laboratory of Plant Genetics, Graduate School of Agriculture, Kyoto University, 606-8502, Kyoto, Japan
| | | |
Collapse
|
31
|
Modi WS, Ivanov S, Gallagher DS. Concerted evolution and higher-order repeat structure of the 1.709 (satellite IV) family in bovids. J Mol Evol 2004; 58:460-5. [PMID: 15114424 DOI: 10.1007/s00239-003-2567-6] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2003] [Accepted: 11/06/2003] [Indexed: 11/24/2022]
Abstract
The 1.709 or satellite IV repeated DNA family originally isolated from the domestic cow was analyzed using Southern blotting, pulsed field gel electrophoresis, fluorescence in situ hybridization, and DNA sequencing in species belonging to the genera Bos, Bison, Bubalus, Syncerus, Boselaphus, and Tragelaphus. Hybridization indicates that the family has been amplified in Bos, Bison, Bubalus, and Syncerus but not in Boselaphus or Tragelaphus. Pericentromeric, higher-order repeat substructure exists in all species, with multimeric arrays ranging in size from 10 to 1500 kb. Sequence analysis of a 492-bp PCR product revealed comparable levels (0.2-4.5%) of intra- and interspecific divergence when species of Bos and Bison were compared, supporting the idea that species of these two genera should be recognized under the genus Bos. Alternatively, all Syncerus sequences cluster as a monophyletic group on an evolutionary tree and differ from those of Bos/ Bison by about 13%. Comparing these findings with the fossil record indicates that concerted evolution has occurred since Bos/ Bison and Syncerus last shared a common ancestor (5.0 MYA) but before the radiation of the genus Bos (2.5 MYA): GenBank accession numbers AY517856-AY517904.
Collapse
Affiliation(s)
- William S Modi
- Basic Research Program, SAIC-Frederick, National Cancer Institute at Frederick, Frederick, MD 21702-1201, USA.
| | | | | |
Collapse
|
32
|
Amor DJ, Bentley K, Ryan J, Perry J, Wong L, Slater H, Choo KHA. Human centromere repositioning "in progress". Proc Natl Acad Sci U S A 2004; 101:6542-7. [PMID: 15084747 PMCID: PMC404081 DOI: 10.1073/pnas.0308637101] [Citation(s) in RCA: 168] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2003] [Accepted: 03/12/2004] [Indexed: 01/31/2023] Open
Abstract
Centromere repositioning provides a potentially powerful evolutionary force for reproductive isolation and speciation, but the underlying mechanisms remain ill-defined. An attractive model is through the simultaneous inactivation of a normal centromere and the formation of a new centromere at a hitherto noncentromeric chromosomal location with minimal detrimental effect. We report a two-generation family in which the centromeric activity of one chromosome 4 has been relocated to a euchromatic site at 4q21.3 through the epigenetic formation of a neocentromere in otherwise cytogenetically normal and mitotically stable karyotypes. Strong epigenetic inactivation of the original centromere is suggested by retention of 1.3 megabases of centromeric alpha-satellite DNA, absence of detectable molecular alteration in chromosome 4-centromereproximal p- and q-arm sequences, and failure of the inactive centromere to be reactivated through extensive culturing or treatment with histone deacetylase inhibitor trichostatin A. The neocentromere binds functionally essential centromere proteins (CENP-A, CENP-C, CENP-E, CENP-I, BUB1, and HP1), although a moderate reduction in CENP-A binding and sister-chromatid cohesion compared with the typical centromeres suggests possible underlying structural/functional differences. The stable mitotic and meiotic transmissibility of this pseudodicentric-neocentric chromosome in healthy individuals and the ability of the neocentric activity to form in a euchromatic site in preference to a preexisting alphoid domain provide direct evidence for an inherent mechanism of human centromere repositioning and karyotype evolution "in progress." We discuss the wider implication of such a mechanism for meiotic drive and the evolution of primate and other species.
Collapse
Affiliation(s)
- David J Amor
- Murdoch Children's Research Institute and Department of Paediatrics, Genetic Health Services Victoria, Royal Children's Hospital, Flemington Road, Victoria 3052, Australia
| | | | | | | | | | | | | |
Collapse
|
33
|
Abstract
Centromeres are the site for kinetochore formation and spindle attachment and are embedded in heterochromatin in most eukaryotes. The repeat-rich nature of heterochromatin has hindered obtaining a detailed understanding of the composition and organization of heterochromatic and centromeric DNA sequences. Here, we report the results of extensive sequence analysis of a fully functional centromere present in the Drosophila Dp1187 minichromosome. Approximately 8.4% (31 kb) of the highly repeated satellite DNA (AATAT and TTCTC) was sequenced, representing the largest data set of Drosophila satellite DNA sequence to date. Sequence analysis revealed that the orientation of the arrays is uniform and that individual repeats within the arrays mostly differ by rare, single-base polymorphisms. The entire complex DNA component of this centromere (69.7 kb) was sequenced and assembled. The 39-kb "complex island" Maupiti contains long stretches of a complex A+T rich repeat interspersed with transposon fragments, and most of these elements are organized as direct repeats. Surprisingly, five single, intact transposons are directly inserted at different locations in the AATAT satellite arrays. We find no evidence for centromere-specific sequences within this centromere, providing further evidence for sequence-independent, epigenetic determination of centromere identity and function in higher eukaryotes. Our results also demonstrate that the sequence composition and organization of large regions of centric heterochromatin can be determined, despite the presence of repeated DNA.
Collapse
Affiliation(s)
- Xiaoping Sun
- Molecular and Cell Biology Laboratory, The Salk Institute, La Jolla, CA 92037, USA
| | | | | | | |
Collapse
|
34
|
Hall SE, Kettler G, Preuss D. Centromere satellites from Arabidopsis populations: maintenance of conserved and variable domains. Genome Res 2003; 13:195-205. [PMID: 12566397 PMCID: PMC420371 DOI: 10.1101/gr.593403] [Citation(s) in RCA: 99] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The rapid evolution of centromere sequences between species has led to a debate over whether centromere activity is sequence-dependent. The Arabidopsis thaliana centromere regions contain approximately 20,000 copies of a 178-bp satellite repeat. Here, we analyzed satellites from 41 Arabidopsis ecotypes, providing the first broad population survey of satellite variation within a species. We found highly conserved segments and consistent sequence lengths in the Arabidopsis satellites and in the published collection of human alpha-satellites, supporting models for a functional role. Despite this conservation, polymorphisms are significantly enriched at some sites, yielding variation that could restrict binding proteins to a subset of repeat monomers. Some satellite regions vary considerably; at certain bases, consensus sequences derived from each ecotype diverge significantly from the Arabidopsis consensus, indicating substitutions sweep through a genome in less than 5 million years. Such rapid changes generate more variation within the set of Arabidopsis satellites than in genes from the chromosome arms or from the recombinationally suppressed centromere regions. These studies highlight a balance between the mechanisms that maintain particular satellite domains and the forces that disperse sequence changes throughout the satellite repeats in the genome.
Collapse
Affiliation(s)
- Sarah E Hall
- Committee on Genetics, University of Chicago, Chicago, Illinois 60637, USA
| | | | | |
Collapse
|
35
|
Crovella S, Del Pero M, Marziliano N, Garagna S, Pecile V, Morgutti M, Boniotto M, Amoroso A, Montagnon D, Campa CC, Lamberti L, Bigatti MP, Ardito G. MFASAT: a new alphoid DNA sequence isolated from Macaca fascicularis (Cercopithecidae, Primates). Genome 1999; 42:1066-70. [PMID: 10659771 DOI: 10.1139/g99-054] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
A new highly repeated DNA fragment isolated from Macaca fascicularis (MFASAT) is described. Our findings obtained by sequencing, Southern blot analysis, and fluorescent in situ hybridization (FISH) on metaphasic chromosomes strongly suggest that MFASAT can be considered as a member of the alphoid DNA family characteristic of Old World monkeys. The chromosomal localization of MFASAT, obtained by FISH, showed that this alphoid DNA is present in the peri-centromeric area of all the chromosomes. MFASAT showed a high degree of conservation when compared, by sequence alignment, to other Macaca species and Papio papio as expected for species with considerable genome conservation. A low degree of homology has been found comparing M. fascicularis alphoid DNA with a more distantly related Cercopithecidae species such as Cercopithecus aethiops.
Collapse
Affiliation(s)
- S Crovella
- Cattedra di Genetica e Servizio di Genetica, IRCCS Burlo-Garofolo, Trieste, Italy.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
36
|
Abstract
Primate pericentromeric regions recently have been shown to exhibit extraordinary evolutionary plasticity. In this paper we report an additional peculiar feature of these regions that we discovered while analyzing, by FISH, the evolutionary conservation of primate phylogenetic chromosome IX. If the position of the centromere is not taken into account, a relatively small number of rearrangements must be invoked to account for interspecific differences. Conversely, if the centromere is included, a paradox emerges: The position of the centromere seems to have undergone, in some species, an evolutionary history independent from the surrounding markers. A significant number of additional rearrangements must be proposed to reconcile the order of the markers with centromere position. Alternatively, the evolutionary emergence of neocentromeres can be postulated.
Collapse
|
37
|
|
38
|
Gosden JR. Identification and quantitation of human chromosomes by primed in situ synthesis. Chromosome Res 1996; 4:331-4. [PMID: 8871820 DOI: 10.1007/bf02257267] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Affiliation(s)
- J R Gosden
- MRC Human Genetics Unit, Western General Hospital, Edinburgh, UK.
| |
Collapse
|
39
|
Haaf T, Bray-Ward P. Region-specific YAC banding and painting probes for comparative genome mapping: implications for the evolution of human chromosome 2. Chromosoma 1996; 104:537-44. [PMID: 8662246 DOI: 10.1007/bf00352293] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
To date, several hundred nonchimeric yeast artificial chromosomes (YACs) from the Centre d'Etude du Polymorphisme Humain containing polymorphic sequence-tagged sites have been mapped by fluoresence in situ hybridization (FISH) on human metaphase chromosomes. Because they carry an average of 1 Mb of human genomic DNA, CEPH YACs generate high-intensity in situ hybridization signals. The available set of cytogenetically and genetically anchored YACs, approximately one every 5-10 cM evenly spaced over almost the entire human genome, provides complex region-specific probes for molecular cytogenetics. YAC probes can be adapted with unlimited flexibility to specific FISH applications such as the study of chromosomal evolution. We have generated representational probes for YAC banding and painting of human chromosome 2 and its great ape homologs. Convergent inversions were found in the pericentric region of the gorilla and orangutan homologs of chromosome 2p.
Collapse
Affiliation(s)
- T Haaf
- Max Planck Institute of Molecular Genetics, Ihnestrasse 73, D-14195 Berlin, Germany
| | | |
Collapse
|