1
|
Cechova M, Miga KH. Satellite DNAs and human sex chromosome variation. Semin Cell Dev Biol 2022; 128:15-25. [PMID: 35644878 DOI: 10.1016/j.semcdb.2022.04.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 04/26/2022] [Accepted: 04/27/2022] [Indexed: 11/17/2022]
Abstract
Satellite DNAs are present on every chromosome in the cell and are typically enriched in repetitive, heterochromatic parts of the human genome. Sex chromosomes represent a unique genomic and epigenetic context. In this review, we first report what is known about satellite DNA biology on human X and Y chromosomes, including repeat content and organization, as well as satellite variation in typical euploid individuals. Then, we review sex chromosome aneuploidies that are among the most common types of aneuploidies in the general population, and are better tolerated than autosomal aneuploidies. This is demonstrated also by the fact that aging is associated with the loss of the X, and especially the Y chromosome. In addition, supernumerary sex chromosomes enable us to study general processes in a cell, such as analyzing heterochromatin dosage (i.e. additional Barr bodies and long heterochromatin arrays on Yq) and their downstream consequences. Finally, genomic and epigenetic organization and regulation of satellite DNA could influence chromosome stability and lead to aneuploidy. In this review, we argue that the complete annotation of satellite DNA on sex chromosomes in human, and especially in centromeric regions, will aid in explaining the prevalence and the consequences of sex chromosome aneuploidies.
Collapse
Affiliation(s)
- Monika Cechova
- Faculty of Informatics, Masaryk University, Czech Republic
| | - Karen H Miga
- Department of Biomolecular Engineering, University of California Santa Cruz, CA, USA; UC Santa Cruz Genomics Institute, University of California Santa Cruz, CA 95064, USA
| |
Collapse
|
2
|
Haig D. Paradox lost: Concerted evolution and centromeric instability: Centromeres are hospitable habitats for repeats that evolve adaptations for proliferation within the nucleus sometimes at organismal cost.: Centromeres are hospitable habitats for repeats that evolve adaptations for proliferation within the nucleus sometimes at organismal cost. Bioessays 2022; 44:e2200023. [PMID: 35748194 DOI: 10.1002/bies.202200023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 06/07/2022] [Accepted: 06/09/2022] [Indexed: 11/11/2022]
Abstract
Homologous centromeres compete for segregation to the secondary oocyte nucleus at female meiosis I. Centromeric repeats also compete with each other to populate centromeres in mitotic cells of the germline and have become adapted to use the recombinational machinery present at centromeres to promote their own propagation. Repeats are not needed at centromeres, rather centromeres appear to be hospitable habitats for the colonization and proliferation of repeats. This is probably an indirect consequence of two distinctive features of centromeric DNA. Centromeres are subject to breakage by the mechanical forces exerted by microtubules and meiotic crossing-over is suppressed. Centromeric proteins acting in trans are under selection to mitigate the costs of centromeric repeats acting in cis. Collateral costs of mitotic competition at centromeres may help to explain the high rates of aneuploidy observed in early human embryos.
Collapse
Affiliation(s)
- David Haig
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, USA
| |
Collapse
|
3
|
Population Scale Analysis of Centromeric Satellite DNA Reveals Highly Dynamic Evolutionary Patterns and Genomic Organization in Long-Tailed and Rhesus Macaques. Cells 2022; 11:cells11121953. [PMID: 35741082 PMCID: PMC9221937 DOI: 10.3390/cells11121953] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2022] [Revised: 06/12/2022] [Accepted: 06/14/2022] [Indexed: 02/04/2023] Open
Abstract
Centromeric satellite DNA (cen-satDNA) consists of highly divergent repeat monomers, each approximately 171 base pairs in length. Here, we investigated the genetic diversity in the centromeric region of two primate species: long-tailed (Macaca fascicularis) and rhesus (Macaca mulatta) macaques. Fluorescence in situ hybridization and bioinformatic analysis showed the chromosome-specific organization and dynamic nature of cen-satDNAsequences, and their substantial diversity, with distinct subfamilies across macaque populations, suggesting increased turnovers. Comparative genomics identified high level polymorphisms spanning a 120 bp deletion region and a remarkable interspecific variability in cen-satDNA size and structure. Population structure analysis detected admixture patterns within populations, indicating their high divergence and rapid evolution. However, differences in cen-satDNA profiles appear to not be involved in hybrid incompatibility between the two species. Our study provides a genomic landscape of centromeric repeats in wild macaques and opens new avenues for exploring their impact on the adaptive evolution and speciation of primates.
Collapse
|
4
|
Jeffery D, Lochhead M, Almouzni G. CENP-A: A Histone H3 Variant with Key Roles in Centromere Architecture in Healthy and Diseased States. Results Probl Cell Differ 2022; 70:221-261. [PMID: 36348109 DOI: 10.1007/978-3-031-06573-6_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Centromeres are key architectural components of chromosomes. Here, we examine their construction, maintenance, and functionality. Focusing on the mammalian centromere- specific histone H3 variant, CENP-A, we highlight its coevolution with both centromeric DNA and its chaperone, HJURP. We then consider CENP-A de novo deposition and the importance of centromeric DNA recently uncovered with the added value from new ultra-long-read sequencing. We next review how to ensure the maintenance of CENP-A at the centromere throughout the cell cycle. Finally, we discuss the impact of disrupting CENP-A regulation on cancer and cell fate.
Collapse
Affiliation(s)
- Daniel Jeffery
- Equipe Labellisée Ligue contre le Cancer, Institut Curie, PSL Research University, CNRS, Sorbonne Université, Nuclear Dynamics Unit, UMR3664, Paris, France
| | - Marina Lochhead
- Equipe Labellisée Ligue contre le Cancer, Institut Curie, PSL Research University, CNRS, Sorbonne Université, Nuclear Dynamics Unit, UMR3664, Paris, France
| | - Geneviève Almouzni
- Equipe Labellisée Ligue contre le Cancer, Institut Curie, PSL Research University, CNRS, Sorbonne Université, Nuclear Dynamics Unit, UMR3664, Paris, France.
| |
Collapse
|
5
|
Abstract
We are entering a new era in genomics where entire centromeric regions are accurately represented in human reference assemblies. Access to these high-resolution maps will enable new surveys of sequence and epigenetic variation in the population and offer new insight into satellite array genomics and centromere function. Here, we focus on the sequence organization and evolution of alpha satellites, which are credited as the genetic and genomic definition of human centromeres due to their interaction with inner kinetochore proteins and their importance in the development of human artificial chromosome assays. We provide an overview of alpha satellite repeat structure and array organization in the context of these high-quality reference data sets; discuss the emergence of variation-based surveys; and provide perspective on the role of this new source of genetic and epigenetic variation in the context of chromosome biology, genome instability, and human disease.
Collapse
Affiliation(s)
- Karen H Miga
- UC Santa Cruz Genomics Institute, University of California, Santa Cruz, California 95064, USA; .,Department of Biomolecular Engineering, University of California, Santa Cruz, California 95064, USA
| | - Ivan A Alexandrov
- Department of Genomics and Human Genetics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia; .,Center for Algorithmic Biotechnology, Institute of Translational Biomedicine, Saint Petersburg State University, Saint Petersburg 199004, Russia.,Research Center of Biotechnology of the Russian Academy of Sciences, Moscow 119071, Russia
| |
Collapse
|
6
|
Vojvoda Zeljko T, Ugarković Đ, Pezer Ž. Differential enrichment of H3K9me3 at annotated satellite DNA repeats in human cell lines and during fetal development in mouse. Epigenetics Chromatin 2021; 14:47. [PMID: 34663449 PMCID: PMC8524813 DOI: 10.1186/s13072-021-00423-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2021] [Accepted: 10/05/2021] [Indexed: 01/24/2023] Open
Abstract
BACKGROUND Trimethylation of histone H3 on lysine 9 (H3K9me3) at satellite DNA sequences has been primarily studied at (peri)centromeric regions, where its level shows differences associated with various processes such as development and malignant transformation. However, the dynamics of H3K9me3 at distal satellite DNA repeats has not been thoroughly investigated. RESULTS We exploit the sets of publicly available data derived from chromatin immunoprecipitation combined with massively parallel DNA sequencing (ChIP-Seq), produced by the The Encyclopedia of DNA Elements (ENCODE) project, to analyze H3K9me3 at assembled satellite DNA repeats in genomes of human cell lines and during mouse fetal development. We show that annotated satellite elements are generally enriched for H3K9me3, but its level in cancer cell lines is on average lower than in normal cell lines. We find 407 satellite DNA instances with differential H3K9me3 enrichment between cancer and normal cells including a large 115-kb cluster of GSATII elements on chromosome 12. Differentially enriched regions are not limited to satellite DNA instances, but instead encompass a wider region of flanking sequences. We found no correlation between the levels of H3K9me3 and noncoding RNA at corresponding satellite DNA loci. The analysis of data derived from multiple tissues identified 864 instances of satellite DNA sequences in the mouse reference genome that are differentially enriched between fetal developmental stages. CONCLUSIONS Our study reveals significant differences in H3K9me3 level at a subset of satellite repeats between biological states and as such contributes to understanding of the role of satellite DNA repeats in epigenetic regulation during development and carcinogenesis.
Collapse
Affiliation(s)
| | | | - Željka Pezer
- Ruđer Bošković Institute, Bijenička 54, 10000, Zagreb, Croatia.
| |
Collapse
|
7
|
Feliciello I, Pezer Ž, Kordiš D, Bruvo Mađarić B, Ugarković Đ. Evolutionary History of Alpha Satellite DNA Repeats Dispersed within Human Genome Euchromatin. Genome Biol Evol 2021; 12:2125-2138. [PMID: 33078196 PMCID: PMC7719264 DOI: 10.1093/gbe/evaa224] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/14/2020] [Indexed: 01/03/2023] Open
Abstract
Major human alpha satellite DNA repeats are preferentially assembled within (peri)centromeric regions but are also dispersed within euchromatin in the form of clustered or short single repeat arrays. To study the evolutionary history of single euchromatic human alpha satellite repeats (ARs), we analyzed their orthologous loci across the primate genomes. The continuous insertion of euchromatic ARs throughout the evolutionary history of primates starting with the ancestors of Simiformes (45-60 Ma) and continuing up to the ancestors of Homo is revealed. Once inserted, the euchromatic ARs were stably transmitted to the descendant species, some exhibiting copy number variation, whereas their sequence divergence followed the species phylogeny. Many euchromatic ARs have sequence characteristics of (peri)centromeric alpha repeats suggesting heterochromatin as a source of dispersed euchromatic ARs. The majority of euchromatic ARs are inserted in the vicinity of other repetitive elements such as L1, Alu, and ERV or are embedded within them. Irrespective of the insertion context, each AR insertion seems to be unique and once inserted, ARs do not seem to be subsequently spread to new genomic locations. In spite of association with (retro)transposable elements, there is no indication that such elements play a role in ARs proliferation. The presence of short duplications at most of ARs insertion sites suggests site-directed recombination between homologous motifs in ARs and in the target genomic sequence, probably mediated by extrachromosomal circular DNA, as a mechanism of spreading within euchromatin.
Collapse
Affiliation(s)
- Isidoro Feliciello
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia.,Dipartimento di Medicina Clinica e Chirurgia, Universita' degli Studi di Napoli Federico II, Italy
| | - Željka Pezer
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| | - Dušan Kordiš
- Department of Molecular and Biomedical Sciences, Jožef Stefan Institute, Ljubljana, Slovenia
| | | | - Đurđica Ugarković
- Department of Molecular Biology, Ruđer Bošković Institute, Zagreb, Croatia
| |
Collapse
|
8
|
Lopes M, Louzada S, Gama-Carvalho M, Chaves R. Genomic Tackling of Human Satellite DNA: Breaking Barriers through Time. Int J Mol Sci 2021; 22:4707. [PMID: 33946766 PMCID: PMC8125562 DOI: 10.3390/ijms22094707] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2021] [Revised: 04/24/2021] [Accepted: 04/27/2021] [Indexed: 12/12/2022] Open
Abstract
(Peri)centromeric repetitive sequences and, more specifically, satellite DNA (satDNA) sequences, constitute a major human genomic component. SatDNA sequences can vary on a large number of features, including nucleotide composition, complexity, and abundance. Several satDNA families have been identified and characterized in the human genome through time, albeit at different speeds. Human satDNA families present a high degree of sub-variability, leading to the definition of various subfamilies with different organization and clustered localization. Evolution of satDNA analysis has enabled the progressive characterization of satDNA features. Despite recent advances in the sequencing of centromeric arrays, comprehensive genomic studies to assess their variability are still required to provide accurate and proportional representation of satDNA (peri)centromeric/acrocentric short arm sequences. Approaches combining multiple techniques have been successfully applied and seem to be the path to follow for generating integrated knowledge in the promising field of human satDNA biology.
Collapse
Affiliation(s)
- Mariana Lopes
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (M.L.); (S.L.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| | - Sandra Louzada
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (M.L.); (S.L.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| | - Margarida Gama-Carvalho
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| | - Raquel Chaves
- Laboratory of Cytogenomics and Animal Genomics (CAG), Department of Genetics and Biotechnology (DGB), University of Trás-os-Montes and Alto Douro (UTAD), 5000-801 Vila Real, Portugal; (M.L.); (S.L.)
- Biosystems and Integrative Sciences Institute (BioISI), Faculty of Sciences, University of Lisbon, 1749-016 Lisbon, Portugal;
| |
Collapse
|
9
|
Thakur J, Packiaraj J, Henikoff S. Sequence, Chromatin and Evolution of Satellite DNA. Int J Mol Sci 2021; 22:ijms22094309. [PMID: 33919233 PMCID: PMC8122249 DOI: 10.3390/ijms22094309] [Citation(s) in RCA: 70] [Impact Index Per Article: 23.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Revised: 04/16/2021] [Accepted: 04/17/2021] [Indexed: 12/15/2022] Open
Abstract
Satellite DNA consists of abundant tandem repeats that play important roles in cellular processes, including chromosome segregation, genome organization and chromosome end protection. Most satellite DNA repeat units are either of nucleosomal length or 5–10 bp long and occupy centromeric, pericentromeric or telomeric regions. Due to high repetitiveness, satellite DNA sequences have largely been absent from genome assemblies. Although few conserved satellite-specific sequence motifs have been identified, DNA curvature, dyad symmetries and inverted repeats are features of various satellite DNAs in several organisms. Satellite DNA sequences are either embedded in highly compact gene-poor heterochromatin or specialized chromatin that is distinct from euchromatin. Nevertheless, some satellite DNAs are transcribed into non-coding RNAs that may play important roles in satellite DNA function. Intriguingly, satellite DNAs are among the most rapidly evolving genomic elements, such that a large fraction is species-specific in most organisms. Here we describe the different classes of satellite DNA sequences, their satellite-specific chromatin features, and how these features may contribute to satellite DNA biology and evolution. We also discuss how the evolution of functional satellite DNA classes may contribute to speciation in plants and animals.
Collapse
Affiliation(s)
- Jitendra Thakur
- Department of Biology, Emory University, Atlanta, GA 30322, USA;
- Correspondence:
| | - Jenika Packiaraj
- Department of Biology, Emory University, Atlanta, GA 30322, USA;
| | - Steven Henikoff
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA;
- Fred Hutchinson Cancer Research Center, Howard Hughes Medical Institute, Seattle, WA 98109, USA
| |
Collapse
|
10
|
Ahmad SF, Singchat W, Jehangir M, Suntronpong A, Panthum T, Malaivijitnond S, Srikulnath K. Dark Matter of Primate Genomes: Satellite DNA Repeats and Their Evolutionary Dynamics. Cells 2020; 9:E2714. [PMID: 33352976 PMCID: PMC7767330 DOI: 10.3390/cells9122714] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/15/2020] [Accepted: 12/16/2020] [Indexed: 12/12/2022] Open
Abstract
A substantial portion of the primate genome is composed of non-coding regions, so-called "dark matter", which includes an abundance of tandemly repeated sequences called satellite DNA. Collectively known as the satellitome, this genomic component offers exciting evolutionary insights into aspects of primate genome biology that raise new questions and challenge existing paradigms. A complete human reference genome was recently reported with telomere-to-telomere human X chromosome assembly that resolved hundreds of dark regions, encompassing a 3.1 Mb centromeric satellite array that had not been identified previously. With the recent exponential increase in the availability of primate genomes, and the development of modern genomic and bioinformatics tools, extensive growth in our knowledge concerning the structure, function, and evolution of satellite elements is expected. The current state of knowledge on this topic is summarized, highlighting various types of primate-specific satellite repeats to compare their proportions across diverse lineages. Inter- and intraspecific variation of satellite repeats in the primate genome are reviewed. The functional significance of these sequences is discussed by describing how the transcriptional activity of satellite repeats can affect gene expression during different cellular processes. Sex-linked satellites are outlined, together with their respective genomic organization. Mechanisms are proposed whereby satellite repeats might have emerged as novel sequences during different evolutionary phases. Finally, the main challenges that hinder the detection of satellite DNA are outlined and an overview of the latest methodologies to address technological limitations is presented.
Collapse
Affiliation(s)
- Syed Farhan Ahmad
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Worapong Singchat
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Maryam Jehangir
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Department of Structural and Functional Biology, Institute of Bioscience at Botucatu, São Paulo State University (UNESP), Botucatu, São Paulo 18618-689, Brazil
| | - Aorarat Suntronpong
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Thitipong Panthum
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
| | - Suchinda Malaivijitnond
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Department of Biology, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand
| | - Kornsorn Srikulnath
- Laboratory of Animal Cytogenetics and Comparative Genomics (ACCG), Department of Genetics, Faculty of Science, Kasetsart University, Bangkok 10900, Thailand; (S.F.A.); (W.S.); (M.J.); (A.S.); (T.P.)
- Special Research Unit for Wildlife Genomics (SRUWG), Department of Forest Biology, Faculty of Forestry, Kasetsart University, Bangkok 10900, Thailand
- National Primate Research Center of Thailand, Chulalongkorn University, Saraburi 18110, Thailand;
- Center of Excellence on Agricultural Biotechnology (AG-BIO/PERDO-CHE), Bangkok 10900, Thailand
- Omics Center for Agriculture, Bioresources, Food and Health, Kasetsart University (OmiKU), Bangkok 10900, Thailand
| |
Collapse
|
11
|
Suzuki Y, Myers EW, Morishita S. Rapid and ongoing evolution of repetitive sequence structures in human centromeres. SCIENCE ADVANCES 2020; 6:6/50/eabd9230. [PMID: 33310858 PMCID: PMC7732198 DOI: 10.1126/sciadv.abd9230] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2020] [Accepted: 10/30/2020] [Indexed: 06/12/2023]
Abstract
Our understanding of centromere sequence variation across human populations is limited by its extremely long nested repeat structures called higher-order repeats that are challenging to sequence. Here, we analyzed chromosomes 11, 17, and X using long-read sequencing data for 36 individuals from diverse populations including a Han Chinese trio and 21 Japanese. We revealed substantial structural diversity with many previously unidentified variant higher-order repeats specific to individuals characterizing rapid, haplotype-specific evolution of human centromeric arrays, while frequent single-nucleotide variants are largely conserved. We found a characteristic pattern shared among prevalent variants in human and chimpanzee. Our findings pave the way for studying sequence evolution in human and primate centromeres.
Collapse
Affiliation(s)
- Yuta Suzuki
- The University of Tokyo, Graduate School of Frontier Sciences, Department of Computational Biology and Medical Sciences, Kashiwa, Chiba 277-8568, Japan.
| | - Eugene W Myers
- Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany
| | - Shinichi Morishita
- The University of Tokyo, Graduate School of Frontier Sciences, Department of Computational Biology and Medical Sciences, Kashiwa, Chiba 277-8568, Japan.
| |
Collapse
|
12
|
Tolmacheva EN, Vasilyev SA, Lebedev IN. Aneuploidy and DNA Methylation as Mirrored Features of Early Human Embryo Development. Genes (Basel) 2020; 11:E1084. [PMID: 32957536 PMCID: PMC7564410 DOI: 10.3390/genes11091084] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 09/08/2020] [Accepted: 09/15/2020] [Indexed: 12/18/2022] Open
Abstract
Genome stability is an integral feature of all living organisms. Aneuploidy is the most common cause of fetal death in humans. The timing of bursts in increased aneuploidy frequency coincides with the waves of global epigenetic reprogramming in mammals. During gametogenesis and early embryogenesis, parental genomes undergo two waves of DNA methylation reprogramming. Failure of these processes can critically affect genome stability, including chromosome segregation during cell division. Abnormal methylation due to errors in the reprogramming process can potentially lead to aneuploidy. On the other hand, the presence of an entire additional chromosome, or chromosome loss, can affect the global genome methylation level. The associations of these two phenomena are well studied in the context of carcinogenesis, but here, we consider the relationship of DNA methylation and aneuploidy in early human and mammalian ontogenesis. In this review, we link these two phenomena and highlight the critical ontogenesis periods and genome regions that play a significant role in human reproduction and in the formation of pathological phenotypes in newborns with chromosomal aneuploidy.
Collapse
Affiliation(s)
- Ekaterina N. Tolmacheva
- Research Institute of Medical Genetics, Tomsk National Research Medical Center, 634050 Tomsk, Russia; (S.A.V.); (I.N.L.)
| | | | | |
Collapse
|
13
|
Talbert PB, Henikoff S. What makes a centromere? Exp Cell Res 2020; 389:111895. [PMID: 32035948 DOI: 10.1016/j.yexcr.2020.111895] [Citation(s) in RCA: 87] [Impact Index Per Article: 21.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2019] [Revised: 01/18/2020] [Accepted: 02/05/2020] [Indexed: 12/26/2022]
Abstract
Centromeres are the eukaryotic chromosomal sites at which the kinetochore forms and attaches to spindle microtubules to orchestrate chromosomal segregation in mitosis and meiosis. Although centromeres are essential for cell division, their sequences are not conserved and evolve rapidly. Centromeres vary dramatically in size and organization. Here we categorize their diversity and explore the evolutionary forces shaping them. Nearly all centromeres favor AT-rich DNA that is gene-free and transcribed at a very low level. Repair of frequent centromere-proximal breaks probably contributes to their rapid sequence evolution. Point centromeres are only ~125 bp and are specified by common protein-binding motifs, whereas short regional centromeres are 1-5 kb, typically have unique sequences, and may have pericentromeric repeats adapted to facilitate centromere clustering. Transposon-rich centromeres are often ~100-300 kb and are favored by RNAi machinery that silences transposons, by suppression of meiotic crossovers at centromeres, and by the ability of some transposons to target centromeres. Megabase-length satellite centromeres arise in plants and animals with asymmetric female meiosis that creates centromere competition, and favors satellite monomers one or two nucleosomes in length that position and stabilize centromeric nucleosomes. Holocentromeres encompass the length of a chromosome and may differ dramatically between mitosis and meiosis. We propose a model in which low level transcription of centromeres facilitates the formation of non-B DNA that specifies centromeres and promotes loading of centromeric nucleosomes.
Collapse
Affiliation(s)
- Paul B Talbert
- Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA, 98109, USA
| | - Steven Henikoff
- Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA, 98109, USA.
| |
Collapse
|
14
|
Eskandarian Z, Fliegauf M, Bulashevska A, Proietti M, Hague R, Smulski CR, Schubert D, Warnatz K, Grimbacher B. Assessing the Functional Relevance of Variants in the IKAROS Family Zinc Finger Protein 1 ( IKZF1) in a Cohort of Patients With Primary Immunodeficiency. Front Immunol 2019; 10:568. [PMID: 31057532 PMCID: PMC6477086 DOI: 10.3389/fimmu.2019.00568] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Accepted: 03/04/2019] [Indexed: 12/13/2022] Open
Abstract
Common variable immunodeficiency (CVID) is the most frequent symptomatic primary immunodeficiency. Patients with CVID are prone to recurrent bacterial infection due to the failure of adequate immunoglobulin production. Monogenetic defects have been identified in ~25% of CVID patients. Recently, mutations in IKZF1, encoding the zinc-finger transcription factor IKAROS which is broadly expressed in hematopoietic cells, have been associated with a CVID-like phenotype. Herein we describe 11 patients with heterozygous IKZF1 variants from eight different families with autosomal dominant CVID and two siblings with an IKZF1 variant presenting with inflammatory bowel disease (IBD). This study shows that mutations affecting the DNA binding domain of IKAROS can impair the interaction with the target DNA sequence thereby preventing heterochromatin and pericentromeric localization (HC-PC) of the protein. Our results also indicate an impairment of pericentromeric localization of IKAROS by overexpression of a truncated variant, caused by an immature stop codon in IKZF1. We also describe an additional variant in TNFSF10, encoding Tumor Necrosis Factor Related Apoptosis Inducing Ligand (TRAIL), additionally presented in individuals of Family A. Our results indicate that this variant may impair the TRAIL-induced apoptosis in target cell lines and prohibit the NFκB activation by TRAIL and may act as a modifier in Family A.
Collapse
Affiliation(s)
- Zoya Eskandarian
- Institute for Immunodeficiency, Center for Chronic Immunodeficiency, Medical Center, Faculty of Medicine, Albert-Ludwigs-University of Freiburg, Freiburg, Germany.,Faculty of Biology, Albert-Ludwigs-University of Freiburg, Freiburg, Germany
| | - Manfred Fliegauf
- Institute for Immunodeficiency, Center for Chronic Immunodeficiency, Medical Center, Faculty of Medicine, Albert-Ludwigs-University of Freiburg, Freiburg, Germany.,Centre for Integrative Biological Signalling Studies, Albert-Ludwigs University of Freiburg, Freiburg, Germany
| | - Alla Bulashevska
- Institute for Immunodeficiency, Center for Chronic Immunodeficiency, Medical Center, Faculty of Medicine, Albert-Ludwigs-University of Freiburg, Freiburg, Germany
| | - Michele Proietti
- Institute for Immunodeficiency, Center for Chronic Immunodeficiency, Medical Center, Faculty of Medicine, Albert-Ludwigs-University of Freiburg, Freiburg, Germany
| | - Rosie Hague
- Royal Hospital for Children, Glasgow, United Kingdom
| | - Cristian Roberto Smulski
- Department of Medical Physics, Centro Atómico Bariloche, CONICET, San Carlos de Bariloche, Argentina
| | - Desirée Schubert
- Institute for Immunodeficiency, Center for Chronic Immunodeficiency, Medical Center, Faculty of Medicine, Albert-Ludwigs-University of Freiburg, Freiburg, Germany
| | - Klaus Warnatz
- Clinic for Rheumatology and Clinical Immunology, Faculty of Medicine, CCI, Medical Center, Albert-Ludwigs-University of Freiburg, Freiburg, Germany
| | - Bodo Grimbacher
- Institute for Immunodeficiency, Center for Chronic Immunodeficiency, Medical Center, Faculty of Medicine, Albert-Ludwigs-University of Freiburg, Freiburg, Germany.,Centre for Integrative Biological Signalling Studies, Albert-Ludwigs University of Freiburg, Freiburg, Germany.,Satellite Center Freiburg, RESIST-Cluster of Excellence 2155, Hanover Medical School, Freiburg, Germany.,Satellite Center Freiburg, German Center for Infection Research, Freiburg, Germany.,Institute of Immunity and Transplantation, Royal Free Hospital, University College London, London, United Kingdom
| |
Collapse
|
15
|
McNulty SM, Sullivan BA. Alpha satellite DNA biology: finding function in the recesses of the genome. Chromosome Res 2018; 26:115-138. [PMID: 29974361 DOI: 10.1007/s10577-018-9582-3] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Accepted: 06/14/2018] [Indexed: 02/05/2023]
Abstract
Repetitive DNA, formerly referred to by the misnomer "junk DNA," comprises a majority of the human genome. One class of this DNA, alpha satellite, comprises up to 10% of the genome. Alpha satellite is enriched at all human centromere regions and is competent for de novo centromere assembly. Because of the highly repetitive nature of alpha satellite, it has been difficult to achieve genome assemblies at centromeres using traditional next-generation sequencing approaches, and thus, centromeres represent gaps in the current human genome assembly. Moreover, alpha satellite DNA is transcribed into repetitive noncoding RNA and contributes to a large portion of the transcriptome. Recent efforts to characterize these transcripts and their function have uncovered pivotal roles for satellite RNA in genome stability, including silencing "selfish" DNA elements and recruiting centromere and kinetochore proteins. This review will describe the genomic and epigenetic features of alpha satellite DNA, discuss recent findings of noncoding transcripts produced from distinct alpha satellite arrays, and address current progress in the functional understanding of this oft-neglected repetitive sequence. We will discuss unique challenges of studying human satellite DNAs and RNAs and point toward new technologies that will continue to advance our understanding of this largely untapped portion of the genome.
Collapse
Affiliation(s)
- Shannon M McNulty
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, 27710, USA
| | - Beth A Sullivan
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, NC, 27710, USA. .,Division of Human Genetics, Duke University Medical Center, Durham, NC, 27710, USA.
| |
Collapse
|
16
|
Cacheux L, Ponger L, Gerbault-Seureau M, Loll F, Gey D, Richard FA, Escudé C. The Targeted Sequencing of Alpha Satellite DNA in Cercopithecus pogonias Provides New Insight Into the Diversity and Dynamics of Centromeric Repeats in Old World Monkeys. Genome Biol Evol 2018; 10:1837-1851. [PMID: 29860303 PMCID: PMC6061836 DOI: 10.1093/gbe/evy109] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/29/2018] [Indexed: 02/06/2023] Open
Abstract
Alpha satellite is the major repeated DNA element of primate centromeres. Specific evolutionary mechanisms have led to a great diversity of sequence families with peculiar genomic organization and distribution, which have till now been studied mostly in great apes. Using high throughput sequencing of alpha satellite monomers obtained by enzymatic digestion followed by computational and cytogenetic analysis, we compare here the diversity and genomic distribution of alpha satellite DNA in two related Old World monkey species, Cercopithecus pogonias and Cercopithecus solatus, which are known to have diverged about 7 Ma. Two main families of monomers, called C1 and C2, are found in both species. A detailed analysis of our data sets revealed the existence of numerous subfamilies within the centromeric C1 family. Although the most abundant subfamily is conserved between both species, our fluorescence in situ hybridization (FISH) experiments clearly show that some subfamilies are specific for each species and that their distribution is restricted to a subset of chromosomes, thereby pointing to the existence of recurrent amplification/homogenization events. The pericentromeric C2 family is very abundant on the short arm of all acrocentric chromosomes in both species, pointing to specific mechanisms that lead to this distribution. Results obtained using two different restriction enzymes are fully consistent with a predominant monomeric organization of alpha satellite DNA that coexists with higher order organization patterns in the C. pogonias genome. Our study suggests a high dynamics of alpha satellite DNA in Cercopithecini, with recurrent apparition of new sequence variants and interchromosomal sequence transfer.
Collapse
Affiliation(s)
- Lauriane Cacheux
- Département Adaptations du Vivant, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
- Département Origines et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - Loïc Ponger
- Département Adaptations du Vivant, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - Michèle Gerbault-Seureau
- Département Origines et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - François Loll
- Département Adaptations du Vivant, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - Delphine Gey
- Service de Systématique Moléculaire, UMS 2700 CNRS, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| | - Florence Anne Richard
- Département Origines et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
- Université Versailles St-Quentin, Montigny-le-Bretonneux, France
| | - Christophe Escudé
- Département Adaptations du Vivant, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum National d’Histoire Naturelle, Paris, France
| |
Collapse
|
17
|
Garrido-Ramos MA. Satellite DNA: An Evolving Topic. Genes (Basel) 2017; 8:genes8090230. [PMID: 28926993 PMCID: PMC5615363 DOI: 10.3390/genes8090230] [Citation(s) in RCA: 222] [Impact Index Per Article: 31.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2017] [Revised: 09/12/2017] [Accepted: 09/13/2017] [Indexed: 12/22/2022] Open
Abstract
Satellite DNA represents one of the most fascinating parts of the repetitive fraction of the eukaryotic genome. Since the discovery of highly repetitive tandem DNA in the 1960s, a lot of literature has extensively covered various topics related to the structure, organization, function, and evolution of such sequences. Today, with the advent of genomic tools, the study of satellite DNA has regained a great interest. Thus, Next-Generation Sequencing (NGS), together with high-throughput in silico analysis of the information contained in NGS reads, has revolutionized the analysis of the repetitive fraction of the eukaryotic genomes. The whole of the historical and current approaches to the topic gives us a broad view of the function and evolution of satellite DNA and its role in chromosomal evolution. Currently, we have extensive information on the molecular, chromosomal, biological, and population factors that affect the evolutionary fate of satellite DNA, knowledge that gives rise to a series of hypotheses that get on well with each other about the origin, spreading, and evolution of satellite DNA. In this paper, I review these hypotheses from a methodological, conceptual, and historical perspective and frame them in the context of chromosomal organization and evolution.
Collapse
Affiliation(s)
- Manuel A Garrido-Ramos
- Departamento de Genética, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain.
| |
Collapse
|
18
|
Miga KH. The Promises and Challenges of Genomic Studies of Human Centromeres. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2017; 56:285-304. [PMID: 28840242 DOI: 10.1007/978-3-319-58592-5_12] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Human centromeres are genomic regions that act as sites of kinetochore assembly to ensure proper chromosome segregation during mitosis and meiosis. Although the biological importance of centromeres in genome stability, and ultimately, cell viability are well understood, the complete sequence content and organization in these multi-megabase-sized regions remains unknown. The lack of a high-resolution reference assembly inhibits standard bioinformatics protocols, and as a result, sequence-based studies involving human centromeres lag far behind the advances made for the non-repetitive sequences in the human genome. In this chapter, I introduce what is known about the genomic organization in the highly repetitive regions spanning human centromeres, and discuss the challenges these sequences pose for assembly, alignment, and data interpretation. Overcoming these obstacles is expected to issue a new era for centromere genomics, which will offer new discoveries in basic cell biology and human biomedical research.
Collapse
Affiliation(s)
- Karen H Miga
- Center for Biomolecular Science and Engineering, University of California, Santa Cruz, CA, USA.
| |
Collapse
|
19
|
Cacheux L, Ponger L, Gerbault-Seureau M, Richard FA, Escudé C. Diversity and distribution of alpha satellite DNA in the genome of an Old World monkey: Cercopithecus solatus. BMC Genomics 2016; 17:916. [PMID: 27842493 PMCID: PMC5109768 DOI: 10.1186/s12864-016-3246-5] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2016] [Accepted: 11/02/2016] [Indexed: 11/10/2022] Open
Abstract
Background Alpha satellite is the major repeated DNA element of primate centromeres. Evolution of these tandemly repeated sequences has led to the existence of numerous families of monomers exhibiting specific organizational patterns. The limited amount of information available in non-human primates is a restriction to the understanding of the evolutionary dynamics of alpha satellite DNA. Results We carried out the targeted high-throughput sequencing of alpha satellite monomers and dimers from the Cercopithecus solatus genome, an Old World monkey from the Cercopithecini tribe. Computational approaches were used to infer the existence of sequence families and to study how these families are organized with respect to each other. While previous studies had suggested that alpha satellites in Old World monkeys were poorly diversified, our analysis provides evidence for the existence of at least four distinct families of sequences within the studied species and of higher order organizational patterns. Fluorescence in situ hybridization using oligonucleotide probes that are able to target each family in a specific way showed that the different families had distinct distributions on chromosomes and were not homogeneously distributed between chromosomes. Conclusions Our new approach provides an unprecedented and comprehensive view of the diversity and organization of alpha satellites in a species outside the hominoid group. We consider these data with respect to previously known alpha satellite families and to potential mechanisms for satellite DNA evolution. Applying this approach to other species will open new perspectives regarding the integration of satellite DNA into comparative genomic and cytogenetic studies. Electronic supplementary material The online version of this article (doi:10.1186/s12864-016-3246-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Lauriane Cacheux
- Département Régulations, Développement et Diversité Moléculaire, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France.,Département Systématique et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France
| | - Loïc Ponger
- Département Régulations, Développement et Diversité Moléculaire, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France
| | - Michèle Gerbault-Seureau
- Département Systématique et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France
| | - Florence Anne Richard
- Département Systématique et Evolution, Institut de Systématique, Evolution, Biodiversité, UMR 7205 MNHN, CNRS, UPMC, EPHE, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France.,Université Versailles St-Quentin, Montigny-le-Bretonneux, France
| | - Christophe Escudé
- Département Régulations, Développement et Diversité Moléculaire, Structure et Instabilité des Génomes, INSERM U1154, CNRS UMR7196, Sorbonne Universités, Muséum national d'Histoire naturelle, Paris, France.
| |
Collapse
|
20
|
Meštrović N, Mravinac B, Pavlek M, Vojvoda-Zeljko T, Šatović E, Plohl M. Structural and functional liaisons between transposable elements and satellite DNAs. Chromosome Res 2016; 23:583-96. [PMID: 26293606 DOI: 10.1007/s10577-015-9483-7] [Citation(s) in RCA: 82] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Transposable elements (TEs) and satellite DNAs (satDNAs) are typically identified as major repetitive DNA components in eukaryotic genomes. TEs are DNA segments able to move throughout a genome while satDNAs are tandemly repeated sequences organized in long arrays. Both classes of repetitive sequences are extremely diverse, and many TEs and satDNAs exist within a genome. Although they differ in structure, genomic organization, mechanisms of spread, and evolutionary dynamics, TEs and satDNAs can share sequence similarity and organizational patterns, thus indicating that complex mutual relationships can determine their evolution, and ultimately define roles they might have on genome architecture and function. Motivated by accumulating data about sequence elements that incorporate features of both TEs and satDNAs, here we present an overview of their structural and functional liaisons.
Collapse
Affiliation(s)
| | | | - Martina Pavlek
- Ruđer Bošković Institute, Bijenička 54, HR-10000, Zagreb, Croatia
| | | | - Eva Šatović
- Ruđer Bošković Institute, Bijenička 54, HR-10000, Zagreb, Croatia
| | - Miroslav Plohl
- Ruđer Bošković Institute, Bijenička 54, HR-10000, Zagreb, Croatia.
| |
Collapse
|
21
|
Clusters of alpha satellite on human chromosome 21 are dispersed far onto the short arm and lack ancient layers. Chromosome Res 2016; 24:421-36. [PMID: 27430641 DOI: 10.1007/s10577-016-9530-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Accepted: 06/03/2016] [Indexed: 10/21/2022]
Abstract
Human alpha satellite (AS) sequence domains that currently function as centromeres are typically flanked by layers of evolutionarily older AS that presumably represent the remnants of earlier primate centromeres. Studies on several human chromosomes reveal that these older AS arrays are arranged in an age gradient, with the oldest arrays farthest from the functional centromere and arrays progressively closer to the centromere being progressively younger. The organization of AS on human chromosome 21 (HC21) has not been well-characterized. We have used newly available HC21 sequence data and an HC21p YAC map to determine the size, organization, and location of the AS arrays, and compared them to AS arrays found on other chromosomes. We find that the majority of the HC21 AS sequences are present on the p-arm of the chromosome and are organized into at least five distinct isolated clusters which are distributed over a larger distance from the functional centromere than that typically seen for AS on other chromosomes. Using both phylogenetic and L1 element age estimations, we found that all of the HC21 AS clusters outside the functional centromere are of a similar relatively recent evolutionary origin. HC21 contains none of the ancient AS layers associated with early primate evolution which is present on other chromosomes, possibly due to the fact that the p-arm of HC21 and the other acrocentric chromosomes underwent substantial reorganization about 20 million years ago.
Collapse
|
22
|
Nambiar M, Smith GR. Repression of harmful meiotic recombination in centromeric regions. Semin Cell Dev Biol 2016; 54:188-97. [PMID: 26849908 DOI: 10.1016/j.semcdb.2016.01.042] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2015] [Accepted: 01/27/2016] [Indexed: 11/16/2022]
Abstract
During the first division of meiosis, segregation of homologous chromosomes reduces the chromosome number by half. In most species, sister chromatid cohesion and reciprocal recombination (crossing-over) between homologous chromosomes are essential to provide tension to signal proper chromosome segregation during the first meiotic division. Crossovers are not distributed uniformly throughout the genome and are repressed at and near the centromeres. Rare crossovers that occur too near or in the centromere interfere with proper segregation and can give rise to aneuploid progeny, which can be severely defective or inviable. We review here how crossing-over occurs and how it is prevented in and around the centromeres. Molecular mechanisms of centromeric repression are only now being elucidated. However, rapid advances in understanding crossing-over, chromosome structure, and centromere functions promise to explain how potentially deleterious crossovers are avoided in certain chromosomal regions while allowing beneficial crossovers in others.
Collapse
Affiliation(s)
- Mridula Nambiar
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA, United States.
| | - Gerald R Smith
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, 1100 Fairview Avenue North, Seattle, WA, United States.
| |
Collapse
|
23
|
Craddock EM, Gall JG, Jonas M. Hawaiian Drosophila genomes: size variation and evolutionary expansions. Genetica 2016; 144:107-24. [PMID: 26790663 DOI: 10.1007/s10709-016-9882-5] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2015] [Accepted: 01/09/2016] [Indexed: 01/24/2023]
Abstract
This paper reports genome sizes of one Hawaiian Scaptomyza and 16 endemic Hawaiian Drosophila species that include five members of the antopocerus species group, one member of the modified mouthpart group, and ten members of the picture wing clade. Genome size expansions have occurred independently multiple times among Hawaiian Drosophila lineages, and have resulted in an over 2.3-fold range of genome sizes among species, with the largest observed in Drosophila cyrtoloma (1C = 0.41 pg). We find evidence that these repeated genome size expansions were likely driven by the addition of significant amounts of heterochromatin and satellite DNA. For example, our data reveal that the addition of seven heterochromatic chromosome arms to the ancestral haploid karyotype, and a remarkable proportion of ~70 % satellite DNA, account for the greatly expanded size of the D. cyrtoloma genome. Moreover, the genomes of 13/17 Hawaiian picture wing species are composed of substantial proportions (22-70 %) of detectable satellites (all but one of which are AT-rich). Our results suggest that in this tightly knit group of recently evolved species, genomes have expanded, in large part, via evolutionary amplifications of satellite DNA sequences in centric and pericentric domains (especially of the X and dot chromosomes), which have resulted in longer acrocentric chromosomes or metacentrics with an added heterochromatic chromosome arm. We discuss possible evolutionary mechanisms that may have shaped these patterns, including rapid fixation of novel expanded genomes during founder-effect speciation.
Collapse
Affiliation(s)
- Elysse M Craddock
- Natural Sciences Building, Purchase College, State University of New York, 735 Anderson Hill Road, Purchase, NY, 10577, USA.
| | - Joseph G Gall
- Department of Embryology, Carnegie Institution for Science, Baltimore, MD, USA
| | - Mark Jonas
- Natural Sciences Building, Purchase College, State University of New York, 735 Anderson Hill Road, Purchase, NY, 10577, USA
| |
Collapse
|
24
|
Pavlek M, Gelfand Y, Plohl M, Meštrović N. Genome-wide analysis of tandem repeats in Tribolium castaneum genome reveals abundant and highly dynamic tandem repeat families with satellite DNA features in euchromatic chromosomal arms. DNA Res 2015; 22:387-401. [PMID: 26428853 PMCID: PMC4675708 DOI: 10.1093/dnares/dsv021] [Citation(s) in RCA: 44] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2015] [Accepted: 08/26/2015] [Indexed: 12/31/2022] Open
Abstract
Although satellite DNAs are well-explored components of heterochromatin and centromeres, little is known about emergence, dispersal and possible impact of comparably structured tandem repeats (TRs) on the genome-wide scale. Our bioinformatics analysis of assembled Tribolium castaneum genome disclosed significant contribution of TRs in euchromatic chromosomal arms and clear predominance of satellite DNA-typical 170 bp monomers in arrays of ≥5 repeats. By applying different experimental approaches, we revealed that the nine most prominent TR families Cast1-Cast9 extracted from the assembly comprise ∼4.3% of the entire genome and reside almost exclusively in euchromatic regions. Among them, seven families that build ∼3.9% of the genome are based on ∼170 and ∼340 bp long monomers. Results of phylogenetic analyses of 2500 monomers originating from these families show high-sequence dynamics, evident by extensive exchanges between arrays on non-homologous chromosomes. In addition, our analysis shows that concerted evolution acts more efficiently on longer than on shorter arrays. Efficient genome-wide distribution of nine TR families implies the role of transposition only in expansion of the most dispersed family, and involvement of other mechanisms is anticipated. Despite similarities in sequence features, FISH experiments indicate high-level compartmentalization of centromeric and euchromatic tandem repeats.
Collapse
Affiliation(s)
- Martina Pavlek
- Ruđer Bošković Institute, Bijenička 54, Zagreb HR-10002, Croatia
| | - Yevgeniy Gelfand
- Laboratory for Biocomputing and Informatics, Boston University, Boston, MA 02215, USA
| | - Miroslav Plohl
- Ruđer Bošković Institute, Bijenička 54, Zagreb HR-10002, Croatia
| | | |
Collapse
|
25
|
Shepelev VA, Uralsky LI, Alexandrov AA, Yurov YB, Rogaev EI, Alexandrov IA. Annotation of suprachromosomal families reveals uncommon types of alpha satellite organization in pericentromeric regions of hg38 human genome assembly. GENOMICS DATA 2015; 5:139-146. [PMID: 26167452 PMCID: PMC4496801 DOI: 10.1016/j.gdata.2015.05.035] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- V A Shepelev
- Institute of Molecular Genetics, Russian Academy of Sciences, Kurchatov sq. 2, Moscow 123182, Russia ; Department of Genomics and Human Genetics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia ; Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - L I Uralsky
- Institute of Molecular Genetics, Russian Academy of Sciences, Kurchatov sq. 2, Moscow 123182, Russia ; Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia
| | - A A Alexandrov
- Institute of Molecular Genetics, Russian Academy of Sciences, Kurchatov sq. 2, Moscow 123182, Russia
| | - Y B Yurov
- Research Center of Mental Health, Russian Academy of Medical Sciences, Zagorodnoe sh. 2, Moscow 113152, Russia
| | - E I Rogaev
- Department of Genomics and Human Genetics, Vavilov Institute of General Genetics, Russian Academy of Sciences, Moscow 119991, Russia ; Center for Brain Neurobiology and Neurogenetics, Institute of Cytology and Genetics, Siberian Branch of the Russian Academy of Sciences, Novosibirsk 630090, Russia ; Department of Psychiatry, Brudnick Neuropsychiatric Research Institute, University of Massachusetts Medical School, Worcester, MA 01604, USA ; Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow 119234, Russia
| | - I A Alexandrov
- Research Center of Mental Health, Russian Academy of Medical Sciences, Zagorodnoe sh. 2, Moscow 113152, Russia
| |
Collapse
|
26
|
Louzada S, Vieira-da-Silva A, Mendes-da-Silva A, Kubickova S, Rubes J, Adega F, Chaves R. A novel satellite DNA sequence in the Peromyscus genome (PMSat): Evolution via copy number fluctuation. Mol Phylogenet Evol 2015; 92:193-203. [PMID: 26103000 DOI: 10.1016/j.ympev.2015.06.008] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2014] [Revised: 06/11/2015] [Accepted: 06/12/2015] [Indexed: 12/16/2022]
Abstract
Satellite DNAs (satDNA) are tandemly arrayed repeated sequences largely present in eukaryotic genomes, which play important roles in genome evolution and function, and therefore, their analysis is vital. Here, we describe the isolation of a novel satellite DNA family (PMSat) from the rodent Peromyscus eremicus (Cricetidae, Rodentia), which is located in pericentromeric regions and exhibits a typical satellite DNA genome organization. Orthologous PMSat sequences were isolated and characterized from three species belonging to Cricetidae: Cricetus cricetus, Phodopus sungorus and Microtus arvalis. In these species, PMSat is highly conserved, with the absence of fixed species-specific mutations. Strikingly, different numbers of copies of this sequence were found among the species, suggesting evolution by copy number fluctuation. Repeat units of PMSat were also found in the Peromyscus maniculatus bairdii BioProject, but our results suggest that these repeat units are from genome regions outside the pericentromere. The remarkably high evolutionary sequence conservation along with the preservation of a few numbers of copies of this sequence in the analyzed genomes may suggest functional significance but a different sequence nature/organization. Our data highlight that repeats are difficult to analyze due to the limited tools available to dissect genomes and the fact that assemblies do not cover regions of constitutive heterochromatin.
Collapse
Affiliation(s)
- Sandra Louzada
- University of Trás-os-Montes and Alto Douro (UTAD), Department of Genetics and Biotechnology (DGB), Laboratory of Cytogenomics and Animal Genomics (CAG), Apdo 1013, 5001-801 Vila Real, Portugal
| | - Ana Vieira-da-Silva
- University of Trás-os-Montes and Alto Douro (UTAD), Department of Genetics and Biotechnology (DGB), Laboratory of Cytogenomics and Animal Genomics (CAG), Apdo 1013, 5001-801 Vila Real, Portugal; University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, Lisboa, Portugal
| | - Ana Mendes-da-Silva
- University of Trás-os-Montes and Alto Douro (UTAD), Department of Genetics and Biotechnology (DGB), Laboratory of Cytogenomics and Animal Genomics (CAG), Apdo 1013, 5001-801 Vila Real, Portugal; University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, Lisboa, Portugal
| | | | - Jiri Rubes
- Veterinary Research Institute, Brno, Czech Republic
| | - Filomena Adega
- University of Trás-os-Montes and Alto Douro (UTAD), Department of Genetics and Biotechnology (DGB), Laboratory of Cytogenomics and Animal Genomics (CAG), Apdo 1013, 5001-801 Vila Real, Portugal; University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, Lisboa, Portugal
| | - Raquel Chaves
- University of Trás-os-Montes and Alto Douro (UTAD), Department of Genetics and Biotechnology (DGB), Laboratory of Cytogenomics and Animal Genomics (CAG), Apdo 1013, 5001-801 Vila Real, Portugal; University of Lisboa, Faculty of Sciences, BioISI - Biosystems & Integrative Sciences Institute, Campo Grande, Lisboa, Portugal.
| |
Collapse
|
27
|
Hsieh LJ, Cheng YM, Wang YC, Lin CC, Li YC. Organization and evolution of a novel cervid satellite DNA with yeast CDEI-like repeats. Zool Stud 2014. [DOI: 10.1186/s40555-014-0025-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Abstract
Background
It has been proposed that pericentromeric satellite DNA arises from the progressive proximal expansion of ancient centromeric DNA. In an attempt to recover putative ancestral centromeric DNA, we microdissected the pericentromeric/centromeric DNA from the chromosome X + 3 of Indian muntjac (Muntiacus muntjak vaginalis) and constructed a microclone-library of the X + 3 centromeric DNA.
Results
A new cervid satellite DNA element, designated as satellite VI, was isolated from this library. Fluorescence in situ hybridization (FISH) studies revealed that satellite VI is predominately located on the distal pericentromeric region of the Indian muntjac chromosome X + 3 and on the pericentromeres of several Old World deer species studied. Its sequence is organized as 11-bp monomeric (ATCACGTGGGA) tandem repeats. Further sequencing on a BAC clone of Indian muntjac harboring this repeat showed that an array of this repeat stretches over approximately 5 kb followed by approximately 3 kb of interspersed repetitive sequences, such as long interspersed elements (LINEs), short interspersed elements (SINEs), and long terminal repeats (LTRs).
Conclusions
Based on the chromosomal localization, genomic and sequence organization, and copy numbers of satellite VI in deer species studied, we postulate that this newly found satellite DNA could be a putative ancient cervidic centromeric DNA that is still preserved in some Old World deer. Interestingly, the first eight nucleotides of the 11-bp monomeric consensus sequences are highly conserved and identical to the CDEI element in the centromere of the budding yeast Saccharomyces cerevisiae. The centromeric/pericentromeric satellite DNA harboring abundant copies of CDEI sequences is the first found in a mammalian species. Several zipper-like d (GGGA)2 motifs were also found in the (ATCACGTGGGA)n repeat of satellite VI DNA. Whether the satellite VI is structurally and functionally correlated with the CDEI of centromere of budding yeast and whether a zipper-like structure forms in satellite VI require further studies.
Collapse
|
28
|
Cazaux B, Catalan J, Justy F, Escudé C, Desmarais E, Britton-Davidian J. Evolution of the structure and composition of house mouse satellite DNA sequences in the subgenus Mus (Rodentia: Muridea): a cytogenomic approach. Chromosoma 2013; 122:209-20. [PMID: 23515652 DOI: 10.1007/s00412-013-0402-4] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2012] [Revised: 01/29/2013] [Accepted: 02/23/2013] [Indexed: 12/16/2022]
Abstract
The composition and orientation of the house mouse satellite DNA sequences (minor, major, TLC) were investigated by a FISH and CO-FISH approach in 11 taxa belonging to three clades of the subgenus Mus. Using a phylogenetic framework, our results highlighted two distribution patterns. The TLC satellite, the most recently discovered satellite, was present in all clades but varied quantitatively among species. This distribution supported its appearance in the ancestor of the subgenus followed by independent evolution in species of each clade. In contrast, the minor and major satellites occurred in only two clades of the subgenus indicating the simultaneous and recent amplification of these sequences. In addition, although qualitative differences in the composition and orientation of the satellite sequences were observed among the taxa, none of the features studied were unique to the house mouse and could account for the extensive chromosomal plasticity evidenced in Mus musculus domesticus.
Collapse
Affiliation(s)
- B Cazaux
- Institut des Sciences de l'Evolution, Université Montpellier 2, cc065, Pl. E. Bataillon, 34095 Montpellier Cedex 05, France
| | | | | | | | | | | |
Collapse
|
29
|
Abstract
Advances in human genomics have accelerated studies in evolution, disease, and cellular regulation. However, centromere sequences, defining the chromosomal interface with spindle microtubules, remain largely absent from ongoing genomic studies and disconnected from functional, genome-wide analyses. This disparity results from the challenge of predicting the linear order of multi-megabase-sized regions that are composed almost entirely of near-identical satellite DNA. Acknowledging these challenges, the field of human centromere genomics possesses the potential to rapidly advance given the availability of individual, or personalized, genome projects matched with the promise of long-read sequencing technologies. Here I review the current genomic model of human centromeres in consideration of those studies involving functional datasets that examine the role of sequence in centromere identity.
Collapse
|
30
|
Localization of centromeric breaks in head and neck squamous cell carcinoma. Cancer Genet 2012; 205:622-9. [DOI: 10.1016/j.cancergen.2012.10.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2012] [Revised: 10/01/2012] [Accepted: 10/02/2012] [Indexed: 11/20/2022]
|
31
|
Hayden KE, Willard HF. Composition and organization of active centromere sequences in complex genomes. BMC Genomics 2012; 13:324. [PMID: 22817545 PMCID: PMC3422206 DOI: 10.1186/1471-2164-13-324] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2012] [Accepted: 07/20/2012] [Indexed: 01/13/2023] Open
Abstract
Background Centromeres are sites of chromosomal spindle attachment during mitosis and meiosis. While the sequence basis for centromere identity remains a subject of considerable debate, one approach is to examine the genomic organization at these active sites that are correlated with epigenetic marks of centromere function. Results We have developed an approach to characterize both satellite and non-satellite centromeric sequences that are missing from current assemblies in complex genomes, using the dog genome as an example. Combining this genomic reference with an epigenetic dataset corresponding to sequences associated with the histone H3 variant centromere protein A (CENP-A), we identify active satellite sequence domains that appear to be both functionally and spatially distinct within the overall definition of satellite families. Conclusions These findings establish a genomic and epigenetic foundation for exploring the functional role of centromeric sequences in the previously sequenced dog genome and provide a model for similar studies within the context of less-characterized genomes.
Collapse
Affiliation(s)
- Karen E Hayden
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, Durham, NC, USA.
| | | |
Collapse
|
32
|
Noskov VN, Lee NC, Larionov V, Kouprina N. Rapid generation of long tandem DNA repeat arrays by homologous recombination in yeast to study their function in mammalian genomes. Biol Proced Online 2011; 13:8. [PMID: 21982381 PMCID: PMC3200152 DOI: 10.1186/1480-9222-13-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2011] [Accepted: 10/07/2011] [Indexed: 12/18/2022] Open
Abstract
We describe here a method to rapidly convert any desirable DNA fragment, as small as 100 bp, into long tandem DNA arrays up to 140 kb in size that are inserted into a microbe vector. This method includes rolling-circle phi29 amplification (RCA) of the sequence in vitro and assembly of the RCA products in vivo by homologous recombination in the yeast Saccharomyces cerevisiae. The method was successfully used for a functional analysis of centromeric and pericentromeric repeats and construction of new vehicles for gene delivery to mammalian cells. The method may have general application in elucidating the role of tandem repeats in chromosome organization and dynamics. Each cycle of the protocol takes ~ two weeks to complete.
Collapse
Affiliation(s)
- Vladimir N Noskov
- Laboratory of Molecular Pharmacology, National Cancer Institute, National Institutes of Health, 9000 Rockville Pike, Bethesda, Maryland 20892, USA.
| | | | | | | |
Collapse
|
33
|
Lee HR, Hayden KE, Willard HF. Organization and molecular evolution of CENP-A--associated satellite DNA families in a basal primate genome. Genome Biol Evol 2011; 3:1136-49. [PMID: 21828373 PMCID: PMC3194837 DOI: 10.1093/gbe/evr083] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
Centromeric regions in many complex eukaryotic species contain highly repetitive satellite DNAs. Despite the diversity of centromeric DNA sequences among species, the functional centromeres in all species studied to date are marked by CENP-A, a centromere-specific histone H3 variant. Although it is well established that families of multimeric higher-order alpha satellite are conserved at the centromeres of human and great ape chromosomes and that diverged monomeric alpha satellite is found in old and new world monkey genomes, little is known about the organization, function, and evolution of centromeric sequences in more distant primates, including lemurs. Aye-Aye (Daubentonia madagascariensis) is a basal primate and is located at a key position in the evolutionary tree to study centromeric satellite transitions in primate genomes. Using the approach of chromatin immunoprecipitation with antibodies directed to CENP-A, we have identified two satellite families, Daubentonia madagascariensis Aye-Aye 1 (DMA1) and Daubentonia madagascariensis Aye-Aye 2 (DMA2), related to each other but unrelated in sequence to alpha satellite or any other previously described primate or mammalian satellite DNA families. Here, we describe the initial genomic and phylogenetic organization of DMA1 and DMA2 and present evidence of higher-order repeats in Aye-Aye centromeric domains, providing an opportunity to study the emergence of chromosome-specific modes of satellite DNA evolution in primate genomes.
Collapse
Affiliation(s)
- Hye-Ran Lee
- Genome Biology Group, Duke Institute for Genome Sciences & Policy, Duke University, USA
| | | | | |
Collapse
|
34
|
Kuhn GCS, Küttler H, Moreira-Filho O, Heslop-Harrison JS. The 1.688 repetitive DNA of Drosophila: concerted evolution at different genomic scales and association with genes. Mol Biol Evol 2011; 29:7-11. [PMID: 21712468 DOI: 10.1093/molbev/msr173] [Citation(s) in RCA: 82] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Concerted evolution leading to homogenization of tandemly repeated DNA arrays is widespread and important for genome evolution. We investigated the range and nature of the process at chromosomal and array levels using the 1.688 tandem repeats of Drosophila melanogaster where large arrays are present in the heterochromatin of chromosomes 2, 3, and X, and short arrays are found in the euchromatin of the same chromosomes. Analysis of 326 euchromatic and heterochromatic repeats from 52 arrays showed that the homogenization of 1.688 repeats occurred differentially for distinct genomic regions, from euchromatin to heterochromatin and from local arrays to chromosomes. We further found that most euchromatic arrays are either close to, or are within introns of, genes. The short size of euchromatic arrays (one to five repeats) could be selectively constrained by their role as gene regulators, a situation similar to the so-called "tuning knobs."
Collapse
|
35
|
Koumbaris G, Hatzisevastou-Loukidou H, Alexandrou A, Ioannides M, Christodoulou C, Fitzgerald T, Rajan D, Clayton S, Kitsiou-Tzeli S, Vermeesch JR, Skordis N, Antoniou P, Kurg A, Georgiou I, Carter NP, Patsalis PC. FoSTeS, MMBIR and NAHR at the human proximal Xp region and the mechanisms of human Xq isochromosome formation. Hum Mol Genet 2011; 20:1925-36. [PMID: 21349920 PMCID: PMC3428953 DOI: 10.1093/hmg/ddr074] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
The recently described DNA replication-based mechanisms of fork stalling and template switching (FoSTeS) and microhomology-mediated break-induced replication (MMBIR) were previously shown to catalyze complex exonic, genic and genomic rearrangements. By analyzing a large number of isochromosomes of the long arm of chromosome X (i(Xq)), using whole-genome tiling path array comparative genomic hybridization (aCGH), ultra-high resolution targeted aCGH and sequencing, we provide evidence that the FoSTeS and MMBIR mechanisms can generate large-scale gross chromosomal rearrangements leading to the deletion and duplication of entire chromosome arms, thus suggesting an important role for DNA replication-based mechanisms in both the development of genomic disorders and cancer. Furthermore, we elucidate the mechanisms of dicentric i(Xq) (idic(Xq)) formation and show that most idic(Xq) chromosomes result from non-allelic homologous recombination between palindromic low copy repeats and highly homologous palindromic LINE elements. We also show that non-recurrent-breakpoint idic(Xq) chromosomes have microhomology-associated breakpoint junctions and are likely catalyzed by microhomology-mediated replication-dependent recombination mechanisms such as FoSTeS and MMBIR. Finally, we stress the role of the proximal Xp region as a chromosomal rearrangement hotspot.
Collapse
Affiliation(s)
- George Koumbaris
- Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, Nicosia 2370, Cyprus
| | | | - Angelos Alexandrou
- Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, Nicosia 2370, Cyprus
| | - Marios Ioannides
- Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, Nicosia 2370, Cyprus
| | - Christodoulos Christodoulou
- Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, Nicosia 2370, Cyprus
| | - Tomas Fitzgerald
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Diana Rajan
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Stephen Clayton
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Sophia Kitsiou-Tzeli
- Department of Medical Genetics, University of Athens, St Sophia Children’s Hospital, Athens 11527, Greece
| | - Joris R. Vermeesch
- Centre for Human Genetics, University Hospital, Catholic University of Leuven, 3000 Leuven, Belgium
| | - Nicos Skordis
- Pediatric Endocrine Unit, Makarios III Hospital, Nicosia 1474, Cyprus
| | - Pavlos Antoniou
- Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, Nicosia 2370, Cyprus
| | - Ants Kurg
- Institute of Molecular and Cell Biology, University of Tartu, Tartu 51010, Estonia
| | | | - Nigel P. Carter
- The Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK
| | - Philippos C. Patsalis
- Department of Cytogenetics and Genomics, The Cyprus Institute of Neurology and Genetics, Nicosia 2370, Cyprus
| |
Collapse
|
36
|
Alkan C, Cardone MF, Catacchio CR, Antonacci F, O'Brien SJ, Ryder OA, Purgato S, Zoli M, Della Valle G, Eichler EE, Ventura M. Genome-wide characterization of centromeric satellites from multiple mammalian genomes. Genome Res 2010; 21:137-45. [PMID: 21081712 DOI: 10.1101/gr.111278.110] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.
Collapse
Affiliation(s)
- Can Alkan
- Department of Genome Sciences, Howard Hughes Medical Institute, University of Washington School of Medicine, Seattle, Washington 98195, USA
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
37
|
Pironon N, Puechberty J, Roizès G. Molecular and evolutionary characteristics of the fraction of human alpha satellite DNA associated with CENP-A at the centromeres of chromosomes 1, 5, 19, and 21. BMC Genomics 2010; 11:195. [PMID: 20331851 PMCID: PMC2853522 DOI: 10.1186/1471-2164-11-195] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2009] [Accepted: 03/23/2010] [Indexed: 11/15/2022] Open
Abstract
Background The mode of evolution of the highly homogeneous Higher-Order-Repeat-containing alpha satellite arrays is still subject to discussion. This is also true of the CENP-A associated repeats where the centromere is formed. Results In this paper, we show that the molecular mechanisms by which these arrays evolve are identical in multiple chromosomes: i) accumulation of crossovers that homogenise and expand the arrays into different domains and subdomains that are mostly unshared between homologues and ii) sporadic mutations and conversion events that simultaneously differentiate them from one another. Individual arrays are affected by these mechanisms to different extents that presumably increase with time. Repeats associated with CENP-A, where the centromere is formed, are subjected to the same evolutionary mechanisms, but constitute minor subsets that exhibit subtle sequence differences from those of the bulk repeats. While the DNA sequence per se is not essential for centromere localisation along an array, it appears that certain sequences can be selected against. On chromosomes 1 and 19, which are more affected by the above evolutionary mechanisms than are chromosomes 21 and 5, CENP-A associated repeats were also recovered from a second homogeneous array present on each chromosome. This could be a way for chromosomes to sustain mitosis and meiosis when the normal centromere locus is ineluctably undermined by the above mechanisms. Conclusion We discuss, in light of these observations, possible scenarios for the normal evolutionary fates of human centromeric regions.
Collapse
Affiliation(s)
- Nathalie Pironon
- Institut de Génétique Humaine, UPR 1142, CNRS, 141 Rue de Cardonille, 34396 Montpellier Cedex 5, France
| | | | | |
Collapse
|
38
|
Abstract
Geneticists have long known that centromeres suppress crossing over, but considerable evidence indicates that they appear to recombine. Confirmation of gene conversion in maize centromeres explains this paradox.
Collapse
Affiliation(s)
- Paul B Talbert
- Howard Hughes Medical Institute, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | | |
Collapse
|
39
|
Schueler MG, Swanson W, Thomas PJ, Green ED. Adaptive evolution of foundation kinetochore proteins in primates. Mol Biol Evol 2010; 27:1585-97. [PMID: 20142441 DOI: 10.1093/molbev/msq043] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Rapid evolution is a hallmark of centromeric DNA in eukaryotic genomes. Yet, the centromere itself has a conserved functional role that is mediated by the kinetochore protein complex. To broaden our understanding about both the DNA and proteins that interact at the functional centromere, we sought to gain a detailed view of the evolutionary events that have shaped the primate kinetochore. Specifically, we performed comparative mapping and sequencing of the genomic regions encompassing the genes encoding three foundation kinetochore proteins: Centromere Proteins A, B, and C (CENP-A, CENP-B, and CENP-C). A histone H3 variant, CENP-A provides the foundation of the centromere-specific nucleosome. Comparative sequence analyses of the CENP-A gene in 14 primate species revealed encoded amino-acid residues within both the histone-fold domain and the N-terminal tail that are under strong positive selection. Similar comparative analyses of CENP-C, another foundation protein essential for centromere function, identified amino-acid residues throughout the protein under positive selection in the primate lineage, including several in the centromere localization and DNA-binding regions. Perhaps surprisingly, the gene encoding CENP-B, a kinetochore protein that binds specifically to alpha-satellite DNA, was not found to be associated with signatures of positive selection. These findings point to important and distinct evolutionary forces operating on the DNA and proteins of the primate centromere.
Collapse
Affiliation(s)
- Mary G Schueler
- Genome Technology Branch, National Institutes of Health, Bethesda, MD, USA.
| | | | | | | | | |
Collapse
|
40
|
Blakesley RW, Hansen NF, Gupta J, McDowell JC, Maskeri B, Barnabas BB, Brooks SY, Coleman H, Haghighi P, Ho SL, Schandler K, Stantripop S, Vogt JL, Thomas PJ, Bouffard GG, Green ED. Effort required to finish shotgun-generated genome sequences differs significantly among vertebrates. BMC Genomics 2010; 11:21. [PMID: 20064230 PMCID: PMC2827409 DOI: 10.1186/1471-2164-11-21] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2009] [Accepted: 01/11/2010] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND The approaches for shotgun-based sequencing of vertebrate genomes are now well-established, and have resulted in the generation of numerous draft whole-genome sequence assemblies. In contrast, the process of refining those assemblies to improve contiguity and increase accuracy (known as 'sequence finishing') remains tedious, labor-intensive, and expensive. As a result, the vast majority of vertebrate genome sequences generated to date remain at a draft stage. RESULTS To date, our genome sequencing efforts have focused on comparative studies of targeted genomic regions, requiring sequence finishing of large blocks of orthologous sequence (average size 0.5-2 Mb) from various subsets of 75 vertebrates. This experience has provided a unique opportunity to compare the relative effort required to finish shotgun-generated genome sequence assemblies from different species, which we report here. Importantly, we found that the sequence assemblies generated for the same orthologous regions from various vertebrates show substantial variation with respect to misassemblies and, in particular, the frequency and characteristics of sequence gaps. As a consequence, the work required to finish different species' sequences varied greatly. Application of the same standardized methods for finishing provided a novel opportunity to "assay" characteristics of genome sequences among many vertebrate species. It is important to note that many of the problems we have encountered during sequence finishing reflect unique architectural features of a particular vertebrate's genome, which in some cases may have important functional and/or evolutionary implications. Finally, based on our analyses, we have been able to improve our procedures to overcome some of these problems and to increase the overall efficiency of the sequence-finishing process, although significant challenges still remain. CONCLUSION Our findings have important implications for the eventual finishing of the draft whole-genome sequences that have now been generated for a large number of vertebrates.
Collapse
Affiliation(s)
- Robert W Blakesley
- NIH Intramural Sequencing Center, National Human Genome Research Institute, National Institutes of Health, Bethesda, MD 20892, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
41
|
Shepelev VA, Alexandrov AA, Yurov YB, Alexandrov IA. The evolutionary origin of man can be traced in the layers of defunct ancestral alpha satellites flanking the active centromeres of human chromosomes. PLoS Genet 2009; 5:e1000641. [PMID: 19749981 PMCID: PMC2729386 DOI: 10.1371/journal.pgen.1000641] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2009] [Accepted: 08/11/2009] [Indexed: 02/01/2023] Open
Abstract
Alpha satellite domains that currently function as centromeres of human chromosomes are flanked by layers of older alpha satellite, thought to contain dead centromeres of primate progenitors, which lost their function and the ability to homogenize satellite repeats, upon appearance of a new centromere. Using cladistic analysis of alpha satellite monomers, we elucidated complete layer patterns on chromosomes 8, 17, and X and related them to each other and to primate alpha satellites. We show that discrete and chronologically ordered alpha satellite layers are partially symmetrical around an active centromere and their succession is partially shared in non-homologous chromosomes. The layer structure forms a visual representation of the human evolutionary lineage with layers corresponding to ancestors of living primates and to entirely fossil taxa. Surprisingly, phylogenetic comparisons suggest that alpha satellite arrays went through periods of unusual hypermutability after they became "dead" centromeres. The layer structure supports a model of centromere evolution where new variants of a satellite repeat expanded periodically in the genome by rounds of inter-chromosomal transfer/amplification. Each wave of expansion covered all or many chromosomes and corresponded to a new primate taxon. Complete elucidation of the alpha satellite phylogenetic record would give a unique opportunity to number and locate the positions of major extinct taxa in relation to human ancestors shared with extant primates. If applicable to other satellites in non-primate taxa, analysis of centromeric layers could become an invaluable tool for phylogenetic studies.
Collapse
Affiliation(s)
- Valery A. Shepelev
- Institute of Molecular Genetics, Russian Academy of Sciences, Moscow, Russia
| | | | - Yuri B. Yurov
- Mental Health Research Centre, Russian Academy of Medical Sciences, Moscow, Russia
| | - Ivan A. Alexandrov
- Mental Health Research Centre, Russian Academy of Medical Sciences, Moscow, Russia
- * E-mail:
| |
Collapse
|
42
|
Pertile MD, Graham AN, Choo KHA, Kalitsis P. Rapid evolution of mouse Y centromere repeat DNA belies recent sequence stability. Genome Res 2009; 19:2202-13. [PMID: 19737860 DOI: 10.1101/gr.092080.109] [Citation(s) in RCA: 42] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The Y centromere sequence of house mouse, Mus musculus, remains unknown despite our otherwise significant knowledge of the genome sequence of this important mammalian model organism. Here, we report the complete molecular characterization of the C57BL/6J chromosome Y centromere, which comprises a highly diverged minor satellite-like sequence (designated Ymin) with higher-order repeat (HOR) sequence organization previously undescribed at mouse centromeres. The Ymin array is approximately 90 kb in length and resides within a single BAC clone that provides sequence information spanning an endogenous animal centromere for the first time. By exploiting direct patrilineal inheritance of the Y chromosome, we demonstrate stability of the Y centromere DNA structure spanning at least 175 inbred generations to beyond the time of domestication of the East Asian M.m. molossinus "fancy" mouse through which the Y chromosome was first introduced into the classical inbred laboratory mouse strains. Despite this stability, at least three unequal genetic exchange events have altered Ymin HOR unit length and sequence structure since divergence of the ancestral Mus musculus subspecies around 900,000 yr ago, with major turnover of the HOR arrays driving rapid divergence of sequence and higher-order structure at the mouse Y centromere. A comparative sequence analysis between the human and chimpanzee centromeres indicates a similar rapid divergence of the primate Y centromere. Our data point to a unique DNA sequence and organizational architecture for the mouse Y centromere that has evolved independently of all other mouse centromeres.
Collapse
Affiliation(s)
- Mark D Pertile
- Murdoch Childrens Research Institute, Victoria, Australia
| | | | | | | |
Collapse
|
43
|
Mravinac B, Sullivan LL, Reeves JW, Yan CM, Kopf KS, Farr CJ, Schueler MG, Sullivan BA. Histone modifications within the human X centromere region. PLoS One 2009; 4:e6602. [PMID: 19672304 PMCID: PMC2719913 DOI: 10.1371/journal.pone.0006602] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2009] [Accepted: 07/06/2009] [Indexed: 12/11/2022] Open
Abstract
Human centromeres are multi-megabase regions of highly ordered arrays of alpha satellite DNA that are separated from chromosome arms by unordered alpha satellite monomers and other repetitive elements. Complexities in assembling such large repetitive regions have limited detailed studies of centromeric chromatin organization. However, a genomic map of the human X centromere has provided new opportunities to explore genomic architecture of a complex locus. We used ChIP to examine the distribution of modified histones within centromere regions of multiple X chromosomes. Methylation of H3 at lysine 4 coincided with DXZ1 higher order alpha satellite, the site of CENP-A localization. Heterochromatic histone modifications were distributed across the 400–500 kb pericentromeric regions. The large arrays of alpha satellite and gamma satellite DNA were enriched for both euchromatic and heterochromatic modifications, implying that some pericentromeric repeats have multiple chromatin characteristics. Partial truncation of the X centromere resulted in reduction in the size of the CENP-A/Cenp-A domain and increased heterochromatic modifications in the flanking pericentromere. Although the deletion removed ∼1/3 of centromeric DNA, the ratio of CENP-A to alpha satellite array size was maintained in the same proportion, suggesting that a limited, but defined linear region of the centromeric DNA is necessary for kinetochore assembly. Our results indicate that the human X centromere contains multiple types of chromatin, is organized similarly to smaller eukaryotic centromeres, and responds to structural changes by expanding or contracting domains.
Collapse
Affiliation(s)
- Brankica Mravinac
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
| | - Lori L. Sullivan
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
| | - Jason W. Reeves
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
- University Program in Genetics and Genomics, Duke University, Durham, North Carolina, United States of America
| | - Christopher M. Yan
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina, United States of America
| | - Kristen S. Kopf
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
| | - Christine J. Farr
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Mary G. Schueler
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Beth A. Sullivan
- Institute for Genome Sciences & Policy, Duke University, Durham, North Carolina, United States of America
- Department of Molecular Genetics and Microbiology, Duke University Medical Center, Durham, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
44
|
Cellamare A, Catacchio CR, Alkan C, Giannuzzi G, Antonacci F, Cardone MF, Della Valle G, Malig M, Rocchi M, Eichler EE, Ventura M. New insights into centromere organization and evolution from the white-cheeked gibbon and marmoset. Mol Biol Evol 2009; 26:1889-900. [PMID: 19429672 DOI: 10.1093/molbev/msp101] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
The evolutionary history of alpha-satellite DNA, the major component of primate centromeres, is hardly defined because of the difficulty in its sequence assembly and its rapid evolution when compared with most genomic sequences. By using several approaches, we have cloned, sequenced, and characterized alpha-satellite sequences from two species representing critical nodes in the primate phylogeny: the white-cheeked gibbon, a lesser ape, and marmoset, a New World monkey. Sequence analyses demonstrate that white-cheeked gibbon and marmoset alpha-satellite sequences are formed by units of approximately 171 and approximately 342 bp, respectively, and they both lack the high-order structure found in humans and great apes. Fluorescent in situ hybridization characterization shows a broad dispersal of alpha-satellite in the white-cheeked gibbon genome including centromeric, telomeric, and chromosomal interstitial localizations. On the other hand, centromeres in marmoset appear organized in highly divergent dimers roughly of 342 bp that show a similarity between monomers much lower than previously reported dimers, thus representing an ancient dimeric structure. All these data shed light on the evolution of the centromeric sequences in Primates. Our results suggest radical differences in the structure, organization, and evolution of alpha-satellite DNA among different primate species, supporting the notion that 1) all the centromeric sequence in Primates evolved by genomic amplification, unequal crossover, and sequence homogenization using a 171 bp monomer as the basic seeding unit and 2) centromeric function is linked to relatively short repeated elements, more than higher-order structure. Moreover, our data indicate that complex higher-order repeat structures are a peculiarity of the hominid lineage, showing the more complex organization in humans.
Collapse
Affiliation(s)
- A Cellamare
- Department of Genetics and Microbiology, University of Bari, Bari, Italy
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Lu S, Xie YM, Li X, Luo J, Shi XQ, Hong X, Pan YH, Ma X. Mass spectrometry analysis of dynamic post-translational modifications of TH2B during spermatogenesis. Mol Hum Reprod 2009; 15:373-8. [PMID: 19346237 DOI: 10.1093/molehr/gap028] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
TH2B, an important testis histone, plays a key role in remodeling chromatin structure during spermatogenesis. We present a detailed study of post-translational modifications (PTMs) of histone TH2B from different developmental stages of sperm cells, using a combination of high performance liquid chromatography, enzymatic Glu-c digestions of peptides, liquid chromatography-mass spectrometry (LC-MS) and LC-MS/MS analysis. The results showed modification patterns of the intact histone TH2B during spermatogenesis. Acetylated TH2B was most abundant in spermatogonia (28.9%) when compared with the spermatocytes (8.3%) and round spermatids (11.2%). Several new PTMs of TH2B were identified. In spermatogonia, spermatocytes and round spermatids, T116 and K117, were modified by phosphorylation and methylation, respectively, forming a novel 'phospho switch' site. The identified modification patterns of histone TH2B in spermatogenic cells provides a basis for future studies on histone coding and epigenetic regulation during spermatogenesis.
Collapse
Affiliation(s)
- Shuang Lu
- Graduate School, Peking Union Medical College, Beijing, China
| | | | | | | | | | | | | | | |
Collapse
|
46
|
Kim JH, Ebersole T, Kouprina N, Noskov VN, Ohzeki JI, Masumoto H, Mravinac B, Sullivan BA, Pavlicek A, Dovat S, Pack SD, Kwon YW, Flanagan PT, Loukinov D, Lobanenkov V, Larionov V. Human gamma-satellite DNA maintains open chromatin structure and protects a transgene from epigenetic silencing. Genome Res 2009; 19:533-44. [PMID: 19141594 DOI: 10.1101/gr.086496.108] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
The role of repetitive DNA sequences in pericentromeric regions with respect to kinetochore/heterochromatin structure and function is poorly understood. Here, we use a mouse erythroleukemia cell (MEL) system for studying how repetitive DNA assumes or is assembled into different chromatin structures. We show that human gamma-satellite DNA arrays allow a transcriptionally permissive chromatin conformation in an adjacent transgene and efficiently protect it from epigenetic silencing. These arrays contain CTCF and Ikaros binding sites. In MEL cells, this gamma-satellite DNA activity depends on binding of Ikaros proteins involved in differentiation along the hematopoietic pathway. Given our discovery of gamma-satellite DNA in pericentromeric regions of most human chromosomes and a dynamic chromatin state of gamma-satellite arrays in their natural location, we suggest that gamma-satellite DNA represents a unique region of the functional centromere with a possible role in preventing heterochromatin spreading beyond the pericentromeric region.
Collapse
Affiliation(s)
- Jung-Hyun Kim
- Laboratory of Molecular Pharmacology, National Cancer Institute, Bethesda, Maryland 20892, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
47
|
Ugarković DI. Centromere-competent DNA: structure and evolution. PROGRESS IN MOLECULAR AND SUBCELLULAR BIOLOGY 2009; 48:53-76. [PMID: 19521812 DOI: 10.1007/978-3-642-00182-6_3] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Although extant data favour centromere being an epigenetic structure, it is also clear that centromere formation is based on DNA, in particular, tandemly repeated satellite DNA and its transcripts. Presence of conserved structural motifs within satellite DNAs such as periodically distributed AT tracts, protein binding sites, or promoter elements indicate that despite sequence flexibility, there are structural determinants that are prerequisite for centromere function. In addition, existence of functional centromeric DNA transcripts indicates possible importance of structural elements at the level of RNA secondary or tertiary structure. Rapid centromere evolution is explained by homologous recombination followed by extrachromosomal rolling circle replication. This could lead to amplification of different satellite sequences within a genome. However, only those satellites that have inherent centromere-competence in the form of structural requirements necessary for centromere function are after amplification fixed in a population as a new centromere.
Collapse
Affiliation(s)
- Durd Ica Ugarković
- Department of Molecular Biology, Rud er Bosković Institute, Bijenicka 54, HR-10002, Zagreb, Croatia.
| |
Collapse
|
48
|
Pereira V. Automated paleontology of repetitive DNA with REANNOTATE. BMC Genomics 2008; 9:614. [PMID: 19094224 PMCID: PMC2672092 DOI: 10.1186/1471-2164-9-614] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2008] [Accepted: 12/18/2008] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Dispersed repeats are a major component of eukaryotic genomes and drivers of genome evolution. Annotation of DNA sequences homologous to known repetitive elements has been mainly performed with the program REPEATMASKER. Sequences annotated by REPEATMASKER often correspond to fragments of repetitive elements resulting from the insertion of younger elements or other rearrangements. Although REPEATMASKER annotation is indispensable for studying genome biology, this annotation does not contain much information on the common origin of fossil fragments that share an insertion event, especially where clusters of nested insertions of repetitive elements have occurred. RESULTS Here I present REANNOTATE, a computational tool to process REPEATMASKER annotation for automated i) defragmentation of dispersed repetitive elements, ii) resolution of the temporal order of insertions in clusters of nested elements, and iii) estimating the age of the elements, if they have long terminal repeats. I have re-annotated the repetitive content of human chromosomes, providing evidence for a recent expansion of satellite repeats on the Y chromosome and, from the retroviral age distribution, for a higher rate of evolution on the Y relative to autosomes. CONCLUSION REANNOTATE is ready to process existing annotation for automated evolutionary analysis of all types of complex repeats in any genome. The tool is freely available under the GPL at http://www.bioinformatics.org/reannotate.
Collapse
Affiliation(s)
- Vini Pereira
- Department of Life Sciences, Imperial College London, Silwood Park campus, Ascot, Berkshire SL5 7PY, UK.
| |
Collapse
|
49
|
Rosandić M, Glunčić M, Paar V, Basar I. The role of alphoid higher order repeats (HORs) in the centromere folding. J Theor Biol 2008; 254:555-60. [DOI: 10.1016/j.jtbi.2008.06.012] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2007] [Revised: 05/13/2008] [Accepted: 06/06/2008] [Indexed: 10/21/2022]
|
50
|
Giordano J, Ge Y, Gelfand Y, Abrusán G, Benson G, Warburton PE. Evolutionary history of mammalian transposons determined by genome-wide defragmentation. PLoS Comput Biol 2008; 3:e137. [PMID: 17630829 PMCID: PMC1914374 DOI: 10.1371/journal.pcbi.0030137] [Citation(s) in RCA: 86] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2007] [Accepted: 05/31/2007] [Indexed: 01/30/2023] Open
Abstract
The constant bombardment of mammalian genomes by transposable elements (TEs) has resulted in TEs comprising at least 45% of the human genome. Because of their great age and abundance, TEs are important in comparative phylogenomics. However, estimates of TE age were previously based on divergence from derived consensus sequences or phylogenetic analysis, which can be unreliable, especially for older more diverged elements. Therefore, a novel genome-wide analysis of TE organization and fragmentation was performed to estimate TE age independently of sequence composition and divergence or the assumption of a constant molecular clock. Analysis of TEs in the human genome revealed ∼600,000 examples where TEs have transposed into and fragmented other TEs, covering >40% of all TEs or ∼542 Mbp of genomic sequence. The relative age of these TEs over evolutionary time is implicit in their organization, because newer TEs have necessarily transposed into older TEs that were already present. A matrix of the number of times that each TE has transposed into every other TE was constructed, and a novel objective function was developed that derived the chronological order and relative ages of human TEs spanning >100 million years. This method has been used to infer the relative ages across all four major TE classes, including the oldest, most diverged elements. Analysis of DNA transposons over the history of the human genome has revealed the early activity of some MER2 transposons, and the relatively recent activity of MER1 transposons during primate lineages. The TEs from six additional mammalian genomes were defragmented and analyzed. Pairwise comparison of the independent chronological orders of TEs in these mammalian genomes revealed species phylogeny, the fact that transposons shared between genomes are older than species-specific transposons, and a subset of TEs that were potentially active during periods of speciation. Transposable elements (TEs) are interspersed repetitive DNA families that are capable of copying themselves from place to place; they have literally infested our genome over evolutionary time, and now comprise as much as 45% of our total DNA. Because of their great age and abundance, TEs are important in evolutionary genomics. However, estimates of their age based on DNA sequence composition have been unreliable, especially for older more diverged elements. Therefore, a novel method to estimate the age of TEs was developed based on the fact that as TEs spread throughout the genome, they inserted into and fragmented older TEs that were already present. Therefore, the age of TEs can be revealed by how often they have been fragmented over evolutionary time. We performed a genome-wide defragmention of TEs, and developed a novel objective function to derive the chronological order of TEs spanning >100 million years. This method has been used to infer the relative ages of TEs from seven sequenced mammalian genomes across all four major TE classes, including the oldest, most diverged elements. This age estimate is independent of TE sequence composition or divergence and does not rely on the assumption of a constant molecular clock. This study provides a novel analysis of the evolutionary history of some of the most abundant and ancient repetitive DNA elements in mammalian genomes, which is important for understanding the dynamic forces that shape our genomes during evolution.
Collapse
Affiliation(s)
- Joti Giordano
- Department of Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, New York, United States of America
| | - Yongchao Ge
- Department of Neurology, Mount Sinai School of Medicine, New York, New York, United States of America
- Center for Translational Systems Biology, Mount Sinai School of Medicine, New York, New York, United States of America
| | - Yevgeniy Gelfand
- Laboratory for Biocomputing and Informatics, Boston University, Boston, Massachusetts, United States of America
| | - György Abrusán
- Department of Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, New York, United States of America
| | - Gary Benson
- Departments of Computer Science and Biology, Boston University, Boston, Massachusetts, United States of America
| | - Peter E Warburton
- Department of Genetics and Genomic Sciences, Mount Sinai School of Medicine, New York, New York, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|