1
|
Dennenmoser S, Sedlazeck FJ, Schatz MC, Altmüller J, Zytnicki M, Nolte AW. Genome‐wide patterns of transposon proliferation in an evolutionary young hybrid fish. Mol Ecol 2019; 28:1491-1505. [DOI: 10.1111/mec.14969] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2018] [Revised: 10/15/2018] [Accepted: 10/23/2018] [Indexed: 01/19/2023]
Affiliation(s)
- Stefan Dennenmoser
- Institute for Biology and Environmental Sciences Carl von Ossietzky University Oldenburg Oldenburg Germany
| | | | - Michael C. Schatz
- Cold Spring Harbor Laboratory Cold Spring Harbor New York
- Departments of Computer Science and Biology Johns Hopkins University Baltimore Maryland
| | - Janine Altmüller
- Cologne Center for Genomics, and Institute of Human Genetics University of Cologne Cologne Germany
| | | | - Arne W. Nolte
- Institute for Biology and Environmental Sciences Carl von Ossietzky University Oldenburg Oldenburg Germany
| |
Collapse
|
2
|
Dennenmoser S, Sedlazeck FJ, Iwaszkiewicz E, Li X, Altmüller J, Nolte AW. Copy number increases of transposable elements and protein-coding genes in an invasive fish of hybrid origin. Mol Ecol 2017; 26:4712-4724. [PMID: 28390096 PMCID: PMC5638112 DOI: 10.1111/mec.14134] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2017] [Revised: 03/23/2017] [Accepted: 03/27/2017] [Indexed: 12/25/2022]
Abstract
Evolutionary dynamics of structural genetic variation in lineages of hybrid origin is not well explored, although structural mutations may increase in controlled hybrid crosses. We therefore tested whether structural variants accumulate in a fish of recent hybrid origin, invasive Cottus, relative to both parental species Cottus rhenanus and Cottus perifretum. Copy-number variation in exons of 10,979 genes was assessed using comparative genome hybridization arrays. Twelve genes showed significantly higher copy numbers in invasive Cottus compared to both parents. This coincided with increased expression for three genes related to vision, detoxification and muscle development, suggesting possible gene dosage effects. Copy number increases of putative transposons were assessed by comparative mapping of genomic DNA reads against a de novo assembly of 1,005 repetitive elements. In contrast to exons, copy number increases of repetitive elements were common (20.7%) in invasive Cottus, whereas decrease was very rare (0.01%). Among the increased repetitive elements, 53.8% occurred at higher numbers in C. perifretum compared to C. rhenanus, while only 1.4% were more abundant in C. rhenanus. This implies a biased mutational process that amplifies genetic material from one ancestor. To assess the frequency of de novo mutations through hybridization, we screened 64 laboratory-bred F2 offspring between the parental species for copy-number changes at five candidate loci. We found no evidence for new structural variants, indicating that they are too rare to be detected given our sampling scheme. Instead, they must have accumulated over more generations than we observed in a controlled cross.
Collapse
Affiliation(s)
- Stefan Dennenmoser
- Department for Evolutionary GeneticsMax‐Planck Institute for Evolutionary BiologyPlönGermany
- Institute for BiologyCarl von Ossietzky University OldenburgOldenburgGermany
| | | | - Elzbieta Iwaszkiewicz
- Department for Evolutionary GeneticsMax‐Planck Institute for Evolutionary BiologyPlönGermany
| | - Xiang‐Yi Li
- Department of Evolutionary Biology and Environmental StudiesUniversity of ZurichZurichSwitzerland
| | - Janine Altmüller
- Cologne Center for Genomics, and Institute of Human GeneticsUniversity of CologneCologneGermany
| | - Arne W. Nolte
- Department for Evolutionary GeneticsMax‐Planck Institute for Evolutionary BiologyPlönGermany
- Institute for BiologyCarl von Ossietzky University OldenburgOldenburgGermany
| |
Collapse
|
3
|
Xu D, Pavlidis P, Thamadilok S, Redwood E, Fox S, Blekhman R, Ruhl S, Gokcumen O. Recent evolution of the salivary mucin MUC7. Sci Rep 2016; 6:31791. [PMID: 27558399 PMCID: PMC4997351 DOI: 10.1038/srep31791] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2016] [Accepted: 07/26/2016] [Indexed: 11/23/2022] Open
Abstract
Genomic structural variants constitute the majority of variable base pairs in primate genomes and affect gene function in multiple ways. While whole gene duplications and deletions are relatively well-studied, the biology of subexonic (i.e., within coding exon sequences), copy number variation remains elusive. The salivary MUC7 gene provides an opportunity for studying such variation, as it harbors copy number variable subexonic repeat sequences that encode for densely O-glycosylated domains (PTS-repeats) with microbe-binding properties. To understand the evolution of this gene, we analyzed mammalian and primate genomes within a comparative framework. Our analyses revealed that (i) MUC7 has emerged in the placental mammal ancestor and rapidly gained multiple sites for O-glycosylation; (ii) MUC7 has retained its extracellular activity in saliva in placental mammals; (iii) the anti-fungal domain of the protein was remodified under positive selection in the primate lineage; and (iv) MUC7 PTS-repeats have evolved recurrently and under adaptive constraints. Our results establish MUC7 as a major player in salivary adaptation, likely as a response to diverse pathogenic exposure in primates. On a broader scale, our study highlights variable subexonic repeats as a primary source for modular evolutionary innovation that lead to rapid functional adaptation.
Collapse
Affiliation(s)
- Duo Xu
- Department of Biological Sciences, State University of New York at Buffalo, New York 14260, USA
| | - Pavlos Pavlidis
- Institute of Computer Science (ICS), Foundation of Research and Technology-Hellas, Heraklion, Crete, Greece
| | - Supaporn Thamadilok
- Department of Oral Biology, School of Dental Medicine, State University of New York at Buffalo, New York 14214, USA
| | - Emilie Redwood
- Department of Biological Sciences, State University of New York at Buffalo, New York 14260, USA
| | - Sara Fox
- Department of Biological Sciences, State University of New York at Buffalo, New York 14260, USA
| | - Ran Blekhman
- Department of Genetics, Cell Biology, and Development, University of Minnesota, Twin Cities, Minnesota 55455, USA
| | - Stefan Ruhl
- Department of Oral Biology, School of Dental Medicine, State University of New York at Buffalo, New York 14214, USA
| | - Omer Gokcumen
- Department of Biological Sciences, State University of New York at Buffalo, New York 14260, USA
| |
Collapse
|
4
|
Holocentromeres in Rhynchospora are associated with genome-wide centromere-specific repeat arrays interspersed among euchromatin. Proc Natl Acad Sci U S A 2015; 112:13633-8. [PMID: 26489653 DOI: 10.1073/pnas.1512255112] [Citation(s) in RCA: 71] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Holocentric chromosomes lack a primary constriction, in contrast to monocentrics. They form kinetochores distributed along almost the entire poleward surface of the chromatids, to which spindle fibers attach. No centromere-specific DNA sequence has been found for any holocentric organism studied so far. It was proposed that centromeric repeats, typical for many monocentric species, could not occur in holocentrics, most likely because of differences in the centromere organization. Here we show that the holokinetic centromeres of the Cyperaceae Rhynchospora pubera are highly enriched by a centromeric histone H3 variant-interacting centromere-specific satellite family designated "Tyba" and by centromeric retrotransposons (i.e., CRRh) occurring as genome-wide interspersed arrays. Centromeric arrays vary in length from 3 to 16 kb and are intermingled with gene-coding sequences and transposable elements. We show that holocentromeres of metaphase chromosomes are composed of multiple centromeric units rather than possessing a diffuse organization, thus favoring the polycentric model. A cell-cycle-dependent shuffling of multiple centromeric units results in the formation of functional (poly)centromeres during mitosis. The genome-wide distribution of centromeric repeat arrays interspersing the euchromatin provides a previously unidentified type of centromeric chromatin organization among eukaryotes. Thus, different types of holocentromeres exist in different species, namely with and without centromeric repetitive sequences.
Collapse
|
5
|
Centromere identity from the DNA point of view. Chromosoma 2014; 123:313-25. [PMID: 24763964 PMCID: PMC4107277 DOI: 10.1007/s00412-014-0462-0] [Citation(s) in RCA: 135] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2013] [Revised: 03/28/2014] [Accepted: 04/01/2014] [Indexed: 02/05/2023]
Abstract
The centromere is a chromosomal locus responsible for the faithful segregation of genetic material during cell division. It has become evident that centromeres can be established literally on any DNA sequence, and the possible synergy between DNA sequences and the most prominent centromere identifiers, protein components, and epigenetic marks remains uncertain. However, some evolutionary preferences seem to exist, and long-term established centromeres are frequently formed on long arrays of satellite DNAs and/or transposable elements. Recent progress in understanding functional centromere sequences is based largely on the high-resolution DNA mapping of sequences that interact with the centromere-specific histone H3 variant, the most reliable marker of active centromeres. In addition, sequence assembly and mapping of large repetitive centromeric regions, as well as comparative genome analyses offer insight into their complex organization and evolution. The rapidly advancing field of transcription in centromere regions highlights the functional importance of centromeric transcripts. Here, we comprehensively review the current state of knowledge on the composition and functionality of DNA sequences underlying active centromeres and discuss their contribution to the functioning of different centromere types in higher eukaryotes.
Collapse
|
6
|
Genomic structure and evolution of multigene families: "flowers" on the human genome. INTERNATIONAL JOURNAL OF EVOLUTIONARY BIOLOGY 2012; 2012:917678. [PMID: 22779033 PMCID: PMC3388347 DOI: 10.1155/2012/917678] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2011] [Revised: 04/06/2012] [Accepted: 04/09/2012] [Indexed: 11/17/2022]
Abstract
We report the results of an extensive investigation of genomic structures in the human genome, with a particular focus on relatively large repeats (>50 kb) in adjacent chromosomal regions. We named such structures “Flowers” because the pattern observed on dot plots resembles a flower. We detected a total of 291 Flowers in the human genome. They were predominantly located in euchromatic regions. Flowers are gene-rich compared to the average gene density of the genome. Genes involved in systems receiving environmental information, such as immunity and detoxification, were overrepresented in Flowers. Within a Flower, the mean number of duplication units was approximately four. The maximum and minimum identities between homologs in a Flower showed different distributions; the maximum identity was often concentrated to 100% identity, while the minimum identity was evenly distributed in the range of 78% to 100%. Using a gene conversion detection test, we found frequent and/or recent gene conversion events within the tested Flowers. Interestingly, many of those converted regions contained protein-coding genes. Computer simulation studies suggest that one role of such frequent gene conversions is the elongation of the life span of gene families in a Flower by the resurrection of pseudogenes.
Collapse
|
7
|
Baker RH, Kuehl JV, Wilkinson GS. The Enhancer of split complex arose prior to the diversification of schizophoran flies and is strongly conserved between Drosophila and stalk-eyed flies (Diopsidae). BMC Evol Biol 2011; 11:354. [PMID: 22151427 PMCID: PMC3261227 DOI: 10.1186/1471-2148-11-354] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2011] [Accepted: 12/08/2011] [Indexed: 02/03/2023] Open
Abstract
Background In Drosophila, the Enhancer of split complex (E(spl)-C) comprises 11 bHLH and Bearded genes that function during Notch signaling to repress proneural identity in the developing peripheral nervous system. Comparison with other insects indicates that the basal state for Diptera is a single bHLH and Bearded homolog and that the expansion of the gene complex occurred in the lineage leading to Drosophila. However, comparative genomic data from other fly species that would elucidate the origin and sequence of gene duplication for the complex is lacking. Therefore, in order to examine the evolutionary history of the complex within Diptera, we reconstructed, using several fosmid clones, the entire E(spl)-complex in the stalk-eyed fly, Teleopsis dalmanni and collected additional homologs of E(spl)-C genes from searches of dipteran EST databases and the Glossina morsitans genome assembly. Results Comparison of the Teleopsis E(spl)-C gene organization with Drosophila indicates complete conservation in gene number and orientation between the species except that T. dalmanni contains a duplicated copy of E(spl)m5 that is not present in Drosophila. Phylogenetic analysis of E(spl)-complex bHLH and Bearded genes for several dipteran species clearly demonstrates that all members of the complex were present prior to the diversification of schizophoran flies. Comparison of upstream regulatory elements and 3' UTR domains between the species also reveals strong conservation for many of the genes and identifies several novel characteristics of E(spl)-C regulatory evolution including the discovery of a previously unidentified, highly conserved SPS+A domain between E(spl)mγ and E(spl)mβ. Conclusion Identifying the phylogenetic origin of E(spl)-C genes and their associated regulatory DNA is essential to understanding the functional significance of this well-studied gene complex. Results from this study provide numerous insights into the evolutionary history of the complex and will help refine the focus of studies examining the adaptive consequences of this gene expansion.
Collapse
Affiliation(s)
- Richard H Baker
- Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY 10024, USA.
| | | | | |
Collapse
|
8
|
|
9
|
Developmental diseases and the hypothetical Master Development Program. Med Hypotheses 2010; 74:564-73. [DOI: 10.1016/j.mehy.2009.09.035] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2009] [Accepted: 09/17/2009] [Indexed: 11/24/2022]
|
10
|
Parris GE. Scope of medical implications of the Master Development Program hypothesis. Med Hypotheses 2009; 74:953. [PMID: 20031335 DOI: 10.1016/j.mehy.2009.12.001] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2009] [Revised: 11/28/2009] [Accepted: 12/02/2009] [Indexed: 12/21/2022]
|
11
|
Mucin CYS domains are ancient and highly conserved modules that evolved in concert. Mol Phylogenet Evol 2009; 52:284-92. [DOI: 10.1016/j.ympev.2009.03.035] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2008] [Revised: 03/17/2009] [Accepted: 03/27/2009] [Indexed: 11/22/2022]
|
12
|
Schmidt J, Kirsch S, Rappold GA, Schempp W. Complex evolution of a Y-chromosomal double homeobox 4 (DUX4)-related gene family in hominoids. PLoS One 2009; 4:e5288. [PMID: 19404400 PMCID: PMC2671837 DOI: 10.1371/journal.pone.0005288] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2009] [Accepted: 03/24/2009] [Indexed: 12/21/2022] Open
Abstract
The human Y chromosome carries four human Y-chromosomal euchromatin/heterochromatin transition regions, all of which are characterized by the presence of interchromosomal segmental duplications. The Yq11.1/Yq11.21 transition region harbours a peculiar segment composed of an imperfectly organized tandem-repeat structure encoding four members of the double homeobox (DUX) gene family. By comparative fluorescence in situ hybridization (FISH) analysis we have documented the primary appearance of Y-chromosomal DUX genes (DUXY) on the gibbon Y chromosome. The major amplification and dispersal of DUXY paralogs occurred after the gibbon and hominid lineages had diverged. Orthologous DUXY loci of human and chimpanzee show a highly similar structural organization. Sequence alignment survey, phylogenetic reconstruction and recombination detection analyses of human and chimpanzee DUXY genes revealed the existence of all copies in a common ancestor. Comparative analysis of the circumjacent beta-satellites indicated that DUXY genes and beta-satellites evolved in concert. However, evolutionary forces acting on DUXY genes may have induced amino acid sequence differences in the orthologous chimpanzee and human DUXY open reading frames (ORFs). The acquisition of complete ORFs in human copies might relate to evolutionary advantageous functions indicating neo-functionalization. We propose an evolutionary scenario in which an ancestral tandem array DUX gene cassette transposed to the hominoid Y chromosome followed by lineage-specific chromosomal rearrangements paved the way for a species-specific evolution of the Y-chromosomal members of a large highly diverged homeobox gene family.
Collapse
Affiliation(s)
- Julia Schmidt
- Institute of Human Genetics, University of Freiburg, Freiburg, Germany
| | - Stefan Kirsch
- Institute of Human Genetics, University of Freiburg, Freiburg, Germany
| | - Gudrun A. Rappold
- Institute of Human Genetics, University of Heidelberg, Heidelberg, Germany
| | - Werner Schempp
- Institute of Human Genetics, University of Freiburg, Freiburg, Germany
- * E-mail:
| |
Collapse
|
13
|
Housley DJE, Nikolas M, Venta PJ, Jernigan KA, Waldman ID, Nigg JT, Friderici KH. SNP discovery and haplotype analysis in the segmentally duplicated DRD5 coding region. Ann Hum Genet 2009; 73:274-82. [PMID: 19397556 DOI: 10.1111/j.1469-1809.2009.00513.x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Abstract
The dopamine receptor 5 gene (DRD5) holds much promise as a candidate locus for contributing to neuropsychiatric disorders and other diseases influenced by the dopaminergic system, as well as having potential to affect normal behavioral variation. However, detailed analyses of this gene have been complicated by its location within a segmentally duplicated chromosomal region. Microsatellites and SNPs upstream from the coding region have been used for association studies, but we find, using bioinformatics resources, that these markers all lie within a previously unrecognized second segmental duplication (SD). In order to accurately analyze the DRD5 locus for polymorphisms in the absence of contaminating pseudogene sequences, we developed a fast and reliable method for sequence analysis and genotyping within the DRD5 coding region. We employed restriction enzyme digestion of genomic DNA to eliminate the pseudogenes prior to PCR amplification of the functional gene. This approach allowed us to determine the DRD5 haplotype structure using 31 trios and to reveal additional rare variants in 171 unrelated individuals. We clarify the inconsistencies and errors of the recorded SNPs in dbSNP and HapMap and illustrate the importance of using caution when choosing SNPs in regions of suspected duplications. The simple and relatively inexpensive method presented herein allows for convenient analysis of sequence variation in DRD5 and can be easily adapted to other duplicated genomic regions in order to obtain good quality sequence data.
Collapse
Affiliation(s)
- Donna J E Housley
- Department of Microbiology & Molecular Genetics, Michigan State University, East Lansing, MI, USA
| | | | | | | | | | | | | |
Collapse
|
14
|
Varki A, Geschwind DH, Eichler EE. Explaining human uniqueness: genome interactions with environment, behaviour and culture. Nat Rev Genet 2008; 9:749-63. [PMID: 18802414 DOI: 10.1038/nrg2428] [Citation(s) in RCA: 107] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
What makes us human? Specialists in each discipline respond through the lens of their own expertise. In fact, 'anthropogeny' (explaining the origin of humans) requires a transdisciplinary approach that eschews such barriers. Here we take a genomic and genetic perspective towards molecular variation, explore systems analysis of gene expression and discuss an organ-systems approach. Rejecting any 'genes versus environment' dichotomy, we then consider genome interactions with environment, behaviour and culture, finally speculating that aspects of human uniqueness arose because of a primate evolutionary trend towards increasing and irreversible dependence on learned behaviours and culture - perhaps relaxing allowable thresholds for large-scale genomic diversity.
Collapse
Affiliation(s)
- Ajit Varki
- Center for Academic Research and Training in Anthropogeny, University of California, San Diego, La Jolla, California 92093, USA.
| | | | | |
Collapse
|
15
|
Münch C, Kirsch S, Fernandes AMG, Schempp W. Evolutionary analysis of the highly dynamic CHEK2 duplicon in anthropoids. BMC Evol Biol 2008; 8:269. [PMID: 18831734 PMCID: PMC2566985 DOI: 10.1186/1471-2148-8-269] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2008] [Accepted: 10/02/2008] [Indexed: 12/03/2022] Open
Abstract
BACKGROUND Segmental duplications (SDs) are euchromatic portions of genomic DNA (> or = 1 kb) that occur at more than one site within the genome, and typically share a high level of sequence identity (>90%). Approximately 5% of the human genome is composed of such duplicated sequences. Here we report the detailed investigation of CHEK2 duplications. CHEK2 is a multiorgan cancer susceptibility gene encoding a cell cycle checkpoint kinase acting in the DNA-damage response signalling pathway. The continuous presence of the CHEK2 gene in all eukaryotes and its important role in maintaining genome stability prompted us to investigate the duplicative evolution and phylogeny of CHEK2 and its paralogs during anthropoid evolution. RESULTS To study CHEK2 duplicon evolution in anthropoids we applied a combination of comparative FISH and in silico analyses. Our comparative FISH results with a CHEK2 fosmid probe revealed the single-copy status of CHEK2 in New World monkeys, Old World monkeys and gibbons. Whereas a single CHEK2 duplication was detected in orangutan, a multi-site signal pattern indicated a burst of duplication in African great apes and human. Phylogenetic analysis of paralogous and ancestral CHEK2 sequences in human, chimpanzee and rhesus macaque confirmed this burst of duplication, which occurred after the radiation of orangutan and African great apes. In addition, we used inter-species quantitative PCR to determine CHEK2 copy numbers. An amplification of CHEK2 was detected in African great apes and the highest CHEK2 copy number of all analysed species was observed in the human genome. Furthermore, we detected variation in CHEK2 copy numbers within the analysed set of human samples. CONCLUSION Our detailed analysis revealed the highly dynamic nature of CHEK2 duplication during anthropoid evolution. We determined a burst of CHEK2 duplication after the radiation of orangutan and African great apes and identified the highest CHEK2 copy number in human. In conclusion, our analysis of CHEK2 duplicon evolution revealed that SDs contribute to inter-species variation. Furthermore, our qPCR analysis led us to presume CHEK2 copy number variation in human, and molecular diagnostics of the cancer susceptibility gene CHEK2 inside the duplicated region might be hampered by the individual-specific set of duplicons.
Collapse
Affiliation(s)
- Claudia Münch
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacher Str. 33, 79106 Freiburg, Germany
| | - Stefan Kirsch
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacher Str. 33, 79106 Freiburg, Germany
| | - António MG Fernandes
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacher Str. 33, 79106 Freiburg, Germany
| | - Werner Schempp
- Institute of Human Genetics and Anthropology, University of Freiburg, Breisacher Str. 33, 79106 Freiburg, Germany
| |
Collapse
|
16
|
Affiliation(s)
- Crystal L Kahn
- Department of Computer Science, Brown University, Providence, RI, USA.
| | | |
Collapse
|
17
|
Kirsch S, Münch C, Jiang Z, Cheng Z, Chen L, Batz C, Eichler EE, Schempp W. Evolutionary dynamics of segmental duplications from human Y-chromosomal euchromatin/heterochromatin transition regions. Genome Res 2008; 18:1030-42. [PMID: 18445620 DOI: 10.1101/gr.076711.108] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Human chromosomal regions enriched in segmental duplications are subject to extensive genomic reorganization. Such regions are particularly informative for illuminating the evolutionary history of a given chromosome. We have analyzed 866 kb of Y-chromosomal non-palindromic segmental duplications delineating four euchromatin/heterochromatin transition regions (Yp11.2/Yp11.1, Yq11.1/Yq11.21, Yq11.23/Yq12, and Yq12/PAR2). Several computational methods were applied to decipher the segmental duplication architecture and identify the ancestral origin of the 41 different duplicons. Combining computational and comparative FISH analysis, we reconstruct the evolutionary history of these regions. Our analysis indicates a continuous process of transposition of duplicated sequences onto the evolving higher primate Y chromosome, providing unique insights into the development of species-specific Y-chromosomal and autosomal duplicons. Phylogenetic sequence comparisons show that duplicons of the human Yp11.2/Yp11.1 region were already present in the macaque-human ancestor as multiple paralogs located predominantly in subtelomeric regions. In contrast, duplicons from the Yq11.1/Yq11.21, Yq11.23/Yq12, and Yq12/PAR2 regions show no evidence of duplication in rhesus macaque, but map to the pericentromeric regions in chimpanzee and human. This suggests an evolutionary shift in the direction of duplicative transposition events from subtelomeric in Old World monkeys to pericentromeric in the human/ape lineage. Extensive chromosomal relocation of autosomal-duplicated sequences from euchromatin/heterochromatin transition regions to interstitial regions as demonstrated on the pygmy chimpanzee Y chromosome support a model in which substantial reorganization and amplification of duplicated sequences may contribute to speciation.
Collapse
Affiliation(s)
- Stefan Kirsch
- Institute of Human Genetics, University of Freiburg, 79106 Freiburg, Germany
| | | | | | | | | | | | | | | |
Collapse
|
18
|
Ambrosini A, Paul S, Hu S, Riethman H. Human subtelomeric duplicon structure and organization. Genome Biol 2008; 8:R151. [PMID: 17663781 PMCID: PMC2323237 DOI: 10.1186/gb-2007-8-7-r151] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2007] [Revised: 06/25/2007] [Accepted: 07/30/2007] [Indexed: 01/27/2023] Open
Abstract
The sequence divergence within subtelomeric duplicon families varies considerably, as does the organization of duplicon blocks at subtelomere alleles; a class of duplicon blocks was identified that are subtelomere-specific. Background Human subtelomeric segmental duplications ('subtelomeric repeats') comprise about 25% of the most distal 500 kb and 80% of the most distal 100 kb in human DNA. A systematic analysis of the duplication substructure of human subtelomeric regions was done in order to develop a detailed understanding of subtelomeric sequence organization and a nucleotide sequence-level characterization of subtelomeric duplicon families. Results The extent of nucleotide sequence divergence within subtelomeric duplicon families varies considerably, as does the organization of duplicon blocks at subtelomere alleles. Subtelomeric internal (TTAGGG)n-like tracts occur at duplicon boundaries, suggesting their involvement in the generation of the complex sequence organization. Most duplicons have copies at both subtelomere and non-subtelomere locations, but a class of duplicon blocks is identified that are subtelomere-specific. In addition, a group of six subterminal duplicon families are identified that, together with six single-copy telomere-adjacent segments, include all of the (TTAGGG)n-adjacent sequence identified so far in the human genome. Conclusion Identification of a class of duplicon blocks that is subtelomere-specific will facilitate high-resolution analysis of subtelomere repeat copy number variation as well as studies involving somatic subtelomere rearrangements. The significant levels of nucleotide sequence divergence within many duplicon families as well as the differential organization of duplicon blocks on subtelomere alleles may provide opportunities for allele-specific subtelomere marker development; this is especially true for subterminal regions, where divergence and organizational differences are the greatest. These subterminal sequence families comprise the immediate cis-elements for (TTAGGG)n tracts, and are prime candidates for subtelomeric sequences regulating telomere-specific (TTAGGG)n tract length in humans.
Collapse
Affiliation(s)
- Anthony Ambrosini
- The Wistar Institute, Spruce St, Philadelphia, PA 19104, USA
- Department of Molecular Biology, Princeton University, Princeton, NJ 08544, USA
| | - Sheila Paul
- The Wistar Institute, Spruce St, Philadelphia, PA 19104, USA
| | - Sufen Hu
- The Wistar Institute, Spruce St, Philadelphia, PA 19104, USA
| | - Harold Riethman
- The Wistar Institute, Spruce St, Philadelphia, PA 19104, USA
| |
Collapse
|
19
|
Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution. Nat Genet 2007; 39:1361-8. [PMID: 17922013 DOI: 10.1038/ng.2007.9] [Citation(s) in RCA: 139] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2007] [Accepted: 08/07/2007] [Indexed: 01/22/2023]
Abstract
Human segmental duplications are hotspots for nonallelic homologous recombination leading to genomic disorders, copy-number polymorphisms and gene and transcript innovations. The complex structure and history of these regions have precluded a global evolutionary analysis. Combining a modified A-Bruijn graph algorithm with comparative genome sequence data, we identify the origin of 4,692 ancestral duplication loci and use these to cluster 437 complex duplication blocks into 24 distinct groups. The sequence-divergence data between ancestral-derivative pairs and a comparison with the chimpanzee and macaque genome support a 'punctuated' model of evolution. Our analysis reveals that human segmental duplications are frequently organized around 'core' duplicons, which are enriched for transcripts and, in some cases, encode primate-specific genes undergoing positive selection. We hypothesize that the rapid expansion and fixation of some intrachromosomal segmental duplications during great-ape evolution has been due to the selective advantage conferred by these genes and transcripts embedded within these core duplications.
Collapse
|
20
|
Fickelscher I, Liehr T, Watts K, Bryant V, Barber JCK, Heidemann S, Siebert R, Hertz JM, Tumer Z, Simon Thomas N. The variant inv(2)(p11.2q13) is a genuinely recurrent rearrangement but displays some breakpoint heterogeneity. Am J Hum Genet 2007; 81:847-56. [PMID: 17847011 PMCID: PMC2227935 DOI: 10.1086/521226] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2007] [Accepted: 06/28/2007] [Indexed: 02/04/2023] Open
Abstract
Human chromosome 2 contains large blocks of segmental duplications (SDs), both within and between proximal 2p and proximal 2q, and these may contribute to the frequency of the common variant inversion inv(2)(p11.2q13). Despite their being cytogenetically homogeneous, we have identified four different breakpoint combinations by fluorescence in situ hybridization mapping of 40 cases of inv(2)(p11.2q13) of European origin. For the vast majority of inversions (35/40), the breakpoints fell within the same spanning BACs, which hybridized to both 2p11.2 and 2q13 on the normal and inverted homologues. Sequence analysis revealed that these BACs contain a significant proportion of intrachromosomal SDs with sequence homology to the reciprocal breakpoint region. In contrast, BACs spanning the rare breakpoint combinations contain fewer SDs and with sequence homology only to the same chromosome arm. Using haplotype analysis, we identified a number of related family subgroups with identical or very closely related haplotypes. However, the majority of cases were not related, demonstrating for the first time that the inv(2)(p11.2q13) is a truly recurrent rearrangement. Therefore, there are three explanations to account for the frequent observation of the inv(2)(p11.2q13): the majority have arisen independently in different ancestors, while a minority either have been transmitted from a common founder or have different breakpoints at the molecular cytogenetic level.
Collapse
Affiliation(s)
- Ina Fickelscher
- Institut fur Humangenetik und Anthropologie, Friedrich-Schiller University, Jena, Germany
| | | | | | | | | | | | | | | | | | | |
Collapse
|
21
|
Portin P. Evolution of man in the light of molecular genetics: a review. Part I. Our evolutionary history and genomics. Hereditas 2007; 144:80-95. [PMID: 17663700 DOI: 10.1111/j.2007.0018-0661.02003.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Abstract
The discovery in the mid 1970s of efficient methods of DNA sequencing and their subsequent development into more and more rapid procedures followed by sequencing the genomes of many species, including man in 2001, revolutionised the whole of biology. Remarkably, new light could be cast on the evolutionary relations of different species, and the tempo and mode of evolution within a given species, notably man, could quantitatively be illuminated including ongoing evolution possibly involving also the size of the brains. This review is a short summary of the results of the molecular genetic investigations of human evolution including the time and place of the formation of our species, our evolutionary relation to the closest living species relatives as well as extinct forms of the genus Homo. The nature and amount of genetic polymorphism in man is also considered with special emphasis on the causes of this variation, and the role of natural selection in human evolution. A consensus about the mosaic nature of our genome and the rather dynamic structure of our ancestral population is gradually emerging. The modern gene pool has most likely been contributed to several different ancestral demes either before or after the emergence of the anatomically modern human phenotype in the extent that even the nature of the evolutionary lineage leading to the anatomically modern man as a distinct biological species is disputable. Regulation of the function of genes, as well as the evolution of brains will be dealt with in the second part of this review.
Collapse
Affiliation(s)
- Petter Portin
- Laboratory of Genetics, Department of Biology, University of Turku, Turku, Finland.
| |
Collapse
|
22
|
Ventura M, Antonacci F, Cardone MF, Stanyon R, D'Addabbo P, Cellamare A, Sprague LJ, Eichler EE, Archidiacono N, Rocchi M. Evolutionary Formation of New Centromeres in Macaque. Science 2007; 316:243-6. [PMID: 17431171 DOI: 10.1126/science.1140615] [Citation(s) in RCA: 107] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
A systematic fluorescence in situ hybridization comparison of macaque and human synteny organization disclosed five additional macaque evolutionary new centromeres (ENCs) for a total of nine ENCs. To understand the dynamics of ENC formation and progression, we compared the ENC of macaque chromosome 4 with the human orthologous region, at 6q24.3, that conserves the ancestral genomic organization. A 250-kilobase segment was extensively duplicated around the macaque centromere. These duplications were strictly intrachromosomal. Our results suggest that novel centromeres may trigger only local duplication activity and that the absence of genes in the seeding region may have been important in ENC maintenance and progression.
Collapse
Affiliation(s)
- Mario Ventura
- Department of Genetics and Microbiology, University of Bari, 70126 Bari, Italy
| | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Kehrer-Sawatzki H, Cooper DN. Understanding the recent evolution of the human genome: insights from human-chimpanzee genome comparisons. Hum Mutat 2007; 28:99-130. [PMID: 17024666 DOI: 10.1002/humu.20420] [Citation(s) in RCA: 94] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The sequencing of the chimpanzee genome and the comparison with its human counterpart have begun to reveal the spectrum of genetic changes that has accompanied human evolution. In addition to gross karyotypic rearrangements such as the fusion that formed human chromosome 2 and the human-specific pericentric inversions of chromosomes 1 and 18, there is considerable submicroscopic structural variation involving deletions, duplications, and inversions. Lineage-specific segmental duplications, detected by array comparative genomic hybridization and direct sequence comparison, have made a very significant contribution to this structural divergence, which is at least three-fold greater than that due to nucleotide substitutions. Since structural genomic changes may have given rise to irreversible functional differences between the diverging species, their detailed analysis could help to identify the biological processes that have accompanied speciation. To this end, interspecies comparisons have revealed numerous human-specific gains and losses of genes as well as changes in gene expression. The very considerable structural diversity (polymorphism) evident within both lineages has, however, hampered the analysis of the structural divergence between the human and chimpanzee genomes. The concomitant evaluation of genetic divergence and diversity at the nucleotide level has nevertheless served to identify many genes that have evolved under positive selection and may thus have been involved in the development of human lineage-specific traits. Genes that display signs of weak negative selection have also been identified and could represent candidate loci for complex genomic disorders. Here, we review recent progress in comparing the human and chimpanzee genomes and discuss how the differences detected have improved our understanding of the evolution of the human genome.
Collapse
|
24
|
Lambert LA, Mitchell SL. Molecular Evolution of the Transferrin Receptor/Glutamate Carboxypeptidase II Family. J Mol Evol 2006; 64:113-28. [PMID: 17160644 DOI: 10.1007/s00239-006-0137-4] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2006] [Accepted: 10/03/2006] [Indexed: 02/07/2023]
Abstract
The transferrin receptor family is represented by at least seven different homologous proteins in primates. Transferrin receptor (TfR1) is a type II membrane glycoprotein that, as a cell surface homodimer, binds iron-loaded transferrin as part of the process of iron transfer and uptake. Other family members include transferrin receptor 2 (TfR2), glutamate carboxypeptidase II (GCP2 or PSMA), N-acetylated alpha-linked acidic dipeptidase-like protein (NLDL), N-acetylated alpha-linked acidic dipeptidase 2 (NAALAD2), and prostate-specific membrane antigen-like protein (PMSAL/GCPIII). We compared 86 different sequences from 24 different species, from mammals to fungi. Through this comparison, we have identified several highly conserved residues specific to each family not previously associated with clinical mutations. The evolutionary history of the TfR/GCP2 family shows repeated episodes of duplications consistent with recent theories that nondispensable, slowly evolving genes are more likely to form multiple gene families.
Collapse
Affiliation(s)
- Lisa Ann Lambert
- Department of Biology, Chatham College, Woodland Road, Pittsburgh, PA 15232, USA.
| | | |
Collapse
|
25
|
Popesco MC, Maclaren EJ, Hopkins J, Dumas L, Cox M, Meltesen L, McGavran L, Wyckoff GJ, Sikela JM. Human lineage-specific amplification, selection, and neuronal expression of DUF1220 domains. Science 2006; 313:1304-7. [PMID: 16946073 DOI: 10.1126/science.1127980] [Citation(s) in RCA: 143] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Extreme gene duplication is a major source of evolutionary novelty. A genome-wide survey of gene copy number variation among human and great ape lineages revealed that the most striking human lineage-specific amplification was due to an unknown gene, MGC8902, which is predicted to encode multiple copies of a protein domain of unknown function (DUF1220). Sequences encoding these domains are virtually all primate-specific, show signs of positive selection, and are increasingly amplified generally as a function of a species' evolutionary proximity to humans, where the greatest number of copies (212) is found. DUF1220 domains are highly expressed in brain regions associated with higher cognitive function, and in brain show neuron-specific expression preferentially in cell bodies and dendrites.
Collapse
Affiliation(s)
- Magdalena C Popesco
- Human Medical Genetics, University of Colorado at Denver and Health Sciences Center, Aurora, CO 80045, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Bailey JA, Eichler EE. Primate segmental duplications: crucibles of evolution, diversity and disease. Nat Rev Genet 2006; 7:552-64. [PMID: 16770338 DOI: 10.1038/nrg1895] [Citation(s) in RCA: 441] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Compared with other mammals, the genomes of humans and other primates show an enrichment of large, interspersed segmental duplications (SDs) with high levels of sequence identity. Recent evidence has begun to shed light on the origin of primate SDs, pointing to a complex interplay of mechanisms and indicating that distinct waves of duplication took place during primate evolution. There is also evidence for a strong association between duplication, genomic instability and large-scale chromosomal rearrangements. Exciting new findings suggest that SDs have not only created novel primate gene families, but might have also influenced current human genic and phenotypic variation on a previously unappreciated scale. A growing number of examples link natural human genetic variation of these regions to susceptibility to common disease.
Collapse
Affiliation(s)
- Jeffrey A Bailey
- Department of Pathology, Case Western University School of Medicine and University Hospitals of Cleveland, Ohio 44106, USA
| | | |
Collapse
|
27
|
Durand D, Hoberman R. Diagnosing duplications – can it be done? Trends Genet 2006; 22:156-64. [PMID: 16442663 DOI: 10.1016/j.tig.2006.01.002] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2005] [Revised: 11/30/2005] [Accepted: 01/11/2006] [Indexed: 01/10/2023]
Abstract
New genes arise through duplication and modification of DNA sequences on a range of scales: single gene duplication, duplication of large chromosomal fragments and whole-genome duplication. Each duplication mechanism has specific characteristics that influence the fate of the resulting duplicates, such as the size of the duplicated fragment, the potential for dosage imbalance, the preservation or disruption of regulatory control and genomic context. The ability to diagnose or identify the mechanism that produced a pair of paralogs has the potential to increase our ability to reconstruct evolutionary history, to understand the processes that govern genome evolution and to make functional predictions based on paralogy. The recent availability of large amounts of whole-genome sequence, often from several closely related species, has stimulated a wealth of new computational methods to diagnose gene duplications.
Collapse
Affiliation(s)
- Dannie Durand
- Department of Biological Sciences, Carnegie Mellon University, Pittsburgh, PA 15213, USA.
| | | |
Collapse
|
28
|
Abstract
Pericentromeres are exceptional genomic regions: in animals they contain extensive segmental duplications implicated in gene creation, and in plants they sustain rearrangements and insertions uncommon in euchromatin. To examine the mechanisms and patterns of plant pericentromere evolution, we compared pericentromere sequence from four Brassicaceae species separated by <15 million years (Myr). This flowering plant family is ideal for studying relationships between genome reorganization and pericentromere evolution-its members have undergone recent polyploidization and hybridization, with close relatives changing in genome size and chromosome number. Through sequence and hybridization analyses, we examined regions from Arabidopsis arenosa, Capsella rubella, and Olimarabidopsis pumila that are homologous to Arabidopsis thaliana pericentromeres (peri-CENs) III and V, and used FISH to demonstrate they have been maintained near centromere satellite arrays in each species. Sequence analysis revealed a set of highly conserved genes, yet we discovered substantial differences in intergenic length and species-specific changes in sequence content and gene density. We discovered that A. thaliana has undergone recent, significant expansions within its pericentromeres, in some cases measuring hundreds of kilobases; these findings are in marked contrast to euchromatic segments in these species that exhibit only minor length changes. While plant pericentromeres do contain some duplications, we did not find evidence of extensive segmental duplications, as has been documented in primates. Our data support a model in which plant pericentromeres may experience selective pressures distinct from euchromatin, tolerating rapid, dynamic changes in structure and sequence content, including large insertions of mobile elements, 5S rDNA arrays and pseudogenes.
Collapse
Affiliation(s)
- Anne E Hall
- Howard Hughes Medical Institute, The University of Chicago, Chicago, Illinois 60637, USA
| | | | | |
Collapse
|
29
|
Ma J, Jackson SA. Retrotransposon accumulation and satellite amplification mediated by segmental duplication facilitate centromere expansion in rice. Genome Res 2005; 16:251-9. [PMID: 16354755 PMCID: PMC1361721 DOI: 10.1101/gr.4583106] [Citation(s) in RCA: 66] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
The abundance of repetitive DNA varies greatly across centromeres within an individual or between different organisms. To shed light on the molecular mechanisms of centromere repeat proliferation, we performed structural analysis of LTR-retrotransposons, mostly centromere retrotransposons of rice (CRRs), and phylogenetic analysis of CentO satellite repeats harbored in the core region of the rice chromosome 4 centromere (CEN4). The data obtained demonstrate that the CRRs in the centromeric region we investigated have been enriched more significantly by recent rounds of segmental duplication than by original integration of active elements, suggesting that segmental duplication is an important process for CRR accumulation in the centromeric region. Our results also indicate that segmental duplication of large arrays of satellite repeats is primarily responsible for the amplification of satellite repeats, contributing to rapid reshuffling of CentO satellites. Intercentromere satellite homogenization was revealed by genome-wide comparison of CentO satellite monomers. However, a 10-bp duplication present in nearly half of the CEN4 monomers was found to be completely absent in rice centromere 8 (CEN8), suggesting that CEN4 and CEN8 may represent two different stages in the evolution of rice centromeres. These observations, obtained from the only complex eukaryotic centromeres to have been completely sequenced thus far, depict the evolutionary dynamics of rice centromeres with respect to the nature, timing, and process of centromeric repeat amplification.
Collapse
Affiliation(s)
- Jianxin Ma
- Department of Agronomy, Purdue University, West Lafayette, IN 47907, USA
| | | |
Collapse
|