351
|
Yu YH, Lin YW, Yu JF, Schempp W, Yen PH. Evolution of the DAZ gene and the AZFc region on primate Y chromosomes. BMC Evol Biol 2008; 8:96. [PMID: 18366765 PMCID: PMC2322974 DOI: 10.1186/1471-2148-8-96] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2007] [Accepted: 03/26/2008] [Indexed: 12/13/2022] Open
Abstract
Background The Azoospermia Factor c (AZFc) region of the human Y chromosome is a unique product of segmental duplication. It consists almost entirely of very long amplicons, represented by different colors, and is frequently deleted in subfertile men. Most of the AZFc amplicons have high sequence similarity with autosomal segments, indicating recent duplication and transposition to the Y chromosome. The Deleted in Azoospermia (DAZ) gene within the red-amplicon arose from an ancestral autosomal DAZ-like (DAZL) gene. It varies significantly between different men regarding to its copy number and the numbers of RNA recognition motif and DAZ repeat it encodes. We used Southern analyses to study the evolution of DAZ and AZFc amplicons on the Y chromosomes of primates. Results The Old World monkey rhesus macaque has only one DAZ gene. In contrast, the great apes have multiple copies of DAZ, ranging from 2 copies in bonobos and gorillas to at least 6 copies in orangutans, and these DAZ genes have polymorphic structures similar to those of their human counterparts. Sequences homologous to the various AZFc amplicons are present on the Y chromosomes of some but not all primates, indicating that they arrived on the Y chromosome at different times during primate evolution. Conclusion The duplication and transposition of AZFc amplicons to the human Y chromosome occurred in three waves, i.e., after the branching of the New World monkey, the gorilla, and the chimpanzee/bonobo lineages, respectively. The red-amplicon, one of the first to arrive on the Y chromosome, amplified by inverted duplication followed by direct duplication after the separation of the Old World monkey and the great ape lineages. Subsequent duplication/deletion in the various lineages gave rise to a spectrum of DAZ gene structure and copy number found in today's great apes.
Collapse
Affiliation(s)
- Yueh-Hsiang Yu
- Graduate Institute of Life Sciences, National Defense Medical Center, Taipei, Taiwan.
| | | | | | | | | |
Collapse
|
352
|
Huerta-Cepas J, Dopazo H, Dopazo J, Gabaldón T. The human phylome. Genome Biol 2008; 8:R109. [PMID: 17567924 PMCID: PMC2394744 DOI: 10.1186/gb-2007-8-6-r109] [Citation(s) in RCA: 112] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2006] [Revised: 03/16/2007] [Accepted: 06/13/2007] [Indexed: 01/09/2023] Open
Abstract
The human phylome, which includes evolutionary relationships of all human proteins and their homologs among thirty-nine fully sequenced eukaryotes, is reconstructed. Background: Phylogenomics analyses serve to establish evolutionary relationships among organisms and their genes. A phylome, the complete collection of all gene phylogenies in a genome, constitutes a valuable source of information, but its use in large genomes still constitutes a technical challenge. The use of phylomes also requires the development of new methods that help us to interpret them. Results: We reconstruct here the human phylome, which includes the evolutionary relationships of all human proteins and their homologs among 39 fully sequenced eukaryotes. Phylogenetic techniques used include alignment trimming, branch length optimization, evolutionary model testing and maximum likelihood and Bayesian methods. Although differences with alternative topologies are minor, most of the trees support the Coelomata and Unikont hypotheses as well as the grouping of primates with laurasatheria to the exclusion of rodents. We assess the extent of gene duplication events and their relationship with the functional roles of the protein families involved. We find support for at least one, and probably two, rounds of whole genome duplications before vertebrate radiation. Using a novel algorithm that is independent from a species phylogeny, we derive orthology and paralogy relationships of human proteins among eukaryotic genomes. Conclusion: Topological variations among phylogenies for different genes are to be expected, highlighting the danger of gene-sampling effects in phylogenomic analyses. Several links can be established between the functions of gene families duplicated at certain phylogenetic splits and major evolutionary transitions in those lineages. The pipeline implemented here can be easily adapted for use in other organisms.
Collapse
Affiliation(s)
- Jaime Huerta-Cepas
- Bioinformatics Department, Centro de Investigación Príncipe Felipe, Autopista del Saler, 46013 Valencia, Spain
| | - Hernán Dopazo
- Bioinformatics Department, Centro de Investigación Príncipe Felipe, Autopista del Saler, 46013 Valencia, Spain
| | - Joaquín Dopazo
- Bioinformatics Department, Centro de Investigación Príncipe Felipe, Autopista del Saler, 46013 Valencia, Spain
| | - Toni Gabaldón
- Bioinformatics Department, Centro de Investigación Príncipe Felipe, Autopista del Saler, 46013 Valencia, Spain
| |
Collapse
|
353
|
Makoff AJ, Flomen RH. Detailed analysis of 15q11-q14 sequence corrects errors and gaps in the public access sequence to fully reveal large segmental duplications at breakpoints for Prader-Willi, Angelman, and inv dup(15) syndromes. Genome Biol 2008; 8:R114. [PMID: 17573966 PMCID: PMC2394762 DOI: 10.1186/gb-2007-8-6-r114] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2006] [Revised: 04/23/2007] [Accepted: 06/15/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Chromosome 15 contains many segmental duplications, including some at 15q11-q13 that appear to be responsible for the deletions that cause Prader-Willi and Angelman syndromes and for other genomic disorders. The current version of the human genome sequence is incomplete, with seven gaps in the proximal region of 15q, some of which are flanked by duplicated sequence. We have investigated this region by conducting a detailed examination of the sequenced genomic clones in the public database, focusing on clones from the RP11 library that originates from one individual. RESULTS Our analysis has revealed assembly errors, including contig NT_078094 being in the wrong orientation, and has enabled most of the gaps between contigs to be closed. We have constructed a map in which segmental duplications are no longer interrupted by gaps and which together reveals a complex region. There are two pairs of large direct repeats that are located in regions consistent with the two classes of deletions associated with Prader-Willi and Angelman syndromes. There are also large inverted repeats that account for the formation of the observed supernumerary marker chromosomes containing two copies of the proximal end of 15q and associated with autism spectrum disorders when involving duplications of maternal origin (inv dup[15] syndrome). CONCLUSION We have produced a segmental map of 15q11-q14 that reveals several large direct and inverted repeats that are incompletely and inaccurately represented on the current human genome sequence. Some of these repeats are clearly responsible for deletions and duplications in known genomic disorders, whereas some may increase susceptibility to other disorders.
Collapse
Affiliation(s)
- Andrew J Makoff
- Department of Psychological Medicine, King's College London, Institute of Psychiatry, Denmark Hill, London SE5 8AF, UK
| | - Rachel H Flomen
- Department of Psychological Medicine, King's College London, Institute of Psychiatry, Denmark Hill, London SE5 8AF, UK
| |
Collapse
|
354
|
Yang HP, Barbash DA. Abundant and species-specific DINE-1 transposable elements in 12 Drosophila genomes. Genome Biol 2008; 9:R39. [PMID: 18291035 PMCID: PMC2374699 DOI: 10.1186/gb-2008-9-2-r39] [Citation(s) in RCA: 74] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2007] [Revised: 12/17/2007] [Accepted: 02/21/2008] [Indexed: 02/08/2023] Open
Abstract
Evidence is presented that DINE-1 is a highly abundant miniature inverted-repeat transposable element (MITE) family present in all 12 Drosophila species with whole-genome sequence available. Background Miniature inverted-repeat transposable elements (MITEs) are non-autonomous DNA-mediated transposable elements (TEs) derived from autonomous TEs. Unlike in many plants or animals, MITEs and other types of DNA-mediated TEs were previously thought to be either rare or absent in Drosophila. Most other TE families in Drosophila exist at low or intermediate copy number (around < 100 per genome). Results We present evidence here that the dispersed repeat Drosophila interspersed element 1 (DINE-1; also named INE-1 and DNAREP1) is a highly abundant DNA-mediated TE containing inverted repeats found in all 12 sequenced Drosophila genomes. All DINE-1s share a similar sequence structure, but are more homogeneous within species than they are among species. The inferred phylogenetic relationship of the DINE-1 consensus sequence from each species is generally consistent with the known species phylogeny, suggesting vertical transmission as the major mechanism for DINE-1 propagation. Exceptions observed in D. willistoni and D. ananassae could be due to either horizontal transfer or reactivation of ancestral copies. Our analysis of pairwise percentage identity of DINE-1 copies within species suggests that the transpositional activity of DINE-1 is extremely dynamic, with some lineages showing evidence for recent transpositional bursts and other lineages appearing to have silenced their DINE-1s for long periods of time. We also find that all species have many DINE-1 insertions in introns and adjacent to protein-coding genes. Finally, we discuss our results in light of a recent proposal that DINE-1s belong to the Helitron family of TEs. Conclusion We find that all 12 Drosophila species with whole-genome sequence contain the high copy element DINE-1. Although all DINE-1s share a similar structure, species-specific variation in the distribution of average pairwise divergence suggests that DINE-1 has gone through multiple independent cycles of activation and suppression. DINE-1 also has had a significant impact on gene structure evolution.
Collapse
Affiliation(s)
- Hsiao-Pei Yang
- Institute of Genetics, National Yang-Ming University, Taipei 112, Taiwan.
| | | |
Collapse
|
355
|
Cardone MF, Jiang Z, D'Addabbo P, Archidiacono N, Rocchi M, Eichler EE, Ventura M. Hominoid chromosomal rearrangements on 17q map to complex regions of segmental duplication. Genome Biol 2008; 9:R28. [PMID: 18257913 PMCID: PMC2374708 DOI: 10.1186/gb-2008-9-2-r28] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2007] [Revised: 01/24/2008] [Accepted: 02/07/2008] [Indexed: 01/30/2023] Open
Abstract
BACKGROUND Chromosomal rearrangements, such as translocations and inversions, are recurrent phenomena during evolution, and both of them are involved in reproductive isolation and speciation. To better understand the molecular basis of chromosome rearrangements and their part in karyotype evolution, we have investigated the history of human chromosome 17 by comparative fluorescence in situ hybridization (FISH) and sequence analysis. RESULTS Human bacterial artificial chromosome/p1 artificial chromosome probes spanning the length of chromosome 17 were used in FISH experiments on great apes, Old World monkeys and New World monkeys to study the evolutionary history of this chromosome. We observed that the macaque marker order represents the ancestral organization. Human, chimpanzee and gorilla homologous chromosomes differ by a paracentric inversion that occurred specifically in the Homo sapiens/Pan troglodytes/Gorilla gorilla ancestor. Detailed analyses of the paracentric inversion revealed that the breakpoints mapped to two regions syntenic to human 17q12/21 and 17q23, both rich in segmental duplications. CONCLUSION Sequence analyses of the human and macaque organization suggest that the duplication events occurred in the catarrhine ancestor with the duplication blocks continuing to duplicate or undergo gene conversion during evolution of the hominoid lineage. We propose that the presence of these duplicons has mediated the inversion in the H. sapiens/P. troglodytes/G. gorilla ancestor. Recently, the same duplication blocks have been shown to be polymorphic in the human population and to be involved in triggering microdeletion and duplication in human. These results further support a model where genomic architecture has a direct role in both rearrangement involved in karyotype evolution and genomic instability in human.
Collapse
Affiliation(s)
- Maria Francesca Cardone
- Department of Genetics and Microbiology, University of Bari, Via Amendola, Bari, 70126, Italy.
| | | | | | | | | | | | | |
Collapse
|
356
|
Coop G, Wen X, Ober C, Pritchard JK, Przeworski M. High-resolution mapping of crossovers reveals extensive variation in fine-scale recombination patterns among humans. Science 2008; 319:1395-8. [PMID: 18239090 DOI: 10.1126/science.1151851] [Citation(s) in RCA: 273] [Impact Index Per Article: 16.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Recombination plays a crucial role in meiosis, ensuring the proper segregation of chromosomes. Recent linkage disequilibrium (LD) and sperm-typing studies suggest that recombination rates vary tremendously across the human genome, with most events occurring in narrow "hotspots." To examine variation in fine-scale recombination patterns among individuals, we used dense, genome-wide single-nucleotide polymorphism data collected in nuclear families to localize crossovers with high spatial resolution. This analysis revealed that overall recombination hotspot usage is similar in males and females, with individual hotspots often active in both sexes. Across the genome, roughly 60% of crossovers occurred in hotspots inferred from LD studies. Notably, however, we found extensive and heritable variation among both males and females in the proportion of crossovers occurring in these hotspots.
Collapse
Affiliation(s)
- Graham Coop
- Department of Human Genetics, University of Chicago, 920 East 58th Street, Cummings Life Science Center, Chicago, IL 60637, USA.
| | | | | | | | | |
Collapse
|
357
|
Darai-Ramqvist E, Sandlund A, Müller S, Klein G, Imreh S, Kost-Alimova M. Segmental duplications and evolutionary plasticity at tumor chromosome break-prone regions. Genome Res 2008; 18:370-9. [PMID: 18230801 DOI: 10.1101/gr.7010208] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We have previously found that the borders of evolutionarily conserved chromosomal regions often coincide with tumor-associated deletion breakpoints within human 3p12-p22. Moreover, a detailed analysis of a frequently deleted region at 3p21.3 (CER1) showed associations between tumor breaks and gene duplications. We now report on the analysis of 54 chromosome 3 breaks by multipoint FISH (mpFISH) in 10 carcinoma-derived cell lines. The centromeric region was broken in five lines. In lines with highly complex karyotypes, breaks were clustered near known fragile sites, FRA3B, FRA3C, and FRA3D (three lines), and in two other regions: 3p12.3-p13 ( approximately 75 Mb position) and 3q21.3-q22.1 ( approximately 130 Mb position) (six lines). All locations are shown based on NCBI Build 36.1 human genome sequence. The last two regions participated in three of four chromosome 3 inversions during primate evolution. Regions at 75, 127, and 131 Mb positions carry a large ( approximately 250 kb) segmental duplication (tumor break-prone segmental duplication [TBSD]). TBSD homologous sequences were found at 15 sites on different chromosomes. They were located within bands frequently involved in carcinoma-associated breaks. Thirteen of them have been involved in inversions during primate evolution; 10 were reused by breaks during mammalian evolution; 14 showed copy number polymorphism in man. TBSD sites showed an increase in satellite repeats, retrotransposed sequences, and other segmental duplications. We propose that the instability of these sites stems from specific organization of the chromosomal region, associated with location at a boundary between different CG-content isochores and with the presence of TBSDs and "instability elements," including satellite repeats and retroviral sequences.
Collapse
Affiliation(s)
- Eva Darai-Ramqvist
- Department of Microbiology, Tumor and Cell Biology, Karolinska Institute, Stockholm SE-171 77, Sweden
| | | | | | | | | | | |
Collapse
|
358
|
Lee AS, Gutiérrez-Arcelus M, Perry GH, Vallender EJ, Johnson WE, Miller GM, Korbel JO, Lee C. Analysis of copy number variation in the rhesus macaque genome identifies candidate loci for evolutionary and human disease studies. Hum Mol Genet 2008; 17:1127-36. [PMID: 18180252 DOI: 10.1093/hmg/ddn002] [Citation(s) in RCA: 93] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Copy number variants (CNVs) are heritable gains and losses of genomic DNA in normal individuals. While copy number variation is widely studied in humans, our knowledge of CNVs in other mammalian species is more limited. We have designed a custom array-based comparative genomic hybridization (aCGH) platform with 385 000 oligonucleotide probes based on the reference genome sequence of the rhesus macaque (Macaca mulatta), the most widely studied non-human primate in biomedical research. We used this platform to identify 123 CNVs among 10 unrelated macaque individuals, with 24% of the CNVs observed in multiple individuals. We found that segmental duplications were significantly enriched at macaque CNV loci. We also observed significant overlap between rhesus macaque and human CNVs, suggesting that certain genomic regions are prone to recurrent CNV formation and instability, even across a total of approximately 50 million years of primate evolution ( approximately 25 million years in each lineage). Furthermore, for eight of the CNVs that were observed in both humans and macaques, previous human studies have reported a relationship between copy number and gene expression or disease susceptibility. Therefore, the rhesus macaque offers an intriguing, non-human primate outbred model organism with which hypotheses concerning the specific functions of phenotypically relevant human CNVs can be tested.
Collapse
Affiliation(s)
- Arthur S Lee
- Department of Pathology, Brigham and Women's Hospital, 221 Longwood Ave., Boston, MA 02115, USA
| | | | | | | | | | | | | | | |
Collapse
|
359
|
Feder ME. Evolvability of physiological and biochemical traits: evolutionary mechanisms including and beyond single-nucleotide mutation. ACTA ACUST UNITED AC 2008; 210:1653-60. [PMID: 17449831 DOI: 10.1242/jeb.02725] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
A longstanding challenge for biologists has been to explain not just how organisms are adapted to diverse environments, but how these adaptations arise. Although natural selection is clearly sufficient to act on heritable variation, is this heritable variation sufficient to yield complex adaptations and how does this variation itself arise? Much prior focus has been on mutation of single nucleotides in genes. This process is common and can have dramatic phenotypes, but could be limited in its ability to culminate in complex adaptations for two kinds of reasons: (i) because natural selection is powerful, it can purge genetic variation, and (ii) evolutionary transition from the absence to the presence of a complex adaptation seemingly requires multiple mutations at the right place and time and in the right sequence, with each intermediate stage having increased overall fitness; this seems highly improbable. Because the networks that organisms comprise are hierarchical and redundant and have modular structure, however, single-nucleotide mutations can have large and tolerable impacts. Diverse mechanisms, collectively evolutionary capacitors, can shield genetic variation from the purgative of selection. These features can enable evolution to proceed via single-nucleotide mutation. Importantly, single-nucleotide mutation usually only modifies existing genes rather than creating new ones, and numerous other mechanisms eclipse single-nucleotide mutation in creating genetic variation. These include gene duplication (both segmental and whole-genome), lateral gene transfer, hybridization, mobile genetic elements and symbiosis. Other processes can scramble and reassemble nucleotide sequence. The mechanisms beyond single-gene mutation offer considerable promise in detailing the evolution of complex physiological and biochemical traits, and have already done so for several morphological traits.
Collapse
Affiliation(s)
- Martin E Feder
- Department of Organismal Biology and Anatomy and The Committees on Evolutionary Biology, Genetics, and Molecular Medicine, The University of Chicago, 1027 E. 57th Street, Chicago, IL 60637, USA.
| |
Collapse
|
360
|
Wong P, Frishman D. Designability and disease. Methods Mol Biol 2008; 484:491-504. [PMID: 18592197 DOI: 10.1007/978-1-59745-398-1_29] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]
Abstract
Structural designability is the number of ways it is possible to encode for structure. A protein's designability has been equated with the size of sequence space encoding for the protein's structure, a measure that reflects the structure's robustness to mutation. Current evidence suggests that designability is fundamental to our understanding of the evolvability and distribution of structures in nature and is a significant factor associated with human disease. Here, we describe definitions and principles underlying the concept of designability and discuss its relation to disease.
Collapse
Affiliation(s)
- Philip Wong
- Institute for Bioinformatics, GSF-National Research Center for Environment and Health, Neuherberg, Germany
| | | |
Collapse
|
361
|
Kim PM, Korbel JO, Gerstein MB. Positive selection at the protein network periphery: evaluation in terms of structural constraints and cellular context. Proc Natl Acad Sci U S A 2007; 104:20274-9. [PMID: 18077332 PMCID: PMC2154421 DOI: 10.1073/pnas.0710183104] [Citation(s) in RCA: 104] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2007] [Indexed: 12/30/2022] Open
Abstract
Because of recent advances in genotyping and sequencing, human genetic variation and adaptive evolution in the primate lineage have become major research foci. Here, we examine the relationship between genetic signatures of adaptive evolution and network topology. We find a striking tendency of proteins that have been under positive selection (as compared with the chimpanzee) to be located at the periphery of the interaction network. Our results are based on the analysis of two types of genome evolution, both in terms of intra- and interspecies variation. First, we looked at single-nucleotide polymorphisms and their fixed variants, single-nucleotide differences in the human genome relative to the chimpanzee. Second, we examine fixed structural variants, specifically large segmental duplications and their polymorphic precursors known as copy number variants. We propose two complementary mechanisms that lead to the observed trends. First, we can rationalize them in terms of constraints imposed by protein structure: We find that positively selected sites are preferentially located on the exposed surface of proteins. Because central network proteins (hubs) are likely to have a larger fraction of their surface involved in interactions, they tend to be constrained and under negative selection. Conversely, we show that the interaction network roughly maps to cellular organization, with the periphery of the network corresponding to the cellular periphery (i.e., extracellular space or cell membrane). This suggests that the observed positive selection at the network periphery may be due to an increase of adaptive events on the cellular periphery responding to changing environments.
Collapse
Affiliation(s)
- Philip M. Kim
- *Department of Molecular Biophysics and Biochemistry
| | - Jan O. Korbel
- *Department of Molecular Biophysics and Biochemistry
| | - Mark B. Gerstein
- *Department of Molecular Biophysics and Biochemistry
- Department of Computer Science, and
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT 06520
| |
Collapse
|
362
|
Ranz JM, Maurin D, Chan YS, von Grotthuss M, Hillier LW, Roote J, Ashburner M, Bergman CM. Principles of genome evolution in the Drosophila melanogaster species group. PLoS Biol 2007; 5:e152. [PMID: 17550304 PMCID: PMC1885836 DOI: 10.1371/journal.pbio.0050152] [Citation(s) in RCA: 129] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2006] [Accepted: 04/02/2007] [Indexed: 12/19/2022] Open
Abstract
That closely related species often differ by chromosomal inversions was discovered by Sturtevant and Plunkett in 1926. Our knowledge of how these inversions originate is still very limited, although a prevailing view is that they are facilitated by ectopic recombination events between inverted repetitive sequences. The availability of genome sequences of related species now allows us to study in detail the mechanisms that generate interspecific inversions. We have analyzed the breakpoint regions of the 29 inversions that differentiate the chromosomes of Drosophila melanogaster and two closely related species, D. simulans and D. yakuba, and reconstructed the molecular events that underlie their origin. Experimental and computational analysis revealed that the breakpoint regions of 59% of the inversions (17/29) are associated with inverted duplications of genes or other nonrepetitive sequences. In only two cases do we find evidence for inverted repetitive sequences in inversion breakpoints. We propose that the presence of inverted duplications associated with inversion breakpoint regions is the result of staggered breaks, either isochromatid or chromatid, and that this, rather than ectopic exchange between inverted repetitive sequences, is the prevalent mechanism for the generation of inversions in the melanogaster species group. Outgroup analysis also revealed evidence for widespread breakpoint recycling. Lastly, we have found that expression domains in D. melanogaster may be disrupted in D. yakuba, bringing into question their potential adaptive significance. The organization of genes on chromosomes changes over evolutionary time. In some organisms, such as fruit flies and mosquitoes, inversions of chromosome regions are widespread. This has been associated with adaptation to environmental pressures and speciation. However, the mechanisms by which inversions are generated at the molecular level are poorly understood. The prevailing view involves the interactions of sequences that are moderately repeated in the genome. Here, we use molecular and computational methods to study 29 inversions that differentiate the chromosomes of three closely related fruit fly species. We find little support for a causal role of repetitive sequences in the origin of inversions and, instead, detect the presence of inverted duplications of ancestrally unique sequences (generally protein-coding genes) in the breakpoint regions of many inversions. This leads us to propose an alternative model in which the generation of inversions is coupled with the generation of duplications of flanking sequences. Additionally, we find evidence for genomic regions that are prone to breakage, being associated with inversions generated independently during the evolution of the ancestors of existing species. Chromosomal inversion breakpoints were compared between three closely related Drosophila species. Many are associated with inverted gene duplications, suggesting that the prevalent mechanism for their generation involves staggered breakpoints.
Collapse
Affiliation(s)
- José M Ranz
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom.
| | | | | | | | | | | | | | | |
Collapse
|
363
|
Yang S, Arguello JR, Li X, Ding Y, Zhou Q, Chen Y, Zhang Y, Zhao R, Brunet F, Peng L, Long M, Wang W. Repetitive element-mediated recombination as a mechanism for new gene origination in Drosophila. PLoS Genet 2007; 4:e3. [PMID: 18208328 PMCID: PMC2211543 DOI: 10.1371/journal.pgen.0040003] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2007] [Accepted: 11/27/2007] [Indexed: 01/05/2023] Open
Abstract
Previous studies of repetitive elements (REs) have implicated a mechanistic role in generating new chimerical genes. Such examples are consistent with the classic model for exon shuffling, which relies on non-homologous recombination. However, recent data for chromosomal aberrations in model organisms suggest that ectopic homology-dependent recombination may also be important. Lack of a dataset comprising experimentally verified young duplicates has hampered an effective examination of these models as well as an investigation of sequence features that mediate the rearrangements. Here we use ∼7,000 cDNA probes (∼112,000 primary images) to screen eight species within the Drosophila melanogaster subgroup and identify 17 duplicates that were generated through ectopic recombination within the last 12 mys. Most of these are functional and have evolved divergent expression patterns and novel chimeric structures. Examination of their flanking sequences revealed an excess of repetitive sequences, with the majority belonging to the transposable element DNAREP1 family, associated with the new genes. Our dataset strongly suggests an important role for REs in the generation of chimeric genes within these species. In numerous organisms, many new genes have been found to originate through dispersed gene duplication and exon/domain shuffling. What recombination mechanisms were involved in the duplication and the shuffling processes? Lack of the intermediate products of recombination that share adequate sequence identity between homologous sequences, or the parental sequences from which the new genes were derived, often makes answering these questions difficult. We identified a number of young genes that originated in recently diverged branches in the evolutionary tree of the eight Drosophila melanogaster subgroup species, by using fluorescence in situ hybridization with polytene chromosomes. We analyzed the genomic regions surrounding 17 new dispersed duplicate genes and observed that most of these genes are flanked by repetitive elements (REs), including a large and diverged transposable element family, DNAREP1. Several copies of these REs are kept in both new and parental gene regions, and their degeneration is correlated with the increasing ages of the identified new genes. These data suggest that REs mediate the recombination responsible for the new gene origination.
Collapse
Affiliation(s)
- Shuang Yang
- Chinese Academy of Sciences (CAS)—Max Planck Junior Research Group, Key Laboratory of Cellular and Molecular Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
- Graduate School of Chinese Academy Sciences, Beijing, China
| | - J. Roman Arguello
- Committee on Evolutionary Biology, The University of Chicago, Chicago, Illinois, United States of America
| | - Xin Li
- Chinese Academy of Sciences (CAS)—Max Planck Junior Research Group, Key Laboratory of Cellular and Molecular Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
- Graduate School of Chinese Academy Sciences, Beijing, China
| | - Yun Ding
- Chinese Academy of Sciences (CAS)—Max Planck Junior Research Group, Key Laboratory of Cellular and Molecular Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
- Graduate School of Chinese Academy Sciences, Beijing, China
| | - Qi Zhou
- Chinese Academy of Sciences (CAS)—Max Planck Junior Research Group, Key Laboratory of Cellular and Molecular Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
- Graduate School of Chinese Academy Sciences, Beijing, China
| | - Ying Chen
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, United States of America
| | - Yue Zhang
- Chinese Academy of Sciences (CAS)—Max Planck Junior Research Group, Key Laboratory of Cellular and Molecular Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Ruoping Zhao
- Chinese Academy of Sciences (CAS)—Max Planck Junior Research Group, Key Laboratory of Cellular and Molecular Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Frédéric Brunet
- Committee on Evolutionary Biology, The University of Chicago, Chicago, Illinois, United States of America
| | - Lixin Peng
- Chinese Academy of Sciences (CAS)—Max Planck Junior Research Group, Key Laboratory of Cellular and Molecular Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
| | - Manyuan Long
- Committee on Evolutionary Biology, The University of Chicago, Chicago, Illinois, United States of America
- Department of Ecology and Evolution, The University of Chicago, Chicago, Illinois, United States of America
- * To whom correspondence should be addressed. E-mail: (ML); (WW)
| | - Wen Wang
- Chinese Academy of Sciences (CAS)—Max Planck Junior Research Group, Key Laboratory of Cellular and Molecular Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming, Yunnan, China
- * To whom correspondence should be addressed. E-mail: (ML); (WW)
| |
Collapse
|
364
|
Abstract
Chromosomal inversions have an important role in evolution, and an increasing number of inversion polymorphisms are being identified in the human population. The evolutionary history of these inversions and the mechanisms by which they arise are therefore of significant interest. Previously, a polymorphic inversion on human chromosome Xq28 that includes the FLNA and EMD loci was discovered and hypothesized to have been the result of nonallelic homologous recombination (NAHR) between near-identical inverted duplications flanking this region. Here, we carried out an in-depth study of the orthologous region in 27 additional eutherians and report that this inversion is not specific to humans, but has occurred independently and repeatedly at least 10 times in multiple eutherian lineages. Moreover, inverted duplications flank the FLNA-EMD region in all 16 species for which high-quality sequence assemblies are available. Based on detailed sequence analyses, we propose a model in which the observed inverted duplications originated from a common duplication event that predates the eutherian radiation. Subsequent gene conversion homogenized the duplications, thereby providing a continuous substrate for NAHR that led to the recurrent inversion of this segment of the genome. These results provide an extreme example in support of the evolutionary breakpoint reusage hypothesis and point out that some near-identical human segmental duplications may, in fact, have originated >100 million years ago.
Collapse
|
365
|
Emanuel BS, Saitta SC. From microscopes to microarrays: dissecting recurrent chromosomal rearrangements. Nat Rev Genet 2007; 8:869-83. [PMID: 17943194 DOI: 10.1038/nrg2136] [Citation(s) in RCA: 68] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2022]
Abstract
Submicroscopic chromosomal rearrangements that lead to copy-number changes have been shown to underlie distinctive and recognizable clinical phenotypes. The sensitivity to detect copy-number variation has escalated with the advent of array comparative genomic hybridization (CGH), including BAC and oligonucleotide-based platforms. Coupled with improved assemblies and annotation of genome sequence data, these technologies are facilitating the identification of new syndromes that are associated with submicroscopic genomic changes. Their characterization reveals the role of genome architecture in the aetiology of many clinical disorders. We review a group of genomic disorders that are mediated by segmental duplications, emphasizing the impact that high-throughput detection methods and the availability of the human genome sequence have had on their dissection and diagnosis.
Collapse
Affiliation(s)
- Beverly S Emanuel
- Division of Human Genetics, The Children's Hospital of Philadelphia, Abramson Research Center, Department of Pediatrics, University of Pennsylvania School of Medicine, Philadelphia, Philadelphia 19104-4318, USA.
| | | |
Collapse
|
366
|
Recurrent DNA copy number variation in the laboratory mouse. Nat Genet 2007; 39:1384-9. [PMID: 17965714 DOI: 10.1038/ng.2007.19] [Citation(s) in RCA: 107] [Impact Index Per Article: 5.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2007] [Accepted: 08/27/2007] [Indexed: 11/08/2022]
Abstract
Different species, populations and individuals vary considerably in the copy number of discrete segments of their genomes. The manner and frequency with which these genetic differences arise over generational time is not well understood. Taking advantage of divergence among lineages sharing a recent common ancestry, we have conducted a genome-wide analysis of spontaneous copy number variation (CNV) in the laboratory mouse. We used high-resolution microarrays to identify 38 CNVs among 14 colonies of the C57BL/6 strain spanning approximately 967 generations of inbreeding, and we examined these loci in 12 additional strains. It is clear from our results that many CNVs arise through a highly nonrandom process: 18 of 38 were the product of recurrent mutation, and rates of change varied roughly four orders of magnitude across different loci. Recurrent CNVs are found throughout the genome, affect 43 genes and fluctuate in copy number over mere hundreds of generations, observations that raise questions about their contribution to natural variation.
Collapse
|
367
|
Nickel GC, Tefft D, Adams MD. Human PAML browser: a database of positive selection on human genes using phylogenetic methods. Nucleic Acids Res 2007; 36:D800-8. [PMID: 17962310 PMCID: PMC2238824 DOI: 10.1093/nar/gkm764] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
With the recent increase in the number of mammalian genomes being sequenced, large-scale genome scans for human-specific positive selection are now possible. Selection can be inferred through phylogenetic analysis by comparing the rates of silent and replacement substitution between related species. Maximum-likelihood (ML) analysis of codon substitution models can be used to identify genes with an accelerated pattern of amino acid substitution on a particular lineage. However, the ML methods are computationally intensive and awkward to configure. We have created a database that contains the results of tests for positive selection along the human lineage in 13 721 genes with orthologs in the UCSC multispecies genome alignments. The Human PAML Browser is a resource through which researchers can search for a gene of interest or groups of genes by Gene Ontology category, and obtain coding sequence alignments for the gene and as well as results from tests of positive selection from the software package Phylogenetic Analysis by Maximum Likelihood. The Human PAML Browser is available at http://mendel.gene.cwru.edu/adamslab/pbrowser.py.
Collapse
Affiliation(s)
- Gabrielle C Nickel
- Department of Genetics, Case Western Reserve University, Cleveland, OH, USA
| | | | | |
Collapse
|
368
|
Fumasoni I, Meani N, Rambaldi D, Scafetta G, Alcalay M, Ciccarelli FD. Family expansion and gene rearrangements contributed to the functional specialization of PRDM genes in vertebrates. BMC Evol Biol 2007; 7:187. [PMID: 17916234 PMCID: PMC2082429 DOI: 10.1186/1471-2148-7-187] [Citation(s) in RCA: 99] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2007] [Accepted: 10/04/2007] [Indexed: 12/11/2022] Open
Abstract
Background Progressive diversification of paralogs after gene expansion is essential to increase their functional specialization. However, mode and tempo of this divergence remain mostly unclear. Here we report the comparative analysis of PRDM genes, a family of putative transcriptional regulators involved in human tumorigenesis. Results Our analysis assessed that the PRDM genes originated in metazoans, expanded in vertebrates and further duplicated in primates. We experimentally showed that fast-evolving paralogs are poorly expressed, and that the most recent duplicates, such as primate-specific PRDM7, acquire tissue-specificity. PRDM7 underwent major structural rearrangements that decreased the number of encoded Zn-Fingers and modified gene splicing. Through internal duplication and activation of a non-canonical splice site (GC-AG), PRDM7 can acquire a novel intron. We also detected an alternative isoform that can retain the intron in the mature transcript and that is predominantly expressed in human melanocytes. Conclusion Our findings show that (a) molecular evolution of paralogs correlates with their expression pattern; (b) gene diversification is obtained through massive genomic rearrangements; and (c) splicing modification contributes to the functional specialization of novel genes.
Collapse
Affiliation(s)
- Irene Fumasoni
- Department of Experimental Oncology, European Institute of Oncology, IFOM-IEO Campus, Via Adamello 16, 20139 Milan, Italy.
| | | | | | | | | | | |
Collapse
|
369
|
Fickelscher I, Liehr T, Watts K, Bryant V, Barber JCK, Heidemann S, Siebert R, Hertz JM, Tumer Z, Simon Thomas N. The variant inv(2)(p11.2q13) is a genuinely recurrent rearrangement but displays some breakpoint heterogeneity. Am J Hum Genet 2007; 81:847-56. [PMID: 17847011 PMCID: PMC2227935 DOI: 10.1086/521226] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2007] [Accepted: 06/28/2007] [Indexed: 02/04/2023] Open
Abstract
Human chromosome 2 contains large blocks of segmental duplications (SDs), both within and between proximal 2p and proximal 2q, and these may contribute to the frequency of the common variant inversion inv(2)(p11.2q13). Despite their being cytogenetically homogeneous, we have identified four different breakpoint combinations by fluorescence in situ hybridization mapping of 40 cases of inv(2)(p11.2q13) of European origin. For the vast majority of inversions (35/40), the breakpoints fell within the same spanning BACs, which hybridized to both 2p11.2 and 2q13 on the normal and inverted homologues. Sequence analysis revealed that these BACs contain a significant proportion of intrachromosomal SDs with sequence homology to the reciprocal breakpoint region. In contrast, BACs spanning the rare breakpoint combinations contain fewer SDs and with sequence homology only to the same chromosome arm. Using haplotype analysis, we identified a number of related family subgroups with identical or very closely related haplotypes. However, the majority of cases were not related, demonstrating for the first time that the inv(2)(p11.2q13) is a truly recurrent rearrangement. Therefore, there are three explanations to account for the frequent observation of the inv(2)(p11.2q13): the majority have arisen independently in different ancestors, while a minority either have been transmitted from a common founder or have different breakpoints at the molecular cytogenetic level.
Collapse
Affiliation(s)
- Ina Fickelscher
- Institut fur Humangenetik und Anthropologie, Friedrich-Schiller University, Jena, Germany
| | | | | | | | | | | | | | | | | | | |
Collapse
|
370
|
Scherer SW, Lee C, Birney E, Altshuler DM, Eichler EE, Carter NP, Hurles ME, Feuk L. Challenges and standards in integrating surveys of structural variation. Nat Genet 2007; 39:S7-15. [PMID: 17597783 PMCID: PMC2698291 DOI: 10.1038/ng2093] [Citation(s) in RCA: 274] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
There has been an explosion of data describing newly recognized structural variants in the human genome. In the flurry of reporting, there has been no standard approach to collecting the data, assessing its quality or describing identified features. This risks becoming a rampant problem, in particular with respect to surveys of copy number variation and their application to disease studies. Here, we consider the challenges in characterizing and documenting genomic structural variants. From this, we derive recommendations for standards to be adopted, with the aim of ensuring the accurate presentation of this form of genetic variation to facilitate ongoing research.
Collapse
Affiliation(s)
- Stephen W Scherer
- The Centre for Applied Genomics and Program in Genetics and Genomic Biology, The Hospital for Sick Children, 101 College Street, Room 14-701, Ontario M5G 1L7, Canada.
| | | | | | | | | | | | | | | |
Collapse
|
371
|
Portin P. Evolution of man in the light of molecular genetics: a review. Part I. Our evolutionary history and genomics. Hereditas 2007; 144:80-95. [PMID: 17663700 DOI: 10.1111/j.2007.0018-0661.02003.x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/23/2022] Open
Abstract
The discovery in the mid 1970s of efficient methods of DNA sequencing and their subsequent development into more and more rapid procedures followed by sequencing the genomes of many species, including man in 2001, revolutionised the whole of biology. Remarkably, new light could be cast on the evolutionary relations of different species, and the tempo and mode of evolution within a given species, notably man, could quantitatively be illuminated including ongoing evolution possibly involving also the size of the brains. This review is a short summary of the results of the molecular genetic investigations of human evolution including the time and place of the formation of our species, our evolutionary relation to the closest living species relatives as well as extinct forms of the genus Homo. The nature and amount of genetic polymorphism in man is also considered with special emphasis on the causes of this variation, and the role of natural selection in human evolution. A consensus about the mosaic nature of our genome and the rather dynamic structure of our ancestral population is gradually emerging. The modern gene pool has most likely been contributed to several different ancestral demes either before or after the emergence of the anatomically modern human phenotype in the extent that even the nature of the evolutionary lineage leading to the anatomically modern man as a distinct biological species is disputable. Regulation of the function of genes, as well as the evolution of brains will be dealt with in the second part of this review.
Collapse
Affiliation(s)
- Petter Portin
- Laboratory of Genetics, Department of Biology, University of Turku, Turku, Finland.
| |
Collapse
|
372
|
Parrott AM, Mathews MB. Novel rapidly evolving hominid RNAs bind nuclear factor 90 and display tissue-restricted distribution. Nucleic Acids Res 2007; 35:6249-58. [PMID: 17855395 PMCID: PMC2094060 DOI: 10.1093/nar/gkm668] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Nuclear factor 90 (NF90) is a double-stranded RNA-binding protein implicated in multiple cellular functions, but with few identified RNA partners. Using in vivo cross-linking followed by immunoprecipitation, we discovered a family of small NF90-associated RNAs (snaR). These highly structured non-coding RNAs of ∼117 nucleotides are expressed in immortalized human cell lines of diverse lineages. In human tissues, they are abundant in testis, with minor distribution in brain, placenta and some other organs. Two snaR subsets were isolated from human 293 cells, and additional species were found by bioinformatic analysis. Their genes often occur in multiple copies arranged in two inverted regions of tandem repeats on chromosome 19. snaR-A is transcribed by RNA polymerase III from an intragenic promoter, turns over rapidly, and shares sequence identity with Alu RNA and two potential piRNAs. It interacts with NF90's double-stranded RNA-binding motifs. snaR orthologs are present in chimpanzee but not other mammals, and include genes located in the promoter of two chorionic gonadotropin hormone genes. snaRs appear to have undergone accelerated evolution and differential expansion in the great apes.
Collapse
|
373
|
Han K, Lee J, Meyer TJ, Wang J, Sen SK, Srikanta D, Liang P, Batzer MA. Alu recombination-mediated structural deletions in the chimpanzee genome. PLoS Genet 2007; 3:1939-49. [PMID: 17953488 PMCID: PMC2041999 DOI: 10.1371/journal.pgen.0030184] [Citation(s) in RCA: 71] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2007] [Accepted: 09/07/2007] [Indexed: 12/02/2022] Open
Abstract
With more than 1.2 million copies, Alu elements are one of the most important sources of structural variation in primate genomes. Here, we compare the chimpanzee and human genomes to determine the extent of Alu recombination-mediated deletion (ARMD) in the chimpanzee genome since the divergence of the chimpanzee and human lineages (∼6 million y ago). Combining computational data analysis and experimental verification, we have identified 663 chimpanzee lineage-specific deletions (involving a total of ∼771 kb of genomic sequence) attributable to this process. The ARMD events essentially counteract the genomic expansion caused by chimpanzee-specific Alu inserts. The RefSeq databases indicate that 13 exons in six genes, annotated as either demonstrably or putatively functional in the human genome, and 299 intronic regions have been deleted through ARMDs in the chimpanzee lineage. Therefore, our data suggest that this process may contribute to the genomic and phenotypic diversity between chimpanzees and humans. In addition, we found four independent ARMD events at orthologous loci in the gorilla or orangutan genomes. This suggests that human orthologs of loci at which ARMD events have already occurred in other nonhuman primate genomes may be “at-risk” motifs for future deletions, which may subsequently contribute to human lineage-specific genetic rearrangements and disorders. The recent sequencing of a number of primate genomes shows that small segments of DNA known as Alu elements are found repeatedly along all chromosomes, and indeed comprise ∼10% of the human genome. Although older Alu elements that have been in the genome for a long time accumulate some random mutations, overall these elements retain high levels of sequence identity among themselves. The presence of many near-identical Alu elements located close to each other makes primate genomes prone to DNA recombination events that generate genomic deletions of varying sizes. Here, by scanning the chimpanzee genome for such deletions, we determined the role of the Alu recombination-mediated deletion process in creating structural differences between the chimpanzee and human genomes. Using a combination of computational and experimental techniques, we identified 663 deletions, involving the removal of ∼771 kb of genomic sequence. Interestingly, about half of these deletions were located within known or predicted genes, and in several cases, the deletions removed coding exons from chimpanzee genes as compared to their human counterparts. Alu recombination-mediated deletion shows signs of being a major sculptor of primate genomes and may be responsible for generating some of the genetic differences between humans and chimpanzees.
Collapse
Affiliation(s)
- Kyudong Han
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Jungnam Lee
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Thomas J Meyer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Jianxin Wang
- Department of Cancer Genetics, Roswell Park Cancer Institute, New York, United States of America
| | - Shurjo K Sen
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Deepa Srikanta
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana, United States of America
| | - Ping Liang
- Department of Cancer Genetics, Roswell Park Cancer Institute, New York, United States of America
| | - Mark A Batzer
- Department of Biological Sciences, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Biological Computation and Visualization Center, Louisiana State University, Baton Rouge, Louisiana, United States of America
- Center for BioModular Multi-Scale Systems, Louisiana State University, Baton Rouge, Louisiana, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
374
|
Bosch N, Cáceres M, Cardone MF, Carreras A, Ballana E, Rocchi M, Armengol L, Estivill X. Characterization and evolution of the novel gene family FAM90A in primates originated by multiple duplication and rearrangement events. Hum Mol Genet 2007; 16:2572-82. [PMID: 17684299 DOI: 10.1093/hmg/ddm209] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/30/2023] Open
Abstract
Genomic plasticity of human chromosome 8p23.1 region is highly influenced by two groups of complex segmental duplications (SDs), termed REPD and REPP, that mediate different kinds of rearrangements. Part of the difficulty to explain the wide range of phenotypes associated with 8p23.1 rearrangements is that REPP and REPD are not yet well characterized, probably due to their polymorphic status. Here, we describe a novel primate-specific gene family, named FAM90A (family with sequence similarity 90), found within these SDs. According to the current human reference sequence assembly, the FAM90A family includes 24 members along 8p23.1 region plus a single member on chromosome 12p13.31, showing copy number variation (CNV) between individuals. These genes can be classified into subfamilies I and II, which differ in their upstream and 5'-untranslated region sequences, but both share the same open reading frame and are ubiquitously expressed. Sequence analysis and comparative fluorescence in situ hybridization studies showed that FAM90A subfamily II suffered a big expansion in the hominoid lineage, whereas subfamily I members were likely generated sometime around the divergence of orangutan and African great apes by a fusion process. In addition, the analysis of the Ka/Ks ratios provides evidence of functional constraint of some FAM90A genes in all species. The characterization of the FAM90A gene family contributes to a better understanding of the structural polymorphism of the human 8p23.1 region and constitutes a good example of how SDs, CNVs and rearrangements within themselves can promote the formation of new gene sequences with potential functional consequences.
Collapse
Affiliation(s)
- Nina Bosch
- Genes and Disease Program, Center for Genomic Regulation (CRG-UPF) and CIBERESP, Barcelona, Catalonia, Spain
| | | | | | | | | | | | | | | |
Collapse
|
375
|
Ellegren H. Molecular evolutionary genomics of birds. Cytogenet Genome Res 2007; 117:120-30. [PMID: 17675852 DOI: 10.1159/000103172] [Citation(s) in RCA: 110] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2006] [Accepted: 09/09/2006] [Indexed: 11/19/2022] Open
Abstract
Insight into the molecular evolution of birds has been offered by the steady accumulation of avian DNA sequence data, recently culminating in the first draft sequence of an avian genome, that of chicken. By studying avian molecular evolution we can learn about adaptations and phenotypic evolution in birds, and also gain an understanding of the similarities and differences between mammalian and avian genomes. In both these lineages, there is pronounced isochore structure with highly variable GC content. However, while mammalian isochores are decaying, they are maintained in the chicken lineage, which is consistent with a biased gene conversion model where the high and variable recombination rate of birds reinforces heterogeneity in GC. In Galliformes, GC is positively correlated with the rate of nucleotide substitution; the mean neutral mutation rate is 0.12-0.15% at each site per million years but this estimate comes with significant local variation in the rate of mutation. Comparative genomics reveals lower d(N)/d(S) ratios on micro- compared to macrochromosomes, possibly due to population genetic effects or a non-random distribution of genes with respect to chromosome size. A non-random genomic distribution is shown by genes with sex-biased expression, with male-biased genes over-represented and female-biased genes under-represented on the Z chromosome. A strong effect of selection is evident on the non-recombining W chromosome with high d(N)/d(S) ratios and limited polymorphism. Nucleotide diversity in chicken is estimated at 4-5 x 10(-3) which might be seen as surprisingly high given presumed bottlenecks during domestication, but is lower than that recently observed in several natural populations of other species. Several important aspects of the molecular evolutionary process of birds remain to be understood and it can be anticipated that the upcoming genome sequence of a second bird species, the zebra finch, as well as the integration of data on gene expression, shall further advance our knowledge of avian evolution.
Collapse
Affiliation(s)
- H Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden.
| |
Collapse
|
376
|
Affiliation(s)
- Haixu Tang
- School of Informatics, Center for Genomics and Bioinformatics, Indiana University, Bloomington, Indiana 47408, USA.
| |
Collapse
|
377
|
Skipper M. Rhesus macaque joins the club. Nat Rev Genet 2007. [DOI: 10.1038/nrg2121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
|
378
|
Stevenson BJ, Iseli C, Panji S, Zahn-Zabal M, Hide W, Old LJ, Simpson AJ, Jongeneel CV. Rapid evolution of cancer/testis genes on the X chromosome. BMC Genomics 2007; 8:129. [PMID: 17521433 PMCID: PMC1890293 DOI: 10.1186/1471-2164-8-129] [Citation(s) in RCA: 72] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2007] [Accepted: 05/23/2007] [Indexed: 11/15/2022] Open
Abstract
Background Cancer/testis (CT) genes are normally expressed only in germ cells, but can be activated in the cancer state. This unusual property, together with the finding that many CT proteins elicit an antigenic response in cancer patients, has established a role for this class of genes as targets in immunotherapy regimes. Many families of CT genes have been identified in the human genome, but their biological function for the most part remains unclear. While it has been shown that some CT genes are under diversifying selection, this question has not been addressed before for the class as a whole. Results To shed more light on this interesting group of genes, we exploited the generation of a draft chimpanzee (Pan troglodytes) genomic sequence to examine CT genes in an organism that is closely related to human, and generated a high-quality, manually curated set of human:chimpanzee CT gene alignments. We find that the chimpanzee genome contains homologues to most of the human CT families, and that the genes are located on the same chromosome and at a similar copy number to those in human. Comparison of putative human:chimpanzee orthologues indicates that CT genes located on chromosome X are diverging faster and are undergoing stronger diversifying selection than those on the autosomes or than a set of control genes on either chromosome X or autosomes. Conclusion Given their high level of diversifying selection, we suggest that CT genes are primarily responsible for the observed rapid evolution of protein-coding genes on the X chromosome.
Collapse
Affiliation(s)
- Brian J Stevenson
- Ludwig Institute for Cancer Research and Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Christian Iseli
- Ludwig Institute for Cancer Research and Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Sumir Panji
- South African National Bioinformatics Institute, University of the Western Cape, Bellville, 7535, South Africa
| | - Monique Zahn-Zabal
- Ludwig Institute for Cancer Research and Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Winston Hide
- South African National Bioinformatics Institute, University of the Western Cape, Bellville, 7535, South Africa
| | - Lloyd J Old
- Ludwig Institute for Cancer Research, New York Branch at Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, New York 10021, USA
| | - Andrew J Simpson
- Ludwig Institute for Cancer Research, New York Branch at Memorial Sloan-Kettering Cancer Center, 1275 York Avenue, New York, New York 10021, USA
| | - C Victor Jongeneel
- Ludwig Institute for Cancer Research and Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| |
Collapse
|
379
|
Gibbs RA, Rogers J, Katze MG, Bumgarner R, Weinstock GM, Mardis ER, Remington KA, Strausberg RL, Venter JC, Wilson RK, Batzer MA, Bustamante CD, Eichler EE, Hahn MW, Hardison RC, Makova KD, Miller W, Milosavljevic A, Palermo RE, Siepel A, Sikela JM, Attaway T, Bell S, Bernard KE, Buhay CJ, Chandrabose MN, Dao M, Davis C, Delehaunty KD, Ding Y, Dinh HH, Dugan-Rocha S, Fulton LA, Gabisi RA, Garner TT, Godfrey J, Hawes AC, Hernandez J, Hines S, Holder M, Hume J, Jhangiani SN, Joshi V, Khan ZM, Kirkness EF, Cree A, Fowler RG, Lee S, Lewis LR, Li Z, Liu YS, Moore SM, Muzny D, Nazareth LV, Ngo DN, Okwuonu GO, Pai G, Parker D, Paul HA, Pfannkoch C, Pohl CS, Rogers YH, Ruiz SJ, Sabo A, Santibanez J, Schneider BW, Smith SM, Sodergren E, Svatek AF, Utterback TR, Vattathil S, Warren W, White CS, Chinwalla AT, Feng Y, Halpern AL, Hillier LW, Huang X, Minx P, Nelson JO, Pepin KH, Qin X, Sutton GG, Venter E, Walenz BP, Wallis JW, Worley KC, Yang SP, Jones SM, Marra MA, Rocchi M, Schein JE, Baertsch R, Clarke L, Csürös M, Glasscock J, Harris RA, Havlak P, Jackson AR, Jiang H, et alGibbs RA, Rogers J, Katze MG, Bumgarner R, Weinstock GM, Mardis ER, Remington KA, Strausberg RL, Venter JC, Wilson RK, Batzer MA, Bustamante CD, Eichler EE, Hahn MW, Hardison RC, Makova KD, Miller W, Milosavljevic A, Palermo RE, Siepel A, Sikela JM, Attaway T, Bell S, Bernard KE, Buhay CJ, Chandrabose MN, Dao M, Davis C, Delehaunty KD, Ding Y, Dinh HH, Dugan-Rocha S, Fulton LA, Gabisi RA, Garner TT, Godfrey J, Hawes AC, Hernandez J, Hines S, Holder M, Hume J, Jhangiani SN, Joshi V, Khan ZM, Kirkness EF, Cree A, Fowler RG, Lee S, Lewis LR, Li Z, Liu YS, Moore SM, Muzny D, Nazareth LV, Ngo DN, Okwuonu GO, Pai G, Parker D, Paul HA, Pfannkoch C, Pohl CS, Rogers YH, Ruiz SJ, Sabo A, Santibanez J, Schneider BW, Smith SM, Sodergren E, Svatek AF, Utterback TR, Vattathil S, Warren W, White CS, Chinwalla AT, Feng Y, Halpern AL, Hillier LW, Huang X, Minx P, Nelson JO, Pepin KH, Qin X, Sutton GG, Venter E, Walenz BP, Wallis JW, Worley KC, Yang SP, Jones SM, Marra MA, Rocchi M, Schein JE, Baertsch R, Clarke L, Csürös M, Glasscock J, Harris RA, Havlak P, Jackson AR, Jiang H, Liu Y, Messina DN, Shen Y, Song HXZ, Wylie T, Zhang L, Birney E, Han K, Konkel MK, Lee J, Smit AFA, Ullmer B, Wang H, Xing J, Burhans R, Cheng Z, Karro JE, Ma J, Raney B, She X, Cox MJ, Demuth JP, Dumas LJ, Han SG, Hopkins J, Karimpour-Fard A, Kim YH, Pollack JR, Vinar T, Addo-Quaye C, Degenhardt J, Denby A, Hubisz MJ, Indap A, Kosiol C, Lahn BT, Lawson HA, Marklein A, Nielsen R, Vallender EJ, Clark AG, Ferguson B, Hernandez RD, Hirani K, Kehrer-Sawatzki H, Kolb J, Patil S, Pu LL, Ren Y, Smith DG, Wheeler DA, Schenck I, Ball EV, Chen R, Cooper DN, Giardine B, Hsu F, Kent WJ, Lesk A, Nelson DL, O'brien WE, Prüfer K, Stenson PD, Wallace JC, Ke H, Liu XM, Wang P, Xiang AP, Yang F, Barber GP, Haussler D, Karolchik D, Kern AD, Kuhn RM, Smith KE, Zwieg AS. Evolutionary and biomedical insights from the rhesus macaque genome. Science 2007; 316:222-34. [PMID: 17431167 DOI: 10.1126/science.1139247] [Show More Authors] [Citation(s) in RCA: 1023] [Impact Index Per Article: 56.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The rhesus macaque (Macaca mulatta) is an abundant primate species that diverged from the ancestors of Homo sapiens about 25 million years ago. Because they are genetically and physiologically similar to humans, rhesus monkeys are the most widely used nonhuman primate in basic and applied biomedical research. We determined the genome sequence of an Indian-origin Macaca mulatta female and compared the data with chimpanzees and humans to reveal the structure of ancestral primate genomes and to identify evidence for positive selection and lineage-specific expansions and contractions of gene families. A comparison of sequences from individual animals was used to investigate their underlying genetic diversity. The complete description of the macaque genome blueprint enhances the utility of this animal model for biomedical research and improves our understanding of the basic biology of the species.
Collapse
|
380
|
Ventura M, Antonacci F, Cardone MF, Stanyon R, D'Addabbo P, Cellamare A, Sprague LJ, Eichler EE, Archidiacono N, Rocchi M. Evolutionary Formation of New Centromeres in Macaque. Science 2007; 316:243-6. [PMID: 17431171 DOI: 10.1126/science.1140615] [Citation(s) in RCA: 109] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
A systematic fluorescence in situ hybridization comparison of macaque and human synteny organization disclosed five additional macaque evolutionary new centromeres (ENCs) for a total of nine ENCs. To understand the dynamics of ENC formation and progression, we compared the ENC of macaque chromosome 4 with the human orthologous region, at 6q24.3, that conserves the ancestral genomic organization. A 250-kilobase segment was extensively duplicated around the macaque centromere. These duplications were strictly intrachromosomal. Our results suggest that novel centromeres may trigger only local duplication activity and that the absence of genes in the seeding region may have been important in ENC maintenance and progression.
Collapse
Affiliation(s)
- Mario Ventura
- Department of Genetics and Microbiology, University of Bari, 70126 Bari, Italy
| | | | | | | | | | | | | | | | | | | |
Collapse
|
381
|
Sawyer JR, Binz RL, Swanson CM, Lim C. De novo proximal duplication of 1(q12q22) in a female infant with multiple congenital anomalies. Am J Med Genet A 2007; 143:338-42. [PMID: 17230489 DOI: 10.1002/ajmg.a.31604] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Reports of small proximal 1q duplications are rare. We report a 1 month-old female who was referred to clinic because she was believed to have features suggestive of Turner syndrome. The patient's dysmorphic features included a prominent nose, low-set and crumpled ears, slightly high palate, short neck, high-pitched cry, mild micrognathia, hypoplastic labia majora, and somewhat deep palmar creases. Traditional G-band chromosome studies of the patient were interpreted as 46,XX,dup(1)(q12q21). To further evaluate the extent of the chromosome 1 duplication, Spectral Karyotyping and a series of six fluorescence in situ hybridization (FISH) probes were utilized. The FISH probes refined the extent of the duplication to involve the region 1(q12q22) indicating the duplicated segment was larger than interpreted by the G-banding studies. This first case of non-mosaic proximal duplication of 1q to be characterized by multiple locus specific FISH probes should allow a more refined delineation of the phenotypic findings and clinical significance associated with this rare chromosomal duplication.
Collapse
Affiliation(s)
- Jeffrey R Sawyer
- Cytogenetics Laboratory, Department of Pathology, University of Arkansas for Medical Sciences, Little Rock 72204, USA.
| | | | | | | |
Collapse
|
382
|
Thomas JH. Rapid birth-death evolution specific to xenobiotic cytochrome P450 genes in vertebrates. PLoS Genet 2007; 3:e67. [PMID: 17500592 PMCID: PMC1866355 DOI: 10.1371/journal.pgen.0030067] [Citation(s) in RCA: 150] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2006] [Accepted: 03/14/2007] [Indexed: 12/15/2022] Open
Abstract
Genes vary greatly in their long-term phylogenetic stability and there exists no general explanation for these differences. The cytochrome P450 (CYP450) gene superfamily is well suited to investigating this problem because it is large and well studied, and it includes both stable and unstable genes. CYP450 genes encode oxidase enzymes that function in metabolism of endogenous small molecules and in detoxification of xenobiotic compounds. Both types of enzymes have been intensively studied. My analysis of ten nearly complete vertebrate genomes indicates that each genome contains 50-80 CYP450 genes, which are about evenly divided between phylogenetically stable and unstable genes. The stable genes are characterized by few or no gene duplications or losses in species ranging from bony fish to mammals, whereas unstable genes are characterized by frequent gene duplications and losses (birth-death evolution) even among closely related species. All of the CYP450 genes that encode enzymes with known endogenous substrates are phylogenetically stable. In contrast, most of the unstable genes encode enzymes that function as xenobiotic detoxifiers. Nearly all unstable CYP450 genes in the mouse and human genomes reside in a few dense gene clusters, forming unstable gene islands that arose by recurrent local gene duplication. Evidence for positive selection in amino acid sequence is restricted to these unstable CYP450 genes, and sites of selection are associated with substrate-binding regions in the protein structure. These results can be explained by a general model in which phylogenetically stable genes have core functions in development and physiology, whereas unstable genes have accessory functions associated with unstable environmental interactions such as toxin and pathogen exposure. Unstable gene islands in vertebrates share some functional properties with bacterial genomic islands, though they arise by local gene duplication rather than horizontal gene transfer.
Collapse
Affiliation(s)
- James H Thomas
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America.
| |
Collapse
|
383
|
Kehrer-Sawatzki H, Cooper DN. Understanding the recent evolution of the human genome: insights from human-chimpanzee genome comparisons. Hum Mutat 2007; 28:99-130. [PMID: 17024666 DOI: 10.1002/humu.20420] [Citation(s) in RCA: 94] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
The sequencing of the chimpanzee genome and the comparison with its human counterpart have begun to reveal the spectrum of genetic changes that has accompanied human evolution. In addition to gross karyotypic rearrangements such as the fusion that formed human chromosome 2 and the human-specific pericentric inversions of chromosomes 1 and 18, there is considerable submicroscopic structural variation involving deletions, duplications, and inversions. Lineage-specific segmental duplications, detected by array comparative genomic hybridization and direct sequence comparison, have made a very significant contribution to this structural divergence, which is at least three-fold greater than that due to nucleotide substitutions. Since structural genomic changes may have given rise to irreversible functional differences between the diverging species, their detailed analysis could help to identify the biological processes that have accompanied speciation. To this end, interspecies comparisons have revealed numerous human-specific gains and losses of genes as well as changes in gene expression. The very considerable structural diversity (polymorphism) evident within both lineages has, however, hampered the analysis of the structural divergence between the human and chimpanzee genomes. The concomitant evaluation of genetic divergence and diversity at the nucleotide level has nevertheless served to identify many genes that have evolved under positive selection and may thus have been involved in the development of human lineage-specific traits. Genes that display signs of weak negative selection have also been identified and could represent candidate loci for complex genomic disorders. Here, we review recent progress in comparing the human and chimpanzee genomes and discuss how the differences detected have improved our understanding of the evolution of the human genome.
Collapse
|
384
|
Abstract
Chromosome deletions do abound in cancer and are detected in certain regions in a non-random manner. Although their relevance remains elusive, it is a general agreement that segmental losses provide the cell with selective growth advantage. Consequently these may contain genes and/or regulatory sequences that control normal growth and inhibit malignancy. We have developed a monochromosomal hybrid based experimental model for the generation and functional analysis of deletions, that is called "elimination test" (Et). Focused on human chromosome 3 - that was known to carry multiple 3p deletions - the Et was expected to restrict a 3p tumor suppressor region to a sufficiently small segment that permits the selection of a critically important candidate gene. Surprisingly, we detected three regions that were lost in all or majority of tumors: CER1 (3p21.3, Mb: 43.32-45.74), CER2 (3p22, Mb: 37.83-39.06) and FER (3p14.3-p21.2, Mb: 50.12-58.03). In contrast a 3q26-qter region (CRR) was regularly retained. CER1 - our main focus - contains multiple genes that may inhibit tumor growth, but 3 genes, RIS1, LF (LTF) and LIMD1 have already the necessary experimental support to be considered bona fide tumor suppressors. Tumor suppressor region borders display instability features including: (1) they break in evolution and in tumors, (2) they evolve horizontally, and (3) they are enriched with pseudogene insertions. The most remarkable features at the breakpoint cluster regions were segmental duplications that drive horizontal evolution and contribute to cancer associated instability.
Collapse
Affiliation(s)
- Maria Kost-Alimova
- Karolinska Institutet, Microbiology Tumor and Cell Biology Center (MTC), Box 280, 171 77 Stockholm, Sweden
| | | |
Collapse
|
385
|
Weiss KM, Smith FH. Out of the veil of death rode the one million! Neandertals and their genes. Bioessays 2007; 29:105-10. [PMID: 17226793 DOI: 10.1002/bies.20535] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Two recent papers report extensive nuclear DNA sequence from a 38,000-year-old Neandertal fossil, comparing it to modern humans and estimating when it diverged from, and whether it contributed to, our gene pool. Based on 65,250 and over a million base-pairs of sequence across the genome, respectively, the groups arrived at slightly different interpretations. The data are an exciting and interesting new contribution, but are not surprising, and a sense of history and question helps put them in perspective.
Collapse
Affiliation(s)
- Kenneth M Weiss
- Department of Anthropology, Penn State University, PA 16802, USA.
| | | |
Collapse
|
386
|
Demuth JP, Bie TD, Stajich JE, Cristianini N, Hahn MW. The evolution of mammalian gene families. PLoS One 2006; 1:e85. [PMID: 17183716 PMCID: PMC1762380 DOI: 10.1371/journal.pone.0000085] [Citation(s) in RCA: 225] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2006] [Accepted: 11/14/2006] [Indexed: 11/18/2022] Open
Abstract
Gene families are groups of homologous genes that are likely to have highly similar functions. Differences in family size due to lineage-specific gene duplication and gene loss may provide clues to the evolutionary forces that have shaped mammalian genomes. Here we analyze the gene families contained within the whole genomes of human, chimpanzee, mouse, rat, and dog. In total we find that more than half of the 9,990 families present in the mammalian common ancestor have either expanded or contracted along at least one lineage. Additionally, we find that a large number of families are completely lost from one or more mammalian genomes, and a similar number of gene families have arisen subsequent to the mammalian common ancestor. Along the lineage leading to modern humans we infer the gain of 689 genes and the loss of 86 genes since the split from chimpanzees, including changes likely driven by adaptive natural selection. Our results imply that humans and chimpanzees differ by at least 6% (1,418 of 22,000 genes) in their complement of genes, which stands in stark contrast to the oft-cited 1.5% difference between orthologous nucleotide sequences. This genomic “revolving door” of gene gain and loss represents a large number of genetic differences separating humans from our closest relatives.
Collapse
Affiliation(s)
- Jeffery P. Demuth
- Department of Biology and School of Informatics, Indiana UniversityBloomington, Indiana, United States of America
| | - Tijl De Bie
- School of Electronics and Computer Science, ISIS Group, University of SouthamptonSouthampton, United Kingdom
| | - Jason E. Stajich
- Department of Molecular Genetics and Microbiology, Duke UniversityDurham, North Carolina, United States of America
| | - Nello Cristianini
- Department of Statistics, University of California DavisDavis, California, United States of America
| | - Matthew W. Hahn
- Department of Biology and School of Informatics, Indiana UniversityBloomington, Indiana, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
387
|
Gibcus JH, Kok K, Menkema L, Hermsen MA, Mastik M, Kluin PM, van der Wal JE, Schuuring E. High-resolution mapping identifies a commonly amplified 11q13.3 region containing multiple genes flanked by segmental duplications. Hum Genet 2006; 121:187-201. [PMID: 17171571 DOI: 10.1007/s00439-006-0299-6] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2006] [Accepted: 11/09/2006] [Indexed: 11/28/2022]
Abstract
DNA amplification of the 11q13 region is observed frequently in many carcinomas. Within the amplified region several candidate oncogenes have been mapped, including cyclin D1, TAOS1 and cortactin. Yet, it is unknown which gene(s) is/are responsible for the selective pressure enabling amplicon formation. This is probably due to the use of low-resolution detection methods. Furthermore, the size and structure of the amplified 11q13 region is complex and consists of multiple amplicon cores that differ between different tumor types. We set out to test whether the borders of the 11q13 amplicon are restricted to regions that enable DNA breakage and subsequent amplification. A high-resolution array of the 11q13 region was generated to study the structure of the 11q13 amplicon and analyzed 29 laryngeal and pharyngeal carcinomas and nine cell lines with 11q13 amplification. We found that boundaries of the commonly amplified region were restricted to four segments. Three boundaries coincided with a syntenic breakpoint. Such regions have been suggested to be putatively fragile. Sequence comparisons revealed that the amplicon was flanked by two large low copy repeats known as segmental duplications. These segmental duplications might be responsible for the typical structure and size of the 11q13 amplicon. We hypothesize that the selection for genes through amplification of the 11q13.3 region is determined by the ability to form DNA breaks within specific regions and, consequently, results in large amplicons containing multiple genes.
Collapse
Affiliation(s)
- Johan H Gibcus
- Department of Pathology, University Medical Center Groningen, University of Groningen, 9700 RB Groningen, The Netherlands
| | | | | | | | | | | | | | | |
Collapse
|
388
|
Abstract
Recombination has essential functions in mammalian meiosis, which impose several constraints on the recombination process. However, recent studies have shown that, in spite of these roles, recombination rates vary tremendously among humans, and show marked differences between humans and closely related species. These findings provide important insights into the determinants of recombination rates and raise new questions about the selective pressures that affect recombination over different genomic scales, with implications for human genetics and evolutionary biology.
Collapse
Affiliation(s)
- Graham Coop
- Department of Human Genetics, University of Chicago, 920 East 58th Street, Chicago, Illinois 60637, USA
| | | |
Collapse
|
389
|
|
390
|
Abstract
Although genetic association studies have been with us for many years, even for the simplest analyses there is little consensus on the most appropriate statistical procedures. Here I give an overview of statistical approaches to population association studies, including preliminary analyses (Hardy-Weinberg equilibrium testing, inference of phase and missing data, and SNP tagging), and single-SNP and multipoint tests for association. My goal is to outline the key methods with a brief discussion of problems (population structure and multiple testing), avenues for solutions and some ongoing developments.
Collapse
Affiliation(s)
- David J Balding
- Department of Epidemiology and Public Health, Imperial College, St Marys Campus, Norfolk Place, London W2 1PG, UK.
| |
Collapse
|
391
|
Kehrer-Sawatzki H, Cooper DN. Structural divergence between the human and chimpanzee genomes. Hum Genet 2006; 120:759-78. [PMID: 17066299 DOI: 10.1007/s00439-006-0270-6] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2006] [Accepted: 09/19/2006] [Indexed: 01/17/2023]
Abstract
The structural microheterogeneity evident between the human and chimpanzee genomes is quite considerable and includes inversions and duplications as well as deletions, ranging in size from a few base-pairs up to several megabases (Mb). Insertions and deletions have together given rise to at least 150 Mb of genomic DNA sequence that is either present or absent in humans as compared to chimpanzees. Such regions often contain paralogous sequences and members of multigene families thereby ensuring that the human and chimpanzee genomes differ by a significant fraction of their gene content. There is as yet no evidence to suggest that the large chromosomal rearrangements which serve to distinguish the human and chimpanzee karyotypes have influenced either speciation or the evolution of lineage-specific traits. However, the myriad submicroscopic rearrangements in both genomes, particularly those involving copy number variation, are unlikely to represent exclusively neutral changes and hence promise to facilitate the identification of genes that have been important for human-specific evolution.
Collapse
|
392
|
Rocchi M, Archidiacono N, Stanyon R. Ancestral genomes reconstruction: An integrated, multi-disciplinary approach is needed. Genome Res 2006; 16:1441-4. [PMID: 17053088 DOI: 10.1101/gr.5687906] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Affiliation(s)
- Mariano Rocchi
- Department of Genetics and Microbiology, University of Bari, Bari 70126, Italy.
| | | | | |
Collapse
|
393
|
Chan CX, Beiko RG, Ragan MA. Detecting recombination in evolving nucleotide sequences. BMC Bioinformatics 2006; 7:412. [PMID: 16978423 PMCID: PMC1592127 DOI: 10.1186/1471-2105-7-412] [Citation(s) in RCA: 35] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2006] [Accepted: 09/18/2006] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. These recombination events can be obscured by subsequent residue substitutions, which consequently complicate their detection. While there are many algorithms for the identification of recombination events, little is known about the effects of subsequent substitutions on the accuracy of available recombination-detection approaches. RESULTS We assessed the effect of subsequent substitutions on the detection of simulated recombination events within sets of four nucleotide sequences under a homogeneous evolutionary model. The amount of subsequent substitutions per site, prior evolutionary history of the sequences, and reciprocality or non-reciprocality of the recombination event all affected the accuracy of the recombination-detecting programs examined. Bayesian phylogenetic-based approaches showed high accuracy in detecting evidence of recombination event and in identifying recombination breakpoints. These approaches were less sensitive to parameter settings than other methods we tested, making them easier to apply to various data sets in a consistent manner. CONCLUSION Post-recombination substitutions tend to diminish the predictive accuracy of recombination-detecting programs. The best method for detecting recombined regions is not necessarily the most accurate in identifying recombination breakpoints. For difficult detection problems involving highly divergent sequences or large data sets, different types of approach can be run in succession to increase efficiency, and can potentially yield better predictive accuracy than any single method used in isolation.
Collapse
Affiliation(s)
- Cheong Xin Chan
- ARC Centre in Bioinformatics and Institute for Molecular Bioscience, the University of Queensland, Brisbane, QLD 4072, Australia
| | - Robert G Beiko
- ARC Centre in Bioinformatics and Institute for Molecular Bioscience, the University of Queensland, Brisbane, QLD 4072, Australia
| | - Mark A Ragan
- ARC Centre in Bioinformatics and Institute for Molecular Bioscience, the University of Queensland, Brisbane, QLD 4072, Australia
| |
Collapse
|