51
|
Maside X, Charlesworth B. Patterns of molecular variation and evolution in Drosophila americana and its relatives. Genetics 2007; 176:2293-305. [PMID: 17507679 PMCID: PMC1950632 DOI: 10.1534/genetics.107.071191] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We present the results of a survey of DNA sequence variability at X-linked and autosomal loci in Drosophila americana and of patterns of DNA sequence evolution among D. americana and four other related species in the virilis group of Drosophila. D. americana shows a typical level of silent polymorphism for a Drosophila species, but has an unusually low ratio of nonsynonymous to silent variation. Both D. virilis and D. americana also show a low ratio of nonsynonymous to synonymous substitutions along their respective lineages since the split from their common ancestor. The proportion of amino acid substitutions between D. americana and its relatives that are caused by positive selection, as estimated by extensions of the McDonald-Kreitman test, appears to be unusually high. We cannot, however, exclude the possibility that this reflects a recent increase in the intensity of selection on nonsynonymous mutations in D. americana and D. virilis. We also find that base composition at neutral sites appears to be in overall equilibrium among these species, but there is evidence for departure from equilibrium for codon usage in some lineages.
Collapse
Affiliation(s)
- Xulio Maside
- Grupo de Medicina Xenómica, Instituto de Medicina Legal, Universidade de Santiago de Compostela, Rúa de San Francisco s/n, 15782 Santiago de Compostela, Spain.
| | | |
Collapse
|
52
|
Chen K, Rajewsky N. Natural selection on human microRNA binding sites inferred from SNP data. Nat Genet 2006; 38:1452-6. [PMID: 17072316 DOI: 10.1038/ng1910] [Citation(s) in RCA: 355] [Impact Index Per Article: 18.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2006] [Accepted: 09/21/2006] [Indexed: 01/29/2023]
Abstract
A fundamental problem in biology is understanding how natural selection has shaped the evolution of gene regulation. Here we use SNP genotype data and techniques from population genetics to study an entire layer of short, cis-regulatory sites in the human genome. MicroRNAs (miRNAs) are a class of small noncoding RNAs that post-transcriptionally repress mRNA through cis-regulatory sites in 3' UTRs. We show that negative selection in humans is stronger on computationally predicted conserved miRNA binding sites than on other conserved sequence motifs in 3' UTRs, thus providing independent support for the target prediction model and explicitly demonstrating the contribution of miRNAs to darwinian fitness. Our techniques extend to nonconserved miRNA binding sites, and we estimate that 30%-50% of these are functional when the mRNA and miRNA are endogenously coexpressed. As we show that polymorphisms in predicted miRNA binding sites are likely to be deleterious, they are candidates for causal variants of human disease. We believe that our approach can be extended to studying other classes of cis-regulatory sites.
Collapse
Affiliation(s)
- Kevin Chen
- Center for Comparative Functional Genomics, Department of Biology, New York University, New York, New York 10003, USA
| | | |
Collapse
|
53
|
Bartolomé C, Charlesworth B. Evolution of amino-acid sequences and codon usage on the Drosophila miranda neo-sex chromosomes. Genetics 2006; 174:2033-44. [PMID: 17028318 PMCID: PMC1698622 DOI: 10.1534/genetics.106.064113] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We have studied patterns of DNA sequence variation and evolution for 22 genes located on the neo-X and neo-Y chromosomes of Drosophila miranda. As found previously, nucleotide site diversity is greatly reduced on the neo-Y chromosome, with a severely distorted frequency spectrum. There is also an accelerated rate of amino-acid sequence evolution on the neo-Y chromosome. Comparisons of nonsynonymous and silent variation and divergence suggest that amino-acid sequences on the neo-X chromosome are subject to purifying selection, whereas this is much weaker on the neo-Y. The same applies to synonymous variants affecting codon usage. There is also an indication of a recent relaxation of selection on synonymous mutations for genes on other chromosomes. Genes that are weakly expressed on the neo-Y chromosome appear to have a faster rate of accumulation of both nonsynonymous and unpreferred synonymous mutations than genes with high levels of expression, although the rate of accumulation when both types of mutation are pooled is higher for the neo-Y chromosome than for the neo-X chromosome even for highly expressed genes.
Collapse
Affiliation(s)
- Carolina Bartolomé
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom.
| | | |
Collapse
|
54
|
Ko WY, Piao S, Akashi H. Strong regional heterogeneity in base composition evolution on the Drosophila X chromosome. Genetics 2006; 174:349-62. [PMID: 16547109 PMCID: PMC1569809 DOI: 10.1534/genetics.105.054346] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2005] [Accepted: 05/08/2006] [Indexed: 11/18/2022] Open
Abstract
Fluctuations in base composition appear to be prevalent in Drosophila and mammal genome evolution, but their timescale, genomic breadth, and causes remain obscure. Here, we study base composition evolution within the X chromosomes of Drosophila melanogaster and five of its close relatives. Substitutions were inferred on six extant and two ancestral lineages for 14 near-telomeric and 9 nontelomeric genes. GC content evolution is highly variable both within the genome and within the phylogenetic tree. In the lineages leading to D. yakuba and D. orena, GC content at silent sites has increased rapidly near telomeres, but has decreased in more proximal (nontelomeric) regions. D. orena shows a 17-fold excess of GC-increasing vs. AT-increasing synonymous changes within a small (approximately 130-kb) region close to the telomeric end. Base composition changes within introns are consistent with changes in mutation patterns, but stronger GC elevation at synonymous sites suggests contributions of natural selection or biased gene conversion. The Drosophila yakuba lineage shows a less extreme elevation of GC content distributed over a wider genetic region (approximately 1.2 Mb). A lack of change in GC content for most introns within this region suggests a role of natural selection in localized base composition fluctuations.
Collapse
Affiliation(s)
- Wen-Ya Ko
- Institute of Molecular Evolutionary Genetics and Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | | | | |
Collapse
|
55
|
Björnerfeldt S, Webster MT, Vilà C. Relaxation of selective constraint on dog mitochondrial DNA following domestication. Genome Res 2006; 16:990-4. [PMID: 16809672 PMCID: PMC1524871 DOI: 10.1101/gr.5117706] [Citation(s) in RCA: 137] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The domestication of dogs caused a dramatic change in their way of life compared with that of their ancestor, the gray wolf. We hypothesize that this new life style changed the selective forces that acted upon the species, which in turn had an effect on the dog's genome. We sequenced the complete mitochondrial DNA genome in 14 dogs, six wolves, and three coyotes. Here we show that dogs have accumulated nonsynonymous changes in mitochondrial genes at a faster rate than wolves, leading to elevated levels of variation in their proteins. This suggests that a major consequence of domestication in dogs was a general relaxation of selective constraint on their mitochondrial genome. If this change also affected other parts of the dog genome, it could have facilitated the generation of novel functional genetic diversity. This diversity could thus have contributed raw material upon which artificial selection has shaped modern breeds and may therefore be an important source of the extreme phenotypic variation present in modern-day dogs.
Collapse
Affiliation(s)
- Susanne Björnerfeldt
- Department of Evolutionary Biology, Uppsala University, SE-752 36 Uppsala, Sweden
| | - Matthew T. Webster
- Department of Evolutionary Biology, Uppsala University, SE-752 36 Uppsala, Sweden
| | - Carles Vilà
- Department of Evolutionary Biology, Uppsala University, SE-752 36 Uppsala, Sweden
- Corresponding author.E-mail ; fax. +46-18-471-6310
| |
Collapse
|
56
|
Gilad Y, Oshlack A, Rifkin SA. Natural selection on gene expression. Trends Genet 2006; 22:456-61. [PMID: 16806568 DOI: 10.1016/j.tig.2006.06.002] [Citation(s) in RCA: 155] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2006] [Revised: 04/07/2006] [Accepted: 06/05/2006] [Indexed: 01/30/2023]
Abstract
Changes in genetic regulation contribute to adaptations in natural populations and influence susceptibility to human diseases. Despite their potential phenotypic importance, the selective pressures acting on regulatory processes in general and gene expression levels in particular are largely unknown. Studies in model organisms suggest that the expression levels of most genes evolve under stabilizing selection, although a few are consistent with adaptive evolution. However, it has been proposed that gene expression levels in primates evolve largely in the absence of selective constraints. In this article, we discuss the microarray-based observations that led to these disparate interpretations. We conclude that in both primates and model organisms, stabilizing selection is likely to be the dominant mode of gene expression evolution. An important implication is that mutations affecting gene expression will often be deleterious and might underlie many human diseases.
Collapse
Affiliation(s)
- Yoav Gilad
- Department of Human Genetics, University of Chicago, Chicago, Il 60637, USA.
| | | | | |
Collapse
|
57
|
Cutter AD, Félix MA, Barrière A, Charlesworth D. Patterns of nucleotide polymorphism distinguish temperate and tropical wild isolates of Caenorhabditis briggsae. Genetics 2006; 173:2021-31. [PMID: 16783011 PMCID: PMC1569728 DOI: 10.1534/genetics.106.058651] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Caenorhabditis briggsae provides a natural comparison species for the model nematode C. elegans, given their similar morphology, life history, and hermaphroditic mode of reproduction. Despite C. briggsae boasting a published genome sequence and establishing Caenorhabditis as a model genus for genetics and development, little is known about genetic variation across the geographic range of this species. In this study, we greatly expand the collection of natural isolates and characterize patterns of nucleotide variation for six loci in 63 strains from three continents. The pattern of polymorphisms reveals differentiation between C. briggsae strains found in temperate localities in the northern hemisphere from those sampled near the Tropic of Cancer, with diversity within the tropical region comparable to what is found for C. elegans in Europe. As in C. elegans, linkage disequilibrium is pervasive, although recombination is evident among some variant sites, indicating that outcrossing has occurred at a low rate in the history of the sample. In contrast to C. elegans, temperate regions harbor extremely little variation, perhaps reflecting colonization and recent expansion of C. briggsae into northern latitudes. We discuss these findings in relation to their implications for selection, demographic history, and the persistence of self-fertilization.
Collapse
Affiliation(s)
- Asher D Cutter
- Institute of Evolutionary Biology, University of Edinburgh, UK.
| | | | | | | |
Collapse
|
58
|
Nielsen R, Williamson S, Kim Y, Hubisz MJ, Clark AG, Bustamante C. Genomic scans for selective sweeps using SNP data. Genome Res 2006; 15:1566-75. [PMID: 16251466 PMCID: PMC1310644 DOI: 10.1101/gr.4252305] [Citation(s) in RCA: 699] [Impact Index Per Article: 36.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Detecting selective sweeps from genomic SNP data is complicated by the intricate ascertainment schemes used to discover SNPs, and by the confounding influence of the underlying complex demographics and varying mutation and recombination rates. Current methods for detecting selective sweeps have little or no robustness to the demographic assumptions and varying recombination rates, and provide no method for correcting for ascertainment biases. Here, we present several new tests aimed at detecting selective sweeps from genomic SNP data. Using extensive simulations, we show that a new parametric test, based on composite likelihood, has a high power to detect selective sweeps and is surprisingly robust to assumptions regarding recombination rates and demography (i.e., has low Type I error). Our new test also provides estimates of the location of the selective sweep(s) and the magnitude of the selection coefficient. To illustrate the method, we apply our approach to data from the Seattle SNP project and to Chromosome 2 data from the HapMap project. In Chromosome 2, the most extreme signal is found in the lactase gene, which previously has been shown to be undergoing positive selection. Evidence for selective sweeps is also found in many other regions, including genes known to be associated with disease risk such as DPP10 and COL4A3.
Collapse
Affiliation(s)
- Rasmus Nielsen
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA.
| | | | | | | | | | | |
Collapse
|
59
|
Comeron JM. Weak selection and recent mutational changes influence polymorphic synonymous mutations in humans. Proc Natl Acad Sci U S A 2006; 103:6940-5. [PMID: 16632609 PMCID: PMC1458998 DOI: 10.1073/pnas.0510638103] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Recent large-scale genomic and evolutionary studies have revealed the small but detectable signature of weak selection on synonymous mutations during mammalian evolution, likely acting at the level of translational efficacy (i.e., translational selection). To investigate whether weak selection, and translational selection in particular, plays any role in shaping the fate of synonymous mutations that are present today in human populations, we studied genetic variation at the polymorphic level and patterns of evolution in the human lineage after human-chimpanzee separation. We find evidence that neutral mechanisms are influencing the frequency of polymorphic mutations in humans. Our results suggest a recent increase in mutational tendencies toward AT, observed in all isochores, that is responsible for AT mutations segregating at lower frequencies than GC mutations. In all, however, changes in mutational tendencies and other neutral scenarios are not sufficient to explain a difference between synonymous and noncoding mutations or a difference between synonymous mutations potentially advantageous or deleterious under a translational selection model. Furthermore, several estimates of selection intensity on synonymous mutations all suggest a detectable influence of weak selection acting at the level of translational selection. Thus, random genetic drift, recent changes in mutational tendencies, and weak selection influence the fate of synonymous mutations that are present today as polymorphisms. All of these features, neutral and selective, should be taken into account in evolutionary analyses that often assume constancy of mutational tendencies and complete neutrality of synonymous mutations.
Collapse
Affiliation(s)
- Josep M Comeron
- Department of Biological Sciences, University of Iowa, 212 Biology Building, Iowa City, IA 52242, USA.
| |
Collapse
|
60
|
Akashi H, Ko WY, Piao S, John A, Goel P, Lin CF, Vitins AP. Molecular evolution in the Drosophila melanogaster species subgroup: frequent parameter fluctuations on the timescale of molecular divergence. Genetics 2005; 172:1711-26. [PMID: 16387879 PMCID: PMC1456288 DOI: 10.1534/genetics.105.049676] [Citation(s) in RCA: 39] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Although mutation, genetic drift, and natural selection are well established as determinants of genome evolution, the importance (frequency and magnitude) of parameter fluctuations in molecular evolution is less understood. DNA sequence comparisons among closely related species allow specific substitutions to be assigned to lineages on a phylogenetic tree. In this study, we compare patterns of codon usage and protein evolution in 22 genes (>11,000 codons) among Drosophila melanogaster and five relatives within the D. melanogaster subgroup. We assign changes to eight lineages using a maximum-likelihood approach to infer ancestral states. Uncertainty in ancestral reconstructions is taken into account, at least to some extent, by weighting reconstructions by their posterior probabilities. Four of the eight lineages show potentially genomewide departures from equilibrium synonymous codon usage; three are decreasing and one is increasing in major codon usage. Several of these departures are consistent with lineage-specific changes in selection intensity (selection coefficients scaled to effective population size) at silent sites. Intron base composition and rates and patterns of protein evolution are also heterogeneous among these lineages. The magnitude of forces governing silent, intron, and protein evolution appears to have varied frequently, and in a lineage-specific manner, within the D. melanogaster subgroup.
Collapse
Affiliation(s)
- Hiroshi Akashi
- Institute of Molecular Evolutionary Genetics and Department of Biology, Pennsylvania State University, University Park, Pennsylvania 16802, USA.
| | | | | | | | | | | | | |
Collapse
|
61
|
Kondrashov FA, Ogurtsov AY, Kondrashov AS. Selection in favor of nucleotides G and C diversifies evolution rates and levels of polymorphism at mammalian synonymous sites. J Theor Biol 2005; 240:616-26. [PMID: 16343547 DOI: 10.1016/j.jtbi.2005.10.020] [Citation(s) in RCA: 52] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2005] [Revised: 10/26/2005] [Accepted: 10/27/2005] [Indexed: 11/24/2022]
Abstract
The impact of synonymous nucleotide substitutions on fitness in mammals remains controversial. Despite some indications of selective constraint, synonymous sites are often assumed to be neutral, and the rate of their evolution is used as a proxy for mutation rate. We subdivide all sites into four classes in terms of the mutable CpG context, nonCpG, postC, preG, and postCpreG, and compare four-fold synonymous sites and intron sites residing outside transposable elements. The distribution of the rate of evolution across all synonymous sites is trimodal. Rate of evolution at nonCpG synonymous sites, not preceded by C and not followed by G, is approximately 10% below that at such intron sites. In contrast, rate of evolution at postCpreG synonymous sites is approximately 30% above that at such intron sites. Finally, synonymous and intron postC and preG sites evolve at similar rates. The relationship between the levels of polymorphism at the corresponding synonymous and intron sites is very similar to that between their rates of evolution. Within every class, synonymous sites are occupied by G or C much more often than intron sites, whose nucleotide composition is consistent with neutral mutation-drift equilibrium. These patterns suggest that synonymous sites are under weak selection in favor of G and C, with the average coefficient s approximately 0.25/Ne approximately 10(-5), where Ne is the effective population size. Such selection decelerates evolution and reduces variability at sites with symmetric mutation, but has the opposite effects at sites where the favored nucleotides are more mutable. The amino-acid composition of proteins dictates that many synonymous sites are CpGprone, which causes them, on average, to evolve faster and to be more polymorphic than intron sites. An average genotype carries approximately 10(7) suboptimal nucleotides at synonymous sites, implying synergistic epistasis in selection against them.
Collapse
Affiliation(s)
- Fyodor A Kondrashov
- Section of Ecology, Behavior and Evolution, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093-0346, USA.
| | | | | |
Collapse
|
62
|
Abstract
The distribution of mutational effects on fitness is of fundamental importance for many aspects of evolution. We develop two methods for characterizing the fitness effects of deleterious, nonsynonymous mutations, using polymorphism data from two related species. These methods also provide estimates of the proportion of amino acid substitutions that are selectively favorable, when combined with data on between-species sequence divergence. The methods are applicable to species with different effective population sizes, but that share the same distribution of mutational effects. The first, simpler, method assumes that diversity for all nonneutral mutations is given by the value under mutation-selection balance, while the second method allows for stronger effects of genetic drift and yields estimates of the parameters of the probability distribution of mutational effects. We apply these methods to data on populations of Drosophila miranda and D. pseudoobscura and find evidence for the presence of deleterious nonsynonymous mutations, mostly with small heterozygous selection coefficients (a mean of the order of 10(-5) for segregating variants). A leptokurtic gamma distribution of mutational effects with a shape parameter between 0.1 and 1 can explain observed diversities, in the absence of a separate class of completely neutral nonsynonymous mutations. We also describe a simple approximate method for estimating the harmonic mean selection coefficient from diversity data on a single species.
Collapse
Affiliation(s)
- Laurence Loewe
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom.
| | | | | | | |
Collapse
|
63
|
Woolfit M, Bromham L. Population size and molecular evolution on islands. Proc Biol Sci 2005; 272:2277-82. [PMID: 16191640 PMCID: PMC1560185 DOI: 10.1098/rspb.2005.3217] [Citation(s) in RCA: 113] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2005] [Accepted: 06/19/2005] [Indexed: 11/12/2022] Open
Abstract
The nearly neutral theory predicts that the rate and pattern of molecular evolution will be influenced by effective population size (Ne), because in small populations more slightly deleterious mutations are expected to drift to fixation. This important prediction has not been widely empirically tested, largely because of the difficulty of comparing rates of molecular evolution in sufficient numbers of independent lineages which differ only in Ne. Island endemic species provide an ideal test of the effect of Ne on molecular evolution because species restricted to islands frequently have smaller Ne than closely related mainland species, and island endemics have arisen from mainland lineages many times in a wide range of taxa. We collated a dataset of 70 phylogenetically independent comparisons between island and mainland taxa, including vertebrates, invertebrates and plants, from 19 different island groups. The rate of molecular evolution in these lineages was estimated by maximum likelihood using two measures: overall substitution rate and the ratio of non-synonymous to synonymous substitution rates. We show that island lineages have significantly higher ratios of non-synonymous to synonymous substitution rates than mainland lineages, as predicted by the nearly neutral theory, although overall substitution rates do not differ significantly.
Collapse
Affiliation(s)
- Megan Woolfit
- Department of Genetics, Trinity College, University of Dublin, Dublin 2, Ireland.
| | | |
Collapse
|
64
|
Cutter AD. Nucleotide polymorphism and linkage disequilibrium in wild populations of the partial selfer Caenorhabditis elegans. Genetics 2005; 172:171-84. [PMID: 16272415 PMCID: PMC1456145 DOI: 10.1534/genetics.105.048207] [Citation(s) in RCA: 133] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
An understanding of the relative contributions of different evolutionary forces on an organism's genome requires an accurate description of the patterns of genetic variation within and between natural populations. To this end, I report a survey of nucleotide polymorphism in six loci from 118 strains of the nematode Caenorhabditis elegans. These strains derive from wild populations of several regions within France, Germany, and new localities in Scotland, in addition to stock center isolates. Overall levels of silent-site diversity are low within and between populations of this self-fertile species, averaging 0.2% in European samples and 0.3% worldwide. Population structure is present despite a lack of association of sequences with geography, and migration appears to occur at all geographic scales. Linkage disequilibrium is extensive in the C. elegans genome, extending even between chromosomes. Nevertheless, recombination is clearly present in the pattern of polymorphisms, indicating that outcrossing is an infrequent, but important, feature in this species ancestry. The range of outcrossing rates consistent with the data is inferred from linkage disequilibrium, using "scattered" samples representing the collecting phase of the coalescent process in a subdivided population. I propose that genetic variation in this species is shaped largely by population subdivision due to self-fertilization coupled with long- and short-range migration between subpopulations.
Collapse
Affiliation(s)
- Asher D Cutter
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom
| |
Collapse
|
65
|
Haddrill PR, Thornton KR, Charlesworth B, Andolfatto P. Multilocus patterns of nucleotide variability and the demographic and selection history of Drosophila melanogaster populations. Genome Res 2005; 15:790-9. [PMID: 15930491 PMCID: PMC1142469 DOI: 10.1101/gr.3541005] [Citation(s) in RCA: 208] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
Uncertainty about the demographic history of populations can hamper genome-wide scans for selection based on population genetic models. To obtain a portrait of the effects of demographic history on genome variability patterns in Drosophila melanogaster populations, we surveyed noncoding DNA polymorphism at 10 X-linked loci in large samples from three African and two non-African populations. All five populations show significant departures from expectations under the standard neutral model. We detect weak but significant differentiation between East (Kenya and Zimbabwe) and West/Central sub-Saharan (Gabon) African populations. A skew toward high-frequency-derived polymorphisms, elevated levels of linkage disequilibrium (LD) and significant heterogeneity in levels of polymorphism and divergence in the Gabon sample suggest that this population is further from mutation-drift equilibrium than the two Eastern African populations. Both non-African populations harbor significantly higher levels of LD, a large excess of high-frequency-derived mutations and extreme heterogeneity among loci in levels of polymorphism and divergence. Rejections of the neutral model in D. melanogaster populations using these and similar features have been interpreted as evidence for an important role for natural selection in shaping genome variability patterns. Based on simulations, we conclude that simple bottleneck models are sufficient to account for most, if not all, polymorphism features of both African and non-African populations. In contrast, we show that a steady-state recurrent hitchhiking model fails to account for several aspects of the data. Demographic departures from equilibrium expectations in both ancestral and derived populations thus represent a serious challenge to detecting positive selection in genome-wide scans using current methodologies.
Collapse
Affiliation(s)
- Penelope R Haddrill
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3JT, United Kingdom
| | | | | | | |
Collapse
|
66
|
Williamson SH, Hernandez R, Fledel-Alon A, Zhu L, Nielsen R, Bustamante CD. Simultaneous inference of selection and population growth from patterns of variation in the human genome. Proc Natl Acad Sci U S A 2005; 102:7882-7. [PMID: 15905331 PMCID: PMC1142382 DOI: 10.1073/pnas.0502300102] [Citation(s) in RCA: 249] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2004] [Indexed: 11/18/2022] Open
Abstract
Natural selection and demographic forces can have similar effects on patterns of DNA polymorphism. Therefore, to infer selection from samples of DNA sequences, one must simultaneously account for demographic effects. Here we take a model-based approach to this problem by developing predictions for patterns of polymorphism in the presence of both population size change and natural selection. If data are available from different functional classes of variation, and a priori information suggests that mutations in one of those classes are selectively neutral, then the putatively neutral class can be used to infer demographic parameters, and inferences regarding selection on other classes can be performed given demographic parameter estimates. This procedure is more robust to assumptions regarding the true underlying demography than previous approaches to detecting and analyzing selection. We apply this method to a large polymorphism data set from 301 human genes and find (i) widespread negative selection acting on standing nonsynonymous variation, (ii) that the fitness effects of nonsynonymous mutations are well predicted by several measures of amino acid exchangeability, especially site-specific methods, and (iii) strong evidence for very recent population growth.
Collapse
Affiliation(s)
- Scott H Williamson
- Department of Biological Statistics and Computational Biology, 101 Biotechnology Building, Cornell University, Ithaca, NY 14853, USA
| | | | | | | | | | | |
Collapse
|
67
|
Williamson S, Fledel-Alon A, Bustamante CD. Population genetics of polymorphism and divergence for diploid selection models with arbitrary dominance. Genetics 2005; 168:463-75. [PMID: 15454557 PMCID: PMC1448126 DOI: 10.1534/genetics.103.024745] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We develop a Poisson random-field model of polymorphism and divergence that allows arbitrary dominance relations in a diploid context. This model provides a maximum-likelihood framework for estimating both selection and dominance parameters of new mutations using information on the frequency spectrum of sequence polymorphisms. This is the first DNA sequence-based estimator of the dominance parameter. Our model also leads to a likelihood-ratio test for distinguishing nongenic from genic selection; simulations indicate that this test is quite powerful when a large number of segregating sites are available. We also use simulations to explore the bias in selection parameter estimates caused by unacknowledged dominance relations. When inference is based on the frequency spectrum of polymorphisms, genic selection estimates of the selection parameter can be very strongly biased even for minor deviations from the genic selection model. Surprisingly, however, when inference is based on polymorphism and divergence (McDonald-Kreitman) data, genic selection estimates of the selection parameter are nearly unbiased, even for completely dominant or recessive mutations. Further, we find that weak overdominant selection can increase, rather than decrease, the substitution rate relative to levels of polymorphism. This nonintuitive result has major implications for the interpretation of several popular tests of neutrality.
Collapse
Affiliation(s)
- Scott Williamson
- Department of Biological Statistics and Computational Biology, Cornell University, Ithaca, New York 14853, USA.
| | | | | |
Collapse
|
68
|
Wernegreen JJ, Funk DJ. Mutation exposed: a neutral explanation for extreme base composition of an endosymbiont genome. J Mol Evol 2005; 59:849-58. [PMID: 15599516 DOI: 10.1007/s00239-003-0192-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2003] [Accepted: 06/29/2004] [Indexed: 10/26/2022]
Abstract
The influence of neutral mutation pressure versus selection on base composition evolution is a subject of considerable controversy. Yet the present study represents the first explicit population genetic analysis of this issue in prokaryotes, the group in which base composition variation is most dramatic. Here, we explore the impact of mutation and selection on the dynamics of synonymous changes in Buchnera aphidicola, the AT-rich bacterial endosymbiont of aphids. Specifically, we evaluated three forms of evidence. (i) We compared the frequencies of directional base changes (AT-->GC vs. GC-->AT) at synonymous sites within and between Buchnera species, to test for selective preference versus effective neutrality of these mutational categories. Reconstructed mutational changes across a robust intraspecific phylogeny showed a nearly 1:1 AT-->GC:GC-->AT ratio. Likewise, stationarity of base composition among Buchnera species indicated equal rates of AT-->GC and GC-->AT substitutions. The similarity of these patterns within and between species supported the neutral model. (ii) We observed an equivalence of relative per-site AT mutation rate and current AT content at synonymous sites, indicating that base composition is at mutational equilibrium. (iii) We demonstrated statistically greater equality in the frequency of mutational categories in Buchnera than in parallel mammalian studies that documented selection on synonymous sites. Our results indicate that effectively neutral mutational pressure, rather than selection, represents the major force driving base composition evolution in Buchnera. Thus they further corroborate recent evidence for the critical role of reduced N(e) in the molecular evolution of bacterial endosymbionts.
Collapse
Affiliation(s)
- Jennifer J Wernegreen
- Josephine Bay Paul Center for Comparative Molecular Biology & Evolution, The Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA.
| | | |
Collapse
|
69
|
Inomata N, Goto H, Itoh M, Isono K. A single-amino-acid change of the gustatory receptor gene, Gr5a, has a major effect on trehalose sensitivity in a natural population of Drosophila melanogaster. Genetics 2005; 167:1749-58. [PMID: 15342513 PMCID: PMC1471011 DOI: 10.1534/genetics.104.027045] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Variation in trehalose sensitivity and nucleotide sequence polymorphism of the Gr5a gene encoding the gustatory receptor to sugar trehalose were investigated in 152 male lines of Drosophila melanogaster collected from a natural population. Among the observed 59 segregating sites, some pairs of sites showed significant linkage disequilibrium. A single SNP, which results in the Ala218Thr amino acid change, was significantly associated with trehalose sensitivity, as previously suggested. Threonine at amino acid position 218 was found to be the ancestral form in D. melanogaster, suggesting that low trehalose sensitivity was an ancestral form with respect to the receptor function. There was large genetic variation in trehalose sensitivity. It was continuously distributed, indicating that trehalose sensitivity measured by the behavioral assay is a quantitative trait. These results suggest that apart from the Gr5a gene, other genetic factors contribute to variation in trehalose sensitivity. Nucleotide diversity (pi) and nucleotide variation (theta) per site were 0.00874 and 0.00590, respectively. Fu and Li's test and the MK test showed no significant departure from the expectation of selective neutrality in the Gr5a gene. However, we rejected selective neutrality by Tajima's test and Fay and Wu's test with the observed level of recombination. We discuss possible causes of the observed pattern of nucleotide variation in the gustatory receptor Gr5a gene.
Collapse
Affiliation(s)
- Nobuyuki Inomata
- Department of Biology, Graduate School of Sciences, Kyushu University, Hakozaki, Higashi-ku, Fukuoka 812-8581, Japan.
| | | | | | | |
Collapse
|
70
|
Moyer GR, Winemeller KO, McPhee MV, Turner TF. HISTORICAL DEMOGRAPHY, SELECTION, AND COALESCENCE OF MITOCHONDRIAL AND NUCLEAR GENES IN PROCHILODUS SPECIES OF NORTHERN SOUTH AMERICA. Evolution 2005. [DOI: 10.1111/j.0014-3820.2005.tb01019.x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
71
|
Moyer GR, Winemiller KO, McPhee MV, Turner TF. HISTORICAL DEMOGRAPHY, SELECTION, AND COALESCENCE OF MITOCHONDRIAL AND NUCLEAR GENES IN PROCHILODUS SPECIES OF NORTHERN SOUTH AMERICA. Evolution 2005. [DOI: 10.1554/04-280] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
72
|
Rose LE, Bittner-Eddy PD, Langley CH, Holub EB, Michelmore RW, Beynon JL. The maintenance of extreme amino acid diversity at the disease resistance gene, RPP13, in Arabidopsis thaliana. Genetics 2004; 166:1517-27. [PMID: 15082565 PMCID: PMC1470773 DOI: 10.1534/genetics.166.3.1517] [Citation(s) in RCA: 221] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We have used the naturally occurring plant-parasite system of Arabidopsis thaliana and its common parasite Peronospora parasitica (downy mildew) to study the evolution of resistance specificity in the host population. DNA sequence of the resistance gene, RPP13, from 24 accessions, including 20 from the United Kingdom, revealed amino acid sequence diversity higher than that of any protein coding gene reported so far in A. thaliana. A significant excess of amino acid polymorphism segregating within this species is localized within the leucine-rich repeat (LRR) domain of RPP13. These results indicate that single alleles of the gene have not swept through the population, but instead, a diverse collection of alleles have been maintained. Transgenic complementation experiments demonstrate functional differences among alleles in their resistance to various pathogen isolates, suggesting that the extreme amino acid polymorphism in RPP13 is maintained through continual reciprocal selection between host and pathogen.
Collapse
Affiliation(s)
- Laura E Rose
- Center for Population Biology, University of California, Davis, California 95616, USA
| | | | | | | | | | | |
Collapse
|
73
|
Wright SI, Gaut BS. Molecular Population Genetics and the Search for Adaptive Evolution in Plants. Mol Biol Evol 2004; 22:506-19. [PMID: 15525701 DOI: 10.1093/molbev/msi035] [Citation(s) in RCA: 201] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
The first papers on plant molecular population genetics were published approximately 10 years ago. Since that time, well over 50 additional studies of plant nucleotide polymorphism have been published, and many of these studies focused on detecting the signature of balancing or positive selection at a locus. In this review, we discuss some of the theoretical and statistical issues surrounding the detection of selection, with focus on plant populations, and we also summarize the empirical plant molecular population genetics literature. At face value, the literature suggests that a history of balancing or positive selection in plant genes is rampant. In two well-studied taxa (maize and Arabidopsis) over 20% of studied genes have been interpreted as containing the signature of selection. We argue that this is probably an overstatement of the prevalence of natural selection in plant genomes, for two reasons. First, demographic effects are difficult to incorporate and have generally not been well integrated into the plant population genetics literature. Second, the genes studied to date are not a random sample, so selected genes may be overrepresented. The next generation of studies in plant molecular population genetics requires additional sampling of local populations, explicit comparisons among loci, and improved theoretical methods to control for demography. Eventually, candidate loci should be confirmed by explicit consideration of phenotypic effects.
Collapse
Affiliation(s)
- Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of California, Irvine, USA
| | | |
Collapse
|
74
|
Verrelli BC, Tishkoff SA. Signatures of selection and gene conversion associated with human color vision variation. Am J Hum Genet 2004; 75:363-75. [PMID: 15252758 PMCID: PMC1182016 DOI: 10.1086/423287] [Citation(s) in RCA: 81] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2004] [Accepted: 06/10/2004] [Indexed: 11/03/2022] Open
Abstract
Trichromatic color vision in humans results from the combination of red, green, and blue photopigment opsins. Although color vision genes have been the targets of active molecular and psychophysical research on color vision abnormalities, little is known about patterns of normal genetic variation in these genes among global human populations. The current study presents nucleotide sequence analyses and tests of neutrality for a 5.5-kb region of the X-linked long-wave "red" opsin gene (OPN1LW) in 236 individuals from ethnically diverse human populations. Our analysis of the recombination landscape across OPN1LW reveals an unusual haplotype structure associated with amino acid replacement variation in exon 3 that is consistent with gene conversion. Compared with the absence of OPN1LW amino acid replacement fixation since divergence from chimpanzee, the human population exhibits a significant excess of high-frequency OPN1LW replacements. Our results suggest that subtle changes in L-cone opsin wavelength absorption may have been adaptive during human evolution.
Collapse
Affiliation(s)
- Brian C Verrelli
- Department of Biology, University of Maryland, College Park 20742, USA
| | | |
Collapse
|
75
|
Bierne N, Eyre-Walker A. The genomic rate of adaptive amino acid substitution in Drosophila. Mol Biol Evol 2004; 21:1350-60. [PMID: 15044594 DOI: 10.1093/molbev/msh134] [Citation(s) in RCA: 193] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The proportion of amino acid substitutions driven by adaptive evolution can potentially be estimated from polymorphism and divergence data by an extension of the McDonald-Kreitman test. We have developed a maximum-likelihood method to do this and have applied our method to several data sets from three Drosophila species: D. melanogaster, D. simulans, and D. yakuba. The estimated number of adaptive substitutions per codon is not uniformly distributed among genes, but follows a leptokurtic distribution. However, the proportion of amino acid substitutions fixed by adaptive evolution seems to be remarkably constant across the genome (i.e., the proportion of amino acid substitutions that are adaptive appears to be the same in fast-evolving and slow-evolving genes; fast-evolving genes have higher numbers of both adaptive and neutral substitutions). Our estimates do not seem to be significantly biased by selection on synonymous codon use or by the assumption of independence among sites. Nevertheless, an accurate estimate is hampered by the existence of slightly deleterious mutations and variations in effective population size. The analysis of several Drosophila data sets suggests that approximately 25% +/- 20% of amino acid substitutions were driven by positive selection in the divergence between D. simulans and D. yakuba.
Collapse
Affiliation(s)
- Nicolas Bierne
- Centre for the Study of Evolution and School of Biological Sciences, University of Sussex, Brighton, UK
| | | |
Collapse
|
76
|
Halligan DL, Eyre-Walker A, Andolfatto P, Keightley PD. Patterns of evolutionary constraints in intronic and intergenic DNA of Drosophila. Genome Res 2004; 14:273-9. [PMID: 14762063 PMCID: PMC327102 DOI: 10.1101/gr.1329204] [Citation(s) in RCA: 91] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
We develop methods to infer levels of evolutionary constraints in the genome by comparing rates of nucleotide substitution in noncoding DNA with rates predicted from rates of synonymous site evolution in adjacent genes or other putatively neutrally evolving sites, while accounting for differences in base composition. We apply the methods to estimate levels of constraint in noncoding DNA of Drosophila. In introns, constraint (the estimated fraction of mutations that are selectively eliminated) is absolute at the 5' and 3' splice junction dinucleotides, and averages 72% in base pairs 3-6 at the 5'-end. Constraint at the 5' base pairs 3-6 is significantly lower in the lineage leading to Drosophila melanogaster than in Drosophila simulans, a finding that agrees with other features of genome evolution in Drosophila and indicates that the effect of selection on intron function has been weaker in the melanogaster lineage. Elsewhere in intron sequences, the rate of nucleotide substitution is significantly higher than at synonymous sites. By using intronic sites outside splice control regions as a putative neutrally evolving standard, constraint in the 500 bp of intergenic DNA upstream and downstream regions of protein-coding genes averages approximately 44%. Although the estimated level of constraint in intergenic regions close to genes is only about one-half of that of amino acid sites, selection against single-nucleotide mutations in intergenic DNA makes a substantial contribution to the mutation load in Drosophila.
Collapse
Affiliation(s)
- Daniel L Halligan
- University of Edinburgh, School of Biological Sciences, Edinburgh EH9 3JT, UK
| | | | | | | |
Collapse
|
77
|
Abstract
Synonymous codons are not used at random, significantly influencing the base composition of the genome. The selection-mutation-drift model proposes that this bias reflects natural selection in favor of a subset of preferred codons. Previous estimates in Drosophila of the intensity of selective forces involved seem too large to be reconciled with theoretical predictions of the level of codon bias. This probably results from confounding effects of the demographic histories of the species concerned. We have studied three species of the virilis group of Drosophila, which are more likely to satisfy the assumptions of the evolutionary models. We analyzed the patterns of polymorphism and divergence in a sample of 18 genes and applied a new method for estimating the intensity of selection on synonymous mutations based on the frequencies of unpreferred mutations among polymorphic sites. This yielded estimates of selection intensities (N(e)s) of the order of 0.65, which is more compatible with the observed levels of codon bias. Our results support the action of both selection and mutational bias on codon usage bias and suggest that codon usage and genome base composition in the D. americana lineage are in approximate equilibrium. Biased gene conversion may also contribute to the observed patterns.
Collapse
Affiliation(s)
- Xulio Maside
- Institute of Cell, Animal and Population Biology, University of Edinburgh, Kings Buildings, West Mains Road, EH9 3JT Edinburgh, UK.
| | | | | |
Collapse
|
78
|
Schein M, Yang Z, Mitchell-Olds T, Schmid KJ. Rapid evolution of a pollen-specific oleosin-like gene family from Arabidopsis thaliana and closely related species. Mol Biol Evol 2004; 21:659-69. [PMID: 14739246 DOI: 10.1093/molbev/msh059] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
It has been shown in a variety of species that genes expressed in reproductive tissues evolve rapidly, which often appears to be the result of positive Darwinian selection. We investigated the evolution of a family of seven pollen-specific oleosin-like proteins (or oleopollenins) in Arabidopsis thaliana and two closely related species. More than 30 kb of a genomic region that harbors the complete, tandemly repeated oleopollenin cluster were sequenced from Arabidopsis lyrata ssp. lyrata, and Boechera drummondii. A phylogenetic analysis of the complete gene cluster from these three species and from Brassica oleracea confirmed its rapid evolution resulting from gene duplication and gene loss events, numerous amino acid substitutions, and insertions/deletions in the coding sequence. Independent duplications were inferred in the lineages leading to Arabidopsis and to Brassica, and gene loss was inferred in the lineage leading to B. drummondii. Comparisons of the ratio of nonsynonymous (d(N)) and synonymous (d(S)) divergence revealed that the oleopollenins are among the most rapidly evolving proteins currently known from Arabidopsis and that they may evolve under positive Darwinian selection. Reverse transcriptase polymerase chain reaction analysis demonstrated the expression of oleopollenins in flowers of the outcrossing A. lyrata, the selfing B. drummondii, and the apomictic Boechera holboellii, suggesting that oleopollenins play an important role in species with different breeding systems. These results are consistent with a putative function in species recognition, but further analyses of protein function and sequence variation in species with different breeding systems are necessary to reveal the underlying causes for the rapid evolution of oleopollenins.
Collapse
Affiliation(s)
- Manja Schein
- Department of Genetics and Evolution, Max-Planck-Institute of Chemical Ecology, Jena, Germany
| | | | | | | |
Collapse
|
79
|
Piganeau G, Eyre-Walker A. Estimating the distribution of fitness effects from DNA sequence data: implications for the molecular clock. Proc Natl Acad Sci U S A 2003; 100:10335-40. [PMID: 12925735 PMCID: PMC193562 DOI: 10.1073/pnas.1833064100] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2003] [Indexed: 11/18/2022] Open
Abstract
We present a method for estimating the distribution of fitness effects of new amino acid mutations when those mutations can be assumed to be slightly advantageous, slightly deleterious, or strongly deleterious. We apply the method to mitochondrial data from several different species. In the majority of the data sets, the shape of the distribution is approximately exponential. Our results provide an estimate of the distribution of fitness effects of weakly selected mutations and provide a possible explanation for why the molecular clock is fairly constant across taxa and time.
Collapse
Affiliation(s)
- Gwenaël Piganeau
- Centre for the Study of Evolution and School of Biological Sciences, University of Sussex, Brighton BN1 9QG, United Kingdom
| | | |
Collapse
|
80
|
Abstract
The primary structures of peptides may be adapted for efficient synthesis as well as proper function. Here, the Saccharomyces cerevisiae genome sequence, DNA microarray expression data, tRNA gene numbers, and functional categorizations of proteins are employed to determine whether the amino acid composition of peptides reflects natural selection to optimize the speed and accuracy of translation. Strong relationships between synonymous codon usage bias and estimates of transcript abundance suggest that DNA array data serve as adequate predictors of translation rates. Amino acid usage also shows striking relationships with expression levels. Stronger correlations between tRNA concentrations and amino acid abundances among highly expressed proteins than among less abundant proteins support adaptation of both tRNA abundances and amino acid usage to enhance the speed and accuracy of protein synthesis. Natural selection for efficient synthesis appears to also favor shorter proteins as a function of their expression levels. Comparisons restricted to proteins within functional classes are employed to control for differences in amino acid composition and protein size that reflect differences in the functional requirements of proteins expressed at different levels.
Collapse
Affiliation(s)
- Hiroshi Akashi
- Institute of Molecular Evolutionary Genetics and Department of Biology, 208 Mueller Laboratory, Pennsylvania State University, University Park, PA 16802, USA.
| |
Collapse
|
81
|
Abstract
Evidence from disparate sources suggests that natural selection may often play a role in the evolution of host immune system proteins. However, there have been few attempts to make general population genetic inferences on the basis of analysis of several immune-system-related genes from a single species. Here we present DNA polymorphism and divergence data from 34 genes thought to function in the innate immune system of Drosophila simulans and compare these data to those from 28 nonimmunity genes sequenced from the same lines. Several statistics, including average K(A)/K(S) ratio, average silent heterozygosity, and average haplotype diversity, significantly differ between the immunity and nonimmunity genes, suggesting an important role for directional selection in immune system protein evolution. In contrast to data from mammalian immunoglobulins and other proteins, we find no strong evidence for the selective maintenance of protein diversity in Drosophila immune system proteins. This may be a consequence of Drosophila's generalized innate immune response.
Collapse
Affiliation(s)
- Todd A Schlenke
- Section of Evolution and Ecology, Division of Biological Sciences, Storer Hall, University of California, Davis, CA 95616, USA.
| | | |
Collapse
|
82
|
Bustamante CD, Nielsen R, Hartl DL. Maximum likelihood and Bayesian methods for estimating the distribution of selective effects among classes of mutations using DNA polymorphism data. Theor Popul Biol 2003; 63:91-103. [PMID: 12615493 DOI: 10.1016/s0040-5809(02)00050-3] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Maximum likelihood and Bayesian approaches are presented for analyzing hierarchical statistical models of natural selection operating on DNA polymorphism within a panmictic population. For analyzing Bayesian models, we present Markov chain Monte-Carlo (MCMC) methods for sampling from the joint posterior distribution of parameters. For frequentist analysis, an Expectation-Maximization (EM) algorithm is presented for finding the maximum likelihood estimate of the genome wide mean and variance in selection intensity among classes of mutations. The framework presented here provides an ideal setting for modeling mutations dispersed through the genome and, in particular, for the analysis of how natural selection operates on different classes of single nucleotide polymorphisms (SNPs).
Collapse
Affiliation(s)
- Carlos D Bustamante
- Mathematical Genetics Group, Department of Statistics, University of Oxford, 1 South Parks Road, Oxford, UK OX1 3TG.
| | | | | |
Collapse
|
83
|
Abstract
Estimates of the scaled selection coefficient, gamma of Sawyer and Hartl, are shown to be remarkably robust to population subdivision. Estimates of mutation parameters and divergence times, in contrast, are very sensitive to subdivision. These results follow from an analysis of natural selection and genetic drift in the island model of subdivision in the limit of a very large number of subpopulations, or demes. In particular, a diffusion process is shown to hold for the average allele frequency among demes in which the level of subdivision sets the timescale of drift and selection and determines the dynamic equilibrium of allele frequencies among demes. This provides a framework for inference about mutation, selection, divergence, and migration when data are available from a number of unlinked nucleotide sites. The effects of subdivision on parameter estimates depend on the distribution of samples among demes. If samples are taken singly from different demes, the only effect of subdivision is in the rescaling of mutation and divergence-time parameters. If multiple samples are taken from one or more demes, high levels of within-deme relatedness lead to low levels of intraspecies polymorphism and increase the number of fixed differences between samples from two species. If subdivision is ignored, mutation parameters are underestimated and the species divergence time is overestimated, sometimes quite drastically. Estimates of the strength of selection are much less strongly affected and always in a conservative direction.
Collapse
Affiliation(s)
- John Wakeley
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA.
| |
Collapse
|
84
|
Lercher MJ, Smith NGC, Eyre-Walker A, Hurst LD. The evolution of isochores: evidence from SNP frequency distributions. Genetics 2002; 162:1805-10. [PMID: 12524350 PMCID: PMC1462390 DOI: 10.1093/genetics/162.4.1805] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The large-scale systematic variation in nucleotide composition along mammalian and avian genomes has been a focus of the debate between neutralist and selectionist views of molecular evolution. Here we test whether the compositional variation is due to mutation bias using two new tests, which do not assume compositional equilibrium. In the first test we assume a standard population genetics model, but in the second we make no assumptions about the underlying population genetics. We apply the tests to single-nucleotide polymorphism data from noncoding regions of the human genome. Both models of neutral mutation bias fit the frequency distributions of SNPs segregating in low- and medium-GC-content regions of the genome adequately, although both suggest compositional nonequilibrium. However, neither model fits the frequency distribution of SNPs from the high-GC-content regions. In contrast, a simple population genetics model that incorporates selection or biased gene conversion cannot be rejected. The results suggest that mutation biases are not solely responsible for the compositional biases found in noncoding regions.
Collapse
Affiliation(s)
- Martin J Lercher
- Department of Biology and Biochemistry, University of Bath, Bath BA2 7AY, United Kingdom.
| | | | | | | |
Collapse
|
85
|
Abstract
Selective fixation of beneficial mutations reduces levels of linked, neutral variation. The magnitude of this "hitchhiking effect" is determined by the strength of selection and the recombination rate between selected and neutral sites. Thus, depending on the values of these parameters and the frequency with which directional selection occurs, the genomic scale over which directional selection reduces levels of linked variation may vary widely. Here we present a permutation-based analysis of nucleotide polymorphisms and fixations in Drosophila simulans. We show evidence of pervasive small-scale hitchhiking effects in this lineage. Furthermore, our results reveal that different types of fixations are associated with different levels of linked variation.
Collapse
Affiliation(s)
- Andrew D Kern
- Center for Population Biology, University of California, Davis, California 95616, USA.
| | | | | |
Collapse
|
86
|
Begun DJ, Whitley P. Molecular population genetics of Xdh and the evolution of base composition in Drosophila. Genetics 2002; 162:1725-35. [PMID: 12524344 PMCID: PMC1462376 DOI: 10.1093/genetics/162.4.1725] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Few loci have been measured for DNA polymorphism and divergence in several species. Here we report such data from the protein-coding region of xanthine dehydrogenase (Xdh) in 22 species of Drosophila. Many of our samples were from closely related species, allowing us to confidently assign substitutions to individual lineages. Surprisingly, Xdh appears to be fixing more A/T mutations than G/C mutations in most lineages, leading to evolution of higher A/T content in the recent past. We found no compelling evidence for selection on protein variation, though some aspects of the data support the notion that a significant fraction of amino acid polymorphisms are slightly deleterious. Finally, we found no convincing evidence that levels of silent heterozygosity are associated with rates of protein evolution.
Collapse
Affiliation(s)
- David J Begun
- Section of Integrative Biology, University of Texas, Austin, Texas 78712, USA.
| | | |
Collapse
|
87
|
Abstract
Artifactual evidence of adaptive amino acid substitution can be generated within a McDonald-Kreitman test if some amino acid mutations are slightly deleterious and there has been an increase in effective population size. Here I investigate the conditions under which this occurs. I show that fairly small increases in effective population size can generate artifactual evidence of positive selection if there is no selection upon synonymous codon use. This problem is exacerbated by the removal of low-frequency polymorphisms. However, selection on synonymous codon use restricts the conditions under which artifactual evidence of adaptive evolution is produced.
Collapse
Affiliation(s)
- Adam Eyre-Walker
- Centre for the Study of Evolution, University of Sussex, Brighton BN1 9QG, United Kingdom.
| |
Collapse
|
88
|
Wright SI, Lauga B, Charlesworth D. Rates and patterns of molecular evolution in inbred and outbred Arabidopsis. Mol Biol Evol 2002; 19:1407-20. [PMID: 12200469 DOI: 10.1093/oxfordjournals.molbev.a004204] [Citation(s) in RCA: 167] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The evolution of self-fertilization is associated with a large reduction in the effective rate of recombination and a corresponding decline in effective population size. If many spontaneous mutations are slightly deleterious, this shift in the breeding system is expected to lead to a reduced efficacy of natural selection and genome-wide changes in the rates of molecular evolution. Here, we investigate the effects of the breeding system on molecular evolution in the highly self-fertilizing plant Arabidopsis thaliana by comparing its coding and noncoding genomic regions with those of its close outcrossing relative, the self-incompatible A. lyrata. More distantly related species in the Brassicaceae are used as outgroups to polarize the substitutions along each lineage. In contrast to expectations, no significant difference in the rates of protein evolution is observed between selfing and outcrossing Arabidopsis species. Similarly, no consistent overall difference in codon bias is observed between the species, although for low-biased genes A. lyrata shows significantly higher major codon usage. There is also evidence of intron size evolution in A. thaliana, which has consistently smaller introns than its outcrossing congener, potentially reflecting directional selection on intron size. The results are discussed in the context of heterogeneity in selection coefficients across loci and the effects of life history and population structure on rates of molecular evolution. Using estimates of substitution rates in coding regions and approximate estimates of divergence and generation times, the genomic deleterious mutation rate (U) for amino acid substitutions in Arabidopsis is estimated to be approximately 0.2-0.6 per generation.
Collapse
Affiliation(s)
- Stephen I Wright
- Institute of Cell, Animal, and Population Biology, Ashworth Laboratories, University of Edinburgh.
| | | | | |
Collapse
|
89
|
Hahn MW, Rausher MD, Cunningham CW. Distinguishing between selection and population expansion in an experimental lineage of bacteriophage T7. Genetics 2002; 161:11-20. [PMID: 12019219 PMCID: PMC1462109 DOI: 10.1093/genetics/161.1.11] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Experimental evolution of short-lived organisms offers the opportunity to study the dynamics of polymorphism over time in a controlled environment. Here, we characterize DNA polymorphism data over time for four genes in bacteriophage T7. Our experiment ran for 2500 generations and populations were sampled after 500, 2000, and 2500 generations. We detect positive selection, purifying ("negative") selection, and population expansion in our experiment. We also present a statistical test that is able to distinguish demographic from selective events, processes that are hard to identify individually because both often produce an excess of rare mutations. Our "heterogeneity test" modifies common statistics measuring the frequency spectrum of polymorphism (e.g., Fu and Li's D) by looking for processes producing different patterns on nonsynonymous and synonymous mutations. Test results agree with the known conditions of the experiment, and we are therefore confident that this test offers a tool to evaluate natural populations. Our results suggest that instances of segregating deleterious mutations may be common, but as yet undetected, in nature.
Collapse
Affiliation(s)
- Matthew W Hahn
- Evolution, Ecology, and Organismal Biology Group, Department of Biology, Duke University, Durham, North Carolina 27708, USA.
| | | | | |
Collapse
|
90
|
Bustamante CD, Wakeley J, Sawyer S, Hartl DL. Directional selection and the site-frequency spectrum. Genetics 2001; 159:1779-88. [PMID: 11779814 PMCID: PMC1461920 DOI: 10.1093/genetics/159.4.1779] [Citation(s) in RCA: 144] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
In this article we explore statistical properties of the maximum-likelihood estimates (MLEs) of the selection and mutation parameters in a Poisson random field population genetics model of directional selection at DNA sites. We derive the asymptotic variances and covariance of the MLEs and explore the power of the likelihood ratio tests (LRT) of neutrality for varying levels of mutation and selection as well as the robustness of the LRT to deviations from the assumption of free recombination among sites. We also discuss the coverage of confidence intervals on the basis of two standard-likelihood methods. We find that the LRT has high power to detect deviations from neutrality and that the maximum-likelihood estimation performs very well when the ancestral states of all mutations in the sample are known. When the ancestral states are not known, the test has high power to detect deviations from neutrality for negative selection but not for positive selection. We also find that the LRT is not robust to deviations from the assumption of independence among sites.
Collapse
Affiliation(s)
- C D Bustamante
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA
| | | | | | | |
Collapse
|
91
|
Munte A, Aguade M, Segarra C. Changes in the recombinational environment affect divergence in the yellow gene of Drosophila. Mol Biol Evol 2001; 18:1045-56. [PMID: 11371593 DOI: 10.1093/oxfordjournals.molbev.a003876] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The complete coding region of the yellow (y) gene was sequenced in different Drosophila species. In the species of the melanogaster subgroup (D. melanogaster, D. simulans, D. mauritiana, D. yakuba, and D. erecta), this gene is located at the tip of the X chromosome in a region with a strong reduction in recombination rate. In contrast, in D. ananassae (included in the ananassae subgroup of the melanogaster group) and in the obscura group species (D. subobscura, D. madeirensis, D. guanche, and D. pseudoobscura), the y gene is located in regions with normal recombination rates. As predicted by the hitchhiking and background selection models, this change in the recombinational environment affected synonymous divergence in the y-gene-coding region. Estimates of the number of synonymous substitutions per site were much lower between the obscura group species and D. ananassae than between the species of the obscura group and the melanogaster subgroup. In fact, a highly significant increase in the rate of synonymous substitution was detected in all lineages leading to the species of the melanogaster subgroup relative to the D. ananassae lineage. This increase can be explained by a higher fixation rate of mutations from preferred to unpreferred codons (slightly deleterious mutations). The lower codon bias detected in all species of the melanogaster subgroup relative to D. ananassae (or to the obscura group species) would be consistent with this proposal. Therefore, at least in Drosophila, changes in the recombination rate in different lineages might cause deviations of the molecular-clock hypothesis and contribute to the overdispersion of the rate of synonymous substitution. In contrast, the change in the recombinational environment of the y gene has no detectable effect on the rate of amino acid replacement in the Yellow protein.
Collapse
Affiliation(s)
- A Munte
- Departament de Genètica, Facultat de Biologia, Universitat de Barcelona, Diagonal 645, 08071 Barcelona, Spain
| | | | | |
Collapse
|
92
|
Abstract
Patterns of codon bias in Drosophila suggest that silent mutations can be classified into two types: unpreferred (slightly deleterious) and preferred (slightly beneficial). Results of previous analyses of polymorphism and divergence in Drosophila simulans were interpreted as supporting a mutation-selection-drift model in which slightly deleterious, silent mutants make significantly greater contributions to polymorphism than to divergence. Frequencies of unpreferred polymorphisms were inferred to be lower than frequencies of other silent polymorphisms. Here, I analyzed additional D. simulans data to reevaluate the support for these ideas. I found that D. simulans has fixed more unpreferred than preferred mutations, suggesting that this lineage has not been at mutation-selection-drift equilibrium at silent sites. Frequencies of polarized unpreferred polymorphisms are not skewed toward rare alleles. However, frequencies of unpolarized unpreferred codons are lower in high-bias genes than in low-bias genes. This supports the idea that unpreferred codons are borderline deleterious mutations. Purifying selection on silent sites appears to be stronger at twofold-degenerate codons than at fourfold-degenerate codons. Finally, I found that X-linked polymorphisms occur at a higher average frequency than polymorphisms on chromosome arm 3R, even though an average X-linked site is significantly less likely to be polymorphic than an average site on 3R. This result supports a previous analysis of D. simulans indicating different population genetics of X-linked versus autosomal mutations.
Collapse
Affiliation(s)
- D J Begun
- Section of Evolution and Ecology, University of California at Davis, 95616, USA.
| |
Collapse
|
93
|
Tiffin P, Gaut BS. Sequence diversity in the tetraploid Zea perennis and the closely related diploid Z. diploperennis: insights from four nuclear loci. Genetics 2001; 158:401-12. [PMID: 11333248 PMCID: PMC1461631 DOI: 10.1093/genetics/158.1.401] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Polyploidy has been an extremely common phenomenon in the evolutionary history of angiosperms. Despite this there are few data available to evaluate the effects of polyploidy on genetic diversity and to compare the relative effects of drift and selection in polyploids and related diploids. We investigated DNA sequence diversity at four nuclear loci (adh1, glb1, c1, and waxy) from the tetraploid Zea perennis and the closely related diploid Z. diploperennis. Contrary to expectations, we detected no strong evidence for greater genetic diversity in the tetraploid, or for consistent differences in the effects of either drift or selection between the tetraploid and the diploid. Our failure to find greater genetic diversity in Z. perennis may result from its relatively recent origin or demographic factors associated with its origin. In addition to comparing genetic diversity in the two species, we constructed genealogies to infer the evolutionary origin of Z. perennis. Although these genealogies are equivocal regarding the mode of origin, several aspects of these genealogies support an autotetraploid origin. Consistent with previous molecular data the genealogies do not, however, support the division of Zea into two sections, the section Zea and the section Luxuriantes.
Collapse
Affiliation(s)
- P Tiffin
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA 92697-2525, USA.
| | | |
Collapse
|
94
|
Przeworski M, Wall JD, Andolfatto P. Recombination and the frequency spectrum in Drosophila melanogaster and Drosophila simulans. Mol Biol Evol 2001; 18:291-8. [PMID: 11230530 DOI: 10.1093/oxfordjournals.molbev.a003805] [Citation(s) in RCA: 45] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Most "tests of neutrality" assess whether particular data sets depart from the predictions of a standard neutral model with no recombination. For Drosophila, where nuclear polymorphism data routinely show evidence of genetic exchange, the assumption of no recombination is often unrealistic. In addition, while conservative, this assumption is made at the cost of a great loss in power. Perhaps as a result, tests of the frequency spectrum based on zero recombination suggest an adequate fit of Drosophila polymorphism data to the predictions of the standard neutral model. Here, we analyze the frequency spectrum of a large number of loci in Drosophila melanogaster and D. simulans using two summary statistics. We use an estimate of the population recombination rate based on a laboratory estimate of the rate of crossing over per physical length and an estimate of the species' effective population size. In contrast to previous studies, we find that roughly half of the loci depart from the predictions of the standard neutral model. The extent of the departure depends on the exact recombination rate, but the global pattern that emerges is robust. Interestingly, these departures from neutral expectations are not unidirectional. The large variance in outcomes may be due to a complex demographic history and inconsistent sampling, or to the pervasive action of natural selection.
Collapse
Affiliation(s)
- M Przeworski
- Department of Statistics, Oxford University, Oxford, England.
| | | | | |
Collapse
|
95
|
McVean GA, Vieira J. Inferring parameters of mutation, selection and demography from patterns of synonymous site evolution in Drosophila. Genetics 2001; 157:245-57. [PMID: 11139506 PMCID: PMC1461462 DOI: 10.1093/genetics/157.1.245] [Citation(s) in RCA: 117] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Selection acting on codon usage can cause patterns of synonymous evolution to deviate considerably from those expected under neutrality. To investigate the quantitative relationship between parameters of mutation, selection, and demography, and patterns of synonymous site divergence, we have developed a novel combination of population genetic models and likelihood methods of phylogenetic sequence analysis. Comparing 50 orthologous gene pairs from Drosophila melanogaster and D. virilis and 27 from D. melanogaster and D. simulans, we show considerable variation between amino acids and genes in the strength of selection acting on codon usage and find evidence for both long-term and short-term changes in the strength of selection between species. Remarkably, D. melanogaster shows no evidence of current selection on codon usage, while its sister species D. simulans experiences only half the selection pressure for codon usage of their common ancestor. We also find evidence for considerable base asymmetries in the rate of mutation, such that the average synonymous mutation rate is 20-30% higher than in noncoding regions. A Bayesian approach is adopted to investigate how accounting for selection on codon usage influences estimates of the parameters of mutation.
Collapse
Affiliation(s)
- G A McVean
- Institute of Cell, Animal and Population Biology, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom.
| | | |
Collapse
|
96
|
Rand DM, Weinreich DM, Cezairliyan BO. Neutrality tests of conservative-radical amino acid changes in nuclear- and mitochondrially-encoded proteins. Gene 2000; 261:115-25. [PMID: 11164043 DOI: 10.1016/s0378-1119(00)00483-2] [Citation(s) in RCA: 50] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
The neutralist-selectionist debate should not be viewed as a dichotomy but as a continuum. While the strictly neutral model suggests a neutralist-selectionist dichotomy, the nearly neutral model is a continuous model spanning strict neutrality through weak selection (Ns approximately 1) to deterministic selection (Ns>3). We illustrate these points with polymorphism and divergence data from a sample of 73 genes (31 mitochondrial, 36 nuclear genes from Drosophila, and six Arabidopsis data sets). In an earlier study we used the McDonald-Kreitman (MK) test to show that amino acid replacement polymorphism in animal mitochondrial genes and Arabidopsis genes show a consistent trend toward negative selection, whereas nuclear genes from Drosophila span a range from negative selection, through neutrality, to positive selection. Here we analyze a subset of these genes (13 Drosophila nuclear, ten mitochondrial, and six Arabidopsis nuclear) for polymorphism and divergence of conservative and radical amino acid replacements (a protein-based conservative-radical MK, or pMK, test). The distinct patterns of selection between the different genomes is not apparent with the pMK test. Different definitions of conservative and radical (based on amino acid polarity, volume or charge) give inconsistent results across genes. We suggest that segregating fitness difference between silent and replacement mutations are more visible to selection than are segregating fitness differences between conservative and radical amino acid mutations. New data on the variation among genes with different opportunities for positive and negative selection are as important to the continuum view of the neutralist-selectionist debate as is the distribution of selection coefficients within individual genes.
Collapse
Affiliation(s)
- D M Rand
- Department of Ecology and Evolutionary Biology, Box G-W, 69 Brown Street, Brown University, Providence, RI 02912, USA.
| | | | | |
Collapse
|
97
|
Verrelli BC, Eanes WF. Extensive amino acid polymorphism at the pgm locus is consistent with adaptive protein evolution in Drosophila melanogaster. Genetics 2000; 156:1737-52. [PMID: 11102370 PMCID: PMC1461360 DOI: 10.1093/genetics/156.4.1737] [Citation(s) in RCA: 40] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
PGM plays a central role in the glycolytic pathway at the branch point leading to glycogen metabolism and is highly polymorphic in allozyme studies of many species. We have characterized the nucleotide diversity across the Pgm gene in Drosophila melanogaster and D. simulans to investigate the role that protein polymorphism plays at this crucial metabolic branch point shared with several other enzymes. Although D. melanogaster and D. simulans share common allozyme mobility alleles, we find these allozymes are the result of many different amino acid changes at the nucleotide level. In addition, specific allozyme classes within species contain several amino acid changes, which may explain the absence of latitudinal clines for PGM allozyme alleles, the lack of association of PGM allozymes with the cosmopolitan In(3L)P inversion, and the failure to detect differences between PGM allozymes in functional studies. We find a significant excess of amino acid polymorphisms within D. melanogaster when compared to the complete absence of fixed replacements with D. simulans. There is also strong linkage disequilibrium across the 2354 bp of the Pgm locus, which may be explained by a specific amino acid haplotype that is high in frequency yet contains an excess of singleton polymorphisms. Like G6pd, Pgm shows strong evidence for a branch point enzyme that exhibits adaptive protein evolution.
Collapse
Affiliation(s)
- B C Verrelli
- Department of Ecology and Evolution, State University of New York, Stony Brook, New York 11794-5245, USA.
| | | |
Collapse
|
98
|
Comeron JM, Kreitman M. The correlation between intron length and recombination in drosophila. Dynamic equilibrium between mutational and selective forces. Genetics 2000; 156:1175-90. [PMID: 11063693 PMCID: PMC1461334 DOI: 10.1093/genetics/156.3.1175] [Citation(s) in RCA: 150] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Intron length is negatively correlated with recombination in both Drosophila melanogaster and humans. This correlation is not likely to be the result of mutational processes alone: evolutionary analysis of intron length polymorphism in D. melanogaster reveals equivalent ratios of deletion to insertion in regions of high and low recombination. The polymorphism data do reveal, however, an excess of deletions relative to insertions (i.e., a deletion bias), with an overall deletion-to-insertion events ratio of 1.35. We propose two types of selection favoring longer intron lengths. First, the natural mutational bias toward deletion must be opposed by strong selection in very short introns to maintain the minimum intron length needed for the intron splicing reaction. Second, selection will favor insertions in introns that increase recombination between mutations under the influence of selection in adjacent exons. Mutations that increase recombination, even slightly, will be selectively favored because they reduce interference among selected mutations. Interference selection acting on intron length mutations must be very weak, as indicated by frequency spectrum analysis of Drosophila intron length polymorphism, making the equilibrium for intron length sensitive to changes in the recombinational environment and population size. One consequence of this sensitivity is that the advantage of longer introns is expected to decrease inversely with the rate of recombination, thus leading to a negative correlation between intron length and recombination rate. Also in accord with this model, intron length differs between closely related Drosophila species, with the longest variant present more often in D. melanogaster than in D. simulans. We suggest that the study of the proposed dynamic model, taking into account interference among selected sites, might shed light on many aspects of the comparative biology of genome sizes including the C value paradox.
Collapse
Affiliation(s)
- J M Comeron
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois 60637, USA.
| | | |
Collapse
|
99
|
Duvernell DD, Eanes WF. Contrasting molecular population genetics of four hexokinases in Drosophila melanogaster, D. simulans and D. yakuba. Genetics 2000; 156:1191-201. [PMID: 11063694 PMCID: PMC1461338 DOI: 10.1093/genetics/156.3.1191] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
As part of a larger study contrasting patterns of variation in regulatory and nonregulatory enzymes of the central metabolic pathways we have examined the molecular variation in four uncharacterized hexokinase genes unique to muscle, fat body, and testis in Drosophila melanogaster, D. simulans, and D. yakuba. Earlier isoenzyme studies had designated these genes as Hex-A, Hex-C, and Hex-t. There are two tightly linked testes-specific genes designated here as Hex-t1 and Hex-t2. Substantial and concordant differences across species are seen in levels of both amino acid and silent polymorphism. The flight muscle form Hex-A is the most conserved followed by the fat body hexokinase Hex-C and testis-specific hexokinases Hex-t1 and Hex-t2. While constraints acting at the amino acid level are expected, the silent polymorphisms follow this pattern as well. All genes are in regions of normal recombination, therefore hitchhiking and background selection are not likely causes of interlocus differences. In D. melanogaster latitudinal clines are seen for amino acid polymorphisms at the Hex-C and Hex-t2 loci. There is evidence for accelerated amino acid substitution in Hex-t1 that has lost residues known to be associated with glucose and glucose-6-phosphate binding. D. simulans shows substantial linkage phase structuring that suggests historical population subdivision.
Collapse
Affiliation(s)
- D D Duvernell
- Department of Ecology and Evolution, State University of New York, Stony Brook, New York 11794, USA
| | | |
Collapse
|
100
|
Weinreich DM, Rand DM. Contrasting patterns of nonneutral evolution in proteins encoded in nuclear and mitochondrial genomes. Genetics 2000; 156:385-99. [PMID: 10978302 PMCID: PMC1461243 DOI: 10.1093/genetics/156.1.385] [Citation(s) in RCA: 92] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We report that patterns of nonneutral DNA sequence evolution among published nuclear and mitochondrially encoded protein-coding loci differ significantly in animals. Whereas an apparent excess of amino acid polymorphism is seen in most (25/31) mitochondrial genes, this pattern is seen in fewer than half (15/36) of the nuclear data sets. This differentiation is even greater among data sets with significant departures from neutrality (14/15 vs. 1/6). Using forward simulations, we examined patterns of nonneutral evolution using parameters chosen to mimic the differences between mitochondrial and nuclear genetics (we varied recombination rate, population size, mutation rate, selective dominance, and intensity of germ line bottleneck). Patterns of evolution were correlated only with effective population size and strength of selection, and no single genetic factor explains the empirical contrast in patterns. We further report that in Arabidopsis thaliana, a highly self-fertilizing plant with effectively low recombination, five of six published nuclear data sets also exhibit an excess of amino acid polymorphism. We suggest that the contrast between nuclear and mitochondrial nonneutrality in animals stems from differences in rates of recombination in conjunction with a distribution of selective effects. If the majority of mutations segregating in populations are deleterious, high linkage may hinder the spread of the occasional beneficial mutation.
Collapse
Affiliation(s)
- D M Weinreich
- Department of Ecology and Evolutionary Biology, Brown University, Providence, Rhode Island 02912, USA.
| | | |
Collapse
|