1
|
Bailey SF, Alonso Morales LA, Kassen R. Effects of synonymous mutations beyond codon bias: The evidence for adaptive synonymous substitutions from microbial evolution experiments. Genome Biol Evol 2021; 13:6300525. [PMID: 34132772 PMCID: PMC8410137 DOI: 10.1093/gbe/evab141] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/10/2021] [Indexed: 12/22/2022] Open
Abstract
Synonymous mutations are often assumed to be neutral with respect to fitness because they do not alter the encoded amino acid and so cannot be 'seen' by natural selection. Yet a growing body of evidence suggests that synonymous mutations can have fitness effects that drive adaptive evolution through their impacts on gene expression and protein folding. Here, we review what microbial experiments have taught us about the contribution of synonymous mutations to adaptation. A survey of site-directed mutagenesis experiments reveals the distributions of fitness effects for nonsynonymous and synonymous mutations are more similar, especially for beneficial mutations, than expected if all synonymous mutations were neutral, suggesting they should drive adaptive evolution more often than is typically observed. A review of experimental evolution studies where synonymous mutations have contributed to adaptation shows they can impact fitness through a range of mechanisms including the creation of illicit RNA polymerase binding sites impacting transcription and changes to mRNA folding stability that modulate translation. We suggest that clonal interference in evolving microbial populations may be the reason synonymous mutations play a smaller role in adaptive evolution than expected based on their observed fitness effects. We finish by discussing the impacts of falsely assuming synonymous mutations are neutral and discuss directions for future work exploring the role of synonymous mutations in adaptive evolution.
Collapse
Affiliation(s)
- Susan F Bailey
- Department of Biology, Clarkson University, Potsdam, NY 13699, USA
| | | | - Rees Kassen
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| |
Collapse
|
2
|
Vogl C, Mikula LC. A nearly-neutral biallelic Moran model with biased mutation and linear and quadratic selection. Theor Popul Biol 2021; 139:1-17. [PMID: 33964284 DOI: 10.1016/j.tpb.2021.03.003] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2020] [Revised: 03/28/2021] [Accepted: 03/29/2021] [Indexed: 01/27/2023]
Abstract
In this article, a biallelic reversible mutation model with linear and quadratic selection is analysed. The approach reconnects to one proposed by Kimura (1979), who starts from a diffusion model and derives its equilibrium distribution up to a constant. We use a boundary-mutation Moran model, which approximates a general mutation model for small effective mutation rates, and derive its equilibrium distribution for polymorphic and monomorphic variants in small to moderately sized populations. Using this model, we show that biased mutation rates and linear selection alone can cause patterns of polymorphism within and substitution rates between populations that are usually ascribed to balancing or overdominant selection. We illustrate this using a data set of short introns and fourfold degenerate sites from Drosophila simulans and Drosophila melanogaster.
Collapse
Affiliation(s)
- Claus Vogl
- Department of Biomedical Sciences, Vetmeduni Vienna, Veterinärplatz 1, A-1210 Wien, Austria; Vienna Graduate School of Population Genetics, A-1210 Wien, Austria.
| | - Lynette Caitlin Mikula
- Centre for Biological Diversity, School of Biology, University of St. Andrews, St Andrews KY16 9TH, UK.
| |
Collapse
|
3
|
McCandlish DM, Lang GI. Evolution of Epistasis: Small Populations Go Their Separate Ways. J Mol Evol 2020; 88:418-420. [PMID: 32435963 DOI: 10.1007/s00239-020-09950-4] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Accepted: 05/06/2020] [Indexed: 11/26/2022]
Affiliation(s)
- David M McCandlish
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 11724, USA.
| | - Gregory I Lang
- Department of Biological Sciences, Lehigh University, Bethlehem, PA, 18015, USA
| |
Collapse
|
4
|
Machado HE, Lawrie DS, Petrov DA. Pervasive Strong Selection at the Level of Codon Usage Bias in Drosophila melanogaster. Genetics 2020; 214:511-528. [PMID: 31871131 PMCID: PMC7017021 DOI: 10.1534/genetics.119.302542] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2019] [Accepted: 12/12/2019] [Indexed: 11/18/2022] Open
Abstract
Codon usage bias (CUB), where certain codons are used more frequently than expected by chance, is a ubiquitous phenomenon and occurs across the tree of life. The dominant paradigm is that the proportion of preferred codons is set by weak selection. While experimental changes in codon usage have at times shown large phenotypic effects in contrast to this paradigm, genome-wide population genetic estimates have supported the weak selection model. Here we use deep genomic population sequencing of two Drosophila melanogaster populations to measure selection on synonymous sites in a way that allowed us to estimate the prevalence of both weak and strong purifying selection. We find that selection in favor of preferred codons ranges from weak (|Nes| ∼ 1) to strong (|Nes| > 10), with strong selection acting on 10-20% of synonymous sites in preferred codons. While previous studies indicated that selection at synonymous sites could be strong, this is the first study to detect and quantify strong selection specifically at the level of CUB. Further, we find that CUB-associated polymorphism accounts for the majority of strong selection on synonymous sites, with secondary contributions of splicing (selection on alternatively spliced genes, splice junctions, and spliceosome-bound sites) and transcription factor binding. Our findings support a new model of CUB and indicate that the functional importance of CUB, as well as synonymous sites in general, have been underestimated.
Collapse
Affiliation(s)
- Heather E Machado
- Cancer, Ageing, and Somatic Mutation, Wellcome Sanger Institute, Hinxton CB10 1SA, UK
| | - David S Lawrie
- Department of Ecology and Evolutionary Biology, University of California, Irvine, California 92697-3958
| | - Dmitri A Petrov
- Department of Biology, Stanford University, California 94305-5020
| |
Collapse
|
5
|
Szövényi P, Ullrich KK, Rensing SA, Lang D, van Gessel N, Stenøien HK, Conti E, Reski R. Selfing in Haploid Plants and Efficacy of Selection: Codon Usage Bias in the Model Moss Physcomitrella patens. Genome Biol Evol 2017; 9:1528-1546. [PMID: 28549175 PMCID: PMC5507605 DOI: 10.1093/gbe/evx098] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/25/2017] [Indexed: 12/15/2022] Open
Abstract
A long-term reduction in effective population size will lead to major shift in genome evolution. In particular, when effective population size is small, genetic drift becomes dominant over natural selection. The onset of self-fertilization is one evolutionary event considerably reducing effective size of populations. Theory predicts that this reduction should be more dramatic in organisms capable for haploid than for diploid selfing. Although theoretically well-grounded, this assertion received mixed experimental support. Here, we test this hypothesis by analyzing synonymous codon usage bias of genes in the model moss Physcomitrella patens frequently undergoing haploid selfing. In line with population genetic theory, we found that the effect of natural selection on synonymous codon usage bias is very weak. Our conclusion is supported by four independent lines of evidence: 1) Very weak or nonsignificant correlation between gene expression and codon usage bias, 2) no increased codon usage bias in more broadly expressed genes, 3) no evidence that codon usage bias would constrain synonymous and nonsynonymous divergence, and 4) predominant role of genetic drift on synonymous codon usage predicted by a model-based analysis. These findings show striking similarity to those observed in AT-rich genomes with weak selection for optimal codon usage and GC content overall. Our finding is in contrast to a previous study reporting adaptive codon usage bias in the moss P. patens.
Collapse
Affiliation(s)
- Péter Szövényi
- Department of Systematic and Evolutionary Botany, University of Zurich, Switzerland
| | - Kristian K. Ullrich
- Plant Cell Biology, Faculty of Biology, University of Marburg, Germany
- Present address: Max-Planck-Insitut für Evolutionsbiologie, Plön, Germany
| | - Stefan A. Rensing
- Plant Cell Biology, Faculty of Biology, University of Marburg, Germany
- BIOSS—Centre for Biological Signalling Studies, University of Freiburg, Germany
| | - Daniel Lang
- Plant Genome and Systems Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Nico van Gessel
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Germany
| | | | - Elena Conti
- Department of Systematic and Evolutionary Botany, University of Zurich, Switzerland
| | - Ralf Reski
- BIOSS—Centre for Biological Signalling Studies, University of Freiburg, Germany
- Plant Biotechnology, Faculty of Biology, University of Freiburg, Germany
| |
Collapse
|
6
|
Zhu YO, Sherlock G, Petrov DA. Extremely Rare Polymorphisms in Saccharomyces cerevisiae Allow Inference of the Mutational Spectrum. PLoS Genet 2017; 13:e1006455. [PMID: 28046117 PMCID: PMC5207638 DOI: 10.1371/journal.pgen.1006455] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Accepted: 11/03/2016] [Indexed: 12/04/2022] Open
Abstract
The characterization of mutational spectra is usually carried out in one of three ways-by direct observation through mutation accumulation (MA) experiments, through parent-offspring sequencing, or by indirect inference from sequence data. Direct observations of spontaneous mutations with MA experiments are limited, given (i) the rarity of spontaneous mutations, (ii) applicability only to laboratory model species with short generation times, and (iii) the possibility that mutational spectra under lab conditions might be different from those observed in nature. Trio sequencing is an elegant solution, but it is not applicable in all organisms. Indirect inference, usually from divergence data, faces no such technical limitations, but rely upon critical assumptions regarding the strength of natural selection that are likely to be violated. Ideally, new mutational events would be directly observed before the biased filter of selection, and without the technical limitations common to lab experiments. One approach is to identify very young mutations from population sequencing data. Here we do so by leveraging two characteristics common to all new mutations-new mutations are necessarily rare in the population, and absent in the genomes of immediate relatives. From 132 clinical yeast strains, we were able to identify 1,425 putatively new mutations and show that they exhibit extremely low signatures of selection, as well as display a mutational spectrum that is similar to that identified by a large scale MA experiment. We verify that population sequencing data are a potential wealth of information for inferring mutational spectra, and should be considered for analysis where MA experiments are infeasible or especially tedious.
Collapse
Affiliation(s)
- Yuan O. Zhu
- Department of Genetics, Stanford University, Stanford, CA, United States of America
- Department of Biology, Stanford University, Stanford, CA, United States of America
- Genome Institute of Singapore, Singapore
| | - Gavin Sherlock
- Department of Genetics, Stanford University, Stanford, CA, United States of America
| | - Dmitri A. Petrov
- Department of Biology, Stanford University, Stanford, CA, United States of America
| |
Collapse
|
7
|
Pouyet F, Bailly-Bechet M, Mouchiroud D, Guéguen L. SENCA: A Multilayered Codon Model to Study the Origins and Dynamics of Codon Usage. Genome Biol Evol 2016; 8:2427-41. [PMID: 27401173 PMCID: PMC5010899 DOI: 10.1093/gbe/evw165] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Gene sequences are the target of evolution operating at different levels, including the nucleotide, codon, and amino acid levels. Disentangling the impact of those different levels on gene sequences requires developing a probabilistic model with three layers. Here we present SENCA (site evolution of nucleotides, codons, and amino acids), a codon substitution model that separately describes 1) nucleotide processes which apply on all sites of a sequence such as the mutational bias, 2) preferences between synonymous codons, and 3) preferences among amino acids. We argue that most synonymous substitutions are not neutral and that SENCA provides more accurate estimates of selection compared with more classical codon sequence models. We study the forces that drive the genomic content evolution, intraspecifically in the core genome of 21 prokaryotes and interspecifically for five Enterobacteria. We retrieve the existence of a universal mutational bias toward AT, and that taking into account selection on synonymous codon usage has consequences on the measurement of selection on nonsynonymous substitutions. We also confirm that codon usage bias is mostly driven by selection on preferred codons. We propose new summary statistics to measure the relative importance of the different evolutionary processes acting on sequences.
Collapse
Affiliation(s)
- Fanny Pouyet
- Laboratoire de Biologie et Biométrie Evolutive, University Claude Bernard Lyon 1-University of Lyon, Villeurbanne, France
| | - Marc Bailly-Bechet
- Laboratoire de Biologie et Biométrie Evolutive, University Claude Bernard Lyon 1-University of Lyon, Villeurbanne, France
| | - Dominique Mouchiroud
- Laboratoire de Biologie et Biométrie Evolutive, University Claude Bernard Lyon 1-University of Lyon, Villeurbanne, France
| | - Laurent Guéguen
- Laboratoire de Biologie et Biométrie Evolutive, University Claude Bernard Lyon 1-University of Lyon, Villeurbanne, France
| |
Collapse
|
8
|
Price N, Graur D. Are Synonymous Sites in Primates and Rodents Functionally Constrained? J Mol Evol 2015; 82:51-64. [PMID: 26563252 DOI: 10.1007/s00239-015-9719-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2015] [Accepted: 11/04/2015] [Indexed: 11/28/2022]
Abstract
It has been claimed that synonymous sites in mammals are under selective constraint. Furthermore, in many studies the selective constraint at such sites in primates was claimed to be more stringent than that in rodents. Given the larger effective population sizes in rodents than in primates, the theoretical expectation is that selection in rodents would be more effective than that in primates. To resolve this contradiction between expectations and observations, we used processed pseudogenes as a model for strict neutral evolution, and estimated selective constraint on synonymous sites using the rate of substitution at pseudosynonymous and pseudononsynonymous sites in pseudogenes as the neutral expectation. After controlling for the effects of GC content, our results were similar to those from previous studies, i.e., synonymous sites in primates exhibited evidence for higher selective constraint that those in rodents. Specifically, our results indicated that in primates up to 24% of synonymous sites could be under purifying selection, while in rodents synonymous sites evolved neutrally. To further control for shifts in GC content, we estimated selective constraint at fourfold degenerate sites using a maximum parsimony approach. This allowed us to estimate selective constraint using mutational patterns that cause a shift in GC content (GT ↔ TG, CT ↔ TC, GA ↔ AG, and CA ↔ AC) and ones that do not (AT ↔ TA and CG ↔ GC). Using this approach, we found that synonymous sites evolve neutrally in both primates and rodents. Apparent deviations from neutrality were caused by a higher rate of C → A and C → T mutations in pseudogenes. Such differences are most likely caused by the shift in GC content experienced by pseudogenes. We conclude that previous estimates according to which 20-40% of synonymous sites in primates were under selective constraint were most likely artifacts of the biased pattern of mutation.
Collapse
Affiliation(s)
- Nicholas Price
- Department of Bioagricultural Sciences and Pest Management, Colorado State University, Fort Collins, CO, 80523, USA.
| | - Dan Graur
- Department of Biology and Biochemistry, University of Houston, Houston, TX, 77204-5001, USA
| |
Collapse
|
9
|
Arenas M. Trends in substitution models of molecular evolution. Front Genet 2015; 6:319. [PMID: 26579193 PMCID: PMC4620419 DOI: 10.3389/fgene.2015.00319] [Citation(s) in RCA: 78] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2015] [Accepted: 10/09/2015] [Indexed: 11/13/2022] Open
Abstract
Substitution models of evolution describe the process of genetic variation through fixed mutations and constitute the basis of the evolutionary analysis at the molecular level. Almost 40 years after the development of first substitution models, highly sophisticated, and data-specific substitution models continue emerging with the aim of better mimicking real evolutionary processes. Here I describe current trends in substitution models of DNA, codon and amino acid sequence evolution, including advantages and pitfalls of the most popular models. The perspective concludes that despite the large number of currently available substitution models, further research is required for more realistic modeling, especially for DNA coding and amino acid data. Additionally, the development of more accurate complex models should be coupled with new implementations and improvements of methods and frameworks for substitution model selection and downstream evolutionary analysis.
Collapse
Affiliation(s)
- Miguel Arenas
- Institute of Molecular Pathology and Immunology of the University of Porto Porto, Portugal
| |
Collapse
|
10
|
Hübner S, Korol AB, Schmid KJ. RNA-Seq analysis identifies genes associated with differential reproductive success under drought-stress in accessions of wild barley Hordeum spontaneum. BMC PLANT BIOLOGY 2015; 15:134. [PMID: 26055625 PMCID: PMC4459662 DOI: 10.1186/s12870-015-0528-z] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/28/2014] [Accepted: 05/20/2015] [Indexed: 05/20/2023]
Abstract
BACKGROUND The evolutionary basis of reproductive success in different environments is of major interest in the study of plant adaptation. Since the reproductive stage is particularly sensitive to drought, genes affecting reproductive success during this stage are key players in the evolution of adaptive mechanisms. We used an ecological genomics approach to investigate the reproductive response of drought-tolerant and sensitive wild barley accessions originating from different habitats in the Levant. RESULTS We sequenced mRNA extracted from spikelets at the flowering stage in drought-treated and control plants. The barley genome was used for a reference-guided assembly and differential expression analysis. Our approach enabled to detect biological processes affecting grain production under drought stress. We detected novel candidate genes and differentially expressed alleles associated with drought tolerance. Drought associated genes were shown to be more conserved than non-associated genes, and drought-tolerance genes were found to evolve more rapidly than other drought associated genes. CONCLUSIONS We show that reproductive success under drought stress is not a habitat-specific trait but a shared physiological adaptation that appeared to evolve recently in the evolutionary history of wild barley. Exploring the genomic basis of reproductive success under stress in crop wild progenitors is expected to have considerable ecological and economical applications.
Collapse
Affiliation(s)
- Sariel Hübner
- Department of Evolutionary and Environmental Biology, University of Haifa, Mt. Carmel 31905, Haifa, Israel.
- Current address: Department of Botany, University of British Columbia, Vancouver, Canada.
| | - Abraham B Korol
- Department of Evolutionary and Environmental Biology, University of Haifa, Mt. Carmel 31905, Haifa, Israel.
| | - Karl J Schmid
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, D-70593, Stuttgart, Germany.
| |
Collapse
|
11
|
Goncearenco A, Berezovsky IN. The fundamental tradeoff in genomes and proteomes of prokaryotes established by the genetic code, codon entropy, and physics of nucleic acids and proteins. Biol Direct 2014; 9:29. [PMID: 25496919 PMCID: PMC4273451 DOI: 10.1186/s13062-014-0029-2] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2014] [Accepted: 12/01/2014] [Indexed: 11/26/2022] Open
Abstract
Background Mutations in nucleotide sequences provide a foundation for genetic variability, and selection is the driving force of the evolution and molecular adaptation. Despite considerable progress in the understanding of selective forces and their compositional determinants, the very nature of underlying mutational biases remains unclear. Results We explore here a fundamental tradeoff, which analytically describes mutual adjustment of the nucleotide and amino acid compositions and its possible effect on the mutational biases. The tradeoff is determined by the interplay between the genetic code, optimization of the codon entropy, and demands on the structure and stability of nucleic acids and proteins. Conclusion The tradeoff is the unifying property of all prokaryotes regardless of the differences in their phylogenies, life styles, and extreme environments. It underlies mutational biases characteristic for genomes with different nucleotide and amino acid compositions, providing foundation for evolution and adaptation. Reviewers This article was reviewed by Eugene Koonin, Michael Gromiha, and Alexander Schleiffer. Electronic supplementary material The online version of this article (doi:10.1186/s13062-014-0029-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Alexander Goncearenco
- Computational Biology Unit and Department of Informatics, University of Bergen, N-5008, Bergen, Norway. .,Current address: Computational Biology Branch of the National Center for Biotechnology Information in Bethesda, Maryland, USA.
| | - Igor N Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, Singapore, 138671, Singapore. .,Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117597, Singapore, Singapore.
| |
Collapse
|
12
|
McCandlish DM, Stoltzfus A. Modeling evolution using the probability of fixation: history and implications. QUARTERLY REVIEW OF BIOLOGY 2014; 89:225-52. [PMID: 25195318 DOI: 10.1086/677571] [Citation(s) in RCA: 123] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/03/2022]
Abstract
Many models of evolution calculate the rate of evolution by multiplying the rate at which new mutations originate within a population by a probability of fixation. Here we review the historical origins, contemporary applications, and evolutionary implications of these "origin-fixation" models, which are widely used in evolutionary genetics, molecular evolution, and phylogenetics. Origin-fixation models were first introduced in 1969, in association with an emerging view of "molecular" evolution. Early origin-fixation models were used to calculate an instantaneous rate of evolution across a large number of independently evolving loci; in the 1980s and 1990s, a second wave of origin-fixation models emerged to address a sequence of fixation events at a single locus. Although origin fixation models have been applied to a broad array of problems in contemporary evolutionary research, their rise in popularity has not been accompanied by an increased appreciation of their restrictive assumptions or their distinctive implications. We argue that origin-fixation models constitute a coherent theory of mutation-limited evolution that contrasts sharply with theories of evolution that rely on the presence of standing genetic variation. A major unsolved question in evolutionary biology is the degree to which these models provide an accurate approximation of evolution in natural populations.
Collapse
|
13
|
Abstract
Some species exhibit very high levels of DNA sequence variability; there is also evidence for the existence of heritable epigenetic variants that experience state changes at a much higher rate than sequence variants. In both cases, the resulting high diversity levels within a population (hyperdiversity) mean that standard population genetics methods are not trustworthy. We analyze a population genetics model that incorporates purifying selection, reversible mutations, and genetic drift, assuming a stationary population size. We derive analytical results for both population parameters and sample statistics and discuss their implications for studies of natural genetic and epigenetic variation. In particular, we find that (1) many more intermediate-frequency variants are expected than under standard models, even with moderately strong purifying selection, and (2) rates of evolution under purifying selection may be close to, or even exceed, neutral rates. These findings are related to empirical studies of sequence and epigenetic variation.
Collapse
|
14
|
Abstract
Mutation is the ultimate source of genetic variation. The most direct and unbiased method of studying spontaneous mutations is via mutation accumulation (MA) lines. Until recently, MA experiments were limited by the cost of sequencing and thus provided us with small numbers of mutational events and therefore imprecise estimates of rates and patterns of mutation. We used whole-genome sequencing to identify nearly 1,000 spontaneous mutation events accumulated over ∼311,000 generations in 145 diploid MA lines of the budding yeast Saccharomyces cerevisiae. MA experiments are usually assumed to have negligible levels of selection, but even mild selection will remove strongly deleterious events. We take advantage of such patterns of selection and show that mutation classes such as indels and aneuploidies (especially monosomies) are proportionately much more likely to contribute mutations of large effect. We also provide conservative estimates of indel, aneuploidy, environment-dependent dominant lethal, and recessive lethal mutation rates. To our knowledge, for the first time in yeast MA data, we identified a sufficiently large number of single-nucleotide mutations to measure context-dependent mutation rates and were able to (i) confirm strong AT bias of mutation in yeast driven by high rate of mutations from C/G to T/A and (ii) detect a higher rate of mutation at C/G nucleotides in two specific contexts consistent with cytosine methylation in S. cerevisiae.
Collapse
|
15
|
Lawrie DS, Petrov DA. Comparative population genomics: power and principles for the inference of functionality. Trends Genet 2014; 30:133-9. [PMID: 24656563 DOI: 10.1016/j.tig.2014.02.002] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2013] [Revised: 01/31/2014] [Accepted: 02/06/2014] [Indexed: 11/19/2022]
Abstract
The availability of sequenced genomes from multiple related organisms allows the detection and localization of functional genomic elements based on the idea that such elements evolve more slowly than neutral sequences. Although such comparative genomics methods have proven useful in discovering functional elements and ascertaining levels of functional constraint in the genome as a whole, here we outline limitations intrinsic to this approach that cannot be overcome by sequencing more species. We argue that it is essential to supplement comparative genomics with ultra-deep sampling of populations from closely related species to enable substantially more powerful genomic scans for functional elements. The convergence of sequencing technology and population genetics theory has made such projects feasible and has exciting implications for functional genomics.
Collapse
Affiliation(s)
- David S Lawrie
- Department of Genetics, Stanford University, Stanford, CA, USA; Department of Biology, Stanford University, Stanford, CA, USA.
| | - Dmitri A Petrov
- Department of Biology, Stanford University, Stanford, CA, USA
| |
Collapse
|
16
|
Sloan DB, Triant DA, Forrester NJ, Bergner LM, Wu M, Taylor DR. A recurring syndrome of accelerated plastid genome evolution in the angiosperm tribe Sileneae (Caryophyllaceae). Mol Phylogenet Evol 2013; 72:82-9. [PMID: 24373909 DOI: 10.1016/j.ympev.2013.12.004] [Citation(s) in RCA: 84] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2013] [Revised: 12/05/2013] [Accepted: 12/17/2013] [Indexed: 11/27/2022]
Abstract
In flowering plants, plastid genomes are generally conserved, exhibiting slower rates of sequence evolution than the nucleus and little or no change in structural organization. However, accelerated plastid genome evolution has occurred in scattered angiosperm lineages. For example, some species within the genus Silene have experienced a suite of recent changes to their plastid genomes, including inversions, shifts in inverted repeat boundaries, large indels, intron losses, and rapid rates of amino acid sequence evolution in a subset of protein genes, with the most extreme divergence occurring in the protease gene clpP. To investigate the relationship between the rates of sequence and structural evolution, we sequenced complete plastid genomes from three species (Silene conoidea, S. paradoxa, and Lychnis chalcedonica), representing independent lineages within the tribe Sileneae that were previously shown to have accelerated rates of clpP evolution. We found a high degree of parallel evolution. Elevated rates of amino acid substitution have occurred repeatedly in the same subset of plastid genes and have been accompanied by a recurring pattern of structural change, including cases of identical inversions and intron loss. This "syndrome" of changes was not observed in the closely related outgroup Agrostemma githago or in the more slowly evolving Silene species that were sequenced previously. Although no single mechanism has yet been identified to explain the correlated suite of changes in plastid genome sequence and structure that has occurred repeatedly in angiosperm evolution, we discuss a possible mixture of adaptive and non-adaptive forces that may be responsible.
Collapse
Affiliation(s)
- Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins, CO 80523, United States.
| | - Deborah A Triant
- Department of Biology, University of Virginia, Charlottesville, VA 22904, United States
| | - Nicole J Forrester
- Department of Biology, University of Virginia, Charlottesville, VA 22904, United States
| | - Laura M Bergner
- Department of Biology, University of Virginia, Charlottesville, VA 22904, United States
| | - Martin Wu
- Department of Biology, University of Virginia, Charlottesville, VA 22904, United States
| | - Douglas R Taylor
- Department of Biology, University of Virginia, Charlottesville, VA 22904, United States
| |
Collapse
|
17
|
Strong purifying selection at synonymous sites in D. melanogaster. PLoS Genet 2013; 9:e1003527. [PMID: 23737754 PMCID: PMC3667748 DOI: 10.1371/journal.pgen.1003527] [Citation(s) in RCA: 143] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2013] [Accepted: 04/08/2013] [Indexed: 11/19/2022] Open
Abstract
Synonymous sites are generally assumed to be subject to weak selective constraint. For this reason, they are often neglected as a possible source of important functional variation. We use site frequency spectra from deep population sequencing data to show that, contrary to this expectation, 22% of four-fold synonymous (4D) sites in Drosophila melanogaster evolve under very strong selective constraint while few, if any, appear to be under weak constraint. Linking polymorphism with divergence data, we further find that the fraction of synonymous sites exposed to strong purifying selection is higher for those positions that show slower evolution on the Drosophila phylogeny. The function underlying the inferred strong constraint appears to be separate from splicing enhancers, nucleosome positioning, and the translational optimization generating canonical codon bias. The fraction of synonymous sites under strong constraint within a gene correlates well with gene expression, particularly in the mid-late embryo, pupae, and adult developmental stages. Genes enriched in strongly constrained synonymous sites tend to be particularly functionally important and are often involved in key developmental pathways. Given that the observed widespread constraint acting on synonymous sites is likely not limited to Drosophila, the role of synonymous sites in genetic disease and adaptation should be reevaluated.
Collapse
|
18
|
Maruki T, Kumar S, Kim Y. Purifying selection modulates the estimates of population differentiation and confounds genome-wide comparisons across single-nucleotide polymorphisms. Mol Biol Evol 2012; 29:3617-23. [PMID: 22826460 PMCID: PMC3494274 DOI: 10.1093/molbev/mss187] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
An improved understanding of the biological and numerical properties of measures of population differentiation across loci is becoming increasingly more important because of their growing use in analyzing genome-wide polymorphism data for detecting population structures, inferring the rates of migration, and identifying local adaptations. In a genome-wide analysis, we discovered that the estimates of population differentiation (e.g., F(ST), θ, and Jost's D) calculated for human single-nucleotide polymorphisms (SNPs) are strongly and positively correlated to the position-specific evolutionary rates measured from multispecies alignments. That is, genomic positions (loci) experiencing higher purifying selection (lower evolutionary rates) produce lower values for the degree of population differentiation than those evolving with faster rates. We show that this pattern is completely mediated by the negative effects of purifying selection on the minor allele frequency (MAF) at individual loci. Our results suggest that inferences and methods relying on the comparison of population differentiation estimates (F(ST), θ, and Jost's D) based on SNPs across genomic positions should be restricted to loci with similar MAFs and/or the rates of evolution in genome scale surveys.
Collapse
Affiliation(s)
- Takahiro Maruki
- Center for Evolutionary Medicine and Informatics, The Biodesign Institute, Arizona State University
- School of Life Sciences, Arizona State University
| | - Sudhir Kumar
- Center for Evolutionary Medicine and Informatics, The Biodesign Institute, Arizona State University
- School of Life Sciences, Arizona State University
| | - Yuseob Kim
- Center for Evolutionary Medicine and Informatics, The Biodesign Institute, Arizona State University
- School of Life Sciences, Arizona State University
- Department of Life Science, Ewha Womans University, Seoul, Korea
| |
Collapse
|
19
|
Clemente F, Vogl C. Evidence for complex selection on four-fold degenerate sites in Drosophila melanogaster. J Evol Biol 2012; 25:2582-95. [PMID: 23020078 DOI: 10.1111/jeb.12003] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2012] [Revised: 08/31/2012] [Accepted: 08/31/2012] [Indexed: 01/04/2023]
Abstract
We considered genome-wide four-fold degenerate sites from an African Drosophila melanogaster population and compared them to short introns. To include divergence and to polarize the data, we used its close relatives Drosophila simulans, Drosophila sechellia, Drosophila erecta and Drosophila yakuba as outgroups. In D. melanogaster, the GC content at four-fold degenerate sites is higher than in short introns; compared to its relatives, more AT than GC is fixed. The former has been explained by codon usage bias (CUB) favouring GC; the latter by decreased intensity of directional selection or by increased mutation bias towards AT. With a biallelic equilibrium model, evidence for directional selection comes mostly from the GC-rich ancestral base composition. Together with a slight mutation bias, it leads to an asymmetry of the unpolarized allele frequency spectrum, from which directional selection is inferred. Using a quasi-equilibrium model and polarized spectra, however, only purifying and no directional selection is detected. Furthermore, polarized spectra are proportional to those of the presumably unselected short introns. As we have no evidence for a decrease in effective population size, relaxed CUB must be due to a reduction in the selection coefficient. Going beyond the biallelic model and considering all four bases, signs of directional selection are stronger. In contrast to short introns, complementary bases show strand specificity and allele frequency spectra depend on mutation directions. Hence, the traditional biallelic model to describe the evolution of four-fold degenerate sites should be replaced by more complex models assuming only quasi-equilibrium and accounting for all four bases.
Collapse
Affiliation(s)
- F Clemente
- Institute of Population Genetics, Veterinärmedizinische Universität Wien, Vienna, Austria
| | | |
Collapse
|
20
|
Clemente F, Vogl C. Unconstrained evolution in short introns? - an analysis of genome-wide polymorphism and divergence data from Drosophila. J Evol Biol 2012; 25:1975-1990. [PMID: 22901008 DOI: 10.1111/j.1420-9101.2012.02580.x] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2012] [Revised: 06/15/2012] [Accepted: 06/22/2012] [Indexed: 12/23/2022]
Abstract
An unconstrained reference sequence facilitates the detection of selection. In Drosophila, sequence variation in short introns seems to be least influenced by selection and dominated by mutation and drift. Here, we test this with genome-wide sequences using an African population (Malawi) of D. melanogaster and data from the related outgroup species D. simulans, D. sechellia, D. erecta and D. yakuba. The distribution of mutations deviates from equilibrium, and the content of A and T (AT) nucleotides shows an excess of variance among introns. We explain this by a complex mutational pattern: a shift in mutational bias towards AT, leading to a slight nonequilibrium in base composition and context-dependent mutation rates, with G or C (GC) sites mutating most frequently in AT-rich introns. By comparing the corresponding allele frequency spectra of AT-rich vs. GC-rich introns, we can rule out the influence of directional selection or biased gene conversion on the mutational pattern. Compared with neutral equilibrium expectations, polymorphism spectra show an excess of low frequency and a paucity of intermediate frequency variants, irrespective of the direction of mutation. Combining the information from different outgroups with the polymorphism data and using a generalized linear model, we find evidence for shared ancestral polymorphism between D. melanogaster and D. simulans, D. sechellia, arguing against a bottleneck in D. melanogaster. Generally, we find that short introns can be used as a neutral reference on a genome-wide level, if the spatially and temporally varying mutational pattern is accounted for.
Collapse
Affiliation(s)
- F Clemente
- Institute of Population Genetics, Veterinärmedizinische Universität Wien, Vienna, Austria
| | - C Vogl
- Institute of Animal Breeding and Genetics, Veterinärmedizinische Universität Wien, Vienna, Austria
| |
Collapse
|
21
|
Campos JL, Charlesworth B, Haddrill PR. Molecular evolution in nonrecombining regions of the Drosophila melanogaster genome. Genome Biol Evol 2012; 4:278-88. [PMID: 22275518 PMCID: PMC3318434 DOI: 10.1093/gbe/evs010] [Citation(s) in RCA: 45] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
We study the evolutionary effects of reduced recombination on the Drosophila melanogaster genome, analyzing more than 200 new genes that lack crossing-over and employing a novel orthology search among species of the melanogaster subgroup. These genes are located in the heterochromatin of chromosomes other than the dot (fourth) chromosome. Noncrossover regions of the genome all exhibited an elevated level of evolutionary divergence from D. yakuba at nonsynonymous sites, lower codon usage bias, lower GC content in coding and noncoding regions, and longer introns. Levels of gene expression are similar for genes in regions with and without crossing-over, which rules out the possibility that the reduced level of adaptation that we detect is caused by relaxed selection due to lower levels of gene expression in the heterochromatin. The patterns observed are consistent with a reduction in the efficacy of selection in all regions of the genome of D. melanogaster that lack crossing-over, as a result of the effects of enhanced Hill-Robertson interference. However, we also detected differences among nonrecombining locations: The X chromosome seems to exhibit the weakest effects, whereas the fourth chromosome and the heterochromatic genes on the autosomes located most proximal to the centromere showed the largest effects. However, signatures of selection on both nonsynonymous mutations and on codon usage persist in all heterochromatic regions.
Collapse
Affiliation(s)
- José L Campos
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, United Kingdom.
| | | | | |
Collapse
|
22
|
Vogl C, Clemente F. The allele-frequency spectrum in a decoupled Moran model with mutation, drift, and directional selection, assuming small mutation rates. Theor Popul Biol 2012; 81:197-209. [PMID: 22269092 PMCID: PMC3315028 DOI: 10.1016/j.tpb.2012.01.001] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2011] [Revised: 12/23/2011] [Accepted: 01/04/2012] [Indexed: 01/22/2023]
Abstract
We analyze a decoupled Moran model with haploid population size N, a biallelic locus under mutation and drift with scaled forward and backward mutation rates θ1=μ1N and θ0=μ0N, and directional selection with scaled strength γ=sN. With small scaled mutation rates θ0 and θ1, which is appropriate for single nucleotide polymorphism data in highly recombining regions, we derive a simple approximate equilibrium distribution for polymorphic alleles with a constant of proportionality. We also put forth an even simpler model, where all mutations originate from monomorphic states. Using this model we derive the sojourn times, conditional on the ancestral and fixed allele, and under equilibrium the distributions of fixed and polymorphic alleles and fixation rates. Furthermore, we also derive the distribution of small samples in the diffusion limit and provide convenient recurrence relations for calculating this distribution. This enables us to give formulas analogous to the Ewens–Watterson estimator of θ for biased mutation rates and selection. We apply this theory to a polymorphism dataset of fourfold degenerate sites in Drosophila melanogaster.
Collapse
Affiliation(s)
- Claus Vogl
- Institute of Animal Breeding and Genetics, Veterinärmedizinische Universität Wien, Veterinärplatz 1, A-1210 Vienna, Austria.
| | | |
Collapse
|
23
|
Abstract
In this paper, the comprehensive analysis of codon usage bias of Duck enteritis virus (DEV) UL21 gene was performed by using CAI, CHIPS and CUSP program of EMBOSS. Our study showed that codon usage bias of DEV UL21 had strong bias towards the A-ended or T-ended codons, and GC3s contents of the codon usage bias in DEV UL21 gene were significantly varied compared with those of other 27 reference herpesviruses. The CAI, ENC value of DEV CHv strain UL21 gene is 0.615 and 55.167, respectively, indicating that the codon usage bias of this gene is weak and lowly expressed. The plot of ENC versus GC3S content revealed that DEV UL21 gene is subject to GC compositional constraints. The phylogentic analysis about amino acids codon usage bias of DEV UL21 and the27 reference herpesviruses showed that DEV was evolutionarily closer to herpesviruses Mardivirus. In addition, the codon usage bias of DEV UL21 gene was compared with those of E. coli, yeast and humans. There are 42, 45, 39 same codons usage bias between the DEV UL21 to E.coli, Yeast, H.sapiens, respectively, indicaiting that UL21 gene of DEV may be more efficiently expressed in the yeast system.
Collapse
|
24
|
Kostka D, Hubisz MJ, Siepel A, Pollard KS. The role of GC-biased gene conversion in shaping the fastest evolving regions of the human genome. Mol Biol Evol 2011; 29:1047-57. [PMID: 22075116 PMCID: PMC3278478 DOI: 10.1093/molbev/msr279] [Citation(s) in RCA: 61] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
GC-biased gene conversion (gBGC) is a recombination-associated evolutionary process that accelerates the fixation of guanine or cytosine alleles, regardless of their effects on fitness. gBGC can increase the overall rate of substitutions, a hallmark of positive selection. Many fast-evolving genes and noncoding sequences in the human genome have GC-biased substitution patterns, suggesting that gBGC-in contrast to adaptive processes-may have driven the human changes in these sequences. To investigate this hypothesis, we developed a substitution model for DNA sequence evolution that quantifies the nonlinear interacting effects of selection and gBGC on substitution rates and patterns. Based on this model, we used a series of lineage-specific likelihood ratio tests to evaluate sequence alignments for evidence of changes in mode of selection, action of gBGC, or both. With a false positive rate of less than 5% for individual tests, we found that the majority (76%) of previously identified human accelerated regions are best explained without gBGC, whereas a substantial minority (19%) are best explained by the action of gBGC alone. Further, more than half (55%) have substitution rates that significantly exceed local estimates of the neutral rate, suggesting that these regions may have been shaped by positive selection rather than by relaxation of constraint. By distinguishing the effects of gBGC, relaxation of constraint, and positive selection we provide an integrated analysis of the evolutionary forces that shaped the fastest evolving regions of the human genome, which facilitates the design of targeted functional studies of adaptation in humans.
Collapse
Affiliation(s)
- Dennis Kostka
- Gladstone Institute of Cardiovascular Disease, University of California, San Francisco, USA.
| | | | | | | |
Collapse
|
25
|
Osada N, Akashi H. Mitochondrial-nuclear interactions and accelerated compensatory evolution: evidence from the primate cytochrome C oxidase complex. Mol Biol Evol 2011; 29:337-46. [PMID: 21890478 DOI: 10.1093/molbev/msr211] [Citation(s) in RCA: 141] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Accelerated rates of mitochondrial protein evolution have been proposed to reflect Darwinian coadaptation for efficient energy production for mammalian flight and brain activity. However, several features of mammalian mtDNA (absence of recombination, small effective population size, and high mutation rate) promote genome degradation through the accumulation of weakly deleterious mutations. Here, we present evidence for "compensatory" adaptive substitutions in nuclear DNA- (nDNA) encoded mitochondrial proteins to prevent fitness decline in primate mitochondrial protein complexes. We show that high mutation rate and small effective population size, key features of primate mitochondrial genomes, can accelerate compensatory adaptive evolution in nDNA-encoded genes. We combine phylogenetic information and the 3D structure of the cytochrome c oxidase (COX) complex to test for accelerated compensatory changes among interacting sites. Physical interactions among mtDNA- and nDNA-encoded components are critical in COX evolution; amino acids in close physical proximity in the 3D structure show a strong tendency for correlated evolution among lineages. Only nuclear-encoded components of COX show evidence for positive selection and adaptive nDNA-encoded changes tend to follow mtDNA-encoded amino acid changes at nearby sites in the 3D structure. This bias in the temporal order of substitutions supports compensatory weak selection as a major factor in accelerated primate COX evolution.
Collapse
Affiliation(s)
- Naoki Osada
- Division of Evolutionary Genetics, Department of Population Genetics, National Institute of Genetics, Mishima, Japan.
| | | |
Collapse
|