1
|
Gutiérrez-Valencia J, Zervakis PI, Postel Z, Fracassetti M, Losvik A, Mehrabi S, Bunikis I, Soler L, Hughes PW, Désamoré A, Laenen B, Abdelaziz M, Pettersson OV, Arroyo J, Slotte T. Genetic causes and genomic consequences of breakdown of distyly in Linum trigynum. Mol Biol Evol 2024:msae087. [PMID: 38709782 DOI: 10.1093/molbev/msae087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Revised: 03/22/2024] [Accepted: 04/29/2024] [Indexed: 05/08/2024] Open
Abstract
Distyly is an iconic floral polymorphism governed by a supergene, which promotes efficient pollen transfer and outcrossing through reciprocal differences in the position of sexual organs in flowers, often coupled with heteromorphic self-incompatibility (SI). Distyly has evolved convergently in multiple flowering plant lineages, but has also broken down repeatedly, often resulting in homostylous, self-compatible populations with elevated rates of self-fertilization. Here, we aimed to study the genetic causes and genomic consequences of the shift to homostyly in Linum trigynum, which is closely related to distylous Linum tenue. Building on a high-quality genome assembly, we show that L. trigynum harbors a genomic region homologous to the dominant haplotype of the distyly supergene conferring long stamens and short styles in L. tenue, suggesting that loss of distyly first occurred in a short-styled individual. In contrast to homostylous Primula and Fagopyrum, L. trigynum harbors no fixed loss-of-function mutations in coding sequences of S-linked distyly candidate genes. Instead, floral gene expression analyses and controlled crosses suggest that mutations downregulating the S-linked LtWDR-44 candidate gene for male SI and/or anther height could underlie homostyly and self-compatibility (SC) in L. trigynum. Population genomic analyses of 224 whole-genome sequences further demonstrate that L. trigynum is highly self-fertilizing, exhibits significantly lower genetic diversity genome-wide, and is experiencing relaxed purifying selection and less frequent positive selection on nonsynonymous mutations relative to L. tenue. Our analyses shed light on the loss of distyly in L. trigynum, and advance our understanding of a common evolutionary transition in flowering plants.
Collapse
Affiliation(s)
- Juanita Gutiérrez-Valencia
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Panagiotis-Ioannis Zervakis
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Zoé Postel
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Marco Fracassetti
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Aleksandra Losvik
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Sara Mehrabi
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Ignas Bunikis
- Uppsala Genome Center, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | - Lucile Soler
- Department of Medical Biochemistry and Microbiology, Uppsala University, National Bioinformatics Infrastructure Sweden (NBIS), Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - P William Hughes
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Aurélie Désamoré
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Benjamin Laenen
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | | | - Olga Vinnere Pettersson
- Uppsala Genome Center, Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | - Juan Arroyo
- Department of Plant Biology and Ecology, University of Seville, Seville, Spain
| | - Tanja Slotte
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| |
Collapse
|
2
|
Soni V, Jensen JD. Temporal challenges in detecting balancing selection from population genomic data. G3 (Bethesda) 2024:jkae069. [PMID: 38551137 DOI: 10.1093/g3journal/jkae069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 12/21/2023] [Accepted: 03/19/2024] [Indexed: 04/28/2024]
Abstract
The role of balancing selection in maintaining genetic variation remains an open question in population genetics. Recent years have seen numerous studies identifying candidate loci potentially experiencing balancing selection, most predominantly in human populations. There are however numerous alternative evolutionary processes that may leave similar patterns of variation, thereby potentially confounding inference, and the expected signatures of balancing selection additionally change in a temporal fashion. Here we use forward-in-time simulations to quantify expected statistical power to detect balancing selection using both site frequency spectrum (SFS)- and linkage disequilibrium (LD)-based methods under a variety of evolutionarily realistic null models. We find that whilst SFS-based methods have little power immediately after a balanced mutation begins segregating, power increases with time since the introduction of the balanced allele. Conversely, LD-based methods have considerable power whilst the allele is young, and power dissipates rapidly as the time since introduction increases. Taken together, this suggests that SFS-based methods are most effective at detecting long-term balancing selection (>25N generations since the introduction of the balanced allele) whilst LD-based methods are effective over much shorter timescales (<1N generations), thereby leaving a large time frame over which current methods have little power to detect the action of balancing selection. Finally, we investigate the extent to which alternative evolutionary processes may mimic these patterns, and demonstrate the need for caution in attempting to distinguish the signatures of balancing selection from those of both neutral processes (e.g., population structure and admixture) as well as of alternative selective processes (e.g., partial selective sweeps).
Collapse
Affiliation(s)
- Vivak Soni
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine, Tempe, Arizona, 85281, USA
| | - Jeffrey D Jensen
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine, Tempe, Arizona, 85281, USA
| |
Collapse
|
3
|
Soni V, Pfeifer SP, Jensen JD. The Effects of Mutation and Recombination Rate Heterogeneity on the Inference of Demography and the Distribution of Fitness Effects. Genome Biol Evol 2024; 16:evae004. [PMID: 38207127 PMCID: PMC10834165 DOI: 10.1093/gbe/evae004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 12/12/2023] [Accepted: 01/07/2024] [Indexed: 01/13/2024] Open
Abstract
Disentangling the effects of demography and selection has remained a focal point of population genetic analysis. Knowledge about mutation and recombination is essential in this endeavor; however, despite clear evidence that both mutation and recombination rates vary across genomes, it is common practice to model both rates as fixed. In this study, we quantify how this unaccounted for rate heterogeneity may impact inference using common approaches for inferring selection (DFE-alpha, Grapes, and polyDFE) and/or demography (fastsimcoal2 and δaδi). We demonstrate that, if not properly modeled, this heterogeneity can increase uncertainty in the estimation of demographic and selective parameters and in some scenarios may result in mis-leading inference. These results highlight the importance of quantifying the fundamental evolutionary parameters of mutation and recombination before utilizing population genomic data to quantify the effects of genetic drift (i.e. as modulated by demographic history) and selection; or, at the least, that the effects of uncertainty in these parameters can and should be directly modeled in downstream inference.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, USA
| | - Susanne P Pfeifer
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
4
|
Wong HWS, Holman L. Pleiotropic fitness effects across sexes and ages in the Drosophila genome and transcriptome. Evolution 2023; 77:2642-2655. [PMID: 37738246 DOI: 10.1093/evolut/qpad163] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Revised: 08/20/2023] [Accepted: 09/19/2023] [Indexed: 09/24/2023]
Abstract
Selection varies between categories of individuals, with far-reaching ramifications: Sex-specific selection can impede or accelerate adaptation, and differences in selection between young and old individuals are ultimately responsible for senescence. Here, we measure early- and late-life fitness in adults of both sexes from the Drosophila genetic reference panel and perform quantitative genetic and transcriptomic analyses. Fitness was heritable, showed positive pleiotropy across sexes and age classes, and appeared to be influenced by very large numbers of loci with small effects plus a smaller number with moderate effects. Most loci affected male and female fitness in the same direction; relatively few candidate sexually antagonistic loci were found, though these were enriched on the X chromosome as predicted by theory. The expression level of many genes showed an opposite correlation with fitness in males and females, consistent with unresolved sexual conflict over transcription. The load of deleterious mutations correlated negatively with fitness across genotypes, and we found some evidence for the mutation accumulation (but not the antagonistic pleiotropy) theory of aging.
Collapse
Affiliation(s)
- Heidi W S Wong
- School of Biosciences, University of Melbourne, Parkville, VIC, Australia
| | - Luke Holman
- School of Applied Sciences, Edinburgh Napier University, Edinburgh, United Kingdom
| |
Collapse
|
5
|
Kyriazis CC, Robinson JA, Lohmueller KE. Using Computational Simulations to Model Deleterious Variation and Genetic Load in Natural Populations. Am Nat 2023; 202:737-752. [PMID: 38033186 PMCID: PMC10897732 DOI: 10.1086/726736] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2023]
Abstract
AbstractDeleterious genetic variation is abundant in wild populations, and understanding the ecological and conservation implications of such variation is an area of active research. Genomic methods are increasingly used to quantify the impacts of deleterious variation in natural populations; however, these approaches remain limited by an inability to accurately predict the selective and dominance effects of mutations. Computational simulations of deleterious variation offer a complementary tool that can help overcome these limitations, although such approaches have yet to be widely employed. In this perspective article, we aim to encourage ecological and conservation genomics researchers to adopt greater use of computational simulations to aid in deepening our understanding of deleterious variation in natural populations. We first provide an overview of the components of a simulation of deleterious variation, describing the key parameters involved in such models. Next, we discuss several approaches for validating simulation models. Finally, we compare and validate several recently proposed deleterious mutation models, demonstrating that models based on estimates of selection parameters from experimental systems are biased toward highly deleterious mutations. We describe a new model that is supported by multiple orthogonal lines of evidence and provide example scripts for implementing this model (https://github.com/ckyriazis/simulations_review).
Collapse
|
6
|
Soni V, Pfeifer SP, Jensen JD. The effects of mutation and recombination rate heterogeneity on the inference of demography and the distribution of fitness effects. bioRxiv 2023:2023.11.11.566703. [PMID: 38014252 PMCID: PMC10680612 DOI: 10.1101/2023.11.11.566703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Disentangling the effects of demography and selection has remained a focal point of population genetic analysis. Knowledge about mutation and recombination is essential in this endeavour; however, despite clear evidence that both mutation and recombination rates vary across genomes, it is common practice to model both rates as fixed. In this study, we quantify how this unaccounted for rate heterogeneity may impact inference using common approaches for inferring selection (DFE-alpha, Grapes, and polyDFE) and/or demography (fastsimcoal2 and δaδi). We demonstrate that, if not properly modelled, this heterogeneity can increase uncertainty in the estimation of demographic and selective parameters and in some scenarios may result in mis-leading inference. These results highlight the importance of quantifying the fundamental evolutionary parameters of mutation and recombination prior to utilizing population genomic data to quantify the effects of genetic drift (i.e., as modulated by demographic history) and selection; or, at the least, that the effects of uncertainty in these parameters can and should be directly modelled in downstream inference.
Collapse
Affiliation(s)
- Vivak Soni
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| | - Susanne P. Pfeifer
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| | - Jeffrey D. Jensen
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| |
Collapse
|
7
|
Soni V, Johri P, Jensen JD. Evaluating power to detect recurrent selective sweeps under increasingly realistic evolutionary null models. Evolution 2023; 77:2113-2127. [PMID: 37395482 PMCID: PMC10547124 DOI: 10.1093/evolut/qpad120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 06/15/2023] [Accepted: 06/30/2023] [Indexed: 07/04/2023]
Abstract
The detection of selective sweeps from population genomic data often relies on the premise that the beneficial mutations in question have fixed very near the sampling time. As it has been previously shown that the power to detect a selective sweep is strongly dependent on the time since fixation as well as the strength of selection, it is naturally the case that strong, recent sweeps leave the strongest signatures. However, the biological reality is that beneficial mutations enter populations at a rate, one that partially determines the mean wait time between sweep events and hence their age distribution. An important question thus remains about the power to detect recurrent selective sweeps when they are modeled by a realistic mutation rate and as part of a realistic distribution of fitness effects, as opposed to a single, recent, isolated event on a purely neutral background as is more commonly modeled. Here we use forward-in-time simulations to study the performance of commonly used sweep statistics, within the context of more realistic evolutionary baseline models incorporating purifying and background selection, population size change, and mutation and recombination rate heterogeneity. Results demonstrate the important interplay of these processes, necessitating caution when interpreting selection scans; specifically, false-positive rates are in excess of true-positive across much of the evaluated parameter space, and selective sweeps are often undetectable unless the strength of selection is exceptionally strong.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| | - Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, United States
| |
Collapse
|
8
|
Abstract
AbstractEvolutionary biologists have thought about the role of genetic variation during adaptation for a very long time-before we understood the organization of the genetic code, the provenance of genetic variation, and how such variation influenced the phenotypes on which natural selection acts. Half a century after the discovery of the structure of DNA and the unraveling of the genetic code, we have a rich understanding of these problems and the means to both delve deeper and widen our perspective across organisms and natural populations. The 2022 Vice Presidential Symposium of the American Society of Naturalists highlighted examples of recent insights into the role of genetic variation in adaptive processes, which are compiled in this special section. The work was conducted in different parts of the world, included theoretical and empirical studies with diverse organisms, and addressed distinct aspects of how genetic variation influences adaptation. In our introductory article to the special section, we discuss some important recent insights about the generation and maintenance of genetic variation, its impacts on phenotype and fitness, its fate in natural populations, and its role in driving adaptation. By placing the special section articles in the broader context of recent developments, we hope that this overview will also serve as a useful introduction to the field.
Collapse
|
9
|
Charmouh AP, Bocedi G, Hartfield M. Inferring the distributions of fitness effects and proportions of strongly deleterious mutations. G3 (Bethesda) 2023; 13:jkad140. [PMID: 37337692 PMCID: PMC10468728 DOI: 10.1093/g3journal/jkad140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 06/05/2023] [Accepted: 06/05/2023] [Indexed: 06/21/2023]
Abstract
The distribution of fitness effects is a key property in evolutionary genetics as it has implications for several evolutionary phenomena including the evolution of sex and mating systems, the rate of adaptive evolution, and the prevalence of deleterious mutations. Despite the distribution of fitness effects being extensively studied, the effects of strongly deleterious mutations are difficult to infer since such mutations are unlikely to be present in a sample of haplotypes, so genetic data may contain very little information about them. Recent work has attempted to correct for this issue by expanding the classic gamma-distributed model to explicitly account for strongly deleterious mutations. Here, we use simulations to investigate one such method, adding a parameter (plth) to capture the proportion of strongly deleterious mutations. We show that plth can improve the model fit when applied to individual species but underestimates the true proportion of strongly deleterious mutations. The parameter can also artificially maximize the likelihood when used to jointly infer a distribution of fitness effects from multiple species. As plth and related parameters are used in current inference algorithms, our results are relevant with respect to avoiding model artifacts and improving future tools for inferring the distribution of fitness effects.
Collapse
Affiliation(s)
- Anders P Charmouh
- School of Biological Sciences, University of Aberdeen, Aberdeen AB24 3FX, UK
- Bioinformatics Research Centre Aarhus University, University City 81, building 1872, 3rd floor. DK-8000 Aarhus C, Denmark
| | - Greta Bocedi
- School of Biological Sciences, University of Aberdeen, Aberdeen AB24 3FX, UK
| | - Matthew Hartfield
- Institute of Ecology and Evolution, The University of Edinburgh, Edinburgh EH9 3FL, UK
| |
Collapse
|
10
|
Plavskin Y, de Biase MS, Ziv N, Janská L, Zhu YO, Hall DW, Schwarz RF, Tranchina D, Siegal M. Spontaneous single-nucleotide substitutions and microsatellite mutations have distinct distributions of fitness effects: Distributions of fitness effects of spontaneous mutations. bioRxiv 2023:2023.07.04.547687. [PMID: 37461506 PMCID: PMC10349969 DOI: 10.1101/2023.07.04.547687] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/28/2023]
Abstract
The fitness effects of new mutations determine key properties of evolutionary processes. Beneficial mutations drive evolution, yet selection is also shaped by the frequency of small-effect deleterious mutations, whose combined effect can burden otherwise adaptive lineages and alter evolutionary trajectories and outcomes in clonally evolving organisms such as viruses, microbes, and tumors. The small effect sizes of these important mutations have made accurate measurements of their rates difficult. In microbes, assessing the effect of mutations on growth can be especially instructive, as this complex phenotype is closely linked to fitness in clonally evolving organisms. Here, we perform high-throughput time-lapse microscopy on cells from mutation-accumulation strains to precisely infer the distribution of mutational effects on growth rate in the budding yeast, Saccharomyces cerevisiae. We show that mutational effects on growth rate are overwhelmingly negative, highly skewed towards very small effect sizes, and frequent enough to suggest that deleterious hitchhikers may impose a significant burden on evolving lineages. By using lines that accumulated mutations in either wild-type or slippage repair-defective backgrounds, we further disentangle the effects of two common types of mutations, single-nucleotide substitutions and simple sequence repeat indels, and show that they have distinct effects on yeast growth rate. Although the average effect of a simple sequence repeat mutation is very small (~0.3%), many do alter growth rate, implying that this class of frequent mutations has an important evolutionary impact.
Collapse
Affiliation(s)
- Yevgeniy Plavskin
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
- Department of Biology, New York University, New York, NY 10003, USA
| | - Maria Stella de Biase
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin 10115, Germany
- Humboldt-Universität zu Berlin, Department of Biology, Berlin 10099, Germany
| | - Naomi Ziv
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
- Department of Biology, New York University, New York, NY 10003, USA
- Current address: Shmunis School of Biomedical and Cancer Research, Tel Aviv University, Ramat Aviv, Israel
| | - Libuše Janská
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
- Department of Biology, New York University, New York, NY 10003, USA
| | - Yuan O. Zhu
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
- Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - David W. Hall
- Department of Genetics, University of Georgia, Athens, GA 30602, USA
| | - Roland F Schwarz
- Berlin Institute for Medical Systems Biology, Max Delbrück Center for Molecular Medicine in the Helmholtz Association, Berlin 10115, Germany
- Institute for Computational Cancer Biology, Center for Integrated Oncology (CIO), Cancer Research Center Cologne Essen (CCCE), Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne 50937, Germany
- Berlin Institute for the Foundations of Learning and Data (BIFOLD), Berlin 10623, Germany
| | - Daniel Tranchina
- Department of Biology, New York University, New York, NY 10003, USA
- Courant Math Institute, New York University, New York, NY 1003, USA
| | - Mark Siegal
- Center for Genomics and Systems Biology, New York University, New York, NY 10003, USA
- Department of Biology, New York University, New York, NY 10003, USA
| |
Collapse
|
11
|
Wade EE, Kyriazis CC, Cavassim MIA, Lohmueller KE. Quantifying the fraction of new mutations that are recessive lethal. Evolution 2023; 77:1539-1549. [PMID: 37074880 PMCID: PMC10309970 DOI: 10.1093/evolut/qpad061] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 03/21/2023] [Accepted: 04/14/2023] [Indexed: 04/20/2023]
Abstract
The presence and impact of recessive lethal mutations have been widely documented in diploid outcrossing species. However, precise estimates of the proportion of new mutations that are recessive lethal remain limited. Here, we evaluate the performance of Fit∂a∂i, a commonly used method for inferring the distribution of fitness effects (DFE), in the presence of lethal mutations. Using simulations, we demonstrate that in both additive and recessive cases, inference of the deleterious nonlethal portion of the DFE is minimally affected by a small proportion (<10%) of lethal mutations. Additionally, we demonstrate that while Fit∂a∂i cannot estimate the fraction of recessive lethal mutations, Fit∂a∂i can accurately infer the fraction of additive lethal mutations. Finally, as an alternative approach to estimate the proportion of mutations that are recessive lethal, we employ models of mutation-selection-drift balance using existing genomic parameters and estimates of segregating recessive lethals for humans and Drosophila melanogaster. In both species, the segregating recessive lethal load can be explained by a very small fraction (<1%) of new nonsynonymous mutations being recessive lethal. Our results refute recent assertions of a much higher proportion of mutations being recessive lethal (4%-5%), while highlighting the need for additional information on the joint distribution of selection and dominance coefficients.
Collapse
Affiliation(s)
- Emma E Wade
- Department of Ecology and Evolutionary Biology, University of California–Los Angeles, Los Angeles, CA, United States
- Department of Computer Science and Engineering, Mississippi State University, Starkville, MS, United States
| | - Christopher C Kyriazis
- Department of Ecology and Evolutionary Biology, University of California–Los Angeles, Los Angeles, CA, United States
| | - Maria Izabel A Cavassim
- Department of Ecology and Evolutionary Biology, University of California–Los Angeles, Los Angeles, CA, United States
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California–Los Angeles, Los Angeles, CA, United States
- Interdepartmental Program in Bioinformatics, University of California–Los Angeles, Los Angeles, CA, United States
- Department of Human Genetics, David Geffen School of Medicine, University of California–Los Angeles, Los Angeles, CA, United States
| |
Collapse
|
12
|
Vihinen M. Nonsynonymous Synonymous Variants Demand for a Paradigm Shift in Genetics. Curr Genomics 2023; 24:18-23. [PMID: 37920730 PMCID: PMC10334700 DOI: 10.2174/1389202924666230417101020] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2022] [Revised: 02/20/2023] [Accepted: 03/01/2023] [Indexed: 11/04/2023] Open
Abstract
Synonymous (also known as silent) variations are by definition not considered to change the coded protein. Still many variations in this category affect either protein abundance or properties. As this situation is confusing, we have recently introduced systematics for synonymous variations and those that may on the surface look like synonymous, but these may affect the coded protein in various ways. A new category, unsense variation, was introduced to describe variants that do not introduce a stop codon into the variation site, but which lead to different types of changes in the coded protein. Many of these variations lead to mRNA degradation and missing protein. Here, consequences of the systematics are discussed from the perspectives of variation annotation and interpretation, evolutionary calculations, nonsynonymous-to-synonymous substitution rates, phylogenetics and other evolutionary inferences that are based on the principle of (nearly) neutral synonymous variations. It may be necessary to reassess published results. Further, databases for synonymous variations and prediction methods for such variations should consider unsense variations. Thus, there is a need to evaluate and reflect principles of numerous aspects in genetics, ranging from variation naming and classification to evolutionary calculations.
Collapse
Affiliation(s)
- Mauno Vihinen
- Department of Experimental Medical Science, Lund University, Lund, BMC B13, Sweden
| |
Collapse
|
13
|
Soni V, Johri P, Jensen JD. Evaluating power to detect recurrent selective sweeps under increasingly realistic evolutionary null models. bioRxiv 2023:2023.06.15.545166. [PMID: 37398347 PMCID: PMC10312679 DOI: 10.1101/2023.06.15.545166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
The detection of selective sweeps from population genomic data often relies on the premise that the beneficial mutations in question have fixed very near the sampling time. As it has been previously shown that the power to detect a selective sweep is strongly dependent on the time since fixation as well as the strength of selection, it is naturally the case that strong, recent sweeps leave the strongest signatures. However, the biological reality is that beneficial mutations enter populations at a rate, one that partially determines the mean wait time between sweep events and hence their age distribution. An important question thus remains about the power to detect recurrent selective sweeps when they are modelled by a realistic mutation rate and as part of a realistic distribution of fitness effects (DFE), as opposed to a single, recent, isolated event on a purely neutral background as is more commonly modelled. Here we use forward-in-time simulations to study the performance of commonly used sweep statistics, within the context of more realistic evolutionary baseline models incorporating purifying and background selection, population size change, and mutation and recombination rate heterogeneity. Results demonstrate the important interplay of these processes, necessitating caution when interpreting selection scans; specifically, false positive rates are in excess of true positive across much of the evaluated parameter space, and selective sweeps are often undetectable unless the strength of selection is exceptionally strong. Teaser Text Outlier-based genomic scans have proven a popular approach for identifying loci that have potentially experienced recent positive selection. However, it has previously been shown that an evolutionarily appropriate baseline model that incorporates non-equilibrium population histories, purifying and background selection, and variation in mutation and recombination rates is necessary to reduce often extreme false positive rates when performing genomic scans. Here we evaluate the power to detect recurrent selective sweeps using common SFS-based and haplotype-based methods under these increasingly realistic models. We find that while these appropriate evolutionary baselines are essential to reduce false positive rates, the power to accurately detect recurrent selective sweeps is generally low across much of the biologically relevant parameter space.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
- Present address: Department of Biology, Department of Genetics, University of North Carolina, Chapel Hill, NC, USA
| | | |
Collapse
|
14
|
Sane M, Diwan GD, Bhat BA, Wahl LM, Agashe D. Shifts in mutation spectra enhance access to beneficial mutations. Proc Natl Acad Sci U S A 2023; 120:e2207355120. [PMID: 37216547 PMCID: PMC10235995 DOI: 10.1073/pnas.2207355120] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2022] [Accepted: 03/27/2023] [Indexed: 05/24/2023] Open
Abstract
Biased mutation spectra are pervasive, with wide variation in the magnitude of mutational biases that influence genome evolution and adaptation. How do such diverse biases evolve? Our experiments show that changing the mutation spectrum allows populations to sample previously undersampled mutational space, including beneficial mutations. The resulting shift in the distribution of fitness effects is advantageous: Beneficial mutation supply and beneficial pleiotropy both increase, while deleterious load reduces. More broadly, simulations indicate that reducing or reversing the direction of a long-term bias is always selectively favored. Such changes in mutation bias can occur easily via altered function of DNA repair genes. A phylogenetic analysis shows that these genes are repeatedly gained and lost in bacterial lineages, leading to frequent bias shifts in opposite directions. Thus, shifts in mutation spectra may evolve under selection and can directly alter the outcome of adaptive evolution by facilitating access to beneficial mutations.
Collapse
Affiliation(s)
- Mrudula Sane
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru560065, India
| | - Gaurav D. Diwan
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru560065, India
- Bioquant, University of Heidelberg,69120Heidelberg, Germany
- Heidelberg University Biochemistry Center (BZH), 69120Heidelberg, Germany
| | - Bhoomika A. Bhat
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru560065, India
- Undergraduate Programme, Indian Institute of Science, Bengaluru 560012, India
| | - Lindi M. Wahl
- Mathematics, Western University, London, ON, N6A 5B7, Canada
| | - Deepa Agashe
- National Centre for Biological Sciences, Tata Institute of Fundamental Research, Bengaluru560065, India
| |
Collapse
|
15
|
Johri P, Pfeifer SP, Jensen JD. Developing an evolutionary baseline model for humans: jointly inferring purifying selection with population history. Mol Biol Evol 2023; 40:7147633. [PMID: 37128989 DOI: 10.1093/molbev/msad100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 04/11/2023] [Accepted: 04/27/2023] [Indexed: 05/03/2023] Open
Abstract
Building evolutionarily appropriate baseline models for natural populations is not only important for answering fundamental questions in population genetics - including quantifying the relative contributions of adaptive vs. non-adaptive processes - but it is also essential for identifying candidate loci experiencing relatively rare and episodic forms of selection (e.g., positive or balancing selection). Here, a baseline model was developed for a human population of West African ancestry, the Yoruba, comprising processes constantly operating on the genome (i.e., purifying and background selection, population size changes, recombination rate heterogeneity, and gene conversion). Specifically, to perform joint inference of selective effects with demography, an approximate Bayesian approach was employed that utilizes the decay of background selection effects around functional elements, taking into account genomic architecture. This approach inferred a recent 6-fold population growth together with a distribution of fitness effects that is skewed towards effectively neutral mutations. Importantly, these results further suggest that, while strong and/or frequent recurrent positive selection is inconsistent with observed data, weak to moderate positive selection is consistent but unidentifiable if rare.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | | | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
16
|
Dapa T, Wong DP, Vasquez KS, Xavier KB, Huang KC, Good BH. Within-host evolution of the gut microbiome. Curr Opin Microbiol 2023; 71:102258. [PMID: 36608574 PMCID: PMC9993085 DOI: 10.1016/j.mib.2022.102258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2022] [Revised: 12/05/2022] [Accepted: 12/05/2022] [Indexed: 01/06/2023]
Abstract
Gut bacteria inhabit a complex environment that is shaped by interactions with their host and the other members of the community. While these ecological interactions have evolved over millions of years, mounting evidence suggests that gut commensals can evolve on much shorter timescales as well, by acquiring new mutations within individual hosts. In this review, we highlight recent progress in understanding the causes and consequences of short-term evolution in the mammalian gut, from experimental evolution in murine hosts to longitudinal tracking of human cohorts. We also discuss new opportunities for future progress by expanding the repertoire of focal species, hosts, and surrounding communities, and by combining deep-sequencing technologies with quantitative frameworks from population genetics.
Collapse
Affiliation(s)
- Tanja Dapa
- Instituto Gulbenkian de Ciência, 2780-156 Oeiras, Portugal
| | - Daniel Pgh Wong
- Department of Applied Physics, Stanford University, Stanford, CA 94305, USA
| | - Kimberly S Vasquez
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | | | - Kerwyn Casey Huang
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Bioengineering, Stanford University, Stanford, CA 94305, USA; Chan Zuckerberg Biohub, San Francisco, CA 94158, USA.
| | - Benjamin H Good
- Department of Applied Physics, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
17
|
Agarwal I, Fuller ZL, Myers SR, Przeworski M. Relating pathogenic loss-of-function mutations in humans to their evolutionary fitness costs. eLife 2023; 12:83172. [PMID: 36648429 PMCID: PMC9937649 DOI: 10.7554/elife.83172] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 01/16/2023] [Indexed: 01/18/2023] Open
Abstract
Causal loss-of-function (LOF) variants for Mendelian and severe complex diseases are enriched in 'mutation intolerant' genes. We show how such observations can be interpreted in light of a model of mutation-selection balance and use the model to relate the pathogenic consequences of LOF mutations at present to their evolutionary fitness effects. To this end, we first infer posterior distributions for the fitness costs of LOF mutations in 17,318 autosomal and 679 X-linked genes from exome sequences in 56,855 individuals. Estimated fitness costs for the loss of a gene copy are typically above 1%; they tend to be largest for X-linked genes, whether or not they have a Y homolog, followed by autosomal genes and genes in the pseudoautosomal region. We compare inferred fitness effects for all possible de novo LOF mutations to those of de novo mutations identified in individuals diagnosed with one of six severe, complex diseases or developmental disorders. Probands carry an excess of mutations with estimated fitness effects above 10%; as we show by simulation, when sampled in the population, such highly deleterious mutations are typically only a couple of generations old. Moreover, the proportion of highly deleterious mutations carried by probands reflects the typical age of onset of the disease. The study design also has a discernible influence: a greater proportion of highly deleterious mutations is detected in pedigree than case-control studies, and for autism, in simplex than multiplex families and in female versus male probands. Thus, anchoring observations in human genetics to a population genetic model allows us to learn about the fitness effects of mutations identified by different mapping strategies and for different traits.
Collapse
Affiliation(s)
- Ipsita Agarwal
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
- Department of Statistics, University of OxfordOxfordUnited Kingdom
| | - Zachary L Fuller
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
| | - Simon R Myers
- Department of Statistics, University of OxfordOxfordUnited Kingdom
- The Wellcome Centre for Human Genetics, University of OxfordOxfordUnited Kingdom
| | - Molly Przeworski
- Department of Biological Sciences, Columbia UniversityNew YorkUnited States
- Department of Systems Biology, Columbia UniversityNew YorkUnited States
| |
Collapse
|
18
|
Abstract
Gene-by-environment interactions play a crucial role in horizontal gene transfer by affecting how the transferred genes alter host fitness. However, how the environment modulates the fitness effect of transferred genes has not been tested systematically in an experimental study. We adapted a high-throughput technique for obtaining very precise estimates of bacterial fitness, in order to measure the fitness effects of 44 orthologs transferred from Salmonella Typhimurium to Escherichia coli in six physiologically relevant environments. We found that the fitness effects of individual genes were highly dependent on the environment, while the distributions of fitness effects across genes were not, with all tested environments resulting in distributions of same shape and spread. Furthermore, the extent to which the fitness effects of a gene varied between environments depended on the average fitness effect of that gene across all environments, with nearly neutral and nearly lethal genes having more consistent fitness effects across all environments compared to deleterious genes. Put together, our results reveal the unpredictable nature of how environmental conditions impact the fitness effects of each individual gene. At the same time, distributions of fitness effects across environments exhibit consistent features, pointing to the generalizability of factors that shape horizontal gene transfer of orthologous genes.
Collapse
Affiliation(s)
- Hande Acar Kirit
- Veterinary and Ecological Sciences, Institute of Infection, University of Liverpool, Liverpool, Merseyside, United Kingdom
- Laboratories of Molecular Anthropology and Microbiome Research, University of Oklahoma, Norman, OK
- Department of Anthropology, University of Oklahoma, Norman, OK
| | - Jonathan P Bollback
- Veterinary and Ecological Sciences, Institute of Infection, University of Liverpool, Liverpool, Merseyside, United Kingdom
| | - Mato Lagator
- School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, United Kingdom
| |
Collapse
|
19
|
Das SG, Krug J. Unpredictable repeatability in molecular evolution. Proc Natl Acad Sci U S A 2022; 119:e2209373119. [PMID: 36122210 DOI: 10.1073/pnas.2209373119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022] Open
Abstract
The extent of parallel evolution at the genotypic level is quantitatively linked to the distribution of beneficial fitness effects (DBFE) of mutations. The standard view, based on light-tailed distributions (i.e., distributions with finite moments), is that the probability of parallel evolution in duplicate populations is inversely proportional to the number of available mutations and, moreover, that the DBFE is sufficient to determine the probability when the number of available mutations is large. Here, we show that when the DBFE is heavy-tailed, as found in several recent experiments, these expectations are defied. The probability of parallel evolution decays anomalously slowly in the number of mutations or even becomes independent of it, implying higher repeatability of evolution. At the same time, the probability of parallel evolution is non-self-averaging—that is, it does not converge to its mean value, even when a large number of mutations are involved. This behavior arises because the evolutionary process is dominated by only a few mutations of high weight. Consequently, the probability varies widely across systems with the same DBFE. Contrary to the standard view, the DBFE is no longer sufficient to determine the extent of parallel evolution, making it much less predictable. We illustrate these ideas theoretically and through analysis of empirical data on antibiotic-resistance evolution.
Collapse
|
20
|
Abstract
Whole-genome duplications (WGDs) have occurred in many eukaryotic lineages. However, the underlying evolutionary forces and molecular mechanisms responsible for the long-term retention of gene duplicates created by WGDs are not well understood. We employ a population-genomic approach to understand the selective forces acting on paralogs and investigate ongoing duplicate-gene loss in multiple species of Paramecium that share an ancient WGD. We show that mutations that abolish protein function are more likely to be segregating in retained WGD paralogs than in single-copy genes, most likely because of ongoing nonfunctionalization post-WGD. This relaxation of purifying selection occurs in only one WGD paralog, accompanied by the gradual fixation of nonsynonymous mutations and reduction in levels of expression, and occurs over a long period of evolutionary time, “marking” one locus for future loss. Concordantly, the fitness effects of new nonsynonymous mutations and frameshift-causing indels are significantly more deleterious in the highly expressed copy compared with their paralogs with lower expression. Our results provide a novel mechanistic model of gene duplicate loss following WGDs, wherein selection acts on the sum of functional activity of both duplicate genes, allowing the two to wander in expression and functional space, until one duplicate locus eventually degenerates enough in functional efficiency or expression that its contribution to total activity is too insignificant to be retained by purifying selection. Retention of duplicates by such mechanisms predicts long times to duplicate-gene loss, which should not be falsely attributed to retention due to gain/change in function.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ 85287, USA
| | - Jean-Francois Gout
- Department of Biological Sciences, Mississippi State University, Mississippi State, MS 39762, USA
| | - Thomas G Doak
- Department of Biology, Indiana University, Bloomington, IN 47405, USA.,National Center for Genome Analysis Support, Indiana University, Bloomington, IN 47405, USA
| | - Michael Lynch
- Center for Mechanisms of Evolution, The Biodesign Institute, Arizona State University, Tempe, AZ 85287, USA
| |
Collapse
|
21
|
Liang YY, Chen XY, Zhou BF, Mitchell-Olds T, Wang B. Globally Relaxed Selection and Local Adaptation in Boechera stricta. Genome Biol Evol 2022; 14:evac043. [PMID: 35349686 PMCID: PMC9011030 DOI: 10.1093/gbe/evac043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/23/2022] [Indexed: 11/25/2022] Open
Abstract
The strength of selection varies among populations and across the genome, but the determinants of efficacy of selection remain unclear. In this study, we used whole-genome sequencing data from 467 Boechera stricta accessions to quantify the strength of selection and characterize the pattern of local adaptation. We found low genetic diversity on 0-fold degenerate sites and conserved non-coding sites, indicating functional constraints on these regions. The estimated distribution of fitness effects and the proportion of fixed substitutions suggest relaxed negative and positive selection in B. stricta. Among the four population groups, the NOR and WES groups have smaller effective population size (Ne), higher proportions of effectively neutral sites, and lower rates of adaptive evolution compared with UTA and COL groups, reflecting the effect of Ne on the efficacy of natural selection. We also found weaker selection on GC-biased sites compared with GC-conservative (unbiased) sites, suggested that GC-biased gene conversion has affected the strength of selection in B. stricta. We found mixed evidence for the role of the recombination rate on the efficacy of selection. The positive and negative selection was stronger in high-recombination regions compared with low-recombination regions in COL but not in other groups. By scanning the genome, we found different subsets of selected genes suggesting differential adaptation among B. stricta groups. These results show that differences in effective population size, nucleotide composition, and recombination rate are important determinants of the efficacy of selection. This study enriches our understanding of the roles of natural selection and local adaptation in shaping genomic variation.
Collapse
Affiliation(s)
- Yi-Ye Liang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences,
Guangzhou, China
- University of the Chinese Academy of Sciences, Beijing, China
| | - Xue-Yan Chen
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences,
Guangzhou, China
- University of the Chinese Academy of Sciences, Beijing, China
| | - Biao-Feng Zhou
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences,
Guangzhou, China
- University of the Chinese Academy of Sciences, Beijing, China
| | | | - Baosheng Wang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences,
Guangzhou, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, China
| |
Collapse
|
22
|
Chen J, Bataillon T, Glémin S, Lascoux M. What does the distribution of fitness effects of new mutations reflect? Insights from plants. New Phytol 2022; 233:1613-1619. [PMID: 34704271 DOI: 10.1111/nph.17826] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 09/28/2021] [Indexed: 06/13/2023]
Abstract
The distribution of fitness effects (DFE) of new mutations plays a central role in molecular evolution. It is therefore crucial to be able to estimate it accurately from genomic data and to understand the factors that shape it. After a rapid overview of available methods to characterize the fitness effects of mutations, we review what is known on the factors affecting them in plants. Available data indicate that life history traits (e.g. mating system and longevity) have a major effect on the DFE. By contrast, the impact of demography within species appears to be more limited. These results remain to be confirmed, and methods to estimate the joint evolution of demography, life history traits, and the DFE need to be developed.
Collapse
Affiliation(s)
- Jun Chen
- College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Thomas Bataillon
- Bioinformatics Research Centre, Aarhus University, C.F. Möllers Allé 8, Aarhus C, DK-8000, Denmark
| | - Sylvain Glémin
- Centre National de la Recherche Scientifique (CNRS), ECOBIO (Ecosystèmes, Biodiversité, Evolution) - Unité Mixte de Recherche (UMR) 6553, Université de Rennes, Rennes, F-35000, France
- Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, 75236, Sweden
| | - Martin Lascoux
- Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, 75236, Sweden
| |
Collapse
|
23
|
Abstract
Predicting fitness in natural populations is a major challenge in biology. It may be possible to leverage fast-accumulating genomic data sets to infer the fitness effects of mutant alleles, allowing evolutionary questions to be addressed in any organism. In this paper, we investigate the utility of one such tool, called PROVEAN. This program compares a query sequence with existing data to provide an alignment-based score for any protein variant, with scores categorized as neutral or deleterious based on a pre-set threshold. PROVEAN has been used widely in evolutionary studies, for example, to estimate mutation load in natural populations, but has not been formally tested as a predictor of aggregate mutational effects on fitness. Using three large published data sets on the genome sequences of laboratory mutation accumulation lines, we assessed how well PROVEAN predicted the actual fitness patterns observed, relative to other metrics. In most cases, we find that a simple count of the total number of mutant proteins is a better predictor of fitness than the number of proteins with variants scored as deleterious by PROVEAN. We also find that the sum of all mutant protein scores explains variation in fitness better than the number of mutant proteins in one of the data sets. We discuss the implications of these results for studies of populations in the wild.
Collapse
Affiliation(s)
- Linnea Sandell
- Department of Zoology, University of British Columbia, Vancouver, British Columbia, Canada
- Systematic Biology, Department of Organismal Biology, Uppsala University, Sweden
| | | |
Collapse
|
24
|
Horvath R, Josephs EB, Pesquet E, Stinchcombe JR, Wright SI, Scofield D, Slotte T. Selection on Accessible Chromatin Regions in Capsella grandiflora. Mol Biol Evol 2021; 38:5563-5575. [PMID: 34498072 PMCID: PMC8662636 DOI: 10.1093/molbev/msab270] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Accurate estimates of genome-wide rates and fitness effects of new mutations are essential for an improved understanding of molecular evolutionary processes. Although eukaryotic genomes generally contain a large noncoding fraction, functional noncoding regions and fitness effects of mutations in such regions are still incompletely characterized. A promising approach to characterize functional noncoding regions relies on identifying accessible chromatin regions (ACRs) tightly associated with regulatory DNA. Here, we applied this approach to identify and estimate selection on ACRs in Capsella grandiflora, a crucifer species ideal for population genomic quantification of selection due to its favorable population demography. We describe a population-wide ACR distribution based on ATAC-seq data for leaf samples of 16 individuals from a natural population. We use population genomic methods to estimate fitness effects and proportions of positively selected fixations (α) in ACRs and find that intergenic ACRs harbor a considerable fraction of weakly deleterious new mutations, as well as a significantly higher proportion of strongly deleterious mutations than comparable inaccessible intergenic regions. ACRs are enriched for expression quantitative trait loci (eQTL) and depleted of transposable element insertions, as expected if intergenic ACRs are under selection because they harbor regulatory regions. By integrating empirical identification of intergenic ACRs with analyses of eQTL and population genomic analyses of selection, we demonstrate that intergenic regulatory regions are an important source of nearly neutral mutations. These results improve our understanding of selection on noncoding regions and the role of nearly neutral mutations for evolutionary processes in outcrossing Brassicaceae species.
Collapse
Affiliation(s)
- Robert Horvath
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Emily B Josephs
- Department of Plant Biology, Michigan State University, Lansing, MI, USA
| | - Edouard Pesquet
- Department of Ecology, Environment and Plant Sciences, Stockholm University, Stockholm, Sweden
| | - John R Stinchcombe
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - Stephen I Wright
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - Douglas Scofield
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
| | - Tanja Slotte
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| |
Collapse
|
25
|
Gilbert KJ, Zdraljevic S, Cook DE, Cutter AD, Andersen EC, Baer CF. The distribution of mutational effects on fitness in Caenorhabditis elegans inferred from standing genetic variation. Genetics 2021; 220:6383146. [PMID: 34791202 DOI: 10.1093/genetics/iyab166] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Accepted: 09/27/2021] [Indexed: 11/14/2022] Open
Abstract
The distribution of fitness effects (DFE) for new mutations is one of the most theoretically important but difficult to estimate properties in population genetics. A crucial challenge to inferring the DFE from natural genetic variation is the sensitivity of the site frequency spectrum to factors like population size change, population substructure, genome structure, and nonrandom mating. Although inference methods aim to control for population size changes, the influence of nonrandom mating remains incompletely understood, despite being a common feature of many species. We report the DFE estimated from 326 genomes of Caenorhabditis elegans, a nematode roundworm with a high rate of self-fertilization. We evaluate the robustness of DFE inferences using simulated data that mimics the genomic structure and reproductive life history of C. elegans. Our observations demonstrate how the combined influence of self-fertilization, genome structure, and natural selection on linked sites can conspire to compromise estimates of the DFE from extant polymorphisms with existing methods. These factors together tend to bias inferences toward weakly deleterious mutations, making it challenging to have full confidence in the inferred DFE of new mutations as deduced from standing genetic variation in species like C. elegans. Improved methods for inferring the DFE are needed to appropriately handle strong linked selection and selfing. These results highlight the importance of understanding the combined effects of processes that can bias our interpretations of evolution in natural populations.
Collapse
Affiliation(s)
| | - Stefan Zdraljevic
- Department of Molecular Biosciences, Northwestern University, Evanston, IL 60208, USA.,Department of Human Genetics, Department of Biological Chemistry, and Howard Hughes Medical Institute, University of California, Los Angeles, CA 90095, USA
| | - Daniel E Cook
- Department of Molecular Biosciences, Northwestern University, Evanston, IL 60208, USA
| | - Asher D Cutter
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON M5S 3B2, Canada
| | - Erik C Andersen
- Department of Molecular Biosciences, Northwestern University, Evanston, IL 60208, USA
| | - Charles F Baer
- Department of Biology, University of Florida, Gainesville, FL 32611-8525, USA.,University of Florida Genetics Institute, Gainesville, FL 32611, USA
| |
Collapse
|
26
|
Desbiez-Piat A, Le Rouzic A, Tenaillon MI, Dillmann C. Interplay between extreme drift and selection intensities favors the fixation of beneficial mutations in selfing maize populations. Genetics 2021; 219:6339583. [PMID: 34849881 DOI: 10.1093/genetics/iyab123] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 07/21/2021] [Indexed: 11/13/2022] Open
Abstract
Population and quantitative genetic models provide useful approximations to predict long-term selection responses sustaining phenotypic shifts, and underlying multilocus adaptive dynamics. Valid across a broad range of parameters, their use for understanding the adaptive dynamics of small selfing populations undergoing strong selection intensity (thereafter High Drift-High selection regime, HDHS) remains to be explored. Saclay Divergent Selection Experiments (DSEs) on maize flowering time provide an interesting example of populations evolving under HDHS, with significant selection responses over 20 generations in two directions. We combined experimental data from Saclay DSEs, forward individual-based simulations, and theoretical predictions to dissect the evolutionary mechanisms at play in the observed selection responses. We asked two main questions: How do mutations arise, spread, and reach fixation in populations evolving under HDHS? How does the interplay between drift and selection influence observed phenotypic shifts? We showed that the long-lasting response to selection in small populations is due to the rapid fixation of mutations occurring during the generations of selection. Among fixed mutations, we also found a clear signal of enrichment for beneficial mutations revealing a limited cost of selection. Both environmental stochasticity and variation in selection coefficients likely contributed to exacerbate mutational effects, thereby facilitating selection grasp and fixation of small-effect mutations. Together our results highlight that despite a small number of polymorphic loci expected under HDHS, adaptive variation is continuously fueled by a vast mutational target. We discuss our results in the context of breeding and long-term survival of small selfing populations.
Collapse
Affiliation(s)
- Arnaud Desbiez-Piat
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE-Le Moulon, 91190 Gif-sur-Yvette, France
| | - Arnaud Le Rouzic
- Université Paris-Saclay, CNRS, IRD, UMR Évolution, Génomes, Comportement et Écologie, 91120 Gif-sur-Yvette, France
| | - Maud I Tenaillon
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE-Le Moulon, 91190 Gif-sur-Yvette, France
| | - Christine Dillmann
- Université Paris-Saclay, INRAE, CNRS, AgroParisTech, GQE-Le Moulon, 91190 Gif-sur-Yvette, France
| |
Collapse
|
27
|
Deatherage DE, Barrick JE. High-throughput characterization of mutations in genes that drive clonal evolution using multiplex adaptome capture sequencing. Cell Syst 2021; 12:1187-1200.e4. [PMID: 34536379 DOI: 10.1016/j.cels.2021.08.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 07/14/2021] [Accepted: 08/20/2021] [Indexed: 11/17/2022]
Abstract
Understanding how cells are likely to evolve can guide medical interventions and bioengineering efforts that must contend with unwanted mutations. The adaptome of a cell-the neighborhood of genetic changes that are most likely to drive adaptation in a given environment-can be mapped by tracking rare beneficial variants during the early stages of clonal evolution. We used multiplex adaptome capture sequencing (mAdCap-seq), a procedure that combines unique molecular identifiers and hybridization-based enrichment, to characterize mutations in eight Escherichia coli genes known to be under selection in a laboratory environment. We tracked 301 mutations at frequencies as low as 0.01% and inferred the fitness effects of 240 of these mutations. There were distinct molecular signatures of selection on protein structure and function for the three genes with the most beneficial mutations. Our results demonstrate how mAdCap-seq can be used to deeply profile a targeted portion of a cell's adaptome.
Collapse
Affiliation(s)
- Daniel E Deatherage
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, The University of Texas at Austin, Austin, TX 78712, USA
| | - Jeffrey E Barrick
- Department of Molecular Biosciences, Center for Systems and Synthetic Biology, The University of Texas at Austin, Austin, TX 78712, USA.
| |
Collapse
|
28
|
Bailey SF, Alonso Morales LA, Kassen R. Effects of synonymous mutations beyond codon bias: The evidence for adaptive synonymous substitutions from microbial evolution experiments. Genome Biol Evol 2021; 13:6300525. [PMID: 34132772 PMCID: PMC8410137 DOI: 10.1093/gbe/evab141] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/10/2021] [Indexed: 12/22/2022] Open
Abstract
Synonymous mutations are often assumed to be neutral with respect to fitness because they do not alter the encoded amino acid and so cannot be 'seen' by natural selection. Yet a growing body of evidence suggests that synonymous mutations can have fitness effects that drive adaptive evolution through their impacts on gene expression and protein folding. Here, we review what microbial experiments have taught us about the contribution of synonymous mutations to adaptation. A survey of site-directed mutagenesis experiments reveals the distributions of fitness effects for nonsynonymous and synonymous mutations are more similar, especially for beneficial mutations, than expected if all synonymous mutations were neutral, suggesting they should drive adaptive evolution more often than is typically observed. A review of experimental evolution studies where synonymous mutations have contributed to adaptation shows they can impact fitness through a range of mechanisms including the creation of illicit RNA polymerase binding sites impacting transcription and changes to mRNA folding stability that modulate translation. We suggest that clonal interference in evolving microbial populations may be the reason synonymous mutations play a smaller role in adaptive evolution than expected based on their observed fitness effects. We finish by discussing the impacts of falsely assuming synonymous mutations are neutral and discuss directions for future work exploring the role of synonymous mutations in adaptive evolution.
Collapse
Affiliation(s)
- Susan F Bailey
- Department of Biology, Clarkson University, Potsdam, NY 13699, USA
| | | | - Rees Kassen
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada
| |
Collapse
|
29
|
Berdan EL, Blanckaert A, Slotte T, Suh A, Westram AM, Fragata I. Unboxing mutations: Connecting mutation types with evolutionary consequences. Mol Ecol 2021; 30:2710-2723. [PMID: 33955064 DOI: 10.1111/mec.15936] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 03/30/2021] [Accepted: 04/20/2021] [Indexed: 01/09/2023]
Abstract
A key step in understanding the genetic basis of different evolutionary outcomes (e.g., adaptation) is to determine the roles played by different mutation types (e.g., SNPs, translocations and inversions). To do this we must simultaneously consider different mutation types in an evolutionary framework. Here, we propose a research framework that directly utilizes the most important characteristics of mutations, their population genetic effects, to determine their relative evolutionary significance in a given scenario. We review known population genetic effects of different mutation types and show how these may be connected to different evolutionary outcomes. We provide examples of how to implement this framework and pinpoint areas where more data, theory and synthesis are needed. Linking experimental and theoretical approaches to examine different mutation types simultaneously is a critical step towards understanding their evolutionary significance.
Collapse
Affiliation(s)
- Emma L Berdan
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | | | - Tanja Slotte
- Department of Ecology, Environment and Plant Sciences, Science for Life Laboratory, Stockholm University, Stockholm, Sweden
| | - Alexander Suh
- School of Biological Sciences - Organisms and the Environment, University of East Anglia, Norwich, UK.,Department of Organismal Biology - Systematic Biology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| | - Anja M Westram
- IST Austria, Klosterneuburg, Austria.,Faculty of Biosciences and Aquaculture, Nord University, Bodø, Norway
| | - Inês Fragata
- cE3c - Centre for Ecology, Evolution and Environmental Changes, Faculdade de Ciências, Universidade de Lisboa, Lisboa, Portugal
| |
Collapse
|
30
|
Verta JP, Barton HJ, Pritchard V, Primmer CR. Genetic Drift Dominates Genome-Wide Regulatory Evolution Following an Ancient Whole-Genome Duplication in Atlantic Salmon. Genome Biol Evol 2021; 13:evab059. [PMID: 33749748 PMCID: PMC8140206 DOI: 10.1093/gbe/evab059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/17/2021] [Indexed: 11/23/2022] Open
Abstract
Whole-genome duplications (WGD) have been considered as springboards that potentiate lineage diversification through increasing functional redundancy. Divergence in gene regulatory elements is a central mechanism for evolutionary diversification, yet the patterns and processes governing regulatory divergence following events that lead to massive functional redundancy, such as WGD, remain largely unknown. We studied the patterns of divergence and strength of natural selection on regulatory elements in the Atlantic salmon (Salmo salar) genome, which has undergone WGD 100-80 Ma. Using ChIPmentation, we first show that H3K27ac, a histone modification typical to enhancers and promoters, is associated with genic regions, tissue-specific transcription factor binding motifs, and with gene transcription levels in immature testes. Divergence in transcription between duplicated genes from WGD (ohnologs) correlated with difference in the number of proximal regulatory elements, but not with promoter elements, suggesting that functional divergence between ohnologs after WGD is mainly driven by enhancers. By comparing H3K27ac regions between duplicated genome blocks, we further show that a longer polyploid state post-WGD has constrained regulatory divergence. Patterns of genetic diversity across natural populations inferred from resequencing indicate that recent evolutionary pressures on H3K27ac regions are dominated by largely neutral evolution. In sum, our results suggest that post-WGD functional redundancy in regulatory elements continues to have an impact on the evolution of the salmon genome, promoting largely neutral evolution of regulatory elements despite their association with transcription levels. These results highlight a case where genome-wide regulatory evolution following an ancient WGD is dominated by genetic drift.
Collapse
Affiliation(s)
- Jukka-Pekka Verta
- Organismal and Evolutionary Biology Research Programme, University of Helsinki, Finland
- Institute of Biotechnology, HiLIFE, University of Helsinki, Finland
| | - Henry J Barton
- Organismal and Evolutionary Biology Research Programme, University of Helsinki, Finland
- Institute of Biotechnology, HiLIFE, University of Helsinki, Finland
| | - Victoria Pritchard
- Organismal and Evolutionary Biology Research Programme, University of Helsinki, Finland
- Institute of Biotechnology, HiLIFE, University of Helsinki, Finland
| | - Craig R Primmer
- Organismal and Evolutionary Biology Research Programme, University of Helsinki, Finland
- Institute of Biotechnology, HiLIFE, University of Helsinki, Finland
| |
Collapse
|
31
|
Johri P, Riall K, Becher H, Excoffier L, Charlesworth B, Jensen JD. The Impact of Purifying and Background Selection on the Inference of Population History: Problems and Prospects. Mol Biol Evol 2021; 38:2986-3003. [PMID: 33591322 PMCID: PMC8233493 DOI: 10.1093/molbev/msab050] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Current procedures for inferring population history generally assume complete neutrality—that is, they neglect both direct selection and the effects of selection on linked sites. We here examine how the presence of direct purifying selection and background selection may bias demographic inference by evaluating two commonly-used methods (MSMC and fastsimcoal2), specifically studying how the underlying shape of the distribution of fitness effects and the fraction of directly selected sites interact with demographic parameter estimation. The results show that, even after masking functional genomic regions, background selection may cause the mis-inference of population growth under models of both constant population size and decline. This effect is amplified as the strength of purifying selection and the density of directly selected sites increases, as indicated by the distortion of the site frequency spectrum and levels of nucleotide diversity at linked neutral sites. We also show how simulated changes in background selection effects caused by population size changes can be predicted analytically. We propose a potential method for correcting for the mis-inference of population growth caused by selection. By treating the distribution of fitness effect as a nuisance parameter and averaging across all potential realizations, we demonstrate that even directly selected sites can be used to infer demographic histories with reasonable accuracy.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Kellen Riall
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | - Hannes Becher
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Laurent Excoffier
- Institute of Ecology and Evolution, University of Berne, Berne, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh, United Kingdom
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
32
|
Jaramillo-Correa JP, Bagnoli F, Grivet D, Fady B, Aravanopoulos FA, Vendramin GG, González-Martínez SC. Evolutionary rate and genetic load in an emblematic Mediterranean tree following an ancient and prolonged population collapse. Mol Ecol 2020; 29:4797-4811. [PMID: 33063352 DOI: 10.1111/mec.15684] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 09/25/2020] [Accepted: 09/28/2020] [Indexed: 12/18/2022]
Abstract
Severe bottlenecks significantly diminish the amount of genetic diversity and the speed at which it accumulates (i.e., evolutionary rate). They further compromise the efficiency of natural selection to eliminate deleterious variants, which may reach fixation in the surviving populations. Consequently, expanding and adapting to new environments may pose a significant challenge when strong bottlenecks result in genetic pauperization. Herein, we surveyed the patterns of nucleotide diversity, molecular adaptation and genetic load across 177 gene-loci in a circum-Mediterranean conifer (Pinus pinea L.) that represents one of the most extreme cases of genetic pauperization in widespread outbreeding taxa. We found very little genetic variation in both hypervariable nuclear microsatellites (SSRs) and gene-loci, which translated into genetic diversity estimates one order of magnitude lower than those previously reported for pines. Such values were consistent with a strong population decline that began some ~1 Ma. Comparisons with the related and parapatric maritime pine (Pinus pinaster Ait.) revealed reduced rates of adaptive evolution (α and ωa ) and a significant accumulation of genetic load. It is unlikely that these are the result from differences in mutation rate or linkage disequilibrium between the two species; instead they are the presumable outcome of contrasting demographic histories affecting both the speed at which these taxa accumulate genetic diversity, and the global efficacy of selection. Future studies, and programs for conservation and management, should thus start testing for the effects of genetic load on fitness, and integrating such effects into predictive models.
Collapse
Affiliation(s)
- Juan P Jaramillo-Correa
- Department of Evolutionary Ecology, Instituto de Ecología, Universidad Nacional Autónoma de México, Mexico City, Mexico
| | - Francesca Bagnoli
- Division of Florence, Institute of Biosciences and Bioresources, National Research Council, Sesto Fiorentino, Italy
| | - Delphine Grivet
- Department of Forest Ecology and Genetics, Forest Research Centre, INIA-CIFOR, Madrid, Spain
| | - Bruno Fady
- INRAE, Unité de Recherche Écologie des Forêts Méditerranéennes (URFM), Avignon, France
| | - Filippos A Aravanopoulos
- Laboratory of Forest Genetics and Tree Breeding, Department of Forestry and Natural Environment, Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Giovanni G Vendramin
- Division of Florence, Institute of Biosciences and Bioresources, National Research Council, Sesto Fiorentino, Italy
| | | |
Collapse
|
33
|
Kutschera VE, Poelstra JW, Botero-Castro F, Dussex N, Gemmell NJ, Hunt GR, Ritchie MG, Rutz C, Wiberg RAW, Wolf JBW. Purifying Selection in Corvids Is Less Efficient on Islands. Mol Biol Evol 2020; 37:469-474. [PMID: 31633794 PMCID: PMC6993847 DOI: 10.1093/molbev/msz233] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Theory predicts that deleterious mutations accumulate more readily in small populations. As a consequence, mutation load is expected to be elevated in species where life-history strategies and geographic or historical contingencies reduce the number of reproducing individuals. Yet, few studies have empirically tested this prediction using genome-wide data in a comparative framework. We collected whole-genome sequencing data for 147 individuals across seven crow species (Corvus spp.). For each species, we estimated the distribution of fitness effects of deleterious mutations and compared it with proxies of the effective population size Ne. Island species with comparatively smaller geographic range sizes had a significantly increased mutation load. These results support the view that small populations have an elevated risk of mutational meltdown, which may contribute to the higher extinction rates observed in island species.
Collapse
Affiliation(s)
- Verena E Kutschera
- Department of Evolutionary Biology, Uppsala University, Uppsala, Sweden.,Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | | | - Fidel Botero-Castro
- Division of Evolutionary Biology, Faculty of Biology, LMU Munich, Planegg-Martinsried, Germany
| | - Nicolas Dussex
- Department of Anatomy, University of Otago, Dunedin, New Zealand.,Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
| | - Neil J Gemmell
- Department of Anatomy, University of Otago, Dunedin, New Zealand
| | | | - Michael G Ritchie
- Centre for Biological Diversity, School of Biology, University of St Andrews, St Andrews, United Kingdom
| | - Christian Rutz
- Centre for Biological Diversity, School of Biology, University of St Andrews, St Andrews, United Kingdom
| | - R Axel W Wiberg
- Centre for Biological Diversity, School of Biology, University of St Andrews, St Andrews, United Kingdom.,Department of Environmental Sciences, Evolutionary Biology, University of Basel, Basel, Switzerland
| | - Jochen B W Wolf
- Department of Evolutionary Biology, Uppsala University, Uppsala, Sweden.,Division of Evolutionary Biology, Faculty of Biology, LMU Munich, Planegg-Martinsried, Germany
| |
Collapse
|
34
|
Abstract
Genetic drift is an important evolutionary force of strength inversely proportional to Ne , the effective population size. The impact of drift on genome diversity and evolution is known to vary among species, but quantifying this effect is a difficult task. Here we assess the magnitude of variation in drift power among species of animals via its effect on the mutation load - which implies also inferring the distribution of fitness effects of deleterious mutations. To this aim, we analyze the nonsynonymous (amino-acid changing) and synonymous (amino-acid conservative) allele frequency spectra in a large sample of metazoan species, with a focus on the primates vs. fruit flies contrast. We show that a Gamma model of the distribution of fitness effects is not suitable due to strong differences in estimated shape parameters among taxa, while adding a class of lethal mutations essentially solves the problem. Using the Gamma + lethal model and assuming that the mean deleterious effects of nonsynonymous mutations is shared among species, we estimate that the power of drift varies by a factor of at least 500 between large-Ne and small-Ne species of animals, i.e., an order of magnitude more than the among-species variation in genetic diversity. Our results are relevant to Lewontin's paradox while further questioning the meaning of the Ne parameter in population genomics.
Collapse
Affiliation(s)
- Nicolas Galtier
- Institute of Evolution Sciences of Montpellier (ISEM), CNRS, University of Montpellier, IRD, EPHE, 34095 Montpellier, France
| | - Marjolaine Rousselle
- Institute of Evolution Sciences of Montpellier (ISEM), CNRS, University of Montpellier, IRD, EPHE, 34095 Montpellier, France
- Bioinformatics Research Centre, Aarhus University, DK Aarhus, Denmark
| |
Collapse
|
35
|
Booker TR. Inferring Parameters of the Distribution of Fitness Effects of New Mutations When Beneficial Mutations Are Strongly Advantageous and Rare. G3 (Bethesda) 2020; 10:2317-2326. [PMID: 32371451 PMCID: PMC7341129 DOI: 10.1534/g3.120.401052] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Accepted: 05/01/2020] [Indexed: 12/13/2022]
Abstract
Characterizing the distribution of fitness effects (DFE) for new mutations is central in evolutionary genetics. Analysis of molecular data under the McDonald-Kreitman test has suggested that adaptive substitutions make a substantial contribution to between-species divergence. Methods have been proposed to estimate the parameters of the distribution of fitness effects for positively selected mutations from the unfolded site frequency spectrum (uSFS). Such methods perform well when beneficial mutations are mildly selected and frequent. However, when beneficial mutations are strongly selected and rare, they may make little contribution to standing variation and will thus be difficult to detect from the uSFS. In this study, I analyze uSFS data from simulated populations subject to advantageous mutations with effects on fitness ranging from mildly to strongly beneficial. As expected, frequent, mildly beneficial mutations contribute substantially to standing genetic variation and parameters are accurately recovered from the uSFS. However, when advantageous mutations are strongly selected and rare, there are very few segregating in populations at any one time. Fitting the uSFS in such cases leads to underestimates of the strength of positive selection and may lead researchers to false conclusions regarding the relative contribution adaptive mutations make to molecular evolution. Fortunately, the parameters for the distribution of fitness effects for harmful mutations are estimated with high accuracy and precision. The results from this study suggest that the parameters of positively selected mutations obtained by analysis of the uSFS should be treated with caution and that variability at linked sites should be used in conjunction with standing variability to estimate parameters of the distribution of fitness effects in the future.
Collapse
Affiliation(s)
- Tom R Booker
- Department of Forest and Conservation Sciences, University of British Columbia, Vancouver, Canada and
- Biodiversity Research Centre, University of British Columbia, Vancouver, Canada
| |
Collapse
|
36
|
Johri P, Charlesworth B, Jensen JD. Toward an Evolutionarily Appropriate Null Model: Jointly Inferring Demography and Purifying Selection. Genetics 2020; 215:173-192. [PMID: 32152045 PMCID: PMC7198275 DOI: 10.1534/genetics.119.303002] [Citation(s) in RCA: 67] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 03/05/2020] [Indexed: 01/27/2023] Open
Abstract
The question of the relative evolutionary roles of adaptive and nonadaptive processes has been a central debate in population genetics for nearly a century. While advances have been made in the theoretical development of the underlying models, and statistical methods for estimating their parameters from large-scale genomic data, a framework for an appropriate null model remains elusive. A model incorporating evolutionary processes known to be in constant operation, genetic drift (as modulated by the demographic history of the population) and purifying selection, is lacking. Without such a null model, the role of adaptive processes in shaping within- and between-population variation may not be accurately assessed. Here, we investigate how population size changes and the strength of purifying selection affect patterns of variation at "neutral" sites near functional genomic components. We propose a novel statistical framework for jointly inferring the contribution of the relevant selective and demographic parameters. By means of extensive performance analyses, we quantify the utility of the approach, identify the most important statistics for parameter estimation, and compare the results with existing methods. Finally, we reanalyze genome-wide population-level data from a Zambian population of Drosophila melanogaster, and find that it has experienced a much slower rate of population growth than was inferred when the effects of purifying selection were neglected. Our approach represents an appropriate null model, against which the effects of positive selection can be assessed.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, Arizona 85287
| | - Brian Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, EH9 3FL, United Kingdom
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, Arizona 85287
| |
Collapse
|
37
|
Chen J, Glémin S, Lascoux M. From Drift to Draft: How Much Do Beneficial Mutations Actually Contribute to Predictions of Ohta's Slightly Deleterious Model of Molecular Evolution? Genetics 2020; 214:1005-1018. [PMID: 32015019 PMCID: PMC7153929 DOI: 10.1534/genetics.119.302869] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Accepted: 01/26/2020] [Indexed: 12/18/2022] Open
Abstract
Since its inception in 1973, the slightly deleterious model of molecular evolution, also known as the nearly neutral theory of molecular evolution, remains a central model to explain the main patterns of DNA polymorphism in natural populations. This is not to say that the quantitative fit to data are perfect. A recent study used polymorphism data from Drosophila melanogaster to test whether, as predicted by the nearly neutral theory, the proportion of effectively neutral mutations depends on the effective population size (Ne ). It showed that a nearly neutral model simply scaling with Ne variation across the genome could not alone explain the data, but that consideration of linked positive selection improves the fit between observations and predictions. In the present article, we extended the work in two main directions. First, we confirmed the observed pattern on a set of 59 species, including high-quality genomic data from 11 animal and plant species with different mating systems and effective population sizes, hence a priori different levels of linked selection. Second, for the 11 species with high-quality genomic data we also estimated the full distribution of fitness effects (DFE) of mutations, and not solely the DFE of deleterious mutations. Both Ne and beneficial mutations contributed to the relationship between the proportion of effectively neutral mutations and local Ne across the genome. In conclusion, the predictions of the slightly deleterious model of molecular evolution hold well for species with small Ne , but for species with large Ne , the fit is improved by incorporating linked positive selection to the model.
Collapse
Affiliation(s)
- Jun Chen
- College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang 310058, China
- Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, 75236 Uppsala, Sweden
| | - Sylvain Glémin
- Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, 75236 Uppsala, Sweden
- Université de Rennes, Centre National de la Recherche Scientifique (CNRS), ECOBIO (Ecosystèmes, Biodiversité, Evolution) - Unité Mixte de Recherche (UMR) 6553, F-35000 Rennes, France
| | - Martin Lascoux
- Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, 75236 Uppsala, Sweden
| |
Collapse
|
38
|
Williams MJ, Zapata L, Werner B, Barnes CP, Sottoriva A, Graham TA. Measuring the distribution of fitness effects in somatic evolution by combining clonal dynamics with dN/dS ratios. eLife 2020; 9:e48714. [PMID: 32223898 PMCID: PMC7105384 DOI: 10.7554/elife.48714] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Accepted: 03/09/2020] [Indexed: 12/22/2022] Open
Abstract
The distribution of fitness effects (DFE) defines how new mutations spread through an evolving population. The ratio of non-synonymous to synonymous mutations (dN/dS) has become a popular method to detect selection in somatic cells. However the link, in somatic evolution, between dN/dS values and fitness coefficients is missing. Here we present a quantitative model of somatic evolutionary dynamics that determines the selective coefficients of individual driver mutations from dN/dS estimates. We then measure the DFE for somatic mutant clones in ostensibly normal oesophagus and skin. We reveal a broad distribution of fitness effects, with the largest fitness increases found for TP53 and NOTCH1 mutants (proliferative bias 1-5%). This study provides the theoretical link between dN/dS values and selective coefficients in somatic evolution, and measures the DFE of mutations in human tissues.
Collapse
Affiliation(s)
- Marc J Williams
- Centre for Genomics and Computational Biology, Barts Cancer Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of LondonLondonUnited Kingdom
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer CenterNew YorkUnited States
| | - Luis Zapata
- Evolutionary Genomics and Modelling Lab, Centre for Evolution and Cancer, The Institute of Cancer ResearchLondonUnited Kingdom
| | - Benjamin Werner
- Centre for Genomics and Computational Biology, Barts Cancer Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of LondonLondonUnited Kingdom
| | - Chris P Barnes
- Department of Cell and Developmental Biology, University College LondonLondonUnited Kingdom
| | - Andrea Sottoriva
- Evolutionary Genomics and Modelling Lab, Centre for Evolution and Cancer, The Institute of Cancer ResearchLondonUnited Kingdom
| | - Trevor A Graham
- Centre for Genomics and Computational Biology, Barts Cancer Institute, Barts and the London School of Medicine and Dentistry, Queen Mary University of LondonLondonUnited Kingdom
| |
Collapse
|
39
|
Barton HJ, Zeng K. The Impact of Natural Selection on Short Insertion and Deletion Variation in the Great Tit Genome. Genome Biol Evol 2019; 11:1514-1524. [PMID: 30924871 PMCID: PMC6543879 DOI: 10.1093/gbe/evz068] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/27/2019] [Indexed: 12/11/2022] Open
Abstract
Insertions and deletions (INDELs) remain understudied, despite being the most common form of genetic variation after single nucleotide polymorphisms. This stems partly from the challenge of correctly identifying the ancestral state of an INDEL and thus identifying it as an insertion or a deletion. Erroneously assigned ancestral states can skew the site frequency spectrum, leading to artificial signals of selection. Consequently, the selective pressures acting on INDELs are, at present, poorly resolved. To tackle this issue, we have recently published a maximum likelihood approach to estimate the mutation rate and the distribution of fitness effects for INDELs. Our approach estimates and controls for the rate of ancestral state misidentification, overcoming issues plaguing previous INDEL studies. Here, we apply the method to INDEL polymorphism data from ten high coverage (∼44×) European great tit (Parus major) genomes. We demonstrate that coding INDELs are under strong purifying selection with a small proportion making it into the population (∼4%). However, among fixed coding INDELs, 71% of insertions and 86% of deletions are fixed by positive selection. In noncoding regions, we estimate ∼80% of insertions and ∼52% of deletions are effectively neutral, the remainder show signatures of purifying selection. Additionally, we see evidence of linked selection reducing INDEL diversity below background levels, both in proximity to exons and in areas of low recombination.
Collapse
Affiliation(s)
- Henry J Barton
- Department of Animal and Plant Sciences, University of Sheffield, United Kingdom
| | - Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, United Kingdom
| |
Collapse
|
40
|
Kemble H, Nghe P, Tenaillon O. Recent insights into the genotype-phenotype relationship from massively parallel genetic assays. Evol Appl 2019; 12:1721-1742. [PMID: 31548853 PMCID: PMC6752143 DOI: 10.1111/eva.12846] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 06/21/2019] [Accepted: 07/02/2019] [Indexed: 12/20/2022] Open
Abstract
With the molecular revolution in Biology, a mechanistic understanding of the genotype-phenotype relationship became possible. Recently, advances in DNA synthesis and sequencing have enabled the development of deep mutational scanning assays, capable of scoring comprehensive libraries of genotypes for fitness and a variety of phenotypes in massively parallel fashion. The resulting empirical genotype-fitness maps pave the way to predictive models, potentially accelerating our ability to anticipate the behaviour of pathogen and cancerous cell populations from sequencing data. Besides from cellular fitness, phenotypes of direct application in industry (e.g. enzyme activity) and medicine (e.g. antibody binding) can be quantified and even selected directly by these assays. This review discusses the technological basis of and recent developments in massively parallel genetics, along with the trends it is uncovering in the genotype-phenotype relationship (distribution of mutation effects, epistasis), their possible mechanistic bases and future directions for advancing towards the goal of predictive genetics.
Collapse
Affiliation(s)
- Harry Kemble
- Infection, Antimicrobials, Modelling, Evolution, INSERM, Unité Mixte de Recherche 1137Université Paris Diderot, Université Paris NordParisFrance
- École Supérieure de Physique et de Chimie Industrielles de la Ville de Paris (ESPCI Paris), UMR CNRS‐ESPCI CBI 8231PSL Research UniversityParis Cedex 05France
| | - Philippe Nghe
- École Supérieure de Physique et de Chimie Industrielles de la Ville de Paris (ESPCI Paris), UMR CNRS‐ESPCI CBI 8231PSL Research UniversityParis Cedex 05France
| | - Olivier Tenaillon
- Infection, Antimicrobials, Modelling, Evolution, INSERM, Unité Mixte de Recherche 1137Université Paris Diderot, Université Paris NordParisFrance
| |
Collapse
|
41
|
Mattila TM, Laenen B, Horvath R, Hämälä T, Savolainen O, Slotte T. Impact of demography on linked selection in two outcrossing Brassicaceae species. Ecol Evol 2019; 9:9532-9545. [PMID: 31534673 PMCID: PMC6745670 DOI: 10.1002/ece3.5463] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Revised: 06/28/2019] [Accepted: 07/02/2019] [Indexed: 12/13/2022] Open
Abstract
Genetic diversity is shaped by mutation, genetic drift, gene flow, recombination, and selection. The dynamics and interactions of these forces shape genetic diversity across different parts of the genome, between populations and species. Here, we have studied the effects of linked selection on nucleotide diversity in outcrossing populations of two Brassicaceae species, Arabidopsis lyrata and Capsella grandiflora, with contrasting demographic history. In agreement with previous estimates, we found evidence for a modest population size expansion thousands of generations ago, as well as efficient purifying selection in C. grandiflora. In contrast, the A. lyrata population exhibited evidence for very recent strong population size decline and weaker efficacy of purifying selection. Using multiple regression analyses with recombination rate and other genomic covariates as explanatory variables, we can explain 47% of the variance in neutral diversity in the C. grandiflora population, while in the A. lyrata population, only 11% of the variance was explained by the model. Recombination rate had a significant positive effect on neutral diversity in both species, suggesting that selection at linked sites has an effect on patterns of neutral variation. In line with this finding, we also found reduced neutral diversity in the vicinity of genes in the C. grandiflora population. However, in A. lyrata no such reduction in diversity was evident, a finding that is consistent with expectations of the impact of a recent bottleneck on patterns of neutral diversity near genes. This study thus empirically demonstrates how differences in demographic history modulate the impact of selection at linked sites in natural populations.
Collapse
Affiliation(s)
- Tiina M. Mattila
- Department of Ecology and GeneticsUniversity of OuluOuluFinland
- Present address:
Department of Organismal BiologyUppsala UniversityUppsalaSweden
| | - Benjamin Laenen
- Science for Life Laboratory, Department of Ecology, Environment, and Plant SciencesStockholm UniversityStockholmSweden
| | - Robert Horvath
- Science for Life Laboratory, Department of Ecology, Environment, and Plant SciencesStockholm UniversityStockholmSweden
| | - Tuomas Hämälä
- Department of Ecology and GeneticsUniversity of OuluOuluFinland
- Biocenter OuluUniversity of OuluOuluFinland
- Present address:
Department of Plant and Microbial BiologyUniversity of Minnesota Twin CitiesSt. PaulMNUSA
| | - Outi Savolainen
- Department of Ecology and GeneticsUniversity of OuluOuluFinland
- Biocenter OuluUniversity of OuluOuluFinland
| | - Tanja Slotte
- Science for Life Laboratory, Department of Ecology, Environment, and Plant SciencesStockholm UniversityStockholmSweden
| |
Collapse
|
42
|
Abstract
For nearly a century adaptive landscapes have provided overviews of the evolutionary process and yet they remain metaphors. We redefine adaptive landscapes in terms of biological processes rather than descriptive phenomenology. We focus on the underlying mechanisms that generate emergent properties such as epistasis, dominance, trade-offs and adaptive peaks. We illustrate the utility of landscapes in predicting the course of adaptation and the distribution of fitness effects. We abandon aged arguments concerning landscape ruggedness in favor of empirically determining landscape architecture. In so doing, we transform the landscape metaphor into a scientific framework within which causal hypotheses can be tested.
Collapse
Affiliation(s)
- Xiao Yi
- BioTechnology Institute, University of Minnesota, St. Paul, MN
| | - Antony M Dean
- BioTechnology Institute, University of Minnesota, St. Paul, MN.,Department of Ecology, Evolution, and Behavior, University of Minnesota, St. Paul, MN
| |
Collapse
|
43
|
Corcoran P, Gossmann TI, Barton HJ, Slate J, Zeng K. Determinants of the Efficacy of Natural Selection on Coding and Noncoding Variability in Two Passerine Species. Genome Biol Evol 2018; 9:2987-3007. [PMID: 29045655 PMCID: PMC5714183 DOI: 10.1093/gbe/evx213] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/16/2017] [Indexed: 02/06/2023] Open
Abstract
Population genetic theory predicts that selection should be more effective when the effective population size (Ne) is larger, and that the efficacy of selection should correlate positively with recombination rate. Here, we analyzed the genomes of ten great tits and ten zebra finches. Nucleotide diversity at 4-fold degenerate sites indicates that zebra finches have a 2.83-fold larger Ne. We obtained clear evidence that purifying selection is more effective in zebra finches. The proportion of substitutions at 0-fold degenerate sites fixed by positive selection (α) is high in both species (great tit 48%; zebra finch 64%) and is significantly higher in zebra finches. When α was estimated on GC-conservative changes (i.e., between A and T and between G and C), the estimates reduced in both species (great tit 22%; zebra finch 53%). A theoretical model presented herein suggests that failing to control for the effects of GC-biased gene conversion (gBGC) is potentially a contributor to the overestimation of α, and that this effect cannot be alleviated by first fitting a demographic model to neutral variants. We present the first estimates in birds for α in the untranslated regions, and found evidence for substantial adaptive changes. Finally, although purifying selection is stronger in high-recombination regions, we obtained mixed evidence for α increasing with recombination rate, especially after accounting for gBGC. These results highlight that it is important to consider the potential confounding effects of gBGC when quantifying selection and that our understanding of what determines the efficacy of selection is incomplete.
Collapse
Affiliation(s)
- Pádraic Corcoran
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| | - Toni I Gossmann
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| | - Henry J Barton
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| | | | - Jon Slate
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| | - Kai Zeng
- Department of Animal and Plant Sciences, University of Sheffield, South Yorkshire, United Kingdom
| |
Collapse
|
44
|
Abstract
Pleiotropic effects of mutations underlie diverse biological phenomena such as ageing and specialization. In particular, antagonistic pleiotropy ("AP": when a mutation has opposite fitness effects in different environments) generates tradeoffs, which may constrain adaptation. Models of adaptation typically assume that AP is common - especially among large-effect mutations - and that pleiotropic effect sizes are positively correlated. Empirical tests of these assumptions have focused on de novo beneficial mutations arising under strong selection. However, most mutations are actually deleterious or neutral, and may contribute to standing genetic variation that can subsequently drive adaptation. We quantified the incidence, nature, and effect size of pleiotropy for carbon utilization across 80 single mutations in Escherichia coli that arose under mutation accumulation (i.e., weak selection). Although ∼46% of the mutations were pleiotropic, only 11% showed AP; among beneficial mutations, only ∼4% showed AP. In some environments, AP was more common in large-effect mutations; and AP effect sizes across environments were often negatively correlated. Thus, AP for carbon use is generally rare (especially among beneficial mutations); is not consistently enriched in large-effect mutations; and often involves weakly deleterious antagonistic effects. Our unbiased quantification of mutational effects therefore suggests that antagonistic pleiotropy may be unlikely to cause maladaptive tradeoffs.
Collapse
Affiliation(s)
- Mrudula Sane
- National Centre for Biological SciencesTata Institute of Fundamental ResearchBangaloreIndia
| | - Joshua John Miranda
- National Centre for Biological SciencesTata Institute of Fundamental ResearchBangaloreIndia
| | - Deepa Agashe
- National Centre for Biological SciencesTata Institute of Fundamental ResearchBangaloreIndia
| |
Collapse
|
45
|
Abstract
Kimura's neutral theory argued that positive selection was not responsible for an appreciable fraction of molecular substitutions. Correspondingly, quantitative analysis reveals that the vast majority of substitutions in cancer genomes are not detectably under selection. Insights from the somatic evolution of cancer reveal that beneficial substitutions in cancer constitute a small but important fraction of the molecular variants. The molecular evolution of cancer community will benefit by incorporating the neutral theory of molecular evolution into their understanding and analysis of cancer evolution-and accepting the use of tractable, predictive models, even when there is some evidence that they are not perfect.
Collapse
Affiliation(s)
| | - Jeffrey P Townsend
- Department of Biostatistics, Yale University, New Haven, CT
- Program in Computational Biology and Bioinformatics
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT
| |
Collapse
|
46
|
Luijckx P, Ho EKH, Stanić A, Agrawal AF. Mutation accumulation in populations of varying size: large effect mutations cause most mutational decline in the rotifer Brachionus calyciflorus under UV-C radiation. J Evol Biol 2018; 31:924-932. [PMID: 29672987 DOI: 10.1111/jeb.13282] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Revised: 02/19/2018] [Accepted: 04/06/2018] [Indexed: 12/22/2022]
Abstract
Theory predicts that fitness decline via mutation accumulation will depend on population size, but there are only a few direct tests of this key idea. To gain a qualitative understanding of the fitness effect of new mutations, we performed a mutation accumulation experiment with the facultative sexual rotifer Brachionus calyciflorus at six different population sizes under UV-C radiation. Lifetime reproduction assays conducted after ten and sixteen UV-C radiations showed that while small populations lost fitness, fitness losses diminished rapidly with increasing population size. Populations kept as low as 10 individuals were able to maintain fitness close to the nonmutagenized populations throughout the experiment indicating that selection was able to remove the majority of large effect mutations in small populations. Although our results also seem to imply that small populations are effectively immune to mutational decay, we caution against this interpretation. Given sufficient time, populations of moderate to large size can experience declines in fitness from accumulating weakly deleterious mutations as demonstrated by fitness estimates from simulations and, tentatively, from a long-term experiment with populations of moderate size. There is mounting evidence to suggest that mutational distributions contain a heavier tail of large effects. Our results suggest that this is also true when the mutational spectrum is altered by UV radiation.
Collapse
Affiliation(s)
- Pepijn Luijckx
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada.,Zoology, School of Natural Sciences, Trinity College Dublin, Dublin 2, Ireland
| | - Eddie K H Ho
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - Andrijana Stanić
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| | - Aneil F Agrawal
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
47
|
Grivet D, Avia K, Vaattovaara A, Eckert AJ, Neale DB, Savolainen O, González-Martínez SC. High rate of adaptive evolution in two widespread European pines. Mol Ecol 2017; 26:6857-6870. [PMID: 29110402 DOI: 10.1111/mec.14402] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2015] [Revised: 09/14/2017] [Accepted: 09/25/2017] [Indexed: 12/18/2022]
Abstract
Comparing related organisms with differing ecological requirements and evolutionary histories can shed light on the mechanisms and drivers underlying genetic adaptation. Here, by examining a common set of hundreds of loci, we compare patterns of nucleotide diversity and molecular adaptation of two European conifers (Scots pine and maritime pine) living in contrasted environments and characterized by distinct population genetic structure (low and clinal in Scots pine, high and ecotypic in maritime pine) and demographic histories. We found higher nucleotide diversity in Scots pine than in maritime pine, whereas rates of new adaptive substitutions (ωa ), as estimated from the distribution of fitness effects, were similar across species and among the highest found in plants. Sample size and population genetic structure did not appear to have resulted in significant bias in estimates of ωa . Moreover, population contraction-expansion dynamics for each species did not affect differentially the rate of adaptive substitution in these two pines. Several methodological and biological factors may underlie the unusually high rate of adaptive evolution of Scots pine and maritime pine. By providing two new case studies with contrasting evolutionary histories, we contribute to disentangling the multiple factors potentially affecting adaptive evolution in natural plant populations.
Collapse
Affiliation(s)
- Delphine Grivet
- Department of Forest Ecology and Genetics, Forest Research Centre, INIA-CIFOR, Madrid, Spain.,Sustainable Forest Management Research Institute, INIA - University of Valladolid, Palencia, Spain
| | - Komlan Avia
- Department of Ecology and Genetics and Biocenter Oulu, University of Oulu, Oulu, Finland.,Algal Genetics Group, UMR 8227, CNRS, Sorbonne Universités, UPMC, Station Biologique Roscoff, Roscoff, France.,UMI 3614 Evolutionary Biology and Ecology of Algae, CNRS, Sorbonne Universités, UPMC, Pontificia Universidad Católica de Chile, Universidad Austral de Chile, Station Biologique Roscoff, Roscoff, France
| | - Aleksia Vaattovaara
- Department of Ecology and Genetics and Biocenter Oulu, University of Oulu, Oulu, Finland.,Division of Plant Biology, Department of Biosciences, Viikki Plant Science Centre (ViPS), University of Helsinki, Helsinki, Finland
| | - Andrew J Eckert
- Department of Biology, Virginia Commonwealth University, Richmond, VA, USA
| | - David B Neale
- Department of Plant Sciences, University of California at Davis, Davis, CA, USA
| | - Outi Savolainen
- Department of Ecology and Genetics and Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Santiago C González-Martínez
- Department of Forest Ecology and Genetics, Forest Research Centre, INIA-CIFOR, Madrid, Spain.,Sustainable Forest Management Research Institute, INIA - University of Valladolid, Palencia, Spain.,BIOGECO, INRA, Univ. Bordeaux, Cestas, France
| |
Collapse
|
48
|
Tataru P, Mollion M, Glémin S, Bataillon T. Inference of Distribution of Fitness Effects and Proportion of Adaptive Substitutions from Polymorphism Data. Genetics 2017; 207:1103-19. [PMID: 28951530 DOI: 10.1534/genetics.117.300323] [Citation(s) in RCA: 76] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2017] [Accepted: 09/13/2017] [Indexed: 11/18/2022] Open
Abstract
The distribution of fitness effects (DFE) encompasses the fraction of deleterious, neutral, and beneficial mutations. It conditions the evolutionary trajectory of populations, as well as the rate of adaptive molecular evolution (α). Inferring DFE and α from patterns of polymorphism, as given through the site frequency spectrum (SFS) and divergence data, has been a longstanding goal of evolutionary genetics. A widespread assumption shared by previous inference methods is that beneficial mutations only contribute negligibly to the polymorphism data. Hence, a DFE comprising only deleterious mutations tends to be estimated from SFS data, and α is then predicted by contrasting the SFS with divergence data from an outgroup. We develop a hierarchical probabilistic framework that extends previous methods to infer DFE and α from polymorphism data alone. We use extensive simulations to examine the performance of our method. While an outgroup is still needed to obtain an unfolded SFS, we show that both a DFE, comprising both deleterious and beneficial mutations, and α can be inferred without using divergence data. We also show that not accounting for the contribution of beneficial mutations to polymorphism data leads to substantially biased estimates of the DFE and α. We compare our framework with one of the most widely used inference methods available and apply it on a recently published chimpanzee exome data set.
Collapse
|
49
|
Abstract
Molecular population genetics aims to explain genetic variation and molecular evolution from population genetics principles. The field was born 50 years ago with the first measures of genetic variation in allozyme loci, continued with the nucleotide sequencing era, and is currently in the era of population genomics. During this period, molecular population genetics has been revolutionized by progress in data acquisition and theoretical developments. The conceptual elegance of the neutral theory of molecular evolution or the footprint carved by natural selection on the patterns of genetic variation are two examples of the vast number of inspiring findings of population genetics research. Since the inception of the field, Drosophila has been the prominent model species: molecular variation in populations was first described in Drosophila and most of the population genetics hypotheses were tested in Drosophila species. In this review, we describe the main concepts, methods, and landmarks of molecular population genetics, using the Drosophila model as a reference. We describe the different genetic data sets made available by advances in molecular technologies, and the theoretical developments fostered by these data. Finally, we review the results and new insights provided by the population genomics approach, and conclude by enumerating challenges and new lines of inquiry posed by increasingly large population scale sequence data.
Collapse
|
50
|
Abstract
The distribution of fitness effects (DFE) of new mutations plays a fundamental role in evolutionary genetics. However, the extent to which the DFE differs across species has yet to be systematically investigated. Furthermore, the biological mechanisms determining the DFE in natural populations remain unclear. Here, we show that theoretical models emphasizing different biological factors at determining the DFE, such as protein stability, back-mutations, species complexity, and mutational robustness make distinct predictions about how the DFE will differ between species. Analyzing amino acid-changing variants from natural populations in a comparative population genomic framework, we find that humans have a higher proportion of strongly deleterious mutations than Drosophila melanogaster. Furthermore, when comparing the DFE across yeast, Drosophila, mice, and humans, the average selection coefficient becomes more deleterious with increasing species complexity. Last, pleiotropic genes have a DFE that is less variable than that of nonpleiotropic genes. Comparing four categories of theoretical models, only Fisher's geometrical model (FGM) is consistent with our findings. FGM assumes that multiple phenotypes are under stabilizing selection, with the number of phenotypes defining the complexity of the organism. Our results suggest that long-term population size and cost of complexity drive the evolution of the DFE, with many implications for evolutionary and medical genomics.
Collapse
Affiliation(s)
- Christian D Huber
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095;
| | - Bernard Y Kim
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095
| | - Clare D Marsden
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095;
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, CA 90095
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA 90095
| |
Collapse
|