1
|
Daigle A, Johri P. Hill-Robertson interference may bias the inference of fitness effects of new mutations in highly selfing species. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.06.579142. [PMID: 38370745 PMCID: PMC10871249 DOI: 10.1101/2024.02.06.579142] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
The accurate estimation of the distribution of fitness effects (DFE) of new mutations is critical for population genetic inference but remains a challenging task. While various methods have been developed for DFE inference using the site frequency spectrum of putatively neutral and selected sites, their applicability in species with diverse life history traits and complex demographic scenarios is not well understood. Selfing is common among eukaryotic species and can lead to decreased effective recombination rates, increasing the effects of selection at linked sites, including interference between selected alleles. We employ forward simulations to investigate the limitations of current DFE estimation approaches in the presence of selfing and other model violations, such as linkage, departures from semidominance, population structure, and uneven sampling. We find that distortions of the site frequency spectrum due to Hill-Robertson interference in highly selfing populations lead to mis-inference of the deleterious DFE of new mutations. Specifically, when inferring the distribution of selection coefficients, there is an overestimation of nearly neutral and strongly deleterious mutations and an underestimation of mildly deleterious mutations when interference between selected alleles is pervasive. In addition, the presence of cryptic population structure with low rates of migration and uneven sampling across subpopulations leads to the false inference of a deleterious DFE skewed towards effectively neutral/mildly deleterious mutations. Finally, the proportion of adaptive substitutions estimated at high rates of selfing is substantially overestimated. Our observations apply broadly to species and genomic regions with little/no recombination and where interference might be pervasive.
Collapse
|
2
|
Rios-Carlos H, Segovia-Ramírez MG, Fujita MK, Rovito SM. Genomic Gigantism is not Associated with Reduced Selection Efficiency in Neotropical Salamanders. J Mol Evol 2024; 92:371-380. [PMID: 38844681 PMCID: PMC11291587 DOI: 10.1007/s00239-024-10177-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Accepted: 05/16/2024] [Indexed: 08/03/2024]
Abstract
Genome size variation in eukaryotes has myriad effects on organismal biology from the genomic to whole-organism level. Large genome size may be associated with lower selection efficiency because lower effective population sizes allow fixation of deleterious mutations via genetic drift, increasing genome size and decreasing selection efficiency. Because of a hypothesized negative relationship between genome size and recombination rate per base pair, increased genome size could also increase the effect of linked selection in the genome, decreasing the efficiency with which natural selection can fix or remove mutations. We used a transcriptomic dataset of 15 and a subset of six Neotropical salamander species ranging in genome size from 12 to 87 pg to study the relationship between genome size and efficiency of selection. We estimated dN/dS of salamanders with small and large genomes and tested for relaxation of selection in the larger genomes. Contrary to our expectations, we did not find a significant relationship between genome size and selection efficiency or strong evidence for higher dN/dS values in species with larger genomes for either species set. We also found little evidence for relaxation of selection in species with larger genomes. A positive correlation between genome size and range size (a proxy of population size) in this group disagrees with predictions of stronger drift in species with larger genomes. Our results highlight the complex interactions between the many forces shaping genomic variation in organisms with genomic gigantism.
Collapse
Affiliation(s)
- Hairo Rios-Carlos
- Unidad de Genómica Avanzada, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional, km 9.6 Libramiento Norte Carretera Irapuato-León, Irapuato, Guanajuato, México
| | - María Guadalupe Segovia-Ramírez
- Unidad de Genómica Avanzada, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional, km 9.6 Libramiento Norte Carretera Irapuato-León, Irapuato, Guanajuato, México
| | - Matthew K Fujita
- Department of Biology, Amphibian and Reptile Diversity Research Center, The University of Texas, Arlington, TX, USA
| | - Sean M Rovito
- Unidad de Genómica Avanzada, Centro de Investigación y de Estudios Avanzados del Instituto Politécnico Nacional, km 9.6 Libramiento Norte Carretera Irapuato-León, Irapuato, Guanajuato, México.
| |
Collapse
|
3
|
Marsh JI, Johri P. Biases in ARG-Based Inference of Historical Population Size in Populations Experiencing Selection. Mol Biol Evol 2024; 41:msae118. [PMID: 38874402 PMCID: PMC11245712 DOI: 10.1093/molbev/msae118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 06/05/2024] [Accepted: 06/11/2024] [Indexed: 06/15/2024] Open
Abstract
Inferring the demographic history of populations provides fundamental insights into species dynamics and is essential for developing a null model to accurately study selective processes. However, background selection and selective sweeps can produce genomic signatures at linked sites that mimic or mask signals associated with historical population size change. While the theoretical biases introduced by the linked effects of selection have been well established, it is unclear whether ancestral recombination graph (ARG)-based approaches to demographic inference in typical empirical analyses are susceptible to misinference due to these effects. To address this, we developed highly realistic forward simulations of human and Drosophila melanogaster populations, including empirically estimated variability of gene density, mutation rates, recombination rates, purifying, and positive selection, across different historical demographic scenarios, to broadly assess the impact of selection on demographic inference using a genealogy-based approach. Our results indicate that the linked effects of selection minimally impact demographic inference for human populations, although it could cause misinference in populations with similar genome architecture and population parameters experiencing more frequent recurrent sweeps. We found that accurate demographic inference of D. melanogaster populations by ARG-based methods is compromised by the presence of pervasive background selection alone, leading to spurious inferences of recent population expansion, which may be further worsened by recurrent sweeps, depending on the proportion and strength of beneficial mutations. Caution and additional testing with species-specific simulations are needed when inferring population history with non-human populations using ARG-based approaches to avoid misinference due to the linked effects of selection.
Collapse
Affiliation(s)
- Jacob I Marsh
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Parul Johri
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
- Integrative Program for Biological and Genome Sciences, University of North Carolina, Chapel Hill, NC 27599, USA
| |
Collapse
|
4
|
Bénitière F, Duret L, Necsulea A. GTDrift: a resource for exploring the interplay between genetic drift, genomic and transcriptomic characteristics in eukaryotes. NAR Genom Bioinform 2024; 6:lqae064. [PMID: 38867915 PMCID: PMC11167491 DOI: 10.1093/nargab/lqae064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 04/22/2024] [Accepted: 05/27/2024] [Indexed: 06/14/2024] Open
Abstract
We present GTDrift, a comprehensive data resource that enables explorations of genomic and transcriptomic characteristics alongside proxies of the intensity of genetic drift in individual species. This resource encompasses data for 1506 eukaryotic species, including 1413 animals and 93 green plants, and is organized in three components. The first two components contain approximations of the effective population size, which serve as indicators of the extent of random genetic drift within each species. In the first component, we meticulously investigated public databases to assemble data on life history traits such as longevity, adult body length and body mass for a set of 979 species. The second component includes estimations of the ratio between the rate of non-synonymous substitutions and the rate of synonymous substitutions (dN/dS) in protein-coding sequences for 1324 species. This ratio provides an estimate of the efficiency of natural selection in purging deleterious substitutions. Additionally, we present polymorphism-derived N e estimates for 66 species. The third component encompasses various genomic and transcriptomic characteristics. With this component, we aim to facilitate comparative transcriptomics analyses across species, by providing easy-to-use processed data for more than 16 000 RNA-seq samples across 491 species. These data include intron-centered alternative splicing frequencies, gene expression levels and sequencing depth statistics for each species, obtained with a homogeneous analysis protocol. To enable cross-species comparisons, we provide orthology predictions for conserved single-copy genes based on BUSCO gene sets. To illustrate the possible uses of this database, we identify the most frequently used introns for each gene and we assess how the sequencing depth available for each species affects our power to identify major and minor splice variants.
Collapse
Affiliation(s)
- Florian Bénitière
- Laboratoire de Biométrie et Biologie Évolutive, Université Lyon 1, UMR CNRS 5558, Villeurbanne, France
- Laboratoire d’Écologie des Hydrosystèmes Naturels et Anthropisés, Université Lyon 1, UMR CNRS 5023, Villeurbanne, France
| | - Laurent Duret
- Laboratoire de Biométrie et Biologie Évolutive, Université Lyon 1, UMR CNRS 5558, Villeurbanne, France
| | - Anamaria Necsulea
- Laboratoire de Biométrie et Biologie Évolutive, Université Lyon 1, UMR CNRS 5558, Villeurbanne, France
| |
Collapse
|
5
|
Sendrowski J, Bataillon T. fastDFE: Fast and Flexible Inference of the Distribution of Fitness Effects. Mol Biol Evol 2024; 41:msae070. [PMID: 38577958 PMCID: PMC11140822 DOI: 10.1093/molbev/msae070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Revised: 03/19/2024] [Accepted: 03/25/2024] [Indexed: 04/06/2024] Open
Abstract
Estimating the distribution of fitness effects (DFE) of new mutations is of fundamental importance in evolutionary biology, ecology, and conservation. However, existing methods for DFE estimation suffer from limitations, such as slow computation speed and limited scalability. To address these issues, we introduce fastDFE, a Python-based software package, offering fast, and flexible DFE inference from site-frequency spectrum (SFS) data. Apart from providing efficient joint inference of multiple DFEs that share parameters, it offers the feature of introducing genomic covariates that influence the DFEs and testing their significance. To further simplify usage, fastDFE is equipped with comprehensive VCF-to-SFS parsing utilities. These include options for site filtering and stratification, as well as site-degeneracy annotation and probabilistic ancestral-allele inference. fastDFE thereby covers the entire workflow of DFE inference from the moment of acquiring a raw VCF file. Despite its Python foundation, fastDFE incorporates a full R interface, including native R visualization capabilities. The package is comprehensively tested and documented at fastdfe.readthedocs.io.
Collapse
Affiliation(s)
- Janek Sendrowski
- Bioinformatics Research Center, Aarhus University, Aarhus, Denmark
| | - Thomas Bataillon
- Bioinformatics Research Center, Aarhus University, Aarhus, Denmark
| |
Collapse
|
6
|
Raas MWD, Dutheil JY. The rate of adaptive molecular evolution in wild and domesticated Saccharomyces cerevisiae populations. Mol Ecol 2024; 33:e16980. [PMID: 37157166 DOI: 10.1111/mec.16980] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 04/22/2023] [Accepted: 04/26/2023] [Indexed: 05/10/2023]
Abstract
Through its fermentative capacities, Saccharomyces cerevisiae was central in the development of civilisation during the Neolithic period, and the yeast remains of importance in industry and biotechnology, giving rise to bona fide domesticated populations. Here, we conduct a population genomic study of domesticated and wild populations of S. cerevisiae. Using coalescent analyses, we report that the effective population size of yeast populations decreased since the divergence with S. paradoxus. We fitted models of distributions of fitness effects to infer the rate of adaptive (ω a ) and non-adaptive (ω na ) non-synonymous substitutions in protein-coding genes. We report an overall limited contribution of positive selection to S. cerevisiae protein evolution, albeit with higher rates of adaptive evolution in wild compared to domesticated populations. Our analyses revealed the signature of background selection and possibly Hill-Robertson interference, as recombination was found to be negatively correlated withω na and positively correlated withω a . However, the effect of recombination onω a was found to be labile, as it is only apparent after removing the impact of codon usage bias on the synonymous site frequency spectrum and disappears if we control for the correlation withω na , suggesting that it could be an artefact of the decreasing population size. Furthermore, the rate of adaptive non-synonymous substitutions is significantly correlated with the residue solvent exposure, a relation that cannot be explained by the population's demography. Together, our results provide a detailed characterisation of adaptive mutations in protein-coding genes across S. cerevisiae populations.
Collapse
Affiliation(s)
- Maximilian W D Raas
- Research Group Molecular Systems Evolution, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Julien Y Dutheil
- Research Group Molecular Systems Evolution, Max Planck Institute for Evolutionary Biology, Plön, Germany
- Unité Mixte de Recherche 5554 Institut des Sciences de l'Evolution, CNRS, IRD, EPHE, Université de Montpellier, Montpellier, France
| |
Collapse
|
7
|
Rodrigues MF, Kern AD, Ralph PL. Shared evolutionary processes shape landscapes of genomic variation in the great apes. Genetics 2024; 226:iyae006. [PMID: 38242701 PMCID: PMC10990428 DOI: 10.1093/genetics/iyae006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Revised: 10/26/2023] [Accepted: 01/03/2024] [Indexed: 01/21/2024] Open
Abstract
For at least the past 5 decades, population genetics, as a field, has worked to describe the precise balance of forces that shape patterns of variation in genomes. The problem is challenging because modeling the interactions between evolutionary processes is difficult, and different processes can impact genetic variation in similar ways. In this paper, we describe how diversity and divergence between closely related species change with time, using correlations between landscapes of genetic variation as a tool to understand the interplay between evolutionary processes. We find strong correlations between landscapes of diversity and divergence in a well-sampled set of great ape genomes, and explore how various processes such as incomplete lineage sorting, mutation rate variation, GC-biased gene conversion and selection contribute to these correlations. Through highly realistic, chromosome-scale, forward-in-time simulations, we show that the landscapes of diversity and divergence in the great apes are too well correlated to be explained via strictly neutral processes alone. Our best fitting simulation includes both deleterious and beneficial mutations in functional portions of the genome, in which 9% of fixations within those regions is driven by positive selection. This study provides a framework for modeling genetic variation in closely related species, an approach which can shed light on the complex balance of forces that have shaped genetic variation.
Collapse
Affiliation(s)
- Murillo F Rodrigues
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA
- Department of Biology, University of Oregon, Eugene, OR 97403, USA
| | - Andrew D Kern
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA
- Department of Biology, University of Oregon, Eugene, OR 97403, USA
| | - Peter L Ralph
- Institute of Ecology and Evolution, University of Oregon, Eugene, OR 97403, USA
- Department of Biology, University of Oregon, Eugene, OR 97403, USA
- Department of Mathematics, University of Oregon, Eugene, OR 97403, USA
| |
Collapse
|
8
|
Murga-Moreno J, Casillas S, Barbadilla A, Uricchio L, Enard D. An efficient and robust ABC approach to infer the rate and strength of adaptation. G3 (BETHESDA, MD.) 2024; 14:jkae031. [PMID: 38365205 PMCID: PMC11090462 DOI: 10.1093/g3journal/jkae031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 10/10/2023] [Accepted: 01/29/2024] [Indexed: 02/18/2024]
Abstract
Inferring the effects of positive selection on genomes remains a critical step in characterizing the ultimate and proximate causes of adaptation across species, and quantifying positive selection remains a challenge due to the confounding effects of many other evolutionary processes. Robust and efficient approaches for adaptation inference could help characterize the rate and strength of adaptation in nonmodel species for which demographic history, mutational processes, and recombination patterns are not currently well-described. Here, we introduce an efficient and user-friendly extension of the McDonald-Kreitman test (ABC-MK) for quantifying long-term protein adaptation in specific lineages of interest. We characterize the performance of our approach with forward simulations and find that it is robust to many demographic perturbations and positive selection configurations, demonstrating its suitability for applications to nonmodel genomes. We apply ABC-MK to the human proteome and a set of known virus interacting proteins (VIPs) to test the long-term adaptation in genes interacting with viruses. We find substantially stronger signatures of positive selection on RNA-VIPs than DNA-VIPs, suggesting that RNA viruses may be an important driver of human adaptation over deep evolutionary time scales.
Collapse
Affiliation(s)
- Jesús Murga-Moreno
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85719, USA
| | - Sònia Casillas
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Antonio Barbadilla
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | | | - David Enard
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85719, USA
| |
Collapse
|
9
|
Zurita AMI, Kyriazis CC, Lohmueller KE. The impact of non-neutral synonymous mutations when inferring selection on non-synonymous mutations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.07.579314. [PMID: 38370782 PMCID: PMC10871344 DOI: 10.1101/2024.02.07.579314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/20/2024]
Abstract
The distribution of fitness effects (DFE) describes the proportions of new mutations that have different effects on reproductive fitness. Accurate measurements of the DFE are important because the DFE is a fundamental parameter in evolutionary genetics and has implications for our understanding of other phenomena like complex disease or inbreeding depression. Current computational methods to infer the DFE for nonsynonymous mutations from natural variation first estimate demographic parameters from synonymous variants to control for the effects of demography and background selection. Then, conditional on these parameters, the DFE is then inferred for nonsynonymous mutations. This approach relies on the assumption that synonymous variants are neutrally evolving. However, some evidence points toward synonymous mutations having measurable effects on fitness. To test whether selection on synonymous mutations affects inference of the DFE of nonsynonymous mutations, we simulated several possible models of selection on synonymous mutations using SLiM and attempted to recover the DFE of nonsynonymous mutations using Fit∂a∂i, a common method for DFE inference. Our results show that the presence of selection on synonymous variants leads to incorrect inferences of recent population growth. Furthermore, under certain parameter combinations, inferences of the DFE can have an inflated proportion of highly deleterious nonsynonymous mutations. However, this bias can be eliminated if the correct demographic parameters are used for DFE inference instead of the biased ones inferred from synonymous variants. Our work demonstrates how unmodeled selection on synonymous mutations may affect downstream inferences of the DFE.
Collapse
Affiliation(s)
- Aina Martinez I Zurita
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, USA
| | - Christopher C Kyriazis
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, USA
| | - Kirk E Lohmueller
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, USA
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, USA
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, USA
| |
Collapse
|
10
|
Galtier N. Half a Century of Controversy: The Neutralist/Selectionist Debate in Molecular Evolution. Genome Biol Evol 2024; 16:evae003. [PMID: 38311843 PMCID: PMC10839204 DOI: 10.1093/gbe/evae003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/01/2024] [Indexed: 02/06/2024] Open
Abstract
The neutral and nearly neutral theories, introduced more than 50 yr ago, have raised and still raise passionate discussion regarding the forces governing molecular evolution and their relative importance. The debate, initially focused on the amount of within-species polymorphism and constancy of the substitution rate, has spread, matured, and now underlies a wide range of topics and questions. The neutralist/selectionist controversy has structured the field and influences the way molecular evolutionary scientists conceive their research.
Collapse
Affiliation(s)
- Nicolas Galtier
- ISEM, CNRS, IRD, Université de Montpellier, Montpellier, France
| |
Collapse
|
11
|
Soni V, Pfeifer SP, Jensen JD. The Effects of Mutation and Recombination Rate Heterogeneity on the Inference of Demography and the Distribution of Fitness Effects. Genome Biol Evol 2024; 16:evae004. [PMID: 38207127 PMCID: PMC10834165 DOI: 10.1093/gbe/evae004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 12/12/2023] [Accepted: 01/07/2024] [Indexed: 01/13/2024] Open
Abstract
Disentangling the effects of demography and selection has remained a focal point of population genetic analysis. Knowledge about mutation and recombination is essential in this endeavor; however, despite clear evidence that both mutation and recombination rates vary across genomes, it is common practice to model both rates as fixed. In this study, we quantify how this unaccounted for rate heterogeneity may impact inference using common approaches for inferring selection (DFE-alpha, Grapes, and polyDFE) and/or demography (fastsimcoal2 and δaδi). We demonstrate that, if not properly modeled, this heterogeneity can increase uncertainty in the estimation of demographic and selective parameters and in some scenarios may result in mis-leading inference. These results highlight the importance of quantifying the fundamental evolutionary parameters of mutation and recombination before utilizing population genomic data to quantify the effects of genetic drift (i.e. as modulated by demographic history) and selection; or, at the least, that the effects of uncertainty in these parameters can and should be directly modeled in downstream inference.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, USA
| | - Susanne P Pfeifer
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Center for Evolution & Medicine, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
12
|
de Jong MJ, van Oosterhout C, Hoelzel AR, Janke A. Moderating the neutralist-selectionist debate: exactly which propositions are we debating, and which arguments are valid? Biol Rev Camb Philos Soc 2024; 99:23-55. [PMID: 37621151 DOI: 10.1111/brv.13010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 08/04/2023] [Accepted: 08/07/2023] [Indexed: 08/26/2023]
Abstract
Half a century after its foundation, the neutral theory of molecular evolution continues to attract controversy. The debate has been hampered by the coexistence of different interpretations of the core proposition of the neutral theory, the 'neutral mutation-random drift' hypothesis. In this review, we trace the origins of these ambiguities and suggest potential solutions. We highlight the difference between the original, the revised and the nearly neutral hypothesis, and re-emphasise that none of them equates to the null hypothesis of strict neutrality. We distinguish the neutral hypothesis of protein evolution, the main focus of the ongoing debate, from the neutral hypotheses of genomic and functional DNA evolution, which for many species are generally accepted. We advocate a further distinction between a narrow and an extended neutral hypothesis (of which the latter posits that random non-conservative amino acid substitutions can cause non-ecological phenotypic divergence), and we discuss the implications for evolutionary biology beyond the domain of molecular evolution. We furthermore point out that the debate has widened from its initial focus on point mutations, and also concerns the fitness effects of large-scale mutations, which can alter the dosage of genes and regulatory sequences. We evaluate the validity of neutralist and selectionist arguments and find that the tested predictions, apart from being sensitive to violation of underlying assumptions, are often derived from the null hypothesis of strict neutrality, or equally consistent with the opposing selectionist hypothesis, except when assuming molecular panselectionism. Our review aims to facilitate a constructive neutralist-selectionist debate, and thereby to contribute to answering a key question of evolutionary biology: what proportions of amino acid and nucleotide substitutions and polymorphisms are adaptive?
Collapse
Affiliation(s)
- Menno J de Jong
- Senckenberg Biodiversity and Climate Research Institute (SBiK-F), Georg-Voigt-Strasse 14-16, Frankfurt am Main, 60325, Germany
| | - Cock van Oosterhout
- Centre for Ecology, Evolution and Conservation, University of East Anglia, Norwich Research Park, Norwich, NR4 7TJ, UK
| | - A Rus Hoelzel
- Department of Biosciences, Durham University, South Road, Durham, DH1 3LE, UK
| | - Axel Janke
- Senckenberg Biodiversity and Climate Research Institute (SBiK-F), Georg-Voigt-Strasse 14-16, Frankfurt am Main, 60325, Germany
- Institute for Ecology, Evolution and Diversity, Goethe University, Max-von-Laue-Strasse 9, Frankfurt am Main, 60438, Germany
- LOEWE-Centre for Translational Biodiversity Genomics (TBG), Senckenberg Nature Research Society, Georg-Voigt-Straße 14-16, Frankfurt am Main, 60325, Germany
| |
Collapse
|
13
|
Schrider DR. Allelic gene conversion softens selective sweeps. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.05.570141. [PMID: 38106127 PMCID: PMC10723294 DOI: 10.1101/2023.12.05.570141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
The prominence of positive selection, in which beneficial mutations are favored by natural selection and rapidly increase in frequency, is a subject of intense debate. Positive selection can result in selective sweeps, in which the haplotype(s) bearing the adaptive allele "sweep" through the population, thereby removing much of the genetic diversity from the region surrounding the target of selection. Two models of selective sweeps have been proposed: classical sweeps, or "hard sweeps", in which a single copy of the adaptive allele sweeps to fixation, and "soft sweeps", in which multiple distinct copies of the adaptive allele leave descendants after the sweep. Soft sweeps can be the outcome of recurrent mutation to the adaptive allele, or the presence of standing genetic variation consisting of multiple copies of the adaptive allele prior to the onset of selection. Importantly, soft sweeps will be common when populations can rapidly adapt to novel selective pressures, either because of a high mutation rate or because adaptive alleles are already present. The prevalence of soft sweeps is especially controversial, and it has been noted that selection on standing variation or recurrent mutations may not always produce soft sweeps. Here, we show that the inverse is true: selection on single-origin de novo mutations may often result in an outcome that is indistinguishable from a soft sweep. This is made possible by allelic gene conversion, which "softens" hard sweeps by copying the adaptive allele onto multiple genetic backgrounds, a process we refer to as a "pseudo-soft" sweep. We carried out a simulation study examining the impact of gene conversion on sweeps from a single de novo variant in models of human, Drosophila, and Arabidopsis populations. The fraction of simulations in which gene conversion had produced multiple haplotypes with the adaptive allele upon fixation was appreciable. Indeed, under realistic demographic histories and gene conversion rates, even if selection always acts on a single-origin mutation, sweeps involving multiple haplotypes are more likely than hard sweeps in large populations, especially when selection is not extremely strong. Thus, even when the mutation rate is low or there is no standing variation, hard sweeps are expected to be the exception rather than the rule in large populations. These results also imply that the presence of signatures of soft sweeps does not necessarily mean that adaptation has been especially rapid or is not mutation limited.
Collapse
Affiliation(s)
- Daniel R Schrider
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599
| |
Collapse
|
14
|
Soni V, Pfeifer SP, Jensen JD. The effects of mutation and recombination rate heterogeneity on the inference of demography and the distribution of fitness effects. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.11.566703. [PMID: 38014252 PMCID: PMC10680612 DOI: 10.1101/2023.11.11.566703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
Disentangling the effects of demography and selection has remained a focal point of population genetic analysis. Knowledge about mutation and recombination is essential in this endeavour; however, despite clear evidence that both mutation and recombination rates vary across genomes, it is common practice to model both rates as fixed. In this study, we quantify how this unaccounted for rate heterogeneity may impact inference using common approaches for inferring selection (DFE-alpha, Grapes, and polyDFE) and/or demography (fastsimcoal2 and δaδi). We demonstrate that, if not properly modelled, this heterogeneity can increase uncertainty in the estimation of demographic and selective parameters and in some scenarios may result in mis-leading inference. These results highlight the importance of quantifying the fundamental evolutionary parameters of mutation and recombination prior to utilizing population genomic data to quantify the effects of genetic drift (i.e., as modulated by demographic history) and selection; or, at the least, that the effects of uncertainty in these parameters can and should be directly modelled in downstream inference.
Collapse
Affiliation(s)
- Vivak Soni
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| | - Susanne P. Pfeifer
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| | - Jeffrey D. Jensen
- Arizona State University, School of Life Sciences, Center for Evolution & Medicine
| |
Collapse
|
15
|
Choquet M, Lenner F, Cocco A, Toullec G, Corre E, Toullec JY, Wallberg A. Comparative Population Transcriptomics Provide New Insight into the Evolutionary History and Adaptive Potential of World Ocean Krill. Mol Biol Evol 2023; 40:msad225. [PMID: 37816123 PMCID: PMC10642690 DOI: 10.1093/molbev/msad225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2023] [Revised: 08/31/2023] [Accepted: 09/25/2023] [Indexed: 10/12/2023] Open
Abstract
Genetic variation is instrumental for adaptation to changing environments but it is unclear how it is structured and contributes to adaptation in pelagic species lacking clear barriers to gene flow. Here, we applied comparative genomics to extensive transcriptome datasets from 20 krill species collected across the Atlantic, Indian, Pacific, and Southern Oceans. We compared genetic variation both within and between species to elucidate their evolutionary history and genomic bases of adaptation. We resolved phylogenetic interrelationships and uncovered genomic evidence to elevate the cryptic Euphausia similis var. armata into species. Levels of genetic variation and rates of adaptive protein evolution vary widely. Species endemic to the cold Southern Ocean, such as the Antarctic krill Euphausia superba, showed less genetic variation and lower evolutionary rates than other species. This could suggest a low adaptive potential to rapid climate change. We uncovered hundreds of candidate genes with signatures of adaptive evolution among Antarctic Euphausia but did not observe strong evidence of adaptive convergence with the predominantly Arctic Thysanoessa. We instead identified candidates for cold-adaptation that have also been detected in Antarctic fish, including genes that govern thermal reception such as TrpA1. Our results suggest parallel genetic responses to similar selection pressures across Antarctic taxa and provide new insights into the adaptive potential of important zooplankton already affected by climate change.
Collapse
Affiliation(s)
- Marvin Choquet
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
- Natural History Museum, University of Oslo, Oslo, Norway
| | - Felix Lenner
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
| | - Arianna Cocco
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - Gaëlle Toullec
- Laboratory for Biological Geochemistry, School of Architecture, Civil and Environmental Engineering, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
| | - Erwan Corre
- CNRS, Sorbonne Université, FR 2424, ABiMS Platform, Station Biologique de Roscoff, Roscoff, France
| | - Jean-Yves Toullec
- CNRS, UMR 7144, AD2M, Sorbonne Université, Station Biologique de Roscoff, Roscoff, France
| | - Andreas Wallberg
- Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| |
Collapse
|
16
|
James J, Kastally C, Budde KB, González-Martínez SC, Milesi P, Pyhäjärvi T, Lascoux M. Between but Not Within-Species Variation in the Distribution of Fitness Effects. Mol Biol Evol 2023; 40:msad228. [PMID: 37832225 PMCID: PMC10630145 DOI: 10.1093/molbev/msad228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 09/04/2023] [Accepted: 09/25/2023] [Indexed: 10/15/2023] Open
Abstract
New mutations provide the raw material for evolution and adaptation. The distribution of fitness effects (DFE) describes the spectrum of effects of new mutations that can occur along a genome, and is, therefore, of vital interest in evolutionary biology. Recent work has uncovered striking similarities in the DFE between closely related species, prompting us to ask whether there is variation in the DFE among populations of the same species, or among species with different degrees of divergence, that is whether there is variation in the DFE at different levels of evolution. Using exome capture data from six tree species sampled across Europe we characterized the DFE for multiple species, and for each species, multiple populations, and investigated the factors potentially influencing the DFE, such as demography, population divergence, and genetic background. We find statistical support for the presence of variation in the DFE at the species level, even among relatively closely related species. However, we find very little difference at the population level, suggesting that differences in the DFE are primarily driven by deep features of species biology, and those evolutionarily recent events, such as demographic changes and local adaptation, have little impact.
Collapse
Affiliation(s)
- Jennifer James
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
- Swedish Collegium of Advanced Study, Uppsala University, Uppsala, Sweden
| | - Chedly Kastally
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
- Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Katharina B Budde
- Department of Forest Genetics and Forest Tree Breeding, Georg-August-University Goettingen, Goettingen, Germany
- Center of Biodiversity and Sustainable Land Use (CBL), University of Goettingen, Goettingen, Germany
| | - Santiago C González-Martínez
- National Research Institute for Agriculture, Food and the Environment (INRAE), University of Bordeaux, BIOGECO, Cestas, France
| | - Pascal Milesi
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
- Science for Life Laboratory (SciLifeLab), Uppsala University, Uppsala, Sweden
| | - Tanja Pyhäjärvi
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
- Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Martin Lascoux
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
| |
Collapse
|
17
|
Rodrigues MF, Kern AD, Ralph PL. Shared evolutionary processes shape landscapes of genomic variation in the great apes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.07.527547. [PMID: 36798346 PMCID: PMC9934647 DOI: 10.1101/2023.02.07.527547] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/10/2023]
Abstract
For at least the past five decades population genetics, as a field, has worked to describe the precise balance of forces that shape patterns of variation in genomes. The problem is challenging because modelling the interactions between evolutionary processes is difficult, and different processes can impact genetic variation in similar ways. In this paper, we describe how diversity and divergence between closely related species change with time, using correlations between landscapes of genetic variation as a tool to understand the interplay between evolutionary processes. We find strong correlations between landscapes of diversity and divergence in a well sampled set of great ape genomes, and explore how various processes such as incomplete lineage sorting, mutation rate variation, GC-biased gene conversion and selection contribute to these correlations. Through highly realistic, chromosome-scale, forward-in-time simulations we show that the landscapes of diversity and divergence in the great apes are too well correlated to be explained via strictly neutral processes alone. Our best fitting simulation includes both deleterious and beneficial mutations in functional portions of the genome, in which 9% of fixations within those regions is driven by positive selection. This study provides a framework for modelling genetic variation in closely related species, an approach which can shed light on the complex balance of forces that have shaped genetic variation.
Collapse
Affiliation(s)
- Murillo F. Rodrigues
- Institute of Ecology and Evolution, University of Oregon
- Department of Biology, University of Oregon
| | - Andrew D. Kern
- Institute of Ecology and Evolution, University of Oregon
- Department of Biology, University of Oregon
| | - Peter L. Ralph
- Institute of Ecology and Evolution, University of Oregon
- Department of Biology, University of Oregon
- Department of Mathematics, University of Oregon
| |
Collapse
|
18
|
Pivirotto AM, Platt A, Patel R, Kumar S, Hey J. Analyses of allele age and fitness impact reveal human beneficial alleles to be older than neutral controls. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.09.561569. [PMID: 37873438 PMCID: PMC10592680 DOI: 10.1101/2023.10.09.561569] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/25/2023]
Abstract
A classic population genetic prediction is that alleles experiencing directional selection should swiftly traverse allele frequency space, leaving detectable reductions in genetic variation in linked regions. However, despite this expectation, identifying clear footprints of beneficial allele passage has proven to be surprisingly challenging. We addressed the basic premise underlying this expectation by estimating the ages of large numbers of beneficial and deleterious alleles in a human population genomic data set. Deleterious alleles were found to be young, on average, given their allele frequency. However, beneficial alleles were older on average than non-coding, non-regulatory alleles of the same frequency. This finding is not consistent with directional selection and instead indicates some type of balancing selection. Among derived beneficial alleles, those fixed in the population show higher local recombination rates than those still segregating, consistent with a model in which new beneficial alleles experience an initial period of balancing selection due to linkage disequilibrium with deleterious recessive alleles. Alleles that ultimately fix following a period of balancing selection will leave a modest 'soft' sweep impact on the local variation, consistent with the overall paucity of species-wide 'hard' sweeps in human genomes.
Collapse
Affiliation(s)
| | - Alexander Platt
- Temple University, Department of Biology, Philadelphia PA 19122, USA
- University of Pennsylvania, Department of Genetics, Philadelphia PA 19104, USA
| | - Ravi Patel
- Temple University, Department of Biology, Philadelphia PA 19122, USA
- Institute for Genomics and Evolutionary Medicine, Temple University, PA 19122, USA
| | - Sudhir Kumar
- Temple University, Department of Biology, Philadelphia PA 19122, USA
- Institute for Genomics and Evolutionary Medicine, Temple University, PA 19122, USA
| | - Jody Hey
- Temple University, Department of Biology, Philadelphia PA 19122, USA
| |
Collapse
|
19
|
Murga-Moreno J, Casillas S, Barbadilla A, Uricchio L, Enard D. An efficient and robust ABC approach to infer the rate and strength of adaptation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.29.555322. [PMID: 37693550 PMCID: PMC10491248 DOI: 10.1101/2023.08.29.555322] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/12/2023]
Abstract
Inferring the effects of positive selection on genomes remains a critical step in characterizing the ultimate and proximate causes of adaptation across species, and quantifying positive selection remains a challenge due to the confounding effects of many other evolutionary processes. Robust and efficient approaches for adaptation inference could help characterize the rate and strength of adaptation in non-model species for which demographic history, mutational processes, and recombination patterns are not currently well-described. Here, we introduce an efficient and user-friendly extension of the McDonald-Kreitman test (ABC-MK) for quantifying long-term protein adaptation in specific lineages of interest. We characterize the performance of our approach with forward simulations and find that it is robust to many demographic perturbations and positive selection configurations, demonstrating its suitability for applications to non-model genomes. We apply ABC-MK to the human proteome and a set of known Virus Interacting Proteins (VIPs) to test the long-term adaptation in genes interacting with viruses. We find substantially stronger signatures of positive selection on RNA-VIPs than DNA-VIPs, suggesting that RNA viruses may be an important driver of human adaptation over deep evolutionary time scales.
Collapse
Affiliation(s)
- Jesús Murga-Moreno
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, USA
| | - Sònia Casillas
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | - Antonio Barbadilla
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona, Bellaterra, Barcelona 08193, Spain
| | | | - David Enard
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, USA
| |
Collapse
|
20
|
Rougemont Q, Leroy T, Rondeau EB, Koop B, Bernatchez L. Allele surfing causes maladaptation in a Pacific salmon of conservation concern. PLoS Genet 2023; 19:e1010918. [PMID: 37683018 PMCID: PMC10545117 DOI: 10.1371/journal.pgen.1010918] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Revised: 10/02/2023] [Accepted: 08/11/2023] [Indexed: 09/10/2023] Open
Abstract
How various factors, including demography, recombination or genome duplication, may impact the efficacy of natural selection and the burden of deleterious mutations, is a central question in evolutionary biology and genetics. In this study, we show that key evolutionary processes, including variations in i) effective population size (Ne) ii) recombination rates and iii) chromosome inheritance, have influenced the genetic load and efficacy of selection in Coho salmon (Oncorhynchus kisutch), a widely distributed salmonid species on the west coast of North America. Using whole genome resequencing data from 14 populations at different migratory distances from their southern glacial refugium, we found evidence supporting gene surfing, wherein reduced Ne at the postglacial recolonization front, leads to a decrease in the efficacy of selection and a surf of deleterious alleles in the northernmost populations. Furthermore, our results indicate that recombination rates play a prime role in shaping the load along the genome. Additionally, we identified variation in polyploidy as a contributing factor to within-genome variation of the load. Overall, our results align remarkably well with expectations under the nearly neutral theory of molecular evolution. We discuss the fundamental and applied implications of these findings for evolutionary and conservation genomics.
Collapse
Affiliation(s)
- Quentin Rougemont
- Centre d’Ecologie Fonctionnelle et Evolutive, Univ Montpellier, CNRS, EPHE, IRD, Montpellier, France
| | - Thibault Leroy
- GenPhySE, INRAE, INP, ENVT, Université de Toulouse, Auzeville- Tolosane, France
| | - Eric B. Rondeau
- Department of Fisheries and Ocean, Pacific Biological Station, Nanaimo, Canada
| | - Ben Koop
- Department of Biology, University of Victoria, Victoria, Canada
| | - Louis Bernatchez
- Département de Biologie, Institut de Biologie Intégrative et des Systèmes (IBIS), Université Laval, Québec, Canada
| |
Collapse
|
21
|
Charmouh AP, Bocedi G, Hartfield M. Inferring the distributions of fitness effects and proportions of strongly deleterious mutations. G3 (BETHESDA, MD.) 2023; 13:jkad140. [PMID: 37337692 PMCID: PMC10468728 DOI: 10.1093/g3journal/jkad140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 06/05/2023] [Accepted: 06/05/2023] [Indexed: 06/21/2023]
Abstract
The distribution of fitness effects is a key property in evolutionary genetics as it has implications for several evolutionary phenomena including the evolution of sex and mating systems, the rate of adaptive evolution, and the prevalence of deleterious mutations. Despite the distribution of fitness effects being extensively studied, the effects of strongly deleterious mutations are difficult to infer since such mutations are unlikely to be present in a sample of haplotypes, so genetic data may contain very little information about them. Recent work has attempted to correct for this issue by expanding the classic gamma-distributed model to explicitly account for strongly deleterious mutations. Here, we use simulations to investigate one such method, adding a parameter (plth) to capture the proportion of strongly deleterious mutations. We show that plth can improve the model fit when applied to individual species but underestimates the true proportion of strongly deleterious mutations. The parameter can also artificially maximize the likelihood when used to jointly infer a distribution of fitness effects from multiple species. As plth and related parameters are used in current inference algorithms, our results are relevant with respect to avoiding model artifacts and improving future tools for inferring the distribution of fitness effects.
Collapse
Affiliation(s)
- Anders P Charmouh
- School of Biological Sciences, University of Aberdeen, Aberdeen AB24 3FX, UK
- Bioinformatics Research Centre Aarhus University, University City 81, building 1872, 3rd floor. DK-8000 Aarhus C, Denmark
| | - Greta Bocedi
- School of Biological Sciences, University of Aberdeen, Aberdeen AB24 3FX, UK
| | - Matthew Hartfield
- Institute of Ecology and Evolution, The University of Edinburgh, Edinburgh EH9 3FL, UK
| |
Collapse
|
22
|
Näsvall K, Boman J, Höök L, Vila R, Wiklund C, Backström N. Nascent evolution of recombination rate differences as a consequence of chromosomal rearrangements. PLoS Genet 2023; 19:e1010717. [PMID: 37549188 PMCID: PMC10434929 DOI: 10.1371/journal.pgen.1010717] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2023] [Revised: 08/17/2023] [Accepted: 07/17/2023] [Indexed: 08/09/2023] Open
Abstract
Reshuffling of genetic variation occurs both by independent assortment of chromosomes and by homologous recombination. Such reshuffling can generate novel allele combinations and break linkage between advantageous and deleterious variants which increases both the potential and the efficacy of natural selection. Here we used high-density linkage maps to characterize global and regional recombination rate variation in two populations of the wood white butterfly (Leptidea sinapis) that differ considerably in their karyotype as a consequence of at least 27 chromosome fissions and fusions. The recombination data were compared to estimates of genetic diversity and measures of selection to assess the relationship between chromosomal rearrangements, crossing over, maintenance of genetic diversity and adaptation. Our data show that the recombination rate is influenced by both chromosome size and number, but that the difference in the number of crossovers between karyotypes is reduced as a consequence of a higher frequency of double crossovers in larger chromosomes. As expected from effects of selection on linked sites, we observed an overall positive association between recombination rate and genetic diversity in both populations. Our results also revealed a significant effect of chromosomal rearrangements on the rate of intergenic diversity change between populations, but limited effects on polymorphisms in coding sequence. We conclude that chromosomal rearrangements can have considerable effects on the recombination landscape and consequently influence both maintenance of genetic diversity and efficiency of selection in natural populations.
Collapse
Affiliation(s)
- Karin Näsvall
- Evolutionary Biology Program, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, Uppsala, Sweden
| | - Jesper Boman
- Evolutionary Biology Program, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, Uppsala, Sweden
| | - Lars Höök
- Evolutionary Biology Program, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, Uppsala, Sweden
| | - Roger Vila
- Butterfly Diversity and Evolution Lab, Institut de Biologia Evolutiva (CSIC-Univ. Pompeu Fabra), Barcelona, Spain
| | - Christer Wiklund
- Department of Zoology: Division of Ecology, Stockholm University, Stockholm, Sweden
| | - Niclas Backström
- Evolutionary Biology Program, Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, Uppsala, Sweden
| |
Collapse
|
23
|
Näsvall K, Boman J, Talla V, Backström N. Base Composition, Codon Usage, and Patterns of Gene Sequence Evolution in Butterflies. Genome Biol Evol 2023; 15:evad150. [PMID: 37565492 PMCID: PMC10462419 DOI: 10.1093/gbe/evad150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 07/17/2023] [Accepted: 08/08/2023] [Indexed: 08/12/2023] Open
Abstract
Coding sequence evolution is influenced by both natural selection and neutral evolutionary forces. In many species, the effects of mutation bias, codon usage, and GC-biased gene conversion (gBGC) on gene sequence evolution have not been detailed. Quantification of how these forces shape substitution patterns is therefore necessary to understand the strength and direction of natural selection. Here, we used comparative genomics to investigate the association between base composition and codon usage bias on gene sequence evolution in butterflies and moths (Lepidoptera), including an in-depth analysis of underlying patterns and processes in one species, Leptidea sinapis. The data revealed significant G/C to A/T substitution bias at third codon position with some variation in the strength among different butterfly lineages. However, the substitution bias was lower than expected from previously estimated mutation rate ratios, partly due to the influence of gBGC. We found that A/T-ending codons were overrepresented in most species, but there was a positive association between the magnitude of codon usage bias and GC-content in third codon positions. In addition, the tRNA-gene population in L. sinapis showed higher GC-content at third codon positions compared to coding sequences in general and less overrepresentation of A/T-ending codons. There was an inverse relationship between synonymous substitutions and codon usage bias indicating selection on synonymous sites. We conclude that the evolutionary rate in Lepidoptera is affected by a complex interaction between underlying G/C -> A/T mutation bias and partly counteracting fixation biases, predominantly conferred by overall purifying selection, gBGC, and selection on codon usage.
Collapse
Affiliation(s)
- Karin Näsvall
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| | - Jesper Boman
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| | - Venkat Talla
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| | - Niclas Backström
- Evolutionary Biology Program, Department of Ecology and Genetics (IEG), Uppsala University, Uppsala, Sweden
| |
Collapse
|
24
|
Wade EE, Kyriazis CC, Cavassim MIA, Lohmueller KE. Quantifying the fraction of new mutations that are recessive lethal. Evolution 2023; 77:1539-1549. [PMID: 37074880 PMCID: PMC10309970 DOI: 10.1093/evolut/qpad061] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 03/21/2023] [Accepted: 04/14/2023] [Indexed: 04/20/2023]
Abstract
The presence and impact of recessive lethal mutations have been widely documented in diploid outcrossing species. However, precise estimates of the proportion of new mutations that are recessive lethal remain limited. Here, we evaluate the performance of Fit∂a∂i, a commonly used method for inferring the distribution of fitness effects (DFE), in the presence of lethal mutations. Using simulations, we demonstrate that in both additive and recessive cases, inference of the deleterious nonlethal portion of the DFE is minimally affected by a small proportion (<10%) of lethal mutations. Additionally, we demonstrate that while Fit∂a∂i cannot estimate the fraction of recessive lethal mutations, Fit∂a∂i can accurately infer the fraction of additive lethal mutations. Finally, as an alternative approach to estimate the proportion of mutations that are recessive lethal, we employ models of mutation-selection-drift balance using existing genomic parameters and estimates of segregating recessive lethals for humans and Drosophila melanogaster. In both species, the segregating recessive lethal load can be explained by a very small fraction (<1%) of new nonsynonymous mutations being recessive lethal. Our results refute recent assertions of a much higher proportion of mutations being recessive lethal (4%-5%), while highlighting the need for additional information on the joint distribution of selection and dominance coefficients.
Collapse
Affiliation(s)
- Emma E Wade
- Department of Ecology and Evolutionary Biology, University of California–Los Angeles, Los Angeles, CA, United States
- Department of Computer Science and Engineering, Mississippi State University, Starkville, MS, United States
| | - Christopher C Kyriazis
- Department of Ecology and Evolutionary Biology, University of California–Los Angeles, Los Angeles, CA, United States
| | - Maria Izabel A Cavassim
- Department of Ecology and Evolutionary Biology, University of California–Los Angeles, Los Angeles, CA, United States
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California–Los Angeles, Los Angeles, CA, United States
- Interdepartmental Program in Bioinformatics, University of California–Los Angeles, Los Angeles, CA, United States
- Department of Human Genetics, David Geffen School of Medicine, University of California–Los Angeles, Los Angeles, CA, United States
| |
Collapse
|
25
|
Barroso GV, Lohmueller KE. Inferring the mode and strength of ongoing selection. Genome Res 2023; 33:632-643. [PMID: 37055196 PMCID: PMC10234300 DOI: 10.1101/gr.276386.121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 03/29/2023] [Indexed: 04/15/2023]
Abstract
Genome sequence data are no longer scarce. The UK Biobank alone comprises 200,000 individual genomes, with more on the way, leading the field of human genetics toward sequencing entire populations. Within the next decades, other model organisms will follow suit, especially domesticated species such as crops and livestock. Having sequences from most individuals in a population will present new challenges for using these data to improve health and agriculture in the pursuit of a sustainable future. Existing population genetic methods are designed to model hundreds of randomly sampled sequences but are not optimized for extracting the information contained in the larger and richer data sets that are beginning to emerge, with thousands of closely related individuals. Here we develop a new method called trio-based inference of dominance and selection (TIDES) that uses data from tens of thousands of family trios to make inferences about natural selection acting in a single generation. TIDES further improves on the state of the art by making no assumptions regarding demography, linkage, or dominance. We discuss how our method paves the way for studying natural selection from new angles.
Collapse
Affiliation(s)
- Gustavo V Barroso
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California 90095-1606, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California 90095, USA
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, California 90095-1606, USA; Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, California 90095, USA
| |
Collapse
|
26
|
Latrille T, Rodrigue N, Lartillot N. Genes and sites under adaptation at the phylogenetic scale also exhibit adaptation at the population-genetic scale. Proc Natl Acad Sci U S A 2023; 120:e2214977120. [PMID: 36897968 PMCID: PMC10089192 DOI: 10.1073/pnas.2214977120] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 02/11/2023] [Indexed: 03/12/2023] Open
Abstract
Adaptation in protein-coding sequences can be detected from multiple sequence alignments across species or alternatively by leveraging polymorphism data within a population. Across species, quantification of the adaptive rate relies on phylogenetic codon models, classically formulated in terms of the ratio of nonsynonymous over synonymous substitution rates. Evidence of an accelerated nonsynonymous substitution rate is considered a signature of pervasive adaptation. However, because of the background of purifying selection, these models are potentially limited in their sensitivity. Recent developments have led to more sophisticated mutation-selection codon models aimed at making a more detailed quantitative assessment of the interplay between mutation, purifying, and positive selection. In this study, we conducted a large-scale exome-wide analysis of placental mammals with mutation-selection models, assessing their performance at detecting proteins and sites under adaptation. Importantly, mutation-selection codon models are based on a population-genetic formalism and thus are directly comparable to the McDonald and Kreitman test at the population level to quantify adaptation. Taking advantage of this relationship between phylogenetic and population genetics analyses, we integrated divergence and polymorphism data across the entire exome for 29 populations across 7 genera and showed that proteins and sites detected to be under adaptation at the phylogenetic scale are also under adaptation at the population-genetic scale. Altogether, our exome-wide analysis shows that phylogenetic mutation-selection codon models and the population-genetic test of adaptation can be reconciled and are congruent, paving the way for integrative models and analyses across individuals and populations.
Collapse
Affiliation(s)
- Thibault Latrille
- Université de Lyon, Université Lyon 1, CNRS, VetAgro Sup, Laboratoire de Biométrie et Biologie Evolutive, UMR5558, 69100Villeurbanne, France
- École Normale Supérieure de Lyon, Université de Lyon, 69342Lyon, France
- Department of Computational Biology, Université de Lausanne, 1015Lausanne, Switzerland
| | - Nicolas Rodrigue
- Department of Biology, Institute of Biochemistry, and School of Mathematics and Statistics, Carleton University, K1S 5B6Ottawa, Canada
| | - Nicolas Lartillot
- Université de Lyon, Université Lyon 1, CNRS, VetAgro Sup, Laboratoire de Biométrie et Biologie Evolutive, UMR5558, 69100Villeurbanne, France
| |
Collapse
|
27
|
The use of evolutionary analyses to predict functionally relevant traits in filamentous plant pathogens. Curr Opin Microbiol 2023; 73:102244. [PMID: 36889024 DOI: 10.1016/j.mib.2022.102244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 10/27/2022] [Accepted: 11/03/2022] [Indexed: 03/08/2023]
Abstract
Identifying traits involved in plant-pathogen interactions is one of the major objectives in molecular plant pathology. Evolutionary analyses may assist in the identification of genes encoding traits that are involved in virulence and local adaptation, including adaptation to agricultural intervention strategies. In the past decades, the number of available genome sequences of fungal plant pathogens has rapidly increased, providing a rich source for the discovery of functionally important genes as well as inference of species histories. Positive selection in the form of diversifying or directional selection leaves particular signatures in genome alignments and can be identified with statistical genetics methods. This review summarises the concepts and approaches used in evolutionary genomics and lists major discoveries related to plant-pathogen adaptative evolution. We underline the significant contribution of evolutionary genomics in discovering virulence-related traits and the study of plant-pathogen ecology and adaptive evolution.
Collapse
|
28
|
Vlček J, Miláček M, Vinkler M, Štefka J. Effect of population size and selection on Toll-like receptor diversity in populations of Galápagos mockingbirds. J Evol Biol 2023; 36:109-120. [PMID: 36398499 DOI: 10.1111/jeb.14121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Revised: 08/25/2022] [Accepted: 09/10/2022] [Indexed: 11/19/2022]
Abstract
The interactions of evolutionary forces are difficult to analyse in free-living populations. However, when properly understood, they provide valuable insights into evolutionary biology and conservation genetics. This is particularly important for the interplay of genetic drift and natural selection in immune genes that confer resistance to disease. The Galápagos Islands are inhabited by four closely related species of mockingbirds (Mimus spp.). We used 12 different-sized populations of Galápagos mockingbirds and one population of their continental relative northern mockingbird (Mimus polyglottos) to study the effects of genetic drift on the molecular evolution of immune genes, the Toll-like receptors (TLRs: TLR1B, TLR4 and TLR15). We found that neutral genetic diversity was positively correlated with island size, indicating an important effect of genetic drift. However, for TLR1B and TLR4, there was little correlation between functional (e.g., protein) diversity and island size, and protein structural properties were largely conserved, indicating only a limited effect of genetic drift on molecular phenotype. By contrast, TLR15 was less conserved and even its putative functional polymorphism correlated with island size. The patterns observed for the three genes suggest that genetic drift does not necessarily dominate selection even in relatively small populations, but that the final outcome depends on the degree of selection constraint that is specific for each TLR locus.
Collapse
Affiliation(s)
- Jakub Vlček
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czech Republic.,Department of Zoology, University of South Bohemia in České Budějovice Faculty of Science, České Budějovice, Czech Republic.,Department of Botany, Charles University Faculty of Science, Prague, Czech Republic
| | - Matěj Miláček
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czech Republic.,Department of Zoology, University of South Bohemia in České Budějovice Faculty of Science, České Budějovice, Czech Republic
| | - Michal Vinkler
- Department of Zoology, Charles University Faculty of Science, Prague, Czech Republic
| | - Jan Štefka
- Institute of Parasitology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czech Republic.,Department of Zoology, University of South Bohemia in České Budějovice Faculty of Science, České Budějovice, Czech Republic
| |
Collapse
|
29
|
Angst P, Ameline C, Haag CR, Ben-Ami F, Ebert D, Fields PD. Genetic Drift Shapes the Evolution of a Highly Dynamic Metapopulation. Mol Biol Evol 2022; 39:msac264. [PMID: 36472514 PMCID: PMC9778854 DOI: 10.1093/molbev/msac264] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 11/28/2022] [Accepted: 12/01/2022] [Indexed: 12/12/2022] Open
Abstract
The dynamics of extinction and (re)colonization in habitat patches are characterizing features of dynamic metapopulations, causing them to evolve differently than large, stable populations. The propagule model, which assumes genetic bottlenecks during colonization, posits that newly founded subpopulations have low genetic diversity and are genetically highly differentiated from each other. Immigration may then increase diversity and decrease differentiation between subpopulations. Thus, older and/or less isolated subpopulations are expected to have higher genetic diversity and less genetic differentiation. We tested this theory using whole-genome pool-sequencing to characterize nucleotide diversity and differentiation in 60 subpopulations of a natural metapopulation of the cyclical parthenogen Daphnia magna. For comparison, we characterized diversity in a single, large, and stable D. magna population. We found reduced (synonymous) genomic diversity, a proxy for effective population size, weak purifying selection, and low rates of adaptive evolution in the metapopulation compared with the large, stable population. These differences suggest that genetic bottlenecks during colonization reduce effective population sizes, which leads to strong genetic drift and reduced selection efficacy in the metapopulation. Consistent with the propagule model, we found lower diversity and increased differentiation in younger and also in more isolated subpopulations. Our study sheds light on the genomic consequences of extinction-(re)colonization dynamics to an unprecedented degree, giving strong support for the propagule model. We demonstrate that the metapopulation evolves differently from a large, stable population and that evolution is largely driven by genetic drift.
Collapse
Affiliation(s)
- Pascal Angst
- Department of Environmental Sciences, Zoology, University of Basel, Basel 4051, Switzerland
| | - Camille Ameline
- Department of Environmental Sciences, Zoology, University of Basel, Basel 4051, Switzerland
- Evolutionary Biology, Instituto Gulbenkian de Ciência, Oeiras 2780-156, Portugal
| | - Christoph R Haag
- CEFE, Université de Montpellier, CNRS, EPHE, IRD, Montpellier 34293, France
- Tvärminne Zoological Station, University of Helsinki, Hanko 10900, Finland
| | - Frida Ben-Ami
- Tvärminne Zoological Station, University of Helsinki, Hanko 10900, Finland
- George S. Wise Faculty of Life Sciences, School of Zoology, Tel Aviv University, Tel Aviv 69978, Israel
| | - Dieter Ebert
- Department of Environmental Sciences, Zoology, University of Basel, Basel 4051, Switzerland
- Tvärminne Zoological Station, University of Helsinki, Hanko 10900, Finland
| | - Peter D Fields
- Department of Environmental Sciences, Zoology, University of Basel, Basel 4051, Switzerland
- Tvärminne Zoological Station, University of Helsinki, Hanko 10900, Finland
| |
Collapse
|
30
|
Thorpe HA, Tourrette E, Yahara K, Vale FF, Liu S, Oleastro M, Alarcon T, Perets TT, Latifi-Navid S, Yamaoka Y, Martinez-Gonzalez B, Karayiannis I, Karamitros T, Sgouras DN, Elamin W, Pascoe B, Sheppard SK, Ronkainen J, Aro P, Engstrand L, Agreus L, Suerbaum S, Thorell K, Falush D. Repeated out-of-Africa expansions of Helicobacter pylori driven by replacement of deleterious mutations. Nat Commun 2022; 13:6842. [PMID: 36369175 PMCID: PMC9652371 DOI: 10.1038/s41467-022-34475-3] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2022] [Accepted: 10/26/2022] [Indexed: 11/13/2022] Open
Abstract
Helicobacter pylori lives in the human stomach and has a population structure resembling that of its host. However, H. pylori from Europe and the Middle East trace substantially more ancestry from modern African populations than the humans that carry them. Here, we use a collection of Afro-Eurasian H. pylori genomes to show that this African ancestry is due to at least three distinct admixture events. H. pylori from East Asia, which have undergone little admixture, have accumulated many more non-synonymous mutations than African strains. European and Middle Eastern bacteria have elevated African ancestry at the sites of these mutations, implying selection to remove them during admixture. Simulations show that population fitness can be restored after bottlenecks by migration and subsequent admixture of small numbers of bacteria from non-bottlenecked populations. We conclude that recent spread of African DNA has been driven by deleterious mutations accumulated during the original out-of-Africa bottleneck.
Collapse
Affiliation(s)
- Harry A Thorpe
- Department of Biostatistics, University of Oslo, Oslo, Norway
| | - Elise Tourrette
- CAS Key Laboratory of Molecular Virology and Immunology, Institut Pasteur of Shanghai, Chinese Academy of Sciences, Shanghai, China
| | - Koji Yahara
- Antimicrobial Resistance Research Center, National Institute of Infectious Diseases, Tokyo, Japan
| | - Filipa F Vale
- Pathogen Genome Bioinformatics and Computational Biology, Research Institute for Medicines (iMed.ULisboa), Faculty of Pharmacy, Universidade de Lisboa, Lisbon, Portugal
| | - Siqi Liu
- CAS Key Laboratory of Molecular Virology and Immunology, Institut Pasteur of Shanghai, Chinese Academy of Sciences, Shanghai, China
- University of Chinese Academy of Sciences, Beijing, China
| | - Mónica Oleastro
- National Reference Laboratory for Gastrointestinal Infections, Department of Infectious Diseases, National Institute of Health Dr Ricardo Jorge, Lisbon, Portugal
| | - Teresa Alarcon
- Department of Microbiology, Hospital Universitario La Princesa, Instituto de Investigación Sanitaria Princesa, Madrid, Spain
| | - Tsachi-Tsadok Perets
- Gastroenterology Laboratory, Rabin Medical Center, Petah Tikva, Israel
- Department of Digital Medical Technologies, Holon Institute of Technology, Holon, Israel
| | - Saeid Latifi-Navid
- Department of Biology, Faculty of Sciences, University of Mohaghegh Ardabili, Ardabil, Iran
| | - Yoshio Yamaoka
- Department of Environmental and Preventive Medicine, Oita University Faculty of Medicine, Yufu, Oita, Japan
- Department of Medicine-Gastroenterology, Baylor College of Medicine, Houston, TX, USA
| | | | - Ioannis Karayiannis
- Laboratory of Medical Microbiology, Hellenic Pasteur Institute, Athens, Greece
| | | | | | - Wael Elamin
- G42 Healthcare, Abu Dhabi, UAE
- Elrazi University, Khartoum, Sudan
| | - Ben Pascoe
- Department of Biology, University of Oxford, Oxford, UK
| | - Samuel K Sheppard
- Ineos Oxford Institute, Department of Biology, University of Oxford, Oxford, UK
| | - Jukka Ronkainen
- Center for Life Course Health Research, University of Oulu, Oulu, Finland
- Primary Health Care Center, Tornio, Finland
| | | | - Lars Engstrand
- Center for Translational Microbiome Research, Department for Microbiology, Tumor, and Cell Biology, Karolinska Institutet, Stockholm, Sweden
| | - Lars Agreus
- Division of Family Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Sebastian Suerbaum
- Department of Medical Microbiology and Hospital Epidemiology, Max von Pettenkofer Institute, Faculty of Medicine, LMU Munich, Munich, Germany
- Department of Medical Microbiology and Hospital Epidemiology, Hannover Medical School, Hanover, Germany
- DZIF German Center for Infection Research, Hannover-Braunschweig and Munich Partner Sites, Munich, Germany
| | - Kaisa Thorell
- Institute of Biomedicine, Department of Infectious Diseases, University of Gothenburg, Gothenburg, Sweden
- Department of Clinical Microbiology, Sahlgrenska University Hospital, Gothenburg, Sweden
| | - Daniel Falush
- CAS Key Laboratory of Molecular Virology and Immunology, Institut Pasteur of Shanghai, Chinese Academy of Sciences, Shanghai, China.
| |
Collapse
|
31
|
Murga-Moreno J, Coronado-Zamora M, Casillas S, Barbadilla A. impMKT: the imputed McDonald and Kreitman test, a straightforward correction that significantly increases the evidence of positive selection of the McDonald and Kreitman test at the gene level. G3 GENES|GENOMES|GENETICS 2022; 12:6670623. [PMID: 35976111 PMCID: PMC9526038 DOI: 10.1093/g3journal/jkac206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Accepted: 07/28/2022] [Indexed: 11/14/2022]
Abstract
The McDonald and Kreitman test is one of the most powerful and widely used methods to detect and quantify recurrent natural selection in DNA sequence data. One of its main limitations is the underestimation of positive selection due to the presence of slightly deleterious variants segregating at low frequencies. Although several approaches have been developed to overcome this limitation, most of them work on gene pooled analyses. Here, we present the imputed McDonald and Kreitman test (impMKT), a new straightforward approach for the detection of positive selection and other selection components of the distribution of fitness effects at the gene level. We compare imputed McDonald and Kreitman test with other widely used McDonald and Kreitman test approaches considering both simulated and empirical data. By applying imputed McDonald and Kreitman test to humans and Drosophila data at the gene level, we substantially increase the statistical evidence of positive selection with respect to previous approaches (e.g. by 50% and 157% compared with the McDonald and Kreitman test in Drosophila and humans, respectively). Finally, we review the minimum number of genes required to obtain a reliable estimation of the proportion of adaptive substitution (α) in gene pooled analyses by using the imputed McDonald and Kreitman test compared with other McDonald and Kreitman test implementations. Because of its simplicity and increased power to detect recurrent positive selection on genes, we propose the imputed McDonald and Kreitman test as the first straightforward approach for testing specific evolutionary hypotheses at the gene level. The software implementation and population genomics data are available at the web-server imkt.uab.cat.
Collapse
Affiliation(s)
- Jesús Murga-Moreno
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona , Barcelona 08193, Spain
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona , Barcelona 08193, Spain
| | - Marta Coronado-Zamora
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona , Barcelona 08193, Spain
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona , Barcelona 08193, Spain
| | - Sònia Casillas
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona , Barcelona 08193, Spain
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona , Barcelona 08193, Spain
| | - Antonio Barbadilla
- Institute of Biotechnology and Biomedicine, Universitat Autònoma de Barcelona , Barcelona 08193, Spain
- Department of Genetics and Microbiology, Universitat Autònoma de Barcelona , Barcelona 08193, Spain
| |
Collapse
|
32
|
Moutinho AF, Eyre-Walker A, Dutheil JY. Strong evidence for the adaptive walk model of gene evolution in Drosophila and Arabidopsis. PLoS Biol 2022; 20:e3001775. [PMID: 36099311 PMCID: PMC9470001 DOI: 10.1371/journal.pbio.3001775] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Accepted: 08/01/2022] [Indexed: 11/19/2022] Open
Abstract
Understanding the dynamics of species adaptation to their environments has long been a central focus of the study of evolution. Theories of adaptation propose that populations evolve by “walking” in a fitness landscape. This “adaptive walk” is characterised by a pattern of diminishing returns, where populations further away from their fitness optimum take larger steps than those closer to their optimal conditions. Hence, we expect young genes to evolve faster and experience mutations with stronger fitness effects than older genes because they are further away from their fitness optimum. Testing this hypothesis, however, constitutes an arduous task. Young genes are small, encode proteins with a higher degree of intrinsic disorder, are expressed at lower levels, and are involved in species-specific adaptations. Since all these factors lead to increased protein evolutionary rates, they could be masking the effect of gene age. While controlling for these factors, we used population genomic data sets of Arabidopsis and Drosophila and estimated the rate of adaptive substitutions across genes from different phylostrata. We found that a gene’s evolutionary age significantly impacts the molecular rate of adaptation. Moreover, we observed that substitutions in young genes tend to have larger physicochemical effects. Our study, therefore, provides strong evidence that molecular evolution follows an adaptive walk model across a large evolutionary timescale. This study uses population genomic datasets from Arabidopsis and Drosophila to show that young genes adapt faster and are subject to mutations of larger fitness effects, providing strong evidence that molecular evolution follows an adaptive walk model across a large evolutionary timescale.
Collapse
Affiliation(s)
- Ana Filipa Moutinho
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Plön, Germany
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
- * E-mail:
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Julien Y. Dutheil
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Plön, Germany
- Unité Mixte de Recherche 5554 Institut des Sciences de l’Evolution, CNRS, IRD, EPHE, Université de Montpellier, Montpellier, France
| |
Collapse
|
33
|
Johri P, Eyre-Walker A, Gutenkunst RN, Lohmueller KE, Jensen JD. On the prospect of achieving accurate joint estimation of selection with population history. Genome Biol Evol 2022; 14:evac088. [PMID: 35675379 PMCID: PMC9254643 DOI: 10.1093/gbe/evac088] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/02/2022] [Indexed: 11/15/2022] Open
Abstract
As both natural selection and population history can affect genome-wide patterns of variation, disentangling the contributions of each has remained as a major challenge in population genetics. We here discuss historical and recent progress towards this goal-highlighting theoretical and computational challenges that remain to be addressed, as well as inherent difficulties in dealing with model complexity and model violations-and offer thoughts on potentially fruitful next steps.
Collapse
Affiliation(s)
- Parul Johri
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| | | | - Ryan N Gutenkunst
- Department of Molecular and Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - Kirk E Lohmueller
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA, USA
- Department of Human Genetics, University of California, Los Angeles, CA, USA
| | - Jeffrey D Jensen
- School of Life Sciences, Arizona State University, Tempe, AZ, USA
| |
Collapse
|
34
|
Soni V, Vos M, Eyre-Walker A. A new test suggests hundreds of amino acid polymorphisms in humans are subject to balancing selection. PLoS Biol 2022; 20:e3001645. [PMID: 35653351 PMCID: PMC9162324 DOI: 10.1371/journal.pbio.3001645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Accepted: 04/25/2022] [Indexed: 11/18/2022] Open
Abstract
The role that balancing selection plays in the maintenance of genetic diversity remains unresolved. Here, we introduce a new test, based on the McDonald–Kreitman test, in which the number of polymorphisms that are shared between populations is contrasted to those that are private at selected and neutral sites. We show that this simple test is robust to a variety of demographic changes, and that it can also give a direct estimate of the number of shared polymorphisms that are directly maintained by balancing selection. We apply our method to population genomic data from humans and provide some evidence that hundreds of nonsynonymous polymorphisms are subject to balancing selection.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Michiel Vos
- European Centre for Environment and Human Health, University of Exeter Medical School, Environment and Sustainability Institute, Penryn, United Kingdom
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
- * E-mail:
| |
Collapse
|
35
|
Liang YY, Chen XY, Zhou BF, Mitchell-Olds T, Wang B. Globally Relaxed Selection and Local Adaptation in Boechera stricta. Genome Biol Evol 2022; 14:evac043. [PMID: 35349686 PMCID: PMC9011030 DOI: 10.1093/gbe/evac043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/23/2022] [Indexed: 11/25/2022] Open
Abstract
The strength of selection varies among populations and across the genome, but the determinants of efficacy of selection remain unclear. In this study, we used whole-genome sequencing data from 467 Boechera stricta accessions to quantify the strength of selection and characterize the pattern of local adaptation. We found low genetic diversity on 0-fold degenerate sites and conserved non-coding sites, indicating functional constraints on these regions. The estimated distribution of fitness effects and the proportion of fixed substitutions suggest relaxed negative and positive selection in B. stricta. Among the four population groups, the NOR and WES groups have smaller effective population size (Ne), higher proportions of effectively neutral sites, and lower rates of adaptive evolution compared with UTA and COL groups, reflecting the effect of Ne on the efficacy of natural selection. We also found weaker selection on GC-biased sites compared with GC-conservative (unbiased) sites, suggested that GC-biased gene conversion has affected the strength of selection in B. stricta. We found mixed evidence for the role of the recombination rate on the efficacy of selection. The positive and negative selection was stronger in high-recombination regions compared with low-recombination regions in COL but not in other groups. By scanning the genome, we found different subsets of selected genes suggesting differential adaptation among B. stricta groups. These results show that differences in effective population size, nucleotide composition, and recombination rate are important determinants of the efficacy of selection. This study enriches our understanding of the roles of natural selection and local adaptation in shaping genomic variation.
Collapse
Affiliation(s)
- Yi-Ye Liang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences,
Guangzhou, China
- University of the Chinese Academy of Sciences, Beijing, China
| | - Xue-Yan Chen
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences,
Guangzhou, China
- University of the Chinese Academy of Sciences, Beijing, China
| | - Biao-Feng Zhou
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences,
Guangzhou, China
- University of the Chinese Academy of Sciences, Beijing, China
| | | | - Baosheng Wang
- Key Laboratory of Plant Resources Conservation and Sustainable Utilization, South China Botanical Garden, Chinese Academy of Sciences,
Guangzhou, China
- Center of Conservation Biology, Core Botanical Gardens, Chinese Academy of Sciences, Guangzhou, China
| |
Collapse
|
36
|
Angst P, Ebert D, Fields PD. Demographic history shapes genomic variation in an intracellular parasite with a wide geographic distribution. Mol Ecol 2022; 31:2528-2544. [DOI: 10.1111/mec.16419] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2021] [Revised: 02/14/2022] [Accepted: 02/28/2022] [Indexed: 11/27/2022]
Affiliation(s)
- Pascal Angst
- Department of Environmental Sciences, Zoology University of Basel Vesalgasse 1 4051 Basel Switzerland
| | - Dieter Ebert
- Department of Environmental Sciences, Zoology University of Basel Vesalgasse 1 4051 Basel Switzerland
| | - Peter D. Fields
- Department of Environmental Sciences, Zoology University of Basel Vesalgasse 1 4051 Basel Switzerland
| |
Collapse
|
37
|
Fields PD, McTaggart S, Reisser CMO, Haag C, Palmer WH, Little TJ, Ebert D, Obbard DJ. Population-genomic analysis identifies a low rate of global adaptive fixation in the proteins of the cyclical parthenogen Daphnia magna. Mol Biol Evol 2022; 39:6542319. [PMID: 35244177 PMCID: PMC8963301 DOI: 10.1093/molbev/msac048] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
Daphnia are well-established ecological and evolutionary models, and the interaction between D. magna and its microparasites is widely considered a paragon of the host-parasite coevolutionary process. Like other well-studied arthropods such as Drosophila melanogaster and Anopheles gambiae, D. magna is a small, widespread, and abundant species that is therefore expected to display a large long-term population size and high rates of adaptive protein evolution. However, unlike these other species, D. magna is cyclically asexual and lives in a highly structured environment (ponds and lakes) with moderate levels of dispersal, both of which are predicted to impact upon long-term effective population size and adaptive protein evolution. To investigate patterns of adaptive protein fixation, we produced the complete coding genomes of 36 D. magna clones sampled from across the European range (Western Palaearctic), along with draft sequences for the close relatives D. similis and D. lumholtzi, used as outgroups. We analyzed genome-wide patterns of adaptive fixation, with a particular focus on genes that have an a priori expectation of high rates, such as those likely to mediate immune responses, RNA interference against viruses and transposable elements, and those with a strongly male-biased expression pattern. We find that, as expected, D. magna displays high levels of diversity and that this is highly structured among populations. However, compared with Drosophila, we find that D. magna proteins appear to have a high proportion of weakly deleterious variants and do not show evidence of pervasive adaptive fixation across its entire range. This is true of the genome as a whole, and also of putative ‘arms race’ genes that often show elevated levels of adaptive substitution in other species. In addition to the likely impact of extensive, and previously documented, local adaptation, we speculate that these findings may reflect reduced efficacy of selection associated with cyclical asexual reproduction.
Collapse
Affiliation(s)
- Peter D Fields
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, Basel, CH-4051, Switzerland
| | - Seanna McTaggart
- Institute of Evolutionary Biology; School of Biological Sciences University of Edinburgh, Edinburgh, EH9 3JT, United Kingdom
| | - Céline M O Reisser
- Centre d'Ecologie Fonctionnelle et Evolutive CEFE UMR 5175, Univ Montpellier, CNRS, EPHE, IRD, Univ Paul Valéry Montpellier 3, campus CNRS, 1919, route de Mende, 34293 Montpellier Cedex 5, France.,MARBEC, Univ Montpellier, CNRS, IFREMER, IRD, Montpellier, France
| | - Christoph Haag
- Centre d'Ecologie Fonctionnelle et Evolutive CEFE UMR 5175, Univ Montpellier, CNRS, EPHE, IRD, Univ Paul Valéry Montpellier 3, campus CNRS, 1919, route de Mende, 34293 Montpellier Cedex 5, France
| | - William H Palmer
- Institute of Evolutionary Biology; School of Biological Sciences University of Edinburgh, Edinburgh, EH9 3JT, United Kingdom
| | - Tom J Little
- Institute of Evolutionary Biology; School of Biological Sciences University of Edinburgh, Edinburgh, EH9 3JT, United Kingdom
| | - Dieter Ebert
- University of Basel, Department of Environmental Sciences, Zoology, Vesalgasse 1, Basel, CH-4051, Switzerland
| | - Darren J Obbard
- Institute of Evolutionary Biology; School of Biological Sciences University of Edinburgh, Edinburgh, EH9 3JT, United Kingdom
| |
Collapse
|
38
|
Chen J, Bataillon T, Glémin S, Lascoux M. What does the distribution of fitness effects of new mutations reflect? Insights from plants. THE NEW PHYTOLOGIST 2022; 233:1613-1619. [PMID: 34704271 DOI: 10.1111/nph.17826] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/01/2021] [Accepted: 09/28/2021] [Indexed: 06/13/2023]
Abstract
The distribution of fitness effects (DFE) of new mutations plays a central role in molecular evolution. It is therefore crucial to be able to estimate it accurately from genomic data and to understand the factors that shape it. After a rapid overview of available methods to characterize the fitness effects of mutations, we review what is known on the factors affecting them in plants. Available data indicate that life history traits (e.g. mating system and longevity) have a major effect on the DFE. By contrast, the impact of demography within species appears to be more limited. These results remain to be confirmed, and methods to estimate the joint evolution of demography, life history traits, and the DFE need to be developed.
Collapse
Affiliation(s)
- Jun Chen
- College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - Thomas Bataillon
- Bioinformatics Research Centre, Aarhus University, C.F. Möllers Allé 8, Aarhus C, DK-8000, Denmark
| | - Sylvain Glémin
- Centre National de la Recherche Scientifique (CNRS), ECOBIO (Ecosystèmes, Biodiversité, Evolution) - Unité Mixte de Recherche (UMR) 6553, Université de Rennes, Rennes, F-35000, France
- Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, 75236, Sweden
| | - Martin Lascoux
- Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, 75236, Sweden
| |
Collapse
|
39
|
Pélissié B, Chen YH, Cohen ZP, Crossley MS, Hawthorne DJ, Izzo V, Schoville SD. Genome resequencing reveals rapid, repeated evolution in the Colorado potato beetle. Mol Biol Evol 2022; 39:6511499. [PMID: 35044459 PMCID: PMC8826761 DOI: 10.1093/molbev/msac016] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Insecticide resistance and rapid pest evolution threatens food security and the development of sustainable agricultural practices, yet the evolutionary mechanisms that allow pests to rapidly adapt to control tactics remains unclear. Here we examine how a global super-pest, the Colorado potato beetle (CPB), Leptinotarsa decemlineata, rapidly evolves resistance to insecticides. Using whole genome resequencing and transcriptomic data focused on its ancestral and pest range in North America, we assess evidence for three, non-mutually exclusive models of rapid evolution: pervasive selection on novel mutations, rapid regulatory evolution, and repeated selection on standing genetic variation. Population genomic analysis demonstrates that CPB is geographically structured, even among recently established pest populations. Pest populations exhibit similar levels of nucleotide diversity, relative to non-pest populations, and show evidence of recent expansion. Genome scans provide clear signatures of repeated adaptation across CPB populations, with especially strong evidence of selection on insecticide resistance genes in different populations. Analyses of gene expression show that constitutive upregulation of candidate insecticide resistance genes drives distinctive population patterns. CPB evolves insecticide resistance repeatedly across agricultural regions, leveraging similar genetic pathways but different genes, demonstrating a polygenic trait architecture for insecticide resistance that can evolve from standing genetic variation. Despite expectations, we do not find support for strong selection on novel mutations, or rapid evolution from selection on regulatory genes. These results suggest that integrated pest management practices must mitigate the evolution of polygenic resistance phenotypes among local pest populations, in order to maintain the efficacy and sustainability of novel control techniques.
Collapse
Affiliation(s)
- Benjamin Pélissié
- Department of Entomology, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Yolanda H Chen
- Department of Plant and Soil Science, University of Vermont, Burlington, VT 05405, USA
| | - Zachary P Cohen
- Department of Entomology, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Michael S Crossley
- Department of Entomology, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - David J Hawthorne
- Department of Entomology, University of Maryland, College Park, MD 20742, USA
| | - Victor Izzo
- Department of Plant and Soil Science, University of Vermont, Burlington, VT 05405, USA
| | - Sean D Schoville
- Department of Entomology, University of Wisconsin-Madison, Madison, WI 53706, USA
| |
Collapse
|
40
|
Vecchyo DOD, Lohmueller KE, Novembre J. Haplotype-based inference of the distribution of fitness effects. Genetics 2022; 220:6501446. [PMID: 35100400 PMCID: PMC8982047 DOI: 10.1093/genetics/iyac002] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2021] [Accepted: 12/18/2021] [Indexed: 11/13/2022] Open
Abstract
Abstract
Recent genome sequencing studies with large sample sizes in humans have discovered a vast quantity of low-frequency variants, providing an important source of information to analyze how selection is acting on human genetic variation. In order to estimate the strength of natural selection acting on low-frequency variants, we have developed a likelihood-based method that uses the lengths of pairwise identity-by-state between haplotypes carrying low-frequency variants. We show that in some non-equilibrium populations (such as those that have had recent population expansions) it is possible to distinguish between positive or negative selection acting on a set of variants. With our new framework, one can infer a fixed selection intensity acting on a set of variants at a particular frequency, or a distribution of selection coefficients for standing variants and new mutations. We show an application of our method to the UK10K phased haplotype dataset of individuals.
Collapse
Affiliation(s)
- Diego Ortega-Del Vecchyo
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de México, Juriquilla, Querétaro, 76230, México
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, Los Angeles, California, 90095, United States of America
| | - Kirk E Lohmueller
- Interdepartmental Program in Bioinformatics, University of California, Los Angeles, Los Angeles, California, 90095, United States of America
- Department of Ecology and Evolutionary Biology, University of California, Los Angeles, Los Angeles, California, 90095, United States of America
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, California, 90095, United States of America
| | - John Novembre
- Department of Human Genetics, University of Chicago, Chicago, Illinois, 60637, United States of America
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois, 60637, United States of America
| |
Collapse
|
41
|
Yeaman S. Evolution of polygenic traits under global vs local adaptation. Genetics 2022; 220:iyab134. [PMID: 35134196 PMCID: PMC8733419 DOI: 10.1093/genetics/iyab134] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 08/05/2021] [Indexed: 12/14/2022] Open
Abstract
Observations about the number, frequency, effect size, and genomic distribution of alleles associated with complex traits must be interpreted in light of evolutionary process. These characteristics, which constitute a trait's genetic architecture, can dramatically affect evolutionary outcomes in applications from agriculture to medicine, and can provide a window into how evolution works. Here, I review theoretical predictions about the evolution of genetic architecture under spatially homogeneous, global adaptation as compared with spatially heterogeneous, local adaptation. Due to the tension between divergent selection and migration, local adaptation can favor "concentrated" genetic architectures that are enriched for alleles of larger effect, clustered in a smaller number of genomic regions, relative to expectations under global adaptation. However, the evolution of such architectures may be limited by many factors, including the genotypic redundancy of the trait, mutation rate, and temporal variability of environment. I review the circumstances in which predictions differ for global vs local adaptation and discuss where progress can be made in testing hypotheses using data from natural populations and lab experiments. As the field of comparative population genomics expands in scope, differences in architecture among traits and species will provide insights into how evolution works, and such differences must be interpreted in light of which kind of selection has been operating.
Collapse
Affiliation(s)
- Sam Yeaman
- Department of Biological Sciences, University of Calgary, Calgary, AB T2N 1N4, Canada
| |
Collapse
|
42
|
Soni V, Eyre-Walker A. OUP accepted manuscript. Genome Biol Evol 2022; 14:6528851. [PMID: 35166775 PMCID: PMC8882387 DOI: 10.1093/gbe/evac028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/09/2022] [Indexed: 12/05/2022] Open
Abstract
The rate of amino acid substitution has been shown to be correlated to a number of factors including the rate of recombination, the age of the gene, the length of the protein, mean expression level, and gene function. However, the extent to which these correlations are due to adaptive and nonadaptive evolution has not been studied in detail, at least not in hominids. We find that the rate of adaptive evolution is significantly positively correlated to the rate of recombination, protein length and gene expression level, and negatively correlated to gene age. These correlations remain significant when each factor is controlled for in turn, except when controlling for expression in an analysis of protein length; and they also generally remain significant when biased gene conversion is taken into account. However, the positive correlations could be an artifact of population size contraction. We also find that the rate of nonadaptive evolution is negatively correlated to each factor, and all these correlations survive controlling for each other and biased gene conversion. Finally, we examine the effect of gene function on rates of adaptive and nonadaptive evolution; we confirm that virus-interacting proteins (VIPs) have higher rates of adaptive and lower rates of nonadaptive evolution, but we also demonstrate that there is significant variation in the rate of adaptive and nonadaptive evolution between GO categories when removing VIPs. We estimate that the VIP/non-VIP axis explains about 5–8 fold more of the variance in evolutionary rate than GO categories.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
- Corresponding author: E-mail:
| |
Collapse
|
43
|
Abstract
It is known that methods to estimate the rate of adaptive evolution, which are based on the McDonald–Kreitman test, can be biased by changes in effective population size. Here, we demonstrate theoretically that changes in population size can also generate an artifactual correlation between the rate of adaptive evolution and any factor that is correlated to the strength of selection acting against deleterious mutations. In this context, we have investigated whether several site-level factors influence the rate of adaptive evolution in the divergence of humans and chimpanzees, two species that have been inferred to have undergone population size contraction since they diverged. We find that the rate of adaptive evolution, relative to the rate of mutation, is higher for more exposed amino acids, lower for amino acid pairs that are more dissimilar in terms of their polarity, volume, and lower for amino acid pairs that are subject to stronger purifying selection, as measured by the ratio of the numbers of nonsynonymous to synonymous polymorphisms (pN/pS). All of these correlations are opposite to the artifactual correlations expected under contracting population size. We therefore conclude that these correlations are genuine.
Collapse
Affiliation(s)
- Vivak Soni
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
| | - Ana Filipa Moutinho
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
- Department for Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, Plon, Germany
| | - Adam Eyre-Walker
- School of Life Sciences, University of Sussex, Brighton, United Kingdom
- Corresponding author: E-mail:
| |
Collapse
|
44
|
Di C, Murga Moreno J, Salazar-Tortosa DF, Lauterbur ME, Enard D. Decreased recent adaptation at human mendelian disease genes as a possible consequence of interference between advantageous and deleterious variants. eLife 2021; 10:69026. [PMID: 34636724 PMCID: PMC8526059 DOI: 10.7554/elife.69026] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2021] [Accepted: 10/02/2021] [Indexed: 11/27/2022] Open
Abstract
Advances in genome sequencing have improved our understanding of the genetic basis of human diseases, and thousands of human genes have been associated with different diseases. Recent genomic adaptation at disease genes has not been well characterized. Here, we compare the rate of strong recent adaptation in the form of selective sweeps between mendelian, non-infectious disease genes and non-disease genes across distinct human populations from the 1000 Genomes Project. We find that mendelian disease genes have experienced far less selective sweeps compared to non-disease genes especially in Africa. Investigating further the possible causes of the sweep deficit at disease genes, we find that this deficit is very strong at disease genes with both low recombination rates and with high numbers of associated disease variants, but is almost non-existent at disease genes with higher recombination rates or lower numbers of associated disease variants. Because segregating recessive deleterious variants have the ability to interfere with adaptive ones, these observations strongly suggest that adaptation has been slowed down by the presence of interfering recessive deleterious variants at disease genes. These results suggest that disease genes suffer from a transient inability to adapt as fast as the rest of the genome.
Collapse
Affiliation(s)
- Chenlu Di
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, United States
| | - Jesus Murga Moreno
- Institut de Biotecnologia i de Biomedicina and Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, Barcelona, Spain
| | | | - M Elise Lauterbur
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, United States
| | - David Enard
- University of Arizona Department of Ecology and Evolutionary Biology, Tucson, United States
| |
Collapse
|
45
|
Huang YF. Dissecting genomic determinants of positive selection with an evolution-guided regression model. Mol Biol Evol 2021; 39:6379733. [PMID: 34597406 PMCID: PMC8763110 DOI: 10.1093/molbev/msab291] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
In evolutionary genomics, it is fundamentally important to understand how characteristics of genomic sequences, such as gene expression level, determine the rate of adaptive evolution. While numerous statistical methods, such as the McDonald–Kreitman (MK) test, are available to examine the association between genomic features and the rate of adaptation, we currently lack a statistical approach to disentangle the independent effect of a genomic feature from the effects of other correlated genomic features. To address this problem, I present a novel statistical model, the MK regression, which augments the MK test with a generalized linear model. Analogous to the classical multiple regression model, the MK regression can analyze multiple genomic features simultaneously to infer the independent effect of a genomic feature, holding constant all other genomic features. Using the MK regression, I identify numerous genomic features driving positive selection in chimpanzees. These features include well-known ones, such as local mutation rate, residue exposure level, tissue specificity, and immune genes, as well as new features not previously reported, such as gene expression level and metabolic genes. In particular, I show that highly expressed genes may have a higher adaptation rate than their weakly expressed counterparts, even though a higher expression level may impose stronger negative selection. Also, I show that metabolic genes may have a higher adaptation rate than their nonmetabolic counterparts, possibly due to recent changes in diet in primate evolution. Overall, the MK regression is a powerful approach to elucidate the genomic basis of adaptation.
Collapse
Affiliation(s)
- Yi-Fei Huang
- Department of Biology, Pennsylvania State University, University Park, PA, 16802, USA.,Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA, 16802, USA
| |
Collapse
|
46
|
Catania F, Ujvari B, Roche B, Capp JP, Thomas F. Bridging Tumorigenesis and Therapy Resistance With a Non-Darwinian and Non-Lamarckian Mechanism of Adaptive Evolution. Front Oncol 2021; 11:732081. [PMID: 34568068 PMCID: PMC8462274 DOI: 10.3389/fonc.2021.732081] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2021] [Accepted: 08/25/2021] [Indexed: 12/13/2022] Open
Abstract
Although neo-Darwinian (and less often Lamarckian) dynamics are regularly invoked to interpret cancer's multifarious molecular profiles, they shine little light on how tumorigenesis unfolds and often fail to fully capture the frequency and breadth of resistance mechanisms. This uncertainty frames one of the most problematic gaps between science and practice in modern times. Here, we offer a theory of adaptive cancer evolution, which builds on a molecular mechanism that lies outside neo-Darwinian and Lamarckian schemes. This mechanism coherently integrates non-genetic and genetic changes, ecological and evolutionary time scales, and shifts the spotlight away from positive selection towards purifying selection, genetic drift, and the creative-disruptive power of environmental change. The surprisingly simple use-it or lose-it rationale of the proposed theory can help predict molecular dynamics during tumorigenesis. It also provides simple rules of thumb that should help improve therapeutic approaches in cancer.
Collapse
Affiliation(s)
- Francesco Catania
- Institute for Evolution and Biodiversity, University of Münster, Münster, Germany
| | - Beata Ujvari
- Centre for Integrative Ecology, School of Life and Environmental Sciences, Deakin University, Deakin, VIC, Australia
| | - Benjamin Roche
- CREEC/CANECEV, MIVEGEC (CREES), Centre de Recherches Ecologiques et Evolutives sur le Cancer, University of Montpellier, CNRS, IRD, Montpellier, France
| | - Jean-Pascal Capp
- Toulouse Biotechnology Institute, University of Toulouse, INSA, CNRS, INRAE, Toulouse, France
| | - Frédéric Thomas
- CREEC/CANECEV, MIVEGEC (CREES), Centre de Recherches Ecologiques et Evolutives sur le Cancer, University of Montpellier, CNRS, IRD, Montpellier, France
| |
Collapse
|
47
|
Jackson B, Charlesworth B. Evidence for a force favoring GC over AT at short intronic sites in Drosophila simulans and Drosophila melanogaster. G3 GENES|GENOMES|GENETICS 2021; 11:6321237. [PMID: 34544137 PMCID: PMC8496279 DOI: 10.1093/g3journal/jkab240] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2021] [Accepted: 07/06/2021] [Indexed: 11/13/2022]
Abstract
Population genetics studies often make use of a class of nucleotide site free from selective pressures, in order to make inferences about population size changes or natural selection at other sites. If such neutral sites can be identified, they offer the opportunity to avoid any confounding effects of selection. Here, we investigate evolution at putatively neutrally evolving short intronic sites in natural populations of Drosophila melanogaster and Drosophila simulans, in order to understand the properties of spontaneous mutations and the extent of GC-biased gene conversion in these species. Use of data on the genetics of natural populations is advantageous because it integrates information from large numbers of individuals over long timescales. In agreement with direct evidence from observations of spontaneous mutations in Drosophila, we find a bias in the spectrum of mutations toward AT basepairs. In addition, we find that this bias is stronger in the D. melanogaster lineage than in the D. simulans lineage. The evidence for GC-biased gene conversion in Drosophila has been equivocal. Here, we provide evidence for a weak force favoring GC in both species, which is correlated with the GC content of introns and is stronger in D. simulans than in D. melanogaster.
Collapse
Affiliation(s)
- Ben Jackson
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| | - Brian Charlesworth
- School of Biological Sciences, Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, UK
| |
Collapse
|
48
|
Buffalo V. Quantifying the relationship between genetic diversity and population size suggests natural selection cannot explain Lewontin's Paradox. eLife 2021; 10:e67509. [PMID: 34409937 PMCID: PMC8486380 DOI: 10.7554/elife.67509] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2021] [Accepted: 08/16/2021] [Indexed: 12/21/2022] Open
Abstract
Neutral theory predicts that genetic diversity increases with population size, yet observed levels of diversity across metazoans vary only two orders of magnitude while population sizes vary over several. This unexpectedly narrow range of diversity is known as Lewontin's Paradox of Variation (1974). While some have suggested selection constrains diversity, tests of this hypothesis seem to fall short. Here, I revisit Lewontin's Paradox to assess whether current models of linked selection are capable of reducing diversity to this extent. To quantify the discrepancy between pairwise diversity and census population sizes across species, I combine previously-published estimates of pairwise diversity from 172 metazoan taxa with newly derived estimates of census sizes. Using phylogenetic comparative methods, I show this relationship is significant accounting for phylogeny, but with high phylogenetic signal and evidence that some lineages experience shifts in the evolutionary rate of diversity deep in the past. Additionally, I find a negative relationship between recombination map length and census size, suggesting abundant species have less recombination and experience greater reductions in diversity due to linked selection. However, I show that even assuming strong and abundant selection, models of linked selection are unlikely to explain the observed relationship between diversity and census sizes across species.
Collapse
Affiliation(s)
- Vince Buffalo
- Institute for Ecology and Evolution, University of OregonEugeneUnited States
| |
Collapse
|
49
|
Cavassim MIA, Andersen SU, Bataillon T, Schierup MH. Recombination facilitates adaptive evolution in rhizobial soil bacteria. Mol Biol Evol 2021; 38:5480-5490. [PMID: 34410427 PMCID: PMC8662638 DOI: 10.1093/molbev/msab247] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
Homologous recombination is expected to increase natural selection efficacy by decoupling the fate of beneficial and deleterious mutations and by readily creating new combinations of beneficial alleles. Here, we investigate how the proportion of amino acid substitutions fixed by adaptive evolution (α) depends on the recombination rate in bacteria. We analyze 3,086 core protein-coding sequences from 196 genomes belonging to five closely related species of the genus Rhizobium. These genes are found in all species and do not display any signs of introgression between species. We estimate α using the site frequency spectrum (SFS) and divergence data for all pairs of species. We evaluate the impact of recombination within each species by dividing genes into three equally sized recombination classes based on their average level of intragenic linkage disequilibrium. We find that α varies from 0.07 to 0.39 across species and is positively correlated with the level of recombination. This is both due to a higher estimated rate of adaptive evolution and a lower estimated rate of nonadaptive evolution, suggesting that recombination both increases the fixation probability of advantageous variants and decreases the probability of fixation of deleterious variants. Our results demonstrate that homologous recombination facilitates adaptive evolution measured by α in the core genome of prokaryote species in agreement with studies in eukaryotes.
Collapse
Affiliation(s)
- Maria Izabel A Cavassim
- Bioinformatics Research Centre, Aarhus University, Aarhus, 8000, Denmark.,Department of Molecular Biology and Genetics, Aarhus University, Aarhus, 8000, Denmark
| | - Stig U Andersen
- Department of Molecular Biology and Genetics, Aarhus University, Aarhus, 8000, Denmark
| | - Thomas Bataillon
- Bioinformatics Research Centre, Aarhus University, Aarhus, 8000, Denmark
| | | |
Collapse
|
50
|
Faria R, Johannesson K, Stankowski S. Speciation in marine environments: Diving under the surface. J Evol Biol 2021; 34:4-15. [PMID: 33460491 DOI: 10.1111/jeb.13756] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2020] [Revised: 12/31/2020] [Accepted: 01/03/2021] [Indexed: 12/28/2022]
Abstract
Marine environments are inhabited by a broad representation of the tree of life, yet our understanding of speciation in marine ecosystems is extremely limited compared with terrestrial and freshwater environments. Developing a more comprehensive picture of speciation in marine environments requires that we 'dive under the surface' by studying a wider range of taxa and ecosystems is necessary for a more comprehensive picture of speciation. Although studying marine evolutionary processes is often challenging, recent technological advances in different fields, from maritime engineering to genomics, are making it increasingly possible to study speciation of marine life forms across diverse ecosystems and taxa. Motivated by recent research in the field, including the 14 contributions in this issue, we highlight and discuss six axes of research that we think will deepen our understanding of speciation in the marine realm: (a) study a broader range of marine environments and organisms; (b) identify the reproductive barriers driving speciation between marine taxa; (c) understand the role of different genomic architectures underlying reproductive isolation; (d) infer the evolutionary history of divergence using model-based approaches; (e) study patterns of hybridization and introgression between marine taxa; and (f) implement highly interdisciplinary, collaborative research programmes. In outlining these goals, we hope to inspire researchers to continue filling this critical knowledge gap surrounding the origins of marine biodiversity.
Collapse
Affiliation(s)
- Rui Faria
- CIBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, InBIO, Laboratório Associado, Universidade do Porto, Vairão, Portugal.,CIIMAR, Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Portugal.,Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom
| | - Kerstin Johannesson
- Department of Marine Sciences-Tjärnö, University of Gothenburg, Strömstad, Sweden
| | - Sean Stankowski
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield, United Kingdom.,IST Austria, Klosterneuburg, Austria
| |
Collapse
|