26
|
Horvath R, Josephs EB, Pesquet E, Stinchcombe JR, Wright SI, Scofield D, Slotte T. Selection on Accessible Chromatin Regions in Capsella grandiflora. Mol Biol Evol 2021; 38:5563-5575. [PMID: 34498072 PMCID: PMC8662636 DOI: 10.1093/molbev/msab270] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Accurate estimates of genome-wide rates and fitness effects of new mutations are essential for an improved understanding of molecular evolutionary processes. Although eukaryotic genomes generally contain a large noncoding fraction, functional noncoding regions and fitness effects of mutations in such regions are still incompletely characterized. A promising approach to characterize functional noncoding regions relies on identifying accessible chromatin regions (ACRs) tightly associated with regulatory DNA. Here, we applied this approach to identify and estimate selection on ACRs in Capsella grandiflora, a crucifer species ideal for population genomic quantification of selection due to its favorable population demography. We describe a population-wide ACR distribution based on ATAC-seq data for leaf samples of 16 individuals from a natural population. We use population genomic methods to estimate fitness effects and proportions of positively selected fixations (α) in ACRs and find that intergenic ACRs harbor a considerable fraction of weakly deleterious new mutations, as well as a significantly higher proportion of strongly deleterious mutations than comparable inaccessible intergenic regions. ACRs are enriched for expression quantitative trait loci (eQTL) and depleted of transposable element insertions, as expected if intergenic ACRs are under selection because they harbor regulatory regions. By integrating empirical identification of intergenic ACRs with analyses of eQTL and population genomic analyses of selection, we demonstrate that intergenic regulatory regions are an important source of nearly neutral mutations. These results improve our understanding of selection on noncoding regions and the role of nearly neutral mutations for evolutionary processes in outcrossing Brassicaceae species.
Collapse
|
27
|
Desbiez-Piat A, Le Rouzic A, Tenaillon MI, Dillmann C. Interplay between extreme drift and selection intensities favors the fixation of beneficial mutations in selfing maize populations. Genetics 2021; 219:6339583. [PMID: 34849881 DOI: 10.1093/genetics/iyab123] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 07/21/2021] [Indexed: 11/13/2022] Open
Abstract
Population and quantitative genetic models provide useful approximations to predict long-term selection responses sustaining phenotypic shifts, and underlying multilocus adaptive dynamics. Valid across a broad range of parameters, their use for understanding the adaptive dynamics of small selfing populations undergoing strong selection intensity (thereafter High Drift-High selection regime, HDHS) remains to be explored. Saclay Divergent Selection Experiments (DSEs) on maize flowering time provide an interesting example of populations evolving under HDHS, with significant selection responses over 20 generations in two directions. We combined experimental data from Saclay DSEs, forward individual-based simulations, and theoretical predictions to dissect the evolutionary mechanisms at play in the observed selection responses. We asked two main questions: How do mutations arise, spread, and reach fixation in populations evolving under HDHS? How does the interplay between drift and selection influence observed phenotypic shifts? We showed that the long-lasting response to selection in small populations is due to the rapid fixation of mutations occurring during the generations of selection. Among fixed mutations, we also found a clear signal of enrichment for beneficial mutations revealing a limited cost of selection. Both environmental stochasticity and variation in selection coefficients likely contributed to exacerbate mutational effects, thereby facilitating selection grasp and fixation of small-effect mutations. Together our results highlight that despite a small number of polymorphic loci expected under HDHS, adaptive variation is continuously fueled by a vast mutational target. We discuss our results in the context of breeding and long-term survival of small selfing populations.
Collapse
|
28
|
Deatherage DE, Barrick JE. High-throughput characterization of mutations in genes that drive clonal evolution using multiplex adaptome capture sequencing. Cell Syst 2021; 12:1187-1200.e4. [PMID: 34536379 DOI: 10.1016/j.cels.2021.08.011] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Revised: 07/14/2021] [Accepted: 08/20/2021] [Indexed: 11/17/2022]
Abstract
Understanding how cells are likely to evolve can guide medical interventions and bioengineering efforts that must contend with unwanted mutations. The adaptome of a cell-the neighborhood of genetic changes that are most likely to drive adaptation in a given environment-can be mapped by tracking rare beneficial variants during the early stages of clonal evolution. We used multiplex adaptome capture sequencing (mAdCap-seq), a procedure that combines unique molecular identifiers and hybridization-based enrichment, to characterize mutations in eight Escherichia coli genes known to be under selection in a laboratory environment. We tracked 301 mutations at frequencies as low as 0.01% and inferred the fitness effects of 240 of these mutations. There were distinct molecular signatures of selection on protein structure and function for the three genes with the most beneficial mutations. Our results demonstrate how mAdCap-seq can be used to deeply profile a targeted portion of a cell's adaptome.
Collapse
|
29
|
Bailey SF, Alonso Morales LA, Kassen R. Effects of synonymous mutations beyond codon bias: The evidence for adaptive synonymous substitutions from microbial evolution experiments. Genome Biol Evol 2021; 13:6300525. [PMID: 34132772 PMCID: PMC8410137 DOI: 10.1093/gbe/evab141] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/10/2021] [Indexed: 12/22/2022] Open
Abstract
Synonymous mutations are often assumed to be neutral with respect to fitness because they do not alter the encoded amino acid and so cannot be 'seen' by natural selection. Yet a growing body of evidence suggests that synonymous mutations can have fitness effects that drive adaptive evolution through their impacts on gene expression and protein folding. Here, we review what microbial experiments have taught us about the contribution of synonymous mutations to adaptation. A survey of site-directed mutagenesis experiments reveals the distributions of fitness effects for nonsynonymous and synonymous mutations are more similar, especially for beneficial mutations, than expected if all synonymous mutations were neutral, suggesting they should drive adaptive evolution more often than is typically observed. A review of experimental evolution studies where synonymous mutations have contributed to adaptation shows they can impact fitness through a range of mechanisms including the creation of illicit RNA polymerase binding sites impacting transcription and changes to mRNA folding stability that modulate translation. We suggest that clonal interference in evolving microbial populations may be the reason synonymous mutations play a smaller role in adaptive evolution than expected based on their observed fitness effects. We finish by discussing the impacts of falsely assuming synonymous mutations are neutral and discuss directions for future work exploring the role of synonymous mutations in adaptive evolution.
Collapse
|
30
|
Berdan EL, Blanckaert A, Slotte T, Suh A, Westram AM, Fragata I. Unboxing mutations: Connecting mutation types with evolutionary consequences. Mol Ecol 2021; 30:2710-2723. [PMID: 33955064 DOI: 10.1111/mec.15936] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 03/30/2021] [Accepted: 04/20/2021] [Indexed: 01/09/2023]
Abstract
A key step in understanding the genetic basis of different evolutionary outcomes (e.g., adaptation) is to determine the roles played by different mutation types (e.g., SNPs, translocations and inversions). To do this we must simultaneously consider different mutation types in an evolutionary framework. Here, we propose a research framework that directly utilizes the most important characteristics of mutations, their population genetic effects, to determine their relative evolutionary significance in a given scenario. We review known population genetic effects of different mutation types and show how these may be connected to different evolutionary outcomes. We provide examples of how to implement this framework and pinpoint areas where more data, theory and synthesis are needed. Linking experimental and theoretical approaches to examine different mutation types simultaneously is a critical step towards understanding their evolutionary significance.
Collapse
|
31
|
Verta JP, Barton HJ, Pritchard V, Primmer CR. Genetic Drift Dominates Genome-Wide Regulatory Evolution Following an Ancient Whole-Genome Duplication in Atlantic Salmon. Genome Biol Evol 2021; 13:evab059. [PMID: 33749748 PMCID: PMC8140206 DOI: 10.1093/gbe/evab059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/17/2021] [Indexed: 11/23/2022] Open
Abstract
Whole-genome duplications (WGD) have been considered as springboards that potentiate lineage diversification through increasing functional redundancy. Divergence in gene regulatory elements is a central mechanism for evolutionary diversification, yet the patterns and processes governing regulatory divergence following events that lead to massive functional redundancy, such as WGD, remain largely unknown. We studied the patterns of divergence and strength of natural selection on regulatory elements in the Atlantic salmon (Salmo salar) genome, which has undergone WGD 100-80 Ma. Using ChIPmentation, we first show that H3K27ac, a histone modification typical to enhancers and promoters, is associated with genic regions, tissue-specific transcription factor binding motifs, and with gene transcription levels in immature testes. Divergence in transcription between duplicated genes from WGD (ohnologs) correlated with difference in the number of proximal regulatory elements, but not with promoter elements, suggesting that functional divergence between ohnologs after WGD is mainly driven by enhancers. By comparing H3K27ac regions between duplicated genome blocks, we further show that a longer polyploid state post-WGD has constrained regulatory divergence. Patterns of genetic diversity across natural populations inferred from resequencing indicate that recent evolutionary pressures on H3K27ac regions are dominated by largely neutral evolution. In sum, our results suggest that post-WGD functional redundancy in regulatory elements continues to have an impact on the evolution of the salmon genome, promoting largely neutral evolution of regulatory elements despite their association with transcription levels. These results highlight a case where genome-wide regulatory evolution following an ancient WGD is dominated by genetic drift.
Collapse
|
32
|
Johri P, Riall K, Becher H, Excoffier L, Charlesworth B, Jensen JD. The Impact of Purifying and Background Selection on the Inference of Population History: Problems and Prospects. Mol Biol Evol 2021; 38:2986-3003. [PMID: 33591322 PMCID: PMC8233493 DOI: 10.1093/molbev/msab050] [Citation(s) in RCA: 42] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Current procedures for inferring population history generally assume complete neutrality—that is, they neglect both direct selection and the effects of selection on linked sites. We here examine how the presence of direct purifying selection and background selection may bias demographic inference by evaluating two commonly-used methods (MSMC and fastsimcoal2), specifically studying how the underlying shape of the distribution of fitness effects and the fraction of directly selected sites interact with demographic parameter estimation. The results show that, even after masking functional genomic regions, background selection may cause the mis-inference of population growth under models of both constant population size and decline. This effect is amplified as the strength of purifying selection and the density of directly selected sites increases, as indicated by the distortion of the site frequency spectrum and levels of nucleotide diversity at linked neutral sites. We also show how simulated changes in background selection effects caused by population size changes can be predicted analytically. We propose a potential method for correcting for the mis-inference of population growth caused by selection. By treating the distribution of fitness effect as a nuisance parameter and averaging across all potential realizations, we demonstrate that even directly selected sites can be used to infer demographic histories with reasonable accuracy.
Collapse
|
33
|
Jaramillo-Correa JP, Bagnoli F, Grivet D, Fady B, Aravanopoulos FA, Vendramin GG, González-Martínez SC. Evolutionary rate and genetic load in an emblematic Mediterranean tree following an ancient and prolonged population collapse. Mol Ecol 2020; 29:4797-4811. [PMID: 33063352 DOI: 10.1111/mec.15684] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Revised: 09/25/2020] [Accepted: 09/28/2020] [Indexed: 12/18/2022]
Abstract
Severe bottlenecks significantly diminish the amount of genetic diversity and the speed at which it accumulates (i.e., evolutionary rate). They further compromise the efficiency of natural selection to eliminate deleterious variants, which may reach fixation in the surviving populations. Consequently, expanding and adapting to new environments may pose a significant challenge when strong bottlenecks result in genetic pauperization. Herein, we surveyed the patterns of nucleotide diversity, molecular adaptation and genetic load across 177 gene-loci in a circum-Mediterranean conifer (Pinus pinea L.) that represents one of the most extreme cases of genetic pauperization in widespread outbreeding taxa. We found very little genetic variation in both hypervariable nuclear microsatellites (SSRs) and gene-loci, which translated into genetic diversity estimates one order of magnitude lower than those previously reported for pines. Such values were consistent with a strong population decline that began some ~1 Ma. Comparisons with the related and parapatric maritime pine (Pinus pinaster Ait.) revealed reduced rates of adaptive evolution (α and ωa ) and a significant accumulation of genetic load. It is unlikely that these are the result from differences in mutation rate or linkage disequilibrium between the two species; instead they are the presumable outcome of contrasting demographic histories affecting both the speed at which these taxa accumulate genetic diversity, and the global efficacy of selection. Future studies, and programs for conservation and management, should thus start testing for the effects of genetic load on fitness, and integrating such effects into predictive models.
Collapse
|
34
|
Kutschera VE, Poelstra JW, Botero-Castro F, Dussex N, Gemmell NJ, Hunt GR, Ritchie MG, Rutz C, Wiberg RAW, Wolf JBW. Purifying Selection in Corvids Is Less Efficient on Islands. Mol Biol Evol 2020; 37:469-474. [PMID: 31633794 PMCID: PMC6993847 DOI: 10.1093/molbev/msz233] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Theory predicts that deleterious mutations accumulate more readily in small populations. As a consequence, mutation load is expected to be elevated in species where life-history strategies and geographic or historical contingencies reduce the number of reproducing individuals. Yet, few studies have empirically tested this prediction using genome-wide data in a comparative framework. We collected whole-genome sequencing data for 147 individuals across seven crow species (Corvus spp.). For each species, we estimated the distribution of fitness effects of deleterious mutations and compared it with proxies of the effective population size Ne. Island species with comparatively smaller geographic range sizes had a significantly increased mutation load. These results support the view that small populations have an elevated risk of mutational meltdown, which may contribute to the higher extinction rates observed in island species.
Collapse
|
35
|
Galtier N, Rousselle M. How Much Does Ne Vary Among Species? Genetics 2020; 216:559-572. [PMID: 32839240 PMCID: PMC7536855 DOI: 10.1534/genetics.120.303622] [Citation(s) in RCA: 24] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2020] [Accepted: 08/20/2020] [Indexed: 11/18/2022] Open
Abstract
Genetic drift is an important evolutionary force of strength inversely proportional to Ne , the effective population size. The impact of drift on genome diversity and evolution is known to vary among species, but quantifying this effect is a difficult task. Here we assess the magnitude of variation in drift power among species of animals via its effect on the mutation load - which implies also inferring the distribution of fitness effects of deleterious mutations. To this aim, we analyze the nonsynonymous (amino-acid changing) and synonymous (amino-acid conservative) allele frequency spectra in a large sample of metazoan species, with a focus on the primates vs. fruit flies contrast. We show that a Gamma model of the distribution of fitness effects is not suitable due to strong differences in estimated shape parameters among taxa, while adding a class of lethal mutations essentially solves the problem. Using the Gamma + lethal model and assuming that the mean deleterious effects of nonsynonymous mutations is shared among species, we estimate that the power of drift varies by a factor of at least 500 between large-Ne and small-Ne species of animals, i.e., an order of magnitude more than the among-species variation in genetic diversity. Our results are relevant to Lewontin's paradox while further questioning the meaning of the Ne parameter in population genomics.
Collapse
|
36
|
Booker TR. Inferring Parameters of the Distribution of Fitness Effects of New Mutations When Beneficial Mutations Are Strongly Advantageous and Rare. G3 (BETHESDA, MD.) 2020; 10:2317-2326. [PMID: 32371451 PMCID: PMC7341129 DOI: 10.1534/g3.120.401052] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Accepted: 05/01/2020] [Indexed: 12/13/2022]
Abstract
Characterizing the distribution of fitness effects (DFE) for new mutations is central in evolutionary genetics. Analysis of molecular data under the McDonald-Kreitman test has suggested that adaptive substitutions make a substantial contribution to between-species divergence. Methods have been proposed to estimate the parameters of the distribution of fitness effects for positively selected mutations from the unfolded site frequency spectrum (uSFS). Such methods perform well when beneficial mutations are mildly selected and frequent. However, when beneficial mutations are strongly selected and rare, they may make little contribution to standing variation and will thus be difficult to detect from the uSFS. In this study, I analyze uSFS data from simulated populations subject to advantageous mutations with effects on fitness ranging from mildly to strongly beneficial. As expected, frequent, mildly beneficial mutations contribute substantially to standing genetic variation and parameters are accurately recovered from the uSFS. However, when advantageous mutations are strongly selected and rare, there are very few segregating in populations at any one time. Fitting the uSFS in such cases leads to underestimates of the strength of positive selection and may lead researchers to false conclusions regarding the relative contribution adaptive mutations make to molecular evolution. Fortunately, the parameters for the distribution of fitness effects for harmful mutations are estimated with high accuracy and precision. The results from this study suggest that the parameters of positively selected mutations obtained by analysis of the uSFS should be treated with caution and that variability at linked sites should be used in conjunction with standing variability to estimate parameters of the distribution of fitness effects in the future.
Collapse
|
37
|
Johri P, Charlesworth B, Jensen JD. Toward an Evolutionarily Appropriate Null Model: Jointly Inferring Demography and Purifying Selection. Genetics 2020; 215:173-192. [PMID: 32152045 PMCID: PMC7198275 DOI: 10.1534/genetics.119.303002] [Citation(s) in RCA: 86] [Impact Index Per Article: 21.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 03/05/2020] [Indexed: 01/27/2023] Open
Abstract
The question of the relative evolutionary roles of adaptive and nonadaptive processes has been a central debate in population genetics for nearly a century. While advances have been made in the theoretical development of the underlying models, and statistical methods for estimating their parameters from large-scale genomic data, a framework for an appropriate null model remains elusive. A model incorporating evolutionary processes known to be in constant operation, genetic drift (as modulated by the demographic history of the population) and purifying selection, is lacking. Without such a null model, the role of adaptive processes in shaping within- and between-population variation may not be accurately assessed. Here, we investigate how population size changes and the strength of purifying selection affect patterns of variation at "neutral" sites near functional genomic components. We propose a novel statistical framework for jointly inferring the contribution of the relevant selective and demographic parameters. By means of extensive performance analyses, we quantify the utility of the approach, identify the most important statistics for parameter estimation, and compare the results with existing methods. Finally, we reanalyze genome-wide population-level data from a Zambian population of Drosophila melanogaster, and find that it has experienced a much slower rate of population growth than was inferred when the effects of purifying selection were neglected. Our approach represents an appropriate null model, against which the effects of positive selection can be assessed.
Collapse
|
38
|
Chen J, Glémin S, Lascoux M. From Drift to Draft: How Much Do Beneficial Mutations Actually Contribute to Predictions of Ohta's Slightly Deleterious Model of Molecular Evolution? Genetics 2020; 214:1005-1018. [PMID: 32015019 PMCID: PMC7153929 DOI: 10.1534/genetics.119.302869] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2019] [Accepted: 01/26/2020] [Indexed: 12/18/2022] Open
Abstract
Since its inception in 1973, the slightly deleterious model of molecular evolution, also known as the nearly neutral theory of molecular evolution, remains a central model to explain the main patterns of DNA polymorphism in natural populations. This is not to say that the quantitative fit to data are perfect. A recent study used polymorphism data from Drosophila melanogaster to test whether, as predicted by the nearly neutral theory, the proportion of effectively neutral mutations depends on the effective population size (Ne ). It showed that a nearly neutral model simply scaling with Ne variation across the genome could not alone explain the data, but that consideration of linked positive selection improves the fit between observations and predictions. In the present article, we extended the work in two main directions. First, we confirmed the observed pattern on a set of 59 species, including high-quality genomic data from 11 animal and plant species with different mating systems and effective population sizes, hence a priori different levels of linked selection. Second, for the 11 species with high-quality genomic data we also estimated the full distribution of fitness effects (DFE) of mutations, and not solely the DFE of deleterious mutations. Both Ne and beneficial mutations contributed to the relationship between the proportion of effectively neutral mutations and local Ne across the genome. In conclusion, the predictions of the slightly deleterious model of molecular evolution hold well for species with small Ne , but for species with large Ne , the fit is improved by incorporating linked positive selection to the model.
Collapse
|
39
|
Williams MJ, Zapata L, Werner B, Barnes CP, Sottoriva A, Graham TA. Measuring the distribution of fitness effects in somatic evolution by combining clonal dynamics with dN/dS ratios. eLife 2020; 9:e48714. [PMID: 32223898 PMCID: PMC7105384 DOI: 10.7554/elife.48714] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Accepted: 03/09/2020] [Indexed: 12/22/2022] Open
Abstract
The distribution of fitness effects (DFE) defines how new mutations spread through an evolving population. The ratio of non-synonymous to synonymous mutations (dN/dS) has become a popular method to detect selection in somatic cells. However the link, in somatic evolution, between dN/dS values and fitness coefficients is missing. Here we present a quantitative model of somatic evolutionary dynamics that determines the selective coefficients of individual driver mutations from dN/dS estimates. We then measure the DFE for somatic mutant clones in ostensibly normal oesophagus and skin. We reveal a broad distribution of fitness effects, with the largest fitness increases found for TP53 and NOTCH1 mutants (proliferative bias 1-5%). This study provides the theoretical link between dN/dS values and selective coefficients in somatic evolution, and measures the DFE of mutations in human tissues.
Collapse
|
40
|
Barton HJ, Zeng K. The Impact of Natural Selection on Short Insertion and Deletion Variation in the Great Tit Genome. Genome Biol Evol 2019; 11:1514-1524. [PMID: 30924871 PMCID: PMC6543879 DOI: 10.1093/gbe/evz068] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/27/2019] [Indexed: 12/11/2022] Open
Abstract
Insertions and deletions (INDELs) remain understudied, despite being the most common form of genetic variation after single nucleotide polymorphisms. This stems partly from the challenge of correctly identifying the ancestral state of an INDEL and thus identifying it as an insertion or a deletion. Erroneously assigned ancestral states can skew the site frequency spectrum, leading to artificial signals of selection. Consequently, the selective pressures acting on INDELs are, at present, poorly resolved. To tackle this issue, we have recently published a maximum likelihood approach to estimate the mutation rate and the distribution of fitness effects for INDELs. Our approach estimates and controls for the rate of ancestral state misidentification, overcoming issues plaguing previous INDEL studies. Here, we apply the method to INDEL polymorphism data from ten high coverage (∼44×) European great tit (Parus major) genomes. We demonstrate that coding INDELs are under strong purifying selection with a small proportion making it into the population (∼4%). However, among fixed coding INDELs, 71% of insertions and 86% of deletions are fixed by positive selection. In noncoding regions, we estimate ∼80% of insertions and ∼52% of deletions are effectively neutral, the remainder show signatures of purifying selection. Additionally, we see evidence of linked selection reducing INDEL diversity below background levels, both in proximity to exons and in areas of low recombination.
Collapse
|
41
|
Kemble H, Nghe P, Tenaillon O. Recent insights into the genotype-phenotype relationship from massively parallel genetic assays. Evol Appl 2019; 12:1721-1742. [PMID: 31548853 PMCID: PMC6752143 DOI: 10.1111/eva.12846] [Citation(s) in RCA: 32] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2019] [Revised: 06/21/2019] [Accepted: 07/02/2019] [Indexed: 12/20/2022] Open
Abstract
With the molecular revolution in Biology, a mechanistic understanding of the genotype-phenotype relationship became possible. Recently, advances in DNA synthesis and sequencing have enabled the development of deep mutational scanning assays, capable of scoring comprehensive libraries of genotypes for fitness and a variety of phenotypes in massively parallel fashion. The resulting empirical genotype-fitness maps pave the way to predictive models, potentially accelerating our ability to anticipate the behaviour of pathogen and cancerous cell populations from sequencing data. Besides from cellular fitness, phenotypes of direct application in industry (e.g. enzyme activity) and medicine (e.g. antibody binding) can be quantified and even selected directly by these assays. This review discusses the technological basis of and recent developments in massively parallel genetics, along with the trends it is uncovering in the genotype-phenotype relationship (distribution of mutation effects, epistasis), their possible mechanistic bases and future directions for advancing towards the goal of predictive genetics.
Collapse
|
42
|
Mattila TM, Laenen B, Horvath R, Hämälä T, Savolainen O, Slotte T. Impact of demography on linked selection in two outcrossing Brassicaceae species. Ecol Evol 2019; 9:9532-9545. [PMID: 31534673 PMCID: PMC6745670 DOI: 10.1002/ece3.5463] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2019] [Revised: 06/28/2019] [Accepted: 07/02/2019] [Indexed: 12/13/2022] Open
Abstract
Genetic diversity is shaped by mutation, genetic drift, gene flow, recombination, and selection. The dynamics and interactions of these forces shape genetic diversity across different parts of the genome, between populations and species. Here, we have studied the effects of linked selection on nucleotide diversity in outcrossing populations of two Brassicaceae species, Arabidopsis lyrata and Capsella grandiflora, with contrasting demographic history. In agreement with previous estimates, we found evidence for a modest population size expansion thousands of generations ago, as well as efficient purifying selection in C. grandiflora. In contrast, the A. lyrata population exhibited evidence for very recent strong population size decline and weaker efficacy of purifying selection. Using multiple regression analyses with recombination rate and other genomic covariates as explanatory variables, we can explain 47% of the variance in neutral diversity in the C. grandiflora population, while in the A. lyrata population, only 11% of the variance was explained by the model. Recombination rate had a significant positive effect on neutral diversity in both species, suggesting that selection at linked sites has an effect on patterns of neutral variation. In line with this finding, we also found reduced neutral diversity in the vicinity of genes in the C. grandiflora population. However, in A. lyrata no such reduction in diversity was evident, a finding that is consistent with expectations of the impact of a recent bottleneck on patterns of neutral diversity near genes. This study thus empirically demonstrates how differences in demographic history modulate the impact of selection at linked sites in natural populations.
Collapse
|
43
|
Abstract
For nearly a century adaptive landscapes have provided overviews of the evolutionary process and yet they remain metaphors. We redefine adaptive landscapes in terms of biological processes rather than descriptive phenomenology. We focus on the underlying mechanisms that generate emergent properties such as epistasis, dominance, trade-offs and adaptive peaks. We illustrate the utility of landscapes in predicting the course of adaptation and the distribution of fitness effects. We abandon aged arguments concerning landscape ruggedness in favor of empirically determining landscape architecture. In so doing, we transform the landscape metaphor into a scientific framework within which causal hypotheses can be tested.
Collapse
|
44
|
Corcoran P, Gossmann TI, Barton HJ, Slate J, Zeng K. Determinants of the Efficacy of Natural Selection on Coding and Noncoding Variability in Two Passerine Species. Genome Biol Evol 2018; 9:2987-3007. [PMID: 29045655 PMCID: PMC5714183 DOI: 10.1093/gbe/evx213] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/16/2017] [Indexed: 02/06/2023] Open
Abstract
Population genetic theory predicts that selection should be more effective when the effective population size (Ne) is larger, and that the efficacy of selection should correlate positively with recombination rate. Here, we analyzed the genomes of ten great tits and ten zebra finches. Nucleotide diversity at 4-fold degenerate sites indicates that zebra finches have a 2.83-fold larger Ne. We obtained clear evidence that purifying selection is more effective in zebra finches. The proportion of substitutions at 0-fold degenerate sites fixed by positive selection (α) is high in both species (great tit 48%; zebra finch 64%) and is significantly higher in zebra finches. When α was estimated on GC-conservative changes (i.e., between A and T and between G and C), the estimates reduced in both species (great tit 22%; zebra finch 53%). A theoretical model presented herein suggests that failing to control for the effects of GC-biased gene conversion (gBGC) is potentially a contributor to the overestimation of α, and that this effect cannot be alleviated by first fitting a demographic model to neutral variants. We present the first estimates in birds for α in the untranslated regions, and found evidence for substantial adaptive changes. Finally, although purifying selection is stronger in high-recombination regions, we obtained mixed evidence for α increasing with recombination rate, especially after accounting for gBGC. These results highlight that it is important to consider the potential confounding effects of gBGC when quantifying selection and that our understanding of what determines the efficacy of selection is incomplete.
Collapse
|
45
|
Sane M, Miranda JJ, Agashe D. Antagonistic pleiotropy for carbon use is rare in new mutations. Evolution 2018; 72:2202-2213. [PMID: 30095155 PMCID: PMC6203952 DOI: 10.1111/evo.13569] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2018] [Revised: 07/20/2018] [Accepted: 07/25/2018] [Indexed: 12/21/2022]
Abstract
Pleiotropic effects of mutations underlie diverse biological phenomena such as ageing and specialization. In particular, antagonistic pleiotropy ("AP": when a mutation has opposite fitness effects in different environments) generates tradeoffs, which may constrain adaptation. Models of adaptation typically assume that AP is common - especially among large-effect mutations - and that pleiotropic effect sizes are positively correlated. Empirical tests of these assumptions have focused on de novo beneficial mutations arising under strong selection. However, most mutations are actually deleterious or neutral, and may contribute to standing genetic variation that can subsequently drive adaptation. We quantified the incidence, nature, and effect size of pleiotropy for carbon utilization across 80 single mutations in Escherichia coli that arose under mutation accumulation (i.e., weak selection). Although ∼46% of the mutations were pleiotropic, only 11% showed AP; among beneficial mutations, only ∼4% showed AP. In some environments, AP was more common in large-effect mutations; and AP effect sizes across environments were often negatively correlated. Thus, AP for carbon use is generally rare (especially among beneficial mutations); is not consistently enriched in large-effect mutations; and often involves weakly deleterious antagonistic effects. Our unbiased quantification of mutational effects therefore suggests that antagonistic pleiotropy may be unlikely to cause maladaptive tradeoffs.
Collapse
|
46
|
Abstract
Kimura's neutral theory argued that positive selection was not responsible for an appreciable fraction of molecular substitutions. Correspondingly, quantitative analysis reveals that the vast majority of substitutions in cancer genomes are not detectably under selection. Insights from the somatic evolution of cancer reveal that beneficial substitutions in cancer constitute a small but important fraction of the molecular variants. The molecular evolution of cancer community will benefit by incorporating the neutral theory of molecular evolution into their understanding and analysis of cancer evolution-and accepting the use of tractable, predictive models, even when there is some evidence that they are not perfect.
Collapse
|
47
|
Luijckx P, Ho EKH, Stanić A, Agrawal AF. Mutation accumulation in populations of varying size: large effect mutations cause most mutational decline in the rotifer Brachionus calyciflorus under UV-C radiation. J Evol Biol 2018; 31:924-932. [PMID: 29672987 DOI: 10.1111/jeb.13282] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2017] [Revised: 02/19/2018] [Accepted: 04/06/2018] [Indexed: 12/22/2022]
Abstract
Theory predicts that fitness decline via mutation accumulation will depend on population size, but there are only a few direct tests of this key idea. To gain a qualitative understanding of the fitness effect of new mutations, we performed a mutation accumulation experiment with the facultative sexual rotifer Brachionus calyciflorus at six different population sizes under UV-C radiation. Lifetime reproduction assays conducted after ten and sixteen UV-C radiations showed that while small populations lost fitness, fitness losses diminished rapidly with increasing population size. Populations kept as low as 10 individuals were able to maintain fitness close to the nonmutagenized populations throughout the experiment indicating that selection was able to remove the majority of large effect mutations in small populations. Although our results also seem to imply that small populations are effectively immune to mutational decay, we caution against this interpretation. Given sufficient time, populations of moderate to large size can experience declines in fitness from accumulating weakly deleterious mutations as demonstrated by fitness estimates from simulations and, tentatively, from a long-term experiment with populations of moderate size. There is mounting evidence to suggest that mutational distributions contain a heavier tail of large effects. Our results suggest that this is also true when the mutational spectrum is altered by UV radiation.
Collapse
|
48
|
Grivet D, Avia K, Vaattovaara A, Eckert AJ, Neale DB, Savolainen O, González-Martínez SC. High rate of adaptive evolution in two widespread European pines. Mol Ecol 2017; 26:6857-6870. [PMID: 29110402 DOI: 10.1111/mec.14402] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2015] [Revised: 09/14/2017] [Accepted: 09/25/2017] [Indexed: 12/18/2022]
Abstract
Comparing related organisms with differing ecological requirements and evolutionary histories can shed light on the mechanisms and drivers underlying genetic adaptation. Here, by examining a common set of hundreds of loci, we compare patterns of nucleotide diversity and molecular adaptation of two European conifers (Scots pine and maritime pine) living in contrasted environments and characterized by distinct population genetic structure (low and clinal in Scots pine, high and ecotypic in maritime pine) and demographic histories. We found higher nucleotide diversity in Scots pine than in maritime pine, whereas rates of new adaptive substitutions (ωa ), as estimated from the distribution of fitness effects, were similar across species and among the highest found in plants. Sample size and population genetic structure did not appear to have resulted in significant bias in estimates of ωa . Moreover, population contraction-expansion dynamics for each species did not affect differentially the rate of adaptive substitution in these two pines. Several methodological and biological factors may underlie the unusually high rate of adaptive evolution of Scots pine and maritime pine. By providing two new case studies with contrasting evolutionary histories, we contribute to disentangling the multiple factors potentially affecting adaptive evolution in natural plant populations.
Collapse
|
49
|
Inference of Distribution of Fitness Effects and Proportion of Adaptive Substitutions from Polymorphism Data. Genetics 2017; 207:1103-1119. [PMID: 28951530 PMCID: PMC5676230 DOI: 10.1534/genetics.117.300323] [Citation(s) in RCA: 87] [Impact Index Per Article: 12.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2017] [Accepted: 09/13/2017] [Indexed: 11/18/2022] Open
Abstract
The distribution of fitness effects (DFE) encompasses the fraction of deleterious, neutral, and beneficial mutations. It conditions the evolutionary trajectory of populations, as well as the rate of adaptive molecular evolution (α). Inferring DFE and α from patterns of polymorphism, as given through the site frequency spectrum (SFS) and divergence data, has been a longstanding goal of evolutionary genetics. A widespread assumption shared by previous inference methods is that beneficial mutations only contribute negligibly to the polymorphism data. Hence, a DFE comprising only deleterious mutations tends to be estimated from SFS data, and α is then predicted by contrasting the SFS with divergence data from an outgroup. We develop a hierarchical probabilistic framework that extends previous methods to infer DFE and α from polymorphism data alone. We use extensive simulations to examine the performance of our method. While an outgroup is still needed to obtain an unfolded SFS, we show that both a DFE, comprising both deleterious and beneficial mutations, and α can be inferred without using divergence data. We also show that not accounting for the contribution of beneficial mutations to polymorphism data leads to substantially biased estimates of the DFE and α. We compare our framework with one of the most widely used inference methods available and apply it on a recently published chimpanzee exome data set.
Collapse
|
50
|
Abstract
Molecular population genetics aims to explain genetic variation and molecular evolution from population genetics principles. The field was born 50 years ago with the first measures of genetic variation in allozyme loci, continued with the nucleotide sequencing era, and is currently in the era of population genomics. During this period, molecular population genetics has been revolutionized by progress in data acquisition and theoretical developments. The conceptual elegance of the neutral theory of molecular evolution or the footprint carved by natural selection on the patterns of genetic variation are two examples of the vast number of inspiring findings of population genetics research. Since the inception of the field, Drosophila has been the prominent model species: molecular variation in populations was first described in Drosophila and most of the population genetics hypotheses were tested in Drosophila species. In this review, we describe the main concepts, methods, and landmarks of molecular population genetics, using the Drosophila model as a reference. We describe the different genetic data sets made available by advances in molecular technologies, and the theoretical developments fostered by these data. Finally, we review the results and new insights provided by the population genomics approach, and conclude by enumerating challenges and new lines of inquiry posed by increasingly large population scale sequence data.
Collapse
|