1
|
James J, Kastally C, Budde KB, González-Martínez SC, Milesi P, Pyhäjärvi T, Lascoux M. Between but Not Within-Species Variation in the Distribution of Fitness Effects. Mol Biol Evol 2023; 40:msad228. [PMID: 37832225 PMCID: PMC10630145 DOI: 10.1093/molbev/msad228] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Revised: 09/04/2023] [Accepted: 09/25/2023] [Indexed: 10/15/2023] Open
Abstract
New mutations provide the raw material for evolution and adaptation. The distribution of fitness effects (DFE) describes the spectrum of effects of new mutations that can occur along a genome, and is, therefore, of vital interest in evolutionary biology. Recent work has uncovered striking similarities in the DFE between closely related species, prompting us to ask whether there is variation in the DFE among populations of the same species, or among species with different degrees of divergence, that is whether there is variation in the DFE at different levels of evolution. Using exome capture data from six tree species sampled across Europe we characterized the DFE for multiple species, and for each species, multiple populations, and investigated the factors potentially influencing the DFE, such as demography, population divergence, and genetic background. We find statistical support for the presence of variation in the DFE at the species level, even among relatively closely related species. However, we find very little difference at the population level, suggesting that differences in the DFE are primarily driven by deep features of species biology, and those evolutionarily recent events, such as demographic changes and local adaptation, have little impact.
Collapse
Affiliation(s)
- Jennifer James
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
- Swedish Collegium of Advanced Study, Uppsala University, Uppsala, Sweden
| | - Chedly Kastally
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
- Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Katharina B Budde
- Department of Forest Genetics and Forest Tree Breeding, Georg-August-University Goettingen, Goettingen, Germany
- Center of Biodiversity and Sustainable Land Use (CBL), University of Goettingen, Goettingen, Germany
| | - Santiago C González-Martínez
- National Research Institute for Agriculture, Food and the Environment (INRAE), University of Bordeaux, BIOGECO, Cestas, France
| | - Pascal Milesi
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
- Science for Life Laboratory (SciLifeLab), Uppsala University, Uppsala, Sweden
| | - Tanja Pyhäjärvi
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
- Viikki Plant Science Centre, University of Helsinki, Helsinki, Finland
| | - Martin Lascoux
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
| |
Collapse
|
2
|
Andersson BA, Zhao W, Haller BC, Brännström Å, Wang XR. Inference of the distribution of fitness effects of mutations is affected by single nucleotide polymorphism filtering methods, sample size and population structure. Mol Ecol Resour 2023; 23:1589-1603. [PMID: 37340611 DOI: 10.1111/1755-0998.13825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 06/02/2023] [Accepted: 06/08/2023] [Indexed: 06/22/2023]
Abstract
The distribution of fitness effects (DFE) of new mutations has been of interest to evolutionary biologists since the concept of mutations arose. Modern population genomic data enable us to quantify the DFE empirically, but few studies have examined how data processing, sample size and cryptic population structure might affect the accuracy of DFE inference. We used simulated and empirical data (from Arabidopsis lyrata) to show the effects of missing data filtering, sample size, number of single nucleotide polymorphisms (SNPs) and population structure on the accuracy and variance of DFE estimates. Our analyses focus on three filtering methods-downsampling, imputation and subsampling-with sample sizes of 4-100 individuals. We show that (1) the choice of missing-data treatment directly affects the estimated DFE, with downsampling performing better than imputation and subsampling; (2) the estimated DFE is less reliable in small samples (<8 individuals), and becomes unpredictable with too few SNPs (<5000, the sum of 0- and 4-fold SNPs); and (3) population structure may skew the inferred DFE towards more strongly deleterious mutations. We suggest that future studies should consider downsampling for small data sets, and use samples larger than 4 (ideally larger than 8) individuals, with more than 5000 SNPs in order to improve the robustness of DFE inference and enable comparative analyses.
Collapse
Affiliation(s)
| | - Wei Zhao
- Department of Ecology and Environmental Sciences, Umeå University, Umeå, Sweden
| | - Benjamin C Haller
- Department of Computational Biology, Cornell University, Ithaca, New York, USA
| | - Åke Brännström
- Department of Mathematics and Mathematical Statistics, Umeå University, Umeå, Sweden
- Advancing Systems Analysis Program, International Institute for Applied Systems Analysis, Laxenburg, Austria
- Complexity Science and Evolution Unit, Okinawa Institute of Science and Technology Graduate University, Kunigami, Japan
| | - Xiao-Ru Wang
- Department of Ecology and Environmental Sciences, Umeå University, Umeå, Sweden
| |
Collapse
|
3
|
Cervantes S, Kesälahti R, Kumpula TA, Mattila TM, Helanterä H, Pyhäjärvi T. Strong Purifying Selection in Haploid Tissue-Specific Genes of Scots Pine Supports the Masking Theory. Mol Biol Evol 2023; 40:msad183. [PMID: 37565532 PMCID: PMC10457172 DOI: 10.1093/molbev/msad183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 06/16/2023] [Accepted: 08/10/2023] [Indexed: 08/12/2023] Open
Abstract
The masking theory states that genes expressed in a haploid stage will be under more efficient selection. In contrast, selection will be less efficient in genes expressed in a diploid stage, where the fitness effects of recessive deleterious or beneficial mutations can be hidden from selection in heterozygous form. This difference can influence several evolutionary processes such as the maintenance of genetic variation, adaptation rate, and genetic load. Masking theory expectations have been confirmed in single-cell haploid and diploid organisms. However, in multicellular organisms, such as plants, the effects of haploid selection are not clear-cut. In plants, the great majority of studies indicating haploid selection have been carried out using male haploid tissues in angiosperms. Hence, evidence in these systems is confounded with the effects of sexual selection and intraspecific competition. Evidence from other plant groups is scarce, and results show no support for the masking theory. Here, we have used a gymnosperm Scots pine megagametophyte, a maternally derived seed haploid tissue, and four diploid tissues to test the strength of purifying selection on a set of genes with tissue-specific expression. By using targeted resequencing data of those genes, we obtained estimates of genetic diversity, the site frequency spectrum of 0-fold and 4-fold sites, and inferred the distribution of fitness effects of new mutations in haploid and diploid tissue-specific genes. Our results show that purifying selection is stronger for tissue-specific genes expressed in the haploid megagametophyte tissue and that this signal of strong selection is not an artifact driven by high expression levels.
Collapse
Affiliation(s)
- Sandra Cervantes
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
- Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Robert Kesälahti
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| | - Timo A Kumpula
- Biocenter Oulu, University of Oulu, Oulu, Finland
- Laboratory of Cancer Genetics and Tumor Biology, Research Unit of Translational Medicine, University of Oulu, Oulu, Finland
| | - Tiina M Mattila
- Human Evolution, Department of Organismal Biology, Uppsala University, Uppsala, Sweden
| | - Heikki Helanterä
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| | - Tanja Pyhäjärvi
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
| |
Collapse
|
4
|
Fernández-Calvet A, Toribio-Celestino L, Alonso-del Valle A, Sastre-Dominguez J, Valdes-Chiara P, San Millan A, DelaFuente J. The distribution of fitness effects of plasmid pOXA-48 in clinical enterobacteria. Microbiology (Reading) 2023; 169:001369. [PMID: 37505800 PMCID: PMC10433420 DOI: 10.1099/mic.0.001369] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/05/2023] [Accepted: 07/12/2023] [Indexed: 07/29/2023]
Abstract
Antimicrobial resistance (AMR) in bacteria is a major public health problem. The main route for AMR acquisition in clinically important bacteria is the horizontal transfer of plasmids carrying resistance genes. AMR plasmids allow bacteria to survive antibiotics, but they also entail physiological alterations in the host cell. Multiple studies over the last few years have indicated that these alterations can translate into a fitness cost when antibiotics are absent. However, due to technical limitations, most of these studies are based on analysing new associations between plasmids and bacteria generated in vitro, and we know very little about the effects of plasmids in their native bacterial hosts. In this study, we used a CRISPR-Cas9-tool to selectively cure plasmids from clinical enterobacteria to overcome this limitation. Using this approach, we were able to study the fitness effects of the carbapenem resistance plasmid pOXA-48 in 35 pOXA-48-carrying isolates recovered from hospitalized patients. Our results revealed that pOXA-48 produces variable effects across the collection of wild-type enterobacterial strains naturally carrying the plasmid, ranging from fitness costs to fitness benefits. Importantly, the plasmid was only associated with a significant fitness reduction in four out of 35 clones, and produced no significant changes in fitness in the great majority of isolates. Our results suggest that plasmids produce neutral fitness effects in most native bacterial hosts, helping to explain the great prevalence of plasmids in natural microbial communities.
Collapse
Affiliation(s)
| | | | | | | | | | - Alvaro San Millan
- Centro Nacional de Biotecnología (CNB-CSIC), Madrid, Spain
- Centro de Investigación Biológica en Red de Epidemiología y Salud Pública (CIBERESP), Instituto de Salud Carlos III, Madrid, Spain
| | | |
Collapse
|
5
|
Turck D, Bohn T, Castenmiller J, De Henauw S, Hirsch-Ernst KI, Knutsen HK, Maciuk A, Mangelsdorf I, McArdle HJ, Naska A, Peláez C, Siani A, Thies F, Tsabouri S, Vinceti M, Cubadda F, Abrahantes JC, Dumas C, Ercolano V, Titz A, Pentieva K. Conversion of calcium-l-methylfolate and (6S)-5-methyltetrahydrofolic acid glucosamine salt into dietary folate equivalents. EFSA J 2022; 20:e07452. [PMID: 36034319 PMCID: PMC9399872 DOI: 10.2903/j.efsa.2022.7452] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Following a request from the European Commission, the EFSA Panel on Nutrition, Novel Foods and Food Allergens (NDA) was asked to deliver a scientific opinion on the conversion of calcium‐l‐methylfolate and (6S)‐5‐methyltetrahydrofolic acid glucosamine salt (collectively called 5‐MTHF hereafter) into dietary folate equivalents (DFE). Following a systematic review, the conclusions of the opinion are based on one intervention study in adults for intakes < 400 μg/day and three intervention studies in adults for intakes ≥ 400 μg/day. At intakes below 400 μg/day, folic acid (FA) is assumed to be linearly related to responses of biomarkers of intake and status and is an appropriate comparator for deriving a DFE conversion factor for 5‐MTHF. It is proposed to use the same factor as for folic acid for conversion of 5‐MTHF into DFE for intakes < 400 μg/day. As such intake levels are unlikely to be exceeded through fortified food consumption, the conversion factor of 1.7 relative to natural food folate (NF) could be applied to 5‐MTHF added to foods and to food supplements providing < 400 μg/day. At 400 μg/day, 5‐MTHF was found to be more bioavailable than folic acid and a conversion factor of 2 is proposed for this intake level and for higher intakes. The derived DFE equations are DFE = NF + 1.7 × FA + 1.7 × 5‐MTHF for fortified foods and food supplements providing intakes < 400 μg/day; and DFE = NF + 1.7 × FA + 2.0 × 5‐MTHF for food supplements providing intakes ≥ 400 μg/day. Although this assessment applies to calcium‐L‐methylfolate and 5‐MTHF glucosamine salt, it is considered that the influence of the cation on bioavailability is likely to be within the margin of error of the proposed DFE equations. Therefore, the proposed equations can also be applied to 5‐MTHF associated with other cations.
Collapse
|
6
|
Chen J, Bataillon T, Glémin S, Lascoux M. Hunting for beneficial mutations: conditioning on SIFT scores when estimating the distribution of fitness effect of new mutations. Genome Biol Evol 2021; 14:6310736. [PMID: 34180988 PMCID: PMC8743036 DOI: 10.1093/gbe/evab151] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/21/2021] [Indexed: 11/13/2022] Open
Abstract
The Distribution of Fitness Effects (DFE) of new mutations is a key parameter of molecular evolution. The DFE can in principle be estimated by comparing the Site Frequency Spectra (SFS) of putatively neutral and functional polymorphisms. Unfortunately the DFE is intrinsically hard to estimate, especially for beneficial mutations since these tend to be exceedingly rare. There is therefore a strong incentive to find out whether conditioning on properties of mutations that are independent of the SFS could provide additional information. In the present study, we developed a new measure based on SIFT scores. SIFT scores are assigned to nucleotide sites based on their level of conservation across a multi species alignment: the more conserved a site, the more likely mutations occurring at this site are deleterious and the lower the SIFT score. If one knows the ancestral state at a given site, one can assign a value to new mutations occurring at the site based on the change of SIFT score associated with the mutation. We called this new measure δ. We show that properties of the DFE as well as the flux of beneficial mutations across classes covary with δ and, hence, that SIFT scores are informative when estimating the fitness effect of new mutations. In particular, conditioning on SIFT scores can help to characterize beneficial mutations.
Collapse
Affiliation(s)
- J Chen
- College of Life Sciences, Zhejiang University, Hangzhou, Zhejiang, 310058, China
| | - T Bataillon
- Bioinformatics Research Centre, Aarhus University, C.F. Møllers Allé 8, Aarhus C, DK-8000, Denmark
| | - S Glémin
- Université de Rennes, Centre National de la Recherche Scientifique (CNRS), ECOBIO (Ecosystèmes, Biodiversité, Evolution) - Unité Mixte de Recherche (UMR) 6553, Rennes, F-35000, France.,Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, 75236, Sweden
| | - M Lascoux
- Program in Plant Ecology and Evolution, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, 75236, Sweden
| |
Collapse
|
7
|
Dai L, Du Y, Qi H, Huber CD, Chen D, Zhang TH, Wu NC, Wang E, Lloyd-Smith JO, Sun R. Quantifying the Evolutionary Constraints and Potential of Hepatitis C Virus NS5A Protein. mSystems 2021; 6:e01111-20. [PMID: 33850042 DOI: 10.1128/mSystems.01111-20] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022] Open
Abstract
RNA viruses, such as hepatitis C virus (HCV), influenza virus, and SARS-CoV-2, are notorious for their ability to evolve rapidly under selection in novel environments. It is known that the high mutation rate of RNA viruses can generate huge genetic diversity to facilitate viral adaptation. However, less attention has been paid to the underlying fitness landscape that represents the selection forces on viral genomes, especially under different selection conditions. Here, we systematically quantified the distribution of fitness effects of about 1,600 single amino acid substitutions in the drug-targeted region of NS5A protein of HCV. We found that the majority of nonsynonymous substitutions incur large fitness costs, suggesting that NS5A protein is highly optimized. The replication fitness of viruses is correlated with the pattern of sequence conservation in nature, and viral evolution is constrained by the need to maintain protein stability. We characterized the adaptive potential of HCV by subjecting the mutant viruses to selection by the antiviral drug daclatasvir at multiple concentrations. Both the relative fitness values and the number of beneficial mutations were found to increase with the increasing concentrations of daclatasvir. The changes in the spectrum of beneficial mutations in NS5A protein can be explained by a pharmacodynamics model describing viral fitness as a function of drug concentration. Overall, our results show that the distribution of fitness effects of mutations is modulated by both the constraints on the biophysical properties of proteins (i.e., selection pressure for protein stability) and the level of environmental stress (i.e., selection pressure for drug resistance). IMPORTANCE Many viruses adapt rapidly to novel selection pressures, such as antiviral drugs. Understanding how pathogens evolve under drug selection is critical for the success of antiviral therapy against human pathogens. By combining deep sequencing with selection experiments in cell culture, we have quantified the distribution of fitness effects of mutations in hepatitis C virus (HCV) NS5A protein. Our results indicate that the majority of single amino acid substitutions in NS5A protein incur large fitness costs. Simulation of protein stability suggests viral evolution is constrained by the need to maintain protein stability. By subjecting the mutant viruses to selection under an antiviral drug, we find that the adaptive potential of viral proteins in a novel environment is modulated by the level of environmental stress, which can be explained by a pharmacodynamics model. Our comprehensive characterization of the fitness landscapes of NS5A can potentially guide the design of effective strategies to limit viral evolution.
Collapse
|
8
|
Mahmutovic A, Abel Zur Wiesch P, Abel S. Selection or drift: The population biology underlying transposon insertion sequencing experiments. Comput Struct Biotechnol J 2020; 18:791-804. [PMID: 32280434 DOI: 10.1016/j.csbj.2020.03.021] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2019] [Revised: 03/06/2020] [Accepted: 03/22/2020] [Indexed: 01/23/2023] Open
Abstract
Transposon insertion sequencing methods such as Tn-seq revolutionized microbiology by allowing the identification of genomic loci that are critical for viability in a specific environment on a genome-wide scale. While powerful, transposon insertion sequencing suffers from limited reproducibility when different analysis methods are compared. From the perspective of population biology, this may be explained by changes in mutant frequency due to chance (drift) rather than differential fitness (selection). Here, we develop a mathematical model of the population biology of transposon insertion sequencing experiments, i.e. the changes in size and composition of the transposon-mutagenized population during the experiment. We use this model to investigate mutagenesis, the growth of the mutant library, and its passage through bottlenecks. Specifically, we study how these processes can lead to extinction of individual mutants depending on their fitness and the distribution of fitness effects (DFE) of the entire mutant population. We find that in typical in vitro experiments few mutants with high fitness go extinct. However, bottlenecks of a size that is common in animal infection models lead to so much random extinction that a large number of viable mutants would be misclassified. While mutants with low fitness are more likely to be lost during the experiment, mutants with intermediate fitness are expected to be much more abundant and can constitute a large proportion of detected hits, i.e. false positives. Thus, incorporating the DFEs of randomly generated mutations in the analysis may improve the reproducibility of transposon insertion experiments, especially when strong bottlenecks are encountered.
Collapse
|
9
|
Castellano D, Macià MC, Tataru P, Bataillon T, Munch K. Comparison of the Full Distribution of Fitness Effects of New Amino Acid Mutations Across Great Apes. Genetics 2019; 213:953-966. [PMID: 31488516 PMCID: PMC6827385 DOI: 10.1534/genetics.119.302494] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Accepted: 08/29/2019] [Indexed: 12/31/2022] Open
Abstract
The distribution of fitness effects (DFE) is central to many questions in evolutionary biology. However, little is known about the differences in DFE between closely related species. We use >9000 coding genes orthologous one-to-one across great apes, gibbons, and macaques to assess the stability of the DFE across great apes. We use the unfolded site frequency spectrum of polymorphic mutations (n = 8 haploid chromosomes per population) to estimate the DFE. We find that the shape of the deleterious DFE is strikingly similar across great apes. We confirm that effective population size (Ne ) is a strong predictor of the strength of negative selection, consistent with the nearly neutral theory. However, we also find that the strength of negative selection varies more than expected given the differences in Ne between species. Across species, mean fitness effects of new deleterious mutations covaries with Ne , consistent with positive epistasis among deleterious mutations. We find that the strength of negative selection for the smallest populations, bonobos and western chimpanzees, is higher than expected given their Ne This may result from a more efficient purging of strongly deleterious recessive variants in these populations. Forward simulations confirm that these findings are not artifacts of the way we are inferring Ne and DFE parameters. All findings are replicated using only GC-conservative mutations, thereby confirming that GC-biased gene conversion is not affecting our conclusions.
Collapse
Affiliation(s)
- David Castellano
- Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
| | - Moisès Coll Macià
- Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
| | - Paula Tataru
- Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
| | - Thomas Bataillon
- Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
| | - Kasper Munch
- Bioinformatics Research Centre, Aarhus University, DK-8000 Aarhus C, Denmark
| |
Collapse
|
10
|
Abstract
Pleiotropic effects of mutations underlie diverse biological phenomena such as ageing and specialization. In particular, antagonistic pleiotropy ("AP": when a mutation has opposite fitness effects in different environments) generates tradeoffs, which may constrain adaptation. Models of adaptation typically assume that AP is common - especially among large-effect mutations - and that pleiotropic effect sizes are positively correlated. Empirical tests of these assumptions have focused on de novo beneficial mutations arising under strong selection. However, most mutations are actually deleterious or neutral, and may contribute to standing genetic variation that can subsequently drive adaptation. We quantified the incidence, nature, and effect size of pleiotropy for carbon utilization across 80 single mutations in Escherichia coli that arose under mutation accumulation (i.e., weak selection). Although ∼46% of the mutations were pleiotropic, only 11% showed AP; among beneficial mutations, only ∼4% showed AP. In some environments, AP was more common in large-effect mutations; and AP effect sizes across environments were often negatively correlated. Thus, AP for carbon use is generally rare (especially among beneficial mutations); is not consistently enriched in large-effect mutations; and often involves weakly deleterious antagonistic effects. Our unbiased quantification of mutational effects therefore suggests that antagonistic pleiotropy may be unlikely to cause maladaptive tradeoffs.
Collapse
Affiliation(s)
- Mrudula Sane
- National Centre for Biological SciencesTata Institute of Fundamental ResearchBangaloreIndia
| | - Joshua John Miranda
- National Centre for Biological SciencesTata Institute of Fundamental ResearchBangaloreIndia
| | - Deepa Agashe
- National Centre for Biological SciencesTata Institute of Fundamental ResearchBangaloreIndia
| |
Collapse
|
11
|
Abstract
The patterns of polymorphisms in genomes are imprints of the evolutionary forces at play in nature. In particular, polymorphisms have been extensively used to infer the fitness effects of mutations and their dynamics of fixation. However, the role and contribution of molecular biophysics to these observations remain unclear. Here, we couple robust findings from protein biophysics, enzymatic flux theory, the selection against the cytotoxic effects of protein misfolding, and explicit population dynamics simulations in the polyclonal regime. First, we recapitulate results on the dynamics of clonal interference and on the shape of the DFE, thus providing them with a molecular and mechanistic foundation. Second, we predict that if evolution is indeed under the dynamic equilibrium of mutation-selection balance, the fraction of stabilizing and destabilizing mutations is almost equal among single-nucleotide polymorphisms segregating at high allele frequencies. This prediction is proven true for polymorphisms in the human coding region. Overall, our results show how selection for protein folding stability predominantly shapes the patterns of polymorphisms in coding regions.
Collapse
|