1
|
García-Pintos LP. Limits on the evolutionary rates of biological traits. Sci Rep 2024; 14:11314. [PMID: 38760507 PMCID: PMC11101453 DOI: 10.1038/s41598-024-61872-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 05/10/2024] [Indexed: 05/19/2024] Open
Abstract
This paper focuses on the maximum speed at which biological evolution can occur. I derive inequalities that limit the rate of evolutionary processes driven by natural selection, mutations, or genetic drift. These rate limits link the variability in a population to evolutionary rates. In particular, high variances in the fitness of a population and of a quantitative trait allow for fast changes in the trait's average. In contrast, low variability makes a trait less susceptible to random changes due to genetic drift. The results in this article generalize Fisher's fundamental theorem of natural selection to dynamics that allow for mutations and genetic drift, via trade-off relations that constrain the evolutionary rates of arbitrary traits. The rate limits can be used to probe questions in various evolutionary biology and ecology settings. They apply, for instance, to trait dynamics within or across species or to the evolution of bacteria strains. They apply to any quantitative trait, e.g., from species' weights to the lengths of DNA strands.
Collapse
Affiliation(s)
- Luis Pedro García-Pintos
- Theoretical Division (T4), Los Alamos National Laboratory, Los Alamos, NM, 87545, USA.
- Joint Center for Quantum Information and Computer Science and Joint Quantum Institute, NIST/University of Maryland, College Park, MD, 20742, USA.
| |
Collapse
|
2
|
Kawakatsu M, Kessinger TA, Plotkin JB. A mechanistic model of gossip, reputations, and cooperation. Proc Natl Acad Sci U S A 2024; 121:e2400689121. [PMID: 38717858 PMCID: PMC11098103 DOI: 10.1073/pnas.2400689121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Accepted: 04/12/2024] [Indexed: 05/18/2024] Open
Abstract
Social reputations facilitate cooperation: those who help others gain a good reputation, making them more likely to receive help themselves. But when people hold private views of one another, this cycle of indirect reciprocity breaks down, as disagreements lead to the perception of unjustified behavior that ultimately undermines cooperation. Theoretical studies often assume population-wide agreement about reputations, invoking rapid gossip as an endogenous mechanism for reaching consensus. However, the theory of indirect reciprocity lacks a mechanistic description of how gossip actually generates consensus. Here, we develop a mechanistic model of gossip-based indirect reciprocity that incorporates two alternative forms of gossip: exchanging information with randomly selected peers or consulting a single gossip source. We show that these two forms of gossip are mathematically equivalent under an appropriate transformation of parameters. We derive an analytical expression for the minimum amount of gossip required to reach sufficient consensus and stabilize cooperation. We analyze how the amount of gossip necessary for cooperation depends on the benefits and costs of cooperation, the assessment rule (social norm), and errors in reputation assessment, strategy execution, and gossip transmission. Finally, we show that biased gossip can either facilitate or hinder cooperation, depending on the direction and magnitude of the bias. Our results contribute to the growing literature on cooperation facilitated by communication, and they highlight the need to study strategic interactions coupled with the spread of social information.
Collapse
Affiliation(s)
- Mari Kawakatsu
- Department of Biology, University of Pennsylvania, Philadelphia, PA19104
- Center for Mathematical Biology, University of Pennsylvania, Philadelphia, PA19104
| | | | - Joshua B. Plotkin
- Department of Biology, University of Pennsylvania, Philadelphia, PA19104
- Center for Mathematical Biology, University of Pennsylvania, Philadelphia, PA19104
| |
Collapse
|
3
|
Fine AG, Steinrücken M. A novel expectation-maximization approach to infer general diploid selection from time-series genetic data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.10.593575. [PMID: 38798346 PMCID: PMC11118272 DOI: 10.1101/2024.05.10.593575] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Detecting and quantifying the strength of selection is a main objective in population genetics. Since selection acts over multiple generations, many approaches have been developed to detect and quantify selection using genetic data sampled at multiple points in time. Such time series genetic data is commonly analyzed using Hidden Markov Models, but in most cases, under the assumption of additive selection. However, many examples of genetic variation exhibiting non-additive mechanisms exist, making it critical to develop methods that can characterize selection in more general scenarios. Thus, we extend a previously introduced expectation-maximization algorithm for the inference of additive selection coefficients to the case of general diploid selection, in which heterozygote and homozygote fitnesses are parameterized independently. We furthermore introduce a framework to identify bespoke modes of diploid selection from given data, as well as a procedure for aggregating data across linked loci to increase power and robustness. Using extensive simulation studies, we find that our method accurately and efficiently estimates selection coefficients for different modes of diploid selection across a wide range of scenarios; however, power to classify the mode of selection is low unless selection is very strong. We apply our method to ancient DNA samples from Great Britain in the last 4,450 years, and detect evidence for selection in six genomic regions, including the well-characterized LCT locus. Our work is the first genome-wide scan characterizing signals of general diploid selection.
Collapse
Affiliation(s)
- Adam G Fine
- Department of Ecology and Evolution, University of Chicago
- Graduate Program in Biophysical Sciences, University of Chicago
| | - Matthias Steinrücken
- Department of Ecology and Evolution, University of Chicago
- Department of Human Genetics, University of Chicago
| |
Collapse
|
4
|
Guerrero Montero J, Blythe RA. Self-contained Beta-with-Spikes approximation for inference under a Wright-Fisher model. Genetics 2023; 225:iyad092. [PMID: 37226886 PMCID: PMC10550310 DOI: 10.1093/genetics/iyad092] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 03/10/2023] [Accepted: 05/10/2023] [Indexed: 05/26/2023] Open
Abstract
We construct a reliable estimation method for evolutionary parameters within the Wright-Fisher model, which describes changes in allele frequencies due to selection and genetic drift, from time-series data. Such data exist for biological populations, for example via artificial evolution experiments, and for the cultural evolution of behavior, such as linguistic corpora that document historical usage of different words with similar meanings. Our method of analysis builds on a Beta-with-Spikes approximation to the distribution of allele frequencies predicted by the Wright-Fisher model. We introduce a self-contained scheme for estimating parameters in the approximation, and demonstrate its robustness with synthetic data, especially in the strong-selection and near-extinction regimes where previous approaches fail. We further apply the method to allele frequency data for baker's yeast (Saccharomyces cerevisiae), finding a significant signal of selection in cases where independent evidence supports such a conclusion. We further demonstrate the possibility of detecting time points at which evolutionary parameters change in the context of a historical spelling reform in the Spanish language.
Collapse
Affiliation(s)
- Juan Guerrero Montero
- SUPA, School of Physics and Astronomy, University of Edinburgh, Edinburgh, EH9 3FD, UK
| | - Richard A Blythe
- Corresponding author: SUPA, School of Physics and Astronomy, University of Edinburgh, Edinburgh EH9 3FD, UK.
| |
Collapse
|
5
|
Torrillo PA, Lieberman TD. Reversions mask the contribution of adaptive evolution in microbiomes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.09.14.557751. [PMID: 37745437 PMCID: PMC10515931 DOI: 10.1101/2023.09.14.557751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/26/2023]
Abstract
When examining bacterial genomes for evidence of past selection, the results obtained depend heavily on the mutational distance between chosen genomes. Even within a bacterial species, genomes separated by larger mutational distances exhibit stronger evidence of purifying selection as assessed by d N / d S , the normalized ratio of nonsynonymous to synonymous mutations. This dependence on mutational distance, and thus time, has been proposed to arise from the inefficiency of purifying selection at removing weakly deleterious mutations. Here, we revisit this assumption in light of abundant genomes from gut microbiomes and show that a model of weak purifying selection that fits the data leads to problematic mutation accumulation. We propose an alternative model to explain the timescale dependence of d N / d S , in which constantly changing environmental pressures select for revertants of previously adaptive mutations. Reversions that sweep within-host populations are nearly guaranteed in microbiomes due to large population sizes, short generation times, and variable environments. Using analytical and simulation approaches, we fit the adaptive reversion model to d N / d S decay curves and obtain estimates of local adaptation that are realistic in the context of bacterial genomes. These results argue for interpreting low values of d N / d S with caution, as they may emerge even when adaptive sweeps are frequent. This work reframes an old observation in bacterial evolution, illustrates the potential of mutation reversions to shape genomic landscapes over time, and highlights the need for additional research on bacterial genomic evolution on short time scales.
Collapse
Affiliation(s)
- Paul A. Torrillo
- Institute for Medical Engineering and Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Tami D. Lieberman
- Institute for Medical Engineering and Sciences, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
- Ragon Institute of MGH, MIT, and Harvard, Cambridge, MA 02139, USA
| |
Collapse
|
6
|
Shimagaki K, Barton JP. Bézier interpolation improves the inference of dynamical models from data. Phys Rev E 2023; 107:024116. [PMID: 36932614 PMCID: PMC10027371 DOI: 10.1103/physreve.107.024116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Accepted: 01/23/2023] [Indexed: 06/18/2023]
Abstract
Many dynamical systems, from quantum many-body systems to evolving populations to financial markets, are described by stochastic processes. Parameters characterizing such processes can often be inferred using information integrated over stochastic paths. However, estimating time-integrated quantities from real data with limited time resolution is challenging. Here, we propose a framework for accurately estimating time-integrated quantities using Bézier interpolation. We applied our approach to two dynamical inference problems: Determining fitness parameters for evolving populations and inferring forces driving Ornstein-Uhlenbeck processes. We found that Bézier interpolation reduces the estimation bias for both dynamical inference problems. This improvement was especially noticeable for data sets with limited time resolution. Our method could be broadly applied to improve accuracy for other dynamical inference problems using finitely sampled data.
Collapse
Affiliation(s)
- Kai Shimagaki
- Department of Physics and Astronomy, University of California, Riverside, California 92521, USA
| | - John P. Barton
- Department of Physics and Astronomy, University of California, Riverside, California 92521, USA
- Department of Computational and Systems Biology, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania 15213, USA
| |
Collapse
|
7
|
Jensen AJ, Hagen IJ, Czorlich Y, Bolstad GH, Bremset G, Finstad B, Hindar K, Skaala Ø, Karlsson S. Large-effect loci mediate rapid adaptation of salmon body size after river regulation. Proc Natl Acad Sci U S A 2022; 119:e2207634119. [PMID: 36279467 PMCID: PMC9636922 DOI: 10.1073/pnas.2207634119] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2022] [Accepted: 09/20/2022] [Indexed: 02/18/2024] Open
Abstract
Understanding the potential of natural populations to adapt to altered environments is becoming increasingly relevant in evolutionary research. Currently, our understanding of adaptation to human alteration of the environment is hampered by lack of knowledge on the genetic basis of traits, lack of time series, and little or no information on changes in optimal trait values. Here, we used time series data spanning nearly a century to investigate how the body mass of Atlantic salmon (Salmo salar) adapts to river regulation. We found that the change in body mass followed the change in waterflow, both decreasing to ∼1/3 of their original values. Allele frequency changes at two loci in the regions of vgll3 and six6 predicted more than 80% of the observed body mass reduction. Modeling the adaptive dynamics revealed that the population mean lagged behind its optimum before catching up approximately six salmon generations after the initial waterflow reduction. Our results demonstrate rapid adaptation mediated by large-effect loci and provide insight into the temporal dynamics of evolutionary rescue following human disturbance.
Collapse
Affiliation(s)
- Arne J. Jensen
- Norwegian Institute for Nature Research, NO-7485 Trondheim, Norway
| | - Ingerid J. Hagen
- Norwegian Institute for Nature Research, NO-7485 Trondheim, Norway
| | - Yann Czorlich
- Norwegian Institute for Nature Research, NO-7485 Trondheim, Norway
| | - Geir H. Bolstad
- Norwegian Institute for Nature Research, NO-7485 Trondheim, Norway
| | | | - Bengt Finstad
- Department of Biology, Norwegian University of Science and Technology, NO-7491 Trondheim, Norway
| | - Kjetil Hindar
- Norwegian Institute for Nature Research, NO-7485 Trondheim, Norway
| | | | - Sten Karlsson
- Norwegian Institute for Nature Research, NO-7485 Trondheim, Norway
| |
Collapse
|
8
|
Sohail MS, Louie RHY, Hong Z, Barton JP, McKay MR. Inferring Epistasis from Genetic Time-series Data. Mol Biol Evol 2022; 39:6710201. [PMID: 36130322 PMCID: PMC9558069 DOI: 10.1093/molbev/msac199] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023] Open
Abstract
Epistasis refers to fitness or functional effects of mutations that depend on the sequence background in which these mutations arise. Epistasis is prevalent in nature, including populations of viruses, bacteria, and cancers, and can contribute to the evolution of drug resistance and immune escape. However, it is difficult to directly estimate epistatic effects from sampled observations of a population. At present, there are very few methods that can disentangle the effects of selection (including epistasis), mutation, recombination, genetic drift, and genetic linkage in evolving populations. Here we develop a method to infer epistasis, along with the fitness effects of individual mutations, from observed evolutionary histories. Simulations show that we can accurately infer pairwise epistatic interactions provided that there is sufficient genetic diversity in the data. Our method also allows us to identify which fitness parameters can be reliably inferred from a particular data set and which ones are unidentifiable. Our approach therefore allows for the inference of more complex models of selection from time-series genetic data, while also quantifying uncertainty in the inferred parameters.
Collapse
Affiliation(s)
- Muhammad Saqib Sohail
- Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology, Hong Kong SAR, People’s Republic of China
| | - Raymond H Y Louie
- The Kirby Institute, University of New South Wales, Sydney, New South Wales, Australia
| | - Zhenchen Hong
- Department of Physics and Astronomy, University of California, Riverside, CA, USA
| | | | | |
Collapse
|
9
|
Mathieson I, Terhorst J. Direct detection of natural selection in Bronze Age Britain. Genome Res 2022; 32:2057-2067. [PMID: 36316157 PMCID: PMC9808619 DOI: 10.1101/gr.276862.122] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2022] [Accepted: 08/29/2022] [Indexed: 11/04/2022]
Abstract
We developed a novel method for efficiently estimating time-varying selection coefficients from genome-wide ancient DNA data. In simulations, our method accurately recovers selective trajectories and is robust to misspecification of population size. We applied it to a large data set of ancient and present-day human genomes from Britain and identified seven loci with genome-wide significant evidence of selection in the past 4500 yr. Almost all of them can be related to increased vitamin D or calcium levels, suggesting strong selective pressure on these or related phenotypes. However, the strength of selection on individual loci varied substantially over time, suggesting that cultural or environmental factors moderated the genetic response. Of 28 complex anthropometric and metabolic traits, skin pigmentation was the only one with significant evidence of polygenic selection, further underscoring the importance of phenotypes related to vitamin D. Our approach illustrates the power of ancient DNA to characterize selection in human populations and illuminates the recent evolutionary history of Britain.
Collapse
Affiliation(s)
- Iain Mathieson
- Department of Genetics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Jonathan Terhorst
- Department of Statistics, University of Michigan, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
10
|
Okada T, Hallatschek O. Dynamic sampling bias and overdispersion induced by skewed offspring distributions. Genetics 2021; 219:6363801. [PMID: 34718557 DOI: 10.1093/genetics/iyab135] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2021] [Accepted: 08/06/2021] [Indexed: 11/14/2022] Open
Abstract
Natural populations often show enhanced genetic drift consistent with a strong skew in their offspring number distribution. The skew arises because the variability of family sizes is either inherently strong or amplified by population expansions. The resulting allele-frequency fluctuations are large and, therefore, challenge standard models of population genetics, which assume sufficiently narrow offspring distributions. While the neutral dynamics backward in time can be readily analyzed using coalescent approaches, we still know little about the effect of broad offspring distributions on the forward-in-time dynamics, especially with selection. Here, we employ an asymptotic analysis combined with a scaling hypothesis to demonstrate that over-dispersed frequency trajectories emerge from the competition of conventional forces, such as selection or mutations, with an emerging time-dependent sampling bias against the minor allele. The sampling bias arises from the characteristic time-dependence of the largest sampled family size within each allelic type. Using this insight, we establish simple scaling relations for allele-frequency fluctuations, fixation probabilities, extinction times, and the site frequency spectra that arise when offspring numbers are distributed according to a power law.
Collapse
Affiliation(s)
- Takashi Okada
- Departments of Physics and Integrative Biology, University of California, Berkeley, CA 94720, USA.,RIKEN iTHEMS, Wako, Saitama 351-0198, Japan
| | - Oskar Hallatschek
- Departments of Physics and Integrative Biology, University of California, Berkeley, CA 94720, USA
| |
Collapse
|
11
|
Stark TL, Liberles DA. Characterizing Amino Acid Substitution with Complete Linkage of Sites on a Lineage. Genome Biol Evol 2021; 13:6377338. [PMID: 34581792 PMCID: PMC8557849 DOI: 10.1093/gbe/evab225] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/17/2021] [Indexed: 11/16/2022] Open
Abstract
Amino acid substitution models are commonly used for phylogenetic inference, for ancestral sequence reconstruction, and for the inference of positive selection. All commonly used models explicitly assume that each site evolves independently, an assumption that is violated by both linkage and protein structural and functional constraints. We introduce two new models for amino acid substitution which incorporate linkage between sites, each based on the (population-genetic) Moran model. The first model is a generalized population process tracking arbitrarily many sites which undergo mutation, with individuals replaced according to their fitnesses. This model provides a reasonably complete framework for simulations but is numerically and analytically intractable. We also introduce a second model which includes several simplifying assumptions but for which some theoretical results can be derived. We analyze the simplified model to determine conditions where linkage is likely to have meaningful effects on sitewise substitution probabilities, as well as conditions under which the effects are likely to be negligible. These findings are an important step in the generation of tractable phylogenetic models that parameterize selective coefficients for amino acid substitution while accounting for linkage of sites leading to both hitchhiking and background selection.
Collapse
Affiliation(s)
- Tristan L Stark
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, USA
| | - David A Liberles
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, USA
| |
Collapse
|
12
|
Salehi S, Kabeer F, Ceglia N, Andronescu M, Williams MJ, Campbell KR, Masud T, Wang B, Biele J, Brimhall J, Gee D, Lee H, Ting J, Zhang AW, Tran H, O'Flanagan C, Dorri F, Rusk N, de Algara TR, Lee SR, Cheng BYC, Eirew P, Kono T, Pham J, Grewal D, Lai D, Moore R, Mungall AJ, Marra MA, McPherson A, Bouchard-Côté A, Aparicio S, Shah SP. Clonal fitness inferred from time-series modelling of single-cell cancer genomes. Nature 2021; 595:585-590. [PMID: 34163070 PMCID: PMC8396073 DOI: 10.1038/s41586-021-03648-3] [Citation(s) in RCA: 52] [Impact Index Per Article: 17.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2020] [Accepted: 05/17/2021] [Indexed: 02/02/2023]
Abstract
Progress in defining genomic fitness landscapes in cancer, especially those defined by copy number alterations (CNAs), has been impeded by lack of time-series single-cell sampling of polyclonal populations and temporal statistical models1-7. Here we generated 42,000 genomes from multi-year time-series single-cell whole-genome sequencing of breast epithelium and primary triple-negative breast cancer (TNBC) patient-derived xenografts (PDXs), revealing the nature of CNA-defined clonal fitness dynamics induced by TP53 mutation and cisplatin chemotherapy. Using a new Wright-Fisher population genetics model8,9 to infer clonal fitness, we found that TP53 mutation alters the fitness landscape, reproducibly distributing fitness over a larger number of clones associated with distinct CNAs. Furthermore, in TNBC PDX models with mutated TP53, inferred fitness coefficients from CNA-based genotypes accurately forecast experimentally enforced clonal competition dynamics. Drug treatment in three long-term serially passaged TNBC PDXs resulted in cisplatin-resistant clones emerging from low-fitness phylogenetic lineages in the untreated setting. Conversely, high-fitness clones from treatment-naive controls were eradicated, signalling an inversion of the fitness landscape. Finally, upon release of drug, selection pressure dynamics were reversed, indicating a fitness cost of treatment resistance. Together, our findings define clonal fitness linked to both CNA and therapeutic resistance in polyclonal tumours.
Collapse
Affiliation(s)
- Sohrab Salehi
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Farhia Kabeer
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
- Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Nicholas Ceglia
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Mirela Andronescu
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
- Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, British Columbia, Canada
| | - Marc J Williams
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Kieran R Campbell
- Lunenfeld-Tanenbaum Research Institute Mount Sinai Hospital Joseph & Wolf Lebovic Health Complex, Molecular Genetics, University of Toronto, Toronto, Ontario, Canada
| | - Tehmina Masud
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Beixi Wang
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Justina Biele
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Jazmine Brimhall
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - David Gee
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Hakwoo Lee
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Jerome Ting
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Allen W Zhang
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Hoa Tran
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Ciara O'Flanagan
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Fatemeh Dorri
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
- Department of Computer Science, University of British Columbia, Vancouver, British Columbia, Canada
| | - Nicole Rusk
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | | | - So Ra Lee
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Brian Yu Chieh Cheng
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Peter Eirew
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Takako Kono
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Jenifer Pham
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Diljot Grewal
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Daniel Lai
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada
| | - Richard Moore
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada
| | - Andrew J Mungall
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada
| | - Marco A Marra
- Canada's Michael Smith Genome Sciences Centre, BC Cancer, Vancouver, British Columbia, Canada
| | - Andrew McPherson
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA
| | - Alexandre Bouchard-Côté
- Department of Statistics, University of British Columbia, Vancouver, British Columbia, Canada
| | - Samuel Aparicio
- Department of Molecular Oncology, BC Cancer, Vancouver, British Columbia, Canada.
- Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, British Columbia, Canada.
| | - Sohrab P Shah
- Computational Oncology, Department of Epidemiology and Biostatistics, Memorial Sloan Kettering Cancer Center, New York, NY, USA.
| |
Collapse
|
13
|
Sohail MS, Louie RHY, McKay MR, Barton JP. MPL resolves genetic linkage in fitness inference from complex evolutionary histories. Nat Biotechnol 2021; 39:472-479. [PMID: 33257862 PMCID: PMC8044047 DOI: 10.1038/s41587-020-0737-3] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2019] [Accepted: 10/14/2020] [Indexed: 12/13/2022]
Abstract
Genetic linkage causes the fate of new mutations in a population to be contingent on the genetic background on which they appear. This makes it challenging to identify how individual mutations affect fitness. To overcome this challenge, we developed marginal path likelihood (MPL), a method to infer selection from evolutionary histories that resolves genetic linkage. Validation on real and simulated data sets shows that MPL is fast and accurate, outperforming existing inference approaches. We found that resolving linkage is crucial for accurately quantifying selection in complex evolving populations, which we demonstrate through a quantitative analysis of intrahost HIV-1 evolution using multiple patient data sets. Linkage effects generated by variants that sweep rapidly through the population are particularly strong, extending far across the genome. Taken together, our results argue for the importance of resolving linkage in studies of natural selection.
Collapse
Affiliation(s)
- Muhammad Saqib Sohail
- Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology, Hong Kong, China
| | - Raymond H Y Louie
- Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology, Hong Kong, China
- Institute for Advanced Study, Hong Kong University of Science and Technology, Hong Kong, China
- The Kirby Institute, University of New South Wales, Sydney, New South Wales, Australia
- School of Medical Sciences, University of New South Wales, Sydney, New South Wales, Australia
| | - Matthew R McKay
- Department of Electronic and Computer Engineering, Hong Kong University of Science and Technology, Hong Kong, China.
- Department of Chemical and Biological Engineering, Hong Kong University of Science and Technology, Hong Kong, China.
| | - John P Barton
- Department of Physics and Astronomy, University of California, Riverside, Riverside, CA, USA.
| |
Collapse
|
14
|
Sachdeva V, Mora T, Walczak AM, Palmer SE. Optimal prediction with resource constraints using the information bottleneck. PLoS Comput Biol 2021; 17:e1008743. [PMID: 33684112 PMCID: PMC7971903 DOI: 10.1371/journal.pcbi.1008743] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2020] [Revised: 03/18/2021] [Accepted: 01/27/2021] [Indexed: 11/19/2022] Open
Abstract
Responding to stimuli requires that organisms encode information about the external world. Not all parts of the input are important for behavior, and resource limitations demand that signals be compressed. Prediction of the future input is widely beneficial in many biological systems. We compute the trade-offs between representing the past faithfully and predicting the future using the information bottleneck approach, for input dynamics with different levels of complexity. For motion prediction, we show that, depending on the parameters in the input dynamics, velocity or position information is more useful for accurate prediction. We show which motion representations are easiest to re-use for accurate prediction in other motion contexts, and identify and quantify those with the highest transferability. For non-Markovian dynamics, we explore the role of long-term memory in shaping the internal representation. Lastly, we show that prediction in evolutionary population dynamics is linked to clustering allele frequencies into non-overlapping memories.
Collapse
Affiliation(s)
- Vedant Sachdeva
- Graduate Program in Biophysical Sciences, University of Chicago, Chicago, Illinois, United States of America
| | - Thierry Mora
- Laboratoire de physique de l’École normale supérieure, Centre National de la Recherche Scientifique, Paris, France
- Paris Sciences et Lettres University Paris, Paris, France
- Sorbonne Université Paris, Paris, France
- Université de Paris, Paris, France
| | - Aleksandra M. Walczak
- Laboratoire de physique de l’École normale supérieure, Centre National de la Recherche Scientifique, Paris, France
- Paris Sciences et Lettres University Paris, Paris, France
- Sorbonne Université Paris, Paris, France
- Université de Paris, Paris, France
| | - Stephanie E. Palmer
- Department of Organismal Biology and Anatomy, University of Chicago, Chicago, Illinois, United States of America
- Department of Physics, University of Chicago, Chicago, Illinois, United States of America
| |
Collapse
|
15
|
Mukhopadhyay A, Chakraborty S. Replicator equations induced by microscopic processes in nonoverlapping population playing bimatrix games. CHAOS (WOODBURY, N.Y.) 2021; 31:023123. [PMID: 33653037 DOI: 10.1063/5.0032311] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Accepted: 01/27/2021] [Indexed: 06/12/2023]
Abstract
This paper is concerned with exploring the microscopic basis for the discrete versions of the standard replicator equation and the adjusted replicator equation. To this end, we introduce frequency-dependent selection-as a result of competition fashioned by game-theoretic consideration-into the Wright-Fisher process, a stochastic birth-death process. The process is further considered to be active in a generation-wise nonoverlapping finite population where individuals play a two-strategy bimatrix population game. Subsequently, connections among the corresponding master equation, the Fokker-Planck equation, and the Langevin equation are exploited to arrive at the deterministic discrete replicator maps in the limit of infinite population size.
Collapse
Affiliation(s)
- Archan Mukhopadhyay
- Department of Physics, Indian Institute of Technology Kanpur, Uttar Pradesh 208016, India
| | - Sagar Chakraborty
- Department of Physics, Indian Institute of Technology Kanpur, Uttar Pradesh 208016, India
| |
Collapse
|
16
|
He Z, Dai X, Beaumont M, Yu F. Detecting and Quantifying Natural Selection at Two Linked Loci from Time Series Data of Allele Frequencies with Forward-in-Time Simulations. Genetics 2020; 216:521-541. [PMID: 32826299 PMCID: PMC7536848 DOI: 10.1534/genetics.120.303463] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Accepted: 08/15/2020] [Indexed: 12/16/2022] Open
Abstract
Recent advances in DNA sequencing techniques have made it possible to monitor genomes in great detail over time. This improvement provides an opportunity for us to study natural selection based on time serial samples of genomes while accounting for genetic recombination effect and local linkage information. Such time series genomic data allow for more accurate estimation of population genetic parameters and hypothesis testing on the recent action of natural selection. In this work, we develop a novel Bayesian statistical framework for inferring natural selection at a pair of linked loci by capitalising on the temporal aspect of DNA data with the additional flexibility of modeling the sampled chromosomes that contain unknown alleles. Our approach is built on a hidden Markov model where the underlying process is a two-locus Wright-Fisher diffusion with selection, which enables us to explicitly model genetic recombination and local linkage. The posterior probability distribution for selection coefficients is computed by applying the particle marginal Metropolis-Hastings algorithm, which allows us to efficiently calculate the likelihood. We evaluate the performance of our Bayesian inference procedure through extensive simulations, showing that our approach can deliver accurate estimates of selection coefficients, and the addition of genetic recombination and local linkage brings about significant improvement in the inference of natural selection. We also illustrate the utility of our method on real data with an application to ancient DNA data associated with white spotting patterns in horses.
Collapse
Affiliation(s)
- Zhangyi He
- School of Mathematics, University of Bristol, BS8 1UG, United Kingdom
| | - Xiaoyang Dai
- School of Biological Sciences, University of Bristol, BS8 1TQ, United Kingdom
| | - Mark Beaumont
- School of Biological Sciences, University of Bristol, BS8 1TQ, United Kingdom
| | - Feng Yu
- School of Mathematics, University of Bristol, BS8 1UG, United Kingdom
| |
Collapse
|
17
|
Vicuña L, Klimenkova O, Norambuena T, Martinez FI, Fernandez MI, Shchur V, Eyheramendy S. Postadmixture Selection on Chileans Targets Haplotype Involved in Pigmentation, Thermogenesis and Immune Defense against Pathogens. Genome Biol Evol 2020; 12:1459-1470. [PMID: 32614437 PMCID: PMC7487163 DOI: 10.1093/gbe/evaa136] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/24/2020] [Indexed: 12/13/2022] Open
Abstract
Detection of positive selection signatures in populations around the world is helping to uncover recent human evolutionary history as well as the genetic basis of diseases. Most human evolutionary genomic studies have been performed in European, African, and Asian populations. However, populations with Native American ancestry have been largely underrepresented. Here, we used a genome-wide local ancestry enrichment approach complemented with neutral simulations to identify postadmixture adaptations underwent by admixed Chileans through gene flow from Europeans into local Native Americans. The top significant hits (P = 2.4×10-7) are variants in a region on chromosome 12 comprising multiple regulatory elements. This region includes rs12821256, which regulates the expression of KITLG, a well-known gene involved in lighter hair and skin pigmentation in Europeans as well as in thermogenesis. Another variant from that region is associated with the long noncoding RNA RP11-13A1.1, which has been specifically involved in the innate immune response against infectious pathogens. Our results suggest that these genes were relevant for adaptation in Chileans following the Columbian exchange.
Collapse
Affiliation(s)
- Lucas Vicuña
- Faculty of Engineering and Sciences, Universidad Adolfo Ibañez, Peñalolén, Santiago, Chile
| | - Olga Klimenkova
- National Research University Higher School of Economics, Russian Federation, Moscow, Russia
| | - Tomás Norambuena
- Faculty of Engineering and Sciences, Universidad Adolfo Ibañez, Peñalolén, Santiago, Chile
| | - Felipe I Martinez
- Center for Intercultural and Indigenous Research, School of Anthropology, Faculty of Social Sciences, Pontificia Universidad Católica de Chile, Santiago, Chile
| | - Mario I Fernandez
- Department of Urology, Clínica Alemana, Santiago, Chile
- Center for Genetics and Genomics, Faculty of Medicine, Clínica Alemana, Universidad del Desarrollo, Santiago, Chile
| | - Vladimir Shchur
- National Research University Higher School of Economics, Russian Federation, Moscow, Russia
| | - Susana Eyheramendy
- Faculty of Engineering and Sciences, Universidad Adolfo Ibañez, Peñalolén, Santiago, Chile
- Instituto Milenio de Investigación sobre los Fundamentos de los Datos (IMFD)
| |
Collapse
|
18
|
Abstract
Neutral models of evolution assume the absence of natural selection. Formerly confined to ecology and evolutionary biology, neutral models are spreading. In recent years they've been applied to explaining the diversity of baby names, scientific citations, cryptocurrencies, pot decorations, literary lexica, tumour variants and much more besides. Here, we survey important neutral models and highlight their similarities. We investigate the most widely used tests of neutrality, show that they are weak and suggest more powerful methods. We conclude by discussing the role of neutral models in the explanation of diversity. We suggest that the ability of neutral models to fit low-information distributions should not be taken as evidence for the absence of selection. Nevertheless, many studies, in increasingly diverse fields, make just such claims. We call this tendency 'neutral syndrome'.
Collapse
|
19
|
Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model. G3-GENES GENOMES GENETICS 2019; 9:4073-4086. [PMID: 31597676 PMCID: PMC6893182 DOI: 10.1534/g3.119.400778] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Detecting genomic regions under selection is an important objective of population genetics. Typical analyses for this goal are based on exploiting genetic diversity patterns in present time data but rapid advances in DNA sequencing have increased the availability of time series genomic data. A common approach to analyze such data is to model the temporal evolution of an allele frequency as a Markov chain. Based on this principle, several methods have been proposed to infer selection intensity. One of their differences lies in how they model the transition probabilities of the Markov chain. Using the Wright-Fisher model is a natural choice but its computational cost is prohibitive for large population sizes so approximations to this model based on parametric distributions have been proposed. Here, we compared the performance of some of these approximations with respect to their power to detect selection and their estimation of the selection coefficient. We developped a new generic Hidden Markov Model likelihood calculator and applied it on genetic time series simulated under various evolutionary scenarios. The Beta with spikes approximation, which combines discrete fixation probabilities with a continuous Beta distribution, was found to perform consistently better than the others. This distribution provides an almost perfect fit to the Wright-Fisher model in terms of selection inference, for a computational cost that does not increase with population size. We further evaluated this model for population sizes not accessible to the Wright-Fisher model and illustrated its performance on a dataset of two divergently selected chicken populations.
Collapse
|
20
|
Librado P, Orlando L. Detecting Signatures of Positive Selection along Defined Branches of a Population Tree Using LSD. Mol Biol Evol 2019; 35:1520-1535. [PMID: 29617830 PMCID: PMC5967574 DOI: 10.1093/molbev/msy053] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Identifying the genomic basis underlying local adaptation is paramount to evolutionary biology, and bears many applications in the fields of conservation biology, crop, and animal breeding, as well as personalized medicine. Although many approaches have been developed to detect signatures of positive selection within single populations and population pairs, the increasing wealth of high-throughput sequencing data requires improved methods capable of handling multiple, and ideally large number of, populations in a single analysis. In this study, we introduce LSD (levels of exclusively shared differences), a fast and flexible framework to perform genome-wide selection scans, along the internal and external branches of a given population tree. We use forward simulations to demonstrate that LSD can identify branches targeted by positive selection with remarkable sensitivity and specificity. We illustrate a range of potential applications by analyzing data from the 1000 Genomes Project and uncover a list of adaptive candidates accompanying the expansion of anatomically modern humans out of Africa and their spread to Europe.
Collapse
Affiliation(s)
- Pablo Librado
- Centre for GeoGenetics, Natural History Museum of Denmark, Copenhagen, Denmark
- Laboratoire d’Anthropobiologie Moléculaire et d’Imagerie de Synthèse, CNRS UMR 5288, Université de Toulouse, Université Paul Sabatier, Toulouse, France
- Corresponding author: E-mail:
| | - Ludovic Orlando
- Centre for GeoGenetics, Natural History Museum of Denmark, Copenhagen, Denmark
- Laboratoire d’Anthropobiologie Moléculaire et d’Imagerie de Synthèse, CNRS UMR 5288, Université de Toulouse, Université Paul Sabatier, Toulouse, France
| |
Collapse
|
21
|
Zhao L, Abbasi AB, Illingworth CJR. Mutational load causes stochastic evolutionary outcomes in acute RNA viral infection. Virus Evol 2019; 5:vez008. [PMID: 31024738 PMCID: PMC6476161 DOI: 10.1093/ve/vez008] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Mutational load is known to be of importance for the evolution of RNA viruses, the combination of a high mutation rate and large population size leading to an accumulation of deleterious mutations. However, while the effects of mutational load on global viral populations have been considered, its quantitative effects at the within-host scale of infection are less well understood. We here show that even on the rapid timescale of acute disease, mutational load has an effect on within-host viral adaptation, reducing the effective selection acting upon beneficial variants by ∼10 per cent. Furthermore, mutational load induces considerable stochasticity in the pattern of evolution, causing a more than five-fold uncertainty in the effective fitness of a transmitted beneficial variant. Our work aims to bridge the gap between classic models from population genetic theory and the biology of viral infection. In an advance on some previous models of mutational load, we replace the assumption of a constant variant fitness cost with an experimentally-derived distribution of fitness effects. Expanding previous frameworks for evolutionary simulation, we introduce the Wright-Fisher model with continuous mutation, which describes a continuum of possible modes of replication within a cell. Our results advance our understanding of adaptation in the context of strong selection and a high mutation rate. Despite viral populations having large absolute sizes, critical events in viral adaptation, including antigenic drift and the onset of drug resistance, arise through stochastic evolutionary processes.
Collapse
Affiliation(s)
- Lei Zhao
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Ali B Abbasi
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Christopher J R Illingworth
- Department of Genetics, University of Cambridge, Cambridge, UK
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, UK
| |
Collapse
|
22
|
Lumby CK, Nene NR, Illingworth CJR. A novel framework for inferring parameters of transmission from viral sequence data. PLoS Genet 2018; 14:e1007718. [PMID: 30325921 PMCID: PMC6203404 DOI: 10.1371/journal.pgen.1007718] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2018] [Revised: 10/26/2018] [Accepted: 09/26/2018] [Indexed: 11/18/2022] Open
Abstract
Transmission between hosts is a critical part of the viral lifecycle. Recent studies of viral transmission have used genome sequence data to evaluate the number of particles transmitted between hosts, and the role of selection as it operates during the transmission process. However, the interpretation of sequence data describing transmission events is a challenging task. We here present a novel and comprehensive framework for using short-read sequence data to understand viral transmission events, designed for influenza virus, but adaptable to other viral species. Our approach solves multiple shortcomings of previous methods for this purpose; for example, we consider transmission as an event involving whole viruses, rather than sets of independent alleles. We demonstrate how selection during transmission and noisy sequence data may each affect naive inferences of the population bottleneck, accounting for these in our framework so as to achieve a correct inference. We identify circumstances in which selection for increased viral transmission may or may not be identified from data. Applying our method to experimental data in which transmission occurs in the presence of strong selection, we show that our framework grants a more quantitative insight into transmission events than previous approaches, inferring the bottleneck in a manner that accounts for selection, both for within-host virulence, and for inherent viral transmissibility. Our work provides new opportunities for studying transmission processes in influenza, and by extension, in other infectious diseases.
Collapse
Affiliation(s)
- Casper K. Lumby
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Nuno R. Nene
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Christopher J. R. Illingworth
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- Department of Applied Mathematics and Theoretical Physics, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
23
|
Rapid sex-specific evolution of age at maturity is shaped by genetic architecture in Atlantic salmon. Nat Ecol Evol 2018; 2:1800-1807. [PMID: 30275465 PMCID: PMC6322654 DOI: 10.1038/s41559-018-0681-5] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2018] [Accepted: 08/29/2018] [Indexed: 11/16/2022]
Abstract
Understanding the mechanisms by which populations adapt to their
environments is a fundamental aim in biology. However, it remains challenging to
identify the genetic basis of traits, provide evidence of genetic changes and
quantify phenotypic responses. Age at maturity in Atlantic salmon represents an
ideal trait to study contemporary adaptive evolution as it has been associated
with a single locus in the vgll3 region, and has also strongly
changed in recent decades. Here, we provide an empirical example of contemporary
adaptive evolution of a large effect locus driving contrasting sex-specific
evolutionary responses at the phenotypic level. We identified an 18% decrease in
the vgll3 allele associated with late maturity
(L) in a large and diverse salmon population over 36 years,
induced by sex-specific selection during the sea migration. Those genetic
changes resulted in a significant evolutionary response in males only, due to
sex-specific dominance patterns and vgll3 allelic effects. The
vgll3 allelic and dominance effects differed greatly in a
second population and were likely to generate different selection and
evolutionary patterns. Our study highlights the importance of knowledge of
genetic architecture to better understand fitness trait evolution and phenotypic
diversity. It also emphasizes the potential role of adaptive evolution in the
trend toward earlier maturation observed in numerous Atlantic salmon populations
worldwide.
Collapse
|
24
|
Inferring Fitness Effects from Time-Resolved Sequence Data with a Delay-Deterministic Model. Genetics 2018; 209:255-264. [PMID: 29500183 PMCID: PMC5937181 DOI: 10.1534/genetics.118.300790] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2018] [Accepted: 02/28/2018] [Indexed: 11/30/2022] Open
Abstract
A broad range of approaches have considered the challenge of inferring selection from time-resolved genome sequence data. Models describing deterministic changes in allele or haplotype frequency have been highlighted as providing accurate and computationally... A common challenge arising from the observation of an evolutionary system over time is to infer the magnitude of selection acting upon a specific genetic variant, or variants, within the population. The inference of selection may be confounded by the effects of genetic drift in a system, leading to the development of inference procedures to account for these effects. However, recent work has suggested that deterministic models of evolution may be effective in capturing the effects of selection even under complex models of demography, suggesting the more general application of deterministic approaches to inference. Responding to this literature, we here note a case in which a deterministic model of evolution may give highly misleading inferences, resulting from the nondeterministic properties of mutation in a finite population. We propose an alternative approach that acts to correct for this error, and which we denote the delay-deterministic model. Applying our model to a simple evolutionary system, we demonstrate its performance in quantifying the extent of selection acting within that system. We further consider the application of our model to sequence data from an evolutionary experiment. We outline scenarios in which our model may produce improved results for the inference of selection, noting that such situations can be easily identified via the use of a regular deterministic model.
Collapse
|
25
|
Inferring sex-specific demographic history from SNP data. PLoS Genet 2018; 14:e1007191. [PMID: 29385127 PMCID: PMC5809101 DOI: 10.1371/journal.pgen.1007191] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Revised: 02/12/2018] [Accepted: 01/08/2018] [Indexed: 12/04/2022] Open
Abstract
The relative female and male contributions to demography are of great importance to better understand the history and dynamics of populations. While earlier studies relied on uniparental markers to investigate sex-specific questions, the increasing amount of sequence data now enables us to take advantage of tens to hundreds of thousands of independent loci from autosomes and the X chromosome. Here, we develop a novel method to estimate effective sex ratios or ESR (defined as the female proportion of the effective population) from allele count data for each branch of a rooted tree topology that summarizes the history of the populations of interest. Our method relies on Kimura’s time-dependent diffusion approximation for genetic drift, and is based on a hierarchical Bayesian model to integrate over the allele frequencies along the branches. We show via simulations that parameters are inferred robustly, even under scenarios that violate some of the model assumptions. Analyzing bovine SNP data, we infer a strongly female-biased ESR in both dairy and beef cattle, as expected from the underlying breeding scheme. Conversely, we observe a strongly male-biased ESR in early domestication times, consistent with an easier taming and management of cows, and/or introgression from wild auroch males, that would both cause a relative increase in male effective population size. In humans, analyzing a subsample of non-African populations, we find a male-biased ESR in Oceanians that may reflect complex marriage patterns in Aboriginal Australians. Because our approach relies on allele count data, it may be applied on a wide range of species. The history of populations and their social organization is often intricate due to breeding structures, migration patterns or population bottlenecks. Estimation of the female proportion of the effective population (sex ratio) is therefore important to better understand this underlying social structure and dynamics. This question has been mainly investigated so far by comparing genetic variation of mitochondrial DNA and the Y chromosome, two uniparentally inherited markers that reflect the demographic history of females and males, respectively. To overcome the intrinsic limitations of these genetic markers, and to take advantage of the increasing amount of sequence data, we propose a new approach that uses large numbers of independent polymorphisms from autosomes and the X chromosome to estimate sex ratios, throughout the history of populations. This method allows us to confirm a strongly female-biased sex ratio in modern dairy and beef cattle breeds. Yet, we find a strongly male-biased sex ratio during domestication times, consistent with an easier taming and management of cows, and/or introgression from wild auroch males. Analyzing human data from a sample of non-African populations, we find a male bias in Oceanians, possibly indicating complex marriage patterns among Aboriginal Australian groups.
Collapse
|
26
|
R Nené N, Mustonen V, J R Illingworth C. Evaluating genetic drift in time-series evolutionary analysis. J Theor Biol 2018; 437:51-57. [PMID: 28958783 PMCID: PMC5703635 DOI: 10.1016/j.jtbi.2017.09.021] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Revised: 06/20/2017] [Accepted: 09/18/2017] [Indexed: 11/15/2022]
Abstract
The Wright-Fisher model is the most popular population model for describing the behaviour of evolutionary systems with a finite population size. Approximations have commonly been used but the model itself has rarely been tested against time-resolved genomic data. Here, we evaluate the extent to which it can be inferred as the correct model under a likelihood framework. Given genome-wide data from an evolutionary experiment, we validate the Wright-Fisher drift model as the better option for describing evolutionary trajectories in a finite population. This was found by evaluating its performance against a Gaussian model of allele frequency propagation. However, we note a range of circumstances under which standard Wright-Fisher drift cannot be correctly identified.
Collapse
Affiliation(s)
- Nuno R Nené
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Ville Mustonen
- Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK; Department of Biosciences, Department of Computer Science, Institute of Biotechnology, University of Helsinki, Helsinki 00014, Finland
| | | |
Collapse
|
27
|
Effects of the Ordering of Natural Selection and Population Regulation Mechanisms on Wright-Fisher Models. G3-GENES GENOMES GENETICS 2017; 7:2095-2106. [PMID: 28500051 PMCID: PMC5499119 DOI: 10.1534/g3.117.041038] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
We explore the effect of different mechanisms of natural selection on the evolution of populations for one- and two-locus systems. We compare the effect of viability and fecundity selection in the context of the Wright-Fisher model with selection under the assumption of multiplicative fitness. We show that these two modes of natural selection correspond to different orderings of the processes of population regulation and natural selection in the Wright-Fisher model. We find that under the Wright-Fisher model these two different orderings can affect the distribution of trajectories of haplotype frequencies evolving with genetic recombination. However, the difference in the distribution of trajectories is only appreciable when the population is in significant linkage disequilibrium. We find that as linkage disequilibrium decays the trajectories for the two different models rapidly become indistinguishable. We discuss the significance of these findings in terms of biological examples of viability and fecundity selection, and speculate that the effect may be significant when factors such as gene migration maintain a degree of linkage disequilibrium.
Collapse
|