1
|
Steiner MC, Rice DP, Biddanda A, Ianni-Ravn MK, Porras C, Novembre J. Study design and the sampling of deleterious rare variants in biobank-scale datasets. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.12.02.626424. [PMID: 39677632 PMCID: PMC11642817 DOI: 10.1101/2024.12.02.626424] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2024]
Abstract
One key component of study design in population genetics is the "geographic breadth" of a sample (i.e., how broad a region across which individuals are sampled). How the geographic breadth of a sample impacts observations of rare, deleterious variants is unclear, even though such variants are of particular interest for biomedical and evolutionary applications. Here, in order to gain insight into the effects of sample design on ascertained genetic variants, we formulate a stochastic model of dispersal, genetic drift, selection, mutation, and geographically concentrated sampling. We use this model to understand the effects of the geographic breadth of sampling effort on the discovery of negatively selected variants. We find that samples which are more geographically broad will discover a greater number variants as compared geographically narrow samples (an effect we label "discovery"); though the variants will be detected at lower average frequency than in narrow samples (e.g. as singletons, an effect we label "dilution"). Importantly, these effects are amplified for larger sample sizes and moderated by the magnitude of fitness effects. We validate these results using both population genetic simulations and empirical analyses in the UK Biobank. Our results are particularly important in two contexts: the association of large-effect rare variants with particular phenotypes and the inference of negative selection from allele frequency data. Overall, our findings emphasize the importance of considering geographic breadth when designing and carrying out genetic studies, especially at biobank scale.
Collapse
Affiliation(s)
| | - Daniel P. Rice
- Media Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139
- SecureBio, Cambridge, MA 02142
| | - Arjun Biddanda
- Department of Biology, Johns Hopkins University, Baltimore, MD 21218
| | | | - Christian Porras
- Department of Genetics and Genomic Sciences, Icahn School of Medicine, New York, NY 10029
| | - John Novembre
- Department of Human Genetics, University of Chicago, Chicago, IL 60637
- Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637
| |
Collapse
|
2
|
Chotai M, Wei X, Messer PW. Signatures of selective sweeps in continuous-space populations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.26.605365. [PMID: 39091822 PMCID: PMC11291165 DOI: 10.1101/2024.07.26.605365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
Selective sweeps describe the process by which an adaptive mutation arises and rapidly fixes in the population, thereby removing genetic variation in its genomic vicinity. The expected signatures of selective sweeps are relatively well understood in panmictic population models, yet natural populations often extend across larger geographic ranges where individuals are more likely to mate with those born nearby. To investigate how such spatial population structure can affect sweep dynamics and signatures, we simulated selective sweeps in populations inhabiting a two-dimensional continuous landscape. The maximum dispersal distance of offspring from their parents can be varied in our simulations from an essentially panmictic population to scenarios with increasingly limited dispersal. We find that in low-dispersal populations, adaptive mutations spread more slowly than in panmictic ones, while recombination becomes less effective at breaking up genetic linkage around the sweep locus. Together, these factors result in a trough of reduced genetic diversity around the sweep locus that looks very similar across dispersal rates. We also find that the site frequency spectrum around hard sweeps in low-dispersal populations becomes enriched for intermediate-frequency variants, making these sweeps appear softer than they are. Furthermore, haplotype heterozygosity at the sweep locus tends to be elevated in low-dispersal scenarios as compared to panmixia, contrary to what we observe in neutral scenarios without sweeps. The haplotype patterns generated by these hard sweeps in low-dispersal populations can resemble soft sweeps from standing genetic variation that arose from substantially older alleles. Our results highlight the need for better accounting for spatial population structure when making inferences about selective sweeps.
Collapse
Affiliation(s)
- Meera Chotai
- Department of Computational Biology, Cornell University
| | - Xinzhu Wei
- Department of Computational Biology, Cornell University
| | | |
Collapse
|
3
|
Angst P, Haag CR, Ben-Ami F, Fields PD, Ebert D. Genome-Wide Allele Frequency Changes Reveal That Dynamic Metapopulations Evolve Differently. Mol Biol Evol 2024; 41:msae128. [PMID: 38935572 PMCID: PMC11229820 DOI: 10.1093/molbev/msae128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 06/14/2024] [Accepted: 06/20/2024] [Indexed: 06/29/2024] Open
Abstract
Two important characteristics of metapopulations are extinction-(re)colonization dynamics and gene flow between subpopulations. These processes can cause strong shifts in genome-wide allele frequencies that are generally not observed in "classical" (large, stable, and panmictic) populations. Subpopulations founded by one or a few individuals, the so-called propagule model, are initially expected to show intermediate allele frequencies at polymorphic sites until natural selection and genetic drift drive allele frequencies toward a mutation-selection-drift equilibrium characterized by a negative exponential-like distribution of the site frequency spectrum. We followed changes in site frequency spectrum distribution in a natural metapopulation of the cyclically parthenogenetic pond-dwelling microcrustacean Daphnia magna using biannual pool-seq samples collected over a 5-yr period from 118 ponds occupied by subpopulations of known age. As expected under the propagule model, site frequency spectra in newly founded subpopulations trended toward intermediate allele frequencies and shifted toward right-skewed distributions as the populations aged. Immigration and subsequent hybrid vigor altered this dynamic. We show that the analysis of site frequency spectrum dynamics is a powerful approach to understand evolution in metapopulations. It allowed us to disentangle evolutionary processes occurring in a natural metapopulation, where many subpopulations evolve in parallel. Thereby, stochastic processes like founder and immigration events lead to a pattern of subpopulation divergence, while genetic drift leads to converging site frequency spectrum distributions in the persisting subpopulations. The observed processes are well explained by the propagule model and highlight that metapopulations evolve differently from classical populations.
Collapse
Affiliation(s)
- Pascal Angst
- Department of Environmental Sciences, Zoology, University of Basel, Basel 4051, Switzerland
| | - Christoph R Haag
- CEFE, Université de Montpellier, CNRS, EPHE, IRD, Montpellier 34293, France
- Tvärminne Zoological Station, University of Helsinki, Hanko 10900, Finland
| | - Frida Ben-Ami
- Tvärminne Zoological Station, University of Helsinki, Hanko 10900, Finland
- School of Zoology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Peter D Fields
- Department of Environmental Sciences, Zoology, University of Basel, Basel 4051, Switzerland
- Tvärminne Zoological Station, University of Helsinki, Hanko 10900, Finland
| | - Dieter Ebert
- Department of Environmental Sciences, Zoology, University of Basel, Basel 4051, Switzerland
- Tvärminne Zoological Station, University of Helsinki, Hanko 10900, Finland
| |
Collapse
|
4
|
Ye Z, Pfrender ME, Lynch M. Evolutionary Genomics of Sister Species Differing in Effective Population Sizes and Recombination Rates. Genome Biol Evol 2023; 15:evad202. [PMID: 37946625 PMCID: PMC10664402 DOI: 10.1093/gbe/evad202] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Revised: 10/16/2023] [Accepted: 11/02/2023] [Indexed: 11/12/2023] Open
Abstract
Studies of closely related species with known ecological differences provide exceptional opportunities for understanding the genetic mechanisms of evolution. In this study, we compared population-genomics data between Daphnia pulex and Daphnia pulicaria, two reproductively compatible sister species experiencing ecological speciation, the first largely confined to intermittent ponds and the second to permanent lakes in the same geographic region. Daphnia pulicaria has lower genome-wide nucleotide diversity, a smaller effective population size, a higher incidence of private alleles, and a substantially more linkage disequilibrium than D. pulex. Positively selected genes in D. pulicaria are enriched in potentially aging-related categories such as cellular homeostasis, which may explain the extended life span in D. pulicaria. We also found that opsin-related genes, which may mediate photoperiodic responses, are under different selection pressures in these two species. Genes involved in mitochondrial functions, ribosomes, and responses to environmental stimuli are found to be under positive selection in both species. Additionally, we found that the two species have similar average evolutionary rates at the DNA-sequence level, although approximately 160 genes have significantly different rates in the two lineages. Our results provide insights into the physiological traits that differ within this regionally sympatric sister-species pair that occupies unique microhabitats.
Collapse
Affiliation(s)
- Zhiqiang Ye
- Hubei Key Laboratory of Genetic Regulation and Integrative Biology, School of Life Sciences, Central China Normal University, Wuhan, China
| | - Michael E Pfrender
- Department of Biological Sciences, University of Notre Dame, Notre Dame, Indiana, USA
| | - Michael Lynch
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, Arizona, USA
| |
Collapse
|
5
|
Medina-Muñoz SG, Ortega-Del Vecchyo D, Cruz-Hervert LP, Ferreyra-Reyes L, García-García L, Moreno-Estrada A, Ragsdale AP. Demographic modeling of admixed Latin American populations from whole genomes. Am J Hum Genet 2023; 110:1804-1816. [PMID: 37725976 PMCID: PMC10577084 DOI: 10.1016/j.ajhg.2023.08.015] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 08/17/2023] [Accepted: 08/23/2023] [Indexed: 09/21/2023] Open
Abstract
Demographic models of Latin American populations often fail to fully capture their complex evolutionary history, which has been shaped by both recent admixture and deeper-in-time demographic events. To address this gap, we used high-coverage whole-genome data from Indigenous American ancestries in present-day Mexico and existing genomes from across Latin America to infer multiple demographic models that capture the impact of different timescales on genetic diversity. Our approach, which combines analyses of allele frequencies and ancestry tract length distributions, represents a significant improvement over current models in predicting patterns of genetic variation in admixed Latin American populations. We jointly modeled the contribution of European, African, East Asian, and Indigenous American ancestries into present-day Latin American populations. We infer that the ancestors of Indigenous Americans and East Asians diverged ∼30 thousand years ago, and we characterize genetic contributions of recent migrations from East and Southeast Asia to Peru and Mexico. Our inferred demographic histories are consistent across different genomic regions and annotations, suggesting that our inferences are robust to the potential effects of linked selection. In conjunction with published distributions of fitness effects for new nonsynonymous mutations in humans, we show in large-scale simulations that our models recover important features of both neutral and deleterious variation. By providing a more realistic framework for understanding the evolutionary history of Latin American populations, our models can help address the historical under-representation of admixed groups in genomics research and can be a valuable resource for future studies of populations with complex admixture and demographic histories.
Collapse
Affiliation(s)
- Santiago G Medina-Muñoz
- National Laboratory of Genomics for Biodiversity (LANGEBIO), Advanced Genomics Unit (UGA), CINVESTAV, Irapuato, Guanajuato 36824, Mexico
| | - Diego Ortega-Del Vecchyo
- Laboratorio Internacional de Investigación sobre el Genoma Humano, Universidad Nacional Autónoma de Mexico, Juriquilla, Querétaro 76230, Mexico
| | | | | | | | - Andrés Moreno-Estrada
- National Laboratory of Genomics for Biodiversity (LANGEBIO), Advanced Genomics Unit (UGA), CINVESTAV, Irapuato, Guanajuato 36824, Mexico.
| | - Aaron P Ragsdale
- National Laboratory of Genomics for Biodiversity (LANGEBIO), Advanced Genomics Unit (UGA), CINVESTAV, Irapuato, Guanajuato 36824, Mexico; Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
6
|
Gloss AD, Steiner MC, Novembre J, Bergelson J. The design of mapping populations: Impacts of geographic scale on genetic architecture and mapping efficacy for defense and immunity. CURRENT OPINION IN PLANT BIOLOGY 2023; 74:102399. [PMID: 37307746 PMCID: PMC10441534 DOI: 10.1016/j.pbi.2023.102399] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/27/2022] [Revised: 04/29/2023] [Accepted: 05/15/2023] [Indexed: 06/14/2023]
Abstract
Genome-wide association studies (GWAS) have yielded tremendous insight into the genetic architecture of trait variation. However, the collections of loci they uncover are far from exhaustive. As many of the complicating factors that confound or limit the efficacy of GWAS are exaggerated over broad geographic scales, a shift toward more analyses using mapping panels sampled from narrow geographic localities ("local" populations) could provide novel, complementary insights. Here, we present an overview of the major complicating factors, review mounting evidence from genomic analyses that these factors are pervasive, and synthesize theoretical and empirical evidence for the power of GWAS in local populations.
Collapse
Affiliation(s)
- Andrew D Gloss
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY, USA.
| | | | - John Novembre
- Department of Human Genetics, University of Chicago, Chicago, IL, USA; Department of Ecology & Evolution, University of Chicago, Chicago, IL, USA
| | - Joy Bergelson
- Department of Biology, Center for Genomics and Systems Biology, New York University, New York, NY, USA.
| |
Collapse
|
7
|
Lynch M, Ye Z, Urban L, Maruki T, Wei W. The Linkage-Disequilibrium and Recombinational Landscape in Daphnia pulex. Genome Biol Evol 2022; 14:evac145. [PMID: 36170345 PMCID: PMC9642108 DOI: 10.1093/gbe/evac145] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/20/2022] [Indexed: 11/24/2022] Open
Abstract
By revealing the influence of recombinational activity beyond what can be achieved with controlled crosses, measures of linkage disequilibrium (LD) in natural populations provide a powerful means of defining the recombinational landscape within which genes evolve. In one of the most comprehensive studies of this sort ever performed, involving whole-genome analyses on nearly 1,000 individuals of the cyclically parthenogenetic microcrustacean Daphnia pulex, the data suggest a relatively uniform pattern of recombination across the genome. Patterns of LD are quite consistent among populations; average rates of recombination are quite similar for all chromosomes; and although some chromosomal regions have elevated recombination rates, the degree of inflation is not large, and the overall spatial pattern of recombination is close to the random expectation. Contrary to expectations for models in which crossing-over is the primary mechanism of recombination, and consistent with data for other species, the distance-dependent pattern of LD indicates excessively high levels at both short and long distances and unexpectedly low levels of decay at long distances, suggesting significant roles for factors such as nonindependent mutation, population subdivision, and recombination mechanisms unassociated with crossing over. These observations raise issues regarding the classical LD equilibrium model widely applied in population genetics to infer recombination rates across various length scales on chromosomes.
Collapse
Affiliation(s)
- Michael Lynch
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| | - Zhiqiang Ye
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| | - Lina Urban
- Department for Evolutionary Genetics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Takahiro Maruki
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| | - Wen Wei
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| |
Collapse
|
8
|
Abstract
The ways in which genetic variation is distributed within and among populations is a key determinant of the evolutionary features of a species. However, most comprehensive studies of these features have been restricted to studies of subdivision in settings known to have been driven by local adaptation, leaving our understanding of the natural dispersion of allelic variation less than ideal. Here, we present a geographic population-genomic analysis of 10 populations of the freshwater microcrustacean Daphnia pulex, an emerging model system in evolutionary genomics. These populations exhibit a pattern of moderate isolation-by-distance, with an average migration rate of 0.6 individuals per generation, and average effective population sizes of ∼650,000 individuals. Most populations contain numerous private alleles, and genomic scans highlight the presence of islands of excessively high population subdivision for more common alleles. A large fraction of such islands of population divergence likely reflect historical neutral changes, including rare stochastic migration and hybridization events. The data do point to local adaptive divergence, although the precise nature of the relevant variation is diffuse and cannot be associated with particular loci, despite the very large sample sizes involved in this study. In contrast, an analysis of between-species divergence highlights positive selection operating on a large set of genes with functions nearly nonoverlapping with those involved in local adaptation, in particular ribosome structure, mitochondrial bioenergetics, light reception and response, detoxification, and gene regulation. These results set the stage for using D. pulex as a model for understanding the relationship between molecular and cellular evolution in the context of natural environments.
Collapse
Affiliation(s)
- Takahiro Maruki
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| | - Zhiqiang Ye
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| | - Michael Lynch
- Biodesign Center for Mechanisms of Evolution, Arizona State University, Tempe, AZ 85287, USA
| |
Collapse
|
9
|
Koropoulis A, Alachiotis N, Pavlidis P. Detecting Positive Selection in Populations Using Genetic Data. Methods Mol Biol 2020; 2090:87-123. [PMID: 31975165 DOI: 10.1007/978-1-0716-0199-0_5] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
High-throughput genomic sequencing allows to disentangle the evolutionary forces acting in populations. Among evolutionary forces, positive selection has received a lot of attention because it is related to the adaptation of populations in their environments, both biotic and abiotic. Positive selection, also known as Darwinian selection, occurs when an allele is favored by natural selection. The frequency of the favored allele increases in the population and, due to genetic hitchhiking, neighboring linked variation diminishes, creating so-called selective sweeps. Such a process leaves traces in genomes that can be detected in a future time point. Detecting traces of positive selection in genomes is achieved by searching for signatures introduced by selective sweeps, such as regions of reduced variation, a specific shift of the site frequency spectrum, and particular linkage disequilibrium (LD) patterns in the region. A variety of approaches can be used for detecting selective sweeps, ranging from simple implementations that compute summary statistics to more advanced statistical approaches, e.g., Bayesian approaches, maximum-likelihood-based methods, and machine learning methods. In this chapter, we discuss selective sweep detection methodologies on the basis of their capacity to analyze whole genomes or just subgenomic regions, and on the specific polymorphism patterns they exploit as selective sweep signatures. We also summarize the results of comparisons among five open-source software releases (SweeD, SweepFinder, SweepFinder2, OmegaPlus, and RAiSD) regarding sensitivity, specificity, and execution times. Furthermore, we test and discuss machine learning methods and present a thorough performance analysis. In equilibrium neutral models or mild bottlenecks, most methods are able to detect selective sweeps accurately. Methods and tools that rely on linkage disequilibrium (LD) rather than single SNPs exhibit higher true positive rates than the site frequency spectrum (SFS)-based methods under the model of a single sweep or recurrent hitchhiking. However, their false positive rate is elevated when a misspecified demographic model is used to build the distribution of the statistic under the null hypothesis. Both LD and SFS-based approaches suffer from decreased accuracy on localizing the true target of selection in bottleneck scenarios. Furthermore, we present an extensive analysis of the effects of gene flow on selective sweep detection, a problem that has been understudied in selective sweep literature.
Collapse
Affiliation(s)
- Angelos Koropoulis
- Institute of Computer Science, Foundation for Research and Technology Hellas, Heraklion, Greece
- Computer Science Department, University of Crete, Crete, Heraklion, Greece
| | - Nikolaos Alachiotis
- Institute of Computer Science, Foundation for Research and Technology Hellas, Heraklion, Greece
| | - Pavlos Pavlidis
- Institute of Computer Science, Foundation for Research and Technology Hellas, Heraklion, Greece.
| |
Collapse
|
10
|
Koskela J, Wilke Berenguer M. Robust model selection between population growth and multiple merger coalescents. Math Biosci 2019; 311:1-12. [PMID: 30851276 DOI: 10.1016/j.mbs.2019.03.004] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2018] [Revised: 03/05/2019] [Accepted: 03/05/2019] [Indexed: 11/24/2022]
Abstract
We study the effect of biological confounders on the model selection problem between Kingman coalescents with population growth, and Ξ-coalescents involving simultaneous multiple mergers. We use a low dimensional, computationally tractable summary statistic, dubbed the singleton-tail statistic, to carry out approximate likelihood ratio tests between these model classes. The singleton-tail statistic has been shown to distinguish between them with high power in the simple setting of neutrally evolving, panmictic populations without recombination. We extend this work by showing that cryptic recombination and selection do not diminish the power of the test, but that misspecifying population structure does. Furthermore, we demonstrate that the singleton-tail statistic can also solve the more challenging model selection problem between multiple mergers due to selective sweeps, and multiple mergers due to high fecundity with moderate power of up to 30%.
Collapse
Affiliation(s)
- Jere Koskela
- Department of Statistics, University of Warwick, Coventry CV4 7AL, UK.
| | - Maite Wilke Berenguer
- Fakultät für Mathematik, Ruhr Universität Bochum, Universitätstraße 150, Bochum 44780, Germany.
| |
Collapse
|
11
|
Pavlidis P, Alachiotis N. A survey of methods and tools to detect recent and strong positive selection. ACTA ACUST UNITED AC 2017; 24:7. [PMID: 28405579 PMCID: PMC5385031 DOI: 10.1186/s40709-017-0064-0] [Citation(s) in RCA: 68] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2016] [Accepted: 03/29/2017] [Indexed: 01/25/2023]
Abstract
Positive selection occurs when an allele is favored by natural selection. The frequency of the favored allele increases in the population and due to genetic hitchhiking the neighboring linked variation diminishes, creating so-called selective sweeps. Detecting traces of positive selection in genomes is achieved by searching for signatures introduced by selective sweeps, such as regions of reduced variation, a specific shift of the site frequency spectrum, and particular LD patterns in the region. A variety of methods and tools can be used for detecting sweeps, ranging from simple implementations that compute summary statistics such as Tajima's D, to more advanced statistical approaches that use combinations of statistics, maximum likelihood, machine learning etc. In this survey, we present and discuss summary statistics and software tools, and classify them based on the selective sweep signature they detect, i.e., SFS-based vs. LD-based, as well as their capacity to analyze whole genomes or just subgenomic regions. Additionally, we summarize the results of comparisons among four open-source software releases (SweeD, SweepFinder, SweepFinder2, and OmegaPlus) regarding sensitivity, specificity, and execution times. In equilibrium neutral models or mild bottlenecks, both SFS- and LD-based methods are able to detect selective sweeps accurately. Methods and tools that rely on LD exhibit higher true positive rates than SFS-based ones under the model of a single sweep or recurrent hitchhiking. However, their false positive rate is elevated when a misspecified demographic model is used to represent the null hypothesis. When the correct (or similar to the correct) demographic model is used instead, the false positive rates are considerably reduced. The accuracy of detecting the true target of selection is decreased in bottleneck scenarios. In terms of execution time, LD-based methods are typically faster than SFS-based methods, due to the nature of required arithmetic.
Collapse
Affiliation(s)
- Pavlos Pavlidis
- Institute of Computer Science, Foundation for Research and Technology-Hellas, 70013 Crete, Greece
| | - Nikolaos Alachiotis
- Institute of Computer Science, Foundation for Research and Technology-Hellas, 70013 Crete, Greece
| |
Collapse
|
12
|
Theunert C, Slatkin M. Distinguishing recent admixture from ancestral population structure. Genome Biol Evol 2017; 9:2982377. [PMID: 28186554 PMCID: PMC5381645 DOI: 10.1093/gbe/evx018] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Revised: 01/17/2017] [Accepted: 02/04/2017] [Indexed: 12/19/2022] Open
Abstract
We develop and test two methods for distinguishing between recent admixture and ancestral population structure as explanations for greater similarity of one of two populations to an outgroup population. This problem arose when Neanderthals were found to be slightly more similar to nonAfrican than to African populations. The excess similarity is consistent with both recent admixture from Neanderthals into the ancestors of nonAfricans and subdivision in the ancestral population. Although later studies showed that there had been recent admixture, distinguishing between these two classes of models will be important in other situations, particularly when high-coverage genomes cannot be obtained for all populations. One of our two methods is based on the properties of the doubly conditioned frequency spectrum combined with the unconditional frequency spectrum. This method does not require a linkage map and can be used when there is relatively low coverage. The second method uses the extent of linkage disequilibrium among closely linked markers.
Collapse
Affiliation(s)
- Christoph Theunert
- Department of Integrative Biology, University of California, Berkeley
- Department of Evolutionary Genetics, Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | | |
Collapse
|
13
|
Zhang D, Ye Z, Yamada K, Zhen Y, Zheng C, Bu W. Pleistocene sea level fluctuation and host plant habitat requirement influenced the historical phylogeography of the invasive species Amphiareus obscuriceps (Hemiptera: Anthocoridae) in its native range. BMC Evol Biol 2016; 16:174. [PMID: 27582259 PMCID: PMC5007872 DOI: 10.1186/s12862-016-0748-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2016] [Accepted: 08/18/2016] [Indexed: 11/22/2022] Open
Abstract
BACKGROUND On account of repeated exposure and submergence of the East China Sea (ECS) land bridge, sea level fluctuation played an important role in shaping the population structure of many temperate species across the ECS during the glacial period. The flower bug Amphiareus obscuriceps (Poppius, 1909) (Hemiptera: Anthocoridae) is an invasive species native to the Sino-Japanese Region (SJR) of East Asia. We tested the hypothesis of the ECS land bridge acting as a dispersal corridor or filter for A. obscuriceps during the glacial period. Specifically, we tested whether and the extent to which dispersal ability and host plant habitat requirement influenced the genetic structure of A. obscuriceps during the exposure of the ECS land bridge. RESULTS Phylogenetic and network analyses indicated that A. obscuriceps is composed of two major lineages, i.e., China and Japan. Divergence time on both sides of the ECS was estimated to be approximately 1.07 (0.79-1.32) Ma, which was about the same period that the sea level increased. No significant Isolation by Distance (IBD) relationship was found between Фst and Euclidean distances in the Mantel tests, which is consistent with the hypothesis that this species has a good dispersal ability. Our Last Glacial Maximum (LGM) niche modeling of plants that constitute preferred habitats for A. obscuriceps exhibited a similar habitat gap on the exposed ECS continental shelf between China and Japan, but showed a continuous distribution across the Taiwan Strait. CONCLUSION Our results suggest that ecological properties (habitat requirement and dispersal ability), together with sea level fluctuation during the Pleistocene across the ECS, have shaped the genetic structure and demographic history of A. obscuriceps in its native area. The host plant habitat requirement could also be a key to the colonization of the A. obscuriceps species during the exposure of the ECS land bridge. Our findings will shed light on the potential role of habitat requirement in the process of biological invasion in future studies.
Collapse
Affiliation(s)
- Danli Zhang
- Institute of Entomology, College of Life Sciences, Nankai University, 94 Weijin Road, Tianjin, 300071 China
| | - Zhen Ye
- Institute of Entomology, College of Life Sciences, Nankai University, 94 Weijin Road, Tianjin, 300071 China
| | - Kazutaka Yamada
- Tokushima Prefectural Museum, Bunka-no-Mori Park, Hachiman-chô, Tokushima 770-8070 Japan
| | - Yahui Zhen
- Institute of Entomology, College of Life Sciences, Nankai University, 94 Weijin Road, Tianjin, 300071 China
| | - Chenguang Zheng
- Institute of Entomology, College of Life Sciences, Nankai University, 94 Weijin Road, Tianjin, 300071 China
| | - Wenjun Bu
- Institute of Entomology, College of Life Sciences, Nankai University, 94 Weijin Road, Tianjin, 300071 China
| |
Collapse
|
14
|
Duforet-Frebourg N, Slatkin M. Isolation-by-distance-and-time in a stepping-stone model. Theor Popul Biol 2016; 108:24-35. [PMID: 26592162 PMCID: PMC4779737 DOI: 10.1016/j.tpb.2015.11.003] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2015] [Revised: 10/26/2015] [Accepted: 11/03/2015] [Indexed: 01/30/2023]
Abstract
With the great advances in ancient DNA extraction, genetic data are now obtained from geographically separated individuals from both present and past. However, population genetics theory about the joint effect of space and time has not been thoroughly studied. Based on the classical stepping-stone model, we develop the theory of Isolation by distance and time. We derive the correlation of allele frequencies between demes in the case where ancient samples are present, and investigate the impact of edge effects with forward-in-time simulations. We also derive results about coalescent times in circular and toroidal models. As one of the most common ways to investigate population structure is principal components analysis (PCA), we evaluate the impact of our theory on PCA plots. Our results demonstrate that time between samples is an important factor. Ancient samples tend to be drawn to the center of a PCA plot.
Collapse
Affiliation(s)
- Nicolas Duforet-Frebourg
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, United States.
| | - Montgomery Slatkin
- Department of Integrative Biology, University of California Berkeley, Berkeley, CA 94720, United States
| |
Collapse
|
15
|
O'Loughlin SM, Magesa SM, Mbogo C, Mosha F, Midega J, Burt A. Genomic signatures of population decline in the malaria mosquito Anopheles gambiae. Malar J 2016; 15:182. [PMID: 27013475 PMCID: PMC4806450 DOI: 10.1186/s12936-016-1214-9] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2015] [Accepted: 03/05/2016] [Indexed: 01/15/2023] Open
Abstract
Background Population genomic features such as nucleotide diversity and linkage disequilibrium are expected to be strongly shaped by changes in population size, and might therefore be useful for monitoring the success of a control campaign. In the Kilifi district of Kenya, there has been a marked decline in the abundance of the malaria vector Anopheles gambiae subsequent to the rollout of insecticide-treated bed nets. Methods To investigate whether this decline left a detectable population genomic signature, simulations were performed to compare the effect of population crashes on nucleotide diversity, Tajima’s D, and linkage disequilibrium (as measured by the population recombination parameter ρ). Linkage disequilibrium and ρ were estimated for An. gambiae from Kilifi, and compared them to values for Anopheles arabiensis and Anopheles merus at the same location, and for An. gambiae in a location 200 km from Kilifi. Results In the first simulations ρ changed more rapidly after a population crash than the other statistics, and therefore is a more sensitive indicator of recent population decline. In the empirical data, linkage disequilibrium extends 100–1000 times further, and ρ is 100–1000 times smaller, for the Kilifi population of An. gambiae than for any of the other populations. There were also significant runs of homozygosity in many of the individual An. gambiae mosquitoes from Kilifi. Conclusions These results support the hypothesis that the recent decline in An. gambiae was driven by the rollout of bed nets. Measuring population genomic parameters in a small sample of individuals before, during and after vector or pest control may be a valuable method of tracking the effectiveness of interventions. Electronic supplementary material The online version of this article (doi:10.1186/s12936-016-1214-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Samantha M O'Loughlin
- Department of Life Sciences, Imperial College London, Silwood Park, Ascot, SL5 7PY, UK.
| | - Stephen M Magesa
- NIMR Amani Research Centre, P.O. Box 81, Muheza, Tanzania.,Global Health Division, RTI International, Dar es Salaam, Tanzania
| | - Charles Mbogo
- Centre for Geographic Medicine Research Coast, Kenya Medical Research Institute, P.O. Box 428, Kilifi, Kenya.,Malaria Public Health Department, Centre for Geographic Medicine, KEMRI-Wellcome Trust Research Programme, Kenyatta National Hospital Grounds, P.O. Box 43640-00100, Nairobi, Kenya
| | - Franklin Mosha
- Kilimanjaro Christian Medical University College, Moshi, Tanzania
| | - Janet Midega
- Centre for Geographic Medicine Research Coast, Kenya Medical Research Institute, P.O. Box 428, Kilifi, Kenya.,Department of Life Sciences, Imperial College London, South Kensington Campus, London, SW7 2AZ, UK.,Wellcome Trust Centre for Human Genetics, Oxford, OX3 7BN, UK
| | - Austin Burt
- Department of Life Sciences, Imperial College London, Silwood Park, Ascot, SL5 7PY, UK
| |
Collapse
|
16
|
Bradburd GS, Ralph PL, Coop GM. A Spatial Framework for Understanding Population Structure and Admixture. PLoS Genet 2016; 12:e1005703. [PMID: 26771578 PMCID: PMC4714911 DOI: 10.1371/journal.pgen.1005703] [Citation(s) in RCA: 89] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2015] [Accepted: 11/05/2015] [Indexed: 01/26/2023] Open
Abstract
Geographic patterns of genetic variation within modern populations, produced by complex histories of migration, can be difficult to infer and visually summarize. A general consequence of geographically limited dispersal is that samples from nearby locations tend to be more closely related than samples from distant locations, and so genetic covariance often recapitulates geographic proximity. We use genome-wide polymorphism data to build "geogenetic maps," which, when applied to stationary populations, produces a map of the geographic positions of the populations, but with distances distorted to reflect historical rates of gene flow. In the underlying model, allele frequency covariance is a decreasing function of geogenetic distance, and nonlocal gene flow such as admixture can be identified as anomalously strong covariance over long distances. This admixture is explicitly co-estimated and depicted as arrows, from the source of admixture to the recipient, on the geogenetic map. We demonstrate the utility of this method on a circum-Tibetan sampling of the greenish warbler (Phylloscopus trochiloides), in which we find evidence for gene flow between the adjacent, terminal populations of the ring species. We also analyze a global sampling of human populations, for which we largely recover the geography of the sampling, with support for significant histories of admixture in many samples. This new tool for understanding and visualizing patterns of population structure is implemented in a Bayesian framework in the program SpaceMix.
Collapse
Affiliation(s)
- Gideon S. Bradburd
- Center for Population Biology, Department of Evolution and Ecology, University of California, Davis, California, United States of America
| | - Peter L. Ralph
- Department of Molecular and Computational Biology, University of Southern California, Los Angeles, California, United States of America
| | - Graham M. Coop
- Center for Population Biology, Department of Evolution and Ecology, University of California, Davis, California, United States of America
| |
Collapse
|
17
|
Zhang L, Li H, Li S, Zhang A, Kou F, Xun H, Wang P, Wang Y, Song F, Cui J, Cui J, Gouge DH, Cai W. Phylogeographic structure of cotton pest Adelphocoris suturalis (Hemiptera: Miridae): strong subdivision in China inferred from mtDNA and rDNA ITS markers. Sci Rep 2015; 5:14009. [PMID: 26388034 PMCID: PMC4585665 DOI: 10.1038/srep14009] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2015] [Accepted: 08/13/2015] [Indexed: 11/09/2022] Open
Abstract
Phylogeographic patterns of some extant plant and vertebrate species have been well studied; however, they are poorly understood in the majority of insects. The study documents analysis of mitochondrial (COI, CYTB and ND5) and nuclear (5.8S rDNA, ITS2 and 28S rDNA) data from 419 individuals of Adelphocoris suturalis, which is one of the main cotton pests found in the 31 locations in China and Japan involved in the study. Results show that the species is highly differentiated between populations from central China and peripheral China regions. Analysis of molecular variance showed a high level of geographical differentiation at different hierarchical levels. Isolation-by-distance test showed no significant correlation between genetic distance and geographical distance among A. suturalis populations, which suggested gene flow is not restricted by distance. In seven peripheral populations, the high levels of genetic differentiation and the small Nem values implied that geographic barriers were more likely restrict gene flow. Neutrality tests and the Bayesian skyline plot suggested population expansion likely happened during the cooling transition between Last Interglacial and Last Glacial Maximum. All lines of evidence suggest that physical barriers, Pleistocene climatic oscillations and geographical heterogeneity have affected the population structure and distribution of this insect in China.
Collapse
Affiliation(s)
- Lijuan Zhang
- Department of Entomology, China Agricultural University, Beijing 100193, China.,Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan 455000, China
| | - Hu Li
- Department of Entomology, China Agricultural University, Beijing 100193, China
| | - Shujuan Li
- Maricopa Agricultural Center, University of Arizona, Maricopa, AZ 85138, USA
| | - Aibing Zhang
- College of Life Sciences, Capital Normal University, Beijing 100048, China
| | - Fei Kou
- Department of Entomology, China Agricultural University, Beijing 100193, China
| | - Huaizhu Xun
- Department of Entomology, China Agricultural University, Beijing 100193, China
| | - Pei Wang
- Key Laboratory of Molluscan Quarantine and Identification of AQSIQ, Fujian Entry-Exit Inspection &Quarantine Bureau, Fuzhou, Fujian 350001, China
| | - Ying Wang
- Henan Institute of Science and Technology, Xinxiang, Henan 453003, China
| | - Fan Song
- Department of Entomology, China Agricultural University, Beijing 100193, China
| | - Jianxin Cui
- Henan Institute of Science and Technology, Xinxiang, Henan 453003, China
| | - Jinjie Cui
- Cotton Research Institute, Chinese Academy of Agricultural Sciences, Anyang, Henan 455000, China
| | - Dawn H Gouge
- Department of Entomology, University of Arizona, Maricopa, AZ 85721, USA
| | - Wanzhi Cai
- Department of Entomology, China Agricultural University, Beijing 100193, China
| |
Collapse
|
18
|
Pyhäjärvi T, Hufford MB, Mezmouk S, Ross-Ibarra J. Complex patterns of local adaptation in teosinte. Genome Biol Evol 2014; 5:1594-609. [PMID: 23902747 PMCID: PMC3787665 DOI: 10.1093/gbe/evt109] [Citation(s) in RCA: 115] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
Populations of widely distributed species encounter and must adapt to local environmental conditions. However, comprehensive characterization of the genetic basis of adaptation is demanding, requiring genome-wide genotype data, multiple sampled populations, and an understanding of population structure and potential selection pressures. Here, we used single-nucleotide polymorphism genotyping and data on numerous environmental variables to describe the genetic basis of local adaptation in 21 populations of teosinte, the wild ancestor of maize. We found complex hierarchical genetic structure created by altitude, dispersal events, and admixture among subspecies, which complicated identification of locally beneficial alleles. Patterns of linkage disequilibrium revealed four large putative inversion polymorphisms showing clinal patterns of frequency. Population differentiation and environmental correlations suggest that both inversions and intergenic polymorphisms are involved in local adaptation.
Collapse
Affiliation(s)
- Tanja Pyhäjärvi
- Department of Plant Sciences, University of California, Davis
| | | | | | | |
Collapse
|
19
|
Mona S, Mordret E, Veuille M, Tommaseo-Ponzetta M. Investigating sex-specific dynamics using uniparental markers: West New Guinea as a case study. Ecol Evol 2013; 3:2647-60. [PMID: 24567829 PMCID: PMC3930047 DOI: 10.1002/ece3.660] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2013] [Revised: 05/27/2013] [Accepted: 05/31/2013] [Indexed: 12/24/2022] Open
Abstract
Mitochondrial DNA (mtDNA) and Y chromosome (NRY) genetic markers have been often contrasted to investigate sex-specific dynamics. Traditionally, isolation by distance, intrapopulation genetic diversity and population differentiation are estimated from both markers and compared. Two possible sources of bias are often neglected. First, kilometric distances are frequently used as predictor of the connectivity between groups, hiding the role played by environmental features at a microgeographic scale. Second, the comparison of intrapopulation diversity and population differentiation between mtDNA and NRY is hampered by their different mutational mechanisms and rates. Here, we show how to account for these biases by analyzing from a different perspective a published dataset of eight West New Guinea (WNG) populations for which mtDNA control region sequences and seven linked NRY microsatellites had been typed. First, we modeled the connectivity among sampled populations by computing the number of days required to travel between groups. Then, we investigated the differences between the two sexes accounting for the molecular characteristics of the markers examined to obtain estimates on the product of the effective population size and the migration rate among demes (Nm). We achieved this goal by studying the shape of the gene genealogy at several sampling levels and using spatial explicit simulations. Both the direction and the rate of migration differ between male and females, with an Nm estimated to be >6 times higher in the latter under many evolutionary scenarios. We finally highlight the importance of applying metapopulation models when analyzing the genetic diversity of a species. We have applied the prediction of the sampling theory in a meta-population and we have corroborated our finding using spatial explicit simulations. Both approaches are fundamentally meant to deal with structured populations: we strongly believe in the importance of tacking structure into account when inferring the demographic history of a species.
Collapse
Affiliation(s)
- Stefano Mona
- Laboratoire Biologie intégrative des populations, Ecole Pratique des Hautes Etudes 46 rue de Lille, 75007, Paris, France ; CNRS UMR 7205, Muséum National d'Histoire Naturelle Rue Buffon, 75005, Paris, France
| | - Ernest Mordret
- Laboratoire Biologie intégrative des populations, Ecole Pratique des Hautes Etudes 46 rue de Lille, 75007, Paris, France ; CNRS UMR 7205, Muséum National d'Histoire Naturelle Rue Buffon, 75005, Paris, France
| | - Michel Veuille
- Laboratoire Biologie intégrative des populations, Ecole Pratique des Hautes Etudes 46 rue de Lille, 75007, Paris, France ; CNRS UMR 7205, Muséum National d'Histoire Naturelle Rue Buffon, 75005, Paris, France
| | | |
Collapse
|
20
|
Gattepaille LM, Jakobsson M, Blum MGB. Inferring population size changes with sequence and SNP data: lessons from human bottlenecks. Heredity (Edinb) 2013; 110:409-19. [PMID: 23423148 PMCID: PMC3630807 DOI: 10.1038/hdy.2012.120] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Reconstructing historical variation of population size from sequence and single-nucleotide polymorphism (SNP) data is valuable for understanding the evolutionary history of species. Changes in the population size of humans have been thoroughly investigated, and we review different methodologies of demographic reconstruction, specifically focusing on human bottlenecks. In addition to the classical approaches based on the site-frequency spectrum (SFS) or based on linkage disequilibrium, we also review more recent approaches that utilize atypical shared genomic fragments, such as identical by descent or homozygous segments between or within individuals. Compared with methods based on the SFS, these methods are well suited for detecting recent bottlenecks. In general, all these various methods suffer from bias and dependencies on confounding factors such as population structure or poor specification of the mutational and recombination processes, which can affect the demographic reconstruction. With the exception of SFS-based methods, the effects of confounding factors on the inference methods remain poorly investigated. We conclude that an important step when investigating population size changes rests on validating the demographic model by investigating to what extent the fitted demographic model can reproduce the main features of the polymorphism data.
Collapse
Affiliation(s)
- L M Gattepaille
- Department of Evolutionary Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | | | | |
Collapse
|
21
|
Wang B, Jiang J, Xie F, Li C. Phylogeographic patterns of mtDNA variation revealed multiple glacial refugia for the frog species Feirana taihangnica endemic to the Qinling Mountains. J Mol Evol 2013; 76:112-28. [PMID: 23381112 DOI: 10.1007/s00239-013-9544-5] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2011] [Accepted: 01/19/2013] [Indexed: 11/24/2022]
Abstract
Diversification patterns and demography of montane species are affected by Pleistocene climate fluctuations. Empirical cases from the Qinling Mountains (QM) region, which is a major biogeographic divider of East Asia, are few. We used DNA sequence data of the complete mitochondrial ND2 gene to detect effects of the Pleistocene glaciations on phylogeographic profiles of a frog species, Feirana taihangnica, which is endemic to the QM. Four distinct lineages consisting of seven sublineages were revealed. The strongest signal of biogeographical structure (F(ct) = 0.971, P < 0.01) was found when populations were grouped according to these seven sublineages. One narrow secondary contact zone was detected in the middle QM between the lineage from middle QM and the lineage from eastern QM. Coalescent simulations indicated that this species colonized the QM region by a stepping-stone model. Divergences among lineages had likely been influenced by the uplift of the Tibetan Plateau during the late Miocene-to-late Pleistocene, as well as by the Pleistocene climatic cycles. Coalescent simulations also suggested that F. taihangnica populations have persisted through the Pleistocene glacial periods in multiple refugia across the QM region. Demographic analyses indicated that all lineages, except the lineage in the Funiu Mountains, have been experienced postglacial expansion of population size and distribution range. In conclusion, Pleistocene climate fluctuations and tectonic changes during the late Miocene-late Pleistocene have profoundly influenced the phylogeography and historical demography of F. taihangnica.
Collapse
Affiliation(s)
- Bin Wang
- Chengdu Institute of Biology, Chinese Academy of Sciences, Chengdu, China
| | | | | | | |
Collapse
|
22
|
Pyhäjärvi T, Hufford MB, Mezmouk S, Ross-Ibarra J. Complex patterns of local adaptation in teosinte. Genome Biol Evol 2013. [PMID: 23902747 DOI: 10.1093/gbe.evt109] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/10/2023] Open
Abstract
Populations of widely distributed species encounter and must adapt to local environmental conditions. However, comprehensive characterization of the genetic basis of adaptation is demanding, requiring genome-wide genotype data, multiple sampled populations, and an understanding of population structure and potential selection pressures. Here, we used single-nucleotide polymorphism genotyping and data on numerous environmental variables to describe the genetic basis of local adaptation in 21 populations of teosinte, the wild ancestor of maize. We found complex hierarchical genetic structure created by altitude, dispersal events, and admixture among subspecies, which complicated identification of locally beneficial alleles. Patterns of linkage disequilibrium revealed four large putative inversion polymorphisms showing clinal patterns of frequency. Population differentiation and environmental correlations suggest that both inversions and intergenic polymorphisms are involved in local adaptation.
Collapse
Affiliation(s)
- Tanja Pyhäjärvi
- Department of Plant Sciences, University of California, Davis
| | | | | | | |
Collapse
|
23
|
Tachida H. Linkage disequilibrium in a population undergoing periodic fragmentation and admixture. Genes Genet Syst 2012; 87:125-35. [PMID: 22820386 DOI: 10.1266/ggs.87.125] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Glacial and interglacial cycles are considered to have caused the fragmentation and admixture of populations in many organisms. A simple model incorporating such periodic changes of the population structure is analysed in order to investigate the behaviour of neutral genetic variation at one and two loci. The equilibrium is reached very quickly in terms of cycles if the length of a cycle is long, as would be expected of the glaciation cycles. Heterozygosity and linkage disequilibrium are shown to depend on the length of time of the fragmented and admixed phases, population sizes, and number (n) of subpopulations in the fragmented phase. If the population size is small in the fragmented phase and its duration is long, the squared correlation coefficient of two loci (a measure of linkage disequilibrium) just after the admixture is approximated by 1/(n-1) for n > 1. After admixture, the correlation decays at a rate of approximately twice the recombination rate. Therefore, if post-glaciation admixture created linkage disequilibrium, we expect to observe linkage disequilibrium even between moderately linked loci, and its decay pattern along the chromosome is very different from that in a random mating population at equilibrium. This is especially true in organisms with long generation times such as trees.
Collapse
Affiliation(s)
- Hidenori Tachida
- Department of Biology, Faculty of Sciences, Kyushu University, Higashi-ku, Fukuoka, Japan.
| |
Collapse
|
24
|
Impact of sampling schemes on demographic inference: an empirical study in two species with different mating systems and demographic histories. G3-GENES GENOMES GENETICS 2012; 2:803-14. [PMID: 22870403 PMCID: PMC3385986 DOI: 10.1534/g3.112.002410] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/29/2012] [Accepted: 05/10/2012] [Indexed: 12/12/2022]
Abstract
Most species have at least some level of genetic structure. Recent simulation studies have shown that it is important to consider population structure when sampling individuals to infer past population history. The relevance of the results of these computer simulations for empirical studies, however, remains unclear. In the present study, we use DNA sequence datasets collected from two closely related species with very different histories, the selfing species Capsella rubella and its outcrossing relative C. grandiflora, to assess the impact of different sampling strategies on summary statistics and the inference of historical demography. Sampling strategy did not strongly influence the mean values of Tajima's D in either species, but it had some impact on the variance. The general conclusions about demographic history were comparable across sampling schemes even when resampled data were analyzed with approximate Bayesian computation (ABC). We used simulations to explore the effects of sampling scheme under different demographic models. We conclude that when sequences from modest numbers of loci (<60) are analyzed, the sampling strategy is generally of limited importance. The same is true under intermediate or high levels of gene flow (4Nm > 2-10) in models in which global expansion is combined with either local expansion or hierarchical population structure. Although we observe a less severe effect of sampling than predicted under some earlier simulation models, our results should not be seen as an encouragement to neglect this issue. In general, a good coverage of the natural range, both within and between populations, will be needed to obtain a reliable reconstruction of a species's demographic history, and in fact, the effect of sampling scheme on polymorphism patterns may itself provide important information about demographic history.
Collapse
|
25
|
Small effective population size and genetic homogeneity in the Val Borbera isolate. Eur J Hum Genet 2012; 21:89-94. [PMID: 22713810 DOI: 10.1038/ejhg.2012.113] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022] Open
Abstract
Population isolates are a valuable resource for medical genetics because of their reduced genetic, phenotypic and environmental heterogeneity. Further, extended linkage disequilibrium (LD) allows accurate haplotyping and imputation. In this study, we use nuclear and mitochondrial DNA data to determine to what extent the geographically isolated population of the Val Borbera valley also presents features of genetic isolation. We performed a comparative analysis of population structure and estimated effective population size exploiting LD data. We also evaluated haplotype sharing through the analysis of segments of autozygosity. Our findings reveal that the valley has features characteristic of a genetic isolate, including reduced genetic heterogeneity and reduced effective population size. We show that this population has been subject to prolonged genetic drift and thus we expect many variants that are rare in the general population to reach significant frequency values in the valley, making this population suitable for the identification of rare variants underlying complex traits.
Collapse
|
26
|
Cutter AD, Wang GX, Ai H, Peng Y. Influence of finite-sites mutation, population subdivision and sampling schemes on patterns of nucleotide polymorphism for species with molecular hyperdiversity. Mol Ecol 2012; 21:1345-59. [PMID: 22320847 DOI: 10.1111/j.1365-294x.2012.05475.x] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Molecular hyperdiversity has been documented in viruses, prokaryotes and eukaryotes. Such organisms undermine the assumptions of the infinite-sites mutational model, because multiple mutational events at a site comprise a non-negligible portion of polymorphisms. Moreover, different sampling schemes of individuals from species with subdivided populations can profoundly influence resulting patterns and interpretations of molecular variation. Inspired by molecular hyperdiversity in the nematode Caenorhabditis sp. 5, which exhibits average pairwise differences among synonymous sites of >5% as well as modest population structure, we investigated via coalescent simulation the joint effects of a finite-sites mutation (FSM) process and population subdivision on the variant frequency spectrum. From many demes interconnected through a stepping-stone migration model, we constructed local samples from a single deme, pooled samples from several demes and scattered samples of a single individual from numerous demes. Compared with a single panmictic population at equilibrium, we find that high population mutation rates induce a deficit of rare variants (positive Tajima's D) under a FSM model. Population structure also induces such a skew for local samples when migration is high and for pooled samples when migration is low. Contrasts of sampling schemes for C. sp. 5 imply high mutational input coupled with high migration. We propose that joint analysis of local, pooled and scattered samples for species with subdivided populations provides a means of improving inference of demographic history, by virtue of the partially distinct patterns of polymorphism that manifest when sequences are analyzed according to differing sampling schemes.
Collapse
Affiliation(s)
- Asher D Cutter
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Ontario, Canada.
| | | | | | | |
Collapse
|
27
|
Pettengill JB, Moeller DA. TEMPO AND MODE OF MATING SYSTEM EVOLUTION BETWEEN INCIPIENT CLARKIA SPECIES. Evolution 2011; 66:1210-25. [DOI: 10.1111/j.1558-5646.2011.01521.x] [Citation(s) in RCA: 46] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
|
28
|
Bürger R, Akerman A. The effects of linkage and gene flow on local adaptation: a two-locus continent-island model. Theor Popul Biol 2011; 80:272-88. [PMID: 21801739 PMCID: PMC3257863 DOI: 10.1016/j.tpb.2011.07.002] [Citation(s) in RCA: 65] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2011] [Revised: 07/08/2011] [Accepted: 07/11/2011] [Indexed: 11/24/2022]
Abstract
Population subdivision and migration are generally considered to be important causes of linkage disequilibrium (LD). We explore the combined effects of recombination and gene flow on the amount of LD, the maintenance of polymorphism, and the degree of local adaptation in a subdivided population by analyzing a diploid, deterministic continent-island model with genic selection on two linked loci (i.e., no dominance or epistasis). For this simple model, we characterize explicitly all possible equilibrium configurations. Simple and intuitive approximations for many quantities of interest are obtained in limiting cases, such as weak migration, weak selection, weak or strong recombination. For instance, we derive explicit expressions for the measures D(=p(AB)-p(A)p(B)) and r(2) (the squared correlation in allelic state) of LD. They depend in qualitatively different ways on the migration rate. Remarkably high values of r(2) are maintained between weakly linked loci, especially if gene flow is low. We determine how the maximum amount of gene flow that admits preservation of the locally adapted haplotype, hence of polymorphism at both loci, depends on recombination rate and selection coefficients. We also investigate the evolution of differentiation by examining the invasion of beneficial mutants of small effect that are linked to an already present, locally adapted allele. Mutants of much smaller effect can invade successfully than predicted by naive single-locus theory provided they are at least weakly linked. Finally, the influence of linkage on the degree of local adaptation, the migration load, and the effective migration rate at a neutral locus is explored. We discuss possible consequences for the evolution of genetic architecture, in particular, for the emergence of clusters of tightly linked, slightly beneficial mutations and the evolution of recombination and chromosome inversions.
Collapse
|
29
|
EVANS BENJ, BLISS SHIREENM, MENDEL SIMONEA, TINSLEY RICHARDC. The Rift Valley is a major barrier to dispersal of African clawed frogs (Xenopus) in Ethiopia. Mol Ecol 2011; 20:4216-30. [DOI: 10.1111/j.1365-294x.2011.05262.x] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
30
|
Muir G, Bergero R, Charlesworth D, Filatov DA. Does local adaptation cause high population differentiation of Silene latifolia Y chromosomes? Evolution 2011; 65:3368-80. [PMID: 22133212 DOI: 10.1111/j.1558-5646.2011.01410.x] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
Abstract
Natural selection can reduce the effective population size of the nonrecombining Y chromosome, whereas local adaptation of Y-linked genes can increase the population divergence and overall intra-species polymorphism of Y-linked sequences. The plant Silene latifolia evolved a Y chromosome relatively recently, and most known X-linked genes have functional Y homologues, making the species interesting for comparisons of X- and Y-linked diversity and subdivision. Y-linked genes show higher population differentiation, compared to X-linked genes, and this might be maintained by local adaptation in Y-linked genes (or low sequence diversity). Here we attempt to test between these causes by investigating DNA polymorphism and population differentiation using a larger set of Y-linked and X-linked S. latifolia genes (than used previously), and show that net sequence divergence for Y-linked sequences (measured by D(a) , also known as δ) is low, and not consistently higher than X-linked genes. This does not support local adaptation, instead, the higher values of differentiation measures for the Y-linked genes probably result largely from reduced total variation on the Y chromosome, which in turn reflect deterministic processes lowering effective population sizes of evolving Y-chromosomes.
Collapse
Affiliation(s)
- Graham Muir
- Department of Plant Sciences, University of Oxford, South Parks Road, Oxford, OX1 3RB, United Kingdom.
| | | | | | | |
Collapse
|
31
|
St Onge KR, Källman T, Slotte T, Lascoux M, Palmé AE. Contrasting demographic history and population structure in Capsella rubella and Capsella grandiflora, two closely related species with different mating systems. Mol Ecol 2011; 20:3306-20. [PMID: 21777317 DOI: 10.1111/j.1365-294x.2011.05189.x] [Citation(s) in RCA: 50] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
Both mating system and population history can have large impacts on genetic diversity and population structure. Here, we use multilocus sequence data to investigate how these factors impact two closely related Brassicaceae species: the selfing Capsella rubella and the outcrossing C. grandiflora. To do this, we have sequenced 16 loci in approximately 70 individuals from 7 populations of each species. Patterns of population structure differ strongly between the two species. In C. grandiflora, we observe an isolation-by-distance pattern and identify three clearly delineated genetic groups. In C. rubella, where we estimate the selfing rate to be 0.90-0.94, the pattern is less clear with some sampling populations forming separate genetic clusters while others are highly mixed. The two species also have divergent histories. Our analysis gives support for a bottleneck approximately 73 kya (20-139 kya) in C. rubella, which most likely represents speciation from C. grandiflora. In C. grandiflora, there is moderate support for the standard neutral model in 2 of 3 genetic clusters, while the third cluster and the total data set show evidence of expansion. It is clear that mating system has an impact on these two species, for example affecting the level of genetic variation and the genetic structure. However, our results also clearly show that a combination of past and present processes, some of which are not affected by mating system, is needed to explain the differences between C. rubella and C. grandiflora.
Collapse
Affiliation(s)
- Kate R St Onge
- Department of Plant Ecology and Evolution, Uppsala University, Norbyvägen 18D, 752 36 Uppsala, Sweden
| | | | | | | | | |
Collapse
|
32
|
Zeng K, Charlesworth B. The effects of demography and linkage on the estimation of selection and mutation parameters. Genetics 2010; 186:1411-24. [PMID: 20923980 PMCID: PMC2998320 DOI: 10.1534/genetics.110.122150] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2010] [Accepted: 09/27/2010] [Indexed: 11/18/2022] Open
Abstract
We explore the effects of demography and linkage on a maximum-likelihood (ML) method for estimating selection and mutation parameters in a reversible mutation model. This method assumes free recombination between sites and a randomly mating population of constant size and uses information from both polymorphic and monomorphic sites in the sample. Two likelihood-ratio test statistics were constructed under this ML framework: LRTγ for detecting selection and LRTκ for detecting mutational bias. By carrying out extensive simulations, we obtain the following results. When mutations are neutral and population size is constant, LRTγ and LRTκ follow a chi-square distribution with 1 d.f. regardless of the level of linkage, as long as the mutation rate is not very high. In addition, LRTγ and LRTκ are relatively insensitive to demographic effects and selection at linked sites. We find that the ML estimators of the selection and mutation parameters are usually approximately unbiased and that LRTκ usually has good power to detect mutational bias. Finally, with a recombination rate that is typical for Drosophila, LRTγ has good power to detect weak selection acting on synonymous sites. These results suggest that the method should be useful under many different circumstances.
Collapse
Affiliation(s)
- Kai Zeng
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JT, United Kingdom.
| | | |
Collapse
|
33
|
Qiu S, Bergero R, Forrest A, Kaiser VB, Charlesworth D. Nucleotide diversity in Silene latifolia autosomal and sex-linked genes. Proc Biol Sci 2010; 277:3283-90. [PMID: 20519224 PMCID: PMC2981921 DOI: 10.1098/rspb.2010.0606] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2010] [Accepted: 05/13/2010] [Indexed: 11/12/2022] Open
Abstract
The plant Silene latifolia has separate sexes and sex chromosomes, and is of interest for studying the early stages of sex chromosome evolution, especially the evolution of non-recombining regions on the Y chromosome. Hitch-hiking processes associated with ongoing genetic degeneration of the non-recombining Y chromosome are predicted to reduce Y-linked genes' effective population sizes, and S. latifolia Y-linked genes indeed have lower diversity than X-linked ones. We tested whether this represents a true diversity reduction on the Y, versus the alternative possibility, elevated diversity at X-linked genes, by collecting new data on nucleotide diversity for autosomal genes, which had previously been little studied. We find clear evidence that Y-linked genes have reduced diversity. However, another alternative explanation for a low Y effective size is a high variance in male reproductive success. Autosomal genes should then also have lower diversity than expected, relative to the X, but this is not found in our loci. Taking into account the higher mutation rate of Y-linked genes, their low sequence diversity indicates a strong effect of within-population hitch-hiking on the Y chromosome.
Collapse
Affiliation(s)
- Suo Qiu
- Institute of Evolutionary Biology, School of Biological Sciences, The University of Edinburgh, Ashworth Laboratory, King's Buildings, West Mains Road, Edinburgh EH9 3JT, UK
- State Key Laboratory of Biocontrol and Key Laboratory of Gene Engineering of the Ministry of Education, Sun Yat-Sen University, Guangzhou 510275, People's Republic of China
| | - Roberta Bergero
- Institute of Evolutionary Biology, School of Biological Sciences, The University of Edinburgh, Ashworth Laboratory, King's Buildings, West Mains Road, Edinburgh EH9 3JT, UK
| | - Alan Forrest
- Institute of Evolutionary Biology, School of Biological Sciences, The University of Edinburgh, Ashworth Laboratory, King's Buildings, West Mains Road, Edinburgh EH9 3JT, UK
| | - Vera B. Kaiser
- Institute of Evolutionary Biology, School of Biological Sciences, The University of Edinburgh, Ashworth Laboratory, King's Buildings, West Mains Road, Edinburgh EH9 3JT, UK
| | - Deborah Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, The University of Edinburgh, Ashworth Laboratory, King's Buildings, West Mains Road, Edinburgh EH9 3JT, UK
| |
Collapse
|
34
|
Abstract
There has been an enormous increase in the amount of data on DNA sequence polymorphism available for many organisms in the last decade. New sequencing technologies provide great potential for investigating natural selection in plants using population genomic approaches. However, plant populations frequently show significant departures from the assumptions of standard models used to detect selection and many forms of directional selection do not fit with classical population genetics theory. Here, we explore the extent to which plant populations show departures from standard model assumptions, and the implications this has for detecting selection on molecular variation. A growing number of multilocus studies of nucleotide variation suggest that changes in population size, particularly bottlenecks, and strong subdivision may be common in plants. This demographic variation presents important challenges for models used to infer selection. In addition, selection from standing genetic variation and multiple independent adaptive substitutions can further complicate efforts to understand the nature of selection. We discuss emerging patterns from plant studies and propose that, rather than treating population history as a nuisance variable when testing for selection, the interaction between demography and selection is of fundamental importance for evolutionary studies of plant populations using molecular data.
Collapse
Affiliation(s)
- Mathieu Siol
- Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, ON, Canada.
| | | | | |
Collapse
|
35
|
Panchal M, Beaumont MA. Evaluating nested clade phylogeographic analysis under models of restricted gene flow. Syst Biol 2010; 59:415-32. [PMID: 20547778 DOI: 10.1093/sysbio/syq022] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Nested clade phylogeographic analysis (NCPA) is a widely used method that aims to identify past demographic events that have shaped the history of a population. In an earlier study, NCPA has been fully automated, allowing it to be tested with simulated data sets generated under a null model in which samples simulated from a panmictic population are geographically distributed. It was noted that NCPA was prone to inferring false positives, corroborating earlier findings. The present study aims to evaluate both single-locus and multilocus NCPA under the scenario of restricted gene flow among spatially distributed populations. We have developed a new program, ANeCA-ML, which implements multilocus NCPA. Data were simulated under 3 models of gene flow: a stepping stone model, an island model, and a stepping stone model with some long-distance dispersal. Results indicate that single-locus NCPA tends to give a high frequency of false positives, but, unlike the random-mating scenario presented previously, inferences are not limited to restricted gene flow with isolation by distance or contiguous range expansion. The proportion of single-locus data sets that contained false inferences was 76% for the panmictic case, 87% for the stepping stone model, 79% for the stepping stone model with long-distance dispersal, and more than 99% for the island model. The frequency of inferences is inversely related to the amount of gene flow between demes. We performed multilocus NCPA by grouping the simulated loci into data sets of 5 loci. The false-positive rate was reduced in multilocus NCPA for some inferences but remained high for others. The proportion of multilocus data sets that contained false inferences was 17% for the panmictic case, 30% for the stepping stone model, 4% for the stepping stone model with long-distance dispersal, and 54% for the island model. Multilocus NCPA reduces the false-positive rate by restricting the sensitivity of the method but does not appear to increase the accuracy of the approach. Three classical tests-the analysis of molecular variance method, Fu's Fs, and the Mantel test-show that there is information in the data that gives rise to explicable results using these standard approaches. In conclusion, for the scenarios that we have examined, our simulation study suggests that the NCPA method is unreliable and its inferences may be misleading. We suggest that the NCPA method should not be used without objective simulation-based testing by independent researchers.
Collapse
Affiliation(s)
- Mahesh Panchal
- School of Biological Sciences, University of Reading, Whiteknights, PO Box 228, Reading RG6 6AJ, UK.
| | | |
Collapse
|
36
|
Xie X. The Site-Frequency Spectrum of Linked Sites. Bull Math Biol 2010; 73:459-94. [DOI: 10.1007/s11538-010-9534-3] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2009] [Accepted: 03/08/2010] [Indexed: 11/27/2022]
|
37
|
Pepperell C, Hoeppner VH, Lipatov M, Wobeser W, Schoolnik GK, Feldman MW. Bacterial genetic signatures of human social phenomena among M. tuberculosis from an Aboriginal Canadian population. Mol Biol Evol 2009; 27:427-40. [PMID: 19861642 DOI: 10.1093/molbev/msp261] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
Despite a widespread global distribution and highly variable disease phenotype, there is little DNA sequence diversity among isolates of Mycobacterium tuberculosis. In addition, many regional population genetic surveys have revealed a stereotypical structure in which a single clone, lineage, or clade makes up the majority of the population. It is often assumed that dominant clones are highly adapted, that is, the overall structure of M. tuberculosis populations is the result of positive selection. In order to test this assumption, we analyzed genetic data from extant populations of bacteria circulating in Aboriginal communities in Saskatchewan, Canada. Demographic parameters of the bacterial population were estimated from archival epidemiological data collected over approximately 130 years since the onset of epidemic tuberculosis in the host communities. Bacterial genetic data were tested against neutral theory expectations and the local evolutionary history of M. tuberculosis investigated by phylogenetic analysis. Our findings are not consistent with positive selection on the bacterial population. Instead, we uncovered founder effects persisting over decades and barriers to gene flow within the bacterial population. Simulation experiments suggested that a combination of these neutral influences could result in the stereotypical structure of M. tuberculosis populations. Some aspects of population structure were suggestive of background selection, and data were on the whole consistent with combined effects of population bottlenecks, subdivision, and background selection. Neutral phenomena, namely, bottlenecks and partitions within populations, are prominent influences on the evolution of M. tuberculosis and likely contribute to restricted genetic diversity observed within this species. Given these influences, a complex evolutionary model will be required to define the relative fitness of different M. tuberculosis lineages and, ultimately, to uncover the genetic basis for its success as a pathogen.
Collapse
|
38
|
Abstract
The spatial genetic composition of hybrid zones exhibits a range of possible patterns, with many characterized by patchy distributions. While several hypothetical explanations exist for the maintenance of these "mosaic" hybrid zones, they remain virtually unexplored theoretically. Using computer simulations we investigate the roles of dispersal and assortative mating in the formation and persistence of hybrid zone structure. To quantify mosaic structure we develop a likelihood method, which we apply to simulation and empirical data. We find that long distance dispersal can lead to a patchy distribution that assortative mating can then reinforce, ultimately producing a mosaic capable of persisting over evolutionarily significant periods of time. By reducing the mating success of rare males, assortative mating creates a positive within-patch frequency-dependent selective pressure. Selection against heterozygotes can similarly create a rare-type disadvantage and we show that it can also preserve structure. We find that mosaic structure is maintained across a range of assumptions regarding the form and strength of assortative mating. Interestingly, we find that higher levels of mosaic structure are sometimes observed for intermediate assortment strengths. The high incidence of assortment documented in hybrid zones suggests that it may play a key role in stabilizing their form and structure.
Collapse
Affiliation(s)
- Leithen K M'Gonigle
- Department of Zoology, University of British Columbia, Vancouver, BC, V6T 1Z4, Canada
| | | |
Collapse
|
39
|
Städler T, Haubold B, Merino C, Stephan W, Pfaffelhuber P. The impact of sampling schemes on the site frequency spectrum in nonequilibrium subdivided populations. Genetics 2009; 182:205-16. [PMID: 19237689 PMCID: PMC2674817 DOI: 10.1534/genetics.108.094904] [Citation(s) in RCA: 162] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2008] [Accepted: 02/17/2009] [Indexed: 11/18/2022] Open
Abstract
Using coalescent simulations, we study the impact of three different sampling schemes on patterns of neutral diversity in structured populations. Specifically, we are interested in two summary statistics based on the site frequency spectrum as a function of migration rate, demographic history of the entire substructured population (including timing and magnitude of specieswide expansions), and the sampling scheme. Using simulations implementing both finite-island and two-dimensional stepping-stone spatial structure, we demonstrate strong effects of the sampling scheme on Tajima's D (D(T)) and Fu and Li's D (D(FL)) statistics, particularly under specieswide (range) expansions. Pooled samples yield average D(T) and D(FL) values that are generally intermediate between those of local and scattered samples. Local samples (and to a lesser extent, pooled samples) are influenced by local, rapid coalescence events in the underlying coalescent process. These processes result in lower proportions of external branch lengths and hence lower proportions of singletons, explaining our finding that the sampling scheme affects D(FL) more than it does D(T). Under specieswide expansion scenarios, these effects of spatial sampling may persist up to very high levels of gene flow (Nm > 25), implying that local samples cannot be regarded as being drawn from a panmictic population. Importantly, many data sets on humans, Drosophila, and plants contain signatures of specieswide expansions and effects of sampling scheme that are predicted by our simulation results. This suggests that validating the assumption of panmixia is crucial if robust demographic inferences are to be made from local or pooled samples. However, future studies should consider adopting a framework that explicitly accounts for the genealogical effects of population subdivision and empirical sampling schemes.
Collapse
Affiliation(s)
- Thomas Städler
- Faculty of Mathematics and Physics, University of Freiburg, D-79104 Freiburg, Germany.
| | | | | | | | | |
Collapse
|
40
|
LI SHOUHSIEN, YEUNG CAROLKL, FEINSTEIN JULIE, HAN LIANXIAN, LE MANHHUNG, WANG CHIXAN, DING PING. Sailing through the Late Pleistocene: unusual historical demography of an East Asian endemic, the Chinese Hwamei (Leucodioptron canorum canorum), during the last glacial period. Mol Ecol 2009; 18:622-33. [DOI: 10.1111/j.1365-294x.2008.04028.x] [Citation(s) in RCA: 91] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
41
|
Kamiya K, Moritsuka E, Yoshida T, Yahara T, Tachida H. High population differentiation and unusual haplotype structure in a shade-intolerant pioneer tree species, Zanthoxylum ailanthoides (Rutaceae) revealed by analysis of DNA polymorphism at four nuclear loci. Mol Ecol 2008; 17:2329-38. [PMID: 18429965 DOI: 10.1111/j.1365-294x.2008.03756.x] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Differences in demographic history, life-history traits, and breeding systems affect nucleotide variation patterns. It is expected that shade-intolerant pioneer tree species have different patterns of genetic polymorphism and population structure than climax species. We studied patterns of nucleotide polymorphism at four putative starch pathway loci (agpSA, agpSB, agpL, and GBSSI) in Zanthoxylum ailanthoides, a shade-intolerant pioneer tree species that occupies forest gaps in warm-temperate forests of East Asia. Genetic diversity was lower within each population than among populations, and differentiation among populations was high across the loci (F(ST) = 0.32-0.64), as expected from the insect-pollinated breeding system and the metapopulation structure of this pioneer species. Numbers of haplotypes were smaller than those expected from the observed numbers of segregating sites. Single haplotypes accounted for more than 47% of all the sampled genes at the respective loci. These variation patterns were incompatible with neutral predictions for populations of a finite island model. Complex population dynamics, such as bottleneck and/or admixture, in the history of this pioneer tree species might have resulted in the observed patterns of genetic variation and population structure, which are different from those of climax wind-pollinated tree species, such as conifers. In contrast to the other loci investigated in this study, agpL showed nearly no variation in Z. ailanthoides (one singleton only), but there was some extent of variation in a closely related species, Zanthoxylum schinifolium. This suggests possibly a recent selective sweep at or near the locus in Z. ailanthoides.
Collapse
Affiliation(s)
- K Kamiya
- Department of Biology, Faculty of Sciences, Kyushu University, 4-2-1 Ropponmatsu, Chuo-ku, Fukuoka 810-8560, Japan
| | | | | | | | | |
Collapse
|
42
|
Population genetics of speciation in two closely related wild tomatoes (Solanum section Lycopersicon). Genetics 2008; 178:339-50. [PMID: 18202377 DOI: 10.1534/genetics.107.081810] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We present a multilocus sequencing study to assess patterns of polymorphism and divergence in the closely related wild tomato species, Solanum peruvianum and S. chilense (Solanum section Lycopersicon, Solanaceae). The data set comprises seven mapped nuclear loci (approximately 9.3 kb of analyzed sequence across loci) and four local population samples per species that cover much of the species' range (between 80 and 88 sequenced alleles across both species). We employ the analytical framework of divergence population genetics (DPG) in evaluating the utility of the "isolation" model of speciation to explain observed patterns of polymorphism and divergence. Whereas the isolation model is not rejected by goodness-of-fit criteria established via coalescent simulations, patterns of intragenic linkage disequilibrium provide evidence for postdivergence gene flow at two of the seven loci. These results suggest that speciation occurred under residual gene flow, implying that natural selection is one of the evolutionary forces driving the divergence of these tomato species. This inference is fully consistent with their recent divergence, conservatively estimated to be <or=0.55 million years. We discuss possible biases in the demographic parameter estimates due to the current restriction of DPG algorithms to panmictic species.
Collapse
|
43
|
Arunyawat U, Stephan W, Städler T. Using multilocus sequence data to assess population structure, natural selection, and linkage disequilibrium in wild tomatoes. Mol Biol Evol 2007; 24:2310-22. [PMID: 17675653 DOI: 10.1093/molbev/msm162] [Citation(s) in RCA: 80] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
We employed a multilocus approach to examine the effects of population subdivision and natural selection on DNA polymorphism in 2 closely related wild tomato species (Solanum peruvianum and Solanum chilense), using sequence data for 8 nuclear loci from populations across much of the species' range. Both species exhibit substantial levels of nucleotide variation. The species-wide level of silent nucleotide diversity is 18% higher in S. peruvianum (pi(sil) approximately 2.50%) than in S. chilense (pi(sil) approximately 2.12%). One of the loci deviates from neutral expectations, showing a clinal pattern of nucleotide diversity and haplotype structure in S. chilense. This geographic pattern of variation is suggestive of an incomplete (ongoing) selective sweep, but neutral explanations cannot be entirely dismissed. Both wild tomato species exhibit moderate levels of population differentiation (average F(ST) approximately 0.20). Interestingly, the pooled samples (across different demes) exhibit more negative Tajima's D and Fu and Li's D values; this marked excess of low-frequency polymorphism can only be explained by population (or range) expansion and is unlikely to be due to population structure per se. We thus propose that population structure and population/range expansion are among the most important evolutionary forces shaping patterns of nucleotide diversity within and among demes in these wild tomatoes. Patterns of population differentiation may also be impacted by soil seed banks and historical associations mediated by climatic cycles. Intragenic linkage disequilibrium (LD) decays very rapidly with physical distance, suggesting high recombination rates and effective population sizes in both species. The rapid decline of LD seems very promising for future association studies with the purpose of mapping functional variation in wild tomatoes.
Collapse
Affiliation(s)
- Uraiwan Arunyawat
- Section of Evolutionary Biology, Department Biologie II, University of Munich (LMU), Planegg-Martinsried, Germany
| | | | | |
Collapse
|