1
|
Fabiano EC, Bonatto SL, Schmidt-Küntzel A, O’Brien SJ, Marker L, Eizirik E. Inferring the historical demography of southern African cheetahs (Acinonyx jubatus) using Bayesian analyses of molecular genetic data. Genet Mol Biol 2025; 48:e20240253. [PMID: 40408594 PMCID: PMC12091599 DOI: 10.1590/1678-4685-gmb-2024-0253] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2024] [Accepted: 03/05/2025] [Indexed: 05/25/2025] Open
Abstract
The contemporary genetic diversity of the cheetah (Acinonyx jubatus) has been the focus of several studies, which have revealed very low levels of variation. Different hypotheses have been proposed to explain this pattern of low diversity, and require additional scrutiny. Here, we used published microsatellite data and coalescence-based analytical methods to explore the historical demography of the largest free-ranging cheetah population, aiming to assess whether present-day diversity may have been impacted by a historical demographic decline. Our results support the hypothesis of a historical (and most likely gradual) demographic decline over the past ~10,000 years, leading to a present-day N e ranging from 700 to 1,600 individuals. This decline was likely induced by climate-driven vegetational shifts affecting habitat suitability and possibly also interspecies interactions with prey and competitors. These results help clarify the demographic history of cheetahs in southern Africa and its impact on the current genetic diversity of this population.
Collapse
Affiliation(s)
- Ezequiel Chimbioputo Fabiano
- Cheetah Conservation Fund, Otjiwarongo, Namibia
- Pontifícia Universidade Católica do Rio Grande do Sul, Escola de Ciências da Saúde e da Vida, Laboratório de Biologia Genômica e Molecular, Porto Alegre, RS, Brazil
- University of Namibia, Ngweze, Katima Mulilo, Namibia
| | - Sandro Luis Bonatto
- Pontifícia Universidade Católica do Rio Grande do Sul, Escola de Ciências da Saúde e da Vida, Laboratório de Biologia Genômica e Molecular, Porto Alegre, RS, Brazil
| | | | | | | | - Eduardo Eizirik
- Pontifícia Universidade Católica do Rio Grande do Sul, Escola de Ciências da Saúde e da Vida, Laboratório de Biologia Genômica e Molecular, Porto Alegre, RS, Brazil
| |
Collapse
|
2
|
Ben Chehida Y, Stelwagen T, Hoekendijk JPA, Ferreira M, Eira C, Torres‐Pereira A, Nicolau L, Thumloup J, Fontaine MC. Harbor porpoise losing its edge: Genetic time series suggests a rapid population decline in Iberian waters over the last 30 years. Ecol Evol 2023; 13:e10819. [PMID: 38089896 PMCID: PMC10714065 DOI: 10.1002/ece3.10819] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 11/23/2023] [Accepted: 11/27/2023] [Indexed: 10/16/2024] Open
Abstract
Impact of climate change is expected to be especially noticeable at the edges of a species' distribution, where they meet suboptimal habitat conditions. In Mauritania and Iberia, two genetically differentiated populations of harbor porpoises (Phocoena phocoena) form an ecotype adapted to local upwelling conditions and distinct from other ecotypes further north on the NE Atlantic continental shelf and in the Black Sea. By analyzing the evolution of mitochondrial genetic variation in the Iberian population between two temporal cohorts (1990-2002 vs. 2012-2015), we report a substantial decrease in genetic diversity. Phylogenetic analyses including neighboring populations identified two porpoises in southern Iberia carrying a divergent haplotype closely related to those from the Mauritanian population, yet forming a distinct lineage. This suggests that Iberian porpoises may not be as isolated as previously thought, indicating possible dispersion from Mauritania or an unknown population in between, but none from the northern ecotype. Demo-genetic scenario testing by approximate Bayesian computation showed that the rapid decline in the Iberian mitochondrial diversity was not simply due to the genetic drift of a small population, but models support instead a substantial decline in effective population size, possibly resulting from environmental stochasticity, prey depletion, or acute fishery bycatches. These results illustrate the value of genetics time series to inform demographic trends and emphasize the urgent need for conservation measures to ensure the viability of this small harbor porpoise population in Iberian waters.
Collapse
Affiliation(s)
- Yacine Ben Chehida
- Groningen Institute for Evolutionary Life Sciences (GELIFES)University of GroningenGroningenThe Netherlands
- Department of BiologyUniversity of YorkYorkUK
- Ecology and Evolutionary Biology, School of BiosciencesUniversity of SheffieldSheffieldUK
| | - Tjibbe Stelwagen
- Groningen Institute for Evolutionary Life Sciences (GELIFES)University of GroningenGroningenThe Netherlands
- BirdEyes, Centre for Global Ecological Change at the Faculties of Science & Engineering and Campus FryslânUniversity of GroningenLeeuwardenThe Netherlands
| | - Jeroen P. A. Hoekendijk
- Groningen Institute for Evolutionary Life Sciences (GELIFES)University of GroningenGroningenThe Netherlands
- Department of Coastal Systems, NIOZ Royal Netherlands Institute for Sea ResearchUtrecht UniversityTexelThe Netherlands
- Wageningen University & Research CentreWageningenThe Netherlands
| | - Marisa Ferreira
- MATB‐Portuguese Wildlife Society (SPVS)Figueira da FozPortugal
| | - Catarina Eira
- MATB‐Portuguese Wildlife Society (SPVS)Figueira da FozPortugal
- ECOMARE, Universidade de AveiroAveiroPortugal
- Centre for Environmental and Marine Studies CESAMUniversity of AveiroAveiroPortugal
| | - Andreia Torres‐Pereira
- MATB‐Portuguese Wildlife Society (SPVS)Figueira da FozPortugal
- ECOMARE, Universidade de AveiroAveiroPortugal
- Centre for Environmental and Marine Studies CESAMUniversity of AveiroAveiroPortugal
| | - Lidia Nicolau
- MATB‐Portuguese Wildlife Society (SPVS)Figueira da FozPortugal
| | - Julie Thumloup
- Groningen Institute for Evolutionary Life Sciences (GELIFES)University of GroningenGroningenThe Netherlands
| | - Michael C. Fontaine
- Groningen Institute for Evolutionary Life Sciences (GELIFES)University of GroningenGroningenThe Netherlands
- MIVEGEC, Univ. Montpellier, CNRS, IRDMontpellierFrance
| |
Collapse
|
3
|
Huang X, Athrey GN, Kaufman PE, Fredregill C, Slotman MA. Effective population size of Culex quinquefasciatus under insecticide-based vector management and following Hurricane Harvey in Harris County, Texas. Front Genet 2023; 14:1297271. [PMID: 38075683 PMCID: PMC10702589 DOI: 10.3389/fgene.2023.1297271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 10/24/2023] [Indexed: 02/12/2024] Open
Abstract
Introduction: Culex quinquefasciatus is a mosquito species of significant public health importance due to its ability to transmit multiple pathogens that can cause mosquito-borne diseases, such as West Nile fever and St. Louis encephalitis. In Harris County, Texas, Cx. quinquefasciatus is a common vector species and is subjected to insecticide-based management by the Harris County Public Health Department. However, insecticide resistance in mosquitoes has increased rapidly worldwide and raises concerns about maintaining the effectiveness of vector control approaches. This concern is highly relevant in Texas, with its humid subtropical climate along the Gulf Coast that provides suitable habitat for Cx. quinquefasciatus and other mosquito species that are known disease vectors. Therefore, there is an urgent and ongoing need to monitor the effectiveness of current vector control programs. Methods: In this study, we evaluated the impact of vector control approaches by estimating the effective population size of Cx. quinquefasciatus in Harris County. We applied Approximate Bayesian Computation to microsatellite data to estimate effective population size. We collected Cx. quinquefasciatus samples from two mosquito control operation areas; 415 and 802, during routine vector monitoring in 2016 and 2017. No county mosquito control operations were applied at area 415 in 2016 and 2017, whereas extensive adulticide spraying operations were in effect at area 802 during the summer of 2016. We collected data for eighteen microsatellite markers for 713 and 723 mosquitoes at eight timepoints from 2016 to 2017 in areas 415 and 802, respectively. We also investigated the impact of Hurricane Harvey's landfall in the Houston area in August of 2017 on Cx. quinquefasciatus population fluctuation. Results: We found that the bottleneck scenario was the most probable historical scenario describing the impact of the winter season at area 415 and area 802, with the highest posterior probability of 0.9167 and 0.4966, respectively. We also detected an expansion event following Hurricane Harvey at area 802, showing a 3.03-fold increase in 2017. Discussion: Although we did not detect significant effects of vector control interventions, we found considerable influences of the winter season and a major hurricane on the effective population size of Cx. quinquefasciatus. The fluctuations in effective population size in both areas showed a significant seasonal pattern. Additionally, the significant population expansion following Hurricane Harvey in 2017 supports the necessity for post-hurricane vector-control interventions.
Collapse
Affiliation(s)
- Xinyue Huang
- Department of Entomology, Texas A&M University, College Station, TX, United States
| | - Giridhar N. Athrey
- Department of Poultry Science, Texas A&M University, College Station, TX, United States
| | - Phillip E. Kaufman
- Department of Entomology, Texas A&M University, College Station, TX, United States
| | - Chris Fredregill
- Harris County Public Health, Mosquito & Vector Control Division, Houston, TX, United States
| | - Michel A. Slotman
- Department of Entomology, Texas A&M University, College Station, TX, United States
| |
Collapse
|
4
|
Korfmann K, Gaggiotti OE, Fumagalli M. Deep Learning in Population Genetics. Genome Biol Evol 2023; 15:evad008. [PMID: 36683406 PMCID: PMC9897193 DOI: 10.1093/gbe/evad008] [Citation(s) in RCA: 23] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2022] [Revised: 12/19/2022] [Accepted: 01/16/2023] [Indexed: 01/24/2023] Open
Abstract
Population genetics is transitioning into a data-driven discipline thanks to the availability of large-scale genomic data and the need to study increasingly complex evolutionary scenarios. With likelihood and Bayesian approaches becoming either intractable or computationally unfeasible, machine learning, and in particular deep learning, algorithms are emerging as popular techniques for population genetic inferences. These approaches rely on algorithms that learn non-linear relationships between the input data and the model parameters being estimated through representation learning from training data sets. Deep learning algorithms currently employed in the field comprise discriminative and generative models with fully connected, convolutional, or recurrent layers. Additionally, a wide range of powerful simulators to generate training data under complex scenarios are now available. The application of deep learning to empirical data sets mostly replicates previous findings of demography reconstruction and signals of natural selection in model organisms. To showcase the feasibility of deep learning to tackle new challenges, we designed a branched architecture to detect signals of recent balancing selection from temporal haplotypic data, which exhibited good predictive performance on simulated data. Investigations on the interpretability of neural networks, their robustness to uncertain training data, and creative representation of population genetic data, will provide further opportunities for technological advancements in the field.
Collapse
Affiliation(s)
- Kevin Korfmann
- Professorship for Population Genetics, Department of Life Science Systems, Technical University of Munich, Germany
| | - Oscar E Gaggiotti
- Centre for Biological Diversity, Sir Harold Mitchell Building, University of St Andrews, Fife KY16 9TF, UK
| | - Matteo Fumagalli
- Department of Biological and Behavioural Sciences, Queen Mary University of London, UK
| |
Collapse
|
5
|
Talis EJ, Che-Castaldo C, Şen B, Krumhardt K, Lynch HJ. Variability, skipped breeding and heavy-tailed dynamics in an Antarctic seabird. J Anim Ecol 2022; 91:2437-2450. [PMID: 36266757 DOI: 10.1111/1365-2656.13827] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 09/29/2022] [Indexed: 12/14/2022]
Abstract
The population dynamics of many colonially breeding seabirds are characterized by large interannual fluctuations that cannot be explained by environmental conditions alone. This variation may be particularly confounded by the use of skipped breeding by seabirds as a life-history strategy, which directly impacts the number of breeding pairs and may affect the accuracy of breeding abundance as a metric of population health. Additionally, large fluctuations in time series may suggest that the underlying population dynamics are heavy tailed, allowing for a higher likelihood of extreme events than expected under Gaussian dynamics. Here, we investigated the effect of demography on time series for abundance of the Adélie penguin Pygoscelis adeliae and explored the occurrence of heavy-tailed dynamics in observed Adélie time series. We focus this study on the Adélie penguin as it is an important bellwether species long used to track the impacts of climate change and fishing on the Southern Ocean ecosystem and shares life-history traits with many colonial seabirds. We quantified the impacts of demographic rates, including skipped breeding, on time series of Adélie abundance simulated using an age-structured model. We also used observed time series of Adélie breeding abundance at all known Antarctic colonies to classify distributions for abundance as Gaussian or non-Gaussian heavy tailed. We then identified the cause of such heavy-tailed dynamics in simulated time series and linked these to spatial patterns in Adélie food resource variability. We found that breeding propensity drives observed breeding fluctuations more than any other vital rate, with high variability in skipped breeding decoupling true abundance from observed breeding abundance. We also found several Antarctic regions characterized by heavy-tailed dynamics in abundance. These regions were often also characterized by high variability in zooplankton availability. In simulated time series, heavy-tailed dynamics were strongly linked to high variability in adult survival. Our results illustrate that stochastic variability in abundance dynamics, particularly the presence of variable rates of skipped breeding, can challenge our interpretation of fluctuations in abundance through time and obscure the relationship between key environmental drivers and population abundance.
Collapse
Affiliation(s)
- Emma J Talis
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York, USA.,Institute for Advanced Computational Science, Stony Brook University, Stony Brook, New York, USA
| | - Christian Che-Castaldo
- Institute for Advanced Computational Science, Stony Brook University, Stony Brook, New York, USA
| | - Bilgecan Şen
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York, USA
| | - Kristen Krumhardt
- Climate and Global Dynamics, National Center for Atmospheric Research (NCAR), Boulder, Colorado, USA
| | - Heather J Lynch
- Institute for Advanced Computational Science, Stony Brook University, Stony Brook, New York, USA.,Department of Ecology and Evolution, Stony Brook University, Stony Brook, New York, USA
| |
Collapse
|
6
|
Abstract
Alleles that introgress between species can influence the evolutionary and ecological fate of species exposed to novel environments. Hybrid offspring of different species are often unfit, and yet it has long been argued that introgression can be a potent force in evolution, especially in plants. Over the last two decades, genomic data have increasingly provided evidence that introgression is a critically important source of genetic variation and that this additional variation can be useful in adaptive evolution of both animals and plants. Here, we review factors that influence the probability that foreign genetic variants provide long-term benefits (so-called adaptive introgression) and discuss their potential benefits. We find that introgression plays an important role in adaptive evolution, particularly when a species is far from its fitness optimum, such as when they expand their range or are subject to changing environments.
Collapse
Affiliation(s)
- Nathaniel B Edelman
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA; .,Current affiliation: Yale Institute for Biospheric Studies and Yale School of the Environment, Yale University, New Haven, Connecticut 06511, USA;
| | - James Mallet
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts 02138, USA;
| |
Collapse
|
7
|
Ruiz-Suarez S, Leos-Barajas V, Alvarez-Castro I, Morales JM. Using approximate Bayesian inference for a "steps and turns" continuous-time random walk observed at regular time intervals. PeerJ 2020; 8:e8452. [PMID: 32095333 PMCID: PMC7020826 DOI: 10.7717/peerj.8452] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2019] [Accepted: 12/23/2019] [Indexed: 11/20/2022] Open
Abstract
The study of animal movement is challenging because movement is a process modulated by many factors acting at different spatial and temporal scales. In order to describe and analyse animal movement, several models have been proposed which differ primarily in the temporal conceptualization, namely continuous and discrete time formulations. Naturally, animal movement occurs in continuous time but we tend to observe it at fixed time intervals. To account for the temporal mismatch between observations and movement decisions, we used a state-space model where movement decisions (steps and turns) are made in continuous time. That is, at any time there is a non-zero probability of making a change in movement direction. The movement process is then observed at regular time intervals. As the likelihood function of this state-space model turned out to be intractable yet simulating data is straightforward, we conduct inference using different variations of Approximate Bayesian Computation (ABC). We explore the applicability of this approach as a function of the discrepancy between the temporal scale of the observations and that of the movement process in a simulation study. Simulation results suggest that the model parameters can be recovered if the observation time scale is moderately close to the average time between changes in movement direction. Good estimates were obtained when the scale of observation was up to five times that of the scale of changes in direction. We demonstrate the application of this model to a trajectory of a sheep that was reconstructed in high resolution using information from magnetometer and GPS devices. The state-space model used here allowed us to connect the scales of the observations and movement decisions in an intuitive and easy to interpret way. Our findings underscore the idea that the time scale at which animal movement decisions are made needs to be considered when designing data collection protocols. In principle, ABC methods allow to make inferences about movement processes defined in continuous time but in terms of easily interpreted steps and turns.
Collapse
Affiliation(s)
- Sofia Ruiz-Suarez
- INIBIOMA (CONICET-Universidad Nacional del Comahue), Rio Negro, Argentina
- Facultad de Ciencias Económicas, Universidad Nacional de Rosario, Rosario, Argentina
| | - Vianey Leos-Barajas
- Department of Statistics, North Carolina State University, Raleigh, United States of America
- Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC, United States of America
| | | | | |
Collapse
|
8
|
Dumas E, Feurtey A, Rodríguez de la Vega RC, Le Prieur S, Snirc A, Coton M, Thierry A, Coton E, Le Piver M, Roueyre D, Ropars J, Branca A, Giraud T. Independent domestication events in the blue-cheese fungus Penicillium roqueforti. Mol Ecol 2020; 29:2639-2660. [PMID: 31960565 PMCID: PMC7497015 DOI: 10.1111/mec.15359] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2019] [Revised: 01/02/2020] [Accepted: 01/04/2020] [Indexed: 12/13/2022]
Abstract
Domestication provides an excellent framework for studying adaptive divergence. Using population genomics and phenotypic assays, we reconstructed the domestication history of the blue cheese mould Penicillium roqueforti. We showed that this fungus was domesticated twice independently. The population used in Roquefort originated from an old domestication event associated with weak bottlenecks and exhibited traits beneficial for pre‐industrial cheese production (slower growth in cheese and greater spore production on bread, the traditional multiplication medium). The other cheese population originated more recently from the selection of a single clonal lineage, was associated with all types of blue cheese worldwide except Roquefort, and displayed phenotypes more suited for industrial cheese production (high lipolytic activity, efficient cheese cavity colonization ability and salt tolerance). We detected genomic regions affected by recent positive selection and putative horizontal gene transfers. This study sheds light on the processes of rapid adaptation and raises questions about genetic resource conservation. see also the Perspective by Brigida Gallone, Jan Steensels and Kevin J. Verstrepen.
Collapse
Affiliation(s)
- Emilie Dumas
- Ecologie Systématique et Evolution, CNRS, AgroParisTech, Ecologie Systématique Evolution, Université Paris-Saclay, Orsay, France.,Laboratory for Molecular Immunology and Inflammation, Department of Rheumatology, University Hospital Ghent, The Vlaams Instituut voor Biotechnologie (VIB) Center for Inflammation Research (IRC), Ghent, Belgium
| | - Alice Feurtey
- Ecologie Systématique et Evolution, CNRS, AgroParisTech, Ecologie Systématique Evolution, Université Paris-Saclay, Orsay, France.,Environmental Genomics, Max Planck Institute for Evolutionary Biology, Plön, Germany
| | - Ricardo C Rodríguez de la Vega
- Ecologie Systématique et Evolution, CNRS, AgroParisTech, Ecologie Systématique Evolution, Université Paris-Saclay, Orsay, France
| | - Stéphanie Le Prieur
- Ecologie Systématique et Evolution, CNRS, AgroParisTech, Ecologie Systématique Evolution, Université Paris-Saclay, Orsay, France
| | - Alodie Snirc
- Ecologie Systématique et Evolution, CNRS, AgroParisTech, Ecologie Systématique Evolution, Université Paris-Saclay, Orsay, France
| | - Monika Coton
- Univ Brest, Laboratoire Universitaire de Biodiversité et Ecologie Microbienne, Plouzané, France
| | - Anne Thierry
- Science et Technologie du Lait et de l'Œuf (STLO), UMR1253, Agrocampus Ouest, INRAE, Rennes, France
| | - Emmanuel Coton
- Univ Brest, Laboratoire Universitaire de Biodiversité et Ecologie Microbienne, Plouzané, France
| | - Mélanie Le Piver
- Laboratoire Interprofessionnel de Production - SAS L.I.P, Aurillac, France
| | - Daniel Roueyre
- Laboratoire Interprofessionnel de Production - SAS L.I.P, Aurillac, France
| | - Jeanne Ropars
- Ecologie Systématique et Evolution, CNRS, AgroParisTech, Ecologie Systématique Evolution, Université Paris-Saclay, Orsay, France
| | - Antoine Branca
- Ecologie Systématique et Evolution, CNRS, AgroParisTech, Ecologie Systématique Evolution, Université Paris-Saclay, Orsay, France
| | - Tatiana Giraud
- Ecologie Systématique et Evolution, CNRS, AgroParisTech, Ecologie Systématique Evolution, Université Paris-Saclay, Orsay, France
| |
Collapse
|
9
|
Approximate Bayesian estimation of coevolutionary arms races. PLoS Comput Biol 2019; 15:e1006988. [PMID: 30986245 PMCID: PMC6483265 DOI: 10.1371/journal.pcbi.1006988] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2018] [Revised: 04/25/2019] [Accepted: 03/29/2019] [Indexed: 11/19/2022] Open
Abstract
Exaggerated traits involved in species interactions have long captivated the imagination of evolutionary biologists and inspired the durable metaphor of the coevolutionary arms race. Despite decades of research, however, we have only a handful of examples where reciprocal coevolutionary change has been rigorously established as the cause of trait exaggeration. Support for a coevolutionary mechanism remains elusive because we lack generally applicable tools for quantifying the intensity of coevolutionary selection. Here we develop an approximate Bayesian computation (ABC) approach for estimating the intensity of coevolutionary selection using population mean phenotypes of traits mediating interspecific interactions. Our approach relaxes important assumptions of a previous maximum likelihood approach by allowing gene flow among populations, variable abiotic environments, and strong coevolutionary selection. Using simulated data, we show that our ABC method accurately infers the strength of coevolutionary selection if reliable estimates are available for key background parameters and ten or more populations are sampled. Applying our approach to the putative arms race between the plant Camellia japonica and its seed predatory weevil, Curculio camelliae, provides support for a coevolutionary hypothesis but fails to preclude the possibility of unilateral evolution. Comparing independently estimated selection gradients acting on Camellia pericarp thickness with values simulated by our model reveals a correlation between predicted and observed selection gradients of 0.941. The strong agreement between predicted and observed selection gradients validates our method.
Collapse
|
10
|
Acosta JJ, Fahrenkrog AM, Neves LG, Resende MFR, Dervinis C, Davis JM, Holliday JA, Kirst M. Exome Resequencing Reveals Evolutionary History, Genomic Diversity, and Targets of Selection in the Conifers Pinus taeda and Pinus elliottii. Genome Biol Evol 2019; 11:508-520. [PMID: 30689841 PMCID: PMC6385631 DOI: 10.1093/gbe/evz016] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/22/2019] [Indexed: 12/22/2022] Open
Abstract
Loblolly pine (Pinus taeda) and slash pine (Pinus elliottii) are ecologically and economically important pine species that dominate many forest ecosystems in the southern United States, but like all conifers, the study of their genetic diversity and demographic history has been hampered by their large genome size. A small number of studies mainly based on candidate-gene sequencing have been reported for P. taeda to date, whereas none are available for P. elliottii. Targeted exome resequencing has recently enabled population genomics studies for conifers, approach used here to assess genomic diversity, signatures of selection, population structure, and demographic history of P. elliottii and P. taeda. Extensive similarities were revealed between these species: both species feature rapid linkage disequilibrium decay and high levels of genetic diversity. Moreover, genome-wide positive correlations for measures of genetic diversity between the species were also observed, likely due to shared structural genomic constraints. Also, positive selection appears to be targeting a common set of genes in both pines. Demographic history differs between both species, with only P. taeda being affected by a dramatic bottleneck during the last glacial period. The ability of P. taeda to recover from a dramatic reduction in population size while still retaining high levels of genetic diversity shows promise for other pines facing environmental stressors associated with climate change, indicating that these too may be able to adapt successfully to new future conditions even after a drastic population size contraction.
Collapse
Affiliation(s)
- Juan J Acosta
- School of Forest Resources and Conservation, University of Florida.,University of Florida Genetics Institute, University of Florida.,Camcore, Department of Forestry and Environmental Resources, North Carolina State University, Raleigh, NC
| | - Annette M Fahrenkrog
- School of Forest Resources and Conservation, University of Florida.,Plant Molecular and Cellular Biology Graduate Program, University of Florida
| | - Leandro G Neves
- School of Forest Resources and Conservation, University of Florida.,Plant Molecular and Cellular Biology Graduate Program, University of Florida.,RAPiD Genomics, Gainesville, FL
| | | | | | - John M Davis
- School of Forest Resources and Conservation, University of Florida
| | - Jason A Holliday
- Department of Forest Resources and Environmental Conservation, Virginia Polytechnic Institute and State University
| | - Matias Kirst
- School of Forest Resources and Conservation, University of Florida.,Plant Molecular and Cellular Biology Graduate Program, University of Florida.,University of Florida Genetics Institute, University of Florida
| |
Collapse
|
11
|
Kendall EA, Azman AS, Maartens G, Boulle A, Wilkinson RJ, Dowdy DW, Rangaka MX. Projected population-wide impact of antiretroviral therapy-linked isoniazid preventive therapy in a high-burden setting. AIDS 2019; 33:525-536. [PMID: 30325773 PMCID: PMC6355370 DOI: 10.1097/qad.0000000000002053] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
OBJECTIVE Both isoniazid preventive therapy (IPT) and antiretroviral therapy (ART) reduce tuberculosis risk in individuals living with HIV. We sought to estimate the broader, population-wide impact of providing a pragmatically implemented 12-month IPT regimen to ART recipients in a high-burden community. DESIGN Dynamic transmission model of a tuberculosis (TB)-HIV epidemic, calibrated to site-specific, historical epidemiologic and clinical trial data from Khayelitsha, South Africa. METHODS We projected the 5-year impact of delivering a 12-month IPT regimen community-wide to 85% of new ART initiators and 15%/year of those already on ART, accounting for IPT-attributable reductions in TB infection, progression, and transmission. We also evaluated scenarios of continuously-delivered IPT, ongoing ART scale-up, and lower tuberculosis incidence. RESULTS Under historical (early 2010) ART coverage, this ART-linked IPT intervention prevented one tuberculosis case per 18 [95% credible interval (CrI) 11-29] people treated. It lowered TB incidence by a projected 23% (95% CrI 14-30%) among people receiving ART, and by 5.2% (95% CrI 2.9-8.7%) in the total population. Continuous IPT reduced the number needed to treat to prevent one case of TB to 10 (95% CrI 7-16), though it required 74% more person-years of therapy (95% CrI 64-94%) to prevent one TB case, relative to 12-month therapy. Under expanding ART coverage, the tuberculosis incidence reduction achieved by 12-month IPT grew to 7.6% (95% CrI 4.3-12.6%). Effect sizes were similar in a simulated setting of lower TB incidence. CONCLUSIONS IPT in conjunction with ART reduces tuberculosis incidence among those who receive therapy and has additional impact on tuberculosis transmission in the population.
Collapse
Affiliation(s)
- Emily A Kendall
- Division of Infectious Diseases, Johns Hopkins University School of Medicine
| | - Andrew S Azman
- Division of Infectious Disease Epidemiology, Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
| | - Gary Maartens
- Division of Clinical Pharmacology, Department of Medicine, University of Cape Town
| | - Andrew Boulle
- Centre for Infectious Disease Epidemiology and Research, Department of Public Health and Family Medicine, University of Cape Town
| | - Robert J Wilkinson
- Wellcome Centre for Infectious Diseases Research in Africa, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Cape Town, South Africa
- Department of Medicine, Imperial College
- Francis Crick Institute
| | - David W Dowdy
- Division of Infectious Disease Epidemiology, Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, Maryland, USA
| | - Molebogeng X Rangaka
- Wellcome Centre for Infectious Diseases Research in Africa, Institute of Infectious Disease and Molecular Medicine, University of Cape Town, Cape Town, South Africa
- Institute for Global Health, University College London, London, UK
| |
Collapse
|
12
|
Species delimitation in the presence of strong incomplete lineage sorting and hybridization: Lessons from Ophioderma (Ophiuroidea: Echinodermata). Mol Phylogenet Evol 2019; 131:138-148. [DOI: 10.1016/j.ympev.2018.11.014] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2018] [Revised: 11/15/2018] [Accepted: 11/16/2018] [Indexed: 01/01/2023]
|
13
|
Dexter E, Bollens SM, Cordell J, Soh HY, Rollwagen-Bollens G, Pfeifer SP, Goudet J, Vuilleumier S. A genetic reconstruction of the invasion of the calanoid copepod Pseudodiaptomus inopinus across the North American Pacific Coast. Biol Invasions 2017. [DOI: 10.1007/s10530-017-1649-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
14
|
Xue AT, Hickerson MJ. multi-dice: r package for comparative population genomic inference under hierarchical co-demographic models of independent single-population size changes. Mol Ecol Resour 2017; 17:e212-e224. [PMID: 28449263 PMCID: PMC5724483 DOI: 10.1111/1755-0998.12686] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2017] [Revised: 03/14/2017] [Accepted: 04/14/2017] [Indexed: 01/25/2023]
Abstract
Population genetic data from multiple taxa can address comparative phylogeographic questions about community-scale response to environmental shifts, and a useful strategy to this end is to employ hierarchical co-demographic models that directly test multi-taxa hypotheses within a single, unified analysis. This approach has been applied to classical phylogeographic data sets such as mitochondrial barcodes as well as reduced-genome polymorphism data sets that can yield 10,000s of SNPs, produced by emergent technologies such as RAD-seq and GBS. A strategy for the latter had been accomplished by adapting the site frequency spectrum to a novel summarization of population genomic data across multiple taxa called the aggregate site frequency spectrum (aSFS), which potentially can be deployed under various inferential frameworks including approximate Bayesian computation, random forest and composite likelihood optimization. Here, we introduce the r package multi-dice, a wrapper program that exploits existing simulation software for flexible execution of hierarchical model-based inference using the aSFS, which is derived from reduced genome data, as well as mitochondrial data. We validate several novel software features such as applying alternative inferential frameworks, enforcing a minimal threshold of time surrounding co-demographic pulses and specifying flexible hyperprior distributions. In sum, multi-dice provides comparative analysis within the familiar R environment while allowing a high degree of user customization, and will thus serve as a tool for comparative phylogeography and population genomics.
Collapse
Affiliation(s)
- Alexander T. Xue
- Department of Biology: Subprogram in Ecology, Evolutionary Biology, and BehaviorCity College and Graduate Center of City University of New YorkNew YorkNYUSA
| | - Michael J. Hickerson
- Department of Biology: Subprogram in Ecology, Evolutionary Biology, and BehaviorCity College and Graduate Center of City University of New YorkNew YorkNYUSA
- Division of Invertebrate ZoologyAmerican Museum of Natural HistoryNew YorkNYUSA
| |
Collapse
|
15
|
Technow F, Messina CD, Totir LR, Cooper M. Integrating Crop Growth Models with Whole Genome Prediction through Approximate Bayesian Computation. PLoS One 2015; 10:e0130855. [PMID: 26121133 PMCID: PMC4488317 DOI: 10.1371/journal.pone.0130855] [Citation(s) in RCA: 114] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2015] [Accepted: 05/25/2015] [Indexed: 11/18/2022] Open
Abstract
Genomic selection, enabled by whole genome prediction (WGP) methods, is revolutionizing plant breeding. Existing WGP methods have been shown to deliver accurate predictions in the most common settings, such as prediction of across environment performance for traits with additive gene effects. However, prediction of traits with non-additive gene effects and prediction of genotype by environment interaction (G×E), continues to be challenging. Previous attempts to increase prediction accuracy for these particularly difficult tasks employed prediction methods that are purely statistical in nature. Augmenting the statistical methods with biological knowledge has been largely overlooked thus far. Crop growth models (CGMs) attempt to represent the impact of functional relationships between plant physiology and the environment in the formation of yield and similar output traits of interest. Thus, they can explain the impact of G×E and certain types of non-additive gene effects on the expressed phenotype. Approximate Bayesian computation (ABC), a novel and powerful computational procedure, allows the incorporation of CGMs directly into the estimation of whole genome marker effects in WGP. Here we provide a proof of concept study for this novel approach and demonstrate its use with synthetic data sets. We show that this novel approach can be considerably more accurate than the benchmark WGP method GBLUP in predicting performance in environments represented in the estimation set as well as in previously unobserved environments for traits determined by non-additive gene effects. We conclude that this proof of concept demonstrates that using ABC for incorporating biological knowledge in the form of CGMs into WGP is a very promising and novel approach to improving prediction accuracy for some of the most challenging scenarios in plant breeding and applied genetics.
Collapse
Affiliation(s)
- Frank Technow
- Breeding Technologies, DuPont Pioneer, Johnston, IA, USA
- * E-mail:
| | - Carlos D. Messina
- Trait Characterization & Development, DuPont Pioneer, Johnston, IA, USA
| | - L. Radu Totir
- Breeding Technologies, DuPont Pioneer, Johnston, IA, USA
| | - Mark Cooper
- Trait Characterization & Development, DuPont Pioneer, Johnston, IA, USA
| |
Collapse
|
16
|
Pratdesaba R, Segarra C, Aguadé M. Inferring the demographic history of Drosophila subobscura from nucleotide variation at regions not affected by chromosomal inversions. Mol Ecol 2015; 24:1729-41. [PMID: 25776124 DOI: 10.1111/mec.13155] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2014] [Revised: 03/09/2015] [Accepted: 03/11/2015] [Indexed: 11/29/2022]
Abstract
Drosophila subobscura presents a rich and complex chromosomal inversion polymorphism. It can thus be considered a model system (i) to study the mechanisms originating inversions and how inversions affect the levels and patterns of variation in the inverted regions and (ii) to study adaptation at both the single-gene and chromosomal inversion levels. It is therefore important to infer its demographic history as previous information indicated that its nucleotide variation is not at mutation-drift equilibrium. For that purpose, we sequenced 16 noncoding regions distributed across those parts of the J chromosome not affected by inversions in the studied population and possibly either by other selective events. The pattern of variation detected in these 16 regions is similar to that previously reported within different chromosomal arrangements, suggesting that the latter results would, thus, mainly reflect recent demographic events rather than the partial selective sweep imposed by the origin and frequency increase of inversions. Among the simple demographic models considered in our Approximate Bayesian Computation analysis of variation at the 16 regions, the model best supported by the data implies a population size expansion soon after the penultimate glacial period. This model constitutes a better null model, and it is therefore an important resource for subsequent studies aiming among others to uncover selective events across the species genome. Our results also highlight the importance of introducing the possibility of multiple hits in the coalescent simulations with an outgroup.
Collapse
Affiliation(s)
- Roser Pratdesaba
- Departament de Genètica, Facultat de Biologia and Institut de Recerca de la Biodiversitat (IRBio), Universitat de Barcelona, Diagonal 643, 08028, Barcelona, Spain
| | | | | |
Collapse
|
17
|
Schulte MB, Draghi JA, Plotkin JB, Andino R. Experimentally guided models reveal replication principles that shape the mutation distribution of RNA viruses. eLife 2015; 4. [PMID: 25635405 PMCID: PMC4311501 DOI: 10.7554/elife.03753] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2014] [Accepted: 12/31/2014] [Indexed: 12/31/2022] Open
Abstract
Life history theory posits that the sequence and timing of events in an organism's lifespan are fine-tuned by evolution to maximize the production of viable offspring. In a virus, a life history strategy is largely manifested in its replication mode. Here, we develop a stochastic mathematical model to infer the replication mode shaping the structure and mutation distribution of a poliovirus population in an intact single infected cell. We measure production of RNA and poliovirus particles through the infection cycle, and use these data to infer the parameters of our model. We find that on average the viral progeny produced from each cell are approximately five generations removed from the infecting virus. Multiple generations within a single cell infection provide opportunities for significant accumulation of mutations per viral genome and for intracellular selection. DOI:http://dx.doi.org/10.7554/eLife.03753.001 Viruses with genetic information made up of molecules of RNA can multiply quickly, but not very accurately. This means that many errors, or mutations, occur when the RNA is copied to create new viruses. The advantage of this rapid, but mistake-filled, RNA replication process is that some of the mutations will be beneficial to the virus. This allows viruses to rapidly evolve, for example, to develop resistance against drugs. The poliovirus is an RNA virus that can cause paralysis and death in humans. To prevent such infections, scientists have extensively studied the poliovirus and have developed effective vaccines against it that have eliminated the virus from all but a few countries. Because so much is known about the poliovirus and because it has a very simple structure, scientists continue to use the poliovirus as a model to study virus behavior. One unknown aspect of the poliovirus' behavior is how it replicates after invading a cell. Are all of its RNA copies made from the original viral RNA that first infected the cell, in what is known as a ‘stamping machine’ model? Or do the new copies of the RNA also get copied themselves in a ‘geometric replication mode’ that increases the likelihood of mutations and enables the virus to evolve more rapidly? Viral RNA molecules are copied by one of the virus's own proteins and so before the viral RNA can be replicated, it must first be translated to form viral proteins. When and where replication begins depends on the concentration of translated proteins around the RNA and so replication tends to begin in particular areas of the cell at different times. Schulte, Draghi et al. used mathematical modeling to create computer simulations of the number of polioviruses in a cell that take into account these time and space constraints. By including random elements in the model, the simulated behavior more accurately follows experimentally recorded data than previously used models. The results of the model led Schulte, Draghi et al. to conclude that the poliovirus replicates by the ‘geometric mode’; as new copies of the poliovirus RNA are made, each copy goes on to make more copies. This means that in a single infected cell there are multiple generations of RNA, and each generation may undergo distinct mutations that are passed on to the next set of RNA copies. In fact, Schulte, Draghi et al. found that the average virus released from an infected cell is the great-great-great-granddaughter of the original virus that infected the cell. With so many different generations of virus coexisting in a cell, there are a lot of opportunities for new genetic combinations to occur and for viruses to evolve new abilities. DOI:http://dx.doi.org/10.7554/eLife.03753.002
Collapse
Affiliation(s)
- Michael B Schulte
- Tetrad Graduate Program, University of California, San Francisco, San Francisco, United States
| | - Jeremy A Draghi
- Department of Biology, University of Pennsylvania, Philadelphia, United States
| | - Joshua B Plotkin
- Department of Biology, University of Pennsylvania, Philadelphia, United States
| | - Raul Andino
- Department of Microbiology and Immunology, University of California, San Francisco, San Francisco, United States
| |
Collapse
|
18
|
Sandoval-Castellanos E, Palkopoulou E, Dalén L. Back to BaySICS: a user-friendly program for Bayesian Statistical Inference from Coalescent Simulations. PLoS One 2014; 9:e98011. [PMID: 24865457 PMCID: PMC4035278 DOI: 10.1371/journal.pone.0098011] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Accepted: 04/28/2014] [Indexed: 12/02/2022] Open
Abstract
Inference of population demographic history has vastly improved in recent years due to a number of technological and theoretical advances including the use of ancient DNA. Approximate Bayesian computation (ABC) stands among the most promising methods due to its simple theoretical fundament and exceptional flexibility. However, limited availability of user-friendly programs that perform ABC analysis renders it difficult to implement, and hence programming skills are frequently required. In addition, there is limited availability of programs able to deal with heterochronous data. Here we present the software BaySICS: Bayesian Statistical Inference of Coalescent Simulations. BaySICS provides an integrated and user-friendly platform that performs ABC analyses by means of coalescent simulations from DNA sequence data. It estimates historical demographic population parameters and performs hypothesis testing by means of Bayes factors obtained from model comparisons. Although providing specific features that improve inference from datasets with heterochronous data, BaySICS also has several capabilities making it a suitable tool for analysing contemporary genetic datasets. Those capabilities include joint analysis of independent tables, a graphical interface and the implementation of Markov-chain Monte Carlo without likelihoods.
Collapse
Affiliation(s)
- Edson Sandoval-Castellanos
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden; Department of Zoology, Stockholm University, Stockholm, Sweden
| | - Eleftheria Palkopoulou
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden; Department of Zoology, Stockholm University, Stockholm, Sweden
| | - Love Dalén
- Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
| |
Collapse
|
19
|
Liepe J, Kirk P, Filippi S, Toni T, Barnes CP, Stumpf MP. A framework for parameter estimation and model selection from experimental data in systems biology using approximate Bayesian computation. Nat Protoc 2014; 9:439-56. [PMID: 24457334 PMCID: PMC5081097 DOI: 10.1038/nprot.2014.025] [Citation(s) in RCA: 113] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
As modeling becomes a more widespread practice in the life sciences and biomedical sciences, researchers need reliable tools to calibrate models against ever more complex and detailed data. Here we present an approximate Bayesian computation (ABC) framework and software environment, ABC-SysBio, which is a Python package that runs on Linux and Mac OS X systems and that enables parameter estimation and model selection in the Bayesian formalism by using sequential Monte Carlo (SMC) approaches. We outline the underlying rationale, discuss the computational and practical issues and provide detailed guidance as to how the important tasks of parameter inference and model selection can be performed in practice. Unlike other available packages, ABC-SysBio is highly suited for investigating, in particular, the challenging problem of fitting stochastic models to data. In order to demonstrate the use of ABC-SysBio, in this protocol we postulate the existence of an imaginary reaction network composed of seven interrelated biological reactions (involving a specific mRNA, the protein it encodes and a post-translationally modified version of the protein), a network that is defined by two files containing 'observed' data that we provide as supplementary information. In the first part of the PROCEDURE, ABC-SysBio is used to infer the parameters of this system, whereas in the second part we use ABC-SysBio's relevant functionality to discriminate between two different reaction network models, one of them being the 'true' one. Although computationally expensive, the additional insights gained in the Bayesian formalism more than make up for this cost, especially in complex problems.
Collapse
Affiliation(s)
- Juliane Liepe
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London
| | - Paul Kirk
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London
| | - Sarah Filippi
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London
| | - Tina Toni
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London
| | - Chris P. Barnes
- Department of Cell and Developmental Biology, University College London
| | - Michael P.H. Stumpf
- Centre for Integrative Systems Biology and Bioinformatics, Department of Life Sciences, Imperial College London
- Institute of Chemical Biology, Imperial College London
| |
Collapse
|
20
|
Excoffier L, Dupanloup I, Huerta-Sánchez E, Sousa VC, Foll M. Robust demographic inference from genomic and SNP data. PLoS Genet 2013; 9:e1003905. [PMID: 24204310 PMCID: PMC3812088 DOI: 10.1371/journal.pgen.1003905] [Citation(s) in RCA: 909] [Impact Index Per Article: 75.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2013] [Accepted: 09/11/2013] [Indexed: 01/09/2023] Open
Abstract
We introduce a flexible and robust simulation-based framework to infer demographic parameters from the site frequency spectrum (SFS) computed on large genomic datasets. We show that our composite-likelihood approach allows one to study evolutionary models of arbitrary complexity, which cannot be tackled by other current likelihood-based methods. For simple scenarios, our approach compares favorably in terms of accuracy and speed with ∂a∂i, the current reference in the field, while showing better convergence properties for complex models. We first apply our methodology to non-coding genomic SNP data from four human populations. To infer their demographic history, we compare neutral evolutionary models of increasing complexity, including unsampled populations. We further show the versatility of our framework by extending it to the inference of demographic parameters from SNP chips with known ascertainment, such as that recently released by Affymetrix to study human origins. Whereas previous ways of handling ascertained SNPs were either restricted to a single population or only allowed the inference of divergence time between a pair of populations, our framework can correctly infer parameters of more complex models including the divergence of several populations, bottlenecks and migration. We apply this approach to the reconstruction of African demography using two distinct ascertained human SNP panels studied under two evolutionary models. The two SNP panels lead to globally very similar estimates and confidence intervals, and suggest an ancient divergence (>110 Ky) between Yoruba and San populations. Our methodology appears well suited to the study of complex scenarios from large genomic data sets.
Collapse
Affiliation(s)
- Laurent Excoffier
- CMPG, Institute of Ecology and Evolution, Berne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Isabelle Dupanloup
- CMPG, Institute of Ecology and Evolution, Berne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Emilia Huerta-Sánchez
- Center for Theoretical Evolutionary Genomics, Department of Integrative Biology, University of California, Berkeley, Berkeley, California, United States of America
| | - Vitor C. Sousa
- CMPG, Institute of Ecology and Evolution, Berne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Matthieu Foll
- CMPG, Institute of Ecology and Evolution, Berne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| |
Collapse
|
21
|
Gilabert A, Wasmuth JD. Unravelling parasitic nematode natural history using population genetics. Trends Parasitol 2013; 29:438-48. [PMID: 23948430 DOI: 10.1016/j.pt.2013.07.006] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2013] [Revised: 07/10/2013] [Accepted: 07/11/2013] [Indexed: 01/01/2023]
Abstract
The health and economic importance of parasitic nematodes cannot be overstated. Moreover, they offer a complex and diverse array of life strategies, raising a multitude of evolutionary questions. Researchers are applying population genetics to parasitic nematodes in order to disentangle some aspects of their life strategies, improve our knowledge about disease epidemiology, and design control strategies. However, population genetics studies of nematodes have been constrained due to the difficulty in sampling nematodes and developing molecular markers. In this context, new computational and sequencing technologies represent promising tools to investigate population genomics of parasitic, non-model, nematode species in an epidemiological context.
Collapse
Affiliation(s)
- Aude Gilabert
- Department of Ecosystem and Public Health, Faculty of Veterinary Medicine, University of Calgary, 3280 Hospital Drive NW, Calgary, Alberta, T2N 4Z6, Canada
| | | |
Collapse
|
22
|
Matsuoka Y, Nasuda S, Ashida Y, Nitta M, Tsujimoto H, Takumi S, Kawahara T. Genetic basis for spontaneous hybrid genome doubling during allopolyploid speciation of common wheat shown by natural variation analyses of the paternal species. PLoS One 2013; 8:e68310. [PMID: 23950867 PMCID: PMC3738567 DOI: 10.1371/journal.pone.0068310] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2013] [Accepted: 05/28/2013] [Indexed: 11/19/2022] Open
Abstract
The complex process of allopolyploid speciation includes various mechanisms ranging from species crosses and hybrid genome doubling to genome alterations and the establishment of new allopolyploids as persisting natural entities. Currently, little is known about the genetic mechanisms that underlie hybrid genome doubling, despite the fact that natural allopolyploid formation is highly dependent on this phenomenon. We examined the genetic basis for the spontaneous genome doubling of triploid F1 hybrids between the direct ancestors of allohexaploid common wheat (Triticum aestivum L., AABBDD genome), namely Triticumturgidum L. (AABB genome) and Aegilopstauschii Coss. (DD genome). An Ae. tauschii intraspecific lineage that is closely related to the D genome of common wheat was identified by population-based analysis. Two representative accessions, one that produces a high-genome-doubling-frequency hybrid when crossed with a T. turgidum cultivar and the other that produces a low-genome-doubling-frequency hybrid with the same cultivar, were chosen from that lineage for further analyses. A series of investigations including fertility analysis, immunostaining, and quantitative trait locus (QTL) analysis showed that (1) production of functional unreduced gametes through nonreductional meiosis is an early step key to successful hybrid genome doubling, (2) first division restitution is one of the cytological mechanisms that cause meiotic nonreduction during the production of functional male unreduced gametes, and (3) six QTLs in the Ae. tauschii genome, most of which likely regulate nonreductional meiosis and its subsequent gamete production processes, are involved in hybrid genome doubling. Interlineage comparisons of Ae. tauschii's ability to cause hybrid genome doubling suggested an evolutionary model for the natural variation pattern of the trait in which non-deleterious mutations in six QTLs may have important roles. The findings of this study demonstrated that the genetic mechanisms for hybrid genome doubling could be studied based on the intrinsic natural variation that exists in the parental species.
Collapse
Affiliation(s)
| | - Shuhei Nasuda
- Laboratory of Plant Genetics, Graduate School of Agriculture, Kyoto University, Kitashirakawaoiwake-cho, Sakyo-ku, Kyoto, Japan
| | - Yasuyo Ashida
- Laboratory of Plant Genetics, Graduate School of Agriculture, Kyoto University, Kitashirakawaoiwake-cho, Sakyo-ku, Kyoto, Japan
| | - Miyuki Nitta
- Laboratory of Plant Genetics, Graduate School of Agriculture, Kyoto University, Kitashirakawaoiwake-cho, Sakyo-ku, Kyoto, Japan
| | - Hisashi Tsujimoto
- Laboratory of Molecular Breeding, Arid Land Research Center, Tottori University, Tottori-shi, Tottori, Japan
| | - Shigeo Takumi
- Laboratory of Plant Genetics, Graduate School of Agricultural Science, Kobe University, Nada-ku, Kobe, Japan
| | - Taihachi Kawahara
- Laboratory of Crop Evolution, Plant Germ-plasm Institute, Graduate School of Agriculture, Kyoto University, Mozume, Muko, Kyoto, Japan
| |
Collapse
|
23
|
Blum MGB, Nunes MA, Prangle D, Sisson SA. A Comparative Review of Dimension Reduction Methods in Approximate Bayesian Computation. Stat Sci 2013. [DOI: 10.1214/12-sts406] [Citation(s) in RCA: 198] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
|
24
|
Frost SDW, Volz EM. Modelling tree shape and structure in viral phylodynamics. Philos Trans R Soc Lond B Biol Sci 2013; 368:20120208. [PMID: 23382430 PMCID: PMC3678332 DOI: 10.1098/rstb.2012.0208] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Epidemiological models have highlighted the importance of population structure in the transmission dynamics of infectious diseases. Using HIV-1 as an example of a model evolutionary system, we consider how population structure affects the shape and the structure of a viral phylogeny in the absence of strong selection at the population level. For structured populations, the number of lineages as a function of time is insufficient to describe the shape of the phylogeny. We develop deterministic approximations for the dynamics of tips of the phylogeny over evolutionary time, the number of ‘cherries’, tips that share a direct common ancestor, and Sackin's index, a commonly used measure of phylogenetic imbalance or asymmetry. We employ cherries both as a measure of asymmetry of the tree as well as a measure of the association between sequences from different groups. We consider heterogeneity in infectiousness associated with different stages of HIV infection, and in contact rates between groups of individuals. In the absence of selection, we find that population structure may have relatively little impact on the overall asymmetry of a tree, especially when only a small fraction of infected individuals is sampled, but may have marked effects on how sequences from different subpopulations cluster and co-cluster.
Collapse
Affiliation(s)
- Simon D W Frost
- Department of Veterinary Medicine, University of Cambridge, Madingley Road, Cambridge, Cambridgeshire CB3 0ES, UK.
| | | |
Collapse
|
25
|
Abstract
Approximate Bayesian computation has become an essential tool for the analysis of complex stochastic models when the likelihood function is numerically unavailable. However, the well-established statistical method of empirical likelihood provides another route to such settings that bypasses simulations from the model and the choices of the approximate Bayesian computation parameters (summary statistics, distance, tolerance), while being convergent in the number of observations. Furthermore, bypassing model simulations may lead to significant time savings in complex models, for instance those found in population genetics. The Bayesian computation with empirical likelihood algorithm we develop in this paper also provides an evaluation of its own performance through an associated effective sample size. The method is illustrated using several examples, including estimation of standard distributions, time series, and population genetics models.
Collapse
Affiliation(s)
- Kerrie L. Mengersen
- School of Mathematical Sciences, Queensland University of Technology, Brisbane, QLD 4001, Australia
| | - Pierre Pudlo
- Centre de Biologie pour la Gestion des Populations, Institut National de la Recherche Agronomique, 34988 Montferrier-sur-Lez Cedex, France
- Université Montpellier 2, Institut de Mathématiques et de Modélisation de Montpellier, 34095 Montpellier Cedex 5, France
- Institut de Biologie Computationnelle, Montpellier, France
| | - Christian P. Robert
- Université Paris Dauphine, Centre de Recherche en Mathematiques de la Decision, 75775 Paris Cedex 16, France
- Institut Universitaire de France, Paris, France; and
- Centre de Recherche en Statistique et Economie, 92245 Malakoff Cedex, France
| |
Collapse
|
26
|
Athrey G, Hodges TK, Reddy MR, Overgaard HJ, Matias A, Ridl FC, Kleinschmidt I, Caccone A, Slotman MA. The effective population size of malaria mosquitoes: large impact of vector control. PLoS Genet 2012; 8:e1003097. [PMID: 23271973 PMCID: PMC3521722 DOI: 10.1371/journal.pgen.1003097] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2012] [Accepted: 10/01/2012] [Indexed: 12/01/2022] Open
Abstract
Malaria vectors in sub-Saharan Africa have proven themselves very difficult adversaries in the global struggle against malaria. Decades of anti-vector interventions have yielded mixed results--with successful reductions in transmission in some areas and limited impacts in others. These varying successes can be ascribed to a lack of universally effective vector control tools, as well as the development of insecticide resistance in mosquito populations. Understanding the impact of vector control on mosquito populations is crucial for planning new interventions and evaluating existing ones. However, estimates of population size changes in response to control efforts are often inaccurate because of limitations and biases in collection methods. Attempts to evaluate the impact of vector control on mosquito effective population size (N(e)) have produced inconclusive results thus far. Therefore, we obtained data for 13-15 microsatellite markers for more than 1,500 mosquitoes representing multiple time points for seven populations of three important vector species--Anopheles gambiae, An. melas, and An. moucheti--in Equatorial Guinea. These populations were exposed to indoor residual spraying or long-lasting insecticidal nets in recent years. For comparison, we also analyzed data from two populations that have no history of organized vector control. We used Approximate Bayesian Computation to reconstruct their demographic history, allowing us to evaluate the impact of these interventions on the effective population size. In six of the seven study populations, vector control had a dramatic impact on the effective population size, reducing N(e) between 55%-87%, the exception being a single An. melas population. In contrast, the two negative control populations did not experience a reduction in effective population size. This study is the first to conclusively link anti-vector intervention programs in Africa to sharply reduced effective population sizes of malaria vectors.
Collapse
Affiliation(s)
- Giridhar Athrey
- Department of Entomology, Texas A&M University, College Station, Texas, United States of America.
| | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Topping CJ, Dalkvist T, Grimm V. Post-hoc pattern-oriented testing and tuning of an existing large model: lessons from the field vole. PLoS One 2012; 7:e45872. [PMID: 23049882 PMCID: PMC3457952 DOI: 10.1371/journal.pone.0045872] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2012] [Accepted: 08/27/2012] [Indexed: 11/24/2022] Open
Abstract
Pattern-oriented modeling (POM) is a general strategy for modeling complex systems. In POM, multiple patterns observed at different scales and hierarchical levels are used to optimize model structure, to test and select sub-models of key processes, and for calibration. So far, POM has been used for developing new models and for models of low to moderate complexity. It remains unclear, though, whether the basic idea of POM to utilize multiple patterns, could also be used to test and possibly develop existing and established models of high complexity. Here, we use POM to test, calibrate, and further develop an existing agent-based model of the field vole (Microtus agrestis), which was developed and tested within the ALMaSS framework. This framework is complex because it includes a high-resolution representation of the landscape and its dynamics, of the individual’s behavior, and of the interaction between landscape and individual behavior. Results of fitting to the range of patterns chosen were generally very good, but the procedure required to achieve this was long and complicated. To obtain good correspondence between model and the real world it was often necessary to model the real world environment closely. We therefore conclude that post-hoc POM is a useful and viable way to test a highly complex simulation model, but also warn against the dangers of over-fitting to real world patterns that lack details in their explanatory driving factors. To overcome some of these obstacles we suggest the adoption of open-science and open-source approaches to ecological simulation modeling.
Collapse
|
28
|
Impact of sampling schemes on demographic inference: an empirical study in two species with different mating systems and demographic histories. G3-GENES GENOMES GENETICS 2012; 2:803-14. [PMID: 22870403 PMCID: PMC3385986 DOI: 10.1534/g3.112.002410] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/29/2012] [Accepted: 05/10/2012] [Indexed: 12/12/2022]
Abstract
Most species have at least some level of genetic structure. Recent simulation studies have shown that it is important to consider population structure when sampling individuals to infer past population history. The relevance of the results of these computer simulations for empirical studies, however, remains unclear. In the present study, we use DNA sequence datasets collected from two closely related species with very different histories, the selfing species Capsella rubella and its outcrossing relative C. grandiflora, to assess the impact of different sampling strategies on summary statistics and the inference of historical demography. Sampling strategy did not strongly influence the mean values of Tajima's D in either species, but it had some impact on the variance. The general conclusions about demographic history were comparable across sampling schemes even when resampled data were analyzed with approximate Bayesian computation (ABC). We used simulations to explore the effects of sampling scheme under different demographic models. We conclude that when sequences from modest numbers of loci (<60) are analyzed, the sampling strategy is generally of limited importance. The same is true under intermediate or high levels of gene flow (4Nm > 2-10) in models in which global expansion is combined with either local expansion or hierarchical population structure. Although we observe a less severe effect of sampling than predicted under some earlier simulation models, our results should not be seen as an encouragement to neglect this issue. In general, a good coverage of the natural range, both within and between populations, will be needed to obtain a reliable reconstruction of a species's demographic history, and in fact, the effect of sampling scheme on polymorphism patterns may itself provide important information about demographic history.
Collapse
|
29
|
Werneck FP, Gamble T, Colli GR, Rodrigues MT, Sites Jr JW. DEEP DIVERSIFICATION AND LONG-TERM PERSISTENCE IN THE SOUTH AMERICAN ‘DRY DIAGONAL’: INTEGRATING CONTINENT-WIDE PHYLOGEOGRAPHY AND DISTRIBUTION MODELING OF GECKOS. Evolution 2012; 66:3014-34. [DOI: 10.1111/j.1558-5646.2012.01682.x] [Citation(s) in RCA: 144] [Impact Index Per Article: 11.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
30
|
Camargo A, Morando M, Avila LJ, Sites JW. SPECIES DELIMITATION WITH ABC AND OTHER COALESCENT-BASED METHODS: A TEST OF ACCURACY WITH SIMULATIONS AND AN EMPIRICAL EXAMPLE WITH LIZARDS OF THE LIOLAEMUS DARWINII COMPLEX (SQUAMATA: LIOLAEMIDAE). Evolution 2012; 66:2834-49. [PMID: 22946806 DOI: 10.1111/j.1558-5646.2012.01640.x] [Citation(s) in RCA: 157] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]
Affiliation(s)
- Arley Camargo
- Department of Biology & Monte L Bean Museum, Brigham Young University, Provo, Utah 84602, USA.
| | | | | | | |
Collapse
|
31
|
Lenstra JA, Groeneveld LF, Eding H, Kantanen J, Williams JL, Taberlet P, Nicolazzi EL, Sölkner J, Simianer H, Ciani E, Garcia JF, Bruford MW, Ajmone-Marsan P, Weigend S. Molecular tools and analytical approaches for the characterization of farm animal genetic diversity. Anim Genet 2012; 43:483-502. [DOI: 10.1111/j.1365-2052.2011.02309.x] [Citation(s) in RCA: 86] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/15/2011] [Indexed: 12/30/2022]
Affiliation(s)
- J. A. Lenstra
- Faculty of Veterinary Medicine; Utrecht University; Utrecht; The Netherlands
| | - L. F. Groeneveld
- Institute of Farm Animal Genetics; Friedrich-Loeffler-Institut; Hoeltystr. 10; 31535; Neustadt; Germany
| | - H. Eding
- Animal Evaluations Unit; CRV; Arnhem; The Netherlands
| | - J. Kantanen
- Biotechnology and Food Research; MTT Agrifood Research Finland; FI-31600; Jokioinen; Finland
| | - J. L. Williams
- Parco Tecnologico Padano; via Einstein; 2600; Lodi; Italy
| | - P. Taberlet
- Laboratoire d'Ecologie Alpine; Université Joseph Fourier; BP 53; Grenoble; France
| | - E. L. Nicolazzi
- Istituto di Zootecnica and BioDNA Research Centre; Università Cattolica del Sacro Cuore; Piacenza; Italy
| | - J. Sölkner
- Department of Sustainable Agricultural Systems; Animal Breeding Group; BOKU - University of Natural Resources and Life Sciences; Vienna; Austria
| | - H. Simianer
- Department of Animal Sciences; Animal Breeding and Genetics Group; Georg-August-University Göttingen; 37075; Göttingen; Germany
| | - E. Ciani
- Department of General and Environmental Physiology; University of Bari “Aldo Moro”; Bari; Italy
| | - J. F. Garcia
- Universidade Estadual Paulista; Araçatuba; Brazil
| | - M. W. Bruford
- Organisms and Environment Division; School of Biosciences; Cardiff University; Cardiff; UK
| | - P. Ajmone-Marsan
- Istituto di Zootecnica and BioDNA Research Centre; Università Cattolica del Sacro Cuore; Piacenza; Italy
| | - S. Weigend
- Institute of Farm Animal Genetics; Friedrich-Loeffler-Institut; Hoeltystr. 10; 31535; Neustadt; Germany
| |
Collapse
|
32
|
Mamidi S, Rossi M, Annam D, Moghaddam S, Lee R, Papa R, McClean P. Investigation of the domestication of common bean (Phaseolus vulgaris) using multilocus sequence data. FUNCTIONAL PLANT BIOLOGY : FPB 2011; 38:953-967. [PMID: 32480954 DOI: 10.1071/fp11124] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/21/2011] [Accepted: 09/15/2011] [Indexed: 05/24/2023]
Abstract
Multilocus sequence data collected from domesticated and related wild relatives provides a rich source of information on the effect of human selection on the diversity and adaptability of a species to complex environments. To evaluate the domestication history of common bean (Phaseolus vulgaris L.), multilocus sequence data from landraces representing the various races within the Middle American (MA) and Andean gene pools was evaluated. Across 13 loci, nucleotide diversity was similar between landraces and wild germplasm in both gene pools. The diversity data were evaluated using the approximate Bayesian computation approach to test multiple domestication models and estimate population demographic parameters. A model with a single domestication event coupled with bidirectional migration between wild and domesticated genotypes fitted the data better than models consisting of two or three domestication events in each genepool. The effective bottleneck population size was ~50% of the base population in each genepool. The bottleneck began ~8200 and ~8500 years before present and ended at ~6300 and ~7000 years before present in MA and Andean gene pools respectively. Linkage disequilibrium decayed to a greater extent in the MA genepool. Given the (1) geographical adaptation bottleneck in each wild gene pool, (2) a subsequent domestication bottleneck within each gene pool, (3) differentiation into gene-pool specific races and (4) variable extents of linkage disequilibrium, association mapping experiments for common bean would more appropriately be performed within each genepool.
Collapse
Affiliation(s)
- Sujan Mamidi
- North Dakota State University, Department of Plant Sciences, Fargo, ND 58102, USA
| | - Monica Rossi
- Università Politecnica delle Marche, Scienze Ambientali e delle Produzioni Vegetali, Ancona, Italy
| | - Deepti Annam
- North Dakota State University, Department of Statistics, Fargo, ND 58102, USA
| | - Samira Moghaddam
- North Dakota State University, Department of Plant Sciences, Fargo, ND 58102, USA
| | - Rian Lee
- North Dakota State University, Department of Plant Sciences, Fargo, ND 58102, USA
| | - Roberto Papa
- Università Politecnica delle Marche, Scienze Ambientali e delle Produzioni Vegetali, Ancona, Italy
| | - Phillip McClean
- North Dakota State University, Department of Plant Sciences, Fargo, ND 58102, USA
| |
Collapse
|
33
|
Thomas JC, Godfrey PA, Feldgarden M, Robinson DA. Candidate targets of balancing selection in the genome of Staphylococcus aureus. Mol Biol Evol 2011; 29:1175-86. [PMID: 22114360 DOI: 10.1093/molbev/msr286] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Abstract
Signatures of balancing selection can highlight polymorphisms and functions that are important to the long-term fitness of a species. We performed a first genome-wide scan for balancing selection in a bacterial species, Staphylococcus aureus, which is a common cause of serious antimicrobial-resistant infections of humans. Using a sliding window approach, the genomes of 16 strains of S. aureus, including 5 new genome sequences presented here, and 1 outgroup strain of S. epidermidis were scanned for signatures of balancing selection. A total of 195 short windows were investigated based on their extreme values of both Tajima's D (>2.03) and π/K ratios (>0.12) relative to the rest of the genome. To test the unusualness of these windows, an Approximate Bayesian Computation framework was used to select a null demographic model that better accounted for the observed data than did the standard neutral model. A total of 186 windows were demonstrated to be unusual under the null model and, thus, represented candidate loci under balancing selection. These 186 candidate windows were located within 99 candidate genes that were spread across 62 different loci. Nearly all the signal (97.2%) was located within coding sequences; balancing selection on gene regulation apparently occurs through the targeting of global regulators such as agr and gra/aps. The agr locus had some of the strongest signatures of balancing selection, which provides new insight into the causes of diversity at this locus. The list of candidate genes included multiple virulence-associated genes and was significantly enriched for functions in amino acid and inorganic ion transport and metabolism and in defense mechanisms against innate immunity and antimicrobials, highlighting these particular functions as important to the fitness of this pathogen.
Collapse
Affiliation(s)
- Jonathan C Thomas
- Department of Microbiology, University of Mississippi Medical Center, USA
| | | | | | | |
Collapse
|
34
|
Choice of summary statistic weights in approximate Bayesian computation. Stat Appl Genet Mol Biol 2011; 10:/j/sagmb.2011.10.issue-1/1544-6115.1586/1544-6115.1586.xml. [PMID: 23089822 DOI: 10.2202/1544-6115.1586] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Abstract
In this paper, we develop a Genetic Algorithm that can address the fundamental problem of how one should weight the summary statistics included in an approximate Bayesian computation analysis built around an accept/reject algorithm, and how one might choose the tolerance for that analysis. We then demonstrate that using weighted statistics, and a well-chosen tolerance, in such an approximate Bayesian computation approach can result in improved performance, when compared to unweighted analyses, using one example drawn purely from statistics and two drawn from the estimation of population genetics parameters.
Collapse
|
35
|
Abstract
Approximate Bayesian computation (ABC) have become an essential tool for the analysis of complex stochastic models. Grelaud et al. [(2009) Bayesian Anal 3:427-442] advocated the use of ABC for model choice in the specific case of Gibbs random fields, relying on an intermodel sufficiency property to show that the approximation was legitimate. We implemented ABC model choice in a wide range of phylogenetic models in the Do It Yourself-ABC (DIY-ABC) software [Cornuet et al. (2008) Bioinformatics 24:2713-2719]. We now present arguments as to why the theoretical arguments for ABC model choice are missing, because the algorithm involves an unknown loss of information induced by the use of insufficient summary statistics. The approximation error of the posterior probabilities of the models under comparison may thus be unrelated with the computational effort spent in running an ABC algorithm. We then conclude that additional empirical verifications of the performances of the ABC procedure as those available in DIY-ABC are necessary to conduct model choice.
Collapse
|
36
|
Beaumont MA. Approximate Bayesian Computation in Evolution and Ecology. ANNUAL REVIEW OF ECOLOGY EVOLUTION AND SYSTEMATICS 2010. [DOI: 10.1146/annurev-ecolsys-102209-144621] [Citation(s) in RCA: 725] [Impact Index Per Article: 48.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Mark A. Beaumont
- Department of Mathematics and School of Biological Sciences, University of Bristol, Bristol BS8 1TNW, United Kingdom;
| |
Collapse
|
37
|
Cornuet JM, Ravigné V, Estoup A. Inference on population history and model checking using DNA sequence and microsatellite data with the software DIYABC (v1.0). BMC Bioinformatics 2010; 11:401. [PMID: 20667077 PMCID: PMC2919520 DOI: 10.1186/1471-2105-11-401] [Citation(s) in RCA: 331] [Impact Index Per Article: 22.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2010] [Accepted: 07/28/2010] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Approximate Bayesian computation (ABC) is a recent flexible class of Monte-Carlo algorithms increasingly used to make model-based inference on complex evolutionary scenarios that have acted on natural populations. The software DIYABC offers a user-friendly interface allowing non-expert users to consider population histories involving any combination of population divergences, admixtures and population size changes. We here describe and illustrate new developments of this software that mainly include (i) inference from DNA sequence data in addition or separately to microsatellite data, (ii) the possibility to analyze five categories of loci considering balanced or non balanced sex ratios: autosomal diploid, autosomal haploid, X-linked, Y-linked and mitochondrial, and (iii) the possibility to perform model checking computation to assess the "goodness-of-fit" of a model, a feature of ABC analysis that has been so far neglected. RESULTS We used controlled simulated data sets generated under evolutionary scenarios involving various divergence and admixture events to evaluate the effect of mixing autosomal microsatellite, mtDNA and/or nuclear autosomal DNA sequence data on inferences. This evaluation included the comparison of competing scenarios and the quantification of their relative support, and the estimation of parameter posterior distributions under a given scenario. We also considered a set of scenarios often compared when making ABC inferences on the routes of introduction of invasive species to illustrate the interest of the new model checking option of DIYABC to assess model misfit. CONCLUSIONS Our new developments of the integrated software DIYABC should be particularly useful to make inference on complex evolutionary scenarios involving both recent and ancient historical events and using various types of molecular markers in diploid or haploid organisms. They offer a handy way for non-expert users to achieve model checking computation within an ABC framework, hence filling up a gap of ABC analysis. The software DIYABC V1.0 is freely available at http://www1.montpellier.inra.fr/CBGP/diyabc.
Collapse
Affiliation(s)
- Jean-Marie Cornuet
- INRA, UMR CBGP (INRA/IRD/Cirad/Montpellier SupAgro), Campus international de Baillarguet, CS 30016, F-34988 Montferrier-sur-Lez cedex, France
| | | | | |
Collapse
|
38
|
Bertorelle G, Benazzo A, Mona S. ABC as a flexible framework to estimate demography over space and time: some cons, many pros. Mol Ecol 2010; 19:2609-25. [PMID: 20561199 DOI: 10.1111/j.1365-294x.2010.04690.x] [Citation(s) in RCA: 296] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The analysis of genetic variation to estimate demographic and historical parameters and to quantitatively compare alternative scenarios recently gained a powerful and flexible approach: the Approximate Bayesian Computation (ABC). The likelihood functions does not need to be theoretically specified, but posterior distributions can be approximated by simulation even assuming very complex population models including both natural and human-induced processes. Prior information can be easily incorporated and the quality of the results can be analysed with rather limited additional effort. ABC is not a statistical analysis per se, but rather a statistical framework and any specific application is a sort of hybrid between a simulation and a data-analysis study. Complete software packages performing the necessary steps under a set of models and for specific genetic markers are already available, but the flexibility of the method is better exploited combining different programs. Many questions relevant in ecology can be addressed using ABC, but adequate amount of time should be dedicated to decide among alternative options and to evaluate the results. In this paper we will describe and critically comment on the different steps of an ABC analysis, analyse some of the published applications of ABC and provide user guidelines.
Collapse
Affiliation(s)
- G Bertorelle
- Department of Biology and Evolution, University of Ferrara, Via Borsari 46, 44100 Ferrara, Italy.
| | | | | |
Collapse
|