1
|
Cousins T, Scally A, Durbin R. A structured coalescent model reveals deep ancestral structure shared by all modern humans. Nat Genet 2025; 57:856-864. [PMID: 40102687 PMCID: PMC11985351 DOI: 10.1038/s41588-025-02117-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2024] [Accepted: 02/05/2025] [Indexed: 03/20/2025]
Abstract
Understanding the history of admixture events and population size changes leading to modern humans is central to human evolutionary genetics. Here we introduce a coalescence-based hidden Markov model, cobraa, that explicitly represents an ancestral population split and rejoin, and demonstrate its application on simulated and real data across multiple species. Using cobraa, we present evidence for an extended period of structure in the history of all modern humans, in which two ancestral populations that diverged ~1.5 million years ago came together in an admixture event ~300 thousand years ago, in a ratio of ~80:20%. Immediately after their divergence, we detect a strong bottleneck in the major ancestral population. We inferred regions of the present-day genome derived from each ancestral population, finding that material from the minority correlates strongly with distance to coding sequence, suggesting it was deleterious against the majority background. Moreover, we found a strong correlation between regions of majority ancestry and human-Neanderthal or human-Denisovan divergence, suggesting the majority population was also ancestral to those archaic humans.
Collapse
Affiliation(s)
- Trevor Cousins
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Aylwyn Scally
- Department of Genetics, University of Cambridge, Cambridge, UK
| | - Richard Durbin
- Department of Genetics, University of Cambridge, Cambridge, UK.
| |
Collapse
|
2
|
Kornai D, Jiao X, Ji J, Flouri T, Yang Z. Hierarchical Heuristic Species Delimitation Under the Multispecies Coalescent Model with Migration. Syst Biol 2024; 73:1015-1037. [PMID: 39180155 PMCID: PMC11637770 DOI: 10.1093/sysbio/syae050] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 08/12/2024] [Accepted: 08/20/2024] [Indexed: 08/26/2024] Open
Abstract
The multispecies coalescent (MSC) model accommodates genealogical fluctuations across the genome and provides a natural framework for comparative analysis of genomic sequence data from closely related species to infer the history of species divergence and gene flow. Given a set of populations, hypotheses of species delimitation (and species phylogeny) may be formulated as instances of MSC models (e.g., MSC for 1 species versus MSC for 2 species) and compared using Bayesian model selection. This approach, implemented in the program bpp, has been found to be prone to over-splitting. Alternatively, heuristic criteria based on population parameters (such as population split times, population sizes, and migration rates) estimated from genomic data may be used to delimit species. Here, we develop hierarchical merge and split algorithms for heuristic species delimitation based on the genealogical divergence index (gdi) and implement them in a Python pipeline called hhsd. We characterize the behavior of the gdi under a few simple scenarios of gene flow. We apply the new approaches to a dataset simulated under a model of isolation by distance as well as 3 empirical datasets. Our tests suggest that the new approaches produced sensible results and were less prone to oversplitting. We discuss possible strategies for accommodating paraphyletic species in the hierarchical algorithm, as well as the challenges of species delimitation based on heuristic criteria.
Collapse
Affiliation(s)
- Daniel Kornai
- Department of Genetics, Evolution, and Environment, University College London, Gower Street, London WC1E 6BT, UK
| | - Xiyun Jiao
- Department of Statistics and Data Science, China Southern University of Science and Technology, Shenzhen, Guangdong 518055, China
| | - Jiayi Ji
- Department of Genetics, Evolution, and Environment, University College London, Gower Street, London WC1E 6BT, UK
| | - Tomáš Flouri
- Department of Genetics, Evolution, and Environment, University College London, Gower Street, London WC1E 6BT, UK
| | - Ziheng Yang
- Department of Genetics, Evolution, and Environment, University College London, Gower Street, London WC1E 6BT, UK
| |
Collapse
|
3
|
Ceccarelli FS, Sankey Alamilla LRA. Coastal dunes as drivers of genetic differentiation in the honeypot ant Myrmecocystus baja (Formicidae: Formicinae). THE SCIENCE OF NATURE - NATURWISSENSCHAFTEN 2024; 111:39. [PMID: 39008082 DOI: 10.1007/s00114-024-01925-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 07/04/2024] [Accepted: 07/08/2024] [Indexed: 07/16/2024]
Abstract
Coastal dunes are unique habitats, threatened by human activities. In biogeographical terms, coastal dunes are habitat islands, being discrete and distinct patches of similar habitat among themselves, separated from each other by a different type of habitat. Furthermore, coastal dunes harbor endemic species, adapted to living solely in the habitats found on specific dune systems. For example, the honeypot ant Myrmecocystus baja is endemic and restricted to coastal dunes of Mexico's Baja California Pacific coast. This ecological and biogeographical scenario led to the questions whether their geographical isolation is reflected in their genetic diversity and structuring, and how their demographic history is related with the formation of the dune system habitats. To answer these questions, population genetic, isolation-with-migration, and phylogeographical analyses were carried out, based on mitochondrial and five nuclear intronic markers. Minimal gene flow was detected only between two of the dune systems sampled; otherwise, the M. baja populations were found to be isolated and genetically structured, and their divergence generally pre-dated the modern-day dune systems. It is therefore highly likely that these ants were already present in paleodunes and that each of the populations was established from founder populations as the dunes formed. These findings highlight the importance of coastal dunes for species such as the honeypot ant from Baja California, in promoting genetic differentiation.
Collapse
Affiliation(s)
- F Sara Ceccarelli
- Departamento de Biología de La Conservación, Centro de Investigación Científica y de Educación Superior de Ensenada, Ensenada, B.C, México.
| | - Le Roy A Sankey Alamilla
- Departamento de Biología de La Conservación, Centro de Investigación Científica y de Educación Superior de Ensenada, Ensenada, B.C, México
| |
Collapse
|
4
|
Fan Z, Zhang R, Zhou A, Hey J, Song Y, Osada N, Hamada Y, Yue B, Xing J, Li J. Genomic Evidence for the Complex Evolutionary History of Macaques (Genus Macaca). J Mol Evol 2024; 92:286-299. [PMID: 38634872 DOI: 10.1007/s00239-024-10166-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 03/20/2024] [Indexed: 04/19/2024]
Abstract
The genus Macaca is widely distributed, occupies a variety of habitats, shows diverse phenotypic characteristics, and is one of the best-studied genera of nonhuman primates. Here, we reported five re-sequencing Macaca genomes, including one M. cyclopis, one M. fuscata, one M. thibetana, one M. silenus, and one M. sylvanus. Together with published genomes of other macaque species, we combined 20 genome sequences of 10 macaque species to investigate the gene introgression and genetic differences among the species. The network analysis of the SNV-fragment trees indicates a reticular phylogeny of macaque species. Combining the results from various analytical methods, we identified extensive ancient introgression events among macaque species. The multiple introgression signals between different species groups were also observed, such as between fascicularis group species and silenus group species. However, gene flow signals between fascicularis and sinica group were not as strong as those between fascicularis group and silenus group. On the other hand, the unidirect gene flow in M. arctoides probably occurred between the progenitor of M. arctoides and the common ancestor of fascicularis group. Our study also shows that the genetic backgrounds and genetic diversity of different macaques vary dramatically among species, even among populations of the same species. In conclusion, using whole genome sequences and multiple methods, we have studied the evolutionary history of the genus Macaca and provided evidence for extensive introgression among the species.
Collapse
Affiliation(s)
- Zhenxin Fan
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, Sichuan, People's Republic of China
| | - Rusong Zhang
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, Sichuan, People's Republic of China
| | - Anbo Zhou
- Department of Genetics, Rutgers, the State University of New Jersey, Piscataway, NJ, 08854, USA
| | - Jody Hey
- Department of Biology, Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, USA
| | - Yang Song
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, Sichuan, People's Republic of China
| | - Naoki Osada
- Graduate School of Information Science and Technology, Hokkaido University, Sapporo, Hokkaido, 060-0814, Japan
| | - Yuzuru Hamada
- National Primate Research Center of Thailand, Chulalongkorn University, Bangkok, Thailand
| | - Bisong Yue
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, Sichuan, People's Republic of China
| | - Jinchuan Xing
- Department of Genetics, Rutgers, the State University of New Jersey, Piscataway, NJ, 08854, USA
| | - Jing Li
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu, 610065, Sichuan, People's Republic of China.
| |
Collapse
|
5
|
Tran LN, Sun CK, Struck TJ, Sajan M, Gutenkunst RN. Computationally Efficient Demographic History Inference from Allele Frequencies with Supervised Machine Learning. Mol Biol Evol 2024; 41:msae077. [PMID: 38636507 PMCID: PMC11082913 DOI: 10.1093/molbev/msae077] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2023] [Revised: 04/08/2024] [Accepted: 04/12/2024] [Indexed: 04/20/2024] Open
Abstract
Inferring past demographic history of natural populations from genomic data is of central concern in many studies across research fields. Previously, our group had developed dadi, a widely used demographic history inference method based on the allele frequency spectrum (AFS) and maximum composite-likelihood optimization. However, dadi's optimization procedure can be computationally expensive. Here, we present donni (demography optimization via neural network inference), a new inference method based on dadi that is more efficient while maintaining comparable inference accuracy. For each dadi-supported demographic model, donni simulates the expected AFS for a range of model parameters then trains a set of Mean Variance Estimation neural networks using the simulated AFS. Trained networks can then be used to instantaneously infer the model parameters from future genomic data summarized by an AFS. We demonstrate that for many demographic models, donni can infer some parameters, such as population size changes, very well and other parameters, such as migration rates and times of demographic events, fairly well. Importantly, donni provides both parameter and confidence interval estimates from input AFS with accuracy comparable to parameters inferred by dadi's likelihood optimization while bypassing its long and computationally intensive evaluation process. donni's performance demonstrates that supervised machine learning algorithms may be a promising avenue for developing more sustainable and computationally efficient demographic history inference methods.
Collapse
Affiliation(s)
- Linh N Tran
- Genetics Graduate Interdisciplinary Program, University of Arizona, Tucson, AZ 85721, USA
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Connie K Sun
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Travis J Struck
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Mathews Sajan
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Ryan N Gutenkunst
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ 85721, USA
| |
Collapse
|
6
|
Tran LN, Sun CK, Struck TJ, Sajan M, Gutenkunst RN. Computationally efficient demographic history inference from allele frequencies with supervised machine learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.24.542158. [PMID: 38405827 PMCID: PMC10888863 DOI: 10.1101/2023.05.24.542158] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]
Abstract
Inferring past demographic history of natural populations from genomic data is of central concern in many studies across research fields. Previously, our group had developed dadi, a widely used demographic history inference method based on the allele frequency spectrum (AFS) and maximum composite likelihood optimization. However, dadi's optimization procedure can be computationally expensive. Here, we developed donni (demography optimization via neural network inference), a new inference method based on dadi that is more efficient while maintaining comparable inference accuracy. For each dadi-supported demographic model, donni simulates the expected AFS for a range of model parameters then trains a set of Mean Variance Estimation neural networks using the simulated AFS. Trained networks can then be used to instantaneously infer the model parameters from future input data AFS. We demonstrated that for many demographic models, donni can infer some parameters, such as population size changes, very well and other parameters, such as migration rates and times of demographic events, fairly well. Importantly, donni provides both parameter and confidence interval estimates from input AFS with accuracy comparable to parameters inferred by dadi's likelihood optimization while bypassing its long and computationally intensive evaluation process. donni's performance demonstrates that supervised machine learning algorithms may be a promising avenue for developing more sustainable and computationally efficient demographic history inference methods.
Collapse
Affiliation(s)
- Linh N. Tran
- Genetics Graduate Interdisciplinary Program, University of Arizona, Tucson, AZ, USA
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - Connie K. Sun
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - Travis J. Struck
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - Mathews Sajan
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ, USA
| | - Ryan N. Gutenkunst
- Department of Molecular & Cellular Biology, University of Arizona, Tucson, AZ, USA
| |
Collapse
|
7
|
Tiley GP, Flouri T, Jiao X, Poelstra JW, Xu B, Zhu T, Rannala B, Yoder AD, Yang Z. Estimation of species divergence times in presence of cross-species gene flow. Syst Biol 2023; 72:820-836. [PMID: 36961245 PMCID: PMC10405360 DOI: 10.1093/sysbio/syad015] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2021] [Accepted: 03/22/2023] [Indexed: 03/25/2023] Open
Abstract
Cross-species introgression can have significant impacts on phylogenomic reconstruction of species divergence events. Here, we used simulations to show how the presence of even a small amount of introgression can bias divergence time estimates when gene flow is ignored in the analysis. Using advances in analytical methods under the multispecies coalescent (MSC) model, we demonstrate that by accounting for incomplete lineage sorting and introgression using large phylogenomic data sets this problem can be avoided. The multispecies-coalescent-with-introgression (MSci) model is capable of accurately estimating both divergence times and ancestral effective population sizes, even when only a single diploid individual per species is sampled. We characterize some general expectations for biases in divergence time estimation under three different scenarios: 1) introgression between sister species, 2) introgression between non-sister species, and 3) introgression from an unsampled (i.e., ghost) outgroup lineage. We also conducted simulations under the isolation-with-migration (IM) model and found that the MSci model assuming episodic gene flow was able to accurately estimate species divergence times despite high levels of continuous gene flow. We estimated divergence times under the MSC and MSci models from two published empirical datasets with previous evidence of introgression, one of 372 target-enrichment loci from baobabs (Adansonia), and another of 1000 transcriptome loci from 14 species of the tomato relative, Jaltomata. The empirical analyses not only confirm our findings from simulations, demonstrating that the MSci model can reliably estimate divergence times but also show that divergence time estimation under the MSC can be robust to the presence of small amounts of introgression in empirical datasets with extensive taxon sampling. [divergence time; gene flow; hybridization; introgression; MSci model; multispecies coalescent].
Collapse
Affiliation(s)
| | - Tomáš Flouri
- Department of Genetics, Evolution and Environment, University College London, London, UK
| | - Xiyun Jiao
- Department of Genetics, Evolution and Environment, University College London, London, UK
- Department of Statistics and Data Science, China Southern University of Science and Technology, Shenzhen, Guangdong, China
| | | | - Bo Xu
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China
| | - Tianqi Zhu
- National Center for Mathematics and Interdisciplinary Sciences, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, China
- Key Laboratory of Random Complex Structures and Data Science, Academy of Mathematics and Systems Science, Chinese Academy of Sciences, China
| | - Bruce Rannala
- Department of Evolution and Ecology, University of California, Davis, Davis, CA, USA
| | - Anne D Yoder
- Department of Biology, Duke University, Durham, NC, USA
| | - Ziheng Yang
- Department of Genetics, Evolution and Environment, University College London, London, UK
| |
Collapse
|
8
|
Ragsdale AP, Thornton KR. Multiple Sources of Uncertainty Confound Inference of Historical Human Generation Times. Mol Biol Evol 2023; 40:msad160. [PMID: 37450583 PMCID: PMC10404577 DOI: 10.1093/molbev/msad160] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 07/05/2023] [Accepted: 07/07/2023] [Indexed: 07/18/2023] Open
Abstract
Wang et al. (2023) recently proposed an approach to infer the history of human generation intervals from changes in mutation profiles over time. As the relative proportions of different mutation types depend on the ages of parents, binning variants by the time they arose allows for the inference of changes in average paternal and maternal generation intervals. Applying this approach to published allele age estimates, Wang et al. (2023) inferred long-lasting sex differences in average generation times and surprisingly found that ancestral generation times of West African populations remained substantially higher than those of Eurasian populations extending tens of thousands of generations into the past. Here, we argue that the results and interpretations in Wang et al. (2023) are primarily driven by noise and biases in input data and a lack of validation using independent approaches for estimating allele ages. With the recent development of methods to reconstruct genome-wide gene genealogies, coalescence times, and allele ages, we caution that downstream analyses may be strongly influenced by uncharacterized biases in their output.
Collapse
Affiliation(s)
- Aaron P Ragsdale
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI, USA
| | - Kevin R Thornton
- Department of Ecology and Evolutionary Biology, University of California, Irvine, CA, USA
| |
Collapse
|
9
|
Ji J, Jackson DJ, Leaché AD, Yang Z. Power of Bayesian and Heuristic Tests to Detect Cross-Species Introgression with Reference to Gene Flow in the Tamias quadrivittatus Group of North American Chipmunks. Syst Biol 2023; 72:446-465. [PMID: 36504374 PMCID: PMC10275556 DOI: 10.1093/sysbio/syac077] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 11/15/2022] [Accepted: 12/01/2022] [Indexed: 10/25/2023] Open
Abstract
In the past two decades, genomic data have been widely used to detect historical gene flow between species in a variety of plants and animals. The Tamias quadrivittatus group of North America chipmunks, which originated through a series of rapid speciation events, are known to undergo massive amounts of mitochondrial introgression. Yet in a recent analysis of targeted nuclear loci from the group, no evidence for cross-species introgression was detected, indicating widespread cytonuclear discordance. The study used the heuristic method HYDE to detect gene flow, which may suffer from low power. Here we use the Bayesian method implemented in the program BPP to re-analyze these data. We develop a Bayesian test of introgression, calculating the Bayes factor via the Savage-Dickey density ratio using the Markov chain Monte Carlo (MCMC) sample under the model of introgression. We take a stepwise approach to constructing an introgression model by adding introgression events onto a well-supported binary species tree. The analysis detected robust evidence for multiple ancient introgression events affecting the nuclear genome, with introgression probabilities reaching 63%. We estimate population parameters and highlight the fact that species divergence times may be seriously underestimated if ancient cross-species gene flow is ignored in the analysis. We examine the assumptions and performance of HYDE and demonstrate that it lacks power if gene flow occurs between sister lineages or if the mode of gene flow does not match the assumed hybrid-speciation model with symmetrical population sizes. Our analyses highlight the power of likelihood-based inference of cross-species gene flow using genomic sequence data. [Bayesian test; BPP; chipmunks; introgression; MSci; multispecies coalescent; Savage-Dickey density ratio.].
Collapse
Affiliation(s)
- Jiayi Ji
- Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Donavan J Jackson
- Department of Biology and Burke Museum of Natural History and Culture, University of Washington, Box 351800, Seattle, WA 98195-1800, USA
| | - Adam D Leaché
- Department of Biology and Burke Museum of Natural History and Culture, University of Washington, Box 351800, Seattle, WA 98195-1800, USA
| | - Ziheng Yang
- Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| |
Collapse
|
10
|
Chung Y. Assessing the impact of recombination on the estimation of isolation-with-migration models using genomic data: a simulation study. Genomics Inform 2023; 21:e27. [PMID: 37415456 PMCID: PMC10326538 DOI: 10.5808/gi.23016] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 05/20/2023] [Accepted: 05/22/2023] [Indexed: 07/08/2023] Open
Abstract
Recombination events complicate the evolutionary history of populations and species and have a significant impact on the inference of isolation-with-migration (IM) models. However, several existing methods have been developed, assuming no recombination within a locus and free recombination between loci. In this study, we investigated the effect of recombination on the estimation of IM models using genomic data. We conducted a simulation study to evaluate the consistency of the parameter estimators with up to 1,000 loci and analyze true gene trees to examine the sources of errors in estimating the IM model parameters. The results showed that the presence of recombination led to biased estimates of the IM model parameters, with population sizes being more overestimated and migration rates being more underestimated as the number of loci increased. The magnitude of the biases tended to increase with the recombination rates when using 100 or more loci. On the other hand, the estimation of splitting times remained consistent as the number of loci increased. In the absence of recombination, the estimators of the IM model parameters remained consistent.
Collapse
Affiliation(s)
- Yujin Chung
- Department of Applied Statistics, Kyonggi University, Suwon 16227, Korea
| |
Collapse
|
11
|
Cao Y, Almeida-Silva F, Zhang WP, Ding YM, Bai D, Bai WN, Zhang BW, Van de Peer Y, Zhang DY. Genomic Insights into Adaptation to Karst Limestone and Incipient Speciation in East Asian Platycarya spp. (Juglandaceae). Mol Biol Evol 2023; 40:msad121. [PMID: 37216901 PMCID: PMC10257982 DOI: 10.1093/molbev/msad121] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2023] [Revised: 04/06/2023] [Accepted: 05/16/2023] [Indexed: 05/24/2023] Open
Abstract
When challenged by similar environmental conditions, phylogenetically distant taxa often independently evolve similar traits (convergent evolution). Meanwhile, adaptation to extreme habitats might lead to divergence between taxa that are otherwise closely related. These processes have long existed in the conceptual sphere, yet molecular evidence, especially for woody perennials, is scarce. The karst endemic Platycarya longipes and its only congeneric species, Platycarya strobilacea, which is widely distributed in the mountains in East Asia, provide an ideal model for examining the molecular basis of both convergent evolution and speciation. Using chromosome-level genome assemblies of both species, and whole-genome resequencing data from 207 individuals spanning their entire distribution range, we demonstrate that P. longipes and P. strobilacea form two species-specific clades, which diverged around 2.09 million years ago. We find an excess of genomic regions exhibiting extreme interspecific differentiation, potentially due to long-term selection in P. longipes, likely contributing to the incipient speciation of the genus Platycarya. Interestingly, our results unveil underlying karst adaptation in both copies of the calcium influx channel gene TPC1 in P. longipes. TPC1 has previously been identified as a selective target in certain karst-endemic herbs, indicating a convergent adaptation to high calcium stress among karst-endemic species. Our study reveals the genic convergence of TPC1 among karst endemics and the driving forces underneath the incipient speciation of the two Platycarya lineages.
Collapse
Affiliation(s)
- Yu Cao
- State Key Laboratory of Earth Surface Process and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Center for Plant Systems Biology, VIB, Ghent, Belgium
| | - Fabricio Almeida-Silva
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Center for Plant Systems Biology, VIB, Ghent, Belgium
| | - Wei-Ping Zhang
- State Key Laboratory of Earth Surface Process and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
| | - Ya-Mei Ding
- State Key Laboratory of Earth Surface Process and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
| | - Dan Bai
- State Key Laboratory of Earth Surface Process and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
| | - Wei-Ning Bai
- State Key Laboratory of Earth Surface Process and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
| | - Bo-Wen Zhang
- State Key Laboratory of Earth Surface Process and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
| | - Yves Van de Peer
- Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium
- Center for Plant Systems Biology, VIB, Ghent, Belgium
- Center for Microbial Ecology and Genomics, Department of Biochemistry, Genetics and Microbiology, University of Pretoria, Pretoria, South Africa
- College of Horticulture, Nanjing Agricultural University, Nanjing, China
| | - Da-Yong Zhang
- State Key Laboratory of Earth Surface Process and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, China
| |
Collapse
|
12
|
Sakaba T, Soejima A, Fujii S, Ikeda H, Iwasaki T, Saito H, Suyama Y, Matsuo A, Kozhevnikov AE, Kozhevnikova ZV, Wang H, Wang S, Pak JH, Fujii N. Phylogeography of the temperate grassland plant Tephroseris kirilowii (Asteraceae) inferred from multiplexed inter-simple sequence repeat genotyping by sequencing (MIG-seq) data. JOURNAL OF PLANT RESEARCH 2023:10.1007/s10265-023-01452-w. [PMID: 37148377 DOI: 10.1007/s10265-023-01452-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 03/12/2023] [Indexed: 05/08/2023]
Abstract
A group of temperate grassland plant species termed the "Mansen elements" occurs in Japan and is widely distributed in the grasslands of continental East Asia. It has been hypothesized that these species are continental grassland relicts in Japan that stretch back to a colder age, but their migration history has not been elucidated. To assess the migration history of the Mansen elements, we performed phylogeographic analyses of Tephroseris kirilowii, a member of this group, using single-nucleotide polymorphisms (SNPs) obtained from multiplexed inter-simple sequence repeat genotyping by sequencing (MIG-seq). It was estimated that the Japanese populations of T. kirilowii were divided from those of continental East Asia at 25.2 thousand years ago (ka) with 95% highest probability density interval (HPD) of 15.3-40.0 ka and that Japanese clades first diverged at 20.2 ka with 95% HPD of 10.4-30.1 ka. As the climatically suitable range during the last glacial maximum (LGM) estimated using ecological niche modeling (ENM) was limited in Japan and there was a slight genetic differentiation among Japanese populations, a post-glacial expansion of T. kirilowii in the Japanese Archipelago was indicated.
Collapse
Affiliation(s)
- Tomoka Sakaba
- Graduate School of Science and Technology, Kumamoto University, Kurokami 2-39-1, Chuo-Ku, Kumamoto, 860-8555, Japan
| | - Akiko Soejima
- Department of Biological Science, Faculty of Advanced Science and Technology, Kumamoto University, Kurokami 2-39-1, Chuo-Ku, Kumamoto, 860-8555, Japan.
| | - Shinji Fujii
- Department of Field Ecology, University of Human Environments, 6-2 Kamisanbonmatsu, Okazaki, 444-3505, Japan
| | - Hajime Ikeda
- Institute of Plant Science and Resources, Okayama University, 2-20-1 Chuo, Kurashiki, Okayama, 710-0046, Japan
| | - Takaya Iwasaki
- Natural Science Division, Faculty of Core Research, Ochanomizu University, 2-1-1 Otsuka, Bunkyo-Ku, Tokyo, 112-8610, Japan
| | - Hiroaki Saito
- Graduate School of Science and Technology, Kumamoto University, Kurokami 2-39-1, Chuo-Ku, Kumamoto, 860-8555, Japan
| | - Yoshihisa Suyama
- Graduate School of Agricultural Science, Kawatabi Field Science Center, Tohoku University, 232-3 Yomogida, Naruko-Onsen, Osaki, Miyagi, 989-6711, Japan
| | - Ayumi Matsuo
- Graduate School of Agricultural Science, Kawatabi Field Science Center, Tohoku University, 232-3 Yomogida, Naruko-Onsen, Osaki, Miyagi, 989-6711, Japan
| | - Andrey E Kozhevnikov
- Federal Scientific Center of the East Asia Terrestrial Biodiversity, Far Eastern Branch of the Russian Academy of Sciences, Vladivostok, Russia
| | - Zoya V Kozhevnikova
- Federal Scientific Center of the East Asia Terrestrial Biodiversity, Far Eastern Branch of the Russian Academy of Sciences, Vladivostok, Russia
| | - Hongfeng Wang
- Department of Forestry, Northeast Forestry University, 26 Hexing Road, Xiangfang District, Harbin, 150040, China
| | - Siqi Wang
- Department of Forestry, Northeast Forestry University, 26 Hexing Road, Xiangfang District, Harbin, 150040, China
| | - Jae-Hong Pak
- East Coast Research Institute of Life Science, Gangneung-wonju National University, Jukheon-gil 7, Gangneung city, Gangwon-do, 25457, South Korea
| | - Noriyuki Fujii
- Department of Biological Science, Faculty of Advanced Science and Technology, Kumamoto University, Kurokami 2-39-1, Chuo-Ku, Kumamoto, 860-8555, Japan
| |
Collapse
|
13
|
Ragsdale AP, Weaver TD, Atkinson EG, Hoal EG, Möller M, Henn BM, Gravel S. A weakly structured stem for human origins in Africa. Nature 2023; 617:755-763. [PMID: 37198480 PMCID: PMC10208968 DOI: 10.1038/s41586-023-06055-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Accepted: 04/05/2023] [Indexed: 05/19/2023]
Abstract
Despite broad agreement that Homo sapiens originated in Africa, considerable uncertainty surrounds specific models of divergence and migration across the continent1. Progress is hampered by a shortage of fossil and genomic data, as well as variability in previous estimates of divergence times1. Here we seek to discriminate among such models by considering linkage disequilibrium and diversity-based statistics, optimized for rapid, complex demographic inference2. We infer detailed demographic models for populations across Africa, including eastern and western representatives, and newly sequenced whole genomes from 44 Nama (Khoe-San) individuals from southern Africa. We infer a reticulated African population history in which present-day population structure dates back to Marine Isotope Stage 5. The earliest population divergence among contemporary populations occurred 120,000 to 135,000 years ago and was preceded by links between two or more weakly differentiated ancestral Homo populations connected by gene flow over hundreds of thousands of years. Such weakly structured stem models explain patterns of polymorphism that had previously been attributed to contributions from archaic hominins in Africa2-7. In contrast to models with archaic introgression, we predict that fossil remains from coexisting ancestral populations should be genetically and morphologically similar, and that only an inferred 1-4% of genetic differentiation among contemporary human populations can be attributed to genetic drift between stem populations. We show that model misspecification explains the variation in previous estimates of divergence times, and argue that studying a range of models is key to making robust inferences about deep history.
Collapse
Affiliation(s)
- Aaron P Ragsdale
- Department of Integrative Biology, University of Wisconsin-Madison, Madison, WI, USA
| | - Timothy D Weaver
- Department of Anthropology, University of California, Davis, CA, USA
| | - Elizabeth G Atkinson
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Eileen G Hoal
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, Stellenbosch University, Cape Town, South Africa
- South African Medical Research Council Centre for Tuberculosis Research, Stellenbosch University, Cape Town, South Africa
- Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Marlo Möller
- DSI-NRF Centre of Excellence for Biomedical Tuberculosis Research, Stellenbosch University, Cape Town, South Africa
- South African Medical Research Council Centre for Tuberculosis Research, Stellenbosch University, Cape Town, South Africa
- Division of Molecular Biology and Human Genetics, Faculty of Medicine and Health Sciences, Stellenbosch University, Cape Town, South Africa
| | - Brenna M Henn
- Department of Anthropology, University of California, Davis, CA, USA.
- Genome Center, University of California, Davis, CA, USA.
| | - Simon Gravel
- Department of Human Genetics, McGill University, Montreal, Quebec, Canada.
| |
Collapse
|
14
|
Pfennig A, Petersen LN, Kachambwa P, Lachance J. Evolutionary Genetics and Admixture in African Populations. Genome Biol Evol 2023; 15:evad054. [PMID: 36987563 PMCID: PMC10118306 DOI: 10.1093/gbe/evad054] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Revised: 03/15/2023] [Accepted: 03/22/2023] [Indexed: 03/30/2023] Open
Abstract
As the ancestral homeland of our species, Africa contains elevated levels of genetic diversity and substantial population structure. Importantly, African genomes are heterogeneous: They contain mixtures of multiple ancestries, each of which have experienced different evolutionary histories. In this review, we view population genetics through the lens of admixture, highlighting how multiple demographic events have shaped African genomes. Each of these historical vignettes paints a recurring picture of population divergence followed by secondary contact. First, we give a brief overview of genetic variation in Africa and examine deep population structure within Africa, including the evidence of ancient introgression from archaic "ghost" populations. Second, we describe the genetic legacies of admixture events that have occurred during the past 10,000 years. This includes gene flow between different click-speaking Khoe-San populations, the stepwise spread of pastoralism from eastern to southern Africa, multiple migrations of Bantu speakers across the continent, as well as admixture from the Middle East and Europe into the Sahel region and North Africa. Furthermore, the genomic signatures of more recent admixture can be found in the Cape Peninsula and throughout the African diaspora. Third, we highlight how natural selection has shaped patterns of genetic variation across the continent, noting that gene flow provides a potent source of adaptive variation and that selective pressures vary across Africa. Finally, we explore the biomedical implications of population structure in Africa on health and disease and call for more ethically conducted studies of genetic variation in Africa.
Collapse
Affiliation(s)
- Aaron Pfennig
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia
| | | | | | - Joseph Lachance
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia
| |
Collapse
|
15
|
Huang J, Thawornwattana Y, Flouri T, Mallet J, Yang Z. Inference of Gene Flow between Species under Misspecified Models. Mol Biol Evol 2022; 39:6783212. [PMID: 36317198 PMCID: PMC9729068 DOI: 10.1093/molbev/msac237] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
Abstract
Genomic sequence data provide a rich source of information about the history of species divergence and interspecific hybridization or introgression. Despite recent advances in genomics and statistical methods, it remains challenging to infer gene flow, and as a result, one may have to estimate introgression rates and times under misspecified models. Here we use mathematical analysis and computer simulation to examine estimation bias and issues of interpretation when the model of gene flow is misspecified in analysis of genomic datasets, for example, if introgression is assigned to the wrong lineages. In the case of two species, we establish a correspondence between the migration rate in the continuous migration model and the introgression probability in the introgression model. When gene flow occurs continuously through time but in the analysis is assumed to occur at a fixed time point, common evolutionary parameters such as species divergence times are surprisingly well estimated. However, the time of introgression tends to be estimated towards the recent end of the period of continuous gene flow. When introgression events are assigned incorrectly to the parental or daughter lineages, introgression times tend to collapse onto species divergence times, with introgression probabilities underestimated. Overall, our analyses suggest that the simple introgression model is useful for extracting information concerning between-specific gene flow and divergence even when the model may be misspecified. However, for reliable inference of gene flow it is important to include multiple samples per species, in particular, from hybridizing species.
Collapse
Affiliation(s)
| | | | - Tomáš Flouri
- Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, United Kingdom
| | - James Mallet
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138
| | | |
Collapse
|
16
|
Molo MS, White JB, Cornish V, Gell RM, Baars O, Singh R, Carbone MA, Isakeit T, Wise KA, Woloshuk CP, Bluhm BH, Horn BW, Heiniger RW, Carbone I. Asymmetrical lineage introgression and recombination in populations of Aspergillus flavus: Implications for biological control. PLoS One 2022; 17:e0276556. [PMID: 36301851 PMCID: PMC9620740 DOI: 10.1371/journal.pone.0276556] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2022] [Accepted: 10/08/2022] [Indexed: 11/23/2022] Open
Abstract
Aspergillus flavus is an agriculturally important fungus that causes ear rot of maize and produces aflatoxins, of which B1 is the most carcinogenic naturally-produced compound. In the US, the management of aflatoxins includes the deployment of biological control agents that comprise two nonaflatoxigenic A. flavus strains, either Afla-Guard (member of lineage IB) or AF36 (lineage IC). We used genotyping-by-sequencing to examine the influence of both biocontrol agents on native populations of A. flavus in cornfields in Texas, North Carolina, Arkansas, and Indiana. This study examined up to 27,529 single-nucleotide polymorphisms (SNPs) in a total of 815 A. flavus isolates, and 353 genome-wide haplotypes sampled before biocontrol application, three months after biocontrol application, and up to three years after initial application. Here, we report that the two distinct A. flavus evolutionary lineages IB and IC differ significantly in their frequency distributions across states. We provide evidence of increased unidirectional gene flow from lineage IB into IC, inferred to be due to the applied Afla-Guard biocontrol strain. Genetic exchange and recombination of biocontrol strains with native strains was detected in as little as three months after biocontrol application and up to one and three years later. There was limited inter-lineage migration in the untreated fields. These findings suggest that biocontrol products that include strains from lineage IB offer the greatest potential for sustained reductions in aflatoxin levels over several years. This knowledge has important implications for developing new biocontrol strategies.
Collapse
Affiliation(s)
- Megan S. Molo
- Department of Entomology and Plant Pathology, Center for Integrated
Fungal Research, North Carolina State University, Raleigh, NC, United States of
America
| | - James B. White
- Department of Entomology and Plant Pathology, Center for Integrated
Fungal Research, North Carolina State University, Raleigh, NC, United States of
America
| | - Vicki Cornish
- Department of Entomology and Plant Pathology, Center for Integrated
Fungal Research, North Carolina State University, Raleigh, NC, United States of
America
| | - Richard M. Gell
- Department of Entomology and Plant Pathology, Center for Integrated
Fungal Research, North Carolina State University, Raleigh, NC, United States of
America
- Program of Genetics, North Carolina State University, Raleigh, North
Carolina, United States of America
| | - Oliver Baars
- Department of Entomology and Plant Pathology, Center for Integrated
Fungal Research, North Carolina State University, Raleigh, NC, United States of
America
| | - Rakhi Singh
- Department of Entomology and Plant Pathology, Center for Integrated
Fungal Research, North Carolina State University, Raleigh, NC, United States of
America
| | - Mary Anna Carbone
- Center for Integrated Fungal Research and Department of Plant and
Microbial Biology, North Carolina State University, Raleigh, NC, United States
of America
| | - Thomas Isakeit
- Department of Plant Pathology and Microbiology, Texas AgriLife Extension
Service, Texas A&M University, College Station, TX, United States of
America
| | - Kiersten A. Wise
- Department of Plant Pathology, University of Kentucky, Princeton, KY,
United States of America
| | - Charles P. Woloshuk
- Department of Plant Pathology and Botany, Purdue University, West
Lafayette, IN, United States of America
| | - Burton H. Bluhm
- University of Arkansas Division of Agriculture, Department of Entomology
and Plant Pathology, Fayetteville, AR, United States of
America
| | - Bruce W. Horn
- United States Department of Agriculture, Agriculture Research Service,
Dawson, GA, United States of America
| | - Ron W. Heiniger
- Department of Crop and Soil Sciences, North Carolina State University,
Raleigh, NC, United States of America
| | - Ignazio Carbone
- Department of Entomology and Plant Pathology, Center for Integrated
Fungal Research, North Carolina State University, Raleigh, NC, United States of
America
- Program of Genetics, North Carolina State University, Raleigh, North
Carolina, United States of America
- * E-mail:
| |
Collapse
|
17
|
Roycroft E, Moritz C, Rowe KC, Moussalli A, Eldridge MDB, Portela Miguez R, Piggott MP, Potter S. Sequence Capture From Historical Museum Specimens: Maximizing Value for Population and Phylogenomic Studies. Front Ecol Evol 2022. [DOI: 10.3389/fevo.2022.931644] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
The application of high-throughput, short-read sequencing to degraded DNA has greatly increased the feasibility of generating genomic data from historical museum specimens. While many published studies report successful sequencing results from historical specimens; in reality, success and quality of sequence data can be highly variable. To examine predictors of sequencing quality, and methodological approaches to improving data accuracy, we generated and analyzed genomic sequence data from 115 historically collected museum specimens up to 180 years old. Data span both population genomic and phylogenomic scales, including historically collected specimens from 34 specimens of four species of Australian rock-wallabies (genus Petrogale) and 92 samples from 79 specimens of Australo-Papuan murine rodents (subfamily Murinae). For historical rodent specimens, where the focus was sampling for phylogenomics, we found that regardless of specimen age, DNA sequence libraries prepared from toe pad or bone subsamples performed significantly better than those taken from the skin (in terms of proportion of reads on target, number of loci captured, and data accuracy). In total, 93% of DNA libraries from toe pad or bone subsamples resulted in reliable data for phylogenetic inference, compared to 63% of skin subsamples. For skin subsamples, proportion of reads on target weakly correlated with collection year. Then using population genomic data from rock-wallaby skins as a test case, we found substantial improvement in final data quality by mapping to a high-quality “closest sister” de novo assembly from fresh tissues, compared to mapping to a sample-specific historical de novo assembly. Choice of mapping approach also affected final estimates of the number of segregating sites and Watterson's θ, both important parameters for population genomic inference. The incorporation of accurate and reliable sequence data from historical specimens has important outcomes for evolutionary studies at both population and phylogenomic scales. By assessing the outcomes of different approaches to specimen subsampling, library preparation and bioinformatic processing, our results provide a framework for increasing sequencing success for irreplaceable historical specimens.
Collapse
|
18
|
Ding YM, Cao Y, Zhang WP, Chen J, Liu J, Li P, Renner SS, Zhang DY, Bai WN. Population-genomic analyses reveal bottlenecks and asymmetric introgression from Persian into iron walnut during domestication. Genome Biol 2022; 23:145. [PMID: 35787713 PMCID: PMC9254524 DOI: 10.1186/s13059-022-02720-z] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 06/25/2022] [Indexed: 12/05/2022] Open
Abstract
Background Persian walnut, Juglans regia, occurs naturally from Greece to western China, while its closest relative, the iron walnut, Juglans sigillata, is endemic in southwest China; both species are cultivated for their nuts and wood. Here, we infer their demographic histories and the time and direction of possible hybridization and introgression between them. Results We use whole-genome resequencing data, different population-genetic approaches (PSMC and GONE), and isolation-with-migration models (IMa3) on individuals from Europe, Iran, Kazakhstan, Pakistan, and China. IMa3 analyses indicate that the two species diverged from each other by 0.85 million years ago, with unidirectional gene flow from eastern J. regia and its ancestor into J. sigillata, including the shell-thickness gene. Within J. regia, a western group, located from Europe to Iran, and an eastern group with individuals from northern China, experienced dramatically declining population sizes about 80 generations ago (roughly 2400 to 4000 years), followed by an expansion at about 40 generations, while J. sigillata had a constant population size from about 100 to 20 generations ago, followed by a rapid decline. Conclusions Both J. regia and J. sigillata appear to have suffered sudden population declines during their domestication, suggesting that the bottleneck scenario of plant domestication may well apply in at least some perennial crop species. Introgression from introduced J. regia appears to have played a role in the domestication of J. sigillata. Supplementary Information The online version contains supplementary material available at 10.1186/s13059-022-02720-z.
Collapse
Affiliation(s)
- Ya-Mei Ding
- State Key Laboratory of Earth Surface Processes and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China
| | - Yu Cao
- State Key Laboratory of Earth Surface Processes and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China
| | - Wei-Ping Zhang
- State Key Laboratory of Earth Surface Processes and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China
| | - Jun Chen
- State Key Laboratory of Earth Surface Processes and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China.,China National Botanical Garden, Beijing, 100093, China
| | - Jie Liu
- CAS Key Laboratory for Plant Diversity and Biogeography of East Asia, Kunming Institute of Botany, Chinese Academy of Sciences, Kunming, 650201, Yunnan, China
| | - Pan Li
- The Key Laboratory of Conservation Biology for Endangered Wildlife of the Ministry of Education, College of Life Sciences, Zhejiang University, Hangzhou, 310058, China
| | - Susanne S Renner
- Department of Biology, Washington University, Saint Louis, MO, 63130, USA.
| | - Da-Yong Zhang
- State Key Laboratory of Earth Surface Processes and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China.
| | - Wei-Ning Bai
- State Key Laboratory of Earth Surface Processes and Resource Ecology and Ministry of Education Key Laboratory for Biodiversity Science and Ecological Engineering, College of Life Sciences, Beijing Normal University, Beijing, 100875, China.
| |
Collapse
|
19
|
Buck R, Flores-Rentería L. The Syngameon Enigma. PLANTS (BASEL, SWITZERLAND) 2022; 11:895. [PMID: 35406874 PMCID: PMC9002738 DOI: 10.3390/plants11070895] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Revised: 03/21/2022] [Accepted: 03/25/2022] [Indexed: 05/17/2023]
Abstract
Despite their evolutionary relevance, multispecies networks or syngameons are rarely reported in the literature. Discovering how syngameons form and how they are maintained can give insight into processes such as adaptive radiations, island colonizations, and the creation of new hybrid lineages. Understanding these complex hybridization networks is even more pressing with anthropogenic climate change, as syngameons may have unique synergistic properties that will allow participating species to persist. The formation of a syngameon is not insurmountable, as several ways for a syngameon to form have been proposed, depending mostly on the magnitude and frequency of gene flow events, as well as the relatedness of its participants. Episodic hybridization with small amounts of introgression may keep syngameons stable and protect their participants from any detrimental effects of gene flow. As genomic sequencing becomes cheaper and more species are included in studies, the number of known syngameons is expected to increase. Syngameons must be considered in conservation efforts as the extinction of one participating species may have detrimental effects on the survival of all other species in the network.
Collapse
Affiliation(s)
- Ryan Buck
- Department of Biology, San Diego State University, San Diego, CA 92182, USA;
| | | |
Collapse
|
20
|
Balmori-de la Puente A, Ventura J, Miñarro M, Somoano A, Hey J, Castresana J. Divergence time estimation using ddRAD data and an isolation-with-migration model applied to water vole populations of Arvicola. Sci Rep 2022; 12:4065. [PMID: 35260719 PMCID: PMC8904462 DOI: 10.1038/s41598-022-07877-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2021] [Accepted: 02/21/2022] [Indexed: 01/18/2023] Open
Abstract
Molecular dating methods of population splits are crucial in evolutionary biology, but they present important difficulties due to the complexity of the genealogical relationships of genes and past migrations between populations. Using the double digest restriction-site associated DNA (ddRAD) technique and an isolation-with-migration (IM) model, we studied the evolutionary history of water vole populations of the genus Arvicola, a group of complex evolution with fossorial and semi-aquatic ecotypes. To do this, we first estimated mutation rates of ddRAD loci using a phylogenetic approach. An IM model was then used to estimate split times and other relevant demographic parameters. A set of 300 ddRAD loci that included 85 calibrated loci resulted in good mixing and model convergence. The results showed that the two populations of A. scherman present in the Iberian Peninsula split 34 thousand years ago, during the last glaciation. In addition, the much greater divergence from its sister species, A. amphibius, may help to clarify the controversial taxonomy of the genus. We conclude that this approach, based on ddRAD data and an IM model, is highly useful for analyzing the origin of populations and species.
Collapse
Affiliation(s)
- Alfonso Balmori-de la Puente
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta 37, 08003, Barcelona, Spain
| | - Jacint Ventura
- Departament de Biologia Animal, de Biologia Vegetal i d'Ecologia, Facultat de Biociències, Universitat Autònoma de Barcelona, 08193, Cerdanyola del Vallès, Barcelona, Spain.,Àrea de Recerca en Petits Mamífers, Granollers Museum of Natural Sciences, Palaudàries, 102, 08402, Granollers, Barcelona, Spain
| | - Marcos Miñarro
- Servicio Regional de Investigación y Desarrollo Agroalimentario (SERIDA), Ctra AS-267, PK 19, 33300, Villaviciosa, Asturias, Spain
| | - Aitor Somoano
- Servicio Regional de Investigación y Desarrollo Agroalimentario (SERIDA), Ctra AS-267, PK 19, 33300, Villaviciosa, Asturias, Spain
| | - Jody Hey
- Department of Biology, Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA
| | - Jose Castresana
- Institute of Evolutionary Biology (CSIC-Universitat Pompeu Fabra), Passeig Marítim de la Barceloneta 37, 08003, Barcelona, Spain.
| |
Collapse
|
21
|
Jiao X, Flouri T, Yang Z. Multispecies coalescent and its applications to infer species phylogenies and cross-species gene flow. Natl Sci Rev 2022; 8:nwab127. [PMID: 34987842 PMCID: PMC8692950 DOI: 10.1093/nsr/nwab127] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Revised: 07/10/2021] [Accepted: 07/11/2021] [Indexed: 02/06/2023] Open
Abstract
Multispecies coalescent (MSC) is the extension of the single-population coalescent model to multiple species. It integrates the phylogenetic process of species divergences and the population genetic process of coalescent, and provides a powerful framework for a number of inference problems using genomic sequence data from multiple species, including estimation of species divergence times and population sizes, estimation of species trees accommodating discordant gene trees, inference of cross-species gene flow and species delimitation. In this review, we introduce the major features of the MSC model, discuss full-likelihood and heuristic methods of species tree estimation and summarize recent methodological advances in inference of cross-species gene flow. We discuss the statistical and computational challenges in the field and research directions where breakthroughs may be likely in the next few years.
Collapse
Affiliation(s)
- Xiyun Jiao
- Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Tomáš Flouri
- Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Ziheng Yang
- Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| |
Collapse
|
22
|
Phylogeography of the Rough Greensnake, Opheodrys aestivus (Squamata: Colubridae), Using Multilocus Sanger Sequence and Genomic ddRADseq Data. J HERPETOL 2021. [DOI: 10.1670/20-040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
23
|
Larsson DJ, Pan D, Schneeweiss GM. Addressing alpine plant phylogeography using integrative distributional, demographic and coalescent modeling. ALPINE BOTANY 2021; 132:5-19. [PMID: 35368907 PMCID: PMC8933363 DOI: 10.1007/s00035-021-00263-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2021] [Accepted: 07/05/2021] [Indexed: 06/14/2023]
Abstract
Phylogeographic studies of alpine plants have evolved considerably in the last two decades from ad hoc interpretations of genetic data to statistical model-based approaches. In this review we outline the developments in alpine plant phylogeography focusing on the recent approach of integrative distributional, demographic and coalescent (iDDC) modeling. By integrating distributional data with spatially explicit demographic modeling and subsequent coalescent simulations, the history of alpine species can be inferred and long-standing hypotheses, such as species-specific responses to climate change or survival on nunataks during the last glacial maximum, can be efficiently tested as exemplified by available case studies. We also discuss future prospects and improvements of iDDC.
Collapse
Affiliation(s)
- Dennis J. Larsson
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| | - Da Pan
- Jiangsu Key Laboratory for Biodiversity and Biotechnology, College of Life Sciences, Nanjing Normal University, Nanjing, China
| | - Gerald M. Schneeweiss
- Department of Botany and Biodiversity Research, University of Vienna, Vienna, Austria
| |
Collapse
|
24
|
Olsson S, Lorenzo Z, Zabal-Aguirre M, Piotti A, Vendramin GG, González-Martínez SC, Grivet D. Evolutionary history of the mediterranean Pinus halepensis-brutia species complex using gene-resequencing and transcriptomic approaches. PLANT MOLECULAR BIOLOGY 2021; 106:367-380. [PMID: 33934278 DOI: 10.1007/s11103-021-01155-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2020] [Accepted: 04/22/2021] [Indexed: 06/12/2023]
Abstract
Complementary gene-resequencing and transcriptomic approaches reveal contrasted evolutionary histories in a species complex. Pinus halepensis and Pinus brutia are closely related species that can intercross, but occupy different geographical ranges and bioclimates. To study the evolution of this species complex and to provide genomic resources for further research, we produce and analyze two new complementary sets of genetic resources: (i) a set of 172 re-sequenced genomic target loci analyzed in 45 individuals, and (ii) a set of 11 transcriptome assemblies. These two datasets provide insights congruent with previous studies: P. brutia displays high level of genetic diversity and no genetic sub-structure, while P. halepensis shows three main genetic clusters, the western Mediterranean and North African clusters displaying much lower genetic diversity than the eastern Mediterranean cluster, the latter cluster having similar genetic diversity to P. brutia. In addition, these datasets provide new insights on the timing of the species-complex history: the two species would have split at the end of the tertiary, and the changing climatic conditions of the Mediterranean region at the end of the Tertiary-beginning of the Quaternary, together with the distinct species tolerance to harsh climatic conditions would have resulted in different geographic distributions, demographic histories and genetic patterns of the two pines. The multiple glacial-interglacial cycles during the Quaternary would have led to the expansion of P. brutia in the Middle East, while P. halepensis would have been through bottlenecks. The last glaciations, from 0.6 Mya on, would have affected further the Western genetic pool of P. halepensis.
Collapse
Affiliation(s)
- Sanna Olsson
- Department of Forest Ecology & Genetics, Forest Research Centre, INIA-CSIC, Carretera de la Coruña km 7.5, 28040, Madrid, Spain.
| | - Zaida Lorenzo
- Department of Forest Ecology & Genetics, Forest Research Centre, INIA-CSIC, Carretera de la Coruña km 7.5, 28040, Madrid, Spain
| | - Mario Zabal-Aguirre
- Department of Forest Ecology & Genetics, Forest Research Centre, INIA-CSIC, Carretera de la Coruña km 7.5, 28040, Madrid, Spain
| | - Andrea Piotti
- Institute of Biosciences and Bioresources, Division of Florence, National Research Council, 50019, Sesto Fiorentino, Florence, Italy
| | - Giovanni G Vendramin
- Institute of Biosciences and Bioresources, Division of Florence, National Research Council, 50019, Sesto Fiorentino, Florence, Italy
| | - Santiago C González-Martínez
- UMR BIOGECO, INRAE, University of Bordeaux, 33610, Cestas, France
- Sustainable Forest Management Research Institute, INIA - University of Valladolid, Avda. Madrid 44, 34004, Palencia, Spain
| | - Delphine Grivet
- Department of Forest Ecology & Genetics, Forest Research Centre, INIA-CSIC, Carretera de la Coruña km 7.5, 28040, Madrid, Spain.
- Sustainable Forest Management Research Institute, INIA - University of Valladolid, Avda. Madrid 44, 34004, Palencia, Spain.
| |
Collapse
|
25
|
Webb A, Knoblauch J, Sabankar N, Kallur AS, Hey J, Sethuraman A. The Pop-Gen Pipeline Platform: A Software Platform for Population Genomic Analyses. Mol Biol Evol 2021; 38:3478-3485. [PMID: 33950197 PMCID: PMC8321520 DOI: 10.1093/molbev/msab113] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
The Pop-Gen Pipeline Platform (PPP) is a software platform for population genomic analyses. The PPP was designed as a collection of scripts that facilitate common population genomic workflows in a consistent and standardized Python environment. Functions were developed to encompass entire workflows, including input preparation, file format conversion, various population genomic analyses, and output generation. The platform has also been developed with reproducibility and extensibility of analyses in mind. The PPP is an open-source package that is available for download and use at https://ppp.readthedocs.io/en/latest/PPP_pages/install.html.
Collapse
Affiliation(s)
- Andrew Webb
- Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, USA
| | - Jared Knoblauch
- Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, USA
| | - Nitesh Sabankar
- Department of Biological Sciences, California State University San Marcos, San Marcos, CA, USA
| | - Apeksha Sukesh Kallur
- Department of Biological Sciences, California State University San Marcos, San Marcos, CA, USA
| | - Jody Hey
- Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, USA
| | - Arun Sethuraman
- Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, USA.,Department of Biological Sciences, California State University San Marcos, San Marcos, CA, USA
| |
Collapse
|
26
|
Hill P, Wapstra E, Ezaz T, Burridge CP. Pleistocene divergence in the absence of gene flow among populations of a viviparous reptile with intraspecific variation in sex determination. Ecol Evol 2021; 11:5575-5583. [PMID: 34026030 PMCID: PMC8131762 DOI: 10.1002/ece3.7458] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2020] [Revised: 02/21/2021] [Accepted: 02/25/2021] [Indexed: 12/26/2022] Open
Abstract
Polymorphisms can lead to genetic isolation if there is differential mating success among conspecifics divergent for a trait. Polymorphism for sex-determining system may fall into this category, given strong selection for the production of viable males and females and the low success of heterogametic hybrids when sex chromosomes differ (Haldane's rule). Here we investigated whether populations exhibiting polymorphism for sex determination are genetically isolated, using the viviparous snow skink Carinascincus ocellatus. While a comparatively high elevation population has genotypic sex determination, in a lower elevation population there is an additional temperature component to sex determination. Based on 11,107 SNP markers, these populations appear genetically isolated. "Isolation with Migration" analysis also suggests these populations diverged in the absence of gene flow, across a period encompassing multiple Pleistocene glaciations and likely greater geographic proximity of populations. However, further experiments are required to establish whether genetic isolation may be a cause or consequence of differences in sex determination. Given the influence of temperature on sex in one lineage, we also discuss the implications for the persistence of this polymorphism under climate change.
Collapse
Affiliation(s)
- Peta Hill
- Discipline of Biological SciencesUniversity of TasmaniaSandy BayTas.Australia
| | - Erik Wapstra
- Discipline of Biological SciencesUniversity of TasmaniaSandy BayTas.Australia
| | - Tariq Ezaz
- Institute for Applied EcologyUniversity of CanberraBruceACTAustralia
| | | |
Collapse
|
27
|
Hollfelder N, Breton G, Sjödin P, Jakobsson M. The deep population history in Africa. Hum Mol Genet 2021; 30:R2-R10. [PMID: 33438014 PMCID: PMC8117439 DOI: 10.1093/hmg/ddab005] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 12/22/2020] [Accepted: 01/05/2021] [Indexed: 12/28/2022] Open
Abstract
Africa is the continent with the greatest genetic diversity among humans and the level of diversity is further enhanced by incorporating non-majority groups, which are often understudied. Many of today's minority populations historically practiced foraging lifestyles, which were the only subsistence strategies prior to the rise of agriculture and pastoralism, but only a few groups practicing these strategies remain today. Genomic investigations of Holocene human remains excavated across the African continent show that the genetic landscape was vastly different compared to today's genetic landscape and that many groups that today are population isolate inhabited larger regions in the past. It is becoming clear that there are periods of isolation among groups and geographic areas, but also genetic contact over large distances throughout human history in Africa. Genomic information from minority populations and from prehistoric remains provide an invaluable source of information on the human past, in particular deep human population history, as Holocene large-scale population movements obscure past patterns of population structure. Here we revisit questions on the nature and time of the radiation of early humans in Africa, the extent of gene-flow among human populations as well as introgression from archaic and extinct lineages on the continent.
Collapse
Affiliation(s)
- Nina Hollfelder
- Human Evolution, Department of Organismal Biology, Uppsala University, Norbyvägen 18C, 75236 Uppsala, Sweden
| | - Gwenna Breton
- Human Evolution, Department of Organismal Biology, Uppsala University, Norbyvägen 18C, 75236 Uppsala, Sweden
| | - Per Sjödin
- Human Evolution, Department of Organismal Biology, Uppsala University, Norbyvägen 18C, 75236 Uppsala, Sweden
| | - Mattias Jakobsson
- Human Evolution, Department of Organismal Biology, Uppsala University, Norbyvägen 18C, 75236 Uppsala, Sweden
- Palaeo-Research Institute, University of Johannesburg, Physical, Cnr Kingsway & University Roads, Auckland Park, Johannesburg 2092, South Africa
- SciLifeLab, Stockholm and Uppsala, Entrance C11, BMC, Husargatan 3, 752 37 Uppsala, Sweden
| |
Collapse
|
28
|
Flouri T, Jiao X, Rannala B, Yang Z. A Bayesian Implementation of the Multispecies Coalescent Model with Introgression for Phylogenomic Analysis. Mol Biol Evol 2021; 37:1211-1223. [PMID: 31825513 PMCID: PMC7086182 DOI: 10.1093/molbev/msz296] [Citation(s) in RCA: 81] [Impact Index Per Article: 20.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Recent analyses suggest that cross-species gene flow or introgression is common in nature, especially during species divergences. Genomic sequence data can be used to infer introgression events and to estimate the timing and intensity of introgression, providing an important means to advance our understanding of the role of gene flow in speciation. Here, we implement the multispecies-coalescent-with-introgression model, an extension of the multispecies-coalescent model to incorporate introgression, in our Bayesian Markov chain Monte Carlo program Bpp. The multispecies-coalescent-with-introgression model accommodates deep coalescence (or incomplete lineage sorting) and introgression and provides a natural framework for inference using genomic sequence data. Computer simulation confirms the good statistical properties of the method, although hundreds or thousands of loci are typically needed to estimate introgression probabilities reliably. Reanalysis of data sets from the purple cone spruce confirms the hypothesis of homoploid hybrid speciation. We estimated the introgression probability using the genomic sequence data from six mosquito species in the Anopheles gambiae species complex, which varies considerably across the genome, likely driven by differential selection against introgressed alleles.
Collapse
Affiliation(s)
- Tomáš Flouri
- Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
| | - Xiyun Jiao
- Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
| | - Bruce Rannala
- Department of Evolution and Ecology, University of California, Davis, Davis, CA
| | - Ziheng Yang
- Department of Genetics, Evolution and Environment, University College London, London, United Kingdom
| |
Collapse
|
29
|
Selberg AGA, Gaucher EA, Liberles DA. Ancestral Sequence Reconstruction: From Chemical Paleogenetics to Maximum Likelihood Algorithms and Beyond. J Mol Evol 2021; 89:157-164. [PMID: 33486547 PMCID: PMC7828096 DOI: 10.1007/s00239-021-09993-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2020] [Accepted: 01/04/2021] [Indexed: 12/13/2022]
Abstract
As both a computational and an experimental endeavor, ancestral sequence reconstruction remains a timely and important technique. Modern approaches to conduct ancestral sequence reconstruction for proteins are built upon a conceptual framework from journal founder Emile Zuckerkandl. On top of this, work on maximum likelihood phylogenetics published in Journal of Molecular Evolution in 1996 was one of the first approaches for generating maximum likelihood ancestral sequences of proteins. From its computational history, future model development needs as well as potential applications in areas as diverse as computational systems biology, molecular community ecology, infectious disease therapeutics and other biomedical applications, and biotechnology are discussed. From its past in this journal, there is a bright future for ancestral sequence reconstruction in the field of evolutionary biology.
Collapse
Affiliation(s)
- Avery G A Selberg
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA
| | - Eric A Gaucher
- Department of Biology, Georgia State University, Atlanta, GA, 30303, USA
| | - David A Liberles
- Department of Biology and Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, 19122, USA.
| |
Collapse
|
30
|
Koch H, DeGiorgio M. Maximum Likelihood Estimation of Species Trees from Gene Trees in the Presence of Ancestral Population Structure. Genome Biol Evol 2020; 12:3977-3995. [PMID: 32022857 PMCID: PMC7061232 DOI: 10.1093/gbe/evaa022] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/23/2020] [Indexed: 11/12/2022] Open
Abstract
Though large multilocus genomic data sets have led to overall improvements in phylogenetic inference, they have posed the new challenge of addressing conflicting signals across the genome. In particular, ancestral population structure, which has been uncovered in a number of diverse species, can skew gene tree frequencies, thereby hindering the performance of species tree estimators. Here we develop a novel maximum likelihood method, termed TASTI (Taxa with Ancestral structure Species Tree Inference), that can infer phylogenies under such scenarios, and find that it has increasing accuracy with increasing numbers of input gene trees, contrasting with the relatively poor performances of methods not tailored for ancestral structure. Moreover, we propose a supertree approach that allows TASTI to scale computationally with increasing numbers of input taxa. We use genetic simulations to assess TASTI's performance in the three- and four-taxon settings and demonstrate the application of TASTI on a six-species Afrotropical mosquito data set. Finally, we have implemented TASTI in an open-source software package for ease of use by the scientific community.
Collapse
Affiliation(s)
- Hillary Koch
- Department of Statistics, Pennsylvania State University
| | - Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University
| |
Collapse
|
31
|
Phifer-Rixey M, Harr B, Hey J. Further resolution of the house mouse (Mus musculus) phylogeny by integration over isolation-with-migration histories. BMC Evol Biol 2020; 20:120. [PMID: 32933487 PMCID: PMC7493149 DOI: 10.1186/s12862-020-01666-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2020] [Accepted: 07/27/2020] [Indexed: 12/28/2022] Open
Abstract
BACKGROUND The three main subspecies of house mice, Mus musculus castaneus, Mus musculus domesticus, and Mus musculus musculus, are estimated to have diverged ~ 350-500KYA. Resolution of the details of their evolutionary history is complicated by their relatively recent divergence, ongoing gene flow among the subspecies, and complex demographic histories. Previous studies have been limited to some extent by the number of loci surveyed and/or by the scope of the method used. Here, we apply a method (IMa3) that provides an estimate of a population phylogeny while allowing for complex histories of gene exchange. RESULTS Results strongly support a topology with M. m. domesticus as sister to M. m. castaneus and M. m. musculus. In addition, we find evidence of gene flow between all pairs of subspecies, but that gene flow is most restricted from M. m. musculus into M. m. domesticus. Estimates of other key parameters are dependent on assumptions regarding generation time and mutation rate in house mice. Nevertheless, our results support previous findings that the effective population size, Ne, of M. m. castaneus is larger than that of the other two subspecies, that the three subspecies began diverging ~ 130 - 420KYA, and that the time between divergence events was short. CONCLUSIONS Joint demographic and phylogenetic analyses of genomic data provide a clearer picture of the history of divergence in house mice.
Collapse
Affiliation(s)
| | - Bettina Harr
- Department of Evolutionary Genetics, Max-Planck-Institute for Evolutionary Biology, Plön, Germany
| | - Jody Hey
- Department of Biology, Center for Computational Genetics and Genomics, Temple University, Philadelphia, PA, USA
| |
Collapse
|
32
|
Nowicki M, Houston LC, Boggess SL, Aiello AS, Payá‐Milans M, Staton ME, Hayashida M, Yamanaka M, Eda S, Trigiano RN. Species diversity and phylogeography of Cornus kousa (Asian dogwood) captured by genomic and genic microsatellites. Ecol Evol 2020; 10:8299-8312. [PMID: 32788980 PMCID: PMC7417245 DOI: 10.1002/ece3.6537] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2020] [Accepted: 06/09/2020] [Indexed: 01/01/2023] Open
Abstract
Cornus kousa (Asian dogwood), an East Asia native tree, is the most economically important species of the dogwood genus, owing to its desirable horticultural traits and ability to hybridize with North America-native dogwoods. To assess the species genetic diversity and to better inform the ongoing and future breeding efforts, we assembled an herbarium and arboretum collection of 131 noncultivated C. kousa specimens. Genotyping and capillary electrophoresis analyses of our C. kousa collection with the newly developed genic and published nuclear genomic microsatellites permitted assessment of genetic diversity and evolutionary history of the species. Regardless of the microsatellite type used, the study yielded generally similar insights into the C. kousa diversity with subtle differences deriving from and underlining the marker used. The accrued evidence pointed to the species distinct genetic pools related to the plant country of origin. This can be helpful in the development of the commercial cultivars for this important ornamental crop with increased pyramided utility traits. Analyses of the C. kousa evolutionary history using the accrued genotyping datasets pointed to an unsampled ancestor population, possibly now extinct, as per the phylogeography of the region. To our knowledge, there are few studies utilizing the same gDNA collection to compare performance of genomic and genic microsatellites. This is the first detailed report on C. kousa species diversity and evolutionary history inference.
Collapse
Affiliation(s)
- Marcin Nowicki
- Department of Entomology and Plant PathologyThe University of TennesseeKnoxvilleTNUSA
| | - Logan C. Houston
- Department of Entomology and Plant PathologyThe University of TennesseeKnoxvilleTNUSA
| | - Sarah L. Boggess
- Department of Entomology and Plant PathologyThe University of TennesseeKnoxvilleTNUSA
| | | | - Miriam Payá‐Milans
- Department of Entomology and Plant PathologyThe University of TennesseeKnoxvilleTNUSA
- Present address:
Centro de Biotecnología y Genómica de PlantasUPM‐INIAMadridSpain
| | - Margaret E. Staton
- Department of Entomology and Plant PathologyThe University of TennesseeKnoxvilleTNUSA
| | | | - Masahiro Yamanaka
- Department of Pharmaceytical SciencesInternational University of Health and WelfareOhtawaraJapan
| | - Shigetoshi Eda
- Department of Forestry, Wildlife and FisheriesCenter for Wildlife Health ORUKnoxvilleTNUSA
- Department of MicrobiologyCenter for Wildlife Health ORUKnoxvilleTNUSA
| | - Robert N. Trigiano
- Department of Entomology and Plant PathologyThe University of TennesseeKnoxvilleTNUSA
| |
Collapse
|
33
|
A maximum likelihood approach to infer demographic models. COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS 2020. [DOI: 10.29220/csam.2020.27.3.385] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
34
|
Abstract
Knowing phylogenetic relationships among species is fundamental for many studies in biology. An accurate phylogenetic tree underpins our understanding of the major transitions in evolution, such as the emergence of new body plans or metabolism, and is key to inferring the origin of new genes, detecting molecular adaptation, understanding morphological character evolution and reconstructing demographic changes in recently diverged species. Although data are ever more plentiful and powerful analysis methods are available, there remain many challenges to reliable tree building. Here, we discuss the major steps of phylogenetic analysis, including identification of orthologous genes or proteins, multiple sequence alignment, and choice of substitution models and inference methodologies. Understanding the different sources of errors and the strategies to mitigate them is essential for assembling an accurate tree of life.
Collapse
|
35
|
Liu J, Liu Q, Yang Q. mstree: A Multispecies Coalescent Approach for Estimating Ancestral Population Size and Divergence Time during Speciation with Gene Flow. Genome Biol Evol 2020; 12:715-719. [PMID: 32365209 PMCID: PMC7259675 DOI: 10.1093/gbe/evaa087] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/27/2020] [Indexed: 11/28/2022] Open
Abstract
Gene flow between species may cause variations in branch length and topology of gene tree, which are beyond the expected variations from ancestral processes. These additional variations make it difficult to estimate parameters during speciation with gene flow, as the pattern of these additional variations differs with the relationship between isolation and migration. As far as we know, most methods rely on the assumption about the relationship between isolation and migration by a given model, such as the isolation-with-migration model, when estimating parameters during speciation with gene flow. In this article, we develop a multispecies coalescent approach which does not rely on any assumption about the relationship between isolation and migration when estimating parameters and is called mstree. mstree is available at https://github.com/liujunfengtop/MStree/ and uses some mathematical inequalities among several factors, which include the species divergence time, the ancestral population size, and the number of gene trees, to estimate parameters during speciation with gene flow. Using simulations, we show that the estimated values of ancestral population sizes and species divergence times are close to the true values when analyzing the simulation data sets, which are generated based on the isolation-with-initial-migration model, secondary contact model, and isolation-with-migration model. Therefore, our method is able to estimate ancestral population sizes and speciation times in the presence of different modes of gene flow and may be helpful to test different theories of speciation.
Collapse
Affiliation(s)
- Junfeng Liu
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China
| | - Qiao Liu
- Department of Automation, Tsinghua University, Beijing, China
| | - Qingzhu Yang
- Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China.,Department of Automation, Tsinghua University, Beijing, China
| |
Collapse
|
36
|
Gabrielli M, Nabholz B, Leroy T, Milá B, Thébaud C. Within-island diversification in a passerine bird. Proc Biol Sci 2020; 287:20192999. [PMID: 32183633 DOI: 10.1098/rspb.2019.2999] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The presence of congeneric taxa on the same island suggests the possibility of in situ divergence, but can also result from multiple colonizations of previously diverged lineages. Here, using genome-wide data from a large population sample, we test the hypothesis that intra-island divergence explains the occurrence of four geographical forms meeting at hybrid zones in the Reunion grey white-eye (Zosterops borbonicus), a species complex endemic to the small volcanic island of Reunion. Using population genomic and phylogenetic analyses, we reconstructed the population history of the different forms. We confirmed the monophyly of the complex and found that one of the lowland forms is paraphyletic and basal relative to others, a pattern highly consistent with in situ divergence. Our results suggest initial colonization of the island through the lowlands, followed by expansion into the highlands, which led to the evolution of a distinct geographical form, genetically and ecologically different from the lowland ones. Lowland forms seem to have experienced periods of geographical isolation, but they diverged from one another by sexual selection rather than niche change. Overall, low dispersal capabilities in this island bird combined with both geographical and ecological opportunities seem to explain how divergence occurred at such a small spatial scale.
Collapse
Affiliation(s)
- Maëva Gabrielli
- Laboratoire Évolution et Diversité Biologique (EDB), UMR 5174 (Université Paul Sabatier, CNRS, IRD), Toulouse, France
| | - Benoit Nabholz
- Institut des Sciences de l'Evolution de Montpellier, UMR 5554 (Université de Montpellier, CNRS, IRD, EPHE), Montpellier, France
| | - Thibault Leroy
- Institut des Sciences de l'Evolution de Montpellier, UMR 5554 (Université de Montpellier, CNRS, IRD, EPHE), Montpellier, France
| | - Borja Milá
- National Museum of Natural Sciences, Spanish National Research Council (CSIC), Madrid, Spain
| | - Christophe Thébaud
- Laboratoire Évolution et Diversité Biologique (EDB), UMR 5174 (Université Paul Sabatier, CNRS, IRD), Toulouse, France
| |
Collapse
|
37
|
Durvasula A, Sankararaman S. Recovering signals of ghost archaic introgression in African populations. SCIENCE ADVANCES 2020; 6:eaax5097. [PMID: 32095519 PMCID: PMC7015685 DOI: 10.1126/sciadv.aax5097] [Citation(s) in RCA: 68] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/29/2019] [Accepted: 12/03/2019] [Indexed: 05/18/2023]
Abstract
While introgression from Neanderthals and Denisovans has been documented in modern humans outside Africa, the contribution of archaic hominins to the genetic variation of present-day Africans remains poorly understood. We provide complementary lines of evidence for archaic introgression into four West African populations. Our analyses of site frequency spectra indicate that these populations derive 2 to 19% of their genetic ancestry from an archaic population that diverged before the split of Neanderthals and modern humans. Using a method that can identify segments of archaic ancestry without the need for reference archaic genomes, we built genome-wide maps of archaic ancestry in the Yoruba and the Mende populations. Analyses of these maps reveal segments of archaic ancestry at high frequency in these populations that represent potential targets of adaptive introgression. Our results reveal the substantial contribution of archaic ancestry in shaping the gene pool of present-day West African populations.
Collapse
Affiliation(s)
- Arun Durvasula
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| | - Sriram Sankararaman
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, USA
- Bioinformatics Interdepartmental Program, University of California, Los Angeles, Los Angeles, CA, USA
- Department of Computational Medicine, University of California, Los Angeles, Los Angeles, CA, USA
| |
Collapse
|
38
|
Duckett DJ, Pelletier TA, Carstens BC. Identifying model violations under the multispecies coalescent model using P2C2M.SNAPP. PeerJ 2020; 8:e8271. [PMID: 31949994 PMCID: PMC6956792 DOI: 10.7717/peerj.8271] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2019] [Accepted: 11/22/2019] [Indexed: 11/20/2022] Open
Abstract
Phylogenetic estimation under the multispecies coalescent model (MSCM) assumes all incongruence among loci is caused by incomplete lineage sorting. Therefore, applying the MSCM to datasets that contain incongruence that is caused by other processes, such as gene flow, can lead to biased phylogeny estimates. To identify possible bias when using the MSCM, we present P2C2M.SNAPP. P2C2M.SNAPP is an R package that identifies model violations using posterior predictive simulation. P2C2M.SNAPP uses the posterior distribution of species trees output by the software package SNAPP to simulate posterior predictive datasets under the MSCM, and then uses summary statistics to compare either the empirical data or the posterior distribution to the posterior predictive distribution to identify model violations. In simulation testing, P2C2M.SNAPP correctly classified up to 83% of datasets (depending on the summary statistic used) as to whether or not they violated the MSCM model. P2C2M.SNAPP represents a user-friendly way for researchers to perform posterior predictive model checks when using the popular SNAPP phylogenetic estimation program. It is freely available as an R package, along with additional program details and tutorials.
Collapse
Affiliation(s)
- Drew J Duckett
- Department of Evolution, Ecology, and Organismal Biology, The Ohio State University, Columbus, OH, USA
| | | | - Bryan C Carstens
- Department of Evolution, Ecology, and Organismal Biology, The Ohio State University, Columbus, OH, USA
| |
Collapse
|
39
|
Chung Y. Recent advances in Bayesian inference of isolation-with-migration models. Genomics Inform 2020; 17:e37. [PMID: 31896237 PMCID: PMC6944047 DOI: 10.5808/gi.2019.17.4.e37] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2019] [Accepted: 10/23/2019] [Indexed: 12/03/2022] Open
Abstract
Isolation-with-migration (IM) models have become popular for explaining population divergence in the presence of migrations. Bayesian methods are commonly used to estimate IM models, but they are limited to small data analysis or simple model inference. Recently three methods, IMa3, MIST, and AIM, resolved these limitations. Here, we describe the major problems addressed by these three software and compare differences among their inference methods, despite their use of the same standard likelihood function.
Collapse
Affiliation(s)
- Yujin Chung
- Department of Applied Statistics, Kyonggi University, Suwon 16227, Korea
| |
Collapse
|
40
|
Lipson M, Ribot I, Mallick S, Rohland N, Olalde I, Adamski N, Broomandkhoshbacht N, Lawson AM, López S, Oppenheimer J, Stewardson K, Asombang RN, Bocherens H, Bradman N, Culleton BJ, Cornelissen E, Crevecoeur I, de Maret P, Fomine FLM, Lavachery P, Mindzie CM, Orban R, Sawchuk E, Semal P, Thomas MG, Van Neer W, Veeramah KR, Kennett DJ, Patterson N, Hellenthal G, Lalueza-Fox C, MacEachern S, Prendergast ME, Reich D. Ancient West African foragers in the context of African population history. Nature 2020; 577:665-670. [PMID: 31969706 PMCID: PMC8386425 DOI: 10.1038/s41586-020-1929-1] [Citation(s) in RCA: 58] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2018] [Accepted: 11/29/2019] [Indexed: 12/31/2022]
Abstract
Our knowledge of ancient human population structure in sub-Saharan Africa, particularly prior to the advent of food production, remains limited. Here we report genome-wide DNA data from four children-two of whom were buried approximately 8,000 years ago and two 3,000 years ago-from Shum Laka (Cameroon), one of the earliest known archaeological sites within the probable homeland of the Bantu language group1-11. One individual carried the deeply divergent Y chromosome haplogroup A00, which today is found almost exclusively in the same region12,13. However, the genome-wide ancestry profiles of all four individuals are most similar to those of present-day hunter-gatherers from western Central Africa, which implies that populations in western Cameroon today-as well as speakers of Bantu languages from across the continent-are not descended substantially from the population represented by these four people. We infer an Africa-wide phylogeny that features widespread admixture and three prominent radiations, including one that gave rise to at least four major lineages deep in the history of modern humans.
Collapse
Affiliation(s)
- Mark Lipson
- Department of Genetics, Harvard Medical School, Boston, MA, USA.
| | - Isabelle Ribot
- Département d'Anthropologie, Université de Montréal, Montreal, Quebec, Canada
| | - Swapan Mallick
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Medical and Population Genetics Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Nadin Rohland
- Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - Iñigo Olalde
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Institute of Evolutionary Biology (CSIC-UPF), Barcelona, Spain
| | - Nicole Adamski
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Nasreen Broomandkhoshbacht
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
- Department of Anthropology, University of California, Santa Cruz, CA, USA
| | - Ann Marie Lawson
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | - Saioa López
- UCL Genetics Institute, University College London, London, UK
| | - Jonas Oppenheimer
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
- Department of Biomolecular Engineering, University of California, Santa Cruz, CA, USA
| | - Kristin Stewardson
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
| | | | - Hervé Bocherens
- Department of Geosciences, Biogeology, University of Tübingen, Tübingen, Germany
- Senckenberg Research Centre for Human Evolution and Palaeoenvironment, University of Tübingen, Tübingen, Germany
| | - Neil Bradman
- UCL Genetics Institute, University College London, London, UK
- The Henry Stewart Group, London, UK
| | - Brendan J Culleton
- Institutes of Energy and the Environment, Pennsylvania State University, University Park, PA, USA
| | - Els Cornelissen
- Department of Cultural Anthropology and History, Royal Museum for Central Africa, Tervuren, Belgium
| | | | - Pierre de Maret
- Faculté de Philosophie et Sciences Sociales, Université Libre de Bruxelles, Brussels, Belgium
| | | | - Philippe Lavachery
- Agence Wallonne du Patrimoine, Service Public de Wallonie, Namur, Belgium
| | | | - Rosine Orban
- Royal Belgian Institute of Natural Sciences, Brussels, Belgium
| | - Elizabeth Sawchuk
- Department of Anthropology, Stony Brook University, Stony Brook, NY, USA
| | - Patrick Semal
- Royal Belgian Institute of Natural Sciences, Brussels, Belgium
| | - Mark G Thomas
- UCL Genetics Institute, University College London, London, UK
- Department of Genetics, Evolution and Environment, University College London, London, UK
| | - Wim Van Neer
- Royal Belgian Institute of Natural Sciences, Brussels, Belgium
- Department of Biology, University of Leuven, Leuven, Belgium
| | - Krishna R Veeramah
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY, USA
| | - Douglas J Kennett
- Department of Anthropology, University of California, Santa Barbara, CA, USA
| | - Nick Patterson
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| | - Garrett Hellenthal
- UCL Genetics Institute, University College London, London, UK
- Department of Genetics, Evolution and Environment, University College London, London, UK
| | | | - Scott MacEachern
- Division of Social Science, Duke Kunshan University, Kunshan, China
| | - Mary E Prendergast
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Department of Sociology and Anthropology, Saint Louis University, Madrid, Spain
| | - David Reich
- Department of Genetics, Harvard Medical School, Boston, MA, USA
- Medical and Population Genetics Program, Broad Institute of MIT and Harvard, Cambridge, MA, USA
- Howard Hughes Medical Institute, Harvard Medical School, Boston, MA, USA
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA, USA
| |
Collapse
|
41
|
Marko PB, Zaslavskaya NI. Geographic origin and timing of colonization of the Pacific Coast of North America by the rocky shore gastropod Littorina sitkana. PeerJ 2019; 7:e7987. [PMID: 31720112 PMCID: PMC6836758 DOI: 10.7717/peerj.7987] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2019] [Accepted: 10/04/2019] [Indexed: 11/20/2022] Open
Abstract
The demographic history of a species can have a lasting impact on its contemporary population genetic structure. Northeastern Pacific (NEP) populations of the rocky shore gastropod Littorina sitkana have very little mitochondrial DNA (mtDNA) sequence diversity and show no significant population structure despite lacking dispersive planktonic larvae. A contrasting pattern of high mtDNA diversity in the northwestern Pacific (NWP) suggests that L. sitkana may have recently colonized the NEP from the NWP via stepping-stone colonization through the Aleutian-Commander Archipelago (ACA) following the end of the last glacial 20,000 years ago. Here, we use multi-locus sequence data to test that hypothesis using a combination of descriptive statistics and population divergence modeling aimed at resolving the timing and the geographic origin of NEP populations. Our results show that NEP populations share a common ancestor with a population of L. sitkana on the Kamchatka Peninsula ∼46,900 years ago and that NEP populations diverged from each other ∼21,400 years ago. A more recent population divergence between Kamchatka and NEP populations, than between Kamchatka and other populations in the NWP, suggests that the ACA was the most probable dispersal route. Taking into account the confidence intervals for the estimates, we conservatively estimate that L. sitkana arrived in the NEP between 107,400 and 4,100 years ago, a range of dates that is compatible with post-glacial colonization of the NEP. Unlike other congeners that are relatively abundant in the Pleistocene fossil record of the NEP, only one report of L. sitkana exists from the NEP fossil record. Although broadly consistent with the molecular data, the biogeographic significance of these fossils is difficult to evaluate, as the shells cannot be distinguished from the closely-related congener L. subrotundata.
Collapse
Affiliation(s)
- Peter B Marko
- School of Life Sciences, University of Hawai'i at Mānoa, Honolulu, Hawai'i, United States of America
| | - Nadezhda I Zaslavskaya
- National Scientific Center of Marine Biology, Far Eastern Branch, Russian Academy of Sciences, Vladivostok, Russian Federation
| |
Collapse
|
42
|
Wollenberg Valero KC, Marshall JC, Bastiaans E, Caccone A, Camargo A, Morando M, Niemiller ML, Pabijan M, Russello MA, Sinervo B, Werneck FP, Sites JW, Wiens JJ, Steinfartz S. Patterns, Mechanisms and Genetics of Speciation in Reptiles and Amphibians. Genes (Basel) 2019; 10:genes10090646. [PMID: 31455040 PMCID: PMC6769790 DOI: 10.3390/genes10090646] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Revised: 07/21/2019] [Accepted: 08/05/2019] [Indexed: 12/22/2022] Open
Abstract
In this contribution, the aspects of reptile and amphibian speciation that emerged from research performed over the past decade are reviewed. First, this study assesses how patterns and processes of speciation depend on knowing the taxonomy of the group in question, and discuss how integrative taxonomy has contributed to speciation research in these groups. This study then reviews the research on different aspects of speciation in reptiles and amphibians, including biogeography and climatic niches, ecological speciation, the relationship between speciation rates and phenotypic traits, and genetics and genomics. Further, several case studies of speciation in reptiles and amphibians that exemplify many of these themes are discussed. These include studies of integrative taxonomy and biogeography in South American lizards, ecological speciation in European salamanders, speciation and phenotypic evolution in frogs and lizards. The final case study combines genomics and biogeography in tortoises. The field of amphibian and reptile speciation research has steadily moved forward from the assessment of geographic and ecological aspects, to incorporating other dimensions of speciation, such as genetic mechanisms and evolutionary forces. A higher degree of integration among all these dimensions emerges as a goal for future research.
Collapse
Affiliation(s)
| | - Jonathon C Marshall
- Department of Zoology, Weber State University, 1415 Edvalson Street, Dept. 2505, Ogden, UT 84401, USA
| | - Elizabeth Bastiaans
- Department of Biology, State University of New York, College at Oneonta, Oneonta, NY 13820, USA
| | - Adalgisa Caccone
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA
| | - Arley Camargo
- Centro Universitario de Rivera, Universidad de la República, Ituzaingó 667, Rivera 40000, Uruguay
| | - Mariana Morando
- Instituto Patagónico para el Estudio de los Ecosistemas Continentales (IPEEC, CENPAT-CONICET) Bv. Brown 2915, Puerto Madryn U9120ACD, Argentina
| | - Matthew L Niemiller
- Department of Biological Sciences, The University of Alabama in Huntsville, Huntsville, AL 35899, USA
| | - Maciej Pabijan
- Department of Comparative Anatomy, Institute of Zoology and Biomedical Research, Jagiellonian University, ul. Gronostajowa 9, 30-387 Kraków, Poland
| | - Michael A Russello
- Department of Biology, University of British Columbia, Okanagan Campus, 3247 University Way, Kelowna, BC V1V 1V7, Canada
| | - Barry Sinervo
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, Coastal Biology Building, 130 McAllister Way, Santa Cruz, CA 95060, USA
| | - Fernanda P Werneck
- Programa de Coleções Científicas Biológicas, Coordenação de Biodiversidade, Instituto Nacional de Pesquisas da Amazônia, Manaus 69060-000, Brazil
| | - Jack W Sites
- Department of Biological and Marine Sciences, University of Hull, Cottingham Road, Hull HU6 7RX, UK
| | - John J Wiens
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA
| | - Sebastian Steinfartz
- Molecular Evolution and Systematics of Animals, Institute of Biology, University of Leipzig, Talstrasse 33, 04103 Leipzig, Germany
| |
Collapse
|
43
|
Gundlach S, Junge O, Wienbrandt L, Krawczak M, Caliebe A. Comparison of Markov Chain Monte Carlo Software for the Evolutionary Analysis of Y-Chromosomal Microsatellite Data. Comput Struct Biotechnol J 2019; 17:1082-1090. [PMID: 31452861 PMCID: PMC6700485 DOI: 10.1016/j.csbj.2019.07.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2019] [Revised: 06/17/2019] [Accepted: 07/25/2019] [Indexed: 11/28/2022] Open
Abstract
The evolutionary analysis of genetic data is an important subject of modern bioscience, with practical applications in diverse fields. Parameters of interest in this context include effective population sizes, mutation rates, population growth rates and the times to most recent common ancestors. Studying Y-chromosomal microsatellite data, in particular, has proven useful to unravel the recent patrilineal history of Homo sapiens populations. We compared the individual analysis options and technical details of four software tools that are widely used for this purpose, namely BATWING, BEAST, IMa2 and LAMARC, all of which use Bayesian coalescent-based Markov chain Monte Carlo (MCMC) methods for parameter estimation. More specifically, we simulated datasets for either eight or 20 hypothetical Y-chromosomal microsatellites, assuming a mutation rate of 0.0030 per generation and a constant or exponentially increasing population size, and used these data to evaluate the parameter estimation capacity of each tool. The datasets comprised between 100 and 1000 samples. In addition to runtime, the practical utility of the tools of interest can also be expected to depend critically upon the convergence behavior of the actual MCMC implementation. In fact, we found that runtime increased, and convergence rate decreased, with increasing sample size as expected. BATWING performed best with respect to runtime and convergence behavior, but only supports simple evolutionary models. As regards the spectrum of evolutionary models covered, and also in terms of cross-platform usability, BEAST provided the greatest flexibility. Finally, IMa2 and LAMARC turned out best to incorporate elaborate migration models in the analysis process.
Collapse
Affiliation(s)
- Sven Gundlach
- Institute of Medical Informatics and Statistics, Kiel University, University Hospital Schleswig-Holstein, Brunswiker Strasse 10, 24105 Kiel, Germany
| | - Olaf Junge
- Institute of Medical Informatics and Statistics, Kiel University, University Hospital Schleswig-Holstein, Brunswiker Strasse 10, 24105 Kiel, Germany
| | - Lars Wienbrandt
- Institute of Clinical Molecular Biology, Kiel University, University Hospital Schleswig-Holstein, Rosalind-Franklin-Strasse 12, 24105 Kiel, Germany
| | - Michael Krawczak
- Institute of Medical Informatics and Statistics, Kiel University, University Hospital Schleswig-Holstein, Brunswiker Strasse 10, 24105 Kiel, Germany
| | - Amke Caliebe
- Institute of Medical Informatics and Statistics, Kiel University, University Hospital Schleswig-Holstein, Brunswiker Strasse 10, 24105 Kiel, Germany
| |
Collapse
|
44
|
Ragsdale AP, Gravel S. Models of archaic admixture and recent history from two-locus statistics. PLoS Genet 2019; 15:e1008204. [PMID: 31181058 PMCID: PMC6586359 DOI: 10.1371/journal.pgen.1008204] [Citation(s) in RCA: 46] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2019] [Revised: 06/20/2019] [Accepted: 05/17/2019] [Indexed: 11/18/2022] Open
Abstract
We learn about population history and underlying evolutionary biology through patterns of genetic polymorphism. Many approaches to reconstruct evolutionary histories focus on a limited number of informative statistics describing distributions of allele frequencies or patterns of linkage disequilibrium. We show that many commonly used statistics are part of a broad family of two-locus moments whose expectation can be computed jointly and rapidly under a wide range of scenarios, including complex multi-population demographies with continuous migration and admixture events. A full inspection of these statistics reveals that widely used models of human history fail to predict simple patterns of linkage disequilibrium. To jointly capture the information contained in classical and novel statistics, we implemented a tractable likelihood-based inference framework for demographic history. Using this approach, we show that human evolutionary models that include archaic admixture in Africa, Asia, and Europe provide a much better description of patterns of genetic diversity across the human genome. We estimate that an unidentified, deeply diverged population admixed with modern humans within Africa both before and after the split of African and Eurasian populations, contributing 4 - 8% genetic ancestry to individuals in world-wide populations.
Collapse
Affiliation(s)
- Aaron P Ragsdale
- Department of Human Genetics, McGill University, Montreal, QC, Canada
| | - Simon Gravel
- Department of Human Genetics, McGill University, Montreal, QC, Canada
| |
Collapse
|