1
|
Kent TV, Schrider DR, Matute DR. Demographic History, Genetic Load, and the Efficacy of Selection in the Globally Invasive Mosquito Aedes aegypti. Genome Biol Evol 2025; 17:evaf066. [PMID: 40181735 PMCID: PMC12034524 DOI: 10.1093/gbe/evaf066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2025] [Accepted: 03/21/2025] [Indexed: 04/05/2025] Open
Abstract
Aedes aegypti is the main vector species of yellow fever, dengue, Zika, and chikungunya. The species is originally from Africa but has experienced a spectacular expansion in its geographic range to a large swath of the world, and the demographic effects of which have remained largely understudied. In this report, we examine whole-genome sequences from six countries in Africa, North America, and South America to investigate the demographic history of the spread of A. aegypti into the Americas and its impact on genomic diversity and deleterious genetic load. In the Americas, we observe patterns of strong population structure consistent with relatively low (but probably nonzero) levels of gene flow but occasional long-range dispersal and/or recolonization events. We also find evidence that the colonization of the Americas has resulted in introduction bottlenecks. However, while each sampling location shows evidence of a past population contraction and subsequent recovery, our results suggest that the bottlenecks in America have led to a reduction in genetic diversity of only ∼35% relative to African populations, and the American samples have retained high levels of genetic diversity (expected heterozygosity of ∼0.02 at synonymous sites). We additionally find that American populations of aegypti have experienced only a minor reduction in the efficacy of selection, with evidence for both an accumulation of deleterious alleles and some purging of strongly deleterious alleles. These results exemplify how an invasive species can expand its range with remarkable genetic resilience in the face of strong eradication pressure.
Collapse
Affiliation(s)
- Tyler V Kent
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
- Department of Biology, College of Arts and Sciences, University of North Carolina, Chapel Hill, NC, USA
- Department of Genetics, School of Medicine, University of North Carolina, Chapel Hill, NC, USA
| | - Daniel R Schrider
- Department of Genetics, School of Medicine, University of North Carolina, Chapel Hill, NC, USA
| | - Daniel R Matute
- Department of Biology, College of Arts and Sciences, University of North Carolina, Chapel Hill, NC, USA
| |
Collapse
|
2
|
Adams R, Lozano JR, Duncan M, Green J, Assis R, DeGiorgio M. A Tale of Too Many Trees: A Conundrum for Phylogenetic Regression. Mol Biol Evol 2025; 42:msaf032. [PMID: 39930867 PMCID: PMC11884811 DOI: 10.1093/molbev/msaf032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2024] [Revised: 12/20/2024] [Accepted: 01/21/2025] [Indexed: 03/08/2025] Open
Abstract
Just exactly which tree(s) should we assume when testing evolutionary hypotheses? This question has plagued comparative biologists for decades. Though all phylogenetic comparative methods require input trees, we seldom know with certainty whether even a perfectly estimated tree (if this is possible in practice) is appropriate for our studied traits. Yet, we also know that phylogenetic conflict is ubiquitous in modern comparative biology, and we are still learning about its dangers when testing evolutionary hypotheses. Here, we investigate the consequences of tree-trait mismatch for phylogenetic regression in the presence of gene tree-species tree conflict. Our simulation experiments reveal excessively high false positive rates for mismatched models with both small and large trees, simple and complex traits, and known and estimated phylogenies. In some cases, we find evidence of a directionality of error: assuming a species tree for traits that evolved according to a gene tree sometimes fares worse than the opposite. We also explored the impacts of tree choice using an expansive, cross-species gene expression dataset as an arguably "best-case" scenario in which one may have a better chance of matching tree with trait. Offering a potential path forward, we found promise in the application of a robust estimator as a potential, albeit imperfect, solution to some issues raised by tree mismatch. Collectively, our results emphasize the importance of careful study design for comparative methods, highlighting the need to fully appreciate the role of accurate and thoughtful phylogenetic modeling.
Collapse
Affiliation(s)
- Richard Adams
- Department of Entomology and Plant Pathology, University of Arkansas, Fayetteville, AR, USA
- Center for Agricultural Data Analytics, University of Arkansas, Fayetteville, AR, USA
| | - Jenniffer Roa Lozano
- Department of Entomology and Plant Pathology, University of Arkansas, Fayetteville, AR, USA
- Center for Agricultural Data Analytics, University of Arkansas, Fayetteville, AR, USA
| | - Mataya Duncan
- Department of Entomology and Plant Pathology, University of Arkansas, Fayetteville, AR, USA
- Center for Agricultural Data Analytics, University of Arkansas, Fayetteville, AR, USA
| | - Jack Green
- Department of Entomology and Plant Pathology, University of Arkansas, Fayetteville, AR, USA
- Center for Agricultural Data Analytics, University of Arkansas, Fayetteville, AR, USA
| | - Raquel Assis
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL, USA
- Institute for Human Health and Disease Intervention, Florida Atlantic University, Boca Raton, FL, USA
| | - Michael DeGiorgio
- Department of Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL, USA
| |
Collapse
|
3
|
Atağ G, Waldman S, Carmi S, Somel M. An explanation for the sister repulsion phenomenon in Patterson's f-statistics. Genetics 2024; 228:iyae144. [PMID: 39292210 PMCID: PMC11538414 DOI: 10.1093/genetics/iyae144] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2024] [Accepted: 08/19/2024] [Indexed: 09/19/2024] Open
Abstract
Patterson's f-statistics are among the most heavily utilized tools for analyzing genome-wide allele frequency data for demographic inference. Beyond studying admixture, f3- and f4-statistics are also used for clustering populations to identify groups with similar histories. However, previous studies have noted an unexpected behavior of f-statistics: multiple populations from a certain region systematically show higher genetic affinity to a more distant population than to their neighbors, a pattern that is mismatched with alternative measures of genetic similarity. We call this counter-intuitive pattern "sister repulsion". We first present a novel instance of sister repulsion, where genomes from Bronze Age East Anatolian sites show higher affinity toward Bronze Age Greece rather than each other. This is observed both using f3- and f4-statistics, contrasts with archaeological/historical expectation, and also contradicts genetic affinity patterns captured using principal components analysis or multidimensional scaling on genetic distances. We then propose a simple demographic model to explain this pattern, where sister populations receive gene flow from a genetically distant source. We calculate f3- and f4-statistics using simulated genetic data with varying population genetic parameters, confirming that low-level gene flow from an external source into populations from 1 region can create sister repulsion in f-statistics. Unidirectional gene flow between the studied regions (without an external source) can likewise create repulsion. Meanwhile, similar to our empirical observations, multidimensional scaling analyses of genetic distances still cluster sister populations together. Overall, our results highlight the impact of low-level admixture events when inferring demographic history using f-statistics.
Collapse
Affiliation(s)
- Gözde Atağ
- Department of Biological Sciences, Middle East Technical University, Ankara 06800, Turkey
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig 04103, Germany
| | - Shamam Waldman
- Department of Human Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| | - Shai Carmi
- Braun School of Public Health and Community Medicine, The Hebrew University of Jerusalem, Jerusalem 9112102, Israel
| | - Mehmet Somel
- Department of Biological Sciences, Middle East Technical University, Ankara 06800, Turkey
| |
Collapse
|
4
|
Lanfear R, Hahn MW. The Meaning and Measure of Concordance Factors in Phylogenomics. Mol Biol Evol 2024; 41:msae214. [PMID: 39418118 PMCID: PMC11532913 DOI: 10.1093/molbev/msae214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Revised: 09/25/2024] [Accepted: 10/04/2024] [Indexed: 10/19/2024] Open
Abstract
As phylogenomic datasets have grown in size, researchers have developed new ways to measure biological variation and to assess statistical support for specific branches. Larger datasets have more sites and loci and therefore less sampling variance. While we can more accurately measure the mean signal in these datasets, lower sampling variance is often reflected in uniformly high measures of branch support-such as the bootstrap and posterior probability-limiting their utility. Larger datasets have also revealed substantial biological variation in the topologies found across individual loci, such that the single species tree inferred by most phylogenetic methods represents a limited summary of the data for many purposes. In contrast to measures of statistical support, the degree of underlying topological variation among loci should be approximately constant regardless of the size of the dataset. "Concordance factors" (CFs) and similar statistics have therefore become increasingly important tools in phylogenetics. In this review, we explain why CFs should be thought of as descriptors of topological variation rather than as measures of statistical support, and argue that they provide important information about the predictive power of the species tree not contained in measures of support. We review a growing suite of statistics for measuring concordance, compare them in a common framework that reveals their interrelationships, and demonstrate how to calculate them using an example from birds. We also discuss how measures of topological variation might change in the future as we move beyond estimating a single "tree of life" toward estimating the myriad evolutionary histories underlying genomic variation.
Collapse
Affiliation(s)
- Robert Lanfear
- Ecology and Evolution, Research School of Biology, Australian National University, Canberra, Australia
| | - Matthew W Hahn
- Department of Biology, Indiana University, Bloomington, IN, USA
- Department of Computer Science, Indiana University, Bloomington, IN, USA
| |
Collapse
|
5
|
Mackintosh A, Setter D. Genealogical asymmetry under the IM model and a two-taxon test for gene flow. Genetics 2024; 228:iyae157. [PMID: 39344660 PMCID: PMC11631468 DOI: 10.1093/genetics/iyae157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Accepted: 09/26/2024] [Indexed: 10/01/2024] Open
Abstract
Methods for detecting gene flow between populations often rely on asymmetry in the average length of particular genealogical branches, with the ABBA-BABA test being a well known example. Currently, asymmetry-based methods cannot be applied to a pair of populations and such analyses are instead performed using model-based methods. Here we investigate genealogical asymmetry under a two-population Isolation with Migration model. We focus on genealogies where the first coalescence event is between lineages sampled from different populations, as the external branches of these genealogies have equal expected length as long as there is no post-divergence gene flow. We show that unidirectional gene flow breaks this symmetry and results in the recipient population having longer external branches. We derive expectations for the probability of this genealogical asymmetry and propose a simple statistic (Am) to detect it from genome sequence data. Am provides a two-taxon test for gene flow that only requires a single unphased diploid genome from each population, with no outgroup information. We use analytic expectations and simulations to explore how recombination, unequal effective population sizes, bidirectional gene flow and background selection influence Am and find that the statistic provides unambiguous evidence for gene flow under a continent-island history. We estimate Am for genome sequence data from Heliconius butterflies and Odocoileus deer, generating results consistent with previous model-based analyses. Our work highlights a signal of gene flow overlooked to date and provides a method that complements existing approaches for investigating the demographic history of recently diverged populations.
Collapse
Affiliation(s)
- Alexander Mackintosh
- Department of Ecology and Genetics, Uppsala University, Norbyvägen 18D, Uppsala 75236, Sweden
- Institute of Ecology and Evolution, Ashworth Laboratories, University of Edinburgh, Charlotte Auerbach Road, Edinburgh EH9 3FL, UK
| | - Derek Setter
- Institute of Ecology and Evolution, Ashworth Laboratories, University of Edinburgh, Charlotte Auerbach Road, Edinburgh EH9 3FL, UK
| |
Collapse
|
6
|
Kent TV, Schrider DR, Matute DR. Demographic history and the efficacy of selection in the globally invasive mosquito Aedes aegypti. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.07.584008. [PMID: 38559089 PMCID: PMC10979846 DOI: 10.1101/2024.03.07.584008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Aedes aegypti is the main vector species of yellow fever, dengue, zika and chikungunya. The species is originally from Africa but has experienced a spectacular expansion in its geographic range to a large swath of the world, the demographic effects of which have remained largely understudied. In this report, we examine whole-genome sequences from 6 countries in Africa, North America, and South America to investigate the demographic history of the spread of Ae. aegypti into the Americas its impact on genomic diversity. In the Americas, we observe patterns of strong population structure consistent with relatively low (but probably non-zero) levels of gene flow but occasional long-range dispersal and/or recolonization events. We also find evidence that the colonization of the Americas has resulted in introduction bottlenecks. However, while each sampling location shows evidence of a past population contraction and subsequent recovery, our results suggest that the bottlenecks in America have led to a reduction in genetic diversity of only ~35% relative to African populations, and the American samples have retained high levels of genetic diversity (expected heterozygosity of ~0.02 at synonymous sites) and have experienced only a minor reduction in the efficacy of selection. These results evoke the image of an invasive species that has expanded its range with remarkable genetic resilience in the face of strong eradication pressure.
Collapse
Affiliation(s)
- Tyler V. Kent
- Department of Biology, College of Arts and Sciences, University of North Carolina, Chapel Hill, NC, USA
- Department of Genetics, School of Medicine, University of North Carolina, Chapel Hill, NC, USA
| | - Daniel R. Schrider
- Department of Genetics, School of Medicine, University of North Carolina, Chapel Hill, NC, USA
| | - Daniel R. Matute
- Department of Biology, College of Arts and Sciences, University of North Carolina, Chapel Hill, NC, USA
| |
Collapse
|
7
|
Westbury MV, Cabrera AA, Rey-Iglesia A, De Cahsan B, Duchêne DA, Hartmann S, Lorenzen ED. A genomic assessment of the marine-speciation paradox within the toothed whale superfamily Delphinoidea. Mol Ecol 2023; 32:4829-4843. [PMID: 37448145 DOI: 10.1111/mec.17069] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 06/21/2023] [Accepted: 07/03/2023] [Indexed: 07/15/2023]
Abstract
The impact of post-divergence gene flow in speciation has been documented across a range of taxa in recent years, and may have been especially widespread in highly mobile, wide-ranging marine species, such as cetaceans. Here, we studied individual genomes from nine species across the three families of the toothed whale superfamily Delphinoidea (Delphinidae, Phocoenidae and Monodontidae). To investigate the role of post-divergence gene flow in the speciation process, we used a multifaceted approach, including (i) phylogenomics, (ii) the distribution of shared derived alleles and (iii) demographic inference. We found the divergence of lineages within Delphinoidea did not follow a process of pure bifurcation, but was much more complex. Sliding-window phylogenomics reveal a high prevalence of discordant topologies within the superfamily, with further analyses indicating these discordances arose due to both incomplete lineage sorting and gene flow. D-statistics and f-branch analyses supported gene flow between members of Delphinoidea, with the vast majority of gene flow occurring as ancient interfamilial events. Demographic analyses provided evidence that introgressive gene flow has likely ceased between all species pairs tested, despite reports of contemporary interspecific hybrids. Our study provides the first steps towards resolving the large complexity of speciation within Delphinoidea; we reveal the prevalence of ancient interfamilial gene flow events prior to the diversification of each family, and suggest that contemporary hybridisation events may be disadvantageous, as hybrid individuals do not appear to contribute to the parental species' gene pools.
Collapse
Affiliation(s)
| | | | | | - Binia De Cahsan
- Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - David A Duchêne
- Globe Institute, University of Copenhagen, Copenhagen, Denmark
| | - Stefanie Hartmann
- Institute of Biochemistry and Biology, University of Potsdam, Potsdam, Germany
| | | |
Collapse
|
8
|
Sianta SA, Kay KM. Phylogenomic analysis does not support a classic but controversial hypothesis of progenitor-derivative origins for the serpentine endemic Clarkia franciscana. Evolution 2022; 76:1246-1259. [PMID: 35403214 PMCID: PMC9322428 DOI: 10.1111/evo.14484] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2021] [Revised: 02/25/2022] [Accepted: 03/04/2022] [Indexed: 01/21/2023]
Abstract
Budding speciation involves isolation of marginal populations at the periphery of a species range and is thought to be a prominent mode of speciation in organisms with low dispersal and/or strong local adaptation among populations. Budding speciation is typically evidenced by abutting, asymmetric ranges of ecologically divergent sister species and low genetic diversity in putative budded species. Yet these indirect patterns may be unreliable, instead caused by postspeciation processes such as range or demographic shifts. Nested phylogenetic relationships provide the most conclusive evidence of budding speciation. A putative case of budding speciation in the serpentine endemic Clarkia franciscana and two closely related widespread congeners was studied by Harlan Lewis, Peter Raven, Leslie Gottlieb, and others over a 20-year period, yet the origin of C. franciscana remains controversial. Here, we reinvestigate this system with phylogenomic analyses to determine whether C. franciscana is a recently derived budded species, phylogenetically nested within one of the other two putative progenitor species. In contrast to the hypothesized pattern of relatedness among the three Clarkia species, we find no evidence for recent budding speciation. Instead, the data suggest the three species diverged simultaneously. We urge caution in using contemporary range patterns to infer geographic modes of speciation.
Collapse
Affiliation(s)
- Shelley A. Sianta
- Department of Ecology and Evolutionary BiologyUniversity of CaliforniaSanta CruzCalifornia95060,Current Address: Department of Plant and Microbial BiologyUniversity of MinnesotaSt. PaulMinnesota55108
| | - Kathleen M. Kay
- Department of Ecology and Evolutionary BiologyUniversity of CaliforniaSanta CruzCalifornia95060
| |
Collapse
|
9
|
Hancock ZB, Lehmberg ES, Blackmon H. Phylogenetics in Space: How Continuous Spatial Structure Impacts Tree Inference. Mol Phylogenet Evol 2022; 173:107505. [PMID: 35577296 DOI: 10.1016/j.ympev.2022.107505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2021] [Revised: 04/08/2022] [Accepted: 05/06/2022] [Indexed: 11/26/2022]
Abstract
The tendency to discretize biology permeates taxonomy and systematics, leading to models that simplify the often continuous nature of populations. Even when the assumption of panmixia is relaxed, most models still assume some degree of discrete structure. The multispecies coalescent has emerged as a powerful model in phylogenetics, but in its common implementation is entirely space-independent - what we call the "missing z-axis". In this article, we review the many lines of evidence for how continuous spatial structure can impact phylogenetic inference. We illustrate and expand on these by using complex continuous-space demographic models that include distinct modes of speciation. We find that the impact of spatial structure permeates all aspects of phylogenetic inference, including gene tree stoichiometry, topological and branch-length variance, network estimation, and species delimitation. We conclude by utilizing our results to suggest how researchers can identify spatial structure in phylogenetic datasets.
Collapse
|
10
|
Hibbins MS, Hahn MW. Phylogenomic approaches to detecting and characterizing introgression. Genetics 2022; 220:iyab173. [PMID: 34788444 PMCID: PMC9208645 DOI: 10.1093/genetics/iyab173] [Citation(s) in RCA: 78] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Accepted: 10/02/2021] [Indexed: 12/26/2022] Open
Abstract
Phylogenomics has revealed the remarkable frequency with which introgression occurs across the tree of life. These discoveries have been enabled by the rapid growth of methods designed to detect and characterize introgression from whole-genome sequencing data. A large class of phylogenomic methods makes use of data across species to infer and characterize introgression based on expectations from the multispecies coalescent. These methods range from simple tests, such as the D-statistic, to model-based approaches for inferring phylogenetic networks. Here, we provide a detailed overview of the various signals that different modes of introgression are expected leave in the genome, and how current methods are designed to detect them. We discuss the strengths and pitfalls of these approaches and identify areas for future development, highlighting the different signals of introgression, and the power of each method to detect them. We conclude with a discussion of current challenges in inferring introgression and how they could potentially be addressed.
Collapse
Affiliation(s)
- Mark S Hibbins
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Matthew W Hahn
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
- Department of Computer Science, Indiana University, Bloomington, IN 47405, USA
| |
Collapse
|
11
|
Jiao X, Flouri T, Yang Z. Multispecies coalescent and its applications to infer species phylogenies and cross-species gene flow. Natl Sci Rev 2022; 8:nwab127. [PMID: 34987842 PMCID: PMC8692950 DOI: 10.1093/nsr/nwab127] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2021] [Revised: 07/10/2021] [Accepted: 07/11/2021] [Indexed: 02/06/2023] Open
Abstract
Multispecies coalescent (MSC) is the extension of the single-population coalescent model to multiple species. It integrates the phylogenetic process of species divergences and the population genetic process of coalescent, and provides a powerful framework for a number of inference problems using genomic sequence data from multiple species, including estimation of species divergence times and population sizes, estimation of species trees accommodating discordant gene trees, inference of cross-species gene flow and species delimitation. In this review, we introduce the major features of the MSC model, discuss full-likelihood and heuristic methods of species tree estimation and summarize recent methodological advances in inference of cross-species gene flow. We discuss the statistical and computational challenges in the field and research directions where breakthroughs may be likely in the next few years.
Collapse
Affiliation(s)
- Xiyun Jiao
- Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Tomáš Flouri
- Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| | - Ziheng Yang
- Department of Genetics, Evolution and Environment, University College London, London WC1E 6BT, UK
| |
Collapse
|
12
|
Singhal S, Derryberry GE, Bravo GA, Derryberry EP, Brumfield RT, Harvey MG. The dynamics of introgression across an avian radiation. Evol Lett 2021; 5:568-581. [PMID: 34917397 PMCID: PMC8645201 DOI: 10.1002/evl3.256] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 07/11/2021] [Accepted: 08/31/2021] [Indexed: 01/20/2023] Open
Abstract
Hybridization and resulting introgression can play both a destructive and a creative role in the evolution of diversity. Thus, characterizing when and where introgression is most likely to occur can help us understand the causes of diversification dynamics. Here, we examine the prevalence of and variation in introgression using phylogenomic data from a large (1300+ species), geographically widespread avian group, the suboscine birds. We first examine patterns of gene tree discordance across the geographic distribution of the entire clade. We then evaluate the signal of introgression in a subset of 206 species triads using Patterson's D‐statistic and test for associations between introgression signal and evolutionary, geographic, and environmental variables. We find that gene tree discordance varies across lineages and geographic regions. The signal of introgression is highest in cases where species occur in close geographic proximity and in regions with more dynamic climates since the Pleistocene. Our results highlight the potential of phylogenomic datasets for examining broad patterns of hybridization and suggest that the degree of introgression between diverging lineages might be predictable based on the setting in which they occur.
Collapse
Affiliation(s)
- Sonal Singhal
- Department of Biology California State University, Dominguez Hills Carson California 90747
| | - Graham E Derryberry
- Department of Ecology and Evolutionary Biology University of Tennessee Knoxville Tennessee 37996
| | - Gustavo A Bravo
- Department of Organismic and Evolutionary Biology Harvard University Cambridge Massachusetts 02138.,Museum of Comparative Zoology Harvard University Cambridge Massachusetts 02138
| | - Elizabeth P Derryberry
- Department of Ecology and Evolutionary Biology University of Tennessee Knoxville Tennessee 37996
| | - Robb T Brumfield
- Museum of Natural Science Louisiana State University Baton Rouge Louisiana 70803.,Department of Biological Sciences Louisiana State University Baton Rouge Louisiana 70803
| | - Michael G Harvey
- Department of Biological Sciences The University of Texas at El Paso El Paso Texas 79968.,Biodiversity Collections The University of Texas at El Paso El Paso Texas 79968
| |
Collapse
|
13
|
Korunes KL, Machado CA, Noor MAF. Inversions shape the divergence of Drosophila pseudoobscura and Drosophila persimilis on multiple timescales. Evolution 2021; 75:1820-1834. [PMID: 34041743 DOI: 10.1111/evo.14278] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2020] [Revised: 05/03/2021] [Accepted: 05/17/2021] [Indexed: 02/02/2023]
Abstract
By shaping meiotic recombination, chromosomal inversions can influence genetic exchange between hybridizing species. Despite the recognized importance of inversions in evolutionary processes such as divergence and speciation, teasing apart the effects of inversions over time remains challenging. For example, are their effects on sequence divergence primarily generated through creating blocks of linkage disequilibrium prespeciation or through preventing gene flux after speciation? We provide a comprehensive look into the influence of inversions on gene flow throughout the evolutionary history of a classic system: Drosophila pseudoobscura and Drosophila persimilis. We use extensive whole-genome sequence data to report patterns of introgression and divergence with respect to chromosomal arrangements. Overall, we find evidence that inversions have contributed to divergence patterns between D. pseudoobscura and D. persimilis over three distinct timescales: (1) segregation of ancestral polymorphism early in the speciation process, (2) gene flow after the split of D. pseudoobscura and D. persimilis, but prior to the split of D. pseudoobscura subspecies, and (3) recent gene flow between sympatric D. pseudoobscura and D. persimilis, after the split of D. pseudoobscura subspecies. We discuss these results in terms of our understanding of evolution in this classic system and provide cautions for interpreting divergence measures in other systems.
Collapse
Affiliation(s)
- Katharine L Korunes
- Department of Evolutionary Anthropology, Duke University, Durham, North Carolina, 27708
| | - Carlos A Machado
- Department of Biology, University of Maryland, College Park, Maryland, 20742
| | - Mohamed A F Noor
- Department of Biology, Duke University, Durham, North Carolina, 27708
| |
Collapse
|
14
|
Alanzi AAR, Degnan JH. Statistical inconsistency of the unrooted minimize deep coalescence criterion. PLoS One 2021; 16:e0251107. [PMID: 33970931 PMCID: PMC8109837 DOI: 10.1371/journal.pone.0251107] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2020] [Accepted: 04/20/2021] [Indexed: 11/24/2022] Open
Abstract
Species trees, which describe the evolutionary relationships between species, are often inferred from gene trees, which describe the ancestral relationships between sequences sampled at different loci from the species of interest. A common approach to inferring species trees from gene trees is motivated by supposing that gene tree variation is due to incomplete lineage sorting, also known as deep coalescence. One of the earliest methods motivated by deep coalescence is to find the species tree that minimizes the number of deep coalescent events needed to explain discrepancies between the species tree and input gene trees. This minimize deep coalescence (MDC) criterion can be applied in both rooted and unrooted settings. where either rooted or unrooted gene trees can be used to infer a rooted species tree. Previous work has shown that MDC is statistically inconsistent in the rooted setting, meaning that under a probabilistic model for deep coalescence, the multispecies coalescent, for some species trees, increasing the number of input gene trees does not make the method more likely to return a correct species tree. Here, we obtain analogous results in the unrooted setting, showing conditions leading to inconsistency of the MDC criterion using the multispecies coalescent model with unrooted gene trees for four taxa and five taxa.
Collapse
Affiliation(s)
- Ayed A. R. Alanzi
- Mathematics Department, College of Science and Human Studies of Hotat Sudair, Majmaah University, Majmaah, Saudi Arabia
| | - James H. Degnan
- Department of Mathematics and Statistics, University of New Mexico, Albuquerque, NM, United States of America
- * E-mail:
| |
Collapse
|
15
|
Moodley Y, Westbury MV, Russo IRM, Gopalakrishnan S, Rakotoarivelo A, Olsen RA, Prost S, Tunstall T, Ryder OA, Dalén L, Bruford MW. Interspecific Gene Flow and the Evolution of Specialization in Black and White Rhinoceros. Mol Biol Evol 2021; 37:3105-3117. [PMID: 32585004 DOI: 10.1093/molbev/msaa148] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Africa's black (Diceros bicornis) and white (Ceratotherium simum) rhinoceros are closely related sister-taxa that evolved highly divergent obligate browsing and grazing feeding strategies. Although their precursor species Diceros praecox and Ceratotherium mauritanicum appear in the fossil record ∼5.2 Ma, by 4 Ma both were still mixed feeders, and were even spatiotemporally sympatric at several Pliocene sites in what is today Africa's Rift Valley. Here, we ask whether or not D. praecox and C. mauritanicum were reproductively isolated when they came into Pliocene secondary contact. We sequenced and de novo assembled the first annotated black rhinoceros reference genome and compared it with available genomes of other black and white rhinoceros. We show that ancestral gene flow between D. praecox and C. mauritanicum ceased sometime between 3.3 and 4.1 Ma, despite conventional methods for the detection of gene flow from whole genome data returning false positive signatures of recent interspecific migration due to incomplete lineage sorting. We propose that ongoing Pliocene genetic exchange, for up to 2 My after initial divergence, could have potentially hindered the development of obligate feeding strategies until both species were fully reproductively isolated, but that the more severe and shifting paleoclimate of the early Pleistocene was likely the ultimate driver of ecological specialization in African rhinoceros.
Collapse
Affiliation(s)
- Yoshan Moodley
- Department of Zoology, University of Venda, Thohoyandou, Republic of South Africa
| | - Michael V Westbury
- Section for Evolutionary Genomics, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
| | - Isa-Rita M Russo
- School of Biosciences, Cardiff University, Cardiff, United Kingdom
| | - Shyam Gopalakrishnan
- Section for Evolutionary Genomics, GLOBE Institute, University of Copenhagen, Copenhagen, Denmark
| | - Andrinajoro Rakotoarivelo
- Department of Zoology, University of Venda, Thohoyandou, Republic of South Africa.,Natiora Ahy Madagasikara, Ampahibe, Antananarivo, Madagascar
| | - Remi-Andre Olsen
- Science for Life Laboratory, Department of Biochemistry and Biophysics, Stockholm University, Solna, Sweden
| | - Stefan Prost
- LOEWE-Centre for Translational Biodiversity Genomics, Senckenberg Museum, Frankfurt, Germany.,South African National Biodiversity Institute, National Zoological Gardens, Pretoria, Republic of South Africa
| | - Tate Tunstall
- San Diego Zoo Institute for Conservation Research, San Diego Zoo Global, Escondido, CA
| | - Oliver A Ryder
- San Diego Zoo Institute for Conservation Research, San Diego Zoo Global, Escondido, CA
| | - Love Dalén
- Centre for Palaeogenetics, Stockholm, Sweden.,Department of Bioinformatics and Genetics, Swedish Museum of Natural History, Stockholm, Sweden
| | - Michael W Bruford
- School of Biosciences, Cardiff University, Cardiff, United Kingdom.,Sustainable Places Research Institute, Cardiff University, Cardiff, United Kingdom
| |
Collapse
|
16
|
Koch H, DeGiorgio M. Maximum Likelihood Estimation of Species Trees from Gene Trees in the Presence of Ancestral Population Structure. Genome Biol Evol 2020; 12:3977-3995. [PMID: 32022857 PMCID: PMC7061232 DOI: 10.1093/gbe/evaa022] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/23/2020] [Indexed: 11/12/2022] Open
Abstract
Though large multilocus genomic data sets have led to overall improvements in phylogenetic inference, they have posed the new challenge of addressing conflicting signals across the genome. In particular, ancestral population structure, which has been uncovered in a number of diverse species, can skew gene tree frequencies, thereby hindering the performance of species tree estimators. Here we develop a novel maximum likelihood method, termed TASTI (Taxa with Ancestral structure Species Tree Inference), that can infer phylogenies under such scenarios, and find that it has increasing accuracy with increasing numbers of input gene trees, contrasting with the relatively poor performances of methods not tailored for ancestral structure. Moreover, we propose a supertree approach that allows TASTI to scale computationally with increasing numbers of input taxa. We use genetic simulations to assess TASTI's performance in the three- and four-taxon settings and demonstrate the application of TASTI on a six-species Afrotropical mosquito data set. Finally, we have implemented TASTI in an open-source software package for ease of use by the scientific community.
Collapse
Affiliation(s)
- Hillary Koch
- Department of Statistics, Pennsylvania State University
| | - Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University
| |
Collapse
|
17
|
Primate phylogenomics uncovers multiple rapid radiations and ancient interspecific introgression. PLoS Biol 2020; 18:e3000954. [PMID: 33270638 PMCID: PMC7738166 DOI: 10.1371/journal.pbio.3000954] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Revised: 12/15/2020] [Accepted: 11/02/2020] [Indexed: 12/17/2022] Open
Abstract
Our understanding of the evolutionary history of primates is undergoing continual revision due to ongoing genome sequencing efforts. Bolstered by growing fossil evidence, these data have led to increased acceptance of once controversial hypotheses regarding phylogenetic relationships, hybridization and introgression, and the biogeographical history of primate groups. Among these findings is a pattern of recent introgression between species within all major primate groups examined to date, though little is known about introgression deeper in time. To address this and other phylogenetic questions, here, we present new reference genome assemblies for 3 Old World monkey (OWM) species: Colobus angolensis ssp. palliatus (the black and white colobus), Macaca nemestrina (southern pig-tailed macaque), and Mandrillus leucophaeus (the drill). We combine these data with 23 additional primate genomes to estimate both the species tree and individual gene trees using thousands of loci. While our species tree is largely consistent with previous phylogenetic hypotheses, the gene trees reveal high levels of genealogical discordance associated with multiple primate radiations. We use strongly asymmetric patterns of gene tree discordance around specific branches to identify multiple instances of introgression between ancestral primate lineages. In addition, we exploit recent fossil evidence to perform fossil-calibrated molecular dating analyses across the tree. Taken together, our genome-wide data help to resolve multiple contentious sets of relationships among primates, while also providing insight into the biological processes and technical artifacts that led to the disagreements in the first place. Combining three newly sequenced primate genomes with other published genomes, this study adapts a little-known method for detecting ancient introgression to genome-scale data, revealing multiple previously unknown examples of hybridization between primate species.
Collapse
|
18
|
Cai L, Xi Z, Lemmon EM, Lemmon AR, Mast A, Buddenhagen CE, Liu L, Davis CC. The Perfect Storm: Gene Tree Estimation Error, Incomplete Lineage Sorting, and Ancient Gene Flow Explain the Most Recalcitrant Ancient Angiosperm Clade, Malpighiales. Syst Biol 2020; 70:491-507. [PMID: 33169797 DOI: 10.1093/sysbio/syaa083] [Citation(s) in RCA: 66] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2019] [Revised: 10/20/2020] [Accepted: 10/28/2020] [Indexed: 12/20/2022] Open
Abstract
The genomic revolution offers renewed hope of resolving rapid radiations in the Tree of Life. The development of the multispecies coalescent model and improved gene tree estimation methods can better accommodate gene tree heterogeneity caused by incomplete lineage sorting (ILS) and gene tree estimation error stemming from the short internal branches. However, the relative influence of these factors in species tree inference is not well understood. Using anchored hybrid enrichment, we generated a data set including 423 single-copy loci from 64 taxa representing 39 families to infer the species tree of the flowering plant order Malpighiales. This order includes 9 of the top 10 most unstable nodes in angiosperms, which have been hypothesized to arise from the rapid radiation during the Cretaceous. Here, we show that coalescent-based methods do not resolve the backbone of Malpighiales and concatenation methods yield inconsistent estimations, providing evidence that gene tree heterogeneity is high in this clade. Despite high levels of ILS and gene tree estimation error, our simulations demonstrate that these two factors alone are insufficient to explain the lack of resolution in this order. To explore this further, we examined triplet frequencies among empirical gene trees and discovered some of them deviated significantly from those attributed to ILS and estimation error, suggesting gene flow as an additional and previously unappreciated phenomenon promoting gene tree variation in Malpighiales. Finally, we applied a novel method to quantify the relative contribution of these three primary sources of gene tree heterogeneity and demonstrated that ILS, gene tree estimation error, and gene flow contributed to 10.0$\%$, 34.8$\%$, and 21.4$\%$ of the variation, respectively. Together, our results suggest that a perfect storm of factors likely influence this lack of resolution, and further indicate that recalcitrant phylogenetic relationships like the backbone of Malpighiales may be better represented as phylogenetic networks. Thus, reducing such groups solely to existing models that adhere strictly to bifurcating trees greatly oversimplifies reality, and obscures our ability to more clearly discern the process of evolution. [Coalescent; concatenation; flanking region; hybrid enrichment, introgression; phylogenomics; rapid radiation, triplet frequency.].
Collapse
Affiliation(s)
- Liming Cai
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, Cambridge, MA 02138, USA
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Zhenxiang Xi
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, Cambridge, MA 02138, USA
- Key Laboratory of Bio-Resource and Eco-Environment of Ministry of Education, College of Life Sciences, Sichuan University, Chengdu 610065, China
| | - Emily Moriarty Lemmon
- Department of Biological Sciences, Florida State University, Tallahassee, FL 32306, USA
| | - Alan R Lemmon
- Department of Scientific Computing, Florida State University, Tallahassee, FL 32306, USA
| | - Austin Mast
- Department of Biological Sciences, Florida State University, Tallahassee, FL 32306, USA
| | - Christopher E Buddenhagen
- Department of Biological Sciences, Florida State University, Tallahassee, FL 32306, USA
- AgResearch, 10 Bisley Road, Hamilton 3214, New Zealand
| | - Liang Liu
- Department of Statistics and Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| | - Charles C Davis
- Department of Organismic and Evolutionary Biology, Harvard University Herbaria, Cambridge, MA 02138, USA
| |
Collapse
|
19
|
Rancilhac L, Irisarri I, Angelini C, Arntzen JW, Babik W, Bossuyt F, Künzel S, Lüddecke T, Pasmans F, Sanchez E, Weisrock D, Veith M, Wielstra B, Steinfartz S, Hofreiter M, Philippe H, Vences M. Phylotranscriptomic evidence for pervasive ancient hybridization among Old World salamanders. Mol Phylogenet Evol 2020; 155:106967. [PMID: 33031928 DOI: 10.1016/j.ympev.2020.106967] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2019] [Revised: 07/09/2020] [Accepted: 09/28/2020] [Indexed: 11/18/2022]
Abstract
Hybridization can leave genealogical signatures in an organism's genome, originating from the parental lineages and persisting over time. This potentially confounds phylogenetic inference methods that aim to represent evolution as a strictly bifurcating tree. We apply a phylotranscriptomic approach to study the evolutionary history of, and test for inter-lineage introgression in the Salamandridae, a Holarctic salamanders group of interest in studies of toxicity and aposematism, courtship behavior, and molecular evolution. Although the relationships between the 21 currently recognized salamandrid genera have been the subject of numerous molecular phylogenetic studies, some branches have remained controversial and sometimes affected by discordances between mitochondrial vs. nuclear trees. To resolve the phylogeny of this family, and understand the source of mito-nuclear discordance, we generated new transcriptomic (RNAseq) data for 20 salamandrids and used these along with published data, including 28 mitochondrial genomes, to obtain a comprehensive nuclear and mitochondrial perspective on salamandrid evolution. Our final phylotranscriptomic data set included 5455 gene alignments for 40 species representing 17 of the 21 salamandrid genera. Using concatenation and species-tree phylogenetic methods, we find (1) Salamandrina sister to the clade of the "True Salamanders" (consisting of Chioglossa, Mertensiella, Lyciasalamandra, and Salamandra), (2) Ichthyosaura sister to the Near Eastern genera Neurergus and Ommatotriton, (3) Triturus sister to Lissotriton, and (4) Cynops paraphyletic with respect to Paramesotriton and Pachytriton. Combining introgression tests and phylogenetic networks, we find evidence for introgression among taxa within the clades of "Modern Asian Newts" and "Modern European Newts". However, we could not unambiguously identify the number, position, and direction of introgressive events. Combining evidence from nuclear gene analysis with the observed mito-nuclear phylogenetic discordances, we hypothesize a scenario with hybridization and mitochondrial capture among ancestral lineages of (1) Lissotriton into Ichthyosaura and (2) Triturus into Calotriton, plus introgression of nuclear genes from Triturus into Lissotriton. Furthermore, both mitochondrial capture and nuclear introgression may have occurred among lineages assigned to Cynops. More comprehensive genomic data will, in the future, allow testing this against alternative scenarios involving hybridization with other, extinct lineages of newts.
Collapse
Affiliation(s)
- Loïs Rancilhac
- Zoological Institute, Technische Universität Braunschweig, Mendelssohnstr. 4, 38106 Braunschweig, Germany.
| | - Iker Irisarri
- Department of Biodiversity and Evolutionary Biology, Museo Nacional de Ciencias Naturales, José Gutiérrez Abascal 2, 28006 Madrid, Spain
| | | | - Jan W Arntzen
- Naturalis Biodiversity Center, 2300 RA Leiden, the Netherlands
| | - Wiesław Babik
- Institute of Environmental Sciences, Jagiellonian University, ul. Gronostajowa 7, 30-387 Kraków, Poland
| | - Franky Bossuyt
- Amphibian Evolution Lab, Biology Department, Vrije Universiteit Brussel, Pleinlaan 2, B-1050 Brussels Belgium
| | - Sven Künzel
- Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany
| | - Tim Lüddecke
- Department of Bioresources, Fraunhofer Institute for Molecular Biology and Applied Ecology, Winchesterstr. 2, 35394 Gießen, Germany; LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberganlage 25, 60325 Frankfurt, Germany
| | - Frank Pasmans
- Department of Pathology, Bacteriology and Avian Diseases, Faculty of Veterinary Medicine, Ghent University, 9820 Merelbeke, Belgium
| | - Eugenia Sanchez
- Zoological Institute, Technische Universität Braunschweig, Mendelssohnstr. 4, 38106 Braunschweig, Germany; Department of Biology, Stanford University, Stanford, CA 94305, USA
| | - David Weisrock
- Department of Biology, University of Kentucky, Lexington, KY 40506, USA
| | - Michael Veith
- Biogeography Department, Trier University, 54286 Trier, Germany
| | - Ben Wielstra
- Institute of Biology Leiden, Leiden University, 2300 RA Leiden, the Netherlands
| | - Sebastian Steinfartz
- Institute of Biology, Molecular Evolution and Systematics of Animals, University of Leipzig, Talstrasse 33, 04103, Leipzig, Germany
| | - Michael Hofreiter
- Faculty of Mathematics and Natural Sciences, Institute for Biochemistry and Biology, University of Potsdam, Karl-Liebknecht-Str. 24-25, 14476 Potsdam, Germany
| | - Hervé Philippe
- Centre for Biodiversity Theory and Modelling, UMR CNRS 5321, Station of Theoretical and Experimental Ecology, 2 route du CNRS, 09200 Moulis, France
| | - Miguel Vences
- Zoological Institute, Technische Universität Braunschweig, Mendelssohnstr. 4, 38106 Braunschweig, Germany
| |
Collapse
|
20
|
Jiang Y, Yuan Z, Hu H, Ye X, Zheng Z, Wei Y, Zheng YL, Wang YG, Liu C. Differentiating homoploid hybridization from ancestral subdivision in evaluating the origin of the D lineage in wheat. THE NEW PHYTOLOGIST 2020; 228:409-414. [PMID: 32255512 DOI: 10.1111/nph.16578] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2019] [Accepted: 03/19/2020] [Indexed: 06/11/2023]
Affiliation(s)
- Yunfeng Jiang
- Triticeae Research Institute, Sichuan Agricultural University, Wenjiang, Chengdu, 611130, China
- CSIRO Agriculture and Food, St Lucia, Qld, 4067, Australia
| | - Zhongwei Yuan
- Triticeae Research Institute, Sichuan Agricultural University, Wenjiang, Chengdu, 611130, China
- CSIRO Agriculture and Food, St Lucia, Qld, 4067, Australia
| | - Haiyan Hu
- CSIRO Agriculture and Food, St Lucia, Qld, 4067, Australia
- College of Life Science and Technology, Henan Institute of Science and Technology, Xinxiang, Henan, 453003, China
| | - Xueling Ye
- Triticeae Research Institute, Sichuan Agricultural University, Wenjiang, Chengdu, 611130, China
- CSIRO Agriculture and Food, St Lucia, Qld, 4067, Australia
| | - Zhi Zheng
- CSIRO Agriculture and Food, St Lucia, Qld, 4067, Australia
| | - Yuming Wei
- Triticeae Research Institute, Sichuan Agricultural University, Wenjiang, Chengdu, 611130, China
| | - You-Liang Zheng
- Triticeae Research Institute, Sichuan Agricultural University, Wenjiang, Chengdu, 611130, China
| | - You-Gan Wang
- Science and Engineering Facility, Queensland University of Technology, Brisbane, Qld, 4000, Australia
| | - Chunji Liu
- CSIRO Agriculture and Food, St Lucia, Qld, 4067, Australia
| |
Collapse
|
21
|
Degnan JH. Meng and Kubatko (2009): Modeling hybridization with coalescence. Theor Popul Biol 2020; 133:36-37. [DOI: 10.1016/j.tpb.2019.07.008] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2019] [Revised: 07/06/2019] [Accepted: 07/08/2019] [Indexed: 11/16/2022]
|
22
|
Duranton M, Allal F, Valière S, Bouchez O, Bonhomme F, Gagnaire PA. The contribution of ancient admixture to reproductive isolation between European sea bass lineages. Evol Lett 2020; 4:226-242. [PMID: 32547783 PMCID: PMC7293100 DOI: 10.1002/evl3.169] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2019] [Revised: 01/02/2020] [Accepted: 03/05/2020] [Indexed: 12/20/2022] Open
Abstract
Understanding how new species arise through the progressive establishment of reproductive isolation (RI) barriers between diverging populations is a major goal in Evolutionary Biology. An important result of speciation genomics studies is that genomic regions involved in RI frequently harbor anciently diverged haplotypes that predate the reconstructed history of species divergence. The possible origins of these old alleles remain much debated, as they relate to contrasting mechanisms of speciation that are not yet fully understood. In the European sea bass (Dicentrarchus labrax), the genomic regions involved in RI between Atlantic and Mediterranean lineages are enriched for anciently diverged alleles of unknown origin. Here, we used haplotype-resolved whole-genome sequences to test whether divergent haplotypes could have originated from a closely related species, the spotted sea bass (Dicentrarchus punctatus). We found that an ancient admixture event between D. labrax and D. punctatus is responsible for the presence of shared derived alleles that segregate at low frequencies in both lineages of D. labrax. An exception to this was found within regions involved in RI between the two D. labrax lineages. In those regions, archaic tracts originating from D. punctatus locally reached high frequencies or even fixation in Atlantic genomes but were almost absent in the Mediterranean. We showed that the ancient admixture event most likely occurred between D. punctatus and the D. labrax Atlantic lineage, while Atlantic and Mediterranean D. labrax lineages were experiencing allopatric isolation. Our results suggest that local adaptive introgression and/or the resolution of genomic conflicts provoked by ancient admixture have probably contributed to the establishment of RI between the two D. labrax lineages.
Collapse
Affiliation(s)
- Maud Duranton
- ISEM Univ Montpellier, CNRS, EPHE, IRD Montpellier France
| | - François Allal
- MARBEC Université de Montpellier, Ifremer-CNRS-IRD-UM Palavas-les-Flots 34250 France
| | - Sophie Valière
- INRA, US 1426, GeT-PlaGe Genotoul Castanet-Tolosan 31326 France
| | - Olivier Bouchez
- INRA, US 1426, GeT-PlaGe Genotoul Castanet-Tolosan 31326 France
| | | | | |
Collapse
|
23
|
Kim H, Yoshihara M, Suyama M. Comparative genomic analysis of inbred rat strains reveals the existence of ancestral polymorphisms. Mamm Genome 2020; 31:86-94. [PMID: 32166433 PMCID: PMC7200647 DOI: 10.1007/s00335-020-09831-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2019] [Accepted: 03/02/2020] [Indexed: 11/25/2022]
Abstract
In an alignment of closely related genomic sequences, the existence of discordant mutation sites, which do not reflect the phylogenetic relationship of the genomes, is often observed. Although these discordant mutation sites are thought to have emerged by ancestral polymorphism or gene flow, their frequency and distribution in the genome have not yet been analyzed in detail. Using the genome sequences of all protein coding genes of 25 inbred rat strains, we analyzed the frequency and genome-wide distribution of the discordant mutation sites. From the comparison of different substrains, it was found that these loci are not substrain specific, but are common among different groups of substrains, suggesting that the discordant sites might have mainly emerged through ancestral polymorphism. It was also revealed that the discordant sites are not uniformly distributed along chromosomes, but are concentrated at certain genomic loci, such as RT1, major histocompatibility complex of rats, and olfactory receptors, indicating that genes known to be highly polymorphic tend to have more discordant sites. Our results also showed that loci with a high density of discordant sites are also rich in heterozygous variants, even though these are inbred strains.
Collapse
Affiliation(s)
- Hyeonjeong Kim
- Division of Bioinformatics, Medical Institute of Bioregulation, Kyushu University, Maidashi 3-1-1, Higashi-ku, Fukuoka, 812-8582, Japan
| | - Minako Yoshihara
- Division of Bioinformatics, Medical Institute of Bioregulation, Kyushu University, Maidashi 3-1-1, Higashi-ku, Fukuoka, 812-8582, Japan
| | - Mikita Suyama
- Division of Bioinformatics, Medical Institute of Bioregulation, Kyushu University, Maidashi 3-1-1, Higashi-ku, Fukuoka, 812-8582, Japan.
| |
Collapse
|
24
|
van der Valk T, Gonda CM, Silegowa H, Almanza S, Sifuentes-Romero I, Hart TB, Hart JA, Detwiler KM, Guschanski K. The Genome of the Endangered Dryas Monkey Provides New Insights into the Evolutionary History of the Vervets. Mol Biol Evol 2020; 37:183-194. [PMID: 31529046 PMCID: PMC6984364 DOI: 10.1093/molbev/msz213] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Genomic data can be a powerful tool for inferring ecology, behavior, and conservation needs of highly elusive species, particularly, when other sources of information are hard to come by. Here, we focus on the Dryas monkey (Cercopithecus dryas), an endangered primate endemic to the Congo Basin with cryptic behavior and possibly <250 remaining adult individuals. Using whole-genome sequencing data, we show that the Dryas monkey represents a sister lineage to the vervets (Chlorocebus sp.) and has diverged from them ∼1.4 Ma with additional bidirectional gene flow ∼750,000–∼500,000 years ago that has likely involved the crossing of the Congo River. Together with evidence of gene flow across the Congo River in bonobos and okapis, our results suggest that the fluvial topology of the Congo River might have been more dynamic than previously recognized. Despite the presence of several homozygous loss-of-function mutations in genes associated with sperm mobility and immunity, we find high genetic diversity and low levels of inbreeding and genetic load in the studied Dryas monkey individual. This suggests that the current population carries sufficient genetic variability for long-term survival and might be larger than currently recognized. We thus provide an example of how genomic data can directly improve our understanding of highly elusive species.
Collapse
Affiliation(s)
- Tom van der Valk
- Animal Ecology, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Catalina M Gonda
- Animal Ecology, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| | - Henri Silegowa
- Frankfurt Zoological Society, TL2 Project, Kinshasa, Democratic Republic of the Congo
| | - Sandra Almanza
- Department of Anthropology, Florida Atlantic University, Boca Raton, FL
| | | | - Terese B Hart
- Frankfurt Zoological Society, TL2 Project, Kinshasa, Democratic Republic of the Congo
| | - John A Hart
- Frankfurt Zoological Society, TL2 Project, Kinshasa, Democratic Republic of the Congo
| | - Kate M Detwiler
- Department of Anthropology, Florida Atlantic University, Boca Raton, FL.,Department of Biological Sciences, Florida Atlantic University, Boca Raton, FL
| | - Katerina Guschanski
- Animal Ecology, Department of Ecology and Genetics, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| |
Collapse
|
25
|
Abstract
As the number of available genome sequences from both closely related species and individuals within species increased, theoretical and methodological convergences between the fields of phylogenomics and population genomics emerged. Population genomics typically focuses on the analysis of variants, while phylogenomics heavily relies on genome alignments. However, these are playing an increasingly important role in studies at the population level. Multiple genome alignments of individuals are used when structural variation is of primary interest and when genome architecture permits to assemble de novo genome sequences. Here I describe MafFilter, a command-line-driven program allowing to process genome alignments in the Multiple Alignment Format (MAF). Using concrete examples based on publicly available datasets, I demonstrate how MafFilter can be used to develop efficient and reproducible pipelines with quality assurance for downstream analyses. I further show how MafFilter can be used to perform both basic and advanced population genomic analyses in order to infer the patterns of nucleotide diversity along genomes.
Collapse
|
26
|
Springer MS, Molloy EK, Sloan DB, Simmons MP, Gatesy J. ILS-Aware Analysis of Low-Homoplasy Retroelement Insertions: Inference of Species Trees and Introgression Using Quartets. J Hered 2019; 111:147-168. [DOI: 10.1093/jhered/esz076] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2019] [Accepted: 12/12/2019] [Indexed: 12/20/2022] Open
Abstract
Abstract
DNA sequence alignments have provided the majority of data for inferring phylogenetic relationships with both concatenation and coalescent methods. However, DNA sequences are susceptible to extensive homoplasy, especially for deep divergences in the Tree of Life. Retroelement insertions have emerged as a powerful alternative to sequences for deciphering evolutionary relationships because these data are nearly homoplasy-free. In addition, retroelement insertions satisfy the “no intralocus-recombination” assumption of summary coalescent methods because they are singular events and better approximate neutrality relative to DNA loci commonly sampled in phylogenomic studies. Retroelements have traditionally been analyzed with parsimony, distance, and network methods. Here, we analyze retroelement data sets for vertebrate clades (Placentalia, Laurasiatheria, Balaenopteroidea, Palaeognathae) with 2 ILS-aware methods that operate by extracting, weighting, and then assembling unrooted quartets into a species tree. The first approach constructs a species tree from retroelement bipartitions with ASTRAL, and the second method is based on split-decomposition with parsimony. We also develop a Quartet-Asymmetry test to detect hybridization using retroelements. Both ILS-aware methods recovered the same species-tree topology for each data set. The ASTRAL species trees for Laurasiatheria have consecutive short branch lengths in the anomaly zone whereas Palaeognathae is outside of this zone. For the Balaenopteroidea data set, which includes rorquals (Balaenopteridae) and gray whale (Eschrichtiidae), both ILS-aware methods resolved balaeonopterids as paraphyletic. Application of the Quartet-Asymmetry test to this data set detected 19 different quartets of species for which historical introgression may be inferred. Evidence for introgression was not detected in the other data sets.
Collapse
Affiliation(s)
- Mark S Springer
- Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA
| | - Erin K Molloy
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL
| | - Daniel B Sloan
- Department of Biology, Colorado State University, Fort Collins, CO
| | - Mark P Simmons
- Department of Biology, Colorado State University, Fort Collins, CO
| | - John Gatesy
- Division of Vertebrate Zoology and Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, NY
| |
Collapse
|
27
|
He C, Liang D, Zhang P. Asymmetric Distribution of Gene Trees Can Arise under Purifying Selection If Differences in Population Size Exist. Mol Biol Evol 2019; 37:881-892. [DOI: 10.1093/molbev/msz232] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
AbstractIncomplete lineage sorting (ILS) is an important factor that causes gene tree discordance. For gene trees of three species, under neutrality, random mating, and the absence of interspecific gene flow, ILS creates a symmetric distribution of gene trees: the gene tree that accords with the species tree has the highest frequency, and the two discordant trees are equally frequent. If the neutral condition is violated, the impact of ILS may change, altering the gene tree distribution. Here, we show that under purifying selection, even assuming that the fitness effect of mutations is constant throughout the species tree, if differences in population size exist among species, asymmetric distributions of gene trees will arise, which is different from the expectation under neutrality. In extremes, one of the discordant trees rather than the concordant tree becomes the most frequent gene tree. In addition, we found that in a real case, the position of Scandentia relative to Primate and Glires, the symmetry in the gene tree distribution can be influenced by the strength of purifying selection. In current phylogenetic inference, the impact of purifying selection on the gene tree distribution is rarely considered by researchers. This study highlights the necessity of considering this impact.
Collapse
Affiliation(s)
- Chong He
- State Key Laboratory of Biocontrol, College of Ecology and Evolution, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
| | - Dan Liang
- State Key Laboratory of Biocontrol, College of Ecology and Evolution, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
| | - Peng Zhang
- State Key Laboratory of Biocontrol, College of Ecology and Evolution, School of Life Sciences, Sun Yat-Sen University, Guangzhou, China
| |
Collapse
|
28
|
Abstract
Abstract
Many methods exist for detecting introgression between nonsister species, but the most commonly used require either a single sequence from four or more taxa or multiple sequences from each of three taxa. Here, we present a test for introgression that uses only a single sequence from three taxa. This test, denoted D3, uses similar logic as the standard D-test for introgression, but by using pairwise distances instead of site patterns it is able to detect the same signal of introgression with fewer species. We use simulations to show that D3 has statistical power almost equal to D, demonstrating its use on a data set of wild bananas (Musa). The new test is easy to apply and easy to interpret, and should find wide use among currently available data sets.
Collapse
Affiliation(s)
- Matthew W Hahn
- Department of Biology, Indiana University, Bloomington, IN
- Department of Computer Science, Indiana University, Bloomington, IN
| | - Mark S Hibbins
- Department of Biology, Indiana University, Bloomington, IN
| |
Collapse
|
29
|
Huynh S, Marcussen T, Felber F, Parisod C. Hybridization preceded radiation in diploid wheats. Mol Phylogenet Evol 2019; 139:106554. [PMID: 31288105 DOI: 10.1016/j.ympev.2019.106554] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2019] [Revised: 07/03/2019] [Accepted: 07/03/2019] [Indexed: 01/06/2023]
Abstract
Evolutionary relationships among the Aegilops-Triticum relatives of cultivated wheats have been difficult to resolve owing to incomplete lineage sorting and reticulate evolution. Recent studies have suggested that the wheat D-genome lineage (progenitor of Ae. tauschii) originated through homoploid hybridization between the A-genome lineage (progenitor of Triticum s.str.) and the B-genome lineage (progenitor of Ae. speltoides). This scenario of reticulation has been debated, calling for adequate phylogenetic analyses based on comprehensive sampling. To reconstruct the evolution of Aegilops-Triticum diploids, we here combined high-throughput sequencing of 38 nuclear low-copy loci of multiple accessions of all 13 species with inferences of the species phylogeny using the full-parameterized MCMC_SEQ method. Phylogenies recovered a monophyletic Aegilops-Triticum lineage that began diversifying ~6.6 Ma ago and gave rise to four sublineages, i.e. the A- (2 species), B- (1 species), D- (9 species) and T- (Ae. mutica) genome lineage. Full-parameterized phylogenies as well as patterns of tree dilation and tree compression supported a hybrid origin of the D-genome lineage from A and B ~3.0-4.0 Ma ago, and did not indicate additional hybridization events. Conflicting ABBA-BABA tests suggestive of further reticulation were shown here to result from ancestral population structure rather than hybridization. This comprehensive and dated phylogeny of wheat relatives indicates that the origin of the hybrid D-genome was followed by intense diversification into the majority of extant diploid as well as allopolyploid wild wheats.
Collapse
Affiliation(s)
- Stella Huynh
- Institute of Biology, University of Neuchâtel, Switzerland
| | - Thomas Marcussen
- Centre for Ecological and Evolutionary Synthesis, University of Oslo, Norway
| | - François Felber
- Institute of Biology, University of Neuchâtel, Switzerland; Musée et Jardins botaniques cantonaux de Lausanne et Pont-de-Nant, Switzerland
| | | |
Collapse
|
30
|
Adams RH, Schield DR, Castoe TA. Recent Advances in the Inference of Gene Flow from Population Genomic Data. ACTA ACUST UNITED AC 2019. [DOI: 10.1007/s40610-019-00120-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
|
31
|
Dutheil JY, Hobolth A. Ancestral Population Genomics. Methods Mol Biol 2019; 1910:555-589. [PMID: 31278677 DOI: 10.1007/978-1-4939-9074-0_18] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]
Abstract
Borrowing both from population genetics and phylogenetics, the field of population genomics emerged as full genomes of several closely related species were available. Providing we can properly model sequence evolution within populations undergoing speciation events, this resource enables us to estimate key population genetics parameters such as ancestral population sizes and split times. Furthermore we can enhance our understanding of the recombination process and investigate various selective forces. With the advent of resequencing technologies, genome-wide patterns of diversity in extant populations have now come to complement this picture, offering an increasing power to study more recent genetic history.We discuss the basic models of genomes in populations, including speciation models for closely related species. A major point in our discussion is that only a few complete genomes contain much information about the whole population. The reason being that recombination unlinks genomic regions, and therefore a few genomes contain many segments with distinct histories. The challenge of population genomics is to decode this mosaic of histories in order to infer scenarios of demography and selection. We survey modeling strategies for understanding genetic variation in ancestral populations and species. The underlying models build on the coalescent with recombination process and introduce further assumptions to scale the analyses to genomic data sets.
Collapse
Affiliation(s)
- Julien Y Dutheil
- Department of Evolutionary Genetics, Max Planck Institute of Evolutionary Biology, Plön, Germany.
| | - Asger Hobolth
- Bioinformatics Research Center (BiRC), Aarhus University, Aarhus, Denmark
| |
Collapse
|
32
|
Degnan JH. Modeling Hybridization Under the Network Multispecies Coalescent. Syst Biol 2018; 67:786-799. [PMID: 29846734 PMCID: PMC6101600 DOI: 10.1093/sysbio/syy040] [Citation(s) in RCA: 65] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2017] [Revised: 05/13/2018] [Accepted: 05/16/2018] [Indexed: 11/13/2022] Open
Abstract
Simultaneously modeling hybridization and the multispecies coalescent is becoming increasingly common, and inference of species networks in this context is now implemented in several software packages. This article addresses some of the conceptual issues and decisions to be made in this modeling, including whether or not to use branch lengths and issues with model identifiability. This article is based on a talk given at a Spotlight Session at Evolution 2017 meeting in Portland, Oregon. This session included several talks about modeling hybridization and gene flow in the presence of incomplete lineage sorting. Other talks given at this meeting are also included in this special issue of Systematic Biology.
Collapse
Affiliation(s)
- James H Degnan
- Department of Mathematics and Statistics, University of New Mexico, Albuquerque, NM 87131, USA
| |
Collapse
|
33
|
Zhu S, Degnan JH. Displayed Trees Do Not Determine Distinguishability Under the Network Multispecies Coalescent. Syst Biol 2018; 66:283-298. [PMID: 27780899 DOI: 10.1093/sysbio/syw097] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2015] [Accepted: 03/08/2016] [Indexed: 11/13/2022] Open
Abstract
Recent work in estimating species relationships from gene trees has included inferring networks assuming that past hybridization has occurred between species. Probabilistic models using the multispecies coalescent can be used in this framework for likelihood-based inference of both network topologies and parameters, including branch lengths and hybridization parameters. A difficulty for such methods is that it is not always clear whether, or to what extent, networks are identifiable-that is whether there could be two distinct networks that lead to the same distribution of gene trees. For cases in which incomplete lineage sorting occurs in addition to hybridization, we demonstrate a new representation of the species network likelihood that expresses the probability distribution of the gene tree topologies as a linear combination of gene tree distributions given a set of species trees. This representation makes it clear that in some cases in which two distinct networks give the same distribution of gene trees when sampling one allele per species, the two networks can be distinguished theoretically when multiple individuals are sampled per species. This result means that network identifiability is not only a function of the trees displayed by the networks but also depends on allele sampling within species. We additionally give an example in which two networks that display exactly the same trees can be distinguished from their gene trees even when there is only one lineage sampled per species. [gene tree, hybridization, identifiability, maximum likelihood, species tree, phylogeny.].
Collapse
Affiliation(s)
- Sha Zhu
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford OX3 7BN, UK
| | - James H Degnan
- Department of Mathematics and Statistics, University of New Mexico, Albuquerque, NM 87110, USA
| |
Collapse
|
34
|
Theunert C, Slatkin M. Distinguishing recent admixture from ancestral population structure. Genome Biol Evol 2017; 9:2982377. [PMID: 28186554 PMCID: PMC5381645 DOI: 10.1093/gbe/evx018] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2016] [Revised: 01/17/2017] [Accepted: 02/04/2017] [Indexed: 12/19/2022] Open
Abstract
We develop and test two methods for distinguishing between recent admixture and ancestral population structure as explanations for greater similarity of one of two populations to an outgroup population. This problem arose when Neanderthals were found to be slightly more similar to nonAfrican than to African populations. The excess similarity is consistent with both recent admixture from Neanderthals into the ancestors of nonAfricans and subdivision in the ancestral population. Although later studies showed that there had been recent admixture, distinguishing between these two classes of models will be important in other situations, particularly when high-coverage genomes cannot be obtained for all populations. One of our two methods is based on the properties of the doubly conditioned frequency spectrum combined with the unconditional frequency spectrum. This method does not require a linkage map and can be used when there is relatively low coverage. The second method uses the extent of linkage disequilibrium among closely linked markers.
Collapse
Affiliation(s)
- Christoph Theunert
- Department of Integrative Biology, University of California, Berkeley
- Department of Evolutionary Genetics, Max-Planck Institute for Evolutionary Anthropology, Leipzig, Germany
| | | |
Collapse
|
35
|
de Manuel M, Kuhlwilm M, Frandsen P, Sousa VC, Desai T, Prado-Martinez J, Hernandez-Rodriguez J, Dupanloup I, Lao O, Hallast P, Schmidt JM, Heredia-Genestar JM, Benazzo A, Barbujani G, Peter BM, Kuderna LFK, Casals F, Angedakin S, Arandjelovic M, Boesch C, Kühl H, Vigilant L, Langergraber K, Novembre J, Gut M, Gut I, Navarro A, Carlsen F, Andrés AM, Siegismund HR, Scally A, Excoffier L, Tyler-Smith C, Castellano S, Xue Y, Hvilsom C, Marques-Bonet T. Chimpanzee genomic diversity reveals ancient admixture with bonobos. Science 2016; 354:477-481. [PMID: 27789843 DOI: 10.1126/science.aag2602] [Citation(s) in RCA: 166] [Impact Index Per Article: 18.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2016] [Accepted: 09/09/2016] [Indexed: 12/13/2022]
Abstract
Our closest living relatives, chimpanzees and bonobos, have a complex demographic history. We analyzed the high-coverage whole genomes of 75 wild-born chimpanzees and bonobos from 10 countries in Africa. We found that chimpanzee population substructure makes genetic information a good predictor of geographic origin at country and regional scales. Multiple lines of evidence suggest that gene flow occurred from bonobos into the ancestors of central and eastern chimpanzees between 200,000 and 550,000 years ago, probably with subsequent spread into Nigeria-Cameroon chimpanzees. Together with another, possibly more recent contact (after 200,000 years ago), bonobos contributed less than 1% to the central chimpanzee genomes. Admixture thus appears to have been widespread during hominid evolution.
Collapse
Affiliation(s)
- Marc de Manuel
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain
| | - Martin Kuhlwilm
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain
| | - Peter Frandsen
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark. Center for Zoo and Wild Animal Health, Copenhagen Zoo, 2000 Frederiksberg, Denmark
| | - Vitor C Sousa
- Computational and Molecular Population Genetics, Institute of Ecology and Evolution, University of Berne, 3012 Berne, Switzerland. Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Tariq Desai
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
| | - Javier Prado-Martinez
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain. Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Jessica Hernandez-Rodriguez
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain
| | - Isabelle Dupanloup
- Computational and Molecular Population Genetics, Institute of Ecology and Evolution, University of Berne, 3012 Berne, Switzerland. Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Oscar Lao
- National Centre for Genomic Analysis-Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08028 Barcelona, Spain. Universitat Pompeu Fabra, 08003 Barcelona, Spain
| | - Pille Hallast
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK. Institute of Molecular and Cell Biology, University of Tartu, Tartu 51010, Estonia
| | - Joshua M Schmidt
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103, Leipzig, Germany
| | - José María Heredia-Genestar
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain
| | - Andrea Benazzo
- Department of Life Sciences and Biotechnology, University of Ferrara, 44121 Ferrara, Italy
| | - Guido Barbujani
- Department of Life Sciences and Biotechnology, University of Ferrara, 44121 Ferrara, Italy
| | - Benjamin M Peter
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | - Lukas F K Kuderna
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain
| | - Ferran Casals
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain
| | - Samuel Angedakin
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Mimi Arandjelovic
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Christophe Boesch
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Hjalmar Kühl
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Linda Vigilant
- Department of Primatology, Max Planck Institute for Evolutionary Anthropology, 04103 Leipzig, Germany
| | - Kevin Langergraber
- School of Human Evolution and Social Change and Institute of Human Origins, Arizona State University, Tempe, AZ 85287, USA
| | - John Novembre
- Department of Human Genetics, University of Chicago, Chicago, IL 60637, USA
| | - Marta Gut
- National Centre for Genomic Analysis-Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08028 Barcelona, Spain
| | - Ivo Gut
- National Centre for Genomic Analysis-Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08028 Barcelona, Spain
| | - Arcadi Navarro
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain. National Centre for Genomic Analysis-Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08028 Barcelona, Spain. Institucio Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Catalonia 08010, Spain
| | - Frands Carlsen
- Center for Zoo and Wild Animal Health, Copenhagen Zoo, 2000 Frederiksberg, Denmark
| | - Aida M Andrés
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103, Leipzig, Germany
| | - Hans R Siegismund
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, 2200 Copenhagen, Denmark
| | - Aylwyn Scally
- Department of Genetics, University of Cambridge, Cambridge CB2 3EH, UK
| | - Laurent Excoffier
- Computational and Molecular Population Genetics, Institute of Ecology and Evolution, University of Berne, 3012 Berne, Switzerland. Swiss Institute of Bioinformatics, 1015 Lausanne, Switzerland
| | - Chris Tyler-Smith
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Sergi Castellano
- Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, 04103, Leipzig, Germany
| | - Yali Xue
- Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton CB10 1SA, UK
| | - Christina Hvilsom
- Center for Zoo and Wild Animal Health, Copenhagen Zoo, 2000 Frederiksberg, Denmark.
| | - Tomas Marques-Bonet
- Institut de Biologia Evolutiva (Consejo Superior de Investigaciones Científicas-Universitat Pompeu Fabra), Barcelona Biomedical Research Park, Doctor Aiguader 88, Barcelona, Catalonia 08003, Spain. National Centre for Genomic Analysis-Centre for Genomic Regulation, Barcelona Institute of Science and Technology, 08028 Barcelona, Spain. Institucio Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Catalonia 08010, Spain.
| |
Collapse
|
36
|
Novikova PY, Hohmann N, Nizhynska V, Tsuchimatsu T, Ali J, Muir G, Guggisberg A, Paape T, Schmid K, Fedorenko OM, Holm S, Säll T, Schlötterer C, Marhold K, Widmer A, Sese J, Shimizu KK, Weigel D, Krämer U, Koch MA, Nordborg M. Sequencing of the genus Arabidopsis identifies a complex history of nonbifurcating speciation and abundant trans-specific polymorphism. Nat Genet 2016; 48:1077-82. [PMID: 27428747 DOI: 10.1038/ng.3617] [Citation(s) in RCA: 134] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2015] [Accepted: 06/14/2016] [Indexed: 12/17/2022]
Abstract
The notion of species as reproductively isolated units related through a bifurcating tree implies that gene trees should generally agree with the species tree and that sister taxa should not share polymorphisms unless they diverged recently and should be equally closely related to outgroups. It is now possible to evaluate this model systematically. We sequenced multiple individuals from 27 described taxa representing the entire Arabidopsis genus. Cluster analysis identified seven groups, corresponding to described species that capture the structure of the genus. However, at the level of gene trees, only the separation of Arabidopsis thaliana from the remaining species was universally supported, and, overall, the amount of shared polymorphism demonstrated that reproductive isolation was considerably more recent than the estimated divergence times. We uncovered multiple cases of past gene flow that contradict a bifurcating species tree. Finally, we showed that the pattern of divergence differs between gene ontologies, suggesting a role for selection.
Collapse
Affiliation(s)
- Polina Yu Novikova
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter (VBC), Vienna, Austria.,Vienna Graduate School of Population Genetics, Institut für Populationsgenetik, Vetmeduni, Vienna, Austria
| | - Nora Hohmann
- Centre for Organismal Studies Heidelberg, University of Heidelberg, Heidelberg, Germany
| | - Viktoria Nizhynska
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter (VBC), Vienna, Austria
| | - Takashi Tsuchimatsu
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter (VBC), Vienna, Austria
| | - Jamshaid Ali
- Department of Plant Physiology, Ruhr-Universität Bochum, Bochum, Germany
| | - Graham Muir
- Vienna Graduate School of Population Genetics, Institut für Populationsgenetik, Vetmeduni, Vienna, Austria
| | | | - Tim Paape
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
| | - Karl Schmid
- Institute of Plant Breeding, Seed Science and Population Genetics, University of Hohenheim, Stuttgart, Germany
| | - Olga M Fedorenko
- Institute of Biology, Karelian Research Center of the Russian Academy of Sciences, Petrozavodsk, Russia
| | - Svante Holm
- Faculty of Science, Technology and Media, Department of Natural Sciences, Mid Sweden University, Sundsvall, Sweden
| | - Torbjörn Säll
- Department of Biology, Lund University, Lund, Sweden
| | | | - Karol Marhold
- Department of Botany, Faculty of Science, Charles University, Prague, Czech Republic.,Institute of Botany, Slovak Academy of Sciences, Bratislava, Slovakia
| | - Alex Widmer
- Department of Plant Physiology, Ruhr-Universität Bochum, Bochum, Germany
| | - Jun Sese
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology, Tokyo, Japan
| | - Kentaro K Shimizu
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
| | - Detlef Weigel
- Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Ute Krämer
- Department of Plant Physiology, Ruhr-Universität Bochum, Bochum, Germany
| | - Marcus A Koch
- Centre for Organismal Studies Heidelberg, University of Heidelberg, Heidelberg, Germany
| | - Magnus Nordborg
- Gregor Mendel Institute, Austrian Academy of Sciences, Vienna Biocenter (VBC), Vienna, Austria
| |
Collapse
|
37
|
Solís-Lemus C, Yang M, Ané C. Inconsistency of Species Tree Methods under Gene Flow. Syst Biol 2016; 65:843-51. [DOI: 10.1093/sysbio/syw030] [Citation(s) in RCA: 107] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2015] [Accepted: 04/01/2016] [Indexed: 11/14/2022] Open
|
38
|
Consistency and inconsistency of consensus methods for inferring species trees from gene trees in the presence of ancestral population structure. Theor Popul Biol 2016; 110:12-24. [PMID: 27086043 DOI: 10.1016/j.tpb.2016.02.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2014] [Revised: 12/22/2015] [Accepted: 02/05/2016] [Indexed: 11/21/2022]
Abstract
In the last few years, several statistically consistent consensus methods for species tree inference have been devised that are robust to the gene tree discordance caused by incomplete lineage sorting in unstructured ancestral populations. One source of gene tree discordance that has only recently been identified as a potential obstacle for phylogenetic inference is ancestral population structure. In this article, we describe a general model of ancestral population structure, and by relying on a single carefully constructed example scenario, we show that the consensus methods Democratic Vote, STEAC, STAR, R(∗) Consensus, Rooted Triple Consensus, Minimize Deep Coalescences, and Majority-Rule Consensus are statistically inconsistent under the model. We find that among the consensus methods evaluated, the only method that is statistically consistent in the presence of ancestral population structure is GLASS/Maximum Tree. We use simulations to evaluate the behavior of the various consensus methods in a model with ancestral population structure, showing that as the number of gene trees increases, estimates on the basis of GLASS/Maximum Tree approach the true species tree topology irrespective of the level of population structure, whereas estimates based on the remaining methods only approach the true species tree topology if the level of structure is low. However, through simulations using species trees both with and without ancestral population structure, we show that GLASS/Maximum Tree performs unusually poorly on gene trees inferred from alignments with little information. This practical limitation of GLASS/Maximum Tree together with the inconsistency of other methods prompts the need for both further testing of additional existing methods and development of novel methods under conditions that incorporate ancestral population structure.
Collapse
|
39
|
Meiklejohn KA, Faircloth BC, Glenn TC, Kimball RT, Braun EL. Analysis of a Rapid Evolutionary Radiation Using Ultraconserved Elements: Evidence for a Bias in Some Multispecies Coalescent Methods. Syst Biol 2016; 65:612-27. [DOI: 10.1093/sysbio/syw014] [Citation(s) in RCA: 114] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2015] [Accepted: 01/25/2016] [Indexed: 01/30/2023] Open
|
40
|
Richart CH, Hayashi CY, Hedin M. Phylogenomic analyses resolve an ancient trichotomy at the base of Ischyropsalidoidea (Arachnida, Opiliones) despite high levels of gene tree conflict and unequal minority resolution frequencies. Mol Phylogenet Evol 2015; 95:171-82. [PMID: 26691642 DOI: 10.1016/j.ympev.2015.11.010] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2015] [Revised: 09/16/2015] [Accepted: 11/13/2015] [Indexed: 11/19/2022]
Abstract
Phylogenetic resolution of ancient rapid radiations has remained problematic despite major advances in statistical approaches and DNA sequencing technologies. Here we report on a combined phylogenetic approach utilizing transcriptome data in conjunction with Sanger sequence data to investigate a tandem of ancient divergences in the harvestmen superfamily Ischyropsalidoidea (Arachnida, Opiliones, Dyspnoi). We rely on Sanger sequences to resolve nodes within and between closely related genera, and use RNA-seq data from a subset of taxa to resolve a short and ancient internal branch. We use several analytical approaches to explore this succession of ancient diversification events, including concatenated and coalescent-based analyses and maximum likelihood gene trees for each locus. We evaluate the robustness of phylogenetic inferences using a randomized locus sub-sampling approach, and find congruence across these methods despite considerable incongruence across gene trees. Incongruent gene trees are not recovered in frequencies expected from a simple multispecies coalescent model, and we reject incomplete lineage sorting as the sole contributor to gene tree conflict. Using these approaches we attain robust support for higher-level phylogenetic relationships within Ischyropsalidoidea.
Collapse
Affiliation(s)
- Casey H Richart
- Department of Biology, San Diego State University, 5500 Campanile Drive, San Diego, CA 92182, USA; Department of Biology, University of California, Riverside, CA 92521, USA.
| | - Cheryl Y Hayashi
- Department of Biology, University of California, Riverside, CA 92521, USA
| | - Marshal Hedin
- Department of Biology, San Diego State University, 5500 Campanile Drive, San Diego, CA 92182, USA
| |
Collapse
|
41
|
Suh A, Smeds L, Ellegren H. The Dynamics of Incomplete Lineage Sorting across the Ancient Adaptive Radiation of Neoavian Birds. PLoS Biol 2015; 13:e1002224. [PMID: 26284513 PMCID: PMC4540587 DOI: 10.1371/journal.pbio.1002224] [Citation(s) in RCA: 170] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Accepted: 07/10/2015] [Indexed: 12/18/2022] Open
Abstract
The diversification of neoavian birds is one of the most rapid adaptive radiations of extant organisms. Recent whole-genome sequence analyses have much improved the resolution of the neoavian radiation and suggest concurrence with the Cretaceous-Paleogene (K-Pg) boundary, yet the causes of the remaining genome-level irresolvabilities appear unclear. Here we show that genome-level analyses of 2,118 retrotransposon presence/absence markers converge at a largely consistent Neoaves phylogeny and detect a highly differential temporal prevalence of incomplete lineage sorting (ILS), i.e., the persistence of ancestral genetic variation as polymorphisms during speciation events. We found that ILS-derived incongruences are spread over the genome and involve 35% and 34% of the analyzed loci on the autosomes and the Z chromosome, respectively. Surprisingly, Neoaves diversification comprises three adaptive radiations, an initial near-K-Pg super-radiation with highly discordant phylogenetic signals from near-simultaneous speciation events, followed by two post-K-Pg radiations of core landbirds and core waterbirds with much less pronounced ILS. We provide evidence that, given the extreme level of up to 100% ILS per branch in super-radiations, particularly rapid speciation events may neither resemble a fully bifurcating tree nor are they resolvable as such. As a consequence, their complex demographic history is more accurately represented as local networks within a species tree. A study of ancient genetic variation reveals genomic evidence for near-simultaneous speciation at the base of Neoaves (a group containing most modern birds), which temporally coincides with the mass extinction of nonavian dinosaurs and archaic birds. The rise of modern birds began after the mass extinction of nonavian dinosaurs and archaic birds at the Cretaceous-Paleogene (K-Pg) boundary, about 66 million years ago. This coincides with the super-rapid adaptive radiation of Neoaves (a group that contains most modern birds), which has been difficult to resolve even with whole genome sequences. We reconstructed the genealogical fates of thousands of rare genomic changes (insertions of selfish mobile elements called retrotransposons), a third of which were found to be affected by a phenomenon known as incomplete lineage sorting (ILS), namely a persistence of polymorphisms across multiple successive speciation events. Astoundingly, we found that near the K-Pg boundary, speciation events were accompanied by extreme levels of ILS, suggesting a near-simultaneous, star-like diversification process that appears plausible in the context of instantaneous niche availability that must have followed the K-Pg mass extinction. Our genome-scale results provide a population genomic explanation as to why some species radiations may be more complex than a fully bifurcating tree of life. We suggest that, under such circumstances, ILS bears witness to the biological limitation of phylogenetic resolution.
Collapse
Affiliation(s)
- Alexander Suh
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, Uppsala, Sweden
- * E-mail:
| | - Linnéa Smeds
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, Uppsala, Sweden
| | - Hans Ellegren
- Department of Evolutionary Biology, Evolutionary Biology Centre (EBC), Uppsala University, Uppsala, Sweden
| |
Collapse
|
42
|
Lohse K, Clarke M, Ritchie MG, Etges WJ. Genome-wide tests for introgression between cactophilic Drosophila implicate a role of inversions during speciation. Evolution 2015; 69:1178-90. [PMID: 25824653 PMCID: PMC5029762 DOI: 10.1111/evo.12650] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2014] [Accepted: 03/17/2015] [Indexed: 12/25/2022]
Abstract
Models of speciation‐with‐gene‐flow have shown that the reduction in recombination between alternative chromosome arrangements can facilitate the fixation of locally adaptive genes in the face of gene flow and contribute to speciation. However, it has proven frustratingly difficult to show empirically that inversions have reduced gene flow and arose during or shortly after the onset of species divergence rather than represent ancestral polymorphisms. Here, we present an analysis of whole genome data from a pair of cactophilic fruit flies, Drosophila mojavensis and D. arizonae, which are reproductively isolated in the wild and differ by several large inversions on three chromosomes. We found an increase in divergence at rearranged compared to colinear chromosomes. Using the density of divergent sites in short sequence blocks we fit a series of explicit models of species divergence in which gene flow is restricted to an initial period after divergence and may differ between colinear and rearranged parts of the genome. These analyses show that D. mojavensis and D. arizonae have experienced postdivergence gene flow that ceased around 270 KY ago and was significantly reduced in chromosomes with fixed inversions. Moreover, we show that these inversions most likely originated around the time of species divergence which is compatible with theoretical models that posit a role of inversions in speciation with gene flow.
Collapse
Affiliation(s)
- Konrad Lohse
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom.
| | - Magnus Clarke
- Institute of Evolutionary Biology, University of Edinburgh, Edinburgh EH9 3FL, United Kingdom
| | - Michael G Ritchie
- School of Biology, University of St. Andrews, St. Andrews KY16 9TH, United Kingdom
| | - William J Etges
- Program in Ecology and Evolutionary Biology, Department of Biological Sciences, University of Arkansas, Fayetteville, Arkansas 72701
| |
Collapse
|
43
|
Rogers AR, Bohlender RJ. Bias in estimators of archaic admixture. Theor Popul Biol 2015; 100C:63-78. [DOI: 10.1016/j.tpb.2014.12.006] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2013] [Revised: 12/20/2014] [Accepted: 12/23/2014] [Indexed: 11/30/2022]
|
44
|
Leaché AD, Harris RB, Maliska ME, Linkem CW. Comparative species divergence across eight triplets of spiny lizards (Sceloporus) using genomic sequence data. Genome Biol Evol 2014; 5:2410-9. [PMID: 24259316 PMCID: PMC3879974 DOI: 10.1093/gbe/evt186] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Species divergence is typically thought to occur in the absence of gene flow, but many empirical studies are discovering that gene flow may be more pervasive during species formation. Although many examples of divergence with gene flow have been identified, few clades have been investigated in a comparative manner, and fewer have been studied using genome-wide sequence data. We contrast species divergence genetic histories across eight triplets of North American Sceloporus lizards using a maximum likelihood implementation of the isolation–migration (IM) model. Gene flow at the time of species divergence is modeled indirectly as variation in species divergence time across the genome or explicitly using a migration rate parameter. Likelihood ratio tests (LRTs) are used to test the null model of no gene flow at speciation against these two alternative gene flow models. We also use the Akaike information criterion to rank the models. Hundreds of loci are needed for the LRTs to have statistical power, and we use genome sequencing of reduced representation libraries to obtain DNA sequence alignments at many loci (between 340 and 3,478; mean = 1,678) for each triplet. We find that current species distributions are a poor predictor of whether a species pair diverged with gene flow. Interrogating the genome using the triplet method expedites the comparative study of species divergence history and the estimation of genetic parameters associated with speciation.
Collapse
|
45
|
Zwickl DJ, Stein JC, Wing RA, Ware D, Sanderson MJ. Disentangling Methodological and Biological Sources of Gene Tree Discordance on Oryza (Poaceae) Chromosome 3. Syst Biol 2014; 63:645-59. [DOI: 10.1093/sysbio/syu027] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Derrick J. Zwickl
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA, 2Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA, 3School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA, and 4Robert W. Holley Center for Agriculture and Health, United States Department of Agriculture-Agricultural Research Service, Ithaca, NY 14853, USA
| | - Joshua C. Stein
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA, 2Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA, 3School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA, and 4Robert W. Holley Center for Agriculture and Health, United States Department of Agriculture-Agricultural Research Service, Ithaca, NY 14853, USA
| | - Rod A. Wing
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA, 2Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA, 3School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA, and 4Robert W. Holley Center for Agriculture and Health, United States Department of Agriculture-Agricultural Research Service, Ithaca, NY 14853, USA
| | - Doreen Ware
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA, 2Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA, 3School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA, and 4Robert W. Holley Center for Agriculture and Health, United States Department of Agriculture-Agricultural Research Service, Ithaca, NY 14853, USA
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA, 2Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA, 3School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA, and 4Robert W. Holley Center for Agriculture and Health, United States Department of Agriculture-Agricultural Research Service, Ithaca, NY 14853, USA
| | - Michael J. Sanderson
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ 85721, USA, 2Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 11724, USA, 3School of Plant Sciences, University of Arizona, Tucson, AZ 85721, USA, and 4Robert W. Holley Center for Agriculture and Health, United States Department of Agriculture-Agricultural Research Service, Ithaca, NY 14853, USA
| |
Collapse
|
46
|
Abstract
Although there has been much interest in estimating histories of divergence and admixture from genomic data, it has proved difficult to distinguish recent admixture from long-term structure in the ancestral population. Thus, recent genome-wide analyses based on summary statistics have sparked controversy about the possibility of interbreeding between Neandertals and modern humans in Eurasia. Here we derive the probability of full mutational configurations in nonrecombining sequence blocks under both admixture and ancestral structure scenarios. Dividing the genome into short blocks gives an efficient way to compute maximum-likelihood estimates of parameters. We apply this likelihood scheme to triplets of human and Neandertal genomes and compare the relative support for a model of admixture from Neandertals into Eurasian populations after their expansion out of Africa against a history of persistent structure in their common ancestral population in Africa. Our analysis allows us to conclusively reject a model of ancestral structure in Africa and instead reveals strong support for Neandertal admixture in Eurasia at a higher rate (3.4-7.3%) than suggested previously. Using analysis and simulations we show that our inference is more powerful than previous summary statistics and robust to realistic levels of recombination.
Collapse
|
47
|
Rheindt FE, Fujita MK, Wilton PR, Edwards SV. Introgression and Phenotypic Assimilation in Zimmerius Flycatchers (Tyrannidae): Population Genetic and Phylogenetic Inferences from Genome-Wide SNPs. Syst Biol 2013; 63:134-52. [DOI: 10.1093/sysbio/syt070] [Citation(s) in RCA: 77] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
|
48
|
Leaché AD, Harris RB, Rannala B, Yang Z. The Influence of Gene Flow on Species Tree Estimation: A Simulation Study. Syst Biol 2013; 63:17-30. [DOI: 10.1093/sysbio/syt049] [Citation(s) in RCA: 249] [Impact Index Per Article: 20.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Affiliation(s)
- Adam D. Leaché
- Department of Biology and Burke Museum of Natural History and Culture, University of Washington, Seattle, WA 98195 USA
| | - Rebecca B. Harris
- Department of Biology and Burke Museum of Natural History and Culture, University of Washington, Seattle, WA 98195 USA
| | - Bruce Rannala
- Genome Center and Department of Evolution & Ecology, University of California, Davis, CA 95616, USA
- Center for Computational Genomics, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; and
| | - Ziheng Yang
- Center for Computational Genomics, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China; and
- Department of Biology, University College London, Gower Street, London WC1E 6BT, UK
| |
Collapse
|
49
|
Lowery RK, Uribe G, Jimenez EB, Weiss MA, Herrera KJ, Regueiro M, Herrera RJ. Neanderthal and Denisova genetic affinities with contemporary humans: introgression versus common ancestral polymorphisms. Gene 2013; 530:83-94. [PMID: 23872234 DOI: 10.1016/j.gene.2013.06.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2013] [Revised: 06/08/2013] [Accepted: 06/11/2013] [Indexed: 10/26/2022]
Abstract
Analyses of the genetic relationships among modern humans, Neanderthals and Denisovans have suggested that 1-4% of the non-Sub-Saharan African gene pool may be Neanderthal derived, while 6-8% of the Melanesian gene pool may be the product of admixture between the Denisovans and the direct ancestors of Melanesians. In the present study, we analyzed single nucleotide polymorphism (SNP) diversity among a worldwide collection of contemporary human populations with respect to the genetic constitution of these two archaic hominins and Pan troglodytes (chimpanzee). We partitioned SNPs into subsets, including those that are derived in both archaic lineages, those that are ancestral in both archaic lineages and those that are only derived in one archaic lineage. By doing this, we have conducted separate examinations of subsets of mutations with higher probabilities of divergent phylogenetic origins. While previous investigations have excluded SNPs from common ancestors in principal component analyses, we included common ancestral SNPs in our analyses to visualize the relative placement of the Neanderthal and Denisova among human populations. To assess the genetic similarities among the various hominin lineages, we performed genetic structure analyses to provide a comparison of genetic patterns found within contemporary human genomes that may have archaic or common ancestral roots. Our results indicate that 3.6% of the Neanderthal genome is shared with roughly 65.4% of the average European gene pool, which clinally diminishes with distance from Europe. Our results suggest that Neanderthal genetic associations with contemporary non-Sub-Saharan African populations, as well as the genetic affinities observed between Denisovans and Melanesians most likely result from the retention of ancient mutations in these populations.
Collapse
Affiliation(s)
- Robert K Lowery
- Department of Molecular and Human Genetics, College of Medicine, Florida International University, Miami, FL 33199, USA; Department of Biological Sciences, College of Arts and Sciences, Florida International University, Miami, FL 33199, USA; Department of Biological Sciences, College of Arts and Sciences, Indian River State College, Fort Pierce, FL 34981, USA.
| | | | | | | | | | | | | |
Collapse
|
50
|
Schumer M, Cui R, Boussau B, Walter R, Rosenthal G, Andolfatto P. An evaluation of the hybrid speciation hypothesis for Xiphophorus clemenciae based on whole genome sequences. Evolution 2013; 67:1155-68. [PMID: 23550763 PMCID: PMC3621027 DOI: 10.1111/evo.12009] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Once thought rare in animal taxa, hybridization has been increasingly recognized as an important and common force in animal evolution. In the past decade, a number of studies have suggested that hybridization has driven speciation in some animal groups. We investigate the signature of hybridization in the genome of a putative hybrid species, Xiphophorus clemenciae, through whole genome sequencing of this species and its hypothesized progenitors. Based on analysis of this data, we find that X. clemenciae is unlikely to have been derived from admixture between its proposed parental species. However, we find significant evidence for recent gene flow between Xiphophorus species. Although we detect genetic exchange in two pairs of species analyzed, the proportion of genomic regions that can be attributed to hybrid origin is small, suggesting that strong behavioral premating isolation prevents frequent hybridization in Xiphophorus. The direction of gene flow between species is potentially consistent with a role for sexual selection in mediating hybridization.
Collapse
Affiliation(s)
- Molly Schumer
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, New Jersey 08544, USA.
| | | | | | | | | | | |
Collapse
|