1
|
Comte A, Tricou T, Tannier E, Joseph J, Siberchicot A, Penel S, Allio R, Delsuc F, Dray S, de Vienne DM. PhylteR: Efficient Identification of Outlier Sequences in Phylogenomic Datasets. Mol Biol Evol 2023; 40:msad234. [PMID: 37879113 PMCID: PMC10655845 DOI: 10.1093/molbev/msad234] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 09/29/2023] [Accepted: 10/18/2023] [Indexed: 10/27/2023] Open
Abstract
In phylogenomics, incongruences between gene trees, resulting from both artifactual and biological reasons, can decrease the signal-to-noise ratio and complicate species tree inference. The amount of data handled today in classical phylogenomic analyses precludes manual error detection and removal. However, a simple and efficient way to automate the identification of outliers from a collection of gene trees is still missing. Here, we present PhylteR, a method that allows rapid and accurate detection of outlier sequences in phylogenomic datasets, i.e. species from individual gene trees that do not follow the general trend. PhylteR relies on DISTATIS, an extension of multidimensional scaling to 3 dimensions to compare multiple distance matrices at once. In PhylteR, these distance matrices extracted from individual gene phylogenies represent evolutionary distances between species according to each gene. On simulated datasets, we show that PhylteR identifies outliers with more sensitivity and precision than a comparable existing method. We also show that PhylteR is not sensitive to ILS-induced incongruences, which is a desirable feature. On a biological dataset of 14,463 genes for 53 species previously assembled for Carnivora phylogenomics, we show (i) that PhylteR identifies as outliers sequences that can be considered as such by other means, and (ii) that the removal of these sequences improves the concordance between the gene trees and the species tree. Thanks to the generation of numerous graphical outputs, PhylteR also allows for the rapid and easy visual characterization of the dataset at hand, thus aiding in the precise identification of errors. PhylteR is distributed as an R package on CRAN and as containerized versions (docker and singularity).
Collapse
Affiliation(s)
- Aurore Comte
- French Institute of Bioinformatics (IFB)—South Green Bioinformatics Platform, Bioversity, CIRAD, INRAE, IRD, Montpellier, France
- IRD, CIRAD, INRAE, Institut Agro, PHIM Plant Health Institute, Montpellier University, Montpellier, France
| | - Théo Tricou
- Université de Lyon, Université Lyon 1, UMR CNRS 5558 Laboratoire de Biométrie et Biologie Évolutive, Villeurbanne, France
| | - Eric Tannier
- Université de Lyon, Université Lyon 1, UMR CNRS 5558 Laboratoire de Biométrie et Biologie Évolutive, Villeurbanne, France
- Centre de Recherches Inria de Lyon, Villeurbanne, France
| | - Julien Joseph
- Université de Lyon, Université Lyon 1, UMR CNRS 5558 Laboratoire de Biométrie et Biologie Évolutive, Villeurbanne, France
| | - Aurélie Siberchicot
- Université de Lyon, Université Lyon 1, UMR CNRS 5558 Laboratoire de Biométrie et Biologie Évolutive, Villeurbanne, France
| | - Simon Penel
- Université de Lyon, Université Lyon 1, UMR CNRS 5558 Laboratoire de Biométrie et Biologie Évolutive, Villeurbanne, France
| | - Rémi Allio
- CBGP, INRAE, CIRAD, IRD, Montpellier SupAgro, Univ. Montpellier, Montpellier, France
| | | | - Stéphane Dray
- Université de Lyon, Université Lyon 1, UMR CNRS 5558 Laboratoire de Biométrie et Biologie Évolutive, Villeurbanne, France
| | - Damien M de Vienne
- Université de Lyon, Université Lyon 1, UMR CNRS 5558 Laboratoire de Biométrie et Biologie Évolutive, Villeurbanne, France
| |
Collapse
|
2
|
Wilson JD, Bond JE, Harvey MS, Ramírez MJ, Rix MG. Correlation with a limited set of behavioral niches explains the convergence of somatic morphology in mygalomorph spiders. Ecol Evol 2023; 13:e9706. [PMID: 36636427 PMCID: PMC9830016 DOI: 10.1002/ece3.9706] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 12/06/2022] [Accepted: 12/15/2022] [Indexed: 01/11/2023] Open
Abstract
Understanding the drivers of morphological convergence requires investigation into its relationship with behavior and niche space, and such investigations in turn provide insights into evolutionary dynamics, functional morphology, and life history. Mygalomorph spiders (trapdoor spiders and their kin) have long been associated with high levels of morphological homoplasy, and many convergent features can be intuitively associated with different behavioral niches. Using genus-level phylogenies based on recent genomic studies and a newly assembled matrix of discrete behavioral and somatic morphological characters, we reconstruct the evolution of burrowing behavior in the Mygalomorphae, compare the influence of behavior and evolutionary history on somatic morphology, and test hypotheses of correlated evolution between specific morphological features and behavior. Our results reveal the simplicity of the mygalomorph adaptive landscape, with opportunistic, web-building taxa at one end, and burrowing/nesting taxa with structurally modified burrow entrances (e.g., a trapdoor) at the other. Shifts in behavioral niche, in both directions, are common across the evolutionary history of the Mygalomorphae, and several major clades include taxa inhabiting both behavioral extremes. Somatic morphology is heavily influenced by behavior, with taxa inhabiting the same behavioral niche often more similar morphologically than more closely related but behaviorally divergent taxa, and we were able to identify a suite of 11 somatic features that show significant correlation with particular behaviors. We discuss these findings in light of the function of particular morphological features, niche dynamics within the Mygalomorphae, and constraints on the mygalomorph adaptive landscape relative to other spiders.
Collapse
Affiliation(s)
- Jeremy D. Wilson
- Biodiversity and Geosciences ProgramQueensland Museum Collections and Research CentreHendraQueenslandAustralia
| | - Jason E. Bond
- Department of Entomology and NematologyUniversity of CaliforniaDavisCaliforniaUSA
| | - Mark S. Harvey
- Collections and ResearchWestern Australian MuseumWelshpoolWestern AustraliaAustralia
- School of Biological SciencesUniversity of Western AustraliaCrawleyWestern AustraliaAustralia
| | - Martín J. Ramírez
- Museo Argentino de Ciencias NaturalesConsejo Nacional de Investigaciones Científicas y TécnicasBuenos AiresArgentina
| | - Michael G. Rix
- Biodiversity and Geosciences ProgramQueensland Museum Collections and Research CentreHendraQueenslandAustralia
| |
Collapse
|
3
|
Wisniewski AL, Lloyd GT, Slater GJ. Extant species fail to estimate ancestral geographical ranges at older nodes in primate phylogeny. Proc Biol Sci 2022; 289:20212535. [PMID: 35582793 DOI: 10.1098/rspb.2021.2535] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
A clade's evolutionary history is shaped, in part, by geographical range expansion, sweepstakes dispersal and local extinction. A rigorous understanding of historical biogeography may therefore yield insights into macroevolutionary dynamics such as adaptive radiation. Modern historical biogeographic analyses typically fit statistical models to molecular phylogenies, but it remains unclear whether extant species provide sufficient signal or if well-sampled phylogenies of extinct and extant taxa are necessary to produce meaningful estimates of past ranges. We investigated the historical biogeography of Primates and their euarchontan relatives using a novel meta-analytical phylogeny of over 900 extant (n= 419) and extinct (n = 483) species spanning their entire evolutionary history. Ancestral range estimates for young nodes were largely congruent with those derived from molecular phylogeny. However, node age exerts a significant effect on ancestral range estimate congruence, and the probability of congruent inference dropped below 0.5 for nodes older than the late Eocene, corresponding to the origins of higher-level clades. Discordance was not observed in analyses of extinct taxa alone. Fossils are essential for robust ancestral range inference and biogeographic analyses of extant clades originating in the deep past should be viewed with scepticism without them.
Collapse
Affiliation(s)
- Anna L Wisniewski
- Department of the Geophysical Sciences, University of Chicago, Chicago IL, USA
| | - Graeme T Lloyd
- School of Earth and Environment, University of Leeds, Leeds, UK
| | - Graham J Slater
- Department of the Geophysical Sciences, University of Chicago, Chicago IL, USA
| |
Collapse
|
4
|
Lloyd GT, Slater GJ. A Total-Group Phylogenetic Metatree for Cetacea and the Importance of Fossil Data in Diversification Analyses. Syst Biol 2021; 70:922-939. [PMID: 33507304 DOI: 10.1093/sysbio/syab002] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2020] [Revised: 12/20/2020] [Accepted: 01/10/2021] [Indexed: 01/09/2023] Open
Abstract
Phylogenetic trees provide a powerful framework for testing macroevolutionary hypotheses, but it is becoming increasingly apparent that inferences derived from extant species alone can be highly misleading. Trees incorporating living and extinct taxa are are needed to address fundamental questions about the origins of diversity and disparity but it has proved challenging to generate robust, species-rich phylogenies that include large numbers of fossil taxa. As a result, most studies of diversification dynamics continue to rely on molecular phylogenies. Here, we extend and apply a recently developed meta-analytic approach for synthesizing previously published phylogenetic studies to infer a well-resolved set of species level, time-scaled phylogenetic hypotheses for extinct and extant cetaceans (whales, dolphins and allies). Our trees extend sampling from the ∼ 90 extant species to over 500 living and extinct species, and therefore allow for more robust inference of macroevolutionary dynamics. While the diversification scenarios we recover are broadly concordant with those inferred from molecular phylogenies they differ in critical ways, notably in the relative contributions of extinction and speciation rate shifts in driving rapid radiations. The metatree approach provides the most immediate route for generating higher level phylogenies of extinct taxa, and opens the door to re-evaluation of macroevolutionary hypotheses derived only from extant taxa.
Collapse
Affiliation(s)
- Graeme T Lloyd
- School of Earth and Environment, University of Leeds, Leeds, U.K
| | - Graham J Slater
- Department of the Geophysical Sciences, University of Chicago, Chicago, USA
| |
Collapse
|
5
|
Chopra K, Bawaria S, Chauhan R. Evolutionary divergence of the nuclear pore complex from fungi to metazoans. Protein Sci 2018; 28:571-586. [PMID: 30488506 PMCID: PMC6371224 DOI: 10.1002/pro.3558] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2018] [Revised: 11/16/2018] [Accepted: 11/19/2018] [Indexed: 12/19/2022]
Abstract
Nuclear pore complex (NPC) is the largest multimeric protein assembly of the eukaryotic cell, which mediates the nucleocytoplasmic transport. The constituent proteins of this assembly (nucleoporins) are present in varying copy numbers to give a size from ~ 60 MDa (yeast) to 112 MDa (human) and share common ancestry with other membrane‐associated complexes such as COPI/COPII and thus share the same structural folds. However, the nucleoporins across species exhibit very low percentage sequence similarity and this reflects in their distinct secondary structure and domain organization. We employed thorough sequence and phylogenetic analysis guided from structure‐based alignments of all the nucleoporins from fungi to metazoans to understand the evolution of NPC. Through evolutionary pressure analysis on various nucleoporins, we deduced that these proteins are under differential selection pressure and hence the homologous interacting partners do not complement each other in the in vitro pull‐down assay. The super tree analysis of all nucleoporins taken together illustrates divergent evolution of nucleoporins and notably, the degree of divergence is more apparent in higher order organisms as compared to lower species. Overall, our results support the hypothesis that the protein–protein interactions in such large multimeric assemblies are species specific in nature and hence their structure and function should also be studied in an organism‐specific manner.
Collapse
Affiliation(s)
- Kriti Chopra
- National Center for Cell Science, S.P. Pune University, Pune, 411007, Maharashtra, India
| | - Shrankhla Bawaria
- National Center for Cell Science, S.P. Pune University, Pune, 411007, Maharashtra, India
| | - Radha Chauhan
- National Center for Cell Science, S.P. Pune University, Pune, 411007, Maharashtra, India
| |
Collapse
|
6
|
Ashford OS, Kenny AJ, Barrio Froján CRS, Bonsall MB, Horton T, Brandt A, Bird GJ, Gerken S, Rogers AD. Phylogenetic and functional evidence suggests that deep-ocean ecosystems are highly sensitive to environmental change and direct human disturbance. Proc Biol Sci 2018; 285:20180923. [PMID: 30068675 PMCID: PMC6111167 DOI: 10.1098/rspb.2018.0923] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Accepted: 07/09/2018] [Indexed: 11/12/2022] Open
Abstract
An understanding of the balance of interspecific competition and the physical environment in structuring organismal communities is crucial because those communities structured primarily by their physical environment typically exhibit greater sensitivity to environmental change than those structured predominantly by competitive interactions. Here, using detailed phylogenetic and functional information, we investigate this question in macrofaunal assemblages from Northwest Atlantic Ocean continental slopes, a high seas region projected to experience substantial environmental change through the current century. We demonstrate assemblages to be both phylogenetically and functionally under-dispersed, and thus conclude that the physical environment, not competition, may dominate in structuring deep-ocean communities. Further, we find temperature and bottom trawling intensity to be among the environmental factors significantly related to assemblage diversity. These results hint that deep-ocean communities are highly sensitive to their physical environment and vulnerable to environmental perturbation, including by direct disturbance through fishing, and indirectly through the changes brought about by climate change.
Collapse
Affiliation(s)
- Oliver S Ashford
- Department of Zoology, University of Oxford, Oxford OX1 3PS, UK
- Centre for the Environment, Fisheries and Aquaculture Science (Cefas), Pakefield Road, Lowestoft NR33 0HT, UK
| | - Andrew J Kenny
- Centre for the Environment, Fisheries and Aquaculture Science (Cefas), Pakefield Road, Lowestoft NR33 0HT, UK
| | | | | | - Tammy Horton
- National Oceanography Centre, University of Southampton Waterfront Campus, European Way, Southampton, SO14 3ZH, UK
| | - Angelika Brandt
- Department of Marine Zoology, Senckenberg Research Institute and Natural History Museum, Senckenberganlage 25, 60325 Frankfurt am Main, Germany
- FB 15 Biological Sciences, Institute for Ecology, Diversity and Evolution, Goethe University Frankfurt, Campus Riedberg, Max-von-Laue-Str. 13, 60438 Frankfurt am Main, Germany
| | - Graham J Bird
- 8 Shotover Grove, Waikanae, Kāpiti 5036, New Zealand
| | - Sarah Gerken
- Department of Biological Sciences, University of Alaska, Anchorage, AK 99508, USA
| | - Alex D Rogers
- Department of Zoology, University of Oxford, Oxford OX1 3PS, UK
| |
Collapse
|
7
|
Abstract
Supertree methods merge a set of overlapping phylogenetic trees into a supertree containing all taxa of the input trees. The challenge in supertree reconstruction is the way of dealing with conflicting information in the input trees. Many different algorithms for different objective functions have been suggested to resolve these conflicts. In particular, there exist methods based on encoding the source trees in a matrix, where the supertree is constructed applying a local search heuristic to optimize the respective objective function. We present a novel heuristic supertree algorithm called Bad Clade Deletion (BCD) supertrees. It uses minimum cuts to delete a locally minimal number of columns from such a matrix representation so that it is compatible. This is the complement problem to Matrix Representation with Compatibility (Maximum Split Fit). Our algorithm has guaranteed polynomial worst-case running time and performs swiftly in practice. Different from local search heuristics, it guarantees to return the directed perfect phylogeny for the input matrix, corresponding to the parent tree of the input trees, if one exists. Comparing supertrees to model trees for simulated data, BCD shows a better accuracy (F1 score) than the state-of-the-art algorithms SuperFine (up to 3%) and Matrix Representation with Parsimony (up to 7%); at the same time, BCD is up to 7 times faster than SuperFine, and up to 600 times faster than Matrix Representation with Parsimony. Finally, using the BCD supertree as a starting tree for a combined Maximum Likelihood analysis using RAxML, we reach significantly improved accuracy (1% higher F1 score) and running time (1.7-fold speedup).
Collapse
Affiliation(s)
- Markus Fleischauer
- Chair for Bioinformatics, Institute for Computer Science, Friedrich-Schiller-University Jena, Jena, Germany
| | - Sebastian Böcker
- Chair for Bioinformatics, Institute for Computer Science, Friedrich-Schiller-University Jena, Jena, Germany
| |
Collapse
|
8
|
Abstract
Current phylogenomic data sets highlight the need for species tree methods able to deal with several sources of gene tree/species tree incongruence. At the same time, we need to make most use of all available data. Most species tree methods deal with single processes of phylogenetic discordance, namely, gene duplication and loss, incomplete lineage sorting (ILS) or horizontal gene transfer. In this manuscript, we address the problem of species tree inference from multilocus, genome-wide data sets regardless of the presence of gene duplication and loss and ILS therefore without the need to identify orthologs or to use a single individual per species. We do this by extending the idea of Maximum Likelihood (ML) supertrees to a hierarchical Bayesian model where several sources of gene tree/species tree disagreement can be accounted for in a modular manner. We implemented this model in a computer program called guenomu whose inputs are posterior distributions of unrooted gene tree topologies for multiple gene families, and whose output is the posterior distribution of rooted species tree topologies. We conducted extensive simulations to evaluate the performance of our approach in comparison with other species tree approaches able to deal with more than one leaf from the same species. Our method ranked best under simulated data sets, in spite of ignoring branch lengths, and performed well on empirical data, as well as being fast enough to analyze relatively large data sets. Our Bayesian supertree method was also very successful in obtaining better estimates of gene trees, by reducing the uncertainty in their distributions. In addition, our results show that under complex simulation scenarios, gene tree parsimony is also a competitive approach once we consider its speed, in contrast to more sophisticated models.
Collapse
Affiliation(s)
| | - Diego Mallo
- Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo, 36310, Spain
| | - David Posada
- Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo, 36310, Spain
| |
Collapse
|
9
|
Sigwart JD, Lindberg DR. Consensus and confusion in molluscan trees: evaluating morphological and molecular phylogenies. Syst Biol 2015; 64:384-95. [PMID: 25472575 PMCID: PMC4395843 DOI: 10.1093/sysbio/syu105] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2013] [Accepted: 11/21/2014] [Indexed: 11/18/2022] Open
Abstract
Mollusks are the most morphologically disparate living animal phylum, they have diversified into all habitats, and have a deep fossil record. Monophyly and identity of their eight living classes is undisputed, but relationships between these groups and patterns of their early radiation have remained elusive. Arguments about traditional morphological phylogeny focus on a small number of topological concepts but often without regard to proximity of the individual classes. In contrast, molecular studies have proposed a number of radically different, inherently contradictory, and controversial sister relationships. Here, we assembled a data set of 42 unique published trees describing molluscan interrelationships. We used these data to ask several questions about the state of resolution of molluscan phylogeny compared with a null model of the variation possible in random trees constructed from a monophyletic assemblage of eight terminals. Although 27 different unique trees have been proposed from morphological inference, the majority of these are not statistically different from each other. Within the available molecular topologies, only four studies to date have included the deep sea class Monoplacophora; but 36.4% of all trees are not significantly different. We also present supertrees derived from two data partitions and three methods, including all available molecular molluscan phylogenies, which will form the basis for future hypothesis testing. The supertrees presented here were not constructed to provide yet another hypothesis of molluscan relationships, but rather to algorithmically evaluate the relationships present in the disparate published topologies. Based on the totality of available evidence, certain patterns of relatedness among constituent taxa become clear. The internodal distance is consistently short between a few taxon pairs, particularly supporting the relatedness of Monoplacophora and the chitons, Polyplacophora. Other taxon pairs are rarely or never found in close proximity, such as the vermiform Caudofoveata and Bivalvia. Our results have specific utility for guiding constructive research planning to better test relationships in Mollusca as well as other problematic groups. Taxa with consistently proximate relationships should be the focus of a combined approach in a concerted assessment of potential genetic and anatomical homology, whereas unequivocally distant taxa will make the most constructive choices for exemplar selection in higher level phylogenomic analyses.
Collapse
Affiliation(s)
- Julia D Sigwart
- Marine Laboratory, Queen's University Belfast, BT22 1PF, Northern Ireland, UK; and Department of Integrative Biology, Museum of Paleontology and Center for Computational Biology, University of California, Berkeley, CA, 94720, USA
| | - David R Lindberg
- Marine Laboratory, Queen's University Belfast, BT22 1PF, Northern Ireland, UK; and Department of Integrative Biology, Museum of Paleontology and Center for Computational Biology, University of California, Berkeley, CA, 94720, USA
| |
Collapse
|
10
|
Thomas DB, McGraw KJ, Butler MW, Carrano MT, Madden O, James HF. Ancient origins and multiple appearances of carotenoid-pigmented feathers in birds. Proc Biol Sci 2015; 281:20140806. [PMID: 24966316 DOI: 10.1098/rspb.2014.0806] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
The broad palette of feather colours displayed by birds serves diverse biological functions, including communication and camouflage. Fossil feathers provide evidence that some avian colours, like black and brown melanins, have existed for at least 160 million years (Myr), but no traces of bright carotenoid pigments in ancient feathers have been reported. Insight into the evolutionary history of plumage carotenoids may instead be gained from living species. We visually surveyed modern birds for carotenoid-consistent plumage colours (present in 2956 of 9993 species). We then used high-performance liquid chromatography and Raman spectroscopy to chemically assess the family-level distribution of plumage carotenoids, confirming their presence in 95 of 236 extant bird families (only 36 family-level occurrences had been confirmed previously). Using our data for all modern birds, we modelled the evolutionary history of carotenoid-consistent plumage colours on recent supertrees. Results support multiple independent origins of carotenoid plumage pigmentation in 13 orders, including six orders without previous reports of plumage carotenoids. Based on time calibrations from the supertree, the number of avian families displaying plumage carotenoids increased throughout the Cenozoic, and most plumage carotenoid originations occurred after the Miocene Epoch (23 Myr). The earliest origination of plumage carotenoids was reconstructed within Passeriformes, during the Palaeocene Epoch (66-56 Myr), and not at the base of crown-lineage birds.
Collapse
Affiliation(s)
- Daniel B Thomas
- Department of Vertebrate Zoology, MRC-116, Smithsonian Institution, Washington, DC 20013-7012, USA Museum Conservation Institute, Smithsonian Institution, Suitland, MD 20746, USA Institute of Natural and Mathematical Sciences, Massey University, Auckland 0632, New Zealand
| | - Kevin J McGraw
- School of Life Sciences, Arizona State University, Tempe, AZ 85287-4501, USA
| | - Michael W Butler
- Department of Biology, Lafayette College, Easton, PA 18042-1778, USA
| | - Matthew T Carrano
- Department of Paleobiology, MRC-121, National Museum of Natural History, Smithsonian Institution, Washington, DC 20013-7012, USA
| | - Odile Madden
- Museum Conservation Institute, Smithsonian Institution, Suitland, MD 20746, USA
| | - Helen F James
- Department of Vertebrate Zoology, MRC-116, Smithsonian Institution, Washington, DC 20013-7012, USA
| |
Collapse
|
11
|
Huang D, Roy K. Anthropogenic extinction threats and future loss of evolutionary history in reef corals. Ecol Evol 2013; 3:1184-93. [PMID: 23762506 PMCID: PMC3678474 DOI: 10.1002/ece3.527] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2013] [Revised: 02/08/2013] [Accepted: 02/14/2013] [Indexed: 11/11/2022] Open
Abstract
Extinction always results in loss of phylogenetic diversity (PD), but phylogenetically selective extinctions have long been thought to disproportionately reduce PD. Recent simulations show that tree shapes also play an important role in determining the magnitude of PD loss, potentially offsetting the effects of clustered extinctions. While patterns of PD loss under different extinction scenarios are becoming well characterized in model phylogenies, analyses of real clades that often have unbalanced tree shapes remain scarce, particularly for marine organisms. Here, we use a fossil-calibrated phylogeny of all living scleractinian reef corals in conjunction with IUCN data on extinction vulnerabilities to quantify how loss of species in different threat categories will affect the PD of this group. Our analyses reveal that predicted PD loss in corals varies substantially among different threats, with extinctions due to bleaching and disease having the largest negative effects on PD. In general, more phylogenetically clustered extinctions lead to larger losses of PD in corals, but there are notable exceptions; extinction of rare corals from distantly-related old and unique lineages can also result in substantial PD loss. Thus our results show that loss of PD in reef corals is dependent on both tree shape and the nature of extinction threats.
Collapse
Affiliation(s)
- Danwei Huang
- Department of Biological Sciences, National University of SingaporeSingapore, 117543
- Scripps Institution of Oceanography, University of California San DiegoLa Jolla, California, 92093
| | - Kaustuv Roy
- Section of Ecology, Behavior and Evolution, University of California San DiegoLa Jolla, California, 92093
| |
Collapse
|
12
|
Lyubetsky VA, Rubanov LI, Rusin LY, Gorbunov KY. Cubic time algorithms of amalgamating gene trees and building evolutionary scenarios. Biol Direct 2012; 7:48. [PMID: 23259766 PMCID: PMC3577452 DOI: 10.1186/1745-6150-7-48] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2012] [Accepted: 12/11/2012] [Indexed: 11/23/2022] Open
Abstract
BACKGROUND A long recognized problem is the inference of the supertree S that amalgamates a given set {G(j)} of trees G(j), with leaves in each G(j) being assigned homologous elements. We ground on an approach to find the tree S by minimizing the total cost of mappings α(j) of individual gene trees G(j) into S. Traditionally, this cost is defined basically as a sum of duplications and gaps in each α(j). The classical problem is to minimize the total cost, where S runs over the set of all trees that contain an exhaustive non-redundant set of species from all input G(j). RESULTS We suggest a reformulation of the classical NP-hard problem of building a supertree in terms of the global minimization of the same cost functional but only over species trees S that consist of clades belonging to a fixed set P (e.g., an exhaustive set of clades in all G(j)). We developed a deterministic solving algorithm with a low degree polynomial (typically cubic) time complexity with respect to the size of input data. We define an extensive set of elementary evolutionary events and suggest an original definition of mapping β of tree G into tree S. We introduce the cost functional c(G, S, f) and define the mapping β as the global minimum of this functional with respect to the variable f, in which sense it is a generalization of classical mapping α. We suggest a reformulation of the classical NP-hard mapping (reconciliation) problem by introducing time slices into the species tree S and present a cubic time solving algorithm to compute the mapping β. We introduce two novel definitions of the evolutionary scenario based on mapping β or a random process of gene evolution along a species tree. CONCLUSIONS Developed algorithms are mathematically proved, which justifies the following statements. The supertree building algorithm finds exactly the global minimum of the total cost if only gene duplications and losses are allowed and the given sets of gene trees satisfies a certain condition. The mapping algorithm finds exactly the minimal mapping β, the minimal total cost and the evolutionary scenario as a minimum over all possible distributions of elementary evolutionary events along the edges of tree S. The algorithms and their effective software implementations provide useful tools in many biological studies. They facilitate processing of voluminous tree data in acceptable time still largely avoiding heuristics. Performance of the tools is tested with artificial and prokaryotic tree data. REVIEWERS This article was reviewed by Prof. Anthony Almudevar, Prof. Alexander Bolshoy (nominated by Prof. Peter Olofsson), and Prof. Marek Kimmel.
Collapse
Affiliation(s)
- Vassily A Lyubetsky
- Institute for Information Transmission Problems, The Russian Academy of Sciences (Kharkevich Institute), Bolshoy Karetny per. 19, Moscow, 127994, Russia
| | - Lev I Rubanov
- Institute for Information Transmission Problems, The Russian Academy of Sciences (Kharkevich Institute), Bolshoy Karetny per. 19, Moscow, 127994, Russia
| | - Leonid Y Rusin
- Institute for Information Transmission Problems, The Russian Academy of Sciences (Kharkevich Institute), Bolshoy Karetny per. 19, Moscow, 127994, Russia
- Faculty of Biology, Moscow State University, Vorob’evy Gory 1/12, Moscow, 119991, Russia
| | - Konstantin Yu Gorbunov
- Institute for Information Transmission Problems, The Russian Academy of Sciences (Kharkevich Institute), Bolshoy Karetny per. 19, Moscow, 127994, Russia
| |
Collapse
|
13
|
Davis RB, Baldauf SL, Mayhew PJ. Many hexapod groups originated earlier and withstood extinction events better than previously realized: inferences from supertrees. Proc Biol Sci 2010; 277:1597-606. [PMID: 20129983 PMCID: PMC2871844 DOI: 10.1098/rspb.2009.2299] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2009] [Accepted: 01/11/2010] [Indexed: 11/12/2022] Open
Abstract
Comprising over half of all described species, the hexapods are central to understanding the evolution of global biodiversity. Direct fossil evidence suggests that new hexapod orders continued to originate from the Jurassic onwards, and diversity is presently higher than ever. Previous studies also suggest that several shifts in net diversification rate have occurred at higher taxonomic levels. However, their inferred timing is phylogeny dependent. We re-examine these issues using the supertree approach to provide, to our knowledge, the first composite estimates of hexapod order-level phylogeny. The Purvis matrix representation with parsimony method provides the most optimal supertree, but alternative methods are considered. Inferring ghost ranges shows richness of terminal lineages in the order-level phylogeny to peak just before the end-Permian extinction, rather than the present day, indicating that at least 11 more lineages survived this extinction than implied by fossils alone. The major upshift in diversification is associated with the origin of wings/wing folding and for the first time, to our knowledge, significant downshifts are shown associated with the origin of species-poor taxa (e.g. Neuropterida, Zoraptera). Polyneopteran phylogeny, especially the position of Zoraptera, remains important resolve because this influences findings regarding shifts in diversification. Our study shows how combining fossil with phylogenetic information can improve macroevolutionary inferences.
Collapse
Affiliation(s)
- Robert B Davis
- Department of Biology, University of York, Heslington, York, UK.
| | | | | |
Collapse
|
14
|
Zhaxybayeva O, Doolittle WF, Papke RT, Gogarten JP. Intertwined evolutionary histories of marine Synechococcus and Prochlorococcus marinus. Genome Biol Evol 2009; 1:325-39. [PMID: 20333202 PMCID: PMC2817427 DOI: 10.1093/gbe/evp032] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/28/2009] [Indexed: 02/04/2023] Open
Abstract
Prochlorococcus is a genus of marine cyanobacteria characterized by small cell and genome size, an evolutionary trend toward low GC content, the possession of chlorophyll b, and the absence of phycobilisomes. Whereas many shared derived characters define Prochlorococcus as a clade, many genome-based analyses recover them as paraphyletic, with some low-light adapted Prochlorococcus spp. grouping with marine Synechococcus. Here, we use 18 Prochlorococcus and marine Synechococcus genomes to analyze gene flow within and between these taxa. We introduce embedded quartet scatter plots as a tool to screen for genes whose phylogeny agrees or conflicts with the plurality phylogenetic signal, with accepted taxonomy and naming, with GC content, and with the ecological adaptation to high and low light intensities. We find that most gene families support high-light adapted Prochlorococcus spp. as a monophyletic clade and low-light adapted Prochlorococcus sp. as a paraphyletic group. But we also detect 16 gene families that were transferred between high-light adapted and low-light adapted Prochlorococcus sp. and 495 gene families, including 19 ribosomal proteins, that do not cluster designated Prochlorococcus and Synechococcus strains in the expected manner. To explain the observed data, we propose that frequent gene transfer between marine Synechococcus spp. and low-light adapted Prochlorococcus spp. has created a “highway of gene sharing” (Beiko RG, Harlow TJ, Ragan MA. 2005. Highways of gene sharing in prokaryotes. Proc Natl Acad Sci USA. 102:14332–14337) that tends to erode genus boundaries without erasing the Prochlorococcus-specific ecological adaptations.
Collapse
Affiliation(s)
- Olga Zhaxybayeva
- Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, Nova Scotia, Canada.
| | | | | | | |
Collapse
|
15
|
Ruta M, Pisani D, Lloyd GT, Benton MJ. A supertree of temnospondyli: cladogenetic patterns in the most species-rich group of early tetrapods. Proc Biol Sci 2007; 274:3087-95. [PMID: 17925278 PMCID: PMC2293949 DOI: 10.1098/rspb.2007.1250] [Citation(s) in RCA: 61] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2007] [Revised: 09/17/2007] [Accepted: 09/18/2007] [Indexed: 11/12/2022] Open
Abstract
As the most diverse group of early tetrapods, temnospondyls provide a unique opportunity to investigate cladogenetic patterns among basal limbed vertebrates. We present five species-level supertrees for temnospondyls, built using a variety of methods. The standard MRP majority rule consensus including minority components shows slightly greater resolution than other supertrees, and its shape matches well several currently accepted hypotheses of higher-level phylogeny for temnospondyls as a whole. Also, its node support is higher than those of other supertrees (except the combined standard plus Purvis MRP supertree). We explore the distribution of significant as well as informative changes (shifts) in branch splitting employing the standard MRP supertree as a reference, and discuss the temporal distribution of changes in time-sliced, pruned trees derived from this supertree. Also, we analyse those shifts that are most relevant to the end-Permian mass extinction. For the Palaeozoic, shifts occur almost invariably along branches that connect major Palaeozoic groups. By contrast, shifts in the Mesozoic occur predominantly within major groups. Numerous shifts bracket narrowly the end-Permian extinction, indicating not only rapid recovery and extensive diversification of temnospondyls over a short time period after the extinction event (possibly less than half a million years), but also the role of intense cladogenesis in the late part of the Permian (although this was counteracted by numerous 'background' extinctions).
Collapse
Affiliation(s)
- Marcello Ruta
- Department of Earth Sciences, University of Bristol, Wills Memorial Building, Queen's Road, Bristol BS8 1RJ, UK.
| | | | | | | |
Collapse
|
16
|
Abstract
Some species introduced into new geographical areas from their native ranges wreak ecological and economic havoc in their new environment. Although many studies have searched for either species or habitat characteristics that predict invasiveness of exotic species, the match between characteristics of the invader and those of members of the existing native community may be essential to understanding invasiveness. Here, we find that one metric, the phylogenetic relatedness of an invader to the native community, provides a predictive tool for invasiveness. Using a phylogenetic supertree of all grass species in California, we show that highly invasive grass species are, on average, significantly less related to native grasses than are introduced but noninvasive grasses. The match between the invader and the existing native community may explain why exotic pest species are not uniformly noxious in all novel habitats. Relatedness of invaders to the native biota may be one useful criterion for prioritizing management efforts of exotic species.
Collapse
Affiliation(s)
- Sharon Y Strauss
- Section of Evolution and Ecology, 2320 Storer Hall, University of California, Davis, CA 95616, USA.
| | | | | |
Collapse
|
17
|
Abstract
Because of the correlation expected between the phylogenetic relatedness of two taxa and their net ecological similarity, a measure of the overall phylogenetic relatedness of a community of interacting organisms can be used to investigate the contemporary ecological processes that structure community composition. I describe two indices that use the number of nodes that separate taxa on a phylogeny as a measure of their phylogenetic relatedness. As an example of the use of these indices in community analysis, I compared the mean observed net relatedness of trees (≥10 cm diameter at breast height) in each of 28 plots (each 0.16 ha) in a Bornean rain forest with the net relatedness expected if species were drawn randomly from the species pool (of the 324 species in the 28 plots), using a supertree that I assembled from published sources. I found that the species in plots were more phylogenetically related than expected by chance, a result that was insensitive to various modifications to the basic methodology. I tentatively infer that variation in habitat among plots causes ecologically more similar species to co-occur within plots. Finally, I suggest a range of applications for phylogenetic relatedness measures in community analysis.
Collapse
|
18
|
Abstract
Because of the correlation expected between the phylogenetic relatedness of two taxa and their net ecological similarity, a measure of the overall phylogenetic relatedness of a community of interacting organisms can be used to investigate the contemporary ecological processes that structure community composition. I describe two indices that use the number of nodes that separate taxa on a phylogeny as a measure of their phylogenetic relatedness. As an example of the use of these indices in community analysis, I compared the mean observed net relatedness of trees (≥10 cm diameter at breast height) in each of 28 plots (each 0.16 ha) in a Bornean rain forest with the net relatedness expected if species were drawn randomly from the species pool (of the 324 species in the 28 plots), using a supertree that I assembled from published sources. I found that the species in plots were more phylogenetically related than expected by chance, a result that was insensitive to various modifications to the basic methodology. I tentatively infer that variation in habitat among plots causes ecologically more similar species to co-occur within plots. Finally, I suggest a range of applications for phylogenetic relatedness measures in community analysis.
Collapse
|