1
|
Shedding light: a phylotranscriptomic perspective illuminates the origin of photosymbiosis in marine bivalves. BMC Evol Biol 2020; 20:50. [PMID: 32357841 PMCID: PMC7195748 DOI: 10.1186/s12862-020-01614-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2019] [Accepted: 04/15/2020] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Photosymbiotic associations between metazoan hosts and photosynthetic dinoflagellates are crucial to the trophic and structural integrity of many marine ecosystems, including coral reefs. Although extensive efforts have been devoted to study the short-term ecological interactions between coral hosts and their symbionts, long-term evolutionary dynamics of photosymbiosis in many marine animals are not well understood. Within Bivalvia, the second largest class of mollusks, obligate photosymbiosis is found in two marine lineages: the giant clams (subfamily Tridacninae) and the heart cockles (subfamily Fraginae), both in the family Cardiidae. Morphologically, giant clams show relatively conservative shell forms whereas photosymbiotic fragines exhibit a diverse suite of anatomical adaptations including flattened shells, leafy mantle extensions, and lens-like microstructural structures. To date, the phylogenetic relationships between these two subfamilies remain poorly resolved, and it is unclear whether photosymbiosis in cardiids originated once or twice. RESULTS In this study, we establish a backbone phylogeny for Cardiidae utilizing RNASeq-based transcriptomic data from Tridacninae, Fraginae and other cardiids. A variety of phylogenomic approaches were used to infer the relationship between the two groups. Our analyses found conflicting gene signals and potential rapid divergence among the lineages. Overall, results support a sister group relationship between Tridacninae and Fraginae, which diverged during the Cretaceous. Although a sister group relationship is recovered, ancestral state reconstruction using maximum likelihood-based methods reveals two independent origins of photosymbiosis, one at the base of Tridacninae and the other within a symbiotic Fraginae clade. CONCLUSIONS The newly revealed common ancestry between Tridacninae and Fraginae brings a possibility that certain genetic, metabolic, and/or anatomical exaptations existed in their last common ancestor, which promoted both lineages to independently establish photosymbiosis, possibly in response to the modern expansion of reef habitats.
Collapse
|
2
|
Lemer S, Bieler R, Giribet G. Resolving the relationships of clams and cockles: dense transcriptome sampling drastically improves the bivalve tree of life. Proc Biol Sci 2020; 286:20182684. [PMID: 30963927 PMCID: PMC6408618 DOI: 10.1098/rspb.2018.2684] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
Bivalvia has been the subject of extensive recent phylogenetic work to attempt resolving either the backbone of the bivalve tree using transcriptomic data, or the tips using morpho-anatomical data and up to five genetic markers. Yet the first approach lacked decisive taxon sampling and the second failed to resolve many interfamilial relationships, especially within the diverse clade Imparidentia. Here we combine dense taxon sampling with 108 deep-sequenced Illumina-based transcriptomes to provide resolution in nodes that required additional study. We designed specific data matrices to address the poorly resolved relationships within Imparidentia. Our results support the overall backbone of the bivalve tree, the monophyly of Bivalvia and all its main nodes, although the monophyly of Protobranchia remains less clear. Likewise, the inter-relationships of the six main bivalve clades were fully supported. Within Imparidentia, resolution increases when analysing Imparidentia-specific matrices. Lucinidae, Thyasiridae and Gastrochaenida represent three early branches. Gastrochaenida is sister group to all remaining imparidentians, which divide into six orders. Neoheterodontei is always fully supported, and consists of Sphaeriida, Myida and Venerida, with the latter now also containing Mactroidea, Ungulinoidea and Chamidae, a family particularly difficult to place in earlier work. Overall, our study, by using densely sampled transcriptomes, provides the best-resolved bivalve phylogeny to date.
Collapse
Affiliation(s)
- Sarah Lemer
- 1 University of Guam Marine Laboratory , 303 University Drive, UOG Station, Mangilao, GU 96923 , USA.,2 Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University , 26 Oxford Street, Cambridge, MA 02138 , USA
| | - Rüdiger Bieler
- 3 Integrative Research Center, Field Museum of Natural History , 1400 South Lake Shore Drive, Chicago, IL 60605 , USA
| | - Gonzalo Giribet
- 2 Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University , 26 Oxford Street, Cambridge, MA 02138 , USA
| |
Collapse
|
3
|
Perron U, Kozlov AM, Stamatakis A, Goldman N, Moal IH. Modeling Structural Constraints on Protein Evolution via Side-Chain Conformational States. Mol Biol Evol 2020; 36:2086-2103. [PMID: 31114882 PMCID: PMC6736381 DOI: 10.1093/molbev/msz122] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
Abstract
Few models of sequence evolution incorporate parameters describing protein structure, despite its high conservation, essential functional role and increasing availability. We present a structurally aware empirical substitution model for amino acid sequence evolution in which proteins are expressed using an expanded alphabet that relays both amino acid identity and structural information. Each character specifies an amino acid as well as information about the rotamer configuration of its side-chain: the discrete geometric pattern of permitted side-chain atomic positions, as defined by the dihedral angles between covalently linked atoms. By assigning rotamer states in 251,194 protein structures and identifying 4,508,390 substitutions between closely related sequences, we generate a 55-state “Dayhoff-like” model that shows that the evolutionary properties of amino acids depend strongly upon side-chain geometry. The model performs as well as or better than traditional 20-state models for divergence time estimation, tree inference, and ancestral state reconstruction. We conclude that not only is rotamer configuration a valuable source of information for phylogenetic studies, but that modeling the concomitant evolution of sequence and structure may have important implications for understanding protein folding and function.
Collapse
Affiliation(s)
- Umberto Perron
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridgeshire, United Kingdom
| | - Alexey M Kozlov
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Alexandros Stamatakis
- Computational Molecular Evolution Group, Heidelberg Institute for Theoretical Studies, Heidelberg, Germany.,Institute for Theoretical Informatics, Karlsruhe Institute of Technology, Karlsruhe, Germany
| | - Nick Goldman
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridgeshire, United Kingdom
| | - Iain H Moal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridgeshire, United Kingdom.,Computational and Modelling Sciences, GlaxoSmithKline Research and Development, Stevenage, United Kingdom
| |
Collapse
|
4
|
Claywell BC, Dinh V, Fourment M, McCoy CO, Matsen Iv FA. A Surrogate Function for One-Dimensional Phylogenetic Likelihoods. Mol Biol Evol 2019; 35:242-246. [PMID: 29029199 DOI: 10.1093/molbev/msx253] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Phylogenetics has seen a steady increase in data set size and substitution model complexity, which require increasing amounts of computational power to compute likelihoods. This motivates strategies to approximate the likelihood functions for branch length optimization and Bayesian sampling. In this article, we develop an approximation to the 1D likelihood function as parametrized by a single branch length. Our method uses a four-parameter surrogate function abstracted from the simplest phylogenetic likelihood function, the binary symmetric model. We show that it offers a surrogate that can be fit over a variety of branch lengths, that it is applicable to a wide variety of models and trees, and that it can be used effectively as a proposal mechanism for Bayesian sampling. The method is implemented as a stand-alone open-source C library for calling from phylogenetics algorithms; it has proven essential for good performance of our online phylogenetic algorithm sts.
Collapse
Affiliation(s)
- Brian C Claywell
- Program in Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Vu Dinh
- Department of Mathematical Sciences, University of Delaware, Newark, DE
| | - Mathieu Fourment
- ithree Institute, University of Technology Sydney, Ultimo, NSW, Australia
| | - Connor O McCoy
- Program in Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, WA
| | - Frederick A Matsen Iv
- Program in Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, WA
| |
Collapse
|
5
|
Levinstein Hallak K, Tzur S, Rosset S. Big data analysis of human mitochondrial DNA substitution models: a regression approach. BMC Genomics 2018; 19:759. [PMID: 30340456 PMCID: PMC6195736 DOI: 10.1186/s12864-018-5123-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Accepted: 09/27/2018] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND We study Phylotree, a comprehensive representation of the phylogeny of global human mitochondrial DNA (mtDNA) variations, to better understand the mtDNA substitution mechanism and its most influential factors. We consider a substitution model, where a set of genetic features may predict the rate at which mtDNA substitutions occur. To find an appropriate model, an exhaustive analysis on the effect of multiple factors on the substitution rate is performed through Negative Binomial and Poisson regressions. We examine three different inclusion options for each categorical factor: omission, inclusion as an explanatory variable, and by-value partitioning. The examined factors include genes, codon position, a CpG indicator, directionality, nucleotide, amino acid, codon, and context (neighboring nucleotides), in addition to other site based factors. Partitioning a model by a factor's value results in several sub-models (one for each value), where the likelihoods of the sub-models can be combined to form a score for the entire model. Eventually, the leading models are considered as viable candidates for explaining mtDNA substitution rates. RESULTS Initially, we introduce a novel clustering technique on genes, based on three similarity tests between pairs of genes, supporting previous results regarding gene functionalities in the mtDNA. These clusters are then used as a factor in our models. We present leading models for the protein coding genes, rRNA and tRNA genes and the control region, showing it is disadvantageous to separate the models of transitions/transversions, or synonymous/non-synonymous substitutions. We identify a context effect that cannot be attributed solely to protein level constraints or CpG pairs. For protein-coding genes, we show that the substitution model should be partitioned into sub-models according to the codon position and input codon; additionally we confirm that gene identity and cluster have no significant effect once the above factors are accounted for. CONCLUSIONS We leverage the large, high-confidence Phylotree mtDNA phylogeny to develop a new statistical approach. We model the substitution rates using regressions, allowing consideration of many factors simultaneously. This admits the use of model selection tools helping to identify the set of factors best explaining the mutational dynamics when considered in tandem.
Collapse
Affiliation(s)
- Keren Levinstein Hallak
- Department of Statistics and Operations Research, School of Mathematical Sciences, Tel-Aviv University, 6997801, Tel-Aviv, Israel
| | - Shay Tzur
- Braun School of Public Health and Community Medicine, The Hebrew University of Jerusalem, 9112102, Jerusalem, Israel
| | - Saharon Rosset
- Department of Statistics and Operations Research, School of Mathematical Sciences, Tel-Aviv University, 6997801, Tel-Aviv, Israel.
| |
Collapse
|
6
|
Schwentner M, Richter S, Rogers DC, Giribet G. Tetraconatan phylogeny with special focus on Malacostraca and Branchiopoda: highlighting the strength of taxon-specific matrices in phylogenomics. Proc Biol Sci 2018; 285:20181524. [PMID: 30135168 PMCID: PMC6125901 DOI: 10.1098/rspb.2018.1524] [Citation(s) in RCA: 52] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2018] [Accepted: 07/18/2018] [Indexed: 01/12/2023] Open
Abstract
Understanding the evolution of Tetraconata or Pancrustacea-the clade that includes crustaceans and insects-requires a well-resolved hypothesis regarding the relationships within and among its constituent taxa. Here, we assembled a taxon-rich phylogenomic dataset focusing on crustacean lineages based solely on genomes and new-generation Illumina-generated transcriptomes, including 89 representatives of Tetraconata. This constitutes, to our knowledge, the first phylogenomic study specifically addressing internal relationships of Malacostraca (with 26 species included) and Branchiopoda (36 species). Seven matrices comprising 81-684 orthogroups and 17 690-242 530 amino acid positions were assembled and analysed under five different analytical approaches. To maximize gene occupancy and to improve resolution, taxon-specific matrices were designed for Malacostraca and Branchiopoda. Key tetraconatan taxa (i.e. Oligostraca, Multicrustacea, Branchiopoda, Malacostraca, Thecostraca, Copepoda and Hexapoda) were monophyletic and well supported. Within Branchiopoda, Phyllopoda, Diplostraca, Cladoceromorpha and Cladocera were monophyletic. Within Malacostraca, the clades Eumalacostraca, Decapoda and Reptantia were well supported. Recovery of Caridoida or Peracarida was highly dependent on the analysis for the complete matrix, but it was consistently monophyletic in the malacostracan-specific matrices. From such examples, we demonstrate that taxon-specific matrices and particular evolutionary models and analytical methods, namely CAT-GTR and Dayhoff recoding, outperform other approaches in resolving certain recalcitrant nodes in phylogenomic analyses.
Collapse
Affiliation(s)
- Martin Schwentner
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
- Centrum of Natural History, Universität Hamburg, Martin-Luther-King-Platz 3, 20146 Hamburg, Germany
| | - Stefan Richter
- Allgemeine und Spezielle Zoologie, Universität Rostock, Universitätsplatz 2, 18055 Rostock, Germany
| | - D Christopher Rogers
- Kansas Biological Survey, Kansas University, Higuchi Hall, 2101 Constant Avenue, Lawrence, KS 66047, USA
| | - Gonzalo Giribet
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
7
|
Lemer S, González VL, Bieler R, Giribet G. Cementing mussels to oysters in the pteriomorphian tree: a phylogenomic approach. Proc Biol Sci 2017; 283:rspb.2016.0857. [PMID: 27358369 PMCID: PMC4936043 DOI: 10.1098/rspb.2016.0857] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2016] [Accepted: 06/03/2016] [Indexed: 11/12/2022] Open
Abstract
Mussels (Mytilida) are a group of bivalves with ancient origins and some of the most important commercial shellfish worldwide. Mytilida consists of approximately 400 species found in various littoral and deep-sea environments, and are part of the higher clade Pteriomorphia, but their exact position within the group has been unstable. The multiple adaptive radiations that occurred within Pteriomorphia have rendered phylogenetic classifications difficult and uncertainty remains regarding the relationships among most families. To address this phylogenetic uncertainty, novel transcriptomic data were generated to include all five orders of Pteriomorphia. Our results, derived from complex analyses of large datasets from 41 transcriptomes and evaluating possible pitfalls affecting phylogenetic reconstruction (matrix occupancy, heterogeneity, evolutionary rates, evolutionary models), consistently recover a well-supported phylogeny of Pteriomorphia, with the only exception of the most complete but smallest data matrix (Matrix 3: 51 genes, 90% gene occupancy). Maximum-likelihood and Bayesian mixture model analyses retrieve strong support for: (i) the monophyly of Pteriomorphia, (ii) Mytilida as a sister group to Ostreida, and (iii) Arcida as sister group to all other pteriomorphians. The basal position of Arcida is congruent with its shell microstructure (solely composed of aragonitic crystals), whereas Mytilida and Ostreida display a combination of a calcitic outer layer with an aragonitic inner layer composed of nacre tablets, the latter being secondarily lost in Ostreoidea.
Collapse
Affiliation(s)
- Sarah Lemer
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| | - Vanessa L González
- Department of Invertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC 20013, USA
| | - Rüdiger Bieler
- Integrative Research Center, Field Museum of Natural History, 1400 South Lake Shore Drive, Chicago, IL 60605, USA
| | - Gonzalo Giribet
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
8
|
Schwentner M, Combosch DJ, Pakes Nelson J, Giribet G. A Phylogenomic Solution to the Origin of Insects by Resolving Crustacean-Hexapod Relationships. Curr Biol 2017; 27:1818-1824.e5. [DOI: 10.1016/j.cub.2017.05.040] [Citation(s) in RCA: 87] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2016] [Revised: 04/10/2017] [Accepted: 05/10/2017] [Indexed: 12/11/2022]
|
9
|
Fernández R, Sharma PP, Tourinho AL, Giribet G. The Opiliones tree of life: shedding light on harvestmen relationships through transcriptomics. Proc Biol Sci 2017; 284:20162340. [PMID: 28228511 PMCID: PMC5326524 DOI: 10.1098/rspb.2016.2340] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2016] [Accepted: 01/27/2017] [Indexed: 12/12/2022] Open
Abstract
Opiliones are iconic arachnids with a Palaeozoic origin and a diversity that reflects ancient biogeographic patterns dating back at least to the times of Pangea. Owing to interest in harvestman diversity, evolution and biogeography, their relationships have been thoroughly studied using morphology and PCR-based Sanger approaches to infer their systematic relationships. More recently, two studies utilized transcriptomics-based phylogenomics to explore their basal relationships and diversification, but sampling was limiting for understanding deep evolutionary patterns, as they lacked good taxon representation at the family level. Here, we analysed a set of the 14 existing transcriptomes with 40 additional ones generated for this study, representing approximately 80% of the extant familial diversity in Opiliones. Our phylogenetic analyses, including a set of data matrices with different gene occupancy and evolutionary rates, and using a multitude of methods correcting for a diversity of factors affecting phylogenomic data matrices, provide a robust and stable Opiliones tree of life, where most families and higher taxa are precisely placed. Our dating analyses using alternative calibration points, methods and analytical parameters provide well-resolved old divergences, consistent with ancient regionalization in Pangea in some groups, and Pangean vicariance in others. The integration of state-of-the-art molecular techniques and analyses, together with the broadest taxonomic sampling to date presented in a phylogenomic study of harvestmen, provide new insights into harvestmen interrelationships, as well as an overview of the general biogeographic patterns of this ancient arthropod group.
Collapse
Affiliation(s)
- Rosa Fernández
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| | - Prashant P Sharma
- Department of Zoology, University of Wisconsin-Madison, 352 Birge Hall, 430 Lincoln Drive, Madison, WI 53706, USA
| | - Ana Lúcia Tourinho
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
- Instituto Nacional de Pesquisas da Amazônia, Coordenação de Biodiversidade (CBIO), Avenida André Araújo, 2936, Aleixo, CEP 69011-970, Manaus, Amazonas, Brazil
| | - Gonzalo Giribet
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
10
|
Fernández R, Edgecombe GD, Giribet G. Exploring Phylogenetic Relationships within Myriapoda and the Effects of Matrix Composition and Occupancy on Phylogenomic Reconstruction. Syst Biol 2016; 65:871-89. [PMID: 27162151 PMCID: PMC4997009 DOI: 10.1093/sysbio/syw041] [Citation(s) in RCA: 82] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2015] [Accepted: 04/28/2016] [Indexed: 11/14/2022] Open
Abstract
Myriapods, including the diverse and familiar centipedes and millipedes, are one of the dominant terrestrial arthropod groups. Although molecular evidence has shown that Myriapoda is monophyletic, its internal phylogeny remains contentious and understudied, especially when compared to those of Chelicerata and Hexapoda. Until now, efforts have focused on taxon sampling (e.g., by including a handful of genes from many species) or on maximizing matrix size (e.g., by including hundreds or thousands of genes in just a few species), but a phylogeny maximizing sampling at both levels remains elusive. In this study, we analyzed 40 Illumina transcriptomes representing 3 of the 4 myriapod classes (Diplopoda, Chilopoda, and Symphyla); 25 transcriptomes were newly sequenced to maximize representation at the ordinal level in Diplopoda and at the family level in Chilopoda. Ten supermatrices were constructed to explore the effect of several potential phylogenetic biases (e.g., rate of evolution, heterotachy) at 3 levels of gene occupancy per taxon (50%, 75%, and 90%). Analyses based on maximum likelihood and Bayesian mixture models retrieved monophyly of each myriapod class, and resulted in 2 alternative phylogenetic positions for Symphyla, as sister group to Diplopoda + Chilopoda, or closer to Diplopoda, the latter hypothesis having been traditionally supported by morphology. Within centipedes, all orders were well supported, but 2 deep nodes remained in conflict in the different analyses despite dense taxon sampling at the family level. Relationships among centipede orders in all analyses conducted with the most complete matrix (90% occupancy) are at odds not only with the sparser but more gene-rich supermatrices (75% and 50% supermatrices) and with the matrices optimizing phylogenetic informativeness or most conserved genes, but also with previous hypotheses based on morphology, development, or other molecular data sets. Our results indicate that a high percentage of ribosomal proteins in the most complete matrices, in conjunction with distance from the root, can act in concert to compromise the estimated relationships within the ingroup. We discuss the implications of these findings in the context of the ever more prevalent quest for completeness in phylogenomic studies.
Collapse
Affiliation(s)
- Rosa Fernández
- Museum of Comparative Zoology & Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| | - Gregory D Edgecombe
- Department of Earth Sciences, The Natural History Museum, Cromwell Road, London SW7 5BD, UK
| | - Gonzalo Giribet
- Museum of Comparative Zoology & Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
11
|
Ješovnik A, González VL, Schultz TR. Phylogenomics and Divergence Dating of Fungus-Farming Ants (Hymenoptera: Formicidae) of the Genera Sericomyrmex and Apterostigma. PLoS One 2016; 11:e0151059. [PMID: 27466804 PMCID: PMC4965065 DOI: 10.1371/journal.pone.0151059] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2015] [Accepted: 02/22/2016] [Indexed: 01/27/2023] Open
Abstract
Fungus-farming ("attine") ants are model systems for studies of symbiosis, coevolution, and advanced eusociality. A New World clade of nearly 300 species in 15 genera, all attine ants cultivate fungal symbionts for food. In order to better understand the evolution of ant agriculture, we sequenced, assembled, and analyzed transcriptomes of four different attine ant species in two genera: three species in the higher-attine genus Sericomyrmex and a single lower-attine ant species, Apterostigma megacephala, representing the first genomic data for either genus. These data were combined with published genomes of nine other ant species and the honey bee Apis mellifera for phylogenomic and divergence-dating analyses. The resulting phylogeny confirms relationships inferred in previous studies of fungus-farming ants. Divergence-dating analyses recovered slightly older dates than most prior analyses, estimating that attine ants originated 53.6-66.7 million of years ago, and recovered a very long branch subtending a very recent, rapid radiation of the genus Sericomyrmex. This result is further confirmed by a separate analysis of the three Sericomyrmex species, which reveals that 92.71% of orthologs have 99% - 100% pairwise-identical nucleotide sequences. We searched the transcriptomes for genes of interest, most importantly argininosuccinate synthase and argininosuccinate lyase, which are functional in other ants but which are known to have been lost in seven previously studied attine ant species. Loss of the ability to produce the amino acid arginine has been hypothesized to contribute to the obligate dependence of attine ants upon their cultivated fungi, but the point in fungus-farming ant evolution at which these losses occurred has remained unknown. We did not find these genes in any of the sequenced transcriptomes. Although expected for Sericomyrmex species, the absence of arginine anabolic genes in the lower-attine ant Apterostigma megacephala strongly suggests that the loss coincided with the origin of attine ants.
Collapse
Affiliation(s)
- Ana Ješovnik
- Entomology Department, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, United States of America
- Maryland Center for Systematic Entomology, Department of Entomology, University of Maryland, College Park, Maryland, United States of America
| | - Vanessa L. González
- Global Genome Initiative, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, United States of America
| | - Ted R. Schultz
- Entomology Department, National Museum of Natural History, Smithsonian Institution, Washington, District of Columbia, United States of America
| |
Collapse
|
12
|
Arenas M. Trends in substitution models of molecular evolution. Front Genet 2015; 6:319. [PMID: 26579193 PMCID: PMC4620419 DOI: 10.3389/fgene.2015.00319] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2015] [Accepted: 10/09/2015] [Indexed: 11/13/2022] Open
Abstract
Substitution models of evolution describe the process of genetic variation through fixed mutations and constitute the basis of the evolutionary analysis at the molecular level. Almost 40 years after the development of first substitution models, highly sophisticated, and data-specific substitution models continue emerging with the aim of better mimicking real evolutionary processes. Here I describe current trends in substitution models of DNA, codon and amino acid sequence evolution, including advantages and pitfalls of the most popular models. The perspective concludes that despite the large number of currently available substitution models, further research is required for more realistic modeling, especially for DNA coding and amino acid data. Additionally, the development of more accurate complex models should be coupled with new implementations and improvements of methods and frameworks for substitution model selection and downstream evolutionary analysis.
Collapse
Affiliation(s)
- Miguel Arenas
- Institute of Molecular Pathology and Immunology of the University of Porto Porto, Portugal
| |
Collapse
|
13
|
Fernández R, Giribet G. Unnoticed in the tropics: phylogenomic resolution of the poorly known arachnid order Ricinulei (Arachnida). ROYAL SOCIETY OPEN SCIENCE 2015; 2:150065. [PMID: 26543583 PMCID: PMC4632547 DOI: 10.1098/rsos.150065] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/09/2015] [Accepted: 05/29/2015] [Indexed: 06/05/2023]
Abstract
Ricinulei are among the most obscure and cryptic arachnid orders, constituting a micro-diverse group with extreme endemism. The 76 extant species described to date are grouped in three genera: Ricinoides, from tropical Western and Central Africa, and the two Neotropical genera Cryptocellus and Pseudocellus. Until now, a single molecular phylogeny of Ricinulei has been published, recovering the African Ricinoides as the sister group of the American Pseudocellus and providing evidence for the diversification of the order pre-dating the fragmentation of Gondwana. Here, we present, to our knowledge, the first phylogenomic study of this neglected arachnid order based on data from five transcriptomes obtained from the five major mitochondrial lineages of Ricinulei. Our results, based on up to more than 2000 genes, strongly support a clade containing Pseudocellus and Cryptocellus, constituting the American group of Ricinulei, with the African Ricinoides nesting outside. Our dating of the diversification of the African and American clades using a 76 gene data matrix with 90% gene occupancy indicates that this arachnid lineage was distributed in the South American, North American and African plates of Gondwana and that its diversification is concordant with a biogeographic scenario (both for pattern and tempo) of Gondwanan vicariance.
Collapse
Affiliation(s)
- Rosa Fernández
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| | | |
Collapse
|
14
|
Averianov AO, Lopatin AV. High-level systematics of placental mammals: Current status of the problem. BIOL BULL+ 2014. [DOI: 10.1134/s1062359014090039] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
15
|
Lemer S, Kawauchi GY, Andrade SCS, González VL, Boyle MJ, Giribet G. Re-evaluating the phylogeny of Sipuncula through transcriptomics. Mol Phylogenet Evol 2014; 83:174-83. [PMID: 25450098 DOI: 10.1016/j.ympev.2014.10.019] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2014] [Revised: 10/17/2014] [Accepted: 10/23/2014] [Indexed: 01/28/2023]
Abstract
Sipunculans (also known as peanut worms) are an ancient group of exclusively marine worms with a global distribution and a fossil record that dates back to the Early Cambrian. The systematics of sipunculans, now considered a distinct subclade of Annelida, has been studied for decades using morphological and molecular characters, and has reached the limits of Sanger-based approaches. Here, we reevaluate their family-level phylogeny by comparative transcriptomic analysis of eight species representing all known families within Sipuncula. Two data matrices with alternative gene occupancy levels (large matrix with 675 genes and 62% missing data; reduced matrix with 141 genes and 23% missing data) were analysed using concatenation and gene-tree methods, yielding congruent results and resolving each internal node with maximum support. We thus corroborate prior phylogenetic work based on molecular data, resolve outstanding issues with respect to the familial relationships of Aspidosiphonidae, Antillesomatidae and Phascolosomatidae, and highlight the next area of focus for sipunculan systematics.
Collapse
Affiliation(s)
- Sarah Lemer
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA.
| | - Gisele Y Kawauchi
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA; CEBIMar, Universidade de São Paulo, Praia do Cabelo Gordo, São Sebastião, São Paulo, Brazil
| | - Sónia C S Andrade
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA; Departamento de Zootecnia, ESALQ-USP, Piracicaba, São Paulo, Brazil
| | - Vanessa L González
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA; Department of Invertebrate Zoology, National Museum of Natural History, Smithsonian Institution, Washington, DC 20013, USA
| | - Michael J Boyle
- Smithsonian Tropical Research Institute (STRI), Naos Marine Laboratories, Panama 0843/03092, Panama
| | - Gonzalo Giribet
- Museum of Comparative Zoology, Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford Street, Cambridge, MA 02138, USA
| |
Collapse
|
16
|
Fernández R, Hormiga G, Giribet G. Phylogenomic analysis of spiders reveals nonmonophyly of orb weavers. Curr Biol 2014; 24:1772-7. [PMID: 25042584 DOI: 10.1016/j.cub.2014.06.035] [Citation(s) in RCA: 90] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2014] [Revised: 06/12/2014] [Accepted: 06/12/2014] [Indexed: 01/25/2023]
Abstract
Spiders constitute one of the most successful clades of terrestrial predators. Their extraordinary diversity, paralleled only by some insects and mites, is often attributed to the use of silk, and, in one of the largest lineages, to stereotyped behaviors for building foraging webs of remarkable biomechanical properties. However, our understanding of higher-level spider relationships is poor and is largely based on morphology. Prior molecular efforts have focused on a handful of genes but have provided little resolution to key questions such as the origin of the orb weavers. We apply a next-generation sequencing approach to resolve spider phylogeny, examining the relationships among its major lineages. We further explore possible pitfalls in phylogenomic reconstruction, including missing data, unequal rates of evolution, and others. Analyses of multiple data sets all agree on the basic structure of the spider tree and all reject the long-accepted monophyly of Orbiculariae, by placing the cribellate orb weavers (Deinopoidea) with other groups and not with the ecribellate orb weavers (Araneoidea). These results imply independent origins for the two types of orb webs (cribellate and ecribellate) or a much more ancestral origin of the orb web with subsequent loss in the so-called RTA clade. Either alternative demands a major reevaluation of our current understanding of the spider evolutionary chronicle.
Collapse
Affiliation(s)
- Rosa Fernández
- Museum of Comparative Zoology and Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA.
| | - Gustavo Hormiga
- Department of Biological Sciences, The George Washington University, Washington, D.C. 20052, USA
| | - Gonzalo Giribet
- Museum of Comparative Zoology and Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA 02138, USA
| |
Collapse
|
17
|
Fernández R, Laumer CE, Vahtera V, Libro S, Kaluziak S, Sharma PP, Pérez-Porro AR, Edgecombe GD, Giribet G. Evaluating topological conflict in centipede phylogeny using transcriptomic data sets. Mol Biol Evol 2014; 31:1500-13. [PMID: 24674821 DOI: 10.1093/molbev/msu108] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Relationships between the five extant orders of centipedes have been considered solved based on morphology. Phylogenies based on samples of up to a few dozen genes have largely been congruent with the morphological tree apart from an alternative placement of one order, the relictual Craterostigmomorpha, consisting of two species in Tasmania and New Zealand. To address this incongruence, novel transcriptomic data were generated to sample all five orders of centipedes and also used as a test case for studying gene-tree incongruence. Maximum likelihood and Bayesian mixture model analyses of a data set composed of 1,934 orthologs with 45% missing data, as well as the 389 orthologs in the least saturated, stationary quartile, retrieve strong support for a sister-group relationship between Craterostigmomorpha and all other pleurostigmophoran centipedes, of which the latter group is newly named Amalpighiata. The Amalpighiata hypothesis, which shows little gene-tree incongruence and is robust to the influence of among-taxon compositional heterogeneity, implies convergent evolution in several morphological and behavioral characters traditionally used in centipede phylogenetics, such as maternal brood care, but accords with patterns of first appearances in the fossil record.
Collapse
Affiliation(s)
- Rosa Fernández
- Museum of Comparative Zoology & Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA
| | - Christopher E Laumer
- Museum of Comparative Zoology & Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA
| | - Varpu Vahtera
- Museum of Comparative Zoology & Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MAZoological Museum, Department of Biology, University of Turku, Turku, Finland
| | - Silvia Libro
- Marine Science Center, Northeastern University, Nahant, MA
| | | | - Prashant P Sharma
- Division of Invertebrate Zoology, American Museum of Natural History, New York, NY
| | - Alicia R Pérez-Porro
- Museum of Comparative Zoology & Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MACentre d'Estudis Avançats de Blanes (CEAB-CSIC), Catalonia, Spain
| | - Gregory D Edgecombe
- Department of Earth Sciences, The Natural History Museum, London, United Kingdom
| | - Gonzalo Giribet
- Museum of Comparative Zoology & Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, MA
| |
Collapse
|
18
|
Miyazawa S. Superiority of a mechanistic codon substitution model even for protein sequences in phylogenetic analysis. BMC Evol Biol 2013; 13:257. [PMID: 24256155 PMCID: PMC4225520 DOI: 10.1186/1471-2148-13-257] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2013] [Accepted: 11/14/2013] [Indexed: 11/25/2022] Open
Abstract
Background Nucleotide and amino acid substitution tendencies are characteristic of each species, organelle, and protein family. Hence, various empirical amino acid substitution rate matrices have needed to be estimated for phylogenetic analysis: JTT, WAG, and LG for nuclear proteins, mtREV for mitochondrial proteins, cpREV10 and cpREV64 for chloroplast-encoded proteins, and FLU for influenza proteins. On the other hand, in a mechanistic codon substitution model, in which each codon substitution rate is proportional to the product of a codon mutation rate and the ratio of fixation depending on the type of amino acid replacement, mutation rates and the strength of selective constraint on amino acids can be tailored to each protein family with additional 11 parameters. As a result, in the evolutionary analysis of codon sequences it outperforms codon substitution models equivalent to empirical amino acid substitution matrices. Is it superior even for amino acid sequences, among which synonymous substitutions cannot be identified? Results Nucleotide mutations are assumed to occur independently of codon positions but multiple nucleotide changes in infinitesimal time are allowed. Selective constraints on the respective types of amino acid replacements are tailored to each gene with a linear function of a given estimate of selective constraints, which were estimated by maximizing the likelihood of an empirical amino acid or codon substitution frequency matrix, each of JTT, WAG, LG, and KHG. It is shown that the mechanistic codon substitution model with the assumption of equal codon usage yields better values of Akaike and Bayesian information criteria for all three phylogenetic trees of mitochondrial, chloroplast, and influenza-A hemagglutinin proteins than the empirical amino acid substitution models with mtREV, cpREV64, and FLU, which were designed specifically for those protein families, respectively. The variation of selective constraint across sites fits the datasets significantly better than variable codon mutation rates, confirming that substitution rate variations across sites detected by amino acid substitution models are caused primarily by the variation of selective constraint against amino acid substitutions rather than the variation of codon mutation rate. Conclusions The mechanistic codon substitution model is superior to amino acid substitution models even in the evolutionary analysis of protein sequences.
Collapse
|