1
|
Cheung K, Rollins LA, Hammond JM, Barton K, Ferguson JM, Eyck HJF, Shine R, Edwards RJ. Repeat-Rich Regions Cause False-Positive Detection of NUMTs: A Case Study in Amphibians Using an Improved Cane Toad Reference Genome. Genome Biol Evol 2024; 16:evae246. [PMID: 39548850 PMCID: PMC11606642 DOI: 10.1093/gbe/evae246] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2024] [Revised: 10/08/2024] [Accepted: 11/04/2024] [Indexed: 11/18/2024] Open
Abstract
Mitochondrial DNA (mtDNA) has been widely used in genetics research for decades. Contamination from nuclear DNA of mitochondrial origin (NUMTs) can confound studies of phylogenetic relationships and mtDNA heteroplasmy. Homology searches with mtDNA are widely used to detect NUMTs in the nuclear genome. Nevertheless, false-positive detection of NUMTs is common when handling repeat-rich sequences, while fragmented genomes might result in missing true NUMTs. In this study, we investigated different NUMT detection methods and how the quality of the genome assembly affects them. We presented an improved nuclear genome assembly (aRhiMar1.3) of the invasive cane toad (Rhinella marina) with additional long-read Nanopore and 10× linked-read sequencing. The final assembly was 3.47 Gb in length with 91.3% of tetrapod universal single-copy orthologs (n = 5,310), indicating the gene-containing regions were well assembled. We used 3 complementary methods (NUMTFinder, dinumt, and PALMER) to study the NUMT landscape of the cane toad genome. All 3 methods yielded consistent results, showing very few NUMTs in the cane toad genome. Furthermore, we expanded NUMT detection analyses to other amphibians and confirmed a weak relationship between genome size and the number of NUMTs present in the nuclear genome. Amphibians are repeat-rich, and we show that the number of NUMTs found in highly repetitive genomes is prone to inflation when using homology-based detection without filters. Together, this study provides an exemplar of how to robustly identify NUMTs in complex genomes when confounding effects on mtDNA analyses are a concern.
Collapse
Affiliation(s)
- Kelton Cheung
- Evolution & Ecology Research Centre, School of Biological, Earth & Environmental Sciences, University of New South Wales, Sydney, Australia
- Evolution & Ecology Research Centre, School of Biotechnology & Biomolecular Sciences, University of New South Wales, Sydney, Australia
| | - Lee Ann Rollins
- Evolution & Ecology Research Centre, School of Biological, Earth & Environmental Sciences, University of New South Wales, Sydney, Australia
| | - Jillian M Hammond
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Darlinghurst, New South Wales, Australia
| | - Kirston Barton
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
| | - James M Ferguson
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Darlinghurst, New South Wales, Australia
| | - Harrison J F Eyck
- National Collections and Marine Infrastructure, CSIRO, Canberra, Australian Capital Territory, Australia
| | - Richard Shine
- School of Natural Sciences, Macquarie University, Sydney, New South Wales, Australia
| | - Richard J Edwards
- Evolution & Ecology Research Centre, School of Biotechnology & Biomolecular Sciences, University of New South Wales, Sydney, Australia
- Minderoo OceanOmics Centre at UWA, Oceans Institute, The University of Western Australia, Western Australia, Australia
| |
Collapse
|
2
|
Mackiewicz P, Urantówka AD, Kroczak A, Mackiewicz D. Resolving Phylogenetic Relationships within Passeriformes Based on Mitochondrial Genes and Inferring the Evolution of Their Mitogenomes in Terms of Duplications. Genome Biol Evol 2019; 11:2824-2849. [PMID: 31580435 PMCID: PMC6795242 DOI: 10.1093/gbe/evz209] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/30/2019] [Indexed: 12/29/2022] Open
Abstract
Mitochondrial genes are placed on one molecule, which implies that they should carry consistent phylogenetic information. Following this advantage, we present a well-supported phylogeny based on mitochondrial genomes from almost 300 representatives of Passeriformes, the most numerous and differentiated Aves order. The analyses resolved the phylogenetic position of paraphyletic Basal and Transitional Oscines. Passerida occurred divided into two groups, one containing Paroidea and Sylvioidea, whereas the other, Passeroidea and Muscicapoidea. Analyses of mitogenomes showed four types of rearrangements including a duplicated control region (CR) with adjacent genes. Mapping the presence and absence of duplications onto the phylogenetic tree revealed that the duplication was the ancestral state for passerines and was maintained in early diverged lineages. Next, the duplication could be lost and occurred independently at least four times according to the most parsimonious scenario. In some lineages, two CR copies have been inherited from an ancient duplication and highly diverged, whereas in others, the second copy became similar to the first one due to concerted evolution. The second CR copies accumulated over twice as many substitutions as the first ones. However, the second CRs were not completely eliminated and were retained for a long time, which suggests that both regions can fulfill an important role in mitogenomes. Phylogenetic analyses based on CR sequences subjected to the complex evolution can produce tree topologies inconsistent with real evolutionary relationships between species. Passerines with two CRs showed a higher metabolic rate in relation to their body mass.
Collapse
Affiliation(s)
- Paweł Mackiewicz
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, Poland
| | - Adam Dawid Urantówka
- Department of Genetics, Wroclaw University of Environmental and Life Sciences, Poland
| | - Aleksandra Kroczak
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, Poland
- Department of Genetics, Wroclaw University of Environmental and Life Sciences, Poland
| | - Dorota Mackiewicz
- Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wrocław, Poland
| |
Collapse
|
3
|
Gates DJ, Pilson D, Smith SD. Filtering of target sequence capture individuals facilitates species tree construction in the plant subtribe Iochrominae (Solanaceae). Mol Phylogenet Evol 2018; 123:26-34. [DOI: 10.1016/j.ympev.2018.02.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2017] [Revised: 01/30/2018] [Accepted: 02/01/2018] [Indexed: 10/18/2022]
|
4
|
Deichmann JL, Mulcahy DG, Vanthomme H, Tobi E, Wynn AH, Zimkus BM, McDiarmid RW. How many species and under what names? Using DNA barcoding and GenBank data for west Central African amphibian conservation. PLoS One 2017; 12:e0187283. [PMID: 29131846 PMCID: PMC5683629 DOI: 10.1371/journal.pone.0187283] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2016] [Accepted: 09/06/2017] [Indexed: 11/19/2022] Open
Abstract
Development projects in west Central Africa are proceeding at an unprecedented rate, often with little concern for their effects on biodiversity. In an attempt to better understand potential impacts of a road development project on the anuran amphibian community, we conducted a biodiversity assessment employing multiple methodologies (visual encounter transects, auditory surveys, leaf litter plots and pitfall traps) to inventory species prior to construction of a new road within the buffer zone of Moukalaba-Doudou National Park, Gabon. Because of difficulties in morphological identification and taxonomic uncertainty of amphibian species observed in the area, we integrated a DNA barcoding analysis into the project to improve the overall quality and accuracy of the species inventory. Based on morphology alone, 48 species were recognized in the field and voucher specimens of each were collected. We used tissue samples from specimens collected at our field site, material available from amphibians collected in other parts of Gabon and the Republic of Congo to initiate a DNA barcode library for west Central African amphibians. We then compared our sequences with material in GenBank for the genera recorded at the study site to assist in identifications. The resulting COI and 16S barcode library allowed us to update the number of species documented at the study site to 28, thereby providing a more accurate assessment of diversity and distributions. We caution that because sequence data maintained in GenBank are often poorly curated by the original submitters and cannot be amended by third-parties, these data have limited utility for identification purposes. Nevertheless, the use of DNA barcoding is likely to benefit biodiversity inventories and long-term monitoring, particularly for taxa that can be difficult to identify based on morphology alone; likewise, inventory and monitoring programs can contribute invaluable data to the DNA barcode library and the taxonomy of complex groups. Our methods provide an example of how non-taxonomists and parataxonomists working in understudied parts of the world with limited geographic sampling and comparative morphological material can use DNA barcoding and publicly available sequence data (GenBank) to rapidly identify the number of species and assign tentative names to aid in urgent conservation management actions and contribute to taxonomic resolution.
Collapse
Affiliation(s)
- Jessica L. Deichmann
- Center for Conservation and Sustainability, Smithsonian Conservation Biology Institute, National Zoological Park, Washington, DC, United States of America
| | - Daniel G. Mulcahy
- Global Genome Initiative, National Museum of Natural History, Smithsonian Institution, Washington, DC, United States of America
| | - Hadrien Vanthomme
- Center for Conservation and Sustainability, Smithsonian Conservation Biology Institute, National Zoological Park, Washington, DC, United States of America
| | - Elie Tobi
- Center for Conservation and Sustainability, Smithsonian Conservation Biology Institute, National Zoological Park, Washington, DC, United States of America
| | - Addison H. Wynn
- Department of Vertebrate Zoology, Division of Amphibians and Reptiles, National Museum of Natural History, Smithsonian Institution, Washington, DC, United States of America
| | - Breda M. Zimkus
- Museum of Comparative Zoology, Harvard University, Cambridge, MA, United States of America
| | - Roy W. McDiarmid
- USGS, Patuxent Wildlife Research Center, National Museum of Natural History, Washington DC, United States of America
| |
Collapse
|
5
|
Urantowka AD, Kroczak A, Mackiewicz P. The influence of molecular markers and methods on inferring the phylogenetic relationships between the representatives of the Arini (parrots, Psittaciformes), determined on the basis of their complete mitochondrial genomes. BMC Evol Biol 2017; 17:166. [PMID: 28705202 PMCID: PMC5513162 DOI: 10.1186/s12862-017-1012-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2016] [Accepted: 07/04/2017] [Indexed: 01/28/2023] Open
Abstract
BACKGROUND Conures are a morphologically diverse group of Neotropical parrots classified as members of the tribe Arini, which has recently been subjected to a taxonomic revision. The previously broadly defined Aratinga genus of this tribe has been split into the 'true' Aratinga and three additional genera, Eupsittula, Psittacara and Thectocercus. Popular markers used in the reconstruction of the parrots' phylogenies derive from mitochondrial DNA. However, current phylogenetic analyses seem to indicate conflicting relationships between Aratinga and other conures, and also among other Arini members. Therefore, it is not clear if the mtDNA phylogenies can reliably define the species tree. The inconsistencies may result from the variable evolution rate of the markers used or their weak phylogenetic signal. To resolve these controversies and to assess to what extent the phylogenetic relationships in the tribe Arini can be inferred from mitochondrial genomes, we compared representative Arini mitogenomes as well as examined the usefulness of the individual mitochondrial markers and the efficiency of various phylogenetic methods. RESULTS Single molecular markers produced inconsistent tree topologies, while different methods offered various topologies even for the same marker. A significant disagreement in these tree topologies occurred for cytb, nd2 and nd6 genes, which are commonly used in parrot phylogenies. The strongest phylogenetic signal was found in the control region and RNA genes. However, these markers cannot be used alone in inferring Arini phylogenies because they do not provide fully resolved trees. The most reliable phylogeny of the parrots under study is obtained only on the concatenated set of all mitochondrial markers. The analyses established significantly resolved relationships within the former Aratinga representatives and the main genera of the tribe Arini. Such mtDNA phylogeny can be in agreement with the species tree, owing to its match with synapomorphic features in plumage colouration. CONCLUSIONS Phylogenetic relationships inferred from single mitochondrial markers can be incorrect and contradictory. Therefore, such phylogenies should be considered with caution. Reliable results can be produced by concatenated sets of all or at least the majority of mitochondrial genes and the control region. The results advance a new view on the relationships among the main genera of Arini and resolve the inconsistencies between the taxa that were previously classified as the broadly defined genus Aratinga. Although gene and species trees do not always have to be consistent, the mtDNA phylogenies for Arini can reflect the species tree.
Collapse
Affiliation(s)
- Adam Dawid Urantowka
- Department of Genetics, Wroclaw University of Environmental and Life Sciences, ul. Kożuchowska7, 51-631, Wroclaw, Poland
| | - Aleksandra Kroczak
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, ul. Fryderyka Joliot-Curie 14a, 50-383 Wrocław, Poland
| | - Paweł Mackiewicz
- Department of Genomics, Faculty of Biotechnology, University of Wrocław, ul. Fryderyka Joliot-Curie 14a, 50-383 Wrocław, Poland
| |
Collapse
|
6
|
Wise MJ. dCITE: Measuring Necessary Cladistic Information Can Help You Reduce Polytomy Artefacts in Trees. PLoS One 2016; 11:e0166991. [PMID: 27898695 PMCID: PMC5127522 DOI: 10.1371/journal.pone.0166991] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2016] [Accepted: 11/07/2016] [Indexed: 11/19/2022] Open
Abstract
Biologists regularly create phylogenetic trees to better understand the evolutionary origins of their species of interest, and often use genomes as their data source. However, as more and more incomplete genomes are published, in many cases it may not be possible to compute genome-based phylogenetic trees due to large gaps in the assembled sequences. In addition, comparison of complete genomes may not even be desirable due to the presence of horizontally acquired and homologous genes. A decision must therefore be made about which gene, or gene combinations, should be used to compute a tree. Deflated Cladistic Information based on Total Entropy (dCITE) is proposed as an easily computed metric for measuring the cladistic information in multiple sequence alignments representing a range of taxa, without the need to first compute the corresponding trees. dCITE scores can be used to rank candidate genes or decide whether input sequences provide insufficient cladistic information, making artefactual polytomies more likely. The dCITE method can be applied to protein, nucleotide or encoded phenotypic data, so can be used to select which data-type is most appropriate, given the choice. In a series of experiments the dCITE method was compared with related measures. Then, as a practical demonstration, the ideas developed in the paper were applied to a dataset representing species from the order Campylobacterales; trees based on sequence combinations, selected on the basis of their dCITE scores, were compared with a tree constructed to mimic Multi-Locus Sequence Typing (MLST) combinations of fragments. We see that the greater the dCITE score the more likely it is that the computed phylogenetic tree will be free of artefactual polytomies. Secondly, cladistic information saturates, beyond which little additional cladistic information can be obtained by adding additional sequences. Finally, sequences with high cladistic information produce more consistent trees for the same taxa.
Collapse
Affiliation(s)
- Michael J. Wise
- Computer Science and Software Engineering, The University of Western Australia, Perth, Australia
- The Marshall Centre for Infectious Diseases Research and Training, The University of Western Australia, Perth, Australia
| |
Collapse
|
7
|
Maddock ST, Briscoe AG, Wilkinson M, Waeschenbach A, San Mauro D, Day JJ, Littlewood DTJ, Foster PG, Nussbaum RA, Gower DJ. Next-Generation Mitogenomics: A Comparison of Approaches Applied to Caecilian Amphibian Phylogeny. PLoS One 2016; 11:e0156757. [PMID: 27280454 PMCID: PMC4900593 DOI: 10.1371/journal.pone.0156757] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2015] [Accepted: 05/19/2016] [Indexed: 01/06/2023] Open
Abstract
Mitochondrial genome (mitogenome) sequences are being generated with increasing speed due to the advances of next-generation sequencing (NGS) technology and associated analytical tools. However, detailed comparisons to explore the utility of alternative NGS approaches applied to the same taxa have not been undertaken. We compared a 'traditional' Sanger sequencing method with two NGS approaches (shotgun sequencing and non-indexed, multiplex amplicon sequencing) on four different sequencing platforms (Illumina's HiSeq and MiSeq, Roche's 454 GS FLX, and Life Technologies' Ion Torrent) to produce seven (near-) complete mitogenomes from six species that form a small radiation of caecilian amphibians from the Seychelles. The fastest, most accurate method of obtaining mitogenome sequences that we tested was direct sequencing of genomic DNA (shotgun sequencing) using the MiSeq platform. Bayesian inference and maximum likelihood analyses using seven different partitioning strategies were unable to resolve compellingly all phylogenetic relationships among the Seychelles caecilian species, indicating the need for additional data in this case.
Collapse
Affiliation(s)
- Simon T. Maddock
- Department of Life Sciences, Natural History Museum, London, SW7 5BD, United Kingdom
- Department of Genetics, Evolution and Environment, University College London, London, WC1E 6BT, United Kingdom
- Department of Animal Management, Reaseheath College, Nantwich, CW5 6DF, United Kingdom
| | - Andrew G. Briscoe
- Department of Life Sciences, Natural History Museum, London, SW7 5BD, United Kingdom
| | - Mark Wilkinson
- Department of Life Sciences, Natural History Museum, London, SW7 5BD, United Kingdom
| | - Andrea Waeschenbach
- Department of Life Sciences, Natural History Museum, London, SW7 5BD, United Kingdom
| | - Diego San Mauro
- Department of Zoology and Physical Anthropology, Complutense University of Madrid, 28040, Madrid, Spain
| | - Julia J. Day
- Department of Genetics, Evolution and Environment, University College London, London, WC1E 6BT, United Kingdom
| | - D. Tim J. Littlewood
- Department of Life Sciences, Natural History Museum, London, SW7 5BD, United Kingdom
| | - Peter G. Foster
- Department of Life Sciences, Natural History Museum, London, SW7 5BD, United Kingdom
| | - Ronald A. Nussbaum
- Museum of Zoology, University of Michigan, Ann Arbor, MI, 48109–1079, United States of America
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI, 48109–1079, United States of America
| | - David J. Gower
- Department of Life Sciences, Natural History Museum, London, SW7 5BD, United Kingdom
| |
Collapse
|
8
|
Lewis PO, Chen MH, Kuo L, Lewis LA, Fučíková K, Neupane S, Wang YB, Shi D. Estimating Bayesian Phylogenetic Information Content. Syst Biol 2016; 65:1009-1023. [PMID: 27155008 PMCID: PMC5066063 DOI: 10.1093/sysbio/syw042] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2016] [Revised: 04/15/2016] [Accepted: 05/01/2016] [Indexed: 11/13/2022] Open
Abstract
Measuring the phylogenetic information content of data has a long history in systematics. Here we explore a Bayesian approach to information content estimation. The entropy of the posterior distribution compared with the entropy of the prior distribution provides a natural way to measure information content. If the data have no information relevant to ranking tree topologies beyond the information supplied by the prior, the posterior and prior will be identical. Information in data discourages consideration of some hypotheses allowed by the prior, resulting in a posterior distribution that is more concentrated (has lower entropy) than the prior. We focus on measuring information about tree topology using marginal posterior distributions of tree topologies. We show that both the accuracy and the computational efficiency of topological information content estimation improve with use of the conditional clade distribution, which also allows topological information content to be partitioned by clade. We explore two important applications of our method: providing a compelling definition of saturation and detecting conflict among data partitions that can negatively affect analyses of concatenated data. [Bayesian; concatenation; conditional clade distribution; entropy; information; phylogenetics; saturation.].
Collapse
Affiliation(s)
- Paul O Lewis
- Department of Ecology and Evolutionary Biology, University of Connecticut, 75 N. Eagleville Road, Unit 3043, Storrs, CT 06269, USA;
| | - Ming-Hui Chen
- Department of Statistics, University of Connecticut, 215 Glenbrook Road, Unit 4120, Storrs, CT 06269, USA
| | - Lynn Kuo
- Department of Statistics, University of Connecticut, 215 Glenbrook Road, Unit 4120, Storrs, CT 06269, USA
| | - Louise A Lewis
- Department of Ecology and Evolutionary Biology, University of Connecticut, 75 N. Eagleville Road, Unit 3043, Storrs, CT 06269, USA
| | - Karolina Fučíková
- Department of Ecology and Evolutionary Biology, University of Connecticut, 75 N. Eagleville Road, Unit 3043, Storrs, CT 06269, USA
| | - Suman Neupane
- Department of Ecology and Evolutionary Biology, University of Connecticut, 75 N. Eagleville Road, Unit 3043, Storrs, CT 06269, USA
| | - Yu-Bo Wang
- Department of Statistics, University of Connecticut, 215 Glenbrook Road, Unit 4120, Storrs, CT 06269, USA
| | - Daoyuan Shi
- Department of Statistics, University of Connecticut, 215 Glenbrook Road, Unit 4120, Storrs, CT 06269, USA
| |
Collapse
|
9
|
Collins RA, Britz R, Rüber L. Phylogenetic systematics of leaffishes (Teleostei: Polycentridae, Nandidae). J ZOOL SYST EVOL RES 2015. [DOI: 10.1111/jzs.12103] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Rupert A. Collins
- Laboratório de Evolução e Genética Animal; Departamento de Biologia; Universidade Federal do Amazonas; Manaus Amazonas Brasil
| | - Ralf Britz
- Vertebrates Division; Department of Life Sciences; Natural History Museum; London UK
| | - Lukas Rüber
- Naturhistorisches Museum der Burgergemeinde Bern; Bern Switzerland
| |
Collapse
|
10
|
San Mauro D, Gower DJ, Müller H, Loader SP, Zardoya R, Nussbaum RA, Wilkinson M. Life-history evolution and mitogenomic phylogeny of caecilian amphibians. Mol Phylogenet Evol 2014; 73:177-89. [PMID: 24480323 DOI: 10.1016/j.ympev.2014.01.009] [Citation(s) in RCA: 63] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2013] [Revised: 01/13/2014] [Accepted: 01/16/2014] [Indexed: 11/27/2022]
Abstract
We analyze mitochondrial genomes to reconstruct a robust phylogenetic framework for caecilian amphibians and use this to investigate life-history evolution within the group. Our study comprises 45 caecilian mitochondrial genomes (19 of them newly reported), representing all families and 27 of 32 currently recognized genera, including some for which molecular data had never been reported. Support for all relationships in the inferred phylogenetic tree is high to maximal, and topology tests reject all investigated alternatives, indicating an exceptionally robust molecular phylogenetic framework of caecilian evolution consistent with current morphology-based supraspecific classification. We used the mitogenomic phylogenetic framework to infer ancestral character states and to assess correlation among three life-history traits (free-living larvae, viviparity, specialized pre-adult or vernal teeth), each of which occurs only in some caecilian species. Our results provide evidence that an ancestor of the Seychelles caecilians abandoned direct development and re-evolved a free-living larval stage. This study yields insights into the concurrent evolution of direct development and of vernal teeth in an ancestor of Teresomata that likely gave rise to skin-feeding (maternal dermatophagy) behavior and subsequently enabled evolution of viviparity, with skin feeding possibly a homologous precursor of oviduct feeding in viviparous caecilians.
Collapse
Affiliation(s)
- Diego San Mauro
- Department of Life Sciences, The Natural History Museum, Cromwell Road, London SW7 5BD, United Kingdom.
| | - David J Gower
- Department of Life Sciences, The Natural History Museum, Cromwell Road, London SW7 5BD, United Kingdom
| | - Hendrik Müller
- Institut für Spezielle Zoologie und Evolutionsbiologie mit Phyletischem Museum, Friedrich-Schiller-Universität Jena, Erbertstrasse 1, 07743 Jena, Germany
| | - Simon P Loader
- University of Basel, Biogeography Research Group, Department of Environmental Sciences, Basel 4056, Switzerland
| | - Rafael Zardoya
- Departamento de Biodiversidad y Biología Evolutiva, Museo Nacional de Ciencias Naturales - CSIC, José Gutiérrez Abascal 2, 28006 Madrid, Spain
| | - Ronald A Nussbaum
- Museum of Zoology, University of Michigan, 1109 Geddes Ave., Ann Arbor, MI 48109-1079, United States
| | - Mark Wilkinson
- Department of Life Sciences, The Natural History Museum, Cromwell Road, London SW7 5BD, United Kingdom
| |
Collapse
|
11
|
Havird JC, Santos SR. Performance of single and concatenated sets of mitochondrial genes at inferring metazoan relationships relative to full mitogenome data. PLoS One 2014; 9:e84080. [PMID: 24454717 PMCID: PMC3891902 DOI: 10.1371/journal.pone.0084080] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2013] [Accepted: 11/11/2013] [Indexed: 12/04/2022] Open
Abstract
Mitochondrial (mt) genes are some of the most popular and widely-utilized genetic loci in phylogenetic studies of metazoan taxa. However, their linked nature has raised questions on whether using the entire mitogenome for phylogenetics is overkill (at best) or pseudoreplication (at worst). Moreover, no studies have addressed the comparative phylogenetic utility of mitochondrial genes across individual lineages within the entire Metazoa. To comment on the phylogenetic utility of individual mt genes as well as concatenated subsets of genes, we analyzed mitogenomic data from 1865 metazoan taxa in 372 separate lineages spanning genera to subphyla. Specifically, phylogenies inferred from these datasets were statistically compared to ones generated from all 13 mt protein-coding (PC) genes (i.e., the “supergene” set) to determine which single genes performed “best” at, and the minimum number of genes required to, recover the “supergene” topology. Surprisingly, the popular marker COX1 performed poorest, while ND5, ND4, and ND2 were most likely to reproduce the “supergene” topology. Averaged across all lineages, the longest ∼2 mt PC genes were sufficient to recreate the “supergene” topology, although this average increased to ∼5 genes for datasets with 40 or more taxa. Furthermore, concatenation of the three “best” performing mt PC genes outperformed that of the three longest mt PC genes (i.e, ND5, COX1, and ND4). Taken together, while not all mt PC genes are equally interchangeable in phylogenetic studies of the metazoans, some subset can serve as a proxy for the 13 mt PC genes. However, the exact number and identity of these genes is specific to the lineage in question and cannot be applied indiscriminately across the Metazoa.
Collapse
Affiliation(s)
- Justin C. Havird
- Department of Biological Sciences, Molette Biology Laboratory for Environmental and Climate Change Studies, Auburn University, Auburn, Alabama, United States of America
- Cellular and Molecular Biosciences Program, Auburn University, Auburn, Alabama, United States of America
- * E-mail:
| | - Scott R. Santos
- Department of Biological Sciences, Molette Biology Laboratory for Environmental and Climate Change Studies, Auburn University, Auburn, Alabama, United States of America
- Cellular and Molecular Biosciences Program, Auburn University, Auburn, Alabama, United States of America
| |
Collapse
|
12
|
Pedraza-Lara C, Doadrio I, Breinholt JW, Crandall KA. Phylogeny and evolutionary patterns in the Dwarf crayfish subfamily (Decapoda: Cambarellinae). PLoS One 2012; 7:e48233. [PMID: 23155379 PMCID: PMC3498282 DOI: 10.1371/journal.pone.0048233] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2011] [Accepted: 09/27/2012] [Indexed: 11/18/2022] Open
Abstract
The Dwarf crayfish or Cambarellinae, is a morphologically singular subfamily of decapod crustaceans that contains only one genus, Cambarellus. Its intriguing distribution, along the river basins of the Gulf Coast of United States (Gulf Group) and into Central México (Mexican Group), has until now lacked of satisfactory explanation. This study provides a comprehensive sampling of most of the extant species of Cambarellus and sheds light on its evolutionary history, systematics and biogeography. We tested the impact of Gulf Group versus Mexican Group geography on rates of cladogenesis using a maximum likelihood framework, testing different models of birth/extinction of lineages. We propose a comprehensive phylogenetic hypothesis for the subfamily based on mitochondrial and nuclear loci (3,833 bp) using Bayesian and Maximum Likelihood methods. The phylogenetic structure found two phylogenetic groups associated to the two main geographic components (Gulf Group and Mexican Group) and is partially consistent with the historical structure of river basins. The previous hypothesis, which divided the genus into three subgenera based on genitalia morphology was only partially supported (P = 0.047), resulting in a paraphyletic subgenus Pandicambarus. We found at least two cases in which phylogenetic structure failed to recover monophyly of recognized species while detecting several cases of cryptic diversity, corresponding to lineages not assigned to any described species. Cladogenetic patterns in the entire subfamily are better explained by an allopatric model of speciation. Diversification analyses showed similar cladogenesis patterns between both groups and did not significantly differ from the constant rate models. While cladogenesis in the Gulf Group is coincident in time with changes in the sea levels, in the Mexican Group, cladogenesis is congruent with the formation of the Trans-Mexican Volcanic Belt. Our results show how similar allopatric divergence in freshwater organisms can be promoted through diverse vicariant factors.
Collapse
Affiliation(s)
- Carlos Pedraza-Lara
- Departamento de Biodiversidad y Biología Evolutiva, Museo Nacional de Ciencias Naturales, CSIC, Madrid, Spain.
| | | | | | | |
Collapse
|
13
|
Weisrock DW. Concordance analysis in mitogenomic phylogenetics. Mol Phylogenet Evol 2012; 65:194-202. [DOI: 10.1016/j.ympev.2012.06.003] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2012] [Revised: 05/22/2012] [Accepted: 06/06/2012] [Indexed: 10/28/2022]
|
14
|
Agorreta A, Rüber L. A standardized reanalysis of molecular phylogenetic hypotheses of Gobioidei. SYST BIODIVERS 2012. [DOI: 10.1080/14772000.2012.699477] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
15
|
San Mauro D, Gower DJ, Cotton JA, Zardoya R, Wilkinson M, Massingham T. Experimental Design in Phylogenetics: Testing Predictions from Expected Information. Syst Biol 2012; 61:661-74. [DOI: 10.1093/sysbio/sys028] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Affiliation(s)
- Diego San Mauro
- Department of Zoology, The Natural History Museum, Cromwell Road, London SW7 5BD, UK
| | - David J. Gower
- Department of Zoology, The Natural History Museum, Cromwell Road, London SW7 5BD, UK
| | - James A. Cotton
- School of Biological and Chemical Sciences, Queen Mary, University of London, London E1 4NS, UK
| | - Rafael Zardoya
- Departamento de Biodiversidad y Biología Evolutiva, Museo Nacional de Ciencias Naturales—CSIC, José Gutiérrez Abascal 2, 28006 Madrid, Spain
| | - Mark Wilkinson
- Department of Zoology, The Natural History Museum, Cromwell Road, London SW7 5BD, UK
| | - Tim Massingham
- EMBL—European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, UK
| |
Collapse
|
16
|
Alexander Pyron R, Wiens JJ. A large-scale phylogeny of Amphibia including over 2800 species, and a revised classification of extant frogs, salamanders, and caecilians. Mol Phylogenet Evol 2011; 61:543-83. [DOI: 10.1016/j.ympev.2011.06.012] [Citation(s) in RCA: 936] [Impact Index Per Article: 66.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2011] [Revised: 06/10/2011] [Accepted: 06/12/2011] [Indexed: 11/27/2022]
|
17
|
Fisher-Reid MC, Wiens JJ. What are the consequences of combining nuclear and mitochondrial data for phylogenetic analysis? Lessons from Plethodon salamanders and 13 other vertebrate clades. BMC Evol Biol 2011; 11:300. [PMID: 21995558 PMCID: PMC3203092 DOI: 10.1186/1471-2148-11-300] [Citation(s) in RCA: 72] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2011] [Accepted: 10/13/2011] [Indexed: 11/15/2022] Open
Abstract
Background The use of mitochondrial DNA data in phylogenetics is controversial, yet studies that combine mitochondrial and nuclear DNA data (mtDNA and nucDNA) to estimate phylogeny are common, especially in vertebrates. Surprisingly, the consequences of combining these data types are largely unexplored, and many fundamental questions remain unaddressed in the literature. For example, how much do trees from mtDNA and nucDNA differ? How are topological conflicts between these data types typically resolved in the combined-data tree? What determines whether a node will be resolved in favor of mtDNA or nucDNA, and are there any generalities that can be made regarding resolution of mtDNA-nucDNA conflicts in combined-data trees? Here, we address these and related questions using new and published nucDNA and mtDNA data for Plethodon salamanders and published data from 13 other vertebrate clades (including fish, frogs, lizards, birds, turtles, and mammals). Results We find widespread discordance between trees from mtDNA and nucDNA (30-70% of nodes disagree per clade), but this discordance is typically not strongly supported. Despite often having larger numbers of variable characters, mtDNA data do not typically dominate combined-data analyses, and combined-data trees often share more nodes with trees from nucDNA alone. There is no relationship between the proportion of nodes shared between combined-data and mtDNA trees and relative numbers of variable characters or levels of homoplasy in the mtDNA and nucDNA data sets. Congruence between trees from mtDNA and nucDNA is higher on branches that are longer and deeper in the combined-data tree, but whether a conflicting node will be resolved in favor mtDNA or nucDNA is unrelated to branch length. Conflicts that are resolved in favor of nucDNA tend to occur at deeper nodes in the combined-data tree. In contrast to these overall trends, we find that Plethodon have an unusually large number of strongly supported conflicts between data types, which are generally resolved in favor of mtDNA in the combined-data tree (despite the large number of nuclear loci sampled). Conclusions Overall, our results from 14 vertebrate clades show that combined-data analyses are not necessarily dominated by the more variable mtDNA data sets. However, given cases like Plethodon, there is also the need for routine checking of incongruence between mtDNA and nucDNA data and its impacts on combined-data analyses.
Collapse
Affiliation(s)
- M Caitlin Fisher-Reid
- Department of Ecology and Evolution, Stony Brook University, Stony Brook NY, 11794-5245, USA.
| | | |
Collapse
|
18
|
Gower DJ, Mauro DS, Giri V, Bhatta G, Govindappa V, Kotharambath R, Oommen OV, Fatih FA, Mackenzie-Dodds JA, Nussbaum RA, Biju S, Shouche YS, Wilkinson M. Molecular systematics of caeciliid caecilians (Amphibia: Gymnophiona) of the Western Ghats, India. Mol Phylogenet Evol 2011; 59:698-707. [DOI: 10.1016/j.ympev.2011.03.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2010] [Revised: 02/28/2011] [Accepted: 03/02/2011] [Indexed: 10/18/2022]
|
19
|
Doherty-Bone TM, Ndifon RK, San Mauro D, Wilkinson M, LeGrand GN, Gower DJ. Systematics and ecology of the caecilianCrotaphatrema lamottei(Nussbaum) (Amphibia: Gymnophiona: Scolecomorphidae). J NAT HIST 2011. [DOI: 10.1080/00222933.2010.535921] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
20
|
Irisarri I, San Mauro D, Green DM, Zardoya R. The complete mitochondrial genome of the relict frog Leiopelma archeyi: insights into the root of the frog Tree of Life. ACTA ACUST UNITED AC 2011; 21:173-82. [PMID: 20958226 DOI: 10.3109/19401736.2010.513973] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
Determining the root of the anuran Tree of Life is still a contentious and open question in frog systematics. Two genera with disjunct distributions have been traditionally considered the most basal among extant frogs: Leiopelma, which is endemic to New Zealand, and Ascaphus, which lives in North America. However, their specific phylogenetic position is rather elusive because each genus shows many autapomorphies, and together they retain many symplesiomorphic characters. Therefore, several alternative hypotheses have been proposed regarding the relative phylogenetic position of both Leiopelma and Ascaphus. In order to distinguish among these competing phylogenetic hypotheses, we sequenced the complete mitochondrial (mt) genome of Leiopelma archeyi and used it along with previously reported frog mt genomes (including that of Ascaphus truei) to infer a robust phylogeny of major anuran lineages. The reconstructed maximum likelihood and Bayesian inference phylogenies recovered identical topology, which supports the sister group relationship of Ascaphus and Leiopelma, and the placement of this clade at the base of the anuran tree. Interestingly, the mt genome of L. archeyi displays a novel gene arrangement in frog mt genomes affecting the relative position of cytochrome b, trnT, NADH dehydrogenase subunit 6, trnE, and trnP genes. The tandem duplication-random loss model of gene order change explains the origin of this novel frog mt genome arrangement, which is convergent with others reported in some fishes and salamanders. These results, together with comparative data for other available vertebrate mt genomes, provide evidence that the 5' end of the control region is a hot spot for gene order rearrangement.
Collapse
Affiliation(s)
- Iker Irisarri
- Departamento de Biodiversidad y Biología Evolutiva, Museo Nacional de Ciencias Naturales (CSIC), Madrid, Spain.
| | | | | | | |
Collapse
|
21
|
A multilocus timescale for the origin of extant amphibians. Mol Phylogenet Evol 2010; 56:554-61. [DOI: 10.1016/j.ympev.2010.04.019] [Citation(s) in RCA: 84] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2009] [Revised: 04/06/2010] [Accepted: 04/13/2010] [Indexed: 11/15/2022]
|
22
|
Molecular systematics: A synthesis of the common methods and the state of knowledge. Cell Mol Biol Lett 2010; 15:311-41. [PMID: 20213503 PMCID: PMC6275913 DOI: 10.2478/s11658-010-0010-8] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2009] [Accepted: 03/01/2010] [Indexed: 11/21/2022] Open
Abstract
The comparative and evolutionary analysis of molecular data has allowed researchers to tackle biological questions that have long remained unresolved. The evolution of DNA and amino acid sequences can now be modeled accurately enough that the information conveyed can be used to reconstruct the past. The methods to infer phylogeny (the pattern of historical relationships among lineages of organisms and/or sequences) range from the simplest, based on parsimony, to more sophisticated and highly parametric ones based on likelihood and Bayesian approaches. In general, molecular systematics provides a powerful statistical framework for hypothesis testing and the estimation of evolutionary processes, including the estimation of divergence times among taxa. The field of molecular systematics has experienced a revolution in recent years, and, although there are still methodological problems and pitfalls, it has become an essential tool for the study of evolutionary patterns and processes at different levels of biological organization. This review aims to present a brief synthesis of the approaches and methodologies that are most widely used in the field of molecular systematics today, as well as indications of future trends and state-of-the-art approaches.
Collapse
|
23
|
Klopfstein S, Kropf C, Quicke DLJ. An evaluation of phylogenetic informativeness profiles and the molecular phylogeny of diplazontinae (Hymenoptera, Ichneumonidae). Syst Biol 2010; 59:226-41. [PMID: 20525632 DOI: 10.1093/sysbio/syp105] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
How to quantify the phylogenetic information content of a data set is a longstanding question in phylogenetics, influencing both the assessment of data quality in completed studies and the planning of future phylogenetic projects. Recently, a method has been developed that profiles the phylogenetic informativeness (PI) of a data set through time by linking its site-specific rates of change to its power to resolve relationships at different timescales. Here, we evaluate the performance of this method in the case of 2 standard genetic markers for phylogenetic reconstruction, 28S ribosomal RNA and cytochrome oxidase subunit 1 (CO1) mitochondrial DNA, with maximum parsimony, maximum likelihood, and Bayesian analyses of relationships within a group of parasitoid wasps (Hymenoptera: Ichneumonidae, Diplazontinae). Retrieving PI profiles of the 2 genes from our own and from 3 additional data sets, we find that the method repeatedly overestimates the performance of the more quickly evolving CO1 compared with 28S. We explore possible reasons for this bias, including phylogenetic uncertainty, violation of the molecular clock assumption, model misspecification, and nonstationary nucleotide composition. As none of these provides a sufficient explanation of the observed discrepancy, we use simulated data sets, based on an idealized setting, to show that the optimum evolutionary rate decreases with increasing number of taxa. We suggest that this relationship could explain why the formula derived from the 4-taxon case overrates the performance of higher versus lower rates of evolution in our case and that caution should be taken when the method is applied to data sets including more than 4 taxa.
Collapse
Affiliation(s)
- Seraina Klopfstein
- Department of Invertebrates, Natural History Museum, Bernastrasse 15, CH-3005 Bern, Switzerland.
| | | | | |
Collapse
|