1
|
Chou RT, Ouattara A, Adams M, Berry AA, Takala-Harrison S, Cummings MP. Positive-unlabeled learning identifies vaccine candidate antigens in the malaria parasite Plasmodium falciparum. NPJ Syst Biol Appl 2024; 10:44. [PMID: 38678051 PMCID: PMC11055854 DOI: 10.1038/s41540-024-00365-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Accepted: 03/29/2024] [Indexed: 04/29/2024] Open
Abstract
Malaria vaccine development is hampered by extensive antigenic variation and complex life stages of Plasmodium species. Vaccine development has focused on a small number of antigens, many of which were identified without utilizing systematic genome-level approaches. In this study, we implement a machine learning-based reverse vaccinology approach to predict potential new malaria vaccine candidate antigens. We assemble and analyze P. falciparum proteomic, structural, functional, immunological, genomic, and transcriptomic data, and use positive-unlabeled learning to predict potential antigens based on the properties of known antigens and remaining proteins. We prioritize candidate antigens based on model performance on reference antigens with different genetic diversity and quantify the protein properties that contribute most to identifying top candidates. Candidate antigens are characterized by gene essentiality, gene ontology, and gene expression in different life stages to inform future vaccine development. This approach provides a framework for identifying and prioritizing candidate vaccine antigens for a broad range of pathogens.
Collapse
Affiliation(s)
- Renee Ti Chou
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, College Park, MD, USA
| | - Amed Ouattara
- Center for Vaccine Development and Global Health, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Matthew Adams
- Center for Vaccine Development and Global Health, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Andrea A Berry
- Center for Vaccine Development and Global Health, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Shannon Takala-Harrison
- Center for Vaccine Development and Global Health, University of Maryland School of Medicine, Baltimore, MD, USA.
| | - Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, College Park, MD, USA.
| |
Collapse
|
2
|
Hsueh HT, Chou RT, Rai U, Kolodziejski P, Liyanage W, Pejavar J, Mozzer A, Davison C, Appell MB, Kim YC, Leo KT, Kwon H, Sista M, Anders NM, Hemingway A, Rompicharla SVK, Pitha I, Zack DJ, Hanes J, Cummings MP, Ensign LM. Engineered peptide-drug conjugate provides sustained protection of retinal ganglion cells with topical administration in rats. J Control Release 2023; 362:371-380. [PMID: 37657693 PMCID: PMC10591956 DOI: 10.1016/j.jconrel.2023.08.058] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Revised: 05/03/2023] [Accepted: 08/29/2023] [Indexed: 09/03/2023]
Abstract
Effective eye drop delivery systems for treating diseases of the posterior segment have yet to be clinically validated. Further, adherence to eye drop regimens is often problematic due to the difficulty and inconvenience of repetitive dosing. Here, we describe a strategy for topically dosing a peptide-drug conjugate to achieve effective and sustained therapeutic sunitinib concentrations to protect retinal ganglion cells (RGCs) in a rat model of optic nerve injury. We combined two promising delivery technologies, namely, a hypotonic gel-forming eye drop delivery system, and an engineered melanin binding and cell-penetrating peptide that sustains intraocular drug residence time. We found that once daily topical dosing of HR97-SunitiGel provided up to 2 weeks of neuroprotection after the last dose, effectively doubling the therapeutic window observed with SunitiGel. For chronic ocular diseases affecting the posterior segment, the convenience of an eye drop combined with intermittent dosing frequency could result in greater patient adherence, and thus, improved disease management.
Collapse
Affiliation(s)
- Henry T Hsueh
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Renee Ti Chou
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, College Park, MD, USA
| | - Usha Rai
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Patricia Kolodziejski
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Wathsala Liyanage
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Jahnavi Pejavar
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Ann Mozzer
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Charlotte Davison
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Matthew B Appell
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA
| | - Yoo Chun Kim
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Kirby T Leo
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - HyeYoung Kwon
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Maanasa Sista
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Biomedical Engineering, Case Western Reserve University, Cleveland, OH, USA
| | - Nicole M Anders
- The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
| | - Avelina Hemingway
- The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
| | - Sri Vishnu Kiran Rompicharla
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Ian Pitha
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Donald J Zack
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Departments of Neuroscience, Molecular Biology and Genetics, and Genetic Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Justin Hanes
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA; Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA; The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
| | - Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, College Park, MD, USA
| | - Laura M Ensign
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA; Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA; Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA; The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA.
| |
Collapse
|
3
|
Hsueh HT, Chou RT, Rai U, Liyanage W, Kim YC, Appell MB, Pejavar J, Leo KT, Davison C, Kolodziejski P, Mozzer A, Kwon H, Sista M, Anders NM, Hemingway A, Rompicharla SVK, Edwards M, Pitha I, Hanes J, Cummings MP, Ensign LM. Machine learning-driven multifunctional peptide engineering for sustained ocular drug delivery. Nat Commun 2023; 14:2509. [PMID: 37130851 PMCID: PMC10154330 DOI: 10.1038/s41467-023-38056-w] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 04/12/2023] [Indexed: 05/04/2023] Open
Abstract
Sustained drug delivery strategies have many potential benefits for treating a range of diseases, particularly chronic diseases that require treatment for years. For many chronic ocular diseases, patient adherence to eye drop dosing regimens and the need for frequent intraocular injections are significant barriers to effective disease management. Here, we utilize peptide engineering to impart melanin binding properties to peptide-drug conjugates to act as a sustained-release depot in the eye. We develop a super learning-based methodology to engineer multifunctional peptides that efficiently enter cells, bind to melanin, and have low cytotoxicity. When the lead multifunctional peptide (HR97) is conjugated to brimonidine, an intraocular pressure lowering drug that is prescribed for three times per day topical dosing, intraocular pressure reduction is observed for up to 18 days after a single intracameral injection in rabbits. Further, the cumulative intraocular pressure lowering effect increases ~17-fold compared to free brimonidine injection. Engineered multifunctional peptide-drug conjugates are a promising approach for providing sustained therapeutic delivery in the eye and beyond.
Collapse
Affiliation(s)
- Henry T Hsueh
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Renee Ti Chou
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, College Park, MD, USA
| | - Usha Rai
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Wathsala Liyanage
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Yoo Chun Kim
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Matthew B Appell
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA
| | - Jahnavi Pejavar
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Kirby T Leo
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Charlotte Davison
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Patricia Kolodziejski
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Ann Mozzer
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - HyeYoung Kwon
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Maanasa Sista
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Biomedical Engineering, Case Western Reserve University, Cleveland, OH, USA
| | - Nicole M Anders
- The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
| | - Avelina Hemingway
- The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
| | - Sri Vishnu Kiran Rompicharla
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Malia Edwards
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Ian Pitha
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
| | - Justin Hanes
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA
- Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA
- The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA
| | - Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, College Park, MD, USA.
| | - Laura M Ensign
- Center for Nanomedicine at the Wilmer Eye Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA.
- Department of Chemical & Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA.
- Department of Ophthalmology, Johns Hopkins University School of Medicine, Baltimore, MD, USA.
- Department of Pharmacology and Molecular Sciences, Johns Hopkins University, Baltimore, MD, USA.
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, USA.
- The Sidney Kimmel Comprehensive Cancer Center at Johns Hopkins University, Baltimore, MD, USA.
| |
Collapse
|
4
|
Goodheart JA, Collins AG, Cummings MP, Egger B, Rawlinson KA. A phylogenomic approach to resolving interrelationships of polyclad flatworms, with implications for life-history evolution. R Soc Open Sci 2023; 10:220939. [PMID: 36998763 PMCID: PMC10049750 DOI: 10.1098/rsos.220939] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/22/2022] [Accepted: 03/07/2023] [Indexed: 06/19/2023]
Abstract
Platyhelminthes (flatworms) are a diverse invertebrate phylum useful for exploring life-history evolution. Within Platyhelminthes, only two clades develop through a larval stage: free-living polyclads and parasitic neodermatans. Neodermatan larvae are considered evolutionarily derived, whereas polyclad larvae are hypothesized to be ancestral due to ciliary band similarities among polyclad and other spiralian larvae. However, larval evolution has been challenging to investigate within polyclads due to low support for deeper phylogenetic relationships. To investigate polyclad life-history evolution, we generated transcriptomic data for 21 species of polyclads to build a well-supported phylogeny for the group. The resulting tree provides strong support for deeper nodes, and we recover a new monophyletic clade of early branching cotyleans. We then used ancestral state reconstructions to investigate ancestral modes of development within Polycladida and more broadly within flatworms. In polyclads, we were unable to reconstruct the ancestral state of deeper nodes with significant support because early branching clades show diverse modes of development. This suggests a complex history of larval evolution in polyclads that likely includes multiple losses and/or multiple gains. However, our ancestral state reconstruction across a previously published platyhelminth phylogeny supports a direct developing prorhynchid/polyclad ancestor, which suggests that a larval stage in the life cycle evolved along the polyclad stem lineage or within polyclads.
Collapse
Affiliation(s)
- Jessica A. Goodheart
- Division of Invertebrate Zoology, American Museum of Natural History, New York, NY 10024, USA
- Scripps Institution of Oceanography, University of California, San Diego, La Jolla, CA 92037, USA
| | - Allen G. Collins
- NMFS, National Systematics Laboratory, National Museum of Natural History, Smithsonian Institution, MRC-153, PO Box 37012, Washington, DC 20013, USA
| | - Michael P. Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| | - Bernhard Egger
- Universität Innsbruck, Department of Zoology, Technikerstr. 25, 6020 Innsbruck, Austria
| | - Kate A. Rawlinson
- Wellcome Sanger Institute, Hinxton, Cambridgeshire CB10 1SA, UK
- Josephine Bay Paul Center, Marine Biological Laboratory, Woods Hole, MA, 02543
| |
Collapse
|
5
|
Commichaux S, Shah N, Ghurye J, Stoppel A, Goodheart JA, Luque GG, Cummings MP, Pop M. A critical assessment of gene catalogs for metagenomic analysis. Bioinformatics 2021; 37:2848-2857. [PMID: 33792639 PMCID: PMC8479683 DOI: 10.1093/bioinformatics/btab216] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Revised: 02/02/2021] [Accepted: 03/31/2021] [Indexed: 02/02/2023] Open
Abstract
MOTIVATION Microbial gene catalogs are data structures that organize genes found in microbial communities, providing a reference for standardized analysis of the microbes across samples and studies. Although gene catalogs are commonly used, they have not been critically evaluated for their effectiveness as a basis for metagenomic analyses. RESULTS As a case study, we investigate one such catalog, the Integrated Gene Catalog (IGC), however, our observations apply broadly to most gene catalogs constructed to date. We focus on both the approach used to construct this catalog and on its effectiveness when used as a reference for microbiome studies. Our results highlight important limitations of the approach used to construct the IGC and call into question the broad usefulness of gene catalogs more generally. We also recommend best practices for the construction and use of gene catalogs in microbiome studies and highlight opportunities for future research. AVAILABILITY AND IMPLEMENTATION All supporting scripts for our analyses can be found on GitHub: https://github.com/SethCommichaux/IGC.git. The supporting data can be downloaded from: https://obj.umiacs.umd.edu/igc-analysis/IGC_analysis_data.tar.gz. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Seth Commichaux
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, 20742, USA,Biological Science Graduate Program, University of Maryland, College Park, MD, 20742, USA,Division of Molecular Biology, Office of Applied Research and Safety Assessment, Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, Laurel, Maryland, 20708, USA
| | - Nidhi Shah
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, 20742, USA,Department of Computer Science, University of Maryland, College Park, MD, 20742, USA
| | - Jay Ghurye
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, 20742, USA,Department of Computer Science, University of Maryland, College Park, MD, 20742, USA
| | - Alexander Stoppel
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, 20742, USA
| | - Jessica A Goodheart
- Scripps Institution of Oceanography, University of California, San Diego, La Jolla, CA, 92037, USA
| | - Guillermo G Luque
- Department of Microbiome Science, Max Planck Institute for Developmental Biology, Tübingen, 72076, Germany
| | - Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, 20742, USA
| | - Mihai Pop
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, 20742, USA,Department of Computer Science, University of Maryland, College Park, MD, 20742, USA,To whom correspondence should be addressed.
| |
Collapse
|
6
|
Ayres DL, Cummings MP, Baele G, Darling AE, Lewis PO, Swofford DL, Huelsenbeck JP, Lemey P, Rambaut A, Suchard MA. BEAGLE 3: Improved Performance, Scaling, and Usability for a High-Performance Computing Library for Statistical Phylogenetics. Syst Biol 2019; 68:1052-1061. [PMID: 31034053 PMCID: PMC6802572 DOI: 10.1093/sysbio/syz020] [Citation(s) in RCA: 101] [Impact Index Per Article: 20.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2018] [Revised: 04/10/2019] [Accepted: 04/10/2019] [Indexed: 11/12/2022] Open
Abstract
BEAGLE is a high-performance likelihood-calculation library for phylogenetic inference. The BEAGLE library defines a simple, but flexible, application programming interface (API), and includes a collection of efficient implementations for calculation under a variety of evolutionary models on different hardware devices. The library has been integrated into recent versions of popular phylogenetics software packages including BEAST and MrBayes and has been widely used across a diverse range of evolutionary studies. Here, we present BEAGLE 3 with new parallel implementations, increased performance for challenging data sets, improved scalability, and better usability. We have added new OpenCL and central processing unit-threaded implementations to the library, allowing the effective utilization of a wider range of modern hardware. Further, we have extended the API and library to support concurrent computation of independent partial likelihood arrays, for increased performance of nucleotide-model analyses with greater flexibility of data partitioning. For better scalability and usability, we have improved how phylogenetic software packages use BEAGLE in multi-GPU (graphics processing unit) and cluster environments, and introduced an automated method to select the fastest device given the data set, evolutionary model, and hardware. For application developers who wish to integrate the library, we also have developed an online tutorial. To evaluate the effect of the improvements, we ran a variety of benchmarks on state-of-the-art hardware. For a partitioned exemplar analysis, we observe run-time performance improvements as high as 5.9-fold over our previous GPU implementation. BEAGLE 3 is free, open-source software licensed under the Lesser GPL and available at https://beagle-dev.github.io.
Collapse
Affiliation(s)
- Daniel L Ayres
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| | - Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| | - Guy Baele
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven – University of Leuven, 3000 Leuven, Belgium
| | - Aaron E Darling
- The ithree Institute, University of Technology Sydney, Ultimo, New South Wales 2007, Australia
| | - Paul O Lewis
- Department of Ecology and Evolutionary Biology, University of Connecticut, Unit 3043, Storrs, CT 06269, USA
| | - David L Swofford
- Florida Museum of Natural History, University of Florida, Gainesville, FL 32611, USA
| | - John P Huelsenbeck
- Department of Integrative Biology, University of California, Berkeley, CA 94720 USA
| | - Philippe Lemey
- Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven – University of Leuven, 3000 Leuven, Belgium
| | - Andrew Rambaut
- Institute of Evolutionary Biology, University of Edinburgh, King’s Buildings, Edinburgh EH9 3FL, UK
- Fogarty International Center, National Institutes of Health, Bethesda, MD 20892, USA
| | - Marc A Suchard
- Department of Biomathematics University of California, Los Angeles, CA 90095, USA
- Department of Biostatistics, University of California, Los Angeles, CA 90095, USA
- Department of Human Genetics, University of California, Los Angeles, CA 90095, USA
| |
Collapse
|
7
|
Goodheart JA, Bleidißel S, Schillo D, Strong EE, Ayres DL, Preisfeld A, Collins AG, Cummings MP, Wägele H. Comparative morphology and evolution of the cnidosac in Cladobranchia (Gastropoda: Heterobranchia: Nudibranchia). Front Zool 2018; 15:43. [PMID: 30473719 PMCID: PMC6234619 DOI: 10.1186/s12983-018-0289-2] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2018] [Accepted: 10/30/2018] [Indexed: 11/10/2022] Open
Abstract
Background A number of shelled and shell-less gastropods are known to use multiple defensive mechanisms, including internally generated or externally obtained biochemically active compounds and structures. Within Nudipleura, nudibranchs within Cladobranchia possess such a special defense: the ability to sequester cnidarian nematocysts - small capsules that can inject venom into the tissues of other organisms. This ability is distributed across roughly 600 species within Cladobranchia, and many questions still remain in regard to the comparative morphology and evolution of the cnidosac - the structure that houses sequestered nematocysts (called kleptocnides). In this paper, we describe cnidosac morphology across the main groups of Cladobranchia in which it occurs, and place variation in its structure in a phylogenetic context to better understand the evolution of nematocyst sequestration. Results Overall, we find that the length, size and structure of the entrance to the cnidosac varies more than expected based on previous work, as does the structure of the exit, the musculature surrounding the cnidosac, and the position and orientation of the kleptocnides. The sequestration of nematocysts has originated at least twice within Cladobranchia based on the phylogeny presented here using 94 taxa and 409 genes. Conclusions The cnidosac is not homologous to cnidosac-like structures found in Hancockiidae. Additionally, the presence of a sac at the distal end of the digestive gland may have originated prior to the sequestration of nematocysts. This study provides a more complete picture of variation in, and evolution of, morphological characters associated with nematocyst sequestration in Cladobranchia.
Collapse
Affiliation(s)
- Jessica A Goodheart
- 1Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742 USA.,2NMFS, National Systematics Laboratory, National Museum of Natural History, Smithsonian Institution, MRC-153, PO Box 37012, Washington, DC 20013 USA.,3Department of Invertebrate Zoology, Smithsonian Institution, National Museum of Natural History, MRC 163, P.O. Box 37012, Washington, DC 20013-7012 USA.,4Present address: Marine Science Institute, University of California Santa Barbara, Santa Barbara, CA 93106 USA
| | - Sabrina Bleidißel
- 5Zoology and Didactics of Biology, University of Wuppertal, 42097 Wuppertal, Germany
| | - Dorothee Schillo
- 6Zoologisches Forschungsmuseum Alexander Koenig, 53113 Bonn, Germany
| | - Ellen E Strong
- 3Department of Invertebrate Zoology, Smithsonian Institution, National Museum of Natural History, MRC 163, P.O. Box 37012, Washington, DC 20013-7012 USA
| | - Daniel L Ayres
- 1Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742 USA
| | - Angelika Preisfeld
- 5Zoology and Didactics of Biology, University of Wuppertal, 42097 Wuppertal, Germany
| | - Allen G Collins
- 2NMFS, National Systematics Laboratory, National Museum of Natural History, Smithsonian Institution, MRC-153, PO Box 37012, Washington, DC 20013 USA
| | - Michael P Cummings
- 1Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742 USA
| | - Heike Wägele
- 6Zoologisches Forschungsmuseum Alexander Koenig, 53113 Bonn, Germany
| |
Collapse
|
8
|
Goodheart JA, Bazinet AL, Valdés Á, Collins AG, Cummings MP. Prey preference follows phylogeny: evolutionary dietary patterns within the marine gastropod group Cladobranchia (Gastropoda: Heterobranchia: Nudibranchia). BMC Evol Biol 2017; 17:221. [PMID: 29073890 PMCID: PMC5659023 DOI: 10.1186/s12862-017-1066-0] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2017] [Accepted: 10/15/2017] [Indexed: 12/03/2022] Open
Abstract
Background The impact of predator-prey interactions on the evolution of many marine invertebrates is poorly understood. Since barriers to genetic exchange are less obvious in the marine realm than in terrestrial or freshwater systems, non-allopatric divergence may play a fundamental role in the generation of biodiversity. In this context, shifts between major prey types could constitute important factors explaining the biodiversity of marine taxa, particularly in groups with highly specialized diets. However, the scarcity of marine specialized consumers for which reliable phylogenies exist hampers attempts to test the role of trophic specialization in evolution. In this study, RNA-Seq data is used to produce a phylogeny of Cladobranchia, a group of marine invertebrates that feed on a diverse array of prey taxa but mostly specialize on cnidarians. The broad range of prey type preferences allegedly present in two major groups within Cladobranchia suggest that prey type shifts are relatively common over evolutionary timescales. Results In the present study, we generated a well-supported phylogeny of the major lineages within Cladobranchia using RNA-Seq data, and used ancestral state reconstruction analyses to better understand the evolution of prey preference. These analyses answered several fundamental questions regarding the evolutionary relationships within Cladobranchia, including support for a clade of species from Arminidae as sister to Tritoniidae (which both preferentially prey on Octocorallia). Ancestral state reconstruction analyses supported a cladobranchian ancestor with a preference for Hydrozoa and show that the few transitions identified only occur from lineages that prey on Hydrozoa to those that feed on other types of prey. Conclusions There is strong phylogenetic correlation with prey preference within Cladobranchia, suggesting that prey type specialization within this group has inertia. Shifts between different types of prey have occurred rarely throughout the evolution of Cladobranchia, indicating that this may not have been an important driver of the diversity within this group. Electronic supplementary material The online version of this article (10.1186/s12862-017-1066-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jessica A Goodheart
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, 20742, USA. .,NMFS, National Systematics Laboratory, National Museum of Natural History, Smithsonian Institution, MRC-153, PO Box 37012, Washington, DC, 20013, USA.
| | - Adam L Bazinet
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, 20742, USA.,Present address: National Biodefense Analysis and Countermeasures Center, 8300 Research Plaza, Fort Detrick, MD, 21702, USA
| | - Ángel Valdés
- Department of Biological Sciences, California State Polytechnic University, 3801 W Temple Ave, Pomona, CA, 91768, USA
| | - Allen G Collins
- NMFS, National Systematics Laboratory, National Museum of Natural History, Smithsonian Institution, MRC-153, PO Box 37012, Washington, DC, 20013, USA
| | - Michael P Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, 20742, USA
| |
Collapse
|
9
|
Agrawal S, Moser KA, Morton L, Cummings MP, Parihar A, Dwivedi A, Shetty AC, Drabek EF, Jacob CG, Henrich PP, Parobek CM, Jongsakul K, Huy R, Spring MD, Lanteri CA, Chaorattanakawee S, Lon C, Fukuda MM, Saunders DL, Fidock DA, Lin JT, Juliano JJ, Plowe CV, Silva JC, Takala-Harrison S. Association of a Novel Mutation in the Plasmodium falciparum Chloroquine Resistance Transporter With Decreased Piperaquine Sensitivity. J Infect Dis 2017; 216:468-476. [PMID: 28931241 DOI: 10.1093/infdis/jix334] [Citation(s) in RCA: 85] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Accepted: 07/12/2017] [Indexed: 12/30/2022] Open
Abstract
Background Amplified copy number in the plasmepsin II/III genes within Plasmodium falciparum has been associated with decreased sensitivity to piperaquine. To examine this association and test whether additional loci might also contribute, we performed a genome-wide association study of ex vivo P. falciparum susceptibility to piperaquine. Methods Plasmodium falciparum DNA from 183 samples collected primarily from Cambodia was genotyped at 33716 genome-wide single nucleotide polymorphisms (SNPs). Linear mixed models and random forests were used to estimate associations between parasite genotypes and piperaquine susceptibility. Candidate polymorphisms were evaluated for their association with dihydroartemisinin-piperaquine treatment outcomes in an independent dataset. Results Single nucleotide polymorphisms on multiple chromosomes were associated with piperaquine 90% inhibitory concentrations (IC90) in a genome-wide analysis. Fine-mapping of genomic regions implicated in genome-wide analyses identified multiple SNPs in linkage disequilibrium with each other that were significantly associated with piperaquine IC90, including a novel mutation within the gene encoding the P. falciparum chloroquine resistance transporter, PfCRT. This mutation (F145I) was associated with dihydroartemisinin-piperaquine treatment failure after adjusting for the presence of amplified plasmepsin II/III, which was also associated with decreased piperaquine sensitivity. Conclusions Our data suggest that, in addition to plasmepsin II/III copy number, other loci, including pfcrt, may also be involved in piperaquine resistance.
Collapse
Affiliation(s)
- Sonia Agrawal
- Division of Malaria Research, Institute for Global Health
| | - Kara A Moser
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore
| | - Lindsay Morton
- Division of Malaria Research, Institute for Global Health
| | - Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park
| | - Ankita Parihar
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore
| | - Ankit Dwivedi
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore
| | - Amol C Shetty
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore
| | - Elliott F Drabek
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore
| | | | | | - Christian M Parobek
- Division of Infectious Diseases, University of North Carolina at Chapel Hill
| | - Krisada Jongsakul
- Armed Forces Research Institute of Medical Sciences, Department of Immunology and Medicine
| | - Rekol Huy
- National Center for Parasitology Entomology and Malaria Control, Phnom Penh, Cambodia
| | - Michele D Spring
- Armed Forces Research Institute of Medical Sciences, Department of Immunology and Medicine
| | - Charlotte A Lanteri
- Armed Forces Research Institute of Medical Sciences, Department of Immunology and Medicine
| | - Suwanna Chaorattanakawee
- Armed Forces Research Institute of Medical Sciences, Department of Immunology and Medicine.,Department of Parasitology and Entomology, Faculty of Public Health, Mahidol University, Bangkok, Thailand
| | - Chanthap Lon
- Armed Forces Research Institute of Medical Sciences.,Armed Forces Research Institute of Medical Sciences, Department of Immunology and Medicine
| | - Mark M Fukuda
- Armed Forces Research Institute of Medical Sciences, Department of Immunology and Medicine
| | - David L Saunders
- Armed Forces Research Institute of Medical Sciences, Department of Immunology and Medicine
| | - David A Fidock
- Department of Microbiology and Immunology.,Division of Infectious Diseases, Department of Medicine, Columbia University Medical Center, New York
| | - Jessica T Lin
- Division of Infectious Diseases, University of North Carolina at Chapel Hill
| | - Jonathan J Juliano
- Division of Infectious Diseases, University of North Carolina at Chapel Hill
| | | | - Joana C Silva
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore
| | | |
Collapse
|
10
|
Abstract
Until recently, deep-level phylogeny in Lepidoptera, the largest single radiation of plant-feeding insects, was very poorly understood. Over the past two decades, building on a preceding era of morphological cladistic studies, molecular data have yielded robust initial estimates of relationships both within and among the ∼43 superfamilies, with unsolved problems now yielding to much larger data sets from high-throughput sequencing. Here we summarize progress on lepidopteran phylogeny since 1975, emphasizing the superfamily level, and discuss some resulting advances in our understanding of lepidopteran evolution.
Collapse
Affiliation(s)
- Charles Mitter
- Department of Entomology, University of Maryland, College Park, Maryland 20742;
| | - Donald R Davis
- Department of Entomology, National Museum of Natural History, Smithsonian Institution, Washington, DC 20560
| | - Michael P Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland 20742
| |
Collapse
|
11
|
Huson DH, Tappu R, Bazinet AL, Xie C, Cummings MP, Nieselt K, Williams R. Fast and simple protein-alignment-guided assembly of orthologous gene families from microbiome sequencing reads. Microbiome 2017; 5:11. [PMID: 28122610 PMCID: PMC5267372 DOI: 10.1186/s40168-017-0233-2] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2016] [Accepted: 01/17/2017] [Indexed: 05/03/2023]
Abstract
BACKGROUND Microbiome sequencing projects typically collect tens of millions of short reads per sample. Depending on the goals of the project, the short reads can either be subjected to direct sequence analysis or be assembled into longer contigs. The assembly of whole genomes from metagenomic sequencing reads is a very difficult problem. However, for some questions, only specific genes of interest need to be assembled. This is then a gene-centric assembly where the goal is to assemble reads into contigs for a family of orthologous genes. METHODS We present a new method for performing gene-centric assembly, called protein-alignment-guided assembly, and provide an implementation in our metagenome analysis tool MEGAN. Genes are assembled on the fly, based on the alignment of all reads against a protein reference database such as NCBI-nr. Specifically, the user selects a gene family based on a classification such as KEGG and all reads binned to that gene family are assembled. RESULTS Using published synthetic community metagenome sequencing reads and a set of 41 gene families, we show that the performance of this approach compares favorably with that of full-featured assemblers and that of a recently published HMM-based gene-centric assembler, both in terms of the number of reference genes detected and of the percentage of reference sequence covered. CONCLUSIONS Protein-alignment-guided assembly of orthologous gene families complements whole-metagenome assembly in a new and very useful way.
Collapse
Affiliation(s)
- Daniel H Huson
- Center for Bioinformatics, University of Tübingen, Sand 14, 72076, Tübingen, Germany.
- Life Sciences Institute, National University of Singapore, 28 Medical Drive, Singapore, 117456, Singapore.
| | - Rewati Tappu
- Center for Bioinformatics, University of Tübingen, Sand 14, 72076, Tübingen, Germany
| | - Adam L Bazinet
- Center for Bioinformatics and Computational Biology, University of Maryland, 8314 Paint Branch Drive, College Park, MD, 20742, USA
- National Biodefense Analysis and Countermeasures Center, 8300 Research Plaza, Fort Detrick, Frederick, MD, 21702, USA
| | - Chao Xie
- Human Longevity Inc., Singapore, Singapore
| | - Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, 8314 Paint Branch Drive, College Park, MD, 20742, USA
| | - Kay Nieselt
- Center for Bioinformatics, University of Tübingen, Sand 14, 72076, Tübingen, Germany
| | - Rohan Williams
- Life Sciences Institute, National University of Singapore, 28 Medical Drive, Singapore, 117456, Singapore
| |
Collapse
|
12
|
Goodheart JA, Bazinet AL, Collins AG, Cummings MP. Relationships within Cladobranchia (Gastropoda: Nudibranchia) based on RNA-Seq data: an initial investigation. R Soc Open Sci 2015; 2:150196. [PMID: 26473045 PMCID: PMC4593679 DOI: 10.1098/rsos.150196] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/07/2015] [Accepted: 08/26/2015] [Indexed: 05/28/2023]
Abstract
Cladobranchia (Gastropoda: Nudibranchia) is a diverse (approx. 1000 species) but understudied group of sea slug molluscs. In order to fully comprehend the diversity of nudibranchs and the evolution of character traits within Cladobranchia, a solid understanding of evolutionary relationships is necessary. To date, only two direct attempts have been made to understand the evolutionary relationships within Cladobranchia, neither of which resulted in well-supported phylogenetic hypotheses. In addition to these studies, several others have addressed some of the relationships within this clade while investigating the evolutionary history of more inclusive groups (Nudibranchia and Euthyneura). However, all of the resulting phylogenetic hypotheses contain conflicting topologies within Cladobranchia. In this study, we address some of these long-standing issues regarding the evolutionary history of Cladobranchia using RNA-Seq data (transcriptomes). We sequenced 16 transcriptomes and combined these with four transcriptomes from the NCBI Sequence Read Archive. Transcript assembly using Trinity and orthology determination using HaMStR yielded 839 orthologous groups for analysis. These data provide a well-supported and almost fully resolved phylogenetic hypothesis for Cladobranchia. Our results support the monophyly of Cladobranchia and the sub-clade Aeolidida, but reject the monophyly of Dendronotida.
Collapse
Affiliation(s)
- Jessica A. Goodheart
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
- NMFS, National Systematics Laboratory, National Museum of Natural History, Smithsonian Institution, MRC-153, PO Box 37012, Washington, DC 20013, USA
| | - Adam L. Bazinet
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| | - Allen G. Collins
- NMFS, National Systematics Laboratory, National Museum of Natural History, Smithsonian Institution, MRC-153, PO Box 37012, Washington, DC 20013, USA
| | - Michael P. Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| |
Collapse
|
13
|
Brown TS, Jacob CG, Silva JC, Takala-Harrison S, Djimdé A, Dondorp AM, Fukuda M, Noedl H, Nyunt MM, Kyaw MP, Mayxay M, Hien TT, Plowe CV, Cummings MP. Plasmodium falciparum field isolates from areas of repeated emergence of drug resistant malaria show no evidence of hypermutator phenotype. Infect Genet Evol 2014; 30:318-322. [PMID: 25514047 DOI: 10.1016/j.meegid.2014.12.010] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/09/2014] [Revised: 12/04/2014] [Accepted: 12/07/2014] [Indexed: 11/30/2022]
Abstract
Multiple transcontinental waves of drug resistance in Plasmodium falciparum have originated in Southeast Asia before spreading westward, first into the rest of Asia and then to sub-Saharan Africa. In vitro studies have suggested that hypermutator P. falciparum parasites may exist in Southeast Asia and that an increased rate of acquisition of new mutations in these parasites may explain the repeated emergence of drug resistance in Southeast Asia. This study is the first to test the hypermutator hypothesis using field isolates. Using genome-wide SNP data from human P. falciparum infections in Southeast Asia and West Africa and a test for relative rate differences we found no evidence of increased relative substitution rates in P. falciparum isolates from Southeast Asia. Instead, we found significantly increased substitution rates in Mali and Bangladesh populations relative to those in populations from Southeast Asia. Additionally we found no association between increased relative substitution rates and parasite clearance following treatment with artemisinin derivatives.
Collapse
Affiliation(s)
- Tyler S Brown
- Johns Hopkins University School of Medicine, Baltimore, MD, USA; Howard Hughes Medical Institute/Center for Vaccine Development, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Christopher G Jacob
- Howard Hughes Medical Institute/Center for Vaccine Development, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Joana C Silva
- Institute for Genome Sciences and Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Shannon Takala-Harrison
- Howard Hughes Medical Institute/Center for Vaccine Development, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Abdoulaye Djimdé
- Malaria Research and Training Center, Department of Epidemiology of Parasitic Diseases, Faculty of Medicine and Pharmacy, University of Science, Techniques and Technology of Bamako, Bamako, Mali
| | - Arjen M Dondorp
- Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand; Center for Tropical Medicine, Nuffield Department of Medicine, Churchill Hospital, University of Oxford, Oxford, United Kingdom
| | - Mark Fukuda
- Armed Forces Research Institute of Medical Sciences, Bangkok, Thailand
| | - Harald Noedl
- Institute of Specific Prophylaxis and Tropical Medicine, Medical University of Vienna, Austria and Malaria Research Initiative Bandarban, Bandarban, Bangladesh
| | - Myaing Myaing Nyunt
- Howard Hughes Medical Institute/Center for Vaccine Development, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Myat Phone Kyaw
- Department of Medical Research (Lower Myanmar), Yangon, Myanmar
| | - Mayfong Mayxay
- Lao-Oxford-Mahosot Hospital-Wellcome Trust Research Unit (LOMWRU), Microbiology Laboratory, Mahosot Hospital, Vientiane, Lao Democratic People's Republic; Faculty of Postgraduate Studies, University of Health Sciences, Vientiane, Lao Democratic People's Republic
| | - Tran Tinh Hien
- Centre for Tropical Medicine Oxford University Clinical Research Unit Vietnam (OUCRU), Ho Chi Minh City, Viet Nam
| | - Christopher V Plowe
- Howard Hughes Medical Institute/Center for Vaccine Development, University of Maryland School of Medicine, Baltimore, MD, USA
| | - Michael P Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, USA.
| |
Collapse
|
14
|
Takala-Harrison S, Jacob CG, Arze C, Cummings MP, Silva JC, Dondorp AM, Fukuda MM, Hien TT, Mayxay M, Noedl H, Nosten F, Kyaw MP, Nhien NTT, Imwong M, Bethell D, Se Y, Lon C, Tyner SD, Saunders DL, Ariey F, Mercereau-Puijalon O, Menard D, Newton PN, Khanthavong M, Hongvanthong B, Starzengruber P, Fuehrer HP, Swoboda P, Khan WA, Phyo AP, Nyunt MM, Nyunt MH, Brown TS, Adams M, Pepin CS, Bailey J, Tan JC, Ferdig MT, Clark TG, Miotto O, MacInnis B, Kwiatkowski DP, White NJ, Ringwald P, Plowe CV. Independent emergence of artemisinin resistance mutations among Plasmodium falciparum in Southeast Asia. J Infect Dis 2014; 211:670-9. [PMID: 25180241 DOI: 10.1093/infdis/jiu491] [Citation(s) in RCA: 325] [Impact Index Per Article: 32.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND The emergence of artemisinin-resistant Plasmodium falciparum in Southeast Asia threatens malaria treatment efficacy. Mutations in a kelch protein encoded on P. falciparum chromosome 13 (K13) have been associated with resistance in vitro and in field samples from Cambodia. METHODS P. falciparum infections from artesunate efficacy trials in Bangladesh, Cambodia, Laos, Myanmar, and Vietnam were genotyped at 33 716 genome-wide single-nucleotide polymorphisms (SNPs). Linear mixed models were used to test associations between parasite genotypes and parasite clearance half-lives following artesunate treatment. K13 mutations were tested for association with artemisinin resistance, and extended haplotypes on chromosome 13 were examined to determine whether mutations arose focally and spread or whether they emerged independently. RESULTS The presence of nonreference K13 alleles was associated with prolonged parasite clearance half-life (P = 1.97 × 10(-12)). Parasites with a mutation in any of the K13 kelch domains displayed longer parasite clearance half-lives than parasites with wild-type alleles. Haplotype analysis revealed both population-specific emergence of mutations and independent emergence of the same mutation in different geographic areas. CONCLUSIONS K13 appears to be a major determinant of artemisinin resistance throughout Southeast Asia. While we found some evidence of spreading resistance, there was no evidence of resistance moving westward from Cambodia into Myanmar.
Collapse
Affiliation(s)
| | | | - Cesar Arze
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore
| | - Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park
| | - Joana C Silva
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore
| | | | - Mark M Fukuda
- Armed Forces Research Institute of Medical Sciences, Bangkok
| | - Tran Tinh Hien
- Center for Tropical Medicine, Oxford University Clinical Research Unit, Ho Chi Minh City, Vietnam
| | - Mayfong Mayxay
- Lao-Oxford-Mahosot Hospital-Wellcome Trust Research Unit, Microbiology Laboratory, Mahosot Hospital Faculty of Postgraduate Studies, University of Health Sciences Centre for Tropical Medicine, Nuffield Department of Medicine, University of Oxford
| | - Harald Noedl
- Institute of Specific Prophylaxis and Tropical Medicine, Medical University of Vienna, Austria
| | - Francois Nosten
- Mahidol-Oxford Tropical Medicine Research Unit Centre for Tropical Medicine, Nuffield Department of Medicine, University of Oxford
| | - Myat P Kyaw
- Department of Medical Research (Lower Myanmar), Yangon
| | - Nguyen Thanh Thuy Nhien
- Center for Tropical Medicine, Oxford University Clinical Research Unit, Ho Chi Minh City, Vietnam
| | - Mallika Imwong
- Department of Molecular Tropical Medicine and Genetics, Faculty of Tropical Medicine, Mahidol University
| | - Delia Bethell
- Armed Forces Research Institute of Medical Sciences, Bangkok
| | - Youry Se
- Armed Forces Research Institute of Medical Sciences
| | - Chanthap Lon
- Armed Forces Research Institute of Medical Sciences
| | - Stuart D Tyner
- Armed Forces Research Institute of Medical Sciences, Bangkok
| | | | | | | | - Didier Menard
- Malaria Molecular Epidemiology Unit, Institut Pasteur du Cambodge, Phnom Penh, Cambodia
| | - Paul N Newton
- Lao-Oxford-Mahosot Hospital-Wellcome Trust Research Unit, Microbiology Laboratory, Mahosot Hospital Centre for Tropical Medicine, Nuffield Department of Medicine, University of Oxford
| | | | | | - Peter Starzengruber
- Institute of Specific Prophylaxis and Tropical Medicine, Medical University of Vienna, Austria
| | - Hans-Peter Fuehrer
- Institute of Specific Prophylaxis and Tropical Medicine, Medical University of Vienna, Austria
| | - Paul Swoboda
- Institute of Specific Prophylaxis and Tropical Medicine, Medical University of Vienna, Austria
| | - Wasif A Khan
- International Centre for Diarrhoeal Disease Research, Dhaka, Bangladesh
| | - Aung Pyae Phyo
- Mahidol-Oxford Tropical Medicine Research Unit Shoklo Malaria Research Unit Mahidol-Oxford Tropical Medicine Research Unit, Faculty of Tropical Medicine, Mahidol University, Mae Sot, Thailand
| | - Myaing M Nyunt
- Howard Hughes Medical Institute/Center for Vaccine Development
| | - Myat H Nyunt
- Department of Medical Research (Lower Myanmar), Yangon
| | - Tyler S Brown
- Howard Hughes Medical Institute/Center for Vaccine Development
| | - Matthew Adams
- Howard Hughes Medical Institute/Center for Vaccine Development
| | | | - Jason Bailey
- Howard Hughes Medical Institute/Center for Vaccine Development
| | - John C Tan
- Research and Development, Roche NimbleGen, Madison, Wisconsin
| | - Michael T Ferdig
- Eck Institute for Global Health, Department of Biological Sciences, University of Notre Dame, Indiana
| | - Taane G Clark
- Faculty of Epidemiology and Population Health Faculty Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine
| | - Olivo Miotto
- Mahidol-Oxford Tropical Medicine Research Unit MRC Centre for Genomics and Global Health, Oxford University and Wellcome Trust Sanger Institute Malaria Programme, Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | - Bronwyn MacInnis
- Malaria Programme, Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | - Dominic P Kwiatkowski
- MRC Centre for Genomics and Global Health, Oxford University and Wellcome Trust Sanger Institute Malaria Programme, Wellcome Trust Sanger Institute, Hinxton, United Kingdom
| | | | - Pascal Ringwald
- Drug Resistance and Containment Unit, Global Malaria Programme, World Health Organization, Geneva, Switzerland
| | | |
Collapse
|
15
|
Abstract
We introduce molecularevolution.org, a publicly available gateway for high-throughput, maximum-likelihood phylogenetic analysis powered by grid computing. The gateway features a garli 2.0 web service that enables a user to quickly and easily submit thousands of maximum likelihood tree searches or bootstrap searches that are executed in parallel on distributed computing resources. The garli web service allows one to easily specify partitioned substitution models using a graphical interface, and it performs sophisticated post-processing of phylogenetic results. Although the garli web service has been used by the research community for over three years, here we formally announce the availability of the service, describe its capabilities, highlight new features and recent improvements, and provide details about how the grid system efficiently delivers high-quality phylogenetic results. [garli, gateway, grid computing, maximum likelihood, molecular evolution portal, phylogenetics, web service.]
Collapse
Affiliation(s)
- Adam L Bazinet
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, 20742-3360, USA, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, 85721-0088, USA
| | - Derrick J Zwickl
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, 20742-3360, USA, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, 85721-0088, USA
| | - Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, 20742-3360, USA, and Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, AZ, 85721-0088, USA
| |
Collapse
|
16
|
Bazinet AL, Cummings MP, Mitter KT, Mitter CW. Can RNA-Seq resolve the rapid radiation of advanced moths and butterflies (Hexapoda: Lepidoptera: Apoditrysia)? An exploratory study. PLoS One 2013; 8:e82615. [PMID: 24324810 PMCID: PMC3853519 DOI: 10.1371/journal.pone.0082615] [Citation(s) in RCA: 49] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2013] [Accepted: 10/23/2013] [Indexed: 11/19/2022] Open
Abstract
Recent molecular phylogenetic studies of the insect order Lepidoptera have robustly resolved family-level divergences within most superfamilies, and most divergences among the relatively species-poor early-arising superfamilies. In sharp contrast, relationships among the superfamilies of more advanced moths and butterflies that comprise the mega-diverse clade Apoditrysia (ca. 145,000 spp.) remain mostly poorly supported. This uncertainty, in turn, limits our ability to discern the origins, ages and evolutionary consequences of traits hypothesized to promote the spectacular diversification of Apoditrysia. Low support along the apoditrysian "backbone" probably reflects rapid diversification. If so, it may be feasible to strengthen resolution by radically increasing the gene sample, but case studies have been few. We explored the potential of next-generation sequencing to conclusively resolve apoditrysian relationships. We used transcriptome RNA-Seq to generate 1579 putatively orthologous gene sequences across a broad sample of 40 apoditrysians plus four outgroups, to which we added two taxa from previously published data. Phylogenetic analysis of a 46-taxon, 741-gene matrix, resulting from a strict filter that eliminated ortholog groups containing any apparent paralogs, yielded dramatic overall increase in bootstrap support for deeper nodes within Apoditrysia as compared to results from previous and concurrent 19-gene analyses. High support was restricted mainly to the huge subclade Obtectomera broadly defined, in which 11 of 12 nodes subtending multiple superfamilies had bootstrap support of 100%. The strongly supported nodes showed little conflict with groupings from previous studies, and were little affected by changes in taxon sampling, suggesting that they reflect true signal rather than artifacts of massive gene sampling. In contrast, strong support was seen at only 2 of 11 deeper nodes among the "lower", non-obtectomeran apoditrysians. These represent a much harder phylogenetic problem, for which one path to resolution might include further increase in gene sampling, together with improved orthology assignments.
Collapse
Affiliation(s)
- Adam L. Bazinet
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, United States of America
| | - Michael P. Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, United States of America
| | - Kim T. Mitter
- Department of Entomology, University of Maryland, College Park, Maryland, United States of America
| | - Charles W. Mitter
- Department of Entomology, University of Maryland, College Park, Maryland, United States of America
| |
Collapse
|
17
|
Regier JC, Mitter C, Zwick A, Bazinet AL, Cummings MP, Kawahara AY, Sohn JC, Zwickl DJ, Cho S, Davis DR, Baixeras J, Brown J, Parr C, Weller S, Lees DC, Mitter KT. A large-scale, higher-level, molecular phylogenetic study of the insect order Lepidoptera (moths and butterflies). PLoS One 2013; 8:e58568. [PMID: 23554903 PMCID: PMC3595289 DOI: 10.1371/journal.pone.0058568] [Citation(s) in RCA: 166] [Impact Index Per Article: 15.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2012] [Accepted: 02/05/2013] [Indexed: 01/22/2023] Open
Abstract
Background Higher-level relationships within the Lepidoptera, and particularly within the species-rich subclade Ditrysia, are generally not well understood, although recent studies have yielded progress. We present the most comprehensive molecular analysis of lepidopteran phylogeny to date, focusing on relationships among superfamilies. Methodology / Principal Findings 483 taxa spanning 115 of 124 families were sampled for 19 protein-coding nuclear genes, from which maximum likelihood tree estimates and bootstrap percentages were obtained using GARLI. Assessment of heuristic search effectiveness showed that better trees and higher bootstrap percentages probably remain to be discovered even after 1000 or more search replicates, but further search proved impractical even with grid computing. Other analyses explored the effects of sampling nonsynonymous change only versus partitioned and unpartitioned total nucleotide change; deletion of rogue taxa; and compositional heterogeneity. Relationships among the non-ditrysian lineages previously inferred from morphology were largely confirmed, plus some new ones, with strong support. Robust support was also found for divergences among non-apoditrysian lineages of Ditrysia, but only rarely so within Apoditrysia. Paraphyly for Tineoidea is strongly supported by analysis of nonsynonymous-only signal; conflicting, strong support for tineoid monophyly when synonymous signal was added back is shown to result from compositional heterogeneity. Conclusions / Significance Support for among-superfamily relationships outside the Apoditrysia is now generally strong. Comparable support is mostly lacking within Apoditrysia, but dramatically increased bootstrap percentages for some nodes after rogue taxon removal, and concordance with other evidence, strongly suggest that our picture of apoditrysian phylogeny is approximately correct. This study highlights the challenge of finding optimal topologies when analyzing hundreds of taxa. It also shows that some nodes get strong support only when analysis is restricted to nonsynonymous change, while total change is necessary for strong support of others. Thus, multiple types of analyses will be necessary to fully resolve lepidopteran phylogeny.
Collapse
Affiliation(s)
- Jerome C. Regier
- Institute for Bioscience and Biotechnology Research, University of Maryland, College Park, Maryland, United States of America
- Department of Entomology, University of Maryland, College Park, Maryland, United States of America
- * E-mail: (JCR); (CM)
| | - Charles Mitter
- Department of Entomology, University of Maryland, College Park, Maryland, United States of America
- * E-mail: (JCR); (CM)
| | - Andreas Zwick
- Entomology, State Museum of Natural History, Stuttgart, Germany
| | - Adam L. Bazinet
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, United States of America
| | - Michael P. Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, United States of America
| | - Akito Y. Kawahara
- Florida Museum of Natural History, Gainesville, Florida, United States of America
| | - Jae-Cheon Sohn
- Department of Entomology, University of Maryland, College Park, Maryland, United States of America
| | - Derrick J. Zwickl
- Department of Ecology and Evolutionary Biology, University of Arizona, Tucson, Arizona, United States of America
| | - Soowon Cho
- Department of Plant Medicine, Chungbuk National University, Cheongju, Korea
| | - Donald R. Davis
- Department of Entomology, Smithsonian Institution, Washington, District of Columbia, United States of America
| | - Joaquin Baixeras
- Cavanilles Institute of Biodiversity and Evolutionary Biology, University of Valencia, Valencia, Spain
| | - John Brown
- Systematic Entomology Lab, Agricultural Research Service, United States Department of Agriculture, Beltsville, Maryland, United States of America
| | - Cynthia Parr
- Encyclopedia of Life, Smithsonian Institution, Washington, District of Columbia, United States of America
| | - Susan Weller
- Department of Entomology, University of Minnesota, Saint Paul, Minnesota, United States of America
| | - David C. Lees
- Department of Life Sciences, Natural History Museum, London, England
| | - Kim T. Mitter
- Department of Entomology, University of Maryland, College Park, Maryland, United States of America
| |
Collapse
|
18
|
Sohn JC, Regier JC, Mitter C, Davis D, Landry JF, Zwick A, Cummings MP. A molecular phylogeny for yponomeutoidea (insecta, Lepidoptera, ditrysia) and its implications for classification, biogeography and the evolution of host plant use. PLoS One 2013; 8:e55066. [PMID: 23383061 PMCID: PMC3561450 DOI: 10.1371/journal.pone.0055066] [Citation(s) in RCA: 36] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2012] [Accepted: 12/18/2012] [Indexed: 11/19/2022] Open
Abstract
BACKGROUND Yponomeutoidea, one of the early-diverging lineages of ditrysian Lepidoptera, comprise about 1,800 species worldwide, including notable pests and insect-plant interaction models. Yponomeutoids were one of the earliest lepidopteran clades to evolve external feeding and to extensively colonize herbaceous angiosperms. Despite the group's economic importance, and its value for tracing early lepidopteran evolution, the biodiversity and phylogeny of Yponomeutoidea have been relatively little studied. METHODOLOGY/PRINCIPAL FINDINGS Eight nuclear genes (8 kb) were initially sequenced for 86 putative yponomeutoid species, spanning all previously recognized suprageneric groups, and 53 outgroups representing 22 families and 12 superfamilies. Eleven to 19 additional genes, yielding a total of 14.8 to 18.9 kb, were then sampled for a subset of taxa, including 28 yponomeutoids and 43 outgroups. Maximum likelihood analyses were conducted on data sets differing in numbers of genes, matrix completeness, inclusion/weighting of synonymous substitutions, and inclusion/exclusion of "rogue" taxa. Monophyly for Yponomeutoidea was supported very strongly when the 18 "rogue" taxa were excluded, and moderately otherwise. Results from different analyses are highly congruent and relationships within Yponomeutoidea are well supported overall. There is strong support overall for monophyly of families previously recognized on morphological grounds, including Yponomeutidae, Ypsolophidae, Plutellidae, Glyphipterigidae, Argyresthiidae, Attevidae, Praydidae, Heliodinidae, and Bedelliidae. We also assign family rank to Scythropiinae (Scythropiidae stat. rev.), which in our trees are strongly grouped with Bedelliidae, in contrast to all previous proposals. We present a working hypothesis of among-family relationships, and an informal higher classification. Host plant family associations of yponomeutoid subfamilies and families are non-random, but show no trends suggesting parallel phylogenesis. Our analyses suggest that previous characterizations of yponomeutoids as predominantly Holarctic were based on insufficient sampling. CONCLUSIONS/SIGNIFICANCE We provide the first robust molecular phylogeny for Yponomeutoidea, together with a revised classification and new insights into their life history evolution and biogeography.
Collapse
Affiliation(s)
- Jae-Cheon Sohn
- Department of Entomology, University of Maryland, College Park, MD, USA.
| | | | | | | | | | | | | |
Collapse
|
19
|
Abstract
Background A fundamental problem in modern genomics is to taxonomically or functionally classify DNA sequence fragments derived from environmental sampling (i.e., metagenomics). Several different methods have been proposed for doing this effectively and efficiently, and many have been implemented in software. In addition to varying their basic algorithmic approach to classification, some methods screen sequence reads for ’barcoding genes’ like 16S rRNA, or various types of protein-coding genes. Due to the sheer number and complexity of methods, it can be difficult for a researcher to choose one that is well-suited for a particular analysis. Results We divided the very large number of programs that have been released in recent years for solving the sequence classification problem into three main categories based on the general algorithm they use to compare a query sequence against a database of sequences. We also evaluated the performance of the leading programs in each category on data sets whose taxonomic and functional composition is known. Conclusions We found significant variability in classification accuracy, precision, and resource consumption of sequence classification programs when used to analyze various metagenomics data sets. However, we observe some general trends and patterns that will be useful to researchers who use sequence classification programs.
Collapse
Affiliation(s)
- Adam L Bazinet
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20874, USA.
| | | |
Collapse
|
20
|
Regier JC, Brown JW, Mitter C, Baixeras J, Cho S, Cummings MP, Zwick A. A molecular phylogeny for the leaf-roller moths (Lepidoptera: Tortricidae) and its implications for classification and life history evolution. PLoS One 2012; 7:e35574. [PMID: 22536410 PMCID: PMC3334928 DOI: 10.1371/journal.pone.0035574] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2011] [Accepted: 03/20/2012] [Indexed: 01/08/2023] Open
Abstract
BACKGROUND Tortricidae, one of the largest families of microlepidopterans, comprise about 10,000 described species worldwide, including important pests, biological control agents and experimental models. Understanding of tortricid phylogeny, the basis for a predictive classification, is currently provisional. We present the first detailed molecular estimate of relationships across the tribes and subfamilies of Tortricidae, assess its concordance with previous morphological evidence, and re-examine postulated evolutionary trends in host plant use and biogeography. METHODOLOGY/PRINCIPAL FINDINGS We sequenced up to five nuclear genes (6,633 bp) in each of 52 tortricids spanning all three subfamilies and 19 of the 22 tribes, plus up to 14 additional genes, for a total of 14,826 bp, in 29 of those taxa plus all 14 outgroup taxa. Maximum likelihood analyses yield trees that, within Tortricidae, differ little among data sets and character treatments and are nearly always strongly supported at all levels of divergence. Support for several nodes was greatly increased by the additional 14 genes sequenced in just 29 of 52 tortricids, with no evidence of phylogenetic artifacts from deliberately incomplete gene sampling. There is strong support for the monophyly of Tortricinae and of Olethreutinae, and for grouping of these to the exclusion of Chlidanotinae. Relationships among tribes are robustly resolved in Tortricinae and mostly so in Olethreutinae. Feeding habit (internal versus external) is strongly conserved on the phylogeny. Within Tortricinae, a clade characterized by eggs being deposited in large clusters, in contrast to singly or in small batches, has markedly elevated incidence of polyphagous species. The five earliest-branching tortricid lineages are all species-poor tribes with mainly southern/tropical distributions, consistent with a hypothesized Gondwanan origin for the family. CONCLUSIONS/SIGNIFICANCE We present the first robustly supported phylogeny for Tortricidae, and a revised classification in which all of the sampled tribes are now monophyletic.
Collapse
Affiliation(s)
- Jerome C. Regier
- Department of Entomology, University of Maryland, College Park, Maryland, United States of America
- Institute for Bioscience and Biotechnology Research, College Park, Maryland, United States of America
| | - John W. Brown
- Systematic Entomology Laboratory, Agricultural Research Service, United States Department of Agriculture, Beltsville, Maryland, United States of America
| | - Charles Mitter
- Department of Entomology, University of Maryland, College Park, Maryland, United States of America
- * E-mail:
| | - Joaquín Baixeras
- Cavanilles Institute of Biodiversity and Evolutionary Biology, University of Valencia, Valencia, Spain
| | - Soowon Cho
- Department of Plant Medicine, Chungbuk National University, Cheongju, Korea
| | - Michael P. Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland, United States of America
| | - Andreas Zwick
- Department of Entomology, State Museum of Natural History, Stuttgart, Germany
| |
Collapse
|
21
|
Cho S, Zwick A, Regier JC, Mitter C, Cummings MP, Yao J, Du Z, Zhao H, Kawahara AY, Weller S, Davis DR, Baixeras J, Brown JW, Parr C. Can deliberately incomplete gene sample augmentation improve a phylogeny estimate for the advanced moths and butterflies (Hexapoda: Lepidoptera)? Syst Biol 2011; 60:782-96. [PMID: 21840842 PMCID: PMC3193767 DOI: 10.1093/sysbio/syr079] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2009] [Revised: 03/04/2010] [Accepted: 04/12/2011] [Indexed: 11/15/2022] Open
Abstract
This paper addresses the question of whether one can economically improve the robustness of a molecular phylogeny estimate by increasing gene sampling in only a subset of taxa, without having the analysis invalidated by artifacts arising from large blocks of missing data. Our case study stems from an ongoing effort to resolve poorly understood deeper relationships in the large clade Ditrysia ( > 150,000 species) of the insect order Lepidoptera (butterflies and moths). Seeking to remedy the overall weak support for deeper divergences in an initial study based on five nuclear genes (6.6 kb) in 123 exemplars, we nearly tripled the total gene sample (to 26 genes, 18.4 kb) but only in a third (41) of the taxa. The resulting partially augmented data matrix (45% intentionally missing data) consistently increased bootstrap support for groupings previously identified in the five-gene (nearly) complete matrix, while introducing no contradictory groupings of the kind that missing data have been predicted to produce. Our results add to growing evidence that data sets differing substantially in gene and taxon sampling can often be safely and profitably combined. The strongest overall support for nodes above the family level came from including all nucleotide changes, while partitioning sites into sets undergoing mostly nonsynonymous versus mostly synonymous change. In contrast, support for the deepest node for which any persuasive molecular evidence has yet emerged (78-85% bootstrap) was weak or nonexistent unless synonymous change was entirely excluded, a result plausibly attributed to compositional heterogeneity. This node (Gelechioidea + Apoditrysia), tentatively proposed by previous authors on the basis of four morphological synapomorphies, is the first major subset of ditrysian superfamilies to receive strong statistical support in any phylogenetic study. A "more-genes-only" data set (41 taxa×26 genes) also gave strong signal for a second deep grouping (Macrolepidoptera) that was obscured, but not strongly contradicted, in more taxon-rich analyses.
Collapse
Affiliation(s)
- Soowon Cho
- Department of Entomology, University of Maryland, College Park, MD 20742, USA
- Present address: Department of Plant Medicine, Chungbuk National University, Cheongju, Korea
| | - Andreas Zwick
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, MD 20742, USA
| | - Jerome C. Regier
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, MD 20742, USA
| | - Charles Mitter
- Department of Entomology, University of Maryland, College Park, MD 20742, USA
| | - Michael P. Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA
| | - Jianxiu Yao
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, MD 20742, USA
- Present address: Department of Entomology, Kansas State University, Manhattan, KS 66506, USA
| | - Zaile Du
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, MD 20742, USA
| | - Hong Zhao
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, MD 20742, USA
| | - Akito Y. Kawahara
- Department of Entomology, University of Maryland, College Park, MD 20742, USA
| | - Susan Weller
- Department of Entomology, University of Minnesota, Saint Paul, MN 55108, USA
| | - Donald R. Davis
- Department of Entomology, Smithsonian Institution, Washington, DC 20560, USA
| | - Joaquin Baixeras
- Cavanilles Institute of Biodiversity and Evolutionary Biology, University of Valencia, Valencia, Spain
| | - John W. Brown
- Systematic Entomology Laboratory, Agricultural Research Service, United States Department of Agriculture, Beltsville, MD 20705, USA
| | - Cynthia Parr
- Encyclopedia of Life, Smithsonian Institution, Washington, DC 20560, USA
| |
Collapse
|
22
|
Ayres DL, Darling A, Zwickl DJ, Beerli P, Holder MT, Lewis PO, Huelsenbeck JP, Ronquist F, Swofford DL, Cummings MP, Rambaut A, Suchard MA. BEAGLE: an application programming interface and high-performance computing library for statistical phylogenetics. Syst Biol 2011; 61:170-3. [PMID: 21963610 PMCID: PMC3243739 DOI: 10.1093/sysbio/syr100] [Citation(s) in RCA: 366] [Impact Index Per Article: 28.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Phylogenetic inference is fundamental to our understanding of most aspects of the origin and evolution of life, and in recent years, there has been a concentration of interest in statistical approaches such as Bayesian inference and maximum likelihood estimation. Yet, for large data sets and realistic or interesting models of evolution, these approaches remain computationally demanding. High-throughput sequencing can yield data for thousands of taxa, but scaling to such problems using serial computing often necessitates the use of nonstatistical or approximate approaches. The recent emergence of graphics processing units (GPUs) provides an opportunity to leverage their excellent floating-point computational performance to accelerate statistical phylogenetic inference. A specialized library for phylogenetic calculation would allow existing software packages to make more effective use of available computer hardware, including GPUs. Adoption of a common library would also make it easier for other emerging computing architectures, such as field programmable gate arrays, to be used in the future. We present BEAGLE, an application programming interface (API) and library for high-performance statistical phylogenetic inference. The API provides a uniform interface for performing phylogenetic likelihood calculations on a variety of compute hardware platforms. The library includes a set of efficient implementations and can currently exploit hardware including GPUs using NVIDIA CUDA, central processing units (CPUs) with Streaming SIMD Extensions and related processor supplementary instruction sets, and multicore CPUs via OpenMP. To demonstrate the advantages of a common API, we have incorporated the library into several popular phylogenetic software packages. The BEAGLE library is free open source software licensed under the Lesser GPL and available from http://beagle-lib.googlecode.com. An example client program is available as public domain software.
Collapse
Affiliation(s)
- Daniel L Ayres
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA.
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
23
|
Kawahara AY, Ohshima I, Kawakita A, Regier JC, Mitter C, Cummings MP, Davis DR, Wagner DL, De Prins J, Lopez-Vaamonde C. Increased gene sampling strengthens support for higher-level groups within leaf-mining moths and relatives (Lepidoptera: Gracillariidae). BMC Evol Biol 2011; 11:182. [PMID: 21702958 PMCID: PMC3145599 DOI: 10.1186/1471-2148-11-182] [Citation(s) in RCA: 28] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2010] [Accepted: 06/24/2011] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND Researchers conducting molecular phylogenetic studies are frequently faced with the decision of what to do when weak branch support is obtained for key nodes of importance. As one solution, the researcher may choose to sequence additional orthologous genes of appropriate evolutionary rate for the taxa in the study. However, generating large, complete data matrices can become increasingly difficult as the number of characters increases. A few empirical studies have shown that augmenting genes even for a subset of taxa can improve branch support. However, because each study differs in the number of characters and taxa, there is still a need for additional studies that examine whether incomplete sampling designs are likely to aid at increasing deep node resolution. We target Gracillariidae, a Cretaceous-age (~100 Ma) group of leaf-mining moths to test whether the strategy of adding genes for a subset of taxa can improve branch support for deep nodes. We initially sequenced ten genes (8,418 bp) for 57 taxa that represent the major lineages of Gracillariidae plus outgroups. After finding that many deep divergences remained weakly supported, we sequenced eleven additional genes (6,375 bp) for a 27-taxon subset. We then compared results from different data sets to assess whether one sampling design can be favored over another. The concatenated data set comprising all genes and all taxa and three other data sets of different taxon and gene sub-sampling design were analyzed with maximum likelihood. Each data set was subject to five different models and partitioning schemes of non-synonymous and synonymous changes. Statistical significance of non-monophyly was examined with the Approximately Unbiased (AU) test. RESULTS Partial augmentation of genes led to high support for deep divergences, especially when non-synonymous changes were analyzed alone. Increasing the number of taxa without an increase in number of characters led to lower bootstrap support; increasing the number of characters without increasing the number of taxa generally increased bootstrap support. More than three-quarters of nodes were supported with bootstrap values greater than 80% when all taxa and genes were combined. Gracillariidae, Lithocolletinae + Leucanthiza, and Acrocercops and Parectopa groups were strongly supported in nearly every analysis. Gracillaria group was well supported in some analyses, but less so in others. We find strong evidence for the exclusion of Douglasiidae from Gracillarioidea sensu Davis and Robinson (1998). Our results strongly support the monophyly of a G.B.R.Y. clade, a group comprised of Gracillariidae + Bucculatricidae + Roeslerstammiidae + Yponomeutidae, when analyzed with non-synonymous changes only, but this group was frequently split when synonymous and non-synonymous substitutions were analyzed together. CONCLUSIONS 1) Partially or fully augmenting a data set with more characters increased bootstrap support for particular deep nodes, and this increase was dramatic when non-synonymous changes were analyzed alone. Thus, the addition of sites that have low levels of saturation and compositional heterogeneity can greatly improve results. 2) Gracillarioidea, as defined by Davis and Robinson (1998), clearly do not include Douglasiidae, and changes to current classification will be required. 3) Gracillariidae were monophyletic in all analyses conducted, and nearly all species can be placed into one of six strongly supported clades though relationships among these remain unclear. 4) The difficulty in determining the phylogenetic placement of Bucculatricidae is probably attributable to compositional heterogeneity at the third codon position. From our tests for compositional heterogeneity and strong bootstrap values obtained when synonymous changes are excluded, we tentatively conclude that Bucculatricidae is closely related to Gracillariidae + Roeslerstammiidae + Yponomeutidae.
Collapse
Affiliation(s)
- Akito Y Kawahara
- Department of Entomology, University of Maryland, College Park, MD, USA
| | - Issei Ohshima
- Division of Evolutionary Biology, National Institute for Basic Biology, Okazaki, Japan
| | | | - Jerome C Regier
- Institute for Bioscience and Biotechnology Research, University of Maryland, College Park, MD, USA
| | - Charles Mitter
- Department of Entomology, University of Maryland, College Park, MD, USA
| | - Michael P Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD, USA
| | - Donald R Davis
- Department of Entomology, Smithsonian Institution, Washington, D.C., USA
| | - David L Wagner
- Department of Ecology & Evolutionary Biology, University of Connecticut, Storrs, CT, USA
| | | | | |
Collapse
|
24
|
Abstract
The major opportunities for broader incorporation of bioinformatics in education can be placed into three general categories: general applicability of bioinformatics in life science and related curricula; inherent fit of bioinformatics for promoting student learning in most biology programs; and the general experience and associated comfort students have with computers and technology. Conversely, the major challenges for broader incorporation of bioinformatics in education can be placed into three general categories: required infrastructure and logistics; instructor knowledge of bioinformatics and continuing education; and the breadth of bioinformatics, and the diversity of students and educational objectives. Broader incorporation of bioinformatics at all education levels requires overcoming the challenges to using transformative computer-requiring learning activities, assisting faculty in collecting assessment data on mastery of student learning outcomes, as well as creating more faculty development opportunities that span diverse skill levels, with an emphasis placed on providing resource materials that are kept up-to-date as the field and tools change.
Collapse
Affiliation(s)
- Michael P Cummings
- Center for Bioinformatics and Computational Biology, Biomolecular Sciences Building, University of Maryland, College Park, MD 20742, USA.
| | | |
Collapse
|
25
|
Takala SL, Coulibaly D, Thera MA, Batchelor AH, Cummings MP, Escalante AA, Ouattara A, Traoré K, Niangaly A, Djimdé AA, Doumbo OK, Plowe CV. Extreme polymorphism in a vaccine antigen and risk of clinical malaria: implications for vaccine development. Sci Transl Med 2010; 1:2ra5. [PMID: 20165550 DOI: 10.1126/scitranslmed.3000257] [Citation(s) in RCA: 138] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
Vaccines directed against the blood stages of Plasmodium falciparum malaria are intended to prevent the parasite from invading and replicating within host cells. No blood-stage malaria vaccine has shown clinical efficacy in humans. Most malaria vaccine antigens are parasite surface proteins that have evolved extensive genetic diversity, and this diversity could allow malaria parasites to escape vaccine-induced immunity. We examined the extent and within-host dynamics of genetic diversity in the blood-stage malaria vaccine antigen apical membrane antigen-1 in a longitudinal study in Mali. Two hundred and fourteen unique apical membrane antigen-1 haplotypes were identified among 506 human infections, and amino acid changes near a putative invasion machinery binding site were strongly associated with the development of clinical symptoms, suggesting that these residues may be important to consider in designing polyvalent apical membrane antigen-1 vaccines and in assessing vaccine efficacy in field trials. This extreme diversity may pose a serious obstacle to an effective polyvalent recombinant subunit apical membrane antigen-1 vaccine.
Collapse
Affiliation(s)
- Shannon L Takala
- Howard Hughes Medical Institute and Center for Vaccine Development, University of Maryland School of Medicine, 685 West Baltimore Street, Baltimore, MD 21201, USA
| | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
26
|
Regier JC, Zwick A, Cummings MP, Kawahara AY, Cho S, Weller S, Roe A, Baixeras J, Brown JW, Parr C, Davis DR, Epstein M, Hallwachs W, Hausmann A, Janzen DH, Kitching IJ, Solis MA, Yen SH, Bazinet AL, Mitter C. Toward reconstructing the evolution of advanced moths and butterflies (Lepidoptera: Ditrysia): an initial molecular study. BMC Evol Biol 2009; 9:280. [PMID: 19954545 PMCID: PMC2796670 DOI: 10.1186/1471-2148-9-280] [Citation(s) in RCA: 124] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2009] [Accepted: 12/02/2009] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In the mega-diverse insect order Lepidoptera (butterflies and moths; 165,000 described species), deeper relationships are little understood within the clade Ditrysia, to which 98% of the species belong. To begin addressing this problem, we tested the ability of five protein-coding nuclear genes (6.7 kb total), and character subsets therein, to resolve relationships among 123 species representing 27 (of 33) superfamilies and 55 (of 100) families of Ditrysia under maximum likelihood analysis. RESULTS Our trees show broad concordance with previous morphological hypotheses of ditrysian phylogeny, although most relationships among superfamilies are weakly supported. There are also notable surprises, such as a consistently closer relationship of Pyraloidea than of butterflies to most Macrolepidoptera. Monophyly is significantly rejected by one or more character sets for the putative clades Macrolepidoptera as currently defined (P < 0.05) and Macrolepidoptera excluding Noctuoidea and Bombycoidea sensu lato (P < or = 0.005), and nearly so for the superfamily Drepanoidea as currently defined (P < 0.08). Superfamilies are typically recovered or nearly so, but usually without strong support. Relationships within superfamilies and families, however, are often robustly resolved. We provide some of the first strong molecular evidence on deeper splits within Pyraloidea, Tortricoidea, Geometroidea, Noctuoidea and others.Separate analyses of mostly synonymous versus non-synonymous character sets revealed notable differences (though not strong conflict), including a marked influence of compositional heterogeneity on apparent signal in the third codon position (nt3). As available model partitioning methods cannot correct for this variation, we assessed overall phylogeny resolution through separate examination of trees from each character set. Exploration of "tree space" with GARLI, using grid computing, showed that hundreds of searches are typically needed to find the best-feasible phylogeny estimate for these data. CONCLUSION Our results (a) corroborate the broad outlines of the current working phylogenetic hypothesis for Ditrysia, (b) demonstrate that some prominent features of that hypothesis, including the position of the butterflies, need revision, and (c) resolve the majority of family and subfamily relationships within superfamilies as thus far sampled. Much further gene and taxon sampling will be needed, however, to strongly resolve individual deeper nodes.
Collapse
Affiliation(s)
- Jerome C Regier
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, Maryland 20742, USA
| | - Andreas Zwick
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, Maryland 20742, USA
| | - Michael P Cummings
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland 20742, USA
| | - Akito Y Kawahara
- Department of Entomology, University of Maryland, College Park, Maryland 20742, USA
| | - Soowon Cho
- Department of Entomology, University of Maryland, College Park, Maryland 20742, USA
- Department of Plant Medicine, Chungbuk National University, Cheongju 361-763, Korea
| | - Susan Weller
- Department of Entomology, University of Minnesota, St. Paul, Minnesota 55455, USA
| | - Amanda Roe
- Department of Entomology, University of Minnesota, St. Paul, Minnesota 55455, USA
- Department of Biological Sciences, University of Alberta, Edmonton, Alberta T6G 2E9, Canada
| | - Joaquin Baixeras
- Cavanilles Institute of Biodiversity and Evolutionary Biology, University of Valencia, Apartat de correus 2085, 46071 Valencia, Spain
| | - John W Brown
- Systematic Entomology Laboratory, Agricultural Research Service, United States Department of Agriculture, Beltsville, Maryland 20705, USA
| | - Cynthia Parr
- Encyclopedia of Life, Smithsonian Institution, Washington, D.C. 20013-7012, USA
| | - Donald R Davis
- Department of Entomology, Smithsonian Institution, Washington, D.C. 20013-7012, USA
| | - Marc Epstein
- Plant Pest Diagnostics Branch, California Department of Food and Agriculture, 3294 Meadowview Road, Sacramento, California 95832-1448, USA
| | - Winifred Hallwachs
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Axel Hausmann
- Bavarian State Collection of Zoology, Münchhausenstrasse 21, D-81247 München, Germany
| | - Daniel H Janzen
- Department of Biology, University of Pennsylvania, Philadelphia, Pennsylvania 19104, USA
| | - Ian J Kitching
- Department of Entomology, The Natural History Museum, Cromwell Road, London SW7 5BD, UK
| | - M Alma Solis
- Systematic Entomology Laboratory, Agricultural Research Service, United States Department of Agriculture, Beltsville, Maryland 20705, USA
| | - Shen-Horn Yen
- Department of Biological Sciences, National Sun Yat-Sen University, Kaohsiung 804, Taiwan
| | - Adam L Bazinet
- Laboratory of Molecular Evolution, Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland 20742, USA
| | - Charles Mitter
- Department of Entomology, University of Maryland, College Park, Maryland 20742, USA
| |
Collapse
|
27
|
Regier JC, Shultz JW, Ganley ARD, Hussey A, Shi D, Ball B, Zwick A, Stajich JE, Cummings MP, Martin JW, Cunningham CW. Resolving arthropod phylogeny: exploring phylogenetic signal within 41 kb of protein-coding nuclear gene sequence. Syst Biol 2009; 57:920-38. [PMID: 19085333 DOI: 10.1080/10635150802570791] [Citation(s) in RCA: 148] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022] Open
Abstract
This study attempts to resolve relationships among and within the four basal arthropod lineages (Pancrustacea, Myriapoda, Euchelicerata, Pycnogonida) and to assess the widespread expectation that remaining phylogenetic problems will yield to increasing amounts of sequence data. Sixty-eight regions of 62 protein-coding nuclear genes (approximately 41 kilobases (kb)/taxon) were sequenced for 12 taxonomically diverse arthropod taxa and a tardigrade outgroup. Parsimony, likelihood, and Bayesian analyses of total nucleotide data generally strongly supported the monophyly of each of the basal lineages represented by more than one species. Other relationships within the Arthropoda were also supported, with support levels depending on method of analysis and inclusion/exclusion of synonymous changes. Removing third codon positions, where the assumption of base compositional homogeneity was rejected, altered the results. Removing the final class of synonymous mutations--first codon positions encoding leucine and arginine, which were also compositionally heterogeneous--yielded a data set that was consistent with a hypothesis of base compositional homogeneity. Furthermore, under such a data-exclusion regime, all 68 gene regions individually were consistent with base compositional homogeneity. Restricting likelihood analyses to nonsynonymous change recovered trees with strong support for the basal lineages but not for other groups that were variably supported with more inclusive data sets. In a further effort to increase phylogenetic signal, three types of data exploration were undertaken. (1) Individual genes were ranked by their average rate of nonsynonymous change, and three rate categories were assigned--fast, intermediate, and slow. Then, bootstrap analysis of each gene was performed separately to see which taxonomic groups received strong support. Five taxonomic groups were strongly supported independently by two or more genes, and these genes mostly belonged to the slow or intermediate categories, whereas groups supported only by a single gene region tended to be from genes of the fast category, arguing that fast genes provide a less consistent signal. (2) A sensitivity analysis was performed in which increasing numbers of genes were excluded, beginning with the fastest. The number of strongly supported nodes increased up to a point and then decreased slightly. Recovery of Hexapoda required removal of fast genes. Support for Mandibulata (Pancrustacea + Myriapoda) also increased, at times to "strong" levels, with removal of the fastest genes. (3) Concordance selection was evaluated by clustering genes according to their ability to recover Pancrustacea, Euchelicerata, or Myriapoda and analyzing the three clusters separately. All clusters of genes recovered the three concordance clades but were at times inconsistent in the relationships recovered among and within these clades, a result that indicates that the a priori concordance criteria may bias phylogenetic signal in unexpected ways. In a further attempt to increase support of taxonomic relationships, sequence data from 49 additional taxa for three slow genes (i.e., EF-1 alpha, EF-2, and Pol II) were combined with the various 13-taxon data sets. The 62-taxon analyses supported the results of the 13-taxon analyses and provided increased support for additional pancrustacean clades found in an earlier analysis including only EF-1 alpha, EF-2, and Pol II.
Collapse
Affiliation(s)
- Jerome C Regier
- Center for Biosystems Research, University of Maryland Biotechnology Institute, College Park, Maryland 20742, USA.
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Abstract
We introduce a statistic, the genealogical sorting index (gsi), for quantifying the degree of exclusive ancestry of labeled groups on a rooted genealogy and demonstrate its application. The statistic is simple, intuitive, and easily calculated. It has a normalized range to facilitate comparisons among different groups, trees, or studies and it provides information on individual groups rather than a composite measure for all groups. It naturally handles polytomies and accommodates measures of uncertainty in phylogenetic relationships. We use coalescent simulations to explore the behavior of the gsi across a range of divergence times, with the mean value increasing to 1, the maximum value when exclusivity within a group reached monophyly. Simulations also demonstrate that the power to reject the null hypothesis of mixed genealogical ancestry increased markedly as sample size increased, and that the gsi provides a statistically more powerful measure of divergence than FST. Applications to data from published studies demonstrated that the gsi provides a useful way to detect significant exclusivity even when groups are not monophyletic. Although we describe this statistic in the context of divergence, it is more broadly applicable to quantify and assess the significance of clustering of observations in labeled groups on any tree.
Collapse
Affiliation(s)
- Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, Maryland 20742, USA.
| | | | | |
Collapse
|
29
|
Parr CS, Cummings MP. Data sharing in ecology and evolution. Trends Ecol Evol 2007; 20:362-3. [PMID: 16701396 DOI: 10.1016/j.tree.2005.04.023] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2005] [Revised: 04/14/2005] [Accepted: 04/22/2005] [Indexed: 10/25/2022]
|
30
|
Abstract
Complementarity-based reserve selection algorithms efficiently prioritize sites for biodiversity conservation, but they are data-intensive and most regions lack accurate distribution maps for the majority of species. We explored implications of basing conservation planning decisions on incomplete and biased data using occurrence records of the plant family Proteaceae in South Africa. Treating this high-quality database as 'complete', we introduced three realistic sampling biases characteristic of biodiversity databases: a detectability sampling bias and two forms of roads sampling bias. We then compared reserve networks constructed using complete, biased, and randomly sampled data. All forms of biased sampling performed worse than both the complete data set and equal-effort random sampling. Biased sampling failed to detect a median of 1-5% of species, and resulted in reserve networks that were 9-17% larger than those designed with complete data. Spatial congruence and the correlation of irreplaceability scores between reserve networks selected with biased and complete data were low. Thus, reserve networks based on biased data require more area to protect fewer species and identify different locations than those selected with randomly sampled or complete data.
Collapse
Affiliation(s)
- Joanna Grand
- Department of Plant Science and Landscape Architecture, University of Maryland, College Park, MD 20742, USA.
| | | | | | | | | |
Collapse
|
31
|
Cummings MP, Meyer A. Magic bullets and golden rules: data sampling in molecular phylogenetics. ZOOLOGY 2005; 108:329-36. [PMID: 16351981 DOI: 10.1016/j.zool.2005.09.006] [Citation(s) in RCA: 37] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2005] [Revised: 09/22/2005] [Accepted: 09/23/2005] [Indexed: 11/23/2022]
Abstract
Data collection for molecular phylogenetic studies is based on samples of both genes and taxa. In an ideal world, with no limitations to resources, as many genes could be sampled as deemed necessary to address phylogenetic problems. Given limited resources in the real world, inadequate (in terms of choice of genes or number of genes) sequences or restricted taxon sampling can adversely affect the reliability or information gained in phylogenetics. Recent empirical and simulation-based studies of data sampling in molecular phylogenetics have reached differing conclusions on how to deal with these problems. Some advocated sampling more genes, others more taxa. There is certainly no 'magic bullet' that will fit all phylogenetic problems, and no specific 'golden rules' have been deduced, other than that single genes may not always contain sufficient phylogenetic information. However, several general conclusions and suggestions can be made. One suggestion is that the determination of a multiple, but moderate number (e.g., 6-10) of gene sequences might take precedence over sequencing a larger set of genes and thereby permit the sampling of more taxa for a phylogenetic study.
Collapse
Affiliation(s)
- Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742, USA.
| | | |
Collapse
|
32
|
Cummings MP, Segal MR. Few amino acid positions in rpoB are associated with most of the rifampin resistance in Mycobacterium tuberculosis. BMC Bioinformatics 2004; 5:137. [PMID: 15453919 PMCID: PMC524371 DOI: 10.1186/1471-2105-5-137] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2004] [Accepted: 09/28/2004] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Mutations in rpoB, the gene encoding the beta subunit of DNA-dependent RNA polymerase, are associated with rifampin resistance in Mycobacterium tuberculosis. Several studies have been conducted where minimum inhibitory concentration (MIC, which is defined as the minimum concentration of the antibiotic in a given culture medium below which bacterial growth is not inhibited) of rifampin has been measured and partial DNA sequences have been determined for rpoB in different isolates of M. tuberculosis. However, no model has been constructed to predict rifampin resistance based on sequence information alone. Such a model might provide the basis for quantifying rifampin resistance status based exclusively on DNA sequence data and thus eliminate the requirements for time consuming culturing and antibiotic testing of clinical isolates. RESULTS Sequence data for amino acid positions 511-533 of rpoB and associated MIC of rifampin for different isolates of M. tuberculosis were taken from studies examining rifampin resistance in clinical samples from New York City and throughout Japan. We used tree-based statistical methods and random forests to generate models of the relationships between rpoB amino acid sequence and rifampin resistance. The proportion of variance explained by a relatively simple tree-based cross-validated regression model involving two amino acid positions (526 and 531) is 0.679. The first partition in the data, based on position 531, results in groups that differ one hundredfold in mean MIC (1.596 micrograms/ml and 159.676 micrograms/ml). The subsequent partition based on position 526, the most variable in this region, results in a > 354-fold difference in MIC. When considered as a classification problem (susceptible or resistant), a cross-validated tree-based model correctly classified most (0.884) of the observations and was very similar to the regression model. Random forest analysis of the MIC data as a continuous variable, a regression problem, produced a model that explained 0.861 of the variance. The random forest analysis of the MIC data as discrete classes produced a model that correctly classified 0.942 of the observations with sensitivity of 0.958 and specificity of 0.885. CONCLUSIONS Highly accurate regression and classification models of rifampin resistance can be made based on this short sequence region. Models may be better with improved (and consistent) measurements of MIC and more sequence data.
Collapse
Affiliation(s)
- Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742-3360, USA
| | - Mark R Segal
- Department of Epidemiology and Biostatistics, University of California, San Francisco, CA 94143-0560, USA
| |
Collapse
|
33
|
Abstract
Background RNA editing is the process whereby an RNA sequence is modified from the sequence of the corresponding DNA template. In the mitochondria of land plants, some cytidines are converted to uridines before translation. Despite substantial study, the molecular biological mechanism by which C-to-U RNA editing proceeds remains relatively obscure, although several experimental studies have implicated a role for cis-recognition. A highly non-random distribution of nucleotides is observed in the immediate vicinity of edited sites (within 20 nucleotides 5' and 3'), but no precise consensus motif has been identified. Results Data for analysis were derived from the the complete mitochondrial genomes of Arabidopsis thaliana, Brassica napus, and Oryza sativa; additionally, a combined data set of observations across all three genomes was generated. We selected datasets based on the 20 nucleotides 5' and the 20 nucleotides 3' of edited sites and an equivalently sized and appropriately constructed null-set of non-edited sites. We used tree-based statistical methods and random forests to generate models of C-to-U RNA editing based on the nucleotides surrounding the edited/non-edited sites and on the estimated folding energies of those regions. Tree-based statistical methods based on primary sequence data surrounding edited/non-edited sites and estimates of free energy of folding yield models with optimistic re-substitution-based estimates of ~0.71 accuracy, ~0.64 sensitivity, and ~0.88 specificity. Random forest analysis yielded better models and more exact performance estimates with ~0.74 accuracy, ~0.72 sensitivity, and ~0.81 specificity for the combined observations. Conclusions Simple models do moderately well in predicting which cytidines will be edited to uridines, and provide the first quantitative predictive models for RNA edited sites in plant mitochondria. Our analysis shows that the identity of the nucleotide -1 to the edited C and the estimated free energy of folding for a 41 nt region surrounding the edited C are the most important variables that distinguish most edited from non-edited sites. However, the results suggest that primary sequence data and simple free energy of folding calculations alone are insufficient to make highly accurate predictions.
Collapse
Affiliation(s)
- Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742-3360, USA
| | - Daniel S Myers
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742-3360, USA
| |
Collapse
|
34
|
Cummings MP. A book like its cover. Heredity (Edinb) 2004. [DOI: 10.1038/sj.hdy.6800475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
|
35
|
Neel MC, Cummings MP. Section-level relationships of North American Agalinis (Orobanchaceae) based on DNA sequence analysis of three chloroplast gene regions. BMC Evol Biol 2004; 4:15. [PMID: 15186507 PMCID: PMC446187 DOI: 10.1186/1471-2148-4-15] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2004] [Accepted: 06/08/2004] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The North American Agalinis are representatives of a taxonomically difficult group that has been subject to extensive taxonomic revision from species level through higher sub-generic designations (e.g., subsections and sections). Previous presentations of relationships have been ambiguous and have not conformed to modern phylogenetic standards (e.g., were not presented as phylogenetic trees). Agalinis contains a large number of putatively rare taxa that have some degree of taxonomic uncertainty. We used DNA sequence data from three chloroplast genes to examine phylogenetic relationships among sections within the genus Agalinis Raf. (=Gerardia), and between Agalinis and closely related genera within Orobanchaceae. RESULTS Maximum likelihood analysis of sequences data from rbcL, ndhF, and matK gene regions (total aligned length 7323 bp) yielded a phylogenetic tree with high bootstrap values for most branches. Likelihood ratio tests showed that all but a few branch lengths were significantly greater than zero, and an additional likelihood ratio test rejected the molecular clock hypothesis. Comparisons of substitution rates between gene regions based on linear models of pairwise distance estimates between taxa show both ndhF and matK evolve more rapidly than rbcL, although the there is substantial rate heterogeneity within gene regions due in part to rate differences among codon positions. CONCLUSIONS Phylogenetic analysis supports the monophyly of Agalinis, including species formerly in Tomanthera, and this group is sister to a group formed by the genera Aureolaria, Brachystigma, Dasistoma, and Seymeria. Many of the previously described sections within Agalinis are polyphyletic, although many of the subsections appear to form natural groups. The analysis reveals a single evolutionary event leading to a reduction in chromosome number from n = 14 to n = 13 based on the sister group relationship of section Erectae and section Purpureae subsection Pedunculares. Our results establish the evolutionary distinctiveness of A. tenella from the more widespread and common A. obtusifolia. However, further data are required to clearly resolve the relationship between A. acuta and A. tenella.
Collapse
Affiliation(s)
- Maile C Neel
- Department of Natural Resource Sciences and Landscape Architecture, University of Maryland, College Park, MD 20742, USA
- Department of Entomology, University of Maryland, College Park, MD 20742, USA
| | - Michael P Cummings
- Center for Bioinformatics and Computational Biology, University of Maryland, College Park, MD 20742-3360, USA
| |
Collapse
|
36
|
Mark Welch DB, Cummings MP, Hillis DM, Meselson M. Divergent gene copies in the asexual class Bdelloidea (Rotifera) separated before the bdelloid radiation or within bdelloid families. Proc Natl Acad Sci U S A 2004; 101:1622-5. [PMID: 14747660 PMCID: PMC341794 DOI: 10.1073/pnas.2136686100] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Rotifers of the asexual class Bdelloidea are unusual in possessing two or more divergent copies of every gene that has been examined. Phylogenetic analysis of the heat-shock gene hsp82 and the TATA-box-binding protein gene tbp in multiple bdelloid species suggested that for each gene, each copy belonged to one of two lineages that began to diverge before the bdelloid radiation. Such gene trees are consistent with the two lineages having descended from former alleles that began to diverge after meiotic segregation ceased or from subgenomes of an alloploid ancestor of the bdelloids. However, the original analyses of bdelloid gene-copy divergence used only a single outgroup species and were based on parsimony and neighbor joining. We have now used maximum likelihood and Bayesian inference methods and, for hsp82, multiple outgroups in an attempt to produce more robust gene trees. Here we report that the available data do not unambiguously discriminate between gene trees that root the origin of hsp82 and tbp copy divergence before the bdelloid radiation and those which indicate that the gene copies began to diverge within bdelloid families. The remarkable presence of multiple diverged gene copies in individual genomes is nevertheless consistent with the loss of sex in an ancient ancestor of bdelloids.
Collapse
Affiliation(s)
- David B Mark Welch
- Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biology Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA
| | | | | | | |
Collapse
|
37
|
Abstract
Assessment of the reliability of a given phylogenetic hypothesis is an important step in phylogenetic analysis. Historically, the nonparametric bootstrap procedure has been the most frequently used method for assessing the support for specific phylogenetic relationships. The recent employment of Bayesian methods for phylogenetic inference problems has resulted in clade support being expressed in terms of posterior probabilities. We used simulated data and the four-taxon case to explore the relationship between nonparametric bootstrap values (as inferred by maximum likelihood) and posterior probabilities (as inferred by Bayesian analysis). The results suggest a complex association between the two measures. Three general regions of tree space can be identified: (1) the neutral zone, where differences between mean bootstrap and mean posterior probability values are not significant, (2) near the two-branch corner, and (3) deep in the two-branch corner. In the last two regions, significant differences occur between mean bootstrap and mean posterior probability values. Whether bootstrap or posterior probability values are higher depends on the data in support of alternative topologies. Examination of star topologies revealed that both bootstrap and posterior probability values differ significantly from theoretical expectations; in particular, there are more posterior probability values in the range 0.85-1 than expected by theory. Therefore, our results corroborate the findings of others that posterior probability values are excessively high. Our results also suggest that extrapolations from single topology branch-length studies are unlikely to provide any general conclusions regarding the relationship between bootstrap and posterior probability values.
Collapse
Affiliation(s)
- Michael P Cummings
- Center for Bioinformatics and Computational Biology, 2119 A.V. Williams, University of Maryland, College Park, Maryland 20742, USA.
| | | | | | | | | | | |
Collapse
|
38
|
Cummings MP, Nugent JM, Olmstead RG, Palmer JD. Phylogenetic analysis reveals five independent transfers of the chloroplast gene rbcL to the mitochondrial genome in angiosperms. Curr Genet 2003; 43:131-8. [PMID: 12695853 DOI: 10.1007/s00294-003-0378-3] [Citation(s) in RCA: 54] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2002] [Revised: 01/13/2003] [Accepted: 01/16/2003] [Indexed: 11/29/2022]
Abstract
We used the chloroplast gene rbcL as a model to study the frequency and relative timing of transfer of chloroplast sequences to the mitochondrial genome. Southern blot survey of 20 mitochondrial DNAs confirmed three previously reported groups of plants containing rbcL in their mitochondrion, while PCR studies identified a new mitochondrial rbcL. Published and newly determined mitochondrial and chloroplast rbcL sequences were used to reconstruct rbcL phylogeny. The results imply five or six separate interorganellar transfers of rbcL among the angiosperms examined, and hundreds of successful transfers across all flowering plants. By taxonomic criteria, the crucifer transfer is the most ancient, two separate transfers within the grass family are of intermediate ancestry, and the morning-glory transfer is most recent. All five mitochondrial copies of rbcL examined exhibit insertion and/or deletion events that disrupt the reading frame (three are grossly truncated); and all are elevated in the proportion of nonsynonymous substitutions, providing clear evidence that these sequences are pseudogenes.
Collapse
Affiliation(s)
- Michael P Cummings
- The Josephine Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543-1015, USA.
| | | | | | | |
Collapse
|
39
|
|
40
|
García-Varela M, Cummings MP, Pérez-Ponce de León G, Gardner SL, Laclette JP. Phylogenetic analysis based on 18S ribosomal RNA gene sequences supports the existence of class polyacanthocephala (acanthocephala). Mol Phylogenet Evol 2002; 23:288-92. [PMID: 12069558 DOI: 10.1016/s1055-7903(02)00020-9] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
Members of phylum Acanthocephala are parasites of vertebrates and arthropods and are distributed worldwide. The phylum has traditionally been divided into three classes, Archiacanthocephala, Palaeacanthocephala, and Eoacanthocephala; a fourth class, Polyacanthocephala, has been recently proposed. However, erection of this new class, based on morphological characters, has been controversial. We sequenced the near complete 18S rRNA gene of Polyacanthorhynchus caballeroi (Polyacanthocephala) and Rhadinorhynchus sp. (Palaeacanthocephala); these sequences were aligned with another 21 sequences of acanthocephalans representing the three widely recognized classes of the phylum and with 16 sequences from outgroup taxa. Phylogenetic relationships inferred by maximum-likelihood and maximum-parsimony analyses showed Archiacanthocephala as the most basal group within the phylum, whereas classes Polyacanthocephala + Eoacanthocephala formed a monophyletic clade, with Palaeacanthocephala as its sister group. These results are consistent with the view of Polyacanthocephala representing an independent class within Acanthocephala.
Collapse
Affiliation(s)
- Martín García-Varela
- Department of Immunology, Instituto de Investigaciones Biomédicas México, D.F., México
| | | | | | | | | |
Collapse
|
41
|
Abstract
We illustrate data analytic concerns that arise in the context of relating genotype, as represented by amino acid sequence, to phenotypes (outcomes). The present application examines whether peptides that bind to a particular major histocompatibility complex (MHC) class I molecule have characteristic amino acid sequences. However, the concerns identified and addressed are considerably more general. It is recognized that simple rules for predicting binding based solely on preferences for specific amino acids in certain (anchor) positions of the peptide's amino acid sequence are generally inadequate and that binding is potentially influenced by all sequence positions as well as between-position interactions. The desire to elucidate these more complex prediction rules has spawned various modeling attempts, the shortcomings of which provide motivation for the methods adopted here. Because of (i) this need to model between-position interactions, (ii) amino acids constituting a highly (20) multilevel unordered categorical covariate, and (iii) there frequently being numerous such covariates (i.e., positions) comprising the sequence, standard regression/classification techniques are problematic due to the proliferation of indicator variables required for encoding the sequence position covariates and attendant interactions. These difficulties have led to analyses based on (continuous) properties (e.g., molecular weights) of the amino acids. However, there is potential information loss in such an approach if the properties used are incomplete and/or do not capture the mechanism underlying association with the phenotype. Here we demonstrate that handling unordered categorical covariates with numerous levels and accompanying interactions can be done effectively using classification trees and recently devised bump-hunting methods. We further tackle the question of whether observed associations are attributable to amino acid properties as well as addressing the assessment and implications of between-position covariation.
Collapse
Affiliation(s)
- M R Segal
- Division of Biostatistics, University of California, San Francisco 94143-0560, USA.
| | | | | |
Collapse
|
42
|
Abstract
Comparative analysis is one of the most powerful methods available for understanding the diverse and complex systems found in biology, but it is often limited by a lack of comprehensive taxonomic sampling. Despite the recent development of powerful genome technologies capable of producing sequence data in large quantities (witness the recently completed first draft of the human genome), there has been relatively little change in how evolutionary studies are conducted. The application of genomic methods to evolutionary biology is a challenge, in part because gene segments from different organisms are manipulated separately, requiring individual purification, cloning, and sequencing. We suggest that a feasible approach to collecting genome-scale data sets for evolutionary biology (i.e., evolutionary genomics) may consist of combination of DNA samples prior to cloning and sequencing, followed by computational reconstruction of the original sequences. This approach will allow the full benefit of automated protocols developed by genome projects to be realized; taxon sampling levels can easily increase to thousands for targeted genomes and genomic regions. Sequence diversity at this level will dramatically improve the quality and accuracy of phylogenetic inference, as well as the accuracy and resolution of comparative evolutionary studies. In particular, it will be possible to make accurate estimates of normal evolution in the context of constant structural and functional constraints (i.e., site-specific substitution probabilities), along with accurate estimates of changes in evolutionary patterns, including pairwise coevolution between sites, adaptive bursts, and changes in selective constraints. These estimates can then be used to understand and predict the effects of protein structure and function on sequence evolution and to predict unknown details of protein structure, function, and functional divergence. In order to demonstrate the practicality of these ideas and the potential benefit for functional genomic analysis, we describe a pilot project we are conducting to simultaneously sequence large numbers of vertebrate mitochondrial genomes.
Collapse
Affiliation(s)
- D D Pollock
- Theoretical Biology and Biophysics, Theoretical Division, Los Alamos National Laboratory, Los Alamos, New Mexico, USA.
| | | | | | | |
Collapse
|
43
|
García-Varela M, Pérez-Ponce de León G, de la Torre P, Cummings MP, Sarma SS, Laclette JP. Phylogenetic relationships of Acanthocephala based on analysis of 18S ribosomal RNA gene sequences. J Mol Evol 2000; 50:532-40. [PMID: 10835483 DOI: 10.1007/s002390010056] [Citation(s) in RCA: 89] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Acanthocephala (thorny-headed worms) is a phylum of endoparasites of vertebrates and arthropods, included among the most phylogenetically basal tripoblastic pseudocoelomates. The phylum is divided into three classes: Archiacanthocephala, Palaeacanthocephala, and Eoacanthocephala. These classes are distinguished by morphological characters such as location of lacunar canals, persistence of ligament sacs in females, number and type of cement glands in males, number and size of proboscis hooks, host taxonomy, and ecology. To understand better the phylogenetic relationships within Acanthocephala, and between Acanthocephala and Rotifera, we sequenced the nearly complete 18S rRNA genes of nine species from the three classes of Acanthocephala and four species of Rotifera from the classes Bdelloidea and Monogononta. Phylogenetic relationships were inferred by maximum-likelihood analyses of these new sequences and others previously determined. The analyses showed that Acanthocephala is the sister group to a clade including Eoacanthocephala and Palaeacanthocephala. Archiacanthocephala exhibited a slower rate of evolution at the nucleotide level, as evidenced by shorter branch lengths for the group. We found statistically significant support for the monophyly of Rotifera, represented in our analysis by species from the clade Eurotatoria, which includes the classes Bdelloidea and Monogononta. Eurotatoria also appears as the sister group to Acanthocephala.
Collapse
Affiliation(s)
- M García-Varela
- Department of Immunology, Instituto de Investigaciones Biomédicas UNAM, A.P. 70228, 04510 México D.F., México
| | | | | | | | | | | |
Collapse
|
44
|
Affiliation(s)
- M P Cummings
- Department of Botany and Plant Sciences, University of California, Riverside 92521-1024, USA
| | | | | |
Collapse
|
45
|
Abstract
Nucleotide sequences of 18S ribosomal RNA from 71 species of Platyhelminthes, the flatworms, were analyzed using maximum likelihood, and the resulting phylogenetic trees were compared with previous phylogenetic hypotheses. Analyses including 15 outgroup species belonging to eight other phyla show that Platyhelminthes are monophyletic with the exception of a sequence putatively from Acoela sp., Lecithoepitheliata, Polycladida, Tricladida, Trematoda (Aspidobothrii + Digenea), Monogenea, and Cestoda (Gyrocotylidea + Amphilinidea + Eucestoda) are monophyletic groups. Catenulids form the sister group to the rest of platyhelminths, whereas a complex clade formed by Acoela, Tricladida, "Dalyellioida", and perhaps "Typhloplanoida" is sister to Neodermata. "Typhloplanoida" does not appear to be monophyletic; Fecampiida does not appear to belong within "Dalyellioida," nor Kalyptorhynchia within "Typhloplanoida." Trematoda is the sister group to the rest of Neodermata, and Monogenea is sister group to Cestoda. Within Trematoda, Aspidobothrii is the sister group of Digenea and Heronimidae is the most basal family in Digenea. Our trees support the hypothesis that parasitism evolved at least twice in Platyhelminthes, once in the ancestor to Neodermata and again in the ancestor of Fecampiida, independently to the ancestor of putatively parasitic "Dalyellioida."
Collapse
Affiliation(s)
- A Campos
- Department of Immunology, Universidad Nacional Autónoma de México, D.F., México
| | | | | | | |
Collapse
|
46
|
Cummings MP, Clegg MT. Nucleotide sequence diversity at the alcohol dehydrogenase 1 locus in wild barley (Hordeum vulgare ssp. spontaneum): an evaluation of the background selection hypothesis. Proc Natl Acad Sci U S A 1998; 95:5637-42. [PMID: 9576936 PMCID: PMC20431 DOI: 10.1073/pnas.95.10.5637] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
The background selection hypothesis predicts a reduction in nucleotide site diversity and an excess of rare variants, owing to linkage associations with deleterious alleles. This effect is expected to be amplified in species that are predominantly self-fertilizing. To examine the predictions of the background selection hypothesis in self-fertilizing species, we sequenced 1,362 bp of adh1, a gene for alcohol dehydrogenase (Adh; alcohol:NAD+ oxidoreductase, EC 1.1.1.1), in a sample of 45 accessions of wild barley, Hordeum vulgare ssp. spontaneum, drawn from throughout the species range. The region sequenced included 786 bp of exon sequence (part of exon 4, all of exons 5-9, and part of exon 10) and 576 bp of intron sequence (all of introns 4-9). There were 19 sites polymorphic for nucleotide substitutions, 8 in introns, and 11 in exons. Of the 11 nucleotide substitutions in codons, 4 were synonymous and 7 were nonsynonymous, occurring uniquely in the sample. There was no evidence of recombination in the region studied, and the estimated effective population size (Ne) based on synonymous sites was approximately 1.8-4.2 x 10(5). Several tests reveal that the pattern of nonsynonymous substitutions departs significantly from neutral expectations. However, the data do not appear to be consistent with recovery from a population bottleneck, recent population expansion, selective sweep, or strong positive selection. Though several features of the data are consistent with background selection, the distributions of polymorphic synonymous and intron sites are not perturbed toward a significant excess of rare alleles as would be predicted by background selection.
Collapse
Affiliation(s)
- M P Cummings
- Department of Botany and Plant Sciences, University of California, Riverside, CA 92521-0124, USA
| | | |
Collapse
|
47
|
King LM, Cummings MP. Satellite DNA repeat sequence variation is low in three species of burying beetles in the genus Nicrophorus (Coleoptera: Silphidae). Mol Biol Evol 1997; 14:1088-95. [PMID: 9364766 DOI: 10.1093/oxfordjournals.molbev.a025718] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2023] Open
Abstract
Three satellite DNA families were identified in three species of burying beetles, Nicrophorus orbicollis, N. marginatus, and N. americanus. Southern hybridization and nucleotide sequence analysis of individual randomly cloned repeats shows that these satellite DNA families are highly abundant in the genome, are composed of unique repeats, and are species-specific. The repeats do not have identifiable core elements or substructures that are similar in all three families, and most interspecific sequence similarity is confined to homopolymeric runs of A and T. Satellite DNA from N. marginatus and N. americanus show single-base-pair indels among repeats, but single-nucleotide substitutions characterize most of the repeat variability. Although the repeat units are of similar lengths (342, 350, and 354 bp) and A + T composition (65%, 71%, and 71%, respectively), the average nucleotide divergence among sequenced repeats is very low (0.18%, 1.22%, and 0.71%, respectively). Transition/transversion ratios from the consensus sequence are 0.20, 0.69, and 0.70, respectively.
Collapse
Affiliation(s)
- L M King
- Department of Biology, University of Miami, USA
| | | |
Collapse
|
48
|
Abstract
We analyze the evolutionary dynamics of three of the best-studied plant nuclear multigene families. The data analyzed derive from the genes that encode the small subunit of ribulose-1,5-bisphosphate carboxylase (rbcS), the gene family that encodes the enzyme chalcone synthase (Chs), and the gene family that encodes alcohol dehydrogenases (Adh). In addition, we consider the limited evolutionary data available on plant transposable elements. New Chs and rbcS genes appear to be recruited at about 10 times the rate estimated for Adh genes, and this is correlated with a much smaller average gene family size for Adh genes. In addition, duplication and divergence in function appears to be relatively common for Chs genes in flowering plant evolution. Analyses of synonymous nucleotide substitution rates for Adh genes in monocots reject a linear relationship with clock time. Replacement substitution rates vary with time in a complex fashion, which suggests that adaptive evolution has played an important role in driving divergence following gene duplication events. Molecular population genetic studies of Adh and Chs genes reveal high levels of molecular diversity within species. These studies also reveal that inter- and intralocus recombination are important forces in the generation allelic novelties. Moreover, illegitimate recombination events appear to be an important factor in transposable element loss in plants. When we consider the recruitment and loss of new gene copies, the generation of allelic diversity within plant species, and ectopic exchange among transposable elements, we conclude that recombination is a pervasive force at all levels of plant evolution.
Collapse
Affiliation(s)
- M T Clegg
- Department of Botany and Plant Sciences, University of California, Riverside, CA 92521, USA
| | | | | |
Collapse
|
49
|
Abstract
A 10-year-old boy had a 3-month history of urticarial plaques and vesicles. Histologic and immunofluorescence testing confirmed the diagnosis of linear IgA disease. Immunoelectron microscopy revealed IgA deposits in the sublamina densa area similar to those seen in epidermolysis bullosa acquisita. Milia developed after resolution of the lesions, similar to lesions of epidermolysis bullosa acquisita.
Collapse
Affiliation(s)
- D F Mutasim
- Department of Dermatology, University of Cincinnati, Ohio 45267-0523, USA
| | | |
Collapse
|
50
|
Abstract
As our appreciation of the diversity within the flatworms has grown, so too has our curiosity about the ways in which these varied creatures are related to one another. In particular, the parasitic groups (trematodes, cestodes and monogeneans have been the focus of enquiry. Until recently, morphology, anatomy and life histories have provided the raw data for building hypotheses on relationships. Now, ultrastructural evidence, and most recently, molecular data from nucleic acid sequences, have been brought to bear on the topic. Here, David Blair, Andrés Campos, Michael Cummings and Juan Pedro Laclette discuss the ways in which molecular data, in particular, are helping us recognize the various lineages of flatworms.
Collapse
Affiliation(s)
- D Blair
- Department of Zoology, James Cook University, Townsville, Queensland 4811, Australia.
| | | | | | | |
Collapse
|