1
|
Peona V, Martelossi J, Almojil D, Bocharkina J, Brännström I, Brown M, Cang A, Carrasco-Valenzuela T, DeVries J, Doellman M, Elsner D, Espíndola-Hernández P, Montoya GF, Gaspar B, Zagorski D, Hałakuc P, Ivanovska B, Laumer C, Lehmann R, Boštjančić LL, Mashoodh R, Mazzoleni S, Mouton A, Nilsson MA, Pei Y, Potente G, Provataris P, Pardos-Blas JR, Raut R, Sbaffi T, Schwarz F, Stapley J, Stevens L, Sultana N, Symonova R, Tahami MS, Urzì A, Yang H, Yusuf A, Pecoraro C, Suh A. Teaching transposon classification as a means to crowd source the curation of repeat annotation - a tardigrade perspective. Mob DNA 2024; 15:10. [PMID: 38711146 DOI: 10.1186/s13100-024-00319-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2024] [Accepted: 04/09/2024] [Indexed: 05/08/2024] Open
Abstract
BACKGROUND The advancement of sequencing technologies results in the rapid release of hundreds of new genome assemblies a year providing unprecedented resources for the study of genome evolution. Within this context, the significance of in-depth analyses of repetitive elements, transposable elements (TEs) in particular, is increasingly recognized in understanding genome evolution. Despite the plethora of available bioinformatic tools for identifying and annotating TEs, the phylogenetic distance of the target species from a curated and classified database of repetitive element sequences constrains any automated annotation effort. Moreover, manual curation of raw repeat libraries is deemed essential due to the frequent incompleteness of automatically generated consensus sequences. RESULTS Here, we present an example of a crowd-sourcing effort aimed at curating and annotating TE libraries of two non-model species built around a collaborative, peer-reviewed teaching process. Manual curation and classification are time-consuming processes that offer limited short-term academic rewards and are typically confined to a few research groups where methods are taught through hands-on experience. Crowd-sourcing efforts could therefore offer a significant opportunity to bridge the gap between learning the methods of curation effectively and empowering the scientific community with high-quality, reusable repeat libraries. CONCLUSIONS The collaborative manual curation of TEs from two tardigrade species, for which there were no TE libraries available, resulted in the successful characterization of hundreds of new and diverse TEs in a reasonable time frame. Our crowd-sourcing setting can be used as a teaching reference guide for similar projects: A hidden treasure awaits discovery within non-model organisms.
Collapse
Affiliation(s)
- Valentina Peona
- Department of Organismal Biology - Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, SE-752 36, Sweden.
- Swiss Ornithological Institute Vogelwarte, Sempach, CH-6204, Switzerland.
- Department of Bioinformatics and Genetics, Swedish Natural History Museum, Stockholm, Sweden.
| | - Jacopo Martelossi
- Department of Biological Geological and Environmental Science, University of Bologna, Via Selmi 3, Bologna, 40126, Italy.
| | - Dareen Almojil
- New York University Abu Dhabi, Saadiyat Island, United Arab Emirates
| | | | - Ioana Brännström
- Natural History Museum, Oslo University, Oslo, Norway
- Department of Ecology and Genetics, Uppsala University, Uppsala, Sweden
| | - Max Brown
- Anglia Ruskin University, East Rd, Cambridge, CB1 1PT, UK
| | | | - Tomàs Carrasco-Valenzuela
- Evolutionary Genetics Department, Leibniz Institute for Zoo and Wildlife Research, 10315, Berlin, Germany
- Berlin Center for Genomics in Biodiversity Research, 14195, Berlin, Germany
| | - Jon DeVries
- Reed College, Portland, OR, United States of America
| | - Meredith Doellman
- Department of Ecology and Evolution, The University of Chicago, Chicago, IL, 60637, USA
- Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA
| | - Daniel Elsner
- Evolutionary Biology & Ecology, University of Freiburg, Freiburg, Germany
| | - Pamela Espíndola-Hernández
- Research Unit Comparative Microbiome Analysis (COMI), Helmholtz Zentrum München, Ingolstädter Landstraße 1, D-85764, Neuherberg, Germany
| | | | - Bence Gaspar
- Institute of Evolution and Ecology, University of Tuebingen, Tuebingen, Germany
| | - Danijela Zagorski
- Institute of Botany, Czech Academy of Sciences, Průhonice, Czech Republic
| | - Paweł Hałakuc
- Institute of Evolutionary Biology, Faculty of Biology, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
| | - Beti Ivanovska
- Institute of Genetics and Biotechnology, Hungarian University of Agriculture and Life Sciences, Budapest, Hungary
| | | | - Robert Lehmann
- Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia
| | - Ljudevit Luka Boštjančić
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberganlage 25, 60325, Frankfurt, Germany
| | - Rahia Mashoodh
- Department of Genetics, Environment & Evolution, Centre for Biodiversity & Environment Research, University College London, London, UK
| | - Sofia Mazzoleni
- Department of Ecology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Alice Mouton
- INBIOS-Conservation Genetic Lab, University of Liege, Liege, Belgium
| | - Maria Anna Nilsson
- LOEWE Centre for Translational Biodiversity Genomics (LOEWE-TBG), Senckenberganlage 25, 60325, Frankfurt, Germany
| | - Yifan Pei
- Department of Organismal Biology - Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, SE-752 36, Sweden
- Centre for Molecular Biodiversity Research, Leibniz Institute for the Analysis of Biodiversity Change, Adenauerallee 127, 53113, Bonn, Germany
| | - Giacomo Potente
- Department of Systematic and Evolutionary Botany, University of Zurich, Zurich, Switzerland
| | - Panagiotis Provataris
- German Cancer Research Center, NGS Core Facility, DKFZ-ZMBH Alliance, 69120, Heidelberg, Germany
| | - José Ramón Pardos-Blas
- Departamento de Biodiversidad y Biología Evolutiva, Museo Nacional de Ciencias Naturales (MNCN-CSIC), José Gutiérrez Abascal 2, Madrid, 28006, Spain
| | - Ravindra Raut
- Department of Biotechnology, National Institute of Technology Durgapur, Durgapur, India
| | - Tomasa Sbaffi
- Molecular Ecology Group (MEG), National Research Council of Italy - Water Research Institute (CNR-IRSA), Verbania, Italy
| | - Florian Schwarz
- Eurofins Genomics Europe Pharma and Diagnostics Products & Services Sales GmbH, Ebersberg, Germany
| | - Jessica Stapley
- Plant Pathology Group, Institute of Integrative Biology, ETH Zurich, Zurich, Switzerland
| | - Lewis Stevens
- Tree of Life, Wellcome Sanger Institute, Cambridge, CB10 1SA, UK
| | - Nusrat Sultana
- Department of Botany, Jagannath Univerity, Dhaka, 1100, Bangladesh
| | - Radka Symonova
- Institute of Hydrobiology, Biology Centre of the Czech Academy of Sciences, České Budějovice, Czech Republic
| | - Mohadeseh S Tahami
- Department of Biological and Environmental Science, University of Jyväskylä, P.O. Box 35, Jyväskylä, 40014, Finland
| | - Alice Urzì
- Centogene GmbH, Am Strande 7, 18055, Rostock, Germany
| | - Heidi Yang
- Department of Ecology & Evolutionary Biology, University of California, Los Angeles, Los Angeles, CA, United States of America
| | - Abdullah Yusuf
- Zell- und Molekularbiologie der Pflanzen, Technische Universität Dresden, Dresden, Germany
| | | | - Alexander Suh
- Department of Organismal Biology - Systematic Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, SE-752 36, Sweden.
- School of Biological Sciences, University of East Anglia, Norwich Research Park, Norwich, NR4 7TU, UK.
- Present address: Centre for Molecular Biodiversity Research, Leibniz Institute for the Analysis of Biodiversity Change, Adenauerallee 160, 53113, Bonn, Germany.
| |
Collapse
|
2
|
Köhler S, Gargano M, Matentzoglu N, Carmody LC, Lewis-Smith D, Vasilevsky NA, Danis D, Balagura G, Baynam G, Brower AM, Callahan TJ, Chute CG, Est JL, Galer PD, Ganesan S, Griese M, Haimel M, Pazmandi J, Hanauer M, Harris NL, Hartnett M, Hastreiter M, Hauck F, He Y, Jeske T, Kearney H, Kindle G, Klein C, Knoflach K, Krause R, Lagorce D, McMurry JA, Miller JA, Munoz-Torres M, Peters RL, Rapp CK, Rath AM, Rind SA, Rosenberg A, Segal MM, Seidel MG, Smedley D, Talmy T, Thomas Y, Wiafe SA, Xian J, Yüksel Z, Helbig I, Mungall CJ, Haendel MA, Robinson PN. The Human Phenotype Ontology in 2021. Nucleic Acids Res 2021; 49:D1207-D1217. [PMID: 33264411 PMCID: PMC7778952 DOI: 10.1093/nar/gkaa1043] [Citation(s) in RCA: 501] [Impact Index Per Article: 167.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 10/11/2020] [Accepted: 11/16/2020] [Indexed: 12/21/2022] Open
Abstract
The Human Phenotype Ontology (HPO, https://hpo.jax.org) was launched in 2008 to provide a comprehensive logical standard to describe and computationally analyze phenotypic abnormalities found in human disease. The HPO is now a worldwide standard for phenotype exchange. The HPO has grown steadily since its inception due to considerable contributions from clinical experts and researchers from a diverse range of disciplines. Here, we present recent major extensions of the HPO for neurology, nephrology, immunology, pulmonology, newborn screening, and other areas. For example, the seizure subontology now reflects the International League Against Epilepsy (ILAE) guidelines and these enhancements have already shown clinical validity. We present new efforts to harmonize computational definitions of phenotypic abnormalities across the HPO and multiple phenotype ontologies used for animal models of disease. These efforts will benefit software such as Exomiser by improving the accuracy and scope of cross-species phenotype matching. The computational modeling strategy used by the HPO to define disease entities and phenotypic features and distinguish between them is explained in detail.We also report on recent efforts to translate the HPO into indigenous languages. Finally, we summarize recent advances in the use of HPO in electronic health record systems.
Collapse
Affiliation(s)
| | - Michael Gargano
- Monarch Initiative
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - Nicolas Matentzoglu
- Monarch Initiative
- Semanticly Ltd, London, UK
- European Bioinformatics Institute (EMBL-EBI)
| | - Leigh C Carmody
- Monarch Initiative
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
| | - David Lewis-Smith
- Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, UK
- Clinical Neurosciences, Newcastle upon Tyne Hospitals NHS Foundation Trust, Newcastle upon Tyne, UK
| | - Nicole A Vasilevsky
- Monarch Initiative
- Oregon Clinical & Translational Research Institute, Oregon Health & Science University
| | | | - Ganna Balagura
- Department of Neurosciences, Rehabilitation, Ophthalmology, Genetics, and Maternal and Child Health, University of Genoa, Genoa, Italy
- Pediatric Neurology and Muscular Diseases Unit, IRCCS ‘G. Gaslini’ Institute, Genoa, Italy
| | - Gareth Baynam
- Western Australian Register of Developmental Anomalies, King Edward memorial Hospital, Perth, Australia
- Telethon Kids Institute and the Division of Paediatrics, Faculty of Helath and Medical Sciences, University of Western Australia, Perth, Australia
| | - Amy M Brower
- American College of Medical Genetics and Genomics (ACMG), Bethesda, MD, USA
| | - Tiffany J Callahan
- Computational Bioscience Program, University of Colorado Anschutz Medical Campus, Colorado, USA
| | | | - Johanna L Est
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Peter D Galer
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Shiva Ganesan
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Biomedical and Health Informatics (DBHi), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Matthias Griese
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
- Ludwig-Maximilians University, German Center for Lung Research (DZL), Munich, Germany
| | - Matthias Haimel
- Ludwig Boltzmann Institute for Rare and Undiagnosed Diseases, Vienna, Austria
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
| | - Julia Pazmandi
- Ludwig Boltzmann Institute for Rare and Undiagnosed Diseases, Vienna, Austria
- CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria
- Institute for Systems Genomics, University of Connecticut, Farmington, CT 06032, USA
| | - Marc Hanauer
- INSERM, US14––Orphanet, Plateforme Maladies Rares, Paris, France
| | - Nomi L Harris
- Monarch Initiative
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley CA, USA
| | - Michael J Hartnett
- American College of Medical Genetics and Genomics (ACMG), Bethesda, MD, USA
| | - Maximilian Hastreiter
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Fabian Hauck
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
- German Centre for Infection Research (DZIF), Munich, Germany
| | - Yongqun He
- Unit for Laboratory Animal Medicine, Department of Microbiology and Immunology, Center for Computational Medicine and Bioinformatics, University of Michigan Medical School, Ann Arbor, MI, USA
| | - Tim Jeske
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Hugh Kearney
- FutureNeuro, SFI Research Centre for Chronic and Rare Neurological Diseases, Ireland
| | - Gerhard Kindle
- Institute for Immunodeficiency, Center for Chronic Immunodeficiency (CCI). Faculty of Medicine, Medical Center - University of Freiburg, Freiburg, Germany
- Centre for Biobanking FREEZE, Faculty of Medicine, Medical Center - University of Freiburg, Freiburg, Germany
| | - Christoph Klein
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
| | - Katrin Knoflach
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
- Ludwig-Maximilians University, German Center for Lung Research (DZL), Munich, Germany
| | - Roland Krause
- Luxembourg Centre for Systems Biomedicine, University of Luxembourg, L-4367 Belvaux, Luxembourg
| | - David Lagorce
- INSERM, US14––Orphanet, Plateforme Maladies Rares, Paris, France
| | - Julie A McMurry
- Monarch Initiative
- Translational and Integrative Sciences Center, Department of Environmental and Molecular Toxicology, Oregon State University, OR, USA
| | - Jillian A Miller
- American College of Medical Genetics and Genomics (ACMG), Bethesda, MD, USA
| | - Monica C Munoz-Torres
- Monarch Initiative
- Translational and Integrative Sciences Center, Department of Environmental and Molecular Toxicology, Oregon State University, OR, USA
| | - Rebecca L Peters
- American College of Medical Genetics and Genomics (ACMG), Bethesda, MD, USA
| | - Christina K Rapp
- Department of Pediatrics, Dr. von Hauner Children's Hospital, University Hospital, Ludwig-Maximilians-Universität München, Munich, Germany
- Ludwig-Maximilians University, German Center for Lung Research (DZL), Munich, Germany
| | - Ana M Rath
- INSERM, US14––Orphanet, Plateforme Maladies Rares, Paris, France
| | - Shahmir A Rind
- WA Register of Developmental Anomalies
- Curtin University, Western Australia, Australia
| | - Avi Z Rosenberg
- Division of Kidney-Urologic Pathology, Johns Hopkins University, Baltimore, MD 21205, USA
| | | | - Markus G Seidel
- Research Unit for Pediatric Hematology and Immunology, Division of Pediatric Hemato-Oncology, Department of Pediatrics and Adolescent Medicine, Medical University of Graz, Graz, Austria
| | - Damian Smedley
- The William Harvey Research Institute, Charterhouse Square Barts and the London School of Medicine and Dentistry Queen Mary University of London, London EC1M 6BQ, UK
| | - Tomer Talmy
- Genomic Research Department, Emedgene Technologies, Tel Aviv, Israel
- Faculty of Medicine, Hebrew University Hadassah Medical School, Jerusalem, Israel
| | - Yarlalu Thomas
- West Australian Register of Developmental Anomalies, East Perth, WA, Australia
| | | | - Julie Xian
- Division of Neurology, Children's Hospital of Philadelphia, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children's Hospital of Philadelphia, PA, USA
| | - Zafer Yüksel
- Human Genetics, Bioscientia GmbH, Ingelheim, Germany
| | - Ingo Helbig
- Department of Neurology, University of Pennsylvania, Perelman School of Medicine, Philadelphia, PA, USA
- The Epilepsy NeuroGenetics Initiative (ENGIN), Children's Hospital of Philadelphia, Philadelphia, PA, USA
| | - Christopher J Mungall
- Monarch Initiative
- Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley CA, USA
| | - Melissa A Haendel
- Monarch Initiative
- Oregon Clinical & Translational Research Institute, Oregon Health & Science University
- Translational and Integrative Sciences Center, Department of Environmental and Molecular Toxicology, Oregon State University, OR, USA
| | - Peter N Robinson
- Monarch Initiative
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, USA
- Institute for Systems Genomics, University of Connecticut, Farmington, CT 06032, USA
| |
Collapse
|