1
|
Bach A, Raub F, Höfer H, Ottermanns R, Roß-Nickoll M. ARAapp: filling gaps in the ecological knowledge of spiders using an automated and dynamic approach to analyze systematically collected community data. Database (Oxford) 2024; 2024:baae004. [PMID: 38306294 PMCID: PMC10836506 DOI: 10.1093/database/baae004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 12/22/2023] [Accepted: 01/11/2024] [Indexed: 02/04/2024]
Abstract
The ARAMOB data repository compiles meticulously curated spider community datasets from systematical collections, ensuring a high standard of data quality. These datasets are enriched with crucial methodological data that enable the datasets to be aligned in time and space, facilitating data synthesis across studies, respectively, collections. To streamline the analysis of these datasets in a species-specific context, a suite of tailored ecological analysis tools named ARAapp has been developed. By harnessing the capabilities of ARAapp, users can systematically evaluate the spider species data housed within the ARAMOB repository, elucidating intricate relationships with a range of parameters such as vertical stratification, habitat occurrence, ecological niche parameters (moisture and shading) and phenological patterns. Database URL: ARAapp is available at www.aramob.de/en.
Collapse
Affiliation(s)
- Alexander Bach
- Institute for Environmental Research, RWTH Aachen University, Worringerweg 1, Aachen 52074, Germany
| | - Florian Raub
- Staatliches Museum für Naturkunde Karlsruhe, Erbprinzenstr. 13, Karlsruhe 76133, Germany
| | - Hubert Höfer
- Staatliches Museum für Naturkunde Karlsruhe, Erbprinzenstr. 13, Karlsruhe 76133, Germany
| | - Richard Ottermanns
- Institute for Environmental Research, RWTH Aachen University, Worringerweg 1, Aachen 52074, Germany
| | - Martina Roß-Nickoll
- Institute for Environmental Research, RWTH Aachen University, Worringerweg 1, Aachen 52074, Germany
| |
Collapse
|
2
|
Bouyssié D, Altıner P, Capella-Gutierrez S, Fernández JM, Hagemeijer YP, Horvatovich P, Hubálek M, Levander F, Mauri P, Palmblad M, Raffelsberger W, Rodríguez-Navas L, Di Silvestre D, Kunkli BT, Uszkoreit J, Vandenbrouck Y, Vizcaíno JA, Winkelhardt D, Schwämmle V. WOMBAT-P: Benchmarking Label-Free Proteomics Data Analysis Workflows. J Proteome Res 2024; 23:418-429. [PMID: 38038272 DOI: 10.1021/acs.jproteome.3c00636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2023]
Abstract
The inherent diversity of approaches in proteomics research has led to a wide range of software solutions for data analysis. These software solutions encompass multiple tools, each employing different algorithms for various tasks such as peptide-spectrum matching, protein inference, quantification, statistical analysis, and visualization. To enable an unbiased comparison of commonly used bottom-up label-free proteomics workflows, we introduce WOMBAT-P, a versatile platform designed for automated benchmarking and comparison. WOMBAT-P simplifies the processing of public data by utilizing the sample and data relationship format for proteomics (SDRF-Proteomics) as input. This feature streamlines the analysis of annotated local or public ProteomeXchange data sets, promoting efficient comparisons among diverse outputs. Through an evaluation using experimental ground truth data and a realistic biological data set, we uncover significant disparities and a limited overlap in the quantified proteins. WOMBAT-P not only enables rapid execution and seamless comparison of workflows but also provides valuable insights into the capabilities of different software solutions. These benchmarking metrics are a valuable resource for researchers in selecting the most suitable workflow for their specific data sets. The modular architecture of WOMBAT-P promotes extensibility and customization. The software is available at https://github.com/wombat-p/WOMBAT-Pipelines.
Collapse
Affiliation(s)
- David Bouyssié
- Institut de Pharmacologie et de Biologie Structurale (IPBS), Université de Toulouse, CNRS, Université Toulouse III─Paul Sabatier (UT3), 31062 Toulouse, France
- Proteomics French Infrastructure, ProFI, FR 2048 Toulouse, France
| | - Pınar Altıner
- Institut de Pharmacologie et de Biologie Structurale (IPBS), Université de Toulouse, CNRS, Université Toulouse III─Paul Sabatier (UT3), 31062 Toulouse, France
| | | | - José M Fernández
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
| | - Yanick Paco Hagemeijer
- Department of Analytical Biochemistry, University of Groningen, Groningen Research Institute of Pharmacy, 9712 CP Groningen, The Netherlands
- European Research Institute for the Biology of Ageing, University Medical Center Groningen, 9713 GZ Groningen, The Netherlands
| | - Peter Horvatovich
- Department of Analytical Biochemistry, University of Groningen, Groningen Research Institute of Pharmacy, 9712 CP Groningen, The Netherlands
| | - Martin Hubálek
- Institute of Organic Chemistry and Biochemistry, CAS, 160 00 Prague, Czech Republic
| | - Fredrik Levander
- National Bioinformatics Infrastructure Sweden (NBIS), Science for Life Laboratory, Department of Immunotechnology, Lund University, 22100 Lund, Sweden
| | - Pierluigi Mauri
- Institute for Biomedical Technologies (ITB), Department of Biomedical Sciences, National Research Council (CNR), Segrate, 20054 Milan, Italy
| | - Magnus Palmblad
- Leiden University Medical Center, Postbus 9600, 2300 RC Leiden, The Netherlands
| | - Wolfgang Raffelsberger
- Wolfgang Raffelsberger: Institut de Génétique et de Biologie Moléculaire et Cellulaire, Université de Strasbourg, CNRS UMR7104, INSERM U1258, Illkirch, 1 Rue Laurent Fries, 67404 Illkirch, France
| | - Laura Rodríguez-Navas
- Life Sciences Department, Barcelona Supercomputing Center (BSC), 08034 Barcelona, Spain
| | - Dario Di Silvestre
- Institute for Biomedical Technologies (ITB), Department of Biomedical Sciences, National Research Council (CNR), Segrate, 20054 Milan, Italy
| | - Balázs Tibor Kunkli
- Balázs Tibor Kunkli: Department of Biochemistry and Molecular Biology, University of Debrecen, 4032 Debrecen, Hungary
| | - Julian Uszkoreit
- Medical Faculty, Medical Bioinformatics, Ruhr University Bochum, 44801 Bochum, Germany
- Center for Protein Diagnostics (ProDi), Medical Proteome Analysis, Ruhr University Bochum, 44801 Bochum, Germany
- Medical Faculty, Medizinisches Proteom-Center, Ruhr University Bochum, 44801 Bochum, Germany
| | - Yves Vandenbrouck
- Proteomics French Infrastructure, ProFI, FR 2048 Toulouse, France
- CEA, Fundamental Research Division, Proteomics French Infrastructure, 91191 Gif-sur-Yvette, France
| | - Juan Antonio Vizcaíno
- European Molecular Biology Laboratory European Bioinformatics Institute (EMBL-EBI), Wellcome Trust, Genome Campus, Hinxton, Cambridge CB10 1SD, U.K
| | - Dirk Winkelhardt
- Medical Faculty, Medizinisches Proteom-Center, Ruhr University Bochum, 44801 Bochum, Germany
| | - Veit Schwämmle
- Department of Biochemistry and Molecular Biology, University of Southern Denmark, Campusvej 55, 5230 Odense M, Denmark
| |
Collapse
|
3
|
Novotný P, Seifert S, Rohn M, Diewald W, Štech M, Triebel D. Software infrastructure and data pipelines established for technical interoperability within a cross-border cooperation for the flora of the Bohemian Forest. Biodivers Data J 2022; 10:e87254. [PMID: 36761603 PMCID: PMC9836605 DOI: 10.3897/bdj.10.e87254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 09/04/2022] [Indexed: 11/12/2022] Open
Abstract
Background The timely and geographical resolutions, as well as the quantity and taxon concepts of records on the occurrence of plants near national borders is often ambiguous. This is due to the regional focus and different approaches of the contributing national and regional databases and networks of the neighbouring countries. Careful data transformation between national data providers is essential for understanding distribution patterns and its dynamics for organisms in areas along the national borders. Sharing occurrence data through the international data aggregator Global Biodiversity Information Facility (GBIF) is also complicated and has to consider that the underlying taxonomic concept and geographic information system of each single GBIF dataset might be different. In addition, some regional data providers have a restrictive (non-cc) licensing policy which does not allow data publication via the GBIF network. Therefore, it is necessary to investigate new ways to make data fit for use for a better and comprehensive understanding of the Flora of the Bohemian Forest. New information In this paper, we present a bilateral technical interoperability solution for vascular plant occurrence data for the area between the Czech Republic and Bavaria. We describe the initial state of data providers in both countries and the factual and technical challenges in finding a sustainable concept to establish mutual data sharing. The resulting solution for a functional infrastructure and an agreed data pipeline is described in a step-by-step approach. The new distributed infrastructure allows botanists and other stakeholders from both countries to work within the cross-border context of historical and current plants' distribution.
Collapse
Affiliation(s)
- Petr Novotný
- Department of Biology Education, Faculty of Science, Charles University, Praha, Czech RepublicDepartment of Biology Education, Faculty of Science, Charles UniversityPrahaCzech Republic
| | - Stefan Seifert
- Staatliche Naturwissenschaftliche Sammlungen Bayerns, SNSB IT Center, Munich, Germany, Munich, GermanyStaatliche Naturwissenschaftliche Sammlungen Bayerns, SNSB IT Center, Munich, GermanyMunichGermany
| | - Martin Rohn
- Babická 2379/1a, Praha, Czech RepublicBabická 2379/1aPrahaCzech Republic
| | - Wolfgang Diewald
- Staatliche Naturwissenschaftliche Sammlungen Bayerns, Botanische Staatssammlung München, Munich, GermanyStaatliche Naturwissenschaftliche Sammlungen Bayerns, Botanische Staatssammlung MünchenMunichGermany
| | - Milan Štech
- Department of Botany, Faculty of Science, University of South Bohemia, České Budějovice, Czech RepublicDepartment of Botany, Faculty of Science, University of South BohemiaČeské BudějoviceCzech Republic
| | - Dagmar Triebel
- Staatliche Naturwissenschaftliche Sammlungen Bayerns, SNSB IT Center, Munich, Germany, Munich, GermanyStaatliche Naturwissenschaftliche Sammlungen Bayerns, SNSB IT Center, Munich, GermanyMunichGermany
| |
Collapse
|
4
|
Peer L, Biniossek C, Betz D, Christian TM. Reproducible Research Publication Workflow: A Canonical Workflow Framework and FAIR Digital Object Approach to Quality Research Output. DATA INTELLIGENCE 2022. [DOI: 10.1162/dint_a_00133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/04/2022] Open
Abstract
Abstract
In this paper we present the Reproducible Research Publication Workflow (RRPW) as an example of how generic canonical workflows can be applied to a specific context. The RRPW includes essential steps between submission and final publication of the manuscript and the research artefacts (i.e., data, code, etc.) that underlie the scholarly claims in the manuscript. A key aspect of the RRPW is the inclusion of artefact review and metadata creation as part of the publication workflow. The paper discusses a formalized technical structure around a set of canonical steps which helps codify and standardize the process for researchers, curators, and publishers. The proposed application of canonical workflows can help achieve the goals of improved transparency and reproducibility, increase FAIR compliance of all research artefacts at all steps, and facilitate better exchange of annotated and machine-readable metadata.
Collapse
Affiliation(s)
- Limor Peer
- 1nstitution for Social and Policy Studies, Yale University, Connecticut 06520, USA
| | - Claudia Biniossek
- Center for Empirical Research in Economics and Behavioral Sciences (CEREB), University of Erfurt, Thüringen 99089, Germany
| | - Dirk Betz
- Center for Empirical Research in Economics and Behavioral Sciences (CEREB), University of Erfurt, Thüringen 99089, Germany
| | - Thu-Mai Christian
- Odum Institute for Research in Social Science, University of North Carolina System, North Carolina 27514-3916, USA
| |
Collapse
|
5
|
Chamanara J, Gaikwad J, Gerlach R, Algergawy A, Ostrowski A, König-Ries B. BEXIS2: A FAIR-aligned data management system for biodiversity, ecology and environmental data. Biodivers Data J 2021; 9:e72901. [PMID: 34785977 PMCID: PMC8589773 DOI: 10.3897/bdj.9.e72901] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2021] [Accepted: 10/15/2021] [Indexed: 11/12/2022] Open
Abstract
Background Obtaining fit-to-use data associated with diverse aspects of biodiversity, ecology and environment is challenging since often it is fragmented, sub-optimally managed and available in heterogeneous formats. Recently, with the universal acceptance of the FAIR data principles, the requirements and standards of data publications have changed substantially. Researchers are encouraged to manage the data as per the FAIR data principles and ensure that the raw data, metadata, processed data, software, codes and associated material are securely stored and the data be made available with the completion of the research. New information We have developed BEXIS2 as an open-source community-driven web-based research data management system to support research data management needs of mid to large-scale research projects with multiple sub-projects and up to several hundred researchers. BEXIS2 is a modular and extensible system providing a range of functions to realise the complete data lifecycle from data structure design to data collection, data discovery, dissemination, integration, quality assurance and research planning. It is an extensible and customisable system that allows for the development of new functions and customisation of its various components from database schemas to the user interface layout, elements and look and feel. During the development of BEXIS2, we aimed to incorporate key aspects of what is encoded in FAIR data principles. To investigate the extent to which BEXIS2 conforms to these principles, we conducted the self-assessment using the FAIR indicators, definitions and criteria provided in the FAIR Data Maturity Model. Even though the FAIR data maturity model is developed initially to judge the conformance of datasets, the self-assessment results indicated that BEXIS2 remarkably conforms and supports FAIR indicators. BEXIS2 strongly conforms to the indicators Findability and Accessibility. The indicator Interoperability is moderately supported as of now; however, for many of the lesssupported facets, we have concrete plans for improvement. Reusability (as defined by the FAIR data principles) is partially achieved. This paper also illustrates community deployment examples of the BEXIS2 instances as success stories to exemplify its capacity to meet the biodiversity and ecological data management needs of differently sized projects and serve as an organisational research data management system.
Collapse
Affiliation(s)
- Javad Chamanara
- Friedrich-Schiller-Universität Jena, Jena, Germany Friedrich-Schiller-Universität Jena Jena Germany.,TIB - Leibniz Information Centre for Science and Technology and University Library, Hannover, Germany TIB - Leibniz Information Centre for Science and Technology and University Library Hannover Germany
| | - Jitendra Gaikwad
- Friedrich-Schiller-Universität Jena, Jena, Germany Friedrich-Schiller-Universität Jena Jena Germany
| | - Roman Gerlach
- Friedrich-Schiller-Universität Jena, Jena, Germany Friedrich-Schiller-Universität Jena Jena Germany
| | - Alsayed Algergawy
- Friedrich-Schiller-Universität Jena, Jena, Germany Friedrich-Schiller-Universität Jena Jena Germany
| | - Andreas Ostrowski
- Friedrich-Schiller-Universität Jena, Jena, Germany Friedrich-Schiller-Universität Jena Jena Germany
| | - Birgitta König-Ries
- Friedrich-Schiller-Universität Jena, Jena, Germany Friedrich-Schiller-Universität Jena Jena Germany.,German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig, Jena, Germany German Centre for Integrative Biodiversity Research (iDiv) Halle-Jena-Leipzig Jena Germany
| |
Collapse
|
6
|
Gkoutselis G, Rohrbach S, Harjes J, Obst M, Brachmann A, Horn MA, Rambold G. Microplastics accumulate fungal pathogens in terrestrial ecosystems. Sci Rep 2021; 11:13214. [PMID: 34267241 PMCID: PMC8282651 DOI: 10.1038/s41598-021-92405-7] [Citation(s) in RCA: 88] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2021] [Accepted: 06/03/2021] [Indexed: 02/05/2023] Open
Abstract
Microplastic (MP) is a pervasive pollutant in nature that is colonised by diverse groups of microbes, including potentially pathogenic species. Fungi have been largely neglected in this context, despite their affinity for plastics and their impact as pathogens. To unravel the role of MP as a carrier of fungal pathogens in terrestrial ecosystems and the immediate human environment, epiplastic mycobiomes from municipal plastic waste from Kenya were deciphered using ITS metabarcoding as well as a comprehensive meta-analysis, and visualised via scanning electron as well as confocal laser scanning microscopy. Metagenomic and microscopic findings provided complementary evidence that the terrestrial plastisphere is a suitable ecological niche for a variety of fungal organisms, including important animal and plant pathogens, which formed the plastisphere core mycobiome. We show that MPs serve as selective artificial microhabitats that not only attract distinct fungal communities, but also accumulate certain opportunistic human pathogens, such as cryptococcal and Phoma-like species. Therefore, MP must be regarded a persistent reservoir and potential vector for fungal pathogens in soil environments. Given the increasing amount of plastic waste in terrestrial ecosystems worldwide, this interrelation may have severe consequences for the trans-kingdom and multi-organismal epidemiology of fungal infections on a global scale.
Collapse
Affiliation(s)
- Gerasimos Gkoutselis
- Department of Mycology, University of Bayreuth, Universitätsstraße 30, 95447, Bayreuth, Germany
| | - Stephan Rohrbach
- Institute of Microbiology, Leibniz University Hannover, 30419, Hannover, Germany
| | - Janno Harjes
- Department of Mycology, University of Bayreuth, Universitätsstraße 30, 95447, Bayreuth, Germany
| | - Martin Obst
- Experimental Biogeochemistry, BayCEER, University of Bayreuth, 95448, Bayreuth, Germany
| | - Andreas Brachmann
- Genetics, Faculty of Biology, Ludwig Maximilian University Munich, 82152, Martinsried, Germany
| | - Marcus A Horn
- Institute of Microbiology, Leibniz University Hannover, 30419, Hannover, Germany.
| | - Gerhard Rambold
- Department of Mycology, University of Bayreuth, Universitätsstraße 30, 95447, Bayreuth, Germany.
| |
Collapse
|
7
|
Telatin A. Qiime Artifact eXtractor (qax): A Fast and Versatile Tool to Interact with Qiime2 Archives. BIOTECH 2021; 10:biotech10010005. [PMID: 35822777 PMCID: PMC9245473 DOI: 10.3390/biotech10010005] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2020] [Revised: 02/24/2021] [Accepted: 02/25/2021] [Indexed: 01/04/2023] Open
Abstract
Qiime2 is one of the most popular software tools used for analysis of output from metabarcoding experiments (e.g., sequencing of 16S, 18S, or ITS amplicons). Qiime2 introduced a novel and innovative data exchange format: the ‘Qiime2 artifact’. Qiime2 artifacts are structured compressed archives containing a dataset and its associated metadata. Examples of datasets are FASTQ reads, representative sequences in FASTA format, a phylogenetic tree in Newick format, while examples of metadata are the command that generated the artifact, information on the execution environment, citations on the used software, and all the metadata of the artifacts used to produce it. While artifacts can improve the shareability and reproducibility of Qiime2 workflows, they are less easily integrated with general bioinformatics pipelines. Accessing metadata in the artifacts also requires full Qiime2 installation. Qiime Artifact eXtractor (qax) allows users to easily interface with Qiime2 artifacts from the command line, without needing the full Qiime2 environment installed (or activated).
Collapse
Affiliation(s)
- Andrea Telatin
- Quadram Institute Bioscience, Gut Microbes and Health Program, Norwich Research Park, Norwich NR4 7UQ, UK
| |
Collapse
|