Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Harrow J, Drysdale R, Smith A, Repo S, Lanfear J, Blomberg N. ELIXIR: Providing a Sustainable Infrastructure for Life Science Data at European Scale. Bioinformatics 2021;37:2506-2511. [PMID: 34175941 PMCID: PMC8388016 DOI: 10.1093/bioinformatics/btab481] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 02/19/2021] [Accepted: 06/25/2021] [Indexed: 11/12/2022] Open

For:	Harrow J, Drysdale R, Smith A, Repo S, Lanfear J, Blomberg N. ELIXIR: Providing a Sustainable Infrastructure for Life Science Data at European Scale. Bioinformatics 2021;37:2506-2511. [PMID: 34175941 PMCID: PMC8388016 DOI: 10.1093/bioinformatics/btab481] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 02/19/2021] [Accepted: 06/25/2021] [Indexed: 11/12/2022] Open

Number

Cited by Other Article(s)

Alper P, Dĕd V, Herzinger S, Grouès V, Peter S, Lebioda J, Ebermann L, Popleteeva M, Barry ND, Welter D, Ghosh S, Becker R, Schneider R, Gu W, Trefois C, Satagopam V. DS-PACK: Tool assembly for the end-to-end support of controlled access human data sharing. Sci Data 2024;11:501. [PMID: 38750048 PMCID: PMC11096168 DOI: 10.1038/s41597-024-03326-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2023] [Accepted: 04/29/2024] [Indexed: 05/18/2024] Open

Affiliation(s)

Pinar Alper Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg. ELIXIR Luxembourg, Belvaux, Luxembourg.
Vilém Dĕd ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Sascha Herzinger ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Valentin Grouès ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Sarah Peter ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Jacek Lebioda Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Linda Ebermann Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Marina Popleteeva ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Nene Djenaba Barry Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Danielle Welter Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Soumyabrata Ghosh ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Regina Becker Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Reinhard Schneider ELIXIR Luxembourg, Belvaux, Luxembourg Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg
Wei Gu Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Christophe Trefois Luxembourg National Data Service, PNED GIE, Esch-sur-Alzette, L-4362, Luxembourg ELIXIR Luxembourg, Belvaux, Luxembourg
Venkata Satagopam ELIXIR Luxembourg, Belvaux, Luxembourg. Luxembourg Centre for Systems Biomedicine, University of Luxembourg, Belvaux, L-4367, Luxembourg.

Collapse

Brady A, Charbonneau A, Grossman RL, Creasy HH, Renner R, Pihl T, Otridge J, Kim E, Barnholtz-Sloan JS, Kerlavage AR. NCI Cancer Research Data Commons: Core Standards and Services. Cancer Res 2024;84:1384-1387. [PMID: 38488505 PMCID: PMC11067691 DOI: 10.1158/0008-5472.can-23-2655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Revised: 01/05/2024] [Accepted: 02/28/2024] [Indexed: 05/03/2024]

Jentsch M, Schneider-Lunitz V, Taron U, Braun M, Ishaque N, Wagener H, Conrad C, Twardziok S. Creating cloud platforms for supporting FAIR data management in biomedical research projects. F1000Res 2024;13:8. [PMID: 38779317 PMCID: PMC11109697 DOI: 10.12688/f1000research.140624.3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 04/25/2024] [Indexed: 05/25/2024] Open

Almeida JR, Zúquete A, Pazos A, Oliveira JL. A federated authentication schema among multiple identity providers. Heliyon 2024;10:e28560. [PMID: 38590890 PMCID: PMC10999912 DOI: 10.1016/j.heliyon.2024.e28560] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Revised: 03/08/2024] [Accepted: 03/20/2024] [Indexed: 04/10/2024] Open

Insana G, Ignatchenko A, Martin M, Bateman A. MBDBMetrics: an online metrics tool to measure the impact of biological data resources. BIOINFORMATICS ADVANCES 2023;3:vbad180. [PMID: 38130879 PMCID: PMC10733715 DOI: 10.1093/bioadv/vbad180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 11/13/2023] [Indexed: 12/23/2023]

de Visser C, Johansson LF, Kulkarni P, Mei H, Neerincx P, Joeri van der Velde K, Horvatovich P, van Gool AJ, Swertz MA, Hoen PAC‘, Niehues A. Ten quick tips for building FAIR workflows. PLoS Comput Biol 2023;19:e1011369. [PMID: 37768885 PMCID: PMC10538699 DOI: 10.1371/journal.pcbi.1011369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/30/2023] Open

Affiliation(s)

Casper de Visser Medical BioSciences Department, Radboud University Medical Center, Nijmegen, the Netherlands
Lennart F. Johansson Genomics Coordination Center and Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
Purva Kulkarni Medical BioSciences Department, Radboud University Medical Center, Nijmegen, the Netherlands Translational Metabolic Laboratory, Department of Laboratory Medicine, Radboud University Medical Center, Nijmegen, the Netherlands Department of Human Genetics, Radboud University Medical Center, Nijmegen, the Netherlands
Hailiang Mei Sequencing Analysis Support Core, Department of Biomedical Data Sciences, Leiden University Medical Center, Leiden, the Netherlands
Pieter Neerincx Genomics Coordination Center and Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
K. Joeri van der Velde Genomics Coordination Center and Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
Péter Horvatovich Department of Analytical Biochemistry, Groningen Research Institute of Pharmacy, University of Groningen, Groningen, the Netherlands
Alain J. van Gool Translational Metabolic Laboratory, Department of Laboratory Medicine, Radboud University Medical Center, Nijmegen, the Netherlands Department of Human Genetics, Radboud University Medical Center, Nijmegen, the Netherlands
Morris A. Swertz Genomics Coordination Center and Department of Genetics, University of Groningen, University Medical Center Groningen, Groningen, the Netherlands
Peter A. C. ‘t Hoen Medical BioSciences Department, Radboud University Medical Center, Nijmegen, the Netherlands
Anna Niehues Medical BioSciences Department, Radboud University Medical Center, Nijmegen, the Netherlands Translational Metabolic Laboratory, Department of Laboratory Medicine, Radboud University Medical Center, Nijmegen, the Netherlands

Collapse

Justesen TF, Gögenur I, Tarpgaard LS, Pfeiffer P, Qvortrup C. Evaluating the efficacy and safety of neoadjuvant pembrolizumab in patients with stage I-III MMR-deficient colon cancer: a national, multicentre, prospective, single-arm, phase II study protocol. BMJ Open 2023;13:e073372. [PMID: 37349100 PMCID: PMC10314641 DOI: 10.1136/bmjopen-2023-073372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 03/03/2023] [Accepted: 06/08/2023] [Indexed: 06/24/2023] Open

Devignes MD, Smaïl-Tabbone M, Dhondge H, Dolcemascolo R, Gavaldá-García J, Higuera-Rodriguez RA, Kravchenko A, Roca Martínez J, Messini N, Pérez-Ràfols A, Pérez Ropero G, Sperotto L, Chauvot de Beauchêne I, Vranken W. Experiences with a training DSW knowledge model for early-stage researchers. OPEN RESEARCH EUROPE 2023;3:97. [PMID: 37645489 PMCID: PMC10445825 DOI: 10.12688/openreseurope.15609.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 05/30/2023] [Indexed: 08/31/2023]

Affiliation(s)

Marie-Dominique Devignes Université de Lorraine, CNRS, Inria, LORIA, Nancy, F-5400, France
Malika Smaïl-Tabbone Université de Lorraine, CNRS, Inria, LORIA, Nancy, F-5400, France
Hrishikesh Dhondge Université de Lorraine, CNRS, Inria, LORIA, Nancy, F-5400, France
Roswitha Dolcemascolo Institute for Integrative Systems Biology (I2SysBio), CSIC - University of Valencia, Paterna, 46980, Spain Department of Biotechnology, Polytechnic University of Valencia, Valencia, 46022, Spain
Jose Gavaldá-García Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, 1050, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, 1050, Belgium
R. Anahí Higuera-Rodriguez Dynamic Biosensors GmbH, Munich, 81379, Germany Department of Physics, School of Natural Sciences, Technical University of Munich, Garching, 85748, Germany
Anna Kravchenko Université de Lorraine, CNRS, Inria, LORIA, Nancy, F-5400, France
Joel Roca Martínez Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, 1050, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, 1050, Belgium
Niki Messini Department of Bioscience, School of Natural Sciences, Technical University of Munich, Garching, 85748, Germany
Anna Pérez-Ràfols Giotto Biotech s.r.l,, Florence, 50019, Italy Magnetic Resonance Center (CERM), Department of Chemistry “Ugo Schiff”, University of Florence, Florence, 50019, Italy
Guillermo Pérez Ropero Department of Chemistry-BMC, Uppsala University, Uppsala, 75123, Sweden Ridgeview Instruments AB, Uppsala, 75237, Sweden
Luca Sperotto Department of Bioscience, School of Natural Sciences, Technical University of Munich, Garching, 85748, Germany
Isaure Chauvot de Beauchêne Université de Lorraine, CNRS, Inria, LORIA, Nancy, F-5400, France
Wim Vranken Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, 1050, Belgium Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, 1050, Belgium

Collapse

Grossman RL. Ten lessons for data sharing with a data commons. Sci Data 2023;10:120. [PMID: 36878917 PMCID: PMC9988927 DOI: 10.1038/s41597-023-02029-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2022] [Accepted: 02/17/2023] [Indexed: 03/08/2023] Open

Hooft RW, Harrison E, Martin CS. The road to success: drawing parallels between 'road' and 'research data' infrastructures to foster understanding between service providers, funders and policymakers. F1000Res 2023;12:ELIXIR-88. [PMID: 37065508 PMCID: PMC10102711 DOI: 10.12688/f1000research.128167.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 12/13/2022] [Indexed: 01/24/2023] Open

Hooft RW, Harrison E, Martin CS. The road to success: drawing parallels between 'road' and 'research data' infrastructures to foster understanding between service providers, funders and policymakers. F1000Res 2023;12:ELIXIR-88. [PMID: 37065508 PMCID: PMC10102711 DOI: 10.12688/f1000research.128167.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/18/2023] [Indexed: 08/25/2023] Open

Arend D, Scholz U, Lange M. The Plant Phenomics and Genomics Research Data Repository: An On-Premise Approach for FAIR-Compliant Data Acquisition. Methods Mol Biol 2023;2703:3-22. [PMID: 37646933 DOI: 10.1007/978-1-0716-3389-2_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]

Ross-Hellauer T, Klebel T, Bannach-Brown A, Horbach SP, Jabeen H, Manola N, Metodiev T, Papageorgiou H, Reczko M, Sansone SA, Schneider J, Tijdink J, Vergoulis T. TIER2: enhancing Trust, Integrity and Efficiency in Research through next-level Reproducibility. RESEARCH IDEAS AND OUTCOMES 2022. [DOI: 10.3897/rio.8.e98457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Data platforms for open life sciences-A systematic analysis of management instruments. PLoS One 2022;17:e0276204. [PMID: 36282849 PMCID: PMC9595524 DOI: 10.1371/journal.pone.0276204] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2022] [Accepted: 10/02/2022] [Indexed: 11/05/2022] Open

Vescovi R, Chard R, Saint ND, Blaiszik B, Pruyne J, Bicer T, Lavens A, Liu Z, Papka ME, Narayanan S, Schwarz N, Chard K, Foster IT. Linking scientific instruments and computation: Patterns, technologies, and experiences. PATTERNS (NEW YORK, N.Y.) 2022;3:100606. [PMID: 36277824 PMCID: PMC9583115 DOI: 10.1016/j.patter.2022.100606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 08/07/2022] [Accepted: 09/14/2022] [Indexed: 11/07/2022]

Abstract

Powerful detectors at modern experimental facilities routinely collect data at multiple GB/s. Online analysis methods are needed to enable the collection of only interesting subsets of such massive data streams, such as by explicitly discarding some data elements or by directing instruments to relevant areas of experimental space. Thus, methods are required for configuring and running distributed computing pipelines—what we call flows—that link instruments, computers (e.g., for analysis, simulation, artificial intelligence [AI] model training), edge computing (e.g., for analysis), data stores, metadata catalogs, and high-speed networks. We review common patterns associated with such flows and describe methods for instantiating these patterns. We present experiences with the application of these methods to the processing of data from five different scientific instruments, each of which engages powerful computers for data inversion,model training, or other purposes. We also discuss implications of such methods for operators and users of scientific facilities.

•

Patterns for linking instruments and computers for online analysis are reviewed

•

Methods are presented for capturing such “flows” in reusable forms

•

The use of Globus automation services to run flows is described

•

Implications of these methods for scientists and facilities are discussed

The industrial revolution transformed society via large-scale automation of manufacturing. Today, AI- and robotics-driven automation of scientific research seems set to usher in a new era of accelerated discovery. But just as the industrial revolution depended on new replicable and scalable manufacturing processes and methods for delivering the copious mechanical power required by those processes, so the automated discovery revolution demands new methods for implementing research automation processes and for connecting those processes to computing and data power. We present here new methods that address these essential needs by allowing scientists to capture common automation patterns in reusable flows and to embed such flows in a global trust, data, and computing fabric that enables instant access to powerful AI, simulation, and other computational capabilities. We use examples from synchrotron light sources to show how these methods can be realized in software and applied at scale.

Collapse

Affiliation(s)

Rafael Vescovi Data Science and Learning Division, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA
Ryan Chard Data Science and Learning Division, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA
Nickolaus D Saint Globus, University of Chicago, 5730 S. Ellis Ave., Chicago, IL 60615, USA
Ben Blaiszik Data Science and Learning Division, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA.,Globus, University of Chicago, 5730 S. Ellis Ave., Chicago, IL 60615, USA
Jim Pruyne Data Science and Learning Division, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA.,Globus, University of Chicago, 5730 S. Ellis Ave., Chicago, IL 60615, USA
Tekin Bicer Data Science and Learning Division, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA.,X-ray Science Division, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA
Alex Lavens Structural Biology Center, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA
Zhengchun Liu Data Science and Learning Division, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA
Michael E Papka Argonne Leadership Computing Facility, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA.,Department of Computer Science, University of Illinois Chicago, 1200 W. Harrison St., Chicago, IL 60607, USA
Suresh Narayanan X-ray Science Division, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA
Nicholas Schwarz X-ray Science Division, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA
Kyle Chard Data Science and Learning Division, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA.,Department of Computer Science, University of Chicago, 5730 S. Ellis Ave., Chicago, IL 60615, USA
Ian T Foster Data Science and Learning Division, Argonne National Laboratory, 9700 S. Cass Ave., Lemont, IL 60439, USA.,Department of Computer Science, University of Chicago, 5730 S. Ellis Ave., Chicago, IL 60615, USA

Collapse

De Geest P, Coppens F, Soiland-Reyes S, Eguinoa I, Leo S. Enhancing RDM in Galaxy by integrating RO-Crate. RESEARCH IDEAS AND OUTCOMES 2022. [DOI: 10.3897/rio.8.e95164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Abstract We introduce how the Galaxy research environment (Jalili et al. 2020) integrates with RO-Crate as an implementation of Findable Accessible Interoperable Reproducible Digital Objects (FAIR Digital Objects / FDO) (Wilkinson et al. 2016, Schultes and Wittenburg 2018) and how using RO-Crate as an exchange mechanism of workflows and their execution history helps integrate Galaxy with the wider ecosystem of ELIXIR (Harrow et al. 2021) and the European Open Science Cloud (EOSC-Life) to enable FAIR and reproducible data analysis. RO-Crate (Soiland-Reyes et al. 2022) is a generic packaging format containing datasets and their description using standards for FAIR Linked Data. The format is based on schema.org (Guha et al. 2016) annotations in JSON-LD, which allows for rich metadata representation. The RO-Crate effort aims to make best-practice in formal metadata description accessible and practical for use in a wider variety of situations, from an individual researcher working with a folder of data, to large data-intensive computational research environments. The RO-Crate community brings together practitioners from very different backgrounds, and with different motivations and use cases. Among the core target users are:

researchers engaged with computation and data-intensive, workflow-driven analysis;

digital repository managers and infrastructure providers;

individual researchers looking for a straightforward tool or how-to guide to “FAIRify” their data;

data stewards supporting research projects in creating and curating datasets.

researchers engaged with computation and data-intensive, workflow-driven analysis; digital repository managers and infrastructure providers; individual researchers looking for a straightforward tool or how-to guide to “FAIRify” their data; data stewards supporting research projects in creating and curating datasets. Given the wide applicability of RO-Crate and the lack of practical implementations of FDOs, ELIXIR (Harrow et al. 2021) co-opted this initiative as the project to define a common format for research data exchange and repository entries. Thus, during the last year it’s been implemented in a wide range of services, such as: WorkflowHub (Goble et al. 2021) (a registry for describing, sharing and publishing scientific computational workflows) uses RO-Crates as an exchange format to improve reproducibility of computational workflows that follow the Workflow RO-Crate profile (Bacall et al. 2022); LifeMonitor (Leo et al. 2022) (a service to support the sustainability of computational workflows being developed as part of the EOSC-Life project) uses RO-Crate as an exchange format for describing test suites associated with workflows. Tools have been developed towards aiding the previously mentioned use cases and increasing the general usability of RO-Crates by providing a user-friendly (programmatic) interface for consumption and production of RO-Crates through programmatic libraries for consuming/producing RO-Crates (ro-crate-py De Geest et al. 2022, ro-crate-ruby Bacall and Whitwell 2022, ro-crate-js Lynch et al. 2021). The Galaxy project provides a research environment with data analysis and data management functionalities as a multi user platform, aiming to make computational biology accessible to research scientists that do not have computer programming or systems administration experience. As such, it stores not just analysis related data but also the complete analytical workflow, including its metadata. The internal data model involves the history entity, including all steps performed in a specific analysis, and the workflow entity, defining the structure of an analytical pipeline. From the start, Galaxy aims to enable reproducible analyses by providing capabilities to export (and import) all the analysis history details and workflow data and metadata in a FAIR way. As such it helps its users with the daily research data management. The Galaxy community is continuously improving and adding features, the integration of the FAIR Digital Object principles is a natural next step in this. To be able to support these FDOs, Galaxy leverages the RO-Crate Python client library (De Geest et al. 2022) and provides multiple entry points to import and export different research data objects representing its internal entities and associated metadata. These objects include:

a workflow definition, which is used to share/publish the details of an analysis pipeline, including the graph of tools that need to be executed, and metadata about the data types required

individual data files or a collection of datasets related to an analysis history

a compressed archive of the entire analysis history including the metadata associated with it such as the tools used, their versions, the parameters chosen, workflow invocation related metadata, inputs, outputs, license, author, CWLProv description (Khan et al. 2019) of the workflow, contextual references in the form of Digital Object Identifiers (DOIs), ‘EMBRACE Data And Methods’ ontology (EDAM) terms (Ison et al. 2013), etc.

a workflow definition, which is used to share/publish the details of an analysis pipeline, including the graph of tools that need to be executed, and metadata about the data types required individual data files or a collection of datasets related to an analysis history a compressed archive of the entire analysis history including the metadata associated with it such as the tools used, their versions, the parameters chosen, workflow invocation related metadata, inputs, outputs, license, author, CWLProv description (Khan et al. 2019) of the workflow, contextual references in the form of Digital Object Identifiers (DOIs), ‘EMBRACE Data And Methods’ ontology (EDAM) terms (Ison et al. 2013), etc. The adoption of RO-crate by Galaxy allows a standardised exchange of FDOs with other platforms in the ELIXIR Tools ecosystem, such as WorkflowHub and LifeMonitor. Integrating RO-Crate deeply into Galaxy and offering import and export options of various Galaxy objects such as Research Objects allows for increased standardisation, improved Research Data Management (RDM) functionalities, smoother user experience (UX) as well as improved interoperability with other systems. The integration in a platform used by biologists to do data intensive analysis, facilitates the publication of workflows and workflow invocations for all skill levels and democratises the ability to perform Open Science. Collapse

Arend D, Psaroudakis D, Memon JA, Rey-Mazón E, Schüler D, Szymanski JJ, Scholz U, Junker A, Lange M. From data to knowledge - big data needs stewardship, a plant phenomics perspective. THE PLANT JOURNAL : FOR CELL AND MOLECULAR BIOLOGY 2022;111:335-347. [PMID: 35535481 DOI: 10.1111/tpj.15804] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Revised: 05/02/2022] [Accepted: 05/06/2022] [Indexed: 06/14/2023]

Beier S, Fiebig A, Pommier C, Liyanage I, Lange M, Kersey PJ, Weise S, Finkers R, Koylass B, Cezard T, Courtot M, Contreras-Moreira B, Naamati G, Dyer S, Scholz U. Recommendations for the formatting of Variant Call Format (VCF) files to make plant genotyping data FAIR. F1000Res 2022;11. [PMID: 35811804 PMCID: PMC9218589 DOI: 10.12688/f1000research.109080.2] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 05/17/2022] [Indexed: 11/20/2022] Open

Abstract In this opinion article, we discuss the formatting of files from (plant) genotyping studies, in particular the formatting of metadata in Variant Call Format (VCF) files. The flexibility of the VCF format specification facilitates its use as a generic interchange format across domains but can lead to inconsistency between files in the presentation of metadata. To enable fully autonomous machine actionable data flow, generic elements need to be further specified. We strongly support the merits of the FAIR principles and see the need to facilitate them also through technical implementation specifications. They form a basis for the proposed VCF extensions here. We have learned from the existing application of VCF that the definition of relevant metadata using controlled standards, vocabulary and the consistent use of cross-references via resolvable identifiers (machine-readable) are particularly necessary and propose their encoding. VCF is an established standard for the exchange and publication of genotyping data. Other data formats are also used to capture variant data (for example, the HapMap and the gVCF formats), but none currently have the reach of VCF. For the sake of simplicity, we will only discuss VCF and our recommendations for its use, but these recommendations could also be applied to gVCF. However, the part of the VCF standard relating to metadata (as opposed to the actual variant calls) defines a syntactic format but no vocabulary, unique identifier or recommended content. In practice, often only sparse descriptive metadata is included. When descriptive metadata is provided, proprietary metadata fields are frequently added that have not been agreed upon within the community which may limit long-term and comprehensive interoperability. To address this, we propose recommendations for supplying and encoding metadata, focusing on use cases from plant sciences. We expect there to be overlap, but also divergence, with the needs of other domains. Collapse

Affiliation(s)

Sebastian Beier Breeding Research, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, 06466, Germany Institute of Bio- and Geosciences, Bioinformatics (IBG-4), Forschungszentrum Jülich GmbH, Jülich, 52425, Germany
Anne Fiebig Breeding Research, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, 06466, Germany
Cyril Pommier BioinfOmics, Plant bioinformatics facility, Université Paris-Saclay, INRAE, Versailles, France
Isuru Liyanage European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Matthias Lange Breeding Research, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, 06466, Germany
Paul J. Kersey Royal Botanic Gardens, Kew, Richmond, UK
Stephan Weise Breeding Research, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, 06466, Germany
Richard Finkers Plant Breeding, Wageningen University & Research, Wageningen, The Netherlands Gennovation B.V., Wageningen, The Netherlands
Baron Koylass European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Timothee Cezard European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Mélanie Courtot European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK Ontario Institute for Cancer Research, Toronto, Canada
Bruno Contreras-Moreira Laboratorio de Biología Computacional y Estructural, Estación Experimental Aula Dei-CSIC, Zaragoza, 50059, Spain
Guy Naamati European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Sarah Dyer European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
Uwe Scholz Breeding Research, Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, 06466, Germany

Collapse

Melo AM, Oliveira S, Oliveira JS, Martin CS, Leite RB. Making European performance and impact assessment frameworks for research infrastructures glocal. F1000Res 2022;11:ELIXIR-278. [PMID: 36016992 PMCID: PMC9372636 DOI: 10.12688/f1000research.108804.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 07/29/2022] [Indexed: 11/23/2022] Open

Melo AM, Oliveira S, Oliveira JS, Martin CS, Leite RB. Making European performance and impact assessment frameworks glocal. F1000Res 2022;11:ELIXIR-278. [PMID: 36016992 PMCID: PMC9372636 DOI: 10.12688/f1000research.108804.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 02/16/2022] [Indexed: 01/30/2024] Open

Towards efficient use of data, models and tools in food microbiology. Curr Opin Food Sci 2022. [DOI: 10.1016/j.cofs.2022.100834] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]

Collaborative Data Use between Private and Public Stakeholders—A Regional Case Study. DATA 2022. [DOI: 10.3390/data7020020] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open

Freeberg MA, Fromont LA, D’Altri T, Romero AF, Ciges J, Jene A, Kerry G, Moldes M, Ariosa R, Bahena S, Barrowdale D, Barbero M, Fernandez-Orth D, Garcia-Linares C, Garcia-Rios E, Haziza F, Juhasz B, Llobet O, Milla G, Mohan A, Rueda M, Sankar A, Shaju D, Shimpi A, Singh B, Thomas C, de la Torre S, Uyan U, Vasallo C, Flicek P, Guigo R, Navarro A, Parkinson H, Keane T, Rambla J. The European Genome-phenome Archive in 2021. Nucleic Acids Res 2022;50:D980-D987. [PMID: 34791407 PMCID: PMC8728218 DOI: 10.1093/nar/gkab1059] [Citation(s) in RCA: 42] [Impact Index Per Article: 21.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 10/08/2021] [Accepted: 10/22/2021] [Indexed: 12/27/2022] Open

Affiliation(s)

Mallory Ann Freeberg European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Lauren A Fromont Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Teresa D’Altri Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Anna Foix Romero European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Jorge Izquierdo Ciges European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Aina Jene Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Giselle Kerry European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Mauricio Moldes Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Roberto Ariosa Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Silvia Bahena European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Daniel Barrowdale European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Marcos Casado Barbero European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Dietmar Fernandez-Orth Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Carles Garcia-Linares European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Emilio Garcia-Rios European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Frédéric Haziza Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Bela Juhasz European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Oscar Martinez Llobet Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Gemma Milla Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Anand Mohan European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Manuel Rueda Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Aravind Sankar European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Dona Shaju European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Ashutosh Shimpi European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Babita Singh Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Coline Thomas European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Sabela de la Torre Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Umuthan Uyan Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Claudia Vasallo Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Paul Flicek European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Roderic Guigo Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Arcadi Navarro Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain
Helen Parkinson European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Thomas Keane European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI), Hinxton CB10 1SD, UK
Jordi Rambla Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Dr Aiguader 88, Barcelona 08003, Spain

Collapse

Bansal P, Morgat A, Axelsen KB, Muthukrishnan V, Coudert E, Aimo L, Hyka-Nouspikel N, Gasteiger E, Kerhornou A, Neto TB, Pozzato M, Blatter MC, Ignatchenko A, Redaschi N, Bridge A. Rhea, the reaction knowledgebase in 2022. Nucleic Acids Res 2022;50:D693-D700. [PMID: 34755880 PMCID: PMC8728268 DOI: 10.1093/nar/gkab1016] [Citation(s) in RCA: 56] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Revised: 10/08/2021] [Accepted: 11/09/2021] [Indexed: 12/15/2022] Open

Affiliation(s)

Parit Bansal Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Anne Morgat Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Kristian B Axelsen Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Venkatesh Muthukrishnan Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Elisabeth Coudert Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Lucila Aimo Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Nevila Hyka-Nouspikel Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Elisabeth Gasteiger Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Arnaud Kerhornou Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Teresa Batista Neto Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Monica Pozzato Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Marie-Claude Blatter Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Alex Ignatchenko EMBL-EBI European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridge CB10 1SD, UK
Nicole Redaschi Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland
Alan Bridge Swiss-Prot group, SIB Swiss Institute of Bioinformatics, Centre Medical Universitaire, CH-1211 Geneva 4, Switzerland

Collapse

Grapevine and Wine Metabolomics-Based Guidelines for FAIR Data and Metadata Management. Metabolites 2021;11:metabo11110757. [PMID: 34822415 PMCID: PMC8618349 DOI: 10.3390/metabo11110757] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Revised: 10/29/2021] [Accepted: 10/30/2021] [Indexed: 01/12/2023] Open