1
|
Inau ET, Dedié A, Anastasova I, Schick R, Zdravomyslov Y, Fröhlich B, Birkenfeld AL, Hrabě de Angelis M, Roden M, Zeleke AA, Preusse M, Waltemath D. The Journey to a FAIR CORE DATA SET for Diabetes Research in Germany. Sci Data 2024; 11:1159. [PMID: 39433782 PMCID: PMC11494036 DOI: 10.1038/s41597-024-03882-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 09/16/2024] [Indexed: 10/23/2024] Open
Abstract
The German Center for Diabetes Research (DZD) established a core data set (CDS) of clinical parameters relevant for diabetes research in 2021. The CDS is central to the design of current and future DZD studies. Here, we describe the process and outcomes of FAIRifying the initial version of the CDS. We first did a baseline evaluation of the FAIRness using the FAIR Data Maturity Model. The FAIRification process and the results of this assessment led us to convert the CDS into the recommended format for spreadsheets, annotating the parameters with standardized medical codes, licensing the data set, enriching the data set with metadata, and indexing the metadata. The FAIRified version of the CDS is more suitable for data sharing in diabetes research across DZD sites and beyond. It contributes to the reusability of health research studies.
Collapse
Affiliation(s)
- Esther Thea Inau
- Medical Informatics Laboratory, University Medicine Greifswald, Greifswald, Germany.
| | - Angela Dedié
- German Center for Diabetes Research (DZD), München-Neuherberg, Germany
| | - Ivona Anastasova
- German Center for Diabetes Research (DZD), München-Neuherberg, Germany
| | - Renate Schick
- German Center for Diabetes Research (DZD), München-Neuherberg, Germany
| | | | - Brigitte Fröhlich
- German Center for Diabetes Research (DZD), München-Neuherberg, Germany
| | - Andreas L Birkenfeld
- German Center for Diabetes Research (DZD), Tübingen, Germany
- Institute for Diabetes Research and Metabolic Diseases of the Helmholtz Zentrum München at the University of Tübingen (IDM), Tübingen, Germany
- Department of Diabetology, Endocrinology, and Nephrology, University Clinic Tübingen, Eberhard Karls University Tübingen, Tübingen, Germany
| | - Martin Hrabě de Angelis
- German Center for Diabetes Research (DZD), München-Neuherberg, Germany
- Institute of Experimental Genetics and German Mouse Clinic, Helmholtz Munich, Neuherberg, Germany
- Chair of Experimental Genetics, TUM School of Life Sciences (SoLS), Technische Universität München, Freising, Germany
| | - Michael Roden
- German Center for Diabetes Research (DZD), Düsseldorf, Germany
- Department of Endocrinology and Diabetology, Medical Faculty and University Hospital Düsseldorf, Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany
- Institute for Clinical Diabetology, German Diabetes Center, Leibniz Center for Diabetes Research at Heinrich-Heine-University Düsseldorf, Düsseldorf, Germany
| | | | - Martin Preusse
- German Center for Diabetes Research (DZD), München-Neuherberg, Germany
| | - Dagmar Waltemath
- Medical Informatics Laboratory, University Medicine Greifswald, Greifswald, Germany
| |
Collapse
|
2
|
Kemmer I, Keppler A, Serrano-Solano B, Rybina A, Özdemir B, Bischof J, El Ghadraoui A, Eriksson JE, Mathur A. Building a FAIR image data ecosystem for microscopy communities. Histochem Cell Biol 2023; 160:199-209. [PMID: 37341795 PMCID: PMC10492678 DOI: 10.1007/s00418-023-02203-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/27/2023] [Indexed: 06/22/2023]
Abstract
Bioimaging has now entered the era of big data with faster-than-ever development of complex microscopy technologies leading to increasingly complex datasets. This enormous increase in data size and informational complexity within those datasets has brought with it several difficulties in terms of common and harmonized data handling, analysis, and management practices, which are currently hampering the full potential of image data being realized. Here, we outline a wide range of efforts and solutions currently being developed by the microscopy community to address these challenges on the path towards FAIR bioimaging data. We also highlight how different actors in the microscopy ecosystem are working together, creating synergies that develop new approaches, and how research infrastructures, such as Euro-BioImaging, are fostering these interactions to shape the field.
Collapse
Affiliation(s)
- Isabel Kemmer
- Euro-BioImaging ERIC Bio-Hub, European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117, Heidelberg, Germany
| | - Antje Keppler
- Euro-BioImaging ERIC Bio-Hub, European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117, Heidelberg, Germany
| | - Beatriz Serrano-Solano
- Euro-BioImaging ERIC Bio-Hub, European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117, Heidelberg, Germany
| | - Arina Rybina
- Euro-BioImaging ERIC Bio-Hub, European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117, Heidelberg, Germany
| | - Buğra Özdemir
- Euro-BioImaging ERIC Bio-Hub, European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117, Heidelberg, Germany
| | - Johanna Bischof
- Euro-BioImaging ERIC Bio-Hub, European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117, Heidelberg, Germany
| | - Ayoub El Ghadraoui
- Euro-BioImaging ERIC Bio-Hub, European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117, Heidelberg, Germany
| | - John E Eriksson
- Euro-BioImaging ERIC Statutory Seat, Tykistökatu 6, P.O. Box 123, 20521, Turku, Finland
| | - Aastha Mathur
- Euro-BioImaging ERIC Bio-Hub, European Molecular Biology Laboratory (EMBL) Heidelberg, Meyerhofstraße 1, 69117, Heidelberg, Germany.
| |
Collapse
|
3
|
Schmidt C, Hanne J, Moore J, Meesters C, Ferrando-May E, Weidtkamp-Peters S, members of the NFDI4BIOIMAGE initiative. Research data management for bioimaging: the 2021 NFDI4BIOIMAGE community survey. F1000Res 2022; 11:638. [PMID: 36405555 PMCID: PMC9641114 DOI: 10.12688/f1000research.121714.2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/16/2022] [Indexed: 01/13/2023] Open
Abstract
Background: Knowing the needs of the bioimaging community with respect to research data management (RDM) is essential for identifying measures that enable adoption of the FAIR (findable, accessible, interoperable, reusable) principles for microscopy and bioimage analysis data across disciplines. As an initiative within Germany's National Research Data Infrastructure, we conducted this community survey in summer 2021 to assess the state of the art of bioimaging RDM and the community needs. Methods: An online survey was conducted with a mixed question-type design. We created a questionnaire tailored to relevant topics of the bioimaging community, including specific questions on bioimaging methods and bioimage analysis, as well as more general questions on RDM principles and tools. 203 survey entries were included in the analysis covering the perspectives from various life and biomedical science disciplines and from participants at different career levels. Results: The results highlight the importance and value of bioimaging RDM and data sharing. However, the practical implementation of FAIR practices is impeded by technical hurdles, lack of knowledge, and insecurity about the legal aspects of data sharing. The survey participants request metadata guidelines and annotation tools and endorse the usage of image data management platforms. At present, OMERO (Open Microscopy Environment Remote Objects) is the best known and most widely used platform. Most respondents rely on image processing and analysis, which they regard as the most time-consuming step of the bioimage data workflow. While knowledge about and implementation of electronic lab notebooks and data management plans is limited, respondents acknowledge their potential value for data handling and publication. Conclusion: The bioimaging community acknowledges and endorses the value of RDM and data sharing. Still, there is a need for information, guidance, and standardization to foster the adoption of FAIR data handling. This survey may help inspiring targeted measures to close this gap.
Collapse
Affiliation(s)
- Christian Schmidt
- Enabling Technology, German Cancer Research Center, Heidelberg, Baden-Württemberg, Germany,Bioimaging Center, University of Konstanz, Konstanz, Germany,
| | - Janina Hanne
- German BioImaging - Society for Microscopy and Image Analysis e.V., Konstanz, Germany,
| | - Josh Moore
- German BioImaging - Society for Microscopy and Image Analysis e.V., Konstanz, Germany,Open Microscopy Environment Consortium, University of Dundee, Dundee, UK
| | - Christian Meesters
- High Performance Computing, Johannes Gutenberg University Mainz, Mainz, Germany
| | - Elisa Ferrando-May
- Enabling Technology, German Cancer Research Center, Heidelberg, Baden-Württemberg, Germany,Bioimaging Center, University of Konstanz, Konstanz, Germany,German BioImaging - Society for Microscopy and Image Analysis e.V., Konstanz, Germany
| | - Stefanie Weidtkamp-Peters
- German BioImaging - Society for Microscopy and Image Analysis e.V., Konstanz, Germany,Center for Advanced Imaging, Heinrich Heine University Dusseldorf, Dusseldorf, Germany
| | | |
Collapse
|
4
|
Eriksson O, Bhalla US, Blackwell KT, Crook SM, Keller D, Kramer A, Linne ML, Saudargienė A, Wade RC, Hellgren Kotaleski J. Combining hypothesis- and data-driven neuroscience modeling in FAIR workflows. eLife 2022; 11:e69013. [PMID: 35792600 PMCID: PMC9259018 DOI: 10.7554/elife.69013] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 05/13/2022] [Indexed: 12/22/2022] Open
Abstract
Modeling in neuroscience occurs at the intersection of different points of view and approaches. Typically, hypothesis-driven modeling brings a question into focus so that a model is constructed to investigate a specific hypothesis about how the system works or why certain phenomena are observed. Data-driven modeling, on the other hand, follows a more unbiased approach, with model construction informed by the computationally intensive use of data. At the same time, researchers employ models at different biological scales and at different levels of abstraction. Combining these models while validating them against experimental data increases understanding of the multiscale brain. However, a lack of interoperability, transparency, and reusability of both models and the workflows used to construct them creates barriers for the integration of models representing different biological scales and built using different modeling philosophies. We argue that the same imperatives that drive resources and policy for data - such as the FAIR (Findable, Accessible, Interoperable, Reusable) principles - also support the integration of different modeling approaches. The FAIR principles require that data be shared in formats that are Findable, Accessible, Interoperable, and Reusable. Applying these principles to models and modeling workflows, as well as the data used to constrain and validate them, would allow researchers to find, reuse, question, validate, and extend published models, regardless of whether they are implemented phenomenologically or mechanistically, as a few equations or as a multiscale, hierarchical system. To illustrate these ideas, we use a classical synaptic plasticity model, the Bienenstock-Cooper-Munro rule, as an example due to its long history, different levels of abstraction, and implementation at many scales.
Collapse
Affiliation(s)
- Olivia Eriksson
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
| | - Upinder Singh Bhalla
- National Center for Biological Sciences, Tata Institute of Fundamental ResearchBangaloreIndia
| | - Kim T Blackwell
- Department of Bioengineering, Volgenau School of Engineering, George Mason UniversityFairfaxUnited States
| | - Sharon M Crook
- School of Mathematical and Statistical Sciences, Arizona State UniversityTempeUnited States
| | - Daniel Keller
- Blue Brain Project, École Polytechnique Fédérale de LausanneLausanneSwitzerland
| | - Andrei Kramer
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
- Department of Neuroscience, Karolinska InstituteStockholmSweden
| | - Marja-Leena Linne
- Faculty of Medicine and Health Technology, Tampere UniversityTampereFinland
| | - Ausra Saudargienė
- Neuroscience Institute, Lithuanian University of Health SciencesKaunasLithuania
- Department of Informatics, Vytautas Magnus UniversityKaunasLithuania
| | - Rebecca C Wade
- Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS)HeidelbergGermany
- Center for Molecular Biology (ZMBH), ZMBH-DKFZ Alliance, University of HeidelbergHeidelbergGermany
- Interdisciplinary Center for Scientific Computing (IWR), Heidelberg UniversityHeidelbergGermany
| | - Jeanette Hellgren Kotaleski
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
- Department of Neuroscience, Karolinska InstituteStockholmSweden
| |
Collapse
|
5
|
Mahmud M, Kaiser MS, McGinnity TM, Hussain A. Deep Learning in Mining Biological Data. Cognit Comput 2021; 13:1-33. [PMID: 33425045 PMCID: PMC7783296 DOI: 10.1007/s12559-020-09773-x] [Citation(s) in RCA: 111] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Accepted: 09/28/2020] [Indexed: 02/06/2023]
Abstract
Recent technological advancements in data acquisition tools allowed life scientists to acquire multimodal data from different biological application domains. Categorized in three broad types (i.e. images, signals, and sequences), these data are huge in amount and complex in nature. Mining such enormous amount of data for pattern recognition is a big challenge and requires sophisticated data-intensive machine learning techniques. Artificial neural network-based learning systems are well known for their pattern recognition capabilities, and lately their deep architectures-known as deep learning (DL)-have been successfully applied to solve many complex pattern recognition problems. To investigate how DL-especially its different architectures-has contributed and been utilized in the mining of biological data pertaining to those three types, a meta-analysis has been performed and the resulting resources have been critically analysed. Focusing on the use of DL to analyse patterns in data from diverse biological domains, this work investigates different DL architectures' applications to these data. This is followed by an exploration of available open access data sources pertaining to the three data types along with popular open-source DL tools applicable to these data. Also, comparative investigations of these tools from qualitative, quantitative, and benchmarking perspectives are provided. Finally, some open research challenges in using DL to mine biological data are outlined and a number of possible future perspectives are put forward.
Collapse
Affiliation(s)
- Mufti Mahmud
- Department of Computer Science, Nottingham Trent University, Clifton, NG11 8NS Nottingham, UK
- Medical Technology Innovation Facility, Nottingham Trent University, NG11 8NS Clifton, Nottingham, UK
| | - M. Shamim Kaiser
- Institute of Information Technology, Jahangirnagar University, Savar 1342 Dhaka, Bangladesh
| | - T. Martin McGinnity
- Department of Computer Science, Nottingham Trent University, Clifton, NG11 8NS Nottingham, UK
- Intelligent Systems Research Centre, Ulster University, Northern Ireland BT48 7JL Derry, UK
| | - Amir Hussain
- School of Computing , Edinburgh, EH11 4BN Edinburgh, UK
| |
Collapse
|
6
|
|
7
|
Comte B, Baumbach J, Benis A, Basílio J, Debeljak N, Flobak Å, Franken C, Harel N, He F, Kuiper M, Méndez Pérez JA, Pujos-Guillot E, Režen T, Rozman D, Schmid JA, Scerri J, Tieri P, Van Steen K, Vasudevan S, Watterson S, Schmidt HH. Network and Systems Medicine: Position Paper of the European Collaboration on Science and Technology Action on Open Multiscale Systems Medicine. NETWORK AND SYSTEMS MEDICINE 2020; 3:67-90. [PMID: 32954378 PMCID: PMC7500076 DOI: 10.1089/nsm.2020.0004] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/18/2020] [Indexed: 12/14/2022] Open
Abstract
Introduction: Network and systems medicine has rapidly evolved over the past decade, thanks to computational and integrative tools, which stem in part from systems biology. However, major challenges and hurdles are still present regarding validation and translation into clinical application and decision making for precision medicine. Methods: In this context, the Collaboration on Science and Technology Action on Open Multiscale Systems Medicine (OpenMultiMed) reviewed the available advanced technologies for multidimensional data generation and integration in an open-science approach as well as key clinical applications of network and systems medicine and the main issues and opportunities for the future. Results: The development of multi-omic approaches as well as new digital tools provides a unique opportunity to explore complex biological systems and networks at different scales. Moreover, the application of findable, applicable, interoperable, and reusable principles and the adoption of standards increases data availability and sharing for multiscale integration and interpretation. These innovations have led to the first clinical applications of network and systems medicine, particularly in the field of personalized therapy and drug dosing. Enlarging network and systems medicine application would now imply to increase patient engagement and health care providers as well as to educate the novel generations of medical doctors and biomedical researchers to shift the current organ- and symptom-based medical concepts toward network- and systems-based ones for more precise diagnoses, interventions, and ideally prevention. Conclusion: In this dynamic setting, the health care system will also have to evolve, if not revolutionize, in terms of organization and management.
Collapse
Affiliation(s)
- Blandine Comte
- Plateforme d'Exploration du Métabolisme, MetaboHUB Clermont, Université Clermont Auvergne, INRAE, UNH, Clermont-Ferrand, France
| | - Jan Baumbach
- TUM School of Life Sciences Weihenstephan (WZW), Technical University of Munich (TUM), Freising-Weihenstephan, Germany
| | | | - José Basílio
- Institute of Vascular Biology and Thrombosis Research, Center for Physiology and Pharmacology, Medical University of Vienna, Vienna, Austria
| | - Nataša Debeljak
- Medical Centre for Molecular Biology, Institute of Biochemistry, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Åsmund Flobak
- Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway
- The Cancer Clinic, St. Olav's University Hospital, Trondheim, Norway
| | - Christian Franken
- Digital Health Systems, Einsingen, Germany
- Department of Pharmacology and Personalised Medicine, Faculty of Health, Medicine and Life Science, Maastricht University, Maastricht, The Netherlands
| | | | - Feng He
- Department of Infection and Immunity, Luxembourg Institute of Health, Esch-sur-Alzette, Luxembourg
- Institute of Medical Microbiology, University Hospital Essen, University of Duisburg-Essen, Essen, Germany
| | - Martin Kuiper
- Department of Biology, Faculty of Natural Sciences, Norwegian University of Science and Technology, Trondheim, Norway
| | - Juan Albino Méndez Pérez
- Department of Computer Science and Systems Engineering, Universidad de La Laguna, Tenerife, Spain
| | - Estelle Pujos-Guillot
- Plateforme d'Exploration du Métabolisme, MetaboHUB Clermont, Université Clermont Auvergne, INRAE, UNH, Clermont-Ferrand, France
| | - Tadeja Režen
- Centre for Functional Genomics and Bio-Chips, Institute of Biochemistry, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Damjana Rozman
- Centre for Functional Genomics and Bio-Chips, Institute of Biochemistry, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Johannes A. Schmid
- Institute of Vascular Biology and Thrombosis Research, Center for Physiology and Pharmacology, Medical University of Vienna, Vienna, Austria
| | - Jeanesse Scerri
- Department of Physiology and Biochemistry, Faculty of Medicine and Surgery, University of Malta, Msida, Malta
| | - Paolo Tieri
- CNR National Research Council, IAC Institute for Applied Computing, Rome, Italy
| | | | - Sona Vasudevan
- Georgetown University Medical Centre, Washington, District of Columbia, USA
| | - Steven Watterson
- Northern Ireland Centre for Stratified Medicine, Ulster University, Londonderry, United Kingdom
| | - Harald H.H.W. Schmidt
- Department of Pharmacology and Personalised Medicine, Faculty of Health, Medicine and Life Science, MeHNS, Maastricht University, The Netherlands
| |
Collapse
|
8
|
Wibberg D, Batut B, Belmann P, Blom J, Glöckner FO, Grüning B, Hoffmann N, Kleinbölting N, Rahn R, Rey M, Scholz U, Sharan M, Tauch A, Trojahn U, Usadel B, Kohlbacher O. The de.NBI / ELIXIR-DE training platform - Bioinformatics training in Germany and across Europe within ELIXIR. F1000Res 2019; 8. [PMID: 33163154 PMCID: PMC7607484 DOI: 10.12688/f1000research.20244.2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/04/2020] [Indexed: 12/25/2022] Open
Abstract
The German Network for Bioinformatics Infrastructure (de.NBI) is a national and academic infrastructure funded by the German Federal Ministry of Education and Research (BMBF). The de.NBI provides (i) service, (ii) training, and (iii) cloud computing to users in life sciences research and biomedicine in Germany and Europe and (iv) fosters the cooperation of the German bioinformatics community with international network structures. The de.NBI members also run the German node (ELIXIR-DE) within the European ELIXIR infrastructure. The de.NBI / ELIXIR-DE training platform, also known as special interest group 3 (SIG 3) ‘Training & Education’, coordinates the bioinformatics training of de.NBI and the German ELIXIR node. The network provides a high-quality, coherent, timely, and impactful training program across its eight service centers. Life scientists learn how to handle and analyze biological big data more effectively by applying tools, standards and compute services provided by de.NBI. Since 2015, more than 300 training courses were carried out with about 6,000 participants and these courses received recommendation rates of almost 90% (status as of July 2020). In addition to face-to-face training courses, online training was introduced on the de.NBI website in 2016 and guidelines for the preparation of e-learning material were established in 2018. In 2016, ELIXIR-DE joined the ELIXIR training platform. Here, the de.NBI / ELIXIR-DE training platform collaborates with ELIXIR in training activities, advertising training courses via TeSS and discussions on the exchange of data for training events essential for quality assessment on both the technical and administrative levels. The de.NBI training program trained thousands of scientists from Germany and beyond in many different areas of bioinformatics.
Collapse
Affiliation(s)
- Daniel Wibberg
- Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, 33501, Germany
| | - Bérénice Batut
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Freiburg, 79110, Germany
| | - Peter Belmann
- Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, 33501, Germany
| | - Jochen Blom
- Bioinformatics and Systems Biology, Justus-Liebig-University Giessen, Giessen, 35392, Germany
| | - Frank Oliver Glöckner
- Alfred-Wegener-Institut - Helmholtz Zentrum für Polar- und Meeresforschung and Jacobs University Bremen, Campus Ring 1, Bremen, 28759, Germany
| | - Björn Grüning
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Freiburg, 79110, Germany
| | - Nils Hoffmann
- Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V., Dortmund, 44227, Germany
| | - Nils Kleinbölting
- Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, 33501, Germany
| | - René Rahn
- Algorithmic Bioinformatics, Department of Mathematics and Computer Science, Freie Universität Berlin, Takustraße 9, Berlin, 14195, Germany
| | - Maja Rey
- Scientific Databases and Visualization Group, Heidelberg Institute for Theoretical Studies (HITS) gGmbH, Schloss-Wolfsbrunnenweg 35, Heidelberg, 69118, Germany
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, 06466, Germany
| | - Malvika Sharan
- The Heidelberg Center for Human Bioinformatics (HD-HuB), European Molecular Biology Laboratory, Meyerhofstrasse 1, Heidelberg, 69117, Germany
| | - Andreas Tauch
- Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, 33501, Germany
| | - Ulrike Trojahn
- The Heidelberg Center for Human Bioinformatics (HD-HuB), European Molecular Biology Laboratory, Meyerhofstrasse 1, Heidelberg, 69117, Germany
| | - Björn Usadel
- IBG-2 Plant Sciences, Forschungszentrum Jülich, Jülich, 52428, Germany
| | - Oliver Kohlbacher
- Applied Bioinformatics, Department of Computer Science, University of Tübingen, Tübingen, 72076, Germany.,Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, 72076, Germany.,Translational Bioinformatics, University Hospital Tubingen, Tübingen, 72076, Germany.,Biomolecular Interactions, Max Planck Institute for Development Biology, Tübingen, 72076, Germany
| |
Collapse
|
9
|
Zielinski T, Hay J, Millar AJ. The grant is dead, long live the data - migration as a pragmatic exit strategy for research data preservation. Wellcome Open Res 2019; 4:104. [PMID: 31363499 PMCID: PMC6652102 DOI: 10.12688/wellcomeopenres.15341.2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/18/2019] [Indexed: 11/24/2022] Open
Abstract
Open research, data sharing and data re-use have become a priority for publicly- and charity-funded research. Efficient data management naturally requires computational resources that assist in data description, preservation and discovery. While it is possible to fund development of data management systems, currently it is more difficult to sustain data resources beyond the original grants. That puts the safety of the data at risk and undermines the very purpose of data gathering. PlaSMo stands for ‘Plant Systems-biology Modelling’ and the PlaSMo model repository was envisioned by the plant systems biology community in 2005 with the initial funding lasting until 2010. We addressed the sustainability of the PlaSMo repository and assured preservation of these data by implementing an exit strategy. For our exit strategy we migrated data to an alternative, public repository with secured funding. We describe details of our decision process and aspects of the implementation. Our experience may serve as an example for other projects in a similar situation. We share our reflections on the sustainability of biological data management and the future outcomes of its funding. We expect it to be a useful input for funding bodies.
Collapse
Affiliation(s)
- Tomasz Zielinski
- SynthSys and School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3BF, UK
| | - Johnny Hay
- EPCC, University of Edinburgh, Edinburgh, EH9 3FD, UK
| | - Andrew J Millar
- SynthSys and School of Biological Sciences, University of Edinburgh, Edinburgh, EH9 3BF, UK
| |
Collapse
|
10
|
Zielinski T, Hay J, Millar AJ. The grant is dead, long live the data - migration as a pragmatic exit strategy for research data preservation. Wellcome Open Res 2019; 4:104. [DOI: 10.12688/wellcomeopenres.15341.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 06/28/2019] [Indexed: 11/20/2022] Open
Abstract
Open research, data sharing and data re-use have become a priority for publicly- and charity-funded research. Efficient data management naturally requires computational resources that assist in data description, preservation and discovery. While it is possible to fund development of data management systems, currently it is more difficult to sustain data resources beyond the original grants. That puts the safety of the data at risk and undermines the very purpose of data gathering. PlaSMo stands for ‘Plant Systems-biology Modelling’ and the PlaSMo model repository was envisioned by the plant systems biology community in 2005 with the initial funding lasting till 2010. We addressed the sustainability of the PlaSMo repository and assured preservation of these data by implementing an exit strategy. For our exit strategy we migrated data to an alternative public repository of secured funding. We describe details of our decision process and aspects of the implementation. Our experience may serve as an example for other projects in similar situation. We share our reflections on sustainability of biological data management and the future outcomes of its funding. We expect it to be a useful input for funding bodies.
Collapse
|
11
|
Naldi A, Hernandez C, Levy N, Stoll G, Monteiro PT, Chaouiya C, Helikar T, Zinovyev A, Calzone L, Cohen-Boulakia S, Thieffry D, Paulevé L. The CoLoMoTo Interactive Notebook: Accessible and Reproducible Computational Analyses for Qualitative Biological Networks. Front Physiol 2018; 9:680. [PMID: 29971009 PMCID: PMC6018415 DOI: 10.3389/fphys.2018.00680] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2018] [Accepted: 05/15/2018] [Indexed: 01/07/2023] Open
Abstract
Analysing models of biological networks typically relies on workflows in which different software tools with sensitive parameters are chained together, many times with additional manual steps. The accessibility and reproducibility of such workflows is challenging, as publications often overlook analysis details, and because some of these tools may be difficult to install, and/or have a steep learning curve. The CoLoMoTo Interactive Notebook provides a unified environment to edit, execute, share, and reproduce analyses of qualitative models of biological networks. This framework combines the power of different technologies to ensure repeatability and to reduce users' learning curve of these technologies. The framework is distributed as a Docker image with the tools ready to be run without any installation step besides Docker, and is available on Linux, macOS, and Microsoft Windows. The embedded computational workflows are edited with a Jupyter web interface, enabling the inclusion of textual annotations, along with the explicit code to execute, as well as the visualization of the results. The resulting notebook files can then be shared and re-executed in the same environment. To date, the CoLoMoTo Interactive Notebook provides access to the software tools GINsim, BioLQM, Pint, MaBoSS, and Cell Collective, for the modeling and analysis of Boolean and multi-valued networks. More tools will be included in the future. We developed a Python interface for each of these tools to offer a seamless integration in the Jupyter web interface and ease the chaining of complementary analyses.
Collapse
Affiliation(s)
- Aurélien Naldi
- Computational Systems Biology Team, Institut de Biologie de I'Ecole Normale Supérieure, Centre National de la Recherche Scientifique UMR8197, Institut National de la Santé et de la Recherche Médicale U1024, École Normale Supérieure, PSL Université, Paris, France
| | - Céline Hernandez
- Computational Systems Biology Team, Institut de Biologie de I'Ecole Normale Supérieure, Centre National de la Recherche Scientifique UMR8197, Institut National de la Santé et de la Recherche Médicale U1024, École Normale Supérieure, PSL Université, Paris, France
| | - Nicolas Levy
- Laboratoire de Recherche en Informatique UMR8623, Université Paris-Sud, Centre National de la Recherche Scientifique, Université Paris-Saclay, Orsay, France
- École Normale Supérieure de Lyon, Lyon, France
| | - Gautier Stoll
- Université Paris Descartes/Paris V, Sorbonne Paris Cité, Paris, France
- Équipe 11 Labellisée Ligue Nationale Contre le Cancer, Centre de Recherche des Cordeliers, Paris, France
- Institut National de la Santé et de la Recherche Médicale, U1138, Paris, France
- Université Pierre et Marie Curie, Paris, France
- Metabolomics and Cell Biology Platforms, Gustave Roussy Cancer, Villejuif, France
| | - Pedro T. Monteiro
- INESC-ID/Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal
| | | | - Tomáš Helikar
- Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, NE, United States
| | - Andrei Zinovyev
- Institut Curie, PSL Research University, Paris, France
- Institut National de la Santé et de la Recherche Médicale, U900, Paris, France
- MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, Paris, France
- Lobachevsky University, Nizhni Novgorod, Russia
| | - Laurence Calzone
- Institut Curie, PSL Research University, Paris, France
- Institut National de la Santé et de la Recherche Médicale, U900, Paris, France
- MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, Paris, France
| | - Sarah Cohen-Boulakia
- Laboratoire de Recherche en Informatique UMR8623, Université Paris-Sud, Centre National de la Recherche Scientifique, Université Paris-Saclay, Orsay, France
| | - Denis Thieffry
- Computational Systems Biology Team, Institut de Biologie de I'Ecole Normale Supérieure, Centre National de la Recherche Scientifique UMR8197, Institut National de la Santé et de la Recherche Médicale U1024, École Normale Supérieure, PSL Université, Paris, France
| | - Loïc Paulevé
- Laboratoire de Recherche en Informatique UMR8623, Université Paris-Sud, Centre National de la Recherche Scientifique, Université Paris-Saclay, Orsay, France
| |
Collapse
|