1
|
Royaux C, Mihoub JB, Jossé M, Pelletier D, Norvez O, Reecht Y, Fouilloux A, Rasche H, Hiltemann S, Batut B, Marc E, Seguineau P, Massé G, Amossé A, Bissery C, Lorrilliere R, Martin A, Bas Y, Virgoulay T, Chambon V, Arnaud E, Michon E, Urfer C, Trigodet E, Delannoy M, Loïs G, Julliard R, Grüning B, Le Bras Y, The Galaxy-E community. Guidance framework to apply best practices in ecological data analysis: lessons learned from building Galaxy-Ecology. Gigascience 2025; 14:giae122. [PMID: 39937595 PMCID: PMC11816794 DOI: 10.1093/gigascience/giae122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2024] [Revised: 12/09/2024] [Accepted: 12/19/2024] [Indexed: 02/14/2025] Open
Abstract
Numerous conceptual frameworks exist for best practices in research data and analysis (e.g., Open Science and FAIR principles). In practice, there is a need for further progress to improve transparency, reproducibility, and confidence in ecology. Here, we propose a practical and operational framework for researchers and experts in ecology to achieve best practices for building analytical procedures from individual research projects to production-level analytical pipelines. We introduce the concept of atomization to identify analytical steps that support generalization by allowing us to go beyond single analyses. The term atomization is employed to convey the idea of single analytical steps as "atoms" composing an analytical procedure. When generalized, "atoms" can be used in more than a single case analysis. These guidelines were established during the development of the Galaxy-Ecology initiative, a web platform dedicated to data analysis in ecology. Galaxy-Ecology allows us to demonstrate a way to reach higher levels of reproducibility in ecological sciences by increasing the accessibility and reusability of analytical workflows once atomized and generalized.
Collapse
Affiliation(s)
- Coline Royaux
- UMR8067 Biologie des Organismes et Ecosystèmes Aquatiques (BOREA, MNHN-CNRS-SU-IRD-UCN-UA), Sorbonne Université, Station Marine de Concarneau, 29900 Concarneau, France
- Pôle national de données de biodiversité, UAR2006 PatriNat (OFB-MNHN-CNRS-IRD), Muséum National d'Histoire Naturelle, Station Marine de Concarneau, 29900 Concarneau, France
| | - Jean-Baptiste Mihoub
- Centre d’Écologie et des Sciences de la Conservation (UMR7204 CESCO, MNHN-CNRS-SU), Muséum National d'Histoire Naturelle, Sorbonne Université, Centre National de la Recherche Scientifique, 75005 Paris, France
| | - Marie Jossé
- Data Terra, Centre National de la Recherche Scientifique, 29200 Brest, France
| | | | - Olivier Norvez
- Pôle National de Données de Biodiversité, UAR2006 PatriNat (OFB-MNHN-CNRS-IRD), Fondation pour la Recherche sur la Biodiversité, Muséum national d'Histoire naturelle, 75005 Paris, France
| | - Yves Reecht
- Institute of Marine Research, 5817 Bergen, Norway
- Institut français de recherche pour l'exploitation de la mer (Ifremer), 29200 Brest, France
| | | | - Helena Rasche
- Department of Pathology and Clinical Bioinformatics, Erasmus Medical Center, 3000 CA Rotterdam, The Netherlands
| | - Saskia Hiltemann
- Institute of Pharmaceutical Sciences, Faculty of Chemistry and Pharmacy, University of Freiburg, 79104 Freiburg, Germany
| | - Bérénice Batut
- Institut Français de Bioinformatique, CNRS UAR3601, 91042 Évry, France
- Mésocentre, Clermont-Auvergne, Université Clermont Auvergne, 63000 Clermont-Ferrand, France
| | - Eléaume Marc
- Institut de Systématique Evolution, Biodiversité (UMR7205 ISYEB, MNHN-CNRS-SU-EPHE), Département Origines et Évolution, Muséum national d'Histoire naturelle, 75005 Paris, France
- Institut de Systématique Evolution, Biodiversité (UMR7205 ISYEB, MNHN-CNRS-SU-EPHE), Département Origines et Évolution, Station Marine de Concarneau, 29900 Concarneau, France
| | - Pauline Seguineau
- Institut de Systématique Evolution, Biodiversité (UMR7205 ISYEB, MNHN-CNRS-SU-EPHE), Département Origines et Évolution, Muséum national d'Histoire naturelle, 75005 Paris, France
- Institut de Systématique Evolution, Biodiversité (UMR7205 ISYEB, MNHN-CNRS-SU-EPHE), Département Origines et Évolution, Station Marine de Concarneau, 29900 Concarneau, France
| | - Guillaume Massé
- UMR LOCEAN (CNRS-SU-IRD-MNHN), Centre National de la Recherche Scientifique, Station Marine de Concarneau, 29900 Concarneau, France
| | - Alan Amossé
- Muséum National d'Histoire Naturelle, Station Marine de Concarneau, 29900 Concarneau, France
| | - Claire Bissery
- Institut français de recherche pour l'exploitation de la mer (Ifremer), 29200 Brest, France
- Université Claude Bernard Lyon 1, 69000 Lyon, France
| | - Romain Lorrilliere
- Centre d’Écologie et des Sciences de la Conservation (UMR7204 CESCO, MNHN-CNRS-SU), Muséum National d'Histoire Naturelle, Sorbonne Université, Centre National de la Recherche Scientifique, 75005 Paris, France
| | - Alexis Martin
- UMR8067 Biologie des Organismes et Ecosystèmes Aquatiques (BOREA, MNHN-CNRS-SU-IRD-UCN-UA), Muséum national d'Histoire naturelle, 75005 Paris, France
| | - Yves Bas
- Centre d’Écologie et des Sciences de la Conservation (UMR7204 CESCO, MNHN-CNRS-SU), Muséum National d'Histoire Naturelle, Sorbonne Université, Centre National de la Recherche Scientifique, 75005 Paris, France
- UAR2006 PatriNat (OFB-MNHN-CNRS-IRD), Muséum national d'Histoire naturelle, 75005 Paris, France
| | - Thimothée Virgoulay
- Centre d’Écologie et des Sciences de la Conservation (UMR7204 CESCO, MNHN-CNRS-SU), Muséum National d'Histoire Naturelle, 29900 Concarneau, France
- Université de Montpellier, 34000 Montpellier, France
| | - Valentin Chambon
- Muséum National d'Histoire Naturelle, Station Marine de Concarneau, 29900 Concarneau, France
| | - Elie Arnaud
- Pôle national de données de biodiversité, UAR2006 PatriNat (OFB-MNHN-CNRS-IRD), Muséum National d'Histoire Naturelle, Station Marine de Concarneau, 29900 Concarneau, France
| | - Elisa Michon
- Institut des Sciences de la Mer de Rimouski, Université du Québec à Rimouski, Rimouski G5L 2Z9, Québec, Canada
| | - Clara Urfer
- Pôle national de données de biodiversité, UAR2006 PatriNat (OFB-MNHN-CNRS-IRD), Muséum National d'Histoire Naturelle, Station Marine de Concarneau, 29900 Concarneau, France
- Université de Bretagne Occidentale, 29200 Brest, France
| | - Eloïse Trigodet
- Centre d’Écologie et des Sciences de la Conservation (UMR7204 CESCO, MNHN-CNRS-SU), Muséum National d'Histoire Naturelle, 29900 Concarneau, France
- Université de Bretagne Occidentale, 29200 Brest, France
| | - Marie Delannoy
- Centre d’Écologie et des Sciences de la Conservation (UMR7204 CESCO, MNHN-CNRS-SU), Muséum National d'Histoire Naturelle, Sorbonne Université, Centre National de la Recherche Scientifique, 75005 Paris, France
| | - Gregoire Loïs
- Centre d’Écologie et des Sciences de la Conservation (UMR7204 CESCO, MNHN-CNRS-SU), Muséum National d'Histoire Naturelle, Sorbonne Université, Centre National de la Recherche Scientifique, 75005 Paris, France
| | - Romain Julliard
- Centre d’Écologie et des Sciences de la Conservation (UMR7204 CESCO, MNHN-CNRS-SU), Muséum National d'Histoire Naturelle, Sorbonne Université, Centre National de la Recherche Scientifique, 75005 Paris, France
| | - Björn Grüning
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, 79110 Freiburg, Germany
| | - Yvan Le Bras
- Pôle national de données de biodiversité, UAR2006 PatriNat (OFB-MNHN-CNRS-IRD), Muséum National d'Histoire Naturelle, Station Marine de Concarneau, 29900 Concarneau, France
| | | |
Collapse
|
2
|
van Kampen AHC, Mahamune U, Jongejan A, van Schaik BDC, Balashova D, Lashgari D, Pras-Raves M, Wever EJM, Dane AD, García-Valiente R, Moerland PD. ENCORE: a practical implementation to improve reproducibility and transparency of computational research. Nat Commun 2024; 15:8117. [PMID: 39284801 PMCID: PMC11405857 DOI: 10.1038/s41467-024-52446-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2024] [Accepted: 09/06/2024] [Indexed: 09/20/2024] Open
Abstract
Reproducibility of computational research is often challenging despite established guidelines and best practices. Translating these guidelines into practical applications remains difficult. Here, we present ENCORE, an approach to enhance transparency and reproducibility by guiding researchers in how to structure and document a computational project. ENCORE builds on previous efforts in computational reproducibility and integrates all project components into a standardized file system structure. It utilizes pre-defined files as documentation templates, leverages GitHub for software versioning, and includes an HTML-based navigator. ENCORE is designed to be agnostic to the type of computational project, data, programming language, and ICT infrastructure, and does not rely on specific software tools. We also share our group's experience using ENCORE, highlighting that the most significant challenge to the routine adoption of approaches like ours is the lack of incentives to motivate researchers to dedicate sufficient time and effort to ensure reproducibility.
Collapse
Affiliation(s)
- Antoine H C van Kampen
- Amsterdam UMC, University of Amsterdam, Bioinformatics Laboratory, Epidemiology and Data Science, Meibergdreef 9, Amsterdam, Netherlands.
- Netherlands. Amsterdam Public Health, Methodology, Amsterdam, Netherlands.
- Amsterdam Institute for Immunology and Infectious Diseases, Amsterdam, Netherlands.
- Biosystems Data Analysis, Swammerdam Institute for Life Sciences, University of Amsterdam, Science Park 904, Amsterdam, Netherlands.
| | - Utkarsh Mahamune
- Amsterdam UMC, University of Amsterdam, Bioinformatics Laboratory, Epidemiology and Data Science, Meibergdreef 9, Amsterdam, Netherlands
- Netherlands. Amsterdam Public Health, Methodology, Amsterdam, Netherlands
- Amsterdam Institute for Immunology and Infectious Diseases, Amsterdam, Netherlands
| | - Aldo Jongejan
- Amsterdam UMC, University of Amsterdam, Bioinformatics Laboratory, Epidemiology and Data Science, Meibergdreef 9, Amsterdam, Netherlands
- Netherlands. Amsterdam Public Health, Methodology, Amsterdam, Netherlands
| | - Barbera D C van Schaik
- Amsterdam UMC, University of Amsterdam, Bioinformatics Laboratory, Epidemiology and Data Science, Meibergdreef 9, Amsterdam, Netherlands
- Netherlands. Amsterdam Public Health, Methodology, Amsterdam, Netherlands
- Amsterdam Institute for Immunology and Infectious Diseases, Amsterdam, Netherlands
| | - Daria Balashova
- Amsterdam UMC, University of Amsterdam, Bioinformatics Laboratory, Epidemiology and Data Science, Meibergdreef 9, Amsterdam, Netherlands
- Netherlands. Amsterdam Public Health, Methodology, Amsterdam, Netherlands
- Amsterdam Institute for Immunology and Infectious Diseases, Amsterdam, Netherlands
| | - Danial Lashgari
- Amsterdam UMC, University of Amsterdam, Bioinformatics Laboratory, Epidemiology and Data Science, Meibergdreef 9, Amsterdam, Netherlands
- Netherlands. Amsterdam Public Health, Methodology, Amsterdam, Netherlands
- Amsterdam Institute for Immunology and Infectious Diseases, Amsterdam, Netherlands
| | - Mia Pras-Raves
- Amsterdam UMC, University of Amsterdam, Department of Clinical Chemistry, Laboratory Genetic Metabolic Diseases, Meibergdreef 9, Amsterdam, Netherlands
- Core Facility Metabolomics, Amsterdam UMC, Amsterdam, Netherlands
| | - Eric J M Wever
- Amsterdam UMC, University of Amsterdam, Department of Clinical Chemistry, Laboratory Genetic Metabolic Diseases, Meibergdreef 9, Amsterdam, Netherlands
- Core Facility Metabolomics, Amsterdam UMC, Amsterdam, Netherlands
| | - Adrie D Dane
- Amsterdam UMC, University of Amsterdam, Bioinformatics Laboratory, Epidemiology and Data Science, Meibergdreef 9, Amsterdam, Netherlands
- Netherlands. Amsterdam Public Health, Methodology, Amsterdam, Netherlands
- Core Facility Metabolomics, Amsterdam UMC, Amsterdam, Netherlands
| | - Rodrigo García-Valiente
- Amsterdam UMC, University of Amsterdam, Bioinformatics Laboratory, Epidemiology and Data Science, Meibergdreef 9, Amsterdam, Netherlands
- Netherlands. Amsterdam Public Health, Methodology, Amsterdam, Netherlands
- Amsterdam Institute for Immunology and Infectious Diseases, Amsterdam, Netherlands
| | - Perry D Moerland
- Amsterdam UMC, University of Amsterdam, Bioinformatics Laboratory, Epidemiology and Data Science, Meibergdreef 9, Amsterdam, Netherlands
- Netherlands. Amsterdam Public Health, Methodology, Amsterdam, Netherlands
- Amsterdam Institute for Immunology and Infectious Diseases, Amsterdam, Netherlands
| |
Collapse
|
3
|
Buchan DWA, Moffat L, Lau A, Kandathil S, Jones D. Deep learning for the PSIPRED Protein Analysis Workbench. Nucleic Acids Res 2024; 52:W287-W293. [PMID: 38747351 PMCID: PMC11223827 DOI: 10.1093/nar/gkae328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2024] [Revised: 04/08/2024] [Accepted: 04/24/2024] [Indexed: 07/06/2024] Open
Abstract
The PSIRED Workbench is a long established and popular bioinformatics web service offering a wide range of machine learning based analyses for characterizing protein structure and function. In this paper we provide an update of the recent additions and developments to the webserver, with a focus on new Deep Learning based methods. We briefly discuss some trends in server usage since the publication of AlphaFold2 and we give an overview of some upcoming developments for the service. The PSIPRED Workbench is available at http://bioinf.cs.ucl.ac.uk/psipred.
Collapse
Affiliation(s)
- Daniel W A Buchan
- UCL Bioinformatics Group, Department of Computer Science, University College London, London, WC1E 6BT, UK
| | - Lewis Moffat
- UCL Bioinformatics Group, Department of Computer Science, University College London, London, WC1E 6BT, UK
| | - Andy Lau
- UCL Bioinformatics Group, Department of Computer Science, University College London, London, WC1E 6BT, UK
| | - Shaun M Kandathil
- UCL Bioinformatics Group, Department of Computer Science, University College London, London, WC1E 6BT, UK
| | - David T Jones
- UCL Bioinformatics Group, Department of Computer Science, University College London, London, WC1E 6BT, UK
| |
Collapse
|
4
|
Davies A, Mueller J, Hassey A, Moulton G. Development of a core competency framework for clinical informatics. BMJ Health Care Inform 2021; 28:e100356. [PMID: 34266851 PMCID: PMC8286765 DOI: 10.1136/bmjhci-2021-100356] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2021] [Accepted: 06/07/2021] [Indexed: 11/16/2022] Open
Abstract
OBJECTIVES Until this point there was no national core competency framework for clinical informatics in the UK. We report on the final two iterations of work carried out in the formation of a national core competency framework. This follows an initial systematic literature review of existing skills and competencies and a job listing analysis.MethodsAn iterative approach was applied to framework development. Using a mixed-methods design we carried out semi-structured interviews with participants involved in informatics (n=15). The framework was updated based on the interview findings and was subsequently distributed as part of a bespoke online digital survey for wider participation (n=87). The final version of the framework is based on the findings of the survey. RESULTS Over 102 people reviewed the framework as part of the interview or survey process. This led to a final core competency framework containing 6 primary domains with 36 subdomains containing 111 individual competencies. CONCLUSIONS An iterative mixed-methods approach for competency development involving the target community was appropriate for development of the competency framework. There is some contention around the depth of technical competencies required. Care is also needed to avoid professional burnout, as clinicians and healthcare practitioners already have clinical competencies to maintain. Therefore, how the framework is applied in practice and how practitioners meet the competencies requires careful consideration.
Collapse
Affiliation(s)
- Alan Davies
- School of Health Sciences, The University of Manchester, Manchester, Cheshire, UK
| | | | - Alan Hassey
- The Faculty of Clinical Informatics (FCI), London, UK
| | - Georgina Moulton
- School of Health Sciences, The University of Manchester, Manchester, Cheshire, UK
- Health Data Research UK, London, UK
| |
Collapse
|
5
|
Beard N, Bacall F, Nenadic A, Thurston M, Goble CA, Sansone SA, Attwood TK. TeSS: a platform for discovering life-science training opportunities. Bioinformatics 2020; 36:3290-3291. [PMID: 32044952 PMCID: PMC7214044 DOI: 10.1093/bioinformatics/btaa047] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Revised: 12/10/2019] [Accepted: 02/03/2020] [Indexed: 11/13/2022] Open
Abstract
SUMMARY Dispersed across the Internet is an abundance of disparate, disconnected training information, making it hard for researchers to find training opportunities that are relevant to them. To address this issue, we have developed a new platform-TeSS-which aggregates geographically distributed information and presents it in a central, feature-rich portal. Data are gathered automatically from content providers via bespoke scripts. These resources are cross-linked with related data and tools registries, and made available via a search interface, a data API and through widgets. AVAILABILITY AND IMPLEMENTATION https://tess.elixir-europe.org.
Collapse
Affiliation(s)
- Niall Beard
- Department of Computer Science, The University of Manchester, Manchester M13 9PL, UK
| | - Finn Bacall
- Department of Computer Science, The University of Manchester, Manchester M13 9PL, UK
| | - Aleksandra Nenadic
- Department of Computer Science, The University of Manchester, Manchester M13 9PL, UK
| | - Milo Thurston
- Department of Engineering Science, Oxford e-Research Centre, University of Oxford, Oxford OX1 3QG, UK
| | - Carole A Goble
- Department of Computer Science, The University of Manchester, Manchester M13 9PL, UK
| | - Susanna-Assunta Sansone
- Department of Engineering Science, Oxford e-Research Centre, University of Oxford, Oxford OX1 3QG, UK
| | - Teresa K Attwood
- Department of Computer Science, The University of Manchester, Manchester M13 9PL, UK
| |
Collapse
|
6
|
Tractenberg RE, Lindvall JM, Attwood TK, Via A. The Mastery Rubric for Bioinformatics: A tool to support design and evaluation of career-spanning education and training. PLoS One 2019; 14:e0225256. [PMID: 31770418 PMCID: PMC6879125 DOI: 10.1371/journal.pone.0225256] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 10/24/2019] [Indexed: 11/18/2022] Open
Abstract
As the life sciences have become more data intensive, the pressure to incorporate the requisite training into life-science education and training programs has increased. To facilitate curriculum development, various sets of (bio)informatics competencies have been articulated; however, these have proved difficult to implement in practice. Addressing this issue, we have created a curriculum-design and -evaluation tool to support the development of specific Knowledge, Skills and Abilities (KSAs) that reflect the scientific method and promote both bioinformatics practice and the achievement of competencies. Twelve KSAs were extracted via formal analysis, and stages along a developmental trajectory, from uninitiated student to independent practitioner, were identified. Demonstration of each KSA by a performer at each stage was initially described (Performance Level Descriptors, PLDs), evaluated, and revised at an international workshop. This work was subsequently extended and further refined to yield the Mastery Rubric for Bioinformatics (MR-Bi). The MR-Bi was validated by demonstrating alignment between the KSAs and competencies, and its consistency with principles of adult learning. The MR-Bi tool provides a formal framework to support curriculum building, training, and self-directed learning. It prioritizes the development of independence and scientific reasoning, and is structured to allow individuals (regardless of career stage, disciplinary background, or skill level) to locate themselves within the framework. The KSAs and their PLDs promote scientific problem formulation and problem solving, lending the MR-Bi durability and flexibility. With its explicit developmental trajectory, the tool can be used by developing or practicing scientists to direct their (and their team's) acquisition of new, or to deepen existing, bioinformatics KSAs. The MR-Bi is a tool that can contribute to the cultivation of a next generation of bioinformaticians who are able to design reproducible and rigorous research, and to critically analyze results from their own, and others', work.
Collapse
Affiliation(s)
- Rochelle E. Tractenberg
- Collaborative for Research on Outcomes and –Metrics, and Departments of Neurology, Biostatistics, Biomathematics and Bioinformatics, and Rehabilitation Medicine, Georgetown University, Washington, DC, United States of America
| | - Jessica M. Lindvall
- National Bioinformatics Infrastructure Sweden (NBIS)/ELIXIR-SE, Science for Life Laboratory (SciLifeLab), Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Teresa K. Attwood
- Department of Computer Science, The University of Manchester, Manchester, England, United Kingdom; The GOBLET Foundation, Radboud University, Nijmegen Medical Centre, Nijmegen, The Netherlands
| | - Allegra Via
- ELIXIR Italy, National Research Council of Italy, Institute of Molecular Biology and Pathology, Rome, Italy
| |
Collapse
|
7
|
Wibberg D, Batut B, Belmann P, Blom J, Glöckner FO, Grüning B, Hoffmann N, Kleinbölting N, Rahn R, Rey M, Scholz U, Sharan M, Tauch A, Trojahn U, Usadel B, Kohlbacher O. The de.NBI / ELIXIR-DE training platform - Bioinformatics training in Germany and across Europe within ELIXIR. F1000Res 2019; 8. [PMID: 33163154 PMCID: PMC7607484 DOI: 10.12688/f1000research.20244.2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/04/2020] [Indexed: 12/25/2022] Open
Abstract
The German Network for Bioinformatics Infrastructure (de.NBI) is a national and academic infrastructure funded by the German Federal Ministry of Education and Research (BMBF). The de.NBI provides (i) service, (ii) training, and (iii) cloud computing to users in life sciences research and biomedicine in Germany and Europe and (iv) fosters the cooperation of the German bioinformatics community with international network structures. The de.NBI members also run the German node (ELIXIR-DE) within the European ELIXIR infrastructure. The de.NBI / ELIXIR-DE training platform, also known as special interest group 3 (SIG 3) ‘Training & Education’, coordinates the bioinformatics training of de.NBI and the German ELIXIR node. The network provides a high-quality, coherent, timely, and impactful training program across its eight service centers. Life scientists learn how to handle and analyze biological big data more effectively by applying tools, standards and compute services provided by de.NBI. Since 2015, more than 300 training courses were carried out with about 6,000 participants and these courses received recommendation rates of almost 90% (status as of July 2020). In addition to face-to-face training courses, online training was introduced on the de.NBI website in 2016 and guidelines for the preparation of e-learning material were established in 2018. In 2016, ELIXIR-DE joined the ELIXIR training platform. Here, the de.NBI / ELIXIR-DE training platform collaborates with ELIXIR in training activities, advertising training courses via TeSS and discussions on the exchange of data for training events essential for quality assessment on both the technical and administrative levels. The de.NBI training program trained thousands of scientists from Germany and beyond in many different areas of bioinformatics.
Collapse
Affiliation(s)
- Daniel Wibberg
- Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, 33501, Germany
| | - Bérénice Batut
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Freiburg, 79110, Germany
| | - Peter Belmann
- Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, 33501, Germany
| | - Jochen Blom
- Bioinformatics and Systems Biology, Justus-Liebig-University Giessen, Giessen, 35392, Germany
| | - Frank Oliver Glöckner
- Alfred-Wegener-Institut - Helmholtz Zentrum für Polar- und Meeresforschung and Jacobs University Bremen, Campus Ring 1, Bremen, 28759, Germany
| | - Björn Grüning
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Freiburg, 79110, Germany
| | - Nils Hoffmann
- Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V., Dortmund, 44227, Germany
| | - Nils Kleinbölting
- Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, 33501, Germany
| | - René Rahn
- Algorithmic Bioinformatics, Department of Mathematics and Computer Science, Freie Universität Berlin, Takustraße 9, Berlin, 14195, Germany
| | - Maja Rey
- Scientific Databases and Visualization Group, Heidelberg Institute for Theoretical Studies (HITS) gGmbH, Schloss-Wolfsbrunnenweg 35, Heidelberg, 69118, Germany
| | - Uwe Scholz
- Leibniz Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, Seeland, 06466, Germany
| | - Malvika Sharan
- The Heidelberg Center for Human Bioinformatics (HD-HuB), European Molecular Biology Laboratory, Meyerhofstrasse 1, Heidelberg, 69117, Germany
| | - Andreas Tauch
- Center for Biotechnology (CeBiTec), Bielefeld University, Bielefeld, 33501, Germany
| | - Ulrike Trojahn
- The Heidelberg Center for Human Bioinformatics (HD-HuB), European Molecular Biology Laboratory, Meyerhofstrasse 1, Heidelberg, 69117, Germany
| | - Björn Usadel
- IBG-2 Plant Sciences, Forschungszentrum Jülich, Jülich, 52428, Germany
| | - Oliver Kohlbacher
- Applied Bioinformatics, Department of Computer Science, University of Tübingen, Tübingen, 72076, Germany.,Institute for Bioinformatics and Medical Informatics, University of Tübingen, Tübingen, 72076, Germany.,Translational Bioinformatics, University Hospital Tubingen, Tübingen, 72076, Germany.,Biomolecular Interactions, Max Planck Institute for Development Biology, Tübingen, 72076, Germany
| |
Collapse
|
8
|
Ison J, Ménager H, Brancotte B, Jaaniso E, Salumets A, Raček T, Lamprecht AL, Palmblad M, Kalaš M, Chmura P, Hancock JM, Schwämmle V, Ienasescu HI. Community curation of bioinformatics software and data resources. Brief Bioinform 2019; 21:1697-1705. [PMID: 31624831 PMCID: PMC7947956 DOI: 10.1093/bib/bbz075] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2019] [Revised: 05/13/2019] [Accepted: 05/30/2019] [Indexed: 11/13/2022] Open
Abstract
The corpus of bioinformatics resources is huge and expanding rapidly, presenting life scientists with a growing challenge in selecting tools that fit the desired purpose. To address this, the European Infrastructure for Biological Information is supporting a systematic approach towards a comprehensive registry of tools and databases for all domains of bioinformatics, provided under a single portal (https://bio.tools). We describe here the practical means by which scientific communities, including individual developers and projects, through major service providers and research infrastructures, can describe their own bioinformatics resources and share these via bio.tools.
Collapse
Affiliation(s)
- Jon Ison
- National Life Science Supercomputing Center, Technical University of Denmark, Building 208, DK-2800 Kongens Lyngby, Denmark
| | - Hervé Ménager
- Hub de Bioinformatique et Biostatistique - Département Biologie Computationnelle, Institut Pasteur, USR 3756 CNRS, Paris, France
| | - Bryan Brancotte
- Hub de Bioinformatique et Biostatistique - Département Biologie Computationnelle, Institut Pasteur, USR 3756 CNRS, Paris, France
| | - Erik Jaaniso
- ELIXIR-EE, Institute of Computer Science, University of Tartu. J Liivi 2, Tartu, Estonia
| | - Ahto Salumets
- ELIXIR-EE, Institute of Computer Science, University of Tartu. J Liivi 2, Tartu, Estonia
| | - Tomáš Raček
- CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00 Brno-Bohunice, Czech Republic.,Faculty of Informatics, Masaryk University, Botanická 68a, 602 00 Brno, Czech Republic
| | - Anna-Lena Lamprecht
- Department of Information and Computing Sciences, Utrecht University, Utrecht, Netherlands
| | - Magnus Palmblad
- Center for Proteomics and Metabolomics, Leiden University Medical Center, Leiden, Netherlands
| | - Matúš Kalaš
- Computational Biology Unit, Department of Informatics, University of Bergen, N-5020 Bergen, Norway
| | - Piotr Chmura
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen
| | - John M Hancock
- ELIXIR-Hub, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, United Kingdom
| | - Veit Schwämmle
- Department of Biochemistry and Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, 5230 Odense, Denmark
| | - Hans-Ioan Ienasescu
- National Life Science Supercomputing Center, Technical University of Denmark, Building 208, DK-2800 Kongens Lyngby, Denmark
| |
Collapse
|
9
|
Batut B, Hiltemann S, Bagnacani A, Baker D, Bhardwaj V, Blank C, Bretaudeau A, Brillet-Guéguen L, Čech M, Chilton J, Clements D, Doppelt-Azeroual O, Erxleben A, Freeberg MA, Gladman S, Hoogstrate Y, Hotz HR, Houwaart T, Jagtap P, Larivière D, Le Corguillé G, Manke T, Mareuil F, Ramírez F, Ryan D, Sigloch FC, Soranzo N, Wolff J, Videm P, Wolfien M, Wubuli A, Yusuf D, Taylor J, Backofen R, Nekrutenko A, Grüning B. Community-Driven Data Analysis Training for Biology. Cell Syst 2019; 6:752-758.e1. [PMID: 29953864 DOI: 10.1016/j.cels.2018.05.012] [Citation(s) in RCA: 117] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Revised: 03/10/2018] [Accepted: 05/18/2018] [Indexed: 01/12/2023]
Abstract
The primary problem with the explosion of biomedical datasets is not the data, not computational resources, and not the required storage space, but the general lack of trained and skilled researchers to manipulate and analyze these data. Eliminating this problem requires development of comprehensive educational resources. Here we present a community-driven framework that enables modern, interactive teaching of data analytics in life sciences and facilitates the development of training materials. The key feature of our system is that it is not a static but a continuously improved collection of tutorials. By coupling tutorials with a web-based analysis framework, biomedical researchers can learn by performing computation themselves through a web browser without the need to install software or search for example datasets. Our ultimate goal is to expand the breadth of training materials to include fundamental statistical and data science topics and to precipitate a complete re-engineering of undergraduate and graduate curricula in life sciences. This project is accessible at https://training.galaxyproject.org.
Collapse
Affiliation(s)
- Bérénice Batut
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, Freiburg 79110, Germany
| | - Saskia Hiltemann
- Erasmus Medical Centre, Wytemaweg 80, Rotterdam 3015 CN, the Netherlands
| | - Andrea Bagnacani
- Department of Systems Biology and Bioinformatics, University of Rostock, Ulmenstraße 69, Rostock 18051, Germany
| | - Dannon Baker
- Johns Hopkins University, 3400 N Charles Street, Mudd Hall 144, Baltimore 21218, MD, USA
| | - Vivek Bhardwaj
- Department of Biology, Albert-Ludwigs-University, Schänzlestraße 1, Freiburg 79104, Germany
| | - Clemens Blank
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, Freiburg 79110, Germany
| | - Anthony Bretaudeau
- INRA, UMR IGEPP, BIPAA/GenOuest, INRIA/Irisa - Campus de Beaulieu, 35042 RENNES Cedex, France
| | | | - Martin Čech
- The Pennsylvania State University, 505 Wartik Lab, University Park, PA 16802, USA
| | - John Chilton
- The Pennsylvania State University, 505 Wartik Lab, University Park, PA 16802, USA
| | - Dave Clements
- Johns Hopkins University, 3400 N Charles Street, Mudd Hall 144, Baltimore 21218, MD, USA
| | - Olivia Doppelt-Azeroual
- Bioinformatics and Biostatistics HUB, Centre de Bioinformatique, Biostatistique et Biologie Intégrative (C3BI, USR 3756 Institut Pasteur et CNRS), Institut Pasteur, 25-28 Rue du Docteur Roux, 75015 Paris, France
| | - Anika Erxleben
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, Freiburg 79110, Germany
| | | | - Simon Gladman
- Melbourne Bioinformatics, The University of Melbourne, Melbourne, VIC 3010, Australia
| | - Youri Hoogstrate
- Erasmus Medical Centre, Wytemaweg 80, Rotterdam 3015 CN, the Netherlands
| | - Hans-Rudolf Hotz
- Friedrich Miescher Institute for Biomedical Research, Maulbeerstrasse 66, Basel 4058, Switzerland
| | - Torsten Houwaart
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, Freiburg 79110, Germany
| | - Pratik Jagtap
- Biochemistry, Molecular Biology and Biophysics, University of Minnesota Medical School, 420 Delaware Street SE, Minneapolis, MN 55455, USA
| | - Delphine Larivière
- The Pennsylvania State University, 505 Wartik Lab, University Park, PA 16802, USA
| | - Gildas Le Corguillé
- PMC, CNRS, FR2424, ABiMS, Station Biologique, Place Georges Teissier, Roscoff 29680, France
| | - Thomas Manke
- Max Planck Institute of Immunobiology and Epigenetics, Stübeweg 51, Freiburg 79108, Germany
| | - Fabien Mareuil
- Bioinformatics and Biostatistics HUB, Centre de Bioinformatique, Biostatistique et Biologie Intégrative (C3BI, USR 3756 Institut Pasteur et CNRS), Institut Pasteur, 25-28 Rue du Docteur Roux, 75015 Paris, France
| | - Fidel Ramírez
- Max Planck Institute of Immunobiology and Epigenetics, Stübeweg 51, Freiburg 79108, Germany
| | - Devon Ryan
- Max Planck Institute of Immunobiology and Epigenetics, Stübeweg 51, Freiburg 79108, Germany
| | - Florian Christoph Sigloch
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, Freiburg 79110, Germany
| | - Nicola Soranzo
- Earlham Institute, Norwich Research Park, Norwich NR4 7UZ, UK
| | - Joachim Wolff
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, Freiburg 79110, Germany
| | - Pavankumar Videm
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, Freiburg 79110, Germany
| | - Markus Wolfien
- Department of Systems Biology and Bioinformatics, University of Rostock, Ulmenstraße 69, Rostock 18051, Germany
| | - Aisanjiang Wubuli
- Leibniz Institute for Farm Animal Biology (FBN), Wilhelm-Stahl-Allee 2, Dummerstorf 18196, Germany
| | - Dilmurat Yusuf
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, Freiburg 79110, Germany
| | | | - James Taylor
- Johns Hopkins University, 3400 N Charles Street, Mudd Hall 144, Baltimore 21218, MD, USA
| | - Rolf Backofen
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, Freiburg 79110, Germany.
| | - Anton Nekrutenko
- The Pennsylvania State University, 505 Wartik Lab, University Park, PA 16802, USA.
| | - Björn Grüning
- Bioinformatics Group, Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, Freiburg 79110, Germany.
| |
Collapse
|
10
|
Ison J, Ienasescu H, Chmura P, Rydza E, Ménager H, Kalaš M, Schwämmle V, Grüning B, Beard N, Lopez R, Duvaud S, Stockinger H, Persson B, Vařeková RS, Raček T, Vondrášek J, Peterson H, Salumets A, Jonassen I, Hooft R, Nyrönen T, Valencia A, Capella S, Gelpí J, Zambelli F, Savakis B, Leskošek B, Rapacki K, Blanchet C, Jimenez R, Oliveira A, Vriend G, Collin O, van Helden J, Løngreen P, Brunak S. The bio.tools registry of software tools and data resources for the life sciences. Genome Biol 2019; 20:164. [PMID: 31405382 PMCID: PMC6691543 DOI: 10.1186/s13059-019-1772-6] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2019] [Accepted: 07/22/2019] [Indexed: 11/28/2022] Open
Abstract
Bioinformaticians and biologists rely increasingly upon workflows for the flexible utilization of the many life science tools that are needed to optimally convert data into knowledge. We outline a pan-European enterprise to provide a catalogue ( https://bio.tools ) of tools and databases that can be used in these workflows. bio.tools not only lists where to find resources, but also provides a wide variety of practical information.
Collapse
Affiliation(s)
- Jon Ison
- National Life Science Supercomputing Center, Technical University of Denmark, Building 208, DK-2800, Kongens Lyngby, Denmark.
| | - Hans Ienasescu
- National Life Science Supercomputing Center, Technical University of Denmark, Building 208, DK-2800, Kongens Lyngby, Denmark
| | - Piotr Chmura
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200, Copenhagen, Denmark
| | - Emil Rydza
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200, Copenhagen, Denmark
| | - Hervé Ménager
- Hub de Bioinformatique et de Biostatistiques, Institut Pasteur, C3BI USR, 3756 IP CNRS, Paris, France
| | - Matúš Kalaš
- Computational Biology Unit, Department of Informatics, University of Bergen, N-5020, Bergen, Norway
| | - Veit Schwämmle
- Department of Biochemistry and Molecular Biology and VILLUM Center for Bioanalytical Sciences, University of Southern Denmark, Campusvej 55, 5230, Odense, Denmark
| | - Björn Grüning
- Department of Computer Science, Albert-Ludwigs-University Freiburg, Georges-Köhler-Allee 106, 79110, Freiburg, Germany
| | - Niall Beard
- School of Computer Science, The University of Manchester, Oxford Road, Manchester, M13 9PL, UK
| | - Rodrigo Lopez
- The EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Severine Duvaud
- SIB Swiss Institute of Bioinformatics, Quartier Sorge - Batiment Amphipole, CH-1015, Lausanne, Switzerland
| | - Heinz Stockinger
- SIB Swiss Institute of Bioinformatics, Quartier Sorge - Batiment Amphipole, CH-1015, Lausanne, Switzerland
| | - Bengt Persson
- Bioinformatics Infrastructure for Life Sciences, Science for Life Laboratory, Dept of Cell and Molecular Biology, Uppsala University, S-75124, Uppsala, Sweden
| | - Radka Svobodová Vařeková
- CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00, Brno-Bohunice, Czech Republic
| | - Tomáš Raček
- CEITEC - Central European Institute of Technology, Masaryk University Brno, Kamenice 5, 625 00, Brno-Bohunice, Czech Republic
| | - Jiří Vondrášek
- Institute of Organic Chemistry and Biochemistry, Czech Academy of Sciences, Flemingovo namesti 2, 160 00, Prague, Czech Republic
| | - Hedi Peterson
- ELIXIR-EE, Institute of Computer Science, University of Tartu. J Liivi 2, Tartu, Estonia
| | - Ahto Salumets
- ELIXIR-EE, Institute of Computer Science, University of Tartu. J Liivi 2, Tartu, Estonia
| | - Inge Jonassen
- Computational Biology Unit, Department of Informatics, University of Bergen, N-5020, Bergen, Norway
| | - Rob Hooft
- Dutch Techcentre for Life Sciences, Jaarbeursplein 6, 3521, AL, Utrecht, The Netherlands
| | - Tommi Nyrönen
- CSC - IT Center for Science, PO BOX 405, FI-02101, Espoo, Finland
| | - Alfonso Valencia
- Barcelona Supercomputing Centre (BSC), 08034, Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Pg. Lluıs Companys 23, 08010, Barcelona, Spain
| | | | - Josep Gelpí
- Barcelona Supercomputing Centre (BSC), 08034, Barcelona, Spain
- Department of Biochemistry and Molecular Biomedicine, University of Barcelona, INB / BSC-CNS, Barcelona, Spain
| | - Federico Zambelli
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies, National Research Council (CNR), via Amendola 165/A, Bari, Italy
- Department of Biosciences, University of Milano, Via Celoria 26, Milan, Italy
| | - Babis Savakis
- Biomedical Sciences Research Centre, Alexander Fleming 34 Al. Fleming Str, 16672, Vari, Greece
| | - Brane Leskošek
- Faculty of Medicine / ELIXIR-SI, University of Ljubljana, Vrazov trg 2, SI-1000, Ljubljana, Slovenia
| | - Kristoffer Rapacki
- National Life Science Supercomputing Center, Technical University of Denmark, Building 208, DK-2800, Kongens Lyngby, Denmark
| | - Christophe Blanchet
- CNRS, UMS 3601, Institut Français de Bioinformatique, IFB-core, 2 rue Gaston Crémieux, F-91000, Evry, France
| | - Rafael Jimenez
- ELIXIR-Hub, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK
| | - Arlindo Oliveira
- INESC-ID / Instituto Superior Técnico, R. Alves Redol 9, Lisbon, Portugal
| | - Gert Vriend
- Radboud University Medical Centre, CMBI, Postbus 9101, 6500 HB, Nijmegen, Netherlands
| | - Olivier Collin
- Plateforme GenOuest Univ Rennes, Inria, CNRS, IRISA, F-35000, Rennes, France
| | - Jacques van Helden
- Aix-Marseille Univ, INSERM, lab. Theory and Approaches of Genome Complexity (TAGC), Marseille, France
| | - Peter Løngreen
- National Life Science Supercomputing Center, Technical University of Denmark, Building 208, DK-2800, Kongens Lyngby, Denmark
| | - Søren Brunak
- Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, DK-2200, Copenhagen, Denmark
- Department of Bio and Health Informatics, Technical University of Denmark, Building 208, DK-2800, Kongens Lyngby, Denmark
| |
Collapse
|