1
|
van Geest G, Haefliger Y, Zahn-Zabal M, Palagi PM. Using Glittr.org to find, compare and re-use online materials for training and education. PLoS One 2024; 19:e0308729. [PMID: 39637085 PMCID: PMC11620569 DOI: 10.1371/journal.pone.0308729] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Accepted: 11/12/2024] [Indexed: 12/07/2024] Open
Abstract
A wealth of excellent training and educational materials for the computational life sciences are scattered around the Internet, but they can be hard to find. Many materials reside in public Git repositories that are hosted on platforms such as GitHub and GitLab. Glittr.org is a manually curated database of Git repositories, which enables users to find educational materials that would otherwise be hard to identify. With the application, users can search and compare educational materials based on topic and author, but also on engagement metrics such as stargazers (bookmarks) and recency (days since last commit). Glittr.org currently contains 664 entries, which are assigned to six different categories within the domain of computational life sciences. By analysing the database, we reveal insights in the availability of materials per topic, collaboration patterns of developers, and licensing practices. This knowledge helps to understand in which areas open educational materials are scant, the importance of Git for collaboration on educational materials and how licensing can be improved to enhance sharing and reuse. Taken together, we show that Glittr.org contains a wealth of connected and openly available metadata. Therefore, it enhances adherence to the FAIR (Findable, Accessible, Interoperable, Reusable) principles, which benefits learners, teachers and trainers in the entire life sciences community and beyond.
Collapse
Affiliation(s)
- Geert van Geest
- Swiss Institute of Bioinformatics, Quartier Sorge—Bâtiment Amphipôle, Lausanne, Switzerland
- Interfaculty Bioinformatics Unit, University of Bern, Bern, Switzerland
| | - Yann Haefliger
- Swiss Institute of Bioinformatics, Quartier Sorge—Bâtiment Amphipôle, Lausanne, Switzerland
| | - Monique Zahn-Zabal
- Swiss Institute of Bioinformatics, Quartier Sorge—Bâtiment Amphipôle, Lausanne, Switzerland
| | - Patricia M. Palagi
- Swiss Institute of Bioinformatics, Quartier Sorge—Bâtiment Amphipôle, Lausanne, Switzerland
| |
Collapse
|
2
|
Pallocca M, Betti M, Baldinelli S, Palombo R, Bucci G, Mazzarella L, Tonon G, Ciliberto G. Clinical bioinformatics desiderata for molecular tumor boards. Brief Bioinform 2024; 25:bbae447. [PMID: 39297878 PMCID: PMC11411775 DOI: 10.1093/bib/bbae447] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 06/28/2024] [Accepted: 08/30/2024] [Indexed: 09/26/2024] Open
Abstract
Clinical Bioinformatics is a knowledge framework required to interpret data of medical interest via computational methods. This area became of dramatic importance in precision oncology, fueled by cancer genomic profiling: most definitions of Molecular Tumor Boards require the presence of bioinformaticians. However, all available literature remained rather vague on what are the specific needs in terms of digital tools and expertise to tackle and interpret genomics data to assign novel targeted or biomarker-driven targeted therapies to cancer patients. To fill this gap, in this article, we present a catalog of software families and human skills required for the tumor board bioinformatician, with specific examples of real-world applications associated with each element presented.
Collapse
Affiliation(s)
- Matteo Pallocca
- Institute of Experimental Endocrinology and Oncology, National Research Council, Via Sergio Pansini, 5, 80131 Naples, Italy
| | - Martina Betti
- Biostatistics, Bioinformatics and Clinical Trial Center, IRCCS Regina Elena National Cancer Institute, Via Elio Chianesi, 53, 00144 Rome, Italy
| | - Sara Baldinelli
- Biostatistics, Bioinformatics and Clinical Trial Center, IRCCS Regina Elena National Cancer Institute, Via Elio Chianesi, 53, 00144 Rome, Italy
| | - Ramona Palombo
- Institute of Experimental Endocrinology and Oncology, National Research Council, Via Sergio Pansini, 5, 80131 Naples, Italy
| | - Gabriele Bucci
- Center for OMICS Sciences, IRCCS San Raffaele Scientific Institute, Via Olgettina, 60, 20132 Milan, Italy
| | - Luca Mazzarella
- Division of Gastrointestinal Medical Oncology and Neuroendocrine Tumors, IRCCS IEO - European Institute of Oncology, Via Adamello 16, 20139 Milan, Italy
- Department of Experimental Oncology, IRCCS IEO - European Institute of Oncology, Via Adamello 16, 20139 Milan, Italy
| | - Giovanni Tonon
- Functional Genomics of Cancer Unit, Division of Experimental Oncology, and Center for Omics Sciences, IRCCS San Raffaele Scientific Institute, Via Olgettina, 60, 20132 Milan, Italy
| | - Gennaro Ciliberto
- Scientific Direction, IRCCS Regina Elena National Cancer Institute, Via Elio Chianesi, 53, 00144 Rome, Italy
| |
Collapse
|
3
|
Castro LJ, Palagi PM, Beard N, Attwood TK, Brazas MD. Bioschemas training profiles: A set of specifications for standardizing training information to facilitate the discovery of training programs and resources. PLoS Comput Biol 2023; 19:e1011120. [PMID: 37319143 DOI: 10.1371/journal.pcbi.1011120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/17/2023] Open
Abstract
Stand-alone life science training events and e-learning solutions are among the most sought-after modes of training because they address both point-of-need learning and the limited timeframes available for "upskilling." Yet, finding relevant life sciences training courses and materials is challenging because such resources are not marked up for internet searches in a consistent way. This absence of markup standards to facilitate discovery, re-use, and aggregation of training resources limits their usefulness and knowledge translation potential. Through a joint effort between the Global Organisation for Bioinformatics Learning, Education and Training (GOBLET), the Bioschemas Training community, and the ELIXIR FAIR Training Focus Group, a set of Bioschemas Training profiles has been developed, published, and implemented for life sciences training courses and materials. Here, we describe our development approach and methods, which were based on the Bioschemas model, and present the results for the 3 Bioschemas Training profiles: TrainingMaterial, Course, and CourseInstance. Several implementation challenges were encountered, which we discuss alongside potential solutions. Over time, continued implementation of these Bioschemas Training profiles by training providers will obviate the barriers to skill development, facilitating both the discovery of relevant training events to meet individuals' learning needs, and the discovery and re-use of training and instructional materials.
Collapse
Affiliation(s)
| | | | - Niall Beard
- Department of Computer Science, The University of Manchester, Manchester, United Kingdom
| | - Teresa K Attwood
- Department of Computer Science, The University of Manchester, Manchester, United Kingdom
| | | |
Collapse
|
4
|
Thurlow KE, Lovering RC, De Miranda Pinheiro S. Student biocuration projects as a learning environment. F1000Res 2021; 10:1023. [PMID: 35211294 PMCID: PMC8831850 DOI: 10.12688/f1000research.72808.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 09/22/2021] [Indexed: 08/23/2024] Open
Abstract
Background: Bioinformatics is becoming an essential tool for the majority of biological and biomedical researchers. Although bioinformatics data is exploited by academic and industrial researchers, limited focus is on teaching this area to undergraduates, postgraduates and senior scientists. Many scientists are developing their own expertise without formal training and often without appreciating the source of the data they are reliant upon. Some universities do provide courses on a variety of bioinformatics resources and tools, a few also provide biocuration projects, during which students submit data to annotation resources. Methods: To assess the usefulness and enjoyability of annotation projects a survey was sent to University College London (UCL) students who have undertaken Gene Ontology biocuration projects. Results: Analysis of survey responses suggest that these projects provide students with an opportunity not only to learn about bioinformatics resources but also to improve their literature analysis, presentation and writing skills. Conclusion: Biocuration student projects provide valuable annotations as well as enabling students to develop a variety of skills relevant to their future careers. It is also hoped that, as future scientists, these students will critically assess their own manuscripts and ensure that these are written with the biocurators of the future in mind.
Collapse
Affiliation(s)
- Katherine E. Thurlow
- Functional Gene Annotation, Preclinical and Fundamental Science, Institute of Cardiovascular Science, University College London (UCL), London, WC1E 6JF, UK
| | - Ruth C. Lovering
- Functional Gene Annotation, Preclinical and Fundamental Science, Institute of Cardiovascular Science, University College London (UCL), London, WC1E 6JF, UK
| | - Sandra De Miranda Pinheiro
- Functional Gene Annotation, Preclinical and Fundamental Science, Institute of Cardiovascular Science, University College London (UCL), London, WC1E 6JF, UK
| |
Collapse
|
5
|
Hatos A, Quaglia F, Piovesan D, Tosatto SCE. APICURON: a database to credit and acknowledge the work of biocurators. Database (Oxford) 2021; 2021:baab019. [PMID: 33882120 PMCID: PMC8060004 DOI: 10.1093/database/baab019] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 03/12/2021] [Accepted: 04/12/2021] [Indexed: 11/14/2022]
Abstract
APICURON is an open and freely accessible resource that tracks and credits the work of biocurators across multiple participating knowledgebases. Biocuration is essential to extract knowledge from research data and make it available in a structured and standardized way to the scientific community. However, processing biological data-mainly from literature-requires a huge effort that is difficult to attribute and quantify. APICURON collects biocuration events from third-party resources and aggregates this information, spotlighting biocurator contributions. APICURON promotes biocurator engagement implementing gamification concepts like badges, medals and leaderboards and at the same time provides a monitoring service for registered resources and for biocurators themselves. APICURON adopts a data model that is flexible enough to represent and track the majority of biocuration activities. Biocurators are identified through their Open Researcher and Contributor ID. The definition of curation events, scoring systems and rules for assigning badges and medals are resource-specific and easily customizable. Registered resources can transfer curation activities on the fly through a secure and robust Application Programming Interface (API). Here, we show how simple and effective it is to connect a resource to APICURON, describing the DisProt database of intrinsically disordered proteins as a use case. We believe APICURON will provide biological knowledgebases with a service to recognize and credit the effort of their biocurators, monitor their activity and promote curator engagement. Database URL: https://apicuron.org.
Collapse
Affiliation(s)
- András Hatos
- Department of Biomedical Sciences, University of Padua, Via Ugo Bassi 58/B, Padova 35131, Italy
| | - Federica Quaglia
- Department of Biomedical Sciences, University of Padua, Via Ugo Bassi 58/B, Padova 35131, Italy
| | - Damiano Piovesan
- Department of Biomedical Sciences, University of Padua, Via Ugo Bassi 58/B, Padova 35131, Italy
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padua, Via Ugo Bassi 58/B, Padova 35131, Italy
| |
Collapse
|
6
|
Beard N, Bacall F, Nenadic A, Thurston M, Goble CA, Sansone SA, Attwood TK. TeSS: a platform for discovering life-science training opportunities. Bioinformatics 2020; 36:3290-3291. [PMID: 32044952 PMCID: PMC7214044 DOI: 10.1093/bioinformatics/btaa047] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2019] [Revised: 12/10/2019] [Accepted: 02/03/2020] [Indexed: 11/13/2022] Open
Abstract
SUMMARY Dispersed across the Internet is an abundance of disparate, disconnected training information, making it hard for researchers to find training opportunities that are relevant to them. To address this issue, we have developed a new platform-TeSS-which aggregates geographically distributed information and presents it in a central, feature-rich portal. Data are gathered automatically from content providers via bespoke scripts. These resources are cross-linked with related data and tools registries, and made available via a search interface, a data API and through widgets. AVAILABILITY AND IMPLEMENTATION https://tess.elixir-europe.org.
Collapse
Affiliation(s)
- Niall Beard
- Department of Computer Science, The University of Manchester, Manchester M13 9PL, UK
| | - Finn Bacall
- Department of Computer Science, The University of Manchester, Manchester M13 9PL, UK
| | - Aleksandra Nenadic
- Department of Computer Science, The University of Manchester, Manchester M13 9PL, UK
| | - Milo Thurston
- Department of Engineering Science, Oxford e-Research Centre, University of Oxford, Oxford OX1 3QG, UK
| | - Carole A Goble
- Department of Computer Science, The University of Manchester, Manchester M13 9PL, UK
| | - Susanna-Assunta Sansone
- Department of Engineering Science, Oxford e-Research Centre, University of Oxford, Oxford OX1 3QG, UK
| | - Teresa K Attwood
- Department of Computer Science, The University of Manchester, Manchester M13 9PL, UK
| |
Collapse
|
7
|
Tractenberg RE, Lindvall JM, Attwood TK, Via A. The Mastery Rubric for Bioinformatics: A tool to support design and evaluation of career-spanning education and training. PLoS One 2019; 14:e0225256. [PMID: 31770418 PMCID: PMC6879125 DOI: 10.1371/journal.pone.0225256] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2019] [Accepted: 10/24/2019] [Indexed: 11/18/2022] Open
Abstract
As the life sciences have become more data intensive, the pressure to incorporate the requisite training into life-science education and training programs has increased. To facilitate curriculum development, various sets of (bio)informatics competencies have been articulated; however, these have proved difficult to implement in practice. Addressing this issue, we have created a curriculum-design and -evaluation tool to support the development of specific Knowledge, Skills and Abilities (KSAs) that reflect the scientific method and promote both bioinformatics practice and the achievement of competencies. Twelve KSAs were extracted via formal analysis, and stages along a developmental trajectory, from uninitiated student to independent practitioner, were identified. Demonstration of each KSA by a performer at each stage was initially described (Performance Level Descriptors, PLDs), evaluated, and revised at an international workshop. This work was subsequently extended and further refined to yield the Mastery Rubric for Bioinformatics (MR-Bi). The MR-Bi was validated by demonstrating alignment between the KSAs and competencies, and its consistency with principles of adult learning. The MR-Bi tool provides a formal framework to support curriculum building, training, and self-directed learning. It prioritizes the development of independence and scientific reasoning, and is structured to allow individuals (regardless of career stage, disciplinary background, or skill level) to locate themselves within the framework. The KSAs and their PLDs promote scientific problem formulation and problem solving, lending the MR-Bi durability and flexibility. With its explicit developmental trajectory, the tool can be used by developing or practicing scientists to direct their (and their team's) acquisition of new, or to deepen existing, bioinformatics KSAs. The MR-Bi is a tool that can contribute to the cultivation of a next generation of bioinformaticians who are able to design reproducible and rigorous research, and to critically analyze results from their own, and others', work.
Collapse
Affiliation(s)
- Rochelle E. Tractenberg
- Collaborative for Research on Outcomes and –Metrics, and Departments of Neurology, Biostatistics, Biomathematics and Bioinformatics, and Rehabilitation Medicine, Georgetown University, Washington, DC, United States of America
| | - Jessica M. Lindvall
- National Bioinformatics Infrastructure Sweden (NBIS)/ELIXIR-SE, Science for Life Laboratory (SciLifeLab), Department of Biochemistry and Biophysics, Stockholm University, Stockholm, Sweden
| | - Teresa K. Attwood
- Department of Computer Science, The University of Manchester, Manchester, England, United Kingdom; The GOBLET Foundation, Radboud University, Nijmegen Medical Centre, Nijmegen, The Netherlands
| | - Allegra Via
- ELIXIR Italy, National Research Council of Italy, Institute of Molecular Biology and Pathology, Rome, Italy
| |
Collapse
|
8
|
van Gelder CWG, Hooft RWW, van Rijswijk MN, van den Berg L, Kok RG, Reinders M, Mons B, Heringa J. Bioinformatics in the Netherlands: the value of a nationwide community. Brief Bioinform 2019; 20:540-550. [PMID: 28968694 PMCID: PMC6433734 DOI: 10.1093/bib/bbx087] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2017] [Revised: 07/03/2017] [Indexed: 11/14/2022] Open
Abstract
This review provides a historical overview of the inception and development of bioinformatics research in the Netherlands. Rooted in theoretical biology by foundational figures such as Paulien Hogeweg (at Utrecht University since the 1970s), the developments leading to organizational structures supporting a relatively large Dutch bioinformatics community will be reviewed. We will show that the most valuable resource that we have built over these years is the close-knit national expert community that is well engaged in basic and translational life science research programmes. The Dutch bioinformatics community is accustomed to facing the ever-changing landscape of data challenges and working towards solutions together. In addition, this community is the stable factor on the road towards sustainability, especially in times where existing funding models are challenged and change rapidly.
Collapse
|
9
|
Attwood TK, Blackford S, Brazas MD, Davies A, Schneider MV. A global perspective on evolving bioinformatics and data science training needs. Brief Bioinform 2019; 20:398-404. [PMID: 28968751 PMCID: PMC6433731 DOI: 10.1093/bib/bbx100] [Citation(s) in RCA: 67] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2017] [Revised: 06/21/2017] [Indexed: 11/13/2022] Open
Abstract
Bioinformatics is now intrinsic to life science research, but the past decade has witnessed a continuing deficiency in this essential expertise. Basic data stewardship is still taught relatively rarely in life science education programmes, creating a chasm between theory and practice, and fuelling demand for bioinformatics training across all educational levels and career roles. Concerned by this, surveys have been conducted in recent years to monitor bioinformatics and computational training needs worldwide. This article briefly reviews the principal findings of a number of these studies. We see that there is still a strong appetite for short courses to improve expertise and confidence in data analysis and interpretation; strikingly, however, the most urgent appeal is for bioinformatics to be woven into the fabric of life science degree programmes. Satisfying the relentless training needs of current and future generations of life scientists will require a concerted response from stakeholders across the globe, who need to deliver sustainable solutions capable of both transforming education curricula and cultivating a new cadre of trainer scientists.
Collapse
Affiliation(s)
- Teresa K Attwood
- University of Manchester, School of Computer Science, Manchester, United Kingdom of Great Britain and Northern Ireland
| | - Sarah Blackford
- Lancaster University, Lancaster, United Kingdom of Great Britain and Northern Ireland
| | - Michelle D Brazas
- Ontario Institute for Cancer Research, Informatics and Bio-computing, 101 College St, Suite 800, Toronto, Ontario Canada
| | - Angela Davies
- The University of Manchester, School of Biological Sciences, Manchester, United Kingdom of Great Britain and Northern Ireland
| | - Maria Victoria Schneider
- University of Melbourne Melbourne Institute, Lab-14, 700 Swanston St, Melbourne Carlton Victoria, Australia
| |
Collapse
|
10
|
Stevens SLR, Kuzak M, Martinez C, Moser A, Bleeker P, Galland M. Building a local community of practice in scientific programming for life scientists. PLoS Biol 2018; 16:e2005561. [PMID: 30485260 PMCID: PMC6287879 DOI: 10.1371/journal.pbio.2005561] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Revised: 12/10/2018] [Indexed: 11/18/2022] Open
Abstract
In this paper, we describe why and how to build a local community of practice in scientific programming for life scientists who use computers and programming in their research. A community of practice is a small group of scientists who meet regularly to help each other and promote good practices in scientific programming. While most life scientists are well trained in the laboratory to conduct experiments, good practices with (big) data sets and their analysis are often missing. We propose a model on how to build such a community of practice at a local academic institution, present two real-life examples, and introduce challenges and implemented solutions. We believe that the current data deluge that life scientists face can benefit from the implementation of these small communities. Good practices spread among experimental scientists will foster open, transparent, and sound scientific results beneficial to society.
Collapse
Affiliation(s)
- Sarah L. R. Stevens
- Department of Bacteriology, University of Wisconsin–Madison, Madison, Wisconsin, United States of America
| | - Mateusz Kuzak
- Dutch Techcentre for Life Sciences, Utrecht, Netherlands
| | | | - Aurelia Moser
- Mozilla Foundation, Mountain View, California, United States of America
| | - Petra Bleeker
- Department of Plant Physiology, Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, Netherlands
| | - Marc Galland
- Department of Plant Physiology, Swammerdam Institute for Life Sciences, University of Amsterdam, Amsterdam, Netherlands
| |
Collapse
|
11
|
Carvalho-Silva D, Garcia L, Morgan SL, Brooksbank C, Dunham I. Ten simple rules for delivering live distance training in bioinformatics across the globe using webinars. PLoS Comput Biol 2018; 14:e1006419. [PMID: 30439935 PMCID: PMC6237289 DOI: 10.1371/journal.pcbi.1006419] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Affiliation(s)
- Denise Carvalho-Silva
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
- Open Targets, Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
| | - Leyla Garcia
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
| | - Sarah L. Morgan
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
| | - Cath Brooksbank
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
| | - Ian Dunham
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
- Open Targets, Wellcome Genome Campus, Hinxton, Cambridgeshire, United Kingdom
| |
Collapse
|
12
|
Watson-Haigh NS, Revote J, Suchecki R, Tyagi S, Corley SM, Shang CA, McGrath A. Towards an open, collaborative, reusable framework for sharing hands-on bioinformatics training workshops. Brief Bioinform 2017; 18:348-355. [PMID: 26984618 PMCID: PMC5444239 DOI: 10.1093/bib/bbw013] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2015] [Indexed: 11/13/2022] Open
Abstract
There is a clear demand for hands-on bioinformatics training. The development of bioinformatics workshop content is both time-consuming and expensive. Therefore, enabling trainers to develop bioinformatics workshops in a way that facilitates reuse is becoming increasingly important. The most widespread practice for sharing workshop content is through making PDF, PowerPoint and Word documents available online. While this effort is to be commended, such content is usually not so easy to reuse or repurpose and does not capture all the information required for a third party to rerun a workshop. We present an open, collaborative framework for developing and maintaining, reusable and shareable hands-on training workshop content.
Collapse
Affiliation(s)
- Nathan S Watson-Haigh
- Australian Centre for Plant Functional Genomics (ACPFG), University of Adelaide, PMB 1, Glen Osmond, Australia
| | - Jerico Revote
- Monash Bioinformatics Platform, Monash University, Melbourne, VIC, Australia
| | - Radoslaw Suchecki
- Australian Centre for Plant Functional Genomics (ACPFG), University of Adelaide, PMB 1, Glen Osmond, Australia
| | - Sonika Tyagi
- Oncogenomics Laboratory, Queensland Institute of Medical Research, Herston, Brisbane, QLD, Australia
| | - Susan M Corley
- Systems Biology Initiative, School of Biotechnology and Biomolecular Sciences, UNSW Australia, Sydney, New South Wales, Australia
| | - Catherine A Shang
- School of Biomedical Sciences and the Institute for Molecular Bioscience, The University of Queensland, Queensland, Brisbane, Australia
| | - Annette McGrath
- The Genome Analysis Centre, Norwich, ELIXIR, Wellcome Trust Genome Campus, Hinxton, UK, The Swedish University for Agricultural Sciences, Uppsala, Sweden, European Molecular Biology Laboratory, Heidelberg, Germany, Ontario Institute for Cancer Research, Toronto, Canada, Instituto Gulbenkian de Ciência, Oeiras, Portugal, The University of New South Wales, Sydney, Australia, Netherlands Bioinformatics Centre, Department of Bioinformatics, Radboud Medical Center, Nijmegen, The Netherlands, CSC - IT Center for Science Ltd., Espoo, Finland, Whitehead Institute for Biomedical Research, MIT, Cambridge, MA, USA, CSIRO, Bioinformatics Core, Canberra, Australia, The Sainsbury Laboratory, Norwich Research Park, Norwich, UK, SIB Swiss Institute of Bioinformatics, 1 Rue Michel Servet, Genève, Switzerland, Academis, Illstrasse 12, Berlin, Germany, The Nowgen Centre, 29 Grafton Street, Manchester, UK, Department of Physics, Sapienza University, Rome, Italy, The Roslin Institute, Edinburgh, UK and The University of Manchester, Manchester, UK
| |
Collapse
|
13
|
Horro C, Cook M, Attwood TK, Brazas MD, Hancock JM, Palagi P, Corpas M, Jimenez R. BioCIDER: a Contextualisation InDEx for biological Resources discovery. Bioinformatics 2017; 33:2607-2608. [PMID: 28407033 PMCID: PMC5870719 DOI: 10.1093/bioinformatics/btx213] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2017] [Accepted: 04/11/2017] [Indexed: 11/13/2022] Open
Abstract
Summary The vast, uncoordinated proliferation of bioinformatics resources (databases, software tools, training materials etc.) makes it difficult for users to find them. To facilitate their discovery, various services are being developed to collect such resources into registries. We have developed BioCIDER, which, rather like online shopping ‘recommendations’, provides a contextualization index to help identify biological resources relevant to the content of the sites in which it is embedded. Availability and Implementation BioCIDER (www.biocider.org) is an open-source platform. Documentation is available online (https://goo.gl/Klc51G), and source code is freely available via GitHub (https://github.com/BioCIDER). The BioJS widget that enables websites to embed contextualization is available from the BioJS registry (http://biojs.io/). All code is released under an MIT licence.
Collapse
Affiliation(s)
- Carlos Horro
- Elixir Department, Earlham Institute, Norwich Research Park Innovation Centre, Norwich NR4 7UH, UK
| | - Martin Cook
- ELIXIR Hub, The European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton CB10 1SD, UK
| | - Teresa K Attwood
- School of Computer Science, The University of Manchester, Manchester M13 9PL, UK
| | - Michelle D Brazas
- Informatics and Bio-computing, Ontario Institute for Cancer Research, Toronto M5G 0A3, Canada
| | - John M Hancock
- Elixir Department, Earlham Institute, Norwich Research Park Innovation Centre, Norwich NR4 7UH, UK
| | - Patricia Palagi
- SIB Training Group, SIB Swiss Institute of Bioinformatics, Lausanne 1005, Switzerland
| | - Manuel Corpas
- Repositive, Future Business Centre, Kings' Hedges Road, Cambridge CB4 2HY, UK
| | - Rafael Jimenez
- ELIXIR Hub, The European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton CB10 1SD, UK
| |
Collapse
|
14
|
Larcombe L, Hendricusdottir R, Attwood TK, Bacall F, Beard N, Bellis LJ, Dunn WB, Hancock JM, Nenadic A, Orengo C, Overduin B, Sansone SA, Thurston M, Viant MR, Winder CL, Goble CA, Ponting CP, Rustici G. ELIXIR-UK role in bioinformatics training at the national level and across ELIXIR. F1000Res 2017; 6. [PMID: 28781748 PMCID: PMC5521157 DOI: 10.12688/f1000research.11837.1] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 06/19/2017] [Indexed: 11/20/2022] Open
Abstract
ELIXIR-UK is the UK node of ELIXIR, the European infrastructure for life science data. Since its foundation in 2014, ELIXIR-UK has played a leading role in training both within the UK and in the ELIXIR Training Platform, which coordinates and delivers training across all ELIXIR members. ELIXIR-UK contributes to the Training Platform’s coordination and supports the development of training to address key skill gaps amongst UK scientists. As part of this work it acts as a conduit for nationally-important bioinformatics training resources to promote their activities to the ELIXIR community. ELIXIR-UK also leads ELIXIR’s flagship Training Portal, TeSS, which collects information about a diverse range of training and makes it easily accessible to the community. ELIXIR-UK also works with others to provide key digital skills training, partnering with the Software Sustainability Institute to provide Software Carpentry training to the ELIXIR community and to establish the Data Carpentry initiative, and taking a lead role amongst national stakeholders to deliver the StaTS project – a coordinated effort to drive engagement with training in statistics.
Collapse
Affiliation(s)
- L Larcombe
- MRC Human Genetics Unit, The Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - R Hendricusdottir
- MRC Human Genetics Unit, The Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - T K Attwood
- School of Computer Science, University of Manchester, Manchester, M13 9PL, UK
| | - F Bacall
- School of Computer Science, University of Manchester, Manchester, M13 9PL, UK
| | - N Beard
- School of Computer Science, University of Manchester, Manchester, M13 9PL, UK
| | - L J Bellis
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| | - W B Dunn
- Birmingham Metabolomics Training Centre, School of Biosciences, University of Birmingham, Birmingham, B15 2TT, UK
| | | | - A Nenadic
- The Software Sustainability Institute, University of Manchester, Manchester, M13 9PL, UK
| | - C Orengo
- University College London, London, WC1E 6BT, UK
| | - B Overduin
- Edinburgh Genomics, University of Edinburgh, Edinburgh, EH9 3FL, UK
| | - S-A Sansone
- Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, UK
| | - M Thurston
- Oxford e-Research Centre, University of Oxford, Oxford, OX1 3QG, UK
| | - M R Viant
- Birmingham Metabolomics Training Centre, School of Biosciences, University of Birmingham, Birmingham, B15 2TT, UK
| | - C L Winder
- Birmingham Metabolomics Training Centre, School of Biosciences, University of Birmingham, Birmingham, B15 2TT, UK
| | - C A Goble
- School of Computer Science, University of Manchester, Manchester, M13 9PL, UK
| | - C P Ponting
- MRC Human Genetics Unit, The Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh, EH4 2XU, UK
| | - G Rustici
- Department of Genetics, University of Cambridge, Cambridge, CB2 3EH, UK
| |
Collapse
|
15
|
Machluf Y, Gelbart H, Ben-Dor S, Yarden A. Making authentic science accessible-the benefits and challenges of integrating bioinformatics into a high-school science curriculum. Brief Bioinform 2017; 18:145-159. [PMID: 26801769 PMCID: PMC5221422 DOI: 10.1093/bib/bbv113] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2015] [Revised: 11/19/2015] [Accepted: 12/11/2015] [Indexed: 12/27/2022] Open
Abstract
Despite the central place held by bioinformatics in modern life sciences and related areas, it has only recently been integrated to a limited extent into high-school teaching and learning programs. Here we describe the assessment of a learning environment entitled 'Bioinformatics in the Service of Biotechnology'. Students' learning outcomes and attitudes toward the bioinformatics learning environment were measured by analyzing their answers to questions embedded within the activities, questionnaires, interviews and observations. Students' difficulties and knowledge acquisition were characterized based on four categories: the required domain-specific knowledge (declarative, procedural, strategic or situational), the scientific field that each question stems from (biology, bioinformatics or their combination), the associated cognitive-process dimension (remember, understand, apply, analyze, evaluate, create) and the type of question (open-ended or multiple choice). Analysis of students' cognitive outcomes revealed learning gains in bioinformatics and related scientific fields, as well as appropriation of the bioinformatics approach as part of the students' scientific 'toolbox'. For students, questions stemming from the 'old world' biology field and requiring declarative or strategic knowledge were harder to deal with. This stands in contrast to their teachers' prediction. Analysis of students' affective outcomes revealed positive attitudes toward bioinformatics and the learning environment, as well as their perception of the teacher's role. Insights from this analysis yielded implications and recommendations for curriculum design, classroom enactment, teacher education and research. For example, we recommend teaching bioinformatics in an integrative and comprehensive manner, through an inquiry process, and linking it to the wider science curriculum.
Collapse
Affiliation(s)
- Yossy Machluf
- Department of Science Teaching, Weizmann Institute of Science, Rehovot, Israel
| | - Hadas Gelbart
- Department of Science Teaching, Weizmann Institute of Science, Rehovot, Israel
- National Authority for Measurement and Evaluation in Education (RAMA), Ministry of Education, Ramat-Gan, Israel
| | - Shifra Ben-Dor
- Faculty of Biochemistry, Department of Biological Services, Bioinformatics and Biological Computing Unit, Weizmann Institute of Science, Rehovot, Israel
| | - Anat Yarden
- Department of Science Teaching, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
16
|
Lewis J, Breeze CE, Charlesworth J, Maclaren OJ, Cooper J. Where next for the reproducibility agenda in computational biology? BMC SYSTEMS BIOLOGY 2016; 10:52. [PMID: 27422148 PMCID: PMC4946111 DOI: 10.1186/s12918-016-0288-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Accepted: 06/08/2016] [Indexed: 11/24/2022]
Abstract
Background The concept of reproducibility is a foundation of the scientific method. With the arrival of fast and powerful computers over the last few decades, there has been an explosion of results based on complex computational analyses and simulations. The reproducibility of these results has been addressed mainly in terms of exact replicability or numerical equivalence, ignoring the wider issue of the reproducibility of conclusions through equivalent, extended or alternative methods. Results We use case studies from our own research experience to illustrate how concepts of reproducibility might be applied in computational biology. Several fields have developed ‘minimum information’ checklists to support the full reporting of computational simulations, analyses and results, and standardised data formats and model description languages can facilitate the use of multiple systems to address the same research question. We note the importance of defining the key features of a result to be reproduced, and the expected agreement between original and subsequent results. Dynamic, updatable tools for publishing methods and results are becoming increasingly common, but sometimes come at the cost of clear communication. In general, the reproducibility of computational research is improving but would benefit from additional resources and incentives. Conclusions We conclude with a series of linked recommendations for improving reproducibility in computational biology through communication, policy, education and research practice. More reproducible research will lead to higher quality conclusions, deeper understanding and more valuable knowledge.
Collapse
Affiliation(s)
- Joanna Lewis
- Centre for Maths and Physics in the Life Sciences and Experimental Biology, University College London, Physics Building, Gower Place, London, WC1E 6BT, UK. .,NIHR Health Protection Research Unit in Modelling Methodology, Department of Infectious Disease Epidemiology, Imperial College London, St Mary's Campus, Norfolk Place, London, W2 1PG, UK.
| | - Charles E Breeze
- UCL Cancer Institute, University College London, 72 Huntley St, London, WC1E 6DD, UK
| | - Jane Charlesworth
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK
| | - Oliver J Maclaren
- Department of Mathematics, University of Auckland, Auckland, 1142, New Zealand.,Department of Engineering Science, University of Auckland, Auckland, 1142, New Zealand
| | - Jonathan Cooper
- Department of Computer Science, University of Oxford, Wolfson Building, Parks Road, Oxford, OX1 3QD, UK
| |
Collapse
|
17
|
Schiffthaler B, Kostadima M, NGS Trainer Consortium, Delhomme N, Rustici G. Training in High-Throughput Sequencing: Common Guidelines to Enable Material Sharing, Dissemination, and Reusability. PLoS Comput Biol 2016; 12:e1004937. [PMID: 27309738 PMCID: PMC4910983 DOI: 10.1371/journal.pcbi.1004937] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022] Open
Abstract
The advancement of high-throughput sequencing (HTS) technologies and the rapid development of numerous analysis algorithms and pipelines in this field has resulted in an unprecedentedly high demand for training scientists in HTS data analysis. Embarking on developing new training materials is challenging for many reasons. Trainers often do not have prior experience in preparing or delivering such materials and struggle to keep them up to date. A repository of curated HTS training materials would support trainers in materials preparation, reduce the duplication of effort by increasing the usage of existing materials, and allow for the sharing of teaching experience among the HTS trainers’ community. To achieve this, we have developed a strategy for materials’ curation and dissemination. Standards for describing training materials have been proposed and applied to the curation of existing materials. A Git repository has been set up for sharing annotated materials that can now be reused, modified, or incorporated into new courses. This repository uses Git; hence, it is decentralized and self-managed by the community and can be forked/built-upon by all users. The repository is accessible at http://bioinformatics.upsc.se/htmr. In recent years, the advancement of high-throughput sequencing (HTS) and the rapid development of numerous analysis algorithms and pipelines in this field have resulted in an unprecedentedly high demand for training scientists in HTS data analysis. Generating effective training materials is time-consuming, and a large body of training materials on HTS data analysis has already been generated but is rarely shared among trainers. In this paper we provide guidelines to trainers for describing training materials to increase their reusability. The best practices standards proposed here have been used to annotate a collection of HTS training materials, which is now available to the trainers’ community in Git and discoverable through the ELIXIR and GOBLET portals. Efforts are now underway to utilize the strategy presented in this paper to annotate a wider collection of training materials and define a generic approach for the curation and dissemination of materials that should be adopted by existing training portals and new emerging initiatives.
Collapse
Affiliation(s)
- Bastian Schiffthaler
- Department of Plant Physiology, Umeå Plant Science Centre, Umeå University, Umeå, Sweden
| | - Myrto Kostadima
- Department of Haematology, University of Cambridge, Cambridge, United Kingdom
| | | | - Nicolas Delhomme
- Department of Plant Physiology, Umeå Plant Science Centre, Umeå University, Umeå, Sweden
- Department of Forest Genetics and Plant Physiology, Umeå Plant Science Centre, Swedish University of Agricultural Sciences, Umeå, Sweden
- * E-mail: (ND); (GR)
| | - Gabriella Rustici
- School of Biological Sciences, Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- * E-mail: (ND); (GR)
| |
Collapse
|
18
|
Bendou H, Entfellner JBD, van Heusden P, Gamieldien J, Tiffin N. NetCapDB: measuring bioinformatics capacity development in Africa. BMC Res Notes 2016; 9:144. [PMID: 26945860 PMCID: PMC4779554 DOI: 10.1186/s13104-016-1950-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2015] [Accepted: 02/23/2016] [Indexed: 11/24/2022] Open
Abstract
Background The National Institutes of Health (USA) has committed 5 years of funding to the Bioinformatics Network of the Human Heredity and Health in Africa initiative. This pan-African network aims to develop capacity for bioinformatics research, in order to provide support to human health genomics research programs ongoing on the continent. Over the 5 years of funding, it is imperative to track changes in bioinformatics capacity at the funded centres and to document how the funding has translated into capacity development during this time frame. Results The Network capacity database, NetCapDB, is a relational database that captures quantitative metrics for bioinformatics capacity, and tracks the changes in these metrics over time. A graphical user interface allows for straight-forward, browser-based data entry by users across Africa; and for visual and graph-based exploration of captured data. A reporting interface allows for semi-automated generation of standardized reports for monitoring and evaluation purposes. Electronic supplementary material The online version of this article (doi:10.1186/s13104-016-1950-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Hocine Bendou
- South African National Bioinformatics Institute/Medical Research Council of South Africa Bioinformatics Unit, University of the Western Cape, Cape Town, South Africa.
| | - Jean-Baka Domelevo Entfellner
- South African National Bioinformatics Institute/Medical Research Council of South Africa Bioinformatics Unit, University of the Western Cape, Cape Town, South Africa.
| | - Peter van Heusden
- South African National Bioinformatics Institute/Medical Research Council of South Africa Bioinformatics Unit, University of the Western Cape, Cape Town, South Africa.
| | - Junaid Gamieldien
- South African National Bioinformatics Institute/Medical Research Council of South Africa Bioinformatics Unit, University of the Western Cape, Cape Town, South Africa.
| | - Nicki Tiffin
- South African National Bioinformatics Institute/Medical Research Council of South Africa Bioinformatics Unit, University of the Western Cape, Cape Town, South Africa.
| | | |
Collapse
|
19
|
Robson JF, Barker D. Comparison of the protein-coding gene content of Chlamydia trachomatis and Protochlamydia amoebophila using a Raspberry Pi computer. BMC Res Notes 2015; 8:561. [PMID: 26462790 PMCID: PMC4604092 DOI: 10.1186/s13104-015-1476-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Accepted: 09/21/2015] [Indexed: 12/12/2022] Open
Abstract
Background To demonstrate the bioinformatics capabilities of a low-cost computer, the Raspberry Pi, we present a comparison of the protein-coding gene content of two species in phylum Chlamydiae: Chlamydia trachomatis, a common sexually transmitted infection of humans, and Candidatus Protochlamydia amoebophila, a recently discovered amoebal endosymbiont. Identifying species-specific proteins and differences in protein families could provide insights into the unique phenotypes of the two species. Results Using a Raspberry Pi computer, sequence similarity-based protein families were predicted across the two species, C. trachomatis and P. amoebophila, and their members counted. Examples include nine multi-protein families unique to C. trachomatis, 132 multi-protein families unique to P. amoebophila and one family with multiple copies in both. Most families unique to C. trachomatis were polymorphic outer-membrane proteins. Additionally, multiple protein families lacking functional annotation were found. Predicted functional interactions suggest one of these families is involved with the exodeoxyribonuclease V complex. Conclusion The Raspberry Pi computer is adequate for a comparative genomics project of this scope. The protein families unique to P. amoebophila may provide a basis for investigating the host-endosymbiont interaction. However, additional species should be included; and further laboratory research is required to identify the functions of unknown or putative proteins. Multiple outer membrane proteins were found in C. trachomatis, suggesting importance for host evasion. The tyrosine transport protein family is shared between both species, with four proteins in C. trachomatis and two in P. amoebophila. Shared protein families could provide a starting point for discovery of wide-spectrum drugs against Chlamydiae. Electronic supplementary material The online version of this article (doi:10.1186/s13104-015-1476-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- James F Robson
- School of Biology, University of St Andrews, St Andrews, Fife, KY16 9TH, UK. .,Department of Biology, University of York, Wentworth Way, York, YO10 5DD, UK.
| | - Daniel Barker
- School of Biology, University of St Andrews, St Andrews, Fife, KY16 9TH, UK.
| |
Collapse
|
20
|
Lapatas V, Stefanidakis M. BATMat: Bioinformatics Autodiscovery of Training Materials. Brief Bioinform 2015; 17:728-30. [PMID: 26330576 DOI: 10.1093/bib/bbv071] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2015] [Indexed: 11/13/2022] Open
Abstract
UNLABELLED We present Bioinformatics Autodiscovery of Training Materials (BATMat), an open-source, Google-based, targeted, automatic search tool for training materials related to bioinformatics. BATMat helps gain access with one click to filtered and portable information containing links to existing materials (when present). It also offers functionality to sort results according to source site or title. AVAILABILITY http://imbatmat.com CONTACT piar301@gmail.com.
Collapse
|
21
|
Abstract
Open science describes the practice of carrying out scientific research in a completely transparent manner, and making the results of that research available to everyone. Isn’t that just ‘science’?
Collapse
Affiliation(s)
- Mick Watson
- The Roslin Institute, Royal (Dick) School of Veterinary Studies, University of Edinburgh, Easter Bush, Edinburgh, EH25 9RG, UK.
| |
Collapse
|
22
|
Nicolazzi EL, Biffani S, Biscarini F, Orozco Ter Wengel P, Caprera A, Nazzicari N, Stella A. Software solutions for the livestock genomics SNP array revolution. Anim Genet 2015; 46:343-53. [PMID: 25907889 DOI: 10.1111/age.12295] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/07/2015] [Indexed: 02/04/2023]
Abstract
Since the beginning of the genomic era, the number of available single nucleotide polymorphism (SNP) arrays has grown considerably. In the bovine species alone, 11 SNP chips not completely covered by intellectual property are currently available, and the number is growing. Genomic/genotype data are not standardized, and this hampers its exchange and integration. In addition, software used for the analyses of these data usually requires not standard (i.e. case specific) input files which, considering the large amount of data to be handled, require at least some programming skills in their production. In this work, we describe a software toolkit for SNP array data management, imputation, genome-wide association studies, population genetics and genomic selection. However, this toolkit does not solve the critical need for standardization of the genotypic data and software input files. It only highlights the chaotic situation each researcher has to face on a daily basis and gives some helpful advice on the currently available tools in order to navigate the SNP array data complexity.
Collapse
Affiliation(s)
- E L Nicolazzi
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - S Biffani
- Istituto di biologia e biotecnologia Agraria (IBBA-CNR), Consiglio Nazionale delle Ricerche, Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - F Biscarini
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - P Orozco Ter Wengel
- School of Biosciences, Cardiff University, Museum Avenue, Cardiff, CF10 3AX, UK
| | - A Caprera
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - N Nazzicari
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| | - A Stella
- Fondazione Parco Tecnologico Padano (PTP), Via Einstein, Cascina Codazza, Lodi, 26900, Italy.,Istituto di biologia e biotecnologia Agraria (IBBA-CNR), Consiglio Nazionale delle Ricerche, Via Einstein, Cascina Codazza, Lodi, 26900, Italy
| |
Collapse
|
23
|
Abstract
In recent years, high-throughput technologies have brought big data to the life sciences. The march of progress has been rapid, leaving in its wake a demand for courses in data analysis, data stewardship, computing fundamentals, etc., a need that universities have not yet been able to satisfy—paradoxically, many are actually closing “niche” bioinformatics courses at a time of critical need. The impact of this is being felt across continents, as many students and early-stage researchers are being left without appropriate skills to manage, analyse, and interpret their data with confidence. This situation has galvanised a group of scientists to address the problems on an international scale. For the first time, bioinformatics educators and trainers across the globe have come together to address common needs, rising above institutional and international boundaries to cooperate in sharing bioinformatics training expertise, experience, and resources, aiming to put ad hoc training practices on a more professional footing for the benefit of all.
Collapse
|
24
|
Weber RJM, Winder CL, Larcombe LD, Dunn WB, Viant MR. Training needs in metabolomics. Metabolomics 2015; 11:784-786. [PMID: 26109924 PMCID: PMC4475540 DOI: 10.1007/s11306-015-0815-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/28/2015] [Accepted: 05/20/2015] [Indexed: 11/29/2022]
Affiliation(s)
- Ralf J. M. Weber
- />School of Biosciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT UK
| | - Catherine L. Winder
- />School of Biosciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT UK
| | - Lee D. Larcombe
- />Department of Physiology, Anatomy & Genetics, University of Oxford, Oxford, OX1 3PT UK
| | - Warwick B. Dunn
- />School of Biosciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT UK
| | - Mark R. Viant
- />School of Biosciences, University of Birmingham, Edgbaston, Birmingham, B15 2TT UK
| |
Collapse
|