Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Bhagat J, Tanoh F, Nzuobontane E, Laurent T, Orlowski J, Roos M, Wolstencroft K, Aleksejevs S, Stevens R, Pettifer S, Lopez R, Goble CA. BioCatalogue: a universal catalogue of web services for the life sciences. Nucleic Acids Res 2010;38:W689-94. [PMID: 20484378 PMCID: PMC2896129 DOI: 10.1093/nar/gkq394] [Citation(s) in RCA: 162] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2010] [Revised: 04/27/2010] [Accepted: 04/29/2010] [Indexed: 12/01/2022] Open

For:	Bhagat J, Tanoh F, Nzuobontane E, Laurent T, Orlowski J, Roos M, Wolstencroft K, Aleksejevs S, Stevens R, Pettifer S, Lopez R, Goble CA. BioCatalogue: a universal catalogue of web services for the life sciences. Nucleic Acids Res 2010;38:W689-94. [PMID: 20484378 PMCID: PMC2896129 DOI: 10.1093/nar/gkq394] [Citation(s) in RCA: 162] [Impact Index Per Article: 11.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2010] [Revised: 04/27/2010] [Accepted: 04/29/2010] [Indexed: 12/01/2022] Open

Number

Cited by Other Article(s)

Cai P, Liu S, Zhang D, Xing H, Han M, Liu D, Gong L, Hu QN. SynBioTools: a one-stop facility for searching and selecting synthetic biology tools. BMC Bioinformatics 2023;24:152. [PMID: 37069545 PMCID: PMC10111727 DOI: 10.1186/s12859-023-05281-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2023] [Accepted: 04/11/2023] [Indexed: 04/19/2023] Open

Lamprecht AL, Palmblad M, Ison J, Schwämmle V, Al Manir MS, Altintas I, Baker CJO, Ben Hadj Amor A, Capella-Gutierrez S, Charonyktakis P, Crusoe MR, Gil Y, Goble C, Griffin TJ, Groth P, Ienasescu H, Jagtap P, Kalaš M, Kasalica V, Khanteymoori A, Kuhn T, Mei H, Ménager H, Möller S, Richardson RA, Robert V, Soiland-Reyes S, Stevens R, Szaniszlo S, Verberne S, Verhoeven A, Wolstencroft K. Perspectives on automated composition of workflows in the life sciences. F1000Res 2021;10:897. [PMID: 34804501 PMCID: PMC8573700 DOI: 10.12688/f1000research.54159.1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/27/2021] [Indexed: 12/29/2022] Open

Abstract

Scientific data analyses often combine several computational tools in automated pipelines, or workflows. Thousands of such workflows have been used in the life sciences, though their composition has remained a cumbersome manual process due to a lack of standards for annotation, assembly, and implementation. Recent technological advances have returned the long-standing vision of automated workflow composition into focus. This article summarizes a recent Lorentz Center workshop dedicated to automated composition of workflows in the life sciences. We survey previous initiatives to automate the composition process, and discuss the current state of the art and future perspectives. We start by drawing the "big picture" of the scientific workflow development life cycle, before surveying and discussing current methods, technologies and practices for semantic domain modelling, automation in workflow development, and workflow assessment. Finally, we derive a roadmap of individual and community-based actions to work toward the vision of automated workflow development in the forthcoming years. A central outcome of the workshop is a general description of the workflow life cycle in six stages: 1) scientific question or hypothesis, 2) conceptual workflow, 3) abstract workflow, 4) concrete workflow, 5) production workflow, and 6) scientific results. The transitions between stages are facilitated by diverse tools and methods, usually incorporating domain knowledge in some form. Formal semantic domain modelling is hard and often a bottleneck for the application of semantic technologies. However, life science communities have made considerable progress here in recent years and are continuously improving, renewing interest in the application of semantic technologies for workflow exploration, composition and instantiation. Combined with systematic benchmarking with reference data and large-scale deployment of production-stage workflows, such technologies enable a more systematic process of workflow development than we know today. We believe that this can lead to more robust, reusable, and sustainable workflows in the future.

Collapse

Affiliation(s)

Anna-Lena Lamprecht Utrecht University, 3584 CS Utrecht, The Netherlands
Magnus Palmblad Leiden University Medical Center, 2333 ZA, Leiden, The Netherlands
Jon Ison French Institute of Bioinformatics, 91057 Évry, France
Veit Schwämmle University of Southern Denmark, 5230 Odense M, Denmark
Mohammad Sadnan Al Manir University of Virginia, Charlottesville, VA, 22903, USA
Ilkay Altintas University of California San Diego, La Jolla, CA, 92093, USA
Christopher J. O. Baker University of New Brunswick, Saint John, E2L 4L5, Canada IPSNP Computing Inc., Saint John, E2L 4S6, Canada
Ammar Ben Hadj Amor Westerdijk Institute, 3584 CT, Utrecht, The Netherlands
Salvador Capella-Gutierrez Barcelona Supercomputing Center (BSC), 08034, Barcelona, Spain
Paulos Charonyktakis Gnosis Data Analysis PC, GR-700 13 Heraklion, Greece
Michael R. Crusoe VU Amsterdam, 1081 HV Amsterdam, The Netherlands
Yolanda Gil University of Southern California, Marina Del Rey, CA, 90292, USA
Carole Goble Department of Computer Science, The University of Manchester, Manchester, M13 9PL, UK
Timothy J. Griffin Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, 55455, USA
Paul Groth University of Amsterdam, 1090 GH Amsterdam, The Netherlands
Hans Ienasescu Technical University of Denmark, 2800 Kongens Lyngby, Denmark
Pratik Jagtap Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN, 55455, USA
Matúš Kalaš University of Bergen, 5020 Bergen, Norway
Vedran Kasalica Utrecht University, 3584 CS Utrecht, The Netherlands
Alireza Khanteymoori Bioinformatics Group, University of Freiburg, 79110 Freiburg, Germany
Tobias Kuhn VU Amsterdam, 1081 HV Amsterdam, The Netherlands
Hailiang Mei Sequencing Analysis Support Core, Leiden University Medical Center, 2333 ZC Leiden, The Netherlands
Hervé Ménager Institut Pasteur, 75015 Paris, France
Steffen Möller IBIMA, Rostock University Medical Center, 18057 Rostock, Germany
Robin A. Richardson Netherlands eScience Center, 1098 XG Amsterdam, The Netherlands
Vincent Robert Westerdijk Institute, 3584 CT, Utrecht, The Netherlands
Stian Soiland-Reyes Department of Computer Science, The University of Manchester, Manchester, M13 9PL, UK Informatics Institute, University of Amsterdam, 1090 GH Amsterdam, The Netherlands
Robert Stevens Department of Computer Science, The University of Manchester, Manchester, M13 9PL, UK
Szoke Szaniszlo Westerdijk Institute, 3584 CT, Utrecht, The Netherlands
Suzan Verberne Leiden Institute of Advanced Computer Science, Leiden University, 2333 BE Leiden, The Netherlands
Aswin Verhoeven Leiden University Medical Center, 2333 ZA, Leiden, The Netherlands
Katherine Wolstencroft Leiden Institute of Advanced Computer Science, Leiden University, 2333 BE Leiden, The Netherlands

Collapse

Duvaud S, Gabella C, Lisacek F, Stockinger H, Ioannidis V, Durinx C. Expasy, the Swiss Bioinformatics Resource Portal, as designed by its users. Nucleic Acids Res 2021;49:W216-W227. [PMID: 33849055 PMCID: PMC8265094 DOI: 10.1093/nar/gkab225] [Citation(s) in RCA: 269] [Impact Index Per Article: 89.7] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2021] [Revised: 03/11/2021] [Accepted: 04/01/2021] [Indexed: 12/16/2022] Open

Pettersen EF, Goddard TD, Huang CC, Meng EC, Couch GS, Croll TI, Morris JH, Ferrin TE. UCSF ChimeraX: Structure visualization for researchers, educators, and developers. Protein Sci 2021;30:70-82. [PMID: 32881101 PMCID: PMC7737788 DOI: 10.1002/pro.3943] [Citation(s) in RCA: 3559] [Impact Index Per Article: 1186.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2020] [Revised: 08/26/2020] [Accepted: 08/28/2020] [Indexed: 12/27/2022]

Lachmann A, Clarke DJB, Torre D, Xie Z, Ma'ayan A. Interoperable RNA-Seq analysis in the cloud. BIOCHIMICA ET BIOPHYSICA ACTA-GENE REGULATORY MECHANISMS 2020;1863:194521. [PMID: 32156561 DOI: 10.1016/j.bbagrm.2020.194521] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/02/2019] [Revised: 03/01/2020] [Accepted: 03/01/2020] [Indexed: 12/25/2022]

Wei Q, Zhang Y, Amith M, Lin R, Lapeyrolerie J, Tao C, Xu H. Recognizing software names in biomedical literature using machine learning. Health Informatics J 2019;26:21-33. [PMID: 31566474 PMCID: PMC7334865 DOI: 10.1177/1460458219869490] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]

RNApolis: Computational Platform for RNA Structure Analysis. FOUNDATIONS OF COMPUTING AND DECISION SCIENCES 2019. [DOI: 10.2478/fcds-2019-0012] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open

Bagnacani A, Wolfien M, Wolkenhauer O. Tools for Understanding miRNA-mRNA Interactions for Reproducible RNA Analysis. Methods Mol Biol 2019;1912:199-214. [PMID: 30635895 DOI: 10.1007/978-1-4939-8982-9_8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/09/2023]

Santos HDAD, Oliveira MIS, Lima GDFAB, da Silva KM, S. Muniz RIVC, Lóscio BF. Investigations into data published and consumed on the Web: a systematic mapping study. JOURNAL OF THE BRAZILIAN COMPUTER SOCIETY 2018. [DOI: 10.1186/s13173-018-0077-z] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]

Doppelt-Azeroual O, Mareuil F, Deveaud E, Kalaš M, Soranzo N, van den Beek M, Grüning B, Ison J, Ménager H. ReGaTE: Registration of Galaxy Tools in Elixir. Gigascience 2018;6:1-4. [PMID: 28402416 PMCID: PMC5530318 DOI: 10.1093/gigascience/gix022] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2016] [Accepted: 03/21/2017] [Indexed: 11/14/2022] Open

U-Index, a dataset and an impact metric for informatics tools and databases. Sci Data 2018;5:180043. [PMID: 29557976 PMCID: PMC5859919 DOI: 10.1038/sdata.2018.43] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Accepted: 02/08/2018] [Indexed: 01/28/2023] Open

LabelFlow Framework for Annotating Workflow Provenance. INFORMATICS 2018. [DOI: 10.3390/informatics5010011] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open

Goddard TD, Huang CC, Meng EC, Pettersen EF, Couch GS, Morris JH, Ferrin TE. UCSF ChimeraX: Meeting modern challenges in visualization and analysis. Protein Sci 2018;27:14-25. [PMID: 28710774 PMCID: PMC5734306 DOI: 10.1002/pro.3235] [Citation(s) in RCA: 2618] [Impact Index Per Article: 436.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2017] [Revised: 07/07/2017] [Accepted: 07/10/2017] [Indexed: 12/18/2022]

Hillion KH, Kuzmin I, Khodak A, Rasche E, Crusoe M, Peterson H, Ison J, Ménager H. Using bio.tools to generate and annotate workbench tool descriptions. F1000Res 2017;6:ELIXIR-2074. [PMID: 29333231 PMCID: PMC5747335 DOI: 10.12688/f1000research.12974.1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/26/2017] [Indexed: 11/20/2022] Open

Urdidiales‐Nieto D, Navas‐Delgado I, Aldana‐Montes JF. Biological Web Service Repositories Review. Mol Inform 2017;36:1600035. [PMID: 27783459 PMCID: PMC5434852 DOI: 10.1002/minf.201600035] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2016] [Accepted: 09/27/2016] [Indexed: 12/26/2022]

Guardia GD, Ferreira Pires L, da Silva EG, de Farias CR. SemanticSCo: A platform to support the semantic composition of services for gene expression analysis. J Biomed Inform 2017;66:116-128. [DOI: 10.1016/j.jbi.2016.12.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2016] [Revised: 11/27/2016] [Accepted: 12/31/2016] [Indexed: 10/20/2022]

From the evaluation of existing solutions to an all-inclusive package for biobanks. HEALTH AND TECHNOLOGY 2017;7:89-95. [PMID: 28344915 PMCID: PMC5346419 DOI: 10.1007/s12553-016-0175-x] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2016] [Accepted: 12/19/2016] [Indexed: 11/26/2022]

Abstract

The domain of biobanking has gone through many stages and as a result there are a wide range of commercial and open source software solutions available. The utilization of these software tools requires different levels of domain and technical skills for installation, configuration and ultimate us of these biobank software tools. To compound this complexity the biobanking community are required to work together in order to share knowledge and jointly build solutions to underpin the research infrastructure. We have evaluated the available tools, described them in a catalogue (BiobankApps) and made a selection of tools available to biobanks in a reference toolbox (BIBBOX) that are use-case driven. In the BiobankApps tool catalogue, both commercial and open source software solutions related to the biobanking domain are included, classified and evaluated. The evaluation covers: 1) “user review” by an authenticated user 2) domain expert: quick analysis by BBMRI members and 3) domain expert: detailed analysis and test installation with real world data. The evaluation is paired with a survey across the more “advanced” (from a technology perspective) biobanks to investigate what tools are currently used and summarises known benefits/drawbacks of the respective packages. In the second step we recommend tools for specific use cases, and install, configure and connect these in the BIBBOX framework. This service also builds on the existing work in the United Kingdom in seeking to establish the motivations for different stakeholders to become involved and therefore assisting in prioritising the use-cases based on the level of need and support within the research community. All tools associated to a use-case are available as BIBBOX applications (technically this is achieved by docker containers), which are integrated in the BIBBOX framework with central identification and user management. In future work we plan to share the acquired knowledge with other networks, develop an Application Programmable Interface (API) for the exchange of metadata with other tool catalogues and work on an ontology for the evaluation of biobank software.

Collapse

Zaveri A, Dastgheib S, Wu C, Whetzel T, Verborgh R, Avillach P, Korodi G, Terryn R, Jagodnik K, Assis P, Dumontier M. smartAPI: Towards a More Intelligent Network of Web APIs. THE SEMANTIC WEB 2017. [DOI: 10.1007/978-3-319-58451-5_11] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]

Exploring Protein-Protein Interactions as Drug Targets for Anti-cancer Therapy with In Silico Workflows. Methods Mol Biol 2017;1647:221-236. [PMID: 28809006 DOI: 10.1007/978-1-4939-7201-2_15] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]

White Paper on Research Data Service Discoverability. PUBLICATIONS 2016. [DOI: 10.3390/publications5010001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open

Przybyła P, Shardlow M, Aubin S, Bossy R, Eckart de Castilho R, Piperidis S, McNaught J, Ananiadou S. Text mining resources for the life sciences. Database (Oxford) 2016;2016:baw145. [PMID: 27888231 PMCID: PMC5199186 DOI: 10.1093/database/baw145] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Revised: 10/13/2016] [Accepted: 10/17/2016] [Indexed: 11/18/2022]

Hardisty AR, Bacall F, Beard N, Balcázar-Vargas MP, Balech B, Barcza Z, Bourlat SJ, De Giovanni R, de Jong Y, De Leo F, Dobor L, Donvito G, Fellows D, Guerra AF, Ferreira N, Fetyukova Y, Fosso B, Giddy J, Goble C, Güntsch A, Haines R, Ernst VH, Hettling H, Hidy D, Horváth F, Ittzés D, Ittzés P, Jones A, Kottmann R, Kulawik R, Leidenberger S, Lyytikäinen-Saarenmaa P, Mathew C, Morrison N, Nenadic A, de la Hidalga AN, Obst M, Oostermeijer G, Paymal E, Pesole G, Pinto S, Poigné A, Fernandez FQ, Santamaria M, Saarenmaa H, Sipos G, Sylla KH, Tähtinen M, Vicario S, Vos RA, Williams AR, Yilmaz P. BioVeL: a virtual laboratory for data analysis and modelling in biodiversity science and ecology. BMC Ecol 2016;16:49. [PMID: 27765035 PMCID: PMC5073428 DOI: 10.1186/s12898-016-0103-y] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2016] [Accepted: 10/13/2016] [Indexed: 02/08/2023] Open

Abstract

Background

Making forecasts about biodiversity and giving support to policy relies increasingly on large collections of data held electronically, and on substantial computational capability and capacity to analyse, model, simulate and predict using such data. However, the physically distributed nature of data resources and of expertise in advanced analytical tools creates many challenges for the modern scientist. Across the wider biological sciences, presenting such capabilities on the Internet (as “Web services”) and using scientific workflow systems to compose them for particular tasks is a practical way to carry out robust “in silico” science. However, use of this approach in biodiversity science and ecology has thus far been quite limited.

Results

BioVeL is a virtual laboratory for data analysis and modelling in biodiversity science and ecology, freely accessible via the Internet. BioVeL includes functions for accessing and analysing data through curated Web services; for performing complex in silico analysis through exposure of R programs, workflows, and batch processing functions; for on-line collaboration through sharing of workflows and workflow runs; for experiment documentation through reproducibility and repeatability; and for computational support via seamless connections to supporting computing infrastructures. We developed and improved more than 60 Web services with significant potential in many different kinds of data analysis and modelling tasks. We composed reusable workflows using these Web services, also incorporating R programs. Deploying these tools into an easy-to-use and accessible ‘virtual laboratory’, free via the Internet, we applied the workflows in several diverse case studies. We opened the virtual laboratory for public use and through a programme of external engagement we actively encouraged scientists and third party application and tool developers to try out the services and contribute to the activity.

Conclusions

Our work shows we can deliver an operational, scalable and flexible Internet-based virtual laboratory to meet new demands for data processing and analysis in biodiversity science and ecology. In particular, we have successfully integrated existing and popular tools and practices from different scientific disciplines to be used in biodiversity and ecological research.

Electronic supplementary material

The online version of this article (doi:10.1186/s12898-016-0103-y) contains supplementary material, which is available to authorized users.

Collapse

Affiliation(s)

Alex R Hardisty School of Computer Science and Informatics, Cardiff University, Queens Buildings, 5 The Parade, Cardiff, CF24 3AA, UK.
Finn Bacall School of Computer Science, University of Manchester, Kilburn Building, Oxford Road, Manchester, M13 9PL, UK
Niall Beard School of Computer Science, University of Manchester, Kilburn Building, Oxford Road, Manchester, M13 9PL, UK
Maria-Paula Balcázar-Vargas Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, PO Box 94248, 1090, Amsterdam, The Netherlands
Bachir Balech Institute of Biomembranes and Bioenergetics (IBBE), National Research Council (CNR), via Amendola 165/A, 70126, Bari, Italy
Zoltán Barcza Department of Meteorology, Eötvös Loránd University, Pázmány sétány 1/A, Budapest, 1117, Hungary
Sarah J Bourlat Department of Marine Sciences, University of Gothenburg, Box 463, 405 30, Gothenburg, Sweden
Renato De Giovanni Centro de Referência em Informação Ambiental, Avenida Dr. Romeu Tórtima, 388, Campinas, SP, 13084-791, Brazil
Yde de Jong Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, PO Box 94248, 1090, Amsterdam, The Netherlands.,SIB Labs, Joensuu Science Park, University of Eastern Finland, P.O. Box 111, 80101, Joensuu, Finland
Francesca De Leo Institute of Biomembranes and Bioenergetics (IBBE), National Research Council (CNR), via Amendola 165/A, 70126, Bari, Italy
Laura Dobor Department of Meteorology, Eötvös Loránd University, Pázmány sétány 1/A, Budapest, 1117, Hungary
Giacinto Donvito Institute of Nuclear Physics (INFN), Via E. Orabona 4, 70125, Bari, Italy
Donal Fellows School of Computer Science, University of Manchester, Kilburn Building, Oxford Road, Manchester, M13 9PL, UK
Antonio Fernandez Guerra Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, 28359, Bremen, Germany.,Jacobs University Bremen GmbH, Campus Ring 1, 28359, Bremen, Germany
Nuno Ferreira Stichting EGI (EGI.eu), Science Park 140, 1098, Amsterdam, The Netherlands
Yuliya Fetyukova SIB Labs, Joensuu Science Park, University of Eastern Finland, P.O. Box 111, 80101, Joensuu, Finland
Bruno Fosso Institute of Biomembranes and Bioenergetics (IBBE), National Research Council (CNR), via Amendola 165/A, 70126, Bari, Italy
Jonathan Giddy School of Computer Science and Informatics, Cardiff University, Queens Buildings, 5 The Parade, Cardiff, CF24 3AA, UK
Carole Goble School of Computer Science, University of Manchester, Kilburn Building, Oxford Road, Manchester, M13 9PL, UK
Anton Güntsch Botanic Garden and Botanical Museum Berlin, Freie Universität Berlin, Königin-Luise-Strasse 6-8, 14195, Berlin, Germany
Robert Haines IT Services, University of Manchester, Kilburn Building, Oxford Road, Manchester, M13 9PL, UK
Vera Hernández Ernst Fraunhofer Institute for Intelligent Analysis and Information Systems (IAIS), Schloss Birlinghoven, 53757, Sankt Augustin, Germany
Hannes Hettling Naturalis Biodiversity Center, Postbus 9517, 2300, Leiden, The Netherlands
Dóra Hidy MTA-SZIE Plant Ecology Research Group, Szent István University, Páter K. u.1., Gödöllő, 2103, Hungary
Ferenc Horváth Institute of Ecology and Botany, Centre for Ecological Research, Hungarian Academy of Sciences, Alkotmány u. 2-4., Vácrátót, 2163, Hungary
Dóra Ittzés Institute of Ecology and Botany, Centre for Ecological Research, Hungarian Academy of Sciences, Alkotmány u. 2-4., Vácrátót, 2163, Hungary
Péter Ittzés Institute of Ecology and Botany, Centre for Ecological Research, Hungarian Academy of Sciences, Alkotmány u. 2-4., Vácrátót, 2163, Hungary
Andrew Jones School of Computer Science and Informatics, Cardiff University, Queens Buildings, 5 The Parade, Cardiff, CF24 3AA, UK
Renzo Kottmann Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, 28359, Bremen, Germany
Robert Kulawik Fraunhofer Institute for Intelligent Analysis and Information Systems (IAIS), Schloss Birlinghoven, 53757, Sankt Augustin, Germany
Sonja Leidenberger Swedish Species Information Centre/ArtDatabanken, Swedish University of Agricultural Sciences, Bäcklösavägen 10, 750 07, Uppsala, Sweden
Päivi Lyytikäinen-Saarenmaa Department of Forest Sciences, University of Helsinki, P.O. Box 27, 00014, Helsinki, Finland
Cherian Mathew Botanic Garden and Botanical Museum Berlin, Freie Universität Berlin, Königin-Luise-Strasse 6-8, 14195, Berlin, Germany
Norman Morrison School of Computer Science, University of Manchester, Kilburn Building, Oxford Road, Manchester, M13 9PL, UK
Aleksandra Nenadic School of Computer Science, University of Manchester, Kilburn Building, Oxford Road, Manchester, M13 9PL, UK
Abraham Nieva de la Hidalga School of Computer Science and Informatics, Cardiff University, Queens Buildings, 5 The Parade, Cardiff, CF24 3AA, UK
Matthias Obst Department of Marine Sciences, University of Gothenburg, Box 463, 405 30, Gothenburg, Sweden
Gerard Oostermeijer Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, PO Box 94248, 1090, Amsterdam, The Netherlands
Elisabeth Paymal Fondation pour la Recherche sur la Biodiversité (FRB), 195, rue Saint-Jacques, 75005, Paris, France
Graziano Pesole Institute of Biomembranes and Bioenergetics (IBBE), National Research Council (CNR), via Amendola 165/A, 70126, Bari, Italy.,Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari "A. Moro", via Orabona, 1514, 70126, Bari, Italy
Salvatore Pinto Stichting EGI (EGI.eu), Science Park 140, 1098, Amsterdam, The Netherlands
Axel Poigné Fraunhofer Institute for Intelligent Analysis and Information Systems (IAIS), Schloss Birlinghoven, 53757, Sankt Augustin, Germany
Francisco Quevedo Fernandez School of Computer Science and Informatics, Cardiff University, Queens Buildings, 5 The Parade, Cardiff, CF24 3AA, UK
Monica Santamaria Institute of Biomembranes and Bioenergetics (IBBE), National Research Council (CNR), via Amendola 165/A, 70126, Bari, Italy
Hannu Saarenmaa SIB Labs, Joensuu Science Park, University of Eastern Finland, P.O. Box 111, 80101, Joensuu, Finland
Gergely Sipos Stichting EGI (EGI.eu), Science Park 140, 1098, Amsterdam, The Netherlands
Karl-Heinz Sylla Fraunhofer Institute for Intelligent Analysis and Information Systems (IAIS), Schloss Birlinghoven, 53757, Sankt Augustin, Germany
Marko Tähtinen Finnish Museum of Natural History, University of Helsinki, P.O. Box 17, 00014, Helsinki, Finland
Saverio Vicario Institute of Biomedical Technology (ITB), National Research Council (CNR), via Amendola 122/D, 70126, Bari, Italy
Rutger Aldo Vos Institute for Biodiversity and Ecosystem Dynamics (IBED), University of Amsterdam, PO Box 94248, 1090, Amsterdam, The Netherlands.,Naturalis Biodiversity Center, Postbus 9517, 2300, Leiden, The Netherlands
Alan R Williams School of Computer Science, University of Manchester, Kilburn Building, Oxford Road, Manchester, M13 9PL, UK
Pelin Yilmaz Max Planck Institute for Marine Microbiology, Celsiusstrasse 1, 28359, Bremen, Germany

Collapse

How Aphia—The Platform behind Several Online and Taxonomically Oriented Databases—Can Serve Both the Taxonomic Community and the Field of Biodiversity Informatics. JOURNAL OF MARINE SCIENCE AND ENGINEERING 2015. [DOI: 10.3390/jmse3041448] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]

Kanterakis A, Kuiper J, Potamias G, Swertz MA. PyPedia: using the wiki paradigm as crowd sourcing environment for bioinformatics protocols. SOURCE CODE FOR BIOLOGY AND MEDICINE 2015;10:14. [PMID: 26587054 PMCID: PMC4652372 DOI: 10.1186/s13029-015-0042-6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/23/2015] [Accepted: 10/20/2015] [Indexed: 11/10/2022]

Abstract

Background

Today researchers can choose from many bioinformatics protocols for all types of life sciences research, computational environments and coding languages. Although the majority of these are open source, few of them possess all virtues to maximize reuse and promote reproducible science. Wikipedia has proven a great tool to disseminate information and enhance collaboration between users with varying expertise and background to author qualitative content via crowdsourcing. However, it remains an open question whether the wiki paradigm can be applied to bioinformatics protocols.

Results

We piloted PyPedia, a wiki where each article is both implementation and documentation of a bioinformatics computational protocol in the python language. Hyperlinks within the wiki can be used to compose complex workflows and induce reuse. A RESTful API enables code execution outside the wiki. Initial content of PyPedia contains articles for population statistics, bioinformatics format conversions and genotype imputation. Use of the easy to learn wiki syntax effectively lowers the barriers to bring expert programmers and less computer savvy researchers on the same page.

Conclusions

PyPedia demonstrates how wiki can provide a collaborative development, sharing and even execution environment for biologists and bioinformaticians that complement existing resources, useful for local and multi-center research teams.

Availability

PyPedia is available online at: http://www.pypedia.com. The source code and installation instructions are available at: https://github.com/kantale/PyPedia_server. The PyPedia python library is available at: https://github.com/kantale/pypedia. PyPedia is open-source, available under the BSD 2-Clause License.

Electronic supplementary material

The online version of this article (doi:10.1186/s13029-015-0042-6) contains supplementary material, which is available to authorized users.

Collapse

Ison J, Rapacki K, Ménager H, Kalaš M, Rydza E, Chmura P, Anthon C, Beard N, Berka K, Bolser D, Booth T, Bretaudeau A, Brezovsky J, Casadio R, Cesareni G, Coppens F, Cornell M, Cuccuru G, Davidsen K, Vedova GD, Dogan T, Doppelt-Azeroual O, Emery L, Gasteiger E, Gatter T, Goldberg T, Grosjean M, Grüning B, Helmer-Citterich M, Ienasescu H, Ioannidis V, Jespersen MC, Jimenez R, Juty N, Juvan P, Koch M, Laibe C, Li JW, Licata L, Mareuil F, Mičetić I, Friborg RM, Moretti S, Morris C, Möller S, Nenadic A, Peterson H, Profiti G, Rice P, Romano P, Roncaglia P, Saidi R, Schafferhans A, Schwämmle V, Smith C, Sperotto MM, Stockinger H, Vařeková RS, Tosatto SCE, de la Torre V, Uva P, Via A, Yachdav G, Zambelli F, Vriend G, Rost B, Parkinson H, Løngreen P, Brunak S. Tools and data services registry: a community effort to document bioinformatics resources. Nucleic Acids Res 2015;44:D38-47. [PMID: 26538599 PMCID: PMC4702812 DOI: 10.1093/nar/gkv1116] [Citation(s) in RCA: 86] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2015] [Accepted: 10/13/2015] [Indexed: 01/24/2023] Open

Affiliation(s)

Jon Ison Center for Biological Sequence Analysis Department of Systems Biology, Technical University of Denmark, Denmark
Kristoffer Rapacki Center for Biological Sequence Analysis Department of Systems Biology, Technical University of Denmark, Denmark
Hervé Ménager Centre d'Informatique pour la Biologie, C3BI, Institut Pasteur, France
Matúš Kalaš Computational Biology Unit, Department of Informatics, University of Bergen, Norway
Emil Rydza Center for Biological Sequence Analysis Department of Systems Biology, Technical University of Denmark, Denmark
Piotr Chmura Center for Biological Sequence Analysis Department of Systems Biology, Technical University of Denmark, Denmark
Christian Anthon Department of Veterinary Clinical and Animal Sciences, Faculty for Health and Medical Sciences, University of Copenhagen, Denmark
Niall Beard School of Computer Science, University of Manchester, UK
Karel Berka Department of Physical Chemistry, RCPTM, Faculty of Science, Palacky University, Czech Republic
Dan Bolser The European Bioinformatics Institute (EMBL-EBI), UK
Tim Booth NEBC Wallingford, Centre for Ecology and Hydrology, UK
Anthony Bretaudeau INRA, UMR Institut de Génétique, Environnement et Protection des Plantes (IGEPP), BioInformatics Platform for Agroecosystems Arthropods (BIPAA), France INRIA, IRISA, GenOuest Core Facility, France
Jan Brezovsky Loschmidt Laboratories, Department of Experimental Biology and Research Centre for Toxic Compounds in the Environment RECETOX, Masaryk University, Czech Republic
Rita Casadio Bologna Biocomputing Group, University of Bologna, Italy
Gianni Cesareni Dept. of Biology, University of Rome Tor Vergata, Italy
Frederik Coppens Department of Plant Systems Biology, VIB, Belgium Department of Plant Biotechnology and Bioinformatics, Ghent University, Belgium
Michael Cornell Faculty of Life Sciences, University of Manchester, UK
Gianmauro Cuccuru Bioinformatics, CRS4, Italy
Kristian Davidsen Center for Biological Sequence Analysis Department of Systems Biology, Technical University of Denmark, Denmark
Gianluca Della Vedova Dept. of Computer Science, Systems and Communication. Univ. Milano-Bicocca, Italy
Tunca Dogan UniProt, European Bioinformatics Institute (EMBL-EBI), UK
Olivia Doppelt-Azeroual Centre d'Informatique pour la Biologie, C3BI, Institut Pasteur, France
Laura Emery The European Bioinformatics Institute (EMBL-EBI), UK
Elisabeth Gasteiger SIB Swiss Institute of Bioinformatics, Switzerland
Thomas Gatter Faculty of Technology and Center for Biotechnology, Universität Bielefeld, Germany
Tatyana Goldberg Department of Informatics, Bioinformatics-I12, TUM, Germany
Marie Grosjean Institut Français de Bioinformatique (French Institute of Bioinformatics), CNRS, UMS3601, France
Björn Grüning Albert-Ludwigs-Universität Freiburg, Fahnenbergplatz, 79085 Freiburg
Manuela Helmer-Citterich Centre for Molecular Bioinformatics, Dept. of Biology, University of Rome Tor Vergata, Italy
Hans Ienasescu Bioinformatics Centre, Department of Biology, University of Copenhagen, Denmark
Vassilios Ioannidis SIB Swiss Institute of Bioinformatics, Switzerland
Martin Closter Jespersen Center for Biological Sequence Analysis Department of Systems Biology, Technical University of Denmark, Denmark
Rafael Jimenez The European Bioinformatics Institute (EMBL-EBI), UK
Nick Juty The European Bioinformatics Institute (EMBL-EBI), UK
Peter Juvan Centre for Functional Genomics and Biochips, Faculty of Medicine, University of Ljubljana, Slovenia
Maximilian Koch The European Bioinformatics Institute (EMBL-EBI), UK
Camille Laibe The European Bioinformatics Institute (EMBL-EBI), UK
Jing-Woei Li Faculty of Medicine, The Chinese University of Hong Kong, China Hong Kong Bioinformatics Centre, School of Life Sciences,The Chinese University of Hong Kong, China
Luana Licata Dept. of Biology, University of Rome Tor Vergata, Italy
Fabien Mareuil Centre d'Informatique pour la Biologie, C3BI, Institut Pasteur, France
Ivan Mičetić Department of Biomedical Sciences, University of Padua, Italy
Rune Møllegaard Friborg Bioinformatics Research Centre (BiRC), Denmark
Sebastien Moretti SIB Swiss Institute of Bioinformatics, Switzerland Department of Ecology and Evolution, Biophore, Evolutionary Bioinformatics group, University of Lausanne, Switzerland
Chris Morris STFC, Daresbury Laboratory, UK
Steffen Möller Department of Dermatology, University of Lübeck, Germany Institute for Biostatistics and Informatics in Medicine and Ageing Research, Rostock University Medical Center, Germany
Aleksandra Nenadic School of Computer Science, University of Manchester, UK
Hedi Peterson Institute of Computer Science, University of Tartu, Estonia
Giuseppe Profiti Bologna Biocomputing Group, University of Bologna, Italy
Peter Rice Department of Computing, William Penney Laboratory, Imperial College London, UK
Paolo Romano IRCCS AOU San Martino IST, Italy
Paola Roncaglia The European Bioinformatics Institute (EMBL-EBI), UK
Rabie Saidi UniProt, European Bioinformatics Institute (EMBL-EBI), UK
Andrea Schafferhans Department of Informatics, Bioinformatics-I12, TUM, Germany
Veit Schwämmle Protein Research Group, Department for Biochemistry and Molecular Biology, University of Southern Denmark, Denmark
Callum Smith Instruct, WTCHG, UK
Maria Maddalena Sperotto Center for Biological Sequence Analysis Department of Systems Biology, Technical University of Denmark, Denmark
Heinz Stockinger SIB Swiss Institute of Bioinformatics, Switzerland
Radka Svobodová Vařeková Central European Institute of Technology (CEITEC), Czech Republic
Silvio C E Tosatto Department of Biomedical Sciences, University of Padua, Italy
Victor de la Torre National Bioinformatics Institute Unit (INB), Fundacion Centro Nacional de Investigaciones Oncologicas, Spain
Paolo Uva Bioinformatics, CRS4, Italy
Allegra Via Dept. of Physics, Sapienza University, Italy
Guy Yachdav Department of Informatics, Bioinformatics-I12, TUM, Germany
Federico Zambelli Institute of Biomembranes and Bioenergetics, National Research Council (CNR), and Dept. of Biosciences, University of Milano, Italy
Gert Vriend Radboud University Medical Centre, CMBI, Netherlands
Burkhard Rost Department of Informatics, Bioinformatics-I12, TUM, Germany
Helen Parkinson The European Bioinformatics Institute (EMBL-EBI), UK
Peter Løngreen Center for Biological Sequence Analysis Department of Systems Biology, Technical University of Denmark, Denmark
Søren Brunak Center for Biological Sequence Analysis Department of Systems Biology, Technical University of Denmark, Denmark Novo Nordisk Foundation Center for Protein Research, Faculty of Health and Medical Sciences, University of Copenhagen, Denmark

Collapse

Sfakianaki P, Koumakis L, Sfakianakis S, Iatraki G, Zacharioudakis G, Graf N, Marias K, Tsiknakis M. Semantic biomedical resource discovery: a Natural Language Processing framework. BMC Med Inform Decis Mak 2015;15:77. [PMID: 26423616 PMCID: PMC4591066 DOI: 10.1186/s12911-015-0200-4] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2015] [Accepted: 09/21/2015] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

A plethora of publicly available biomedical resources do currently exist and are constantly increasing at a fast rate. In parallel, specialized repositories are been developed, indexing numerous clinical and biomedical tools. The main drawback of such repositories is the difficulty in locating appropriate resources for a clinical or biomedical decision task, especially for non-Information Technology expert users. In parallel, although NLP research in the clinical domain has been active since the 1960s, progress in the development of NLP applications has been slow and lags behind progress in the general NLP domain. The aim of the present study is to investigate the use of semantics for biomedical resources annotation with domain specific ontologies and exploit Natural Language Processing methods in empowering the non-Information Technology expert users to efficiently search for biomedical resources using natural language.

METHODS

A Natural Language Processing engine which can "translate" free text into targeted queries, automatically transforming a clinical research question into a request description that contains only terms of ontologies, has been implemented. The implementation is based on information extraction techniques for text in natural language, guided by integrated ontologies. Furthermore, knowledge from robust text mining methods has been incorporated to map descriptions into suitable domain ontologies in order to ensure that the biomedical resources descriptions are domain oriented and enhance the accuracy of services discovery. The framework is freely available as a web application at ( http://calchas.ics.forth.gr/ ).

RESULTS

For our experiments, a range of clinical questions were established based on descriptions of clinical trials from the ClinicalTrials.gov registry as well as recommendations from clinicians. Domain experts manually identified the available tools in a tools repository which are suitable for addressing the clinical questions at hand, either individually or as a set of tools forming a computational pipeline. The results were compared with those obtained from an automated discovery of candidate biomedical tools. For the evaluation of the results, precision and recall measurements were used. Our results indicate that the proposed framework has a high precision and low recall, implying that the system returns essentially more relevant results than irrelevant.

CONCLUSIONS

There are adequate biomedical ontologies already available, sufficiency of existing NLP tools and quality of biomedical annotation systems for the implementation of a biomedical resources discovery framework, based on the semantic annotation of resources and the use on NLP techniques. The results of the present study demonstrate the clinical utility of the application of the proposed framework which aims to bridge the gap between clinical question in natural language and efficient dynamic biomedical resources discovery.

Collapse

Dahlö M, Haziza F, Kallio A, Korpelainen E, Bongcam-Rudloff E, Spjuth O. BioImg.org: A Catalog of Virtual Machine Images for the Life Sciences. Bioinform Biol Insights 2015;9:125-8. [PMID: 26401099 PMCID: PMC4567039 DOI: 10.4137/bbi.s28636] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2015] [Revised: 06/29/2015] [Accepted: 07/05/2015] [Indexed: 12/14/2022] Open

JMS: An Open Source Workflow Management System and Web-Based Cluster Front-End for High Performance Computing. PLoS One 2015;10:e0134273. [PMID: 26280450 PMCID: PMC4539224 DOI: 10.1371/journal.pone.0134273] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2015] [Accepted: 07/07/2015] [Indexed: 12/04/2022] Open

Costa GCB, Braga R, David JMN, Campos F. A Scientific Software Product Line for the Bioinformatics domain. J Biomed Inform 2015;56:239-64. [DOI: 10.1016/j.jbi.2015.05.014] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2015] [Revised: 04/04/2015] [Accepted: 05/19/2015] [Indexed: 11/17/2022]

Dobor L, Barcza Z, Hlásny T, Havasi Á, Horváth F, Ittzés P, Bartholy J. Bridging the gap between climate models and impact studies: the FORESEE Database. GEOSCIENCE DATA JOURNAL 2015;2:1-11. [PMID: 28616227 PMCID: PMC5445562 DOI: 10.1002/gdj3.22] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/27/2014] [Revised: 11/13/2014] [Accepted: 11/28/2014] [Indexed: 06/07/2023]

Velloso H, Vialle RA, Ortega JM. BOWS (bioinformatics open web services) to centralize bioinformatics tools in web services. BMC Res Notes 2015;8:206. [PMID: 26032494 PMCID: PMC4467627 DOI: 10.1186/s13104-015-1190-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2014] [Accepted: 05/20/2015] [Indexed: 11/13/2022] Open

Trends in IT Innovation to Build a Next Generation Bioinformatics Solution to Manage and Analyse Biological Big Data Produced by NGS Technologies. BIOMED RESEARCH INTERNATIONAL 2015;2015:904541. [PMID: 26125026 PMCID: PMC4466500 DOI: 10.1155/2015/904541] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/31/2014] [Revised: 04/01/2015] [Accepted: 04/01/2015] [Indexed: 02/07/2023]

Drug discovery FAQs: workflows for answering multidomain drug discovery questions. Drug Discov Today 2015;20:399-405. [DOI: 10.1016/j.drudis.2014.11.006] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2014] [Revised: 10/22/2014] [Accepted: 11/13/2014] [Indexed: 12/26/2022]

Duck G, Nenadic G, Brass A, Robertson DL, Stevens R. Extracting patterns of database and software usage from the bioinformatics literature. Bioinformatics 2015;30:i601-8. [PMID: 25161253 PMCID: PMC4147923 DOI: 10.1093/bioinformatics/btu471] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Oshita K, Tomita M, Arakawa K. G-Links: a gene-centric link acquisition service. F1000Res 2014;3:285. [PMID: 26673001 PMCID: PMC4670005 DOI: 10.12688/f1000research.5754.2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 11/16/2015] [Indexed: 01/20/2023] Open

Repchevsky D, Gelpi JL. BioSWR--semantic web services registry for bioinformatics. PLoS One 2014;9:e107889. [PMID: 25233118 PMCID: PMC4169436 DOI: 10.1371/journal.pone.0107889] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2014] [Accepted: 08/21/2014] [Indexed: 11/28/2022] Open

Tsiliki G, Kossida S, Friesen N, Rüping S, Tzagarakis M, Karacapilidis N. A Data Mining Based Approach for Collaborative Analysis of Biomedical Data. INT J ARTIF INTELL T 2014. [DOI: 10.1142/s0218213014600100] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]

Rak R, Batista-Navarro RT, Carter J, Rowley A, Ananiadou S. Processing biological literature with customizable Web services supporting interoperable formats. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2014;2014:bau064. [PMID: 25006225 PMCID: PMC4086403 DOI: 10.1093/database/bau064] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Malone J, Brown A, Lister AL, Ison J, Hull D, Parkinson H, Stevens R. The Software Ontology (SWO): a resource for reproducibility in biomedical data analysis, curation and digital preservation. J Biomed Semantics 2014;5:25. [PMID: 25068035 PMCID: PMC4098953 DOI: 10.1186/2041-1480-5-25] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2013] [Accepted: 04/19/2014] [Indexed: 01/07/2023] Open

Abstract

Motivation

Biomedical ontologists to date have concentrated on ontological descriptions of biomedical entities such as gene products and their attributes, phenotypes and so on. Recently, effort has diversified to descriptions of the laboratory investigations by which these entities were produced. However, much biological insight is gained from the analysis of the data produced from these investigations, and there is a lack of adequate descriptions of the wide range of software that are central to bioinformatics. We need to describe how data are analyzed for discovery, audit trails, provenance and reproducibility.

Results

The Software Ontology (SWO) is a description of software used to store, manage and analyze data. Input to the SWO has come from beyond the life sciences, but its main focus is the life sciences. We used agile techniques to gather input for the SWO and keep engagement with our users. The result is an ontology that meets the needs of a broad range of users by describing software, its information processing tasks, data inputs and outputs, data formats versions and so on. Recently, the SWO has incorporated EDAM, a vocabulary for describing data and related concepts in bioinformatics. The SWO is currently being used to describe software used in multiple biomedical applications.

Conclusion

The SWO is another element of the biomedical ontology landscape that is necessary for the description of biomedical entities and how they were discovered. An ontology of software used to analyze data produced by investigations in the life sciences can be made in such a way that it covers the important features requested and prioritized by its users. The SWO thus fits into the landscape of biomedical ontologies and is produced using techniques designed to keep it in line with user’s needs.

Availability

The Software Ontology is available under an Apache 2.0 license at http://theswo.sourceforge.net/; the Software Ontology blog can be read at http://softwareontology.wordpress.com.

Collapse

Masseroli M, Mons B, Bongcam-Rudloff E, Ceri S, Kel A, Rechenmann F, Lisacek F, Romano P. Integrated Bio-Search: challenges and trends for the integration, search and comprehensive processing of biological information. BMC Bioinformatics 2014;15 Suppl 1:S2. [PMID: 24564249 PMCID: PMC4015876 DOI: 10.1186/1471-2105-15-s1-s2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open

Abstract

Many efforts exist to design and implement approaches and tools for data capture, integration and analysis in the life sciences. Challenges are not only the heterogeneity, size and distribution of information sources, but also the danger of producing too many solutions for the same problem. Methodological, technological, infrastructural and social aspects appear to be essential for the development of a new generation of best practices and tools. In this paper, we analyse and discuss these aspects from different perspectives, by extending some of the ideas that arose during the NETTAB 2012 Workshop, making reference especially to the European context. First, relevance of using data and software models for the management and analysis of biological data is stressed. Second, some of the most relevant community achievements of the recent years, which should be taken as a starting point for future efforts in this research domain, are presented. Third, some of the main outstanding issues, challenges and trends are analysed. The challenges related to the tendency to fund and create large scale international research infrastructures and public-private partnerships in order to address the complex challenges of data intensive science are especially discussed. The needs and opportunities of Genomic Computing (the integration, search and display of genomic information at a very specific level, e.g. at the level of a single DNA region) are then considered. In the current data and network-driven era, social aspects can become crucial bottlenecks. How these may best be tackled to unleash the technical abilities for effective data integration and validation efforts is then discussed. Especially the apparent lack of incentives for already overwhelmed researchers appears to be a limitation for sharing information and knowledge with other scientists. We point out as well how the bioinformatics market is growing at an unprecedented speed due to the impact that new powerful in silico analysis promises to have on better diagnosis, prognosis, drug discovery and treatment, towards personalized medicine. An open business model for bioinformatics, which appears to be able to reduce undue duplication of efforts and support the increased reuse of valuable data sets, tools and platforms, is finally discussed.

Collapse

Masseroli M, Picozzi M, Ghisalberti G, Ceri S. Explorative search of distributed bio-data to answer complex biomedical questions. BMC Bioinformatics 2014;15 Suppl 1:S3. [PMID: 24564278 PMCID: PMC4015759 DOI: 10.1186/1471-2105-15-s1-s3] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The huge amount of biomedical-molecular data increasingly produced is providing scientists with potentially valuable information. Yet, such data quantity makes difficult to find and extract those data that are most reliable and most related to the biomedical questions to be answered, which are increasingly complex and often involve many different biomedical-molecular aspects. Such questions can be addressed only by comprehensively searching and exploring different types of data, which frequently are ordered and provided by different data sources. Search Computing has been proposed for the management and integration of ranked results from heterogeneous search services. Here, we present its novel application to the explorative search of distributed biomedical-molecular data and the integration of the search results to answer complex biomedical questions.

RESULTS

A set of available bioinformatics search services has been modelled and registered in the Search Computing framework, and a Bioinformatics Search Computing application (Bio-SeCo) using such services has been created and made publicly available at http://www.bioinformatics.deib.polimi.it/bio-seco/seco/. It offers an integrated environment which eases search, exploration and ranking-aware combination of heterogeneous data provided by the available registered services, and supplies global results that can support answering complex multi-topic biomedical questions.

CONCLUSIONS

By using Bio-SeCo, scientists can explore the very large and very heterogeneous biomedical-molecular data available. They can easily make different explorative search attempts, inspect obtained results, select the most appropriate, expand or refine them and move forward and backward in the construction of a global complex biomedical query on multiple distributed sources that could eventually find the most relevant results. Thus, it provides an extremely useful automated support for exploratory integrated bio search, which is fundamental for Life Science data driven knowledge discovery.

Collapse

Kamdar MR, Zeginis D, Hasnain A, Decker S, Deus HF. ReVeaLD: a user-driven domain-specific interactive search platform for biomedical research. J Biomed Inform 2013;47:112-30. [PMID: 24135450 DOI: 10.1016/j.jbi.2013.10.001] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2013] [Revised: 09/22/2013] [Accepted: 10/01/2013] [Indexed: 10/26/2022]

Abstract

Bioinformatics research relies heavily on the ability to discover and correlate data from various sources. The specialization of life sciences over the past decade, coupled with an increasing number of biomedical datasets available through standardized interfaces, has created opportunities towards new methods in biomedical discovery. Despite the popularity of semantic web technologies in tackling the integrative bioinformatics challenge, there are many obstacles towards its usage by non-technical research audiences. In particular, the ability to fully exploit integrated information needs using improved interactive methods intuitive to the biomedical experts. In this report we present ReVeaLD (a Real-time Visual Explorer and Aggregator of Linked Data), a user-centered visual analytics platform devised to increase intuitive interaction with data from distributed sources. ReVeaLD facilitates query formulation using a domain-specific language (DSL) identified by biomedical experts and mapped to a self-updated catalogue of elements from external sources. ReVeaLD was implemented in a cancer research setting; queries included retrieving data from in silico experiments, protein modeling and gene expression. ReVeaLD was developed using Scalable Vector Graphics and JavaScript and a demo with explanatory video is available at http://www.srvgal78.deri.ie:8080/explorer. A set of user-defined graphic rules controls the display of information through media-rich user interfaces. Evaluation of ReVeaLD was carried out as a game: biomedical researchers were asked to assemble a set of 5 challenge questions and time and interactions with the platform were recorded. Preliminary results indicate that complex queries could be formulated under less than two minutes by unskilled researchers. The results also indicate that supporting the identification of the elements of a DSL significantly increased intuitiveness of the platform and usability of semantic web technologies by domain users.

Collapse

Cokelaer T, Pultz D, Harder LM, Serra-Musach J, Saez-Rodriguez J. BioServices: a common Python package to access biological Web Services programmatically. ACTA ACUST UNITED AC 2013;29:3241-2. [PMID: 24064416 PMCID: PMC3842755 DOI: 10.1093/bioinformatics/btt547] [Citation(s) in RCA: 78] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]

Pipelined data‐flow delegated orchestration for data‐intensive eScience workflows. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS 2013. [DOI: 10.1108/ijwis-05-2013-0012] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]

Ison J, Kalas M, Jonassen I, Bolser D, Uludag M, McWilliam H, Malone J, Lopez R, Pettifer S, Rice P. EDAM: an ontology of bioinformatics operations, types of data and identifiers, topics and formats. Bioinformatics 2013;29:1325-32. [PMID: 23479348 PMCID: PMC3654706 DOI: 10.1093/bioinformatics/btt113] [Citation(s) in RCA: 126] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2012] [Revised: 02/28/2013] [Accepted: 03/01/2013] [Indexed: 11/14/2022] Open

McWilliam H, Li W, Uludag M, Squizzato S, Park YM, Buso N, Cowley AP, Lopez R. Analysis Tool Web Services from the EMBL-EBI. Nucleic Acids Res 2013;41:W597-600. [PMID: 23671338 PMCID: PMC3692137 DOI: 10.1093/nar/gkt376] [Citation(s) in RCA: 1184] [Impact Index Per Article: 107.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open

Wolstencroft K, Haines R, Fellows D, Williams A, Withers D, Owen S, Soiland-Reyes S, Dunlop I, Nenadic A, Fisher P, Bhagat J, Belhajjame K, Bacall F, Hardisty A, Nieva de la Hidalga A, Balcazar Vargas MP, Sufi S, Goble C. The Taverna workflow suite: designing and executing workflows of Web Services on the desktop, web or in the cloud. Nucleic Acids Res 2013;41:W557-61. [PMID: 23640334 PMCID: PMC3692062 DOI: 10.1093/nar/gkt328] [Citation(s) in RCA: 482] [Impact Index Per Article: 43.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023] Open

Korcsmaros T, Dunai ZA, Vellai T, Csermely P. Teaching the bioinformatics of signaling networks: an integrated approach to facilitate multi-disciplinary learning. Brief Bioinform 2013;14:618-32. [PMID: 23640570 DOI: 10.1093/bib/bbt024] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open

Pérez M, Berlanga R, Sanz I, Aramburu MJ. BioUSeR: a semantic-based tool for retrieving Life Science web resources driven by text-rich user requirements. J Biomed Semantics 2013;4:12. [PMID: 23635042 PMCID: PMC3698192 DOI: 10.1186/2041-1480-4-12] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2012] [Accepted: 04/18/2013] [Indexed: 12/05/2022] Open

Wollbrett J, Larmande P, de Lamotte F, Ruiz M. Clever generation of rich SPARQL queries from annotated relational schema: application to Semantic Web Service creation for biological databases. BMC Bioinformatics 2013;14:126. [PMID: 23586394 PMCID: PMC3680174 DOI: 10.1186/1471-2105-14-126] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2012] [Accepted: 03/25/2013] [Indexed: 11/10/2022] Open