1
|
Tiemann JKS, Szczuka M, Bouarroudj L, Oussaren M, Garcia S, Howard RJ, Delemotte L, Lindahl E, Baaden M, Lindorff-Larsen K, Chavent M, Poulain P. MDverse: Shedding Light on the Dark Matter of Molecular Dynamics Simulations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.05.02.538537. [PMID: 37205542 PMCID: PMC10187166 DOI: 10.1101/2023.05.02.538537] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
The rise of open science and the absence of a global dedicated data repository for molecular dynamics (MD) simulations has led to the accumulation of MD files in generalist data repositories, constituting the dark matter of MD - data that is technically accessible, but neither indexed, curated, or easily searchable. Leveraging an original search strategy, we found and indexed about 250,000 files and 2,000 datasets from Zenodo, Figshare and Open Science Framework. With a focus on files produced by the Gromacs MD software, we illustrate the potential offered by the mining of publicly available MD data. We identified systems with specific molecular composition and were able to characterize essential parameters of MD simulation such as temperature and simulation length, and could identify model resolution, such as all-atom and coarse-grain. Based on this analysis, we inferred metadata to propose a search engine prototype to explore the MD data. To continue in this direction, we call on the community to pursue the effort of sharing MD data, and to report and standardize metadata to reuse this valuable matter.
Collapse
|
2
|
Nègre D, Larhlimi A, Bertrand S. Reconciliation and evolution of Penicillium rubens genome-scale metabolic networks-What about specialised metabolism? PLoS One 2023; 18:e0289757. [PMID: 37647283 PMCID: PMC10468094 DOI: 10.1371/journal.pone.0289757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2023] [Accepted: 07/24/2023] [Indexed: 09/01/2023] Open
Abstract
In recent years, genome sequencing of filamentous fungi has revealed a high proportion of specialised metabolites with growing pharmaceutical interest. However, detecting such metabolites through in silico genome analysis does not necessarily guarantee their expression under laboratory conditions. However, one plausible strategy for enabling their production lies in modifying the growth conditions. Devising a comprehensive experimental design testing in different culture environments is time-consuming and expensive. Therefore, using in silico modelling as a preliminary step, such as Genome-Scale Metabolic Network (GSMN), represents a promising approach to predicting and understanding the observed specialised metabolite production in a given organism. To address these questions, we reconstructed a new high-quality GSMN for the Penicillium rubens Wisconsin 54-1255 strain, a commonly used model organism. Our reconstruction, iPrub22, adheres to current convention standards and quality criteria, incorporating updated functional annotations, orthology searches with different GSMN templates, data from previous reconstructions, and manual curation steps targeting primary and specialised metabolites. With a MEMOTE score of 74% and a metabolic coverage of 45%, iPrub22 includes 5,192 unique metabolites interconnected by 5,919 reactions, of which 5,033 are supported by at least one genomic sequence. Of the metabolites present in iPrub22, 13% are categorised as belonging to specialised metabolism. While our high-quality GSMN provides a valuable resource for investigating known phenotypes expressed in P. rubens, our analysis identifies bottlenecks related, in particular, to the definition of what is a specialised metabolite, which requires consensus within the scientific community. It also points out the necessity of accessible, standardised and exhaustive databases of specialised metabolites. These questions must be addressed to fully unlock the potential of natural product production in P. rubens and other filamentous fungi. Our work represents a foundational step towards the objective of rationalising the production of natural products through GSMN modelling.
Collapse
Affiliation(s)
- Delphine Nègre
- Nantes Université, Institut des Substances et Organismes de la Mer, ISOMer, Nantes, France
- Nantes Université, École Centrale Nantes, CNRS, Nantes, France
| | | | - Samuel Bertrand
- Nantes Université, Institut des Substances et Organismes de la Mer, ISOMer, Nantes, France
| |
Collapse
|
3
|
Zieliński T, Hodge JJL, Millar AJ. Keep It Simple: Using README Files to Advance Standardization in Chronobiology. Clocks Sleep 2023; 5:499-506. [PMID: 37754351 PMCID: PMC10529918 DOI: 10.3390/clockssleep5030033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 08/25/2023] [Accepted: 08/28/2023] [Indexed: 09/28/2023] Open
Abstract
Standardization plays a crucial role in ensuring the reliability, reproducibility, and interoperability of research data in the biomedical sciences. Metadata standards are one foundation for the FAIR (Findable, Accessible, Interoperable, and Reusable) principles of data management. They facilitate data discovery, understanding, and reuse. However, the adoption of metadata standards in biological research lags in practice. Barriers such as complexity, lack of incentives, technical challenges, resource constraints, and resistance to change hinder widespread adoption. In the field of chronobiology, standardization is essential but faces particular challenges due to the longitudinal nature of experimental data, diverse model organisms, and varied measurement techniques. To address these challenges, we propose an approach that emphasizes simplicity and practicality: the development of README templates tailored for particular data types and species. Through this opinion article, our intention is to initiate a dialogue and commence a community-driven standardization process by engaging potential contributors and collaborators.
Collapse
Affiliation(s)
- Tomasz Zieliński
- Centre for Engineering Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JD, UK;
| | - James J. L. Hodge
- School of Physiology, Pharmacology and Neuroscience, University of Bristol, Bristol BS8 1TD, UK;
| | - Andrew J. Millar
- Centre for Engineering Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JD, UK;
| |
Collapse
|
4
|
Tatka LT, Smith LP, Hellerstein JL, Sauro HM. Adapting modeling and simulation credibility standards to computational systems biology. J Transl Med 2023; 21:501. [PMID: 37496031 PMCID: PMC10369698 DOI: 10.1186/s12967-023-04290-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2023] [Accepted: 06/19/2023] [Indexed: 07/28/2023] Open
Abstract
Computational models are increasingly used in high-impact decision making in science, engineering, and medicine. The National Aeronautics and Space Administration (NASA) uses computational models to perform complex experiments that are otherwise prohibitively expensive or require a microgravity environment. Similarly, the Food and Drug Administration (FDA) and European Medicines Agency (EMA) have began accepting models and simulations as forms of evidence for pharmaceutical and medical device approval. It is crucial that computational models meet a standard of credibility when using them in high-stakes decision making. For this reason, institutes including NASA, the FDA, and the EMA have developed standards to promote and assess the credibility of computational models and simulations. However, due to the breadth of models these institutes assess, these credibility standards are mostly qualitative and avoid making specific recommendations. On the other hand, modeling and simulation in systems biology is a narrower domain and several standards are already in place. As systems biology models increase in complexity and influence, the development of a credibility assessment system is crucial. Here we review existing standards in systems biology, credibility standards in other science, engineering, and medical fields, and propose the development of a credibility standard for systems biology models.
Collapse
Affiliation(s)
- Lillian T Tatka
- Department of Bioengineering, University of Washington, Seattle, WA, USA.
| | - Lucian P Smith
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | | | - Herbert M Sauro
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| |
Collapse
|
5
|
Mendes P. Reproducibility and FAIR principles: the case of a segment polarity network model. Front Cell Dev Biol 2023; 11:1201673. [PMID: 37346177 PMCID: PMC10279958 DOI: 10.3389/fcell.2023.1201673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 05/30/2023] [Indexed: 06/23/2023] Open
Abstract
The issue of reproducibility of computational models and the related FAIR principles (findable, accessible, interoperable, and reusable) are examined in a specific test case. I analyze a computational model of the segment polarity network in Drosophila embryos published in 2000. Despite the high number of citations to this publication, 23 years later the model is barely accessible, and consequently not interoperable. Following the text of the original publication allowed successfully encoding the model for the open source software COPASI. Subsequently saving the model in the SBML format allowed it to be reused in other open source software packages. Submission of this SBML encoding of the model to the BioModels database enables its findability and accessibility. This demonstrates how the FAIR principles can be successfully enabled by using open source software, widely adopted standards, and public repositories, facilitating reproducibility and reuse of computational cell biology models that will outlive the specific software used.
Collapse
Affiliation(s)
- Pedro Mendes
- Center for Cell Analysis and Modeling, University of Connecticut School of Medicine, Farmington, CT, United States
- Department of Cell Biology, University of Connecticut School of Medicine, Farmington, CT, United States
| |
Collapse
|
6
|
Vittadello ST, Stumpf MPH. Open problems in mathematical biology. Math Biosci 2022; 354:108926. [PMID: 36377100 DOI: 10.1016/j.mbs.2022.108926] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2022] [Revised: 10/21/2022] [Accepted: 10/21/2022] [Indexed: 11/06/2022]
Abstract
Biology is data-rich, and it is equally rich in concepts and hypotheses. Part of trying to understand biological processes and systems is therefore to confront our ideas and hypotheses with data using statistical methods to determine the extent to which our hypotheses agree with reality. But doing so in a systematic way is becoming increasingly challenging as our hypotheses become more detailed, and our data becomes more complex. Mathematical methods are therefore gaining in importance across the life- and biomedical sciences. Mathematical models allow us to test our understanding, make testable predictions about future behaviour, and gain insights into how we can control the behaviour of biological systems. It has been argued that mathematical methods can be of great benefit to biologists to make sense of data. But mathematics and mathematicians are set to benefit equally from considering the often bewildering complexity inherent to living systems. Here we present a small selection of open problems and challenges in mathematical biology. We have chosen these open problems because they are of both biological and mathematical interest.
Collapse
Affiliation(s)
- Sean T Vittadello
- Melbourne Integrative Genomics, University of Melbourne, Australia; School of BioSciences, University of Melbourne, Australia
| | - Michael P H Stumpf
- Melbourne Integrative Genomics, University of Melbourne, Australia; School of BioSciences, University of Melbourne, Australia; School of Mathematics and Statistics, University of Melbourne, Australia.
| |
Collapse
|
7
|
pISA-tree - a data management framework for life science research projects using a standardised directory tree. Sci Data 2022; 9:685. [DOI: 10.1038/s41597-022-01805-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 10/24/2022] [Indexed: 11/12/2022] Open
Abstract
AbstractWe developed pISA-tree, a straightforward and flexible data management solution for organisation of life science project-associated research data and metadata. pISA-tree was initiated by end-user requirements thus its strong points are practicality and low maintenance cost. It enables on-the-fly creation of enriched directory tree structure (project/Investigation/Study/Assay) based on the ISA model, in a standardised manner via consecutive batch files. Templates-based metadata is generated in parallel at each level enabling guided submission of experiment metadata. pISA-tree is complemented by two R packages, pisar and seekr. pisar facilitates integration of pISA-tree datasets into bioinformatic pipelines and generation of ISA-Tab exports. seekr enables synchronisation with the FAIRDOMHub repository. Applicability of pISA-tree was demonstrated in several national and international multi-partner projects. The system thus supports findable, accessible, interoperable and reusable (FAIR) research and is in accordance with the Open Science initiative. Source code and documentation of pISA-tree are available at https://github.com/NIB-SI/pISA-tree.
Collapse
|
8
|
Eriksson O, Bhalla US, Blackwell KT, Crook SM, Keller D, Kramer A, Linne ML, Saudargienė A, Wade RC, Hellgren Kotaleski J. Combining hypothesis- and data-driven neuroscience modeling in FAIR workflows. eLife 2022; 11:e69013. [PMID: 35792600 PMCID: PMC9259018 DOI: 10.7554/elife.69013] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Accepted: 05/13/2022] [Indexed: 12/22/2022] Open
Abstract
Modeling in neuroscience occurs at the intersection of different points of view and approaches. Typically, hypothesis-driven modeling brings a question into focus so that a model is constructed to investigate a specific hypothesis about how the system works or why certain phenomena are observed. Data-driven modeling, on the other hand, follows a more unbiased approach, with model construction informed by the computationally intensive use of data. At the same time, researchers employ models at different biological scales and at different levels of abstraction. Combining these models while validating them against experimental data increases understanding of the multiscale brain. However, a lack of interoperability, transparency, and reusability of both models and the workflows used to construct them creates barriers for the integration of models representing different biological scales and built using different modeling philosophies. We argue that the same imperatives that drive resources and policy for data - such as the FAIR (Findable, Accessible, Interoperable, Reusable) principles - also support the integration of different modeling approaches. The FAIR principles require that data be shared in formats that are Findable, Accessible, Interoperable, and Reusable. Applying these principles to models and modeling workflows, as well as the data used to constrain and validate them, would allow researchers to find, reuse, question, validate, and extend published models, regardless of whether they are implemented phenomenologically or mechanistically, as a few equations or as a multiscale, hierarchical system. To illustrate these ideas, we use a classical synaptic plasticity model, the Bienenstock-Cooper-Munro rule, as an example due to its long history, different levels of abstraction, and implementation at many scales.
Collapse
Affiliation(s)
- Olivia Eriksson
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
| | - Upinder Singh Bhalla
- National Center for Biological Sciences, Tata Institute of Fundamental ResearchBangaloreIndia
| | - Kim T Blackwell
- Department of Bioengineering, Volgenau School of Engineering, George Mason UniversityFairfaxUnited States
| | - Sharon M Crook
- School of Mathematical and Statistical Sciences, Arizona State UniversityTempeUnited States
| | - Daniel Keller
- Blue Brain Project, École Polytechnique Fédérale de LausanneLausanneSwitzerland
| | - Andrei Kramer
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
- Department of Neuroscience, Karolinska InstituteStockholmSweden
| | - Marja-Leena Linne
- Faculty of Medicine and Health Technology, Tampere UniversityTampereFinland
| | - Ausra Saudargienė
- Neuroscience Institute, Lithuanian University of Health SciencesKaunasLithuania
- Department of Informatics, Vytautas Magnus UniversityKaunasLithuania
| | - Rebecca C Wade
- Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS)HeidelbergGermany
- Center for Molecular Biology (ZMBH), ZMBH-DKFZ Alliance, University of HeidelbergHeidelbergGermany
- Interdisciplinary Center for Scientific Computing (IWR), Heidelberg UniversityHeidelbergGermany
| | - Jeanette Hellgren Kotaleski
- Science for Life Laboratory, School of Electrical Engineering and Computer Science, KTH Royal Institute of TechnologyStockholmSweden
- Department of Neuroscience, Karolinska InstituteStockholmSweden
| |
Collapse
|
9
|
Khurana S, Schivo S, Plass JRM, Mersinis N, Scholma J, Kerkhofs J, Zhong L, van de Pol J, Langerak R, Geris L, Karperien M, Post JN. An ECHO of Cartilage: In Silico Prediction of Combinatorial Treatments to Switch Between Transient and Permanent Cartilage Phenotypes With Ex Vivo Validation. Front Bioeng Biotechnol 2021; 9:732917. [PMID: 34869253 PMCID: PMC8634894 DOI: 10.3389/fbioe.2021.732917] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 10/25/2021] [Indexed: 11/13/2022] Open
Abstract
A fundamental question in cartilage biology is: what determines the switch between permanent cartilage found in the articular joints and transient hypertrophic cartilage that functions as a template for bone? This switch is observed both in a subset of OA patients that develop osteophytes, as well as in cell-based tissue engineering strategies for joint repair. A thorough understanding of the mechanisms regulating cell fate provides opportunities for treatment of cartilage disease and tissue engineering strategies. The objective of this study was to understand the mechanisms that regulate the switch between permanent and transient cartilage using a computational model of chondrocytes, ECHO. To investigate large signaling networks that regulate cell fate decisions, we developed the software tool ANIMO, Analysis of Networks with interactive Modeling. In ANIMO, we generated an activity network integrating 7 signal transduction pathways resulting in a network containing over 50 proteins with 200 interactions. We called this model ECHO, for executable chondrocyte. Previously, we showed that ECHO could be used to characterize mechanisms of cell fate decisions. ECHO was first developed based on a Boolean model of growth plate. Here, we show how the growth plate Boolean model was translated to ANIMO and how we adapted the topology and parameters to generate an articular cartilage model. In ANIMO, many combinations of overactivation/knockout were tested that result in a switch between permanent cartilage (SOX9+) and transient, hypertrophic cartilage (RUNX2+). We used model checking to prioritize combination treatments for wet-lab validation. Three combinatorial treatments were chosen and tested on metatarsals from 1-day old rat pups that were treated for 6 days. We found that a combination of IGF1 with inhibition of ERK1/2 had a positive effect on cartilage formation and growth, whereas activation of DLX5 combined with inhibition of PKA had a negative effect on cartilage formation and growth and resulted in increased cartilage hypertrophy. We show that our model describes cartilage formation, and that model checking can aid in choosing and prioritizing combinatorial treatments that interfere with normal cartilage development. Here we show that combinatorial treatments induce changes in the zonal distribution of cartilage, indication possible switches in cell fate. This indicates that simulations in ECHO aid in describing pathologies in which switches between cell fates are observed, such as OA.
Collapse
Affiliation(s)
- Sakshi Khurana
- Technical Medicine Centre, Department of Developmental BioEngineering, University of Twente, Enschede, Netherlands
| | - Stefano Schivo
- Technical Medicine Centre, Department of Developmental BioEngineering, University of Twente, Enschede, Netherlands.,Department of Formal Methods and Tools, CTIT Institute, University of Twente, Enschede, Netherlands
| | - Jacqueline R M Plass
- Technical Medicine Centre, Department of Developmental BioEngineering, University of Twente, Enschede, Netherlands
| | - Nikolas Mersinis
- Technical Medicine Centre, Department of Developmental BioEngineering, University of Twente, Enschede, Netherlands
| | - Jetse Scholma
- Technical Medicine Centre, Department of Developmental BioEngineering, University of Twente, Enschede, Netherlands
| | - Johan Kerkhofs
- Biomechanics Research Unit, GIGA In Silico Medicine, ULiège, Liège, Belgium
| | - Leilei Zhong
- Technical Medicine Centre, Department of Developmental BioEngineering, University of Twente, Enschede, Netherlands
| | - Jaco van de Pol
- Department of Formal Methods and Tools, CTIT Institute, University of Twente, Enschede, Netherlands.,Dept. of Computer Science, Aarhus University, Aarhus, Denmark
| | - Rom Langerak
- Department of Formal Methods and Tools, CTIT Institute, University of Twente, Enschede, Netherlands
| | - Liesbet Geris
- Biomechanics Section, Department of Mechanical Engineering, KU Leuven, Leuven, Belgium
| | - Marcel Karperien
- Technical Medicine Centre, Department of Developmental BioEngineering, University of Twente, Enschede, Netherlands
| | - Janine N Post
- Technical Medicine Centre, Department of Developmental BioEngineering, University of Twente, Enschede, Netherlands
| |
Collapse
|
10
|
Budde K, Smith J, Wilsdorf P, Haack F, Uhrmacher AM. Relating simulation studies by provenance-Developing a family of Wnt signaling models. PLoS Comput Biol 2021; 17:e1009227. [PMID: 34351901 PMCID: PMC8407594 DOI: 10.1371/journal.pcbi.1009227] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Revised: 08/31/2021] [Accepted: 06/29/2021] [Indexed: 12/28/2022] Open
Abstract
For many biological systems, a variety of simulation models exist. A new simulation model is rarely developed from scratch, but rather revises and extends an existing one. A key challenge, however, is to decide which model might be an appropriate starting point for a particular problem and why. To answer this question, we need to identify entities and activities that contributed to the development of a simulation model. Therefore, we exploit the provenance data model, PROV-DM, of the World Wide Web Consortium and, building on previous work, continue developing a PROV ontology for simulation studies. Based on a case study of 19 Wnt/β-catenin signaling models, we identify crucial entities and activities as well as useful metadata to both capture the provenance information from individual simulation studies and relate these forming a family of models. The approach is implemented in WebProv, a web application for inserting and querying provenance information. Our specialization of PROV-DM contains the entities Research Question, Assumption, Requirement, Qualitative Model, Simulation Model, Simulation Experiment, Simulation Data, and Wet-lab Data as well as activities referring to building, calibrating, validating, and analyzing a simulation model. We show that most Wnt simulation models are connected to other Wnt models by using (parts of) these models. However, the overlap, especially regarding the Wet-lab Data used for calibration or validation of the models is small. Making these aspects of developing a model explicit and queryable is an important step for assessing and reusing simulation models more effectively. Exposing this information helps to integrate a new simulation model within a family of existing ones and may lead to the development of more robust and valid simulation models. We hope that our approach becomes part of a standardization effort and that modelers adopt the benefits of provenance when considering or creating simulation models. We revise a provenance ontology for simulation studies of cellular biochemical models. Provenance information is useful for understanding the creation of a simulation model because it not only contains information about the entities and activities that have led to a simulation model but also their relations, all of which can be visualized. It provides additional structure by explicitly recording research questions, assumptions, and requirements and relating them along with data, qualitative models, simulation models, and simulation experiments through a small set of predefined but extensible activities. We have applied our concept to a family of 19 Wnt signaling models and implemented a web-based tool (WebProv) to store the provenance information from these studies. The resulting provenance graph visualizes the story line of simulation studies and demonstrates the creation and calibration of simulation models, the successive attempts of validation and extension, and shows, beyond an individual simulation study, how the Wnt models are related. Thereby, the steps and sources that contributed to a simulation model are made explicit. Our approach complements other approaches aimed at facilitating the reuse and assessment of simulation products in systems biology such as model repositories as well as annotation and documentation guidelines.
Collapse
Affiliation(s)
- Kai Budde
- Institute for Visual and Analytic Computing, University of Rostock, Rostock, Germany
- * E-mail:
| | - Jacob Smith
- Faculty of Computer Science, University of New Brunswick, Fredericton, Canada
| | - Pia Wilsdorf
- Institute for Visual and Analytic Computing, University of Rostock, Rostock, Germany
| | - Fiete Haack
- Institute for Visual and Analytic Computing, University of Rostock, Rostock, Germany
| | - Adelinde M. Uhrmacher
- Institute for Visual and Analytic Computing, University of Rostock, Rostock, Germany
| |
Collapse
|
11
|
Wiechert W, Nöh K. Quantitative Metabolic Flux Analysis Based on Isotope Labeling. Metab Eng 2021. [DOI: 10.1002/9783527823468.ch3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
12
|
Carey MA, Dräger A, Beber ME, Papin JA, Yurkovich JT. Community standards to facilitate development and address challenges in metabolic modeling. Mol Syst Biol 2021; 16:e9235. [PMID: 32845080 PMCID: PMC8411906 DOI: 10.15252/msb.20199235] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023] Open
Abstract
Standardization of data and models facilitates effective communication, especially in computational systems biology. However, both the development and consistent use of standards and resources remain challenging. As a result, the amount, quality, and format of the information contained within systems biology models are not consistent and therefore present challenges for widespread use and communication. Here, we focused on these standards, resources, and challenges in the field of constraint-based metabolic modeling by conducting a community-wide survey. We used this feedback to (i) outline the major challenges that our field faces and to propose solutions and (ii) identify a set of features that defines what a "gold standard" metabolic network reconstruction looks like concerning content, annotation, and simulation capabilities. We anticipate that this community-driven outline will help the long-term development of community-inspired resources as well as produce high-quality, accessible models within our field. More broadly, we hope that these efforts can serve as blueprints for other computational modeling communities to ensure the continued development of both practical, usable standards and reproducible, knowledge-rich models.
Collapse
Affiliation(s)
- Maureen A Carey
- Division of Infectious Diseases and International Health, Department of Medicine, University of Virginia, Charlottesville, VA, USA
| | - Andreas Dräger
- Computational Systems Biology of Infection and Antimicrobial-Resistant Pathogens, Institute for Biomedical Informatics (IBMI), University of Tübingen, Tübingen, Germany.,Department of Computer Science, University of Tübingen, Tübingen, Germany.,German Center for Infection Research (DZIF), partner site Tübingen, Tübingen, Germany
| | - Moritz E Beber
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kemitorvet, Denmark
| | - Jason A Papin
- Division of Infectious Diseases and International Health, Department of Medicine, University of Virginia, Charlottesville, VA, USA.,Department of Biomedical Engineering, University of Virginia, Charlottesville, VA, USA
| | | |
Collapse
|
13
|
Eiber CD, Delbeke J, Cardoso J, de Neeling M, John SE, Won Lee C, Skefos J, Sun A, Prodanov D, McKinney Z. Preliminary Minimum Reporting Requirements for In-Vivo Neural Interface Research: I. Implantable Neural Interfaces. IEEE OPEN JOURNAL OF ENGINEERING IN MEDICINE AND BIOLOGY 2021; 2:74-83. [PMID: 33997788 PMCID: PMC8118094 DOI: 10.1109/ojemb.2021.3060919] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
The pace of research and development in neuroscience, neurotechnology, and neurorehabilitation is rapidly accelerating, with the number of publications doubling every 4.2 years. Maintaining this progress requires technological standards and scientific reporting guidelines to provide frameworks for communication and interoperability. The present lack of such neurotechnology standards limits the transparency, repro-ducibility, and meta-analysis of this growing body of literature, posing an ongoing barrier to research, clinical, and commercial objectives. Continued neurotechnological innovation requires the development of some minimal standards to promote integration between this broad spectrum of technologies and therapies. To preserve design freedom and accelerate the translation of research into safe and effective technologies with maximal user benefit, such standards must be collaboratively co-developed by the full range of neuroscience and neurotechnology stakeholders. This paper summarizes the preliminary recommendations of IEEE P2794 Standards Working Group, developing a Reporting Standard for in-vivo Neural Interface Research (RSNIR).
Collapse
Affiliation(s)
| | | | - Jorge Cardoso
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisbon 1649-028, Portugal
| | | | - Sam E John
- University of Melbourne, Melbourne 3010, Australia
| | | | | | - Argus Sun
- University of California, Los Angeles, CA 90095 USA
| | | | - Zach McKinney
- BioRobotics Institute and Center for Excellence in Robotics and AI, Scuola Superiore Sant'Anna, 56127 Pisa, Italy
| |
Collapse
|
14
|
|
15
|
Rampadarath A, Nickerson DP. Automated Execution of Simulation Studies in Systems Medicine Using SED-ML and COMBINE Archive. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11688-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
16
|
König M. Executable Simulation Model of the Liver. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11682-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022] Open
|
17
|
Abstract
This book chapter is drafted for biologists with experimental experiences in ROS biology but being newcomers in the field of modeling. We start with a general introduction about computational modeling in biology and an overview of software tools suitable for beginners. This chapter encompasses an introduction to computational models with special focus on simulation of ROS dynamics. A step-by-step tutorial follows providing guidance for all relevant model development processes. This course of action gives a comprehensible way to understand the benefits of computational models and to gain the necessary knowledge to build own small equation-based models. Small models can be created without any special programming expertise or in-depth technical and mathematical knowledge. Afterward in the final section, a short overview of pitfalls, challenges, and limitations is provided, combined with suggestions for further reading to improve and expand modeling skills of biologists.
Collapse
Affiliation(s)
- Jana Schleicher
- Experimental Transplantation Surgery, Department of General, Visceral and Vascular Surgery, University Hospital Jena, Jena, Germany.
| |
Collapse
|
18
|
Janzén D, Gennemark P, Hovdal D, Jansson-Löfmark R, Ahlström C. Dynamical Modeling of Different Drug Modalities in Drug Research. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11542-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
19
|
Forero DA, Lopez-Leon S, Perry G. A brief guide to the science and art of writing manuscripts in biomedicine. J Transl Med 2020; 18:425. [PMID: 33167977 PMCID: PMC7653709 DOI: 10.1186/s12967-020-02596-2] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2020] [Accepted: 10/29/2020] [Indexed: 12/12/2022] Open
Abstract
Publishing articles in international scientific journals is the primary method for the communication of validated research findings and ideas. Journal articles are commonly used as a major input for evaluations of researchers and institutions. Few articles have been published previously about the different aspects needed for writing high-quality articles. In this manuscript, we provide an updated and brief guide for the multiple dimensions needed for writing manuscripts in the health and biological sciences, from current, international and interdisciplinary perspectives and from our expertise as authors, peer reviewers and editors. We provide key suggestions for writing major sections of the manuscript (e.g. title, abstract, introduction, methods, results and discussion), for submitting the manuscript and bring an overview of the peer review process and of the post-publication impact of the articles.
Collapse
Affiliation(s)
- Diego A Forero
- Health and Sport Sciences Research Group, School of Health and Sport Sciences, Fundación Universitaria del Área Andina, Bogotá, Colombia.
- MSc Program in Epidemiology, School of Health and Sport Sciences, Fundación Universitaria del Área Andina, Bogotá, Colombia.
| | - Sandra Lopez-Leon
- Global Drug Development, Novartis Pharmaceuticals Corporation, East Hanover, NJ, USA.
| | - George Perry
- Department of Biology and Neurosciences Institute, The University of Texas at San Antonio, San Antonio, TX, USA
| |
Collapse
|
20
|
Papadiamantis AG, Klaessig FC, Exner TE, Hofer S, Hofstaetter N, Himly M, Williams MA, Doganis P, Hoover MD, Afantitis A, Melagraki G, Nolan TS, Rumble J, Maier D, Lynch I. Metadata Stewardship in Nanosafety Research: Community-Driven Organisation of Metadata Schemas to Support FAIR Nanoscience Data. NANOMATERIALS (BASEL, SWITZERLAND) 2020; 10:E2033. [PMID: 33076428 PMCID: PMC7602672 DOI: 10.3390/nano10102033] [Citation(s) in RCA: 28] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Revised: 10/08/2020] [Accepted: 10/11/2020] [Indexed: 12/15/2022]
Abstract
The emergence of nanoinformatics as a key component of nanotechnology and nanosafety assessment for the prediction of engineered nanomaterials (NMs) properties, interactions, and hazards, and for grouping and read-across to reduce reliance on animal testing, has put the spotlight firmly on the need for access to high-quality, curated datasets. To date, the focus has been around what constitutes data quality and completeness, on the development of minimum reporting standards, and on the FAIR (findable, accessible, interoperable, and reusable) data principles. However, moving from the theoretical realm to practical implementation requires human intervention, which will be facilitated by the definition of clear roles and responsibilities across the complete data lifecycle and a deeper appreciation of what metadata is, and how to capture and index it. Here, we demonstrate, using specific worked case studies, how to organise the nano-community efforts to define metadata schemas, by organising the data management cycle as a joint effort of all players (data creators, analysts, curators, managers, and customers) supervised by the newly defined role of data shepherd. We propose that once researchers understand their tasks and responsibilities, they will naturally apply the available tools. Two case studies are presented (modelling of particle agglomeration for dose metrics, and consensus for NM dissolution), along with a survey of the currently implemented metadata schema in existing nanosafety databases. We conclude by offering recommendations on the steps forward and the needed workflows for metadata capture to ensure FAIR nanosafety data.
Collapse
Affiliation(s)
- Anastasios G. Papadiamantis
- School of Geography, Earth and Environmental Sciences, University of Birmingham, Birmingham B15 2TT, UK
- Novamechanics Ltd., 1065 Nicosia, Cyprus; (A.A.); (G.M.)
| | | | | | - Sabine Hofer
- Department of Biosciences, Paris Lodron University of Salzburg, 5020 Salzburg, Austria; (S.H.); (N.H.); (M.H.)
| | - Norbert Hofstaetter
- Department of Biosciences, Paris Lodron University of Salzburg, 5020 Salzburg, Austria; (S.H.); (N.H.); (M.H.)
| | - Martin Himly
- Department of Biosciences, Paris Lodron University of Salzburg, 5020 Salzburg, Austria; (S.H.); (N.H.); (M.H.)
| | - Marc A. Williams
- U.S. Army Public Health Center (APHC), Aberdeen Proving Ground—South, Aberdeen, MD 21010, USA;
| | - Philip Doganis
- School of Chemical Engineering, National Technical University of Athens, 157 80 Athens, Greece;
| | | | | | | | - Tracy S. Nolan
- Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR 72205, USA;
| | - John Rumble
- R&R Data Services, Gaithersburg, MD 20877, USA;
- CODATA-VAMAS Working Group on Nanomaterials, 75016 Paris, France
| | | | - Iseult Lynch
- School of Geography, Earth and Environmental Sciences, University of Birmingham, Birmingham B15 2TT, UK
| |
Collapse
|
21
|
Porubsky VL, Goldberg AP, Rampadarath AK, Nickerson DP, Karr JR, Sauro HM. Best Practices for Making Reproducible Biochemical Models. Cell Syst 2020; 11:109-120. [PMID: 32853539 DOI: 10.1016/j.cels.2020.06.012] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2019] [Revised: 05/15/2020] [Accepted: 06/24/2020] [Indexed: 01/03/2023]
Abstract
Like many scientific disciplines, dynamical biochemical modeling is hindered by irreproducible results. This limits the utility of biochemical models by making them difficult to understand, trust, or reuse. We comprehensively list the best practices that biochemical modelers should follow to build reproducible biochemical model artifacts-all data, model descriptions, and custom software used by the model-that can be understood and reused. The best practices provide advice for all steps of a typical biochemical modeling workflow in which a modeler collects data; constructs, trains, simulates, and validates the model; uses the predictions of a model to advance knowledge; and publicly shares the model artifacts. The best practices emphasize the benefits obtained by using standard tools and formats and provides guidance to modelers who do not or cannot use standards in some stages of their modeling workflow. Adoption of these best practices will enhance the ability of researchers to reproduce, understand, and reuse biochemical models.
Collapse
Affiliation(s)
- Veronica L Porubsky
- Department of Bioengineering, University of Washington, Seattle, WA 98105, USA.
| | - Arthur P Goldberg
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA.
| | - Anand K Rampadarath
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - David P Nickerson
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Jonathan R Karr
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA; Icahn Institute for Data Science and Genomic Technology, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| | - Herbert M Sauro
- Department of Bioengineering, University of Washington, Seattle, WA 98105, USA
| |
Collapse
|
22
|
Comte B, Baumbach J, Benis A, Basílio J, Debeljak N, Flobak Å, Franken C, Harel N, He F, Kuiper M, Méndez Pérez JA, Pujos-Guillot E, Režen T, Rozman D, Schmid JA, Scerri J, Tieri P, Van Steen K, Vasudevan S, Watterson S, Schmidt HH. Network and Systems Medicine: Position Paper of the European Collaboration on Science and Technology Action on Open Multiscale Systems Medicine. NETWORK AND SYSTEMS MEDICINE 2020; 3:67-90. [PMID: 32954378 PMCID: PMC7500076 DOI: 10.1089/nsm.2020.0004] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/18/2020] [Indexed: 12/14/2022] Open
Abstract
Introduction: Network and systems medicine has rapidly evolved over the past decade, thanks to computational and integrative tools, which stem in part from systems biology. However, major challenges and hurdles are still present regarding validation and translation into clinical application and decision making for precision medicine. Methods: In this context, the Collaboration on Science and Technology Action on Open Multiscale Systems Medicine (OpenMultiMed) reviewed the available advanced technologies for multidimensional data generation and integration in an open-science approach as well as key clinical applications of network and systems medicine and the main issues and opportunities for the future. Results: The development of multi-omic approaches as well as new digital tools provides a unique opportunity to explore complex biological systems and networks at different scales. Moreover, the application of findable, applicable, interoperable, and reusable principles and the adoption of standards increases data availability and sharing for multiscale integration and interpretation. These innovations have led to the first clinical applications of network and systems medicine, particularly in the field of personalized therapy and drug dosing. Enlarging network and systems medicine application would now imply to increase patient engagement and health care providers as well as to educate the novel generations of medical doctors and biomedical researchers to shift the current organ- and symptom-based medical concepts toward network- and systems-based ones for more precise diagnoses, interventions, and ideally prevention. Conclusion: In this dynamic setting, the health care system will also have to evolve, if not revolutionize, in terms of organization and management.
Collapse
Affiliation(s)
- Blandine Comte
- Plateforme d'Exploration du Métabolisme, MetaboHUB Clermont, Université Clermont Auvergne, INRAE, UNH, Clermont-Ferrand, France
| | - Jan Baumbach
- TUM School of Life Sciences Weihenstephan (WZW), Technical University of Munich (TUM), Freising-Weihenstephan, Germany
| | | | - José Basílio
- Institute of Vascular Biology and Thrombosis Research, Center for Physiology and Pharmacology, Medical University of Vienna, Vienna, Austria
| | - Nataša Debeljak
- Medical Centre for Molecular Biology, Institute of Biochemistry, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Åsmund Flobak
- Department of Clinical and Molecular Medicine, Norwegian University of Science and Technology, Trondheim, Norway
- The Cancer Clinic, St. Olav's University Hospital, Trondheim, Norway
| | - Christian Franken
- Digital Health Systems, Einsingen, Germany
- Department of Pharmacology and Personalised Medicine, Faculty of Health, Medicine and Life Science, Maastricht University, Maastricht, The Netherlands
| | | | - Feng He
- Department of Infection and Immunity, Luxembourg Institute of Health, Esch-sur-Alzette, Luxembourg
- Institute of Medical Microbiology, University Hospital Essen, University of Duisburg-Essen, Essen, Germany
| | - Martin Kuiper
- Department of Biology, Faculty of Natural Sciences, Norwegian University of Science and Technology, Trondheim, Norway
| | - Juan Albino Méndez Pérez
- Department of Computer Science and Systems Engineering, Universidad de La Laguna, Tenerife, Spain
| | - Estelle Pujos-Guillot
- Plateforme d'Exploration du Métabolisme, MetaboHUB Clermont, Université Clermont Auvergne, INRAE, UNH, Clermont-Ferrand, France
| | - Tadeja Režen
- Centre for Functional Genomics and Bio-Chips, Institute of Biochemistry, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Damjana Rozman
- Centre for Functional Genomics and Bio-Chips, Institute of Biochemistry, Faculty of Medicine, University of Ljubljana, Ljubljana, Slovenia
| | - Johannes A. Schmid
- Institute of Vascular Biology and Thrombosis Research, Center for Physiology and Pharmacology, Medical University of Vienna, Vienna, Austria
| | - Jeanesse Scerri
- Department of Physiology and Biochemistry, Faculty of Medicine and Surgery, University of Malta, Msida, Malta
| | - Paolo Tieri
- CNR National Research Council, IAC Institute for Applied Computing, Rome, Italy
| | | | - Sona Vasudevan
- Georgetown University Medical Centre, Washington, District of Columbia, USA
| | - Steven Watterson
- Northern Ireland Centre for Stratified Medicine, Ulster University, Londonderry, United Kingdom
| | - Harald H.H.W. Schmidt
- Department of Pharmacology and Personalised Medicine, Faculty of Health, Medicine and Life Science, MeHNS, Maastricht University, The Netherlands
| |
Collapse
|
23
|
Blanco W, Lopes PH, de S Souza AA, Mascagni M. Non-replicability circumstances in a neural network model with Hodgkin-Huxley-type neurons. J Comput Neurosci 2020; 48:357-363. [PMID: 32519227 DOI: 10.1007/s10827-020-00748-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2019] [Revised: 03/30/2020] [Accepted: 04/22/2020] [Indexed: 11/24/2022]
Abstract
Building upon previous experiments can be used to accomplish new goals. In computing, it is imperative to reuse computer code to continue development on specific projects. Reproducibility is a fundamental building block in science, and experimental reproducibility issues have recently been of great concern. It may be surprising that reproducibility is also of concern in computational science. In this study, we used a previously published code to investigate neural network activity and we were unable to replicate our original results. This led us to investigate the code in question, and we found that several different aspects, attributable to floating-point arithmetic, were the cause of these replicability issues. Furthermore, we uncovered other manifestations of this lack of replicability in other parts of the computation with this model. The simulated model is a standard system of ordinary differential equations, very much like those commonly used in computational neuroscience. Thus, we believe that other researchers in the field should be vigilant when using such models and avoid drawing conclusions from calculations if their qualitative results can be substantially modified through non-reproducible circumstances.
Collapse
Affiliation(s)
- Wilfredo Blanco
- Computer Science Department, State University of Rio Grande do Norte, Natal, RN, Brazil. .,Bioinformatics Department, Federal University of Rio Grande do Norte, Natal, RN, Brazil.
| | - Paulo H Lopes
- Computer Science Department, State University of Rio Grande do Norte, Natal, RN, Brazil
| | | | - Michael Mascagni
- Computer Science Department, Florida State University, Tallahassee, FL, USA.,Applied and Computational Mathematics Division, National Institute of Standards and Technology, Gaithersburg, MD, USA
| |
Collapse
|
24
|
Yang PC, Purawat S, Ieong PU, Jeng MT, DeMarco KR, Vorobyov I, McCulloch AD, Altintas I, Amaro RE, Clancy CE. A demonstration of modularity, reuse, reproducibility, portability and scalability for modeling and simulation of cardiac electrophysiology using Kepler Workflows. PLoS Comput Biol 2019; 15:e1006856. [PMID: 30849072 PMCID: PMC6426265 DOI: 10.1371/journal.pcbi.1006856] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2018] [Revised: 03/20/2019] [Accepted: 02/08/2019] [Indexed: 01/18/2023] Open
Abstract
Multi-scale computational modeling is a major branch of computational biology as evidenced by the US federal interagency Multi-Scale Modeling Consortium and major international projects. It invariably involves specific and detailed sequences of data analysis and simulation, often with multiple tools and datasets, and the community recognizes improved modularity, reuse, reproducibility, portability and scalability as critical unmet needs in this area. Scientific workflows are a well-recognized strategy for addressing these needs in scientific computing. While there are good examples if the use of scientific workflows in bioinformatics, medical informatics, biomedical imaging and data analysis, there are fewer examples in multi-scale computational modeling in general and cardiac electrophysiology in particular. Cardiac electrophysiology simulation is a mature area of multi-scale computational biology that serves as an excellent use case for developing and testing new scientific workflows. In this article, we develop, describe and test a computational workflow that serves as a proof of concept of a platform for the robust integration and implementation of a reusable and reproducible multi-scale cardiac cell and tissue model that is expandable, modular and portable. The workflow described leverages Python and Kepler-Python actor for plotting and pre/post-processing. During all stages of the workflow design, we rely on freely available open-source tools, to make our workflow freely usable by scientists. We present a computational workflow as a proof of concept for integration and implementation of a reusable and reproducible cardiac multi-scale electrophysiology model that is expandable, modular and portable. This framework enables scientists to create intuitive, user-friendly and flexible end-to-end automated scientific workflows using a graphical user interface. Kepler is an advanced open-source platform that supports multiple models of computation. The underlying workflow engine handles scalability, provenance, reproducibility aspects of the code, performs orchestration of data flow, and automates execution on heterogeneous computing resources. One of the main advantages of workflow utilization is the integration of code written in multiple languages Standardization occurs at the interfaces of the workflow elements and allows for general applications and easy comparison and integration of code from different research groups or even multiple programmers coding in different languages for various purposes from the same group. A workflow driven problem-solving approach enables domain scientists to focus on resolving the core science questions, and delegates the computational and process management burden to the underlying Workflow. The workflow driven approach allows scaling the computational experiment with distributed data-parallel execution on multiple computing platforms, such as, HPC resources, GPU clusters, Cloud etc. The workflow framework tracks software version information along with hardware information to allow users an opportunity to trace any variation in workflow outcome to the system configurations.
Collapse
Affiliation(s)
- Pei-Chi Yang
- Department of Physiology and Membrane Biology, Department of Pharmacology, School of Medicine, University of California Davis, Davis, California, United States of America
| | - Shweta Purawat
- San Diego Supercomputer Center (SDSC), University of California, San Diego, La Jolla, California, United States of America
| | - Pek U. Ieong
- Department of Chemistry and Biochemistry, National Biomedical Computation Resource, Drug Design Data Resource (D3R), University of California San Diego, La Jolla, California, United States of America
| | - Mao-Tsuen Jeng
- Department of Physiology and Membrane Biology, Department of Pharmacology, School of Medicine, University of California Davis, Davis, California, United States of America
| | - Kevin R. DeMarco
- Department of Physiology and Membrane Biology, Department of Pharmacology, School of Medicine, University of California Davis, Davis, California, United States of America
| | - Igor Vorobyov
- Department of Physiology and Membrane Biology, Department of Pharmacology, School of Medicine, University of California Davis, Davis, California, United States of America
| | - Andrew D. McCulloch
- Departments of Bioengineering and Medicine, University of California, San Diego, La Jolla, California, United States of America
| | - Ilkay Altintas
- San Diego Supercomputer Center (SDSC), University of California, San Diego, La Jolla, California, United States of America
| | - Rommie E. Amaro
- Department of Chemistry and Biochemistry, National Biomedical Computation Resource, Drug Design Data Resource (D3R), University of California San Diego, La Jolla, California, United States of America
| | - Colleen E. Clancy
- Department of Physiology and Membrane Biology, Department of Pharmacology, School of Medicine, University of California Davis, Davis, California, United States of America
- * E-mail:
| |
Collapse
|
25
|
Mishra B, Kumar N, Mukhtar MS. Systems Biology and Machine Learning in Plant-Pathogen Interactions. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2019; 32:45-55. [PMID: 30418085 DOI: 10.1094/mpmi-08-18-0221-fi] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Systems biology is an inclusive approach to study the static and dynamic emergent properties on a global scale by integrating multiomics datasets to establish qualitative and quantitative associations among multiple biological components. With an abundance of improved high throughput -omics datasets, network-based analyses and machine learning technologies are playing a pivotal role in comprehensive understanding of biological systems. Network topological features reveal most important nodes within a network as well as prioritize significant molecular components for diverse biological networks, including coexpression, protein-protein interaction, and gene regulatory networks. Machine learning techniques provide enormous predictive power through specific feature extraction from biological data. Deep learning, a subtype of machine learning, has plausible future applications because a domain expert for feature extraction is not needed in this algorithm. Inspired by diverse domains of biology, we here review classic systems biology techniques applied in plant immunity thus far. We also discuss additional advanced approaches in both graph theory and machine learning, which may provide new insights for understanding plant-microbe interactions. Finally, we propose a hybrid approach in plant immune systems that harnesses the power of both network biology and machine learning, with a potential to be applicable to both model systems and agronomically important crop plants.
Collapse
Affiliation(s)
| | | | - M Shahid Mukhtar
- 1 Department of Biology, and
- 2 Nutrition Obesity Research Center, University of Alabama at Birmingham, 1300 University Blvd., Birmingham 35294, U.S.A
| |
Collapse
|
26
|
Stanford NJ, Scharm M, Dobson PD, Golebiewski M, Hucka M, Kothamachu VB, Nickerson D, Owen S, Pahle J, Wittig U, Waltemath D, Goble C, Mendes P, Snoep J. Data Management in Computational Systems Biology: Exploring Standards, Tools, Databases, and Packaging Best Practices. Methods Mol Biol 2019; 2049:285-314. [PMID: 31602618 DOI: 10.1007/978-1-4939-9736-7_17] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Computational systems biology involves integrating heterogeneous datasets in order to generate models. These models can assist with understanding and prediction of biological phenomena. Generating datasets and integrating them into models involves a wide range of scientific expertise. As a result these datasets are often collected by one set of researchers, and exchanged with others researchers for constructing the models. For this process to run smoothly the data and models must be FAIR-findable, accessible, interoperable, and reusable. In order for data and models to be FAIR they must be structured in consistent and predictable ways, and described sufficiently for other researchers to understand them. Furthermore, these data and models must be shared with other researchers, with appropriately controlled sharing permissions, before and after publication. In this chapter we explore the different data and model standards that assist with structuring, describing, and sharing. We also highlight the popular standards and sharing databases within computational systems biology.
Collapse
Affiliation(s)
| | - Martin Scharm
- Department of Systems Biology and Bioinformatics, University of Rostock, Rostock, Germany
| | - Paul D Dobson
- School of Computer Science, University of Manchester, Manchester, UK
| | - Martin Golebiewski
- Heidelberg Institute for Theoretical Studies (HITS), Heidelberg, Germany
| | - Michael Hucka
- Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA
| | | | - David Nickerson
- Auckland Bioengineering Institute, University of Auckland, Auckland, New Zealand
| | - Stuart Owen
- School of Computer Science, University of Manchester, Manchester, UK
| | - Jürgen Pahle
- BIOMS/BioQuant, Heidelberg University, Heidelberg, Germany.
| | - Ulrike Wittig
- Heidelberg Institute for Theoretical Studies (HITS), Heidelberg, Germany
| | - Dagmar Waltemath
- Medical Informatics, University Medicine Greifswald, Greifswald, Germany
| | - Carole Goble
- School of Computer Science, University of Manchester, Manchester, UK
| | - Pedro Mendes
- Centre for Quantitative Medicine, University of Connecticut, Farmington, CT, USA
| | - Jacky Snoep
- School of Computer Science, University of Manchester, Manchester, UK.,Biochemistry, Stellenbosch University, Stellenbosch, South Africa
| |
Collapse
|
27
|
Epelde G, Morgan F, Mujika A, Callaly F, Leškovský P, McGinley B, Álvarez R, Blau A, Krewer F. Web-Based Interfaces for Virtual C. elegans Neuron Model Definition, Network Configuration, Behavioral Experiment Definition and Experiment Results Visualization. Front Neuroinform 2018; 12:80. [PMID: 30483089 PMCID: PMC6243129 DOI: 10.3389/fninf.2018.00080] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Accepted: 10/22/2018] [Indexed: 11/29/2022] Open
Abstract
The Si elegans platform targets the complete virtualization of the nematode Caenorhabditis elegans, and its environment. This paper presents a suite of unified web-based Graphical User Interfaces (GUIs) as the main user interaction point, and discusses their underlying technologies and methods. The user-friendly features of this tool suite enable users to graphically create neuron and network models, and behavioral experiments, without requiring knowledge of domain-specific computer-science tools. The framework furthermore allows the graphical visualization of all simulation results using a worm locomotion and neural activity viewer. Models, experiment definitions and results can be exported in a machine-readable format, thereby facilitating reproducible and cross-platform execution of in silico C. elegans experiments in other simulation environments. This is made possible by a novel XML-based behavioral experiment definition encoding format, a NeuroML XML-based model generation and network configuration description language, and their associated GUIs. User survey data confirms the platform usability and functionality, and provides insights into future directions for web-based simulation GUIs of C. elegans and other living organisms. The tool suite is available online to the scientific community and its source code has been made available.
Collapse
Affiliation(s)
- Gorka Epelde
- Vicomtech, San Sebastián, Spain
- IIS Biodonostia, San Sebastián, Spain
| | - Fearghal Morgan
- Bio-Inspired Electronics and Reconfigurable Computing (BIRC) Research Group, Electrical & Electronic Engineering Department, College of Engineering and Informatics, National University of Ireland Galway, Galway, Ireland
| | | | - Frank Callaly
- Bio-Inspired Electronics and Reconfigurable Computing (BIRC) Research Group, Electrical & Electronic Engineering Department, College of Engineering and Informatics, National University of Ireland Galway, Galway, Ireland
| | | | - Brian McGinley
- Bio-Inspired Electronics and Reconfigurable Computing (BIRC) Research Group, Electrical & Electronic Engineering Department, College of Engineering and Informatics, National University of Ireland Galway, Galway, Ireland
| | - Roberto Álvarez
- Vicomtech, San Sebastián, Spain
- IIS Biodonostia, San Sebastián, Spain
| | - Axel Blau
- Department of Neuroscience and Brain Technologies, Fondazione Istituto Italiano di Tecnologia, Genoa, Italy
| | - Finn Krewer
- Bio-Inspired Electronics and Reconfigurable Computing (BIRC) Research Group, Electrical & Electronic Engineering Department, College of Engineering and Informatics, National University of Ireland Galway, Galway, Ireland
| |
Collapse
|
28
|
Abstract
Like other types of computational research, modeling and simulation of biological processes (biomodels) is still largely communicated without sufficient detail to allow independent reproduction of results. But reproducibility in this area of research could easily be achieved by making use of existing resources, such as supplying models in standard formats and depositing code, models, and results in public repositories.
Collapse
|
29
|
Naldi A, Hernandez C, Levy N, Stoll G, Monteiro PT, Chaouiya C, Helikar T, Zinovyev A, Calzone L, Cohen-Boulakia S, Thieffry D, Paulevé L. The CoLoMoTo Interactive Notebook: Accessible and Reproducible Computational Analyses for Qualitative Biological Networks. Front Physiol 2018; 9:680. [PMID: 29971009 PMCID: PMC6018415 DOI: 10.3389/fphys.2018.00680] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2018] [Accepted: 05/15/2018] [Indexed: 01/07/2023] Open
Abstract
Analysing models of biological networks typically relies on workflows in which different software tools with sensitive parameters are chained together, many times with additional manual steps. The accessibility and reproducibility of such workflows is challenging, as publications often overlook analysis details, and because some of these tools may be difficult to install, and/or have a steep learning curve. The CoLoMoTo Interactive Notebook provides a unified environment to edit, execute, share, and reproduce analyses of qualitative models of biological networks. This framework combines the power of different technologies to ensure repeatability and to reduce users' learning curve of these technologies. The framework is distributed as a Docker image with the tools ready to be run without any installation step besides Docker, and is available on Linux, macOS, and Microsoft Windows. The embedded computational workflows are edited with a Jupyter web interface, enabling the inclusion of textual annotations, along with the explicit code to execute, as well as the visualization of the results. The resulting notebook files can then be shared and re-executed in the same environment. To date, the CoLoMoTo Interactive Notebook provides access to the software tools GINsim, BioLQM, Pint, MaBoSS, and Cell Collective, for the modeling and analysis of Boolean and multi-valued networks. More tools will be included in the future. We developed a Python interface for each of these tools to offer a seamless integration in the Jupyter web interface and ease the chaining of complementary analyses.
Collapse
Affiliation(s)
- Aurélien Naldi
- Computational Systems Biology Team, Institut de Biologie de I'Ecole Normale Supérieure, Centre National de la Recherche Scientifique UMR8197, Institut National de la Santé et de la Recherche Médicale U1024, École Normale Supérieure, PSL Université, Paris, France
| | - Céline Hernandez
- Computational Systems Biology Team, Institut de Biologie de I'Ecole Normale Supérieure, Centre National de la Recherche Scientifique UMR8197, Institut National de la Santé et de la Recherche Médicale U1024, École Normale Supérieure, PSL Université, Paris, France
| | - Nicolas Levy
- Laboratoire de Recherche en Informatique UMR8623, Université Paris-Sud, Centre National de la Recherche Scientifique, Université Paris-Saclay, Orsay, France
- École Normale Supérieure de Lyon, Lyon, France
| | - Gautier Stoll
- Université Paris Descartes/Paris V, Sorbonne Paris Cité, Paris, France
- Équipe 11 Labellisée Ligue Nationale Contre le Cancer, Centre de Recherche des Cordeliers, Paris, France
- Institut National de la Santé et de la Recherche Médicale, U1138, Paris, France
- Université Pierre et Marie Curie, Paris, France
- Metabolomics and Cell Biology Platforms, Gustave Roussy Cancer, Villejuif, France
| | - Pedro T. Monteiro
- INESC-ID/Instituto Superior Técnico, University of Lisbon, Lisbon, Portugal
| | | | - Tomáš Helikar
- Department of Biochemistry, University of Nebraska-Lincoln, Lincoln, NE, United States
| | - Andrei Zinovyev
- Institut Curie, PSL Research University, Paris, France
- Institut National de la Santé et de la Recherche Médicale, U900, Paris, France
- MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, Paris, France
- Lobachevsky University, Nizhni Novgorod, Russia
| | - Laurence Calzone
- Institut Curie, PSL Research University, Paris, France
- Institut National de la Santé et de la Recherche Médicale, U900, Paris, France
- MINES ParisTech, PSL Research University, CBIO-Centre for Computational Biology, Paris, France
| | - Sarah Cohen-Boulakia
- Laboratoire de Recherche en Informatique UMR8623, Université Paris-Sud, Centre National de la Recherche Scientifique, Université Paris-Saclay, Orsay, France
| | - Denis Thieffry
- Computational Systems Biology Team, Institut de Biologie de I'Ecole Normale Supérieure, Centre National de la Recherche Scientifique UMR8197, Institut National de la Santé et de la Recherche Médicale U1024, École Normale Supérieure, PSL Université, Paris, France
| | - Loïc Paulevé
- Laboratoire de Recherche en Informatique UMR8623, Université Paris-Sud, Centre National de la Recherche Scientifique, Université Paris-Saclay, Orsay, France
| |
Collapse
|
30
|
Manninen T, Aćimović J, Havela R, Teppola H, Linne ML. Challenges in Reproducibility, Replicability, and Comparability of Computational Models and Tools for Neuronal and Glial Networks, Cells, and Subcellular Structures. Front Neuroinform 2018; 12:20. [PMID: 29765315 PMCID: PMC5938413 DOI: 10.3389/fninf.2018.00020] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2018] [Accepted: 04/06/2018] [Indexed: 01/26/2023] Open
Abstract
The possibility to replicate and reproduce published research results is one of the biggest challenges in all areas of science. In computational neuroscience, there are thousands of models available. However, it is rarely possible to reimplement the models based on the information in the original publication, let alone rerun the models just because the model implementations have not been made publicly available. We evaluate and discuss the comparability of a versatile choice of simulation tools: tools for biochemical reactions and spiking neuronal networks, and relatively new tools for growth in cell cultures. The replicability and reproducibility issues are considered for computational models that are equally diverse, including the models for intracellular signal transduction of neurons and glial cells, in addition to single glial cells, neuron-glia interactions, and selected examples of spiking neuronal networks. We also address the comparability of the simulation results with one another to comprehend if the studied models can be used to answer similar research questions. In addition to presenting the challenges in reproducibility and replicability of published results in computational neuroscience, we highlight the need for developing recommendations and good practices for publishing simulation tools and computational models. Model validation and flexible model description must be an integral part of the tool used to simulate and develop computational models. Constant improvement on experimental techniques and recording protocols leads to increasing knowledge about the biophysical mechanisms in neural systems. This poses new challenges for computational neuroscience: extended or completely new computational methods and models may be required. Careful evaluation and categorization of the existing models and tools provide a foundation for these future needs, for constructing multiscale models or extending the models to incorporate additional or more detailed biophysical mechanisms. Improving the quality of publications in computational neuroscience, enabling progressive building of advanced computational models and tools, can be achieved only through adopting publishing standards which underline replicability and reproducibility of research results.
Collapse
Affiliation(s)
- Tiina Manninen
- Computational Neuroscience Group, BioMediTech Institute and Faculty of Biomedical Sciences and Engineering, Tampere University of Technology, Tampere, Finland
- Laboratory of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Jugoslava Aćimović
- Computational Neuroscience Group, BioMediTech Institute and Faculty of Biomedical Sciences and Engineering, Tampere University of Technology, Tampere, Finland
- Laboratory of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Riikka Havela
- Computational Neuroscience Group, BioMediTech Institute and Faculty of Biomedical Sciences and Engineering, Tampere University of Technology, Tampere, Finland
- Laboratory of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Heidi Teppola
- Computational Neuroscience Group, BioMediTech Institute and Faculty of Biomedical Sciences and Engineering, Tampere University of Technology, Tampere, Finland
- Laboratory of Signal Processing, Tampere University of Technology, Tampere, Finland
| | - Marja-Leena Linne
- Computational Neuroscience Group, BioMediTech Institute and Faculty of Biomedical Sciences and Engineering, Tampere University of Technology, Tampere, Finland
- Laboratory of Signal Processing, Tampere University of Technology, Tampere, Finland
| |
Collapse
|
31
|
Hinds AM, Sajobi TT, Sebille V, Sawatzky R, Lix LM. A systematic review of the quality of reporting of simulation studies about methods for the analysis of complex longitudinal patient-reported outcomes data. Qual Life Res 2018; 27:2507-2516. [PMID: 29679367 DOI: 10.1007/s11136-018-1861-0] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/16/2018] [Indexed: 01/09/2023]
Abstract
PURPOSE This study describes the characteristics and quality of reporting for published computer simulation studies about statistical methods to analyze complex longitudinal (i.e., repeated measures) patient-reported outcomes (PROs); we included methods for longitudinal latent variable measurement and growth models and response shift. METHODS Scopus, PsycINFO, PubMed, EMBASE, and Social Science Citation Index were searched for English-language studies published between 1999 and 2016 using selected keywords. Extracted information included characteristics of the study purpose/objectives, simulation design, software, execution, performance, and results. The quality of reporting was evaluated using published best-practice guidelines. SYNTHESIS A total of 1470 articles were reviewed and 42 articles met the inclusion criteria. The majority of the included studies (73.8%) investigated an existing statistical method, primarily a latent variable model (95.2%). Most studies specified the population model, including variable distributions, mean parameters, and correlation/covariances. The number of time points and sample size(s) were reported by all studies, but justification for the selected values was rarely provided. The majority of the studies (52.4%) did not report on model non-convergence. Bias, accuracy, and model fit were commonly reported performance metrics. All studies reported results descriptively, and 26.2% also used an inferential method. CONCLUSIONS While methodological research on statistical analyses of complex longitudinal PRO data is informed by computer simulation studies, current reporting practices of these studies have not been consistent with best-practice guidelines. Comprehensive reporting of simulation methods and results ensures that the strengths and limitations of the investigated statistical methods are thoroughly explored.
Collapse
Affiliation(s)
- Aynslie M Hinds
- Department of Community Health Sciences, University of Manitoba, S113-750 Bannatyne Ave, Winnipeg, MB, R3E 0W3, Canada
| | - Tolulope T Sajobi
- Department of Community Health Sciences & O'Brien Institute for Public Health, University of Calgary, 3D19 Teaching Research and Wellness Building, 3280 Hospital Drive NW, Calgary, AB, T2N 4Z6, Canada
| | - Véronique Sebille
- Institut de Recherche en Santé, Université de Nantes, Université de Tours, INSERM, SPHERE U1246, 22 Boulevard Bénoni Goullin, 44000, Nantes, France
| | - Richard Sawatzky
- School of Nursing, Trinity Western University, 7th Floor, 828 West 10th Avenue, Research Pavilion, Vancouver, BC V5Z 1M9, Canada.,Centre for Health Evaluation and Outcome Sciences, Providence Health Care, 588-1081 Burrard Street, Vancouver, BC V6Z 1Y6, Canada
| | - Lisa M Lix
- Department of Community Health Sciences, University of Manitoba, S113-750 Bannatyne Ave, Winnipeg, MB, R3E 0W3, Canada.
| |
Collapse
|
32
|
Manninen T, Havela R, Linne ML. Reproducibility and Comparability of Computational Models for Astrocyte Calcium Excitability. Front Neuroinform 2017; 11:11. [PMID: 28270761 PMCID: PMC5318440 DOI: 10.3389/fninf.2017.00011] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2016] [Accepted: 01/25/2017] [Indexed: 11/13/2022] Open
Abstract
The scientific community across all disciplines faces the same challenges of ensuring accessibility, reproducibility, and efficient comparability of scientific results. Computational neuroscience is a rapidly developing field, where reproducibility and comparability of research results have gained increasing interest over the past years. As the number of computational models of brain functions is increasing, we chose to address reproducibility using four previously published computational models of astrocyte excitability as an example. Although not conventionally taken into account when modeling neuronal systems, astrocytes have been shown to take part in a variety of in vitro and in vivo phenomena including synaptic transmission. Two of the selected astrocyte models describe spontaneous calcium excitability, and the other two neurotransmitter-evoked calcium excitability. We specifically addressed how well the original simulation results can be reproduced with a reimplementation of the models. Additionally, we studied how well the selected models can be reused and whether they are comparable in other stimulation conditions and research settings. Unexpectedly, we found out that three of the model publications did not give all the necessary information required to reimplement the models. In addition, we were able to reproduce the original results of only one of the models completely based on the information given in the original publications and in the errata. We actually found errors in the equations provided by two of the model publications; after modifying the equations accordingly, the original results were reproduced more accurately. Even though the selected models were developed to describe the same biological event, namely astrocyte calcium excitability, the models behaved quite differently compared to one another. Our findings on a specific set of published astrocyte models stress the importance of proper validation of the models against experimental wet-lab data from astrocytes as well as the careful review process of models. A variety of aspects of model development could be improved, including the presentation of models in publications and databases. Specifically, all necessary mathematical equations, as well as parameter values, initial values of variables, and stimuli used should be given precisely for successful reproduction of scientific results.
Collapse
Affiliation(s)
- Tiina Manninen
- Computational Neuroscience Group, Faculty of Biomedical Sciences and Engineering and BioMediTech Institute, Tampere University of Technology Tampere, Finland
| | - Riikka Havela
- Computational Neuroscience Group, Faculty of Biomedical Sciences and Engineering and BioMediTech Institute, Tampere University of Technology Tampere, Finland
| | - Marja-Leena Linne
- Computational Neuroscience Group, Faculty of Biomedical Sciences and Engineering and BioMediTech Institute, Tampere University of Technology Tampere, Finland
| |
Collapse
|
33
|
Hunter P. The Virtual Physiological Human: The Physiome Project Aims to Develop Reproducible, Multiscale Models for Clinical Practice. IEEE Pulse 2017; 7:36-42. [PMID: 27414633 DOI: 10.1109/mpul.2016.2563841] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
The Physiome Project was initiated by the International Union of Physiological Sciences (IUPS; www.iups.org) in 1997 to bring multiscale engineering modeling approaches to the physiological interpretation of the wealth of molecular data that was becoming available at that time [1]. The discipline of physiology, which with anatomy underpins medical practice, had lost its traditional central position in the biological sciences (at least from a funding perspective) to molecular biology, despite the very small impact molecular biology has had on the diagnosis and treatment of disease. While diseases and drugs certainly operate at the molecular level, the regulation of genetic transcription and, hence, the expression of proteins (the building blocks of life) are both highly dependent on environmental factors governed by the physical world in which molecular biology operates. Engineering-in particular, the rapidly growing field of bioengineering-is the discipline that has the integrative skills and tools to put the molecular pieces of Humpty Dumpty back together again.
Collapse
|
34
|
Cooper J, Scharm M, Mirams GR. The Cardiac Electrophysiology Web Lab. Biophys J 2016; 110:292-300. [PMID: 26789753 PMCID: PMC4724653 DOI: 10.1016/j.bpj.2015.12.012] [Citation(s) in RCA: 41] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2015] [Revised: 12/09/2015] [Accepted: 12/11/2015] [Indexed: 12/21/2022] Open
Abstract
Computational modeling of cardiac cellular electrophysiology has a long history, and many models are now available for different species, cell types, and experimental preparations. This success brings with it a challenge: how do we assess and compare the underlying hypotheses and emergent behaviors so that we can choose a model as a suitable basis for a new study or to characterize how a particular model behaves in different scenarios? We have created an online resource for the characterization and comparison of electrophysiological cell models in a wide range of experimental scenarios. The details of the mathematical model (quantitative assumptions and hypotheses formulated as ordinary differential equations) are separated from the experimental protocol being simulated. Each model and protocol is then encoded in computer-readable formats. A simulation tool runs virtual experiments on models encoded in CellML, and a website (https://chaste.cs.ox.ac.uk/WebLab) provides a friendly interface, allowing users to store and compare results. The system currently contains a sample of 36 models and 23 protocols, including current-voltage curve generation, action potential properties under steady pacing at different rates, restitution properties, block of particular channels, and hypo-/hyperkalemia. This resource is publicly available, open source, and free, and we invite the community to use it and become involved in future developments. Investigators interested in comparing competing hypotheses using models can make a more informed decision, and those developing new models can upload them for easy evaluation under the existing protocols, and even add their own protocols.
Collapse
Affiliation(s)
- Jonathan Cooper
- Department of Computer Science, University of Oxford, Oxford, United Kingdom.
| | - Martin Scharm
- Department of Systems Biology and Bioinformatics, University of Rostock, Rostock, Germany
| | - Gary R Mirams
- Department of Computer Science, University of Oxford, Oxford, United Kingdom
| |
Collapse
|
35
|
Abstract
Systems pharmacology aims to holistically understand mechanisms of drug actions to support drug discovery and clinical practice. Systems pharmacology modeling (SPM) is data driven. It integrates an exponentially growing amount of data at multiple scales (genetic, molecular, cellular, organismal, and environmental). The goal of SPM is to develop mechanistic or predictive multiscale models that are interpretable and actionable. The current explosions in genomics and other omics data, as well as the tremendous advances in big data technologies, have already enabled biologists to generate novel hypotheses and gain new knowledge through computational models of genome-wide, heterogeneous, and dynamic data sets. More work is needed to interpret and predict a drug response phenotype, which is dependent on many known and unknown factors. To gain a comprehensive understanding of drug actions, SPM requires close collaborations between domain experts from diverse fields and integration of heterogeneous models from biophysics, mathematics, statistics, machine learning, and semantic webs. This creates challenges in model management, model integration, model translation, and knowledge integration. In this review, we discuss several emergent issues in SPM and potential solutions using big data technology and analytics. The concurrent development of high-throughput techniques, cloud computing, data science, and the semantic web will likely allow SPM to be findable, accessible, interoperable, reusable, reliable, interpretable, and actionable.
Collapse
Affiliation(s)
- Lei Xie
- Department of Computer Science, Hunter College, The City University of New York, New York, NY 10065; .,The Graduate Center, The City University of New York, New York, NY 10016
| | - Eli J Draizen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894; .,Program in Bioinformatics, Boston University, Boston, Massachusetts 02215
| | - Philip E Bourne
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, Maryland 20894; .,Office of the Director, National Institutes of Health, Bethesda, Maryland 20894
| |
Collapse
|
36
|
McDougal RA, Bulanova AS, Lytton WW. Reproducibility in Computational Neuroscience Models and Simulations. IEEE Trans Biomed Eng 2016; 63:2021-35. [PMID: 27046845 PMCID: PMC5016202 DOI: 10.1109/tbme.2016.2539602] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
OBJECTIVE Like all scientific research, computational neuroscience research must be reproducible. Big data science, including simulation research, cannot depend exclusively on journal articles as the method to provide the sharing and transparency required for reproducibility. METHODS Ensuring model reproducibility requires the use of multiple standard software practices and tools, including version control, strong commenting and documentation, and code modularity. RESULTS Building on these standard practices, model-sharing sites and tools have been developed that fit into several categories: 1) standardized neural simulators; 2) shared computational resources; 3) declarative model descriptors, ontologies, and standardized annotations; and 4) model-sharing repositories and sharing standards. CONCLUSION A number of complementary innovations have been proposed to enhance sharing, transparency, and reproducibility. The individual user can be encouraged to make use of version control, commenting, documentation, and modularity in development of models. The community can help by requiring model sharing as a condition of publication and funding. SIGNIFICANCE Model management will become increasingly important as multiscale models become larger, more detailed, and correspondingly more difficult to manage by any single investigator or single laboratory. Additional big data management complexity will come as the models become more useful in interpreting experiments, thus increasing the need to ensure clear alignment between modeling data, both parameters and results, and experiment.
Collapse
|
37
|
Golabgir A, Gutierrez JM, Hefzi H, Li S, Palsson BO, Herwig C, Lewis NE. Quantitative feature extraction from the Chinese hamster ovary bioprocess bibliome using a novel meta-analysis workflow. Biotechnol Adv 2016; 34:621-633. [DOI: 10.1016/j.biotechadv.2016.02.011] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2015] [Revised: 02/21/2016] [Accepted: 02/28/2016] [Indexed: 01/01/2023]
|
38
|
A scientific workflow framework for 13C metabolic flux analysis. J Biotechnol 2016; 232:12-24. [DOI: 10.1016/j.jbiotec.2015.12.032] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Revised: 12/14/2015] [Accepted: 12/17/2015] [Indexed: 12/15/2022]
|
39
|
Lewis J, Breeze CE, Charlesworth J, Maclaren OJ, Cooper J. Where next for the reproducibility agenda in computational biology? BMC SYSTEMS BIOLOGY 2016; 10:52. [PMID: 27422148 PMCID: PMC4946111 DOI: 10.1186/s12918-016-0288-x] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2016] [Accepted: 06/08/2016] [Indexed: 11/24/2022]
Abstract
Background The concept of reproducibility is a foundation of the scientific method. With the arrival of fast and powerful computers over the last few decades, there has been an explosion of results based on complex computational analyses and simulations. The reproducibility of these results has been addressed mainly in terms of exact replicability or numerical equivalence, ignoring the wider issue of the reproducibility of conclusions through equivalent, extended or alternative methods. Results We use case studies from our own research experience to illustrate how concepts of reproducibility might be applied in computational biology. Several fields have developed ‘minimum information’ checklists to support the full reporting of computational simulations, analyses and results, and standardised data formats and model description languages can facilitate the use of multiple systems to address the same research question. We note the importance of defining the key features of a result to be reproduced, and the expected agreement between original and subsequent results. Dynamic, updatable tools for publishing methods and results are becoming increasingly common, but sometimes come at the cost of clear communication. In general, the reproducibility of computational research is improving but would benefit from additional resources and incentives. Conclusions We conclude with a series of linked recommendations for improving reproducibility in computational biology through communication, policy, education and research practice. More reproducible research will lead to higher quality conclusions, deeper understanding and more valuable knowledge.
Collapse
Affiliation(s)
- Joanna Lewis
- Centre for Maths and Physics in the Life Sciences and Experimental Biology, University College London, Physics Building, Gower Place, London, WC1E 6BT, UK. .,NIHR Health Protection Research Unit in Modelling Methodology, Department of Infectious Disease Epidemiology, Imperial College London, St Mary's Campus, Norfolk Place, London, W2 1PG, UK.
| | - Charles E Breeze
- UCL Cancer Institute, University College London, 72 Huntley St, London, WC1E 6DD, UK
| | - Jane Charlesworth
- Department of Genetics, University of Cambridge, Downing Street, Cambridge, CB2 3EH, UK
| | - Oliver J Maclaren
- Department of Mathematics, University of Auckland, Auckland, 1142, New Zealand.,Department of Engineering Science, University of Auckland, Auckland, 1142, New Zealand
| | - Jonathan Cooper
- Department of Computer Science, University of Oxford, Wolfson Building, Parks Road, Oxford, OX1 3QD, UK
| |
Collapse
|
40
|
Waltemath D, Wolkenhauer O. How Modeling Standards, Software, and Initiatives Support Reproducibility in Systems Biology and Systems Medicine. IEEE Trans Biomed Eng 2016; 63:1999-2006. [PMID: 27295645 DOI: 10.1109/tbme.2016.2555481] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
OBJECTIVE Only reproducible results are of significance to science. The lack of suitable standards and appropriate support of standards in software tools has led to numerous publications with irreproducible results. Our objectives are to identify the key challenges of reproducible research and to highlight existing solutions. RESULTS In this paper, we summarize problems concerning reproducibility in systems biology and systems medicine. We focus on initiatives, standards, and software tools that aim to improve the reproducibility of simulation studies. CONCLUSIONS The long-term success of systems biology and systems medicine depends on trustworthy models and simulations. This requires openness to ensure reusability and transparency to enable reproducibility of results in these fields.
Collapse
|
41
|
Yao PJ, Chung RH. GESDB: a platform of simulation resources for genetic epidemiology studies. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw082. [PMID: 27242038 PMCID: PMC4885602 DOI: 10.1093/database/baw082] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/21/2016] [Accepted: 04/25/2016] [Indexed: 11/12/2022]
Abstract
Computer simulations are routinely conducted to evaluate new statistical methods, to compare the properties among different methods, and to mimic the observed data in genetic epidemiology studies. Conducting simulation studies can become a complicated task as several challenges can occur, such as the selection of an appropriate simulation tool and the specification of parameters in the simulation model. Although abundant simulated data have been generated for human genetic research, currently there is no public database designed specifically as a repository for these simulated data. With the lack of such a database, for similar studies, similar simulations may have been repeated, which resulted in redundant work. Thus, we created an online platform, the Genetic Epidemiology Simulation Database (GESDB), for simulation data sharing and discussion of simulation techniques for genetic epidemiology studies. GESDB consists of a database for storing simulation scripts, simulated data and documentation from published articles as well as a discussion forum, which provides a platform for discussion of the simulated data and exchanging simulation ideas. Moreover, summary statistics such as the simulation tools that are most commonly used and datasets that are most frequently downloaded are provided. The statistics will be informative for researchers to choose an appropriate simulation tool or select a common dataset for method comparisons. GESDB can be accessed at http://gesdb.nhri.org.tw. Database URL:http://gesdb.nhri.org.tw
Collapse
Affiliation(s)
- Po-Ju Yao
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Taiwan
| | - Ren-Hua Chung
- Division of Biostatistics and Bioinformatics, Institute of Population Health Sciences, National Health Research Institutes, Zhunan, Taiwan
| |
Collapse
|
42
|
Olivier BG, Swat MJ, Moné MJ. Modeling and Simulation Tools: From Systems Biology to Systems Medicine. Methods Mol Biol 2016; 1386:441-63. [PMID: 26677194 DOI: 10.1007/978-1-4939-3283-2_19] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Modeling is an integral component of modern biology. In this chapter we look into the role of the model, as it pertains to Systems Medicine, and the software that is required to instantiate and run it. We do this by comparing the development, implementation, and characteristics of tools that have been developed to work with two divergent methodologies: Systems Biology and Pharmacometrics. From the Systems Biology perspective we consider the concept of "Software as a Medical Device" and what this may imply for the migration of research-oriented, simulation software into the domain of human health.In our second perspective, we see how in practice hundreds of computational tools already accompany drug discovery and development at every stage of the process. Standardized exchange formats are required to streamline the model exchange between tools, which would minimize translation errors and reduce the required time. With the emergence, almost 15 years ago, of the SBML standard, a large part of the domain of interest is already covered and models can be shared and passed from software to software without recoding them. Until recently the last stage of the process, the pharmacometric analysis used in clinical studies carried out on subject populations, lacked such an exchange medium. We describe a new emerging exchange format in Pharmacometrics which covers the non-linear mixed effects models, the standard statistical model type used in this area. By interfacing these two formats the entire domain can be covered by complementary standards and subsequently the according tools.
Collapse
Affiliation(s)
- Brett G Olivier
- Systems Bioinformatics, VU University Amsterdam, Amsterdam, The Netherlands.
| | - Maciej J Swat
- EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, UK
| | - Martijn J Moné
- Molecular Cell Physiology, VU University Amsterdam, Amsterdam, The Netherlands.,Systems and Synthetic Biology, Wageningen University, Wageningen, The Netherlands
| |
Collapse
|
43
|
Costa RS, Hartmann A, Vinga S. Kinetic modeling of cell metabolism for microbial production. J Biotechnol 2015; 219:126-41. [PMID: 26724578 DOI: 10.1016/j.jbiotec.2015.12.023] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2015] [Revised: 11/25/2015] [Accepted: 12/15/2015] [Indexed: 12/20/2022]
Abstract
Kinetic models of cellular metabolism are important tools for the rational design of metabolic engineering strategies and to explain properties of complex biological systems. The recent developments in high-throughput experimental data are leading to new computational approaches for building kinetic models of metabolism. Herein, we briefly survey the available databases, standards and software tools that can be applied for kinetic models of metabolism. In addition, we give an overview about recently developed ordinary differential equations (ODE)-based kinetic models of metabolism and some of the main applications of such models are illustrated in guiding metabolic engineering design. Finally, we review the kinetic modeling approaches of large-scale networks that are emerging, discussing their main advantages, challenges and limitations.
Collapse
Affiliation(s)
- Rafael S Costa
- IDMEC, Instituto Superior Técnico, Universidade de Lisboa, Av. Rovisco Pais 1, 1049-001 Lisboa, Portugal.
| | - Andras Hartmann
- IDMEC, Instituto Superior Técnico, Universidade de Lisboa, Av. Rovisco Pais 1, 1049-001 Lisboa, Portugal
| | - Susana Vinga
- IDMEC, Instituto Superior Técnico, Universidade de Lisboa, Av. Rovisco Pais 1, 1049-001 Lisboa, Portugal
| |
Collapse
|
44
|
Stanford NJ, Wolstencroft K, Golebiewski M, Kania R, Juty N, Tomlinson C, Owen S, Butcher S, Hermjakob H, Le Novère N, Mueller W, Snoep J, Goble C. The evolution of standards and data management practices in systems biology. Mol Syst Biol 2015; 11:851. [PMID: 26700851 PMCID: PMC4704484 DOI: 10.15252/msb.20156053] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Affiliation(s)
- Natalie J Stanford
- Manchester Institute of Biotechnology, The University of Manchester, Manchester, UK School of Computer Science, University of Manchester, Manchester, UK
| | - Katherine Wolstencroft
- Leiden Institute of Advanced Computer Science, Leiden Institute of Advanced Computer Science, Leiden University, Leiden, The Netherlands
| | | | - Renate Kania
- Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Nick Juty
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
| | | | - Stuart Owen
- School of Computer Science, University of Manchester, Manchester, UK
| | - Sarah Butcher
- Department of Surgery and Cancer, Imperial College London, London, UK
| | - Henning Hermjakob
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
| | | | - Wolfgang Mueller
- Heidelberg Institute for Theoretical Studies, Heidelberg, Germany
| | - Jacky Snoep
- Department of Biochemistry, University of Stellenbosch, Matieland, South Africa School of Chemical Engineering & Analytical Science, The University of Manchester, Manchester, UK
| | - Carole Goble
- School of Computer Science, University of Manchester, Manchester, UK
| |
Collapse
|
45
|
Myokit: A simple interface to cardiac cellular electrophysiology. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2015; 120:100-14. [PMID: 26721671 DOI: 10.1016/j.pbiomolbio.2015.12.008] [Citation(s) in RCA: 64] [Impact Index Per Article: 7.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/18/2015] [Revised: 11/07/2015] [Accepted: 12/16/2015] [Indexed: 11/24/2022]
Abstract
Myokit is a new powerful and versatile software tool for modeling and simulation of cardiac cellular electrophysiology. Myokit consists of an easy-to-read modeling language, a graphical user interface, single and multi-cell simulation engines and a library of advanced analysis tools accessible through a Python interface. Models can be loaded from Myokit's native file format or imported from CellML. Model export is provided to C, MATLAB, CellML, CUDA and OpenCL. Patch-clamp data can be imported and used to estimate model parameters. In this paper, we review existing tools to simulate the cardiac cellular action potential to find that current tools do not cater specifically to model development and that there is a gap between easy-to-use but limited software and powerful tools that require strong programming skills from their users. We then describe Myokit's capabilities, focusing on its model description language, simulation engines and import/export facilities in detail. Using three examples, we show how Myokit can be used for clinically relevant investigations, multi-model testing and parameter estimation in Markov models, all with minimal programming effort from the user. This way, Myokit bridges a gap between performance, versatility and user-friendliness.
Collapse
|
46
|
Parton A, McGilligan V, O’Kane M, Baldrick FR, Watterson S. Computational modelling of atherosclerosis. Brief Bioinform 2015; 17:562-75. [DOI: 10.1093/bib/bbv081] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2015] [Indexed: 12/24/2022] Open
|
47
|
Asai Y, Abe T, Li L, Oka H, Nomura T, Kitano H. Databases for multilevel biophysiology research available at Physiome.jp. Front Physiol 2015; 6:251. [PMID: 26441671 PMCID: PMC4563878 DOI: 10.3389/fphys.2015.00251] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2015] [Accepted: 08/24/2015] [Indexed: 11/13/2022] Open
Abstract
Physiome.jp (http://physiome.jp) is a portal site inaugurated in 2007 to support model-based research in physiome and systems biology. At Physiome.jp, several tools and databases are available to support construction of physiological, multi-hierarchical, large-scale models. There are three databases in Physiome.jp, housing mathematical models, morphological data, and time-series data. In late 2013, the site was fully renovated, and in May 2015, new functions were implemented to provide information infrastructure to support collaborative activities for developing models and performing simulations within the database framework. This article describes updates to the databases implemented since 2013, including cooperation among the three databases, interactive model browsing, user management, version management of models, management of parameter sets, and interoperability with applications.
Collapse
Affiliation(s)
- Yoshiyuki Asai
- Integrated Open Systems Unit, Okinawa Institute of Science and Technology Graduate University Okinawa, Japan
| | - Takeshi Abe
- Integrated Open Systems Unit, Okinawa Institute of Science and Technology Graduate University Okinawa, Japan
| | - Li Li
- Intasect Communications Inc. Osaka, Japan
| | - Hideki Oka
- Neuroinformatics Japan Center, RIKEN Brain Science Institute Saitama, Japan
| | - Taishin Nomura
- Department of Mechanical Science and Bioengineering, Osaka University Osaka, Japan
| | - Hiroaki Kitano
- Integrated Open Systems Unit, Okinawa Institute of Science and Technology Graduate University Okinawa, Japan ; The Systems Biology Institute Tokyo, Japan
| |
Collapse
|
48
|
McKeever S, Johnson D. The role of markup for enabling interoperability in health informatics. Front Physiol 2015; 6:152. [PMID: 26042043 PMCID: PMC4434901 DOI: 10.3389/fphys.2015.00152] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2014] [Accepted: 04/27/2015] [Indexed: 11/13/2022] Open
Abstract
Interoperability is the faculty of making information systems work together. In this paper we will distinguish a number of different forms that interoperability can take and show how they are realized on a variety of physiological and health care use cases. The last 15 years has seen the rise of very cheap digital storage both on and off site. With the advent of the Internet of Things people's expectations are for greater interconnectivity and seamless interoperability. The potential impact these technologies have on healthcare are dramatic: from improved diagnoses through immediate access to a patient's electronic health record, to in silico modeling of organs and early stage drug trials, to predictive medicine based on top-down modeling of disease progression and treatment. We will begin by looking at the underlying technology, classify the various kinds of interoperability that exist in the field, and discuss how they are realized. We conclude with a discussion on future possibilities that big data and further standardizations will enable.
Collapse
Affiliation(s)
- Steve McKeever
- Department of Informatics and Media, Uppsala UniversityUppsala, Sweden
- Saint Petersburg National Research University of Information Technologies, Mechanics and Optics (ITMO)Saint Petersburg, Russia
| | - David Johnson
- Data Science Institute, Imperial College LondonLondon, UK
- Department of Computing, Imperial College LondonLondon, UK
| |
Collapse
|
49
|
|
50
|
Henkel R, Wolkenhauer O, Waltemath D. Combining computational models, semantic annotations and simulation experiments in a graph database. Database (Oxford) 2015; 2015:bau130. [PMID: 25754863 PMCID: PMC4352687 DOI: 10.1093/database/bau130] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2014] [Revised: 12/18/2014] [Accepted: 12/18/2014] [Indexed: 12/28/2022]
Abstract
Model repositories such as the BioModels Database, the CellML Model Repository or JWS Online are frequently accessed to retrieve computational models of biological systems. However, their storage concepts support only restricted types of queries and not all data inside the repositories can be retrieved. In this article we present a storage concept that meets this challenge. It grounds on a graph database, reflects the models' structure, incorporates semantic annotations and simulation descriptions and ultimately connects different types of model-related data. The connections between heterogeneous model-related data and bio-ontologies enable efficient search via biological facts and grant access to new model features. The introduced concept notably improves the access of computational models and associated simulations in a model repository. This has positive effects on tasks such as model search, retrieval, ranking, matching and filtering. Furthermore, our work for the first time enables CellML- and Systems Biology Markup Language-encoded models to be effectively maintained in one database. We show how these models can be linked via annotations and queried. Database URL: https://sems.uni-rostock.de/projects/masymos/
Collapse
Affiliation(s)
- Ron Henkel
- University of Rostock, Department of Computer Science, Albert-Einstein-Straße 22, D-18059 Rostock, Germany, Department of Systems Biology and Bioinformatics, University of Rostock, Ulmenstrasse 69, 18057 Rostock, Germany and Stellenbosch Institute for Advanced Study (STIAS), Wallenberg Research Centre at Stellenbosch University, Stellenbosch 7600, South Africa
| | - Olaf Wolkenhauer
- University of Rostock, Department of Computer Science, Albert-Einstein-Straße 22, D-18059 Rostock, Germany, Department of Systems Biology and Bioinformatics, University of Rostock, Ulmenstrasse 69, 18057 Rostock, Germany and Stellenbosch Institute for Advanced Study (STIAS), Wallenberg Research Centre at Stellenbosch University, Stellenbosch 7600, South Africa University of Rostock, Department of Computer Science, Albert-Einstein-Straße 22, D-18059 Rostock, Germany, Department of Systems Biology and Bioinformatics, University of Rostock, Ulmenstrasse 69, 18057 Rostock, Germany and Stellenbosch Institute for Advanced Study (STIAS), Wallenberg Research Centre at Stellenbosch University, Stellenbosch 7600, South Africa
| | - Dagmar Waltemath
- University of Rostock, Department of Computer Science, Albert-Einstein-Straße 22, D-18059 Rostock, Germany, Department of Systems Biology and Bioinformatics, University of Rostock, Ulmenstrasse 69, 18057 Rostock, Germany and Stellenbosch Institute for Advanced Study (STIAS), Wallenberg Research Centre at Stellenbosch University, Stellenbosch 7600, South Africa
| |
Collapse
|