1
|
Drug-target identification in COVID-19 disease mechanisms using computational systems biology approaches. Front Immunol 2024; 14:1282859. [PMID: 38414974 PMCID: PMC10897000 DOI: 10.3389/fimmu.2023.1282859] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 12/22/2023] [Indexed: 02/29/2024] Open
Abstract
Introduction The COVID-19 Disease Map project is a large-scale community effort uniting 277 scientists from 130 Institutions around the globe. We use high-quality, mechanistic content describing SARS-CoV-2-host interactions and develop interoperable bioinformatic pipelines for novel target identification and drug repurposing. Methods Extensive community work allowed an impressive step forward in building interfaces between Systems Biology tools and platforms. Our framework can link biomolecules from omics data analysis and computational modelling to dysregulated pathways in a cell-, tissue- or patient-specific manner. Drug repurposing using text mining and AI-assisted analysis identified potential drugs, chemicals and microRNAs that could target the identified key factors. Results Results revealed drugs already tested for anti-COVID-19 efficacy, providing a mechanistic context for their mode of action, and drugs already in clinical trials for treating other diseases, never tested against COVID-19. Discussion The key advance is that the proposed framework is versatile and expandable, offering a significant upgrade in the arsenal for virus-host interactions and other complex pathologies.
Collapse
|
2
|
MetaLo: metabolic analysis of Logical models extracted from molecular interaction maps. J Integr Bioinform 2024; 0:jib-2023-0048. [PMID: 38314776 DOI: 10.1515/jib-2023-0048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2023] [Accepted: 01/09/2024] [Indexed: 02/07/2024] Open
Abstract
Molecular interaction maps (MIMs) are static graphical representations depicting complex biochemical networks that can be formalized using one of the Systems Biology Graphical Notation languages. Regardless of their extensive coverage of various biological processes, they are limited in terms of dynamic insights. However, MIMs can serve as templates for developing dynamic computational models. We present MetaLo, an open-source Python package that enables the coupling of Boolean models inferred from process description MIMs with generic core metabolic networks. MetaLo provides a framework to study the impact of signaling cascades, gene regulation processes, and metabolic flux distribution of central energy production pathways. MetaLo computes the Boolean model's asynchronous asymptotic behavior, through the identification of trap-spaces, and extracts metabolic constraints to contextualize the generic metabolic network. MetaLo is able to handle large-scale Boolean models and genome-scale metabolic models without requiring kinetic information or manual tuning. The framework behind MetaLo enables in depth analysis of the regulatory model, and may allow tackling a lack of omics data in poorly addressed biological fields to contextualize generic metabolic networks along with improper automatic reconstructions of cell- and/or disease-specific metabolic networks. MetaLo is available at https://pypi.org/project/metalo/ under the terms of the GNU General Public License v3.
Collapse
|
3
|
A large-scale Boolean model of the rheumatoid arthritis fibroblast-like synoviocytes predicts drug synergies in the arthritic joint. NPJ Syst Biol Appl 2023; 9:33. [PMID: 37454172 DOI: 10.1038/s41540-023-00294-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 06/29/2023] [Indexed: 07/18/2023] Open
Abstract
Rheumatoid arthritis (RA) is a complex autoimmune disease with an unknown aetiology. However, rheumatoid arthritis fibroblast-like synoviocytes (RA-FLS) play a significant role in initiating and perpetuating destructive joint inflammation by expressing immuno-modulating cytokines, adhesion molecules, and matrix remodelling enzymes. In addition, RA-FLS are primary drivers of inflammation, displaying high proliferative rates and an apoptosis-resistant phenotype. Thus, RA-FLS-directed therapies could become a complementary approach to immune-directed therapies by predicting the optimal conditions that would favour RA-FLS apoptosis, limit inflammation, slow the proliferation rate and minimise bone erosion and cartilage destruction. In this paper, we present a large-scale Boolean model for RA-FLS that consists of five submodels focusing on apoptosis, cell proliferation, matrix degradation, bone erosion and inflammation. The five-phenotype-specific submodels can be simulated independently or as a global model. In silico simulations and perturbations reproduced the expected biological behaviour of the system under defined initial conditions and input values. The model was then used to mimic the effect of mono or combined therapeutic treatments and predict novel targets and drug candidates through drug repurposing analysis.
Collapse
|
4
|
A guide for developing comprehensive systems biology maps of disease mechanisms: planning, construction and maintenance. FRONTIERS IN BIOINFORMATICS 2023; 3:1197310. [PMID: 37426048 PMCID: PMC10325725 DOI: 10.3389/fbinf.2023.1197310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 06/09/2023] [Indexed: 07/11/2023] Open
Abstract
As a conceptual model of disease mechanisms, a disease map integrates available knowledge and is applied for data interpretation, predictions and hypothesis generation. It is possible to model disease mechanisms on different levels of granularity and adjust the approach to the goals of a particular project. This rich environment together with requirements for high-quality network reconstruction makes it challenging for new curators and groups to be quickly introduced to the development methods. In this review, we offer a step-by-step guide for developing a disease map within its mainstream pipeline that involves using the CellDesigner tool for creating and editing diagrams and the MINERVA Platform for online visualisation and exploration. We also describe how the Neo4j graph database environment can be used for managing and querying efficiently such a resource. For assessing the interoperability and reproducibility we apply FAIR principles.
Collapse
|
5
|
Integrated bioinformatics approach reveals methylation-regulated differentially expressed genes in obesity. ARCHIVES OF ENDOCRINOLOGY AND METABOLISM 2023; 67:e000604. [PMID: 37252693 PMCID: PMC10665070 DOI: 10.20945/2359-3997000000604] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Accepted: 10/08/2022] [Indexed: 05/31/2023]
Abstract
Objective To identify DNA methylation and gene expression profiles involved in obesity by implementing an integrated bioinformatics approach. Materials and methods Gene expression (GSE94752, GSE55200, and GSE48964) and DNA methylation (GSE67024 and GSE111632) datasets were obtained from the GEO database. Differentially expressed genes (DEGs) and differentially methylated genes (DMGs) in subcutaneous adipose tissue of patients with obesity were identified using GEO2R. Methylation-regulated DEGs (MeDEGs) were identified by overlapping DEGs and DMGs. The protein-protein interaction (PPI) network was constructed with the STRING database and analyzed using Cytoscape. Functional modules and hub-bottleneck genes were identified by using MCODE and CytoHubba plugins. Functional enrichment analyses were performed based on Gene Ontology terms and KEGG pathways. To prioritize and identify candidate genes for obesity, MeDEGs were compared with obesity-related genes available at the DisGeNET database. Results A total of 54 MeDEGs were identified after overlapping the lists of significant 274 DEGs and 11,556 DMGs. Of these, 25 were hypermethylated-low expression genes and 29 were hypomethylated-high expression genes. The PPI network showed three hub-bottleneck genes (PTGS2, TNFAIP3, and FBXL20) and one functional module. The 54 MeDEGs were mainly involved in the regulation of fibroblast growth factor production, the molecular function of arachidonic acid, and ubiquitin-protein transferase activity. Data collected from DisGeNET showed that 11 of the 54 MeDEGs were involved in obesity. Conclusion This study identifies new MeDEGs involved in obesity and assessed their related pathways and functions. These results data may provide a deeper understanding of methylation-mediated regulatory mechanisms of obesity.
Collapse
|
6
|
StonPy: a tool to parse and query collections of SBGN maps in a graph database. Bioinformatics 2023; 39:7075543. [PMID: 36897014 PMCID: PMC10017094 DOI: 10.1093/bioinformatics/btad100] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2022] [Revised: 01/14/2023] [Accepted: 03/09/2023] [Indexed: 03/11/2023] Open
Abstract
SUMMARY The systems biology graphical notation (SBGN) has become the de facto standard for the graphical representation of molecular maps. Having rapid and easy access to the content of large collections of maps is necessary to perform semantic or graph-based analysis of these resources. To this end, we propose StonPy, a new tool to store and query SBGN maps in a Neo4j graph database. StonPy notably includes a data model that takes into account all three SBGN languages and a completion module to automatically build valid SBGN maps from query results. StonPy is built as a library that can be integrated into other software and offers a command-line interface that allows users to easily perform all operations. AVAILABILITY AND IMPLEMENTATION StonPy is implemented in Python 3 under a GPLv3 license. Its code and complete documentation are freely available from https://github.com/adrienrougny/stonpy. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
7
|
Overview of Computational and Experimental Methods to Identify Tissue-Specific MicroRNA Targets. Methods Mol Biol 2023; 2630:155-177. [PMID: 36689183 DOI: 10.1007/978-1-0716-2982-6_12] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
As ubiquitous posttranscriptional regulators of gene expression, microRNAs (miRNAs) play key roles in cell physiology and function across taxa. In the last two decades, we have gained a good understanding about miRNA biogenesis pathways, modes of action, and consequences of miRNA-mediated gene regulation. More recently, research has focused on exploring causes for miRNA dysregulation, miRNA-mediated crosstalk between genes and signaling pathways, and the role of miRNAs in disease.This chapter discusses methods for the identification of miRNA-target interactions and causes for tissue-specific miRNA-target regulation. Computational approaches for predicting miRNA target sites and assessing tissue-specific target regulation are discussed. Moreover, there is an emphasis on features that affect miRNA target recognition and how high-throughput sequencing protocols can help in assessing miRNA-mediated gene regulation on a genome-wide scale. In addition, this chapter introduces some experimental approaches for the validation of miRNA targets as well as web-based resources sharing predicted and validated miRNA-target interactions.
Collapse
|
8
|
GraphML-SBGN bidirectional converter for metabolic networks. J Integr Bioinform 2022; 19:jib-2022-0030. [PMID: 36563404 PMCID: PMC9800040 DOI: 10.1515/jib-2022-0030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2022] [Accepted: 12/12/2022] [Indexed: 12/24/2022] Open
Abstract
Systems biology researchers need feasible solutions for editing and visualisation of large biological diagrams. Here, we present the ySBGN bidirectional converter that translates metabolic pathways, developed in the general-purpose yEd Graph Editor (using the GraphML format) into the Systems Biology Graphical Notation Markup Language (SBGN-ML) standard format and vice versa. We illustrate the functionality of this converter by applying it to the translation of the ReconMap resource (available in the SBGN-ML format) to the yEd-specific GraphML and back. The ySBGN tool makes possible to draw extensive metabolic diagrams in a powerful general-purpose graph editor while providing results in the standard SBGN format.
Collapse
|
9
|
Addressing barriers in comprehensiveness, accessibility, reusability, interoperability and reproducibility of computational models in systems biology. Brief Bioinform 2022; 23:bbac212. [PMID: 35671510 PMCID: PMC9294410 DOI: 10.1093/bib/bbac212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2022] [Revised: 04/20/2022] [Accepted: 05/06/2022] [Indexed: 11/14/2022] Open
Abstract
Computational models are often employed in systems biology to study the dynamic behaviours of complex systems. With the rise in the number of computational models, finding ways to improve the reusability of these models and their ability to reproduce virtual experiments becomes critical. Correct and effective model annotation in community-supported and standardised formats is necessary for this improvement. Here, we present recent efforts toward a common framework for annotated, accessible, reproducible and interoperable computational models in biology, and discuss key challenges of the field.
Collapse
|
10
|
Design considerations for representing systems biology information with the Systems Biology Graphical Notation. J Integr Bioinform 2022; 19:jib-2022-0024. [PMID: 35786424 PMCID: PMC9377698 DOI: 10.1515/jib-2022-0024] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Accepted: 06/07/2022] [Indexed: 12/15/2022] Open
Abstract
Visual representations are commonly used to explore, analyse, and communicate information and knowledge in systems biology and beyond. Such visualisations not only need to be accurate but should also be aesthetically pleasing and informative. Using the example of the Systems Biology Graphical Notation (SBGN) we will investigate design considerations for graphically presenting information from systems biology, in particular regarding the use of glyphs for types of information, the style of graph layout for network representation, and the concept of bricks for visual network creation.
Collapse
|
11
|
Exploring the evolution of biochemical models at the network level. PLoS One 2022; 17:e0265735. [PMID: 35312734 PMCID: PMC8936491 DOI: 10.1371/journal.pone.0265735] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2021] [Accepted: 03/07/2022] [Indexed: 11/18/2022] Open
Abstract
The evolution of biochemical models is difficult to track. At present, it is not possible to inspect the differences between model versions at the network level. Biochemical models are often constructed in a distributed, non-linear process: collaborators create model versions on different branches from novel information, model extensions, during curation and adaption. To discuss and align the versions, it is helpful to abstract the changes to the network level. The differences between two model versions can be detected by the software tool BiVeS. However, it cannot show the structural changes resulting from the differences. Here, we present a method to visualise the differences between model versions effectively. We developed a JSON schema to communicate the differences at the network level and extended BiVeS accordingly. Additionally, we developed DiVil, a web-based tool to represent the model and the differences as a standardised network using D3. It combines an automatic layout with an interactive user interface to improve the visualisation and to inspect the model. The network can be exported in standardised formats as images or markup language. Our method communicates the structural differences between model versions. It facilitates the discussion of changes and thus supports the collaborative and non-linear nature of model development. Availability and implementation: DiVil prototype: https://divil.bio.informatik.uni-rostock.de, Code on GitHub: https://github.com/Gebbi8/DiVil, licensed under Apache License 2.0. Contact:url="tom.gebhardt@uni-rostock.de.
Collapse
|
12
|
Computational models in systems biology: standards, dissemination, and best practices. Curr Opin Biotechnol 2022; 75:102702. [PMID: 35217296 DOI: 10.1016/j.copbio.2022.102702] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Revised: 11/15/2021] [Accepted: 02/03/2022] [Indexed: 11/29/2022]
Abstract
Mathematical and computational models are a key technology in systems biology. Progress in the field depends on the replicability and reproducibility of their properties and behavior. For this, an essential requirement is a set of clear standards for model specification and dissemination. This review covers existing standards, and it highlights the most important areas where further work is required. This includes the specification of agent-based models, an increasingly common modeling approach.
Collapse
|
13
|
Modeling cholesterol metabolism and atherosclerosis. WIREs Mech Dis 2021; 14:e1546. [PMID: 34931487 DOI: 10.1002/wsbm.1546] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2020] [Revised: 10/11/2021] [Accepted: 10/14/2021] [Indexed: 12/19/2022]
Abstract
Atherosclerotic cardiovascular disease (ASCVD) is the leading cause of morbidity and mortality among Western populations. Many risk factors have been identified for ASCVD; however, elevated low-density lipoprotein cholesterol (LDL-C) remains the gold standard. Cholesterol metabolism at the cellular and whole-body level is maintained by an array of interacting components. These regulatory mechanisms have complex behavior. Likewise, the mechanisms which underpin atherogenesis are nontrivial and multifaceted. To help overcome the challenge of investigating these processes mathematical modeling, which is a core constituent of the systems biology paradigm has played a pivotal role in deciphering their dynamics. In so doing models have revealed new insights about the key drivers of ASCVD. The aim of this review is fourfold; to provide an overview of cholesterol metabolism and atherosclerosis, to briefly introduce mathematical approaches used in this field, to critically discuss models of cholesterol metabolism and atherosclerosis, and to highlight areas where mathematical modeling could help to investigate in the future. This article is categorized under: Cardiovascular Diseases > Computational Models.
Collapse
|
14
|
SBGNview: towards data analysis, integration and visualization on all pathways. Bioinformatics 2021; 38:1473-1476. [PMID: 34864890 PMCID: PMC8826166 DOI: 10.1093/bioinformatics/btab793] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2021] [Revised: 11/06/2021] [Accepted: 11/17/2021] [Indexed: 01/05/2023] Open
Abstract
SUMMARY Pathway analysis is widely used in genomics and omics research, but the data visualization has been highly limited in function, pathway coverage and data format. Here, we develop SBGNview a comprehensive R package to address these needs. By adopting the standard SBGN format, SBGNview greatly extend the coverage of pathway-based analysis and data visualization to essentially all major pathway databases beyond KEGG, including 5200 reference pathways and over 3000 species. In addition, SBGNview substantially extends or exceeds current tools (esp. Pathview) in both design and function, including standard input format (SBGN), high-quality output graphics (SVG format) convenient for both interpretation and further update, and flexible and open-end workflow for iterative editing and interactive visualization (Highlighter module). In addition to pathway analysis and data visualization, SBGNview provides essential infrastructure for SBGN data manipulation and processing. AVAILABILITY AND IMPLEMENTATION The data underlying this article are available as part of the SBGNview package is available on both GitHub and Bioconductor: https://github.com/datapplab/SBGNview, https://bioconductor.org/packages/SBGNview. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
|
15
|
High-Quality Genome-Scale Reconstruction of Corynebacterium glutamicum ATCC 13032. Front Microbiol 2021; 12:750206. [PMID: 34867870 PMCID: PMC8634658 DOI: 10.3389/fmicb.2021.750206] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Accepted: 10/19/2021] [Indexed: 11/30/2022] Open
Abstract
Corynebacterium glutamicum belongs to the microbes of enormous biotechnological relevance. In particular, its strain ATCC 13032 is a widely used producer of L-amino acids at an industrial scale. Its apparent robustness also turns it into a favorable platform host for a wide range of further compounds, mainly because of emerging bio-based economies. A deep understanding of the biochemical processes in C. glutamicum is essential for a sustainable enhancement of the microbe's productivity. Computational systems biology has the potential to provide a valuable basis for driving metabolic engineering and biotechnological advances, such as increased yields of healthy producer strains based on genome-scale metabolic models (GEMs). Advanced reconstruction pipelines are now available that facilitate the reconstruction of GEMs and support their manual curation. This article presents iCGB21FR, an updated and unified GEM of C. glutamicum ATCC 13032 with high quality regarding comprehensiveness and data standards, built with the latest modeling techniques and advanced reconstruction pipelines. It comprises 1042 metabolites, 1539 reactions, and 805 genes with detailed annotations and database cross-references. The model validation took place using different media and resulted in realistic growth rate predictions under aerobic and anaerobic conditions. The new GEM produces all canonical amino acids, and its phenotypic predictions are consistent with laboratory data. The in silico model proved fruitful in adding knowledge to the metabolism of C. glutamicum: iCGB21FR still produces L-glutamate with the knock-out of the enzyme pyruvate carboxylase, despite the common belief to be relevant for the amino acid's production. We conclude that integrating high standards into the reconstruction of GEMs facilitates replicating validated knowledge, closing knowledge gaps, and making it a useful basis for metabolic engineering. The model is freely available from BioModels Database under identifier MODEL2102050001.
Collapse
|
16
|
Specifications of standards in systems and synthetic biology: status and developments in 2021. J Integr Bioinform 2021; 18:jib-2021-0026. [PMID: 34674411 PMCID: PMC8573232 DOI: 10.1515/jib-2021-0026] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
This special issue of the Journal of Integrative Bioinformatics contains updated specifications of COMBINE standards in systems and synthetic biology. The 2021 special issue presents four updates of standards: Synthetic Biology Open Language Visual Version 2.3, Synthetic Biology Open Language Visual Version 3.0, Simulation Experiment Description Markup Language Level 1 Version 4, and OMEX Metadata specification Version 1.2. This document can also be consulted to identify the latest specifications of all COMBINE standards.
Collapse
|
17
|
The status of causality in biological databases: data resources and data retrieval possibilities to support logical modeling. Brief Bioinform 2021; 22:bbaa390. [PMID: 33378765 PMCID: PMC8294520 DOI: 10.1093/bib/bbaa390] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Revised: 11/26/2020] [Accepted: 11/27/2020] [Indexed: 12/16/2022] Open
Abstract
Causal molecular interactions represent key building blocks used in computational modeling, where they facilitate the assembly of regulatory networks. Logical regulatory networks can be used to predict biological and cellular behaviors by system perturbations and in silico simulations. Today, broad sets of causal interactions are available in a variety of biological knowledge resources. However, different visions, based on distinct biological interests, have led to the development of multiple ways to describe and annotate causal molecular interactions. It can therefore be challenging to efficiently explore various resources of causal interaction and maintain an overview of recorded contextual information that ensures valid use of the data. This review lists the different types of public resources with causal interactions, the different views on biological processes that they represent, the various data formats they use for data representation and storage, and the data exchange and conversion procedures that are available to extract and download these interactions. This may further raise awareness among the targeted audience, i.e. logical modelers and other scientists interested in molecular causal interactions, but also database managers and curators, about the abundance and variety of causal molecular interaction data, and the variety of tools and approaches to convert them into one interoperable resource.
Collapse
|
18
|
A detailed map of coupled circadian clock and cell cycle with qualitative dynamics validation. BMC Bioinformatics 2021; 22:240. [PMID: 33975535 PMCID: PMC8114686 DOI: 10.1186/s12859-021-04158-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 04/21/2021] [Indexed: 12/16/2022] Open
Abstract
Background The temporal coordination of biological processes by the circadian clock is an important mechanism, and its disruption has negative health outcomes, including cancer. Experimental and theoretical evidence suggests that the oscillators driving the circadian clock and the cell cycle are coupled through phase locking. Results We present a detailed and documented map of known mechanisms related to the regulation of the circadian clock, and its coupling with an existing cell cycle map which includes main interactions of the mammalian cell cycle. The coherence of the merged map has been validated with a qualitative dynamics analysis. We verified that the coupled circadian clock and cell cycle maps reproduce the observed sequence of phase markers. Moreover, we predicted mutations that contribute to regulating checkpoints of the two oscillators. Conclusions Our approach underlined the potential key role of the core clock protein NR1D1 in regulating cell cycle progression. We predicted that its activity influences negatively the progression of the cell cycle from phase G2 to M. This is consistent with the earlier experimental finding that pharmacological activation of NR1D1 inhibits tumour cell proliferation and shows that our approach can identify biologically relevant species in the context of large and complex networks. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-021-04158-9.
Collapse
|
19
|
Visual Exploration of Large Metabolic Models. Bioinformatics 2021; 37:4460-4468. [PMID: 33970212 DOI: 10.1093/bioinformatics/btab335] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2020] [Revised: 03/01/2021] [Accepted: 04/30/2021] [Indexed: 01/09/2023] Open
Abstract
MOTIVATION Large metabolic models, including genome-scale metabolic models (GSMMs), are nowadays common in systems biology, biotechnology and pharmacology. They typically contain thousands of metabolites and reactions and therefore methods for their automatic visualisation and interactive exploration can facilitate a better understanding of these models. RESULTS We developed a novel method for the visual exploration of large metabolic models and implemented it in LMME (Large Metabolic Model Explorer), an add-on for the biological network analysis tool VANTED. The underlying idea of our method is to analyse a large model as follows. Starting from a decomposition into several subsystems, relationships between these subsystems are identified and an overview is computed and visualised. From this overview, detailed subviews may be constructed and visualised in order to explore subsystems and relationships in greater detail. Decompositions may either be predefined or computed, using built-in or self-implemented methods. Realised as add-on for VANTED, LMME is embedded in a domain-specific environment, allowing for further related analysis at any stage during the exploration. We describe the method, provide a use case, and discuss the strengths and weaknesses of different decomposition methods. AVAILABILITY The methods and algorithms presented here are implemented in LMME, an open-source add-on for VANTED. LMME can be downloaded from www.cls.uni-konstanz.de/software/lmme and VANTED can be downloaded from www.vanted.org. The source code of LMME is available from GitHub, at https://github.com/LSI-UniKonstanz/lmme.
Collapse
|
20
|
Reusability and composability in process description maps: RAS-RAF-MEK-ERK signalling. Brief Bioinform 2021; 22:6217719. [PMID: 33834185 PMCID: PMC8425390 DOI: 10.1093/bib/bbab103] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2021] [Revised: 03/05/2021] [Accepted: 03/08/2021] [Indexed: 12/05/2022] Open
Abstract
Detailed maps of the molecular basis of the disease are powerful tools for interpreting data and building predictive models. Modularity and composability are considered necessary network features for large-scale collaborative efforts to build comprehensive molecular descriptions of disease mechanisms. An effective way to create and manage large systems is to compose multiple subsystems. Composable network components could effectively harness the contributions of many individuals and enable teams to seamlessly assemble many individual components into comprehensive maps. We examine manually built versions of the RAS–RAF–MEK–ERK cascade from the Atlas of Cancer Signalling Network, PANTHER and Reactome databases and review them in terms of their reusability and composability for assembling new disease models. We identify design principles for managing complex systems that could make it easier for investigators to share and reuse network components. We demonstrate the main challenges including incompatible levels of detail and ambiguous representation of complexes and highlight the need to address these challenges.
Collapse
|
21
|
SBGN Bricks Ontology as a tool to describe recurring concepts in molecular networks. Brief Bioinform 2021; 22:6184415. [PMID: 33758926 PMCID: PMC8425392 DOI: 10.1093/bib/bbab049] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2020] [Revised: 01/13/2021] [Indexed: 11/14/2022] Open
Abstract
A comprehensible representation of a molecular network is key to communicating and understanding scientific results in systems biology. The Systems Biology Graphical Notation (SBGN) has emerged as the main standard to represent such networks graphically. It has been implemented by different software tools, and is now largely used to communicate maps in scientific publications. However, learning the standard, and using it to build large maps, can be tedious. Moreover, SBGN maps are not grounded on a formal semantic layer and therefore do not enable formal analysis. Here, we introduce a new set of patterns representing recurring concepts encountered in molecular networks, called SBGN bricks. The bricks are structured in a new ontology, the Bricks Ontology (BKO), to define clear semantics for each of the biological concepts they represent. We show the usefulness of the bricks and BKO for both the template-based construction and the semantic annotation of molecular networks. The SBGN bricks and BKO can be freely explored and downloaded at sbgnbricks.org.
Collapse
|
22
|
MINERVA, A Platform for the Exploration of Disease Maps. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11685-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
|
23
|
Converting disease maps into heavyweight ontologies: general methodology and application to Alzheimer's disease. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2021; 2021:6137817. [PMID: 33590873 DOI: 10.1093/database/baab004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Revised: 01/17/2021] [Accepted: 01/27/2021] [Indexed: 11/12/2022]
Abstract
Omics technologies offer great promises for improving our understanding of diseases. The integration and interpretation of such data pose major challenges, calling for adequate knowledge models. Disease maps provide curated knowledge about disorders' pathophysiology at the molecular level adapted to omics measurements. However, the expressiveness of disease maps could be increased to help in avoiding ambiguities and misinterpretations and to reinforce their interoperability with other knowledge resources. Ontology is an adequate framework to overcome this limitation, through their axiomatic definitions and logical reasoning properties. We introduce the Disease Map Ontology (DMO), an ontological upper model based on systems biology terms. We then propose to apply DMO to Alzheimer's disease (AD). Specifically, we use it to drive the conversion of AlzPathway, a disease map devoted to AD, into a formal ontology: Alzheimer DMO. We demonstrate that it allows one to deal with issues related to redundancy, naming, consistency, process classification and pathway relationships. Furthermore, we show that it can store and manage multi-omics data. Finally, we expand the model using elements from other resources, such as clinical features contained in the AD Ontology, resulting in an enriched model called ADMO-plus. The current versions of DMO, ADMO and ADMO-plus are freely available at http://bioportal.bioontology.org/ontologies/ADMO.
Collapse
|
24
|
Computational Model Informs Effective Control Interventions against Y. enterocolitica Co-Infection. BIOLOGY 2020; 9:E431. [PMID: 33266094 PMCID: PMC7759887 DOI: 10.3390/biology9120431] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/16/2020] [Revised: 11/19/2020] [Accepted: 11/23/2020] [Indexed: 01/05/2023]
Abstract
The complex interplay between pathogens, host factors, and the integrity and composition of the endogenous microbiome determine the course and outcome of gastrointestinal infections. The model organism Yersinia entercolitica (Ye) is one of the five top frequent causes of bacterial gastroenteritis based on the Epidemiological Bulletin of the Robert Koch Institute (RKI), 10 September 2020. A fundamental challenge in predicting the course of an infection is to understand whether co-infection with two Yersinia strains, differing only in their capacity to resist killing by the host immune system, may decrease the overall virulence by competitive exclusion or increase it by acting cooperatively. Herein, we study the primary interactions among Ye, the host immune system and the microbiota, and their influence on Yersinia population dynamics. The employed model considers commensal bacterial in two host compartments (the intestinal mucosa the and lumen), the co-existence of wt and mut Yersinia strains, and the host immune responses. We determine four possible equilibria: disease-free, wt-free, mut-free, and co-existence of wt and mut in equilibrium. We also calculate the reproduction number for each strain as a threshold parameter to determine if the population may be eradicated or persist within the host. We conclude that the infection should disappear if the reproduction numbers for each strain fall below one, and the commensal bacteria growth rate exceeds the pathogen's growth rate. These findings will help inform medical control strategies. The supplement includes the MATLAB source script, Maple workbook, and figures.
Collapse
|
25
|
Pathway information extracted from 25 years of pathway figures. Genome Biol 2020; 21:273. [PMID: 33168034 PMCID: PMC7649569 DOI: 10.1186/s13059-020-02181-2] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2020] [Accepted: 10/16/2020] [Indexed: 12/16/2022] Open
Abstract
Thousands of pathway diagrams are published each year as static figures inaccessible to computational queries and analyses. Using a combination of machine learning, optical character recognition, and manual curation, we identified 64,643 pathway figures published between 1995 and 2019 and extracted 1,112,551 instances of human genes, comprising 13,464 unique NCBI genes, participating in a wide variety of biological processes. This collection represents an order of magnitude more genes than found in the text of the same papers, and thousands of genes missing from other pathway databases, thus presenting new opportunities for discovery and research.
Collapse
|
26
|
Computational approaches to understanding nutrient metabolism and metabolic disorders. Curr Opin Biotechnol 2020; 70:7-14. [PMID: 33038781 DOI: 10.1016/j.copbio.2020.09.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 09/01/2020] [Accepted: 09/06/2020] [Indexed: 10/23/2022]
Abstract
Computational methods are becoming more and more essential to elucidate biological systems. Many different approaches exist with pros and cons. This paper reviews the most useful technologies focusing on nutrient metabolism and metabolic disorders. Space limitation prevents from exploring the examples in details, but pointers to the relevant papers are reported.
Collapse
|
27
|
Specifications of standards in systems and synthetic biology: status and developments in 2020. J Integr Bioinform 2020; 17:jib-2020-0022. [PMID: 32598316 PMCID: PMC7756620 DOI: 10.1515/jib-2020-0022] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2020] [Accepted: 04/27/2020] [Indexed: 12/14/2022] Open
Abstract
This special issue of the Journal of Integrative Bioinformatics presents papers related to the 10th COMBINE meeting together with the annual update of COMBINE standards in systems and synthetic biology.
Collapse
|
28
|
The first 10 years of the international coordination network for standards in systems and synthetic biology (COMBINE). J Integr Bioinform 2020; 17:jib-2020-0005. [PMID: 32598315 PMCID: PMC7756615 DOI: 10.1515/jib-2020-0005] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2020] [Accepted: 05/14/2020] [Indexed: 01/23/2023] Open
Abstract
This paper presents a report on outcomes of the 10th Computational Modeling in Biology Network (COMBINE) meeting that was held in Heidelberg, Germany, in July of 2019. The annual event brings together researchers, biocurators and software engineers to present recent results and discuss future work in the area of standards for systems and synthetic biology. The COMBINE initiative coordinates the development of various community standards and formats for computational models in the life sciences. Over the past 10 years, COMBINE has brought together standard communities that have further developed and harmonized their standards for better interoperability of models and data. COMBINE 2019 was co-located with a stakeholder workshop of the European EU-STANDS4PM initiative that aims at harmonized data and model standardization for in silico models in the field of personalized medicine, as well as with the FAIRDOM PALs meeting to discuss findable, accessible, interoperable and reusable (FAIR) data sharing. This report briefly describes the work discussed in invited and contributed talks as well as during breakout sessions. It also highlights recent advancements in data, model, and annotation standardization efforts. Finally, this report concludes with some challenges and opportunities that this community will face during the next 10 years.
Collapse
|
29
|
Using Reactome to build an autophagy mechanism knowledgebase. Autophagy 2020; 17:1543-1554. [PMID: 32486891 PMCID: PMC8204961 DOI: 10.1080/15548627.2020.1761659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022] Open
Abstract
The 21st century has revealed much about the fundamental cellular process of autophagy. Autophagy controls the catabolism and recycling of various cellular components both as a constitutive process and as a response to stress and foreign material invasion. There is considerable knowledge of the molecular mechanisms of autophagy, and this is still growing as new modalities emerge. There is a need to investigate autophagy mechanisms reliably, comprehensively and conveniently. Reactome is a freely available knowledgebase that consists of manually curated molecular events (reactions) organized into cellular pathways (https://reactome.org/). Pathways/reactions in Reactome are hierarchically structured, graphically presented and extensively annotated. Data analysis tools, such as pathway enrichment, expression data overlay and species comparison, are also available. For customized analysis, information can also be programmatically queried. Here, we discuss the curation and annotation of the molecular mechanisms of autophagy in Reactome. We also demonstrate the value that Reactome adds to research by reanalyzing a previously published work on genome-wide CRISPR screening of autophagy components.Abbreviations: CMA: chaperone-mediated autophagy; GO: Gene Ontology; MA: macroautophagy; MI: microautophagy; MTOR: mechanistic target of rapamycin kinase; SQSTM1: sequestosome 1.
Collapse
|
30
|
Metabolomics and Multi-Omics Integration: A Survey of Computational Methods and Resources. Metabolites 2020; 10:E202. [PMID: 32429287 PMCID: PMC7281435 DOI: 10.3390/metabo10050202] [Citation(s) in RCA: 56] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Revised: 05/07/2020] [Accepted: 05/13/2020] [Indexed: 02/06/2023] Open
Abstract
As researchers are increasingly able to collect data on a large scale from multiple clinical and omics modalities, multi-omics integration is becoming a critical component of metabolomics research. This introduces a need for increased understanding by the metabolomics researcher of computational and statistical analysis methods relevant to multi-omics studies. In this review, we discuss common types of analyses performed in multi-omics studies and the computational and statistical methods that can be used for each type of analysis. We pinpoint the caveats and considerations for analysis methods, including required parameters, sample size and data distribution requirements, sources of a priori knowledge, and techniques for the evaluation of model accuracy. Finally, for the types of analyses discussed, we provide examples of the applications of corresponding methods to clinical and basic research. We intend that our review may be used as a guide for metabolomics researchers to choose effective techniques for multi-omics analyses relevant to their field of study.
Collapse
|
31
|
Visualizing metabolic network dynamics through time-series metabolomic data. BMC Bioinformatics 2020; 21:130. [PMID: 32245365 PMCID: PMC7119163 DOI: 10.1186/s12859-020-3415-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2019] [Accepted: 02/12/2020] [Indexed: 11/23/2022] Open
Abstract
Background New technologies have given rise to an abundance of -omics data, particularly metabolomic data. The scale of these data introduces new challenges for the interpretation and extraction of knowledge, requiring the development of innovative computational visualization methodologies. Here, we present GEM-Vis, an original method for the visualization of time-course metabolomic data within the context of metabolic network maps. We demonstrate the utility of the GEM-Vis method by examining previously published data for two cellular systems—the human platelet and erythrocyte under cold storage for use in transfusion medicine. Results The results comprise two animated videos that allow for new insights into the metabolic state of both cell types. In the case study of the platelet metabolome during storage, the new visualization technique elucidates a nicotinamide accumulation that mirrors that of hypoxanthine and might, therefore, reflect similar pathway usage. This visual analysis provides a possible explanation for why the salvage reactions in purine metabolism exhibit lower activity during the first few days of the storage period. The second case study displays drastic changes in specific erythrocyte metabolite pools at different times during storage at different temperatures. Conclusions The new visualization technique GEM-Vis introduced in this article constitutes a well-suitable approach for large-scale network exploration and advances hypothesis generation. This method can be applied to any system with data and a metabolic map to promote visualization and understand physiology at the network level. More broadly, we hope that our approach will provide the blueprints for new visualizations of other longitudinal -omics data types. The supplement includes a comprehensive user’s guide and links to a series of tutorial videos that explain how to prepare model and data files, and how to use the software SBMLsimulator in combination with further tools to create similar animations as highlighted in the case studies.
Collapse
|
32
|
RA-map: building a state-of-the-art interactive knowledge base for rheumatoid arthritis. Database (Oxford) 2020; 2020:baaa017. [PMID: 32311035 PMCID: PMC7170216 DOI: 10.1093/database/baaa017] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2019] [Revised: 01/21/2020] [Accepted: 02/13/2020] [Indexed: 02/07/2023]
Abstract
Rheumatoid arthritis (RA) is a progressive, inflammatory autoimmune disease of unknown aetiology. The complex mechanism of aetiopathogenesis, progress and chronicity of the disease involves genetic, epigenetic and environmental factors. To understand the molecular mechanisms underlying disease phenotypes, one has to place implicated factors in their functional context. However, integration and organization of such data in a systematic manner remains a challenging task. Molecular maps are widely used in biology to provide a useful and intuitive way of depicting a variety of biological processes and disease mechanisms. Recent large-scale collaborative efforts such as the Disease Maps Project demonstrate the utility of such maps as versatile tools to organize and formalize disease-specific knowledge in a comprehensive way, both human and machine-readable. We present a systematic effort to construct a fully annotated, expert validated, state-of-the-art knowledge base for RA in the form of a molecular map. The RA map illustrates molecular and signalling pathways implicated in the disease. Signal transduction is depicted from receptors to the nucleus using the Systems Biology Graphical Notation (SBGN) standard representation. High-quality manual curation, use of only human-specific studies and focus on small-scale experiments aim to limit false positives in the map. The state-of-the-art molecular map for RA, using information from 353 peer-reviewed scientific publications, comprises 506 species, 446 reactions and 8 phenotypes. The species in the map are classified to 303 proteins, 61 complexes, 106 genes, 106 RNA entities, 2 ions and 7 simple molecules. The RA map is available online at ramap.elixir-luxembourg.org as an open-access knowledge base allowing for easy navigation and search of molecular pathways implicated in the disease. Furthermore, the RA map can serve as a template for omics data visualization.
Collapse
|
33
|
Specifications of Standards in Systems and Synthetic Biology: Status and Developments in 2019. J Integr Bioinform 2019; 16:/j/jib.2019.16.issue-2/jib-2019-0035/jib-2019-0035.xml. [PMID: 31301675 PMCID: PMC6798822 DOI: 10.1515/jib-2019-0035] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
This special issue of the Journal of Integrative Bioinformatics presents an overview of COMBINE standards and their latest specifications. The standards cover representation formats for computational modeling in synthetic and systems biology and include BioPAX, CellML, NeuroML, SBML, SBGN, SBOL and SED-ML. The articles in this issue contain updated specifications of SBGN Process Description Level 1 Version 2, SBML Level 3 Core Version 2 Release 2, SBOL Version 2.3.0, and SBOL Visual Version 2.1.
Collapse
|