1
|
Gawron P, Hoksza D, Piñero J, Peña-Chilet M, Esteban-Medina M, Fernandez-Rueda JL, Colonna V, Smula E, Heirendt L, Ancien F, Groues V, Satagopam VP, Schneider R, Dopazo J, Furlong LI, Ostaszewski M. Visualization of automatically combined disease maps and pathway diagrams for rare diseases. FRONTIERS IN BIOINFORMATICS 2023; 3:1101505. [PMID: 37502697 PMCID: PMC10369067 DOI: 10.3389/fbinf.2023.1101505] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2022] [Accepted: 05/05/2023] [Indexed: 07/29/2023] Open
Abstract
Introduction: Investigation of molecular mechanisms of human disorders, especially rare diseases, require exploration of various knowledge repositories for building precise hypotheses and complex data interpretation. Recently, increasingly more resources offer diagrammatic representation of such mechanisms, including disease-dedicated schematics in pathway databases and disease maps. However, collection of knowledge across them is challenging, especially for research projects with limited manpower. Methods: In this article we present an automated workflow for construction of maps of molecular mechanisms for rare diseases. The workflow requires a standardized definition of a disease using Orphanet or HPO identifiers to collect relevant genes and variants, and to assemble a functional, visual repository of related mechanisms, including data overlays. The diagrams composing the final map are unified to a common systems biology format from CellDesigner SBML, GPML and SBML+layout+render. The constructed resource contains disease-relevant genes and variants as data overlays for immediate visual exploration, including embedded genetic variant browser and protein structure viewer. Results: We demonstrate the functionality of our workflow on two examples of rare diseases: Kawasaki disease and retinitis pigmentosa. Two maps are constructed based on their corresponding identifiers. Moreover, for the retinitis pigmentosa use-case, we include a list of differentially expressed genes to demonstrate how to tailor the workflow using omics datasets. Discussion: In summary, our work allows for an ad-hoc construction of molecular diagrams combined from different sources, preserving their layout and graphical style, but integrating them into a single resource. This allows to reduce time consuming tasks of prototyping of a molecular disease map, enabling visual exploration, hypothesis building, data visualization and further refinement. The code of the workflow is open and accessible at https://gitlab.lcsb.uni.lu/minerva/automap/.
Collapse
Affiliation(s)
- Piotr Gawron
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - David Hoksza
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
- Faculty of Mathematics and Physics, Charles University, Prague, Czechia
| | - Janet Piñero
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Barcelona, Spain
- Department of Experimental and Health Sciences, Pompeu Fabra University (UPF), Barcelona, Spain
- MedBioinformatics Solutions SL, Barcelona, Spain
| | - Maria Peña-Chilet
- Computational Medicine Platform, Fundacion Progreso y Salud, Sevilla, Spain
- Spanish Network of Research in Rare Diseases (CIBERER), Sevilla, Spain
| | | | | | - Vincenza Colonna
- Institute of Genetics and Biophysics, National Research Council of Italy, Naples, Rome
- Department of Genetics, Genomics and Informatics, College of Medicine, University of Tennessee Health Science Center, Memphis, TN, United States
| | - Ewa Smula
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - Laurent Heirendt
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - François Ancien
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - Valentin Groues
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - Venkata P. Satagopam
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| | - Joaquin Dopazo
- Computational Medicine Platform, Fundacion Progreso y Salud, Sevilla, Spain
- Spanish Network of Research in Rare Diseases (CIBERER), Sevilla, Spain
| | - Laura I. Furlong
- Research Programme on Biomedical Informatics (GRIB), Hospital del Mar Medical Research Institute (IMIM), Barcelona, Spain
- Department of Experimental and Health Sciences, Pompeu Fabra University (UPF), Barcelona, Spain
- MedBioinformatics Solutions SL, Barcelona, Spain
| | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Luxembourg, Luxembourg
| |
Collapse
|
2
|
Mazein A, Acencio ML, Balaur I, Rougny A, Welter D, Niarakis A, Ramirez Ardila D, Dogrusoz U, Gawron P, Satagopam V, Gu W, Kremer A, Schneider R, Ostaszewski M. A guide for developing comprehensive systems biology maps of disease mechanisms: planning, construction and maintenance. FRONTIERS IN BIOINFORMATICS 2023; 3:1197310. [PMID: 37426048 PMCID: PMC10325725 DOI: 10.3389/fbinf.2023.1197310] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Accepted: 06/09/2023] [Indexed: 07/11/2023] Open
Abstract
As a conceptual model of disease mechanisms, a disease map integrates available knowledge and is applied for data interpretation, predictions and hypothesis generation. It is possible to model disease mechanisms on different levels of granularity and adjust the approach to the goals of a particular project. This rich environment together with requirements for high-quality network reconstruction makes it challenging for new curators and groups to be quickly introduced to the development methods. In this review, we offer a step-by-step guide for developing a disease map within its mainstream pipeline that involves using the CellDesigner tool for creating and editing diagrams and the MINERVA Platform for online visualisation and exploration. We also describe how the Neo4j graph database environment can be used for managing and querying efficiently such a resource. For assessing the interoperability and reproducibility we apply FAIR principles.
Collapse
Affiliation(s)
- Alexander Mazein
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Marcio Luis Acencio
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Irina Balaur
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | | | - Danielle Welter
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Anna Niarakis
- Université Paris-Saclay, Laboratoire Européen de Recherche Pour la Polyarthrite Rhumatoïde–Genhotel, University Evry, Evry, France
- Lifeware Group, Inria Saclay-Ile de France, Palaiseau, France
| | - Diana Ramirez Ardila
- ITTM Information Technology for Translational Medicine, Esch-sur-Alzette, Luxemburg
| | - Ugur Dogrusoz
- Computer Engineering Department, Bilkent University, Ankara, Türkiye
| | - Piotr Gawron
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
| | - Venkata Satagopam
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- ELIXIR Luxembourg, Belvaux, Luxembourg
| | - Wei Gu
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- ELIXIR Luxembourg, Belvaux, Luxembourg
| | - Andreas Kremer
- ITTM Information Technology for Translational Medicine, Esch-sur-Alzette, Luxemburg
| | - Reinhard Schneider
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- ELIXIR Luxembourg, Belvaux, Luxembourg
| | - Marek Ostaszewski
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Esch-sur-Alzette, Luxembourg
- ELIXIR Luxembourg, Belvaux, Luxembourg
| |
Collapse
|
3
|
Kermani NZ, Adcock IM, Djukanović R, Chung F, Schofield JPR. Systems Biology in Asthma. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2023; 1426:215-235. [PMID: 37464123 DOI: 10.1007/978-3-031-32259-4_10] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/20/2023]
Abstract
The application of mathematical and computational analysis, together with the modelling of biological and physiological processes, is transforming our understanding of the pathophysiology of complex diseases. This systems biology approach incorporates large amounts of genomic, transcriptomic, proteomic, metabolomic, breathomic, metagenomic and imaging data from disease sites together with deep clinical phenotyping, including patient-reported outcomes. Integration of these datasets will provide a greater understanding of the molecular pathways associated with severe asthma in each individual patient and determine their personalised treatment regime. This chapter describes some of the data integration methods used to combine data sets and gives examples of the results obtained using single datasets and merging of multiple datasets (data fusion and data combination) from several consortia including the severe asthma research programme (SARP) and the Unbiased Biomarkers Predictive of Respiratory Disease Outcomes (U-BIOPRED) consortia. These results highlight the involvement of several different immune and inflammatory pathways and factors in distinct subsets of patients with severe asthma. These pathways often overlap in patients with distinct clinical features of asthma, which may explain the incomplete or no response in patients undergoing specific targeted therapy. Collaboration between groups will improve the predictions obtained using a systems medicine approach in severe asthma.
Collapse
Affiliation(s)
- Nazanin Zounemat Kermani
- Data Science Institute, Imperial College London, London, UK
- National Heart & Lung Institute, Imperial College London, London, UK
| | - Ian M Adcock
- National Heart & Lung Institute, Imperial College London, London, UK
| | - Ratko Djukanović
- NIHR Southampton Biomedical Research Centre, Clinical and Experimental Sciences, Faculty of Medicine, University of Southampton, Southampton, UK.
| | - Fan Chung
- National Heart & Lung Institute, Imperial College London, London, UK
- Biomedical Research Unit, Royal Brompton & Harefield NHS Trust, London, UK
| | - James P R Schofield
- Centre for Proteomic Research, Institute for Life Sciences, University of Southampton, Southampton, UK
- TopMD Precision Medicine Ltd, Southampton, UK
| |
Collapse
|
5
|
Glavaški M, Velicki L. Humans and machines in biomedical knowledge curation: hypertrophic cardiomyopathy molecular mechanisms' representation. BioData Min 2021; 14:45. [PMID: 34600580 PMCID: PMC8487578 DOI: 10.1186/s13040-021-00279-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2021] [Accepted: 09/14/2021] [Indexed: 11/25/2022] Open
Abstract
Background Biomedical knowledge is dispersed in scientific literature and is growing constantly. Curation is the extraction of knowledge from unstructured data into a computable form and could be done manually or automatically. Hypertrophic cardiomyopathy (HCM) is the most common inherited cardiac disease, with genotype–phenotype associations still incompletely understood. We compared human- and machine-curated HCM molecular mechanisms’ models and examined the performance of different machine approaches for that task. Results We created six models representing HCM molecular mechanisms using different approaches and made them publicly available, analyzed them as networks, and tried to explain the models’ differences by the analysis of factors that affect the quality of machine-curated models (query constraints and reading systems’ performance). A result of this work is also the Interactive HCM map, the only publicly available knowledge resource dedicated to HCM. Sizes and topological parameters of the networks differed notably, and a low consensus was found in terms of centrality measures between networks. Consensus about the most important nodes was achieved only with respect to one element (calcium). Models with a reduced level of noise were generated and cooperatively working elements were detected. REACH and TRIPS reading systems showed much higher accuracy than Sparser, but at the cost of extraction performance. TRIPS proved to be the best single reading system for text segments about HCM, in terms of the compromise between accuracy and extraction performance. Conclusions Different approaches in curation can produce models of the same disease with diverse characteristics, and they give rise to utterly different conclusions in subsequent analysis. The final purpose of the model should direct the choice of curation techniques. Manual curation represents the gold standard for information extraction in biomedical research and is most suitable when only high-quality elements for models are required. Automated curation provides more substance, but high level of noise is expected. Different curation strategies can reduce the level of human input needed. Biomedical knowledge would benefit overwhelmingly, especially as to its rapid growth, if computers were to be able to assist in analysis on a larger scale. Supplementary Information The online version contains supplementary material available at 10.1186/s13040-021-00279-2.
Collapse
Affiliation(s)
- Mila Glavaški
- Faculty of Medicine, University of Novi Sad, Novi Sad, Serbia.
| | - Lazar Velicki
- Faculty of Medicine, University of Novi Sad, Novi Sad, Serbia.,Institute of Cardiovascular Diseases Vojvodina, Sremska Kamenica, Serbia
| |
Collapse
|