1
|
Zhang C, Sánchez BJ, Li F, Eiden CWQ, Scott WT, Liebal UW, Blank LM, Mengers HG, Anton M, Rangel AT, Mendoza SN, Zhang L, Nielsen J, Lu H, Kerkhoven EJ. Yeast9: a consensus genome-scale metabolic model for S. cerevisiae curated by the community. Mol Syst Biol 2024; 20:1134-1150. [PMID: 39134886 PMCID: PMC11450192 DOI: 10.1038/s44320-024-00060-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2024] [Revised: 07/17/2024] [Accepted: 07/31/2024] [Indexed: 10/05/2024] Open
Abstract
Genome-scale metabolic models (GEMs) can facilitate metabolism-focused multi-omics integrative analysis. Since Yeast8, the yeast-GEM of Saccharomyces cerevisiae, published in 2019, has been continuously updated by the community. This has increased the quality and scope of the model, culminating now in Yeast9. To evaluate its predictive performance, we generated 163 condition-specific GEMs constrained by single-cell transcriptomics from osmotic pressure or reference conditions. Comparative flux analysis showed that yeast adapting to high osmotic pressure benefits from upregulating fluxes through central carbon metabolism. Furthermore, combining Yeast9 with proteomics revealed metabolic rewiring underlying its preference for nitrogen sources. Lastly, we created strain-specific GEMs (ssGEMs) constrained by transcriptomics for 1229 mutant strains. Well able to predict the strains' growth rates, fluxomics from those large-scale ssGEMs outperformed transcriptomics in predicting functional categories for all studied genes in machine learning models. Based on those findings we anticipate that Yeast9 will continue to empower systems biology studies of yeast metabolism.
Collapse
Affiliation(s)
- Chengyu Zhang
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 200240, Shanghai, China
- State Key Laboratory of Bioreactor Engineering, and School of Biotechnology, East China University of Science and Technology (ECUST), 200237, Shanghai, China
| | - Benjamín J Sánchez
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK-2800 Kgs, Lyngby, Denmark
- Department of Biotechnology and Biomedicine, Technical University of Denmark, DK-2800 Kgs, Lyngby, Denmark
| | - Feiran Li
- Institute of Biopharmaceutical and Health Engineering, Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen, 518055, China
| | - Cheng Wei Quan Eiden
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, 62 Nanyang Drive, Singapore, 637459, Singapore
| | - William T Scott
- UNLOCK, Wageningen University & Research, Wageningen, The Netherlands
- Laboratory of Systems and Synthetic Biology, Wageningen University & Research, Wageningen, The Netherlands
| | - Ulf W Liebal
- Institute of Applied Microbiology - iAMB, Aachen Biology and Biotechnology - ABBt, RWTH Aachen University, 52074, Aachen, Germany
| | - Lars M Blank
- Institute of Applied Microbiology - iAMB, Aachen Biology and Biotechnology - ABBt, RWTH Aachen University, 52074, Aachen, Germany
| | - Hendrik G Mengers
- Institute of Applied Microbiology - iAMB, Aachen Biology and Biotechnology - ABBt, RWTH Aachen University, 52074, Aachen, Germany
| | - Mihail Anton
- Department of Life Sciences, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Chalmers University of Technology, Gothenburg, SE412 58, Sweden
| | - Albert Tafur Rangel
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK-2800 Kgs, Lyngby, Denmark
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, SE412 96, Sweden
| | - Sebastián N Mendoza
- Center for Mathematical Modeling, University of Chile, Santiago, Chile
- Systems Biology Lab, Vrije Universiteit Amsterdam, Amsterdam, The Netherlands
| | - Lixin Zhang
- State Key Laboratory of Bioreactor Engineering, and School of Biotechnology, East China University of Science and Technology (ECUST), 200237, Shanghai, China
| | - Jens Nielsen
- Department of Life Sciences, Chalmers University of Technology, Gothenburg, SE412 96, Sweden
- BioInnovation Institute, Ole Maaløes Vej 3, DK2200, Copenhagen N, Denmark
| | - Hongzhong Lu
- State Key Laboratory of Microbial Metabolism, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, 200240, Shanghai, China.
| | - Eduard J Kerkhoven
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, DK-2800 Kgs, Lyngby, Denmark.
- Department of Life Sciences, SciLifeLab, Chalmers University of Technology, Gothenburg, SE412 96, Sweden.
| |
Collapse
|
2
|
Kundu P, Beura S, Mondal S, Das AK, Ghosh A. Machine learning for the advancement of genome-scale metabolic modeling. Biotechnol Adv 2024; 74:108400. [PMID: 38944218 DOI: 10.1016/j.biotechadv.2024.108400] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 05/13/2024] [Accepted: 06/23/2024] [Indexed: 07/01/2024]
Abstract
Constraint-based modeling (CBM) has evolved as the core systems biology tool to map the interrelations between genotype, phenotype, and external environment. The recent advancement of high-throughput experimental approaches and multi-omics strategies has generated a plethora of new and precise information from wide-ranging biological domains. On the other hand, the continuously growing field of machine learning (ML) and its specialized branch of deep learning (DL) provide essential computational architectures for decoding complex and heterogeneous biological data. In recent years, both multi-omics and ML have assisted in the escalation of CBM. Condition-specific omics data, such as transcriptomics and proteomics, helped contextualize the model prediction while analyzing a particular phenotypic signature. At the same time, the advanced ML tools have eased the model reconstruction and analysis to increase the accuracy and prediction power. However, the development of these multi-disciplinary methodological frameworks mainly occurs independently, which limits the concatenation of biological knowledge from different domains. Hence, we have reviewed the potential of integrating multi-disciplinary tools and strategies from various fields, such as synthetic biology, CBM, omics, and ML, to explore the biochemical phenomenon beyond the conventional biological dogma. How the integrative knowledge of these intersected domains has improved bioengineering and biomedical applications has also been highlighted. We categorically explained the conventional genome-scale metabolic model (GEM) reconstruction tools and their improvement strategies through ML paradigms. Further, the crucial role of ML and DL in omics data restructuring for GEM development has also been briefly discussed. Finally, the case-study-based assessment of the state-of-the-art method for improving biomedical and metabolic engineering strategies has been elaborated. Therefore, this review demonstrates how integrating experimental and in silico strategies can help map the ever-expanding knowledge of biological systems driven by condition-specific cellular information. This multiview approach will elevate the application of ML-based CBM in the biomedical and bioengineering fields for the betterment of society and the environment.
Collapse
Affiliation(s)
- Pritam Kundu
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Satyajit Beura
- Department of Bioscience and Biotechnology, Indian Institute of Technology, Kharagpur, West Bengal 721302, India
| | - Suman Mondal
- P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Amit Kumar Das
- Department of Bioscience and Biotechnology, Indian Institute of Technology, Kharagpur, West Bengal 721302, India
| | - Amit Ghosh
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India; P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India.
| |
Collapse
|
3
|
Turanli B, Gulfidan G, Aydogan OO, Kula C, Selvaraj G, Arga KY. Genome-scale metabolic models in translational medicine: the current status and potential of machine learning in improving the effectiveness of the models. Mol Omics 2024; 20:234-247. [PMID: 38444371 DOI: 10.1039/d3mo00152k] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2024]
Abstract
The genome-scale metabolic model (GEM) has emerged as one of the leading modeling approaches for systems-level metabolic studies and has been widely explored for a broad range of organisms and applications. Owing to the development of genome sequencing technologies and available biochemical data, it is possible to reconstruct GEMs for model and non-model microorganisms as well as for multicellular organisms such as humans and animal models. GEMs will evolve in parallel with the availability of biological data, new mathematical modeling techniques and the development of automated GEM reconstruction tools. The use of high-quality, context-specific GEMs, a subset of the original GEM in which inactive reactions are removed while maintaining metabolic functions in the extracted model, for model organisms along with machine learning (ML) techniques could increase their applications and effectiveness in translational research in the near future. Here, we briefly review the current state of GEMs, discuss the potential contributions of ML approaches for more efficient and frequent application of these models in translational research, and explore the extension of GEMs to integrative cellular models.
Collapse
Affiliation(s)
- Beste Turanli
- Marmara University, Faculty of Engineering, Department of Bioengineering, Istanbul, Turkey.
- Health Biotechnology Joint Research and Application Center of Excellence, Istanbul, Turkey
| | - Gizem Gulfidan
- Marmara University, Faculty of Engineering, Department of Bioengineering, Istanbul, Turkey.
| | - Ozge Onluturk Aydogan
- Marmara University, Faculty of Engineering, Department of Bioengineering, Istanbul, Turkey.
| | - Ceyda Kula
- Marmara University, Faculty of Engineering, Department of Bioengineering, Istanbul, Turkey.
- Health Biotechnology Joint Research and Application Center of Excellence, Istanbul, Turkey
| | - Gurudeeban Selvaraj
- Concordia University, Centre for Research in Molecular Modeling & Department of Chemistry and Biochemistry, Quebec, Canada
- Saveetha Institute of Medical and Technical Sciences (SIMATS), Saveetha Dental College and Hospital, Department of Biomaterials, Bioinformatics Unit, Chennai, India
| | - Kazim Yalcin Arga
- Marmara University, Faculty of Engineering, Department of Bioengineering, Istanbul, Turkey.
- Health Biotechnology Joint Research and Application Center of Excellence, Istanbul, Turkey
- Marmara University, Genetic and Metabolic Diseases Research and Investigation Center, Istanbul, Turkey
| |
Collapse
|
4
|
Sen P, Orešič M. Integrating Omics Data in Genome-Scale Metabolic Modeling: A Methodological Perspective for Precision Medicine. Metabolites 2023; 13:855. [PMID: 37512562 PMCID: PMC10383060 DOI: 10.3390/metabo13070855] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 07/11/2023] [Accepted: 07/17/2023] [Indexed: 07/30/2023] Open
Abstract
Recent advancements in omics technologies have generated a wealth of biological data. Integrating these data within mathematical models is essential to fully leverage their potential. Genome-scale metabolic models (GEMs) provide a robust framework for studying complex biological systems. GEMs have significantly contributed to our understanding of human metabolism, including the intrinsic relationship between the gut microbiome and the host metabolism. In this review, we highlight the contributions of GEMs and discuss the critical challenges that must be overcome to ensure their reproducibility and enhance their prediction accuracy, particularly in the context of precision medicine. We also explore the role of machine learning in addressing these challenges within GEMs. The integration of omics data with GEMs has the potential to lead to new insights, and to advance our understanding of molecular mechanisms in human health and disease.
Collapse
Affiliation(s)
- Partho Sen
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, FI-20520 Turku, Finland
- School of Medical Sciences, Faculty of Medicine and Health, Örebro University, 702 81 Örebro, Sweden
| | - Matej Orešič
- Turku Bioscience Centre, University of Turku and Åbo Akademi University, FI-20520 Turku, Finland
- School of Medical Sciences, Faculty of Medicine and Health, Örebro University, 702 81 Örebro, Sweden
| |
Collapse
|
5
|
Jalili M, Scharm M, Wolkenhauer O, Salehzadeh-Yazdi A. Metabolic function-based normalization improves transcriptome data-driven reduction of genome-scale metabolic models. NPJ Syst Biol Appl 2023; 9:15. [PMID: 37210409 DOI: 10.1038/s41540-023-00281-w] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Accepted: 05/10/2023] [Indexed: 05/22/2023] Open
Abstract
Genome-scale metabolic models (GEMs) are extensively used to simulate cell metabolism and predict cell phenotypes. GEMs can also be tailored to generate context-specific GEMs, using omics data integration approaches. To date, many integration approaches have been developed, however, each with specific pros and cons; and none of these algorithms systematically outperforms the others. The key to successful implementation of such integration algorithms lies in the optimal selection of parameters, and thresholding is a crucial component in this process. To improve the predictive accuracy of context-specific models, we introduce a new integration framework that improves the ranking of related genes and homogenizes the expression values of those gene sets using single-sample Gene Set Enrichment Analysis (ssGSEA). In this study, we coupled ssGSEA with GIMME and validated the advantages of the proposed framework to predict the ethanol formation of yeast grown in the glucose-limited chemostats, and to simulate metabolic behaviors of yeast growth in four different carbon sources. This framework enhances the predictive accuracy of GIMME which we demonstrate for predicting the yeast physiology in nutrient-limited cultures.
Collapse
Affiliation(s)
- Mahdi Jalili
- Hematology, Oncology and SCT Research Center, Tehran University of Medical Sciences, Tehran, Iran
| | | | - Olaf Wolkenhauer
- Department of Systems Biology and Bioinformatics, University of Rostock, Rostock, Germany
- Stellenbosch University, Stellenbosch Institute for Advanced Study (STIAS), Wallenberg Research Centre, Stellenbosch, South Africa
- Leibniz Institute for Food Systems Biology at the Technical University Munich, Freising, Germany
| | | |
Collapse
|
6
|
Zampieri G, Campanaro S, Angione C, Treu L. Metatranscriptomics-guided genome-scale metabolic modeling of microbial communities. CELL REPORTS METHODS 2023; 3:100383. [PMID: 36814842 PMCID: PMC9939383 DOI: 10.1016/j.crmeth.2022.100383] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Revised: 10/07/2022] [Accepted: 12/12/2022] [Indexed: 01/09/2023]
Abstract
Multi-omics data integration via mechanistic models of metabolism is a scalable and flexible framework for exploring biological hypotheses in microbial systems. However, although most microorganisms are unculturable, such multi-omics modeling is limited to isolate microbes or simple synthetic communities. Here, we developed an approach for modeling microbial activity and interactions that leverages the reconstruction of metagenome-assembled genomes and associated genome-centric metatranscriptomes. At its core, we designed a method for condition-specific metabolic modeling of microbial communities through the integration of metatranscriptomic data. Using this approach, we explored the behavior of anaerobic digestion consortia driven by hydrogen availability and human gut microbiota dysbiosis associated with Crohn's disease, identifying condition-dependent amino acid requirements in archaeal species and a reduced short-chain fatty acid exchange network associated with disease, respectively. Our approach can be applied to complex microbial communities, allowing a mechanistic contextualization of multi-omics data on a metagenome scale.
Collapse
Affiliation(s)
- Guido Zampieri
- Department of Biology, University of Padova, Padova 35121, Italy
| | - Stefano Campanaro
- Department of Biology, University of Padova, Padova 35121, Italy
- CRIBI Biotechnology Center, University of Padova, Padova 35121, Italy
| | - Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough TS1 3BX, UK
- National Horizons Centre, Teesside University, Darlington DL1 1HG, UK
- Centre for Digital Innovation, Teesside University, Middlesbrough TS1 3BX, UK
| | - Laura Treu
- Department of Biology, University of Padova, Padova 35121, Italy
| |
Collapse
|
7
|
Context-Specific Genome-Scale Metabolic Modelling and Its Application to the Analysis of COVID-19 Metabolic Signatures. Metabolites 2023; 13:metabo13010126. [PMID: 36677051 PMCID: PMC9866716 DOI: 10.3390/metabo13010126] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2022] [Revised: 12/27/2022] [Accepted: 01/10/2023] [Indexed: 01/19/2023] Open
Abstract
Genome-scale metabolic models (GEMs) have found numerous applications in different domains, ranging from biotechnology to systems medicine. Herein, we overview the most popular algorithms for the automated reconstruction of context-specific GEMs using high-throughput experimental data. Moreover, we describe different datasets applied in the process, and protocols that can be used to further automate the model reconstruction and validation. Finally, we describe recent COVID-19 applications of context-specific GEMs, focusing on the analysis of metabolic implications, identification of biomarkers and potential drug targets.
Collapse
|
8
|
Patra P, B R D, Kundu P, Das M, Ghosh A. Recent advances in machine learning applications in metabolic engineering. Biotechnol Adv 2023; 62:108069. [PMID: 36442697 DOI: 10.1016/j.biotechadv.2022.108069] [Citation(s) in RCA: 26] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Revised: 10/18/2022] [Accepted: 11/22/2022] [Indexed: 11/27/2022]
Abstract
Metabolic engineering encompasses several widely-used strategies, which currently hold a high seat in the field of biotechnology when its potential is manifesting through a plethora of research and commercial products with a strong societal impact. The genomic revolution that occurred almost three decades ago has initiated the generation of large omics-datasets which has helped in gaining a better understanding of cellular behavior. The itinerary of metabolic engineering that has occurred based on these large datasets has allowed researchers to gain detailed insights and a reasonable understanding of the intricacies of biosystems. However, the existing trail-and-error approaches for metabolic engineering are laborious and time-intensive when it comes to the production of target compounds with high yields through genetic manipulations in host organisms. Machine learning (ML) coupled with the available metabolic engineering test instances and omics data brings a comprehensive and multidisciplinary approach that enables scientists to evaluate various parameters for effective strain design. This vast amount of biological data should be standardized through knowledge engineering to train different ML models for providing accurate predictions in gene circuits designing, modification of proteins, optimization of bioprocess parameters for scaling up, and screening of hyper-producing robust cell factories. This review briefs on the premise of ML, followed by mentioning various ML methods and algorithms alongside the numerous omics datasets available to train ML models for predicting metabolic outcomes with high-accuracy. The combinative interplay between the ML algorithms and biological datasets through knowledge engineering have guided the recent advancements in applications such as CRISPR/Cas systems, gene circuits, protein engineering, metabolic pathway reconstruction, and bioprocess engineering. Finally, this review addresses the probable challenges of applying ML in metabolic engineering which will guide the researchers toward novel techniques to overcome the limitations.
Collapse
Affiliation(s)
- Pradipta Patra
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Disha B R
- B.M.S College of Engineering, Basavanagudi, Bengaluru, Karnataka 560019, India
| | - Pritam Kundu
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Manali Das
- School of Bioscience, Indian Institute of Technology Kharagpur, West Bengal 721302, India
| | - Amit Ghosh
- School School of Energy Science and Engineering, Indian Institute of Technology Kharagpur, West Bengal 721302, India; P.K. Sinha Centre for Bioenergy and Renewables, Indian Institute of Technology Kharagpur, West Bengal 721302, India.
| |
Collapse
|
9
|
Magazzù G, Zampieri G, Angione C. Clinical stratification improves the diagnostic accuracy of small omics datasets within machine learning and genome-scale metabolic modelling methods. Comput Biol Med 2022; 151:106244. [PMID: 36343407 DOI: 10.1016/j.compbiomed.2022.106244] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Revised: 10/07/2022] [Accepted: 10/22/2022] [Indexed: 12/27/2022]
Abstract
BACKGROUND Recently, multi-omic machine learning architectures have been proposed for the early detection of cancer. However, for rare cancers and their associated small datasets, it is still unclear how to use the available multi-omics data to achieve a mechanistic prediction of cancer onset and progression, due to the limited data available. Hepatoblastoma is the most frequent liver cancer in infancy and childhood, and whose incidence has been lately increasing in several developed countries. Even though some studies have been conducted to understand the causes of its onset and discover potential biomarkers, the role of metabolic rewiring has not been investigated in depth so far. METHODS Here, we propose and implement an interpretable multi-omics pipeline that combines mechanistic knowledge from genome-scale metabolic models with machine learning algorithms, and we use it to characterise the underlying mechanisms controlling hepatoblastoma. RESULTS AND CONCLUSIONS While the obtained machine learning models generally present a high diagnostic classification accuracy, our results show that the type of omics combinations used as input to the machine learning models strongly affects the detection of important genes, reactions and metabolic pathways linked to hepatoblastoma. Our method also suggests that, in the context of computer-aided diagnosis of cancer, optimal diagnostic accuracy can be achieved by adopting a combination of omics that depends on the patient's clinical characteristics.
Collapse
Affiliation(s)
- Giuseppe Magazzù
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, England, United Kingdom
| | - Guido Zampieri
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, England, United Kingdom; Department of Biology, University of Padova, Padova, Italy
| | - Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, England, United Kingdom; Centre for Digital Innovation, Teesside University, Middlesbrough, England, United Kingdom; National Horizons Centre, Teesside University, Darlington, England, United Kingdom.
| |
Collapse
|
10
|
Gosselin MRF, Mournetas V, Borczyk M, Verma S, Occhipinti A, Róg J, Bozycki L, Korostynski M, Robson SC, Angione C, Pinset C, Gorecki DC. Loss of full-length dystrophin expression results in major cell-autonomous abnormalities in proliferating myoblasts. eLife 2022; 11:e75521. [PMID: 36164827 PMCID: PMC9514850 DOI: 10.7554/elife.75521] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 09/02/2022] [Indexed: 12/05/2022] Open
Abstract
Duchenne muscular dystrophy (DMD) affects myofibers and muscle stem cells, causing progressive muscle degeneration and repair defects. It was unknown whether dystrophic myoblasts-the effector cells of muscle growth and regeneration-are affected. Using transcriptomic, genome-scale metabolic modelling and functional analyses, we demonstrate, for the first time, convergent abnormalities in primary mouse and human dystrophic myoblasts. In Dmdmdx myoblasts lacking full-length dystrophin, the expression of 170 genes was significantly altered. Myod1 and key genes controlled by MyoD (Myog, Mymk, Mymx, epigenetic regulators, ECM interactors, calcium signalling and fibrosis genes) were significantly downregulated. Gene ontology analysis indicated enrichment in genes involved in muscle development and function. Functionally, we found increased myoblast proliferation, reduced chemotaxis and accelerated differentiation, which are all essential for myoregeneration. The defects were caused by the loss of expression of full-length dystrophin, as similar and not exacerbated alterations were observed in dystrophin-null Dmdmdx-βgeo myoblasts. Corresponding abnormalities were identified in human DMD primary myoblasts and a dystrophic mouse muscle cell line, confirming the cross-species and cell-autonomous nature of these defects. The genome-scale metabolic analysis in human DMD myoblasts showed alterations in the rate of glycolysis/gluconeogenesis, leukotriene metabolism, and mitochondrial beta-oxidation of various fatty acids. These results reveal the disease continuum: DMD defects in satellite cells, the myoblast dysfunction affecting muscle regeneration, which is insufficient to counteract muscle loss due to myofiber instability. Contrary to the established belief, our data demonstrate that DMD abnormalities occur in myoblasts, making these cells a novel therapeutic target for the treatment of this lethal disease.
Collapse
Affiliation(s)
- Maxime RF Gosselin
- School of Pharmacy and Biomedical Sciences, University of PortsmouthPortsmouthUnited Kingdom
| | | | - Malgorzata Borczyk
- Laboratory of Pharmacogenomics, Maj Institute of Pharmacology PASKrakowPoland
| | - Suraj Verma
- School of Computing, Engineering and Digital Technologies, Teesside UniversityMiddlesbroughUnited Kingdom
| | - Annalisa Occhipinti
- School of Computing, Engineering and Digital Technologies, Teesside UniversityMiddlesbroughUnited Kingdom
| | - Justyna Róg
- School of Pharmacy and Biomedical Sciences, University of PortsmouthPortsmouthUnited Kingdom
- Laboratory of Cellular Metabolism, Nencki Institute of Experimental BiologyWarsawPoland
| | - Lukasz Bozycki
- School of Pharmacy and Biomedical Sciences, University of PortsmouthPortsmouthUnited Kingdom
- Laboratory of Cellular Metabolism, Nencki Institute of Experimental BiologyWarsawPoland
| | - Michal Korostynski
- Laboratory of Pharmacogenomics, Maj Institute of Pharmacology PASKrakowPoland
| | - Samuel C Robson
- School of Pharmacy and Biomedical Sciences, University of PortsmouthPortsmouthUnited Kingdom
- Centre for Enzyme Innovation, University of PortsmouthPortsmouthUnited Kingdom
| | - Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside UniversityMiddlesbroughUnited Kingdom
| | | | - Dariusz C Gorecki
- School of Pharmacy and Biomedical Sciences, University of PortsmouthPortsmouthUnited Kingdom
| |
Collapse
|
11
|
Vijayakumar S, Angione C. Protocol for hybrid flux balance, statistical, and machine learning analysis of multi-omic data from the cyanobacterium Synechococcus sp. PCC 7002. STAR Protoc 2021; 2:100837. [PMID: 34632416 PMCID: PMC8488602 DOI: 10.1016/j.xpro.2021.100837] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Combining a computational framework for flux balance analysis with machine learning improves the accuracy of predicting metabolic activity across conditions, while enabling mechanistic interpretation. This protocol presents a guide to condition-specific metabolic modeling that integrates regularized flux balance analysis with machine learning approaches to extract key features from transcriptomic and fluxomic data. We demonstrate the protocol as applied to Synechococcus sp. PCC 7002; we also outline how it can be adapted to any species or community with available multi-omic data. For complete details on the use and execution of this protocol, please refer to Vijayakumar et al. (2020).
Collapse
Affiliation(s)
- Supreeta Vijayakumar
- School of Computing, Engineering & Digital Technologies, Teesside University, Middlesbrough, North Yorkshire TS1 3BX, UK
| | - Claudio Angione
- School of Computing, Engineering & Digital Technologies, Teesside University, Middlesbrough, North Yorkshire TS1 3BX, UK
- Centre for Digital Innovation, Teesside University, Middlesbrough TS1 3BX, UK
- Healthcare Innovation Centre, Teesside University, Middlesbrough TS1 3BX, UK
| |
Collapse
|
12
|
Occhipinti A, Hamadi Y, Kugler H, Wintersteiger CM, Yordanov B, Angione C. Discovering Essential Multiple Gene Effects Through Large Scale Optimization: An Application to Human Cancer Metabolism. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:2339-2352. [PMID: 32248120 DOI: 10.1109/tcbb.2020.2973386] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/11/2023]
Abstract
Computational modelling of metabolic processes has proven to be a useful approach to formulate our knowledge and improve our understanding of core biochemical systems that are crucial to maintaining cellular functions. Towards understanding the broader role of metabolism on cellular decision-making in health and disease conditions, it is important to integrate the study of metabolism with other core regulatory systems and omics within the cell, including gene expression patterns. After quantitatively integrating gene expression profiles with a genome-scale reconstruction of human metabolism, we propose a set of combinatorial methods to reverse engineer gene expression profiles and to find pairs and higher-order combinations of genetic modifications that simultaneously optimize multi-objective cellular goals. This enables us to suggest classes of transcriptomic profiles that are most suitable to achieve given metabolic phenotypes. We demonstrate how our techniques are able to compute beneficial, neutral or "toxic" combinations of gene expression levels. We test our methods on nine tissue-specific cancer models, comparing our outcomes with the corresponding normal cells, identifying genes as targets for potential therapies. Our methods open the way to a broad class of applications that require an understanding of the interplay among genotype, metabolism, and cellular behaviour, at scale.
Collapse
|
13
|
Yaneske E, Zampieri G, Bertoldi L, Benvenuto G, Angione C. Genome-scale metabolic modelling of SARS-CoV-2 in cancer cells reveals an increased shift to glycolytic energy production. FEBS Lett 2021; 595:2350-2365. [PMID: 34409594 PMCID: PMC8427129 DOI: 10.1002/1873-3468.14180] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 08/02/2021] [Accepted: 08/15/2021] [Indexed: 01/08/2023]
Abstract
Cancer is considered a high‐risk condition for severe illness resulting from COVID‐19. The interaction between severe acute respiratory syndrome coronavirus‐2 (SARS‐CoV‐2) and human metabolism is key to elucidating the risk posed by COVID‐19 for cancer patients and identifying effective treatments, yet it is largely uncharacterised on a mechanistic level. We present a genome‐scale map of short‐term metabolic alterations triggered by SARS‐CoV‐2 infection of cancer cells. Through transcriptomic‐ and proteomic‐informed genome‐scale metabolic modelling, we characterise the role of RNA and fatty acid biosynthesis in conjunction with a rewiring in energy production pathways and enhanced cytokine secretion. These findings link together complementary aspects of viral invasion of cancer cells, while providing mechanistic insights that can inform the development of treatment strategies.
Collapse
Affiliation(s)
- Elisabeth Yaneske
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK
| | - Guido Zampieri
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK.,Department of Biology, University of Padua, Italy
| | | | | | - Claudio Angione
- School of Computing, Engineering and Digital Technologies, Teesside University, Middlesbrough, UK.,Healthcare Innovation Centre, Teesside University, Middlesbrough, UK.,Centre for Digital Innovation, Teesside University, Middlesbrough, UK
| |
Collapse
|
14
|
Chen Y, Wang Y, Yang J, Zhou W, Dai S. Exploring the diversity of plant proteome. JOURNAL OF INTEGRATIVE PLANT BIOLOGY 2021; 63:1197-1210. [PMID: 33650765 DOI: 10.1111/jipb.13087] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/04/2021] [Accepted: 02/25/2021] [Indexed: 05/10/2023]
Abstract
The tremendous functional, spatial, and temporal diversity of the plant proteome is regulated by multiple factors that continuously modify protein abundance, modifications, interactions, localization, and activity to meet the dynamic needs of plants. Dissecting the proteome complexity and its underlying genetic variation is attracting increasing research attention. Mass spectrometry (MS)-based proteomics has become a powerful approach in the global study of protein functions and their relationships on a systems level. Here, we review recent breakthroughs and strategies adopted to unravel the diversity of the proteome, with a specific focus on the methods used to analyze posttranslational modifications (PTMs), protein localization, and the organization of proteins into functional modules. We also consider PTM crosstalk and multiple PTMs temporally regulating the life cycle of proteins. Finally, we discuss recent quantitative studies using MS to measure protein turnover rates and examine future directions in the study of the plant proteome.
Collapse
Affiliation(s)
- Yanmei Chen
- State Key Laboratory of Plant Physiology and Biochemistry, College of Biological Sciences, China Agricultural University, Beijing, 100193, China
| | - Yi Wang
- State Key Laboratory of Wheat and Maize Crop Science, College of Resources and Environment, Henan Agricultural University, Zhengzhou, 450002, China
| | - Jun Yang
- Shanghai Key Laboratory of Plant Functional Genomics and Resources, Shanghai Chenshan Plant Science Research Center, Chinese Academy of Sciences, Shanghai Chenshan Botanical Garden, Shanghai, 201602, China
| | - Wenbin Zhou
- Institute of Crop Sciences, Chinese Academy of Agricultural Sciences, Beijing, 100081, China
| | - Shaojun Dai
- Development Center of Plant Germplasm Resources, College of Life Sciences, Shanghai Normal University, Shanghai, 200234, China
| |
Collapse
|
15
|
LaBella AL, Opulente DA, Steenwyk JL, Hittinger CT, Rokas A. Signatures of optimal codon usage in metabolic genes inform budding yeast ecology. PLoS Biol 2021; 19:e3001185. [PMID: 33872297 PMCID: PMC8084343 DOI: 10.1371/journal.pbio.3001185] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2020] [Revised: 04/29/2021] [Accepted: 03/15/2021] [Indexed: 02/06/2023] Open
Abstract
Reverse ecology is the inference of ecological information from patterns of genomic variation. One rich, heretofore underutilized, source of ecologically relevant genomic information is codon optimality or adaptation. Bias toward codons that match the tRNA pool is robustly associated with high gene expression in diverse organisms, suggesting that codon optimization could be used in a reverse ecology framework to identify highly expressed, ecologically relevant genes. To test this hypothesis, we examined the relationship between optimal codon usage in the classic galactose metabolism (GAL) pathway and known ecological niches for 329 species of budding yeasts, a diverse subphylum of fungi. We find that optimal codon usage in the GAL pathway is positively correlated with quantitative growth on galactose, suggesting that GAL codon optimization reflects increased capacity to grow on galactose. Optimal codon usage in the GAL pathway is also positively correlated with human-associated ecological niches in yeasts of the CUG-Ser1 clade and with dairy-associated ecological niches in the family Saccharomycetaceae. For example, optimal codon usage of GAL genes is greater than 85% of all genes in the genome of the major human pathogen Candida albicans (CUG-Ser1 clade) and greater than 75% of genes in the genome of the dairy yeast Kluyveromyces lactis (family Saccharomycetaceae). We further find a correlation between optimization in the GALactose pathway genes and several genes associated with nutrient sensing and metabolism. This work suggests that codon optimization harbors information about the metabolic ecology of microbial eukaryotes. This information may be particularly useful for studying fungal dark matter-species that have yet to be cultured in the lab or have only been identified by genomic material.
Collapse
Affiliation(s)
- Abigail Leavitt LaBella
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Dana A. Opulente
- Department of Biology, Villanova University, Villanova, Pennsylvania, United States of America
| | - Jacob L. Steenwyk
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Chris Todd Hittinger
- Laboratory of Genetics, DOE Great Lakes Bioenergy Research Center, Wisconsin Energy Institute, Center for Genomic Science Innovation, J.F. Crow Institute for the Study of Evolution, University of Wisconsin-Madison, Madison, Wisconsin, United States of America
| | - Antonis Rokas
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, United States of America
| |
Collapse
|
16
|
Antonakoudis A, Barbosa R, Kotidis P, Kontoravdi C. The era of big data: Genome-scale modelling meets machine learning. Comput Struct Biotechnol J 2020; 18:3287-3300. [PMID: 33240470 PMCID: PMC7663219 DOI: 10.1016/j.csbj.2020.10.011] [Citation(s) in RCA: 47] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2020] [Revised: 10/07/2020] [Accepted: 10/08/2020] [Indexed: 12/15/2022] Open
Abstract
With omics data being generated at an unprecedented rate, genome-scale modelling has become pivotal in its organisation and analysis. However, machine learning methods have been gaining ground in cases where knowledge is insufficient to represent the mechanisms underlying such data or as a means for data curation prior to attempting mechanistic modelling. We discuss the latest advances in genome-scale modelling and the development of optimisation algorithms for network and error reduction, intracellular constraining and applications to strain design. We further review applications of supervised and unsupervised machine learning methods to omics datasets from microbial and mammalian cell systems and present efforts to harness the potential of both modelling approaches through hybrid modelling.
Collapse
Affiliation(s)
| | | | | | - Cleo Kontoravdi
- Department of Chemical Engineering, Imperial College London, London SW7 2AZ, United Kingdom
| |
Collapse
|
17
|
A review of methods for the reconstruction and analysis of integrated genome-scale models of metabolism and regulation. Biochem Soc Trans 2020; 48:1889-1903. [DOI: 10.1042/bst20190840] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Revised: 07/16/2020] [Accepted: 08/21/2020] [Indexed: 02/07/2023]
Abstract
The current survey aims to describe the main methodologies for extending the reconstruction and analysis of genome-scale metabolic models and phenotype simulation with Flux Balance Analysis mathematical frameworks, via the integration of Transcriptional Regulatory Networks and/or gene expression data. Although the surveyed methods are aimed at improving phenotype simulations obtained from these models, the perspective of reconstructing integrated genome-scale models of metabolism and gene expression for diverse prokaryotes is still an open challenge.
Collapse
|
18
|
Basile A, Campanaro S, Kovalovszki A, Zampieri G, Rossi A, Angelidaki I, Valle G, Treu L. Revealing metabolic mechanisms of interaction in the anaerobic digestion microbiome by flux balance analysis. Metab Eng 2020; 62:138-149. [PMID: 32905861 DOI: 10.1016/j.ymben.2020.08.013] [Citation(s) in RCA: 39] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2020] [Revised: 08/03/2020] [Accepted: 08/24/2020] [Indexed: 10/23/2022]
Abstract
Anaerobic digestion is a key biological process for renewable energy, yet the mechanistic knowledge on its hidden microbial dynamics is still limited. The present work charted the interaction network in the anaerobic digestion microbiome via the full characterization of pairwise interactions and the associated metabolite exchanges. To this goal, a novel collection of 836 genome-scale metabolic models was built to represent the functional capabilities of bacteria and archaea species derived from genome-centric metagenomics. Dominant microbes were shown to prefer mutualistic, parasitic and commensalistic interactions over neutralism, amensalism and competition, and are more likely to behave as metabolite importers and profiteers of the coexistence. Additionally, external hydrogen injection positively influences microbiome dynamics by promoting commensalism over amensalism. Finally, exchanges of glucogenic amino acids were shown to overcome auxotrophies caused by an incomplete tricarboxylic acid cycle. Our novel strategy predicted the most favourable growth conditions for the microbes, overall suggesting strategies to increasing the biogas production efficiency. In principle, this approach could also be applied to microbial populations of biomedical importance, such as the gut microbiome, to allow a broad inspection of the microbial interplays.
Collapse
Affiliation(s)
- Arianna Basile
- Department of Biology, University of Padova, Via U. Bassi 58/b, 35121, Padua, Italy
| | - Stefano Campanaro
- Department of Biology, University of Padova, Via U. Bassi 58/b, 35121, Padua, Italy; CRIBI Biotechnology Center, University of Padova, 35131, Padua, Italy.
| | - Adam Kovalovszki
- Department of Environmental Engineering, Technical University of Denmark, 2800, Kgs. Lyngby, Denmark
| | - Guido Zampieri
- Department of Biology, University of Padova, Via U. Bassi 58/b, 35121, Padua, Italy; Department of Computer Science and Information Systems, Teesside University, Middlesbrough, United Kingdom
| | - Alessandro Rossi
- Department of Biology, University of Padova, Via U. Bassi 58/b, 35121, Padua, Italy
| | - Irini Angelidaki
- Department of Environmental Engineering, Technical University of Denmark, 2800, Kgs. Lyngby, Denmark
| | - Giorgio Valle
- Department of Biology, University of Padova, Via U. Bassi 58/b, 35121, Padua, Italy
| | - Laura Treu
- Department of Biology, University of Padova, Via U. Bassi 58/b, 35121, Padua, Italy
| |
Collapse
|
19
|
Frioux C, Singh D, Korcsmaros T, Hildebrand F. From bag-of-genes to bag-of-genomes: metabolic modelling of communities in the era of metagenome-assembled genomes. Comput Struct Biotechnol J 2020; 18:1722-1734. [PMID: 32670511 PMCID: PMC7347713 DOI: 10.1016/j.csbj.2020.06.028] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2020] [Revised: 06/16/2020] [Accepted: 06/17/2020] [Indexed: 12/12/2022] Open
Abstract
Metagenomic sequencing of complete microbial communities has greatly enhanced our understanding of the taxonomic composition of microbiotas. This has led to breakthrough developments in bioinformatic disciplines such as assembly, gene clustering, metagenomic binning of species genomes and the discovery of an incredible, so far undiscovered, taxonomic diversity. However, functional annotations and estimating metabolic processes from single species - or communities - is still challenging. Earlier approaches relied mostly on inferring the presence of key enzymes for metabolic pathways in the whole metagenome, ignoring the genomic context of such enzymes, resulting in the 'bag-of-genes' approach to estimate functional capacities of microbiotas. Here, we review recent developments in metagenomic bioinformatics, with a special focus on emerging technologies to simulate and estimate metabolic information, that can be derived from metagenomic assembled genomes. Genome-scale metabolic models can be used to model the emergent properties of microbial consortia and whole communities, and the progress in this area is reviewed. While this subfield of metagenomics is still in its infancy, it is becoming evident that there is a dire need for further bioinformatic tools to address the complex combinatorial problems in modelling the metabolism of large communities as a 'bag-of-genomes'.
Collapse
Affiliation(s)
- Clémence Frioux
- Inria, CNRS, INRAE Bordeaux, France
- Gut Microbes and Health, Quadram Institute Bioscience, Norwich, Norfolk, UK
| | - Dipali Singh
- Microbes in the Food Chain, Quadram Institute Bioscience, Norwich, Norfolk, UK
| | - Tamas Korcsmaros
- Gut Microbes and Health, Quadram Institute Bioscience, Norwich, Norfolk, UK
- Digital Biology, Earlham Institute, Norwich, Norfolk, UK
| | - Falk Hildebrand
- Gut Microbes and Health, Quadram Institute Bioscience, Norwich, Norfolk, UK
- Digital Biology, Earlham Institute, Norwich, Norfolk, UK
| |
Collapse
|
20
|
Vijayakumar S, Conway M, Lió P, Angione C. Seeing the wood for the trees: a forest of methods for optimization and omic-network integration in metabolic modelling. Brief Bioinform 2019; 19:1218-1235. [PMID: 28575143 DOI: 10.1093/bib/bbx053] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2017] [Indexed: 11/13/2022] Open
Abstract
Metabolic modelling has entered a mature phase with dozens of methods and software implementations available to the practitioner and the theoretician. It is not easy for a modeller to be able to see the wood (or the forest) for the trees. Driven by this analogy, we here present a 'forest' of principal methods used for constraint-based modelling in systems biology. This provides a tree-based view of methods available to prospective modellers, also available in interactive version at http://modellingmetabolism.net, where it will be kept updated with new methods after the publication of the present manuscript. Our updated classification of existing methods and tools highlights the most promising in the different branches, with the aim to develop a vision of how existing methods could hybridize and become more complex. We then provide the first hands-on tutorial for multi-objective optimization of metabolic models in R. We finally discuss the implementation of multi-view machine learning approaches in poly-omic integration. Throughout this work, we demonstrate the optimization of trade-offs between multiple metabolic objectives, with a focus on omic data integration through machine learning. We anticipate that the combination of a survey, a perspective on multi-view machine learning and a step-by-step R tutorial should be of interest for both the beginner and the advanced user.
Collapse
Affiliation(s)
| | - Max Conway
- Computer Laboratory, University of Cambridge, UK
| | - Pietro Lió
- Computer Laboratory, University of Cambridge, UK
| | - Claudio Angione
- Department of Computer Science and Information Systems, Teesside University, UK
| |
Collapse
|
21
|
Zampieri G, Vijayakumar S, Yaneske E, Angione C. Machine and deep learning meet genome-scale metabolic modeling. PLoS Comput Biol 2019; 15:e1007084. [PMID: 31295267 PMCID: PMC6622478 DOI: 10.1371/journal.pcbi.1007084] [Citation(s) in RCA: 174] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022] Open
Abstract
Omic data analysis is steadily growing as a driver of basic and applied molecular biology research. Core to the interpretation of complex and heterogeneous biological phenotypes are computational approaches in the fields of statistics and machine learning. In parallel, constraint-based metabolic modeling has established itself as the main tool to investigate large-scale relationships between genotype, phenotype, and environment. The development and application of these methodological frameworks have occurred independently for the most part, whereas the potential of their integration for biological, biomedical, and biotechnological research is less known. Here, we describe how machine learning and constraint-based modeling can be combined, reviewing recent works at the intersection of both domains and discussing the mathematical and practical aspects involved. We overlap systematic classifications from both frameworks, making them accessible to nonexperts. Finally, we delineate potential future scenarios, propose new joint theoretical frameworks, and suggest concrete points of investigation for this joint subfield. A multiview approach merging experimental and knowledge-driven omic data through machine learning methods can incorporate key mechanistic information in an otherwise biologically-agnostic learning process.
Collapse
Affiliation(s)
- Guido Zampieri
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, United Kingdom
| | - Supreeta Vijayakumar
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, United Kingdom
| | - Elisabeth Yaneske
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, United Kingdom
| | - Claudio Angione
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, United Kingdom
- Healthcare Innovation Centre, Teesside University, Middlesbrough, United Kingdom
| |
Collapse
|
22
|
Human Systems Biology and Metabolic Modelling: A Review-From Disease Metabolism to Precision Medicine. BIOMED RESEARCH INTERNATIONAL 2019; 2019:8304260. [PMID: 31281846 PMCID: PMC6590590 DOI: 10.1155/2019/8304260] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Revised: 02/07/2019] [Accepted: 05/20/2019] [Indexed: 01/06/2023]
Abstract
In cell and molecular biology, metabolism is the only system that can be fully simulated at genome scale. Metabolic systems biology offers powerful abstraction tools to simulate all known metabolic reactions in a cell, therefore providing a snapshot that is close to its observable phenotype. In this review, we cover the 15 years of human metabolic modelling. We show that, although the past five years have not experienced large improvements in the size of the gene and metabolite sets in human metabolic models, their accuracy is rapidly increasing. We also describe how condition-, tissue-, and patient-specific metabolic models shed light on cell-specific changes occurring in the metabolic network, therefore predicting biomarkers of disease metabolism. We finally discuss current challenges and future promising directions for this research field, including machine/deep learning and precision medicine. In the omics era, profiling patients and biological processes from a multiomic point of view is becoming more common and less expensive. Starting from multiomic data collected from patients and N-of-1 trials where individual patients constitute different case studies, methods for model-building and data integration are being used to generate patient-specific models. Coupled with state-of-the-art machine learning methods, this will allow characterizing each patient's disease phenotype and delivering precision medicine solutions, therefore leading to preventative medicine, reduced treatment, and in silico clinical trials.
Collapse
|
23
|
Zoledowska S, Presta L, Fondi M, Decorosi F, Giovannetti L, Mengoni A, Lojkowska E. Metabolic Modeling of Pectobacterium parmentieri SCC3193 Provides Insights into Metabolic Pathways of Plant Pathogenic Bacteria. Microorganisms 2019; 7:E101. [PMID: 30959803 PMCID: PMC6518042 DOI: 10.3390/microorganisms7040101] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2019] [Revised: 03/26/2019] [Accepted: 04/02/2019] [Indexed: 12/17/2022] Open
Abstract
Understanding plant⁻microbe interactions is crucial for improving plants' productivity and protection. Constraint-based metabolic modeling is one of the possible ways to investigate the bacterial adaptation to different ecological niches and may give insights into the metabolic versatility of plant pathogenic bacteria. We reconstructed a raw metabolic model of the emerging plant pathogenic bacterium Pectobacterium parmentieri SCC3193 with the use of KBase. The model was curated by using inParanoind and phenotypic data generated with the use of the OmniLog system. Metabolic modeling was performed through COBRApy Toolbox v. 0.10.1. The curated metabolic model of P. parmentieri SCC3193 is highly reliable, as in silico obtained results overlapped up to 91% with experimental data on carbon utilization phenotypes. By mean of flux balance analysis (FBA), we predicted the metabolic adaptation of P. parmentieri SCC3193 to two different ecological niches, relevant for the persistence and plant colonization by this bacterium: soil and the rhizosphere. We performed in silico gene deletions to predict the set of essential core genes for this bacterium to grow in such environments. We anticipate that our metabolic model will be a valuable element for defining a set of metabolic targets to control infection and spreading of this plant pathogen.
Collapse
Affiliation(s)
- Sabina Zoledowska
- Department of Biotechnology, Intercollegiate Faculty of Biotechnology, University of Gdansk and Medical University of Gdansk, 58 Abrahama Street, 80-307 Gdansk, Poland.
| | - Luana Presta
- Department of Biology, University of Florence, via Madonna del Piano 6, Sesto Fiorentino, 50019 Florence, Italy.
| | - Marco Fondi
- Department of Biology, University of Florence, via Madonna del Piano 6, Sesto Fiorentino, 50019 Florence, Italy.
| | - Francesca Decorosi
- Department of Agri-food Production and Environmental Sciences, University of Florence, Piazzale delle Cascine 18, 50144 Florence, Italy.
| | - Luciana Giovannetti
- Department of Agri-food Production and Environmental Sciences, University of Florence, Piazzale delle Cascine 18, 50144 Florence, Italy.
| | - Alessio Mengoni
- Department of Biology, University of Florence, via Madonna del Piano 6, Sesto Fiorentino, 50019 Florence, Italy.
| | - Ewa Lojkowska
- Department of Biotechnology, Intercollegiate Faculty of Biotechnology, University of Gdansk and Medical University of Gdansk, 58 Abrahama Street, 80-307 Gdansk, Poland.
| |
Collapse
|
24
|
Di Stefano A, Scatà M, Vijayakumar S, Angione C, La Corte A, Liò P. Social dynamics modeling of chrono-nutrition. PLoS Comput Biol 2019; 15:e1006714. [PMID: 30699206 PMCID: PMC6370249 DOI: 10.1371/journal.pcbi.1006714] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2017] [Revised: 02/11/2019] [Accepted: 12/14/2018] [Indexed: 12/13/2022] Open
Abstract
Gut microbiota and human relationships are strictly connected to each other. What we eat reflects our body-mind connection and synchronizes with people around us. However, how this impacts on gut microbiota and, conversely, how gut bacteria influence our dietary behaviors has not been explored yet. To quantify the complex dynamics of this interplay between gut and human behaviors we explore the "gut-human behavior axis" and its evolutionary dynamics in a real-world scenario represented by the social multiplex network. We consider a dual type of similarity, homophily and gut similarity, other than psychological and unconscious biases. We analyze the dynamics of social and gut microbial communities, quantifying the impact of human behaviors on diets and gut microbial composition and, backwards, through a control mechanism. Meal timing mechanisms and "chrono-nutrition" play a crucial role in feeding behaviors, along with the quality and quantity of food intake. Considering a population of shift workers, we explore the dynamic interplay between their eating behaviors and gut microbiota, modeling the social dynamics of chrono-nutrition in a multiplex network. Our findings allow us to quantify the relation between human behaviors and gut microbiota through the methodological introduction of gut metabolic modeling and statistical estimators, able to capture their dynamic interplay. Moreover, we find that the timing of gut microbial communities is slower than social interactions and shift-working, and the impact of shift-working on the dynamics of chrono-nutrition is a fluctuation of strategies with a major propensity for defection (e.g. high-fat meals). A deeper understanding of the relation between gut microbiota and the dietary behavioral patterns, by embedding also the related social aspects, allows improving the overall knowledge about metabolic models and their implications for human health, opening the possibility to design promising social therapeutic dietary interventions.
Collapse
Affiliation(s)
- Alessandro Di Stefano
- Dipartimento di Ingegneria Elettrica, Elettronica e Informatica (DIEEI), CNIT (National Inter-University Consortium for Telecommunications) Catania, Italy
| | - Marialisa Scatà
- Dipartimento di Ingegneria Elettrica, Elettronica e Informatica (DIEEI), CNIT (National Inter-University Consortium for Telecommunications) Catania, Italy
| | - Supreeta Vijayakumar
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, United Kingdom
| | - Claudio Angione
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, United Kingdom
| | - Aurelio La Corte
- Dipartimento di Ingegneria Elettrica, Elettronica e Informatica (DIEEI), CNIT (National Inter-University Consortium for Telecommunications) Catania, Italy
| | - Pietro Liò
- Computer Laboratory, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
25
|
Mancini A, Eyassu F, Conway M, Occhipinti A, Liò P, Angione C, Pucciarelli S. CiliateGEM: an open-project and a tool for predictions of ciliate metabolic variations and experimental condition design. BMC Bioinformatics 2018; 19:442. [PMID: 30497359 PMCID: PMC6266953 DOI: 10.1186/s12859-018-2422-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The study of cell metabolism is becoming central in several fields such as biotechnology, evolution/adaptation and human disease investigations. Here we present CiliateGEM, the first metabolic network reconstruction draft of the freshwater ciliate Tetrahymena thermophila. We also provide the tools and resources to simulate different growth conditions and to predict metabolic variations. CiliateGEM can be extended to other ciliates in order to set up a meta-model, i.e. a metabolic network reconstruction valid for all ciliates. Ciliates are complex unicellular eukaryotes of presumably monophyletic origin, with a phylogenetic position that is equal from plants and animals. These cells represent a new concept of unicellular system with a high degree of species, population biodiversity and cell complexity. Ciliates perform in a single cell all the functions of a pluricellular organism, including locomotion, feeding, digestion, and sexual processes. RESULTS After generating the model, we performed an in-silico simulation with the presence and absence of glucose. The lack of this nutrient caused a 32.1% reduction rate in biomass synthesis. Despite the glucose starvation, the growth did not stop due to the use of alternative carbon sources such as amino acids. CONCLUSIONS The future models obtained from CiliateGEM may represent a new approach to describe the metabolism of ciliates. This tool will be a useful resource for the ciliate research community in order to extend these species as model organisms in different research fields. An improved understanding of ciliate metabolism could be relevant to elucidate the basis of biological phenomena like genotype-phenotype relationships, population genetics, and cilia-related disease mechanisms.
Collapse
Affiliation(s)
- Alessio Mancini
- School of Biosciences and Veterinary Medicine, University of Camerino, Camerino, Italy
- Computer Laboratory, University of Cambridge, Cambridge, UK
| | - Filmon Eyassu
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, UK
| | - Maxwell Conway
- Computer Laboratory, University of Cambridge, Cambridge, UK
| | | | - Pietro Liò
- Computer Laboratory, University of Cambridge, Cambridge, UK
| | - Claudio Angione
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, UK
| | - Sandra Pucciarelli
- School of Biosciences and Veterinary Medicine, University of Camerino, Camerino, Italy
| |
Collapse
|
26
|
Abstract
BACKGROUND Ageing can be classified in two different ways, chronological ageing and biological ageing. While chronological age is a measure of the time that has passed since birth, biological (also known as transcriptomic) ageing is defined by how time and the environment affect an individual in comparison to other individuals of the same chronological age. Recent research studies have shown that transcriptomic age is associated with certain genes, and that each of those genes has an effect size. Using these effect sizes we can calculate the transcriptomic age of an individual from their age-associated gene expression levels. The limitation of this approach is that it does not consider how these changes in gene expression affect the metabolism of individuals and hence their observable cellular phenotype. RESULTS We propose a method based on poly-omic constraint-based models and machine learning in order to further the understanding of transcriptomic ageing. We use normalised CD4 T-cell gene expression data from peripheral blood mononuclear cells in 499 healthy individuals to create individual metabolic models. These models are then combined with a transcriptomic age predictor and chronological age to provide new insights into the differences between transcriptomic and chronological ageing. As a result, we propose a novel metabolic age predictor. CONCLUSIONS We show that our poly-omic predictors provide a more detailed analysis of transcriptomic ageing compared to gene-based approaches, and represent a basis for furthering our knowledge of the ageing mechanisms in human cells.
Collapse
Affiliation(s)
- Elisabeth Yaneske
- Department of Computer Science and Information Systems, Teesside University, Borough Road, Middlesbrough, UK
| | - Claudio Angione
- Department of Computer Science and Information Systems, Teesside University, Borough Road, Middlesbrough, UK
| |
Collapse
|
27
|
Occhipinti A, Eyassu F, Rahman TJ, Rahman PKSM, Angione C. In silico engineering of Pseudomonas metabolism reveals new biomarkers for increased biosurfactant production. PeerJ 2018; 6:e6046. [PMID: 30588397 PMCID: PMC6301282 DOI: 10.7717/peerj.6046] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2018] [Accepted: 10/30/2018] [Indexed: 01/29/2023] Open
Abstract
BACKGROUND Rhamnolipids, biosurfactants with a wide range of biomedical applications, are amphiphilic molecules produced on the surfaces of or excreted extracellularly by bacteria including Pseudomonas aeruginosa. However, Pseudomonas putida is a non-pathogenic model organism with greater metabolic versatility and potential for industrial applications. METHODS We investigate in silico the metabolic capabilities of P. putida for rhamnolipids biosynthesis using statistical, metabolic and synthetic engineering approaches after introducing key genes (RhlA and RhlB) from P. aeruginosa into a genome-scale model of P. putida. This pipeline combines machine learning methods with multi-omic modelling, and drives the engineered P. putida model toward an optimal production and export of rhamnolipids out of the membrane. RESULTS We identify a substantial increase in synthesis of rhamnolipids by the engineered model compared to the control model. We apply statistical and machine learning techniques on the metabolic reaction rates to identify distinct features on the structure of the variables and individual components driving the variation of growth and rhamnolipids production. We finally provide a computational framework for integrating multi-omics data and identifying latent pathways and genes for the production of rhamnolipids in P. putida. CONCLUSIONS We anticipate that our results will provide a versatile methodology for integrating multi-omics data for topological and functional analysis of P. putida toward maximization of biosurfactant production.
Collapse
Affiliation(s)
- Annalisa Occhipinti
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, UK
| | - Filmon Eyassu
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, UK
| | - Thahira J. Rahman
- Technology Futures Institute, School of Science, Engineering and Design, Teesside University, Middlesbrough, UK
| | - Pattanathu K. S. M. Rahman
- Technology Futures Institute, School of Science, Engineering and Design, Teesside University, Middlesbrough, UK
- Institute of Biological and Biomedical Sciences, School of Biological Sciences, University of Portsmouth, Portsmouth, UK
| | - Claudio Angione
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, UK
| |
Collapse
|
28
|
Vijayakumar S, Conway M, Lió P, Angione C. Optimization of Multi-Omic Genome-Scale Models: Methodologies, Hands-on Tutorial, and Perspectives. Methods Mol Biol 2018; 1716:389-408. [PMID: 29222764 DOI: 10.1007/978-1-4939-7528-0_18] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]
Abstract
Genome-scale metabolic models are valuable tools for assessing the metabolic potential of living organisms. Being downstream of gene expression, metabolism is increasingly being used as an indicator of the phenotypic outcome for drugs and therapies. We here present a review of the principal methods used for constraint-based modelling in systems biology, and explore how the integration of multi-omic data can be used to improve phenotypic predictions of genome-scale metabolic models. We believe that the large-scale comparison of the metabolic response of an organism to different environmental conditions will be an important challenge for genome-scale models. Therefore, within the context of multi-omic methods, we describe a tutorial for multi-objective optimization using the metabolic and transcriptomics adaptation estimator (METRADE), implemented in MATLAB. METRADE uses microarray and codon usage data to model bacterial metabolic response to environmental conditions (e.g., antibiotics, temperatures, heat shock). Finally, we discuss key considerations for the integration of multi-omic networks into metabolic models, towards automatically extracting knowledge from such models.
Collapse
Affiliation(s)
- Supreeta Vijayakumar
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, Tees Valley TS1 3BX, UK
| | - Max Conway
- Computer Laboratory, University of Cambridge, Cambridge, UK
| | - Pietro Lió
- Computer Laboratory, University of Cambridge, Cambridge, UK
| | - Claudio Angione
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, Tees Valley TS1 3BX, UK.
| |
Collapse
|
29
|
Eyassu F, Angione C. Modelling pyruvate dehydrogenase under hypoxia and its role in cancer metabolism. ROYAL SOCIETY OPEN SCIENCE 2017; 4:170360. [PMID: 29134060 PMCID: PMC5666243 DOI: 10.1098/rsos.170360] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/20/2017] [Accepted: 09/25/2017] [Indexed: 05/18/2023]
Abstract
Metabolism is the only biological system that can be fully modelled at genome scale. As a result, metabolic models have been increasingly used to study the molecular mechanisms of various diseases. Hypoxia, a low-oxygen tension, is a well-known characteristic of many cancer cells. Pyruvate dehydrogenase (PDH) controls the flux of metabolites between glycolysis and the tricarboxylic acid cycle and is a key enzyme in metabolic reprogramming in cancer metabolism. Here, we develop and manually curate a constraint-based metabolic model to investigate the mechanism of pyruvate dehydrogenase under hypoxia. Our results characterize the activity of pyruvate dehydrogenase and its decline during hypoxia. This results in lactate accumulation, consistent with recent hypoxia studies and a well-known feature in cancer metabolism. We apply machine-learning techniques on the flux datasets to identify reactions that drive these variations. We also identify distinct features on the structure of the variables and individual metabolic components in the switch from normoxia to hypoxia. Our results provide a framework for future studies by integrating multi-omics data to predict condition-specific metabolic phenotypes under hypoxia.
Collapse
|
30
|
Angione C. Integrating splice-isoform expression into genome-scale models characterizes breast cancer metabolism. Bioinformatics 2017; 34:494-501. [DOI: 10.1093/bioinformatics/btx562] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2017] [Accepted: 09/06/2017] [Indexed: 12/20/2022] Open
Affiliation(s)
- Claudio Angione
- Department of Computer Science and Information Systems, Teesside University, Middlesbrough, UK
| |
Collapse
|
31
|
Otero-Muras I, Banga JR. Automated Design Framework for Synthetic Biology Exploiting Pareto Optimality. ACS Synth Biol 2017; 6:1180-1193. [PMID: 28350462 DOI: 10.1021/acssynbio.6b00306] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
In this work we consider Pareto optimality for automated design in synthetic biology. We present a generalized framework based on a mixed-integer dynamic optimization formulation that, given design specifications, allows the computation of Pareto optimal sets of designs, that is, the set of best trade-offs for the metrics of interest. We show how this framework can be used for (i) forward design, that is, finding the Pareto optimal set of synthetic designs for implementation, and (ii) reverse design, that is, analyzing and inferring motifs and/or design principles of gene regulatory networks from the Pareto set of optimal circuits. Finally, we illustrate the capabilities and performance of this framework considering four case studies. In the first problem we consider the forward design of an oscillator. In the remaining problems, we illustrate how to apply the reverse design approach to find motifs for stripe formation, rapid adaption, and fold-change detection, respectively.
Collapse
Affiliation(s)
- Irene Otero-Muras
- BioProcess Engineering Group, IIM-CSIC,
Spanish National Research Council, Vigo, 36208, Spain
| | - Julio R. Banga
- BioProcess Engineering Group, IIM-CSIC,
Spanish National Research Council, Vigo, 36208, Spain
| |
Collapse
|
32
|
Fondi M, Pinatel E, Talà A, Damiano F, Consolandi C, Mattorre B, Fico D, Testini M, De Benedetto GE, Siculella L, De Bellis G, Alifano P, Peano C. Time-Resolved Transcriptomics and Constraint-Based Modeling Identify System-Level Metabolic Features and Overexpression Targets to Increase Spiramycin Production in Streptomyces ambofaciens. Front Microbiol 2017; 8:835. [PMID: 28553270 PMCID: PMC5427115 DOI: 10.3389/fmicb.2017.00835] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2016] [Accepted: 04/24/2017] [Indexed: 12/22/2022] Open
Abstract
In this study we have applied an integrated system biology approach to characterize the metabolic landscape of Streptomyces ambofaciens and to identify a list of potential metabolic engineering targets for the overproduction of the secondary metabolites in this microorganism. We focused on an often overlooked growth period (i.e., post-first rapid growth phase) and, by integrating constraint-based metabolic modeling with time resolved RNA-seq data, we depicted the main effects of changes in gene expression on the overall metabolic reprogramming occurring in S. ambofaciens. Moreover, through metabolic modeling, we unraveled a set of candidate overexpression gene targets hypothetically leading to spiramycin overproduction. Model predictions were experimentally validated by genetic manipulation of the recently described ethylmalonyl-CoA metabolic node, providing evidence that spiramycin productivity may be increased by enhancing the carbon flow through this pathway. The goal was achieved by over-expressing the ccr paralog srm4 in an ad hoc engineered plasmid. This work embeds the first metabolic reconstruction of S. ambofaciens and the successful experimental validation of model predictions and demonstrates the validity and the importance of in silico modeling tools for the overproduction of molecules with a biotechnological interest. Finally, the proposed metabolic reconstruction, which includes manually refined pathways for several secondary metabolites with antimicrobial activity, represents a solid platform for the future exploitation of S. ambofaciens biotechnological potential.
Collapse
Affiliation(s)
- Marco Fondi
- Department of Biology, University of FlorenceFlorence, Italy
| | - Eva Pinatel
- Institute of Biomedical Technologies, National Research CouncilSegrate, Italy
| | - Adelfia Talà
- Department of Biological and Environmental Sciences and Technologies, University of SalentoLecce, Italy
| | - Fabrizio Damiano
- Department of Biological and Environmental Sciences and Technologies, University of SalentoLecce, Italy
| | - Clarissa Consolandi
- Institute of Biomedical Technologies, National Research CouncilSegrate, Italy
| | | | - Daniela Fico
- Laboratory of Analytical and Isotopic Mass Spectrometry, Department of Cultural Heritage, University of SalentoLecce, Italy
| | - Mariangela Testini
- Department of Biological and Environmental Sciences and Technologies, University of SalentoLecce, Italy
| | - Giuseppe E De Benedetto
- Laboratory of Analytical and Isotopic Mass Spectrometry, Department of Cultural Heritage, University of SalentoLecce, Italy
| | - Luisa Siculella
- Department of Biological and Environmental Sciences and Technologies, University of SalentoLecce, Italy
| | - Gianluca De Bellis
- Institute of Biomedical Technologies, National Research CouncilSegrate, Italy
| | - Pietro Alifano
- Department of Biological and Environmental Sciences and Technologies, University of SalentoLecce, Italy
| | - Clelia Peano
- Institute of Biomedical Technologies, National Research CouncilSegrate, Italy
| |
Collapse
|
33
|
Kashaf SS, Angione C, Lió P. Making life difficult for Clostridium difficile: augmenting the pathogen's metabolic model with transcriptomic and codon usage data for better therapeutic target characterization. BMC SYSTEMS BIOLOGY 2017; 11:25. [PMID: 28209199 PMCID: PMC5314682 DOI: 10.1186/s12918-017-0395-3] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/23/2016] [Accepted: 01/13/2017] [Indexed: 11/10/2022]
Abstract
BACKGROUND Clostridium difficile is a bacterium which can infect various animal species, including humans. Infection with this bacterium is a leading healthcare-associated illness. A better understanding of this organism and the relationship between its genotype and phenotype is essential to the search for an effective treatment. Genome-scale metabolic models contain all known biochemical reactions of a microorganism and can be used to investigate this relationship. RESULTS We present icdf834, an updated metabolic network of C. difficile that builds on iMLTC806cdf and features 1227 reactions, 834 genes, and 807 metabolites. We used this metabolic network to reconstruct the metabolic landscape of this bacterium. The standard metabolic model cannot account for changes in the bacterial metabolism in response to different environmental conditions. To account for this limitation, we also integrated transcriptomic data, which details the gene expression of the bacterium in a wide array of environments. Importantly, to bridge the gap between gene expression levels and protein abundance, we accounted for the synonymous codon usage bias of the bacterium in the model. To our knowledge, this is the first time codon usage has been quantified and integrated into a metabolic model. The metabolic fluxes were defined as a function of protein abundance. To determine potential therapeutic targets using the model, we conducted gene essentiality and metabolic pathway sensitivity analyses and calculated flux control coefficients. We obtained 92.3% accuracy in predicting gene essentiality when compared to experimental data for C. difficile R20291 (ribotype 027) homologs. We validated our context-specific metabolic models using sensitivity and robustness analyses and compared model predictions with literature on C. difficile. The model predicts interesting facets of the bacterium's metabolism, such as changes in the bacterium's growth in response to different environmental conditions. CONCLUSIONS After an extensive validation process, we used icdf834 to obtain state-of-the-art predictions of therapeutic targets for C. difficile. We show how context-specific metabolic models augmented with codon usage information can be a beneficial resource for better understanding C. difficile and for identifying novel therapeutic targets. We remark that our approach can be applied to investigate and treat against other pathogens.
Collapse
Affiliation(s)
- Sara Saheb Kashaf
- Computer Laboratory, University of Cambridge, 15 JJ Thomson Avenue, Cambridge, CB3 0FD UK
| | - Claudio Angione
- Department of Computer Science and Information Systems, Teesside University, Borough road, Middlesbrough, TS1 3BA UK
| | - Pietro Lió
- Computer Laboratory, University of Cambridge, 15 JJ Thomson Avenue, Cambridge, CB3 0FD UK
| |
Collapse
|
34
|
Bassen DM, Vilkhovoy M, Minot M, Butcher JT, Varner JD. JuPOETs: a constrained multiobjective optimization approach to estimate biochemical model ensembles in the Julia programming language. BMC SYSTEMS BIOLOGY 2017; 11:10. [PMID: 28122561 PMCID: PMC5264316 DOI: 10.1186/s12918-016-0380-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Accepted: 12/16/2016] [Indexed: 11/18/2022]
Abstract
Background Ensemble modeling is a promising approach for obtaining robust predictions and coarse grained population behavior in deterministic mathematical models. Ensemble approaches address model uncertainty by using parameter or model families instead of single best-fit parameters or fixed model structures. Parameter ensembles can be selected based upon simulation error, along with other criteria such as diversity or steady-state performance. Simulations using parameter ensembles can estimate confidence intervals on model variables, and robustly constrain model predictions, despite having many poorly constrained parameters. Results In this software note, we present a multiobjective based technique to estimate parameter or models ensembles, the Pareto Optimal Ensemble Technique in the Julia programming language (JuPOETs). JuPOETs integrates simulated annealing with Pareto optimality to estimate ensembles on or near the optimal tradeoff surface between competing training objectives. We demonstrate JuPOETs on a suite of multiobjective problems, including test functions with parameter bounds and system constraints as well as for the identification of a proof-of-concept biochemical model with four conflicting training objectives. JuPOETs identified optimal or near optimal solutions approximately six-fold faster than a corresponding implementation in Octave for the suite of test functions. For the proof-of-concept biochemical model, JuPOETs produced an ensemble of parameters that gave both the mean of the training data for conflicting data sets, while simultaneously estimating parameter sets that performed well on each of the individual objective functions. Conclusions JuPOETs is a promising approach for the estimation of parameter and model ensembles using multiobjective optimization. JuPOETs can be adapted to solve many problem types, including mixed binary and continuous variable types, bilevel optimization problems and constrained problems without altering the base algorithm. JuPOETs is open source, available under an MIT license, and can be installed using the Julia package manager from the JuPOETs GitHub repository
Collapse
Affiliation(s)
- David M Bassen
- Department of Biomedical Engineering, Cornell University, Ithaca, 14853, NY, USA
| | - Michael Vilkhovoy
- Department of Chemical and Biomolecular Engineering, Cornell University, Ithaca, 14853, NY, USA
| | - Mason Minot
- Department of Chemical and Biomolecular Engineering, Cornell University, Ithaca, 14853, NY, USA
| | - Jonathan T Butcher
- Department of Biomedical Engineering, Cornell University, Ithaca, 14853, NY, USA
| | - Jeffrey D Varner
- Department of Chemical and Biomolecular Engineering, Cornell University, Ithaca, 14853, NY, USA.
| |
Collapse
|
35
|
Otero-Muras I, Banga JR. Design Principles of Biological Oscillators through Optimization: Forward and Reverse Analysis. PLoS One 2016; 11:e0166867. [PMID: 27977695 PMCID: PMC5158198 DOI: 10.1371/journal.pone.0166867] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2016] [Accepted: 11/04/2016] [Indexed: 11/18/2022] Open
Abstract
From cyanobacteria to human, sustained oscillations coordinate important biological functions. Although much has been learned concerning the sophisticated molecular mechanisms underlying biological oscillators, design principles linking structure and functional behavior are not yet fully understood. Here we explore design principles of biological oscillators from a multiobjective optimization perspective, taking into account the trade-offs between conflicting performance goals or demands. We develop a comprehensive tool for automated design of oscillators, based on multicriteria global optimization that allows two modes: (i) the automatic design (forward problem) and (ii) the inference of design principles (reverse analysis problem). From the perspective of synthetic biology, the forward mode allows the solution of design problems that mimic some of the desirable properties appearing in natural oscillators. The reverse analysis mode facilitates a systematic exploration of the design space based on Pareto optimality concepts. The method is illustrated with two case studies: the automatic design of synthetic oscillators from a library of biological parts, and the exploration of design principles in 3-gene oscillatory systems.
Collapse
Affiliation(s)
- Irene Otero-Muras
- BioProcess Engineering Group, IIM-CSIC, Spanish National Research Council, Vigo, Spain
- * E-mail:
| | - Julio R. Banga
- BioProcess Engineering Group, IIM-CSIC, Spanish National Research Council, Vigo, Spain
| |
Collapse
|
36
|
Angione C, Conway M, Lió P. Multiplex methods provide effective integration of multi-omic data in genome-scale models. BMC Bioinformatics 2016; 17 Suppl 4:83. [PMID: 26961692 PMCID: PMC4896256 DOI: 10.1186/s12859-016-0912-1] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Genomic, transcriptomic, and metabolic variations shape the complex adaptation landscape of bacteria to varying environmental conditions. Elucidating the genotype-phenotype relation paves the way for the prediction of such effects, but methods for characterizing the relationship between multiple environmental factors are still lacking. Here, we tackle the problem of extracting network-level information from collections of environmental conditions, by integrating the multiple omic levels at which the bacterial response is measured. RESULTS To this end, we model a large compendium of growth conditions as a multiplex network consisting of transcriptomic and fluxomic layers, and we propose a multi-omic network approach to infer similarity of growth conditions by integrating layers of the multiplex network. Each node of the network represents a single condition, while edges are similarities between conditions, as measured by phenotypic and transcriptomic properties on different layers of the network. We then fuse these layers into one network, therefore capturing a global network of conditions and the associated similarities across two omic levels. We apply this multi-omic fusion to an updated genome-scale reconstruction of Escherichia coli that includes underground metabolism and new gene-protein-reaction associations. CONCLUSIONS Our method can be readily used to evaluate and cross-compare different collections of conditions among different species. Acquiring multi-omic information on the topology of the space of experimental conditions makes it possible to infer the position and to build condition-specific models of untested or incomplete profiles for which experimental data is not available. Our weighted network fusion method for genome-scale models is freely available at https://github.com/maxconway/SNFtool .
Collapse
Affiliation(s)
- Claudio Angione
- School of Computing - Teesside University, Middlesbrough, UK.
| | - Max Conway
- Computer Laboratory - University of Cambridge, Cambridge, UK.
| | - Pietro Lió
- Computer Laboratory - University of Cambridge, Cambridge, UK.
| |
Collapse
|