1
|
Yang B, Wu C, Teng Y, Chou KJ, Guarnieri MT, Xiong W. Tailoring microbial fitness through computational steering and CRISPRi-driven robustness regulation. Cell Syst 2024; 15:1133-1147.e4. [PMID: 39667940 DOI: 10.1016/j.cels.2024.11.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 08/25/2024] [Accepted: 11/15/2024] [Indexed: 12/14/2024]
Abstract
The widespread application of genetically modified microorganisms (GMMs) across diverse sectors underscores the pressing need for robust strategies to mitigate the risks associated with their potential uncontrolled escape. This study merges computational modeling with CRISPR interference (CRISPRi) to refine GMM metabolic robustness. Utilizing ensemble modeling, we achieved high-throughput in silico screening for enzymatic targets susceptible to expression alterations. Translating these insights, we developed functional CRISPRi, boosting fitness control via multiplexed gene knockdown. Our method, enhanced by an insulator-improved gRNA structure and an off-switch circuit controlling a compact Cas12m, resulted in rationally engineered strains with escape frequencies below National Institutes of Health standards. The effectiveness of this approach was confirmed under various conditions, showcasing its ability for secure GMM management. This research underscores the resilience of microbial metabolism, strategically modifying key nodes to halt growth without provoking significant resistance, thereby enabling more reliable and precise GMM control. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Bin Yang
- Biosciences Center, National Renewable Energy Laboratory, Golden, CO 80401, USA
| | - Chao Wu
- Biosciences Center, National Renewable Energy Laboratory, Golden, CO 80401, USA
| | - Yuxi Teng
- Biosciences Center, National Renewable Energy Laboratory, Golden, CO 80401, USA
| | - Katherine J Chou
- Biosciences Center, National Renewable Energy Laboratory, Golden, CO 80401, USA
| | - Michael T Guarnieri
- Biosciences Center, National Renewable Energy Laboratory, Golden, CO 80401, USA
| | - Wei Xiong
- Biosciences Center, National Renewable Energy Laboratory, Golden, CO 80401, USA; School of Biology and Biological Engineering, South China University of Technology, Guangzhou, Guangdong 510006, China.
| |
Collapse
|
2
|
Huber N, Alcalá-Orozco EA, Rexer T, Reichl U, Klamt S. Model-based optimization of cell-free enzyme cascades exemplified for the production of GDP-fucose. Metab Eng 2023; 81:S1096-7176(23)00147-7. [PMID: 39492471 DOI: 10.1016/j.ymben.2023.10.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 10/18/2023] [Accepted: 10/22/2023] [Indexed: 11/05/2024]
Abstract
Cell-free production systems are increasingly used for the synthesis of industrially relevant chemicals and biopharmaceuticals. Cell-free systems often utilize cell lysates, but biocatalytic cascades based on recombinant enzymes have emerged as a promising alternative strategy. However, implementing efficient enzyme cascades is a non-trivial task and mathematical modeling and optimization has become a key tool to improve their performance. In this work, we introduce a generic framework for the model-based optimization of cell-free enzyme cascades based on a given kinetic model of the system. We first formulate and systematize seven optimization problems relevant in the context of cell-free production processes including, for example, the maximization of productivity or product yield and the minimization of overall costs. We then present an approach that accounts for parameter uncertainties, not only during model calibration and model analysis but also when performing the actual optimization. After constructing a kinetic model of the enzyme cascade, experimental data are used to generate an ensemble of kinetic parameter sets reflecting their variabilities. For every parameter set, systems optimization is then performed and the resulting solution subsequently cross-validated for all other parameterizations to identify the solution with the highest overall performance under parameter uncertainty. We exemplify our approach for the cell-free synthesis of GDP-fucose, an important sugar nucleotide with various applications. We selected and solved three optimization problems based on a constructed dynamic model and validated two of them experimentally leading to significant improvements of the process (e.g., 50% increase of titer under identical total enzyme load). Overall, our results demonstrate the potential of model-driven optimization for the rational design and improvement of cell-free production systems. The developed approach for systems optimization under parameter uncertainty could also be relevant for the metabolic design of cell factories.
Collapse
Affiliation(s)
- Nicolas Huber
- Max Planck Institute for Dynamics of Complex Technical Systems, 39106, Magdeburg, Germany
| | | | - Thomas Rexer
- Max Planck Institute for Dynamics of Complex Technical Systems, 39106, Magdeburg, Germany; eversyn, 39106, Magdeburg, Germany
| | - Udo Reichl
- Max Planck Institute for Dynamics of Complex Technical Systems, 39106, Magdeburg, Germany
| | - Steffen Klamt
- Max Planck Institute for Dynamics of Complex Technical Systems, 39106, Magdeburg, Germany.
| |
Collapse
|
3
|
Alsiyabi A, Chowdhury NB, Long D, Saha R. Enhancing in silico strain design predictions through next generation metabolic modeling approaches. Biotechnol Adv 2021; 54:107806. [PMID: 34298108 DOI: 10.1016/j.biotechadv.2021.107806] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 06/22/2021] [Accepted: 07/15/2021] [Indexed: 02/06/2023]
Abstract
The reconstruction and analysis of metabolic models has garnered increasing attention due to the multitude of applications in which these have proven to be practical. The growing number of generated metabolic models has been accompanied by an exponentially expanding arsenal of tools used to analyze them. In this work, we discussed the biological relevance of a number of promising modeling frameworks, focusing on the questions and hypotheses each method is equipped to address. To this end, we critically analyzed the steady-state modeling approaches focusing on resource allocation and incorporation of thermodynamic considerations which produce promising results and aid in the generation and experimental validation of numerous predictions. For smaller networks involving more complex regulation, we addressed kinetic modeling techniques which show encouraging results in addressing questions outside the scope of steady-state modeling. Finally, we discussed the potential application of the discussed frameworks within the field of strain design. Adoption of such methodologies is believed to significantly enhance the accuracy of in silico predictions and hence decrease the number of design-build-test cycles required.
Collapse
Affiliation(s)
- Adil Alsiyabi
- Chemical and Biomolecular Engineering, University of Nebraska-Lincoln, United States of America
| | - Niaz Bahar Chowdhury
- Chemical and Biomolecular Engineering, University of Nebraska-Lincoln, United States of America
| | - Dianna Long
- Complex Biosystems, University of Nebraska-Lincoln, United States of America
| | - Rajib Saha
- Chemical and Biomolecular Engineering, University of Nebraska-Lincoln, United States of America; Complex Biosystems, University of Nebraska-Lincoln, United States of America.
| |
Collapse
|
4
|
Haiman ZB, Zielinski DC, Koike Y, Yurkovich JT, Palsson BO. MASSpy: Building, simulating, and visualizing dynamic biological models in Python using mass action kinetics. PLoS Comput Biol 2021; 17:e1008208. [PMID: 33507922 PMCID: PMC7872247 DOI: 10.1371/journal.pcbi.1008208] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Revised: 02/09/2021] [Accepted: 12/21/2020] [Indexed: 01/01/2023] Open
Abstract
Mathematical models of metabolic networks utilize simulation to study system-level mechanisms and functions. Various approaches have been used to model the steady state behavior of metabolic networks using genome-scale reconstructions, but formulating dynamic models from such reconstructions continues to be a key challenge. Here, we present the Mass Action Stoichiometric Simulation Python (MASSpy) package, an open-source computational framework for dynamic modeling of metabolism. MASSpy utilizes mass action kinetics and detailed chemical mechanisms to build dynamic models of complex biological processes. MASSpy adds dynamic modeling tools to the COnstraint-Based Reconstruction and Analysis Python (COBRApy) package to provide an unified framework for constraint-based and kinetic modeling of metabolic networks. MASSpy supports high-performance dynamic simulation through its implementation of libRoadRunner: the Systems Biology Markup Language (SBML) simulation engine. Three examples are provided to demonstrate how to use MASSpy: (1) a validation of the MASSpy modeling tool through dynamic simulation of detailed mechanisms of enzyme regulation; (2) a feature demonstration using a workflow for generating ensemble of kinetic models using Monte Carlo sampling to approximate missing numerical values of parameters and to quantify biological uncertainty, and (3) a case study in which MASSpy is utilized to overcome issues that arise when integrating experimental data with the computation of functional states of detailed biological mechanisms. MASSpy represents a powerful tool to address challenges that arise in dynamic modeling of metabolic networks, both at small and large scales.
Collapse
Affiliation(s)
- Zachary B. Haiman
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Daniel C. Zielinski
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
| | - Yuko Koike
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
- Institute for Systems Biology, Seattle, Washington, United States of America
| | - James T. Yurkovich
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
- Institute for Systems Biology, Seattle, Washington, United States of America
| | - Bernhard O. Palsson
- Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America
- Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
| |
Collapse
|
5
|
Srinivasan S, Cluett WR, Mahadevan R. A scalable method for parameter identification in kinetic models of metabolism using steady-state data. Bioinformatics 2020; 35:5216-5225. [PMID: 31197317 DOI: 10.1093/bioinformatics/btz445] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2018] [Revised: 04/26/2019] [Accepted: 06/05/2019] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION In kinetic models of metabolism, the parameter values determine the dynamic behaviour predicted by these models. Estimating parameters from in vivo experimental data require the parameters to be structurally identifiable, and the data to be informative enough to estimate these parameters. Existing methods to determine the structural identifiability of parameters in kinetic models of metabolism can only be applied to models of small metabolic networks due to their computational complexity. Additionally, a priori experimental design, a necessity to obtain informative data for parameter estimation, also does not account for using steady-state data to estimate parameters in kinetic models. RESULTS Here, we present a scalable methodology to structurally identify parameters for each flux in a kinetic model of metabolism based on the availability of steady-state data. In doing so, we also address the issue of determining the number and nature of experiments for generating steady-state data to estimate these parameters. By using a small metabolic network as an example, we show that most parameters in fluxes expressed by mechanistic enzyme kinetic rate laws can be identified using steady-state data, and the steady-state data required for their estimation can be obtained from selective experiments involving both substrate and enzyme level perturbations. The methodology can be used in combination with other identifiability and experimental design algorithms that use dynamic data to determine the most informative experiments requiring the least resources to perform. AVAILABILITY AND IMPLEMENTATION https://github.com/LMSE/ident. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Shyam Srinivasan
- Department of Chemical Engineering and Applied Chemistry, 200 College Street, University of Toronto, Toronto, ON, M5S3E5, Canada
| | - William R Cluett
- Department of Chemical Engineering and Applied Chemistry, 200 College Street, University of Toronto, Toronto, ON, M5S3E5, Canada
| | - Radhakrishnan Mahadevan
- Department of Chemical Engineering and Applied Chemistry, 200 College Street, University of Toronto, Toronto, ON, M5S3E5, Canada.,Institute of Biomaterials and Biomedical Engineering, 164 College Street, University of Toronto, Toronto, ON, M5S 3G9, Canada
| |
Collapse
|
6
|
Bromig L, Kremling A, Marin-Sanguino A. Understanding biochemical design principles with ensembles of canonical non-linear models. PLoS One 2020; 15:e0230599. [PMID: 32353072 PMCID: PMC7192416 DOI: 10.1371/journal.pone.0230599] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Accepted: 03/03/2020] [Indexed: 12/22/2022] Open
Abstract
Systems biology applies concepts from engineering in order to understand biological networks. If such an understanding was complete, biologists would be able to design ad hoc biochemical components tailored for different purposes, which is the goal of synthetic biology. Needless to say that we are far away from creating biological subsystems as intricate and precise as those found in nature, but mathematical models and high throughput techniques have brought us a long way in this direction. One of the difficulties that still needs to be overcome is finding the right values for model parameters and dealing with uncertainty, which is proving to be an extremely difficult task. In this work, we take advantage of ensemble modeling techniques, where a large number of models with different parameter values are formulated and then tested according to some performance criteria. By finding features shared by successful models, the role of different components and the synergies between them can be better understood. We will address some of the difficulties often faced by ensemble modeling approaches, such as the need to sample a space whose size grows exponentially with the number of parameters, and establishing useful selection criteria. Some methods will be shown to reduce the predictions from many models into a set of understandable “design principles” that can guide us to improve or manufacture a biochemical network. Our proposed framework formulates models within standard formalisms in order to integrate information from different sources and minimize the dimension of the parameter space. Additionally, the mathematical properties of the formalism enable a partition of the parameter space into independent subspaces. Each of these subspaces can be paired with a set of criteria that depend exclusively on it, thus allowing a separate sampling/screening in spaces of lower dimension. By applying tests in a strict order where computationally cheaper tests are applied first to each subspace and applying computationally expensive tests to the remaining subset thereafter, the use of resources is optimized and a larger number of models can be examined. This can be compared to a complex database query where the order of the requests can make a huge difference in the processing time. The method will be illustrated by analyzing a classical model of a metabolic pathway with end-product inhibition. Even for such a simple model, the method provides novel insight.
Collapse
Affiliation(s)
- Lukas Bromig
- Specialty Division for Systems Biotechnology, Technische Universität München, Garching, Germany
| | - Andreas Kremling
- Specialty Division for Systems Biotechnology, Technische Universität München, Garching, Germany
| | - Alberto Marin-Sanguino
- Specialty Division for Systems Biotechnology, Technische Universität München, Garching, Germany
- * E-mail:
| |
Collapse
|
7
|
Wu C, Jiang H, Kalra I, Wang X, Cano M, Maness P, Yu J, Xiong W. A generalized computational framework to streamline thermodynamics and kinetics analysis of metabolic pathways. Metab Eng 2020; 57:140-150. [DOI: 10.1016/j.ymben.2019.08.006] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2019] [Revised: 07/18/2019] [Accepted: 08/07/2019] [Indexed: 12/25/2022]
|
8
|
Greene J, Daniell J, Köpke M, Broadbelt L, Tyo KE. Kinetic ensemble model of gas fermenting Clostridium autoethanogenum for improved ethanol production. Biochem Eng J 2019. [DOI: 10.1016/j.bej.2019.04.021] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
|
9
|
Berndt N, Bulik S, Wallach I, Wünsch T, König M, Stockmann M, Meierhofer D, Holzhütter HG. HEPATOKIN1 is a biochemistry-based model of liver metabolism for applications in medicine and pharmacology. Nat Commun 2018; 9:2386. [PMID: 29921957 PMCID: PMC6008457 DOI: 10.1038/s41467-018-04720-9] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2017] [Accepted: 05/14/2018] [Indexed: 12/18/2022] Open
Abstract
The epidemic increase of non-alcoholic fatty liver diseases (NAFLD) requires a deeper understanding of the regulatory circuits controlling the response of liver metabolism to nutritional challenges, medical drugs, and genetic enzyme variants. As in vivo studies of human liver metabolism are encumbered with serious ethical and technical issues, we developed a comprehensive biochemistry-based kinetic model of the central liver metabolism including the regulation of enzyme activities by their reactants, allosteric effectors, and hormone-dependent phosphorylation. The utility of the model for basic research and applications in medicine and pharmacology is illustrated by simulating diurnal variations of the metabolic state of the liver at various perturbations caused by nutritional challenges (alcohol), drugs (valproate), and inherited enzyme disorders (galactosemia). Using proteomics data to scale maximal enzyme activities, the model is used to highlight differences in the metabolic functions of normal hepatocytes and malignant liver cells (adenoma and hepatocellular carcinoma). In silico models of cells can provide insight into the causes and effects of disease states and reduce the need for in vivo studies. Here, the authors present a kinetic model of hepatocyte metabolism including energy, carbohydrate, lipid and nitrogen metabolism and hormonal and allosteric regulation of enzymatic activity.
Collapse
Affiliation(s)
- Nikolaus Berndt
- Institute of Biochemistry Computational Systems Biochemistry Group, Charité - Universitätsmedizin Berlin, Charitéplatz, 110117, Berlin, Germany
| | - Sascha Bulik
- Institute of Biochemistry Computational Systems Biochemistry Group, Charité - Universitätsmedizin Berlin, Charitéplatz, 110117, Berlin, Germany.,German Federal Institute for Risk Assessment Max-Dohrn-Straße 8-10, 10589, Berlin, Germany
| | - Iwona Wallach
- Institute of Biochemistry Computational Systems Biochemistry Group, Charité - Universitätsmedizin Berlin, Charitéplatz, 110117, Berlin, Germany
| | - Tilo Wünsch
- Department of General, Visceral and Transplantation Surgery Augustenburger Platz, Charité - Universitätsmedizin Berlin - Campus Virchow-Klinikum, 113353, Berlin, Germany
| | - Matthias König
- Institute for Biology, Institute for Theoretical Biology, Humboldt-University Berlin, Invalidenstraße 43, Haus, 410115, Berlin, Germany
| | - Martin Stockmann
- German Federal Institute for Risk Assessment Max-Dohrn-Straße 8-10, 10589, Berlin, Germany
| | - David Meierhofer
- Max Planck Institute of Molecular Genetics/Mass Spectroscopy, Ihnestraße 63-73, 14195, Berlin, Germany
| | - Hermann-Georg Holzhütter
- Institute of Biochemistry Computational Systems Biochemistry Group, Charité - Universitätsmedizin Berlin, Charitéplatz, 110117, Berlin, Germany.
| |
Collapse
|
10
|
Costello Z, Martin HG. A machine learning approach to predict metabolic pathway dynamics from time-series multiomics data. NPJ Syst Biol Appl 2018; 4:19. [PMID: 29872542 PMCID: PMC5974308 DOI: 10.1038/s41540-018-0054-3] [Citation(s) in RCA: 119] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2017] [Revised: 04/11/2018] [Accepted: 04/20/2018] [Indexed: 02/01/2023] Open
Abstract
New synthetic biology capabilities hold the promise of dramatically improving our ability to engineer biological systems. However, a fundamental hurdle in realizing this potential is our inability to accurately predict biological behavior after modifying the corresponding genotype. Kinetic models have traditionally been used to predict pathway dynamics in bioengineered systems, but they take significant time to develop, and rely heavily on domain expertise. Here, we show that the combination of machine learning and abundant multiomics data (proteomics and metabolomics) can be used to effectively predict pathway dynamics in an automated fashion. The new method outperforms a classical kinetic model, and produces qualitative and quantitative predictions that can be used to productively guide bioengineering efforts. This method systematically leverages arbitrary amounts of new data to improve predictions, and does not assume any particular interactions, but rather implicitly chooses the most predictive ones.
Collapse
Affiliation(s)
- Zak Costello
- 1Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA USA.,DOE Agile Biofoundry, Emeryville, CA USA.,3DOE Joint BioEnergy Institute, Emeryville, CA USA
| | - Hector Garcia Martin
- 1Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA USA.,DOE Agile Biofoundry, Emeryville, CA USA.,3DOE Joint BioEnergy Institute, Emeryville, CA USA.,4BCAM, Basque Center for Applied Mathematics, Bilbao, Spain
| |
Collapse
|
11
|
Freed EF, Pines G, Eckert CA, Gill RT. Trackable Multiplex Recombineering (TRMR) and Next-Generation Genome Design Technologies: Modifying Gene Expression inE. coliby Inserting Synthetic DNA Cassettes and Molecular Barcodes. Synth Biol (Oxf) 2018. [DOI: 10.1002/9783527688104.ch2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2023] Open
Affiliation(s)
- Emily F. Freed
- Biosciences Center, National Renewable Energy Laboratory; 15013 Denver West Parkway Golden CO 80401 USA
| | - Gur Pines
- University of Colorado; Chemical and Biological Engineering; 3415 Colorado Ave Boulder CO 80303 USA
- University of Colorado; Renewable and Sustainable Energy Institute; 4001 Discovery Dr Boulder CO 80303 USA
| | - Carrie A. Eckert
- Biosciences Center, National Renewable Energy Laboratory; 15013 Denver West Parkway Golden CO 80401 USA
- University of Colorado; Renewable and Sustainable Energy Institute; 4001 Discovery Dr Boulder CO 80303 USA
| | - Ryan T. Gill
- University of Colorado; Chemical and Biological Engineering; 3415 Colorado Ave Boulder CO 80303 USA
- University of Colorado; Renewable and Sustainable Energy Institute; 4001 Discovery Dr Boulder CO 80303 USA
| |
Collapse
|
12
|
Voit EO. The best models of metabolism. WILEY INTERDISCIPLINARY REVIEWS. SYSTEMS BIOLOGY AND MEDICINE 2017; 9:10.1002/wsbm.1391. [PMID: 28544810 PMCID: PMC5643013 DOI: 10.1002/wsbm.1391] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2017] [Revised: 03/31/2017] [Accepted: 04/01/2017] [Indexed: 12/25/2022]
Abstract
Biochemical systems are among of the oldest application areas of mathematical modeling. Spanning a time period of over one hundred years, the repertoire of options for structuring a model and for formulating reactions has been constantly growing, and yet, it is still unclear whether or to what degree some models are better than others and how the modeler is to choose among them. In fact, the variety of options has become overwhelming and difficult to maneuver for novices and experts alike. This review outlines the metabolic model design process and discusses the numerous choices for modeling frameworks and mathematical representations. It tries to be inclusive, even though it cannot be complete, and introduces the various modeling options in a manner that is as unbiased as that is feasible. However, the review does end with personal recommendations for the choices of default models. WIREs Syst Biol Med 2017, 9:e1391. doi: 10.1002/wsbm.1391 For further resources related to this article, please visit the WIREs website.
Collapse
Affiliation(s)
- Eberhard O Voit
- Department of Biomedical Engineering, Georgia Institute of Technology, Atlanta, GA, USA
| |
Collapse
|
13
|
Formulation, construction and analysis of kinetic models of metabolism: A review of modelling frameworks. Biotechnol Adv 2017; 35:981-1003. [PMID: 28916392 DOI: 10.1016/j.biotechadv.2017.09.005] [Citation(s) in RCA: 78] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2017] [Revised: 08/30/2017] [Accepted: 09/10/2017] [Indexed: 12/13/2022]
Abstract
Kinetic models are critical to predict the dynamic behaviour of metabolic networks. Mechanistic kinetic models for large networks remain uncommon due to the difficulty of fitting their parameters. Recent modelling frameworks promise new ways to overcome this obstacle while retaining predictive capabilities. In this review, we present an overview of the relevant mathematical frameworks for kinetic formulation, construction and analysis. Starting with kinetic formalisms, we next review statistical methods for parameter inference, as well as recent computational frameworks applied to the construction and analysis of kinetic models. Finally, we discuss opportunities and limitations hindering the development of larger kinetic reconstructions.
Collapse
|
14
|
Purdy HM, Reed JL. Evaluating the capabilities of microbial chemical production using genome-scale metabolic models. ACTA ACUST UNITED AC 2017. [DOI: 10.1016/j.coisb.2017.01.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
|
15
|
Henriques D, Villaverde AF, Rocha M, Saez-Rodriguez J, Banga JR. Data-driven reverse engineering of signaling pathways using ensembles of dynamic models. PLoS Comput Biol 2017; 13:e1005379. [PMID: 28166222 PMCID: PMC5319798 DOI: 10.1371/journal.pcbi.1005379] [Citation(s) in RCA: 36] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2016] [Revised: 02/21/2017] [Accepted: 01/24/2017] [Indexed: 11/19/2022] Open
Abstract
Despite significant efforts and remarkable progress, the inference of signaling networks from experimental data remains very challenging. The problem is particularly difficult when the objective is to obtain a dynamic model capable of predicting the effect of novel perturbations not considered during model training. The problem is ill-posed due to the nonlinear nature of these systems, the fact that only a fraction of the involved proteins and their post-translational modifications can be measured, and limitations on the technologies used for growing cells in vitro, perturbing them, and measuring their variations. As a consequence, there is a pervasive lack of identifiability. To overcome these issues, we present a methodology called SELDOM (enSEmbLe of Dynamic lOgic-based Models), which builds an ensemble of logic-based dynamic models, trains them to experimental data, and combines their individual simulations into an ensemble prediction. It also includes a model reduction step to prune spurious interactions and mitigate overfitting. SELDOM is a data-driven method, in the sense that it does not require any prior knowledge of the system: the interaction networks that act as scaffolds for the dynamic models are inferred from data using mutual information. We have tested SELDOM on a number of experimental and in silico signal transduction case-studies, including the recent HPN-DREAM breast cancer challenge. We found that its performance is highly competitive compared to state-of-the-art methods for the purpose of recovering network topology. More importantly, the utility of SELDOM goes beyond basic network inference (i.e. uncovering static interaction networks): it builds dynamic (based on ordinary differential equation) models, which can be used for mechanistic interpretations and reliable dynamic predictions in new experimental conditions (i.e. not used in the training). For this task, SELDOM's ensemble prediction is not only consistently better than predictions from individual models, but also often outperforms the state of the art represented by the methods used in the HPN-DREAM challenge.
Collapse
Affiliation(s)
- David Henriques
- Bioprocess Engineering Group, Spanish National Research Council, IIM-CSIC, Vigo, Spain
| | - Alejandro F. Villaverde
- Bioprocess Engineering Group, Spanish National Research Council, IIM-CSIC, Vigo, Spain
- Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - Miguel Rocha
- Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - Julio Saez-Rodriguez
- Joint Research Center for Computational Biomedicine, RWTH-Aachen University, Aachen, Germany
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust Genome Campus, Hinxton, United Kingdom
| | - Julio R. Banga
- Bioprocess Engineering Group, Spanish National Research Council, IIM-CSIC, Vigo, Spain
| |
Collapse
|
16
|
Bassen DM, Vilkhovoy M, Minot M, Butcher JT, Varner JD. JuPOETs: a constrained multiobjective optimization approach to estimate biochemical model ensembles in the Julia programming language. BMC SYSTEMS BIOLOGY 2017; 11:10. [PMID: 28122561 PMCID: PMC5264316 DOI: 10.1186/s12918-016-0380-2] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/08/2016] [Accepted: 12/16/2016] [Indexed: 11/18/2022]
Abstract
Background Ensemble modeling is a promising approach for obtaining robust predictions and coarse grained population behavior in deterministic mathematical models. Ensemble approaches address model uncertainty by using parameter or model families instead of single best-fit parameters or fixed model structures. Parameter ensembles can be selected based upon simulation error, along with other criteria such as diversity or steady-state performance. Simulations using parameter ensembles can estimate confidence intervals on model variables, and robustly constrain model predictions, despite having many poorly constrained parameters. Results In this software note, we present a multiobjective based technique to estimate parameter or models ensembles, the Pareto Optimal Ensemble Technique in the Julia programming language (JuPOETs). JuPOETs integrates simulated annealing with Pareto optimality to estimate ensembles on or near the optimal tradeoff surface between competing training objectives. We demonstrate JuPOETs on a suite of multiobjective problems, including test functions with parameter bounds and system constraints as well as for the identification of a proof-of-concept biochemical model with four conflicting training objectives. JuPOETs identified optimal or near optimal solutions approximately six-fold faster than a corresponding implementation in Octave for the suite of test functions. For the proof-of-concept biochemical model, JuPOETs produced an ensemble of parameters that gave both the mean of the training data for conflicting data sets, while simultaneously estimating parameter sets that performed well on each of the individual objective functions. Conclusions JuPOETs is a promising approach for the estimation of parameter and model ensembles using multiobjective optimization. JuPOETs can be adapted to solve many problem types, including mixed binary and continuous variable types, bilevel optimization problems and constrained problems without altering the base algorithm. JuPOETs is open source, available under an MIT license, and can be installed using the Julia package manager from the JuPOETs GitHub repository
Collapse
Affiliation(s)
- David M Bassen
- Department of Biomedical Engineering, Cornell University, Ithaca, 14853, NY, USA
| | - Michael Vilkhovoy
- Department of Chemical and Biomolecular Engineering, Cornell University, Ithaca, 14853, NY, USA
| | - Mason Minot
- Department of Chemical and Biomolecular Engineering, Cornell University, Ithaca, 14853, NY, USA
| | - Jonathan T Butcher
- Department of Biomedical Engineering, Cornell University, Ithaca, 14853, NY, USA
| | - Jeffrey D Varner
- Department of Chemical and Biomolecular Engineering, Cornell University, Ithaca, 14853, NY, USA.
| |
Collapse
|
17
|
Application of theoretical methods to increase succinate production in engineered strains. Bioprocess Biosyst Eng 2016; 40:479-497. [PMID: 28040871 DOI: 10.1007/s00449-016-1729-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2016] [Accepted: 12/16/2016] [Indexed: 12/19/2022]
Abstract
Computational methods have enabled the discovery of non-intuitive strategies to enhance the production of a variety of target molecules. In the case of succinate production, reviews covering the topic have not yet analyzed the impact and future potential that such methods may have. In this work, we review the application of computational methods to the production of succinic acid. We found that while a total of 26 theoretical studies were published between 2002 and 2016, only 10 studies reported the successful experimental implementation of any kind of theoretical knowledge. None of the experimental studies reported an exact application of the computational predictions. However, the combination of computational analysis with complementary strategies, such as directed evolution and comparative genome analysis, serves as a proof of concept and demonstrates that successful metabolic engineering can be guided by rational computational methods.
Collapse
|
18
|
Khodayari A, Maranas CD. A genome-scale Escherichia coli kinetic metabolic model k-ecoli457 satisfying flux data for multiple mutant strains. Nat Commun 2016; 7:13806. [PMID: 27996047 PMCID: PMC5187423 DOI: 10.1038/ncomms13806] [Citation(s) in RCA: 149] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2016] [Accepted: 11/03/2016] [Indexed: 01/03/2023] Open
Abstract
Kinetic models of metabolism at a genome scale that faithfully recapitulate the effect of multiple genetic interventions would be transformative in our ability to reliably design novel overproducing microbial strains. Here, we introduce k-ecoli457, a genome-scale kinetic model of Escherichia coli metabolism that satisfies fluxomic data for wild-type and 25 mutant strains under different substrates and growth conditions. The k-ecoli457 model contains 457 model reactions, 337 metabolites and 295 substrate-level regulatory interactions. Parameterization is carried out using a genetic algorithm by simultaneously imposing all available fluxomic data (about 30 measured fluxes per mutant). The Pearson correlation coefficient between experimental data and predicted product yields for 320 engineered strains spanning 24 product metabolites is 0.84. This is substantially higher than that using flux balance analysis, minimization of metabolic adjustment or maximization of product yield exhibiting systematic errors with correlation coefficients of, respectively, 0.18, 0.37 and 0.47 (k-ecoli457 is available for download at http://www.maranasgroup.com).
Collapse
Affiliation(s)
- Ali Khodayari
- Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Costas D. Maranas
- Department of Chemical Engineering, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
19
|
Chou WK, Brynildsen MP. A biochemical engineering view of the quest for immune-potentiating anti-infectives. Curr Opin Chem Eng 2016. [DOI: 10.1016/j.coche.2016.08.018] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
|
20
|
Review of the important challenges and opportunities related to modeling of mammalian cell bioreactors. AIChE J 2016. [DOI: 10.1002/aic.15442] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
|
21
|
Kök MS. An integrated approach: advances in the use ofClostridiumfor biofuel. Biotechnol Genet Eng Rev 2016; 31:69-81. [DOI: 10.1080/02648725.2016.1168075] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
22
|
Lafontaine Rivera JG, Lee Y, Liao JC. An entropy-like index of bifurcational robustness for metabolic systems. Integr Biol (Camb) 2016; 7:895-903. [PMID: 25855352 DOI: 10.1039/c4ib00257a] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
Natural and synthetic metabolic pathways need to retain stability when faced against random changes in gene expression levels and kinetic parameters. In the presence of large parameter changes, a robust system should specifically avoid moving to an unstable region, an event that would dramatically change system behavior. Here we present an entropy-like index, denoted as S, for quantifying the bifurcational robustness of metabolic systems against loss of stability. We show that S enables the optimization of a metabolic model with respect to both bifurcational robustness and experimental data. We then demonstrate how the coupling of ensemble modeling and S enables us to discriminate alternative designs of a synthetic pathway according to bifurcational robustness. Finally, we show that S enables the identification of a key enzyme contributing to the bifurcational robustness of yeast glycolysis. The different applications of S demonstrated illustrate the versatile role it can play in constructing better metabolic models and designing functional non-native pathways.
Collapse
Affiliation(s)
- Jimmy G Lafontaine Rivera
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, 5531 Boelter Hall, Los Angeles, California 90095, USA.
| | | | | |
Collapse
|
23
|
Personalized Whole-Cell Kinetic Models of Metabolism for Discovery in Genomics and Pharmacodynamics. Cell Syst 2015; 1:283-92. [DOI: 10.1016/j.cels.2015.10.003] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2015] [Revised: 08/13/2015] [Accepted: 10/07/2015] [Indexed: 01/07/2023]
|
24
|
Ng CY, Khodayari A, Chowdhury A, Maranas CD. Advances in de novo strain design using integrated systems and synthetic biology tools. Curr Opin Chem Biol 2015; 28:105-14. [DOI: 10.1016/j.cbpa.2015.06.026] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2015] [Revised: 06/13/2015] [Accepted: 06/21/2015] [Indexed: 11/17/2022]
|
25
|
Improving prediction fidelity of cellular metabolism with kinetic descriptions. Curr Opin Biotechnol 2015; 36:57-64. [PMID: 26318076 DOI: 10.1016/j.copbio.2015.08.011] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2015] [Revised: 08/06/2015] [Accepted: 08/09/2015] [Indexed: 12/13/2022]
Abstract
Several modeling frameworks for describing and redirecting cellular metabolism have been developed keeping pace with the rapid development in high-throughput data generation and advances in metabolic engineering techniques. The incorporation of kinetic information within stoichiometry-only modeling techniques offers potential advantages for improved phenotype prediction and consequently more precise computational strain design. In addition to substrate-level kinetic regulatory information, the integration of a number of additional layers of regulation at the transcription, translation, and post-translation levels is sought after by many research groups. However, the practical integration of these complex biological processes into a unified framework amenable to design remains an ongoing challenge.
Collapse
|
26
|
Imam S, Schäuble S, Brooks AN, Baliga NS, Price ND. Data-driven integration of genome-scale regulatory and metabolic network models. Front Microbiol 2015; 6:409. [PMID: 25999934 PMCID: PMC4419725 DOI: 10.3389/fmicb.2015.00409] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2015] [Accepted: 04/20/2015] [Indexed: 12/21/2022] Open
Abstract
Microbes are diverse and extremely versatile organisms that play vital roles in all ecological niches. Understanding and harnessing microbial systems will be key to the sustainability of our planet. One approach to improving our knowledge of microbial processes is through data-driven and mechanism-informed computational modeling. Individual models of biological networks (such as metabolism, transcription, and signaling) have played pivotal roles in driving microbial research through the years. These networks, however, are highly interconnected and function in concert-a fact that has led to the development of a variety of approaches aimed at simulating the integrated functions of two or more network types. Though the task of integrating these different models is fraught with new challenges, the large amounts of high-throughput data sets being generated, and algorithms being developed, means that the time is at hand for concerted efforts to build integrated regulatory-metabolic networks in a data-driven fashion. In this perspective, we review current approaches for constructing integrated regulatory-metabolic models and outline new strategies for future development of these network models for any microbial system.
Collapse
Affiliation(s)
- Saheed Imam
- Institute for Systems Biology Seattle, WA, USA
| | - Sascha Schäuble
- Institute for Systems Biology Seattle, WA, USA ; Jena University Language and Information Engineering Lab, Friedrich-Schiller-University Jena Jena, Germany
| | | | - Nitin S Baliga
- Institute for Systems Biology Seattle, WA, USA ; Departments of Biology and Microbiology, University of Washington Seattle, WA, USA ; Molecular and Cellular Biology Program, University of Washington Seattle, WA, USA ; Lawrence Berkeley National Lab Berkeley, CA, USA
| | | |
Collapse
|
27
|
Villaverde AF, Bongard S, Mauch K, Müller D, Balsa-Canto E, Schmid J, Banga JR. A consensus approach for estimating the predictive accuracy of dynamic models in biology. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2015; 119:17-28. [PMID: 25716416 DOI: 10.1016/j.cmpb.2015.02.001] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/16/2014] [Revised: 12/19/2014] [Accepted: 02/02/2015] [Indexed: 06/04/2023]
Abstract
Mathematical models that predict the complex dynamic behaviour of cellular networks are fundamental in systems biology, and provide an important basis for biomedical and biotechnological applications. However, obtaining reliable predictions from large-scale dynamic models is commonly a challenging task due to lack of identifiability. The present work addresses this challenge by presenting a methodology for obtaining high-confidence predictions from dynamic models using time-series data. First, to preserve the complex behaviour of the network while reducing the number of estimated parameters, model parameters are combined in sets of meta-parameters, which are obtained from correlations between biochemical reaction rates and between concentrations of the chemical species. Next, an ensemble of models with different parameterizations is constructed and calibrated. Finally, the ensemble is used for assessing the reliability of model predictions by defining a measure of convergence of model outputs (consensus) that is used as an indicator of confidence. We report results of computational tests carried out on a metabolic model of Chinese Hamster Ovary (CHO) cells, which are used for recombinant protein production. Using noisy simulated data, we find that the aggregated ensemble predictions are on average more accurate than the predictions of individual ensemble models. Furthermore, ensemble predictions with high consensus are statistically more accurate than ensemble predictions with large variance. The procedure provides quantitative estimates of the confidence in model predictions and enables the analysis of sufficiently complex networks as required for practical applications.
Collapse
Affiliation(s)
| | - Sophia Bongard
- Insilico Biotechnology AG, Meitnerstraße 8, 70563 Stuttgart, Germany.
| | - Klaus Mauch
- Insilico Biotechnology AG, Meitnerstraße 8, 70563 Stuttgart, Germany.
| | - Dirk Müller
- Insilico Biotechnology AG, Meitnerstraße 8, 70563 Stuttgart, Germany.
| | - Eva Balsa-Canto
- Bioprocess Engineering Group, IIM-CSIC, Eduardo Cabello 6, 36208 Vigo, Spain.
| | - Joachim Schmid
- Insilico Biotechnology AG, Meitnerstraße 8, 70563 Stuttgart, Germany.
| | - Julio R Banga
- Bioprocess Engineering Group, IIM-CSIC, Eduardo Cabello 6, 36208 Vigo, Spain.
| |
Collapse
|
28
|
Matsuoka Y, Shimizu K. Current status and future perspectives of kinetic modeling for the cell metabolism with incorporation of the metabolic regulation mechanism. BIORESOUR BIOPROCESS 2015. [DOI: 10.1186/s40643-014-0031-7] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
|
29
|
Khodayari A, Chowdhury A, Maranas CD. Succinate Overproduction: A Case Study of Computational Strain Design Using a Comprehensive Escherichia coli Kinetic Model. Front Bioeng Biotechnol 2015; 2:76. [PMID: 25601910 PMCID: PMC4283520 DOI: 10.3389/fbioe.2014.00076] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2014] [Accepted: 12/05/2014] [Indexed: 01/25/2023] Open
Abstract
Computational strain-design prediction accuracy has been the focus for many recent efforts through the selective integration of kinetic information into metabolic models. In general, kinetic model prediction quality is determined by the range and scope of genetic and/or environmental perturbations used during parameterization. In this effort, we apply the k-OptForce procedure on a kinetic model of E. coli core metabolism constructed using the Ensemble Modeling (EM) method and parameterized using multiple mutant strains data under aerobic respiration with glucose as the carbon source. Minimal interventions are identified that improve succinate yield under both aerobic and anaerobic conditions to test the fidelity of model predictions under both genetic and environmental perturbations. Under aerobic condition, k-OptForce identifies interventions that match existing experimental strategies while pointing at a number of unexplored flux re-directions such as routing glyoxylate flux through the glycerate metabolism to improve succinate yield. Many of the identified interventions rely on the kinetic descriptions that would not be discoverable by a purely stoichiometric description. In contrast, under fermentative (anaerobic) condition, k-OptForce fails to identify key interventions including up-regulation of anaplerotic reactions and elimination of competitive fermentative products. This is due to the fact that the pathways activated under anaerobic condition were not properly parameterized as only aerobic flux data were used in the model construction. This study shed light on the importance of condition-specific model parameterization and provides insight on how to augment kinetic models so as to correctly respond to multiple environmental perturbations.
Collapse
Affiliation(s)
- Ali Khodayari
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, USA
| | - Anupam Chowdhury
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, USA
| | - Costas D. Maranas
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA, USA
| |
Collapse
|
30
|
Khodayari A, Zomorrodi AR, Liao JC, Maranas CD. A kinetic model of Escherichia coli core metabolism satisfying multiple sets of mutant flux data. Metab Eng 2014; 25:50-62. [DOI: 10.1016/j.ymben.2014.05.014] [Citation(s) in RCA: 145] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2014] [Revised: 04/17/2014] [Accepted: 05/28/2014] [Indexed: 01/27/2023]
|
31
|
Ud-Dean SMM, Gunawan R. Ensemble inference and inferability of gene regulatory networks. PLoS One 2014; 9:e103812. [PMID: 25093509 PMCID: PMC4122380 DOI: 10.1371/journal.pone.0103812] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2014] [Accepted: 07/05/2014] [Indexed: 01/05/2023] Open
Abstract
The inference of gene regulatory network (GRN) from gene expression data is an unsolved problem of great importance. This inference has been stated, though not proven, to be underdetermined implying that there could be many equivalent (indistinguishable) solutions. Motivated by this fundamental limitation, we have developed new framework and algorithm, called TRaCE, for the ensemble inference of GRNs. The ensemble corresponds to the inherent uncertainty associated with discriminating direct and indirect gene regulations from steady-state data of gene knock-out (KO) experiments. We applied TRaCE to analyze the inferability of random GRNs and the GRNs of E. coli and yeast from single- and double-gene KO experiments. The results showed that, with the exception of networks with very few edges, GRNs are typically not inferable even when the data are ideal (unbiased and noise-free). Finally, we compared the performance of TRaCE with top performing methods of DREAM4 in silico network inference challenge.
Collapse
Affiliation(s)
| | - Rudiyanto Gunawan
- Institute for Chemical and Bioengineering, ETH Zurich, Zurich, Switzerland
- * E-mail:
| |
Collapse
|
32
|
Mitrophanov AY, Wolberg AS, Reifman J. Kinetic model facilitates analysis of fibrin generation and its modulation by clotting factors: implications for hemostasis-enhancing therapies. MOLECULAR BIOSYSTEMS 2014; 10:2347-57. [PMID: 24958246 PMCID: PMC4128477 DOI: 10.1039/c4mb00263f] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Current mechanistic knowledge of protein interactions driving blood coagulation has come largely from experiments with simple synthetic systems, which only partially represent the molecular composition of human blood plasma. Here, we investigate the ability of the suggested molecular mechanisms to account for fibrin generation and degradation kinetics in diverse, physiologically relevant in vitro systems. We represented the protein interaction network responsible for thrombin generation, fibrin formation, and fibrinolysis as a computational kinetic model and benchmarked it against published and newly generated data reflecting diverse experimental conditions. We then applied the model to investigate the ability of fibrinogen and a recently proposed prothrombin complex concentrate composition, PCC-AT (a combination of the clotting factors II, IX, X, and antithrombin), to restore normal thrombin and fibrin generation in diluted plasma. The kinetic model captured essential features of empirically detected effects of prothrombin, fibrinogen, and thrombin-activatable fibrinolysis inhibitor titrations on fibrin formation and degradation kinetics. Moreover, the model qualitatively predicted the impact of tissue factor and tPA/tenecteplase level variations on the fibrin output. In the majority of considered cases, PCC-AT combined with fibrinogen accurately approximated both normal thrombin and fibrin generation in diluted plasma, which could not be accomplished by fibrinogen or PCC-AT acting alone. We conclude that a common network of protein interactions can account for key kinetic features characterizing fibrin accumulation and degradation in human blood plasma under diverse experimental conditions. Combined PCC-AT/fibrinogen supplementation is a promising strategy to reverse the deleterious effects of dilution-induced coagulopathy associated with traumatic bleeding.
Collapse
Affiliation(s)
- Alexander Y. Mitrophanov
- DoD Biotechnology High-Performance Computing Software Applications Institute, Telemedicine and Advanced Technology Research Center, U.S. Army Medical Research and Materiel Command, Ft. Detrick, MD 21702
| | - Alisa S. Wolberg
- Department of Pathology and Laboratory Medicine, University of North Carolina School of Medicine, Chapel Hill, NC 27599
| | - Jaques Reifman
- DoD Biotechnology High-Performance Computing Software Applications Institute, Telemedicine and Advanced Technology Research Center, U.S. Army Medical Research and Materiel Command, Ft. Detrick, MD 21702
| |
Collapse
|
33
|
Lee Y, Lafontaine Rivera JG, Liao JC. Ensemble Modeling for Robustness Analysis in engineering non-native metabolic pathways. Metab Eng 2014; 25:63-71. [PMID: 24972370 DOI: 10.1016/j.ymben.2014.06.006] [Citation(s) in RCA: 72] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2014] [Revised: 05/06/2014] [Accepted: 06/18/2014] [Indexed: 01/08/2023]
Abstract
Metabolic pathways in cells must be sufficiently robust to tolerate fluctuations in expression levels and changes in environmental conditions. Perturbations in expression levels may lead to system failure due to the disappearance of a stable steady state. Increasing evidence has suggested that biological networks have evolved such that they are intrinsically robust in their network structure. In this article, we presented Ensemble Modeling for Robustness Analysis (EMRA), which combines a continuation method with the Ensemble Modeling approach, for investigating the robustness issue of non-native pathways. EMRA investigates a large ensemble of reference models with different parameters, and determines the effects of parameter drifting until a bifurcation point, beyond which a stable steady state disappears and system failure occurs. A pathway is considered to have high bifurcational robustness if the probability of system failure is low in the ensemble. To demonstrate the utility of EMRA, we investigate the bifurcational robustness of two synthetic central metabolic pathways that achieve carbon conservation: non-oxidative glycolysis and reverse glyoxylate cycle. With EMRA, we determined the probability of system failure of each design and demonstrated that alternative designs of these pathways indeed display varying degrees of bifurcational robustness. Furthermore, we demonstrated that target selection for flux improvement should consider the trade-offs between robustness and performance.
Collapse
Affiliation(s)
- Yun Lee
- Department of Chemical and Biomolecular Engineering, University of California, 5531 Boelter Hall, Los Angeles, CA 90095, USA
| | - Jimmy G Lafontaine Rivera
- Department of Chemical and Biomolecular Engineering, University of California, 5531 Boelter Hall, Los Angeles, CA 90095, USA
| | - James C Liao
- Department of Chemical and Biomolecular Engineering, University of California, 5531 Boelter Hall, Los Angeles, CA 90095, USA; UCLA-DOE Institute for Genomics and Proteomics, University of California, 611 Young Drive East, Los Angeles, CA 90095, USA.
| |
Collapse
|
34
|
|
35
|
Harcombe WR, Delaney NF, Leiby N, Klitgord N, Marx CJ. The ability of flux balance analysis to predict evolution of central metabolism scales with the initial distance to the optimum. PLoS Comput Biol 2013; 9:e1003091. [PMID: 23818838 PMCID: PMC3688462 DOI: 10.1371/journal.pcbi.1003091] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2012] [Accepted: 04/26/2013] [Indexed: 11/21/2022] Open
Abstract
The most powerful genome-scale framework to model metabolism, flux balance analysis (FBA), is an evolutionary optimality model. It hypothesizes selection upon a proposed optimality criterion in order to predict the set of internal fluxes that would maximize fitness. Here we present a direct test of the optimality assumption underlying FBA by comparing the central metabolic fluxes predicted by multiple criteria to changes measurable by a 13C-labeling method for experimentally-evolved strains. We considered datasets for three Escherichia coli evolution experiments that varied in their length, consistency of environment, and initial optimality. For ten populations that were evolved for 50,000 generations in glucose minimal medium, we observed modest changes in relative fluxes that led to small, but significant decreases in optimality and increased the distance to the predicted optimal flux distribution. In contrast, seven populations evolved on the poor substrate lactate for 900 generations collectively became more optimal and had flux distributions that moved toward predictions. For three pairs of central metabolic knockouts evolved on glucose for 600–800 generations, there was a balance between cases where optimality and flux patterns moved toward or away from FBA predictions. Despite this variation in predictability of changes in central metabolism, two generalities emerged. First, improved growth largely derived from evolved increases in the rate of substrate use. Second, FBA predictions bore out well for the two experiments initiated with ancestors with relatively sub-optimal yield, whereas those begun already quite optimal tended to move somewhat away from predictions. These findings suggest that the tradeoff between rate and yield is surprisingly modest. The observed positive correlation between rate and yield when adaptation initiated further from the optimum resulted in the ability of FBA to use stoichiometric constraints to predict the evolution of metabolism despite selection for rate. The most common method of modeling genome-scale metabolism, flux balance analysis, involves using known stoichiometry to define feasible metabolic states and then choosing between these states by proposing that evolution has selected a metabolic flux that optimizes fitness. But does evolution optimize metabolism, and if so, what component of metabolism equates to fitness? We directly tested the underlying assumption of stoichiometric optimality by comparing predicted flux distributions with changes in fluxes that occurred following experimental evolution. Across three experiments ranging in length from a few hundred to fifty thousand generations, we found that substrate uptake – an input to the model – always increased, but supposed optimality criteria such as yield only increased sometimes. Despite this, there was a clear trend. Highly optimal ancestors evolved slightly lower yield in the course of increasing the overall rate, whereas more sub-optimal strains were able to increase both. These results suggest that flux balance analysis is capable of predicting either the initial metabolic behavior of strains or how they will evolve, but not both.
Collapse
Affiliation(s)
- William R. Harcombe
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Nigel F. Delaney
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Nicholas Leiby
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
- Systems Biology Program, Harvard University, Cambridge, Massachusetts, United States of America
| | - Niels Klitgord
- Bioinformatics Graduate Program, Boston University, Boston, Massachusetts, United States of America
| | - Christopher J. Marx
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts, United States of America
- Faculty of Arts and Sciences Center for Systems Biology, Harvard University, Cambridge, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
36
|
Marcheschi RJ, Gronenberg LS, Liao JC. Protein engineering for metabolic engineering: current and next-generation tools. Biotechnol J 2013; 8:545-55. [PMID: 23589443 DOI: 10.1002/biot.201200371] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2012] [Revised: 03/07/2013] [Accepted: 03/20/2013] [Indexed: 11/10/2022]
Abstract
Protein engineering in the context of metabolic engineering is increasingly important to the field of industrial biotechnology. As the demand for biologically produced food, fuels, chemicals, food additives, and pharmaceuticals continues to grow, the ability to design and modify proteins to accomplish new functions will be required to meet the high productivity demands for the metabolism of engineered organisms. We review advances in selecting, modeling, and engineering proteins to improve or alter their activity. Some of the methods have only recently been developed for general use and are just beginning to find greater application in the metabolic engineering community. We also discuss methods of generating random and targeted diversity in proteins to generate mutant libraries for analysis. Recent uses of these techniques to alter cofactor use; produce non-natural amino acids, alcohols, and carboxylic acids; and alter organism phenotypes are presented and discussed as examples of the successful engineering of proteins for metabolic engineering purposes.
Collapse
Affiliation(s)
- Ryan J Marcheschi
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, CA, USA
| | | | | |
Collapse
|
37
|
Jouhten P. Metabolic modelling in the development of cell factories by synthetic biology. Comput Struct Biotechnol J 2012; 3:e201210009. [PMID: 24688669 PMCID: PMC3962133 DOI: 10.5936/csbj.201210009] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2012] [Revised: 11/05/2012] [Accepted: 11/07/2012] [Indexed: 11/22/2022] Open
Abstract
Cell factories are commonly microbial organisms utilized for bioconversion of renewable resources to bulk or high value chemicals. Introduction of novel production pathways in chassis strains is the core of the development of cell factories by synthetic biology. Synthetic biology aims to create novel biological functions and systems not found in nature by combining biology with engineering. The workflow of the development of novel cell factories with synthetic biology is ideally linear which will be attainable with the quantitative engineering approach, high-quality predictive models, and libraries of well-characterized parts. Different types of metabolic models, mathematical representations of metabolism and its components, enzymes and metabolites, are useful in particular phases of the synthetic biology workflow. In this minireview, the role of metabolic modelling in synthetic biology will be discussed with a review of current status of compatible methods and models for the in silico design and quantitative evaluation of a cell factory.
Collapse
Affiliation(s)
- Paula Jouhten
- VTT Technical Research Centre of Finland, Tietotie 2, 02044 VTT, Espoo, Finland
| |
Collapse
|
38
|
Jang YS, Malaviya A, Cho C, Lee J, Lee SY. Butanol production from renewable biomass by clostridia. BIORESOURCE TECHNOLOGY 2012; 123:653-63. [PMID: 22939593 DOI: 10.1016/j.biortech.2012.07.104] [Citation(s) in RCA: 100] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2012] [Revised: 07/25/2012] [Accepted: 07/26/2012] [Indexed: 05/24/2023]
Abstract
Global energy crisis and limited supply of petroleum fuels have rekindled the worldwide focus towards development of a sustainable technology for alternative fuel production. Utilization of abundant renewable biomass offers an excellent opportunity for the development of an economical biofuel production process at a scale sufficiently large to have an impact on sustainability and security objectives. Additionally, several environmental benefits have also been linked with the utilization of renewable biomass. Butanol is considered to be superior to ethanol due to its higher energy content and less hygroscopy. This has led to an increased research interest in butanol production from renewable biomass in recent years. In this paper, we review the various aspects of utilizing renewable biomass for clostridial butanol production. Focus is given on various alternative substrates that have been used for butanol production and on fermentation strategies recently reported to improve butanol production.
Collapse
Affiliation(s)
- Yu-Sin Jang
- Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 Program), BioProcess Engineering Research Center, Center for Systems and Synthetic Biotechnology, Institute for the BioCentury, KAIST, Republic of Korea
| | | | | | | | | |
Collapse
|
39
|
Jang YS, Kim B, Shin JH, Choi YJ, Choi S, Song CW, Lee J, Park HG, Lee SY. Bio-based production of C2-C6 platform chemicals. Biotechnol Bioeng 2012; 109:2437-59. [DOI: 10.1002/bit.24599] [Citation(s) in RCA: 299] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2012] [Revised: 06/26/2012] [Accepted: 06/26/2012] [Indexed: 12/20/2022]
|
40
|
Georgiou G, Lee SY. Editorial: Michael Shuler's legacy in biochemical engineering. Biotechnol J 2012; 7:314-6. [DOI: 10.1002/biot.201290012] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|