1
|
Karp PD, Paley S, Caspi R, Kothari A, Krummenacker M, Midford PE, Moore LR, Subhraveti P, Gama-Castro S, Tierrafria VH, Lara P, Muñiz-Rascado L, Bonavides-Martinez C, Santos-Zavaleta A, Mackie A, Sun G, Ahn-Horst TA, Choi H, Juenemann R, Knudsen CNM, Covert MW, Collado-Vides J, Paulsen I. The EcoCyc database (2025). EcoSal Plus 2025:eesp00192024. [PMID: 40304522 DOI: 10.1128/ecosalplus.esp-0019-2024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2024] [Accepted: 03/18/2025] [Indexed: 05/02/2025]
Abstract
EcoCyc is a bioinformatics database (DB) available at EcoCyc.org that describes the genome and the biochemical machinery of Escherichia coli K-12 MG1655. The long-term goal of the project was to describe the complete molecular catalog of the E. coli cell, as well as the functions of each of its molecular parts, to facilitate a system-level understanding of E. coli. EcoCyc is an electronic reference source for E. coli biologists and for biologists who work with related microorganisms. The database includes information pages on each E. coli gene product, metabolite, reaction, operon, and metabolic pathway. The database also includes information on the regulation of gene expression, E. coli gene essentiality, and nutrient conditions that do or do not support the growth of E. coli. The website and downloadable software contain tools for the analysis of high-throughput data sets. In addition, a steady-state metabolic flux model is generated from each new version of EcoCyc and can be executed via EcoCyc.org. The model can predict metabolic flux rates, nutrient uptake rates, and growth rates for different gene knockouts and nutrient conditions. Data generated from a whole-cell model that is parameterized from the latest data on EcoCyc is also available. This review outlines the data content of EcoCyc and the procedures by which this content is generated.
Collapse
Affiliation(s)
- Peter D Karp
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Suzanne Paley
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Ron Caspi
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Anamika Kothari
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Markus Krummenacker
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Peter E Midford
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Lisa R Moore
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Pallavi Subhraveti
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Socorro Gama-Castro
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Víctor H Tierrafria
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Paloma Lara
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Luis Muñiz-Rascado
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - César Bonavides-Martinez
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Alberto Santos-Zavaleta
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Amanda Mackie
- School of Natural Sciences, Macquarie University, Sydney, New South Wales, Australia
| | - Gwanggyu Sun
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Travis A Ahn-Horst
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Heejo Choi
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Riley Juenemann
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Cyrus N M Knudsen
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Markus W Covert
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Julio Collado-Vides
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, Mexico
| | - Ian Paulsen
- School of Natural Sciences, Macquarie University, Sydney, New South Wales, Australia
| |
Collapse
|
2
|
De Domenico M, Allegri L, Caldarelli G, d'Andrea V, Di Camillo B, Rocha LM, Rozum J, Sbarbati R, Zambelli F. Challenges and opportunities for digital twins in precision medicine from a complex systems perspective. NPJ Digit Med 2025; 8:37. [PMID: 39825012 PMCID: PMC11742446 DOI: 10.1038/s41746-024-01402-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Accepted: 12/16/2024] [Indexed: 01/20/2025] Open
Abstract
Digital twins (DTs) in precision medicine are increasingly viable, propelled by extensive data collection and advancements in artificial intelligence (AI), alongside traditional biomedical methodologies. We argue that including mechanistic simulations that produce behavior based on explicitly defined biological hypotheses and multiscale mechanisms is beneficial. It enables the exploration of diverse therapeutic strategies and supports dynamic clinical decision-making through insights from network science, quantitative biology, and digital medicine.
Collapse
Affiliation(s)
- Manlio De Domenico
- Department of Physics and Astronomy "Galileo Galilei", University of Padua, Padova, Italy.
- Padua Center for Network Medicine, University of Padua, Padova, Italy.
- Padua Neuroscience Center, University of Padua, Padova, Italy.
- Istituto Nazionale di Fisica Nucleare, sez. di Padova, Italy.
| | - Luca Allegri
- Department of Physics and Astronomy "Galileo Galilei", University of Padua, Padova, Italy
| | - Guido Caldarelli
- DSMN and ECLT Ca' Foscari University of Venice, Venezia, Italy
- Institute of Complex Systems (ISC) CNR unit Sapienza University, Rome, Italy
- London Institute for Mathematical Sciences, Royal Institution, London, UK
| | - Valeria d'Andrea
- Department of Physics and Astronomy "Galileo Galilei", University of Padua, Padova, Italy
- Istituto Nazionale di Fisica Nucleare, sez. di Padova, Italy
| | - Barbara Di Camillo
- Padua Center for Network Medicine, University of Padua, Padova, Italy
- Department of Information Engineering, University of Padua, Padova, Italy
- Department of Comparative Biomedicine and Food Science, University of Padua, Padova, Italy
| | - Luis M Rocha
- School of Systems Science and Industrial Eng., Binghamton University, Binghamton, NY, USA
- Universidade Católica Portuguesa, Católica Biomedical Research Centre, Lisbon, Portugal
| | - Jordan Rozum
- School of Systems Science and Industrial Eng., Binghamton University, Binghamton, NY, USA
| | - Riccardo Sbarbati
- Department of Physics and Astronomy "Galileo Galilei", University of Padua, Padova, Italy
- Istituto Nazionale di Fisica Nucleare, sez. di Padova, Italy
| | - Francesco Zambelli
- Department of Physics and Astronomy "Galileo Galilei", University of Padua, Padova, Italy
- Istituto Nazionale di Fisica Nucleare, sez. di Padova, Italy
| |
Collapse
|
3
|
Newton MS, Azadeh AL, Morgenthaler AB, Copley SD. Challenging a decades-old paradigm: ProB and ProA do not channel the unstable intermediate in proline synthesis after all. Proc Natl Acad Sci U S A 2024; 121:e2413673121. [PMID: 39514317 PMCID: PMC11573504 DOI: 10.1073/pnas.2413673121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2024] [Accepted: 10/07/2024] [Indexed: 11/16/2024] Open
Abstract
The pathway for synthesis of proline in most forms of life produces a highly unstable intermediate, γ-L-glutamyl 5-phosphate (GP). For nearly 70 y, channeling of this intermediate from the active site of glutamate 5-kinase to the active site of GP reductase has been believed to protect GP from cyclization to a dead-end product. However, the evidence presented in support of this idea is not conclusive. We show that changes in the structures of the kinase or reductase that should preclude a protein-protein interaction do not compromise proline synthesis in Escherichia coli, demonstrating that channeling does not occur. We calculate that the half-life of GP is 320 ms. Although GP is indeed unstable, it should diffuse the length of an E. coli cell in less than 3 ms. Thus, most GP produced by glutamate 5-kinase should encounter the active site of GP reductase before cyclization occurs.
Collapse
Affiliation(s)
- Matilda S. Newton
- Department of Molecular, Cellular and Developmental Biology and the Cooperative Institute for Research in Environmental Sciences, University of Colorado, Boulder, CO80309
- Royal Society Te Aparangi, Wellington6140, New Zealand
| | - Ashley L. Azadeh
- Department of Molecular, Cellular and Developmental Biology and the Cooperative Institute for Research in Environmental Sciences, University of Colorado, Boulder, CO80309
| | - Andrew B. Morgenthaler
- Department of Molecular, Cellular and Developmental Biology and the Cooperative Institute for Research in Environmental Sciences, University of Colorado, Boulder, CO80309
- Amyris Inc., Emeryville, CA94608
| | - Shelley D. Copley
- Department of Molecular, Cellular and Developmental Biology and the Cooperative Institute for Research in Environmental Sciences, University of Colorado, Boulder, CO80309
| |
Collapse
|
4
|
Lange E, Kranert L, Krüger J, Benndorf D, Heyer R. Microbiome modeling: a beginner's guide. Front Microbiol 2024; 15:1368377. [PMID: 38962127 PMCID: PMC11220171 DOI: 10.3389/fmicb.2024.1368377] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2024] [Accepted: 05/27/2024] [Indexed: 07/05/2024] Open
Abstract
Microbiomes, comprised of diverse microbial species and viruses, play pivotal roles in human health, environmental processes, and biotechnological applications and interact with each other, their environment, and hosts via ecological interactions. Our understanding of microbiomes is still limited and hampered by their complexity. A concept improving this understanding is systems biology, which focuses on the holistic description of biological systems utilizing experimental and computational methods. An important set of such experimental methods are metaomics methods which analyze microbiomes and output lists of molecular features. These lists of data are integrated, interpreted, and compiled into computational microbiome models, to predict, optimize, and control microbiome behavior. There exists a gap in understanding between microbiologists and modelers/bioinformaticians, stemming from a lack of interdisciplinary knowledge. This knowledge gap hinders the establishment of computational models in microbiome analysis. This review aims to bridge this gap and is tailored for microbiologists, researchers new to microbiome modeling, and bioinformaticians. To achieve this goal, it provides an interdisciplinary overview of microbiome modeling, starting with fundamental knowledge of microbiomes, metaomics methods, common modeling formalisms, and how models facilitate microbiome control. It concludes with guidelines and repositories for modeling. Each section provides entry-level information, example applications, and important references, serving as a valuable resource for comprehending and navigating the complex landscape of microbiome research and modeling.
Collapse
Affiliation(s)
- Emanuel Lange
- Multidimensional Omics Data Analysis, Department for Bioanalytics, Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V., Dortmund, Germany
- Graduate School Digital Infrastructure for the Life Sciences, Bielefeld Institute for Bioinformatics Infrastructure (BIBI), Faculty of Technology, Bielefeld University, Bielefeld, Germany
| | - Lena Kranert
- Institute for Automation Engineering, Otto von Guericke University Magdeburg, Magdeburg, Germany
| | - Jacob Krüger
- Engineering of Software-Intensive Systems, Department of Mathematics and Computer Science, Eindhoven University of Technology, Eindhoven, Netherlands
| | - Dirk Benndorf
- Applied Biosciences and Bioprocess Engineering, Anhalt University of Applied Sciences, Köthen, Germany
| | - Robert Heyer
- Multidimensional Omics Data Analysis, Department for Bioanalytics, Leibniz-Institut für Analytische Wissenschaften - ISAS - e.V., Dortmund, Germany
- Graduate School Digital Infrastructure for the Life Sciences, Bielefeld Institute for Bioinformatics Infrastructure (BIBI), Faculty of Technology, Bielefeld University, Bielefeld, Germany
- Multidimensional Omics Data Analysis, Faculty of Technology, Bielefeld University, Bielefeld, Germany
| |
Collapse
|
5
|
Hao T, Song Z, Zhang M, Zhang L, Yang J, Li J, Sun J. Reconstruction of Metabolic-Protein Interaction Integrated Network of Eriocheir sinensis and Analysis of Ecdysone Synthesis. Genes (Basel) 2024; 15:410. [PMID: 38674345 PMCID: PMC11049885 DOI: 10.3390/genes15040410] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2024] [Revised: 03/24/2024] [Accepted: 03/25/2024] [Indexed: 04/28/2024] Open
Abstract
Integrated networks have become a new interest in genome-scale network research due to their ability to comprehensively reflect and analyze the molecular processes in cells. Currently, none of the integrated networks have been reported for higher organisms. Eriocheir sinensis is a typical aquatic animal that grows through ecdysis. Ecdysone has been identified to be a crucial regulator of ecdysis, but the influence factors and regulatory mechanisms of ecdysone synthesis in E. sinensis are still unclear. In this work, the genome-scale metabolic network and protein-protein interaction network of E. sinensis were integrated to reconstruct a metabolic-protein interaction integrated network (MPIN). The MPIN was used to analyze the influence factors of ecdysone synthesis through flux variation analysis. In total, 236 integrated reactions (IRs) were found to influence the ecdysone synthesis of which 16 IRs had a significant impact. These IRs constitute three ecdysone synthesis routes. It is found that there might be alternative pathways to obtain cholesterol for ecdysone synthesis in E. sinensis instead of absorbing it directly from the feeds. The MPIN reconstructed in this work is the first integrated network for higher organisms. The analysis based on the MPIN supplies important information for the mechanism analysis of ecdysone synthesis in E. sinensis.
Collapse
Affiliation(s)
- Tong Hao
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (Z.S.); (M.Z.); (L.Z.); (J.Y.)
| | - Zhentao Song
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (Z.S.); (M.Z.); (L.Z.); (J.Y.)
| | - Mingzhi Zhang
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (Z.S.); (M.Z.); (L.Z.); (J.Y.)
| | - Lingrui Zhang
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (Z.S.); (M.Z.); (L.Z.); (J.Y.)
| | - Jiarui Yang
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (Z.S.); (M.Z.); (L.Z.); (J.Y.)
| | - Jingjing Li
- Tianjin Fisheries Research Institute, Tianjin 300211, China;
| | - Jinsheng Sun
- Tianjin Key Laboratory of Animal and Plant Resistance, College of Life Sciences, Tianjin Normal University, Tianjin 300387, China; (T.H.); (Z.S.); (M.Z.); (L.Z.); (J.Y.)
| |
Collapse
|
6
|
Sun G, DeFelice MM, Gillies TE, Ahn-Horst TA, Andrews CJ, Krummenacker M, Karp PD, Morrison JH, Covert MW. Cross-evaluation of E. coli's operon structures via a whole-cell model suggests alternative cellular benefits for low- versus high-expressing operons. Cell Syst 2024; 15:227-245.e7. [PMID: 38417437 PMCID: PMC10957310 DOI: 10.1016/j.cels.2024.02.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 09/12/2023] [Accepted: 02/08/2024] [Indexed: 03/01/2024]
Abstract
Many bacteria use operons to coregulate genes, but it remains unclear how operons benefit bacteria. We integrated E. coli's 788 polycistronic operons and 1,231 transcription units into an existing whole-cell model and found inconsistencies between the proposed operon structures and the RNA-seq read counts that the model was parameterized from. We resolved these inconsistencies through iterative, model-guided corrections to both datasets, including the correction of RNA-seq counts of short genes that were misreported as zero by existing alignment algorithms. The resulting model suggested two main modes by which operons benefit bacteria. For 86% of low-expression operons, adding operons increased the co-expression probabilities of their constituent proteins, whereas for 92% of high-expression operons, adding operons resulted in more stable expression ratios between the proteins. These simulations underscored the need for further experimental work on how operons reduce noise and synchronize both the expression timing and the quantity of constituent genes. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Gwanggyu Sun
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Mialy M DeFelice
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Taryn E Gillies
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Travis A Ahn-Horst
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Cecelia J Andrews
- Department of Developmental Biology, Stanford University, Stanford, CA 94305, USA
| | | | | | - Jerry H Morrison
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - Markus W Covert
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA.
| |
Collapse
|
7
|
Karp PD, Paley S, Caspi R, Kothari A, Krummenacker M, Midford PE, Moore LR, Subhraveti P, Gama-Castro S, Tierrafria VH, Lara P, Muñiz-Rascado L, Bonavides-Martinez C, Santos-Zavaleta A, Mackie A, Sun G, Ahn-Horst TA, Choi H, Covert MW, Collado-Vides J, Paulsen I. The EcoCyc Database (2023). EcoSal Plus 2023; 11:eesp00022023. [PMID: 37220074 PMCID: PMC10729931 DOI: 10.1128/ecosalplus.esp-0002-2023] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 04/04/2023] [Indexed: 01/28/2024]
Abstract
EcoCyc is a bioinformatics database available online at EcoCyc.org that describes the genome and the biochemical machinery of Escherichia coli K-12 MG1655. The long-term goal of the project is to describe the complete molecular catalog of the E. coli cell, as well as the functions of each of its molecular parts, to facilitate a system-level understanding of E. coli. EcoCyc is an electronic reference source for E. coli biologists and for biologists who work with related microorganisms. The database includes information pages on each E. coli gene product, metabolite, reaction, operon, and metabolic pathway. The database also includes information on the regulation of gene expression, E. coli gene essentiality, and nutrient conditions that do or do not support the growth of E. coli. The website and downloadable software contain tools for the analysis of high-throughput data sets. In addition, a steady-state metabolic flux model is generated from each new version of EcoCyc and can be executed online. The model can predict metabolic flux rates, nutrient uptake rates, and growth rates for different gene knockouts and nutrient conditions. Data generated from a whole-cell model that is parameterized from the latest data on EcoCyc are also available. This review outlines the data content of EcoCyc and of the procedures by which this content is generated.
Collapse
Affiliation(s)
- Peter D. Karp
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Suzanne Paley
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Ron Caspi
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Anamika Kothari
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Markus Krummenacker
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Peter E. Midford
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Lisa R. Moore
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Pallavi Subhraveti
- Bioinformatics Research Group, SRI International, Menlo Park, California, USA
| | - Socorro Gama-Castro
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Victor H. Tierrafria
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Paloma Lara
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Luis Muñiz-Rascado
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - César Bonavides-Martinez
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Alberto Santos-Zavaleta
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Amanda Mackie
- Department of Chemistry and Biomolecular Sciences, Macquarie University, Sydney, New South Wales, Australia
| | - Gwanggyu Sun
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Travis A. Ahn-Horst
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Heejo Choi
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Markus W. Covert
- Department of Bioengineering, Stanford University, Stanford, California, USA
| | - Julio Collado-Vides
- Programa de Genómica Computacional, Centro de Ciencias Genómicas, Universidad Nacional Autónoma de México, Cuernavaca, Morelos, México
| | - Ian Paulsen
- School of Natural Sciences, Macquarie University, Sydney, New South Wales, Australia
| |
Collapse
|
8
|
Georgouli K, Yeom JS, Blake RC, Navid A. Multi-scale models of whole cells: progress and challenges. Front Cell Dev Biol 2023; 11:1260507. [PMID: 38020904 PMCID: PMC10661945 DOI: 10.3389/fcell.2023.1260507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 10/19/2023] [Indexed: 12/01/2023] Open
Abstract
Whole-cell modeling is "the ultimate goal" of computational systems biology and "a grand challenge for 21st century" (Tomita, Trends in Biotechnology, 2001, 19(6), 205-10). These complex, highly detailed models account for the activity of every molecule in a cell and serve as comprehensive knowledgebases for the modeled system. Their scope and utility far surpass those of other systems models. In fact, whole-cell models (WCMs) are an amalgam of several types of "system" models. The models are simulated using a hybrid modeling method where the appropriate mathematical methods for each biological process are used to simulate their behavior. Given the complexity of the models, the process of developing and curating these models is labor-intensive and to date only a handful of these models have been developed. While whole-cell models provide valuable and novel biological insights, and to date have identified some novel biological phenomena, their most important contribution has been to highlight the discrepancy between available data and observations that are used for the parametrization and validation of complex biological models. Another realization has been that current whole-cell modeling simulators are slow and to run models that mimic more complex (e.g., multi-cellular) biosystems, those need to be executed in an accelerated fashion on high-performance computing platforms. In this manuscript, we review the progress of whole-cell modeling to date and discuss some of the ways that they can be improved.
Collapse
Affiliation(s)
- Konstantia Georgouli
- Biosciences and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, United States
| | - Jae-Seung Yeom
- Center for Applied Scientific Computing, Computing Directorate, Lawrence Livermore National Laboratory, Livermore, CA, United States
| | - Robert C. Blake
- Center for Applied Scientific Computing, Computing Directorate, Lawrence Livermore National Laboratory, Livermore, CA, United States
| | - Ali Navid
- Biosciences and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, CA, United States
| |
Collapse
|
9
|
Basile A, Zampieri G, Kovalovszki A, Karkaria B, Treu L, Patil KR, Campanaro S. Modelling of microbial interactions in anaerobic digestion: from black to glass box. Curr Opin Microbiol 2023; 75:102363. [PMID: 37542746 DOI: 10.1016/j.mib.2023.102363] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 05/20/2023] [Accepted: 07/10/2023] [Indexed: 08/07/2023]
Abstract
Anaerobic and microaerophilic environments are pervasive in nature, providing essential contributions to the maintenance of human health, biogeochemical cycles and the Earth's climate. These ecological niches are characterised by low free oxygen and oxidants, or lack thereof. Under these conditions, interactions between species are essential for supporting the growth of syntrophic species and maintaining thermodynamic feasibility of anaerobic fermentation. Kinetic models provide a simplified view of complex metabolic networks, while genome-scale metabolic models and flux-balance analysis (FBA) aim to unravel these systems as a whole. The target of this review is to outline the main similarities, differences and challenges associated with kinetic and metabolic modelling, and describe state-of-the-art modelling practices for studying syntrophies in the anaerobic digestion (AD) case study.
Collapse
Affiliation(s)
- Arianna Basile
- Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK.
| | - Guido Zampieri
- Department of Biology, University of Padova, Via U. Bassi 58/b, 35121 Padova, Italy
| | - Adam Kovalovszki
- Department of Environmental and Resource Engineering, Technical University of Denmark, Building 115, Bygningstorvet, 2800 Kgs. Lyngby, Denmark
| | - Behzad Karkaria
- Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Laura Treu
- Department of Biology, University of Padova, Via U. Bassi 58/b, 35121 Padova, Italy.
| | - Kiran Raosaheb Patil
- Medical Research Council Toxicology Unit, University of Cambridge, Cambridge, UK
| | - Stefano Campanaro
- Department of Biology, University of Padova, Via U. Bassi 58/b, 35121 Padova, Italy
| |
Collapse
|
10
|
Choi H, Covert MW. Whole-cell modeling of E. coli confirms that in vitro tRNA aminoacylation measurements are insufficient to support cell growth and predicts a positive feedback mechanism regulating arginine biosynthesis. Nucleic Acids Res 2023; 51:5911-5930. [PMID: 37224536 PMCID: PMC10325894 DOI: 10.1093/nar/gkad435] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Revised: 05/04/2023] [Accepted: 05/09/2023] [Indexed: 05/26/2023] Open
Abstract
In Escherichia coli, inconsistencies between in vitro tRNA aminoacylation measurements and in vivo protein synthesis demands were postulated almost 40 years ago, but have proven difficult to confirm. Whole-cell modeling can test whether a cell behaves in a physiologically correct manner when parameterized with in vitro measurements by providing a holistic representation of cellular processes in vivo. Here, a mechanistic model of tRNA aminoacylation, codon-based polypeptide elongation, and N-terminal methionine cleavage was incorporated into a developing whole-cell model of E. coli. Subsequent analysis confirmed the insufficiency of aminoacyl-tRNA synthetase kinetic measurements for cellular proteome maintenance, and estimated aminoacyl-tRNA synthetase kcats that were on average 7.6-fold higher. Simulating cell growth with perturbed kcats demonstrated the global impact of these in vitro measurements on cellular phenotypes. For example, an insufficient kcat for HisRS caused protein synthesis to be less robust to the natural variability in aminoacyl-tRNA synthetase expression in single cells. More surprisingly, insufficient ArgRS activity led to catastrophic impacts on arginine biosynthesis due to underexpressed N-acetylglutamate synthase, where translation depends on repeated CGG codons. Overall, the expanded E. coli model deepens understanding of how translation operates in an in vivo context.
Collapse
Affiliation(s)
- Heejo Choi
- Department of Bioengineering, Stanford University, 443 Via Ortega, Stanford, CA 94305, USA
| | - Markus W Covert
- Department of Bioengineering, Stanford University, 443 Via Ortega, Stanford, CA 94305, USA
| |
Collapse
|
11
|
Skalnik CJ, Cheah SY, Yang MY, Wolff MB, Spangler RK, Talman L, Morrison JH, Peirce SM, Agmon E, Covert MW. Whole-cell modeling of E. coli colonies enables quantification of single-cell heterogeneity in antibiotic responses. PLoS Comput Biol 2023; 19:e1011232. [PMID: 37327241 DOI: 10.1371/journal.pcbi.1011232] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2021] [Accepted: 06/01/2023] [Indexed: 06/18/2023] Open
Abstract
Antibiotic resistance poses mounting risks to human health, as current antibiotics are losing efficacy against increasingly resistant pathogenic bacteria. Of particular concern is the emergence of multidrug-resistant strains, which has been rapid among Gram-negative bacteria such as Escherichia coli. A large body of work has established that antibiotic resistance mechanisms depend on phenotypic heterogeneity, which may be mediated by stochastic expression of antibiotic resistance genes. The link between such molecular-level expression and the population levels that result is complex and multi-scale. Therefore, to better understand antibiotic resistance, what is needed are new mechanistic models that reflect single-cell phenotypic dynamics together with population-level heterogeneity, as an integrated whole. In this work, we sought to bridge single-cell and population-scale modeling by building upon our previous experience in "whole-cell" modeling, an approach which integrates mathematical and mechanistic descriptions of biological processes to recapitulate the experimentally observed behaviors of entire cells. To extend whole-cell modeling to the "whole-colony" scale, we embedded multiple instances of a whole-cell E. coli model within a model of a dynamic spatial environment, allowing us to run large, parallelized simulations on the cloud that contained all the molecular detail of the previous whole-cell model and many interactive effects of a colony growing in a shared environment. The resulting simulations were used to explore the response of E. coli to two antibiotics with different mechanisms of action, tetracycline and ampicillin, enabling us to identify sub-generationally-expressed genes, such as the beta-lactamase ampC, which contributed greatly to dramatic cellular differences in steady-state periplasmic ampicillin and was a significant factor in determining cell survival.
Collapse
Affiliation(s)
- Christopher J Skalnik
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Sean Y Cheah
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Mica Y Yang
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Mattheus B Wolff
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Ryan K Spangler
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Lee Talman
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
| | - Jerry H Morrison
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| | - Shayn M Peirce
- Department of Biomedical Engineering, University of Virginia, Charlottesville, Virginia, United States of America
| | - Eran Agmon
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
- Center for Cell Analysis and Modeling, University of Connecticut School of Medicine, Farmington, Connecticut, United States of America
| | - Markus W Covert
- Department of Bioengineering, Stanford University, Stanford, California, United States of America
| |
Collapse
|
12
|
Ousalem F, Singh S, Bailey NA, Wong KH, Zhu L, Neky MJ, Sibindi C, Fei J, Gonzalez RL, Boël G, Hunt JF. Comparative genetic, biochemical, and biophysical analyses of the four E. coli ABCF paralogs support distinct functions related to mRNA translation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.06.11.543863. [PMID: 37398404 PMCID: PMC10312648 DOI: 10.1101/2023.06.11.543863] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Multiple paralogous ABCF ATPases are encoded in most genomes, but the physiological functions remain unknown for most of them. We herein compare the four Escherichia coli K12 ABCFs - EttA, Uup, YbiT, and YheS - using assays previously employed to demonstrate EttA gates the first step of polypeptide elongation on the ribosome dependent on ATP/ADP ratio. A Δ uup knockout, like Δ ettA , exhibits strongly reduced fitness when growth is restarted from long-term stationary phase, but neither Δ ybiT nor Δ yheS exhibits this phenotype. All four proteins nonetheless functionally interact with ribosomes based on in vitro translation and single-molecule fluorescence resonance energy transfer experiments employing variants harboring glutamate-to-glutamine active-site mutations (EQ 2 ) that trap them in the ATP-bound conformation. These variants all strongly stabilize the same global conformational state of a ribosomal elongation complex harboring deacylated tRNA Val in the P site. However, EQ 2 -Uup uniquely exchanges on/off the ribosome on a second timescale, while EQ 2 -YheS-bound ribosomes uniquely sample alternative global conformations. At sub-micromolar concentrations, EQ 2 -EttA and EQ 2 -YbiT fully inhibit in vitro translation of an mRNA encoding luciferase, while EQ 2 -Uup and EQ 2 -YheS only partially inhibit it at ~10-fold higher concentrations. Moreover, tripeptide synthesis reactions are not inhibited by EQ 2 -Uup or EQ 2 -YheS, while EQ 2 -YbiT inhibits synthesis of both peptide bonds and EQ 2 -EttA specifically traps ribosomes after synthesis of the first peptide bond. These results support the four E. coli ABCF paralogs all having different activities on translating ribosomes, and they suggest that there remains a substantial amount of functionally uncharacterized "dark matter" involved in mRNA translation.
Collapse
|
13
|
Favate JS, Skalenko KS, Chiles E, Su X, Yadavalli SS, Shah P. Linking genotypic and phenotypic changes in the E. coli Long-Term Evolution Experiment using metabolomics. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.15.528756. [PMID: 36874203 PMCID: PMC9985142 DOI: 10.1101/2023.02.15.528756] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/10/2023]
Abstract
Changes in an organism's environment, genome, or gene expression patterns can lead to changes in its metabolism. The metabolic phenotype can be under selection and contributes to adaptation. However, the networked and convoluted nature of an organism's metabolism makes relating mutations, metabolic changes, and effects on fitness challenging. To overcome this challenge, we use the Long-Term Evolution Experiment (LTEE) with E. coli as a model to understand how mutations can eventually affect metabolism and perhaps fitness. We used mass-spectrometry to broadly survey the metabolomes of the ancestral strains and all 12 evolved lines. We combined this metabolic data with mutation and expression data to suggest how mutations that alter specific reaction pathways, such as the biosynthesis of nicotinamide adenine dinucleotide, might increase fitness in the system. Our work provides a better understanding of how mutations might affect fitness through the metabolic changes in the LTEE and thus provides a major step in developing a complete genotype-phenotype map for this experimental system.
Collapse
Affiliation(s)
- John S. Favate
- Department of Genetics, Rutgers University, Piscataway, New Jersey, USA
- Human Genetics Institute of New Jersey, Piscataway, New Jersey, USA
| | - Kyle S. Skalenko
- Department of Genetics, Rutgers University, Piscataway, New Jersey, USA
- Waksman Institute, Rutgers University, Piscataway, New Jersey, USA
| | - Eric Chiles
- Cancer Institute of New Jersey, New Brunswick, New Jersey, USA
| | - Xiaoyang Su
- Cancer Institute of New Jersey, New Brunswick, New Jersey, USA
| | - Srujana S. Yadavalli
- Department of Genetics, Rutgers University, Piscataway, New Jersey, USA
- Waksman Institute, Rutgers University, Piscataway, New Jersey, USA
| | - Premal Shah
- Department of Genetics, Rutgers University, Piscataway, New Jersey, USA
- Human Genetics Institute of New Jersey, Piscataway, New Jersey, USA
| |
Collapse
|
14
|
Beer RD, Di Paolo EA. The theoretical foundations of enaction: Precariousness. Biosystems 2023; 223:104823. [PMID: 36574923 DOI: 10.1016/j.biosystems.2022.104823] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 11/28/2022] [Accepted: 12/14/2022] [Indexed: 12/25/2022]
Abstract
Enaction is an increasingly influential approach to cognition that grew out of Maturana and Varela's earlier work on autopoiesis and the biology of cognition. As with any relatively new scientific discipline, the enactive approach would benefit greatly from a careful analysis of its theoretical foundations. Here we initiate such an analysis for one of the core concepts of enaction, precariousness. Specifically, we consider three types of fragility: systemic, processual and thermodynamic. Using a glider in the Game of Life as a toy model, we illustrate each of these fragilities and examine the relationships between them. We also argue that each type of fragility is characterized by which aspects of a system are hardwired into its definition from the outset and which aspects are emergent and hence vulnerable to disintegration without ongoing maintenance.
Collapse
Affiliation(s)
- Randall D Beer
- Cognitive Science Program, Luddy School of Informatics, Computing and Engineering, Indiana University, USA.
| | - Ezequiel A Di Paolo
- Ikerbasque, Basque Foundation for Science, Bizkaia, Spain; IAS-Research Center for Life, Mind and Society, University of the Basque Country, Donostia, Spain; Department of Informatics, University of Sussex, Brighton, UK
| |
Collapse
|
15
|
Ramírez Rojas AA, Swidah R, Schindler D. Microbes of traditional fermentation processes as synthetic biology chassis to tackle future food challenges. Front Bioeng Biotechnol 2022; 10:982975. [PMID: 36185425 PMCID: PMC9523148 DOI: 10.3389/fbioe.2022.982975] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 08/10/2022] [Indexed: 11/23/2022] Open
Abstract
Microbial diversity is magnificent and essential to almost all life on Earth. Microbes are an essential part of every human, allowing us to utilize otherwise inaccessible resources. It is no surprise that humans started, initially unconsciously, domesticating microbes for food production: one may call this microbial domestication 1.0. Sourdough bread is just one of the miracles performed by microbial fermentation, allowing extraction of more nutrients from flour and at the same time creating a fluffy and delicious loaf. There are a broad range of products the production of which requires fermentation such as chocolate, cheese, coffee and vinegar. Eventually, with the rise of microscopy, humans became aware of microbial life. Today our knowledge and technological advances allow us to genetically engineer microbes - one may call this microbial domestication 2.0. Synthetic biology and microbial chassis adaptation allow us to tackle current and future food challenges. One of the most apparent challenges is the limited space on Earth available for agriculture and its major tolls on the environment through use of pesticides and the replacement of ecosystems with monocultures. Further challenges include transport and packaging, exacerbated by the 24/7 on-demand mentality of many customers. Synthetic biology already tackles multiple food challenges and will be able to tackle many future food challenges. In this perspective article, we highlight recent microbial synthetic biology research to address future food challenges. We further give a perspective on how synthetic biology tools may teach old microbes new tricks, and what standardized microbial domestication could look like.
Collapse
|
16
|
Ahn-Horst TA, Mille LS, Sun G, Morrison JH, Covert MW. An expanded whole-cell model of E. coli links cellular physiology with mechanisms of growth rate control. NPJ Syst Biol Appl 2022; 8:30. [PMID: 35986058 PMCID: PMC9391491 DOI: 10.1038/s41540-022-00242-9] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 07/28/2022] [Indexed: 11/09/2022] Open
Abstract
Growth and environmental responses are essential for living organisms to survive and adapt to constantly changing environments. In order to simulate new conditions and capture dynamic responses to environmental shifts in a developing whole-cell model of E. coli, we incorporated additional regulation, including dynamics of the global regulator guanosine tetraphosphate (ppGpp), along with dynamics of amino acid biosynthesis and translation. With the model, we show that under perturbed ppGpp conditions, small molecule feedback inhibition pathways, in addition to regulation of expression, play a role in ppGpp regulation of growth. We also found that simulations with dysregulated amino acid synthesis pathways provide average amino acid concentration predictions that are comparable to experimental results but on the single-cell level, concentrations unexpectedly show regular fluctuations. Additionally, during both an upshift and downshift in nutrient availability, the simulated cell responds similarly with a transient increase in the mRNA:rRNA ratio. This additional simulation functionality should support a variety of new applications and expansions of the E. coli Whole-Cell Modeling Project.
Collapse
Affiliation(s)
- Travis A Ahn-Horst
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA
| | | | - Gwanggyu Sun
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA
| | - Jerry H Morrison
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA
| | - Markus W Covert
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|
17
|
Danchin A. In vivo, in vitro and in silico: an open space for the development of microbe-based applications of synthetic biology. Microb Biotechnol 2022; 15:42-64. [PMID: 34570957 PMCID: PMC8719824 DOI: 10.1111/1751-7915.13937] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2021] [Accepted: 09/14/2021] [Indexed: 12/24/2022] Open
Abstract
Living systems are studied using three complementary approaches: living cells, cell-free systems and computer-mediated modelling. Progresses in understanding, allowing researchers to create novel chassis and industrial processes rest on a cycle that combines in vivo, in vitro and in silico studies. This design-build-test-learn iteration loop cycle between experiments and analyses combines together physiology, genetics, biochemistry and bioinformatics in a way that keeps going forward. Because computer-aided approaches are not directly constrained by the material nature of the entities of interest, we illustrate here how this virtuous cycle allows researchers to explore chemistry which is foreign to that present in extant life, from whole chassis to novel metabolic cycles. Particular emphasis is placed on the importance of evolution.
Collapse
Affiliation(s)
- Antoine Danchin
- Kodikos LabsInstitut Cochin24 rue du Faubourg Saint‐JacquesParis75014France
| |
Collapse
|