1
|
Cross-evaluation of E. coli's operon structures via a whole-cell model suggests alternative cellular benefits for low- versus high-expressing operons. Cell Syst 2024; 15:227-245.e7. [PMID: 38417437 PMCID: PMC10957310 DOI: 10.1016/j.cels.2024.02.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2023] [Revised: 09/12/2023] [Accepted: 02/08/2024] [Indexed: 03/01/2024]
Abstract
Many bacteria use operons to coregulate genes, but it remains unclear how operons benefit bacteria. We integrated E. coli's 788 polycistronic operons and 1,231 transcription units into an existing whole-cell model and found inconsistencies between the proposed operon structures and the RNA-seq read counts that the model was parameterized from. We resolved these inconsistencies through iterative, model-guided corrections to both datasets, including the correction of RNA-seq counts of short genes that were misreported as zero by existing alignment algorithms. The resulting model suggested two main modes by which operons benefit bacteria. For 86% of low-expression operons, adding operons increased the co-expression probabilities of their constituent proteins, whereas for 92% of high-expression operons, adding operons resulted in more stable expression ratios between the proteins. These simulations underscored the need for further experimental work on how operons reduce noise and synchronize both the expression timing and the quantity of constituent genes. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
|
2
|
Computer Simulation for Effective Pharmaceutical Kinetics and Dynamics: A Review. Curr Comput Aided Drug Des 2024; 20:325-340. [PMID: 36852789 DOI: 10.2174/1573409919666230228104901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Revised: 12/13/2022] [Accepted: 01/04/2023] [Indexed: 03/01/2023]
Abstract
Computer-based modelling and simulation are developing as effective tools for supplementing biological data processing and interpretation. It helps to accelerate the creation of dosage forms at a lower cost and with the less human effort required to conduct the work. This paper aims to provide a comprehensive description of the different computer simulation models for various drugs along with their outcomes. The data used are taken from different sources, including review papers from Science Direct, Elsevier, NCBI, and Web of Science from 1995-2020. Keywords like - pharmacokinetic, pharmacodynamics, computer simulation, whole-cell model, and cell simulation, were used for the search process. The use of computer simulation helps speed up the creation of new dosage forms at a lower cost and less human effort required to complete the work. It is also widely used as a technique for researching the structure and dynamics of lipids and proteins found in membranes. It also facilitates both the diagnosis and prevention of illness. Conventional data analysis methods cannot assess and comprehend the huge amount, size, and complexity of data collected by in vitro, in vivo, and ex vivo experiments. As a result, numerous in silico computational e-resources, databases, and simulation software are employed to determine pharmacokinetic (PK) and pharmacodynamic (PD) parameters for illness management. These techniques aid in the provision of multiscale representations of biological processes, beginning with proteins and genes and progressing through cells, isolated tissues and organs, and the whole organism.
Collapse
|
3
|
Multi-scale models of whole cells: progress and challenges. Front Cell Dev Biol 2023; 11:1260507. [PMID: 38020904 PMCID: PMC10661945 DOI: 10.3389/fcell.2023.1260507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2023] [Accepted: 10/19/2023] [Indexed: 12/01/2023] Open
Abstract
Whole-cell modeling is "the ultimate goal" of computational systems biology and "a grand challenge for 21st century" (Tomita, Trends in Biotechnology, 2001, 19(6), 205-10). These complex, highly detailed models account for the activity of every molecule in a cell and serve as comprehensive knowledgebases for the modeled system. Their scope and utility far surpass those of other systems models. In fact, whole-cell models (WCMs) are an amalgam of several types of "system" models. The models are simulated using a hybrid modeling method where the appropriate mathematical methods for each biological process are used to simulate their behavior. Given the complexity of the models, the process of developing and curating these models is labor-intensive and to date only a handful of these models have been developed. While whole-cell models provide valuable and novel biological insights, and to date have identified some novel biological phenomena, their most important contribution has been to highlight the discrepancy between available data and observations that are used for the parametrization and validation of complex biological models. Another realization has been that current whole-cell modeling simulators are slow and to run models that mimic more complex (e.g., multi-cellular) biosystems, those need to be executed in an accelerated fashion on high-performance computing platforms. In this manuscript, we review the progress of whole-cell modeling to date and discuss some of the ways that they can be improved.
Collapse
|
4
|
Dynamics of chromosome organization in a minimal bacterial cell. Front Cell Dev Biol 2023; 11:1214962. [PMID: 37621774 PMCID: PMC10445541 DOI: 10.3389/fcell.2023.1214962] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Accepted: 07/10/2023] [Indexed: 08/26/2023] Open
Abstract
Computational models of cells cannot be considered complete unless they include the most fundamental process of life, the replication and inheritance of genetic material. By creating a computational framework to model systems of replicating bacterial chromosomes as polymers at 10 bp resolution with Brownian dynamics, we investigate changes in chromosome organization during replication and extend the applicability of an existing whole-cell model (WCM) for a genetically minimal bacterium, JCVI-syn3A, to the entire cell-cycle. To achieve cell-scale chromosome structures that are realistic, we model the chromosome as a self-avoiding homopolymer with bending and torsional stiffnesses that capture the essential mechanical properties of dsDNA in Syn3A. In addition, the conformations of the circular DNA must avoid overlapping with ribosomes identitied in cryo-electron tomograms. While Syn3A lacks the complex regulatory systems known to orchestrate chromosome segregation in other bacteria, its minimized genome retains essential loop-extruding structural maintenance of chromosomes (SMC) protein complexes (SMC-scpAB) and topoisomerases. Through implementing the effects of these proteins in our simulations of replicating chromosomes, we find that they alone are sufficient for simultaneous chromosome segregation across all generations within nested theta structures. This supports previous studies suggesting loop-extrusion serves as a near-universal mechanism for chromosome organization within bacterial and eukaryotic cells. Furthermore, we analyze ribosome diffusion under the influence of the chromosome and calculate in silico chromosome contact maps that capture inter-daughter interactions. Finally, we present a methodology to map the polymer model of the chromosome to a Martini coarse-grained representation to prepare molecular dynamics models of entire Syn3A cells, which serves as an ultimate means of validation for cell states predicted by the WCM.
Collapse
|
5
|
Soft X-ray tomograms provide a structural basis for whole-cell modeling. FASEB J 2023; 37:e22681. [PMID: 36519968 PMCID: PMC10107707 DOI: 10.1096/fj.202200253r] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2022] [Revised: 11/13/2022] [Accepted: 11/21/2022] [Indexed: 12/23/2022]
Abstract
Developing in silico models that accurately reflect a whole, functional cell is an ongoing challenge in biology. Current efforts bring together mathematical models, probabilistic models, visual representations, and data to create a multi-scale description of cellular processes. A realistic whole-cell model requires imaging data since it provides spatial constraints and other critical cellular characteristics that are still impossible to obtain by calculation alone. This review introduces Soft X-ray Tomography (SXT) as a powerful imaging technique to visualize and quantify the mesoscopic (~25 nm spatial scale) organelle landscape in whole cells. SXT generates three-dimensional reconstructions of cellular ultrastructure and provides a measured structural framework for whole-cell modeling. Combining SXT with data from disparate technologies at varying spatial resolutions provides further biochemical details and constraints for modeling cellular mechanisms. We conclude, based on the results discussed here, that SXT provides a foundational dataset for a broad spectrum of whole-cell modeling experiments.
Collapse
|
6
|
Integrative modeling of the cell. Acta Biochim Biophys Sin (Shanghai) 2022; 54:1213-1221. [PMID: 36017893 PMCID: PMC9909318 DOI: 10.3724/abbs.2022115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022] Open
Abstract
A whole-cell model represents certain aspects of the cell structure and/or function. Due to the high complexity of the cell, an integrative modeling approach is often taken to utilize all available information including experimental data, prior knowledge and prior models. In this review, we summarize an emerging workflow of whole-cell modeling into five steps: (i) gather information; (ii) represent the modeled system into modules; (iii) translate input information into scoring function; (iv) sample the whole-cell model; (v) validate and interpret the model. In particular, we propose the integrative modeling of the cell by combining available (whole-cell) models to maximize the accuracy, precision, and completeness. In addition, we list quantitative predictions of various aspects of cell biology from existing whole-cell models. Moreover, we discuss the remaining challenges and future directions, and highlight the opportunity to establish an integrative spatiotemporal multi-scale whole-cell model based on a community approach.
Collapse
|
7
|
Abstract
Comprehensive modeling of a whole cell requires an integration of vast amounts of information on various aspects of the cell and its parts. To divide and conquer this task, we introduce Bayesian metamodeling, a general approach to modeling complex systems by integrating a collection of heterogeneous input models. Each input model can in principle be based on any type of data and can describe a different aspect of the modeled system using any mathematical representation, scale, and level of granularity. These input models are 1) converted to a standardized statistical representation relying on probabilistic graphical models, 2) coupled by modeling their mutual relations with the physical world, and 3) finally harmonized with respect to each other. To illustrate Bayesian metamodeling, we provide a proof-of-principle metamodel of glucose-stimulated insulin secretion by human pancreatic β-cells. The input models include a coarse-grained spatiotemporal simulation of insulin vesicle trafficking, docking, and exocytosis; a molecular network model of glucose-stimulated insulin secretion signaling; a network model of insulin metabolism; a structural model of glucagon-like peptide-1 receptor activation; a linear model of a pancreatic cell population; and ordinary differential equations for systemic postprandial insulin response. Metamodeling benefits from decentralized computing, while often producing a more accurate, precise, and complete model that contextualizes input models as well as resolves conflicting information. We anticipate Bayesian metamodeling will facilitate collaborative science by providing a framework for sharing expertise, resources, data, and models, as exemplified by the Pancreatic β-Cell Consortium.
Collapse
|
8
|
Abstract
The Escherichia coli whole-cell modeling project seeks to create the most detailed computational model of an E. coli cell in order to better understand and predict the behavior of this model organism. Details about the approach, framework, and current version of the model are discussed. Currently, the model includes the functions of 43% of characterized genes, with ongoing efforts to include additional data and mechanisms. As additional information is incorporated in the model, its utility and predictive power will continue to increase, which means that discovery efforts can be accelerated by community involvement in the generation and inclusion of data. This project will be an invaluable resource to the E. coli community that could be used to verify expected physiological behavior, to predict new outcomes and testable hypotheses for more efficient experimental design iterations, and to evaluate heterogeneous data sets in the context of each other through deep curation.
Collapse
|
9
|
A forecast for large-scale, predictive biology: Lessons from meteorology. Cell Syst 2021; 12:488-496. [PMID: 34139161 PMCID: PMC8217727 DOI: 10.1016/j.cels.2021.05.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 04/01/2021] [Accepted: 05/18/2021] [Indexed: 11/19/2022]
Abstract
Quantitative systems biology, in which predictive mathematical models are constructed to guide the design of experiments and predict experimental outcomes, is at an exciting transition point, where the foundational scientific principles are becoming established, but the impact is not yet global. The next steps necessary for mathematical modeling to transform biological research and applications, in the same way it has already transformed other fields, is not completely clear. The purpose of this perspective is to forecast possible answers to this question-what needs to happen next-by drawing on the experience gained in another field, specifically meteorology. We review here a number of lessons learned in weather prediction that are directly relevant to biological systems modeling, and that we believe can enable the same kinds of global impact in our field as atmospheric modeling makes today.
Collapse
|
10
|
Learning causal networks using inducible transcription factors and transcriptome-wide time series. Mol Syst Biol 2021; 16:e9174. [PMID: 32181581 PMCID: PMC7076914 DOI: 10.15252/msb.20199174] [Citation(s) in RCA: 35] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2019] [Revised: 02/13/2020] [Accepted: 02/19/2020] [Indexed: 11/27/2022] Open
Abstract
We present IDEA (the Induction Dynamics gene Expression Atlas), a dataset constructed by independently inducing hundreds of transcription factors (TFs) and measuring timecourses of the resulting gene expression responses in budding yeast. Each experiment captures a regulatory cascade connecting a single induced regulator to the genes it causally regulates. We discuss the regulatory cascade of a single TF, Aft1, in detail; however, IDEA contains > 200 TF induction experiments with 20 million individual observations and 100,000 signal‐containing dynamic responses. As an application of IDEA, we integrate all timecourses into a whole‐cell transcriptional model, which is used to predict and validate multiple new and underappreciated transcriptional regulators. We also find that the magnitudes of coefficients in this model are predictive of genetic interaction profile similarities. In addition to being a resource for exploring regulatory connectivity between TFs and their target genes, our modeling approach shows that combining rapid perturbations of individual genes with genome‐scale time‐series measurements is an effective strategy for elucidating gene regulatory networks.
Collapse
|
11
|
Biomolecular interactions modulate macromolecular structure and dynamics in atomistic model of a bacterial cytoplasm. eLife 2016; 5. [PMID: 27801646 PMCID: PMC5089862 DOI: 10.7554/elife.19274] [Citation(s) in RCA: 193] [Impact Index Per Article: 24.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2016] [Accepted: 09/28/2016] [Indexed: 12/24/2022] Open
Abstract
Biological macromolecules function in highly crowded cellular environments. The structure and dynamics of proteins and nucleic acids are well characterized in vitro, but in vivo crowding effects remain unclear. Using molecular dynamics simulations of a comprehensive atomistic model cytoplasm we found that protein-protein interactions may destabilize native protein structures, whereas metabolite interactions may induce more compact states due to electrostatic screening. Protein-protein interactions also resulted in significant variations in reduced macromolecular diffusion under crowded conditions, while metabolites exhibited significant two-dimensional surface diffusion and altered protein-ligand binding that may reduce the effective concentration of metabolites and ligands in vivo. Metabolic enzymes showed weak non-specific association in cellular environments attributed to solvation and entropic effects. These effects are expected to have broad implications for the in vivo functioning of biomolecules. This work is a first step towards physically realistic in silico whole-cell models that connect molecular with cellular biology.
Collapse
|
12
|
Abstract
OBJECTIVE Whole-cell (WC) modeling is a promising tool for biological research, bioengineering, and medicine. However, substantial work remains to create accurate comprehensive models of complex cells. METHODS We organized the 2015 Whole-Cell Modeling Summer School to teach WC modeling and evaluate the need for new WC modeling standards and software by recoding a recently published WC model in the Systems Biology Markup Language. RESULTS Our analysis revealed several challenges to representing WC models using the current standards. CONCLUSION We, therefore, propose several new WC modeling standards, software, and databases. SIGNIFICANCE We anticipate that these new standards and software will enable more comprehensive models.
Collapse
|
13
|
Efficient Analysis of Systems Biology Markup Language Models of Cellular Populations Using Arrays. ACS Synth Biol 2016; 5:835-41. [PMID: 26912276 DOI: 10.1021/acssynbio.5b00242] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The Systems Biology Markup Language (SBML) has been widely used for modeling biological systems. Although SBML has been successful in representing a wide variety of biochemical models, the core standard lacks the structure for representing large complex regular systems in a standard way, such as whole-cell and cellular population models. These models require a large number of variables to represent certain aspects of these types of models, such as the chromosome in the whole-cell model and the many identical cell models in a cellular population. While SBML core is not designed to handle these types of models efficiently, the proposed SBML arrays package can represent such regular structures more easily. However, in order to take full advantage of the package, analysis needs to be aware of the arrays structure. When expanding the array constructs within a model, some of the advantages of using arrays are lost. This paper describes a more efficient way to simulate arrayed models. To illustrate the proposed method, this paper uses a population of repressilator and genetic toggle switch circuits as examples. Results show that there are memory benefits using this approach with a modest cost in runtime.
Collapse
|
14
|
Why Build Whole-Cell Models? Trends Cell Biol 2015; 25:719-722. [PMID: 26471224 DOI: 10.1016/j.tcb.2015.09.004] [Citation(s) in RCA: 41] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2015] [Revised: 09/11/2015] [Accepted: 09/14/2015] [Indexed: 10/22/2022]
Abstract
Our ability to build computational models that account for all known gene functions in a cell has increased dramatically. But why build whole-cell models, and how can they best be used? In this forum, we enumerate several areas in which whole-cell modeling can significantly impact research and technology.
Collapse
|