1
|
Kircher T, Votsmeier M. Machine Learning Surrogate Models for Mechanistic Kinetics: Embedding Atom Balance and Positivity. J Phys Chem Lett 2025; 16:4715-4723. [PMID: 40323851 DOI: 10.1021/acs.jpclett.5c00602] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/07/2025]
Abstract
Multiscale simulations of reactive flows are critical in many fields. However, their application is often hindered by the high computational cost of solving detailed chemical kinetics. Recent advances in surrogate models for reactive chemistry offer promising speedups, but ensuring physical consistency remains challenging. In particular, machine learning models for chemical kinetics must enforce atom balance and guarantee the positivity of predicted concentrations. Here, we introduce a positivity preserving projection and a correction by linear interpolation backtracking which simultaneously guarantee both constraints. We demonstrate this using two practical examples from atmospheric chemistry and heterogeneous catalysis, as well as for a large number of random, synthetically generated reaction systems. In all cases, our approach yields exclusively positive model predictions conforming to the atom balance, without reducing the overall accuracy of the model.
Collapse
Affiliation(s)
- Tim Kircher
- Technische Universität Darmstadt, Darmstadt, 64287, Germany
| | - Martin Votsmeier
- Technische Universität Darmstadt, Darmstadt, 64287, Germany
- Umicore AG & Co. KG, Hanau 63457, Germany
| |
Collapse
|
2
|
Yue T, Nguyen D, Varshney V, Li Y. Assessing the Effectiveness of Neural Networks and Molecular Dynamics Simulations in Predicting Viscosity of Small Organic Molecules. J Phys Chem B 2025; 129:4501-4513. [PMID: 40267179 DOI: 10.1021/acs.jpcb.4c08757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/25/2025]
Abstract
Viscosity is a crucial material property that influences a wide range of applications, including three-dimensional (3D) printing, lubricants, and solvents. However, experimental approaches to measuring viscosity face challenges such as handling multiple samples, high costs, and limited compound availability. To address these limitations, we have developed computational models for viscosity prediction of small organic molecules, utilizing machine learning (ML) and nonequilibrium molecular dynamics (NEMD) simulations. Our ML framework, which includes feed-forward neural networks (FNN) and physics-informed neural networks (PINN), is based on the largest data set of small molecule viscosities compiled from the literature. The PINN model, in particular, incorporates temperature dependence through a four-parameter model, allowing for the direct prediction of continuous temperature-dependent viscosity curves. The ML models demonstrate exceptional prediction accuracy for the viscosity of various organic compounds across a wide range of temperatures. External validation of our models further confirms that the ML prediction models outperform the NEMD approach in predicting viscosity across a diverse range of organic molecules and temperatures. This highlights the potential of ML models to overcome limitations in traditional MD simulations, which often struggle with accuracy for specific molecules or temperature ranges. Our further feature importance analysis revealed a strong correlation between molecular structure and viscosity. We emphasize the key role of substructures in determining viscosity, offering deeper molecular insights for material design with tailored viscosity.
Collapse
Affiliation(s)
- Tianle Yue
- Department of Mechanical Engineering, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Danh Nguyen
- Department of Mechanical Engineering, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| | - Vikas Varshney
- Materials and Manufacturing Directorate, Air Force Research Laboratory, Wright-Patterson Air Force Base, Ohio 45433, United States
| | - Ying Li
- Department of Mechanical Engineering, University of Wisconsin-Madison, Madison, Wisconsin 53706, United States
| |
Collapse
|
3
|
Baker JJ, Shi J, Wang S, Mujica EM, Bianco S, Capponi S, Dueber JE. ML-enhanced peroxisome capacity enables compartmentalization of multienzyme pathway. Nat Chem Biol 2025; 21:727-735. [PMID: 39402374 DOI: 10.1038/s41589-024-01759-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 09/20/2024] [Indexed: 11/10/2024]
Abstract
Repurposing an organelle for specialized metabolism provides an avenue for fermentable, unicellular organisms such as Saccharomyces cerevisiae to mimic compartmentalization of metabolic pathways within different plant tissues. Peroxisomes are attractive organelles for repurposing as they are not required for yeast viability when grown on glucose and can efficiently compartmentalize heterologous enzymes to enable physical separation of cytosolic native metabolism and peroxisomal engineered metabolism. However, when not required, peroxisomes are repressed, leading to low functional capacities for heterologous proteins. Here we engineer peroxisomes with enhanced functional capacities, with the goal of compartmentalizing up to eight metabolic enzymes to enhance titers. We implement a machine learning pipeline that allows the identification of factors to overexpress, culminating in a 137% increase in peroxisome functional capacity compared to a wild-type strain. Improved pathway compartmentalization enables an 80% increase in the biosynthesis titers of the monoterpene geraniol, up to 9.5 g L-1.
Collapse
Affiliation(s)
- Jordan J Baker
- Department of Bioengineering, University of California, Berkeley, CA, USA
- UC Berkeley and UCSF Joint Graduate Program in Bioengineering, University of California, Berkeley, CA, USA
- NSF Center for Cellular Construction, University of California, San Francisco, CA, USA
| | - Jie Shi
- NSF Center for Cellular Construction, University of California, San Francisco, CA, USA
- Department of Functional Genomics and Cellular Engineering, IBM Almaden Research Center, San Jose, CA, USA
| | - Shangying Wang
- NSF Center for Cellular Construction, University of California, San Francisco, CA, USA
- Department of Functional Genomics and Cellular Engineering, IBM Almaden Research Center, San Jose, CA, USA
- Bay Area Institute of Science, Altos Labs, Redwood City, CA, USA
| | - Elena M Mujica
- Department of Bioengineering, University of California, Berkeley, CA, USA
| | - Simone Bianco
- NSF Center for Cellular Construction, University of California, San Francisco, CA, USA
- Department of Functional Genomics and Cellular Engineering, IBM Almaden Research Center, San Jose, CA, USA
- Bay Area Institute of Science, Altos Labs, Redwood City, CA, USA
| | - Sara Capponi
- NSF Center for Cellular Construction, University of California, San Francisco, CA, USA.
- Department of Functional Genomics and Cellular Engineering, IBM Almaden Research Center, San Jose, CA, USA.
| | - John E Dueber
- Department of Bioengineering, University of California, Berkeley, CA, USA.
- NSF Center for Cellular Construction, University of California, San Francisco, CA, USA.
- Biological Systems and Engineering Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|
4
|
Sun Q, Ren H, Bhat MA, Liu N, Li Z, Li Z, Cheng Q, Ren Y, Yang N, Ma Z. Evaluation of water environmental capacity in a northern river-reservoir continuum using environmental fluid dynamics code. THE SCIENCE OF THE TOTAL ENVIRONMENT 2025; 959:178274. [PMID: 39752985 DOI: 10.1016/j.scitotenv.2024.178274] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2024] [Revised: 11/18/2024] [Accepted: 12/22/2024] [Indexed: 01/15/2025]
Abstract
The hydrodynamics, water temperature, and water quality model for the Dan River and Renzhuang Reservoir continuum were developed using field monitoring data and the Environmental Fluid Dynamics Code (EFDC). An in-situ water discharge experiment enabled the calculation of water propagation time using a simulated flood progression method and the hydrodynamics module of EFDC. Based on these model results, degradation coefficients for chemical oxygen demand, biochemical oxygen demand, nitrogen (N), phosphorus (P), fluoride, arsenic were determined, revealing significantly higher values when the wetland barrage was opening. According to field monitoring and EFDC model outputs, the water environment capacity (WEC) of total nitrogen (TN) in Dan River was even negative, with the WEC of TN and total phosphorus (TP) in Renzhuang Reservoir being very low at 147 kg/d and 1490 kg/d, respective. Furthermore, the water environment carrying capacity (WECC) was found to be extremely low (~0.2 %) and limited by TN and TP. The results demonstrated that the spatiotemporal variations of water quality presented by the EFDC model facilitate an intuitive and comprehensive observation of water quality compliance rates over time and space, providing valuable references for decision-makers. The WEC and WECC of the study area underscored the urgency of N and P coordination control.
Collapse
Affiliation(s)
- Qingqing Sun
- School of Earth System Science, Tianjin University, Tianjin 300072, China.
| | - Huanlian Ren
- Changzhi Hydrology and Water Resources Survey Station, Changzhi 046011, China
| | - Mohd Aadil Bhat
- State Key Laboratory of Marine Geology, Tongji University, Shanghai 200092, China
| | - Na Liu
- Hydrology and Water Resources Survey Station of Shanxi Province, Taiyuan 030001, China
| | - Zhaolun Li
- Luoyang Hydrology and Water Resources Survey Bureau, Yellow River Conservancy Commission of the Ministry of Water Resources, Luoyang 471013, China
| | - Zechao Li
- Changzhi Hydrology and Water Resources Survey Station, Changzhi 046011, China
| | - Qiliang Cheng
- Changzhi Hydrology and Water Resources Survey Station, Changzhi 046011, China
| | - Yimeng Ren
- Shanxi Water Conservancy Development Center, Taiyuan 030002, China
| | - Ning Yang
- College of Management and Economics, Tianjin University, Tianjin 300072, China; School of Architecture and Urban Planning, Henan University of Urban Construction, Pingdingshan 467036, China; Henan Chengyuan Zhuoyue Comprehensive Design and Research Institute Co, Ltd, Pingdingshan 467036, China
| | - Zhuoni Ma
- School of Earth System Science, Tianjin University, Tianjin 300072, China
| |
Collapse
|
5
|
Garcia JA, Bouchnita A. Exploring the spatial effects influencing the EGFR/ERK pathway dynamics with machine learning surrogate models. Biosystems 2025; 247:105360. [PMID: 39521268 DOI: 10.1016/j.biosystems.2024.105360] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2024] [Revised: 09/15/2024] [Accepted: 10/30/2024] [Indexed: 11/16/2024]
Abstract
The fate of cells is regulated by biochemical reactions taking place inside of them, known as intracellular pathways. Cells display a variety of characteristics related to their shape, structure and contained fluid, which influences the diffusion of proteins and their interactions. To gain insights into the spatial effects shaping intracellular regulation, we apply machine learning (ML) to explore a previously developed spatial model of the epidermal growth factor receptor (EGFR) signaling. The model describes the reactions between molecular species inside of cells following the transient activation of EGF receptors. To train our ML models, we conduct 10,000 numerical simulations in parallel where we calculate the cumulative activation of molecules and transcription factors under various conditions such as different diffusion speeds, inactivation rates, and cell structures. We take advantage of the low computational cost of ML algorithms to investigate the effects of cell and nucleus sizes, the diffusion speed of proteins, and the inactivation rate of the Ras molecules on the activation strength of transcription factors. Our results suggest that the predictions by both neural networks and random forests yielded minimal mean square error (MSEs), while linear generalized models displayed a significantly larger MSE. The exploration of the surrogate models has shown that smaller cell and nucleus radii as well, larger diffusion coefficients, and reduced inactivation rates increase the activation of transcription factors. These results are confirmed by numerical simulations. Our ML algorithms can be readily incorporated within multiscale models of tumor growth to embed the spatial effects regulating intracellular pathways, enabling the use of complex cell models within multiscale models while reducing the computational cost.
Collapse
Affiliation(s)
- Juan A Garcia
- Department of Mathematical Sciences, The University of Texas at El Paso, El Paso 79968, TX, USA
| | - Anass Bouchnita
- Department of Mathematical Sciences, The University of Texas at El Paso, El Paso 79968, TX, USA.
| |
Collapse
|
6
|
Ramirez Sierra MA, Sokolowski TR. AI-powered simulation-based inference of a genuinely spatial-stochastic gene regulation model of early mouse embryogenesis. PLoS Comput Biol 2024; 20:e1012473. [PMID: 39541410 PMCID: PMC11614244 DOI: 10.1371/journal.pcbi.1012473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2024] [Revised: 12/03/2024] [Accepted: 09/10/2024] [Indexed: 11/16/2024] Open
Abstract
Understanding how multicellular organisms reliably orchestrate cell-fate decisions is a central challenge in developmental biology, particularly in early mammalian development, where tissue-level differentiation arises from seemingly cell-autonomous mechanisms. In this study, we present a multi-scale, spatial-stochastic simulation framework for mouse embryogenesis, focusing on inner cell mass (ICM) differentiation into epiblast (EPI) and primitive endoderm (PRE) at the blastocyst stage. Our framework models key regulatory and tissue-scale interactions in a biophysically realistic fashion, capturing the inherent stochasticity of intracellular gene expression and intercellular signaling, while efficiently simulating these processes by advancing event-driven simulation techniques. Leveraging the power of Simulation-Based Inference (SBI) through the AI-driven Sequential Neural Posterior Estimation (SNPE) algorithm, we conduct a large-scale Bayesian inferential analysis to identify parameter sets that faithfully reproduce experimentally observed features of ICM specification. Our results reveal mechanistic insights into how the combined action of autocrine and paracrine FGF4 signaling coordinates stochastic gene expression at the cellular scale to achieve robust and reproducible ICM patterning at the tissue scale. We further demonstrate that the ICM exhibits a specific time window of sensitivity to exogenous FGF4, enabling lineage proportions to be adjusted based on timing and dosage, thereby extending current experimental findings and providing quantitative predictions for both mutant and wild-type ICM systems. Notably, FGF4 signaling not only ensures correct EPI-PRE lineage proportions but also enhances ICM resilience to perturbations, reducing fate-proportioning errors by 10-20% compared to a purely cell-autonomous system. Additionally, we uncover a surprising role for variability in intracellular initial conditions, showing that high gene-expression heterogeneity can improve both the accuracy and precision of cell-fate proportioning, which remains robust when fewer than 25% of the ICM population experiences perturbed initial conditions. Our work offers a comprehensive, spatial-stochastic description of the biochemical processes driving ICM differentiation and identifies the necessary conditions for its robust unfolding. It also provides a framework for future exploration of similar spatial-stochastic systems in developmental biology.
Collapse
Affiliation(s)
- Michael Alexander Ramirez Sierra
- Frankfurt Institute for Advanced Studies (FIAS), Frankfurt am Main, Germany
- Faculty of Computer Science and Mathematics, Goethe-Universität Frankfurt am Main, Frankfurt am Main, Germany
| | | |
Collapse
|
7
|
Choudhary R, Mahadevan R. DyMMM-LEAPS: An ML-based framework for modulating evenness and stability in synthetic microbial communities. Biophys J 2024; 123:2974-2995. [PMID: 38733081 PMCID: PMC11427784 DOI: 10.1016/j.bpj.2024.05.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2024] [Revised: 04/22/2024] [Accepted: 05/07/2024] [Indexed: 05/13/2024] Open
Abstract
There have been a growing number of computational strategies to aid in the design of synthetic microbial consortia. A framework to identify regions in parametric space to maximize two essential properties, evenness and stability, is critical. In this study, we introduce DyMMM-LEAPS (dynamic multispecies metabolic modeling-locating evenness and stability in large parametric space), an extension of the DyMMM framework. Our method explores the large parametric space of genetic circuits in synthetic microbial communities to identify regions of evenness and stability. Due to the high computational costs of exhaustive sampling, we utilize adaptive sampling and surrogate modeling to reduce the number of simulations required to map the vast space. Our framework predicts engineering targets and computes their operating ranges to maximize the probability of the engineered community to have high evenness and stability. We demonstrate our approach by simulating five cocultures and one three-strain culture with different social interactions (cooperation, competition, and predation) employing quorum-sensing-based genetic circuits. In addition to guiding circuit tuning, our pipeline gives an opportunity for a detailed analysis of pockets of evenness and stability for the circuit under investigation, which can further help dissect the relationship between the two properties. DyMMM-LEAPS is easily customizable and can be expanded to a larger community with more complex interactions.
Collapse
Affiliation(s)
- Ruhi Choudhary
- University of Toronto, Department of Chemical Engineering and Applied Chemistry, Toronto, ON, Canada
| | - Radhakrishnan Mahadevan
- University of Toronto, Department of Chemical Engineering and Applied Chemistry, Toronto, ON, Canada.
| |
Collapse
|
8
|
Capponi S, Wang S. AI in cellular engineering and reprogramming. Biophys J 2024; 123:2658-2670. [PMID: 38576162 PMCID: PMC11393708 DOI: 10.1016/j.bpj.2024.04.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 03/19/2024] [Accepted: 04/01/2024] [Indexed: 04/06/2024] Open
Abstract
During the last decade, artificial intelligence (AI) has increasingly been applied in biophysics and related fields, including cellular engineering and reprogramming, offering novel approaches to understand, manipulate, and control cellular function. The potential of AI lies in its ability to analyze complex datasets and generate predictive models. AI algorithms can process large amounts of data from single-cell genomics and multiomic technologies, allowing researchers to gain mechanistic insights into the control of cell identity and function. By integrating and interpreting these complex datasets, AI can help identify key molecular events and regulatory pathways involved in cellular reprogramming. This knowledge can inform the design of precision engineering strategies, such as the development of new transcription factor and signaling molecule cocktails, to manipulate cell identity and drive authentic cell fate across lineage boundaries. Furthermore, when used in combination with computational methods, AI can accelerate and improve the analysis and understanding of the intricate relationships between genes, proteins, and cellular processes. In this review article, we explore the current state of AI applications in biophysics with a specific focus on cellular engineering and reprogramming. Then, we showcase a couple of recent applications where we combined machine learning with experimental and computational techniques. Finally, we briefly discuss the challenges and prospects of AI in cellular engineering and reprogramming, emphasizing the potential of these technologies to revolutionize our ability to engineer cells for a variety of applications, from disease modeling and drug discovery to regenerative medicine and biomanufacturing.
Collapse
Affiliation(s)
- Sara Capponi
- IBM Almaden Research Center, San Jose, California; Center for Cellular Construction, San Francisco, California.
| | - Shangying Wang
- Bay Area Institute of Science, Altos Labs, Redwood City, California.
| |
Collapse
|
9
|
Hussain MA, Grill WM, Pelot NA. Highly efficient modeling and optimization of neural fiber responses to electrical stimulation. Nat Commun 2024; 15:7597. [PMID: 39217179 PMCID: PMC11365978 DOI: 10.1038/s41467-024-51709-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2024] [Accepted: 08/13/2024] [Indexed: 09/04/2024] Open
Abstract
Peripheral neuromodulation has emerged as a powerful modality for controlling physiological functions and treating a variety of medical conditions including chronic pain and organ dysfunction. The underlying complexity of the nonlinear responses to electrical stimulation make it challenging to design precise and effective neuromodulation protocols. Computational models have thus become indispensable in advancing our understanding and control of neural responses to electrical stimulation. However, existing approaches suffer from computational bottlenecks, rendering them unsuitable for real-time applications, large-scale parameter sweeps, or sophisticated optimization. In this work, we introduce an approach for massively parallel estimation and optimization of neural fiber responses to electrical stimulation using machine learning techniques. By leveraging advances in high-performance computing and parallel programming, we present a surrogate fiber model that generates spatiotemporal responses to a wide variety of cuff-based electrical peripheral nerve stimulation protocols. We used our surrogate fiber model to design stimulation parameters for selective stimulation of pig and human vagus nerves. Our approach yields a several-orders-of-magnitude improvement in computational efficiency while retaining generality and high predictive accuracy, demonstrating its robustness and potential to enhance the design and optimization of peripheral neuromodulation therapies.
Collapse
Affiliation(s)
- Minhaj A Hussain
- Department of Biomedical Engineering, Duke University, Durham, NC, 27708, USA
| | - Warren M Grill
- Department of Biomedical Engineering, Duke University, Durham, NC, 27708, USA
- Department of Electrical and Computer Engineering, Duke University, Durham, NC, 27708, USA
- Department of Neurobiology, Duke University, Durham, NC, 27708, USA
- Department of Neurosurgery, Duke University, Durham, NC, 27708, USA
| | - Nicole A Pelot
- Department of Biomedical Engineering, Duke University, Durham, NC, 27708, USA.
| |
Collapse
|
10
|
Karimi Alavijeh M, Lee YY, Gras SL. A perspective-driven and technical evaluation of machine learning in bioreactor scale-up: A case-study for potential model developments. Eng Life Sci 2024; 24:e2400023. [PMID: 38975020 PMCID: PMC11223373 DOI: 10.1002/elsc.202400023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Accepted: 03/01/2024] [Indexed: 07/09/2024] Open
Abstract
Bioreactor scale-up and scale-down have always been a topical issue for the biopharmaceutical industry and despite considerable effort, the identification of a fail-safe strategy for bioprocess development across scales remains a challenge. With the ubiquitous growth of digital transformation technologies, new scaling methods based on computer models may enable more effective scaling. This study aimed to evaluate the potential application of machine learning (ML) algorithms for bioreactor scale-up, with a specific focus on the prediction of scaling parameters. Factors critical to the development of such models were identified and data for bioreactor scale-up studies involving CHO cell-generated mAb products collated from the literature and public sources for the development of unsupervised and supervised ML models. Comparison of bioreactor performance across scales identified similarities between the different processes and primary differences between small- and large-scale bioreactors. A series of three case studies were developed to assess the relationship between cell growth and scale-sensitive bioreactor features. An embedding layer improved the capability of artificial neural network models to predict cell growth at a large-scale, as this approach captured similarities between the processes. Further models constructed to predict scaling parameters demonstrated how ML models may be applied to assist the scaling process. The development of data sets that include more characterization data with greater variability under different gassing and agitation regimes will also assist the future development of ML tools for bioreactor scaling.
Collapse
Affiliation(s)
- Masih Karimi Alavijeh
- Department of Chemical EngineeringThe University of MelbourneParkvilleVictoriaAustralia
- The Bio21 Molecular Science and Biotechnology InstituteThe University of MelbourneParkvilleVictoriaAustralia
| | | | - Sally L. Gras
- Department of Chemical EngineeringThe University of MelbourneParkvilleVictoriaAustralia
- The Bio21 Molecular Science and Biotechnology InstituteThe University of MelbourneParkvilleVictoriaAustralia
| |
Collapse
|
11
|
Mann J, Meshkin H, Zirkle J, Han X, Thrasher B, Chaturbedi A, Arabidarrehdor G, Li Z. Mechanism-based organization of neural networks to emulate systems biology and pharmacology models. Sci Rep 2024; 14:12082. [PMID: 38802422 PMCID: PMC11130269 DOI: 10.1038/s41598-024-59378-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 04/10/2024] [Indexed: 05/29/2024] Open
Abstract
Deep learning neural networks are often described as black boxes, as it is difficult to trace model outputs back to model inputs due to a lack of clarity over the internal mechanisms. This is even true for those neural networks designed to emulate mechanistic models, which simply learn a mapping between the inputs and outputs of mechanistic models, ignoring the underlying processes. Using a mechanistic model studying the pharmacological interaction between opioids and naloxone as a proof-of-concept example, we demonstrated that by reorganizing the neural networks' layers to mimic the structure of the mechanistic model, it is possible to achieve better training rates and prediction accuracy relative to the previously proposed black-box neural networks, while maintaining the interpretability of the mechanistic simulations. Our framework can be used to emulate mechanistic models in a large parameter space and offers an example on the utility of increasing the interpretability of deep learning networks.
Collapse
Affiliation(s)
- John Mann
- Division of Applied Regulatory Science, Office of Clinical Pharmacology, Office of Translational Sciences, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, WO Bldg 64 Rm 2084, 10903 New Hampshire Ave, Silver Spring, MD, 20993, USA
| | - Hamed Meshkin
- Division of Applied Regulatory Science, Office of Clinical Pharmacology, Office of Translational Sciences, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, WO Bldg 64 Rm 2084, 10903 New Hampshire Ave, Silver Spring, MD, 20993, USA
| | - Joel Zirkle
- Division of Applied Regulatory Science, Office of Clinical Pharmacology, Office of Translational Sciences, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, WO Bldg 64 Rm 2084, 10903 New Hampshire Ave, Silver Spring, MD, 20993, USA
| | - Xiaomei Han
- Division of Applied Regulatory Science, Office of Clinical Pharmacology, Office of Translational Sciences, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, WO Bldg 64 Rm 2084, 10903 New Hampshire Ave, Silver Spring, MD, 20993, USA
| | - Bradlee Thrasher
- Division of Applied Regulatory Science, Office of Clinical Pharmacology, Office of Translational Sciences, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, WO Bldg 64 Rm 2084, 10903 New Hampshire Ave, Silver Spring, MD, 20993, USA
| | - Anik Chaturbedi
- Division of Applied Regulatory Science, Office of Clinical Pharmacology, Office of Translational Sciences, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, WO Bldg 64 Rm 2084, 10903 New Hampshire Ave, Silver Spring, MD, 20993, USA
| | - Ghazal Arabidarrehdor
- Division of Applied Regulatory Science, Office of Clinical Pharmacology, Office of Translational Sciences, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, WO Bldg 64 Rm 2084, 10903 New Hampshire Ave, Silver Spring, MD, 20993, USA
| | - Zhihua Li
- Division of Applied Regulatory Science, Office of Clinical Pharmacology, Office of Translational Sciences, Center for Drug Evaluation and Research, U.S. Food and Drug Administration, WO Bldg 64 Rm 2084, 10903 New Hampshire Ave, Silver Spring, MD, 20993, USA.
| |
Collapse
|
12
|
Cain JY, Evarts JI, Yu JS, Bagheri N. Incorporating temporal information during feature engineering bolsters emulation of spatio-temporal emergence. Bioinformatics 2024; 40:btae131. [PMID: 38444088 PMCID: PMC10957516 DOI: 10.1093/bioinformatics/btae131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 02/08/2024] [Accepted: 03/01/2024] [Indexed: 03/07/2024] Open
Abstract
MOTIVATION Emergent biological dynamics derive from the evolution of lower-level spatial and temporal processes. A long-standing challenge for scientists and engineers is identifying simple low-level rules that give rise to complex higher-level dynamics. High-resolution biological data acquisition enables this identification and has evolved at a rapid pace for both experimental and computational approaches. Simultaneously harnessing the resolution and managing the expense of emerging technologies-e.g. live cell imaging, scRNAseq, agent-based models-requires a deeper understanding of how spatial and temporal axes impact biological systems. Effective emulation is a promising solution to manage the expense of increasingly complex high-resolution computational models. In this research, we focus on the emulation of a tumor microenvironment agent-based model to examine the relationship between spatial and temporal environment features, and emergent tumor properties. RESULTS Despite significant feature engineering, we find limited predictive capacity of tumor properties from initial system representations. However, incorporating temporal information derived from intermediate simulation states dramatically improves the predictive performance of machine learning models. We train a deep-learning emulator on intermediate simulation states and observe promising enhancements over emulators trained solely on initial conditions. Our results underscore the importance of incorporating temporal information in the evaluation of spatio-temporal emergent behavior. Nevertheless, the emulators exhibit inconsistent performance, suggesting that the underlying model characterizes unique cell populations dynamics that are not easily replaced. AVAILABILITY AND IMPLEMENTATION All source codes for the agent-based model, emulation, and analyses are publicly available at the corresponding DOIs: 10.5281/zenodo.10622155, 10.5281/zenodo.10611675, 10.5281/zenodo.10621244, respectively.
Collapse
Affiliation(s)
- Jason Y Cain
- Department of Chemical Engineering, University of Washington, Seattle, WA 98195, United States
| | - Jacob I Evarts
- Department of Biology, University of Washington, Seattle, WA 98195, United States
| | - Jessica S Yu
- Department of Biology, University of Washington, Seattle, WA 98195, United States
| | - Neda Bagheri
- Department of Chemical Engineering, University of Washington, Seattle, WA 98195, United States
- Department of Biology, University of Washington, Seattle, WA 98195, United States
| |
Collapse
|
13
|
Marković S, Salom I, Djordjevic M. Systems Biology Approaches to Understanding COVID-19 Spread in the Population. Methods Mol Biol 2024; 2745:233-253. [PMID: 38060190 DOI: 10.1007/978-1-0716-3577-3_15] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/08/2023]
Abstract
In essence, the COVID-19 pandemic can be regarded as a systems biology problem, with the entire world as the system, and the human population as the element transitioning from one state to another with certain transition rates. While capturing all the relevant features of such a complex system is hardly possible, compartmental epidemiological models can be used as an appropriate simplification to model the system's dynamics and infer its important characteristics, such as basic and effective reproductive numbers of the virus. These measures can later be used as response variables in feature selection methods to uncover the main factors contributing to disease transmissibility. We here demonstrate that a combination of dynamic modeling and machine learning approaches can represent a powerful tool in understanding the spread, not only of COVID-19, but of any infectious disease of epidemiological proportions.
Collapse
Affiliation(s)
- Sofija Marković
- Quantitative Biology Group, Faculty of Biology, University of Belgrade, Belgrade, Serbia
| | - Igor Salom
- Institute of Physics Belgrade, National Institute of the Republic of Serbia, Belgrade, Serbia
| | - Marko Djordjevic
- Quantitative Biology Group, Faculty of Biology, University of Belgrade, Belgrade, Serbia.
| |
Collapse
|
14
|
Khaireh-Walieh A, Langevin D, Bennet P, Teytaud O, Moreau A, Wiecha PR. A newcomer's guide to deep learning for inverse design in nano-photonics. NANOPHOTONICS (BERLIN, GERMANY) 2023; 12:4387-4414. [PMID: 39634708 PMCID: PMC11501815 DOI: 10.1515/nanoph-2023-0527] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 11/18/2023] [Indexed: 12/07/2024]
Abstract
Nanophotonic devices manipulate light at sub-wavelength scales, enabling tasks such as light concentration, routing, and filtering. Designing these devices to achieve precise light-matter interactions using structural parameters and materials is a challenging task. Traditionally, solving this problem has relied on computationally expensive, iterative methods. In recent years, deep learning techniques have emerged as promising tools for tackling the inverse design of nanophotonic devices. While several review articles have provided an overview of the progress in this rapidly evolving field, there is a need for a comprehensive tutorial that specifically targets newcomers without prior experience in deep learning. Our goal is to address this gap and provide practical guidance for applying deep learning to individual scientific problems. We introduce the fundamental concepts of deep learning and critically discuss the potential benefits it offers for various inverse design problems in nanophotonics. We present a suggested workflow and detailed, practical design guidelines to help newcomers navigate the challenges they may encounter. By following our guide, newcomers can avoid frustrating roadblocks commonly experienced when venturing into deep learning for the first time. In a second part, we explore different iterative and direct deep learning-based techniques for inverse design, and evaluate their respective advantages and limitations. To enhance understanding and facilitate implementation, we supplement the manuscript with detailed Python notebook examples, illustrating each step of the discussed processes. While our tutorial primarily focuses on researchers in (nano-)photonics, it is also relevant for those working with deep learning in other research domains. We aim at providing a solid starting point to empower researchers to leverage the potential of deep learning in their scientific pursuits.
Collapse
Affiliation(s)
| | - Denis Langevin
- Université Clermont Auvergne, Clermont Auvergne INP, CNRS, Institut Pascal, F-63000Clermont-Ferrand, France
| | - Pauline Bennet
- Université Clermont Auvergne, Clermont Auvergne INP, CNRS, Institut Pascal, F-63000Clermont-Ferrand, France
| | | | - Antoine Moreau
- Université Clermont Auvergne, Clermont Auvergne INP, CNRS, Institut Pascal, F-63000Clermont-Ferrand, France
| | | |
Collapse
|
15
|
Gorin G, Vastola JJ, Pachter L. Studying stochastic systems biology of the cell with single-cell genomics data. Cell Syst 2023; 14:822-843.e22. [PMID: 37751736 PMCID: PMC10725240 DOI: 10.1016/j.cels.2023.08.004] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Revised: 08/16/2023] [Accepted: 08/25/2023] [Indexed: 09/28/2023]
Abstract
Recent experimental developments in genome-wide RNA quantification hold considerable promise for systems biology. However, rigorously probing the biology of living cells requires a unified mathematical framework that accounts for single-molecule biological stochasticity in the context of technical variation associated with genomics assays. We review models for a variety of RNA transcription processes, as well as the encapsulation and library construction steps of microfluidics-based single-cell RNA sequencing, and present a framework to integrate these phenomena by the manipulation of generating functions. Finally, we use simulated scenarios and biological data to illustrate the implications and applications of the approach.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA 91125, USA
| | - John J Vastola
- Department of Neurobiology, Harvard Medical School, Boston, MA 02115, USA
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA; Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA 91125, USA.
| |
Collapse
|
16
|
Klumpe HE, Lugagne JB, Khalil AS, Dunlop MJ. Deep Neural Networks for Predicting Single-Cell Responses and Probability Landscapes. ACS Synth Biol 2023; 12:2367-2381. [PMID: 37467372 PMCID: PMC11976981 DOI: 10.1021/acssynbio.3c00203] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/21/2023]
Abstract
Engineering biology relies on the accurate prediction of cell responses. However, making these predictions is challenging for a variety of reasons, including the stochasticity of biochemical reactions, variability between cells, and incomplete information about underlying biological processes. Machine learning methods, which can model diverse input-output relationships without requiring a priori mechanistic knowledge, are an ideal tool for this task. For example, such approaches can be used to predict gene expression dynamics given time-series data of past expression history. To explore this application, we computationally simulated single-cell responses, incorporating different sources of noise and alternative genetic circuit designs. We showed that deep neural networks trained on these simulated data were able to correctly infer the underlying dynamics of a cell response even in the presence of measurement noise and stochasticity in the biochemical reactions. The training set size and the amount of past data provided as inputs both affected prediction quality, with cascaded genetic circuits that introduce delays requiring more past data. We also tested prediction performance on a bistable auto-activation circuit, finding that our initial method for predicting a single trajectory was fundamentally ill-suited for multimodal dynamics. To address this, we updated the network architecture to predict the entire distribution of future states, showing it could accurately predict bimodal expression distributions. Overall, these methods can be readily applied to the diverse prediction tasks necessary to predict and control a variety of biological circuits, a key aspect of many synthetic biology applications.
Collapse
Affiliation(s)
- Heidi E. Klumpe
- Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Biological Design Center, Boston University, Boston, MA 02215, USA
| | - Jean-Baptiste Lugagne
- Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Biological Design Center, Boston University, Boston, MA 02215, USA
| | - Ahmad S. Khalil
- Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Biological Design Center, Boston University, Boston, MA 02215, USA
- Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA
| | - Mary J. Dunlop
- Biomedical Engineering, Boston University, Boston, MA 02215, USA
- Biological Design Center, Boston University, Boston, MA 02215, USA
| |
Collapse
|
17
|
Karlsen ST, Rau MH, Sánchez BJ, Jensen K, Zeidan AA. From genotype to phenotype: computational approaches for inferring microbial traits relevant to the food industry. FEMS Microbiol Rev 2023; 47:fuad030. [PMID: 37286882 PMCID: PMC10337747 DOI: 10.1093/femsre/fuad030] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2023] [Revised: 05/31/2023] [Accepted: 06/06/2023] [Indexed: 06/09/2023] Open
Abstract
When selecting microbial strains for the production of fermented foods, various microbial phenotypes need to be taken into account to achieve target product characteristics, such as biosafety, flavor, texture, and health-promoting effects. Through continuous advances in sequencing technologies, microbial whole-genome sequences of increasing quality can now be obtained both cheaper and faster, which increases the relevance of genome-based characterization of microbial phenotypes. Prediction of microbial phenotypes from genome sequences makes it possible to quickly screen large strain collections in silico to identify candidates with desirable traits. Several microbial phenotypes relevant to the production of fermented foods can be predicted using knowledge-based approaches, leveraging our existing understanding of the genetic and molecular mechanisms underlying those phenotypes. In the absence of this knowledge, data-driven approaches can be applied to estimate genotype-phenotype relationships based on large experimental datasets. Here, we review computational methods that implement knowledge- and data-driven approaches for phenotype prediction, as well as methods that combine elements from both approaches. Furthermore, we provide examples of how these methods have been applied in industrial biotechnology, with special focus on the fermented food industry.
Collapse
Affiliation(s)
- Signe T Karlsen
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Martin H Rau
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Benjamín J Sánchez
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Kristian Jensen
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| | - Ahmad A Zeidan
- Bioinformatics & Modeling, R&D Digital Innovation, Chr. Hansen A/S, Bøge Allé 10-12, 2970 Hørsholm, Denmark
| |
Collapse
|
18
|
Gorin G, Vastola JJ, Pachter L. Studying stochastic systems biology of the cell with single-cell genomics data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.17.541250. [PMID: 37292934 PMCID: PMC10245677 DOI: 10.1101/2023.05.17.541250] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Recent experimental developments in genome-wide RNA quantification hold considerable promise for systems biology. However, rigorously probing the biology of living cells requires a unified mathematical framework that accounts for single-molecule biological stochasticity in the context of technical variation associated with genomics assays. We review models for a variety of RNA transcription processes, as well as the encapsulation and library construction steps of microfluidics-based single-cell RNA sequencing, and present a framework to integrate these phenomena by the manipulation of generating functions. Finally, we use simulated scenarios and biological data to illustrate the implications and applications of the approach.
Collapse
Affiliation(s)
- Gennady Gorin
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA, 91125
| | - John J. Vastola
- Department of Neurobiology, Harvard Medical School, Boston, MA, 02115
| | - Lior Pachter
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, 91125
- Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, 91125
| |
Collapse
|
19
|
Kumar P, Leonardi N. A novel framework for the evaluation of coastal protection schemes through integration of numerical modelling and artificial intelligence into the Sand Engine App. Sci Rep 2023; 13:8610. [PMID: 37244960 DOI: 10.1038/s41598-023-35801-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 05/24/2023] [Indexed: 05/29/2023] Open
Abstract
There is growing interest in the adoption of Engineering with Nature or Nature Based Solutions for coastal protection including large mega-nourishment interventions. However, there are still many unknowns on the variables and design features influencing their functionalities. There are also challenges in the optimization of coastal modelling outputs or information usage in support of decision-making. In this study, more than five hundred numerical simulations with different sandengine designs and different locations along Morecambe Bay (UK) were conducted in Delft3D. Twelve Artificial Neural Networking ensemble models structures were trained on the simulated data to predict the influence of different sand engines on water depth, wave height and sediment transports with good performance. The ensemble models were then packed into a Sand Engine App developed in MATLAB and designed to calculate the impact of different sand engine features on the above variables based on users' inputs of sandengine designs.
Collapse
Affiliation(s)
- Pavitra Kumar
- Department of Geography and Planning, School of Environmental Sciences, University of Liverpool, Chatham Street, Liverpool, L69 7ZT, UK.
| | - Nicoletta Leonardi
- Department of Geography and Planning, School of Environmental Sciences, University of Liverpool, Chatham Street, Liverpool, L69 7ZT, UK
| |
Collapse
|
20
|
Daniels KG, Wang S, Simic MS, Bhargava HK, Capponi S, Tonai Y, Yu W, Bianco S, Lim WA. Decoding CAR T cell phenotype using combinatorial signaling motif libraries and machine learning. Science 2022; 378:1194-1200. [PMID: 36480602 PMCID: PMC10026561 DOI: 10.1126/science.abq0225] [Citation(s) in RCA: 60] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Chimeric antigen receptor (CAR) costimulatory domains derived from native immune receptors steer the phenotypic output of therapeutic T cells. We constructed a library of CARs containing ~2300 synthetic costimulatory domains, built from combinations of 13 signaling motifs. These CARs promoted diverse human T cell fates, which were sensitive to motif combinations and configurations. Neural networks trained to decode the combinatorial grammar of CAR signaling motifs allowed extraction of key design rules. For example, non-native combinations of motifs that bind tumor necrosis factor receptor-associated factors (TRAFs) and phospholipase C gamma 1 (PLCγ1) enhanced cytotoxicity and stemness associated with effective tumor killing. Thus, libraries built from minimal building blocks of signaling, combined with machine learning, can efficiently guide engineering of receptors with desired phenotypes.
Collapse
Affiliation(s)
- Kyle G Daniels
- Cell Design Institute, University of California, San Francisco, San Francisco, CA 94158, USA
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Shangying Wang
- Department of Functional Genomics and Cellular Engineering, IBM Almaden Research Center, San Jose, CA 95120, USA
- Center for Cellular Construction, San Francisco, CA 94158, USA
| | - Milos S Simic
- Cell Design Institute, University of California, San Francisco, San Francisco, CA 94158, USA
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Hersh K Bhargava
- Cell Design Institute, University of California, San Francisco, San Francisco, CA 94158, USA
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Sara Capponi
- Department of Functional Genomics and Cellular Engineering, IBM Almaden Research Center, San Jose, CA 95120, USA
- Center for Cellular Construction, San Francisco, CA 94158, USA
| | - Yurie Tonai
- Cell Design Institute, University of California, San Francisco, San Francisco, CA 94158, USA
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Wei Yu
- Cell Design Institute, University of California, San Francisco, San Francisco, CA 94158, USA
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Simone Bianco
- Department of Functional Genomics and Cellular Engineering, IBM Almaden Research Center, San Jose, CA 95120, USA
- Center for Cellular Construction, San Francisco, CA 94158, USA
| | - Wendell A Lim
- Cell Design Institute, University of California, San Francisco, San Francisco, CA 94158, USA
- Department of Cellular and Molecular Pharmacology, University of California, San Francisco, San Francisco, CA 94158, USA
- Center for Cellular Construction, San Francisco, CA 94158, USA
| |
Collapse
|
21
|
Application of a Machine Learning Algorithm for Evaluation of Stiff Fractional Modeling of Polytropic Gas Spheres and Electric Circuits. Symmetry (Basel) 2022. [DOI: 10.3390/sym14122482] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022] Open
Abstract
Fractional polytropic gas sphere problems and electrical engineering models typically simulated with interconnected circuits have numerous applications in physical, astrophysical phenomena, and thermionic currents. Generally, most of these models are singular-nonlinear, symmetric, and include time delay, which has increased attention to them among researchers. In this work, we explored deep neural networks (DNNs) with an optimization algorithm to calculate the approximate solutions for nonlinear fractional differential equations (NFDEs). The target data-driven design of the DNN-LM algorithm was further implemented on the fractional models to study the rigorous impact and symmetry of different parameters on RL, RC circuits, and polytropic gas spheres. The targeted data generated from the analytical and numerical approaches in the literature for different cases were utilized by the deep neural networks to predict the numerical solutions by minimizing the differences in mean square error using the Levenberg–Marquardt algorithm. The numerical solutions obtained by the designed technique were contrasted with the multi-step reproducing kernel Hilbert space method (MS-RKM), Laplace transformation method (LTM), and Padé approximations. The results demonstrate the accuracy of the design technique as the DNN-LM algorithm overlaps with the actual results with minimum percentage absolute errors that lie between 10−8 and 10−12. The extensive graphical and statistical analysis of the designed technique showed that the DNN-LM algorithm is dependable and facilitates the examination of higher-order nonlinear complex problems due to the flexibility of the DNN architecture and the effectiveness of the optimization procedure.
Collapse
|
22
|
Toward predictive engineering of gene circuits. Trends Biotechnol 2022; 41:760-768. [PMID: 36435671 DOI: 10.1016/j.tibtech.2022.11.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2022] [Revised: 10/26/2022] [Accepted: 11/02/2022] [Indexed: 11/25/2022]
Abstract
Many synthetic biology applications rely on programming living cells using gene circuits - the assembly and wiring of genetic elements to control cellular behaviors. Extensive progress has been made in constructing gene circuits with diverse functions and applications. For many circuit functions, however, it remains challenging to ensure that the circuits operate in a predictable manner. Although the notion of predictability may appear intuitive, close inspection suggests that it is not always clear what constitutes predictability. We dissect this concept and how it can be confounded by the complexity of a circuit, the complexity of the context, and the interplay between the two. We discuss circuit engineering strategies, in both computation and experiment, that have been used to improve the predictability of gene circuits.
Collapse
|
23
|
Calibrating spatiotemporal models of microbial communities to microscopy data: A review. PLoS Comput Biol 2022; 18:e1010533. [PMID: 36227846 PMCID: PMC9560168 DOI: 10.1371/journal.pcbi.1010533] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Spatiotemporal models that account for heterogeneity within microbial communities rely on single-cell data for calibration and validation. Such data, commonly collected via microscopy and flow cytometry, have been made more accessible by recent advances in microfluidics platforms and data processing pipelines. However, validating models against such data poses significant challenges. Validation practices vary widely between modelling studies; systematic and rigorous methods have not been widely adopted. Similar challenges are faced by the (macrobial) ecology community, in which systematic calibration approaches are often employed to improve quantitative predictions from computational models. Here, we review single-cell observation techniques that are being applied to study microbial communities and the calibration strategies that are being employed for accompanying spatiotemporal models. To facilitate future calibration efforts, we have compiled a list of summary statistics relevant for quantifying spatiotemporal patterns in microbial communities. Finally, we highlight some recently developed techniques that hold promise for improved model calibration, including algorithmic guidance of summary statistic selection and machine learning approaches for efficient model simulation.
Collapse
|
24
|
Sukys A, Öcal K, Grima R. Approximating solutions of the Chemical Master equation using neural networks. iScience 2022; 25:105010. [PMID: 36117994 PMCID: PMC9474291 DOI: 10.1016/j.isci.2022.105010] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Revised: 06/13/2022] [Accepted: 08/18/2022] [Indexed: 10/27/2022] Open
Abstract
The Chemical Master Equation (CME) provides an accurate description of stochastic biochemical reaction networks in well-mixed conditions, but it cannot be solved analytically for most systems of practical interest. Although Monte Carlo methods provide a principled means to probe system dynamics, the large number of simulations typically required can render the estimation of molecule number distributions and other quantities infeasible. In this article, we aim to leverage the representational power of neural networks to approximate the solutions of the CME and propose a framework for the Neural Estimation of Stochastic Simulations for Inference and Exploration (Nessie). Our approach is based on training neural networks to learn the distributions predicted by the CME from relatively few stochastic simulations. We show on biologically relevant examples that simple neural networks with one hidden layer can capture highly complex distributions across parameter space, thereby accelerating computationally intensive tasks such as parameter exploration and inference.
Collapse
Affiliation(s)
- Augustinas Sukys
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, UK
- The Alan Turing Institute, London NW1 2DB, UK
| | - Kaan Öcal
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, UK
- School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, UK
| | - Ramon Grima
- School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3JH, UK
| |
Collapse
|
25
|
Yu LY, Ren GP, Hou XJ, Wu KJ, He Y. Transition State Theory-Inspired Neural Network for Estimating the Viscosity of Deep Eutectic Solvents. ACS CENTRAL SCIENCE 2022; 8:983-995. [PMID: 35912349 PMCID: PMC9335917 DOI: 10.1021/acscentsci.2c00157] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Indexed: 06/15/2023]
Abstract
The lack of accurate methods for predicting the viscosity of solvent materials, especially those with complex interactions, remains unresolved. Deep eutectic solvents (DESs), an emerging class of green solvents, have a severe lack of viscosity data, resulting in their application still staying at the stage of random trial and error, and it is difficult for them to be implemented on an industrial scale. In this work, we demonstrate the successful prediction of the viscosity of DESs based on the transition state theory-inspired neural network (TSTiNet). The TSTiNet adopts multilayer perceptron (MLP) for the transition state theory-inspired equation (TSTiEq) parameters calculation and verification using the most comprehensive DESs viscosity data set to date. For the energy parameters of the TSTiEq, the constant assumption and the fast iteration with the help of MLP can allow TSTiNet to achieve the best performance (the average absolute relative deviation on the test set of 6.84% and R 2 of 0.9805). Compared with the traditional machine learning methods, the TSTiNet has better generalization ability and dramatically reduces the maximum relative deviation of prediction under the constraints of the thermodynamic formulation. It requires only the structural information on DESs and is the most accurate and reliable model available for DESs viscosity prediction.
Collapse
Affiliation(s)
- Liu-Ying Yu
- Zhejiang
Provincial Key Laboratory of Advanced Chemical Engineering Manufacture
Technology, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Institute
of Zhejiang University-Quzhou, Quzhou 324000, China
| | - Gao-Peng Ren
- Zhejiang
Provincial Key Laboratory of Advanced Chemical Engineering Manufacture
Technology, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
| | - Xiao-Jing Hou
- Zhejiang
Provincial Key Laboratory of Advanced Chemical Engineering Manufacture
Technology, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Institute
of Zhejiang University-Quzhou, Quzhou 324000, China
| | - Ke-Jun Wu
- Zhejiang
Provincial Key Laboratory of Advanced Chemical Engineering Manufacture
Technology, College of Chemical and Biological Engineering, Zhejiang University, Hangzhou 310027, China
- Institute
of Zhejiang University-Quzhou, Quzhou 324000, China
- School
of Chemical and Process Engineering, University
of Leeds, Leeds LS2 9JT, U.K.
| | - Yuchen He
- State
Key Laboratory of Industrial Control Technology, College of Control
Science and Engineering, Zhejiang University, Hangzhou 310027, China
| |
Collapse
|
26
|
Liu C, Yu H, Zhang B, Liu S, Liu CG, Li F, Song H. Engineering whole-cell microbial biosensors: Design principles and applications in monitoring and treatment of heavy metals and organic pollutants. Biotechnol Adv 2022; 60:108019. [PMID: 35853551 DOI: 10.1016/j.biotechadv.2022.108019] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Revised: 07/13/2022] [Accepted: 07/13/2022] [Indexed: 01/18/2023]
Abstract
Biosensors have been widely used as cost-effective, rapid, in situ, and real-time analytical tools for monitoring environments. The development of synthetic biology has enabled emergence of genetically engineered whole-cell microbial biosensors. This review updates the design and optimization principles for a diverse array of whole-cell biosensors based on transcription factors (TF) including activators or repressors derived from heavy metal resistance systems, alkanes, and aromatics metabolic pathways of bacteria. By designing genetic circuits, the whole-cell biosensors could be engineered to intelligently sense heavy metals (Hg2+, Zn2+, Pb2+, Au3+, Cd2+, As3+, Ni2+, Cu2+, and UO22+) or organic compounds (alcohols, alkanes, phenols, and benzenes) through one-component or two-component system-based TFs, transduce signals through genetic amplifiers, and response as various outputs such as cell fluorescence and bioelectricity for monitoring heavy metals and organic pollutants in real conditions, synthetic curli and surface metal-binding peptides for in situ bio-sorption of heavy metals. We further review strategies that have been implemented to optimize the selectivity and correlation between ligand concentration and output signal of the TF-based biosensors, so as to meet requirements of practical applications. The optimization strategies include protein engineering to change specificities, promoter engineering to improve sensitivities, and genetic circuit-based amplification to enhance dynamic ranges via designing transcriptional amplifiers, logic gates, and feedback loops. At last, we outlook future trends in developing novel forms of biosensors.
Collapse
Affiliation(s)
- Changjiang Liu
- Frontiers Science Center for Synthetic Biology (Ministry of Education), Key Laboratory of Systems Bioengineering, Tianjin University, Tianjin 300072, China; Collaborative Innovation Center of Chemical Science and Engineering, School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Huan Yu
- Frontiers Science Center for Synthetic Biology (Ministry of Education), Key Laboratory of Systems Bioengineering, Tianjin University, Tianjin 300072, China; Collaborative Innovation Center of Chemical Science and Engineering, School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Baocai Zhang
- Frontiers Science Center for Synthetic Biology (Ministry of Education), Key Laboratory of Systems Bioengineering, Tianjin University, Tianjin 300072, China; Collaborative Innovation Center of Chemical Science and Engineering, School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Shilin Liu
- Collaborative Innovation Center of Chemical Science and Engineering, School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Chen-Guang Liu
- State Key Laboratory of Microbial Metabolism, Joint International Research Laboratory of Metabolic & Developmental Sciences of Ministry of Education, School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai 200240, China
| | - Feng Li
- Frontiers Science Center for Synthetic Biology (Ministry of Education), Key Laboratory of Systems Bioengineering, Tianjin University, Tianjin 300072, China; Collaborative Innovation Center of Chemical Science and Engineering, School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China.
| | - Hao Song
- Frontiers Science Center for Synthetic Biology (Ministry of Education), Key Laboratory of Systems Bioengineering, Tianjin University, Tianjin 300072, China; Collaborative Innovation Center of Chemical Science and Engineering, School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China.
| |
Collapse
|
27
|
Baranwal M, Clark RL, Thompson J, Sun Z, Hero AO, Venturelli OS. Recurrent neural networks enable design of multifunctional synthetic human gut microbiome dynamics. eLife 2022; 11:e73870. [PMID: 35736613 PMCID: PMC9225007 DOI: 10.7554/elife.73870] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2021] [Accepted: 05/22/2022] [Indexed: 12/26/2022] Open
Abstract
Predicting the dynamics and functions of microbiomes constructed from the bottom-up is a key challenge in exploiting them to our benefit. Current models based on ecological theory fail to capture complex community behaviors due to higher order interactions, do not scale well with increasing complexity and in considering multiple functions. We develop and apply a long short-term memory (LSTM) framework to advance our understanding of community assembly and health-relevant metabolite production using a synthetic human gut community. A mainstay of recurrent neural networks, the LSTM learns a high dimensional data-driven non-linear dynamical system model. We show that the LSTM model can outperform the widely used generalized Lotka-Volterra model based on ecological theory. We build methods to decipher microbe-microbe and microbe-metabolite interactions from an otherwise black-box model. These methods highlight that Actinobacteria, Firmicutes and Proteobacteria are significant drivers of metabolite production whereas Bacteroides shape community dynamics. We use the LSTM model to navigate a large multidimensional functional landscape to design communities with unique health-relevant metabolite profiles and temporal behaviors. In sum, the accuracy of the LSTM model can be exploited for experimental planning and to guide the design of synthetic microbiomes with target dynamic functions.
Collapse
Affiliation(s)
- Mayank Baranwal
- Department of Systems and Control Engineering, Indian Institute of TechnologyBombayIndia
- Division of Data & Decision Sciences, Tata Consultancy Services ResearchMumbaiIndia
| | - Ryan L Clark
- Department of Biochemistry, University of Wisconsin-MadisonMadisonUnited States
| | - Jaron Thompson
- Department of Chemical & Biological Engineering, University of Wisconsin-MadisonMadisonUnited States
| | - Zeyu Sun
- Department of Electrical Engineering & Computer Science, University of MichiganAnn ArborUnited States
| | - Alfred O Hero
- Department of Electrical Engineering & Computer Science, University of MichiganAnn ArborUnited States
- Department of Biomedical Engineering, University of MichiganAnn ArborUnited States
- Department of Statistics, University of MichiganAnn ArborUnited States
| | - Ophelia S Venturelli
- Department of Biochemistry, University of Wisconsin-MadisonMadisonUnited States
- Department of Chemical & Biological Engineering, University of Wisconsin-MadisonMadisonUnited States
- Department of Bacteriology, University of Wisconsin-MadisonMadisonUnited States
| |
Collapse
|
28
|
Egberts G, Schaaphok M, Vermolen F, Zuijlen PV. A Bayesian finite-element trained machine learning approach for predicting post-burn contraction. Neural Comput Appl 2022; 34:8635-8642. [PMID: 35125668 PMCID: PMC8801043 DOI: 10.1007/s00521-021-06772-3] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2021] [Accepted: 11/21/2021] [Indexed: 11/29/2022]
Abstract
Burn injuries can decrease the quality of life of a patient tremendously, because of esthetic reasons and because of contractions that result from them. In severe case, skin contraction takes place at such a large extent that joint mobility of a patient is significantly inhibited. In these cases, one refers to a contracture. In order to predict the evolution of post-wounding skin, several mathematical model frameworks have been set up. These frameworks are based on complicated systems of partial differential equations that need finite element-like discretizations for the approximation of the solution. Since these computational frameworks can be expensive in terms of computation time and resources, we study the applicability of neural networks to reproduce the finite element results. Our neural network is able to simulate the evolution of skin in terms of contraction for over one year. The simulations are based on 25 input parameters that are characteristic for the patient and the injury. One of such input parameters is the stiffness of the skin. The neural network results have yielded an average goodness of fit (\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{mathrsfs}
\usepackage{upgreek}
\setlength{\oddsidemargin}{-69pt}
\begin{document}$$R^2$$\end{document}R2) of 0.9928 (± 0.0013). Further, a tremendous speed-up of 19354X was obtained with the neural network. We illustrate the applicability by an online medical App that takes into account the age of the patient and the length of the burn.
Collapse
Affiliation(s)
- Ginger Egberts
- Delft Institute of Applied Mathematics, Delft University of Technology, Delft, The Netherlands
- Research Group Computational Mathematics(CMAT),Department of Mathematics and Statistics, University of Hasselt, Hasselt, Belgium
| | - Marianne Schaaphok
- Delft Institute of Applied Mathematics, Delft University of Technology, Delft, The Netherlands
| | - Fred Vermolen
- Research Group Computational Mathematics(CMAT),Department of Mathematics and Statistics, University of Hasselt, Hasselt, Belgium
| | - Paul van Zuijlen
- Burn Centre and Department of Plastic,Reconstructive & Hand Surgery, Red Cross Hospital, Beverwijk, Netherlands
- Department of Plastic, Reconstructive & Hand Surgery, Amsterdam UMC, location VUmc, Amsterdam Movement Sciences, Amsterdam, The Netherlands
- Pediatric Surgical Centre, Emma Children’s Hospital, Amsterdam UMC, location AMC and VUmc, Amsterdam, Netherlands
| |
Collapse
|
29
|
A Computational Platform Integrating a Mechanistic Model of Crohn's Disease for Predicting Temporal Progression of Mucosal Damage and Healing. Adv Ther 2022; 39:3225-3247. [PMID: 35581423 PMCID: PMC9239932 DOI: 10.1007/s12325-022-02144-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Accepted: 03/24/2022] [Indexed: 12/16/2022]
Abstract
INTRODUCTION Physicians are often required to make treatment decisions for patients with Crohn's disease on the basis of limited objective information about the state of the patient's gastrointestinal tissue while aiming to achieve mucosal healing. Tools to predict changes in mucosal health with treatment are needed. We evaluated a computational approach integrating a mechanistic model of Crohn's disease with a responder classifier to predict temporal changes in mucosal health. METHODS A hybrid mechanistic-statistical platform was developed to predict biomarker and tissue health time courses in patients with Crohn's disease. Eligible patients from the VERSIFY study (n = 69) were classified into archetypical response cohorts using a decision tree based on early treatment data and baseline characteristics. A virtual patient matching algorithm assigned a digital twin to each patient from their corresponding response cohort. The digital twin was used to forecast response to treatment using the mechanistic model. RESULTS The responder classifier predicted endoscopic remission and mucosal healing for treatment with vedolizumab over 26 weeks, with overall sensitivities of 80% and 75% and overall specificities of 69% and 70%, respectively. Predictions for changes in tissue damage over time in the validation set (n = 31), a measure of the overall performance of the platform, were considered good (at least 70% of data points matched), fair (at least 50%), and poor (less than 50%) for 71%, 23%, and 6% of patients, respectively. CONCLUSION Hybrid computational tools including mechanistic components represent a promising form of decision support that can predict outcomes and patient progress in Crohn's disease.
Collapse
|
30
|
Ye G, Balasubramanian V, Li JKJ, Kaya M. Machine Learning-Based Continuous Intracranial Pressure Prediction for Traumatic Injury Patients. IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2022; 10:4901008. [PMID: 35795876 PMCID: PMC9252333 DOI: 10.1109/jtehm.2022.3179874] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2021] [Revised: 04/06/2022] [Accepted: 05/24/2022] [Indexed: 11/18/2022]
Abstract
Structured Abstract—Objective: Abnormal elevation of intracranial pressure (ICP) can cause dangerous or even fatal outcomes. The early detection of high intracranial pressure events can be crucial in saving lives in an intensive care unit (ICU). Despite many applications of machine learning (ML) techniques related to clinical diagnosis, ML applications for continuous ICP detection or short-term predictions have been rarely reported. This study proposes an efficient method of applying an artificial recurrent neural network on the early prediction of ICP evaluation continuously for TBI patients. Methods: After ICP data preprocessing, the learning model is generated for thirteen patients to continuously predict the ICP signal occurrence and classify events for the upcoming 10 minutes by inputting the previous 20-minutes of the ICP signal. Results: As the overall model performance, the average accuracy is 94.62%, the average sensitivity is 74.91%, the average specificity is 94.83%, and the average root mean square error is approximately 2.18 mmHg. Conclusion: This research addresses a significant clinical problem with the management of traumatic brain injury patients. The machine learning model data enables early prediction of ICP continuously in a real-time fashion, which is crucial for appropriate clinical interventions. The results show that our machine learning-based model has high adaptive performance, accuracy, and efficiency.
Collapse
Affiliation(s)
- Guochang Ye
- Department of Biomedical and Chemical Engineering and Sciences, Florida Institute of Technology, Melbourne, FL, USA
| | - Vignesh Balasubramanian
- Department of Biomedical and Chemical Engineering and Sciences, Florida Institute of Technology, Melbourne, FL, USA
| | - John K-J. Li
- Department of Biomedical Engineering, Rutgers University, New Brunswick, NJ, USA
| | - Mehmet Kaya
- Department of Biomedical and Chemical Engineering and Sciences, Florida Institute of Technology, Melbourne, FL, USA
| |
Collapse
|
31
|
Larie D, An G, Cockrell RC. The Use of Artificial Neural Networks to Forecast the Behavior of Agent-Based Models of Pathophysiology: An Example Utilizing an Agent-Based Model of Sepsis. Front Physiol 2021; 12:716434. [PMID: 34721057 PMCID: PMC8552109 DOI: 10.3389/fphys.2021.716434] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 09/24/2021] [Indexed: 11/13/2022] Open
Abstract
Introduction: Disease states are being characterized at finer and finer levels of resolution via biomarker or gene expression profiles, while at the same time. Machine learning (ML) is increasingly used to analyze and potentially classify or predict the behavior of biological systems based on such characterization. As ML applications are extremely data-intensive, given the relative sparsity of biomedical data sets ML training of artificial neural networks (ANNs) often require the use of synthetic training data. Agent-based models (ABMs) that incorporate known biological mechanisms and their associated stochastic properties are a potential means of generating synthetic data. Herein we present an example of ML used to train an artificial neural network (ANN) as a surrogate system used to predict the time evolution of an ABM focusing on the clinical condition of sepsis. Methods: The disease trajectories for clinical sepsis, in terms of temporal cytokine and phenotypic dynamics, can be interpreted as a random dynamical system. The Innate Immune Response Agent-based Model (IIRABM) is a well-established model that utilizes known cellular and molecular rules to simulate disease trajectories corresponding to clinical sepsis. We have utilized two distinct neural network architectures, Long Short-Term Memory and Multi-Layer Perceptron, to take a time sequence of five measurements of eleven IIRABM simulated serum cytokine concentrations as input and to return both the future cytokine trajectories as well as an aggregate metric representing the patient's state of health. Results: The ANNs predicted model trajectories with the expected amount of error, due to stochasticity in the simulation, and recognizing that the mapping from a specific cytokine profile to a state-of-health is not unique. The Multi-Layer Perceptron neural network, generated predictions with a more accurate forecasted trajectory cone. Discussion: This work serves as a proof-of-concept for the use of ANNs to predict disease progression in sepsis as represented by an ABM. The findings demonstrate that multicellular systems with intrinsic stochasticity can be approximated with an ANN, but that forecasting a specific trajectory of the system requires sequential updating of the system state to provide a rolling forecast horizon.
Collapse
Affiliation(s)
| | | | - R. Chase Cockrell
- Department of Surgery, Larner College of Medicine, University of Vermont, Burlington, VT, United States
| |
Collapse
|
32
|
Capponi S, Wang S, Navarro EJ, Bianco S. AI-driven prediction of SARS-CoV-2 variant binding trends from atomistic simulations. THE EUROPEAN PHYSICAL JOURNAL. E, SOFT MATTER 2021; 44:123. [PMID: 34613523 PMCID: PMC8493367 DOI: 10.1140/epje/s10189-021-00119-5] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 08/24/2021] [Indexed: 05/02/2023]
Abstract
We present a novel technique to predict binding affinity trends between two molecules from atomistic molecular dynamics simulations. The technique uses a neural network algorithm applied to a series of images encoding the distance between two molecules in time. We demonstrate that our algorithm is capable of separating with high accuracy non-hydrophobic mutations with low binding affinity from those with high binding affinity. Moreover, we show high accuracy in prediction using a small subset of the simulation, therefore requiring a much shorter simulation time. We apply our algorithm to the binding between several variants of the SARS-CoV-2 spike protein and the human receptor ACE2.
Collapse
Affiliation(s)
- Sara Capponi
- IBM Almaden Research Center, 650 Harry Rd, San Jose, CA, 95120, USA
- Center for Cellular Construction, San Francisco, CA, 94158, USA
| | - Shangying Wang
- IBM Almaden Research Center, 650 Harry Rd, San Jose, CA, 95120, USA
- Center for Cellular Construction, San Francisco, CA, 94158, USA
| | - Erik J Navarro
- IBM Almaden Research Center, 650 Harry Rd, San Jose, CA, 95120, USA
- Center for Cellular Construction, San Francisco, CA, 94158, USA
- Graduate Program in Biophysics, University of California, San Francisco, San Francisco, CA, 94158, USA
| | - Simone Bianco
- IBM Almaden Research Center, 650 Harry Rd, San Jose, CA, 95120, USA.
- Center for Cellular Construction, San Francisco, CA, 94158, USA.
| |
Collapse
|
33
|
Burgos-Morales O, Gueye M, Lacombe L, Nowak C, Schmachtenberg R, Hörner M, Jerez-Longres C, Mohsenin H, Wagner H, Weber W. Synthetic biology as driver for the biologization of materials sciences. Mater Today Bio 2021; 11:100115. [PMID: 34195591 PMCID: PMC8237365 DOI: 10.1016/j.mtbio.2021.100115] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Revised: 05/16/2021] [Accepted: 05/18/2021] [Indexed: 01/16/2023] Open
Abstract
Materials in nature have fascinating properties that serve as a continuous source of inspiration for materials scientists. Accordingly, bio-mimetic and bio-inspired approaches have yielded remarkable structural and functional materials for a plethora of applications. Despite these advances, many properties of natural materials remain challenging or yet impossible to incorporate into synthetic materials. Natural materials are produced by living cells, which sense and process environmental cues and conditions by means of signaling and genetic programs, thereby controlling the biosynthesis, remodeling, functionalization, or degradation of the natural material. In this context, synthetic biology offers unique opportunities in materials sciences by providing direct access to the rational engineering of how a cell senses and processes environmental information and translates them into the properties and functions of materials. Here, we identify and review two main directions by which synthetic biology can be harnessed to provide new impulses for the biologization of the materials sciences: first, the engineering of cells to produce precursors for the subsequent synthesis of materials. This includes materials that are otherwise produced from petrochemical resources, but also materials where the bio-produced substances contribute unique properties and functions not existing in traditional materials. Second, engineered living materials that are formed or assembled by cells or in which cells contribute specific functions while remaining an integral part of the living composite material. We finally provide a perspective of future scientific directions of this promising area of research and discuss science policy that would be required to support research and development in this field.
Collapse
Affiliation(s)
- O. Burgos-Morales
- École Supérieure de Biotechnologie de Strasbourg - ESBS, University of Strasbourg, Illkirch, 67412, France
- Faculty of Biology, University of Freiburg, Freiburg, 79104, Germany
| | - M. Gueye
- École Supérieure de Biotechnologie de Strasbourg - ESBS, University of Strasbourg, Illkirch, 67412, France
| | - L. Lacombe
- École Supérieure de Biotechnologie de Strasbourg - ESBS, University of Strasbourg, Illkirch, 67412, France
| | - C. Nowak
- École Supérieure de Biotechnologie de Strasbourg - ESBS, University of Strasbourg, Illkirch, 67412, France
- Faculty of Biology, University of Freiburg, Freiburg, 79104, Germany
| | - R. Schmachtenberg
- École Supérieure de Biotechnologie de Strasbourg - ESBS, University of Strasbourg, Illkirch, 67412, France
- Faculty of Biology, University of Freiburg, Freiburg, 79104, Germany
| | - M. Hörner
- Faculty of Biology, University of Freiburg, Freiburg, 79104, Germany
- Signalling Research Centres BIOSS and CIBSS, University of Freiburg, Freiburg, 79104, Germany
| | - C. Jerez-Longres
- Faculty of Biology, University of Freiburg, Freiburg, 79104, Germany
- Signalling Research Centres BIOSS and CIBSS, University of Freiburg, Freiburg, 79104, Germany
- Spemann Graduate School of Biology and Medicine - SGBM, University of Freiburg, Freiburg, 79104, Germany
| | - H. Mohsenin
- Faculty of Biology, University of Freiburg, Freiburg, 79104, Germany
- Signalling Research Centres BIOSS and CIBSS, University of Freiburg, Freiburg, 79104, Germany
| | - H.J. Wagner
- Faculty of Biology, University of Freiburg, Freiburg, 79104, Germany
- Signalling Research Centres BIOSS and CIBSS, University of Freiburg, Freiburg, 79104, Germany
- Department of Biosystems Science and Engineering - D-BSSE, ETH Zurich, Basel, 4058, Switzerland
| | - W. Weber
- Faculty of Biology, University of Freiburg, Freiburg, 79104, Germany
- Signalling Research Centres BIOSS and CIBSS, University of Freiburg, Freiburg, 79104, Germany
- Spemann Graduate School of Biology and Medicine - SGBM, University of Freiburg, Freiburg, 79104, Germany
| |
Collapse
|
34
|
Luo N, Wang S, Lu J, Ouyang X, You L. Collective colony growth is optimized by branching pattern formation in Pseudomonas aeruginosa. Mol Syst Biol 2021; 17:e10089. [PMID: 33900031 PMCID: PMC8073002 DOI: 10.15252/msb.202010089] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 03/13/2021] [Accepted: 03/15/2021] [Indexed: 01/11/2023] Open
Abstract
Branching pattern formation is common in many microbes. Extensive studies have focused on addressing how such patterns emerge from local cell-cell and cell-environment interactions. However, little is known about whether and to what extent these patterns play a physiological role. Here, we consider the colonization of bacteria as an optimization problem to find the colony patterns that maximize colony growth efficiency under different environmental conditions. We demonstrate that Pseudomonas aeruginosa colonies develop branching patterns with characteristics comparable to the prediction of modeling; for example, colonies form thin branches in a nutrient-poor environment. Hence, the formation of branching patterns represents an optimal strategy for the growth of Pseudomonas aeruginosa colonies. The quantitative relationship between colony patterns and growth conditions enables us to develop a coarse-grained model to predict diverse colony patterns under more complex conditions, which we validated experimentally. Our results offer new insights into branching pattern formation as a problem-solving social behavior in microbes and enable fast and accurate predictions of complex spatial patterns in branching colonies.
Collapse
Affiliation(s)
- Nan Luo
- Department of Biomedical EngineeringDuke UniversityDurhamNCUSA
| | - Shangying Wang
- Department of Biomedical EngineeringDuke UniversityDurhamNCUSA
| | - Jia Lu
- Department of Biomedical EngineeringDuke UniversityDurhamNCUSA
| | | | - Lingchong You
- Department of Biomedical EngineeringDuke UniversityDurhamNCUSA
- Center for Genomic and Computational BiologyDuke UniversityDurhamNCUSA
- Department of Molecular Genetics and MicrobiologyDuke University School of MedicineDurhamNCUSA
| |
Collapse
|
35
|
Burzawa L, Li L, Wang X, Buganza-Tepole A, Umulis DM. Acceleration of PDE-Based Biological Simulation Through the Development of Neural Network Metamodels. CURRENT PATHOBIOLOGY REPORTS 2020; 8:121-131. [PMID: 33968495 PMCID: PMC8104327 DOI: 10.1007/s40139-020-00216-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Accepted: 10/12/2020] [Indexed: 10/23/2022]
Abstract
PURPOSE OF REVIEW Partial differential equation (PDE) mathematical models of biological systems and the simulation approaches used to solve them are widely used to test hypotheses and infer regulatory interactions based on optimization of the PDE model against the observed data. In this review, we discuss the ability of powerful machine learning methods to accelerate the parametric screening of biophysical informed- PDE systems. RECENT FINDINGS A major shortcoming in more broad adaptation of PDE-based models is the high computational complexity required to solve and optimize the models and it requires many simulations to traverse the very high-dimensional parameter spaces during model calibration and inference tasks. For instance, when scaling up to tens of millions of simulations for optimization and sensitivity analysis of the PDE models, compute times quickly extend from months to years for sufficient coverage to solve the problems. For many systems, this brute-force approach is simply not feasible. Recently, neural network metamodels have been shown to be an efficient way to accelerate PDE model calibration and here we look at the benefits and limitations in extending the PDE acceleration methods to improve optimization and sensitivity analysis. SUMMARY We use an example simulation to quantitatively and qualitatively show how neural network metamodels can be accurate and fast and demonstrate their potential for optimization of complex spatiotemporal problems in biology. We expect these approaches will be broadly applied to speed up scientific research and discovery in biology and other systems that can be described by complex PDE systems.
Collapse
Affiliation(s)
- Lukasz Burzawa
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907
- School of Electrical and Computer Engineering, Purdue University, West Lafayette, IN 47907
| | - Linlin Li
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907
| | - Xu Wang
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907
| | - Adrian Buganza-Tepole
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907
- School of Mechanical Engineering, Purdue University, West Lafayette, IN 47907
| | - David M Umulis
- Weldon School of Biomedical Engineering, Purdue University, West Lafayette, IN 47907
- Department of Ag. and Biological Engineering, Purdue University, West Lafayette, IN 47907
| |
Collapse
|
36
|
Groaz A, Moghimianavval H, Tavella F, Giessen TW, Vecchiarelli AG, Yang Q, Liu AP. Engineering spatiotemporal organization and dynamics in synthetic cells. WILEY INTERDISCIPLINARY REVIEWS-NANOMEDICINE AND NANOBIOTECHNOLOGY 2020; 13:e1685. [PMID: 33219745 DOI: 10.1002/wnan.1685] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 10/13/2020] [Accepted: 10/30/2020] [Indexed: 12/28/2022]
Abstract
Constructing synthetic cells has recently become an appealing area of research. Decades of research in biochemistry and cell biology have amassed detailed part lists of components involved in various cellular processes. Nevertheless, recreating any cellular process in vitro in cell-sized compartments remains ambitious and challenging. Two broad features or principles are key to the development of synthetic cells-compartmentalization and self-organization/spatiotemporal dynamics. In this review article, we discuss the current state of the art and research trends in the engineering of synthetic cell membranes, development of internal compartmentalization, reconstitution of self-organizing dynamics, and integration of activities across scales of space and time. We also identify some research areas that could play a major role in advancing the impact and utility of engineered synthetic cells. This article is categorized under: Biology-Inspired Nanomaterials > Lipid-Based Structures Biology-Inspired Nanomaterials > Protein and Virus-Based Structures.
Collapse
Affiliation(s)
| | | | | | | | | | - Qiong Yang
- University of Michigan, Ann Arbor, Michigan, USA
| | - Allen P Liu
- University of Michigan, Ann Arbor, Michigan, USA
| |
Collapse
|
37
|
Application of artificial intelligence to the in silico assessment of antimicrobial resistance and risks to human and animal health presented by priority enteric bacterial pathogens. ACTA ACUST UNITED AC 2020; 46:180-185. [PMID: 32673383 DOI: 10.14745/ccdr.v46i06a05] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Each year, approximately one in eight Canadians are affected by foodborne illness, either through outbreaks or sporadic illness, with animals being the major reservoir for the pathogens. Whole genome sequence analyses are now routinely implemented by public and animal health laboratories to define epidemiological disease clusters and to identify potential sources of infection. Similarly, a number of bioinformatics tools can be used to identify virulence and antimicrobial resistance (AMR) determinants in the genomes of pathogenic strains. Many important clinical and phenotypic characteristics of these pathogens can now be predicted using machine learning algorithms applied to whole genome sequence data. In this overview, we compare the ability of support vector machines, gradient-boosted decision trees and artificial neural networks to predict the levels of AMR within Salmonella enterica and extended-spectrum β-lactamase (ESBL) producing Escherichia coli. We show that minimum inhibitory concentrations (MIC) for each of 13 antimicrobials for S. enterica strains can be accurately determined, and that ESBL-producing E. coli strains can be accurately classified as susceptible, intermediate or resistant for each of seven antimicrobials. In addition to AMR and bacterial populations of greatest risk to human health, artificial intelligence algorithms hold promise as tools to predict other clinically and epidemiologically important phenotypes of enteric pathogens.
Collapse
|
38
|
Wiecha PR, Muskens OL. Deep Learning Meets Nanophotonics: A Generalized Accurate Predictor for Near Fields and Far Fields of Arbitrary 3D Nanostructures. NANO LETTERS 2020; 20:329-338. [PMID: 31825227 DOI: 10.1021/acs.nanolett.9b03971] [Citation(s) in RCA: 43] [Impact Index Per Article: 8.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Deep artificial neural networks are powerful tools with many possible applications in nanophotonics. Here, we demonstrate how a deep neural network can be used as a fast, general purpose predictor of the full near-field and far-field response of plasmonic and dielectric nanostructures. A trained neural network is shown to infer the internal fields of arbitrary three-dimensional nanostructures many orders of magnitude faster compared to conventional numerical simulations. Secondary physical quantities are derived from the deep learning predictions and faithfully reproduce a wide variety of physical effects without requiring specific training. We discuss the strengths and limitations of the neural network approach using a number of model studies of single particles and their near-field interactions. Our approach paves the way for fast, yet universal, methods for design and analysis of nanophotonic systems.
Collapse
Affiliation(s)
- Peter R Wiecha
- Physics and Astronomy, Faculty of Engineering and Physical Sciences , University of Southampton , SO 17 1BJ Southampton , United Kingdom
| | - Otto L Muskens
- Physics and Astronomy, Faculty of Engineering and Physical Sciences , University of Southampton , SO 17 1BJ Southampton , United Kingdom
| |
Collapse
|