1
|
Li Z, Barahona M, Thomas P. Moment-based parameter inference with error guarantees for stochastic reaction networks. J Chem Phys 2025; 162:135105. [PMID: 40183299 DOI: 10.1063/5.0251744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/04/2024] [Accepted: 01/10/2025] [Indexed: 04/05/2025] Open
Abstract
Inferring parameters of biochemical kinetic models from single-cell data remains challenging because of the uncertainty arising from the intractability of the likelihood function of stochastic reaction networks. Such uncertainty falls beyond current error quantification measures, which focus on the effects of finite sample size and identifiability but lack theoretical guarantees when likelihood approximations are needed. Here, we propose a method for the inference of parameters of stochastic reaction networks that works for both steady-state and time-resolved data and is applicable to networks with non-linear and rational propensities. Our approach provides bounds on the parameters via convex optimization over sets constrained by moment equations and moment matrices by taking observations to form moment intervals, which are then used to constrain parameters through convex sets. The bounds on the parameters contain the true parameters under the condition that the moment intervals contain the true moments, thus providing uncertainty quantification and error guarantees. Our approach does not need to predict moments and distributions for given parameters (i.e., it avoids solving or simulating the forward problem) and hence circumvents intractable likelihood computations or computationally expensive simulations. We demonstrate its use for uncertainty quantification, data integration, and prediction of latent species statistics through synthetic data from common non-linear biochemical models including the Schlögl model and the toggle switch, a model of post-transcriptional regulation at steady state, and a birth-death model with time-dependent data.
Collapse
Affiliation(s)
- Zekai Li
- Department of Mathematics, Imperial College London, London SW7 2AZ, United Kingdom
| | - Mauricio Barahona
- Department of Mathematics, Imperial College London, London SW7 2AZ, United Kingdom
| | - Philipp Thomas
- Department of Mathematics, Imperial College London, London SW7 2AZ, United Kingdom
| |
Collapse
|
2
|
Sinzger-D’Angelo M, Startceva S, Koeppl H. Bye bye, linearity, bye: quantification of the mean for linear CRNs in a random environment. J Math Biol 2023; 87:43. [PMID: 37573263 PMCID: PMC10423146 DOI: 10.1007/s00285-023-01973-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2022] [Revised: 07/04/2023] [Accepted: 07/22/2023] [Indexed: 08/14/2023]
Abstract
Molecular reactions within a cell are inherently stochastic, and cells often differ in morphological properties or interact with a heterogeneous environment. Consequently, cell populations exhibit heterogeneity both due to these intrinsic and extrinsic causes. Although state-of-the-art studies that focus on dissecting this heterogeneity use single-cell measurements, the bulk data that shows only the mean expression levels is still in routine use. The fingerprint of the heterogeneity is present also in bulk data, despite being hidden from direct measurement. In particular, this heterogeneity can affect the mean expression levels via bimolecular interactions with low-abundant environment species. We make this statement rigorous for the class of linear reaction systems that are embedded in a discrete state Markov environment. The analytic expression that we provide for the stationary mean depends on the reaction rate constants of the linear subsystem, as well as the generator and stationary distribution of the Markov environment. We demonstrate the effect of the environment on the stationary mean. Namely, we show how the heterogeneous case deviates from the quasi-steady state (Q.SS) case when the embedded system is fast compared to the environment.
Collapse
Affiliation(s)
- Mark Sinzger-D’Angelo
- Electrical Engineering and Information Technology, Technische Universität Darmstadt, Darmstadt, Germany
- Centre for Synthetic Biology, Technische Universität Darmstadt, Darmstadt, Germany
| | - Sofia Startceva
- Electrical Engineering and Information Technology, Technische Universität Darmstadt, Darmstadt, Germany
- Centre for Synthetic Biology, Technische Universität Darmstadt, Darmstadt, Germany
| | - Heinz Koeppl
- Electrical Engineering and Information Technology, Technische Universität Darmstadt, Darmstadt, Germany
- Centre for Synthetic Biology, Technische Universität Darmstadt, Darmstadt, Germany
| |
Collapse
|
3
|
Wagner V, Radde N. The impossible challenge of estimating non-existent moments of the Chemical Master Equation. Bioinformatics 2023; 39:i440-i447. [PMID: 37387158 PMCID: PMC10311328 DOI: 10.1093/bioinformatics/btad205] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023] Open
Abstract
MOTIVATION The Chemical Master Equation (CME) is a set of linear differential equations that describes the evolution of the probability distribution on all possible configurations of a (bio-)chemical reaction system. Since the number of configurations and therefore the dimension of the CME rapidly increases with the number of molecules, its applicability is restricted to small systems. A widely applied remedy for this challenge is moment-based approaches which consider the evolution of the first few moments of the distribution as summary statistics for the complete distribution. Here, we investigate the performance of two moment-estimation methods for reaction systems whose equilibrium distributions encounter fat-tailedness and do not possess statistical moments. RESULTS We show that estimation via stochastic simulation algorithm (SSA) trajectories lose consistency over time and estimated moment values span a wide range of values even for large sample sizes. In comparison, the method of moments returns smooth moment estimates but is not able to indicate the non-existence of the allegedly predicted moments. We furthermore analyze the negative effect of a CME solution's fat-tailedness on SSA run times and explain inherent difficulties. While moment-estimation techniques are a commonly applied tool in the simulation of (bio-)chemical reaction networks, we conclude that they should be used with care, as neither the system definition nor the moment-estimation techniques themselves reliably indicate the potential fat-tailedness of the CME's solution.
Collapse
Affiliation(s)
- Vincent Wagner
- Institute for Systems Theory and Automatic Control, University of Stuttgart, Stuttgart 70569, Germany
| | - Nicole Radde
- Institute for Systems Theory and Automatic Control, University of Stuttgart, Stuttgart 70569, Germany
- Stuttgart Center for Simulation Science, University of Stuttgart, Stuttgart 70569, Germany
| |
Collapse
|
4
|
Lunz D, Bonnans JF, Ruess J. Revisiting moment-closure methods with heterogeneous multiscale population models. Math Biosci 2022; 350:108866. [PMID: 35753520 DOI: 10.1016/j.mbs.2022.108866] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Revised: 04/10/2022] [Accepted: 06/08/2022] [Indexed: 11/29/2022]
Abstract
Stochastic chemical kinetics at the single-cell level give rise to heterogeneous populations of cells even when all individuals are genetically identical. This heterogeneity can lead to nonuniform behaviour within populations, including different growth characteristics, cell-fate dynamics, and response to stimuli. Ultimately, these diverse behaviours lead to intricate population dynamics that are inherently multiscale: the population composition evolves based on population-level processes that interact with stochastically distributed single-cell states. Therefore, descriptions that account for this heterogeneity are essential to accurately model and control chemical processes. However, for real-world systems such models are computationally expensive to simulate, which can make optimisation problems, such as optimal control or parameter inference, prohibitively challenging. Here, we consider a class of multiscale population models that incorporate population-level mechanisms while remaining faithful to the underlying stochasticity at the single-cell level and the interplay between these two scales. To address the complexity, we study an order-reduction approximations based on the distribution moments. Since previous moment-closure work has focused on the single-cell kinetics, extending these techniques to populations models prompts us to revisit old observations as well as tackle new challenges. In this extended multiscale context, we encounter the previously established observation that the simplest closure techniques can lead to non-physical system trajectories. Despite their poor performance in some systems, we provide an example where these simple closures outperform more sophisticated closure methods in accurately, efficiently, and robustly solving the problem of optimal control of bioproduction in a microbial consortium model.
Collapse
Affiliation(s)
- Davin Lunz
- Inria Paris, 2 rue Simone Iff, 75012 Paris, France; Institut Pasteur, 28 rue du Docteur Roux, 75015 Paris, France.
| | - J Frédéric Bonnans
- Université Paris-Saclay, CNRS, CentraleSupélec, Inria, Laboratory of signals and systems, 91190, Gif-sur-Yvette, France
| | - Jakob Ruess
- Inria Paris, 2 rue Simone Iff, 75012 Paris, France; Institut Pasteur, 28 rue du Docteur Roux, 75015 Paris, France
| |
Collapse
|
5
|
Thomas P, Shahrezaei V. Coordination of gene expression noise with cell size: analytical results for agent-based models of growing cell populations. J R Soc Interface 2021; 18:20210274. [PMID: 34034535 DOI: 10.1098/rsif.2021.0274] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022] Open
Abstract
The chemical master equation and the Gillespie algorithm are widely used to model the reaction kinetics inside living cells. It is thereby assumed that cell growth and division can be modelled through effective dilution reactions and extrinsic noise sources. We here re-examine these paradigms through developing an analytical agent-based framework of growing and dividing cells accompanied by an exact simulation algorithm, which allows us to quantify the dynamics of virtually any intracellular reaction network affected by stochastic cell size control and division noise. We find that the solution of the chemical master equation-including static extrinsic noise-exactly agrees with the agent-based formulation when the network under study exhibits stochastic concentration homeostasis, a novel condition that generalizes concentration homeostasis in deterministic systems to higher order moments and distributions. We illustrate stochastic concentration homeostasis for a range of common gene expression networks. When this condition is not met, we demonstrate by extending the linear noise approximation to agent-based models that the dependence of gene expression noise on cell size can qualitatively deviate from the chemical master equation. Surprisingly, the total noise of the agent-based approach can still be well approximated by extrinsic noise models.
Collapse
Affiliation(s)
- Philipp Thomas
- Department of Mathematics, Imperial College London, London, UK
| | | |
Collapse
|
6
|
Thomas P. Stochastic Modeling Approaches for Single-Cell Analyses. SYSTEMS MEDICINE 2021. [DOI: 10.1016/b978-0-12-801238-3.11539-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022] Open
|
7
|
Kim J, Dark J, Enciso G, Sindi S. Slack reactants: A state-space truncation framework to estimate quantitative behavior of the chemical master equation. J Chem Phys 2020; 153:054117. [PMID: 32770905 PMCID: PMC8729884 DOI: 10.1063/5.0013457] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
State space truncation methods are widely used to approximate solutions of the chemical
master equation. While most methods of this kind focus on truncating the state space
directly, in this work, we propose modifying the underlying chemical reaction network by
introducing slack reactants that indirectly truncate the state space.
More specifically, slack reactants introduce an expanded chemical reaction network and
impose a truncation scheme based on desired mass conservation laws. This network structure
also allows us to prove inheritance of special properties of the original model, such as
irreducibility and complex balancing. We use the network structure imposed by slack
reactants to prove the convergence of the stationary distribution and first arrival times.
We then provide examples comparing our method with the stationary finite state projection
and finite buffer methods. Our slack reactant system appears to be more robust than some
competing methods with respect to calculating first arrival times.
Collapse
Affiliation(s)
- Jinsu Kim
- Department of Mathematics, University of California, Irvine, California 92697, USA
| | - Jason Dark
- Department of Mathematics, University of California, Irvine, California 92697, USA
- Department of Applied Mathematics, University of California, Merced, California 95343, USA
| | - German Enciso
- Department of Mathematics, University of California, Irvine, California 92697, USA
- Department of Developmental and Cell Biology, University of California, Irvine, California 92617, USA
| | - Suzanne Sindi
- Department of Applied Mathematics, University of California, Merced, California 95343, USA
| |
Collapse
|