1
|
Diamataris IG, Peristeras LD, Papavasileiou KD, Melissas VS, Boulougouris GC. Statistical Inference of Rate Constants in Chemical and Biochemical Reaction Networks Using an "Inverse" Event-Driven Kinetic Monte Carlo Method. J Phys Chem B 2023; 127:9132-9143. [PMID: 37823789 DOI: 10.1021/acs.jpcb.3c03649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
The use of rate models for networks of stochastic reactions is frequently used to comprehend the macroscopically observed dynamic properties of finite size reactive systems as well as their relationship to the underlying molecular events. Τhis particular approach usually stumbles on parameter derivation associated with stochastic kinetics, a quite demanding procedure. The present study incorporates a novel algorithm, which infers kinetic parameters from the system's time evolution, manifested as changes in molecular species populations. The proposed methodology reconstructs distributions required to infer kinetic parameters of a stochastic process pertaining to either a simulation or experimental data. The suggested approach accurately replicates rate constants of the stochastic reaction networks, which have evolved over time by event-driven Monte Carlo (MC) simulations using the Gillespie algorithm. Furthermore, our approach has been successfully used to estimate rate constants of association and dissociation events between molecular species developing during molecular dynamics (MD) simulations. We certainly believe that our method will be remarkably helpful for considering the macroscopic characteristic molecular roots related to stochastic physical and biological processes.
Collapse
Affiliation(s)
- Ioannis G Diamataris
- Laboratory of Computational Physical-Chemistry, Department of Molecular Biology and Genetics, University of Thrace, Alexandroupoulis GR-681 00, Greece
| | - Loukas D Peristeras
- Institute of Nanoscience and Nanotechnology, Molecular Thermodynamics and Modelling of Materials Laboratory, National Center for Scientific Research "Demokritos", Attikis, Agia Paraskevi GR-153 10, Greece
| | - Konstantinos D Papavasileiou
- Department of ChemoInformatics, NovaMechanics Ltd., Nicosia CY-1070, Cyprus
- Division of Data Driven Innovation, Entelos Institute, Larnaca CY-6059, Cyprus
- Department of ChemoInformatics, NovaMechanics MIKE., Piraeus GR-185 45, Greece
| | | | - Georgios C Boulougouris
- Laboratory of Computational Physical-Chemistry, Department of Molecular Biology and Genetics, University of Thrace, Alexandroupoulis GR-681 00, Greece
| |
Collapse
|
2
|
Wanduku D. The multilevel hierarchical data EM-algorithm. Applications to discrete-time Markov chain epidemic models. Heliyon 2022; 8:e12622. [PMID: 36643325 PMCID: PMC9834773 DOI: 10.1016/j.heliyon.2022.e12622] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2021] [Revised: 06/21/2022] [Accepted: 12/16/2022] [Indexed: 12/24/2022] Open
Abstract
The theory of multilevel hierarchical data Expectation Maximization (EM)-algorithm is introduced via discrete time Markov chain (DTMC) epidemic models. A general model for a multilevel hierarchical discrete data is derived. The observed sample Y in the system is a stochastic incomplete data, and the missing data Z exhibits a multilevel hierarchical data structure. The EM-algorithm to find ML-estimates for parameters in the stochastic system is derived. Applications of the EM-algorithm are exhibited in the two DTMC models, to find ML-estimates of the system parameters. Numerical results are given for influenza epidemics in the state of Georgia (GA), USA.
Collapse
|
3
|
Chkrebtii OA, García YE, Capistrán MA, Noyola DE. Inference for stochastic kinetic models from multiple data sources for joint estimation of infection dynamics from aggregate reports and virological data. Ann Appl Stat 2022. [DOI: 10.1214/21-aoas1527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022]
Affiliation(s)
| | - Yury E. García
- Área de Matemáticas Básicas, Centro de Investigación en Matemáticas
| | | | - Daniel E. Noyola
- Department of Microbiology, Faculty of Medicine, Universidad Autónoma de San Luis Potosí
| |
Collapse
|
4
|
Ganesh M, Hawkins SC. A surrogate Bayesian framework for a SARS-CoV-2 data driven stochastic model. COMPUTATIONAL AND MATHEMATICAL BIOPHYSICS 2022. [DOI: 10.1515/cmb-2022-0131] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Abstract
Dynamic compartmentalized data (DCD) and compartmentalized differential equations (CDEs) are key instruments for modeling transmission of pathogens such as the SARS-CoV-2 virus. We describe an effi-cient nowcasting algorithm for modeling transmission of SARS-CoV-2 with uncertainty quantification for the COVID-19 impact. A key concern for transmission of SARS-CoV-2 is under-reporting of cases, and this is addressed in our data-driven model by providing an estimate for the detection rate. Our novel top-down model is based on CDEs with stochastic constitutive parameters obtained from the DCD using Bayesian inference. We demonstrate the robustness of our algorithm for simulation studies using synthetic DCD, and nowcasting COVID-19 using real DCD from several regions across five continents.
Collapse
Affiliation(s)
- M. Ganesh
- Department of Applied Mathematics and Statistics , Colorado School of Mines , Golden ,
| | - S. C. Hawkins
- School of Mathematical and Physical Sciences , Macquarie University , Sydney , , Australia
| |
Collapse
|
5
|
Matabuena M, Rodríguez-Mier P, García-Meixide C, Leborán V. COVID-19: Estimation of the transmission dynamics in Spain using a stochastic simulator and black-box optimization techniques. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE 2021; 211:106399. [PMID: 34607036 PMCID: PMC8418989 DOI: 10.1016/j.cmpb.2021.106399] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Accepted: 08/31/2021] [Indexed: 06/13/2023]
Abstract
BACKGROUND AND OBJECTIVES Epidemiological models of epidemic spread are an essential tool for optimizing decision-making. The current literature is very extensive and covers a wide variety of deterministic and stochastic models. However, with the increase in computing resources, new, more general, and flexible procedures based on simulation models can assess the effectiveness of measures and quantify the current state of the epidemic. This paper illustrates the potential of this approach to build a new dynamic probabilistic model to estimate the prevalence of SARS-CoV-2 infections in different compartments. METHODS We propose a new probabilistic model in which, for the first time in the epidemic literature, parameter learning is carried out using gradient-free stochastic black-box optimization techniques simulating multiple trajectories of the infection dynamics in a general way, solving an inverse problem that is defined employing the daily information from mortality records. RESULTS After the application of the new proposal in Spain in the first and successive waves, the result of the model confirms the accuracy to estimate the seroprevalence and allows us to know the real dynamics of the pandemic a posteriori to assess the impact of epidemiological measures by the Spanish government and to plan more efficiently the subsequent decisions with the prior knowledge obtained. CONCLUSIONS The model results allow us to estimate the daily patterns of COVID-19 infections in Spain retrospectively and examine the population's exposure to the virus dynamically in contrast to seroprevalence surveys. Furthermore, given the flexibility of our simulation framework, we can model situations -even using non-parametric distributions between the different compartments in the model- that other models in the existing literature cannot. Our general optimization strategy remains valid in these cases, and we can easily create other non-standard simulation epidemic models that incorporate more complex and dynamic structures.
Collapse
Affiliation(s)
- Marcos Matabuena
- CiTIUS (Centro Singular de Investigación en Tecnoloxías Intelixentes), Universidade de Santiago of Compostela, Santiago de Compostela, Spain.
| | - Pablo Rodríguez-Mier
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRAE, ENVT, INP-Purpan, UPS, Toulouse 31300, France
| | | | - Victor Leborán
- CiTIUS (Centro Singular de Investigación en Tecnoloxías Intelixentes), Universidade de Santiago of Compostela, Santiago de Compostela, Spain
| |
Collapse
|
6
|
Braunstein A, Ingrosso A, Muntoni AP. Network reconstruction from infection cascades. J R Soc Interface 2020; 16:20180844. [PMID: 30958195 DOI: 10.1098/rsif.2018.0844] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
Accessing the network through which a propagation dynamics diffuses is essential for understanding and controlling it. In a few cases, such information is available through direct experiments or thanks to the very nature of propagation data. In a majority of cases however, available information about the network is indirect and comes from partial observations of the dynamics, rendering the network reconstruction a fundamental inverse problem. Here we show that it is possible to reconstruct the whole structure of an interaction network and to simultaneously infer the complete time course of activation spreading, relying just on single epoch (i.e. snapshot) or time-scattered observations of a small number of activity cascades. The method that we present is built on a belief propagation approximation, that has shown impressive accuracy in a wide variety of relevant cases, and is able to infer interactions in the presence of incomplete time-series data by providing a detailed modelling of the posterior distribution of trajectories conditioned to the observations. Furthermore, we show by experiments that the information content of full cascades is relatively smaller than that of sparse observations or single snapshots.
Collapse
Affiliation(s)
- Alfredo Braunstein
- 1 DISAT, Politecnico di Torino , Corso Duca Degli Abruzzi 24, 10129 Torino , Italy.,2 Italian Institute for Genomic Medicine , Via Nizza 52, 10124 Torino , Italy.,3 Collegio Carlo Alberto , Piazza Arbarello 8, 10122 Torino , Italy.,4 INFN Sezione di Torino , Via P. Giuria 1, 10125 Torino , Italy
| | - Alessandro Ingrosso
- 5 Center for Theoretical Neuroscience, Columbia University , New York, NY , USA
| | - Anna Paola Muntoni
- 1 DISAT, Politecnico di Torino , Corso Duca Degli Abruzzi 24, 10129 Torino , Italy.,6 Laboratoire de Physique de l'Ecole normale supérieure, ENS, Université PSL, CNRS, Sorbonne Université, Université Paris-Diderot, Sorbonne Paris Cité , Paris , France.,7 Sorbonne Université, CNRS, Institut de Biologie Paris-Seine, Laboratory of Computational and Quantitative Biology , 75005 Paris , France
| |
Collapse
|
7
|
Menale M, Carbonaro B. The mathematical analysis towards the dependence on the initial data for a discrete thermostatted kinetic framework for biological systems composed of interacting entities. AIMS BIOPHYSICS 2020. [DOI: 10.3934/biophy.2020016] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
|
8
|
Panchal V, Linder DF. Reverse engineering gene networks using global-local shrinkage rules. Interface Focus 2019; 10:20190049. [PMID: 31897291 DOI: 10.1098/rsfs.2019.0049] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/13/2019] [Indexed: 12/26/2022] Open
Abstract
Inferring gene regulatory networks from high-throughput 'omics' data has proven to be a computationally demanding task of critical importance. Frequently, the classical methods break down owing to the curse of dimensionality, and popular strategies to overcome this are typically based on regularized versions of the classical methods. However, these approaches rely on loss functions that may not be robust and usually do not allow for the incorporation of prior information in a straightforward way. Fully Bayesian methods are equipped to handle both of these shortcomings quite naturally, and they offer the potential for improvements in network structure learning. We propose a Bayesian hierarchical model to reconstruct gene regulatory networks from time-series gene expression data, such as those common in perturbation experiments of biological systems. The proposed methodology uses global-local shrinkage priors for posterior selection of regulatory edges and relaxes the common normal likelihood assumption in order to allow for heavy-tailed data, which were shown in several of the cited references to severely impact network inference. We provide a sufficient condition for posterior propriety and derive an efficient Markov chain Monte Carlo via Gibbs sampling in the electronic supplementary material. We describe a novel way to detect multiple scales based on the corresponding posterior quantities. Finally, we demonstrate the performance of our approach in a simulation study and compare it with existing methods on real data from a T-cell activation study.
Collapse
Affiliation(s)
- Viral Panchal
- Department of Mathematics and Statistics, University of North Carolina Wilmington, Wilmington, NC 28403, USA
| | - Daniel F Linder
- Medical College of Georgia, Augusta University, Augusta, GA 30912, USA
| |
Collapse
|
9
|
KhudaBukhsh WR, Choi B, Kenah E, Rempała GA. Survival dynamical systems: individual-level survival analysis from population-level epidemic models. Interface Focus 2019; 10:20190048. [PMID: 31897290 PMCID: PMC6936005 DOI: 10.1098/rsfs.2019.0048] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 10/07/2019] [Indexed: 12/20/2022] Open
Abstract
In this paper, we show that solutions to ordinary differential equations describing the large-population limits of Markovian stochastic epidemic models can be interpreted as survival or cumulative hazard functions when analysing data on individuals sampled from the population. We refer to the individual-level survival and hazard functions derived from population-level equations as a survival dynamical system (SDS). To illustrate how population-level dynamics imply probability laws for individual-level infection and recovery times that can be used for statistical inference, we show numerical examples based on synthetic data. In these examples, we show that an SDS analysis compares favourably with a complete-data maximum-likelihood analysis. Finally, we use the SDS approach to analyse data from a 2009 influenza A(H1N1) outbreak at Washington State University.
Collapse
Affiliation(s)
- Wasiur R KhudaBukhsh
- Mathematical Biosciences Institute, The Ohio State University, Columbus, OH, USA
| | - Boseung Choi
- Division of Economics and Statistics, Department of National Statistics, Korea University Sejong campus, Sejong Special Autonomous City, Republic of Korea
| | - Eben Kenah
- Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, OH, USA
| | - Grzegorz A Rempała
- Division of Biostatistics, College of Public Health and Mathematical Biosciences Institute, The Ohio State University, Columbus, OH, USA
| |
Collapse
|
10
|
Choi B, Cheng YY, Cinar S, Ott W, Bennett MR, Josić K, Kim JK. Bayesian inference of distributed time delay in transcriptional and translational regulation. Bioinformatics 2019; 36:586-593. [PMID: 31347688 PMCID: PMC7868000 DOI: 10.1093/bioinformatics/btz574] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2019] [Revised: 06/07/2019] [Accepted: 07/17/2019] [Indexed: 01/31/2023] Open
Abstract
MOTIVATION Advances in experimental and imaging techniques have allowed for unprecedented insights into the dynamical processes within individual cells. However, many facets of intracellular dynamics remain hidden, or can be measured only indirectly. This makes it challenging to reconstruct the regulatory networks that govern the biochemical processes underlying various cell functions. Current estimation techniques for inferring reaction rates frequently rely on marginalization over unobserved processes and states. Even in simple systems this approach can be computationally challenging, and can lead to large uncertainties and lack of robustness in parameter estimates. Therefore we will require alternative approaches to efficiently uncover the interactions in complex biochemical networks. RESULTS We propose a Bayesian inference framework based on replacing uninteresting or unobserved reactions with time delays. Although the resulting models are non-Markovian, recent results on stochastic systems with random delays allow us to rigorously obtain expressions for the likelihoods of model parameters. In turn, this allows us to extend MCMC methods to efficiently estimate reaction rates, and delay distribution parameters, from single-cell assays. We illustrate the advantages, and potential pitfalls, of the approach using a birth-death model with both synthetic and experimental data, and show that we can robustly infer model parameters using a relatively small number of measurements. We demonstrate how to do so even when only the relative molecule count within the cell is measured, as in the case of fluorescence microscopy. AVAILABILITY AND IMPLEMENTATION Accompanying code in R is available at https://github.com/cbskust/DDE_BD. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Boseung Choi
- Department of National Statistics, Korea University Sejong Campus, Sejong 30019, Korea
| | - Yu-Yu Cheng
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Selahattin Cinar
- Department of Mathematics, University of Houston, Houston, TX 77204, USA
| | - William Ott
- Department of Mathematics, University of Houston, Houston, TX 77204, USA
| | - Matthew R Bennett
- Department of Biosciences, Rice University, Houston, TX 77005, USA,Department of Bioengineering, Rice University, Houston, TX 77005, USA
| | | | | |
Collapse
|
11
|
Getz WM, Salter R, Muellerklein O, Yoon HS, Tallam K. Modeling epidemics: A primer and Numerus Model Builder implementation. Epidemics 2018; 25:9-19. [PMID: 30017895 DOI: 10.1016/j.epidem.2018.06.001] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2017] [Revised: 05/30/2018] [Accepted: 06/01/2018] [Indexed: 01/09/2023] Open
Abstract
Epidemiological models are dominated by compartmental models, of which SIR formulations are the most commonly used. These formulations can be continuous or discrete (in either the state-variable values or time), deterministic or stochastic, or spatially homogeneous or heterogeneous, the latter often embracing a network formulation. Here we review the continuous and discrete deterministic and discrete stochastic formulations of the SIR dynamical systems models, and we outline how they can be easily and rapidly constructed using Numerus Model Builder, a graphically-driven coding platform. We also demonstrate how to extend these models to a metapopulation setting using NMB network and mapping tools.
Collapse
Affiliation(s)
- Wayne M Getz
- Dept. ESPM, UC Berkeley, CA 94720-3114, USA; School of Mathematical Sciences, University of KwaZulu-Natal, Private Bag X54001, Durban 4000, South Africa; Numerus, 850 Iron Point Rd., Folsom, CA 95630, USA.
| | - Richard Salter
- Numerus, 850 Iron Point Rd., Folsom, CA 95630, USA; Computer Science Dept., Oberlin College, Oberlin, OH 44074, USA
| | | | | | | |
Collapse
|
12
|
Choi B, Rempala GA, Kim JK. Beyond the Michaelis-Menten equation: Accurate and efficient estimation of enzyme kinetic parameters. Sci Rep 2017; 7:17018. [PMID: 29208922 PMCID: PMC5717222 DOI: 10.1038/s41598-017-17072-z] [Citation(s) in RCA: 75] [Impact Index Per Article: 9.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2017] [Accepted: 11/22/2017] [Indexed: 11/09/2022] Open
Abstract
Examining enzyme kinetics is critical for understanding cellular systems and for using enzymes in industry. The Michaelis-Menten equation has been widely used for over a century to estimate the enzyme kinetic parameters from reaction progress curves of substrates, which is known as the progress curve assay. However, this canonical approach works in limited conditions, such as when there is a large excess of substrate over enzyme. Even when this condition is satisfied, the identifiability of parameters is not always guaranteed, and often not verifiable in practice. To overcome such limitations of the canonical approach for the progress curve assay, here we propose a Bayesian approach based on an equation derived with the total quasi-steady-state approximation. In contrast to the canonical approach, estimates obtained with this proposed approach exhibit little bias for any combination of enzyme and substrate concentrations. Importantly, unlike the canonical approach, an optimal experiment to identify parameters with certainty can be easily designed without any prior information. Indeed, with this proposed design, the kinetic parameters of diverse enzymes with disparate catalytic efficiencies, such as chymotrypsin, fumarase, and urease, can be accurately and precisely estimated from a minimal amount of timecourse data. A publicly accessible computational package performing such accurate and efficient Bayesian inference for enzyme kinetics is provided.
Collapse
Affiliation(s)
- Boseung Choi
- Korea University Sejong campus, Division of Economics and Statistics, Department of National Statistics, Sejong, 30019, Korea
| | - Grzegorz A Rempala
- The Ohio State University, Division of Biostatistics and Mathematical Biosciences Institute, Columbus, OH, 43210, USA
| | - Jae Kyoung Kim
- Korea Advanced Institute of Science and Technology, Department of Mathematical Sciences, Daejeon, 34141, Korea.
| |
Collapse
|
13
|
Burch MG, Jacobsen KA, Tien JH, Rempala GA. Network-based analysis of a small Ebola outbreak. MATHEMATICAL BIOSCIENCES AND ENGINEERING : MBE 2017; 14:67-77. [PMID: 27879120 DOI: 10.3934/mbe.2017005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/06/2023]
Abstract
We present a method for estimating epidemic parameters in network-based stochastic epidemic models when the total number of infections is assumed to be small. We illustrate the method by reanalyzing the data from the 2014 Democratic Republic of the Congo (DRC) Ebola outbreak described in Maganga et al. (2014).
Collapse
Affiliation(s)
- Mark G Burch
- College of Public Health, The Ohio State University, Columbus, OH 43210, United States.
| | | | | | | |
Collapse
|
14
|
Zimmer C, Yaesoubi R, Cohen T. A Likelihood Approach for Real-Time Calibration of Stochastic Compartmental Epidemic Models. PLoS Comput Biol 2017; 13:e1005257. [PMID: 28095403 PMCID: PMC5240920 DOI: 10.1371/journal.pcbi.1005257] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2016] [Accepted: 11/21/2016] [Indexed: 12/03/2022] Open
Abstract
Stochastic transmission dynamic models are especially useful for studying the early emergence of novel pathogens given the importance of chance events when the number of infectious individuals is small. However, methods for parameter estimation and prediction for these types of stochastic models remain limited. In this manuscript, we describe a calibration and prediction framework for stochastic compartmental transmission models of epidemics. The proposed method, Multiple Shooting for Stochastic systems (MSS), applies a linear noise approximation to describe the size of the fluctuations, and uses each new surveillance observation to update the belief about the true epidemic state. Using simulated outbreaks of a novel viral pathogen, we evaluate the accuracy of MSS for real-time parameter estimation and prediction during epidemics. We assume that weekly counts for the number of new diagnosed cases are available and serve as an imperfect proxy of incidence. We show that MSS produces accurate estimates of key epidemic parameters (i.e. mean duration of infectiousness, R0, and Reff) and can provide an accurate estimate of the unobserved number of infectious individuals during the course of an epidemic. MSS also allows for accurate prediction of the number and timing of future hospitalizations and the overall attack rate. We compare the performance of MSS to three state-of-the-art benchmark methods: 1) a likelihood approximation with an assumption of independent Poisson observations; 2) a particle filtering method; and 3) an ensemble Kalman filter method. We find that MSS significantly outperforms each of these three benchmark methods in the majority of epidemic scenarios tested. In summary, MSS is a promising method that may improve on current approaches for calibration and prediction using stochastic models of epidemics.
Collapse
Affiliation(s)
- Christoph Zimmer
- Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, Connecticut, United States of America
| | - Reza Yaesoubi
- Health Policy and Management, Yale School of Public Health, New Haven, Connecticut, United States of America
| | - Ted Cohen
- Epidemiology of Microbial Diseases, Yale School of Public Health, New Haven, Connecticut, United States of America
| |
Collapse
|
15
|
Schwartz EJ, Choi B, Rempala GA. Estimating epidemic parameters: Application to H1N1 pandemic data. Math Biosci 2015; 270:198-203. [DOI: 10.1016/j.mbs.2015.03.007] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2014] [Revised: 03/20/2015] [Accepted: 03/23/2015] [Indexed: 11/16/2022]
|
16
|
Bronstein L, Zechner C, Koeppl H. Bayesian inference of reaction kinetics from single-cell recordings across a heterogeneous cell population. Methods 2015; 85:22-35. [PMID: 25986935 DOI: 10.1016/j.ymeth.2015.05.012] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2015] [Revised: 04/19/2015] [Accepted: 05/10/2015] [Indexed: 11/30/2022] Open
Abstract
Single-cell experimental techniques provide informative data to help uncover dynamical processes inside a cell. Making full use of such data requires dedicated computational methods to estimate biophysical process parameters and states in a model-based manner. In particular, the treatment of heterogeneity or cell-to-cell variability deserves special attention. The present article provides an introduction to one particular class of algorithms which employ marginalization in order to take heterogeneity into account. An overview of alternative approaches is provided for comparison. We treat two frequently encountered scenarios in single-cell experiments, namely, single-cell trajectory data and single-cell distribution data.
Collapse
Affiliation(s)
- L Bronstein
- Department of Electrical Engineering and Information Technology, Technische Universität Darmstadt, Darmstadt, Germany
| | - C Zechner
- Department of Biosystems Sciences and Engineering, ETH Zürich, Basel, Switzerland
| | - H Koeppl
- Department of Electrical Engineering and Information Technology, Technische Universität Darmstadt, Darmstadt, Germany.
| |
Collapse
|
17
|
Hey KL, Momiji H, Featherstone K, Davis JRE, White MRH, Rand DA, Finkenstädt B. A stochastic transcriptional switch model for single cell imaging data. Biostatistics 2015; 16:655-69. [PMID: 25819987 PMCID: PMC4570576 DOI: 10.1093/biostatistics/kxv010] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2014] [Accepted: 02/21/2015] [Indexed: 12/03/2022] Open
Abstract
Gene expression is made up of inherently stochastic processes within single cells and can be modeled through stochastic reaction networks (SRNs). In particular, SRNs capture the features of intrinsic variability arising from intracellular biochemical processes. We extend current models for gene expression to allow the transcriptional process within an SRN to follow a random step or switch function which may be estimated using reversible jump Markov chain Monte Carlo (MCMC). This stochastic switch model provides a generic framework to capture many different dynamic features observed in single cell gene expression. Inference for such SRNs is challenging due to the intractability of the transition densities. We derive a model-specific birth–death approximation and study its use for inference in comparison with the linear noise approximation where both approximations are considered within the unifying framework of state-space models. The methodology is applied to synthetic as well as experimental single cell imaging data measuring expression of the human prolactin gene in pituitary cells.
Collapse
Affiliation(s)
- Kirsty L Hey
- Department of Statistics, University of Warwick, Coventry CV4 7AL, UK
| | - Hiroshi Momiji
- Warwick Systems Biology, University of Warwick, Coventry CV4 7AL, UK
| | - Karen Featherstone
- Centre for Endocrinology and Diabetes, University of Manchester, Manchester M13 9PT, UK
| | - Julian R E Davis
- Centre for Endocrinology and Diabetes, University of Manchester, Manchester M13 9PT, UK
| | - Michael R H White
- Systems Biology Centre, University of Manchester, Manchester M13 9PL, UK
| | - David A Rand
- Warwick Systems Biology, University of Warwick, Coventry CV4 7AL, UK
| | | |
Collapse
|
18
|
Gupta A, Rawlings JB. Comparison of Parameter Estimation Methods in Stochastic Chemical Kinetic Models: Examples in Systems Biology. AIChE J 2014; 60:1253-1268. [PMID: 27429455 DOI: 10.1002/aic.14409] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Stochastic chemical kinetics has become a staple for mechanistically modeling various phenomena in systems biology. These models, even more so than their deterministic counterparts, pose a challenging problem in the estimation of kinetic parameters from experimental data. As a result of the inherent randomness involved in stochastic chemical kinetic models, the estimation methods tend to be statistical in nature. Three classes of estimation methods are implemented and compared in this paper. The first is the exact method, which uses the continuous-time Markov chain representation of stochastic chemical kinetics and is tractable only for a very restricted class of problems. The next class of methods is based on Markov chain Monte Carlo (MCMC) techniques. The third method, termed conditional density importance sampling (CDIS), is a new method introduced in this paper. The use of these methods is demonstrated on two examples taken from systems biology, one of which is a new model of single-cell viral infection. The applicability, strengths and weaknesses of the three classes of estimation methods are discussed. Using simulated data for the two examples, some guidelines are provided on experimental design to obtain more information from a limited number of measurements.
Collapse
Affiliation(s)
- Ankur Gupta
- Dept. of Chemical and Biological Engineering; University of Wisconsin-Madison; 1415 Engineering Drive Madison WI 53705
| | - James B. Rawlings
- Dept. of Chemical and Biological Engineering; University of Wisconsin-Madison; 1415 Engineering Drive Madison WI 53705
| |
Collapse
|
19
|
Least squares estimation in stochastic biochemical networks. Bull Math Biol 2012; 74:1938-55. [PMID: 22752340 DOI: 10.1007/s11538-012-9744-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2011] [Accepted: 06/11/2012] [Indexed: 02/04/2023]
Abstract
The paper presents results on the asymptotic properties of the least-squares estimates (LSEs) of the reaction constants in mass-action, stochastic, biochemical network models. LSEs are assumed to be based on the longitudinal data from partially observed trajectories of a stochastic dynamical system, modeled as a continuous-time, pure jump Markov process. Under certain regularity conditions on such a process, it is shown that the vector of LSEs is jointly consistent and asymptotically normal, with the asymptotic covariance structure given in terms of a system of ordinary differential equations (ODE). The derived asymptotic properties hold true as the biochemical network size (the total species number) increases, in which case the stochastic dynamical system converges to the deterministic mass-action ODE. An example is provided, based on synthetic as well as RT-PCR data from the retro-transcription network of the LINE1 gene.
Collapse
|