1
|
Prokop B, Gelens L. From biological data to oscillator models using SINDy. iScience 2024; 27:109316. [PMID: 38523784 PMCID: PMC10959654 DOI: 10.1016/j.isci.2024.109316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 01/18/2024] [Accepted: 02/18/2024] [Indexed: 03/26/2024] Open
Abstract
Periodic changes in the concentration or activity of different molecules regulate vital cellular processes such as cell division and circadian rhythms. Developing mathematical models is essential to better understand the mechanisms underlying these oscillations. Recent data-driven methods like SINDy have fundamentally changed model identification, yet their application to experimental biological data remains limited. This study investigates SINDy's constraints by directly applying it to biological oscillatory data. We identify insufficient resolution, noise, dimensionality, and limited prior knowledge as primary limitations. Using various generic oscillator models of different complexity and/or dimensionality, we systematically analyze these factors. We then propose a comprehensive guide for inferring models from biological data, addressing these challenges step by step. Our approach is validated using glycolytic oscillation data from yeast.
Collapse
Affiliation(s)
- Bartosz Prokop
- Laboratory of Dynamics in Biological Systems, Department of Cellular and Molecular Medicine, KU Leuven, Herestraat 49, 3000 Leuven, Belgium
| | - Lendert Gelens
- Laboratory of Dynamics in Biological Systems, Department of Cellular and Molecular Medicine, KU Leuven, Herestraat 49, 3000 Leuven, Belgium
| |
Collapse
|
2
|
Liu A, Axås J, Haller G. Data-driven modeling and forecasting of chaotic dynamics on inertial manifolds constructed as spectral submanifolds. CHAOS (WOODBURY, N.Y.) 2024; 34:033140. [PMID: 38531092 DOI: 10.1063/5.0179741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/04/2023] [Accepted: 03/06/2024] [Indexed: 03/28/2024]
Abstract
We present a data-driven and interpretable approach for reducing the dimensionality of chaotic systems using spectral submanifolds (SSMs). Emanating from fixed points or periodic orbits, these SSMs are low-dimensional inertial manifolds containing the chaotic attractor of the underlying high-dimensional system. The reduced dynamics on the SSMs turn out to predict chaotic dynamics accurately over a few Lyapunov times and also reproduce long-term statistical features, such as the largest Lyapunov exponents and probability distributions, of the chaotic attractor. We illustrate this methodology on numerical data sets including delay-embedded Lorenz and Rössler attractors, a nine-dimensional Lorenz model, a periodically forced Duffing oscillator chain, and the Kuramoto-Sivashinsky equation. We also demonstrate the predictive power of our approach by constructing an SSM-reduced model from unforced trajectories of a buckling beam and then predicting its periodically forced chaotic response without using data from the forced beam.
Collapse
Affiliation(s)
- Aihui Liu
- Institute for Mechanical Systems, ETH Zürich, Leonhardstrasse 21, 8092 Zürich, Switzerland
| | - Joar Axås
- Institute for Mechanical Systems, ETH Zürich, Leonhardstrasse 21, 8092 Zürich, Switzerland
| | - George Haller
- Institute for Mechanical Systems, ETH Zürich, Leonhardstrasse 21, 8092 Zürich, Switzerland
| |
Collapse
|
3
|
Wand T, Wiedemann T, Harren J, Kamps O. Estimating stable fixed points and Langevin potentials for financial dynamics. Phys Rev E 2024; 109:024226. [PMID: 38491616 DOI: 10.1103/physreve.109.024226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2023] [Accepted: 12/21/2023] [Indexed: 03/18/2024]
Abstract
The geometric Brownian motion (GBM) is a standard model in quantitative finance, but the potential function of its stochastic differential equation (SDE) cannot include stable nonzero prices. This article generalizes the GBM to an SDE with polynomial drift of order q and shows via model selection that q=2 is most frequently the optimal model to describe the data. Moreover, Markov chain Monte Carlo ensembles of the accompanying potential functions show a clear and pronounced potential well, indicating the existence of a stable price.
Collapse
Affiliation(s)
- Tobias Wand
- Institut für Theoretische Physik, Universität Münster, Wilhelm-Klemm-Str. 9, 48149 Münster, Germany and Center for Nonlinear Science, Universität Münster, Corrensstr. 2, 48149 Münster, Germany
| | - Timo Wiedemann
- Finance Center Münster, Universität Münster, Universitätsstr. 14-16, 48143 Münster, Germany
| | - Jan Harren
- Finance Center Münster, Universität Münster, Universitätsstr. 14-16, 48143 Münster, Germany
| | - Oliver Kamps
- Center for Nonlinear Science Münster, Universität Münster, Corrensstr. 2, 48149 Münster, Germany
| |
Collapse
|
4
|
Prabhu S, Rangarajan S, Kothare M. Data-driven discovery of sparse dynamical model of cardiovascular system for model predictive control. Comput Biol Med 2023; 166:107513. [PMID: 37839218 PMCID: PMC10982123 DOI: 10.1016/j.compbiomed.2023.107513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2022] [Revised: 08/11/2023] [Accepted: 09/19/2023] [Indexed: 10/17/2023]
Abstract
Cardiovascular diseases remain the leading cause of death globally. In recent years, vagal nerve stimulation (VNS) has shown promising results in the treatment of a number of cardiovascular diseases. In this approach, mild electrical pulses are sent to the brain via the vagus nerve. This open-loop neurostimulation, however, leads to various side effects due to physiological and inter-patient variability and therefore a closed-loop delivery strategy of electrical pulses that accounts for this variability is desired. In this context, we envision data-driven sparse dynamical model parameterized by patient-specific data as appropriate for use in closed loop controller design. In this work, we build a dynamical model for mean arterial pressure and heart rate using the method sparse identification of nonlinear dynamics (SINDy). As a proxy for real datasets or measurements from a patient, we simulate a mechanistic model from the literature and then discover a data-driven model for predicting mean arterial pressure and heart rate in response to neural stimulus. This discovered model is then used to design a controller to be implemented in closed-loop via model predictive control. We observe that this data-driven model is interpretable, consistent with experiments, provides insights on the sensitivity of different stimulation locations and simplifies the formulation of the optimal control problem. Noting the set-point tracking performance of this closed-loop model-based controller that uses this discovered model, we conclude that the model is adequate in capturing the dynamics of a highly nonlinear cardiovascular system for the purpose of optimal predictive controller design.
Collapse
Affiliation(s)
- Siddharth Prabhu
- Chemical and Biomolecular Engineering, Lehigh University, Bethlehem, PA, USA.
| | - Srinivas Rangarajan
- Chemical and Biomolecular Engineering, Lehigh University, Bethlehem, PA, USA.
| | - Mayuresh Kothare
- Chemical and Biomolecular Engineering, Lehigh University, Bethlehem, PA, USA.
| |
Collapse
|
5
|
Inoue Y, Koga Y, Takeshige R, Murakami H, Takamura T. Statistical Post-Processing Method for Evaluating Bioaccumulation in Fish Due to Dietary Exposure in Japan. ARCHIVES OF ENVIRONMENTAL CONTAMINATION AND TOXICOLOGY 2023; 85:404-416. [PMID: 37898995 DOI: 10.1007/s00244-023-01035-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Accepted: 09/19/2023] [Indexed: 10/31/2023]
Abstract
In 2018, the dietary exposure bioaccumulation fish test of the Organization for Economic Co-operation and Development Test Guideline No. 305 was introduced into Japan's Chemical Substances Control Law. The Japanese government has adopted a single definitive testing criterion for the absence of high bioaccumulation: the growth-corrected kinetic dietary magnification factor (BMFKg) must be less than 0.007. The aim of this study was to decrease regulatory restrictions in order to increase newly developed chemical substances and their subsequent approval of their manufacture and import, i.e., the present study was motivated by concerns over the criterion being too restrictive, rather than scientific concerns, such as uncertainty in criterion. We used statistical post-processing to assess the possibility of expanding the criteria for not being highly bioaccumulative. Based on our results, we proposed the criterion that the test substance should be considered not highly bioaccumulative if the following two conditions are met: (1) The ratio of the maximum to the minimum measured 5% lipid-standardized biomagnification factor at the end of the uptake phase (BMF5%, n = 5) for the test substance and reference substance should be less than 3.0, and (2) For the measured BMF5% of the test substance (n = 5), the probability that the next (the sixth) BMF5% is below 0.0334 should exceed 95% based on statistical post-processing. It is worth noting that the BMF5% values should only be applied for non-ionizable lipid soluble compounds. Application of our suggested approach to Japan implies that the criterion for chemical substances that are not highly bioaccumulative in the dietary exposure bioaccumulation fish test would be increased from 0.007 to 0.0149.
Collapse
Affiliation(s)
- Yoshiyuki Inoue
- Chemicals Evaluation and Research Institute, Japan, CERI Kurume, 3-2-7 Miyanojin, Kurume-Shi, Fukuoka, 839-0801, Japan.
| | - Yuka Koga
- Chemicals Evaluation and Research Institute, Japan, CERI Kurume, 3-2-7 Miyanojin, Kurume-Shi, Fukuoka, 839-0801, Japan
| | - Ryoko Takeshige
- Chemicals Evaluation and Research Institute, Japan, CERI Kurume, 3-2-7 Miyanojin, Kurume-Shi, Fukuoka, 839-0801, Japan
| | - Hidekazu Murakami
- Chemicals Evaluation and Research Institute, Japan, CERI Kurume, 3-2-7 Miyanojin, Kurume-Shi, Fukuoka, 839-0801, Japan
| | - Takumi Takamura
- Chemicals Evaluation and Research Institute, Japan, CERI Kurume, 3-2-7 Miyanojin, Kurume-Shi, Fukuoka, 839-0801, Japan
| |
Collapse
|
6
|
Bortz DM, Messenger DA, Dukic V. Direct Estimation of Parameters in ODE Models Using WENDy: Weak-Form Estimation of Nonlinear Dynamics. Bull Math Biol 2023; 85:110. [PMID: 37796411 PMCID: PMC10556152 DOI: 10.1007/s11538-023-01208-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2023] [Accepted: 08/30/2023] [Indexed: 10/06/2023]
Abstract
We introduce the Weak-form Estimation of Nonlinear Dynamics (WENDy) method for estimating model parameters for non-linear systems of ODEs. Without relying on any numerical differential equation solvers, WENDy computes accurate estimates and is robust to large (biologically relevant) levels of measurement noise. For low dimensional systems with modest amounts of data, WENDy is competitive with conventional forward solver-based nonlinear least squares methods in terms of speed and accuracy. For both higher dimensional systems and stiff systems, WENDy is typically both faster (often by orders of magnitude) and more accurate than forward solver-based approaches. The core mathematical idea involves an efficient conversion of the strong form representation of a model to its weak form, and then solving a regression problem to perform parameter inference. The core statistical idea rests on the Errors-In-Variables framework, which necessitates the use of the iteratively reweighted least squares algorithm. Further improvements are obtained by using orthonormal test functions, created from a set of [Formula: see text] bump functions of varying support sizes.We demonstrate the high robustness and computational efficiency by applying WENDy to estimate parameters in some common models from population biology, neuroscience, and biochemistry, including logistic growth, Lotka-Volterra, FitzHugh-Nagumo, Hindmarsh-Rose, and a Protein Transduction Benchmark model. Software and code for reproducing the examples is available at https://github.com/MathBioCU/WENDy .
Collapse
Affiliation(s)
- David M Bortz
- Department of Applied Mathematics, University of Colorado, Boulder, CO 80309-0526, USA.
| | - Daniel A Messenger
- Department of Applied Mathematics, University of Colorado, Boulder, CO 80309-0526, USA
| | - Vanja Dukic
- Department of Applied Mathematics, University of Colorado, Boulder, CO 80309-0526, USA
| |
Collapse
|
7
|
Massonis G, Villaverde AF, Banga JR. Distilling identifiable and interpretable dynamic models from biological data. PLoS Comput Biol 2023; 19:e1011014. [PMID: 37851682 PMCID: PMC10615316 DOI: 10.1371/journal.pcbi.1011014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 10/30/2023] [Accepted: 10/03/2023] [Indexed: 10/20/2023] Open
Abstract
Mechanistic dynamical models allow us to study the behavior of complex biological systems. They can provide an objective and quantitative understanding that would be difficult to achieve through other means. However, the systematic development of these models is a non-trivial exercise and an open problem in computational biology. Currently, many research efforts are focused on model discovery, i.e. automating the development of interpretable models from data. One of the main frameworks is sparse regression, where the sparse identification of nonlinear dynamics (SINDy) algorithm and its variants have enjoyed great success. SINDy-PI is an extension which allows the discovery of rational nonlinear terms, thus enabling the identification of kinetic functions common in biochemical networks, such as Michaelis-Menten. SINDy-PI also pays special attention to the recovery of parsimonious models (Occam's razor). Here we focus on biological models composed of sets of deterministic nonlinear ordinary differential equations. We present a methodology that, combined with SINDy-PI, allows the automatic discovery of structurally identifiable and observable models which are also mechanistically interpretable. The lack of structural identifiability and observability makes it impossible to uniquely infer parameter and state variables, which can compromise the usefulness of a model by distorting its mechanistic significance and hampering its ability to produce biological insights. We illustrate the performance of our method with six case studies. We find that, despite enforcing sparsity, SINDy-PI sometimes yields models that are unidentifiable. In these cases we show how our method transforms their equations in order to obtain a structurally identifiable and observable model which is also interpretable.
Collapse
Affiliation(s)
- Gemma Massonis
- Computational Biology Lab, MBG-CSIC (Spanish National Research Council), Pontevedra, Galicia, Spain
| | - Alejandro F. Villaverde
- CITMAga, Santiago de Compostela, Galicia, Spain
- Universidade de Vigo, Department of Systems and Control Engineering, Vigo, Galicia, Spain
| | - Julio R. Banga
- Computational Biology Lab, MBG-CSIC (Spanish National Research Council), Pontevedra, Galicia, Spain
| |
Collapse
|
8
|
Course K, Nair PB. State estimation of a physical system with unknown governing equations. Nature 2023; 622:261-267. [PMID: 37821594 PMCID: PMC10567554 DOI: 10.1038/s41586-023-06574-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 08/25/2023] [Indexed: 10/13/2023]
Abstract
State estimation is concerned with reconciling noisy observations of a physical system with the mathematical model believed to predict its behaviour for the purpose of inferring unmeasurable states and denoising measurable ones1,2. Traditional state-estimation techniques rely on strong assumptions about the form of uncertainty in mathematical models, typically that it manifests as an additive stochastic perturbation or is parametric in nature3. Here we present a reparametrization trick for stochastic variational inference with Markov Gaussian processes that enables an approximate Bayesian approach for state estimation in which the equations governing how the system evolves over time are partially or completely unknown. In contrast to classical state-estimation techniques, our method learns the missing terms in the mathematical model and a state estimate simultaneously from an approximate Bayesian perspective. This development enables the application of state-estimation methods to problems that have so far proved to be beyond reach. Finally, although we focus on state estimation, the advancements to stochastic variational inference made here are applicable to a broader class of problems in machine learning.
Collapse
Affiliation(s)
- Kevin Course
- Institute for Aerospace Studies, University of Toronto, Toronto, Ontario, Canada
| | - Prasanth B Nair
- Institute for Aerospace Studies, University of Toronto, Toronto, Ontario, Canada.
| |
Collapse
|
9
|
Chen C, Chen N, Wu JL. CEBoosting: Online sparse identification of dynamical systems with regime switching by causation entropy boosting. CHAOS (WOODBURY, N.Y.) 2023; 33:083114. [PMID: 37549116 DOI: 10.1063/5.0154777] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/15/2023] [Accepted: 07/19/2023] [Indexed: 08/09/2023]
Abstract
Regime switching is ubiquitous in many complex dynamical systems with multiscale features, chaotic behavior, and extreme events. In this paper, a causation entropy boosting (CEBoosting) strategy is developed to facilitate the detection of regime switching and the discovery of the dynamics associated with the new regime via online model identification. The causation entropy, which can be efficiently calculated, provides a logic value of each candidate function in a pre-determined library. The reversal of one or a few such causation entropy indicators associated with the model calibrated for the current regime implies the detection of regime switching. Despite the short length of each batch formed by the sequential data, the accumulated value of causation entropy corresponding to a sequence of data batches leads to a robust indicator. With the detected rectification of the model structure, the subsequent parameter estimation becomes a quadratic optimization problem, which is solved using closed analytic formulas. Using the Lorenz 96 model, it is shown that the causation entropy indicator can be efficiently calculated, and the method applies to moderately large dimensional systems. The CEBoosting algorithm is also adaptive to the situation with partial observations. It is shown via a stochastic parameterized model that the CEBoosting strategy can be combined with data assimilation to identify regime switching triggered by the unobserved latent processes. In addition, the CEBoosting method is applied to a nonlinear paradigm model for topographic mean flow interaction, demonstrating the online detection of regime switching in the presence of strong intermittency and extreme events.
Collapse
Affiliation(s)
- Chuanqi Chen
- Department of Mechanical Engineering, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Nan Chen
- Department of Mathematics, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Jin-Long Wu
- Department of Mechanical Engineering, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| |
Collapse
|
10
|
Goyal P, Benner P. Neural ordinary differential equations with irregular and noisy data. ROYAL SOCIETY OPEN SCIENCE 2023; 10:221475. [PMID: 37476515 PMCID: PMC10354476 DOI: 10.1098/rsos.221475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Accepted: 06/23/2023] [Indexed: 07/22/2023]
Abstract
Measurement noise is an integral part of collecting data of a physical process. Thus, noise removal is necessary to draw conclusions from these data, and it often becomes essential to construct dynamical models using these data. We discuss a methodology to learn differential equation(s) using noisy and irregularly sampled measurements. In our methodology, the main innovation can be seen in the integration of deep neural networks with the neural ordinary differential equations (ODEs) approach. Precisely, we aim at learning a neural network that provides (approximately) an implicit representation of the data and an additional neural network that models the vector fields of the dependent variables. We combine these two networks by constraints using neural ODEs. The proposed framework to learn a model describing the vector field is highly effective under noisy measurements. The approach can handle scenarios where dependent variables are unavailable at the same temporal grid. Moreover, a particular structure, e.g. second order with respect to time, can easily be incorporated. We demonstrate the effectiveness of the proposed method for learning models using data obtained from various differential equations and present a comparison with the neural ODE method that does not make any special treatment to noise. Additionally, we discuss an ensemble approach to improve the performance of the proposed approach further.
Collapse
Affiliation(s)
- Pawan Goyal
- Max Planck Institute for Dynamics of Complex Technical Systems, Standtorstrasse 1, 39106 Magdeburg, Germany
| | - Peter Benner
- Max Planck Institute for Dynamics of Complex Technical Systems, Standtorstrasse 1, 39106 Magdeburg, Germany
| |
Collapse
|
11
|
Botvinick-Greenhouse J, Martin R, Yang Y. Learning dynamics on invariant measures using PDE-constrained optimization. CHAOS (WOODBURY, N.Y.) 2023; 33:063152. [PMID: 37368043 DOI: 10.1063/5.0149673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Accepted: 06/02/2023] [Indexed: 06/28/2023]
Abstract
We extend the methodology in Yang et al. [SIAM J. Appl. Dyn. Syst. 22, 269-310 (2023)] to learn autonomous continuous-time dynamical systems from invariant measures. The highlight of our approach is to reformulate the inverse problem of learning ODEs or SDEs from data as a PDE-constrained optimization problem. This shift in perspective allows us to learn from slowly sampled inference trajectories and perform uncertainty quantification for the forecasted dynamics. Our approach also yields a forward model with better stability than direct trajectory simulation in certain situations. We present numerical results for the Van der Pol oscillator and the Lorenz-63 system, together with real-world applications to Hall-effect thruster dynamics and temperature prediction, to demonstrate the effectiveness of the proposed approach.
Collapse
Affiliation(s)
| | - Robert Martin
- DEVCOM Army Research Laboratory, Research Triangle Park, Durham, North Carolina 27709, USA
| | - Yunan Yang
- Institute for Theoretical Studies, ETH Zürich, Zürich 8092, Switzerland
| |
Collapse
|
12
|
Prokop B, Gelens L, Pelz PF, Friesen J. Challenges in identifying simple pattern-forming mechanisms in the development of settlements using demographic data. Phys Rev E 2023; 107:064305. [PMID: 37464706 DOI: 10.1103/physreve.107.064305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2021] [Accepted: 05/09/2023] [Indexed: 07/20/2023]
Abstract
The rapid increase of population and settlement structures in the Global South during recent decades has motivated the development of suitable models to describe their formation and evolution. Such settlement formation has been previously suggested to be dynamically driven by simple pattern-forming mechanisms. Here, we explore the use of a data-driven white-box approach, called SINDy, to discover differential equation models directly from available spatiotemporal demographic data for three representative regions of the Global South. We show that the current resolution and observation time of the available data are insufficient to uncover relevant pattern-forming mechanisms in settlement development. Using synthetic data generated with a generic pattern-forming model, the Allen-Cahn equation, we characterize what the requirements are for spatial and temporal resolution, as well as observation time, to successfully identify possible model system equations. Overall, the study provides a theoretical framework for the analysis of large-scale geographical and/or ecological systems, and it motivates further improvements in optimization approaches and data collection.
Collapse
Affiliation(s)
- Bartosz Prokop
- Laboratory of Dynamics in Biological Systems, Department of Cellular and Molecular Medicine, KU Leuven, Leuven 3000, Belgium
| | - Lendert Gelens
- Laboratory of Dynamics in Biological Systems, Department of Cellular and Molecular Medicine, KU Leuven, Leuven 3000, Belgium
| | - Peter F Pelz
- Chair of Fluid Systems, TU Darmstadt, 64287 Darmstadt, Germany
| | - John Friesen
- Chair of Fluid Systems, TU Darmstadt, 64287 Darmstadt, Germany
| |
Collapse
|
13
|
Bortz DM, Messenger DA, Dukic V. Direct Estimation of Parameters in ODE Models Using WENDy: Weak-form Estimation of Nonlinear Dynamics. ARXIV 2023:arXiv:2302.13271v3. [PMID: 36911272 PMCID: PMC10002818] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/13/2023]
Abstract
We introduce the Weak-form Estimation of Nonlinear Dynamics (WENDy) method for estimating model parameters for non-linear systems of ODEs. Without relying on any numerical differential equation solvers, WENDy computes accurate estimates and is robust to large (biologically relevant) levels of measurement noise. For low dimensional systems with modest amounts of data, WENDy is competitive with conventional forward solver-based nonlinear least squares methods in terms of speed and accuracy. For both higher dimensional systems and stiff systems, WENDy is typically both faster (often by orders of magnitude) and more accurate than forward solver-based approaches. The core mathematical idea involves an efficient conversion of the strong form representation of a model to its weak form, and then solving a regression problem to perform parameter inference. The core statistical idea rests on the Errors-In-Variables framework, which necessitates the use of the iteratively reweighted least squares algorithm. Further improvements are obtained by using orthonormal test functions, created from a set of C-infinity bump functions of varying support sizes. We demonstrate the high robustness and computational efficiency by applying WENDy to estimate parameters in some common models from population biology, neuroscience, and biochemistry, including logistic growth, Lotka-Volterra, FitzHugh-Nagumo, Hindmarsh-Rose, and a Protein Transduction Benchmark model. Software and code for reproducing the examples is available at (https://github.com/MathBioCU/WENDy).
Collapse
Affiliation(s)
- David M Bortz
- Department of Applied Mathematics, University of Colorado, Boulder, CO 80309-0526
| | | | | |
Collapse
|
14
|
Deep kernel learning of dynamical models from high-dimensional noisy data. Sci Rep 2022; 12:21530. [PMID: 36513711 PMCID: PMC9747975 DOI: 10.1038/s41598-022-25362-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2022] [Accepted: 11/29/2022] [Indexed: 12/15/2022] Open
Abstract
This work proposes a stochastic variational deep kernel learning method for the data-driven discovery of low-dimensional dynamical models from high-dimensional noisy data. The framework is composed of an encoder that compresses high-dimensional measurements into low-dimensional state variables, and a latent dynamical model for the state variables that predicts the system evolution over time. The training of the proposed model is carried out in an unsupervised manner, i.e., not relying on labeled data. Our learning method is evaluated on the motion of a pendulum-a well studied baseline for nonlinear model identification and control with continuous states and control inputs-measured via high-dimensional noisy RGB images. Results show that the method can effectively denoise measurements, learn compact state representations and latent dynamical models, as well as identify and quantify modeling uncertainties.
Collapse
|
15
|
Dong X, Bai YL, Lu Y, Fan M. An improved sparse identification of nonlinear dynamics with Akaike information criterion and group sparsity. NONLINEAR DYNAMICS 2022; 111:1485-1510. [PMID: 36246669 PMCID: PMC9552166 DOI: 10.1007/s11071-022-07875-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 09/03/2022] [Indexed: 06/16/2023]
Abstract
A crucial challenge encountered in diverse areas of engineering applications involves speculating the governing equations based upon partial observations. On this basis, a variant of the sparse identification of nonlinear dynamics (SINDy) algorithm is developed. First, the Akaike information criterion (AIC) is integrated to enforce model selection by hierarchically ranking the most informative model from several manageable candidate models. This integration avoids restricting the number of candidate models, which is a disadvantage of the traditional methods for model selection. The subsequent procedure expands the structure of dynamics from ordinary differential equations (ODEs) to partial differential equations (PDEs), while group sparsity is employed to identify the nonconstant coefficients of partial differential equations. Of practical consideration within an integrated frame is data processing, which tends to treat noise separate from signals and tends to parametrize the noise probability distribution. In particular, the coefficients of a species of canonical ODEs and PDEs, such as the Van der Pol, Rössler, Burgers' and Kuramoto-Sivashinsky equations, can be identified correctly with the introduction of noise. Furthermore, except for normal noise, the proposed approach is able to capture the distribution of uniform noise. In accordance with the results of the experiments, the computational speed is markedly advanced and possesses robustness.
Collapse
Affiliation(s)
- Xin Dong
- College of Physics and Electrical Engineering, Northwest Normal University, Lanzhou, 730070 Gansu China
| | - Yu-Long Bai
- College of Physics and Electrical Engineering, Northwest Normal University, Lanzhou, 730070 Gansu China
| | - Yani Lu
- College of Physics and Electrical Engineering, Northwest Normal University, Lanzhou, 730070 Gansu China
| | - Manhong Fan
- College of Physics and Electrical Engineering, Northwest Normal University, Lanzhou, 730070 Gansu China
| |
Collapse
|
16
|
Messenger DA, Wheeler GE, Liu X, Bortz DM. Learning anisotropic interaction rules from individual trajectories in a heterogeneous cellular population. J R Soc Interface 2022; 19:20220412. [PMCID: PMC9554727 DOI: 10.1098/rsif.2022.0412] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Interacting particle system (IPS) models have proven to be highly successful for describing the spatial movement of organisms. However, it is challenging to infer the interaction rules directly from data. In the field of equation discovery, the weak-form sparse identification of nonlinear dynamics (WSINDy) methodology has been shown to be computationally efficient for identifying the governing equations of complex systems from noisy data. Motivated by the success of IPS models to describe the spatial movement of organisms, we develop WSINDy for the second-order IPS to learn equations for communities of cells. Our approach learns the directional interaction rules for each individual cell that in aggregate govern the dynamics of a heterogeneous population of migrating cells. To sort a cell according to the active classes present in its model, we also develop a novel ad hoc classification scheme (which accounts for the fact that some cells do not have enough evidence to accurately infer a model). Aggregated models are then constructed hierarchically to simultaneously identify different species of cells present in the population and determine best-fit models for each species. We demonstrate the efficiency and proficiency of the method on several test scenarios, motivated by common cell migration experiments.
Collapse
Affiliation(s)
- Daniel A. Messenger
- Department of Applied Mathematics, University of Colorado, Boulder, CO 80309-0526, USA
| | - Graycen E. Wheeler
- Department of Biochemistry, University of Colorado, Boulder, CO 80309-0526, USA
| | - Xuedong Liu
- Department of Biochemistry, University of Colorado, Boulder, CO 80309-0526, USA
| | - David M. Bortz
- Department of Applied Mathematics, University of Colorado, Boulder, CO 80309-0526, USA
| |
Collapse
|
17
|
Process Model Inversion in the Data-Driven Engineering Context for Improved Parameter Sensitivities. Processes (Basel) 2022. [DOI: 10.3390/pr10091764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022] Open
Abstract
Industry 4.0 has embraced process models in recent years, and the use of model-based digital twins has become even more critical in process systems engineering, monitoring, and control. However, the reliability of these models depends on the model parameters available. The accuracy of the estimated parameters is, in turn, determined by the amount and quality of the measurement data and the algorithm used for parameter identification. For the definition of the parameter identification problem, the ordinary least squares framework is still state-of-the-art in the literature, and better parameter estimates are only possible with additional data. In this work, we present an alternative strategy to identify model parameters by incorporating differential flatness for model inversion and neural ordinary differential equations for surrogate modeling. The novel concept results in an input-least-squares-based parameter identification problem with significant parameter sensitivity changes. To study these sensitivity effects, we use a classic one-dimensional diffusion-type problem, i.e., an omnipresent equation in process systems engineering and transport phenomena. As shown, the proposed concept ensures higher parameter sensitivities for two relevant scenarios. Based on the results derived, we also discuss general implications for data-driven engineering concepts used to identify process model parameters in the recent literature.
Collapse
|
18
|
Data-driven model discovery of ideal four-wave mixing in nonlinear fibre optics. Sci Rep 2022; 12:12711. [PMID: 35882898 PMCID: PMC9325870 DOI: 10.1038/s41598-022-16586-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Accepted: 07/12/2022] [Indexed: 02/08/2023] Open
Abstract
We show using numerical simulations that data driven discovery using sparse regression can be used to extract the governing differential equation model of ideal four-wave mixing in a nonlinear Schrödinger equation optical fibre system. Specifically, we consider the evolution of a strong single frequency pump interacting with two frequency detuned sidebands where the dynamics are governed by a reduced Hamiltonian system describing pump-sideband coupling. Based only on generated dynamical data from this system, sparse regression successfully recovers the underlying physical model, fully capturing the dynamical landscape on both sides of the system separatrix. We also discuss how analysing an ensemble over different initial conditions allows us to reliably identify the governing model in the presence of noise. These results extend the use of data driven discovery to ideal four-wave mixing in nonlinear Schrödinger equation systems.
Collapse
|
19
|
Data-driven discovery of the governing equations of dynamical systems via moving horizon optimization. Sci Rep 2022; 12:11836. [PMID: 35821394 PMCID: PMC9276674 DOI: 10.1038/s41598-022-13644-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2022] [Accepted: 05/26/2022] [Indexed: 12/03/2022] Open
Abstract
Discovering the governing laws underpinning physical and chemical phenomena entirely from data is a key step towards understanding and ultimately controlling systems in science and engineering. Noisy measurements and complex, highly nonlinear underlying dynamics hinder the identification of such governing laws. In this work, we introduce a machine learning framework rooted in moving horizon nonlinear optimization for identifying governing equations in the form of ordinary differential equations from noisy experimental data sets. Our approach evaluates sequential subsets of measurement data, and exploits statistical arguments to learn truly parsimonious governing equations from a large dictionary of basis functions. The proposed framework reduces gradient approximation errors by implicitly embedding an advanced numerical discretization scheme, which improves robustness to noise as well as to model stiffness. Canonical nonlinear dynamical system examples are used to demonstrate that our approach can accurately recover parsimonious governing laws under increasing levels of measurement noise, and outperform state of the art frameworks in the literature. Further, we consider a non-isothermal chemical reactor example to demonstrate that the proposed framework can cope with basis functions that have nonlinear (unknown) parameterizations.
Collapse
|
20
|
Goyal P, Benner P. Discovery of nonlinear dynamical systems using a Runge-Kutta inspired dictionary-based sparse regression approach. Proc Math Phys Eng Sci 2022; 478:20210883. [PMID: 35756880 PMCID: PMC9215218 DOI: 10.1098/rspa.2021.0883] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2021] [Accepted: 05/16/2022] [Indexed: 11/12/2022] Open
Abstract
In this work, we blend machine learning and dictionary-based learning with numerical analysis tools to discover differential equations from noisy and sparsely sampled measurement data of time-dependent processes. We use the fact that given a dictionary containing large candidate nonlinear functions, dynamical models can often be described by a few appropriately chosen basis functions. As a result, we obtain parsimonious models that can be better interpreted by practitioners, and potentially generalize better beyond the sampling regime than black-box modelling. In this work, we integrate a numerical integration framework with dictionary learning that yields differential equations without requiring or approximating derivative information at any stage. Hence, it is utterly effective for corrupted and sparsely sampled data. We discuss its extension to governing equations, containing rational nonlinearities that typically appear in biological networks. Moreover, we generalized the method to governing equations subject to parameter variations and externally controlled inputs. We demonstrate the efficiency of the method to discover a number of diverse differential equations using noisy measurements, including a model describing neural dynamics, chaotic Lorenz model, Michaelis-Menten kinetics and a parameterized Hopf normal form.
Collapse
Affiliation(s)
- Pawan Goyal
- Max Planck Institute for Dynamics of Complex Technical Systems, Standtorstraße 1, 39106 Magdeburg, Germany
| | - Peter Benner
- Max Planck Institute for Dynamics of Complex Technical Systems, Standtorstraße 1, 39106 Magdeburg, Germany
| |
Collapse
|