1
|
Brückner DB, Broedersz CP. Learning dynamical models of single and collective cell migration: a review. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2024; 87:056601. [PMID: 38518358 DOI: 10.1088/1361-6633/ad36d2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Accepted: 03/22/2024] [Indexed: 03/24/2024]
Abstract
Single and collective cell migration are fundamental processes critical for physiological phenomena ranging from embryonic development and immune response to wound healing and cancer metastasis. To understand cell migration from a physical perspective, a broad variety of models for the underlying physical mechanisms that govern cell motility have been developed. A key challenge in the development of such models is how to connect them to experimental observations, which often exhibit complex stochastic behaviours. In this review, we discuss recent advances in data-driven theoretical approaches that directly connect with experimental data to infer dynamical models of stochastic cell migration. Leveraging advances in nanofabrication, image analysis, and tracking technology, experimental studies now provide unprecedented large datasets on cellular dynamics. In parallel, theoretical efforts have been directed towards integrating such datasets into physical models from the single cell to the tissue scale with the aim of conceptualising the emergent behaviour of cells. We first review how this inference problem has been addressed in both freely migrating and confined cells. Next, we discuss why these dynamics typically take the form of underdamped stochastic equations of motion, and how such equations can be inferred from data. We then review applications of data-driven inference and machine learning approaches to heterogeneity in cell behaviour, subcellular degrees of freedom, and to the collective dynamics of multicellular systems. Across these applications, we emphasise how data-driven methods can be integrated with physical active matter models of migrating cells, and help reveal how underlying molecular mechanisms control cell behaviour. Together, these data-driven approaches are a promising avenue for building physical models of cell migration directly from experimental data, and for providing conceptual links between different length-scales of description.
Collapse
Affiliation(s)
- David B Brückner
- Institute of Science and Technology Austria, Am Campus 1, 3400 Klosterneuburg, Austria
| | - Chase P Broedersz
- Department of Physics and Astronomy, Vrije Universiteit Amsterdam, 1081 HV Amsterdam, The Netherlands
- Arnold Sommerfeld Center for Theoretical Physics and Center for NanoScience, Department of Physics, Ludwig-Maximilian-University Munich, Theresienstr. 37, D-80333 Munich, Germany
| |
Collapse
|
2
|
Goyal P, Benner P. Neural ordinary differential equations with irregular and noisy data. ROYAL SOCIETY OPEN SCIENCE 2023; 10:221475. [PMID: 37476515 PMCID: PMC10354476 DOI: 10.1098/rsos.221475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Accepted: 06/23/2023] [Indexed: 07/22/2023]
Abstract
Measurement noise is an integral part of collecting data of a physical process. Thus, noise removal is necessary to draw conclusions from these data, and it often becomes essential to construct dynamical models using these data. We discuss a methodology to learn differential equation(s) using noisy and irregularly sampled measurements. In our methodology, the main innovation can be seen in the integration of deep neural networks with the neural ordinary differential equations (ODEs) approach. Precisely, we aim at learning a neural network that provides (approximately) an implicit representation of the data and an additional neural network that models the vector fields of the dependent variables. We combine these two networks by constraints using neural ODEs. The proposed framework to learn a model describing the vector field is highly effective under noisy measurements. The approach can handle scenarios where dependent variables are unavailable at the same temporal grid. Moreover, a particular structure, e.g. second order with respect to time, can easily be incorporated. We demonstrate the effectiveness of the proposed method for learning models using data obtained from various differential equations and present a comparison with the neural ODE method that does not make any special treatment to noise. Additionally, we discuss an ensemble approach to improve the performance of the proposed approach further.
Collapse
Affiliation(s)
- Pawan Goyal
- Max Planck Institute for Dynamics of Complex Technical Systems, Standtorstrasse 1, 39106 Magdeburg, Germany
| | - Peter Benner
- Max Planck Institute for Dynamics of Complex Technical Systems, Standtorstrasse 1, 39106 Magdeburg, Germany
| |
Collapse
|
3
|
Sachak-Patwa R, Lafferty EI, Schmit CJ, Thompson RN, Byrne HM. A target-cell limited model can reproduce influenza infection dynamics in hosts with differing immune responses. J Theor Biol 2023; 567:111491. [PMID: 37044357 DOI: 10.1016/j.jtbi.2023.111491] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2022] [Revised: 03/02/2023] [Accepted: 04/05/2023] [Indexed: 04/14/2023]
Abstract
We consider a hierarchy of ordinary differential equation models that describe the within-host viral kinetics of influenza infections: the IR model explicitly accounts for an immune response to the virus, while the simpler, target-cell limited TEIV and TV models do not. We show that when the IR model is fitted to pooled experimental murine data of the viral load, fraction of dead cells, and immune response levels, its parameters values can be determined. However, if, as is common, only viral load data are available, we can estimate parameters of the TEIV and TV models but not the IR model. These results are substantiated by a structural and practical identifiability analysis. We then use the IR model to generate synthetic data representing infections in hosts whose immune responses differ. We fit the TV model to these synthetic datasets and show that it can reproduce the characteristic exponential increase and decay of viral load generated by the IR model. Furthermore, the values of the fitted parameters of the TV model can be mapped from the immune response parameters in the IR model. We conclude that, if only viral load data are available, a simple target-cell limited model can reproduce influenza infection dynamics and distinguish between hosts with differing immune responses.
Collapse
Affiliation(s)
- Rahil Sachak-Patwa
- Mathematical Institute, University of Oxford, Andrew Wiles Building, Radcliffe Observatory Quarter, Woodstock Road, Oxford, OX2 6GG, UK.
| | - Erin I Lafferty
- Biosensors Beyond Borders Limited, 9 Bedford Square, London, WC1B 3RE, UK
| | - Claude J Schmit
- Biosensors Beyond Borders Limited, 9 Bedford Square, London, WC1B 3RE, UK
| | - Robin N Thompson
- Mathematics Institute, University of Warwick, Zeeman Building, Coventry, CV4 7AL, UK; Zeeman Institute for Systems Biology and Infectious Disease Epidemiology Research, University of Warwick, Coventry, CV4 7AL, UK
| | - Helen M Byrne
- Mathematical Institute, University of Oxford, Andrew Wiles Building, Radcliffe Observatory Quarter, Woodstock Road, Oxford, OX2 6GG, UK
| |
Collapse
|
4
|
Roman A, Palanski K, Nemenman I, Ryu WS. A dynamical model of C. elegans thermal preference reveals independent excitatory and inhibitory learning pathways. Proc Natl Acad Sci U S A 2023; 120:e2215191120. [PMID: 36940330 PMCID: PMC10068832 DOI: 10.1073/pnas.2215191120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 02/19/2023] [Indexed: 03/22/2023] Open
Abstract
Caenorhabditis elegans is capable of learning and remembering behaviorally relevant cues such as smells, tastes, and temperature. This is an example of associative learning, a process in which behavior is modified by making associations between various stimuli. Since the mathematical theory of conditioning does not account for some of its salient aspects, such as spontaneous recovery of extinguished associations, accurate modeling of behavior of real animals during conditioning has turned out difficult. Here, we do this in the context of the dynamics of the thermal preference of C. elegans. We quantify C. elegans thermotaxis in response to various conditioning temperatures, starvation durations, and genetic perturbations using a high-resolution microfluidic droplet assay. We model these data comprehensively, within a biologically interpretable, multi-modal framework. We find that the strength of the thermal preference is composed of two independent, genetically separable contributions and requires a model with at least four dynamical variables. One pathway positively associates the experienced temperature independently of food and the other negatively associates with the temperature when food is absent. The multidimensional structure of the association strength provides an explanation for the apparent classical temperature-food association of C. elegans thermal preference and a number of longstanding questions in animal learning, including spontaneous recovery, asymmetric response to appetitive vs. aversive cues, latent inhibition, and generalization among similar cues.
Collapse
Affiliation(s)
- Ahmed Roman
- Department of Physics, Emory University, Atlanta, GA30322
| | | | - Ilya Nemenman
- Department of Physics, Emory University, Atlanta, GA30322
- Department of Biology, Emory University, Atlanta, GA30322
- Initiative in Theory and Modeling of Living Systems, Emory University, Atlanta, GA30322
| | - William S. Ryu
- Department of Physics, University of Toronto, Toronto, ON M5S 1A7, Canada
- The Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
| |
Collapse
|
5
|
Teichner R, Shomar A, Barak O, Brenner N, Marom S, Meir R, Eytan D. Identifying regulation with adversarial surrogates. Proc Natl Acad Sci U S A 2023; 120:e2216805120. [PMID: 36920920 PMCID: PMC10041131 DOI: 10.1073/pnas.2216805120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2022] [Accepted: 02/15/2023] [Indexed: 03/16/2023] Open
Abstract
Homeostasis, the ability to maintain a relatively constant internal environment in the face of perturbations, is a hallmark of biological systems. It is believed that this constancy is achieved through multiple internal regulation and control processes. Given observations of a system, or even a detailed model of one, it is both valuable and extremely challenging to extract the control objectives of the homeostatic mechanisms. In this work, we develop a robust data-driven method to identify these objectives, namely to understand: "what does the system care about?". We propose an algorithm, Identifying Regulation with Adversarial Surrogates (IRAS), that receives an array of temporal measurements of the system and outputs a candidate for the control objective, expressed as a combination of observed variables. IRAS is an iterative algorithm consisting of two competing players. The first player, realized by an artificial deep neural network, aims to minimize a measure of invariance we refer to as the coefficient of regulation. The second player aims to render the task of the first player more difficult by forcing it to extract information about the temporal structure of the data, which is absent from similar "surrogate" data. We test the algorithm on four synthetic and one natural data set, demonstrating excellent empirical results. Interestingly, our approach can also be used to extract conserved quantities, e.g., energy and momentum, in purely physical systems, as we demonstrate empirically.
Collapse
Affiliation(s)
- Ron Teichner
- Viterbi Department of Electrical & Computer Engineering, Technion, Israel Institute of Technology, 32000 Haifa, Israel
- Network Biology Research Lab, Technion, Israel Institute of Technology, 32000 Haifa, Israel
| | - Aseel Shomar
- Network Biology Research Lab, Technion, Israel Institute of Technology, 32000 Haifa, Israel
- Department of Chemical Engineering, Technion, Israel Institute of Technology, 32000 Haifa, Israel
| | - Omri Barak
- Network Biology Research Lab, Technion, Israel Institute of Technology, 32000 Haifa, Israel
- Rappaport Faculty of Medicine, Technion, Israel Institute of Technology, 32000 Haifa, Israel
| | - Naama Brenner
- Network Biology Research Lab, Technion, Israel Institute of Technology, 32000 Haifa, Israel
- Department of Chemical Engineering, Technion, Israel Institute of Technology, 32000 Haifa, Israel
| | - Shimon Marom
- Network Biology Research Lab, Technion, Israel Institute of Technology, 32000 Haifa, Israel
- Rappaport Faculty of Medicine, Technion, Israel Institute of Technology, 32000 Haifa, Israel
| | - Ron Meir
- Viterbi Department of Electrical & Computer Engineering, Technion, Israel Institute of Technology, 32000 Haifa, Israel
- Network Biology Research Lab, Technion, Israel Institute of Technology, 32000 Haifa, Israel
| | - Danny Eytan
- Network Biology Research Lab, Technion, Israel Institute of Technology, 32000 Haifa, Israel
- Rappaport Faculty of Medicine, Technion, Israel Institute of Technology, 32000 Haifa, Israel
| |
Collapse
|
6
|
Wang J, Xu B, Lai J, Wang Y, Hu C, Li H, Song A. An Improved Koopman-MPC Framework for Data-Driven Modeling and Control of Soft Actuators. IEEE Robot Autom Lett 2023. [DOI: 10.1109/lra.2022.3229235] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
Affiliation(s)
- Jiajin Wang
- State Key Laboratory of Bioelectronics, Jiangsu Key Laboratory of Remote Measurement and Control, School of Instrument Science and Engineering, Southeast University, Nanjing, China
| | - Baoguo Xu
- State Key Laboratory of Bioelectronics, Jiangsu Key Laboratory of Remote Measurement and Control, School of Instrument Science and Engineering, Southeast University, Nanjing, China
| | - Jianwei Lai
- State Key Laboratory of Bioelectronics, Jiangsu Key Laboratory of Remote Measurement and Control, School of Instrument Science and Engineering, Southeast University, Nanjing, China
| | - Yifei Wang
- State Key Laboratory of Bioelectronics, Jiangsu Key Laboratory of Remote Measurement and Control, School of Instrument Science and Engineering, Southeast University, Nanjing, China
| | - Cong Hu
- Guangxi Key Laboratory of Automatic Detecting Technology and Instruments, Guilin University of Electronic Technology, Guilin, China
| | - Huijun Li
- State Key Laboratory of Bioelectronics, Jiangsu Key Laboratory of Remote Measurement and Control, School of Instrument Science and Engineering, Southeast University, Nanjing, China
| | - Aiguo Song
- State Key Laboratory of Bioelectronics, Jiangsu Key Laboratory of Remote Measurement and Control, School of Instrument Science and Engineering, Southeast University, Nanjing, China
| |
Collapse
|
7
|
Zhao Y, Nassar J, Jordan I, Bugallo M, Park IM. Streaming Variational Monte Carlo. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2023; 45:1150-1161. [PMID: 35201981 PMCID: PMC10082974 DOI: 10.1109/tpami.2022.3153225] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Nonlinear state-space models are powerful tools to describe dynamical structures in complex time series. In a streaming setting where data are processed one sample at a time, simultaneous inference of the state and its nonlinear dynamics has posed significant challenges in practice. We develop a novel online learning framework, leveraging variational inference and sequential Monte Carlo, which enables flexible and accurate Bayesian joint filtering. Our method provides an approximation of the filtering posterior which can be made arbitrarily close to the true filtering distribution for a wide class of dynamics models and observation models. Specifically, the proposed framework can efficiently approximate a posterior over the dynamics using sparse Gaussian processes, allowing for an interpretable model of the latent dynamics. Constant time complexity per sample makes our approach amenable to online learning scenarios and suitable for real-time applications.
Collapse
|
8
|
Quinn KN, Abbott MC, Transtrum MK, Machta BB, Sethna JP. Information geometry for multiparameter models: new perspectives on the origin of simplicity. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2022; 86:10.1088/1361-6633/aca6f8. [PMID: 36576176 PMCID: PMC10018491 DOI: 10.1088/1361-6633/aca6f8] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Accepted: 11/29/2022] [Indexed: 05/20/2023]
Abstract
Complex models in physics, biology, economics, and engineering are oftensloppy, meaning that the model parameters are not well determined by the model predictions for collective behavior. Many parameter combinations can vary over decades without significant changes in the predictions. This review uses information geometry to explore sloppiness and its deep relation to emergent theories. We introduce themodel manifoldof predictions, whose coordinates are the model parameters. Itshyperribbonstructure explains why only a few parameter combinations matter for the behavior. We review recent rigorous results that connect the hierarchy of hyperribbon widths to approximation theory, and to the smoothness of model predictions under changes of the control variables. We discuss recent geodesic methods to find simpler models on nearby boundaries of the model manifold-emergent theories with fewer parameters that explain the behavior equally well. We discuss a Bayesian prior which optimizes the mutual information between model parameters and experimental data, naturally favoring points on the emergent boundary theories and thus simpler models. We introduce a 'projected maximum likelihood' prior that efficiently approximates this optimal prior, and contrast both to the poor behavior of the traditional Jeffreys prior. We discuss the way the renormalization group coarse-graining in statistical mechanics introduces a flow of the model manifold, and connect stiff and sloppy directions along the model manifold with relevant and irrelevant eigendirections of the renormalization group. Finally, we discuss recently developed 'intensive' embedding methods, allowing one to visualize the predictions of arbitrary probabilistic models as low-dimensional projections of an isometric embedding, and illustrate our method by generating the model manifold of the Ising model.
Collapse
Affiliation(s)
- Katherine N Quinn
- Center for the Physics of Biological Function, Princeton University, Princeton, NJ, United States of America
| | - Michael C Abbott
- Department of Physics, Yale University, New Haven, CT, United States of America
| | - Mark K Transtrum
- Department of Physics and Astronomy, Brigham Young University, Provo, UT, United States of America
| | - Benjamin B Machta
- Department of Physics and Systems Biology Institute, Yale University, New Haven, CT, United States of America
| | - James P Sethna
- Department of Physics, Cornell University, Ithaca, NY, United States of America
| |
Collapse
|
9
|
Yoshinaga N, Tokuda S. Bayesian modeling of pattern formation from one snapshot of pattern. Phys Rev E 2022; 106:065301. [PMID: 36671103 DOI: 10.1103/physreve.106.065301] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Accepted: 11/07/2022] [Indexed: 06/17/2023]
Abstract
Partial differential equations (PDEs) have been widely used to reproduce patterns in nature and to give insight into the mechanism underlying pattern formation. Although many PDE models have been proposed, they rely on the pre-request knowledge of physical laws and symmetries, and developing a model to reproduce a given desired pattern remains difficult. We propose a method, referred to as Bayesian modeling of PDEs (BM-PDEs), to estimate the best dynamical PDE for one snapshot of a objective pattern under the stationary state without ground truth. We apply BM-PDEs to nontrivial patterns, such as quasicrystals (QCs), a double gyroid, and Frank-Kasper structures. We also generate three-dimensional dodecagonal QCs from a PDE model. This is done by using the estimated parameters for the Frank-Kasper A15 structure, which closely approximates the local structures of QCs. Our method works for noisy patterns and the pattern synthesized without the ground-truth parameters, which are required for the application toward experimental data.
Collapse
Affiliation(s)
- Natsuhiko Yoshinaga
- WPI-Advanced Institute for Materials Research, Tohoku University, Sendai 980-8577, Japan
- MathAM-OIL, AIST, Sendai 980-8577, Japan
| | - Satoru Tokuda
- MathAM-OIL, AIST, Sendai 980-8577, Japan
- Research Institute for Information Technology, Kyushu University, Kasuga 816-8580, Japan
| |
Collapse
|
10
|
Dong X, Bai YL, Lu Y, Fan M. An improved sparse identification of nonlinear dynamics with Akaike information criterion and group sparsity. NONLINEAR DYNAMICS 2022; 111:1485-1510. [PMID: 36246669 PMCID: PMC9552166 DOI: 10.1007/s11071-022-07875-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/31/2022] [Accepted: 09/03/2022] [Indexed: 06/16/2023]
Abstract
A crucial challenge encountered in diverse areas of engineering applications involves speculating the governing equations based upon partial observations. On this basis, a variant of the sparse identification of nonlinear dynamics (SINDy) algorithm is developed. First, the Akaike information criterion (AIC) is integrated to enforce model selection by hierarchically ranking the most informative model from several manageable candidate models. This integration avoids restricting the number of candidate models, which is a disadvantage of the traditional methods for model selection. The subsequent procedure expands the structure of dynamics from ordinary differential equations (ODEs) to partial differential equations (PDEs), while group sparsity is employed to identify the nonconstant coefficients of partial differential equations. Of practical consideration within an integrated frame is data processing, which tends to treat noise separate from signals and tends to parametrize the noise probability distribution. In particular, the coefficients of a species of canonical ODEs and PDEs, such as the Van der Pol, Rössler, Burgers' and Kuramoto-Sivashinsky equations, can be identified correctly with the introduction of noise. Furthermore, except for normal noise, the proposed approach is able to capture the distribution of uniform noise. In accordance with the results of the experiments, the computational speed is markedly advanced and possesses robustness.
Collapse
Affiliation(s)
- Xin Dong
- College of Physics and Electrical Engineering, Northwest Normal University, Lanzhou, 730070 Gansu China
| | - Yu-Long Bai
- College of Physics and Electrical Engineering, Northwest Normal University, Lanzhou, 730070 Gansu China
| | - Yani Lu
- College of Physics and Electrical Engineering, Northwest Normal University, Lanzhou, 730070 Gansu China
| | - Manhong Fan
- College of Physics and Electrical Engineering, Northwest Normal University, Lanzhou, 730070 Gansu China
| |
Collapse
|
11
|
Ma H, Haluszczynski A, Prosperino D, Räth C. Identifying causality drivers and deriving governing equations of nonlinear complex systems. CHAOS (WOODBURY, N.Y.) 2022; 32:103128. [PMID: 36319303 DOI: 10.1063/5.0102250] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 10/01/2022] [Indexed: 06/16/2023]
Abstract
Identifying and describing the dynamics of complex systems is a central challenge in various areas of science, such as physics, finance, or climatology. While machine learning algorithms are increasingly overtaking traditional approaches, their inner workings and, thus, the drivers of causality remain elusive. In this paper, we analyze the causal structure of chaotic systems using Fourier transform surrogates and three different inference techniques: While we confirm that Granger causality is exclusively able to detect linear causality, transfer entropy and convergent cross-mapping indicate that causality is determined to a significant extent by nonlinear properties. For the Lorenz and Halvorsen systems, we find that their contribution is independent of the strength of the nonlinear coupling. Furthermore, we show that a simple rationale and calibration algorithm are sufficient to extract the governing equations directly from the causal structure of the data. Finally, we illustrate the applicability of the framework to real-world dynamical systems using financial data before and after the COVID-19 outbreak. It turns out that the pandemic triggered a fundamental rupture in the world economy, which is reflected in the causal structure and the resulting equations.
Collapse
Affiliation(s)
- Haochun Ma
- Ludwig-Maximilians-Universität München, Department of Physics, Schellingstraße 4, 80799 Munich, Germany
| | | | - Davide Prosperino
- Allianz Global Investors, risklab, Seidlstraße 24, 80335 Munich, Germany
| | - Christoph Räth
- Deutsches Zentrum für Luft- und Raumfahrt (DLR), Institut für KI Sicherheit, Wilhelm-Runge-Straße 10, 89081 Ulm, Germany
| |
Collapse
|
12
|
Cárdenas SD, Reznik CJ, Ranaweera R, Song F, Chung CH, Fertig EJ, Gevertz JL. Model-informed experimental design recommendations for distinguishing intrinsic and acquired targeted therapeutic resistance in head and neck cancer. NPJ Syst Biol Appl 2022; 8:32. [PMID: 36075912 PMCID: PMC9458753 DOI: 10.1038/s41540-022-00244-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2022] [Accepted: 08/05/2022] [Indexed: 11/09/2022] Open
Abstract
The promise of precision medicine has been limited by the pervasive resistance to many targeted therapies for cancer. Inferring the timing (i.e., pre-existing or acquired) and mechanism (i.e., drug-induced) of such resistance is crucial for designing effective new therapeutics. This paper studies cetuximab resistance in head and neck squamous cell carcinoma (HNSCC) using tumor volume data obtained from patient-derived tumor xenografts. We ask if resistance mechanisms can be determined from this data alone, and if not, what data would be needed to deduce the underlying mode(s) of resistance. To answer these questions, we propose a family of mathematical models, with each member of the family assuming a different timing and mechanism of resistance. We present a method for fitting these models to individual volumetric data, and utilize model selection and parameter sensitivity analyses to ask: which member(s) of the family of models best describes HNSCC response to cetuximab, and what does that tell us about the timing and mechanisms driving resistance? We find that along with time-course volumetric data to a single dose of cetuximab, the initial resistance fraction and, in some instances, dose escalation volumetric data are required to distinguish among the family of models and thereby infer the mechanisms of resistance. These findings can inform future experimental design so that we can best leverage the synergy of wet laboratory experimentation and mathematical modeling in the study of novel targeted cancer therapeutics.
Collapse
Affiliation(s)
- Santiago D Cárdenas
- Department of Mathematics and Statistics, The College of New Jersey, Ewing, NJ, USA
| | - Constance J Reznik
- Department of Mathematics and Statistics, The College of New Jersey, Ewing, NJ, USA
- Datacor, Inc., Florham Park, NJ, USA
| | - Ruchira Ranaweera
- Department of Head and Neck-Endocrine Oncology, Moffitt Cancer Center, Tampa, FL, USA
| | - Feifei Song
- Department of Head and Neck-Endocrine Oncology, Moffitt Cancer Center, Tampa, FL, USA
| | - Christine H Chung
- Department of Head and Neck-Endocrine Oncology, Moffitt Cancer Center, Tampa, FL, USA
| | - Elana J Fertig
- Convergence Institute, Department of Oncology, Department of Biomedical Engineering, Department of Applied Mathematics and Statistics, Johns Hopkins University, Baltimore, MD, USA.
| | - Jana L Gevertz
- Department of Mathematics and Statistics, The College of New Jersey, Ewing, NJ, USA.
| |
Collapse
|
13
|
Yu W, Cho J, Burton JC. Extracting forces from noisy dynamics in dusty plasmas. Phys Rev E 2022; 106:035303. [PMID: 36266844 DOI: 10.1103/physreve.106.035303] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 08/26/2022] [Indexed: 06/16/2023]
Abstract
Extracting environmental forces from noisy data is a common yet challenging task in complex physical systems. Machine learning (ML) represents a robust approach to this problem, yet is mostly tested on simulated data with known parameters. Here we use supervised ML to extract the electrostatic, dissipative, and stochastic forces acting on micron-sized charged particles levitated in an argon plasma (dusty plasma). By tracking the subpixel motion of particles in subsequent images, we successfully estimated these forces from their random motion. The experiments contained important sources of non-Gaussian noise, such as drift and pixel locking, representing a data mismatch from methods used to analyze simulated data with purely Gaussian noise. Our model was trained on simulated particle trajectories that included all of these artifacts, and used more than 100 dynamical and statistical features, resulting in a prediction with 50% better accuracy than conventional methods. Finally, in systems with two interacting particles, the model provided noncontact measurements of the particle charge and Debye length in the plasma environment.
Collapse
Affiliation(s)
- Wentao Yu
- Department of Physics, Emory University, Atlanta, Georgia 30322, USA
| | - Jonathan Cho
- Department of Physics, Emory University, Atlanta, Georgia 30322, USA
| | - Justin C Burton
- Department of Physics, Emory University, Atlanta, Georgia 30322, USA
| |
Collapse
|
14
|
Yuan AE, Shou W. Data-driven causal analysis of observational biological time series. eLife 2022; 11:e72518. [PMID: 35983746 PMCID: PMC9391047 DOI: 10.7554/elife.72518] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Accepted: 01/23/2022] [Indexed: 11/28/2022] Open
Abstract
Complex systems are challenging to understand, especially when they defy manipulative experiments for practical or ethical reasons. Several fields have developed parallel approaches to infer causal relations from observational time series. Yet, these methods are easy to misunderstand and often controversial. Here, we provide an accessible and critical review of three statistical causal discovery approaches (pairwise correlation, Granger causality, and state space reconstruction), using examples inspired by ecological processes. For each approach, we ask what it tests for, what causal statement it might imply, and when it could lead us astray. We devise new ways of visualizing key concepts, describe some novel pathologies of existing methods, and point out how so-called 'model-free' causality tests are not assumption-free. We hope that our synthesis will facilitate thoughtful application of methods, promote communication across different fields, and encourage explicit statements of assumptions. A video walkthrough is available (Video 1 or https://youtu.be/AIV0ttQrjK8).
Collapse
Affiliation(s)
- Alex Eric Yuan
- Molecular and Cellular Biology PhD program, University of WashingtonSeattleUnited States
- Basic Sciences Division, Fred Hutchinson Cancer Research CenterSeattleUnited States
| | - Wenying Shou
- Centre for Life’s Origins and Evolution, Department of Genetics, Evolution and Environment, University College LondonLondonUnited Kingdom
| |
Collapse
|
15
|
Nicoletti G, Maritan A, Busiello DM. Information-driven transitions in projections of underdamped dynamics. Phys Rev E 2022; 106:014118. [PMID: 35974569 DOI: 10.1103/physreve.106.014118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2022] [Accepted: 07/05/2022] [Indexed: 06/15/2023]
Abstract
Low-dimensional representations of underdamped systems often provide useful insights and analytical tractability. Here, we build such representations via information projections, obtaining an optimal model that captures the most information on observed spatial trajectories. We show that, in paradigmatic systems, the minimization of the information loss drives the appearance of a discontinuous transition in the optimal model parameters. Our results raise serious warnings for general inference approaches, and they unravel fundamental properties of effective dynamical representations impacting several fields, from biophysics to dimensionality reduction.
Collapse
Affiliation(s)
- Giorgio Nicoletti
- Department of Physics and Astronomy "G. Galilei," University of Padova, Padova, Italy
| | - Amos Maritan
- Department of Physics and Astronomy "G. Galilei," University of Padova, Padova, Italy
| | - Daniel Maria Busiello
- Institute of Physics, École Polytechnique Fédérale de Lausanne-EPFL, 1015 Lausanne, Switzerland
| |
Collapse
|
16
|
Chen B, Huang K, Raghupathi S, Chandratreya I, Du Q, Lipson H. Automated discovery of fundamental variables hidden in experimental data. NATURE COMPUTATIONAL SCIENCE 2022; 2:433-442. [PMID: 38177869 DOI: 10.1038/s43588-022-00281-6] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Accepted: 06/21/2022] [Indexed: 01/06/2024]
Abstract
All physical laws are described as mathematical relationships between state variables. These variables give a complete and non-redundant description of the relevant system. However, despite the prevalence of computing power and artificial intelligence, the process of identifying the hidden state variables themselves has resisted automation. Most data-driven methods for modelling physical phenomena still rely on the assumption that the relevant state variables are already known. A longstanding question is whether it is possible to identify state variables from only high-dimensional observational data. Here we propose a principle for determining how many state variables an observed system is likely to have, and what these variables might be. We demonstrate the effectiveness of this approach using video recordings of a variety of physical dynamical systems, ranging from elastic double pendulums to fire flames. Without any prior knowledge of the underlying physics, our algorithm discovers the intrinsic dimension of the observed dynamics and identifies candidate sets of state variables.
Collapse
Affiliation(s)
- Boyuan Chen
- Department of Computer Science, Columbia University, New York, USA.
| | - Kuang Huang
- Department of Applied Physics and Applied Mathematics, Columbia University, New York, USA
| | - Sunand Raghupathi
- Department of Applied Physics and Applied Mathematics, Columbia University, New York, USA
| | | | - Qiang Du
- Department of Applied Physics and Applied Mathematics, Columbia University, New York, USA
- Data Science Institute, Columbia University, New York, USA
| | - Hod Lipson
- Data Science Institute, Columbia University, New York, USA
- Department of Mechanical Engineering, Columbia University, New York, USA
| |
Collapse
|
17
|
Latent space of a small genetic network: Geometry of dynamics and information. Proc Natl Acad Sci U S A 2022; 119:e2113651119. [PMID: 35737842 PMCID: PMC9245618 DOI: 10.1073/pnas.2113651119] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
The high-dimensional character of most biological systems presents genuine challenges for modeling and prediction. Here we propose a neural network-based approach for dimensionality reduction and analysis of biological gene expression data, using, as a case study, a well-known genetic network in the early Drosophila embryo, the gap gene patterning system. We build an autoencoder compressing the dynamics of spatial gap gene expression into a two-dimensional (2D) latent map. The resulting 2D dynamics suggests an almost linear model, with a small bare set of essential interactions. Maternally defined spatial modes control gap genes positioning, without the classically assumed intricate set of repressive gap gene interactions. This, surprisingly, predicts minimal changes of neighboring gap domains when knocking out gap genes, consistent with previous observations. Latent space geometries in maternal mutants are also consistent with the existence of such spatial modes. Finally, we show how positional information is well defined and interpretable as a polar angle in latent space. Our work illustrates how optimization of small neural networks on medium-sized biological datasets is sufficiently informative to capture essential underlying mechanisms of network function.
Collapse
|
18
|
Ribera H, Shirman S, Nguyen AV, Mangan NM. Model selection of chaotic systems from data with hidden variables using sparse data assimilation. CHAOS (WOODBURY, N.Y.) 2022; 32:063101. [PMID: 35778121 DOI: 10.1063/5.0066066] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Accepted: 05/06/2022] [Indexed: 06/15/2023]
Abstract
Many natural systems exhibit chaotic behavior, including the weather, hydrology, neuroscience, and population dynamics. Although many chaotic systems can be described by relatively simple dynamical equations, characterizing these systems can be challenging due to sensitivity to initial conditions and difficulties in differentiating chaotic behavior from noise. Ideally, one wishes to find a parsimonious set of equations that describe a dynamical system. However, model selection is more challenging when only a subset of the variables are experimentally accessible. Manifold learning methods using time-delay embeddings can successfully reconstruct the underlying structure of the system from data with hidden variables, but not the equations. Recent work in sparse-optimization based model selection has enabled model discovery given a library of possible terms, but regression-based methods require measurements of all state variables. We present a method combining variational annealing-a technique previously used for parameter estimation in chaotic systems with hidden variables-with sparse-optimization methods to perform model identification for chaotic systems with unmeasured variables. We applied the method to ground-truth time-series simulated from the classic Lorenz system and experimental data from an electrical circuit with Lorenz-system like behavior. In both cases, we successfully recover the expected equations with two measured and one hidden variable. Application to simulated data from the Colpitts oscillator demonstrates successful model selection of terms within nonlinear functions. We discuss the robustness of our method to varying noise.
Collapse
Affiliation(s)
- H Ribera
- Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, Illinois 60208, USA
| | - S Shirman
- Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, Illinois 60208, USA
| | - A V Nguyen
- Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, Illinois 60208, USA
| | - N M Mangan
- Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, Illinois 60208, USA
| |
Collapse
|
19
|
Achar SR, Bourassa FXP, Rademaker TJ, Lee A, Kondo T, Salazar-Cavazos E, Davies JS, Taylor N, François P, Altan-Bonnet G. Universal antigen encoding of T cell activation from high-dimensional cytokine dynamics. Science 2022; 376:880-884. [PMID: 35587980 DOI: 10.1126/science.abl5311] [Citation(s) in RCA: 33] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Systems immunology lacks a framework with which to derive theoretical understanding from high-dimensional datasets. We combined a robotic platform with machine learning to experimentally measure and theoretically model CD8+ T cell activation. High-dimensional cytokine dynamics could be compressed onto a low-dimensional latent space in an antigen-specific manner (so-called "antigen encoding"). We used antigen encoding to model and reconstruct patterns of T cell immune activation. The model delineated six classes of antigens eliciting distinct T cell responses. We generalized antigen encoding to multiple immune settings, including drug perturbations and activation of chimeric antigen receptor T cells. Such universal antigen encoding for T cell activation may enable further modeling of immune responses and their rational manipulation to optimize immunotherapies.
Collapse
Affiliation(s)
- Sooraj R Achar
- Immunodynamics Group, Laboratory of Integrative Cancer Immunology, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | | | | | - Angela Lee
- Immunodynamics Group, Laboratory of Integrative Cancer Immunology, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | - Taisuke Kondo
- Pediatric Oncology Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | - Emanuel Salazar-Cavazos
- Immunodynamics Group, Laboratory of Integrative Cancer Immunology, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | - John S Davies
- Genitourinary Malignancies Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | - Naomi Taylor
- Pediatric Oncology Branch, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| | - Paul François
- Department of Physics, McGill University, Montréal, Québec, Canada
| | - Grégoire Altan-Bonnet
- Immunodynamics Group, Laboratory of Integrative Cancer Immunology, Center for Cancer Research, National Cancer Institute, Bethesda, MD, USA
| |
Collapse
|
20
|
Fasel U, Kutz JN, Brunton BW, Brunton SL. Ensemble-SINDy: Robust sparse model discovery in the low-data, high-noise limit, with active learning and control. Proc Math Phys Eng Sci 2022; 478:20210904. [PMID: 35450025 PMCID: PMC9006119 DOI: 10.1098/rspa.2021.0904] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Accepted: 03/10/2022] [Indexed: 12/17/2022] Open
Abstract
Sparse model identification enables the discovery of nonlinear dynamical systems purely from data; however, this approach is sensitive to noise, especially in the low-data limit. In this work, we leverage the statistical approach of bootstrap aggregating (bagging) to robustify the sparse identification of the nonlinear dynamics (SINDy) algorithm. First, an ensemble of SINDy models is identified from subsets of limited and noisy data. The aggregate model statistics are then used to produce inclusion probabilities of the candidate functions, which enables uncertainty quantification and probabilistic forecasts. We apply this ensemble-SINDy (E-SINDy) algorithm to several synthetic and real-world datasets and demonstrate substantial improvements to the accuracy and robustness of model discovery from extremely noisy and limited data. For example, E-SINDy uncovers partial differential equations models from data with more than twice as much measurement noise as has been previously reported. Similarly, E-SINDy learns the Lotka Volterra dynamics from remarkably limited data of yearly lynx and hare pelts collected from 1900 to 1920. E-SINDy is computationally efficient, with similar scaling as standard SINDy. Finally, we show that ensemble statistics from E-SINDy can be exploited for active learning and improved model predictive control.
Collapse
Affiliation(s)
- U. Fasel
- Department of Mechanical Engineering, University of Washington, Seattle, WA, USA
| | - J. N. Kutz
- Department of Applied Mathematics, University of Washington, Seattle, WA, USA
| | - B. W. Brunton
- Department of Biology, University of Washington, Seattle, WA, USA
| | - S. L. Brunton
- Department of Mechanical Engineering, University of Washington, Seattle, WA, USA
| |
Collapse
|
21
|
Kaheman K, Brunton SL, Nathan Kutz J. Automatic differentiation to simultaneously identify nonlinear dynamics and extract noise probability distributions from data. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2022. [DOI: 10.1088/2632-2153/ac567a] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
The sparse identification of nonlinear dynamics (SINDy) is a regression framework for the discovery of parsimonious dynamic models and governing equations from time-series data. As with all system identification methods, noisy measurements compromise the accuracy and robustness of the model discovery procedure. In this work we develop a variant of the SINDy algorithm that integrates automatic differentiation and recent time-stepping constrained motivated by Rudy et al (2019 J. Computat. Phys.
396 483–506) for simultaneously (1) denoising the data, (2) learning and parametrizing the noise probability distribution, and (3) identifying the underlying parsimonious dynamical system responsible for generating the time-series data. Thus within an integrated optimization framework, noise can be separated from signal, resulting in an architecture that is approximately twice as robust to noise as state-of-the-art methods, handling as much as 40% noise on a given time-series signal and explicitly parametrizing the noise probability distribution. We demonstrate this approach on several numerical examples, from Lotka-Volterra models to the spatio-temporal Lorenz 96 model. Further, we show the method can learn a diversity of probability distributions for the measurement noise, including Gaussian, uniform, Gamma, and Rayleigh distributions.
Collapse
|
22
|
Genomic structure predicts metabolite dynamics in microbial communities. Cell 2022; 185:530-546.e25. [PMID: 35085485 DOI: 10.1016/j.cell.2021.12.036] [Citation(s) in RCA: 59] [Impact Index Per Article: 19.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2021] [Revised: 07/16/2021] [Accepted: 12/21/2021] [Indexed: 12/21/2022]
Abstract
The metabolic activities of microbial communities play a defining role in the evolution and persistence of life on Earth, driving redox reactions that give rise to global biogeochemical cycles. Community metabolism emerges from a hierarchy of processes, including gene expression, ecological interactions, and environmental factors. In wild communities, gene content is correlated with environmental context, but predicting metabolite dynamics from genomes remains elusive. Here, we show, for the process of denitrification, that metabolite dynamics of a community are predictable from the genes each member of the community possesses. A simple linear regression reveals a sparse and generalizable mapping from gene content to metabolite dynamics for genomically diverse bacteria. A consumer-resource model correctly predicts community metabolite dynamics from single-strain phenotypes. Our results demonstrate that the conserved impacts of metabolic genes can predict community metabolite dynamics, enabling the prediction of metabolite dynamics from metagenomes, designing denitrifying communities, and discovering how genome evolution impacts metabolism.
Collapse
|
23
|
Mamakoukas G, Castano ML, Tan X, Murphey TD. Derivative-Based Koopman Operators for Real-Time Control of Robotic Systems. IEEE T ROBOT 2021. [DOI: 10.1109/tro.2021.3076581] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
24
|
Chen Z, Liu Y, Sun H. Physics-informed learning of governing equations from scarce data. Nat Commun 2021; 12:6136. [PMID: 34675223 PMCID: PMC8531004 DOI: 10.1038/s41467-021-26434-1] [Citation(s) in RCA: 41] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Accepted: 10/04/2021] [Indexed: 11/23/2022] Open
Abstract
Harnessing data to discover the underlying governing laws or equations that describe the behavior of complex physical systems can significantly advance our modeling, simulation and understanding of such systems in various science and engineering disciplines. This work introduces a novel approach called physics-informed neural network with sparse regression to discover governing partial differential equations from scarce and noisy data for nonlinear spatiotemporal systems. In particular, this discovery approach seamlessly integrates the strengths of deep neural networks for rich representation learning, physics embedding, automatic differentiation and sparse regression to approximate the solution of system variables, compute essential derivatives, as well as identify the key derivative terms and parameters that form the structure and explicit expression of the equations. The efficacy and robustness of this method are demonstrated, both numerically and experimentally, on discovering a variety of partial differential equation systems with different levels of data scarcity and noise accounting for different initial/boundary conditions. The resulting computational framework shows the potential for closed-form model discovery in practical applications where large and accurate datasets are intractable to capture.
Collapse
Affiliation(s)
- Zhao Chen
- grid.261112.70000 0001 2173 3359Department of Civil and Environmental Engineering, Northeastern University, Boston, MA 02115 USA
| | - Yang Liu
- grid.261112.70000 0001 2173 3359Department of Mechanical and Industrial Engineering, Northeastern University, Boston, MA 02115 USA
| | - Hao Sun
- grid.24539.390000 0004 0368 8103Gaoling School of Artificial Intelligence, Renmin University of China, 100872 Beijing, China ,grid.24539.390000 0004 0368 8103Beijing Key Laboratory of Big Data Management and Analysis Methods, 100872 Beijing, China ,grid.116068.80000 0001 2341 2786Department of Civil and Environmental Engineering, MIT, Cambridge, MA 02139 USA
| |
Collapse
|
25
|
Genkin M, Hughes O, Engel TA. Learning non-stationary Langevin dynamics from stochastic observations of latent trajectories. Nat Commun 2021; 12:5986. [PMID: 34645828 PMCID: PMC8514604 DOI: 10.1038/s41467-021-26202-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 09/22/2021] [Indexed: 11/09/2022] Open
Abstract
Many complex systems operating far from the equilibrium exhibit stochastic dynamics that can be described by a Langevin equation. Inferring Langevin equations from data can reveal how transient dynamics of such systems give rise to their function. However, dynamics are often inaccessible directly and can be only gleaned through a stochastic observation process, which makes the inference challenging. Here we present a non-parametric framework for inferring the Langevin equation, which explicitly models the stochastic observation process and non-stationary latent dynamics. The framework accounts for the non-equilibrium initial and final states of the observed system and for the possibility that the system's dynamics define the duration of observations. Omitting any of these non-stationary components results in incorrect inference, in which erroneous features arise in the dynamics due to non-stationary data distribution. We illustrate the framework using models of neural dynamics underlying decision making in the brain.
Collapse
Affiliation(s)
- Mikhail Genkin
- Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA
| | | | | |
Collapse
|
26
|
Fernex D, Noack BR, Semaan R. Cluster-based network modeling-From snapshots to complex dynamical systems. SCIENCE ADVANCES 2021; 7:7/25/eabf5006. [PMID: 34134987 PMCID: PMC8208726 DOI: 10.1126/sciadv.abf5006] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/30/2020] [Accepted: 05/03/2021] [Indexed: 06/12/2023]
Abstract
We propose a universal method for data-driven modeling of complex nonlinear dynamics from time-resolved snapshot data without prior knowledge. Complex nonlinear dynamics govern many fields of science and engineering. Data-driven dynamic modeling often assumes a low-dimensional subspace or manifold for the state. We liberate ourselves from this assumption by proposing cluster-based network modeling (CNM) bridging machine learning, network science, and statistical physics. CNM describes short- and long-term behavior and is fully automatable, as it does not rely on application-specific knowledge. CNM is demonstrated for the Lorenz attractor, ECG heartbeat signals, Kolmogorov flow, and a high-dimensional actuated turbulent boundary layer. Even the notoriously difficult modeling benchmark of rare events in the Kolmogorov flow is solved. This automatable universal data-driven representation of complex nonlinear dynamics complements and expands network connectivity science and promises new fast-track avenues to understand, estimate, predict, and control complex systems in all scientific fields.
Collapse
Affiliation(s)
- Daniel Fernex
- Institut für Strömungsmechanik, Technische Universität Braunschweig, Hermann-Blenk-Str. 37, 38108 Braunschweig, Germany
| | - Bernd R Noack
- Center for Turbulence Control, Harbin Institute of Technology, Shenzhen 518058, People's Republic of China.
- Hermann-Föttinger-Institut, Technische Universität Berlin, Müller-Breslau-Str. 8, 10623 Berlin, Germany
| | - Richard Semaan
- Institut für Strömungsmechanik, Technische Universität Braunschweig, Hermann-Blenk-Str. 37, 38108 Braunschweig, Germany.
| |
Collapse
|
27
|
Udrescu SM, Tegmark M. Symbolic pregression: Discovering physical laws from distorted video. Phys Rev E 2021; 103:043307. [PMID: 34005960 DOI: 10.1103/physreve.103.043307] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Accepted: 03/31/2021] [Indexed: 11/07/2022]
Abstract
We present a method for unsupervised learning of equations of motion for objects in raw and optionally distorted unlabeled synthetic video (or, more generally, for discovering and modeling predictable features in time-series data). We first train an autoencoder that maps each video frame into a low-dimensional latent space where the laws of motion are as simple as possible, by minimizing a combination of nonlinearity, acceleration, and prediction error. Differential equations describing the motion are then discovered using Pareto-optimal symbolic regression. We find that our pre-regression ("pregression") step is able to rediscover Cartesian coordinates of unlabeled moving objects even when the video is distorted by a generalized lens. Using intuition from multidimensional knot theory, we find that the pregression step is facilitated by first adding extra latent space dimensions to avoid topological problems during training and then removing these extra dimensions via principal component analysis. An inertial frame is autodiscovered by minimizing the combined equation complexity for multiple experiments.
Collapse
Affiliation(s)
- Silviu-Marian Udrescu
- Department of Physics, Institute for AI & Fundamental Interactions, and Center for Brains, Minds, & Machines, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Max Tegmark
- Department of Physics, Institute for AI & Fundamental Interactions, and Center for Brains, Minds, & Machines, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| |
Collapse
|
28
|
Inferring phenomenological models of first passage processes. PLoS Comput Biol 2021; 17:e1008740. [PMID: 33667218 PMCID: PMC7968746 DOI: 10.1371/journal.pcbi.1008740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2020] [Revised: 03/17/2021] [Accepted: 01/25/2021] [Indexed: 11/19/2022] Open
Abstract
Biochemical processes in cells are governed by complex networks of many chemical species interacting stochastically in diverse ways and on different time scales. Constructing microscopically accurate models of such networks is often infeasible. Instead, here we propose a systematic framework for building phenomenological models of such networks from experimental data, focusing on accurately approximating the time it takes to complete the process, the First Passage (FP) time. Our phenomenological models are mixtures of Gamma distributions, which have a natural biophysical interpretation. The complexity of the models is adapted automatically to account for the amount of available data and its temporal resolution. The framework can be used for predicting behavior of FP systems under varying external conditions. To demonstrate the utility of the approach, we build models for the distribution of inter-spike intervals of a morphologically complex neuron, a Purkinje cell, from experimental and simulated data. We demonstrate that the developed models can not only fit the data, but also make nontrivial predictions. We demonstrate that our coarse-grained models provide constraints on more mechanistically accurate models of the involved phenomena.
Collapse
|
29
|
Su WH, Chou CS, Xiu D. Deep Learning of Biological Models from Data: Applications to ODE Models. Bull Math Biol 2021; 83:19. [PMID: 33452931 DOI: 10.1007/s11538-020-00851-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2020] [Accepted: 12/21/2020] [Indexed: 10/22/2022]
Abstract
Mathematical equations are often used to model biological processes. However, for many systems, determining analytically the underlying equations is highly challenging due to the complexity and unknown factors involved in the biological processes. In this work, we present a numerical procedure to discover dynamical physical laws behind biological data. The method utilizes deep learning methods based on neural networks, particularly residual networks. It is also based on recently developed mathematical tools of flow-map learning for dynamical systems. We demonstrate that with the proposed method, one can accurately construct numerical biological models for unknown governing equations behind measurement data. Moreover, the deep learning model can also incorporate unknown parameters in the biological process. A successfully trained deep neural network model can then be used as a predictive tool to produce system predictions of different settings and allows one to conduct detailed analysis of the underlying biological process. In this paper, we use three biological models-SEIR model, Morris-Lecar model and the Hodgkin-Huxley model-to show the capability of our proposed method.
Collapse
Affiliation(s)
- Wei-Hung Su
- Department of Mathematics, The Ohio State University, Columbus, OH, 43221, USA
| | - Ching-Shan Chou
- Department of Mathematics, The Ohio State University, Columbus, OH, 43221, USA
| | - Dongbin Xiu
- Department of Mathematics, The Ohio State University, Columbus, OH, 43221, USA.
| |
Collapse
|
30
|
Chvykov P, Hoel E. Causal Geometry. ENTROPY (BASEL, SWITZERLAND) 2020; 23:E24. [PMID: 33375321 PMCID: PMC7824647 DOI: 10.3390/e23010024] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/15/2020] [Revised: 12/04/2020] [Accepted: 12/21/2020] [Indexed: 11/16/2022]
Abstract
Information geometry has offered a way to formally study the efficacy of scientific models by quantifying the impact of model parameters on the predicted effects. However, there has been little formal investigation of causation in this framework, despite causal models being a fundamental part of science and explanation. Here, we introduce causal geometry, which formalizes not only how outcomes are impacted by parameters, but also how the parameters of a model can be intervened upon. Therefore, we introduce a geometric version of "effective information"-a known measure of the informativeness of a causal relationship. We show that it is given by the matching between the space of effects and the space of interventions, in the form of their geometric congruence. Therefore, given a fixed intervention capability, an effective causal model is one that is well matched to those interventions. This is a consequence of "causal emergence," wherein macroscopic causal relationships may carry more information than "fundamental" microscopic ones. We thus argue that a coarse-grained model may, paradoxically, be more informative than the microscopic one, especially when it better matches the scale of accessible interventions-as we illustrate on toy examples.
Collapse
Affiliation(s)
- Pavel Chvykov
- Physics of Living Systems, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Erik Hoel
- Allen Discovery Center, Tufts University, Medford, MA 02155, USA;
| |
Collapse
|
31
|
Yang Y, Aziz Bhouri M, Perdikaris P. Bayesian differential programming for robust systems identification under uncertainty. Proc Math Phys Eng Sci 2020; 476:20200290. [PMID: 33362409 PMCID: PMC7735302 DOI: 10.1098/rspa.2020.0290] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2020] [Accepted: 10/23/2020] [Indexed: 11/12/2022] Open
Abstract
This paper presents a machine learning framework for Bayesian systems identification from noisy, sparse and irregular observations of nonlinear dynamical systems. The proposed method takes advantage of recent developments in differentiable programming to propagate gradient information through ordinary differential equation solvers and perform Bayesian inference with respect to unknown model parameters using Hamiltonian Monte Carlo sampling. This allows an efficient inference of the posterior distributions over plausible models with quantified uncertainty, while the use of sparsity-promoting priors enables the discovery of interpretable and parsimonious representations for the underlying latent dynamics. A series of numerical studies is presented to demonstrate the effectiveness of the proposed methods, including nonlinear oscillators, predator-prey systems and examples from systems biology. Taken together, our findings put forth a flexible and robust workflow for data-driven model discovery under uncertainty. All codes and data accompanying this article are available at https://bit.ly/34FOJMj.
Collapse
Affiliation(s)
| | | | - Paris Perdikaris
- Department of Mechanical Engineering and Applied Mechanics, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
32
|
van Breugel F, Kutz JN, Brunton BW. Numerical differentiation of noisy data: A unifying multi-objective optimization framework. IEEE ACCESS : PRACTICAL INNOVATIONS, OPEN SOLUTIONS 2020; 8:196865-196877. [PMID: 33623728 PMCID: PMC7899139 DOI: 10.1109/access.2020.3034077] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Computing derivatives of noisy measurement data is ubiquitous in the physical, engineering, and biological sciences, and it is often a critical step in developing dynamic models or designing control. Unfortunately, the mathematical formulation of numerical differentiation is typically ill-posed, and researchers often resort to an ad hoc process for choosing one of many computational methods and its parameters. In this work, we take a principled approach and propose a multi-objective optimization framework for choosing parameters that minimize a loss function to balance the faithfulness and smoothness of the derivative estimate. Our framework has three significant advantages. First, the task of selecting multiple parameters is reduced to choosing a single hyper-parameter. Second, where ground-truth data is unknown, we provide a heuristic for selecting this hyper-parameter based on the power spectrum and temporal resolution of the data. Third, the optimal value of the hyper-parameter is consistent across different differentiation methods, thus our approach unifies vastly different numerical differentiation methods and facilitates unbiased comparison of their results. Finally, we provide an extensive open-source Python library pynumdiff to facilitate easy application to diverse datasets (https://github.com/florisvb/PyNumDiff).
Collapse
Affiliation(s)
- Floris van Breugel
- Department of Mechanical Engineering, University of Nevada, Reno, NV 89557
| | - J. Nathan Kutz
- Department of Applied Math, University of Washington, Seattle, WA, 98195, USA
| | | |
Collapse
|
33
|
Genkin M, Engel TA. Moving beyond generalization to accurate interpretation of flexible models. NAT MACH INTELL 2020; 2:674-683. [DOI: 10.1038/s42256-020-00242-6] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022]
|
34
|
Kaheman K, Kutz JN, Brunton SL. SINDy-PI: a robust algorithm for parallel implicit sparse identification of nonlinear dynamics. Proc Math Phys Eng Sci 2020; 476:20200279. [PMID: 33214760 PMCID: PMC7655768 DOI: 10.1098/rspa.2020.0279] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2020] [Accepted: 09/10/2020] [Indexed: 12/15/2022] Open
Abstract
Accurately modelling the nonlinear dynamics of a system from measurement data is a challenging yet vital topic. The sparse identification of nonlinear dynamics (SINDy) algorithm is one approach to discover dynamical systems models from data. Although extensions have been developed to identify implicit dynamics, or dynamics described by rational functions, these extensions are extremely sensitive to noise. In this work, we develop SINDy-PI (parallel, implicit), a robust variant of the SINDy algorithm to identify implicit dynamics and rational nonlinearities. The SINDy-PI framework includes multiple optimization algorithms and a principled approach to model selection. We demonstrate the ability of this algorithm to learn implicit ordinary and partial differential equations and conservation laws from limited and noisy data. In particular, we show that the proposed approach is several orders of magnitude more noise robust than previous approaches, and may be used to identify a class of ODE and PDE dynamics that were previously unattainable with SINDy, including for the double pendulum dynamics and simplified model for the Belousov-Zhabotinsky (BZ) reaction.
Collapse
Affiliation(s)
- Kadierdan Kaheman
- Department of Mechanical Engineering, University of Washington, Seattle, WA 98195, USA
| | - J Nathan Kutz
- Department of Applied Mathematics, University of Washington, Seattle, WA 98195, USA
| | - Steven L Brunton
- Department of Mechanical Engineering, University of Washington, Seattle, WA 98195, USA
| |
Collapse
|
35
|
Johnson JK, Geng S, Hoffman MW, Adesnik H, Wessel R. Precision multidimensional neural population code recovered from single intracellular recordings. Sci Rep 2020; 10:15997. [PMID: 32994474 PMCID: PMC7524839 DOI: 10.1038/s41598-020-72936-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2020] [Accepted: 08/20/2020] [Indexed: 11/08/2022] Open
Abstract
Neurons in sensory cortices are more naturally and deeply integrated than any current neural population recording tools (e.g. electrode arrays, fluorescence imaging). Two concepts facilitate efforts to observe population neural code with single-cell recordings. First, even the highest quality single-cell recording studies find a fraction of the stimulus information in high-dimensional population recordings. Finding any of this missing information provides proof of principle. Second, neurons and neural populations are understood as coupled nonlinear differential equations. Therefore, fitted ordinary differential equations provide a basis for single-trial single-cell stimulus decoding. We obtained intracellular recordings of fluctuating transmembrane current and potential in mouse visual cortex during stimulation with drifting gratings. We use mean deflection from baseline when comparing to prior single-cell studies because action potentials are too sparse and the deflection response to drifting grating stimuli (e.g. tuning curves) are well studied. Equation-based decoders allowed more precise single-trial stimulus discrimination than tuning-curve-base decoders. Performance varied across recorded signal types in a manner consistent with population recording studies and both classification bases evinced distinct stimulus-evoked phases of population dynamics, providing further corroboration. Naturally and deeply integrated observations of population dynamics would be invaluable. We offer proof of principle and a versatile framework.
Collapse
Affiliation(s)
| | | | | | | | - Ralf Wessel
- Washington University in St. Louis, St. Louis, USA
| |
Collapse
|
36
|
Yang W, Peng L, Zhu Y, Hong L. When machine learning meets multiscale modeling in chemical reactions. J Chem Phys 2020; 153:094117. [DOI: 10.1063/5.0015779] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Affiliation(s)
- Wuyue Yang
- Yau Mathematical Sciences Center, Tsinghua University, Beijing 100084, People’s Republic of China
| | - Liangrong Peng
- College of Mathematics and Data Science, Minjiang University, Fuzhou 350108, People’s Republic of China
| | - Yi Zhu
- Yau Mathematical Sciences Center, Tsinghua University, Beijing 100084, People’s Republic of China
| | - Liu Hong
- School of Mathematics, Sun Yat-sen University, Guangzhou 510275, People’s Republic of China
| |
Collapse
|
37
|
Brückner DB, Ronceray P, Broedersz CP. Inferring the Dynamics of Underdamped Stochastic Systems. PHYSICAL REVIEW LETTERS 2020; 125:058103. [PMID: 32794851 DOI: 10.1103/physrevlett.125.058103] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 04/26/2020] [Accepted: 06/24/2020] [Indexed: 06/11/2023]
Abstract
Many complex systems, ranging from migrating cells to animal groups, exhibit stochastic dynamics described by the underdamped Langevin equation. Inferring such an equation of motion from experimental data can provide profound insight into the physical laws governing the system. Here, we derive a principled framework to infer the dynamics of underdamped stochastic systems from realistic experimental trajectories, sampled at discrete times and subject to measurement errors. This framework yields an operational method, Underdamped Langevin Inference, which performs well on experimental trajectories of single migrating cells and in complex high-dimensional systems, including flocks with Viscek-like alignment interactions. Our method is robust to experimental measurement errors, and includes a self-consistent estimate of the inference error.
Collapse
Affiliation(s)
- David B Brückner
- Arnold Sommerfeld Center for Theoretical Physics and Center for NanoScience, Department of Physics, Ludwig-Maximilian-University Munich, Theresienstr. 37, D-80333 Munich, Germany
| | - Pierre Ronceray
- Center for the Physics of Biological Function, Princeton University, Princeton, New Jersey 08544, USA
| | - Chase P Broedersz
- Arnold Sommerfeld Center for Theoretical Physics and Center for NanoScience, Department of Physics, Ludwig-Maximilian-University Munich, Theresienstr. 37, D-80333 Munich, Germany
- Department of Physics and Astronomy, Vrije Universiteit Amsterdam, 1081 HV Amsterdam, The Netherlands
| |
Collapse
|
38
|
Horrocks J, Bauch CT. Algorithmic discovery of dynamic models from infectious disease data. Sci Rep 2020; 10:7061. [PMID: 32341374 PMCID: PMC7184751 DOI: 10.1038/s41598-020-63877-w] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Accepted: 04/07/2020] [Indexed: 11/09/2022] Open
Abstract
Theoretical models are typically developed through a deductive process where a researcher formulates a system of dynamic equations from hypothesized mechanisms. Recent advances in algorithmic methods can discover dynamic models inductively-directly from data. Most previous research has tested these methods by rediscovering models from synthetic data generated by the already known model. Here we apply Sparse Identification of Nonlinear Dynamics (SINDy) to discover mechanistic equations for disease dynamics from case notification data for measles, chickenpox, and rubella. The discovered models provide a good qualitative fit to the observed dynamics for all three diseases, However, the SINDy chickenpox model appears to overfit the empirical data, and recovering qualitatively correct rubella dynamics requires using power spectral density in the goodness-of-fit criterion. When SINDy uses a library of second-order functions, the discovered models tend to include mass action incidence and a seasonally varying transmission rate-a common feature of existing epidemiological models for childhood infectious diseases. We also find that the SINDy measles model is capable of out-of-sample prediction of a dynamical regime shift in measles case notification data. These results demonstrate the potential for algorithmic model discovery to enrich scientific understanding by providing a complementary approach to developing theoretical models.
Collapse
Affiliation(s)
- Jonathan Horrocks
- Department of Applied Mathematics, University of Waterloo, Waterloo, N2L 3G1, Canada
| | - Chris T Bauch
- Department of Applied Mathematics, University of Waterloo, Waterloo, N2L 3G1, Canada.
| |
Collapse
|
39
|
Mujica-Parodi LR, Strey HH. Making Sense of Computational Psychiatry. Int J Neuropsychopharmacol 2020; 23:339-347. [PMID: 32219396 PMCID: PMC7251632 DOI: 10.1093/ijnp/pyaa013] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Accepted: 02/24/2020] [Indexed: 12/26/2022] Open
Abstract
In psychiatry we often speak of constructing "models." Here we try to make sense of what such a claim might mean, starting with the most fundamental question: "What is (and isn't) a model?" We then discuss, in a concrete measurable sense, what it means for a model to be useful. In so doing, we first identify the added value that a computational model can provide in the context of accuracy and power. We then present limitations of standard statistical methods and provide suggestions for how we can expand the explanatory power of our analyses by reconceptualizing statistical models as dynamical systems. Finally, we address the problem of model building-suggesting ways in which computational psychiatry can escape the potential for cognitive biases imposed by classical hypothesis-driven research, exploiting deep systems-level information contained within neuroimaging data to advance our understanding of psychiatric neuroscience.
Collapse
Affiliation(s)
- Lilianne R Mujica-Parodi
- Department of Biomedical Engineering, Stony Brook University, Stony Brook, New York,Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York,Correspondence: Lilianne R. Mujica-Parodi, PhD, Director, Laboratory for Computational Neurodiagnostics, Professor, Department of Biomedical Engineering, Renaissance School of Medicine, Stony Brook, NY 11794-5281 () or Helmut H. Strey, PhD, Department of Biomedical Engineering, Stony Brook University, Stony Brook, NY 11794-5281 ()
| | - Helmut H Strey
- Department of Biomedical Engineering, Stony Brook University, Stony Brook, New York,Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York,Correspondence: Lilianne R. Mujica-Parodi, PhD, Director, Laboratory for Computational Neurodiagnostics, Professor, Department of Biomedical Engineering, Renaissance School of Medicine, Stony Brook, NY 11794-5281 () or Helmut H. Strey, PhD, Department of Biomedical Engineering, Stony Brook University, Stony Brook, NY 11794-5281 ()
| |
Collapse
|
40
|
Favela LH. Cognitive science as complexity science. WILEY INTERDISCIPLINARY REVIEWS. COGNITIVE SCIENCE 2020; 11:e1525. [PMID: 32043728 DOI: 10.1002/wcs.1525] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Revised: 01/02/2020] [Accepted: 01/17/2020] [Indexed: 11/06/2022]
Abstract
It is uncontroversial to claim that cognitive science studies many complex phenomena. What is less acknowledged are the contradictions among many traditional commitments of its investigative approaches and the nature of cognitive systems. Consider, for example, methodological tensions that arise due to the fact that like most natural systems, cognitive systems are nonlinear; and yet, traditionally cognitive science has relied on linear statistical data analyses. Cognitive science as complexity science is offered as an interdisciplinary framework for the investigation of cognition that can dissolve such contradictions and tensions. Here, cognition is treated as exhibiting the following four key features: emergence, nonlinearity, self-organization, and universality. This framework integrates concepts, methods, and theories from such disciplines as systems theory, nonlinear dynamical systems theory, and synergetics. By adopting this approach, the cognitive sciences benefit from a common set of practices to investigate, explain, and understand cognition in its varied and complex forms. This article is categorized under: Computer Science > Neural Networks Psychology > Theory and Methods Philosophy > Foundations of Cognitive Science Neuroscience > Cognition.
Collapse
Affiliation(s)
- Luis H Favela
- Department of Philosophy and Cognitive Sciences Program, University of Central Florida, Orlando, Florida
| |
Collapse
|
41
|
Numerical Parameter Space Compression and Its Application to Biophysical Models. Biophys J 2020; 118:1455-1465. [PMID: 32070477 DOI: 10.1016/j.bpj.2020.01.023] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2019] [Revised: 12/07/2019] [Accepted: 01/14/2020] [Indexed: 11/20/2022] Open
Abstract
Physical models of biological systems can become difficult to interpret when they have a large number of parameters. But the models themselves actually depend on (i.e., are sensitive to) only a subset of those parameters. This phenomenon is due to parameter space compression (PSC), in which a subset of parameters emerges as "stiff" as a function of time or space. PSC has only been used to explain analytically solvable physics models. We have generalized this result by developing a numerical approach to PSC that can be applied to any computational model. We validated our method against analytically solvable models of a random walk with drift and protein production and degradation. We then applied our method to a simple computational model of microtubule dynamic instability. We propose that numerical PSC has the potential to identify the low-dimensional structure of many computational models in biophysics. The low-dimensional structure of a model is easier to interpret and identifies the mechanisms and experiments that best characterize the system.
Collapse
|
42
|
Iten R, Metger T, Wilming H, Del Rio L, Renner R. Discovering Physical Concepts with Neural Networks. PHYSICAL REVIEW LETTERS 2020; 124:010508. [PMID: 31976717 DOI: 10.1103/physrevlett.124.010508] [Citation(s) in RCA: 84] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Indexed: 05/10/2023]
Abstract
Despite the success of neural networks at solving concrete physics problems, their use as a general-purpose tool for scientific discovery is still in its infancy. Here, we approach this problem by modeling a neural network architecture after the human physical reasoning process, which has similarities to representation learning. This allows us to make progress towards the long-term goal of machine-assisted scientific discovery from experimental data without making prior assumptions about the system. We apply this method to toy examples and show that the network finds the physically relevant parameters, exploits conservation laws to make predictions, and can help to gain conceptual insights, e.g., Copernicus' conclusion that the solar system is heliocentric.
Collapse
Affiliation(s)
- Raban Iten
- ETH Zürich, Wolfgang-Pauli-Strasse 27, 8093 Zürich, Switzerland
| | - Tony Metger
- ETH Zürich, Wolfgang-Pauli-Strasse 27, 8093 Zürich, Switzerland
| | - Henrik Wilming
- ETH Zürich, Wolfgang-Pauli-Strasse 27, 8093 Zürich, Switzerland
| | - Lídia Del Rio
- ETH Zürich, Wolfgang-Pauli-Strasse 27, 8093 Zürich, Switzerland
| | - Renato Renner
- ETH Zürich, Wolfgang-Pauli-Strasse 27, 8093 Zürich, Switzerland
| |
Collapse
|
43
|
Brummitt CD, Gómez-Liévano A, Hausmann R, Bonds MH. Machine-learned patterns suggest that diversification drives economic development. J R Soc Interface 2020; 17:20190283. [PMID: 31910774 DOI: 10.1098/rsif.2019.0283] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
We combine a sequence of machine-learning techniques, together called Principal Smooth-Dynamics Analysis (PriSDA), to identify patterns in the dynamics of complex systems. Here, we deploy this method on the task of automating the development of new theory of economic growth. Traditionally, economic growth is modelled with a few aggregate quantities derived from simplified theoretical models. PriSDA, by contrast, identifies important quantities. Applied to 55 years of data on countries' exports, PriSDA finds that what most distinguishes countries' export baskets is their diversity, with extra weight assigned to more sophisticated products. The weights are consistent with previous measures of product complexity. The second dimension of variation is proficiency in machinery relative to agriculture. PriSDA then infers the dynamics of these two quantities and of per capita income. The inferred model predicts that diversification drives growth in income, that diversified middle-income countries will grow the fastest, and that countries will converge onto intermediate levels of income and specialization. PriSDA is generalizable and may illuminate dynamics of elusive quantities such as diversity and complexity in other natural and social systems.
Collapse
Affiliation(s)
- Charles D Brummitt
- Global Health and Social Medicine, Harvard Medical School, Boston, MA 02115, USA
| | | | - Ricardo Hausmann
- Growth Lab at Harvard University, Cambridge, MA, USA.,Center for International Development, Harvard Kennedy School, Cambridge, MA 02138, USA.,Santa Fe Institute, Santa Fe, NM 87501, USA
| | - Matthew H Bonds
- Global Health and Social Medicine, Harvard Medical School, Boston, MA 02115, USA
| |
Collapse
|
44
|
Guimerà R, Reichardt I, Aguilar-Mogas A, Massucci FA, Miranda M, Pallarès J, Sales-Pardo M. A Bayesian machine scientist to aid in the solution of challenging scientific problems. SCIENCE ADVANCES 2020; 6:eaav6971. [PMID: 32064326 PMCID: PMC6994216 DOI: 10.1126/sciadv.aav6971] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/11/2018] [Accepted: 11/20/2019] [Indexed: 05/06/2023]
Abstract
Closed-form, interpretable mathematical models have been instrumental for advancing our understanding of the world; with the data revolution, we may now be in a position to uncover new such models for many systems from physics to the social sciences. However, to deal with increasing amounts of data, we need "machine scientists" that are able to extract these models automatically from data. Here, we introduce a Bayesian machine scientist, which establishes the plausibility of models using explicit approximations to the exact marginal posterior over models and establishes its prior expectations about models by learning from a large empirical corpus of mathematical expressions. It explores the space of models using Markov chain Monte Carlo. We show that this approach uncovers accurate models for synthetic and real data and provides out-of-sample predictions that are more accurate than those of existing approaches and of other nonparametric methods.
Collapse
Affiliation(s)
- Roger Guimerà
- ICREA, Barcelona 08010, Catalonia, Spain
- Department of Chemical Engineering, Universitat Rovira i Virgili, Tarragona 43007, Catalonia, Spain
- Corresponding author.
| | - Ignasi Reichardt
- Department of Chemical Engineering, Universitat Rovira i Virgili, Tarragona 43007, Catalonia, Spain
| | - Antoni Aguilar-Mogas
- Department of Chemical Engineering, Universitat Rovira i Virgili, Tarragona 43007, Catalonia, Spain
- Division of Research, Economic Development and Engagement, East Carolina University, Greenville, NC 27858, USA
| | - Francesco A. Massucci
- Department of Chemical Engineering, Universitat Rovira i Virgili, Tarragona 43007, Catalonia, Spain
- SIRIS Lab, Research Division of SIRIS Academic, Barcelona 08003, Catalonia, Spain
| | - Manuel Miranda
- Department of Chemical Engineering, Universitat Rovira i Virgili, Tarragona 43007, Catalonia, Spain
| | - Jordi Pallarès
- Department of Mechanical Engineering, Universitat Rovira i Virgili, Tarragona 43007, Catalonia, Spain
| | - Marta Sales-Pardo
- Department of Chemical Engineering, Universitat Rovira i Virgili, Tarragona 43007, Catalonia, Spain
| |
Collapse
|
45
|
Machine learning-based adaptive model identification of systems: Application to a chemical process. Chem Eng Res Des 2019. [DOI: 10.1016/j.cherd.2019.09.009] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
|
46
|
Wu T, Tegmark M. Toward an artificial intelligence physicist for unsupervised learning. Phys Rev E 2019; 100:033311. [PMID: 31639888 DOI: 10.1103/physreve.100.033311] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2019] [Indexed: 12/30/2022]
Abstract
We investigate opportunities and challenges for improving unsupervised machine learning using four common strategies with a long history in physics: divide and conquer, Occam's razor, unification, and lifelong learning. Instead of using one model to learn everything, we propose a paradigm centered around the learning and manipulation of theories, which parsimoniously predict both aspects of the future (from past observations) and the domain in which these predictions are accurate. Specifically, we propose a generalized mean loss to encourage each theory to specialize in its comparatively advantageous domain, and a differentiable description length objective to downweight bad data and "snap" learned theories into simple symbolic formulas. Theories are stored in a "theory hub," which continuously unifies learned theories and can propose theories when encountering new environments. We test our implementation, the toy "artificial intelligence physicist" learning agent, on a suite of increasingly complex physics environments. From unsupervised observation of trajectories through worlds involving random combinations of gravity, electromagnetism, harmonic motion, and elastic bounces, our agent typically learns faster and produces mean-squared prediction errors about a billion times smaller than a standard feedforward neural net of comparable complexity, typically recovering integer and rational theory parameters exactly. Our agent successfully identifies domains with different laws of motion also for a nonlinear chaotic double pendulum in a piecewise constant force field.
Collapse
Affiliation(s)
- Tailin Wu
- Department of Physics and Center for Brains, Minds, and Machines, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA and Theiss Research, La Jolla, California 92037, USA
| | - Max Tegmark
- Department of Physics and Center for Brains, Minds, and Machines, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA and Theiss Research, La Jolla, California 92037, USA
| |
Collapse
|
47
|
Pantazis Y, Tsamardinos I. A unified approach for sparse dynamical system inference from temporal measurements. Bioinformatics 2019; 35:3387-3396. [PMID: 30715136 PMCID: PMC6748758 DOI: 10.1093/bioinformatics/btz065] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2017] [Revised: 01/17/2019] [Accepted: 01/28/2019] [Indexed: 11/12/2022] Open
Abstract
Motivation Temporal variations in biological systems and more generally in natural sciences are typically modeled as a set of ordinary, partial or stochastic differential or difference equations. Algorithms for learning the structure and the parameters of a dynamical system are distinguished based on whether time is discrete or continuous, observations are time-series or time-course and whether the system is deterministic or stochastic, however, there is no approach able to handle the various types of dynamical systems simultaneously. Results In this paper, we present a unified approach to infer both the structure and the parameters of non-linear dynamical systems of any type under the restriction of being linear with respect to the unknown parameters. Our approach, which is named Unified Sparse Dynamics Learning (USDL), constitutes of two steps. First, an atemporal system of equations is derived through the application of the weak formulation. Then, assuming a sparse representation for the dynamical system, we show that the inference problem can be expressed as a sparse signal recovery problem, allowing the application of an extensive body of algorithms and theoretical results. Results on simulated data demonstrate the efficacy and superiority of the USDL algorithm under multiple interventions and/or stochasticity. Additionally, USDL’s accuracy significantly correlates with theoretical metrics such as the exact recovery coefficient. On real single-cell data, the proposed approach is able to induce high-confidence subgraphs of the signaling pathway. Availability and implementation Source code is available at Bioinformatics online. USDL algorithm has been also integrated in SCENERY (http://scenery.csd.uoc.gr/); an online tool for single-cell mass cytometry analytics. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Yannis Pantazis
- Institute of Applied and Computational Mathematics, Foundation for Research and Technology - Hellas (FORTH), Heraklion, Greece
- To whom correspondence should be addressed. E-mail: or
| | - Ioannis Tsamardinos
- Institute of Applied and Computational Mathematics, Foundation for Research and Technology - Hellas (FORTH), Heraklion, Greece
- Department of Computer Science, University of Crete, Heraklion, Greece
- Gnosis Data Analysis PC, Heraklion, Greece
- To whom correspondence should be addressed. E-mail: or
| |
Collapse
|
48
|
Heras FJH, Romero-Ferrero F, Hinz RC, de Polavieja GG. Deep attention networks reveal the rules of collective motion in zebrafish. PLoS Comput Biol 2019; 15:e1007354. [PMID: 31518357 PMCID: PMC6760814 DOI: 10.1371/journal.pcbi.1007354] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Revised: 09/25/2019] [Accepted: 08/21/2019] [Indexed: 12/01/2022] Open
Abstract
A variety of simple models has been proposed to understand the collective motion of animals. These models can be insightful but may lack important elements necessary to predict the motion of each individual in the collective. Adding more detail increases predictability but can make models too complex to be insightful. Here we report that deep attention networks can obtain a model of collective behavior that is simultaneously predictive and insightful thanks to an organization in modules. When using simulated trajectories, the model recovers the ground-truth interaction rule used to generate them, as well as the number of interacting neighbours. For experimental trajectories of large groups of 60-100 zebrafish, Danio rerio, the model obtains that interactions between pairs can approximately be described as repulsive, attractive or as alignment, but only when moving slowly. At high velocities, interactions correspond only to alignment or alignment mixed with repulsion at close distances. The model also shows that each zebrafish decides where to move by aggregating information from the group as a weighted average over neighbours. Weights are higher for neighbours that are close, in a collision path or moving faster in frontal and lateral locations. The network also extracts that the number of interacting individuals is dynamical and typically in the range 8–22, with 1–10 more important ones. Our results suggest that each animal decides by dynamically selecting information from the collective. Simple models have traditionally been very successful, because they usually provide more insight than complicated models. This is particularly true in physics, where simple models can often give highly precise quantitative predictions. However, biology is fundamentally complex and thus it is difficult to find simple models that give precise predictions. To create models that are both precise and insightful, we propose to harness the power of deep neural networks but to confine them into modules with a low number of inputs and outputs. We trained one such model to predict the future turning side of a fish in a collective. By plotting the different modules we obtain insight about how fish interact and how they aggregate information from different neighbours. This aggregation is dynamical and shows that fish can interact with approximately 20 neighbours but can also focus on fewer neighbours, down to 1-2, when some move at higher speed in front or to the sides, are very close or are in a collision path.
Collapse
Affiliation(s)
- Francisco J. H. Heras
- Champalimaud Research, Champalimaud Centre for the Unknown, Lisbon, Portugal
- * E-mail: (FJHH); (GGP)
| | | | - Robert C. Hinz
- Champalimaud Research, Champalimaud Centre for the Unknown, Lisbon, Portugal
| | - Gonzalo G. de Polavieja
- Champalimaud Research, Champalimaud Centre for the Unknown, Lisbon, Portugal
- * E-mail: (FJHH); (GGP)
| |
Collapse
|
49
|
Automated, predictive, and interpretable inference of Caenorhabditis elegans escape dynamics. Proc Natl Acad Sci U S A 2019; 116:7226-7231. [PMID: 30902894 DOI: 10.1073/pnas.1816531116] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
The roundworm Caenorhabditis elegans exhibits robust escape behavior in response to rapidly rising temperature. The behavior lasts for a few seconds, shows history dependence, involves both sensory and motor systems, and is too complicated to model mechanistically using currently available knowledge. Instead we model the process phenomenologically, and we use the Sir Isaac dynamical inference platform to infer the model in a fully automated fashion directly from experimental data. The inferred model requires incorporation of an unobserved dynamical variable and is biologically interpretable. The model makes accurate predictions about the dynamics of the worm behavior, and it can be used to characterize the functional logic of the dynamical system underlying the escape response. This work illustrates the power of modern artificial intelligence to aid in discovery of accurate and interpretable models of complex natural systems.
Collapse
|
50
|
Dale R, Bhat HS. Equations of mind: Data science for inferring nonlinear dynamics of socio-cognitive systems. COGN SYST RES 2018. [DOI: 10.1016/j.cogsys.2018.06.020] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|