1
|
Massonis G, Villaverde AF, Banga JR. Distilling identifiable and interpretable dynamic models from biological data. PLoS Comput Biol 2023; 19:e1011014. [PMID: 37851682 PMCID: PMC10615316 DOI: 10.1371/journal.pcbi.1011014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Revised: 10/30/2023] [Accepted: 10/03/2023] [Indexed: 10/20/2023] Open
Abstract
Mechanistic dynamical models allow us to study the behavior of complex biological systems. They can provide an objective and quantitative understanding that would be difficult to achieve through other means. However, the systematic development of these models is a non-trivial exercise and an open problem in computational biology. Currently, many research efforts are focused on model discovery, i.e. automating the development of interpretable models from data. One of the main frameworks is sparse regression, where the sparse identification of nonlinear dynamics (SINDy) algorithm and its variants have enjoyed great success. SINDy-PI is an extension which allows the discovery of rational nonlinear terms, thus enabling the identification of kinetic functions common in biochemical networks, such as Michaelis-Menten. SINDy-PI also pays special attention to the recovery of parsimonious models (Occam's razor). Here we focus on biological models composed of sets of deterministic nonlinear ordinary differential equations. We present a methodology that, combined with SINDy-PI, allows the automatic discovery of structurally identifiable and observable models which are also mechanistically interpretable. The lack of structural identifiability and observability makes it impossible to uniquely infer parameter and state variables, which can compromise the usefulness of a model by distorting its mechanistic significance and hampering its ability to produce biological insights. We illustrate the performance of our method with six case studies. We find that, despite enforcing sparsity, SINDy-PI sometimes yields models that are unidentifiable. In these cases we show how our method transforms their equations in order to obtain a structurally identifiable and observable model which is also interpretable.
Collapse
Affiliation(s)
- Gemma Massonis
- Computational Biology Lab, MBG-CSIC (Spanish National Research Council), Pontevedra, Galicia, Spain
| | - Alejandro F. Villaverde
- CITMAga, Santiago de Compostela, Galicia, Spain
- Universidade de Vigo, Department of Systems and Control Engineering, Vigo, Galicia, Spain
| | - Julio R. Banga
- Computational Biology Lab, MBG-CSIC (Spanish National Research Council), Pontevedra, Galicia, Spain
| |
Collapse
|
2
|
Roman A, Palanski K, Nemenman I, Ryu WS. A dynamical model of C. elegans thermal preference reveals independent excitatory and inhibitory learning pathways. Proc Natl Acad Sci U S A 2023; 120:e2215191120. [PMID: 36940330 PMCID: PMC10068832 DOI: 10.1073/pnas.2215191120] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 02/19/2023] [Indexed: 03/22/2023] Open
Abstract
Caenorhabditis elegans is capable of learning and remembering behaviorally relevant cues such as smells, tastes, and temperature. This is an example of associative learning, a process in which behavior is modified by making associations between various stimuli. Since the mathematical theory of conditioning does not account for some of its salient aspects, such as spontaneous recovery of extinguished associations, accurate modeling of behavior of real animals during conditioning has turned out difficult. Here, we do this in the context of the dynamics of the thermal preference of C. elegans. We quantify C. elegans thermotaxis in response to various conditioning temperatures, starvation durations, and genetic perturbations using a high-resolution microfluidic droplet assay. We model these data comprehensively, within a biologically interpretable, multi-modal framework. We find that the strength of the thermal preference is composed of two independent, genetically separable contributions and requires a model with at least four dynamical variables. One pathway positively associates the experienced temperature independently of food and the other negatively associates with the temperature when food is absent. The multidimensional structure of the association strength provides an explanation for the apparent classical temperature-food association of C. elegans thermal preference and a number of longstanding questions in animal learning, including spontaneous recovery, asymmetric response to appetitive vs. aversive cues, latent inhibition, and generalization among similar cues.
Collapse
Affiliation(s)
- Ahmed Roman
- Department of Physics, Emory University, Atlanta, GA30322
| | | | - Ilya Nemenman
- Department of Physics, Emory University, Atlanta, GA30322
- Department of Biology, Emory University, Atlanta, GA30322
- Initiative in Theory and Modeling of Living Systems, Emory University, Atlanta, GA30322
| | - William S. Ryu
- Department of Physics, University of Toronto, Toronto, ON M5S 1A7, Canada
- The Donnelly Centre, University of Toronto, Toronto, ON M5S 3E1, Canada
| |
Collapse
|
3
|
Ribera H, Shirman S, Nguyen AV, Mangan NM. Model selection of chaotic systems from data with hidden variables using sparse data assimilation. CHAOS (WOODBURY, N.Y.) 2022; 32:063101. [PMID: 35778121 DOI: 10.1063/5.0066066] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/07/2021] [Accepted: 05/06/2022] [Indexed: 06/15/2023]
Abstract
Many natural systems exhibit chaotic behavior, including the weather, hydrology, neuroscience, and population dynamics. Although many chaotic systems can be described by relatively simple dynamical equations, characterizing these systems can be challenging due to sensitivity to initial conditions and difficulties in differentiating chaotic behavior from noise. Ideally, one wishes to find a parsimonious set of equations that describe a dynamical system. However, model selection is more challenging when only a subset of the variables are experimentally accessible. Manifold learning methods using time-delay embeddings can successfully reconstruct the underlying structure of the system from data with hidden variables, but not the equations. Recent work in sparse-optimization based model selection has enabled model discovery given a library of possible terms, but regression-based methods require measurements of all state variables. We present a method combining variational annealing-a technique previously used for parameter estimation in chaotic systems with hidden variables-with sparse-optimization methods to perform model identification for chaotic systems with unmeasured variables. We applied the method to ground-truth time-series simulated from the classic Lorenz system and experimental data from an electrical circuit with Lorenz-system like behavior. In both cases, we successfully recover the expected equations with two measured and one hidden variable. Application to simulated data from the Colpitts oscillator demonstrates successful model selection of terms within nonlinear functions. We discuss the robustness of our method to varying noise.
Collapse
Affiliation(s)
- H Ribera
- Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, Illinois 60208, USA
| | - S Shirman
- Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, Illinois 60208, USA
| | - A V Nguyen
- Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, Illinois 60208, USA
| | - N M Mangan
- Department of Engineering Sciences and Applied Mathematics, Northwestern University, Evanston, Illinois 60208, USA
| |
Collapse
|
4
|
Reddy G, Desban L, Tanaka H, Roussel J, Mirat O, Wyart C. A lexical approach for identifying behavioural action sequences. PLoS Comput Biol 2022; 18:e1009672. [PMID: 35007275 PMCID: PMC8782473 DOI: 10.1371/journal.pcbi.1009672] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2020] [Revised: 01/21/2022] [Accepted: 11/16/2021] [Indexed: 12/14/2022] Open
Abstract
Animals display characteristic behavioural patterns when performing a task, such as the spiraling of a soaring bird or the surge-and-cast of a male moth searching for a female. Identifying such recurring sequences occurring rarely in noisy behavioural data is key to understanding the behavioural response to a distributed stimulus in unrestrained animals. Existing models seek to describe the dynamics of behaviour or segment individual locomotor episodes rather than to identify the rare and transient sequences of locomotor episodes that make up the behavioural response. To fill this gap, we develop a lexical, hierarchical model of behaviour. We designed an unsupervised algorithm called "BASS" to efficiently identify and segment recurring behavioural action sequences transiently occurring in long behavioural recordings. When applied to navigating larval zebrafish, BASS extracts a dictionary of remarkably long, non-Markovian sequences consisting of repeats and mixtures of slow forward and turn bouts. Applied to a novel chemotaxis assay, BASS uncovers chemotactic strategies deployed by zebrafish to avoid aversive cues consisting of sequences of fast large-angle turns and burst swims. In a simulated dataset of soaring gliders climbing thermals, BASS finds the spiraling patterns characteristic of soaring behaviour. In both cases, BASS succeeds in identifying rare action sequences in the behaviour deployed by freely moving animals. BASS can be easily incorporated into the pipelines of existing behavioural analyses across diverse species, and even more broadly used as a generic algorithm for pattern recognition in low-dimensional sequential data.
Collapse
Affiliation(s)
- Gautam Reddy
- NSF-Simons Center for Mathematical & Statistical Analysis of Biology, Harvard University, Cambridge, Massachusetts, United States of America
| | - Laura Desban
- Sorbonne Université, Institut du Cerveau (ICM), Inserm U 1127, CNRS UMR 7225, Paris, France
| | - Hidenori Tanaka
- Physics & Informatics Laboratories, NTT Research, Inc., East Palo Alto, California, United States of America
- Department of Applied Physics, Stanford University, Stanford, California, United States of America
| | - Julian Roussel
- Sorbonne Université, Institut du Cerveau (ICM), Inserm U 1127, CNRS UMR 7225, Paris, France
| | - Olivier Mirat
- Sorbonne Université, Institut du Cerveau (ICM), Inserm U 1127, CNRS UMR 7225, Paris, France
| | - Claire Wyart
- Sorbonne Université, Institut du Cerveau (ICM), Inserm U 1127, CNRS UMR 7225, Paris, France
| |
Collapse
|
5
|
Ranawade A, Levine E. Primer on Mathematical Modeling in C. elegans. Methods Mol Biol 2022; 2468:375-386. [PMID: 35320577 DOI: 10.1007/978-1-0716-2181-3_21] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Recently, applications of mathematical and computational models to biological processes have helped investigators to systematically interpret data, test hypotheses built on experimental data, generate new hypotheses, and guide the design of new experiments, protocols, and synthetic biological systems. Availability of diverse quantitative data is a prerequisite for successful mathematical modeling. The ability to acquire high-quality quantitative data for a broad range of biological processes and perform precise perturbation makes C. elegans an ideal model system for such studies. In this primer, we examine the general procedure of modeling biological systems and demonstrate this process using the heat-shock response in C. elegans as a case study. Our goal is to facilitate the initial discussion between worm biologists and their potential collaborators from quantitative disciplines.
Collapse
Affiliation(s)
- Ayush Ranawade
- Department of Bioengineering, Northeastern University, Boston, MA, USA
| | - Erel Levine
- Department of Bioengineering, Northeastern University, Boston, MA, USA.
| |
Collapse
|
6
|
Tracking changes in behavioural dynamics using prediction error. PLoS One 2021; 16:e0251053. [PMID: 33979384 PMCID: PMC8115816 DOI: 10.1371/journal.pone.0251053] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Accepted: 04/20/2021] [Indexed: 11/19/2022] Open
Abstract
Automated analysis of video can now generate extensive time series of pose and motion in freely-moving organisms. This requires new quantitative tools to characterise behavioural dynamics. For the model roundworm Caenorhabditis elegans, body pose can be accurately quantified from video as coordinates in a single low-dimensional space. We focus on this well-established case as an illustrative example and propose a method to reveal subtle variations in behaviour at high time resolution. Our data-driven method, based on empirical dynamic modeling, quantifies behavioural change as prediction error with respect to a time-delay-embedded ‘attractor’ of behavioural dynamics. Because this attractor is constructed from a user-specified reference data set, the approach can be tailored to specific behaviours of interest at the individual or group level. We validate the approach by detecting small changes in the movement dynamics of C. elegans at the initiation and completion of delta turns. We then examine an escape response initiated by an aversive stimulus and find that the method can track return to baseline behaviour in individual worms and reveal variations in the escape response between worms. We suggest that this general approach—defining dynamic behaviours using reference attractors and quantifying dynamic changes using prediction error—may be of broad interest and relevance to behavioural researchers working with video-derived time series.
Collapse
|
7
|
Haggerty RA, Purvis JE. Inferring the structures of signaling motifs from paired dynamic traces of single cells. PLoS Comput Biol 2021; 17:e1008657. [PMID: 33539338 PMCID: PMC7889133 DOI: 10.1371/journal.pcbi.1008657] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2019] [Revised: 02/17/2021] [Accepted: 12/26/2020] [Indexed: 11/18/2022] Open
Abstract
Individual cells show variability in their signaling dynamics that often correlates with phenotypic responses, indicating that cell-to-cell variability is not merely noise but can have functional consequences. Based on this observation, we reasoned that cell-to-cell variability under the same treatment condition could be explained in part by a single signaling motif that maps different upstream signals into a corresponding set of downstream responses. If this assumption holds, then repeated measurements of upstream and downstream signaling dynamics in a population of cells could provide information about the underlying signaling motif for a given pathway, even when no prior knowledge of that motif exists. To test these two hypotheses, we developed a computer algorithm called MISC (Motif Inference from Single Cells) that infers the underlying signaling motif from paired time-series measurements from individual cells. When applied to measurements of transcription factor and reporter gene expression in the yeast stress response, MISC predicted signaling motifs that were consistent with previous mechanistic models of transcription. The ability to detect the underlying mechanism became less certain when a cell's upstream signal was randomly paired with another cell's downstream response, demonstrating how averaging time-series measurements across a population obscures information about the underlying signaling mechanism. In some cases, motif predictions improved as more cells were added to the analysis. These results provide evidence that mechanistic information about cellular signaling networks can be systematically extracted from the dynamical patterns of single cells.
Collapse
Affiliation(s)
- Raymond A. Haggerty
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, United States of America
- Computational Medicine Program, University of North Carolina, Chapel Hill, North Carolina, United States of America
- Curriculum for Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, North Carolina, United States of America
| | - Jeremy E. Purvis
- Department of Genetics, University of North Carolina, Chapel Hill, North Carolina, United States of America
- Computational Medicine Program, University of North Carolina, Chapel Hill, North Carolina, United States of America
- Curriculum for Bioinformatics and Computational Biology, University of North Carolina, Chapel Hill, North Carolina, United States of America
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina, United States of America
- * E-mail:
| |
Collapse
|
8
|
Mujica-Parodi LR, Strey HH. Making Sense of Computational Psychiatry. Int J Neuropsychopharmacol 2020; 23:339-347. [PMID: 32219396 PMCID: PMC7251632 DOI: 10.1093/ijnp/pyaa013] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Accepted: 02/24/2020] [Indexed: 12/26/2022] Open
Abstract
In psychiatry we often speak of constructing "models." Here we try to make sense of what such a claim might mean, starting with the most fundamental question: "What is (and isn't) a model?" We then discuss, in a concrete measurable sense, what it means for a model to be useful. In so doing, we first identify the added value that a computational model can provide in the context of accuracy and power. We then present limitations of standard statistical methods and provide suggestions for how we can expand the explanatory power of our analyses by reconceptualizing statistical models as dynamical systems. Finally, we address the problem of model building-suggesting ways in which computational psychiatry can escape the potential for cognitive biases imposed by classical hypothesis-driven research, exploiting deep systems-level information contained within neuroimaging data to advance our understanding of psychiatric neuroscience.
Collapse
Affiliation(s)
- Lilianne R Mujica-Parodi
- Department of Biomedical Engineering, Stony Brook University, Stony Brook, New York,Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York,Correspondence: Lilianne R. Mujica-Parodi, PhD, Director, Laboratory for Computational Neurodiagnostics, Professor, Department of Biomedical Engineering, Renaissance School of Medicine, Stony Brook, NY 11794-5281 () or Helmut H. Strey, PhD, Department of Biomedical Engineering, Stony Brook University, Stony Brook, NY 11794-5281 ()
| | - Helmut H Strey
- Department of Biomedical Engineering, Stony Brook University, Stony Brook, New York,Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York,Correspondence: Lilianne R. Mujica-Parodi, PhD, Director, Laboratory for Computational Neurodiagnostics, Professor, Department of Biomedical Engineering, Renaissance School of Medicine, Stony Brook, NY 11794-5281 () or Helmut H. Strey, PhD, Department of Biomedical Engineering, Stony Brook University, Stony Brook, NY 11794-5281 ()
| |
Collapse
|
9
|
Byrne Rodgers J, Ryu WS. Targeted thermal stimulation and high-content phenotyping reveal that the C. elegans escape response integrates current behavioral state and past experience. PLoS One 2020; 15:e0229399. [PMID: 32218560 PMCID: PMC7100941 DOI: 10.1371/journal.pone.0229399] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2019] [Accepted: 02/05/2020] [Indexed: 12/03/2022] Open
Abstract
The ability to avoid harmful or potentially harmful stimuli can help an organism escape predators and injury, and certain avoidance mechanisms are conserved across the animal kingdom. However, how the need to avoid an imminent threat is balanced with current behavior and modified by past experience is not well understood. In this work we focused on rapidly increasing temperature, a signal that triggers an escape response in a variety of animals, including the nematode Caenorhabditis elegans. We have developed a noxious thermal response assay using an infrared laser that can be automatically controlled and targeted in order to investigate how C. elegans responds to noxious heat over long timescales and to repeated stimuli in various behavioral and sensory contexts. High-content phenotyping of behavior in individual animals revealed that the C. elegans escape response is multidimensional, with some features that extend for several minutes, and can be modulated by (i) stimulus amplitude; (ii) other sensory inputs, such as food context; (iii) long and short-term thermal experience; and (iv) the animal's current behavioral state.
Collapse
Affiliation(s)
- Jarlath Byrne Rodgers
- Department of Cell & Systems Biology, University of Toronto, Toronto, Ontario, Canada
- Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
| | - William S. Ryu
- Department of Cell & Systems Biology, University of Toronto, Toronto, Ontario, Canada
- Donnelly Centre, University of Toronto, Toronto, Ontario, Canada
- Department of Physics, University of Toronto, Toronto, Ontario, Canada
| |
Collapse
|