1
|
Wang S, Al-Radhawi MA, Lauffenburger DA, Sontag ED. Recovering biomolecular network dynamics from single-cell omics data requires three time points. NPJ Syst Biol Appl 2024; 10:97. [PMID: 39191787 DOI: 10.1038/s41540-024-00424-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2024] [Accepted: 08/10/2024] [Indexed: 08/29/2024] Open
Abstract
Single-cell omics technologies can measure millions of cells for up to thousands of biomolecular features, enabling data-driven studies of complex biological networks. However, these high-throughput experimental techniques often cannot track individual cells over time, thus complicating the understanding of dynamics such as time trajectories of cell states. These "dynamical phenotypes" are key to understanding biological phenomena such as differentiation fates. We show by mathematical analysis that, in spite of high dimensionality and lack of individual cell traces, three time-points of single-cell omics data are theoretically necessary and sufficient to uniquely determine the network interaction matrix and associated dynamics. Moreover, we show through numerical simulations that an interaction matrix can be accurately determined with three or more time-points even in the presence of sampling and measurement noise typical of single-cell omics. Our results can guide the design of single-cell omics time-course experiments, and provide a tool for data-driven phase-space analysis.
Collapse
Affiliation(s)
- Shu Wang
- Donnelly Centre, University of Toronto, Toronto, ON, Canada
- Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Muhammad Ali Al-Radhawi
- Departments of Bioengineering and Electrical & Computer Engineering, Northeastern University, Boston, MA, USA
| | - Douglas A Lauffenburger
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.
| | - Eduardo D Sontag
- Departments of Bioengineering and Electrical & Computer Engineering, Northeastern University, Boston, MA, USA.
| |
Collapse
|
2
|
Wang S, Sontag ED, Lauffenburger DA. What cannot be seen correctly in 2D visualizations of single-cell 'omics data? Cell Syst 2023; 14:723-731. [PMID: 37734322 PMCID: PMC10863674 DOI: 10.1016/j.cels.2023.07.002] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2022] [Revised: 05/07/2023] [Accepted: 07/14/2023] [Indexed: 09/23/2023]
Abstract
A common strategy for exploring single-cell 'omics data is visualizing 2D nonlinear projections that aim to preserve high-dimensional data properties such as neighborhoods. Alternatively, mathematical theory and other computational tools can directly describe data geometry, while also showing that neighborhoods and other properties cannot be well-preserved in any 2D projection.
Collapse
Affiliation(s)
- Shu Wang
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Eduardo D Sontag
- Departments of Bioengineering and Electrical & Computer Engineering, Northeastern University, Boston, MA 02115, USA.
| | - Douglas A Lauffenburger
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| |
Collapse
|
3
|
Marsh L, Dufresne E, Byrne HM, Harrington HA. Algebra, Geometry and Topology of ERK Kinetics. Bull Math Biol 2022; 84:137. [PMID: 36273372 PMCID: PMC9588486 DOI: 10.1007/s11538-022-01088-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2022] [Accepted: 09/16/2022] [Indexed: 12/01/2022]
Abstract
The MEK/ERK signalling pathway is involved in cell division, cell specialisation, survival and cell death (Shaul and Seger in Biochim Biophys Acta (BBA)-Mol Cell Res 1773(8):1213-1226, 2007). Here we study a polynomial dynamical system describing the dynamics of MEK/ERK proposed by Yeung et al. (Curr Biol 2019, https://doi.org/10.1016/j.cub.2019.12.052 ) with their experimental setup, data and known biological information. The experimental dataset is a time-course of ERK measurements in different phosphorylation states following activation of either wild-type MEK or MEK mutations associated with cancer or developmental defects. We demonstrate how methods from computational algebraic geometry, differential algebra, Bayesian statistics and computational algebraic topology can inform the model reduction, identification and parameter inference of MEK variants, respectively. Throughout, we show how this algebraic viewpoint offers a rigorous and systematic analysis of such models.
Collapse
Affiliation(s)
- Lewis Marsh
- Mathematical Institute, University of Oxford, Oxford, UK.
- Ludwig Institute for Cancer Research, University of Oxford, Oxford, UK.
| | | | - Helen M Byrne
- Mathematical Institute, University of Oxford, Oxford, UK
- Ludwig Institute for Cancer Research, University of Oxford, Oxford, UK
| | | |
Collapse
|
4
|
Maudsley S, Leysen H, van Gastel J, Martin B. Systems Pharmacology: Enabling Multidimensional Therapeutics. COMPREHENSIVE PHARMACOLOGY 2022:725-769. [DOI: 10.1016/b978-0-12-820472-6.00017-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2025]
|
5
|
Yeung E, Kim J, Yuan Y, Gonçalves J, Murray RM. Data-driven network models for genetic circuits from time-series data with incomplete measurements. J R Soc Interface 2021; 18:20210413. [PMID: 34493091 PMCID: PMC8424335 DOI: 10.1098/rsif.2021.0413] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Accepted: 08/12/2021] [Indexed: 12/23/2022] Open
Abstract
Synthetic gene networks are frequently conceptualized and visualized as static graphs. This view of biological programming stands in stark contrast to the transient nature of biomolecular interaction, which is frequently enacted by labile molecules that are often unmeasured. Thus, the network topology and dynamics of synthetic gene networks can be difficult to verify in vivo or in vitro, due to the presence of unmeasured biological states. Here we introduce the dynamical structure function as a new mesoscopic, data-driven class of models to describe gene networks with incomplete measurements of state dynamics. We develop a network reconstruction algorithm and a code base for reconstructing the dynamical structure function from data, to enable discovery and visualization of graphical relationships in a genetic circuit diagram as time-dependent functions rather than static, unknown weights. We prove a theorem, showing that dynamical structure functions can provide a data-driven estimate of the size of crosstalk fluctuations from an idealized model. We illustrate this idea with numerical examples. Finally, we show how data-driven estimation of dynamical structure functions can explain failure modes in two experimentally implemented genetic circuits, a previously reported in vitro genetic circuit and a new E. coli-based transcriptional event detector.
Collapse
Affiliation(s)
- Enoch Yeung
- Center for Biological Engineering, Biomolecular Science and Engineering Program, Department of Mechanical Engineering, Center for Control, Dynamical Systems, and Computation, University of California, Santa Barbara, CA, USA
| | - Jongmin Kim
- Department of Life Sciences, POSTECH, Pohang, South Korea
| | - Ye Yuan
- School of Artificial Intelligence and Automation, Hua Zhong University of Science and Technology, Wuhan, People’s Republic of China
| | - Jorge Gonçalves
- Systems Biology Research Group, University of Luxembourg, Belvaux, Luxembourg
| | - Richard M. Murray
- Control and Dynamical Systems, California Institute of Technology, Pasadena, CA, USA
- Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
| |
Collapse
|
6
|
Sritharan D, Wang S, Hormoz S. Computing the Riemannian curvature of image patch and single-cell RNA sequencing data manifolds using extrinsic differential geometry. Proc Natl Acad Sci U S A 2021; 118:e2100473118. [PMID: 34272279 PMCID: PMC8307776 DOI: 10.1073/pnas.2100473118] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Most high-dimensional datasets are thought to be inherently low-dimensional-that is, data points are constrained to lie on a low-dimensional manifold embedded in a high-dimensional ambient space. Here, we study the viability of two approaches from differential geometry to estimate the Riemannian curvature of these low-dimensional manifolds. The intrinsic approach relates curvature to the Laplace-Beltrami operator using the heat-trace expansion and is agnostic to how a manifold is embedded in a high-dimensional space. The extrinsic approach relates the ambient coordinates of a manifold's embedding to its curvature using the Second Fundamental Form and the Gauss-Codazzi equation. We found that the intrinsic approach fails to accurately estimate the curvature of even a two-dimensional constant-curvature manifold, whereas the extrinsic approach was able to handle more complex toy models, even when confounded by practical constraints like small sample sizes and measurement noise. To test the applicability of the extrinsic approach to real-world data, we computed the curvature of a well-studied manifold of image patches and recapitulated its topological classification as a Klein bottle. Lastly, we applied the extrinsic approach to study single-cell transcriptomic sequencing (scRNAseq) datasets of blood, gastrulation, and brain cells to quantify the Riemannian curvature of scRNAseq manifolds.
Collapse
Affiliation(s)
- Duluxan Sritharan
- Harvard Graduate Program in Biophysics, Harvard University, Boston, MA 02115
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA 02215
| | - Shu Wang
- Harvard Graduate Program in Biophysics, Harvard University, Boston, MA 02115
- Laboratory of Systems Pharmacology, Harvard Medical School, Boston, MA 02115
| | - Sahand Hormoz
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA 02215;
- Department of Systems Biology, Harvard Medical School, Boston, MA 02115
- Broad Institute of MIT and Harvard, Cambridge, MA 02142
| |
Collapse
|
7
|
Prugger M, Einkemmer L, Beik SP, Wasdin PT, Harris LA, Lopez CF. Unsupervised logic-based mechanism inference for network-driven biological processes. PLoS Comput Biol 2021; 17:e1009035. [PMID: 34077417 PMCID: PMC8202945 DOI: 10.1371/journal.pcbi.1009035] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Revised: 06/14/2021] [Accepted: 05/03/2021] [Indexed: 01/21/2023] Open
Abstract
Modern analytical techniques enable researchers to collect data about cellular states, before and after perturbations. These states can be characterized using analytical techniques, but the inference of regulatory interactions that explain and predict changes in these states remains a challenge. Here we present a generalizable, unsupervised approach to generate parameter-free, logic-based models of cellular processes, described by multiple discrete states. Our algorithm employs a Hamming-distance based approach to formulate, test, and identify optimized logic rules that link two states. Our approach comprises two steps. First, a model with no prior knowledge except for the mapping between initial and attractor states is built. We then employ biological constraints to improve model fidelity. Our algorithm automatically recovers the relevant dynamics for the explored models and recapitulates key aspects of the biochemical species concentration dynamics in the original model. We present the advantages and limitations of our work and discuss how our approach could be used to infer logic-based mechanisms of signaling, gene-regulatory, or other input-output processes describable by the Boolean formalism.
Collapse
Affiliation(s)
- Martina Prugger
- Department of Biochemistry, University of Innsbruck, Innsbruck, Austria
- Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - Lukas Einkemmer
- Department of Mathematics, University of Innsbruck, Innsbruck, Austria
| | - Samantha P. Beik
- Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - Perry T. Wasdin
- Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
| | - Leonard A. Harris
- Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Biomedical Engineering, University of Arkansas, Fayetteville, Arkansas, United States of America
- Interdisciplinary Graduate Program in Cell and Molecular Biology, University of Arkansas, Fayetteville, Arkansas, United States of America
- Cancer Biology Program, Winthrop P. Rockefeller Cancer Institute, University of Arkansas for Medical Sciences, Little Rock, Arkansas, United States of America
| | - Carlos F. Lopez
- Department of Biochemistry, Vanderbilt University School of Medicine, Nashville, Tennessee, United States of America
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| |
Collapse
|