1
|
Wang C, Choi HJ, Woodbury L, Lee K. Interpretable Fine-Grained Phenotypes of Subcellular Dynamics via Unsupervised Deep Learning. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2403547. [PMID: 39239705 PMCID: PMC11538677 DOI: 10.1002/advs.202403547] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2024] [Revised: 08/09/2024] [Indexed: 09/07/2024]
Abstract
Uncovering fine-grained phenotypes of live cell dynamics is pivotal for a comprehensive understanding of the heterogeneity in healthy and diseased biological processes. However, this endeavor poses significant technical challenges for unsupervised machine learning, requiring the extraction of features that not only faithfully preserve this heterogeneity but also effectively discriminate between established biological states, all while remaining interpretable. To tackle these challenges, a self-training deep learning framework designed for fine-grained and interpretable phenotyping is presented. This framework incorporates an unsupervised teacher model with interpretable features to facilitate feature learning in a student deep neural network (DNN). Significantly, an autoencoder-based regularizer is designed to encourage the student DNN to maximize the heterogeneity associated with molecular perturbations. This method enables the acquisition of features with enhanced discriminatory power, while simultaneously preserving the heterogeneity associated with molecular perturbations. This study successfully delineated fine-grained phenotypes within the heterogeneous protrusion dynamics of migrating epithelial cells, revealing specific responses to pharmacological perturbations. Remarkably, this framework adeptly captured a concise set of highly interpretable features uniquely linked to these fine-grained phenotypes, each corresponding to specific temporal intervals crucial for their manifestation. This unique capability establishes it as a valuable tool for investigating diverse cellular dynamics and their heterogeneity.
Collapse
Affiliation(s)
- Chuangqi Wang
- Department of Immunology and MicrobiologyUniversity of Colorado Anschutz Medical CampusAuroraCO80045USA
- Department of Biomedical EngineeringWorcester Polytechnic InstituteWorcesterMA01609USA
| | - Hee June Choi
- Department of Biomedical EngineeringWorcester Polytechnic InstituteWorcesterMA01609USA
- Vascular Biology Program and Department of SurgeryBoston Children's HospitalHarvard Medical SchoolBostonMA02115USA
| | - Lucy Woodbury
- Department of Biomedical EngineeringWorcester Polytechnic InstituteWorcesterMA01609USA
- Department of Biomedical EngineeringUniversity of ArkansasFayettevilleAR72701USA
| | - Kwonmoo Lee
- Department of Biomedical EngineeringWorcester Polytechnic InstituteWorcesterMA01609USA
- Vascular Biology Program and Department of SurgeryBoston Children's HospitalHarvard Medical SchoolBostonMA02115USA
| |
Collapse
|
2
|
Panikov NS. Genome-Scale Reconstruction of Microbial Dynamic Phenotype: Successes and Challenges. Microorganisms 2021; 9:2352. [PMID: 34835477 PMCID: PMC8621822 DOI: 10.3390/microorganisms9112352] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 10/18/2021] [Accepted: 10/27/2021] [Indexed: 12/04/2022] Open
Abstract
This review is a part of the SI 'Genome-Scale Modeling of Microorganisms in the Real World'. The goal of GEM is the accurate prediction of the phenotype from its respective genotype under specified environmental conditions. This review focuses on the dynamic phenotype; prediction of the real-life behaviors of microorganisms, such as cell proliferation, dormancy, and mortality; balanced and unbalanced growth; steady-state and transient processes; primary and secondary metabolism; stress responses; etc. Constraint-based metabolic reconstructions were successfully started two decades ago as FBA, followed by more advanced models, but this review starts from the earlier nongenomic predecessors to show that some GEMs inherited the outdated biokinetic frameworks compromising their performances. The most essential deficiencies are: (i) an inadequate account of environmental conditions, such as various degrees of nutrients limitation and other factors shaping phenotypes; (ii) a failure to simulate the adaptive changes of MMCC (MacroMolecular Cell Composition) in response to the fluctuating environment; (iii) the misinterpretation of the SGR (Specific Growth Rate) as either a fixed constant parameter of the model or independent factor affecting the conditional expression of macromolecules; (iv) neglecting stress resistance as an important objective function; and (v) inefficient experimental verification of GEM against simple growth (constant MMCC and SGR) data. Finally, we propose several ways to improve GEMs, such as replacing the outdated Monod equation with the SCM (Synthetic Chemostat Model) that establishes the quantitative relationships between primary and secondary metabolism, growth rate and stress resistance, process kinetics, and cell composition.
Collapse
Affiliation(s)
- Nicolai S Panikov
- Department of Chemistry and Chemical Biology, Northeastern University, 360 Huntington Ave., Boston, MA 02115, USA
| |
Collapse
|
3
|
Choi HJ, Wang C, Pan X, Jang J, Cao M, Brazzo JA, Bae Y, Lee K. Emerging machine learning approaches to phenotyping cellular motility and morphodynamics. Phys Biol 2021; 18:10.1088/1478-3975/abffbe. [PMID: 33971636 PMCID: PMC9131244 DOI: 10.1088/1478-3975/abffbe] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 05/10/2021] [Indexed: 12/22/2022]
Abstract
Cells respond heterogeneously to molecular and environmental perturbations. Phenotypic heterogeneity, wherein multiple phenotypes coexist in the same conditions, presents challenges when interpreting the observed heterogeneity. Advances in live cell microscopy allow researchers to acquire an unprecedented amount of live cell image data at high spatiotemporal resolutions. Phenotyping cellular dynamics, however, is a nontrivial task and requires machine learning (ML) approaches to discern phenotypic heterogeneity from live cell images. In recent years, ML has proven instrumental in biomedical research, allowing scientists to implement sophisticated computation in which computers learn and effectively perform specific analyses with minimal human instruction or intervention. In this review, we discuss how ML has been recently employed in the study of cell motility and morphodynamics to identify phenotypes from computer vision analysis. We focus on new approaches to extract and learn meaningful spatiotemporal features from complex live cell images for cellular and subcellular phenotyping.
Collapse
Affiliation(s)
- Hee June Choi
- Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, MA 01609, United States of America
- Vascular Biology Program and Department of Surgery, Boston Children’s Hospital, Harvard Medical School, Boston, MA 02115, United States of America
| | - Chuangqi Wang
- Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, MA 01609, United States of America
- Present address. Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Xiang Pan
- Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, MA 01609, United States of America
- Vascular Biology Program and Department of Surgery, Boston Children’s Hospital, Harvard Medical School, Boston, MA 02115, United States of America
| | - Junbong Jang
- Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, MA 01609, United States of America
- Vascular Biology Program and Department of Surgery, Boston Children’s Hospital, Harvard Medical School, Boston, MA 02115, United States of America
| | - Mengzhi Cao
- Data Science Program, Worcester Polytechnic Institute, Worcester, MA 01609, United States of America
| | - Joseph A Brazzo
- Department of Pathology and Anatomical Sciences, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, State University of New York, Buffalo, NY 14203, United States of America
| | - Yongho Bae
- Department of Pathology and Anatomical Sciences, Jacobs School of Medicine and Biomedical Sciences, University at Buffalo, State University of New York, Buffalo, NY 14203, United States of America
| | - Kwonmoo Lee
- Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, MA 01609, United States of America
- Vascular Biology Program and Department of Surgery, Boston Children’s Hospital, Harvard Medical School, Boston, MA 02115, United States of America
| |
Collapse
|
4
|
Germine L, Strong RW, Singh S, Sliwinski MJ. Toward dynamic phenotypes and the scalable measurement of human behavior. Neuropsychopharmacology 2021; 46:209-216. [PMID: 32629456 PMCID: PMC7689489 DOI: 10.1038/s41386-020-0757-1] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/01/2020] [Revised: 06/18/2020] [Accepted: 06/25/2020] [Indexed: 12/24/2022]
Abstract
Precision psychiatry demands the rapid, efficient, and temporally dense collection of large scale and multi-omic data across diverse samples, for better diagnosis and treatment of dynamic clinical phenomena. To achieve this, we need approaches for measuring behavior that are readily scalable, both across participants and over time. Efforts to quantify behavior at scale are impeded by the fact that our methods for measuring human behavior are typically developed and validated for single time-point assessment, in highly controlled settings, and with relatively homogeneous samples. As a result, when taken to scale, these measures often suffer from poor reliability, generalizability, and participant engagement. In this review, we attempt to bridge the gap between gold standard behavioral measurements in the lab or clinic and the large-scale, high frequency assessments needed for precision psychiatry. To do this, we introduce and integrate two frameworks for the translation and validation of behavioral measurements. First, borrowing principles from computer science, we lay out an approach for iterative task development that can optimize behavioral measures based on psychometric, accessibility, and engagement criteria. Second, we advocate for a participatory research framework (e.g., citizen science) that can accelerate task development as well as make large-scale behavioral research more equitable and feasible. Finally, we suggest opportunities enabled by scalable behavioral research to move beyond single time-point assessment and toward dynamic models of behavior that more closely match clinical phenomena.
Collapse
Affiliation(s)
- Laura Germine
- Institute for Technology in Psychiatry, McLean Hospital, Belmont, MA, USA.
- Department of Psychiatry, Harvard Medical School, Boston, MA, USA.
| | - Roger W Strong
- Institute for Technology in Psychiatry, McLean Hospital, Belmont, MA, USA
- Department of Psychiatry, Harvard Medical School, Boston, MA, USA
| | - Shifali Singh
- Institute for Technology in Psychiatry, McLean Hospital, Belmont, MA, USA
- Department of Psychiatry, Harvard Medical School, Boston, MA, USA
- Department of Psychiatry and Behavioral Sciences, Rush University Medical Center, Chicago, IL, USA
| | - Martin J Sliwinski
- Center for Healthy Aging, Pennsylvania State University, State College, PA, USA
| |
Collapse
|
5
|
Golovenkin SE, Bac J, Chervov A, Mirkes EM, Orlova YV, Barillot E, Gorban AN, Zinovyev A. Trajectories, bifurcations, and pseudo-time in large clinical datasets: applications to myocardial infarction and diabetes data. Gigascience 2020; 9:giaa128. [PMID: 33241287 PMCID: PMC7688475 DOI: 10.1093/gigascience/giaa128] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2020] [Revised: 09/30/2020] [Accepted: 10/22/2020] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND Large observational clinical datasets are becoming increasingly available for mining associations between various disease traits and administered therapy. These datasets can be considered as representations of the landscape of all possible disease conditions, in which a concrete disease state develops through stereotypical routes, characterized by "points of no return" and "final states" (such as lethal or recovery states). Extracting this information directly from the data remains challenging, especially in the case of synchronic (with a short-term follow-up) observations. RESULTS Here we suggest a semi-supervised methodology for the analysis of large clinical datasets, characterized by mixed data types and missing values, through modeling the geometrical data structure as a bouquet of bifurcating clinical trajectories. The methodology is based on application of elastic principal graphs, which can address simultaneously the tasks of dimensionality reduction, data visualization, clustering, feature selection, and quantifying the geodesic distances (pseudo-time) in partially ordered sequences of observations. The methodology allows a patient to be positioned on a particular clinical trajectory (pathological scenario) and the degree of progression along it to be characterized with a qualitative estimate of the uncertainty of the prognosis. We developed a tool ClinTrajan for clinical trajectory analysis implemented in the Python programming language. We test the methodology in 2 large publicly available datasets: myocardial infarction complications and readmission of diabetic patients data. CONCLUSIONS Our pseudo-time quantification-based approach makes it possible to apply the methods developed for dynamical disease phenotyping and illness trajectory analysis (diachronic data analysis) to synchronic observational data.
Collapse
Affiliation(s)
- Sergey E Golovenkin
- Prof. V.F. Voino-Yasenetsky Krasnoyarsk State Medical University, 660022 Krasnoyarsk, Russia
| | - Jonathan Bac
- Institut Curie, PSL Research University, F-75005 Paris, France
- INSERM, U900, F-75005 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France
| | - Alexander Chervov
- Institut Curie, PSL Research University, F-75005 Paris, France
- INSERM, U900, F-75005 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France
| | - Evgeny M Mirkes
- Centre for Artificial Intelligence, Data Analytics and Modelling, University of Leicester, LE1 7RH Leicester, UK
- Laboratory of advanced methods for high-dimensional data analysis, Lobachevsky University, 603000 Nizhny Novgorod, Russia
| | - Yuliya V Orlova
- Prof. V.F. Voino-Yasenetsky Krasnoyarsk State Medical University, 660022 Krasnoyarsk, Russia
| | - Emmanuel Barillot
- Institut Curie, PSL Research University, F-75005 Paris, France
- INSERM, U900, F-75005 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France
| | - Alexander N Gorban
- Centre for Artificial Intelligence, Data Analytics and Modelling, University of Leicester, LE1 7RH Leicester, UK
- Laboratory of advanced methods for high-dimensional data analysis, Lobachevsky University, 603000 Nizhny Novgorod, Russia
| | - Andrei Zinovyev
- Institut Curie, PSL Research University, F-75005 Paris, France
- INSERM, U900, F-75005 Paris, France
- CBIO-Centre for Computational Biology, Mines ParisTech, PSL Research University, 75006 Paris, France
| |
Collapse
|
6
|
Live cell analysis at sea reveals divergent thermal performance between photosynthetic ocean microbial eukaryote populations. ISME JOURNAL 2019; 13:1374-1378. [PMID: 30705412 DOI: 10.1038/s41396-019-0355-6] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/01/2018] [Revised: 12/31/2018] [Accepted: 01/14/2019] [Indexed: 11/08/2022]
Abstract
Experimentation at sea provides insight into which traits of ocean microbes are linked to performance in situ. Here we show distinct patterns in thermal tolerance of microbial phototrophs from adjacent water masses sampled in the south-west Pacific Ocean, determined using a fluorescent marker for reactive oxygen species (ROS). ROS content of pico-eukaryotes was assessed after 1, 5 and 25 h of incubation along a temperature gradient (15.6-32.1 °C). Pico-eukaryotes from the East Australian Current (EAC) had relatively constant ROS and showed greatest mortality after 25 h at 7 °C below ambient, whereas those from the Tasman Sea had elevated ROS in both warm and cool temperature extremes and greatest mortality at temperatures 6-10 °C above ambient, interpreted as the outcome of thermal stress. Tracking of water masses within an oceanographic circulation model showed populations had distinct thermal histories, with EAC pico-eukaryotes experiencing higher average temperatures for at least 1 week prior to sampling. While acclimatization and community assembly could both influence biological responses, this study clearly demonstrates that phenotypic divergence occurs along planktonic drift trajectories.
Collapse
|
7
|
The future is now: cutting edge science and understanding toxicology. Cell Biol Toxicol 2018; 34:79-85. [DOI: 10.1007/s10565-018-9421-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2017] [Accepted: 01/09/2018] [Indexed: 12/13/2022]
|
8
|
Can the Single Cell Make Biomedicine Different? ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2018; 1068:1-6. [PMID: 29943291 DOI: 10.1007/978-981-13-0502-3_1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The single-cell as the basic unit of biological organs and tissues has recently been considered an important window to furthermore understand molecular mechanisms of organ function and biology. The current issue with a special focus on single cell biomedicine is the first effort to collect the evidence of disease-associated single cell research, define the significance of single cell biomedicine in the pathogenesis of diseases, value the correlation of single cell gene sequencing with disease-specific biomarkers, and monitor the dynamics of RNA processes and responses to microenvironmental changes and drug resistances.
Collapse
|