1
|
Chen CM, Yu R. A multi-step completion process model of cell plasticity. Brief Bioinform 2025; 26:bbaf165. [PMID: 40223810 PMCID: PMC11995008 DOI: 10.1093/bib/bbaf165] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2024] [Revised: 02/11/2025] [Accepted: 03/25/2025] [Indexed: 04/15/2025] Open
Abstract
Plasticity is the potential for cells or cell populations to change their phenotypes and behaviors in response to internal or external cues. Plasticity is fundamental to many complex biological processes, yet to date there remains a lack of mathematical models that can elucidate and predict molecular behaviors in a plasticity program. Here, we report a new mathematical framework that models cell plasticity as a multi-step completion process, where the system moves from the initial state along a path guided by multiple intermediate attractors until the final state (i.e. a new homeostasis) is reached. Using omics time-series data as model input, we show that our method fits data well; identifies attractor states by their timing and molecular markers which are well-aligned with domain knowledge; and can make quantitative and time-resolved predictions such as the molecular outcomes of blocking a plasticity program from reaching completion, to an R2 of 0.53-0.63. We demonstrate that application of our model to primary patient-derived data can provide quantitative insights and predictions that may be useful in guiding further research and potential biomedical interventions.
Collapse
Affiliation(s)
- Chen M Chen
- Department of Molecular Developmental Biology, Radboud Institute for Molecular Life Sciences, Faculty of Science, Radboud University, Geert Grooteplein-Zuid 26-28, Nijmegen, 6525GA, The Netherlands
| | - Rosemary Yu
- Department of Molecular Developmental Biology, Radboud Institute for Molecular Life Sciences, Faculty of Science, Radboud University, Geert Grooteplein-Zuid 26-28, Nijmegen, 6525GA, The Netherlands
| |
Collapse
|
2
|
Delbare SYN, Venkatraman S, Scuderi K, Wells MT, Wolfner MF, Basu S, Clark AG. Time series transcriptome analysis implicates the circadian clock in the Drosophila melanogaster female's response to sex peptide. Proc Natl Acad Sci U S A 2023; 120:e2214883120. [PMID: 36706221 PMCID: PMC9945991 DOI: 10.1073/pnas.2214883120] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2022] [Accepted: 12/28/2022] [Indexed: 01/28/2023] Open
Abstract
Sex peptide (SP), a seminal fluid protein of Drosophila melanogaster males, has been described as driving a virgin-to-mated switch in females, through eliciting an array of responses including increased egg laying, activity, and food intake and a decreased remating rate. While it is known that SP achieves this, at least in part, by altering neuronal signaling in females, the genetic architecture and temporal dynamics of the female's response to SP remain elusive. We used a high-resolution time series RNA-sequencing dataset of female heads at 10 time points within the first 24 h after mating to learn about the genetic architecture, at the gene and exon levels, of the female's response to SP. We find that SP is not essential to trigger early aspects of a virgin-to-mated transcriptional switch, which includes changes in a metabolic gene regulatory network. However, SP is needed to maintain and diversify metabolic changes and to trigger changes in a neuronal gene regulatory network. We further find that SP alters rhythmic gene expression in females and suggests that SP's disruption of the female's circadian rhythm might be key to its widespread effects.
Collapse
Affiliation(s)
- Sofie Y. N. Delbare
- Department of Molecular Biology & Genetics, Cornell University, Ithaca, NY14853
- Department of Statistics & Data Science, Cornell University, Ithaca, NY14853
| | - Sara Venkatraman
- Department of Statistics & Data Science, Cornell University, Ithaca, NY14853
| | - Kate Scuderi
- Department of Molecular Biology & Genetics, Cornell University, Ithaca, NY14853
| | - Martin T. Wells
- Department of Statistics & Data Science, Cornell University, Ithaca, NY14853
| | - Mariana F. Wolfner
- Department of Molecular Biology & Genetics, Cornell University, Ithaca, NY14853
| | - Sumanta Basu
- Department of Statistics & Data Science, Cornell University, Ithaca, NY14853
| | - Andrew G. Clark
- Department of Molecular Biology & Genetics, Cornell University, Ithaca, NY14853
| |
Collapse
|
3
|
Devi Priya R, Sivaraj R, Abraham A, Pravin T, Sivasankar P, Anitha N. Multi-Objective Particle Swarm Optimization Based Preprocessing of Multi-Class Extremely Imbalanced Datasets. INT J UNCERTAIN FUZZ 2022. [DOI: 10.1142/s0218488522500209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
Today’s datasets are usually very large with many features and making analysis on such datasets is really a tedious task. Especially when performing classification, selecting attributes that are salient for the process is a brainstorming task. It is more difficult when there are many class labels for the target class attribute and hence many researchers have introduced methods to select features for performing classification on multi-class attributes. The process becomes more tedious when the attribute values are imbalanced for which researchers have contributed many methods. But, there is no sufficient research to handle extreme imbalance and feature selection together and hence this paper aims to bridge this gap. Here Particle Swarm Optimization (PSO), an efficient evolutionary algorithm is used to handle imbalanced dataset and feature selection process is also enhanced with the required functionalities. First, Multi-objective Particle Swarm Optimization is used to transform the imbalanced datasets into balanced one and then another version of Multi-objective Particle Swarm Optimization is used to select the significant features. The proposed methodology is applied on eight multi-class extremely imbalanced datasets and the experimental results are found to be better than other existing methods in terms of classification accuracy, G mean, F measure. The results validated by using Friedman test also confirm that the proposed methodology effectively balances the dataset with less number of features than other methods.
Collapse
Affiliation(s)
- R. Devi Priya
- Department of Computer Science and Engineering, Centre for IoT and Artificial Intelligence, KPR Institute of Engineering and Technology, Coimbatore, TamilNadu, India
| | - R. Sivaraj
- Department of Computer Science and Engineering, Nandha Engineering College, Erode, TamilNadu, India
| | - Ajith Abraham
- Center for Artificial Intelligence, Innopolis University, Innopolis, Russia
- Machine Intelligence Research Labs (MIR Labs), Auburn, Washington 98071, USA
| | - T. Pravin
- Department of Mechanical Engineering, SNS College of Engineering, Coimbatore, India
| | - P. Sivasankar
- Department of Petroleum Engineering & Earth Sciences, Indian Institute of Petroleum and Energy, Visakhapatnam, India
| | - N. Anitha
- Department of Information Technology, Kongu Engineering College, Erode, TamilNadu, India
| |
Collapse
|
4
|
Secondary Metabolism Gene Clusters Exhibit Increasingly Dynamic and Differential Expression during Asexual Growth, Conidiation, and Sexual Development in Neurospora crassa. mSystems 2022; 7:e0023222. [PMID: 35638725 PMCID: PMC9239088 DOI: 10.1128/msystems.00232-22] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Secondary metabolite clusters (SMCs) encode the machinery for fungal toxin production. However, understanding their function and analyzing their products requires investigation of the developmental and environmental conditions in which they are expressed. Gene expression is often restricted to specific and unexamined stages of the life cycle. Therefore, we applied comparative genomics analyses to identify SMCs in Neurospora crassa and analyzed extensive transcriptomic data spanning nine independent experiments from diverse developmental and environmental conditions to reveal their life cycle-specific gene expression patterns. We reported 20 SMCs comprising 177 genes-a manageable set for investigation of the roles of SMCs across the life cycle of the fungal model N. crassa-as well as gene sets coordinately expressed in 18 predicted SMCs during asexual and sexual growth under three nutritional and two temperature conditions. Divergent activity of SMCs between asexual and sexual development was reported. Of 126 SMC genes that we examined for knockout phenotypes, al-2 and al-3 exhibited phenotypes in asexual growth and conidiation, whereas os-5, poi-2, and pmd-1 exhibited phenotypes in sexual development. SMCs with annotated function in mating and crossing were actively regulated during the switch between asexual and sexual growth. Our discoveries call for attention to roles that SMCs may play in the regulatory switches controlling mode of development, as well as the ecological associations of those developmental stages that may influence expression of SMCs. IMPORTANCE Secondary metabolites (SMs) are low-molecular-weight compounds that often mediate interactions between fungi and their environments. Fungi enriched with SMs are of significant research interest to agriculture and medicine, especially from the aspects of pathogen ecology and environmental epidemiology. However, SM clusters (SMCs) that have been predicted by comparative genomics alone have typically been poorly defined and insufficiently functionally annotated. Therefore, we have investigated coordinate expression in SMCs in the model system N. crassa, and our results suggest that SMCs respond to environmental signals and to stress that are associated with development. This study examined SMC regulation at the level of RNA to integrate observations and knowledge of these genes in various growth and development conditions, supporting combining comparative genomics and inclusive transcriptomics to improve computational annotation of SMCs. Our findings call for detailed study of the function of SMCs during the asexual-sexual switch, a key, often-overlooked developmental stage.
Collapse
|
5
|
Sharma A, Johnson KB, Bie B, Rhoades EE, Sen A, Kida Y, Hockings J, Gatta A, Davenport J, Arcangelini C, Ritzu J, DeVecchio J, Hughen R, Wei M, Thomas Budd G, Lynn Henry N, Eng C, Foss J, Rotroff DM. A Multimodal Approach to Discover Biomarkers for Taxane-Induced Peripheral Neuropathy (TIPN): A Study Protocol. Technol Cancer Res Treat 2022; 21:15330338221127169. [PMID: 36172750 PMCID: PMC9523841 DOI: 10.1177/15330338221127169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022] Open
Abstract
Introduction: Taxanes are a class of chemotherapeutics commonly used to treat various solid tumors, including breast and ovarian cancers. Taxane-induced peripheral neuropathy (TIPN) occurs in up to 70% of patients, impacting quality of life both during and after treatment. TIPN typically manifests as tingling and numbness in the hands and feet and can cause irreversible loss of function of peripheral nerves. TIPN can be dose-limiting, potentially impacting clinical outcomes. The mechanisms underlying TIPN are poorly understood. As such, there are limited treatment options and no tools to provide early detection of those who will develop TIPN. Although some patients may have a genetic predisposition, genetic biomarkers have been inconsistent in predicting chemotherapy-induced peripheral neuropathy (CIPN). Moreover, other molecular markers (eg, metabolites, mRNA, miRNA, proteins) may be informative for predicting CIPN, but remain largely unexplored. We anticipate that combinations of multiple biomarkers will be required to consistently predict those who will develop TIPN. Methods: To address this clinical gap of identifying patients at risk of TIPN, we initiated the Genetics and Inflammatory Markers for CIPN (GENIE) study. This longitudinal multicenter observational study uses a novel, multimodal approach to evaluate genomic variation, metabolites, DNA methylation, gene expression, and circulating cytokines/chemokines prior to, during, and after taxane treatment in 400 patients with breast cancer. Molecular and patient reported data will be collected prior to, during, and after taxane therapy. Multi-modal data will be used to develop a set of comprehensive predictive biomarker signatures of TIPN. Conclusion: The goal of this study is to enable early detection of patients at risk of developing TIPN, provide a tool to modify taxane treatment to minimize morbidity from TIPN, and improved patient quality of life. Here we provide a brief review of the current state of research into CIPN and TIPN and introduce the GENIE study design.
Collapse
Affiliation(s)
- Anukriti Sharma
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, OH, USA
| | - Ken B. Johnson
- Department of Anesthesiology, University of Utah, UT, USA
| | - Bihua Bie
- Department of Anesthesiology, Cleveland Clinic, OH, USA
| | | | - Alper Sen
- Department of Anesthesiology, University of Utah, UT, USA
| | - Yuri Kida
- Department of Anesthesiology, University of Utah, UT, USA
| | - Jennifer Hockings
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, OH, USA
- Department of Pharmacy, Cleveland Clinic, OH, USA
- Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA
| | - Alycia Gatta
- Taussig Cancer Institute, Cleveland Clinic, OH, USA
| | | | | | | | - Jennifer DeVecchio
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, OH, USA
| | - Ron Hughen
- Department of Anesthesiology, University of Utah, UT, USA
| | - Mei Wei
- Huntsman Cancer Institute, University of Utah, Salt Lake City, UT
| | - G. Thomas Budd
- Taussig Cancer Institute, Cleveland Clinic, OH, USA
- Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - N. Lynn Henry
- University of Michigan Rogel Cancer Center, Ann Arbor, MI, USA
| | - Charis Eng
- Taussig Cancer Institute, Cleveland Clinic, OH, USA
- Genomic Medicine Institute, Lerner Research Institute, Cleveland Clinic, OH, USA
- Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA
- Department of Genetics and Genome Sciences, Case Western Reserve University School of Medicine, Cleveland, OH, USA
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - Joseph Foss
- Department of Anesthesiology, Cleveland Clinic, OH, USA
| | - Daniel M. Rotroff
- Department of Quantitative Health Sciences, Lerner Research Institute, Cleveland Clinic, OH, USA
- Cleveland Clinic Lerner College of Medicine, Case Western Reserve University, Cleveland, OH, USA
- Case Comprehensive Cancer Center, Case Western Reserve University School of Medicine, Cleveland, OH, USA
- Endocrinology and Metabolism Institute, Cleveland Clinic, Cleveland, OH, USA
| |
Collapse
|
6
|
Oh VKS, Li RW. Temporal Dynamic Methods for Bulk RNA-Seq Time Series Data. Genes (Basel) 2021; 12:352. [PMID: 33673721 PMCID: PMC7997275 DOI: 10.3390/genes12030352] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Revised: 02/19/2021] [Accepted: 02/22/2021] [Indexed: 02/06/2023] Open
Abstract
Dynamic studies in time course experimental designs and clinical approaches have been widely used by the biomedical community. These applications are particularly relevant in stimuli-response models under environmental conditions, characterization of gradient biological processes in developmental biology, identification of therapeutic effects in clinical trials, disease progressive models, cell-cycle, and circadian periodicity. Despite their feasibility and popularity, sophisticated dynamic methods that are well validated in large-scale comparative studies, in terms of statistical and computational rigor, are less benchmarked, comparing to their static counterparts. To date, a number of novel methods in bulk RNA-Seq data have been developed for the various time-dependent stimuli, circadian rhythms, cell-lineage in differentiation, and disease progression. Here, we comprehensively review a key set of representative dynamic strategies and discuss current issues associated with the detection of dynamically changing genes. We also provide recommendations for future directions for studying non-periodical, periodical time course data, and meta-dynamic datasets.
Collapse
Affiliation(s)
- Vera-Khlara S. Oh
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville, MD 20705, USA;
- Department of Computer Science and Statistics, College of Natural Sciences, Jeju National University, Jeju City 63243, Korea
| | - Robert W. Li
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville, MD 20705, USA;
| |
Collapse
|
7
|
Eicher T, Kinnebrew G, Patt A, Spencer K, Ying K, Ma Q, Machiraju R, Mathé EA. Metabolomics and Multi-Omics Integration: A Survey of Computational Methods and Resources. Metabolites 2020; 10:E202. [PMID: 32429287 PMCID: PMC7281435 DOI: 10.3390/metabo10050202] [Citation(s) in RCA: 72] [Impact Index Per Article: 14.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Revised: 05/07/2020] [Accepted: 05/13/2020] [Indexed: 02/06/2023] Open
Abstract
As researchers are increasingly able to collect data on a large scale from multiple clinical and omics modalities, multi-omics integration is becoming a critical component of metabolomics research. This introduces a need for increased understanding by the metabolomics researcher of computational and statistical analysis methods relevant to multi-omics studies. In this review, we discuss common types of analyses performed in multi-omics studies and the computational and statistical methods that can be used for each type of analysis. We pinpoint the caveats and considerations for analysis methods, including required parameters, sample size and data distribution requirements, sources of a priori knowledge, and techniques for the evaluation of model accuracy. Finally, for the types of analyses discussed, we provide examples of the applications of corresponding methods to clinical and basic research. We intend that our review may be used as a guide for metabolomics researchers to choose effective techniques for multi-omics analyses relevant to their field of study.
Collapse
Affiliation(s)
- Tara Eicher
- Biomedical Informatics Department, The Ohio State University College of Medicine, Columbus, OH 43210, USA; (T.E.); (G.K.); (K.S.); (Q.M.); (R.M.)
- Computer Science and Engineering Department, The Ohio State University College of Engineering, Columbus, OH 43210, USA
| | - Garrett Kinnebrew
- Biomedical Informatics Department, The Ohio State University College of Medicine, Columbus, OH 43210, USA; (T.E.); (G.K.); (K.S.); (Q.M.); (R.M.)
- Comprehensive Cancer Center, The Ohio State University and James Cancer Hospital, Columbus, OH 43210, USA;
- Bioinformatics Shared Resource Group, The Ohio State University, Columbus, OH 43210, USA
| | - Andrew Patt
- Division of Preclinical Innovation, National Center for Advancing Translational Sciences, NIH, 9800 Medical Center Dr., Rockville, MD, 20892, USA;
- Biomedical Sciences Graduate Program, The Ohio State University, Columbus, OH 43210, USA
| | - Kyle Spencer
- Biomedical Informatics Department, The Ohio State University College of Medicine, Columbus, OH 43210, USA; (T.E.); (G.K.); (K.S.); (Q.M.); (R.M.)
- Biomedical Sciences Graduate Program, The Ohio State University, Columbus, OH 43210, USA
- Nationwide Children’s Research Hospital, Columbus, OH 43210, USA
| | - Kevin Ying
- Comprehensive Cancer Center, The Ohio State University and James Cancer Hospital, Columbus, OH 43210, USA;
- Molecular, Cellular and Developmental Biology Program, The Ohio State University, Columbus, OH 43210, USA
| | - Qin Ma
- Biomedical Informatics Department, The Ohio State University College of Medicine, Columbus, OH 43210, USA; (T.E.); (G.K.); (K.S.); (Q.M.); (R.M.)
| | - Raghu Machiraju
- Biomedical Informatics Department, The Ohio State University College of Medicine, Columbus, OH 43210, USA; (T.E.); (G.K.); (K.S.); (Q.M.); (R.M.)
- Computer Science and Engineering Department, The Ohio State University College of Engineering, Columbus, OH 43210, USA
- Department of Pathology, Wexner Medical Center, The Ohio State University, Columbus, OH 43210, USA
- Translational Data Analytics Institute, The Ohio State University, Columbus, OH 43210, USA
| | - Ewy A. Mathé
- Biomedical Informatics Department, The Ohio State University College of Medicine, Columbus, OH 43210, USA; (T.E.); (G.K.); (K.S.); (Q.M.); (R.M.)
- Division of Preclinical Innovation, National Center for Advancing Translational Sciences, NIH, 9800 Medical Center Dr., Rockville, MD, 20892, USA;
| |
Collapse
|