1
|
Kim C, Zhu Z, Barbazuk WB, Bacher RL, Vulpe CD. Time-course characterization of whole-transcriptome dynamics of HepG2/C3A spheroids and its toxicological implications. Toxicol Lett 2024; 401:125-138. [PMID: 39368564 DOI: 10.1016/j.toxlet.2024.10.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2024] [Revised: 09/10/2024] [Accepted: 10/02/2024] [Indexed: 10/07/2024]
Abstract
Physiologically relevant in vitro models are a priority in predictive toxicology to replace and/or reduce animal experiments. The compromised toxicant metabolism of many immortalized human liver cell lines grown as monolayers as compared to in vivo metabolism limits their physiological relevance. However, recent efforts to culture liver cells in a 3D environment, such as spheroids, to better mimic the in vivo conditions, may enhance the toxicant metabolism of human liver cell lines. In this study, we characterized the dynamic changes in the transcriptome of HepG2/C3A hepatocarcinoma cell spheroids maintained in a clinostat system (CelVivo) to gain insight into the metabolic capacity of this model as a function of spheroid size and culture time. We assessed morphological changes (size, necrotic core), cell health, and proliferation rate from initial spheroid seeding to 35 days of continuous culture in conjunction with a time-course (0, 3, 7, 10, 14, 21, 28 days) of the transcriptome (TempO-Seq, BioSpyder). The phenotypic characteristics of HepG2/C3A growing in spheroids were comparable to monolayer growth until ∼Day 12 (Day 10-14) when a significant decrease in cell doubling rate was noted which was concurrent with down-regulation of cell proliferation and cell cycle pathways over this time period. Principal component analysis of the transcriptome data suggests that the Day 3, 7, and 10 spheroids are pronouncedly different from the Day 14, 21, and 28 spheroids in support of a biological transition time point during the long-term 3D spheroid cultures. The expression of genes encoding cellular components involved in toxicant metabolism and transport rapidly increased during the early time points of spheroids to peak at Day 7 or Day 10 as compared to monolayer cultures with a gradual decrease in expression with further culture, suggesting the most metabolically responsive time window for exposure studies. Overall, we provide baseline information on the cellular and molecular characterization, with a particular focus on toxicant metabolic capacity dynamics and cell growth, of HepG2/C3A 3D spheroid cultures over time.
Collapse
Affiliation(s)
- Chanhee Kim
- Center for Human and Environmental Toxicology, Department of Physiological Sciences, College of Veterinary Medicine, University of Florida, Gainesville, FL, United States
| | - Zhaohan Zhu
- Department of Biostatistics, University of Florida, Gainesville, FL, United States
| | - W Brad Barbazuk
- Department of Biology, University of Florida, Gainesville, FL, United States; University of Florida Genetics Institute, University of Florida, Gainesville, FL, United States
| | - Rhonda L Bacher
- Department of Biostatistics, University of Florida, Gainesville, FL, United States
| | - Christopher D Vulpe
- Center for Human and Environmental Toxicology, Department of Physiological Sciences, College of Veterinary Medicine, University of Florida, Gainesville, FL, United States.
| |
Collapse
|
2
|
Scott MA, Harvey KM, Karisch BB, Woolums AR, Tracy RM, Russell JR, Engel CL. Integrated Blood Transcriptome and Multi-Tissue Trace Mineral Analyses of Healthy Stocker Cattle Fed Complexed or Inorganic Trace Mineral Supplement. Animals (Basel) 2024; 14:2186. [PMID: 39123712 PMCID: PMC11311009 DOI: 10.3390/ani14152186] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 07/22/2024] [Accepted: 07/23/2024] [Indexed: 08/12/2024] Open
Abstract
Supplementing trace minerals is common in managing bovine respiratory disease (BRD) in post-weaned cattle; however, its influence on host immunity and metabolism in high-risk cattle remains unclear. We aimed to assess the impact of three supplementation programs on liver and serum trace element concentrations and blood gene expression. Fifty-six high-risk beef steers were randomly assigned to one of three groups over 60 days: (1) sulfate-sourced Cu, Co, Mn, and Zn (INR), (2) amino acid-complexed Cu, Mn, Co, and Zn (AAC), or (3) AAC plus trace mineral and vitamin drench (COMBO). Serum and liver biopsies for Cu, Co, Mn, and Zn at d0, d28, and d60 were analyzed from cattle free of BRD (n = 9 INR; n = 6 AAC; n = 10 COMBO). Differences and correlations of mineral concentrations were analyzed via generalized linear mixed models and Spearman's rank coefficients, respectively (p < 0.05). Whole blood RNA samples from healthy cattle (n = 4 INR; n = 4 AAC; n = 4 COMBO) at d0, d13, d28, d45, and d60 were sequenced and analyzed for differentially expressed genes (DEGs) via glmmSeq (FDR < 0.05), edgeR (FDR < 0.10), and Trendy (p < 0.10). Serum and liver Cu and Co concentrations increased over time in all groups, with higher liver Cu in COMBO (487.985 μg/g) versus AAC (392.043 μg/g) at d60 (p = 0.013). Serum and liver Cu concentrations (ρ = 0.579, p = 6.59 × 10-8) and serum and liver Co concentrations (ρ = 0.466, p = 2.80 × 10-5) were linearly correlated. Minimal gene expression differences were found between AAC versus COMBO (n = 2 DEGs) and INR versus COMBO (n = 0 DEGs) over time. AAC versus INR revealed 107 DEGs (d13-d60) with increased traits in AAC including metabolism of carbohydrates/fat-soluble vitamins, antigen presentation, ATPase activity, and B- and T-cell activation, while osteoclast differentiation and neutrophil degranulation decreased in AAC compared to INR. Our study identifies gene expression differences in high-risk cattle fed inorganic or amino acid-complexed mineral supplements, revealing adaptive immune and metabolic mechanisms that may be improved by organically sourced supplementation.
Collapse
Affiliation(s)
- Matthew A. Scott
- Veterinary Education, Research, and Outreach Program, Texas A&M University, Canyon, TX 79015, USA
| | - Kelsey M. Harvey
- Prairie Research Unit, Mississippi State University, Prairie, MS 39756, USA
| | - Brandi B. Karisch
- Department of Animal and Dairy Sciences, Mississippi State University, Starkville, MS 39762, USA
| | - Amelia R. Woolums
- Department of Pathobiology and Population Medicine, Mississippi State University, Starkville, MS 39762, USA
| | - Rebecca M. Tracy
- Veterinary Education, Research, and Outreach Program, Texas A&M University, Canyon, TX 79015, USA
| | | | | |
Collapse
|
3
|
Kose C, Lindsey-Boltz LA, Sancar A, Jiang Y. Genome-wide analysis of transcription-coupled repair reveals novel transcription events in Caenorhabditis elegans. PLoS Genet 2024; 20:e1011365. [PMID: 39028758 PMCID: PMC11290646 DOI: 10.1371/journal.pgen.1011365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Revised: 07/31/2024] [Accepted: 07/08/2024] [Indexed: 07/21/2024] Open
Abstract
Bulky DNA adducts such as those induced by ultraviolet light are removed from the genomes of multicellular organisms by nucleotide excision repair, which occurs through two distinct mechanisms, global repair, requiring the DNA damage recognition-factor XPC (xeroderma pigmentosum complementation group C), and transcription-coupled repair (TCR), which does not. TCR is initiated when elongating RNA polymerase II encounters DNA damage, and thus analysis of genome-wide excision repair in XPC-mutants only repairing by TCR provides a unique opportunity to map transcription events missed by methods dependent on capturing RNA transcription products and thus limited by their stability and/or modifications (5'-capping or 3'-polyadenylation). Here, we have performed eXcision Repair-sequencing (XR-seq) in the model organism Caenorhabditis elegans to generate genome-wide repair maps in a wild-type strain with normal excision repair, a strain lacking TCR (csb-1), and a strain that only repairs by TCR (xpc-1). Analysis of the intersections between the xpc-1 XR-seq repair maps with RNA-mapping datasets (RNA-seq, long- and short-capped RNA-seq) reveal previously unrecognized sites of transcription and further enhance our understanding of the genome of this important model organism.
Collapse
Affiliation(s)
- Cansu Kose
- Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina, United States of America
| | - Laura A. Lindsey-Boltz
- Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina, United States of America
| | - Aziz Sancar
- Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina, United States of America
| | - Yuchao Jiang
- Department of Statistics, College of Arts and Sciences, Texas A&M University, College Station, Texas, United States of America
- Department of Biology, College of Arts and Sciences, Texas A&M University, College Station, Texas, United States of America
- Department of Biomedical Engineering, College of Engineering, Texas A&M University, College Station, Texas, United States of America
| |
Collapse
|
4
|
Latorraca LB, Galvão A, Rabaglino MB, D'Augero JM, Kelsey G, Fair T. Single-cell profiling reveals transcriptome dynamics during bovine oocyte growth. BMC Genomics 2024; 25:335. [PMID: 38580918 PMCID: PMC10998374 DOI: 10.1186/s12864-024-10234-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 03/18/2024] [Indexed: 04/07/2024] Open
Abstract
BACKGROUND Mammalian follicle development is characterized by extensive changes in morphology, endocrine responsiveness, and function, providing the optimum environment for oocyte growth, development, and resumption of meiosis. In cattle, the first signs of transcription activation in the oocyte are observed in the secondary follicle, later than during mouse and human oogenesis. While many studies have generated extensive datasets characterizing gene expression in bovine oocytes, they are mostly limited to the analysis of fully grown and matured oocytes. The aim of the present study was to apply single-cell RNA sequencing to interrogate the transcriptome of the growing bovine oocyte from the secondary follicle stage through to the mid-antral follicle stage. RESULTS Single-cell RNA-seq libraries were generated from oocytes of known diameters (< 60 to > 120 μm), and datasets were binned into non-overlapping size groups for downstream analysis. Combining the results of weighted gene co-expression network and Trendy analyses, and differently expressed genes (DEGs) between size groups, we identified a decrease in oxidative phosphorylation and an increase in maternal -genes and transcription regulators across the bovine oocyte growth phase. In addition, around 5,000 genes did not change in expression, revealing a cohort of stable genes. An interesting switch in gene expression profile was noted in oocytes greater than 100 μm in diameter, when the expression of genes related to cytoplasmic activities was replaced by genes related to nuclear activities (e.g., chromosome segregation). The highest number of DEGs were detected in the comparison of oocytes 100-109 versus 110-119 μm in diameter, revealing a profound change in the molecular profile of oocytes at the end of their growth phase. CONCLUSIONS The current study provides a unique dataset of the key genes and pathways characteristic of each stage of oocyte development, contributing an important resource for a greater understanding of bovine oogenesis.
Collapse
Affiliation(s)
| | - António Galvão
- Epigenetics Programme, The Babraham Institute, Cambridge, UK
- Department of Comparative Biomedical Sciences, Royal Veterinary College, London, UK
| | - Maria Belen Rabaglino
- Department of Population Health Sciences, Faculty of Veterinary Medicine, Utrecht University, Yalelaan 7, 3584 CL, Utrecht, The Netherlands
| | | | - Gavin Kelsey
- Epigenetics Programme, The Babraham Institute, Cambridge, UK
- Centre for Trophoblast Research, University of Cambridge, Cambridge, UK
- Wellcome-MRC Institute of Metabolic Science-Metabolic Research Laboratories, Cambridge, UK
| | - Trudee Fair
- School of Agriculture and Food Science, University College Dublin, Dublin, Ireland.
| |
Collapse
|
5
|
Toh H, Smolentsev A, Sadjadi R, Clegg D, Yan J, Stewart R, Thomson JA, Jiang P. Transcriptomic clock predicts vascular changes of prodromal diabetic retinopathy. Sci Rep 2023; 13:12968. [PMID: 37563287 PMCID: PMC10415264 DOI: 10.1038/s41598-023-40328-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2023] [Accepted: 08/08/2023] [Indexed: 08/12/2023] Open
Abstract
Diabetic retinopathy is a common complication of long-term diabetes and that could lead to vision loss. Unfortunately, early diabetic retinopathy remains poorly understood. There is no effective way to prevent or treat early diabetic retinopathy until patients develop later stages of diabetic retinopathy. Elevated acellular capillary density is considered a reliable quantitative trait present in the early development of retinopathy. Hence, in this study, we interrogated whole retinal vascular transcriptomic changes via a Nile rat model to better understand the early pathogenesis of diabetic retinopathy. We uncovered the complexity of associations between acellular capillary density and the joint factors of blood glucose, diet, and sex, which was modeled through a Bayesian network. Using segmented regressions, we have identified different gene expression patterns and enriched Gene Ontology (GO) terms associated with acellular capillary density increasing. We developed a random forest regression model based on expression patterns of 14 genes to predict the acellular capillary density. Since acellular capillary density is a reliable quantitative trait in early diabetic retinopathy, and thus our model can be used as a transcriptomic clock to measure the severity of the progression of early retinopathy. We also identified NVP-TAE684, geldanamycin, and NVP-AUY922 as the top three potential drugs which can potentially attenuate the early DR. Although we need more in vivo studies in the future to support our re-purposed drugs, we have provided a data-driven approach to drug discovery.
Collapse
Affiliation(s)
- Huishi Toh
- Neuroscience Research Institute, University of California Santa Barbara, Santa Barbara, CA, USA
| | - Alexander Smolentsev
- Neuroscience Research Institute, University of California Santa Barbara, Santa Barbara, CA, USA
| | - Ryan Sadjadi
- Neuroscience Research Institute, University of California Santa Barbara, Santa Barbara, CA, USA
| | - Dennis Clegg
- Neuroscience Research Institute, University of California Santa Barbara, Santa Barbara, CA, USA
- Department of Molecular, Cellular and Developmental Biology, University of California Santa Barbara, Santa Barbara, CA, USA
| | - Jingqi Yan
- Department of Biological, Geological and Environmental Sciences, Cleveland State University, Cleveland, OH, 44115, USA
- Center for Gene Regulation in Health and Disease, Cleveland State University, Cleveland, OH, 44115, USA
| | - Ron Stewart
- Morgridge Institute For Research, Madison, WI, 53706, USA
| | - James A Thomson
- Neuroscience Research Institute, University of California Santa Barbara, Santa Barbara, CA, USA
- Department of Molecular, Cellular and Developmental Biology, University of California Santa Barbara, Santa Barbara, CA, USA
- Morgridge Institute For Research, Madison, WI, 53706, USA
| | - Peng Jiang
- Department of Biological, Geological and Environmental Sciences, Cleveland State University, Cleveland, OH, 44115, USA.
- Center for Gene Regulation in Health and Disease, Cleveland State University, Cleveland, OH, 44115, USA.
- Center for RNA Science and Therapeutics, School of Medicine, Case Western Reserve University, 10900 Euclid Avenue, Cleveland, OH, 44106, USA.
| |
Collapse
|
6
|
Dong X, Tree J, Banadyga L, He S, Zhu W, Tipton T, Gouriet J, Qiu X, Elmore MJ, Hall Y, Carroll M, Hiscox JA. Linked Mutations in the Ebola Virus Polymerase Are Associated with Organ Specific Phenotypes. Microbiol Spectr 2023; 11:e0415422. [PMID: 36946725 PMCID: PMC10101120 DOI: 10.1128/spectrum.04154-22] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 02/20/2023] [Indexed: 03/23/2023] Open
Abstract
Ebola virus (EBOV) causes a severe infection called Ebola virus disease (EVD). The pathogenesis of EBOV infection is complex, and outcome has been associated with a variety of immunological and cellular factors. Disease can result from several mechanisms, including direct organ and endothelial cell damage as a result of viral replication. During the2013 to 2016 Western Africa EBOV outbreak, several mutants emerged, with changes in the genes of nucleoprotein (NP), glycoprotein (GP), and the large (L) protein. Reverse genetic analysis has been used to investigate whether these mutations played any role in pathogenesis with mixed results depending on the experimental system used. Previous studies investigated the impact of three single nonsynonymous mutations (GP-A82V, NP-R111C, and L-D759G) on the fatality rate of mouse and ferret models and suggested that the L-D759G mutation decreased the virulence of EBOV. In this study, the effect of these three mutations was further evaluated by deep sequencing to determine viral population genetics and the host response in longitudinal samples of blood, liver, kidney, spleen, and lung tissues taken from the previous ferret model. The data indicated that the mutations were maintained in the different tissues, but the frequency of minor genomic mutations were different. In addition, compared to wild-type virus, the recombinant mutants had different within host effects, where the D759G (and accompanying Q986H) substitution in the L protein resulted in an upregulation of the immune response in the kidney, liver, spleen, and lungs. Together these studies provide insights into the biology of EBOV mutants both between and within hosts. IMPORTANCE Ebola virus infection can have dramatic effects on the human body which manifest in Ebola virus disease. The outcome of infection is either survival or death and in the former group with the potential of longer-term health consequences and persistent infection. Disease severity is undoubtedly associated with the host response, often with overt inflammatory responses correlated with poorer outcomes. The scale of the2013 to 2016 Western African Ebola virus outbreak revealed new aspects of viral biology. This included the emergence of mutants with potentially altered virulence. Biobanked tissue from ferret models of EBOV infected with different mutants that emerged in the Western Africa outbreak was used to investigate the effect of EBOV genomic variation in different tissues. Overall, the work provided insights into the population genetics of EBOV and showed that different organs in an animal model can respond differently to variants of EBOV.
Collapse
Affiliation(s)
- Xiaofeng Dong
- Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, United Kingdom
| | - Julia Tree
- UK-Health Security Agency, Salisbury, United Kingdom
| | - Logan Banadyga
- Special Pathogens Program, National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Canada
| | - Shihua He
- Special Pathogens Program, National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Canada
| | - Wenjun Zhu
- Special Pathogens Program, National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Canada
| | - Tom Tipton
- UK-Health Security Agency, Salisbury, United Kingdom
| | - Jade Gouriet
- UK-Health Security Agency, Salisbury, United Kingdom
| | - Xiangguo Qiu
- Special Pathogens Program, National Microbiology Laboratory, Public Health Agency of Canada, Winnipeg, Canada
| | | | - Yper Hall
- UK-Health Security Agency, Salisbury, United Kingdom
| | - Miles Carroll
- Wellcome Centre for Human Genetics, Nuffield Department of Medicine, Oxford University, Oxford, United Kingdom
- Pandemic Sciences Institute, Nuffield Department of Medicine, Oxford University, Oxford, United Kingdom
| | - Julian A. Hiscox
- Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, United Kingdom
- Infectious Diseases Horizontal Technology Centre (ID HTC), A*STAR, Singapore, Singapore
| |
Collapse
|
7
|
Feigin C, Li S, Moreno J, Mallarino R. The GRN concept as a guide for evolutionary developmental biology. JOURNAL OF EXPERIMENTAL ZOOLOGY. PART B, MOLECULAR AND DEVELOPMENTAL EVOLUTION 2023; 340:92-104. [PMID: 35344632 PMCID: PMC9515236 DOI: 10.1002/jez.b.23132] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 03/08/2022] [Accepted: 03/11/2022] [Indexed: 12/13/2022]
Abstract
Organismal phenotypes result largely from inherited developmental programs, usually executed during embryonic and juvenile life stages. These programs are not blank slates onto which natural selection can draw arbitrary forms. Rather, the mechanisms of development play an integral role in shaping phenotypic diversity and help determine the evolutionary trajectories of species. Modern evolutionary biology must, therefore, account for these mechanisms in both theory and in practice. The gene regulatory network (GRN) concept represents a potent tool for achieving this goal whose utility has grown in tandem with advances in "omic" technologies and experimental techniques. However, while the GRN concept is widely utilized, it is often less clear what practical implications it has for conducting research in evolutionary developmental biology. In this Perspective, we attempt to provide clarity by discussing how experiments and projects can be designed in light of the GRN concept. We first map familiar biological notions onto the more abstract components of GRN models. We then review how diverse functional genomic approaches can be directed toward the goal of constructing such models and discuss current methods for functionally testing evolutionary hypotheses that arise from them. Finally, we show how the major steps of GRN model construction and experimental validation suggest generalizable workflows that can serve as a scaffold for project design. Taken together, the practical implications that we draw from the GRN concept provide a set of guideposts for studies aiming at unraveling the molecular basis of phenotypic diversity.
Collapse
Affiliation(s)
- Charles Feigin
- Department of Molecular Biology, Princeton University, Princeton, New Jersey, USA,School of BioSciences, The University of Melbourne, Parkville, Victoria, Australia
| | - Sha Li
- Department of Molecular Biology, Princeton University, Princeton, New Jersey, USA
| | - Jorge Moreno
- Department of Molecular Biology, Princeton University, Princeton, New Jersey, USA
| | - Ricardo Mallarino
- Department of Molecular Biology, Princeton University, Princeton, New Jersey, USA
| |
Collapse
|
8
|
Ma C, Chitra U, Zhang S, Raphael BJ. Belayer: Modeling discrete and continuous spatial variation in gene expression from spatially resolved transcriptomics. Cell Syst 2022; 13:786-797.e13. [PMID: 36265465 PMCID: PMC9814896 DOI: 10.1016/j.cels.2022.09.002] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2022] [Revised: 07/13/2022] [Accepted: 09/06/2022] [Indexed: 01/26/2023]
Abstract
Spatially resolved transcriptomics (SRT) technologies measure gene expression at known locations in a tissue slice, enabling the identification of spatially varying genes or cell types. Current approaches for these tasks assume either that gene expression varies continuously across a tissue or that a tissue contains a small number of regions with distinct cellular composition. We propose a model for SRT data from layered tissues that includes both continuous and discrete spatial variation in expression and an algorithm, Belayer, to learn the parameters of this model. Belayer models gene expression as a piecewise linear function of the relative depth of a tissue layer with possible discontinuities at layer boundaries. We use conformal maps to model relative depth and derive a dynamic programming algorithm to infer layer boundaries and gene expression functions. Belayer accurately identifies tissue layers and biologically meaningful spatially varying genes in SRT data from the brain and skin.
Collapse
Affiliation(s)
- Cong Ma
- Department of Computer Science, Princeton University, 35 Olden St, Princeton, NJ 08540, USA
| | - Uthsav Chitra
- Department of Computer Science, Princeton University, 35 Olden St, Princeton, NJ 08540, USA
| | - Shirley Zhang
- Department of Computer Science, Princeton University, 35 Olden St, Princeton, NJ 08540, USA
| | - Benjamin J Raphael
- Department of Computer Science, Princeton University, 35 Olden St, Princeton, NJ 08540, USA.
| |
Collapse
|
9
|
Cui EH, Song D, Wong WK, Li JJ. Single-cell generalized trend model (scGTM): a flexible and interpretable model of gene expression trend along cell pseudotime. Bioinformatics 2022; 38:3927-3934. [PMID: 35758616 PMCID: PMC9991897 DOI: 10.1093/bioinformatics/btac423] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2021] [Revised: 06/17/2022] [Accepted: 06/23/2022] [Indexed: 02/07/2023] Open
Abstract
MOTIVATION Modeling single-cell gene expression trends along cell pseudotime is a crucial analysis for exploring biological processes. Most existing methods rely on nonparametric regression models for their flexibility; however, nonparametric models often provide trends too complex to interpret. Other existing methods use interpretable but restrictive models. Since model interpretability and flexibility are both indispensable for understanding biological processes, the single-cell field needs a model that improves the interpretability and largely maintains the flexibility of nonparametric regression models. RESULTS Here, we propose the single-cell generalized trend model (scGTM) for capturing a gene's expression trend, which may be monotone, hill-shaped or valley-shaped, along cell pseudotime. The scGTM has three advantages: (i) it can capture non-monotonic trends that are easy to interpret, (ii) its parameters are biologically interpretable and trend informative, and (iii) it can flexibly accommodate common distributions for modeling gene expression counts. To tackle the complex optimization problems, we use the particle swarm optimization algorithm to find the constrained maximum likelihood estimates for the scGTM parameters. As an application, we analyze several single-cell gene expression datasets using the scGTM and show that scGTM can capture interpretable gene expression trends along cell pseudotime and reveal molecular insights underlying biological processes. AVAILABILITY AND IMPLEMENTATION The Python package scGTM is open-access and available at https://github.com/ElvisCuiHan/scGTM. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Elvis Han Cui
- Department of Biostatistics, University of California, Los Angeles, CA 90095-1772, USA
| | - Dongyuan Song
- Bioinformatics Interdepartmental Ph.D. Program, University of California, Los Angeles, CA 90095-7246, USA
| | - Weng Kee Wong
- Department of Biostatistics, University of California, Los Angeles, CA 90095-1772, USA
| | - Jingyi Jessica Li
- Department of Biostatistics, University of California, Los Angeles, CA 90095-1772, USA.,Bioinformatics Interdepartmental Ph.D. Program, University of California, Los Angeles, CA 90095-7246, USA.,Department of Statistics, University of California, Los Angeles, CA 90095-1554, USA.,Department of Computational Medicine, University of California, Los Angeles, CA 90095-1766, USA.,Department of Human Genetics, University of California, Los Angeles, CA 90095-7088, USA
| |
Collapse
|
10
|
Buisan R, Moriano J, Andirkó A, Boeckx C. A Brain Region-Specific Expression Profile for Genes Within Large Introgression Deserts and Under Positive Selection in Homo sapiens. Front Cell Dev Biol 2022; 10:824740. [PMID: 35557944 PMCID: PMC9086289 DOI: 10.3389/fcell.2022.824740] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Accepted: 04/04/2022] [Indexed: 11/13/2022] Open
Abstract
Analyses of ancient DNA from extinct hominins have provided unique insights into the complex evolutionary history of Homo sapiens, intricately related to that of the Neanderthals and the Denisovans as revealed by several instances of admixture events. These analyses have also allowed the identification of introgression deserts: genomic regions in our species that are depleted of "archaic" haplotypes. The presence of genes like FOXP2 in these deserts has been taken to be suggestive of brain-related functional differences between Homo species. Here, we seek a deeper characterization of these regions and the specific expression trajectories of genes within them, taking into account signals of positive selection in our lineage. Analyzing publicly available transcriptomic data from the human brain at different developmental stages, we found that structures outside the cerebral neocortex, in particular the cerebellum, the striatum and the mediodorsal nucleus of the thalamus show the most divergent transcriptomic profiles when considering genes within large introgression deserts and under positive selection.
Collapse
Affiliation(s)
| | - Juan Moriano
- Universitat de Barcelona, Barcelona, Spain
- Universitat de Barcelona Institute of Complex Systems, Barcelona, Spain
| | - Alejandro Andirkó
- Universitat de Barcelona, Barcelona, Spain
- Universitat de Barcelona Institute of Complex Systems, Barcelona, Spain
| | - Cedric Boeckx
- Universitat de Barcelona, Barcelona, Spain
- Universitat de Barcelona Institute of Complex Systems, Barcelona, Spain
- Catalan Institute for Research and Advanced Studies (ICREA), Barcelona, Spain
| |
Collapse
|
11
|
Gao Y, Selee B, Schnabel EL, Poehlman WL, Chavan SA, Frugoli JA, Feltus FA. Time Series Transcriptome Analysis in Medicago truncatula Shoot and Root Tissue During Early Nodulation. FRONTIERS IN PLANT SCIENCE 2022; 13:861639. [PMID: 35463395 PMCID: PMC9021838 DOI: 10.3389/fpls.2022.861639] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/24/2022] [Accepted: 02/28/2022] [Indexed: 06/14/2023]
Abstract
In response to colonization by rhizobia bacteria, legumes are able to form nitrogen-fixing nodules in their roots, allowing the plants to grow efficiently in nitrogen-depleted environments. Legumes utilize a complex, long-distance signaling pathway to regulate nodulation that involves signals in both roots and shoots. We measured the transcriptional response to treatment with rhizobia in both the shoots and roots of Medicago truncatula over a 72-h time course. To detect temporal shifts in gene expression, we developed GeneShift, a novel computational statistics and machine learning workflow that addresses the time series replicate the averaging issue for detecting gene expression pattern shifts under different conditions. We identified both known and novel genes that are regulated dynamically in both tissues during early nodulation including leginsulin, defensins, root transporters, nodulin-related, and circadian clock genes. We validated over 70% of the expression patterns that GeneShift discovered using an independent M. truncatula RNA-Seq study. GeneShift facilitated the discovery of condition-specific temporally differentially expressed genes in the symbiotic nodulation biological system. In principle, GeneShift should work for time-series gene expression profiling studies from other systems.
Collapse
Affiliation(s)
- Yueyao Gao
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States
| | - Bradley Selee
- Department of Electrical and Computer Engineering, Clemson University, Clemson, SC, United States
| | - Elise L. Schnabel
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States
| | - William L. Poehlman
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States
- Sage Bionetworks, Seattle, WA, United States
| | - Suchitra A. Chavan
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States
| | - Julia A. Frugoli
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States
| | - Frank Alex Feltus
- Department of Genetics and Biochemistry, Clemson University, Clemson, SC, United States
- Biomedical Data Science and Informatics Program, Clemson University, Clemson, SC, United States
- Clemson Center for Human Genetics, Greenwood, SC, United States
| |
Collapse
|
12
|
Jang J, Hwang I, Jung I. TimesVector-Web: A Web Service for Analysing Time Course Transcriptome Data with Multiple Conditions. Genes (Basel) 2021; 13:73. [PMID: 35052413 PMCID: PMC8775016 DOI: 10.3390/genes13010073] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2021] [Revised: 12/19/2021] [Accepted: 12/22/2021] [Indexed: 11/27/2022] Open
Abstract
From time course gene expression data, we may identify genes that modulate in a certain pattern across time. Such patterns are advantageous to investigate the transcriptomic response to a certain condition. Especially, it is of interest to compare two or more conditions to detect gene expression patterns that significantly differ between them. Time course analysis can become difficult using traditional differentially expressed gene (DEG) analysis methods since they are based on pair-wise sample comparison instead of a series of time points. Most importantly, the related tools are mostly available as local Software, requiring technical expertise. Here, we present TimesVector-web, which is an easy to use web service for analysing time course gene expression data with multiple conditions. The web-service was developed to (1) alleviate the burden for analyzing multi-class time course data and (2) provide downstream analysis on the results for biological interpretation including TF, miRNA target, gene ontology and pathway analysis. TimesVector-web was validated using three case studies that use both microarray and RNA-seq time course data and showed that the results captured important biological findings from the original studies.
Collapse
Affiliation(s)
| | | | - Inuk Jung
- Department of Computer Science and Engineering, Kyungpook National University, Buk-gu, Deagu 41566, Korea; (J.J.); (I.H.)
| |
Collapse
|
13
|
Thind AS, Monga I, Thakur PK, Kumari P, Dindhoria K, Krzak M, Ranson M, Ashford B. Demystifying emerging bulk RNA-Seq applications: the application and utility of bioinformatic methodology. Brief Bioinform 2021; 22:6330938. [PMID: 34329375 DOI: 10.1093/bib/bbab259] [Citation(s) in RCA: 47] [Impact Index Per Article: 11.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Revised: 06/14/2021] [Accepted: 06/18/2021] [Indexed: 12/13/2022] Open
Abstract
Significant innovations in next-generation sequencing techniques and bioinformatics tools have impacted our appreciation and understanding of RNA. Practical RNA sequencing (RNA-Seq) applications have evolved in conjunction with sequence technology and bioinformatic tools advances. In most projects, bulk RNA-Seq data is used to measure gene expression patterns, isoform expression, alternative splicing and single-nucleotide polymorphisms. However, RNA-Seq holds far more hidden biological information including details of copy number alteration, microbial contamination, transposable elements, cell type (deconvolution) and the presence of neoantigens. Recent novel and advanced bioinformatic algorithms developed the capacity to retrieve this information from bulk RNA-Seq data, thus broadening its scope. The focus of this review is to comprehend the emerging bulk RNA-Seq-based analyses, emphasizing less familiar and underused applications. In doing so, we highlight the power of bulk RNA-Seq in providing biological insights.
Collapse
Affiliation(s)
- Amarinder Singh Thind
- University of Wollongong, Wollongong, Australia.,Illawarra Health and Medical Research Institute, Wollongong, Australia
| | - Isha Monga
- Columbia University, New York City, NY, USA
| | | | - Pallawi Kumari
- Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | - Kiran Dindhoria
- Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | | | - Marie Ranson
- University of Wollongong, Wollongong, Australia.,Illawarra Health and Medical Research Institute, Wollongong, Australia
| | - Bruce Ashford
- University of Wollongong, Wollongong, Australia.,Illawarra Health and Medical Research Institute, Wollongong, Australia
| |
Collapse
|
14
|
Ferreira M, Francisco S, Soares AR, Nobre A, Pinheiro M, Reis A, Neto S, Rodrigues AJ, Sousa N, Moura G, Santos MAS. Integration of segmented regression analysis with weighted gene correlation network analysis identifies genes whose expression is remodeled throughout physiological aging in mouse tissues. Aging (Albany NY) 2021; 13:18150-18190. [PMID: 34330884 PMCID: PMC8351669 DOI: 10.18632/aging.203379] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Accepted: 07/21/2021] [Indexed: 02/06/2023]
Abstract
Gene expression alterations occurring with aging have been described for a multitude of species, organs, and cell types. However, most of the underlying studies rely on static comparisons of mean gene expression levels between age groups and do not account for the dynamics of gene expression throughout the lifespan. These studies also tend to disregard the pairwise relationships between gene expression profiles, which may underlie commonly altered pathways and regulatory mechanisms with age. To overcome these limitations, we have combined segmented regression analysis with weighted gene correlation network analysis (WGCNA) to identify high-confidence signatures of aging in the brain, heart, liver, skeletal muscle, and pancreas of C57BL/6 mice in a publicly available RNA-Seq dataset (GSE132040). Functional enrichment analysis of the overlap of genes identified in both approaches showed that immune- and inflammation-related responses are prominently altered in the brain and the liver, while in the heart and the muscle, aging affects amino and fatty acid metabolism, and tissue regeneration, respectively, which reflects an age-related global loss of tissue function. We also explored sexual dimorphism in the aging mouse transcriptome and found the liver and the muscle to have the most pronounced gender differences in gene expression throughout the lifespan, particularly in proteostasis-related pathways. While the data showed little overlap among the age-dysregulated genes between tissues, aging triggered common biological processes in distinct tissues, which we highlight as important features of murine tissue physiological aging.
Collapse
Affiliation(s)
- Margarida Ferreira
- Institute of Biomedicine – iBiMED, Department of Medical Sciences, University of Aveiro, Aveiro 3810-193, Portugal
| | - Stephany Francisco
- Institute of Biomedicine – iBiMED, Department of Medical Sciences, University of Aveiro, Aveiro 3810-193, Portugal
| | - Ana R. Soares
- Institute of Biomedicine – iBiMED, Department of Medical Sciences, University of Aveiro, Aveiro 3810-193, Portugal
| | - Ana Nobre
- Institute of Biomedicine – iBiMED, Department of Medical Sciences, University of Aveiro, Aveiro 3810-193, Portugal
| | - Miguel Pinheiro
- Institute of Biomedicine – iBiMED, Department of Medical Sciences, University of Aveiro, Aveiro 3810-193, Portugal
| | - Andreia Reis
- Institute of Biomedicine – iBiMED, Department of Medical Sciences, University of Aveiro, Aveiro 3810-193, Portugal
| | - Sonya Neto
- Institute of Biomedicine – iBiMED, Department of Medical Sciences, University of Aveiro, Aveiro 3810-193, Portugal
| | - Ana João Rodrigues
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Braga 4710-057, Portugal
- ICVS/3B’s–PT Government Associate Laboratory, Braga/Guimarães, Portugal
| | - Nuno Sousa
- Life and Health Sciences Research Institute (ICVS), School of Medicine, University of Minho, Braga 4710-057, Portugal
- ICVS/3B’s–PT Government Associate Laboratory, Braga/Guimarães, Portugal
| | - Gabriela Moura
- Institute of Biomedicine – iBiMED, Department of Medical Sciences, University of Aveiro, Aveiro 3810-193, Portugal
| | - Manuel A. S. Santos
- Institute of Biomedicine – iBiMED, Department of Medical Sciences, University of Aveiro, Aveiro 3810-193, Portugal
| |
Collapse
|
15
|
Brown J, Barry C, Schmitz MT, Argus C, Bolin JM, Schwartz MP, Van Aartsen A, Steill J, Swanson S, Stewart R, Thomson JA, Kendziorski C. Interspecies chimeric conditions affect the developmental rate of human pluripotent stem cells. PLoS Comput Biol 2021; 17:e1008778. [PMID: 33647016 PMCID: PMC7951976 DOI: 10.1371/journal.pcbi.1008778] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2020] [Revised: 03/11/2021] [Accepted: 02/08/2021] [Indexed: 12/17/2022] Open
Abstract
Human pluripotent stem cells hold significant promise for regenerative medicine. However, long differentiation protocols and immature characteristics of stem cell-derived cell types remain challenges to the development of many therapeutic applications. In contrast to the slow differentiation of human stem cells in vitro that mirrors a nine-month gestation period, mouse stem cells develop according to a much faster three-week gestation timeline. Here, we tested if co-differentiation with mouse pluripotent stem cells could accelerate the differentiation speed of human embryonic stem cells. Following a six-week RNA-sequencing time course of neural differentiation, we identified 929 human genes that were upregulated earlier and 535 genes that exhibited earlier peaked expression profiles in chimeric cell cultures than in human cell cultures alone. Genes with accelerated upregulation were significantly enriched in Gene Ontology terms associated with neurogenesis, neuron differentiation and maturation, and synapse signaling. Moreover, chimeric mixed samples correlated with in utero human embryonic samples earlier than human cells alone, and acceleration was dose-dependent on human-mouse co-culture ratios. The altered gene expression patterns and developmental rates described in this report have implications for accelerating human stem cell differentiation and the use of interspecies chimeric embryos in developing human organs for transplantation.
Collapse
Affiliation(s)
- Jared Brown
- Department of Statistics, University of Wisconsin-Madison, Wisconsin, United States of America
- * E-mail: (JB); (CK)
| | - Christopher Barry
- Morgridge Institute for Research, Madison, Wisconsin, United States of America
| | - Matthew T. Schmitz
- Morgridge Institute for Research, Madison, Wisconsin, United States of America
| | - Cara Argus
- Morgridge Institute for Research, Madison, Wisconsin, United States of America
| | - Jennifer M. Bolin
- Morgridge Institute for Research, Madison, Wisconsin, United States of America
| | - Michael P. Schwartz
- NSF Center for Sustainable Nanotechnology, Department of Chemistry, University of Wisconsin-Madison, Wisconsin, United States of America
| | - Amy Van Aartsen
- Morgridge Institute for Research, Madison, Wisconsin, United States of America
| | - John Steill
- Morgridge Institute for Research, Madison, Wisconsin, United States of America
| | - Scott Swanson
- Morgridge Institute for Research, Madison, Wisconsin, United States of America
| | - Ron Stewart
- Morgridge Institute for Research, Madison, Wisconsin, United States of America
| | - James A. Thomson
- Morgridge Institute for Research, Madison, Wisconsin, United States of America
- Department of Cell and Regenerative Biology, University of Wisconsin School of Medicine and Public Health, Madison, Wisconsin, United States of America
- Department of Molecular, Cellular, and Developmental Biology, University of California, Santa Barbara, California, United States of America
| | - Christina Kendziorski
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Wisconsin, United States of America
- * E-mail: (JB); (CK)
| |
Collapse
|
16
|
Oh VKS, Li RW. Temporal Dynamic Methods for Bulk RNA-Seq Time Series Data. Genes (Basel) 2021; 12:352. [PMID: 33673721 PMCID: PMC7997275 DOI: 10.3390/genes12030352] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2021] [Revised: 02/19/2021] [Accepted: 02/22/2021] [Indexed: 02/06/2023] Open
Abstract
Dynamic studies in time course experimental designs and clinical approaches have been widely used by the biomedical community. These applications are particularly relevant in stimuli-response models under environmental conditions, characterization of gradient biological processes in developmental biology, identification of therapeutic effects in clinical trials, disease progressive models, cell-cycle, and circadian periodicity. Despite their feasibility and popularity, sophisticated dynamic methods that are well validated in large-scale comparative studies, in terms of statistical and computational rigor, are less benchmarked, comparing to their static counterparts. To date, a number of novel methods in bulk RNA-Seq data have been developed for the various time-dependent stimuli, circadian rhythms, cell-lineage in differentiation, and disease progression. Here, we comprehensively review a key set of representative dynamic strategies and discuss current issues associated with the detection of dynamically changing genes. We also provide recommendations for future directions for studying non-periodical, periodical time course data, and meta-dynamic datasets.
Collapse
Affiliation(s)
- Vera-Khlara S. Oh
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville, MD 20705, USA;
- Department of Computer Science and Statistics, College of Natural Sciences, Jeju National University, Jeju City 63243, Korea
| | - Robert W. Li
- Animal Genomics and Improvement Laboratory, United States Department of Agriculture, Agricultural Research Service, Beltsville, MD 20705, USA;
| |
Collapse
|
17
|
Identification and characterization of dynamically regulated hepatitis-related genes in a concanavalin A-induced liver injury model. Aging (Albany NY) 2020; 12:23187-23199. [PMID: 33221747 PMCID: PMC7746381 DOI: 10.18632/aging.104089] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Accepted: 08/31/2020] [Indexed: 12/11/2022]
Abstract
Background: Concanavalin A (ConA)-induced liver damage of mice is a well-established murine model mimicking the human autoimmune hepatitis (AIH). However, the pathogenic genes of the liver injury remain to be revealed. Methods: Using time-series liver transcriptome, top dynamic genes were inferred from a set of segmented regression models, and cross-checked by weighted correlation network analysis (WGCNA). AIH murine models created by ConA were used to verify the in vivo effect of these genes. Results: We identified 115 top dynamic genes, of which most were overlapped with the hub genes determined by WGCNA. The expression of several top dynamic genes including Cd63, Saa3, Slc10a1, Nrxn1, Ugt2a3, were verified in vivo. Further, Cluster determinant 63 (Cd63) knockdown in mice treated with ConA showed significantly less liver pathology and inflammation as well as higher survival rates than the corresponding controls. Conclusion: We have identified the top dynamic genes related to the process of acute liver injury, and highlighted a targeted strategy for Cd63 might have utility for the protection of hepatocellular damage.
Collapse
|
18
|
Jiang P, Chamberlain CS, Vanderby R, Thomson JA, Stewart R. TimeMeter assesses temporal gene expression similarity and identifies differentially progressing genes. Nucleic Acids Res 2020; 48:e51. [PMID: 32123905 PMCID: PMC7229845 DOI: 10.1093/nar/gkaa142] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Revised: 02/03/2020] [Accepted: 02/26/2020] [Indexed: 01/02/2023] Open
Abstract
Comparative time series transcriptome analysis is a powerful tool to study development, evolution, aging, disease progression and cancer prognosis. We develop TimeMeter, a statistical method and tool to assess temporal gene expression similarity, and identify differentially progressing genes where one pattern is more temporally advanced than the other. We apply TimeMeter to several datasets, and show that TimeMeter is capable of characterizing complicated temporal gene expression associations. Interestingly, we find: (i) the measurement of differential progression provides a novel feature in addition to pattern similarity that can characterize early developmental divergence between two species; (ii) genes exhibiting similar temporal patterns between human and mouse during neural differentiation are under strong negative (purifying) selection during evolution; (iii) analysis of genes with similar temporal patterns in mouse digit regeneration and axolotl blastema differentiation reveals common gene groups for appendage regeneration with potential implications in regenerative medicine.
Collapse
Affiliation(s)
- Peng Jiang
- Regenerative Biology Laboratory, Morgridge Institute for Research, Madison, WI 53707, USA
| | - Connie S Chamberlain
- Department of Orthopedics and Rehabilitation, University of Wisconsin, Madison, WI 53706, USA
| | - Ray Vanderby
- Department of Orthopedics and Rehabilitation, University of Wisconsin, Madison, WI 53706, USA.,Department of Biomedical Engineering, University of Wisconsin, Madison, WI 53706, USA
| | - James A Thomson
- Regenerative Biology Laboratory, Morgridge Institute for Research, Madison, WI 53707, USA.,Department of Molecular, Cellular and Developmental Biology, University of California, Santa Barbara, CA 93106, USA
| | - Ron Stewart
- Regenerative Biology Laboratory, Morgridge Institute for Research, Madison, WI 53707, USA
| |
Collapse
|
19
|
An In Vitro Human Segmentation Clock Model Derived from Embryonic Stem Cells. Cell Rep 2020; 28:2247-2255.e5. [PMID: 31461642 PMCID: PMC6814198 DOI: 10.1016/j.celrep.2019.07.090] [Citation(s) in RCA: 46] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2019] [Revised: 07/09/2019] [Accepted: 07/24/2019] [Indexed: 11/22/2022] Open
Abstract
Defects in somitogenesis result in vertebral malformations at birth known as spondylocostal dysostosis (SCDO). Somites are formed with a species-specific periodicity controlled by the “segmentation clock,” which comprises a group of oscillatory genes in the presomitic mesoderm. Here, we report that a segmentation clock model derived from human embryonic stem cells shows many hallmarks of the mammalian segmentation clock in vivo, including a dependence on the NOTCH and WNT signaling pathways. The gene expression oscillations are highly synchronized, displaying a periodicity specific to the human clock. Introduction of a point of mutation into HES7, a specific mutation previously associated with clinical SCDO, eliminated clock gene oscillations, successfully reproducing the defects in the segmentation clock. Thus, we provide a model for studying the previously inaccessible human segmentation clock to better understand the mechanisms contributing to congenital skeletal defects. The segmentation clock is a molecular oscillator regulating the tempo of somite formation in a species-specific manner. Chu et al. report an embryonic-stem-cell-derived model system displaying a human-specific gene oscillation periodicity, which can shed light on human somitogenesis and model skeletal developmental disorders.
Collapse
|
20
|
Van den Berge K, Roux de Bézieux H, Street K, Saelens W, Cannoodt R, Saeys Y, Dudoit S, Clement L. Trajectory-based differential expression analysis for single-cell sequencing data. Nat Commun 2020; 11:1201. [PMID: 32139671 PMCID: PMC7058077 DOI: 10.1038/s41467-020-14766-3] [Citation(s) in RCA: 363] [Impact Index Per Article: 72.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2019] [Accepted: 01/14/2020] [Indexed: 12/31/2022] Open
Abstract
Trajectory inference has radically enhanced single-cell RNA-seq research by enabling the study of dynamic changes in gene expression. Downstream of trajectory inference, it is vital to discover genes that are (i) associated with the lineages in the trajectory, or (ii) differentially expressed between lineages, to illuminate the underlying biological processes. Current data analysis procedures, however, either fail to exploit the continuous resolution provided by trajectory inference, or fail to pinpoint the exact types of differential expression. We introduce tradeSeq, a powerful generalized additive model framework based on the negative binomial distribution that allows flexible inference of both within-lineage and between-lineage differential expression. By incorporating observation-level weights, the model additionally allows to account for zero inflation. We evaluate the method on simulated datasets and on real datasets from droplet-based and full-length protocols, and show that it yields biological insights through a clear interpretation of the data.
Collapse
Affiliation(s)
- Koen Van den Berge
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium
- Department of Statistics, University of California, Berkeley, CA, USA
| | - Hector Roux de Bézieux
- Division of Biostatistics, School of Public Health, University of California, Berkeley, CA, USA
- Center for Computational Biology, University of California, Berkeley, CA, USA
| | - Kelly Street
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
- Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Wouter Saelens
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
- Data mining and Modelling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium
| | - Robrecht Cannoodt
- Data mining and Modelling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium
- Center for Medical Genetics, Ghent University Hospital, Ghent, Belgium
- Department of Biomolecular Medicine, Ghent University, Ghent, Belgium
| | - Yvan Saeys
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium
- Data mining and Modelling for Biomedicine, VIB Center for Inflammation Research, Ghent, Belgium
| | - Sandrine Dudoit
- Department of Statistics, University of California, Berkeley, CA, USA.
- Division of Biostatistics, School of Public Health, University of California, Berkeley, CA, USA.
- Center for Computational Biology, University of California, Berkeley, CA, USA.
| | - Lieven Clement
- Department of Applied Mathematics, Computer Science and Statistics, Ghent University, Ghent, Belgium.
- Bioinformatics Institute Ghent, Ghent University, Ghent, Belgium.
| |
Collapse
|
21
|
Ray S, Valekunja UK, Stangherlin A, Howell SA, Snijders AP, Damodaran G, Reddy AB. Circadian rhythms in the absence of the clock gene Bmal1. Science 2020; 367:800-806. [PMID: 32054765 DOI: 10.1126/science.aaw7365] [Citation(s) in RCA: 135] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2019] [Revised: 07/18/2019] [Accepted: 01/14/2020] [Indexed: 12/14/2022]
Abstract
Circadian (~24 hour) clocks have a fundamental role in regulating daily physiology. The transcription factor BMAL1 is a principal driver of a molecular clock in mammals. Bmal1 deletion abolishes 24-hour activity patterning, one measure of clock output. We determined whether Bmal1 function is necessary for daily molecular oscillations in skin fibroblasts and liver slices. Unexpectedly, in Bmal1 knockout mice, both tissues exhibited 24-hour oscillations of the transcriptome, proteome, and phosphoproteome over 2 to 3 days in the absence of any exogenous drivers such as daily light or temperature cycles. This demonstrates a competent 24-hour molecular pacemaker in Bmal1 knockouts. We suggest that such oscillations might be underpinned by transcriptional regulation by the recruitment of ETS family transcription factors, and nontranscriptionally by co-opting redox oscillations.
Collapse
Affiliation(s)
- Sandipan Ray
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Utham K Valekunja
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| | - Alessandra Stangherlin
- Institute of Metabolic Science, University of Cambridge, Addenbrooke's Hospital, Cambridge CB2 0QQ, UK
| | | | | | | | - Akhilesh B Reddy
- Department of Systems Pharmacology and Translational Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA.
- Institute for Translational Medicine and Therapeutics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
- Institute for Diabetes, Obesity, and Metabolism, University of Pennsylvania Perelman School of Medicine, Philadelphia, PA 19104, USA
- Chronobiology and Sleep institute (CSI), Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA 19104, USA
| |
Collapse
|
22
|
Chandereng T, Gitter A. Lag penalized weighted correlation for time series clustering. BMC Bioinformatics 2020; 21:21. [PMID: 31948388 PMCID: PMC6966853 DOI: 10.1186/s12859-019-3324-1] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2019] [Accepted: 12/16/2019] [Indexed: 11/17/2022] Open
Abstract
BACKGROUND The similarity or distance measure used for clustering can generate intuitive and interpretable clusters when it is tailored to the unique characteristics of the data. In time series datasets generated with high-throughput biological assays, measurements such as gene expression levels or protein phosphorylation intensities are collected sequentially over time, and the similarity score should capture this special temporal structure. RESULTS We propose a clustering similarity measure called Lag Penalized Weighted Correlation (LPWC) to group pairs of time series that exhibit closely-related behaviors over time, even if the timing is not perfectly synchronized. LPWC aligns time series profiles to identify common temporal patterns. It down-weights aligned profiles based on the length of the temporal lags that are introduced. We demonstrate the advantages of LPWC versus existing time series and general clustering algorithms. In a simulated dataset based on the biologically-motivated impulse model, LPWC is the only method to recover the true clusters for almost all simulated genes. LPWC also identifies clusters with distinct temporal patterns in our yeast osmotic stress response and axolotl limb regeneration case studies. CONCLUSIONS LPWC achieves both of its time series clustering goals. It groups time series with correlated changes over time, even if those patterns occur earlier or later in some of the time series. In addition, it refrains from introducing large shifts in time when searching for temporal patterns by applying a lag penalty. The LPWC R package is available at https://github.com/gitter-lab/LPWC and CRAN under a MIT license.
Collapse
Affiliation(s)
- Thevaa Chandereng
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI USA
- Morgridge Institute of Research, Madison, WI USA
- Department of Statistics, University of Wisconsin-Madison, Madison, WI USA
| | - Anthony Gitter
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI USA
- Morgridge Institute of Research, Madison, WI USA
| |
Collapse
|
23
|
Barry C, Schmitz MT, Argus C, Bolin JM, Probasco MD, Leng N, Duffin BM, Steill J, Swanson S, McIntosh BE, Stewart R, Kendziorski C, Thomson JA, Bacher R. Automated minute scale RNA-seq of pluripotent stem cell differentiation reveals early divergence of human and mouse gene expression kinetics. PLoS Comput Biol 2019; 15:e1007543. [PMID: 31815944 PMCID: PMC6922475 DOI: 10.1371/journal.pcbi.1007543] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2019] [Revised: 12/19/2019] [Accepted: 11/12/2019] [Indexed: 12/22/2022] Open
Abstract
Pluripotent stem cells retain the developmental timing of their species of origin in vitro, an observation that suggests the existence of a cell-intrinsic developmental clock, yet the nature and machinery of the clock remain a mystery. We hypothesize that one possible component may lie in species-specific differences in the kinetics of transcriptional responses to differentiation signals. Using a liquid-handling robot, mouse and human pluripotent stem cells were exposed to identical neural differentiation conditions and sampled for RNA-sequencing at high frequency, every 4 or 10 minutes, for the first 10 hours of differentiation to test for differences in transcriptomic response rates. The majority of initial transcriptional responses occurred within a rapid window in the first minutes of differentiation for both human and mouse stem cells. Despite similarly early onsets of gene expression changes, we observed shortened and condensed gene expression patterns in mouse pluripotent stem cells compared to protracted trends in human pluripotent stem cells. Moreover, the speed at which individual genes were upregulated, as measured by the slopes of gene expression changes over time, was significantly faster in mouse compared to human cells. These results suggest that downstream transcriptomic response kinetics to signaling cues are faster in mouse versus human cells, and may offer a partial account for the vast differences in developmental rates across species.
Collapse
Affiliation(s)
- Christopher Barry
- Morgridge Institute for Research, Madison, WI, United States of America
| | | | - Cara Argus
- Morgridge Institute for Research, Madison, WI, United States of America
| | - Jennifer M. Bolin
- Morgridge Institute for Research, Madison, WI, United States of America
| | | | - Ning Leng
- Morgridge Institute for Research, Madison, WI, United States of America
| | - Bret M. Duffin
- Morgridge Institute for Research, Madison, WI, United States of America
| | - John Steill
- Morgridge Institute for Research, Madison, WI, United States of America
| | - Scott Swanson
- Morgridge Institute for Research, Madison, WI, United States of America
| | - Brian E. McIntosh
- Morgridge Institute for Research, Madison, WI, United States of America
| | - Ron Stewart
- Morgridge Institute for Research, Madison, WI, United States of America
| | - Christina Kendziorski
- Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI, United States of America
| | - James A. Thomson
- Morgridge Institute for Research, Madison, WI, United States of America
- Department of Cell and Regenerative Biology, University of Wisconsin School of Medicine and Public Health, Madison, WI, United States of America
- Department of Molecular, Cellular, and Developmental Biology, University of California, Santa Barbara, CA, United States of America
| | - Rhonda Bacher
- Department of Biostatistics, University of Florida, Gainesville, FL, United States of America
| |
Collapse
|
24
|
Chasman D, Iyer N, Fotuhi Siahpirani A, Estevez Silva M, Lippmann E, McIntosh B, Probasco MD, Jiang P, Stewart R, Thomson JA, Ashton RS, Roy S. Inferring Regulatory Programs Governing Region Specificity of Neuroepithelial Stem Cells during Early Hindbrain and Spinal Cord Development. Cell Syst 2019; 9:167-186.e12. [PMID: 31302154 DOI: 10.1016/j.cels.2019.05.012] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2018] [Revised: 05/05/2019] [Accepted: 05/30/2019] [Indexed: 12/19/2022]
Abstract
Neuroepithelial stem cells (NSC) from different anatomical regions of the embryonic neural tube's rostrocaudal axis can differentiate into diverse central nervous system tissues, but the transcriptional regulatory networks governing these processes are incompletely understood. Here, we measure region-specific NSC gene expression along the rostrocaudal axis in a human pluripotent stem cell model of early central nervous system development over a 72-h time course, spanning the hindbrain to cervical spinal cord. We introduce Escarole, a probabilistic clustering algorithm for non-stationary time series, and combine it with prior-based regulatory network inference to identify genes that are regulated dynamically and predict their upstream regulators. We identify known regulators of patterning and neural development, including the HOX genes, and predict a direct regulatory connection between the transcription factor POU3F2 and target gene STMN2. We demonstrate that POU3F2 is required for expression of STMN2, suggesting that this regulatory connection is important for region specificity of NSCs.
Collapse
Affiliation(s)
- Deborah Chasman
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA
| | - Nisha Iyer
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA; Department of Biomedical Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Alireza Fotuhi Siahpirani
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA; Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Maria Estevez Silva
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA; Department of Biomedical Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Ethan Lippmann
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA; Department of Biomedical Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Brian McIntosh
- Regenerative Biology Theme, Morgridge Institute for Research, Madison, WI 53715, USA
| | - Mitchell D Probasco
- Regenerative Biology Theme, Morgridge Institute for Research, Madison, WI 53715, USA
| | - Peng Jiang
- Regenerative Biology Theme, Morgridge Institute for Research, Madison, WI 53715, USA
| | - Ron Stewart
- Regenerative Biology Theme, Morgridge Institute for Research, Madison, WI 53715, USA
| | - James A Thomson
- Regenerative Biology Theme, Morgridge Institute for Research, Madison, WI 53715, USA
| | - Randolph S Ashton
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA; Department of Biomedical Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA.
| | - Sushmita Roy
- Wisconsin Institute for Discovery, University of Wisconsin-Madison, Madison, WI 53715, USA; Department of Biostatistics and Medical Informatics, University of Wisconsin-Madison, Madison, WI 53792, USA.
| |
Collapse
|