1
|
Cui EH, Zhang Z, Chen CJ, Wong WK. Applications of nature-inspired metaheuristic algorithms for tackling optimization problems across disciplines. Sci Rep 2024; 14:9403. [PMID: 38658593 PMCID: PMC11043462 DOI: 10.1038/s41598-024-56670-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2023] [Accepted: 03/08/2024] [Indexed: 04/26/2024] Open
Abstract
Nature-inspired metaheuristic algorithms are important components of artificial intelligence, and are increasingly used across disciplines to tackle various types of challenging optimization problems. This paper demonstrates the usefulness of such algorithms for solving a variety of challenging optimization problems in statistics using a nature-inspired metaheuristic algorithm called competitive swarm optimizer with mutated agents (CSO-MA). This algorithm was proposed by one of the authors and its superior performance relative to many of its competitors had been demonstrated in earlier work and again in this paper. The main goal of this paper is to show a typical nature-inspired metaheuristic algorithmi, like CSO-MA, is efficient for tackling many different types of optimization problems in statistics. Our applications are new and include finding maximum likelihood estimates of parameters in a single cell generalized trend model to study pseudotime in bioinformatics, estimating parameters in the commonly used Rasch model in education research, finding M-estimates for a Cox regression in a Markov renewal model, performing matrix completion tasks to impute missing data for a two compartment model, and selecting variables optimally in an ecology problem in China. To further demonstrate the flexibility of metaheuristics, we also find an optimal design for a car refueling experiment in the auto industry using a logistic model with multiple interacting factors. In addition, we show that metaheuristics can sometimes outperform optimization algorithms commonly used in statistics.
Collapse
Affiliation(s)
- Elvis Han Cui
- Department of Biostatistics, University of California, Los Angeles, CA, 90095, USA.
| | - Zizhao Zhang
- Department of Biostatistics, University of California, Los Angeles, CA, 90095, USA
- Alibaba Group, Alibaba, Hangzhou, 310099, China
| | - Culsome Junwen Chen
- Department of Environmental Science, Tsinghua University, Beijing, 100084, China
| | - Weng Kee Wong
- Department of Biostatistics, University of California, Los Angeles, CA, 90095, USA.
- The Department of Statistics, National Cheng Kung University, Tainan, Taiwan.
| |
Collapse
|
2
|
Ng-Blichfeldt JP, Stewart BJ, Clatworthy MR, Williams JM, Röper K. Identification of a core transcriptional program driving the human renal mesenchymal-to-epithelial transition. Dev Cell 2024; 59:595-612.e8. [PMID: 38340720 PMCID: PMC7616043 DOI: 10.1016/j.devcel.2024.01.011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 11/28/2023] [Accepted: 01/17/2024] [Indexed: 02/12/2024]
Abstract
During kidney development, nephron epithelia arise de novo from fate-committed mesenchymal progenitors through a mesenchymal-to-epithelial transition (MET). Downstream of fate specification, transcriptional mechanisms that drive establishment of epithelial morphology are poorly understood. We used human iPSC-derived renal organoids, which recapitulate nephrogenesis, to investigate mechanisms controlling renal MET. Multi-ome profiling via snRNA-seq and ATAC-seq of organoids identified dynamic changes in gene expression and chromatin accessibility driven by activators and repressors throughout MET. CRISPR interference identified that paired box 8 (PAX8) is essential for initiation of MET in human renal organoids, contrary to in vivo mouse studies, likely by activating a cell-adhesion program. While Wnt/β-catenin signaling specifies nephron fate, we find that it must be attenuated to allow hepatocyte nuclear factor 1-beta (HNF1B) and TEA-domain (TEAD) transcription factors to drive completion of MET. These results identify the interplay between fate commitment and morphogenesis in the developing human kidney, with implications for understanding both developmental kidney diseases and aberrant epithelial plasticity following adult renal tubular injury.
Collapse
Affiliation(s)
- John-Poul Ng-Blichfeldt
- MRC-Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge CB2 0QH, UK
| | - Benjamin J Stewart
- Molecular Immunity Unit, Department of Medicine, MRC-Laboratory of Molecular Biology, University of Cambridge, Cambridge, UK; Cellular Genetics, Wellcome Sanger Institute, Hinxton, UK
| | - Menna R Clatworthy
- Molecular Immunity Unit, Department of Medicine, MRC-Laboratory of Molecular Biology, University of Cambridge, Cambridge, UK; Cambridge Institute of Therapeutic Immunology and Infectious Diseases, University of Cambridge, Cambridge, UK
| | - Julie M Williams
- Bioscience Renal, Research and Early Development, Cardiovascular, Renal and Metabolism, BioPharmaceuticals R&D, AstraZeneca, Gothenburg, Sweden
| | - Katja Röper
- MRC-Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge Biomedical Campus, Cambridge CB2 0QH, UK.
| |
Collapse
|
3
|
Weatherbee BAT, Gantner CW, Iwamoto-Stohl LK, Daza RM, Hamazaki N, Shendure J, Zernicka-Goetz M. Pluripotent stem cell-derived model of the post-implantation human embryo. Nature 2023; 622:584-593. [PMID: 37369347 PMCID: PMC10584688 DOI: 10.1038/s41586-023-06368-y] [Citation(s) in RCA: 117] [Impact Index Per Article: 58.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Accepted: 06/23/2023] [Indexed: 06/29/2023]
Abstract
The human embryo undergoes morphogenetic transformations following implantation into the uterus, but our knowledge of this crucial stage is limited by the inability to observe the embryo in vivo. Models of the embryo derived from stem cells are important tools for interrogating developmental events and tissue-tissue crosstalk during these stages1. Here we establish a model of the human post-implantation embryo, a human embryoid, comprising embryonic and extraembryonic tissues. We combine two types of extraembryonic-like cell generated by overexpression of transcription factors with wild-type embryonic stem cells and promote their self-organization into structures that mimic several aspects of the post-implantation human embryo. These self-organized aggregates contain a pluripotent epiblast-like domain surrounded by extraembryonic-like tissues. Our functional studies demonstrate that the epiblast-like domain robustly differentiates into amnion, extraembryonic mesenchyme and primordial germ cell-like cells in response to bone morphogenetic protein cues. In addition, we identify an inhibitory role for SOX17 in the specification of anterior hypoblast-like cells2. Modulation of the subpopulations in the hypoblast-like compartment demonstrates that extraembryonic-like cells influence epiblast-like domain differentiation, highlighting functional tissue-tissue crosstalk. In conclusion, we present a modular, tractable, integrated3 model of the human embryo that will enable us to probe key questions of human post-implantation development, a critical window during which substantial numbers of pregnancies fail.
Collapse
Affiliation(s)
- Bailey A T Weatherbee
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK
| | - Carlos W Gantner
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK
| | - Lisa K Iwamoto-Stohl
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK
| | - Riza M Daza
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Nobuhiko Hamazaki
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
| | - Jay Shendure
- Department of Genome Sciences, University of Washington School of Medicine, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, Seattle, WA, USA
- Allen Discovery Center for Cell Lineage Tracing, Seattle, WA, USA
| | - Magdalena Zernicka-Goetz
- Department of Physiology, Development and Neuroscience, University of Cambridge, Cambridge, UK.
- Allen Discovery Center for Cell Lineage Tracing, Seattle, WA, USA.
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA.
| |
Collapse
|
4
|
Kannan S, Miyamoto M, Zhu R, Lynott M, Guo J, Chen EZ, Colas AR, Lin BL, Kwon C. Trajectory reconstruction identifies dysregulation of perinatal maturation programs in pluripotent stem cell-derived cardiomyocytes. Cell Rep 2023; 42:112330. [PMID: 37014753 PMCID: PMC10545814 DOI: 10.1016/j.celrep.2023.112330] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 02/12/2023] [Accepted: 03/20/2023] [Indexed: 04/05/2023] Open
Abstract
A limitation in the application of pluripotent stem cell-derived cardiomyocytes (PSC-CMs) is the failure of these cells to achieve full functional maturity. The mechanisms by which directed differentiation differs from endogenous development, leading to consequent PSC-CM maturation arrest, remain unclear. Here, we generate a single-cell RNA sequencing (scRNA-seq) reference of mouse in vivo CM maturation with extensive sampling of previously difficult-to-isolate perinatal time periods. We subsequently generate isogenic embryonic stem cells to create an in vitro scRNA-seq reference of PSC-CM-directed differentiation. Through trajectory reconstruction, we identify an endogenous perinatal maturation program that is poorly recapitulated in vitro. By comparison with published human datasets, we identify a network of nine transcription factors (TFs) whose targets are consistently dysregulated in PSC-CMs across species. Notably, these TFs are only partially activated in common ex vivo approaches to engineer PSC-CM maturation. Our study can be leveraged toward improving the clinical viability of PSC-CMs.
Collapse
Affiliation(s)
- Suraj Kannan
- Division of Cardiology, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA; Institute for Cell Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Matthew Miyamoto
- Division of Cardiology, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA; Institute for Cell Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Renjun Zhu
- Division of Cardiology, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA; Institute for Cell Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Michaela Lynott
- Sanford Burham Prebys Medical Discovery Institute, San Diego, CA, USA
| | - Jason Guo
- Division of Cardiology, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA; Institute for Cell Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Elaine Zhelan Chen
- Division of Cardiology, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA; Institute for Cell Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Alexandre R Colas
- Sanford Burham Prebys Medical Discovery Institute, San Diego, CA, USA
| | - Brian Leei Lin
- Division of Cardiology, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA
| | - Chulan Kwon
- Division of Cardiology, Department of Medicine, Johns Hopkins School of Medicine, Baltimore, MD, USA; Department of Biomedical Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA; Institute for Cell Engineering, Johns Hopkins School of Medicine, Baltimore, MD, USA.
| |
Collapse
|
5
|
Zheng Z, Qiu X, Wu H, Chang L, Tang X, Zou L, Li J, Wu Y, Zhou J, Jiang S, Wan Y, Ni Q. TIPS: trajectory inference of pathway significance through pseudotime comparison for functional assessment of single-cell RNAseq data. Brief Bioinform 2021; 22:6255997. [PMID: 34370020 PMCID: PMC8425418 DOI: 10.1093/bib/bbab124] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Revised: 03/13/2021] [Accepted: 03/16/2021] [Indexed: 01/18/2023] Open
Abstract
Recent advances in bioinformatics analyses have led to the development of novel tools enabling the capture and trajectory mapping of single-cell RNA sequencing (scRNAseq) data. However, there is a lack of methods to assess the contributions of biological pathways and transcription factors to an overall developmental trajectory mapped from scRNAseq data. In this manuscript, we present a simplified approach for trajectory inference of pathway significance (TIPS) that leverages existing knowledgebases of functional pathways and other gene lists to provide further mechanistic insights into a biological process. TIPS identifies key pathways which contribute to a process of interest, as well as the individual genes that best reflect these changes. TIPS also provides insight into the relative timing of pathway changes, as well as a suite of visualizations to enable simplified data interpretation of scRNAseq libraries generated using a wide range of techniques. The TIPS package can be run through either a web server or downloaded as a user-friendly GUI run in R, and may serve as a useful tool to help biologists perform deeper functional analyses and visualization of their single-cell data.
Collapse
Affiliation(s)
- Zihan Zheng
- Biowavelet Ltd., Chongqing, China.,Chongqing International Institute for Immunology, Chongqing, China
| | - Xin Qiu
- R&D Department, TCRCure Ltd., Chongqing, China
| | - Haiyang Wu
- R&D Department, TCRCure Ltd., Chongqing, China
| | - Ling Chang
- Department of Immunology, Army Medical University, Chongqing, China
| | - Xiangyu Tang
- Biomedical Analysis Center, Army Medical University, Chongqing, China
| | - Liyun Zou
- Department of Immunology, Army Medical University, Chongqing, China
| | - Jingyi Li
- Chongqing International Institute for Immunology, Chongqing, China.,Department of Rheumatology and Immunology, First Affiliated Hospital of Army Medical University, Chongqing, China
| | - Yuzhang Wu
- Department of Immunology, Army Medical University, Chongqing, China
| | | | - Shan Jiang
- Institute for Advanced Study, Shenzhen University, Shenzhen, China
| | - Ying Wan
- Biomedical Analysis Center, Army Medical University, Chongqing, China
| | - Qingshan Ni
- Biomedical Analysis Center, Army Medical University, Chongqing, China
| |
Collapse
|
6
|
Grønning AGB, Oubounyt M, Kanev K, Lund J, Kacprowski T, Zehn D, Röttger R, Baumbach J. Enabling single-cell trajectory network enrichment. NATURE COMPUTATIONAL SCIENCE 2021; 1:153-163. [PMID: 38217228 DOI: 10.1038/s43588-021-00025-y] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Accepted: 01/15/2021] [Indexed: 01/15/2024]
Abstract
Single-cell sequencing (scRNA-seq) technologies allow the investigation of cellular differentiation processes with unprecedented resolution. Although powerful software packages for scRNA-seq data analysis exist, systems biology-based tools for trajectory analysis are rare and typically difficult to handle. This hampers biological exploration and prevents researchers from gaining deeper insights into the molecular control of developmental processes. Here, to address this, we have developed Scellnetor; a network-constraint time-series clustering algorithm. It allows extraction of temporal differential gene expression network patterns (modules) that explain the difference in regulation of two developmental trajectories. Using well-characterized experimental model systems, we demonstrate the capacity of Scellnetor as a hypothesis generator to identify putative mechanisms driving haematopoiesis or mechanistically interpretable subnetworks driving dysfunctional CD8 T-cell development in chronic infections. Altogether, Scellnetor allows for single-cell trajectory network enrichment, which effectively lifts scRNA-seq data analysis to a systems biology level.
Collapse
Affiliation(s)
- Alexander G B Grønning
- Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark.
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark.
| | - Mhaned Oubounyt
- Chair of Experimental Bioinformatics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
- Chair of Computational Systems Biology, University of Hamburg, Hamburg, Germany
| | - Kristiyan Kanev
- Division of Animal Physiology and Immunology, TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Jesper Lund
- Department of Biostatistics and Epidemiology, University of Southern Denmark, Odense, Denmark
| | - Tim Kacprowski
- Division Data Science in Biomedicine, Peter L. Reichertz Institute for Medical Informatics of TU Braunschweig and Hannover Medical School, Brunswick, Germany
| | - Dietmar Zehn
- Division of Animal Physiology and Immunology, TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany
| | - Richard Röttger
- Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark
| | - Jan Baumbach
- Department of Mathematics and Computer Science, University of Southern Denmark, Odense, Denmark.
- Chair of Experimental Bioinformatics, TUM School of Life Sciences Weihenstephan, Technical University of Munich, Freising, Germany.
- Chair of Computational Systems Biology, University of Hamburg, Hamburg, Germany.
| |
Collapse
|
7
|
Klimm F. Functional change along cellular trajectories. NATURE COMPUTATIONAL SCIENCE 2021; 1:102-103. [PMID: 38217227 DOI: 10.1038/s43588-021-00026-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/15/2024]
Affiliation(s)
- Florian Klimm
- Department of Mathematics, Imperial College London, London, UK.
- Medical Research Council Mitochondrial Biology Unit, University of Cambridge, Cambridge, UK.
| |
Collapse
|
8
|
Li X, Liu L, Goodall GJ, Schreiber A, Xu T, Li J, Le TD. A novel single-cell based method for breast cancer prognosis. PLoS Comput Biol 2020; 16:e1008133. [PMID: 32833968 PMCID: PMC7470419 DOI: 10.1371/journal.pcbi.1008133] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Revised: 09/03/2020] [Accepted: 07/09/2020] [Indexed: 12/12/2022] Open
Abstract
Breast cancer prognosis is challenging due to the heterogeneity of the disease. Various computational methods using bulk RNA-seq data have been proposed for breast cancer prognosis. However, these methods suffer from limited performances or ambiguous biological relevance, as a result of the neglect of intra-tumor heterogeneity. Recently, single cell RNA-sequencing (scRNA-seq) has emerged for studying tumor heterogeneity at cellular levels. In this paper, we propose a novel method, scPrognosis, to improve breast cancer prognosis with scRNA-seq data. scPrognosis uses the scRNA-seq data of the biological process Epithelial-to-Mesenchymal Transition (EMT). It firstly infers the EMT pseudotime and a dynamic gene co-expression network, then uses an integrative model to select genes important in EMT based on their expression variation and differentiation in different stages of EMT, and their roles in the dynamic gene co-expression network. To validate and apply the selected signatures to breast cancer prognosis, we use them as the features to build a prediction model with bulk RNA-seq data. The experimental results show that scPrognosis outperforms other benchmark breast cancer prognosis methods that use bulk RNA-seq data. Moreover, the dynamic changes in the expression of the selected signature genes in EMT may provide clues to the link between EMT and clinical outcomes of breast cancer. scPrognosis will also be useful when applied to scRNA-seq datasets of different biological processes other than EMT.
Collapse
Affiliation(s)
- Xiaomei Li
- UniSA STEM, University of South Australia, Mawson Lakes, SA, Australia
| | - Lin Liu
- UniSA STEM, University of South Australia, Mawson Lakes, SA, Australia
| | - Gregory J. Goodall
- Centre for Cancer Biology, an alliance of SA Pathology and University of South Australia, Adelaide, SA, Australia
- School of Medicine, Discipline of Medicine, University of Adelaide, SA, Australia
| | - Andreas Schreiber
- Centre for Cancer Biology, an alliance of SA Pathology and University of South Australia, Adelaide, SA, Australia
| | - Taosheng Xu
- Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, 230031, China
| | - Jiuyong Li
- UniSA STEM, University of South Australia, Mawson Lakes, SA, Australia
| | - Thuc D. Le
- UniSA STEM, University of South Australia, Mawson Lakes, SA, Australia
- * E-mail:
| |
Collapse
|
9
|
Amezquita RA, Lun ATL, Becht E, Carey VJ, Carpp LN, Geistlinger L, Marini F, Rue-Albrecht K, Risso D, Soneson C, Waldron L, Pagès H, Smith ML, Huber W, Morgan M, Gottardo R, Hicks SC. Orchestrating single-cell analysis with Bioconductor. Nat Methods 2020; 17:137-145. [PMID: 31792435 PMCID: PMC7358058 DOI: 10.1038/s41592-019-0654-x] [Citation(s) in RCA: 498] [Impact Index Per Article: 99.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Revised: 09/13/2019] [Accepted: 10/14/2019] [Indexed: 12/24/2022]
Abstract
Recent technological advancements have enabled the profiling of a large number of genome-wide features in individual cells. However, single-cell data present unique challenges that require the development of specialized methods and software infrastructure to successfully derive biological insights. The Bioconductor project has rapidly grown to meet these demands, hosting community-developed open-source software distributed as R packages. Featuring state-of-the-art computational methods, standardized data infrastructure and interactive data visualization tools, we present an overview and online book (https://osca.bioconductor.org) of single-cell methods for prospective users.
Collapse
Affiliation(s)
| | - Aaron T L Lun
- Cancer Research UK Cambridge Institute, University of Cambridge, Cambridge, UK
- Bioinformatics and Computational Biology, Genentech Inc., San Francisco, CA, USA
| | - Etienne Becht
- Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Vince J Carey
- Channing Division of Network Medicine, Brigham And Women's Hospital, Boston, MA, USA
| | | | - Ludwig Geistlinger
- Graduate School of Public Health and Health Policy, City University of New York, New York, NY, USA
- Institute for Implementation Science in Population Health, City University of New York, New York, NY, USA
| | - Federico Marini
- Center for Thrombosis and Hemostasis, Mainz, Germany
- Institute of Medical Biostatistics, Epidemiology and Informatics, Mainz, Germany
| | | | - Davide Risso
- Department of Statistical Sciences, University of Padua, Padua, Italy
- Division of Biostatistics and Epidemiology, Department of Healthcare Policy and Research, Weill Cornell Medicine, New York, NY, USA
| | - Charlotte Soneson
- Friedrich Miescher Institute for Biomedical Research, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Levi Waldron
- Graduate School of Public Health and Health Policy, City University of New York, New York, NY, USA
- Institute for Implementation Science in Population Health, City University of New York, New York, NY, USA
| | - Hervé Pagès
- Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Mike L Smith
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Wolfgang Huber
- European Molecular Biology Laboratory, Genome Biology Unit, Heidelberg, Germany
| | - Martin Morgan
- Biostatistics and Bioinformatics, Roswell Park Comprehensive Cancer Center, Buffalo, NY, USA
| | | | - Stephanie C Hicks
- Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA.
| |
Collapse
|
10
|
Miragaia RJ, Gomes T, Chomka A, Jardine L, Riedel A, Hegazy AN, Whibley N, Tucci A, Chen X, Lindeman I, Emerton G, Krausgruber T, Shields J, Haniffa M, Powrie F, Teichmann SA. Single-Cell Transcriptomics of Regulatory T Cells Reveals Trajectories of Tissue Adaptation. Immunity 2019; 50:493-504.e7. [PMID: 30737144 PMCID: PMC6382439 DOI: 10.1016/j.immuni.2019.01.001] [Citation(s) in RCA: 346] [Impact Index Per Article: 57.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2017] [Revised: 09/28/2018] [Accepted: 12/31/2018] [Indexed: 02/07/2023]
Abstract
Non-lymphoid tissues (NLTs) harbor a pool of adaptive immune cells with largely unexplored phenotype and development. We used single-cell RNA-seq to characterize 35,000 CD4+ regulatory (Treg) and memory (Tmem) T cells in mouse skin and colon, their respective draining lymph nodes (LNs) and spleen. In these tissues, we identified Treg cell subpopulations with distinct degrees of NLT phenotype. Subpopulation pseudotime ordering and gene kinetics were consistent in recruitment to skin and colon, yet the initial NLT-priming in LNs and the final stages of NLT functional adaptation reflected tissue-specific differences. Predicted kinetics were recapitulated using an in vivo melanoma-induction model, validating key regulators and receptors. Finally, we profiled human blood and NLT Treg and Tmem cells, and identified cross-mammalian conserved tissue signatures. In summary, we describe the relationship between Treg cell heterogeneity and recruitment to NLTs through the combined use of computational prediction and in vivo validation.
Collapse
Affiliation(s)
- Ricardo J Miragaia
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK; Centre of Biological Engineering, University of Minho, Braga, Portugal
| | - Tomás Gomes
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Agnieszka Chomka
- Kennedy Institute of Rheumatology, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK; Translational Gastroenterology Unit, Experimental Medicine Division Nuffield Department of Clinical Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UK
| | - Laura Jardine
- Institute of Cellular Medicine, Newcastle University, Newcastle-Upon-Tyne, UK
| | - Angela Riedel
- MRC Cancer Unit, University of Cambridge, Cambridge, UK
| | - Ahmed N Hegazy
- Kennedy Institute of Rheumatology, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK; Translational Gastroenterology Unit, Experimental Medicine Division Nuffield Department of Clinical Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UK
| | - Natasha Whibley
- Kennedy Institute of Rheumatology, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK
| | - Andrea Tucci
- Kennedy Institute of Rheumatology, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK
| | - Xi Chen
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Ida Lindeman
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK; Centre for Immune Regulation and Department of Immunology, University of Oslo and Oslo University Hospital, 0372 Oslo, Norway
| | - Guy Emerton
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK
| | - Thomas Krausgruber
- Kennedy Institute of Rheumatology, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK; Translational Gastroenterology Unit, Experimental Medicine Division Nuffield Department of Clinical Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UK
| | | | - Muzlifah Haniffa
- Institute of Cellular Medicine, Newcastle University, Newcastle-Upon-Tyne, UK
| | - Fiona Powrie
- Kennedy Institute of Rheumatology, Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, University of Oxford, Oxford, UK; Translational Gastroenterology Unit, Experimental Medicine Division Nuffield Department of Clinical Medicine, University of Oxford, John Radcliffe Hospital, Oxford, UK
| | - Sarah A Teichmann
- Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, UK; Theory of Condensed Matter, Cavendish Laboratory, Department of Physics, University of Cambridge, Cambridge, UK; European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, UK.
| |
Collapse
|
11
|
Yau C, Campbell K. Bayesian statistical learning for big data biology. Biophys Rev 2019; 11:95-102. [PMID: 30729409 PMCID: PMC6381359 DOI: 10.1007/s12551-019-00499-1] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Accepted: 01/08/2019] [Indexed: 11/10/2022] Open
Abstract
Bayesian statistical learning provides a coherent probabilistic framework for modelling uncertainty in systems. This review describes the theoretical foundations underlying Bayesian statistics and outlines the computational frameworks for implementing Bayesian inference in practice. We then describe the use of Bayesian learning in single-cell biology for the analysis of high-dimensional, large data sets.
Collapse
Affiliation(s)
- Christopher Yau
- Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham, UK.
- The Alan Turing Institute, London, UK.
| | - Kieran Campbell
- Department of Statistics, University of British Columbia, Vancouver, Canada
- Department of Molecular Oncology, BC Cancer Agency, Vancouver, Canada
| |
Collapse
|
12
|
Lang C, Campbell KR, Ryan BJ, Carling P, Attar M, Vowles J, Perestenko OV, Bowden R, Baig F, Kasten M, Hu MT, Cowley SA, Webber C, Wade-Martins R. Single-Cell Sequencing of iPSC-Dopamine Neurons Reconstructs Disease Progression and Identifies HDAC4 as a Regulator of Parkinson Cell Phenotypes. Cell Stem Cell 2019; 24:93-106.e6. [PMID: 30503143 PMCID: PMC6327112 DOI: 10.1016/j.stem.2018.10.023] [Citation(s) in RCA: 106] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2018] [Revised: 07/13/2018] [Accepted: 10/23/2018] [Indexed: 11/29/2022]
Abstract
Induced pluripotent stem cell (iPSC)-derived dopamine neurons provide an opportunity to model Parkinson's disease (PD), but neuronal cultures are confounded by asynchronous and heterogeneous appearance of disease phenotypes in vitro. Using high-resolution, single-cell transcriptomic analyses of iPSC-derived dopamine neurons carrying the GBA-N370S PD risk variant, we identified a progressive axis of gene expression variation leading to endoplasmic reticulum stress. Pseudotime analysis of genes differentially expressed (DE) along this axis identified the transcriptional repressor histone deacetylase 4 (HDAC4) as an upstream regulator of disease progression. HDAC4 was mislocalized to the nucleus in PD iPSC-derived dopamine neurons and repressed genes early in the disease axis, leading to late deficits in protein homeostasis. Treatment of iPSC-derived dopamine neurons with HDAC4-modulating compounds upregulated genes early in the DE axis and corrected PD-related cellular phenotypes. Our study demonstrates how single-cell transcriptomics can exploit cellular heterogeneity to reveal disease mechanisms and identify therapeutic targets.
Collapse
Affiliation(s)
- Charmaine Lang
- Oxford Parkinson's Disease Centre, Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford, UK
| | - Kieran R Campbell
- Oxford Parkinson's Disease Centre, Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford, UK; The Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Brent J Ryan
- Oxford Parkinson's Disease Centre, Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford, UK
| | - Phillippa Carling
- Oxford Parkinson's Disease Centre, Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford, UK
| | - Moustafa Attar
- The Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Jane Vowles
- Sir William Dunn School of Pathology, University of Oxford, South Parks Road, Oxford OX1 3RE, UK
| | - Olga V Perestenko
- Sir William Dunn School of Pathology, University of Oxford, South Parks Road, Oxford OX1 3RE, UK
| | - Rory Bowden
- The Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Fahd Baig
- Oxford Parkinson's Disease Centre, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
| | - Meike Kasten
- Department of Psychiatry and Psychotherapy and Institute of Neurogenetics, University of Lübeck, Lübeck, Germany
| | - Michele T Hu
- Oxford Parkinson's Disease Centre, Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, UK
| | - Sally A Cowley
- Sir William Dunn School of Pathology, University of Oxford, South Parks Road, Oxford OX1 3RE, UK
| | - Caleb Webber
- Oxford Parkinson's Disease Centre, Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford, UK.
| | - Richard Wade-Martins
- Oxford Parkinson's Disease Centre, Department of Physiology, Anatomy and Genetics, University of Oxford, South Parks Road, Oxford, UK.
| |
Collapse
|
13
|
Campbell KR, Yau C. A descriptive marker gene approach to single-cell pseudotime inference. Bioinformatics 2019; 35:28-35. [PMID: 29939207 PMCID: PMC6298060 DOI: 10.1093/bioinformatics/bty498] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Revised: 02/05/2018] [Accepted: 06/20/2018] [Indexed: 12/25/2022] Open
Abstract
Motivation Pseudotime estimation from single-cell gene expression data allows the recovery of temporal information from otherwise static profiles of individual cells. Conventional pseudotime inference methods emphasize an unsupervised transcriptome-wide approach and use retrospective analysis to evaluate the behaviour of individual genes. However, the resulting trajectories can only be understood in terms of abstract geometric structures and not in terms of interpretable models of gene behaviour. Results Here we introduce an orthogonal Bayesian approach termed 'Ouija' that learns pseudotimes from a small set of marker genes that might ordinarily be used to retrospectively confirm the accuracy of unsupervised pseudotime algorithms. Crucially, we model these genes in terms of switch-like or transient behaviour along the trajectory, allowing us to understand why the pseudotimes have been inferred and learn informative parameters about the behaviour of each gene. Since each gene is associated with a switch or peak time the genes are effectively ordered along with the cells, allowing each part of the trajectory to be understood in terms of the behaviour of certain genes. We demonstrate that this small panel of marker genes can recover pseudotimes that are consistent with those obtained using the entire transcriptome. Furthermore, we show that our method can detect differences in the regulation timings between two genes and identify 'metastable' states-discrete cell types along the continuous trajectories-that recapitulate known cell types. Availability and implementation An open source implementation is available as an R package at http://www.github.com/kieranrcampbell/ouija and as a Python/TensorFlow package at http://www.github.com/kieranrcampbell/ouijaflow. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Kieran R Campbell
- Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, UK
- Wellcome Trust Centre for Human Genetics University of Oxford, Oxford, UK
| | - Christopher Yau
- Wellcome Trust Centre for Human Genetics University of Oxford, Oxford, UK
- Department of Statistics, University of Oxford, Oxford, UK
- Institute of Cancer and Genomic Sciences, Centre for Computational Biology, University of Birmingham, Birmingham, UK
| |
Collapse
|
14
|
Hon CC, Shin JW, Carninci P, Stubbington MJT. The Human Cell Atlas: Technical approaches and challenges. Brief Funct Genomics 2018; 17:283-294. [PMID: 29092000 PMCID: PMC6063304 DOI: 10.1093/bfgp/elx029] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
The Human Cell Atlas is a large, international consortium that aims to identify and describe every cell type in the human body. The comprehensive cellular maps that arise from this ambitious effort have the potential to transform many aspects of fundamental biology and clinical practice. Here, we discuss the technical approaches that could be used today to generate such a resource and also the technical challenges that will be encountered.
Collapse
Affiliation(s)
- Chung-Chau Hon
- RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Kanagawa, Japan
| | - Jay W Shin
- RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Kanagawa, Japan
| | - Piero Carninci
- RIKEN Center for Life Science Technologies, Division of Genomic Technologies, Yokohama, Kanagawa, Japan
| | | |
Collapse
|
15
|
Rostom R, Svensson V, Teichmann SA, Kar G. Computational approaches for interpreting scRNA-seq data. FEBS Lett 2017; 591:2213-2225. [PMID: 28524227 PMCID: PMC5575496 DOI: 10.1002/1873-3468.12684] [Citation(s) in RCA: 76] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2017] [Revised: 04/30/2017] [Accepted: 05/16/2017] [Indexed: 11/18/2022]
Abstract
The recent developments in high‐throughput single‐cell RNA sequencing technology (scRNA‐seq) have enabled the generation of vast amounts of transcriptomic data at cellular resolution. With these advances come new modes of data analysis, building on high‐dimensional data mining techniques. Here, we consider biological questions for which scRNA‐seq data is used, both at a cell and gene level, and describe tools available for these types of analyses. This is an exciting and rapidly evolving field, where clustering, pseudotime inference, branching inference and gene‐level analyses are particularly informative areas of computational analysis.
Collapse
Affiliation(s)
| | | | | | - Gozde Kar
- The European Bioinformatics Institute (EMBL-EBI), Cambridge, UK
| |
Collapse
|
16
|
Campbell KR, Yau C. Probabilistic modeling of bifurcations in single-cell gene expression data using a Bayesian mixture of factor analyzers. Wellcome Open Res 2017; 2:19. [PMID: 28503665 PMCID: PMC5428745 DOI: 10.12688/wellcomeopenres.11087.1] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Modeling bifurcations in single-cell transcriptomics data has become an increasingly popular field of research. Several methods have been proposed to infer bifurcation structure from such data, but all rely on heuristic non-probabilistic inference. Here we propose the first generative, fully probabilistic model for such inference based on a Bayesian hierarchical mixture of factor analyzers. Our model exhibits competitive performance on large datasets despite implementing full Markov-Chain Monte Carlo sampling, and its unique hierarchical prior structure enables automatic determination of genes driving the bifurcation process. We additionally propose an Empirical-Bayes like extension that deals with the high levels of zero-inflation in single-cell RNA-seq data and quantify when such models are useful. We apply or model to both real and simulated single-cell gene expression data and compare the results to existing pseudotime methods. Finally, we discuss both the merits and weaknesses of such a unified, probabilistic approach in the context practical bioinformatics analyses.
Collapse
Affiliation(s)
- Kieran R Campbell
- Department of Physiology, Anatomy and Genetics, University of Oxford, Oxford, OX1 3QX, UK.,Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK
| | - Christopher Yau
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, OX3 7BN, UK.,Centre for Computational Biology, Institute of Cancer and Genomic Sciences, University of Birmingham, Birmingham, B15 2TT, UK
| |
Collapse
|