1
|
Herbst E, Mandel-Gutfreund Y, Yakhini Z, Biran H. Inferring single-cell and spatial microRNA activity from transcriptomics data. Commun Biol 2025; 8:87. [PMID: 39827321 PMCID: PMC11743151 DOI: 10.1038/s42003-025-07454-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Accepted: 01/02/2025] [Indexed: 01/22/2025] Open
Abstract
The activity of miRNA varies across different cell populations and systems, as part of the mechanisms that distinguish cell types and roles in living organisms and in human health and disease. Typically, miRNA regulation drives changes in the composition and levels of protein-coding RNA and of lncRNA, with targets being down-regulated when miRNAs are active. The term "miRNA activity" is used to refer to this transcriptional effect of miRNAs. This study introduces miTEA-HiRes, a method designed to facilitate the evaluation of miRNA activity at high resolution. The method applies to single-cell transcriptomics, type-specific single-cell populations, and spatial transcriptomics data. By comparing different conditions, differential miRNA activity is inferred. For instance, miTEA-HiRes analysis of peripheral blood mononuclear cells comparing Multiple Sclerosis patients to control groups revealed differential activity of miR-20a-5p and others, consistent with the literature on miRNA underexpression in Multiple Sclerosis. We also show miR-519a-3p differential activity in specific cell populations.
Collapse
Affiliation(s)
- Efrat Herbst
- Arazi School of Computer Science, Reichman University, Herzliya, Israel.
| | - Yael Mandel-Gutfreund
- Computer Science Department, Technion - Israel Institute of Technology, Haifa, Israel
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa, Israel
| | - Zohar Yakhini
- Arazi School of Computer Science, Reichman University, Herzliya, Israel
- Computer Science Department, Technion - Israel Institute of Technology, Haifa, Israel
| | - Hadas Biran
- Computer Science Department, Technion - Israel Institute of Technology, Haifa, Israel
| |
Collapse
|
2
|
Sengupta P, Dutta S, Liew F, Samrot A, Dasgupta S, Rajput MA, Slama P, Kolesarova A, Roychoudhury S. Reproductomics: Exploring the Applications and Advancements of Computational Tools. Physiol Res 2024; 73:687-702. [PMID: 39530905 PMCID: PMC11629954 DOI: 10.33549/physiolres.935389] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Accepted: 06/25/2024] [Indexed: 12/13/2024] Open
Abstract
Over recent decades, advancements in omics technologies, such as proteomics, genomics, epigenomics, metabolomics, transcriptomics, and microbiomics, have significantly enhanced our understanding of the molecular mechanisms underlying various physiological and pathological processes. Nonetheless, the analysis and interpretation of vast omics data concerning reproductive diseases are complicated by the cyclic regulation of hormones and multiple other factors, which, in conjunction with a genetic makeup of an individual, lead to diverse biological responses. Reproductomics investigates the interplay between a hormonal regulation of an individual, environmental factors, genetic predisposition (DNA composition and epigenome), health effects, and resulting biological outcomes. It is a rapidly emerging field that utilizes computational tools to analyze and interpret reproductive data, with the aim of improving reproductive health outcomes. It is time to explore the applications of reproductomics in understanding the molecular mechanisms underlying infertility, identification of potential biomarkers for diagnosis and treatment, and in improving assisted reproductive technologies (ARTs). Reproductomics tools include machine learning algorithms for predicting fertility outcomes, gene editing technologies for correcting genetic abnormalities, and single cell sequencing techniques for analyzing gene expression patterns at the individual cell level. However, there are several challenges, limitations and ethical issues involved with the use of reproductomics, such as the applications of gene editing technologies and their potential impact on future generations are discussed. The review comprehensively covers the applications and advancements of reproductomics, highlighting its potential to improve reproductive health outcomes and deepen our understanding of reproductive molecular mechanisms.
Collapse
Affiliation(s)
- P Sengupta
- Department of Biomedical Sciences, College of Medicine, Gulf Medical University, Ajman, UAE; Department of Life Science and Bioinformatics, Assam University, Silchar, India.
| | | | | | | | | | | | | | | | | |
Collapse
|
3
|
Sweeney MD, Torre-Healy LA, Ma VL, Hall MA, Chrastecka L, Yurovsky A, Moffitt RA. FaStaNMF: A Fast and Stable Non-Negative Matrix Factorization for Gene Expression. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:1633-1644. [PMID: 37467096 DOI: 10.1109/tcbb.2023.3296979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/21/2023]
Abstract
Gene expression analysis of samples with mixed cell types only provides limited insight to the characteristics of specific tissues. In silico deconvolution can be applied to extract cell type specific expression, thus avoiding prohibitively expensive techniques such as cell sorting or single-cell sequencing. Non-negative matrix factorization (NMF) is a deconvolution method shown to be useful for gene expression data, in part due to its constraint of non-negativity. Unlike other methods, NMF provides the capability to deconvolve without prior knowledge of the components of the model. However, NMF is not guaranteed to provide a globally unique solution. In this work, we present FaStaNMF, a method that balances achieving global stability of the NMF results, which is essential for inter-experiment and inter-lab reproducibility, with accuracy and speed. Results: FaStaNMF was applied to four datasets with known ground truth, created based on publicly available data or by using our simulation infrastructure, RNAGinesis. We assessed FaStaNMF on three criteria - speed, accuracy, and stability, and it favorably compared to the standard approach of achieving reproduceable results with NMF. We expect that FaStaNMF can be applied successfully to a wide array of biological data, such as different tumor/immune and other disease microenvironments.
Collapse
|
4
|
Zhang L, Cavallini M, Wang J, Xin R, Zhang Q, Feng G, Sanes JR, Peng YR. Evolutionary and developmental specialization of foveal cell types in the marmoset. Proc Natl Acad Sci U S A 2024; 121:e2313820121. [PMID: 38598343 PMCID: PMC11032471 DOI: 10.1073/pnas.2313820121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2023] [Accepted: 03/13/2024] [Indexed: 04/12/2024] Open
Abstract
In primates, high-acuity vision is mediated by the fovea, a small specialized central region of the retina. The fovea, unique to the anthropoid lineage among mammals, undergoes notable neuronal morphological changes during postnatal maturation. However, the extent of cellular similarity across anthropoid foveas and the molecular underpinnings of foveal maturation remain unclear. Here, we used high-throughput single-cell RNA sequencing to profile retinal cells of the common marmoset (Callithrix jacchus), an early divergent in anthropoid evolution from humans, apes, and macaques. We generated atlases of the marmoset fovea and peripheral retina for both neonates and adults. Our comparative analysis revealed that marmosets share almost all their foveal types with both humans and macaques, highlighting a conserved cellular structure among primate foveas. Furthermore, by tracing the developmental trajectory of cell types in the foveal and peripheral retina, we found distinct maturation paths for each. In-depth analysis of gene expression differences demonstrated that cone photoreceptors and Müller glia (MG), among others, show the greatest molecular divergence between these two regions. Utilizing single-cell ATAC-seq and gene-regulatory network inference, we uncovered distinct transcriptional regulations differentiating foveal cones from their peripheral counterparts. Further analysis of predicted ligand-receptor interactions suggested a potential role for MG in supporting the maturation of foveal cones. Together, these results provide valuable insights into foveal development, structure, and evolution.
Collapse
Affiliation(s)
- Lin Zhang
- Department of Ophthalmology and Stein Eye Institute, David Geffen School of Medicine at University of California Los Angeles, Los Angeles, CA90095
| | - Martina Cavallini
- Department of Ophthalmology and Stein Eye Institute, David Geffen School of Medicine at University of California Los Angeles, Los Angeles, CA90095
| | - Junqiang Wang
- Department of Ophthalmology and Stein Eye Institute, David Geffen School of Medicine at University of California Los Angeles, Los Angeles, CA90095
| | - Ruiqi Xin
- Department of Ophthalmology and Stein Eye Institute, David Geffen School of Medicine at University of California Los Angeles, Los Angeles, CA90095
| | - Qiangge Zhang
- McGovern Institute for Brain Research, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA02139
| | - Guoping Feng
- McGovern Institute for Brain Research, Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, Cambridge, MA02139
| | - Joshua R. Sanes
- Center for Brain Science and Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA02138
| | - Yi-Rong Peng
- Department of Ophthalmology and Stein Eye Institute, David Geffen School of Medicine at University of California Los Angeles, Los Angeles, CA90095
| |
Collapse
|
5
|
Vaknin I, Willinger O, Mandl J, Heuberger H, Ben-Ami D, Zeng Y, Goldberg S, Orenstein Y, Amit R. A universal system for boosting gene expression in eukaryotic cell-lines. Nat Commun 2024; 15:2394. [PMID: 38493141 PMCID: PMC10944472 DOI: 10.1038/s41467-024-46573-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2023] [Accepted: 03/04/2024] [Indexed: 03/18/2024] Open
Abstract
We demonstrate a transcriptional regulatory design algorithm that can boost expression in yeast and mammalian cell lines. The system consists of a simplified transcriptional architecture composed of a minimal core promoter and a synthetic upstream regulatory region (sURS) composed of up to three motifs selected from a list of 41 motifs conserved in the eukaryotic lineage. The sURS system was first characterized using an oligo-library containing 189,990 variants. We validate the resultant expression model using a set of 43 unseen sURS designs. The validation sURS experiments indicate that a generic set of grammar rules for boosting and attenuation may exist in yeast cells. Finally, we demonstrate that this generic set of grammar rules functions similarly in mammalian CHO-K1 and HeLa cells. Consequently, our work provides a design algorithm for boosting the expression of promoters used for expressing industrially relevant proteins in yeast and mammalian cell lines.
Collapse
Affiliation(s)
- Inbal Vaknin
- Department of Biotechnology and Food Engineering, Technion, Haifa, Israel
| | - Or Willinger
- Department of Biotechnology and Food Engineering, Technion, Haifa, Israel
| | - Jonathan Mandl
- Department of Computer Science, Bar-Ilan University, Ramat Gan, Israel
| | - Hadar Heuberger
- School of Electrical and Computer Engineering, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Dan Ben-Ami
- School of Electrical and Computer Engineering, Ben-Gurion University of the Negev, Beer Sheva, Israel
| | - Yi Zeng
- Department of Biotechnology and Food Engineering, Technion, Haifa, Israel
| | - Sarah Goldberg
- Department of Biotechnology and Food Engineering, Technion, Haifa, Israel
| | - Yaron Orenstein
- Department of Computer Science, Bar-Ilan University, Ramat Gan, Israel
- The Mina and Everard Goodman Faculty of Life Sciences, Bar-Ilan University, Ramat Gan, Israel
| | - Roee Amit
- Department of Biotechnology and Food Engineering, Technion, Haifa, Israel.
- The Russell Berrie Nanotechnology Institute, Technion, Haifa, Israel.
| |
Collapse
|
6
|
Najm M, Cornet M, Albergante L, Zinovyev A, Sermet-Gaudelus I, Stoven V, Calzone L, Martignetti L. Representation and quantification of module activity from omics data with rROMA. NPJ Syst Biol Appl 2024; 10:8. [PMID: 38242871 PMCID: PMC10799004 DOI: 10.1038/s41540-024-00331-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2023] [Accepted: 01/03/2024] [Indexed: 01/21/2024] Open
Abstract
The efficiency of analyzing high-throughput data in systems biology has been demonstrated in numerous studies, where molecular data, such as transcriptomics and proteomics, offers great opportunities for understanding the complexity of biological processes. One important aspect of data analysis in systems biology is the shift from a reductionist approach that focuses on individual components to a more integrative perspective that considers the system as a whole, where the emphasis shifted from differential expression of individual genes to determining the activity of gene sets. Here, we present the rROMA software package for fast and accurate computation of the activity of gene sets with coordinated expression. The rROMA package incorporates significant improvements in the calculation algorithm, along with the implementation of several functions for statistical analysis and visualizing results. These additions greatly expand the package's capabilities and offer valuable tools for data analysis and interpretation. It is an open-source package available on github at: www.github.com/sysbio-curie/rROMA . Based on publicly available transcriptomic datasets, we applied rROMA to cystic fibrosis, highlighting biological mechanisms potentially involved in the establishment and progression of the disease and the associated genes. Results indicate that rROMA can detect disease-related active signaling pathways using transcriptomic and proteomic data. The results notably identified a significant mechanism relevant to cystic fibrosis, raised awareness of a possible bias related to cell culture, and uncovered an intriguing gene that warrants further investigation.
Collapse
Affiliation(s)
- Matthieu Najm
- INSERM U900, 75428, Paris, France
- Center for Computational Biology, Mines ParisTech, PSL Research University, 75006, Paris, France
- Institut Curie, PSL Research University, 75248, Paris, France
| | - Matthieu Cornet
- INSERM U900, 75428, Paris, France
- Center for Computational Biology, Mines ParisTech, PSL Research University, 75006, Paris, France
- Institut Curie, PSL Research University, 75248, Paris, France
| | - Luca Albergante
- INSERM U900, 75428, Paris, France
- Center for Computational Biology, Mines ParisTech, PSL Research University, 75006, Paris, France
- Institut Curie, PSL Research University, 75248, Paris, France
| | - Andrei Zinovyev
- INSERM U900, 75428, Paris, France
- Center for Computational Biology, Mines ParisTech, PSL Research University, 75006, Paris, France
- Institut Curie, PSL Research University, 75248, Paris, France
| | - Isabelle Sermet-Gaudelus
- Faculté de Médecine, Université de Paris, Paris, France
- Institut Necker Enfants Malades, INSERM U1151, Paris, France
- AP-HP. Centre - Université Paris Cité; Hôpital Necker Enfants Malades, Centre de Référence Maladie Rare - Mucoviscidose, Paris, France
| | - Véronique Stoven
- INSERM U900, 75428, Paris, France
- Center for Computational Biology, Mines ParisTech, PSL Research University, 75006, Paris, France
- Institut Curie, PSL Research University, 75248, Paris, France
| | - Laurence Calzone
- INSERM U900, 75428, Paris, France
- Center for Computational Biology, Mines ParisTech, PSL Research University, 75006, Paris, France
- Institut Curie, PSL Research University, 75248, Paris, France
| | - Loredana Martignetti
- INSERM U900, 75428, Paris, France.
- Center for Computational Biology, Mines ParisTech, PSL Research University, 75006, Paris, France.
- Institut Curie, PSL Research University, 75248, Paris, France.
| |
Collapse
|
7
|
Zhang L, Cavallini M, Wang J, Xin R, Zhang Q, Feng G, Sanes JR, Peng YR. Evolutionary and Developmental Specialization of Foveal Cell Types in the Marmoset. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.10.570996. [PMID: 38106142 PMCID: PMC10723441 DOI: 10.1101/2023.12.10.570996] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
In primates, high-acuity vision is mediated by the fovea, a small specialized central region of the retina. The fovea, unique to the anthropoid lineage among mammals, undergoes notable neuronal morphological changes during postnatal maturation. However, the extent of cellular similarity across anthropoid foveas and the molecular underpinnings of foveal maturation remain unclear. Here, we used high throughput single cell RNA sequencing to profile retinal cells of the common marmoset ( Callithrix jacchus ), an early divergent in anthropoid evolution from humans, apes, and macaques. We generated atlases of the marmoset fovea and peripheral retina for both neonates and adults. Our comparative analysis revealed that marmosets share almost all its foveal types with both humans and macaques, highlighting a conserved cellular structure among primate foveas. Furthermore, by tracing the developmental trajectory of cell types in the foveal and peripheral retina, we found distinct maturation paths for each. In-depth analysis of gene expression differences demonstrated that cone photoreceptors and Müller glia, among others, show the greatest molecular divergence between these two regions. Utilizing single-cell ATAC-seq and gene-regulatory network inference, we uncovered distinct transcriptional regulations differentiating foveal cones from their peripheral counterparts. Further analysis of predicted ligand-receptor interactions suggested a potential role for Müller glia in supporting the maturation of foveal cones. Together, these results provide valuable insights into foveal development, structure, and evolution. Significance statement The sharpness of our eyesight hinges on a tiny retinal region known as the fovea. The fovea is pivotal for primate vision and is susceptible to diseases like age-related macular degeneration. We studied the fovea in the marmoset-a primate with ancient evolutionary ties. Our data illustrated the cellular and molecular composition of its fovea across different developmental ages. Our findings highlighted a profound cellular consistency among marmosets, humans, and macaques, emphasizing the value of marmosets in visual research and the study of visual diseases.
Collapse
|
8
|
Li M, Yao T, Lin W, Hinckley WE, Galli M, Muchero W, Gallavotti A, Chen JG, Huang SSC. Double DAP-seq uncovered synergistic DNA binding of interacting bZIP transcription factors. Nat Commun 2023; 14:2600. [PMID: 37147307 PMCID: PMC10163045 DOI: 10.1038/s41467-023-38096-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2022] [Accepted: 04/15/2023] [Indexed: 05/07/2023] Open
Abstract
Many eukaryotic transcription factors (TF) form homodimer or heterodimer complexes to regulate gene expression. Dimerization of BASIC LEUCINE ZIPPER (bZIP) TFs are critical for their functions, but the molecular mechanism underlying the DNA binding and functional specificity of homo- versus heterodimers remains elusive. To address this gap, we present the double DNA Affinity Purification-sequencing (dDAP-seq) technique that maps heterodimer binding sites on endogenous genomic DNA. Using dDAP-seq we profile twenty pairs of C/S1 bZIP heterodimers and S1 homodimers in Arabidopsis and show that heterodimerization significantly expands the DNA binding preferences of these TFs. Analysis of dDAP-seq binding sites reveals the function of bZIP9 in abscisic acid response and the role of bZIP53 heterodimer-specific binding in seed maturation. The C/S1 heterodimers show distinct preferences for the ACGT elements recognized by plant bZIPs and motifs resembling the yeast GCN4 cis-elements. This study demonstrates the potential of dDAP-seq in deciphering the DNA binding specificities of interacting TFs that are key for combinatorial gene regulation.
Collapse
Affiliation(s)
- Miaomiao Li
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, 10003, USA
| | - Tao Yao
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Wanru Lin
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, 10003, USA
| | - Will E Hinckley
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, 10003, USA
| | - Mary Galli
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, 08854-8020, USA
| | - Wellington Muchero
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Andrea Gallavotti
- Waksman Institute of Microbiology, Rutgers University, Piscataway, NJ, 08854-8020, USA
| | - Jin-Gui Chen
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Shao-Shan Carol Huang
- Center for Genomics and Systems Biology, Department of Biology, New York University, New York, NY, 10003, USA.
| |
Collapse
|
9
|
Molenaar MR, Haaker MW, Vaandrager AB, Houweling M, Helms JB. Lipidomic profiling of rat hepatic stellate cells during activation reveals a two-stage process accompanied by increased levels of lysosomal lipids. J Biol Chem 2023; 299:103042. [PMID: 36803964 PMCID: PMC10033282 DOI: 10.1016/j.jbc.2023.103042] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Revised: 01/30/2023] [Accepted: 02/07/2023] [Indexed: 02/19/2023] Open
Abstract
Hepatic stellate cells (HSCs) are liver-resident cells best known for their role in vitamin A storage under physiological conditions. Upon liver injury, HSCs activate into myofibroblast-like cells, a key process in the onset of liver fibrosis. Lipids play an important role during HSC activation. Here, we provide a comprehensive characterization of the lipidomes of primary rat HSCs during 17 days of activation in vitro. For lipidomic data interpretation, we expanded our previously described Lipid Ontology (LION) and associated web application (LION/Web) with the LION-PCA heatmap module, which generates heatmaps of the most typical LION-signatures in lipidomic datasets. Furthermore, we used LION to perform pathway analysis to determine the significant metabolic conversions in lipid pathways. Together, we identify two distinct stages of HSC activation. In the first stage, we observe a decrease of saturated phosphatidylcholine, sphingomyelin, and phosphatidic acid and an increase in phosphatidylserine and polyunsaturated bis(monoacylglycero)phosphate (BMP), a lipid class typically localized at endosomes and lysosomes. In the second activation stage, BMPs, hexosylceramides, and ether-linked phosphatidylcholines are elevated, resembling a lysosomal lipid storage disease profile. The presence of isomeric structures of BMP in HSCs was confirmed ex vivo in MS-imaging datasets of steatosed liver sections. Finally, treatment with pharmaceuticals targeting the lysosomal integrity led to cell death in primary HSCs but not in HeLa cells. In summary, our combined data suggest that lysosomes play a critical role during a two-stage activation process of HSCs.
Collapse
Affiliation(s)
- Martijn R Molenaar
- Department of Biomolecular Health Sciences, Faculty of Veterinary Medicine, Utrecht University, Utrecht, The Netherlands
| | - Maya W Haaker
- Department of Biomolecular Health Sciences, Faculty of Veterinary Medicine, Utrecht University, Utrecht, The Netherlands
| | - A Bas Vaandrager
- Department of Biomolecular Health Sciences, Faculty of Veterinary Medicine, Utrecht University, Utrecht, The Netherlands
| | - Martin Houweling
- Department of Biomolecular Health Sciences, Faculty of Veterinary Medicine, Utrecht University, Utrecht, The Netherlands
| | - J Bernd Helms
- Department of Biomolecular Health Sciences, Faculty of Veterinary Medicine, Utrecht University, Utrecht, The Netherlands.
| |
Collapse
|
10
|
Battistella E, Vakalopoulou M, Sun R, Estienne T, Lerousseau M, Nikolaev S, Andres EA, Carre A, Niyoteka S, Robert C, Paragios N, Deutsch E. COMBING: Clustering in Oncology for Mathematical and Biological Identification of Novel Gene Signatures. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2022; 19:3317-3331. [PMID: 34714749 DOI: 10.1109/tcbb.2021.3123910] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Precision medicine is a paradigm shift in healthcare relying heavily on genomics data. However, the complexity of biological interactions, the large number of genes as well as the lack of comparisons on the analysis of data, remain a tremendous bottleneck regarding clinical adoption. In this paper, we introduce a novel, automatic and unsupervised framework to discover low-dimensional gene biomarkers. Our method is based on the LP-Stability algorithm, a high dimensional center-based unsupervised clustering algorithm. It offers modularity as concerns metric functions and scalability, while being able to automatically determine the best number of clusters. Our evaluation includes both mathematical and biological criteria to define a quantitative metric. The recovered signature is applied to a variety of biological tasks, including screening of biological pathways and functions, and characterization relevance on tumor types and subtypes. Quantitative comparisons among different distance metrics, commonly used clustering methods and a referential gene signature used in the literature, confirm state of the art performance of our approach. In particular, our signature, based on 27 genes, reports at least 30 times better mathematical significance (average Dunn's Index) and 25% better biological significance (average Enrichment in Protein-Protein Interaction) than those produced by other referential clustering methods. Finally, our signature reports promising results on distinguishing immune inflammatory and immune desert tumors, while reporting a high balanced accuracy of 92% on tumor types classification and averaged balanced accuracy of 68% on tumor subtypes classification, which represents, respectively 7% and 9% higher performance compared to the referential signature.
Collapse
|
11
|
Adipocyte lysoplasmalogenase TMEM86A regulates plasmalogen homeostasis and protein kinase A-dependent energy metabolism. Nat Commun 2022; 13:4084. [PMID: 35835749 PMCID: PMC9283435 DOI: 10.1038/s41467-022-31805-3] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Accepted: 07/01/2022] [Indexed: 02/06/2023] Open
Abstract
Dysregulation of adipose tissue plasmalogen metabolism is associated with obesity-related metabolic diseases. We report that feeding mice a high-fat diet reduces adipose tissue lysoplasmalogen levels and increases transmembrane protein 86 A (TMEM86A), a putative lysoplasmalogenase. Untargeted lipidomic analysis demonstrates that adipocyte-specific TMEM86A-knockout (AKO) increases lysoplasmalogen content in adipose tissue, including plasmenyl lysophosphatidylethanolamine 18:0 (LPE P-18:0). Surprisingly, TMEM86A AKO increases protein kinase A signalling pathways owing to inhibition of phosphodiesterase 3B and elevation of cyclic adenosine monophosphate. TMEM86A AKO upregulates mitochondrial oxidative metabolism, elevates energy expenditure, and protects mice from metabolic dysfunction induced by high-fat feeding. Importantly, the effects of TMEM86A AKO are largely reproduced in vitro and in vivo by LPE P-18:0 supplementation. LPE P-18:0 levels are significantly lower in adipose tissue of human patients with obesity, suggesting that TMEM86A inhibition or lysoplasmalogen supplementation might be therapeutic approaches for preventing or treating obesity-related metabolic diseases.
Collapse
|
12
|
Koh JH, Yoon SJ, Kim M, Cho S, Lim J, Park Y, Kim HS, Kwon SW, Kim WU. Lipidome profile predictive of disease evolution and activity in rheumatoid arthritis. Exp Mol Med 2022; 54:143-155. [PMID: 35169224 PMCID: PMC8894401 DOI: 10.1038/s12276-022-00725-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2021] [Revised: 10/26/2021] [Accepted: 11/04/2021] [Indexed: 12/14/2022] Open
Abstract
Lipid mediators are crucial for the pathogenesis of rheumatoid arthritis (RA); however, global analyses have not been undertaken to systematically define the lipidome underlying the dynamics of disease evolution, activation, and resolution. Here, we performed untargeted lipidomics analysis of synovial fluid and serum from RA patients at different disease activities and clinical phases (preclinical phase to active phase to sustained remission). We found that the lipidome profile in RA joint fluid was severely perturbed and that this correlated with the extent of inflammation and severity of synovitis on ultrasonography. The serum lipidome profile of active RA, albeit less prominent than the synovial lipidome, was also distinguishable from that of RA in the sustained remission phase and from that of noninflammatory osteoarthritis. Of note, the serum lipidome profile at the preclinical phase of RA closely mimicked that of active RA. Specifically, alterations in a set of lysophosphatidylcholine, phosphatidylcholine, ether-linked phosphatidylethanolamine, and sphingomyelin subclasses correlated with RA activity, reflecting treatment responses to anti-rheumatic drugs when monitored serially. Collectively, these results suggest that analysis of lipidome profiles is useful for identifying biomarker candidates that predict the evolution of preclinical to definitive RA and could facilitate the assessment of disease activity and treatment outcomes.
Collapse
Affiliation(s)
- Jung Hee Koh
- Division of Rheumatology, Department of Internal Medicine, the Catholic University of Korea, Seoul, 06591, Republic of Korea.,Center for Integrative Rheumatoid Transcriptomics and Dynamics, the Catholic University of Korea, Seoul, 06591, Republic of Korea
| | - Sang Jun Yoon
- College of Pharmacy, Seoul National University, Seoul, 08826, Republic of Korea
| | - Mina Kim
- College of Pharmacy, Seoul National University, Seoul, 08826, Republic of Korea
| | - Seonghun Cho
- Department of Statistics, Seoul National University, Seoul, 08826, Republic of Korea
| | - Johan Lim
- Department of Statistics, Seoul National University, Seoul, 08826, Republic of Korea
| | - Youngjae Park
- Division of Rheumatology, Department of Internal Medicine, the Catholic University of Korea, Seoul, 06591, Republic of Korea
| | - Hyun-Sook Kim
- Department of Internal Medicine, Soonchunhyang University College of Medicine, Seoul, 04401, Republic of Korea
| | - Sung Won Kwon
- College of Pharmacy, Seoul National University, Seoul, 08826, Republic of Korea.
| | - Wan-Uk Kim
- Division of Rheumatology, Department of Internal Medicine, the Catholic University of Korea, Seoul, 06591, Republic of Korea. .,Center for Integrative Rheumatoid Transcriptomics and Dynamics, the Catholic University of Korea, Seoul, 06591, Republic of Korea.
| |
Collapse
|
13
|
Histone Deacetylase Inhibition Regulates Lipid Homeostasis in a Mouse Model of Amyotrophic Lateral Sclerosis. Int J Mol Sci 2021; 22:ijms222011224. [PMID: 34681883 PMCID: PMC8541517 DOI: 10.3390/ijms222011224] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2021] [Revised: 10/08/2021] [Accepted: 10/15/2021] [Indexed: 12/19/2022] Open
Abstract
Amyotrophic lateral sclerosis (ALS) is an incurable and fatal neurodegenerative disorder of the motor system. While the etiology is still incompletely understood, defects in metabolism act as a major contributor to the disease progression. Recently, histone deacetylase (HDAC) inhibition using ACY-738 has been shown to restore metabolic alterations in the spinal cord of a FUS mouse model of ALS, which was accompanied by a beneficial effect on the motor phenotype and survival. In this study, we investigated the specific effects of HDAC inhibition on lipid metabolism using untargeted lipidomic analysis combined with transcriptomic analysis in the spinal cord of FUS mice. We discovered that symptomatic FUS mice recapitulate lipid alterations found in ALS patients and in the SOD1 mouse model. Glycerophospholipids, sphingolipids, and cholesterol esters were most affected. Strikingly, HDAC inhibition mitigated lipid homeostasis defects by selectively targeting glycerophospholipid metabolism and reducing cholesteryl esters accumulation. Therefore, our data suggest that HDAC inhibition is a potential new therapeutic strategy to modulate lipid metabolism defects in ALS and potentially other neurodegenerative diseases.
Collapse
|
14
|
Dvir S, Argoetti A, Lesnik C, Roytblat M, Shriki K, Amit M, Hashimshony T, Mandel-Gutfreund Y. Uncovering the RNA-binding protein landscape in the pluripotency network of human embryonic stem cells. Cell Rep 2021; 35:109198. [PMID: 34077720 DOI: 10.1016/j.celrep.2021.109198] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2020] [Revised: 03/11/2021] [Accepted: 05/11/2021] [Indexed: 12/18/2022] Open
Abstract
Embryonic stem cell (ESC) self-renewal and cell fate decisions are driven by a broad array of molecular signals. While transcriptional regulators have been extensively studied in human ESCs (hESCs), the extent to which RNA-binding proteins (RBPs) contribute to human pluripotency remains unclear. Here, we carry out a proteome-wide screen and identify 810 proteins that bind RNA in hESCs. We reveal that RBPs are preferentially expressed in hESCs and dynamically regulated during early stem cell differentiation. Notably, many RBPs are affected by knockdown of OCT4, a master regulator of pluripotency, several dozen of which are directly targeted by this factor. Using cross-linking and immunoprecipitation (CLIP-seq), we find that the pluripotency-associated STAT3 and OCT4 transcription factors interact with RNA in hESCs and confirm the binding of STAT3 to the conserved NORAD long-noncoding RNA. Our findings indicate that RBPs have a more widespread role in human pluripotency than previously appreciated.
Collapse
Affiliation(s)
- Shlomi Dvir
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel
| | - Amir Argoetti
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel
| | - Chen Lesnik
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel
| | | | | | - Michal Amit
- Accellta LTD, Haifa 320003, Israel; Ephraim Katzir Department of Biotechnology Engineering, ORT Braude College, Karmiel 2161002, Israel
| | - Tamar Hashimshony
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel
| | - Yael Mandel-Gutfreund
- Faculty of Biology, Technion - Israel Institute of Technology, Haifa 320003, Israel; Computer Science Department, Technion - Israel Institute of Technology, Haifa 320003, Israel.
| |
Collapse
|
15
|
Bories P, Rikardsen AH, Leonards P, Fisk AT, Tartu S, Vogel EF, Bytingsvik J, Blévin P. A deep dive into fat: Investigating blubber lipidomic fingerprint of killer whales and humpback whales in northern Norway. Ecol Evol 2021; 11:6716-6729. [PMID: 34141252 PMCID: PMC8207449 DOI: 10.1002/ece3.7523] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2021] [Revised: 03/15/2021] [Accepted: 03/19/2021] [Indexed: 11/10/2022] Open
Abstract
In cetaceans, blubber is the primary and largest lipid body reservoir. Our current understanding about lipid stores and uses in cetaceans is still limited, and most studies only focused on a single narrow snapshot of the lipidome. We documented an extended lipidomic fingerprint in two cetacean species present in northern Norway during wintertime. We were able to detect 817 molecular lipid species in blubber of killer whales (Orcinus orca) and humpback whales (Megaptera novaeangliae). The profiles were largely dominated by triradylglycerols in both species and, to a lesser extent, by other constituents including glycerophosphocholines, phosphosphingolipids, glycerophosphoethanolamines, and diradylglycerols. Through a unique combination of traditional statistical approaches, together with a novel bioinformatic tool (LION/web), we showed contrasting fingerprint composition between species. The higher content of triradylglycerols in humpback whales is necessary to fuel their upcoming half a year fasting and energy-demanding migration between feeding and breeding grounds. In adipocytes, we assume that the intense feeding rate of humpback whales prior to migration translates into an important accumulation of triacylglycerol content in lipid droplets. Upstream, the endoplasmic reticulum is operating at full capacity to supply acute lipid storage, consistent with the reported enrichment of glycerophosphocholines in humpback whales, major components of the endoplasmic reticulum. There was also an enrichment of membrane components, which translates into higher sphingolipid content in the lipidome of killer whales, potentially as a structural adaptation for their higher hydrodynamic performance. Finally, the presence of both lipid-enriched and lipid-depleted individuals within the killer whale population in Norway suggests dietary specialization, consistent with significant differences in δ15N and δ13C isotopic ratios in skin between the two groups, with higher values and a wider niche for the lipid-enriched individuals. Results suggest the lipid-depleted killer whales were herring specialists, while the lipid-enriched individuals might feed on both herrings and seals.
Collapse
Affiliation(s)
| | - Audun H. Rikardsen
- Department of Arctic and Marine BiologyUiT ‐ The Arctic University of NorwayTromsøNorway
| | - Pim Leonards
- Department of Environment and HealthVrije UniversiteitAmsterdamThe Netherlands
| | - Aaron T. Fisk
- School of the EnvironmentUniversity of WindsorWindsorONCanada
| | - Sabrina Tartu
- Centre d'Etudes Biologiques de ChizéVilliers en BoisFrance
| | - Emma F. Vogel
- Department of Arctic and Marine BiologyUiT ‐ The Arctic University of NorwayTromsøNorway
| | | | | |
Collapse
|
16
|
Delaney C, Schnell A, Cammarata LV, Yao-Smith A, Regev A, Kuchroo VK, Singer M. Combinatorial prediction of marker panels from single-cell transcriptomic data. Mol Syst Biol 2020; 15:e9005. [PMID: 31657111 PMCID: PMC6811728 DOI: 10.15252/msb.20199005] [Citation(s) in RCA: 63] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Revised: 09/18/2019] [Accepted: 09/20/2019] [Indexed: 01/10/2023] Open
Abstract
Single‐cell transcriptomic studies are identifying novel cell populations with exciting functional roles in various in vivo contexts, but identification of succinct gene marker panels for such populations remains a challenge. In this work, we introduce COMET, a computational framework for the identification of candidate marker panels consisting of one or more genes for cell populations of interest identified with single‐cell RNA‐seq data. We show that COMET outperforms other methods for the identification of single‐gene panels and enables, for the first time, prediction of multi‐gene marker panels ranked by relevance. Staining by flow cytometry assay confirmed the accuracy of COMET's predictions in identifying marker panels for cellular subtypes, at both the single‐ and multi‐gene levels, validating COMET's applicability and accuracy in predicting favorable marker panels from transcriptomic input. COMET is a general non‐parametric statistical framework and can be used as‐is on various high‐throughput datasets in addition to single‐cell RNA‐sequencing data. COMET is available for use via a web interface (http://www.cometsc.com/) or a stand‐alone software package (https://github.com/MSingerLab/COMETSC).
Collapse
Affiliation(s)
- Conor Delaney
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Alexandra Schnell
- Evergrande Center for Immunologic Diseases and Ann Romney Center for Neurologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, MA, USA
| | | | - Aaron Yao-Smith
- Department of Computer Science, Cornell University, Ithaca, NY, USA
| | - Aviv Regev
- Department of Biology and Koch Institute of Integrative Cancer Research, Howard Hughes Medical Institute, Massachusetts Institute of Technology, Cambridge, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Vijay K Kuchroo
- Evergrande Center for Immunologic Diseases and Ann Romney Center for Neurologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, USA
| | - Meromit Singer
- Department of Data Sciences, Dana-Farber Cancer Institute, Boston, MA, USA.,Broad Institute of MIT and Harvard, Cambridge, MA, USA.,Department of Immunology, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
17
|
Tsuyuzaki K, Sato H, Sato K, Nikaido I. Benchmarking principal component analysis for large-scale single-cell RNA-sequencing. Genome Biol 2020; 21:9. [PMID: 31955711 PMCID: PMC6970290 DOI: 10.1186/s13059-019-1900-3] [Citation(s) in RCA: 57] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2019] [Accepted: 11/26/2019] [Indexed: 02/02/2023] Open
Abstract
BACKGROUND Principal component analysis (PCA) is an essential method for analyzing single-cell RNA-seq (scRNA-seq) datasets, but for large-scale scRNA-seq datasets, computation time is long and consumes large amounts of memory. RESULTS In this work, we review the existing fast and memory-efficient PCA algorithms and implementations and evaluate their practical application to large-scale scRNA-seq datasets. Our benchmark shows that some PCA algorithms based on Krylov subspace and randomized singular value decomposition are fast, memory-efficient, and more accurate than the other algorithms. CONCLUSION We develop a guideline to select an appropriate PCA implementation based on the differences in the computational environment of users and developers.
Collapse
Affiliation(s)
- Koki Tsuyuzaki
- Laboratory for Bioinformatics Research, RIKEN Center for Biosystems Dynamics Research, Wako, Saitama, 351-0198 Japan
- Japan Science and Technology Agency, PRESTO, 5-3, Yonbancho, Chiyoda-ku, Tokyo, 102-8666 Japan
| | - Hiroyuki Sato
- Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, Yoshida-honmachi, Sakyo-ku, Kyoto, 606-8501 Japan
| | - Kenta Sato
- Laboratory for Bioinformatics Research, RIKEN Center for Biosystems Dynamics Research, Wako, Saitama, 351-0198 Japan
- Department of Biotechnology, Graduate School of Agricultural and Life Sciences, The University of Tokyo, Bunkyo-ku, Tokyo, 113-8657 Japan
| | - Itoshi Nikaido
- Laboratory for Bioinformatics Research, RIKEN Center for Biosystems Dynamics Research, Wako, Saitama, 351-0198 Japan
- Bioinformatics Course, Master’s/Doctoral Program in Life Science Innovation (T-LSI), School of Integrative and Global Majors (SIGMA), University of Tsukuba, 1-1-1, Tennodai, Tsukuba, Ibaraki, 305-8577 Japan
| |
Collapse
|
18
|
Mi D, Li Z, Lim L, Li M, Moissidis M, Yang Y, Gao T, Hu TX, Pratt T, Price DJ, Sestan N, Marín O. Early emergence of cortical interneuron diversity in the mouse embryo. Science 2018; 360:81-85. [PMID: 29472441 PMCID: PMC6195193 DOI: 10.1126/science.aar6821] [Citation(s) in RCA: 152] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2017] [Accepted: 02/14/2018] [Indexed: 12/18/2022]
Abstract
GABAergic interneurons (GABA, γ-aminobutyric acid) regulate neural-circuit activity in the mammalian cerebral cortex. These cortical interneurons are structurally and functionally diverse. Here, we use single-cell transcriptomics to study the origins of this diversity in the mouse. We identify distinct types of progenitor cells and newborn neurons in the ganglionic eminences, the embryonic proliferative regions that give rise to cortical interneurons. These embryonic precursors show temporally and spatially restricted transcriptional patterns that lead to different classes of interneurons in the adult cerebral cortex. Our findings suggest that shortly after the interneurons become postmitotic, their diversity is already patent in their diverse transcriptional programs, which subsequently guide further differentiation in the developing cortex.
Collapse
Affiliation(s)
- Da Mi
- Centre for Developmental Neurobiology, Institute of Psychiatry, Psychology, and Neuroscience, King's College London, London SE1 1UL, UK
- Medical Research Council Centre for Neurodevelopmental Disorders, King's College London, London SE1 1UL, UK
| | - Zhen Li
- Department of Neuroscience and Kavli Institute for Neuroscience, Yale School of Medicine, New Haven, CT 06510, USA
| | - Lynette Lim
- Centre for Developmental Neurobiology, Institute of Psychiatry, Psychology, and Neuroscience, King's College London, London SE1 1UL, UK
- Medical Research Council Centre for Neurodevelopmental Disorders, King's College London, London SE1 1UL, UK
| | - Mingfeng Li
- Department of Neuroscience and Kavli Institute for Neuroscience, Yale School of Medicine, New Haven, CT 06510, USA
| | - Monika Moissidis
- Centre for Developmental Neurobiology, Institute of Psychiatry, Psychology, and Neuroscience, King's College London, London SE1 1UL, UK
- Medical Research Council Centre for Neurodevelopmental Disorders, King's College London, London SE1 1UL, UK
| | - Yifei Yang
- Biomedical Sciences, University of Edinburgh, Edinburgh EH8 9XD, UK
| | - Tianliuyun Gao
- Department of Neuroscience and Kavli Institute for Neuroscience, Yale School of Medicine, New Haven, CT 06510, USA
| | - Tim Xiaoming Hu
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02446, USA
- Department of Physiology, Anatomy, and Genetics, University of Oxford, Oxford, UK
| | - Thomas Pratt
- Biomedical Sciences, University of Edinburgh, Edinburgh EH8 9XD, UK
| | - David J Price
- Biomedical Sciences, University of Edinburgh, Edinburgh EH8 9XD, UK
| | - Nenad Sestan
- Department of Neuroscience and Kavli Institute for Neuroscience, Yale School of Medicine, New Haven, CT 06510, USA.
| | - Oscar Marín
- Centre for Developmental Neurobiology, Institute of Psychiatry, Psychology, and Neuroscience, King's College London, London SE1 1UL, UK.
- Medical Research Council Centre for Neurodevelopmental Disorders, King's College London, London SE1 1UL, UK
| |
Collapse
|
19
|
Nepomuceno JA, Troncoso A, Nepomuceno-Chamorro IA, Aguilar-Ruiz JS. Pairwise gene GO-based measures for biclustering of high-dimensional expression data. BioData Min 2018; 11:4. [PMID: 29610579 PMCID: PMC5872503 DOI: 10.1186/s13040-018-0165-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Accepted: 03/01/2018] [Indexed: 11/15/2022] Open
Abstract
Background Biclustering algorithms search for groups of genes that share the same behavior under a subset of samples in gene expression data. Nowadays, the biological knowledge available in public repositories can be used to drive these algorithms to find biclusters composed of groups of genes functionally coherent. On the other hand, a distance among genes can be defined according to their information stored in Gene Ontology (GO). Gene pairwise GO semantic similarity measures report a value for each pair of genes which establishes their functional similarity. A scatter search-based algorithm that optimizes a merit function that integrates GO information is studied in this paper. This merit function uses a term that addresses the information through a GO measure. Results The effect of two possible different gene pairwise GO measures on the performance of the algorithm is analyzed. Firstly, three well known yeast datasets with approximately one thousand of genes are studied. Secondly, a group of human datasets related to clinical data of cancer is also explored by the algorithm. Most of these data are high-dimensional datasets composed of a huge number of genes. The resultant biclusters reveal groups of genes linked by a same functionality when the search procedure is driven by one of the proposed GO measures. Furthermore, a qualitative biological study of a group of biclusters show their relevance from a cancer disease perspective. Conclusions It can be concluded that the integration of biological information improves the performance of the biclustering process. The two different GO measures studied show an improvement in the results obtained for the yeast dataset. However, if datasets are composed of a huge number of genes, only one of them really improves the algorithm performance. This second case constitutes a clear option to explore interesting datasets from a clinical point of view.
Collapse
Affiliation(s)
- Juan A Nepomuceno
- 1Departamento de Lenguajes y Sistemas Informáticos, Universidad de Sevilla, Avd. Reina Mercedes s/n, Seville, 41012 Spain
| | - Alicia Troncoso
- 2Área de Informática, Universidad Pablo de Olavide, Ctra. Utrera km. 1, Seville, 41013 Spain
| | - Isabel A Nepomuceno-Chamorro
- 1Departamento de Lenguajes y Sistemas Informáticos, Universidad de Sevilla, Avd. Reina Mercedes s/n, Seville, 41012 Spain
| | - Jesús S Aguilar-Ruiz
- 2Área de Informática, Universidad Pablo de Olavide, Ctra. Utrera km. 1, Seville, 41013 Spain
| |
Collapse
|
20
|
Sasagawa Y, Danno H, Takada H, Ebisawa M, Tanaka K, Hayashi T, Kurisaki A, Nikaido I. Quartz-Seq2: a high-throughput single-cell RNA-sequencing method that effectively uses limited sequence reads. Genome Biol 2018; 19:29. [PMID: 29523163 PMCID: PMC5845169 DOI: 10.1186/s13059-018-1407-3] [Citation(s) in RCA: 88] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2017] [Accepted: 02/14/2018] [Indexed: 12/14/2022] Open
Abstract
High-throughput single-cell RNA-seq methods assign limited unique molecular identifier (UMI) counts as gene expression values to single cells from shallow sequence reads and detect limited gene counts. We thus developed a high-throughput single-cell RNA-seq method, Quartz-Seq2, to overcome these issues. Our improvements in the reaction steps make it possible to effectively convert initial reads to UMI counts, at a rate of 30-50%, and detect more genes. To demonstrate the power of Quartz-Seq2, we analyzed approximately 10,000 transcriptomes from in vitro embryonic stem cells and an in vivo stromal vascular fraction with a limited number of reads.
Collapse
Affiliation(s)
- Yohei Sasagawa
- Bioinformatics Research Unit, Advanced Center for Computing and Communication, RIKEN, Hirosawa 2-1, Wako, Saitama, Japan
| | - Hiroki Danno
- Bioinformatics Research Unit, Advanced Center for Computing and Communication, RIKEN, Hirosawa 2-1, Wako, Saitama, Japan
| | - Hitomi Takada
- Laboratory of Stem Cell Technology, Graduate School of Biological Sciences, Nara Institute of Science and Technology, Takayama-cho 8916-5, Ikoma, Nara, Japan
| | - Masashi Ebisawa
- Bioinformatics Research Unit, Advanced Center for Computing and Communication, RIKEN, Hirosawa 2-1, Wako, Saitama, Japan
| | - Kaori Tanaka
- Bioinformatics Research Unit, Advanced Center for Computing and Communication, RIKEN, Hirosawa 2-1, Wako, Saitama, Japan
| | - Tetsutaro Hayashi
- Bioinformatics Research Unit, Advanced Center for Computing and Communication, RIKEN, Hirosawa 2-1, Wako, Saitama, Japan
| | - Akira Kurisaki
- Laboratory of Stem Cell Technology, Graduate School of Biological Sciences, Nara Institute of Science and Technology, Takayama-cho 8916-5, Ikoma, Nara, Japan
| | - Itoshi Nikaido
- Bioinformatics Research Unit, Advanced Center for Computing and Communication, RIKEN, Hirosawa 2-1, Wako, Saitama, Japan
- Single-cell Omics Research Unit, RIKEN Center for Developmental Biology, 2-2-3 Minatojima-minamimachi, Chuo-ku, Kobe, Japan
| |
Collapse
|
21
|
D'Erchia AM, Gallo A, Manzari C, Raho S, Horner DS, Chiara M, Valletti A, Aiello I, Mastropasqua F, Ciaccia L, Locatelli F, Pisani F, Nicchia GP, Svelto M, Pesole G, Picardi E. Massive transcriptome sequencing of human spinal cord tissues provides new insights into motor neuron degeneration in ALS. Sci Rep 2017; 7:10046. [PMID: 28855684 PMCID: PMC5577269 DOI: 10.1038/s41598-017-10488-7] [Citation(s) in RCA: 84] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Accepted: 08/09/2017] [Indexed: 12/13/2022] Open
Abstract
ALS is a devastating and debilitating human disease characterized by the progressive death of upper and lower motor neurons. Although much effort has been made to elucidate molecular determinants underlying the onset and progression of the disorder, the causes of ALS remain largely unknown. In the present work, we have deeply sequenced whole transcriptome from spinal cord ventral horns of post-mortem ALS human donors affected by the sporadic form of the disease (which comprises ~90% of the cases but which is less investigated than the inherited form of the disease). We observe 1160 deregulated genes including 18 miRNAs and show that down regulated genes are mainly of neuronal derivation while up regulated genes have glial origin and tend to be involved in neuroinflammation or cell death. Remarkably, we find strong deregulation of SNAP25 and STX1B at both mRNA and protein levels suggesting impaired synaptic function through SNAP25 reduction as a possible cause of calcium elevation and glutamate excitotoxicity. We also note aberrant alternative splicing but not disrupted RNA editing.
Collapse
Affiliation(s)
- Anna Maria D'Erchia
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari, Via Orabona 4, 70126, Bari, Italy.,Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies (IBIOM), National Research Council, Via Amendola 165/A, 70126, Bari, Italy
| | - Angela Gallo
- Department of Pediatric Oncohaematology, Bambino Gesù Children's Hospital IRCCS, Piazza S. Onofrio 4, 00165, Rome, Italy
| | - Caterina Manzari
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies (IBIOM), National Research Council, Via Amendola 165/A, 70126, Bari, Italy
| | - Susanna Raho
- Department of Pediatric Oncohaematology, Bambino Gesù Children's Hospital IRCCS, Piazza S. Onofrio 4, 00165, Rome, Italy
| | - David S Horner
- Department of Biosciences, University of Milan, Via Celoria 26, 20133, Milan, Italy
| | - Matteo Chiara
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies (IBIOM), National Research Council, Via Amendola 165/A, 70126, Bari, Italy
| | - Alessio Valletti
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies (IBIOM), National Research Council, Via Amendola 165/A, 70126, Bari, Italy
| | - Italia Aiello
- Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies (IBIOM), National Research Council, Via Amendola 165/A, 70126, Bari, Italy
| | - Francesca Mastropasqua
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari, Via Orabona 4, 70126, Bari, Italy
| | - Loredana Ciaccia
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari, Via Orabona 4, 70126, Bari, Italy
| | - Franco Locatelli
- Department of Pediatric Oncohaematology, Bambino Gesù Children's Hospital IRCCS, Piazza S. Onofrio 4, 00165, Rome, Italy
| | - Francesco Pisani
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari, Via Orabona 4, 70126, Bari, Italy
| | - Grazia Paola Nicchia
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari, Via Orabona 4, 70126, Bari, Italy
| | - Maria Svelto
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari, Via Orabona 4, 70126, Bari, Italy.,National Institute of Biostructures and Biosystems (INBB), Viale Medaglie D'Oro 305, 00136, Rome, Italy.,Center of Excellence in Comparative Genomics, University of Bari, Piazza Umberto I, 70121, Bari, Italy
| | - Graziano Pesole
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari, Via Orabona 4, 70126, Bari, Italy. .,Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies (IBIOM), National Research Council, Via Amendola 165/A, 70126, Bari, Italy. .,National Institute of Biostructures and Biosystems (INBB), Viale Medaglie D'Oro 305, 00136, Rome, Italy. .,Center of Excellence in Comparative Genomics, University of Bari, Piazza Umberto I, 70121, Bari, Italy.
| | - Ernesto Picardi
- Department of Biosciences, Biotechnology and Biopharmaceutics, University of Bari, Via Orabona 4, 70126, Bari, Italy. .,Institute of Biomembranes, Bioenergetics and Molecular Biotechnologies (IBIOM), National Research Council, Via Amendola 165/A, 70126, Bari, Italy. .,National Institute of Biostructures and Biosystems (INBB), Viale Medaglie D'Oro 305, 00136, Rome, Italy.
| |
Collapse
|
22
|
Wagner A, Regev A, Yosef N. Revealing the vectors of cellular identity with single-cell genomics. Nat Biotechnol 2017; 34:1145-1160. [PMID: 27824854 DOI: 10.1038/nbt.3711] [Citation(s) in RCA: 395] [Impact Index Per Article: 49.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Abstract
Single-cell genomics has now made it possible to create a comprehensive atlas of human cells. At the same time, it has reopened definitions of a cell's identity and of the ways in which identity is regulated by the cell's molecular circuitry. Emerging computational analysis methods, especially in single-cell RNA sequencing (scRNA-seq), have already begun to reveal, in a data-driven way, the diverse simultaneous facets of a cell's identity, from discrete cell types to continuous dynamic transitions and spatial locations. These developments will eventually allow a cell to be represented as a superposition of 'basis vectors', each determining a different (but possibly dependent) aspect of cellular organization and function. However, computational methods must also overcome considerable challenges-from handling technical noise and data scale to forming new abstractions of biology. As the scale of single-cell experiments continues to increase, new computational approaches will be essential for constructing and characterizing a reference map of cell identities.
Collapse
Affiliation(s)
- Allon Wagner
- Department of Electrical Engineering and Computer Science and the Center for Computational Biology, University of California, Berkeley, California, USA
| | - Aviv Regev
- Howard Hughes Medical Institute, Department of Biology, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA.,Broad Institute of MIT and Harvard, Cambridge, Massachusetts, USA
| | - Nir Yosef
- Department of Electrical Engineering and Computer Science and the Center for Computational Biology, University of California, Berkeley, California, USA.,Ragon Institute of Massachusetts General Hospital, Massachusetts Institute of Technology, and Harvard University, Boston, Massachusetts, USA
| |
Collapse
|
23
|
Shekhar K, Lapan SW, Whitney IE, Tran NM, Macosko EZ, Kowalczyk M, Adiconis X, Levin JZ, Nemesh J, Goldman M, McCarroll SA, Cepko CL, Regev A, Sanes JR. Comprehensive Classification of Retinal Bipolar Neurons by Single-Cell Transcriptomics. Cell 2016; 166:1308-1323.e30. [PMID: 27565351 DOI: 10.1016/j.cell.2016.07.054] [Citation(s) in RCA: 771] [Impact Index Per Article: 85.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2016] [Revised: 06/10/2016] [Accepted: 07/28/2016] [Indexed: 12/15/2022]
Abstract
Patterns of gene expression can be used to characterize and classify neuronal types. It is challenging, however, to generate taxonomies that fulfill the essential criteria of being comprehensive, harmonizing with conventional classification schemes, and lacking superfluous subdivisions of genuine types. To address these challenges, we used massively parallel single-cell RNA profiling and optimized computational methods on a heterogeneous class of neurons, mouse retinal bipolar cells (BCs). From a population of ∼25,000 BCs, we derived a molecular classification that identified 15 types, including all types observed previously and two novel types, one of which has a non-canonical morphology and position. We validated the classification scheme and identified dozens of novel markers using methods that match molecular expression to cell morphology. This work provides a systematic methodology for achieving comprehensive molecular classification of neurons, identifies novel neuronal types, and uncovers transcriptional differences that distinguish types within a class.
Collapse
Affiliation(s)
- Karthik Shekhar
- Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - Sylvain W Lapan
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA; Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| | - Irene E Whitney
- Center for Brain Science and Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02130, USA
| | - Nicholas M Tran
- Center for Brain Science and Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02130, USA
| | - Evan Z Macosko
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA; Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | | | - Xian Adiconis
- Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - Joshua Z Levin
- Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - James Nemesh
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA; Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - Melissa Goldman
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA; Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - Steven A McCarroll
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA; Stanley Center for Psychiatric Research, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Program in Medical and Population Genetics, Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA
| | - Constance L Cepko
- Department of Genetics, Harvard Medical School, Boston, MA 02115, USA; Department of Ophthalmology, Harvard Medical School, Boston, MA 02115, USA; Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA.
| | - Aviv Regev
- Broad Institute of Harvard and MIT, Cambridge, MA 02142, USA; Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA; Department of Biology and Koch Institute, MIT, Cambridge, MA 02139, USA.
| | - Joshua R Sanes
- Center for Brain Science and Department of Molecular and Cellular Biology, Harvard University, Cambridge, MA 02130, USA.
| |
Collapse
|
24
|
Singer M, Wang C, Cong L, Marjanovic ND, Kowalczyk MS, Zhang H, Nyman J, Sakuishi K, Kurtulus S, Gennert D, Xia J, Kwon JYH, Nevin J, Herbst RH, Yanai I, Rozenblatt-Rosen O, Kuchroo VK, Regev A, Anderson AC. A Distinct Gene Module for Dysfunction Uncoupled from Activation in Tumor-Infiltrating T Cells. Cell 2016; 166:1500-1511.e9. [PMID: 27610572 DOI: 10.1016/j.cell.2016.08.052] [Citation(s) in RCA: 271] [Impact Index Per Article: 30.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2016] [Revised: 08/14/2016] [Accepted: 08/23/2016] [Indexed: 02/01/2023]
Abstract
Reversing the dysfunctional T cell state that arises in cancer and chronic viral infections is the focus of therapeutic interventions; however, current therapies are effective in only some patients and some tumor types. To gain a deeper molecular understanding of the dysfunctional T cell state, we analyzed population and single-cell RNA profiles of CD8(+) tumor-infiltrating lymphocytes (TILs) and used genetic perturbations to identify a distinct gene module for T cell dysfunction that can be uncoupled from T cell activation. This distinct dysfunction module is downstream of intracellular metallothioneins that regulate zinc metabolism and can be identified at single-cell resolution. We further identify Gata-3, a zinc-finger transcription factor in the dysfunctional module, as a regulator of dysfunction, and we use CRISPR-Cas9 genome editing to show that it drives a dysfunctional phenotype in CD8(+) TILs. Our results open novel avenues for targeting dysfunctional T cell states while leaving activation programs intact.
Collapse
Affiliation(s)
- Meromit Singer
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Chao Wang
- Evergrande Center for Immunologic Diseases and Ann Romney Center for Neurologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, MA 02115, USA
| | - Le Cong
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Nemanja D Marjanovic
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Biology, Koch Institute and Ludwig Center, Massachusetts Institute of Technology, Cambridge, MA 02142, USA
| | | | - Huiyuan Zhang
- Evergrande Center for Immunologic Diseases and Ann Romney Center for Neurologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, MA 02115, USA
| | - Jackson Nyman
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Kaori Sakuishi
- Evergrande Center for Immunologic Diseases and Ann Romney Center for Neurologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, MA 02115, USA
| | - Sema Kurtulus
- Evergrande Center for Immunologic Diseases and Ann Romney Center for Neurologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, MA 02115, USA
| | - David Gennert
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - Junrong Xia
- Evergrande Center for Immunologic Diseases and Ann Romney Center for Neurologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, MA 02115, USA
| | - John Y H Kwon
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
| | - James Nevin
- Evergrande Center for Immunologic Diseases and Ann Romney Center for Neurologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, MA 02115, USA
| | | | - Itai Yanai
- Institute for Computational Medicine and Department of Biochemistry and Molecular Pharmacology, New York University School of Medicine, New York, NY 10016, USA
| | | | - Vijay K Kuchroo
- Evergrande Center for Immunologic Diseases and Ann Romney Center for Neurologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, MA 02115, USA.
| | - Aviv Regev
- Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Biology, Koch Institute and Ludwig Center, Massachusetts Institute of Technology, Cambridge, MA 02142, USA; Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA.
| | - Ana C Anderson
- Evergrande Center for Immunologic Diseases and Ann Romney Center for Neurologic Diseases, Harvard Medical School and Brigham and Women's Hospital, Boston, MA 02115, USA.
| |
Collapse
|