1
|
Tzaferis C, Karatzas E, Baltoumas FA, Pavlopoulos GA, Kollias G, Konstantopoulos D. SCALA: A complete solution for multimodal analysis of single-cell Next Generation Sequencing data. Comput Struct Biotechnol J 2023; 21:5382-5393. [PMID: 38022693 PMCID: PMC10651449 DOI: 10.1016/j.csbj.2023.10.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2023] [Revised: 10/16/2023] [Accepted: 10/16/2023] [Indexed: 12/01/2023] Open
Abstract
Analysis and interpretation of high-throughput transcriptional and chromatin accessibility data at single-cell (sc) resolution are still open challenges in the biomedical field. The existence of countless bioinformatics tools, for the different analytical steps, increases the complexity of data interpretation and the difficulty to derive biological insights. In this article, we present SCALA, a bioinformatics tool for analysis and visualization of single-cell RNA sequencing (scRNA-seq) and Assay for Transposase-Accessible Chromatin using sequencing (scATAC-seq) datasets, enabling either independent or integrative analysis of the two modalities. SCALA combines standard types of analysis by integrating multiple software packages varying from quality control to the identification of distinct cell populations and cell states. Additional analysis options enable functional enrichment, cellular trajectory inference, ligand-receptor analysis, and regulatory network reconstruction. SCALA is fully parameterizable, presenting data in tabular format and producing publication-ready visualizations. The different available analysis modules can aid biomedical researchers in exploring, analyzing, and visualizing their data without any prior experience in coding. We demonstrate the functionality of SCALA through two use-cases related to TNF-driven arthritic mice, handling both scRNA-seq and scATAC-seq datasets. SCALA is developed in R, Shiny and JavaScript and is mainly available as a standalone version, while an online service of more limited capacity can be found at http://scala.pavlopouloslab.info or https://scala.fleming.gr.
Collapse
Affiliation(s)
- Christos Tzaferis
- Institute for Bioinnovation, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
| | - Evangelos Karatzas
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
| | - Fotis A. Baltoumas
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
| | - Georgios A. Pavlopoulos
- Institute for Fundamental Biomedical Research, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
- Research Institute of New Biotechnologies and Precision Medicine, National and Kapodistrian University of Athens, Greece
| | - George Kollias
- Institute for Bioinnovation, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
- Research Institute of New Biotechnologies and Precision Medicine, National and Kapodistrian University of Athens, Greece
- Department of Physiology, Medical School, National and Kapodistrian University of Athens, Greece
| | - Dimitris Konstantopoulos
- Institute for Bioinnovation, Biomedical Sciences Research Center "Alexander Fleming", Vari, Greece
| |
Collapse
|
2
|
Nassiri I, Fairfax B, Lee A, Wu Y, Buck D, Piazza P. scQCEA: a framework for annotation and quality control report of single-cell RNA-sequencing data. BMC Genomics 2023; 24:381. [PMID: 37415108 DOI: 10.1186/s12864-023-09447-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2022] [Accepted: 06/13/2023] [Indexed: 07/08/2023] Open
Abstract
BACKGROUND Systematic description of library quality and sequencing performance of single-cell RNA sequencing (scRNA-seq) data is imperative for subsequent downstream modules, including re-pooling libraries. While several packages have been developed to visualise quality control (QC) metrics for scRNA-seq data, they do not include expression-based QC to discriminate between true variation and background noise. RESULTS We present scQCEA (acronym of the single-cell RNA sequencing Quality Control and Enrichment Analysis), an R package to generate reports of process optimisation metrics for comparing sets of samples and visual evaluation of quality scores. scQCEA can import data from 10X or other single-cell platforms and includes functions for generating an interactive report of QC metrics for multi-omics data. In addition, scQCEA provides automated cell type annotation on scRNA-seq data using differential gene expression patterns for expression-based quality control. We provide a repository of reference gene sets, including 2348 marker genes, which are exclusively expressed in 95 human and mouse cell types. Using scRNA-seq data from 56 gene expressions and V(D)J T cell replicates, we show how scQCEA can be applied for the visual evaluation of quality scores for sets of samples. In addition, we use the summary of QC measures from 342 human and mouse shallow-sequenced gene expression profiles to specify optimal sequencing requirements to run a cell-type enrichment analysis function. CONCLUSIONS The open-source R tool will allow examining biases and outliers over biological and technical measures, and objective selection of optimal cluster numbers before downstream analysis. scQCEA is available at https://isarnassiri.github.io/scQCEA/ as an R package. Full documentation, including an example, is provided on the package website.
Collapse
Affiliation(s)
- Isar Nassiri
- Oxford Genomics Centre, Nuffield Department of Medicine, Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
| | - Benjamin Fairfax
- MRC-Weatherall Institute of Molecular Medicine, University of Oxford, Oxford, UK
- Department of Oncology, University of Oxford & Oxford Cancer Centre, Churchill Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, UK
| | - Angela Lee
- Oxford Genomics Centre, Nuffield Department of Medicine, Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Yanxia Wu
- Oxford Genomics Centre, Nuffield Department of Medicine, Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - David Buck
- Oxford Genomics Centre, Nuffield Department of Medicine, Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Paolo Piazza
- Oxford Genomics Centre, Nuffield Department of Medicine, Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK.
| |
Collapse
|
3
|
Ma A, Wang X, Li J, Wang C, Xiao T, Liu Y, Cheng H, Wang J, Li Y, Chang Y, Li J, Wang D, Jiang Y, Su L, Xin G, Gu S, Li Z, Liu B, Xu D, Ma Q. Single-cell biological network inference using a heterogeneous graph transformer. Nat Commun 2023; 14:964. [PMID: 36810839 PMCID: PMC9944243 DOI: 10.1038/s41467-023-36559-0] [Citation(s) in RCA: 47] [Impact Index Per Article: 23.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2022] [Accepted: 02/06/2023] [Indexed: 02/23/2023] Open
Abstract
Single-cell multi-omics (scMulti-omics) allows the quantification of multiple modalities simultaneously to capture the intricacy of complex molecular mechanisms and cellular heterogeneity. Existing tools cannot effectively infer the active biological networks in diverse cell types and the response of these networks to external stimuli. Here we present DeepMAPS for biological network inference from scMulti-omics. It models scMulti-omics in a heterogeneous graph and learns relations among cells and genes within both local and global contexts in a robust manner using a multi-head graph transformer. Benchmarking results indicate DeepMAPS performs better than existing tools in cell clustering and biological network construction. It also showcases competitive capability in deriving cell-type-specific biological networks in lung tumor leukocyte CITE-seq data and matched diffuse small lymphocytic lymphoma scRNA-seq and scATAC-seq data. In addition, we deploy a DeepMAPS webserver equipped with multiple functionalities and visualizations to improve the usability and reproducibility of scMulti-omics data analysis.
Collapse
Affiliation(s)
- Anjun Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Xiaoying Wang
- School of Mathematics, Shandong University, Jinan, Shandong, China
| | - Jingxian Li
- School of Mathematics, Shandong University, Jinan, Shandong, China
| | - Cankun Wang
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
| | - Tong Xiao
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Yuntao Liu
- School of Mathematics, Shandong University, Jinan, Shandong, China
| | - Hao Cheng
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
| | - Juexin Wang
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Yang Li
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
| | - Yuzhou Chang
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Jinpu Li
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Duolin Wang
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Yuexu Jiang
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
| | - Li Su
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA
| | - Gang Xin
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Shaopeng Gu
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA
| | - Zihai Li
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA
| | - Bingqiang Liu
- School of Mathematics, Shandong University, Jinan, Shandong, China.
| | - Dong Xu
- Department of Electrical Engineering and Computer Science, University of Missouri, Columbia, MO, USA.
- Christopher S. Bond Life Sciences Center, University of Missouri, Columbia, MO, USA.
- Institute for Data Science and Informatics, University of Missouri, Columbia, MO, USA.
| | - Qin Ma
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH, USA.
- Pelotonia Institute for Immuno-Oncology, The James Comprehensive Cancer Center, The Ohio State University, Columbus, OH, USA.
| |
Collapse
|
4
|
scWizard: a web-based automated tool for classifying and annotating single cells and downstream analysis of single-cell RNA-seq data in cancers. Comput Struct Biotechnol J 2022; 20:4902-4909. [PMID: 36147672 PMCID: PMC9474308 DOI: 10.1016/j.csbj.2022.08.028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 07/27/2022] [Accepted: 08/12/2022] [Indexed: 11/22/2022] Open
Abstract
scWizard provides comprehensive analysis pipeline for integration strategies of cancer scRNA-seq data. scWizard enables classification of 47 cell subtypes within the TME based on hierarchical model by deep neural network. scWizard gives a higher accuracy for annotation cell subtypes within the TME compared with five methods. scWizard packages is a point-and-click tool helping for researchers without proficient programming skills.
The emerging number of single-cell RNA-seq (scRNA-Seq) datasets allows the characterization of cell types across various cancer types. However, there is still lack of effective tools to integrate the various analysis of single-cells, especially for making fine annotation on subtype cells within the tumor microenvironment (TME). We developed scWizard, a point-and-click tool packaging automated process including our developed cell annotation method based on deep neural network learning and 11 downstream analyses methods. scWizard used 113,976 cells across 13 cancer types as a built-in reference dataset for training the hierarchical model enabling to automatedly classify and annotate 7 major cell types and 47 cell subtypes in the TME. scWizard provides a built-in pre-training set for user’s flexible choice, and gives a higher accuracy for annotation subtypes of tumor-derived T-lymphocytes/natural killer cells (T/NK) and myeloid cells from different cancer types compared with the existing five methods. scWizard has good robustness in three independent cancer datasets, with an accuracy of 0.98 in annotating major cell types, 0.85 in annotating myeloid cell subtypes and 0.79 in annotating T/NK cell subtypes, indicting the wide applicability of scWizard in different cell types of cancers. Finally, the automatic analysis and visualization function of scWizard are presented by using the intrahepatic cholangiocarcinoma (ICC) scRNA-Seq dataset as a case. scWizard focuses on decoding TME and covers various analysis flows for cancer scRNA-Seq study, and provides an easy-to-use tool and a user-friendly interface for researchers widely, to further accelerate the biological discovery of cancer research.
Collapse
|
5
|
Lin X, Chi D, Meng Q, Gong Q, Tong Z. Single-Cell Sequencing Unveils the Heterogeneity of Nonimmune Cells in Chronic Apical Periodontitis. Front Cell Dev Biol 2022; 9:820274. [PMID: 35237614 PMCID: PMC8883837 DOI: 10.3389/fcell.2021.820274] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Accepted: 12/24/2021] [Indexed: 12/12/2022] Open
Abstract
Chronic apical periodontitis (CAP) is a unique dynamic interaction between microbial invasions and host defense mechanisms, resulting in infiltration of immune cells, bone absorption, and periapical granuloma formation. To help to understand periapical tissue pathophysiology, we constituted a single-cell atlas for 26,737 high-quality cells from inflammatory periapical tissue and uncovered the complex cellular landscape. The eight types of cells, including nonimmune cells and immune cells, were identified in the periapical tissue of CAP. Considering the key roles of nonimmune cells in CAP, we emphasized osteo-like cells, basal/stromal cells, endothelial cells, and epithelial cells, and discovered their diversity and heterogeneity. The temporal profiling of genomic alterations from common CAP to typical periapical granuloma provided predictions for transcription factors and biological processes. Our study presented potential clues that the shift of inflammatory cytokines, chemokines, proteases, and growth factors initiated polymorphic cell differentiation, lymphangiogenesis, and angiogenesis during CAP.
Collapse
Affiliation(s)
- Xinwei Lin
- Department of Operative Dentistry and Endodontics, Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China
- Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China
| | - Danlu Chi
- Department of Operative Dentistry and Endodontics, Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China
- Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China
| | - Qingzhen Meng
- Department of Operative Dentistry and Endodontics, Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China
- Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China
| | - Qimei Gong
- Department of Operative Dentistry and Endodontics, Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China
- Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China
- *Correspondence: Qimei Gong, ; Zhongchun Tong,
| | - Zhongchun Tong
- Department of Operative Dentistry and Endodontics, Hospital of Stomatology, Sun Yat-sen University, Guangzhou, China
- Guangdong Provincial Key Laboratory of Stomatology, Sun Yat-sen University, Guangzhou, China
- *Correspondence: Qimei Gong, ; Zhongchun Tong,
| |
Collapse
|
6
|
Jagla B, Libri V, Chica C, Rouilly V, Mella S, Puceat M, Hasan M. SCHNAPPs - Single Cell sHiNy APPlication(s). J Immunol Methods 2021; 499:113176. [PMID: 34742775 DOI: 10.1016/j.jim.2021.113176] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2020] [Revised: 10/21/2021] [Accepted: 10/25/2021] [Indexed: 11/30/2022]
Abstract
Single-cell RNA-sequencing (scRNAseq) experiments are becoming a standard tool for bench-scientists to explore the cellular diversity present in all tissues. Data produced by scRNAseq is technically complex and requires analytical workflows that are an active field of bioinformatics research, whereas a wealth of biological background knowledge is needed to guide the investigation. Thus, there is an increasing need to develop applications geared towards bench-scientists to help them abstract the technical challenges of the analysis so that they can focus on the science at play. It is also expected that such applications should support closer collaboration between bioinformaticians and bench-scientists by providing reproducible science tools. We present SCHNAPPs, a Graphical User Interface (GUI), designed to enable bench-scientists to autonomously explore and interpret scRNAseq data and associated annotations. The R/Shiny-based application allows following different steps of scRNAseq analysis workflows from Seurat or Scran packages: performing quality control on cells and genes, normalizing the expression matrix, integrating different samples, dimension reduction, clustering, and differential gene expression analysis. Visualization tools for exploring each step of the process include violin plots, 2D projections, Box-plots, alluvial plots, and histograms. An R-markdown report can be generated that tracks modifications and selected visualizations. The modular design of the tool allows it to easily integrate new visualizations and analyses by bioinformaticians. We illustrate the main features of the tool by applying it to the characterization of T cells in a scRNAseq and Cellular Indexing of Transcriptomes and Epitopes by Sequencing (CITE-Seq) experiment of two healthy individuals.
Collapse
Affiliation(s)
- Bernd Jagla
- Institut Pasteur, Université de Paris, Cytometry and Biomarkers UTechS, F-75015 Paris, France; Institut Pasteur, Université de Paris, Bioinformatics and Biostatistics Hub, F-75015 Paris, France.
| | - Valentina Libri
- Institut Pasteur, Université de Paris, Cytometry and Biomarkers UTechS, F-75015 Paris, France
| | - Claudia Chica
- Institut Pasteur, Université de Paris, Bioinformatics and Biostatistics Hub, F-75015 Paris, France
| | | | - Sebastien Mella
- Institut Pasteur, Université de Paris, Cytometry and Biomarkers UTechS, F-75015 Paris, France; Institut Pasteur, Université de Paris, Bioinformatics and Biostatistics Hub, F-75015 Paris, France
| | - Michel Puceat
- Aix-Marseille University, INSERM U-1251, MMG, France
| | - Milena Hasan
- Institut Pasteur, Université de Paris, Cytometry and Biomarkers UTechS, F-75015 Paris, France
| |
Collapse
|
7
|
Gautam V, Mittal A, Kalra S, Mohanty SK, Gupta K, Rani K, Naidu S, Mishra T, Sengupta D, Ahuja G. EcTracker: Tracking and elucidating ectopic expression leveraging large-scale scRNA-seq studies. Brief Bioinform 2021; 22:6309926. [PMID: 34184038 DOI: 10.1093/bib/bbab237] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 05/31/2021] [Accepted: 06/01/2021] [Indexed: 11/13/2022] Open
Abstract
Dramatic genomic alterations, either inducible or in a pathological state, dismantle the core regulatory networks, leading to the activation of normally silent genes. Despite possessing immense therapeutic potential, accurate detection of these transcripts is an ever-challenging task, as it requires prior knowledge of the physiological gene expression levels. Here, we introduce EcTracker, an R-/Shiny-based single-cell data analysis web server that bestows a plethora of functionalities that collectively enable the quantitative and qualitative assessments of bona fide cell types or tissue-specific transcripts and, conversely, the ectopically expressed genes in the single-cell ribonucleic acid sequencing datasets. Moreover, it also allows regulon analysis to identify the key transcriptional factors regulating the user-selected gene signatures. To demonstrate the EcTracker functionality, we reanalyzed the CRISPR interference (CRISPRi) dataset of the human embryonic stem cells differentiated into endoderm lineage and identified the prominent enrichment of a specific gene signature in the SMAD2 knockout cells whose identity was ambiguous in the original study. The key distinguishing features of EcTracker lie within its processing speed, availability of multiple add-on modules, interactive graphical user interface and comprehensiveness. In summary, EcTracker provides an easy-to-perform, integrative and end-to-end single-cell data analysis platform that allows decoding of cellular identities, identification of ectopically expressed genes and their regulatory networks, and therefore, collectively imparts a novel dimension for analyzing single-cell datasets.
Collapse
Affiliation(s)
- Vishakha Gautam
- Indraprastha Institute of Information Technology, Delhi, India
| | - Aayushi Mittal
- Indraprastha Institute of Information Technology, Delhi, India
| | - Siddhant Kalra
- Indraprastha Institute of Information Technology, Delhi, India
| | | | - Krishan Gupta
- Indraprastha Institute of Information Technology, Delhi, India
| | - Komal Rani
- Indraprastha Institute of Information Technology, Delhi, India
| | - Srivatsava Naidu
- Department of Biomedical Engineering, Indian Institute of Technology Ropar, India
| | | | - Debarka Sengupta
- Department of Computational Biology and Department of Computer Science at the Indraprastha Institute of Information Technology, India
| | - Gaurav Ahuja
- Department of Computational Biology at the Indraprastha Institute of Information Technology-Delhi (IIIT-Delhi), India
| |
Collapse
|
8
|
Singh R. Single-Cell Sequencing in Human Genital Infections. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2020; 1255:203-220. [PMID: 32949402 DOI: 10.1007/978-981-15-4494-1_17] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/03/2023]
Abstract
Human genital infections are one of the most concerning issues worldwide and can be categorized into sexually transmitted, urinary tract and vaginal infections. These infections, if left untreated, can disseminate to the other parts of the body and cause more complicated illnesses such as pelvic inflammatory disease, urethritis, and anogenital cancers. The effective treatment against these infections is further complicated by the emergence of antimicrobial resistance in the genital infection causing pathogens. Furthermore, the development and applications of single-cell sequencing technologies have open new possibilities to study the drug resistant clones, cell to cell variations, the discovery of acquired drug resistance mutations, transcriptional diversity of a pathogen across different infection stages, to identify rare cell types and investigate different cellular states of genital infection causing pathogens, and to develop novel therapeutical strategies. In this chapter, I will provide a complete review of the applications of single-cell sequencing in human genital infections before discussing their limitations and challenges.
Collapse
Affiliation(s)
- Reema Singh
- Department of Biochemistry, Microbiology and Immunology, College of Medicine, University of Saskatchewan, Saskatoon, SK, Canada. .,Vaccine and Infectious Disease Organization-International Vaccine Centre, Saskatoon, SK, Canada.
| |
Collapse
|
9
|
Voigt AP, Whitmore SS, Lessing ND, DeLuca AP, Tucker BA, Stone EM, Mullins RF, Scheetz TE. Spectacle: An interactive resource for ocular single-cell RNA sequencing data analysis. Exp Eye Res 2020; 200:108204. [PMID: 32910939 DOI: 10.1016/j.exer.2020.108204] [Citation(s) in RCA: 68] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2020] [Revised: 08/06/2020] [Accepted: 08/25/2020] [Indexed: 12/11/2022]
Abstract
Single-cell RNA sequencing has revolutionized ocular gene expression studies. This technology has enabled researchers to identify expression signatures for rare cell types and characterize how gene expression changes across biological conditions, such as topographic region or disease status. However, sharing single-cell RNA sequencing results remains a major obstacle, particular for individuals without a computational background. To address these limitations, we developed Spectacle, an interactive web-based resource for exploring previously published single-cell RNA sequencing data from ocular studies. Spectacle is powered by a locally developed R package, cellcuratoR, which utilizes the Shiny framework in R to generate interactive visualizations for single-cell expression data. Spectacle contains five pre-processed ocular single-cell RNA sequencing data sets and is accessible via the web at OcularGeneExpression.org/singlecell. With Spectacle, users can interactively identify which cell types express a gene of interest, detect transcriptomic subpopulations within a cell type, and perform highly flexible differential expression analyses. The freely-available Spectacle system reduces the bioinformatic barrier for interacting with rich single-cell RNA sequencing studies from ocular tissues, making it easy to quickly identify cell types that express a gene of interest.
Collapse
Affiliation(s)
- Andrew P Voigt
- Departments of Ophthalmology and Visual Sciences, the University of Iowa Carver College of Medicine, Iowa City, IA, 52242, USA; Institute for Vision Research, the University of Iowa, Iowa City, IA, 52242, USA
| | - S Scott Whitmore
- Departments of Ophthalmology and Visual Sciences, the University of Iowa Carver College of Medicine, Iowa City, IA, 52242, USA; Institute for Vision Research, the University of Iowa, Iowa City, IA, 52242, USA
| | - Nicholas D Lessing
- Departments of Ophthalmology and Visual Sciences, the University of Iowa Carver College of Medicine, Iowa City, IA, 52242, USA; Institute for Vision Research, the University of Iowa, Iowa City, IA, 52242, USA
| | - Adam P DeLuca
- Departments of Ophthalmology and Visual Sciences, the University of Iowa Carver College of Medicine, Iowa City, IA, 52242, USA; Institute for Vision Research, the University of Iowa, Iowa City, IA, 52242, USA
| | - Budd A Tucker
- Departments of Ophthalmology and Visual Sciences, the University of Iowa Carver College of Medicine, Iowa City, IA, 52242, USA; Institute for Vision Research, the University of Iowa, Iowa City, IA, 52242, USA
| | - Edwin M Stone
- Departments of Ophthalmology and Visual Sciences, the University of Iowa Carver College of Medicine, Iowa City, IA, 52242, USA; Institute for Vision Research, the University of Iowa, Iowa City, IA, 52242, USA
| | - Robert F Mullins
- Departments of Ophthalmology and Visual Sciences, the University of Iowa Carver College of Medicine, Iowa City, IA, 52242, USA; Institute for Vision Research, the University of Iowa, Iowa City, IA, 52242, USA
| | - Todd E Scheetz
- Departments of Ophthalmology and Visual Sciences, the University of Iowa Carver College of Medicine, Iowa City, IA, 52242, USA; Institute for Vision Research, the University of Iowa, Iowa City, IA, 52242, USA.
| |
Collapse
|
10
|
NASQAR: a web-based platform for high-throughput sequencing data analysis and visualization. BMC Bioinformatics 2020; 21:267. [PMID: 32600310 PMCID: PMC7322916 DOI: 10.1186/s12859-020-03577-4] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2019] [Accepted: 06/01/2020] [Indexed: 01/23/2023] Open
Abstract
Background As high-throughput sequencing applications continue to evolve, the rapid growth in quantity and variety of sequence-based data calls for the development of new software libraries and tools for data analysis and visualization. Often, effective use of these tools requires computational skills beyond those of many researchers. To ease this computational barrier, we have created a dynamic web-based platform, NASQAR (Nucleic Acid SeQuence Analysis Resource). Results NASQAR offers a collection of custom and publicly available open-source web applications that make extensive use of a variety of R packages to provide interactive data analysis and visualization. The platform is publicly accessible at http://nasqar.abudhabi.nyu.edu/. Open-source code is on GitHub at https://github.com/nasqar/NASQAR, and the system is also available as a Docker image at https://hub.docker.com/r/aymanm/nasqarall. NASQAR is a collaboration between the core bioinformatics teams of the NYU Abu Dhabi and NYU New York Centers for Genomics and Systems Biology. Conclusions NASQAR empowers non-programming experts with a versatile and intuitive toolbox to easily and efficiently explore, analyze, and visualize their Transcriptomics data interactively. Popular tools for a variety of applications are currently available, including Transcriptome Data Preprocessing, RNA-seq Analysis (including Single-cell RNA-seq), Metagenomics, and Gene Enrichment.
Collapse
|
11
|
Franzén O, Björkegren JLM. alona: a web server for single-cell RNA-seq analysis. Bioinformatics 2020; 36:3910-3912. [PMID: 32324845 PMCID: PMC7320629 DOI: 10.1093/bioinformatics/btaa269] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2020] [Revised: 03/27/2020] [Accepted: 04/16/2020] [Indexed: 01/01/2023] Open
Abstract
SUMMARY Single-cell RNA sequencing (scRNA-seq) is a technology to measure gene expression in single cells. It has enabled discovery of new cell types and established cell type atlases of tissues and organs. The widespread adoption of scRNA-seq has created a need for user-friendly software for data analysis. We have developed a web server, alona that incorporates several of the most popular single-cell analysis algorithms into a flexible pipeline. alona can perform quality filtering, normalization, batch correction, clustering, cell type annotation and differential gene expression analysis. Data are visualized in the web browser using an interface based on JavaScript, allowing the user to query genes of interest and visualize the cluster structure. alona accepts a compressed gene expression matrix and identifies cell clusters with a graph-based clustering strategy. Cell types are identified from a comprehensive collection of marker genes or by specifying a custom set of marker genes. AVAILABILITY AND IMPLEMENTATION The service runs at https://alona.panglaodb.se and the Python package can be downloaded from https://oscar-franzen.github.io/adobo/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Oscar Franzén
- Department of Medicine, Integrated Cardio Metabolic Centre, Karolinska Institutet, Huddinge 14157, Sweden
| | - Johan L M Björkegren
- Department of Medicine, Integrated Cardio Metabolic Centre, Karolinska Institutet, Huddinge 14157, Sweden
- Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, New York, NY 10029, USA
| |
Collapse
|
12
|
Feng D, Whitehurst CE, Shan D, Hill JD, Yue YG. Single Cell Explorer, collaboration-driven tools to leverage large-scale single cell RNA-seq data. BMC Genomics 2019; 20:676. [PMID: 31455220 PMCID: PMC6712711 DOI: 10.1186/s12864-019-6053-y] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2019] [Accepted: 08/21/2019] [Indexed: 12/13/2022] Open
Abstract
Background Single cell transcriptome sequencing has become an increasingly valuable technology for dissecting complex biology at a resolution impossible with bulk sequencing. However, the gap between the technical expertise required to effectively work with the resultant high dimensional data and the biological expertise required to interpret the results in their biological context remains incompletely addressed by the currently available tools. Results Single Cell Explorer is a Python-based web server application we developed to enable computational and experimental scientists to iteratively and collaboratively annotate cell expression phenotypes within a user-friendly and visually appealing platform. These annotations can be modified and shared by multiple users to allow easy collaboration between computational scientists and experimental biologists. Data processing and analytic workflows can be integrated into the system using Jupyter notebooks. The application enables powerful yet accessible features such as the identification of differential gene expression patterns for user-defined cell populations and convenient annotation of cell types using marker genes or differential gene expression patterns. Users are able to produce plots without needing Python or R coding skills. As such, by making single cell RNA-seq data sharing and querying more user-friendly, the software promotes deeper understanding and innovation by research teams applying single cell transcriptomic approaches. Conclusions Single cell explorer is a freely-available single cell transcriptomic analysis tool that enables computational and experimental biologists to collaboratively explore, annotate, and share results in a flexible software environment and a centralized database server that supports data portal functionality.
Collapse
Affiliation(s)
- Di Feng
- Computational Biology, Boehringer Ingelheim Pharmaceuticals, Inc., 900 Ridgebury Road, Ridgefield, CT, 06877, USA.
| | - Charles E Whitehurst
- Immunology and Respiratory Disease Research, Boehringer Ingelheim Pharmaceuticals, Inc., 900 Ridgebury Road, Ridgefield, CT, 06877, USA
| | - Dechao Shan
- Computational Biology, Boehringer Ingelheim Pharmaceuticals, Inc., 900 Ridgebury Road, Ridgefield, CT, 06877, USA
| | - Jon D Hill
- Computational Biology, Boehringer Ingelheim Pharmaceuticals, Inc., 900 Ridgebury Road, Ridgefield, CT, 06877, USA
| | - Yong G Yue
- Computational Biology, Boehringer Ingelheim Pharmaceuticals, Inc., 900 Ridgebury Road, Ridgefield, CT, 06877, USA.,Present Address: Data Science, Camp4 Therapeutics Corp, One Kendall Square, Cambridge, MA, 02139, USA
| |
Collapse
|
13
|
Luecken MD, Theis FJ. Current best practices in single-cell RNA-seq analysis: a tutorial. Mol Syst Biol 2019; 15:e8746. [PMID: 31217225 PMCID: PMC6582955 DOI: 10.15252/msb.20188746] [Citation(s) in RCA: 1142] [Impact Index Per Article: 190.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Revised: 03/15/2019] [Accepted: 04/03/2019] [Indexed: 12/21/2022] Open
Abstract
Single-cell RNA-seq has enabled gene expression to be studied at an unprecedented resolution. The promise of this technology is attracting a growing user base for single-cell analysis methods. As more analysis tools are becoming available, it is becoming increasingly difficult to navigate this landscape and produce an up-to-date workflow to analyse one's data. Here, we detail the steps of a typical single-cell RNA-seq analysis, including pre-processing (quality control, normalization, data correction, feature selection, and dimensionality reduction) and cell- and gene-level downstream analysis. We formulate current best-practice recommendations for these steps based on independent comparison studies. We have integrated these best-practice recommendations into a workflow, which we apply to a public dataset to further illustrate how these steps work in practice. Our documented case study can be found at https://www.github.com/theislab/single-cell-tutorial This review will serve as a workflow tutorial for new entrants into the field, and help established users update their analysis pipelines.
Collapse
Affiliation(s)
- Malte D Luecken
- Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
| | - Fabian J Theis
- Institute of Computational Biology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- Department of Mathematics, Technische Universität München, Garching bei München, Germany
| |
Collapse
|