Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Yang L, Liu J, Lu Q, Riggs AD, Wu X. SAIC: an iterative clustering approach for analysis of single cell RNA-seq data. BMC Genomics 2017;18:689. [PMID: 28984204 PMCID: PMC5629617 DOI: 10.1186/s12864-017-4019-5] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

For:	Yang L, Liu J, Lu Q, Riggs AD, Wu X. SAIC: an iterative clustering approach for analysis of single cell RNA-seq data. BMC Genomics 2017;18:689. [PMID: 28984204 PMCID: PMC5629617 DOI: 10.1186/s12864-017-4019-5] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open

Number

Cited by Other Article(s)

Xie J, Ruan S, Tu M, Yuan Z, Hu J, Li H, Li S. Clustering single-cell RNA sequencing data via iterative smoothing and self-supervised discriminative embedding. Oncogene 2024:10.1038/s41388-024-03074-5. [PMID: 38834657 DOI: 10.1038/s41388-024-03074-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2024] [Revised: 05/22/2024] [Accepted: 05/28/2024] [Indexed: 06/06/2024]

Gong Y, Haeri M, Zhang X, Li Y, Liu A, Wu D, Zhang Q, Jazwinski SM, Zhou X, Wang X, Jiang L, Chen YP, Yan X, Swerdlow RH, Shen H, Deng HW. Spatial Dissection of the Distinct Cellular Responses to Normal Aging and Alzheimer's Disease in Human Prefrontal Cortex at Single-Nucleus Resolution. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.05.21.24306783. [PMID: 38826275 PMCID: PMC11142279 DOI: 10.1101/2024.05.21.24306783] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]

Abstract

Aging significantly elevates the risk for Alzheimer's disease (AD), contributing to the accumulation of AD pathologies, such as amyloid-β (Aβ), inflammation, and oxidative stress. The human prefrontal cortex (PFC) is highly vulnerable to the impacts of both aging and AD. Unveiling and understanding the molecular alterations in PFC associated with normal aging (NA) and AD is essential for elucidating the mechanisms of AD progression and developing novel therapeutics for this devastating disease. In this study, for the first time, we employed a cutting-edge spatial transcriptome platform, STOmics® SpaTial Enhanced Resolution Omics-sequencing (Stereo-seq), to generate the first comprehensive, subcellular resolution spatial transcriptome atlas of the human PFC from six AD cases at various neuropathological stages and six age, sex, and ethnicity matched controls. Our analyses revealed distinct transcriptional alterations across six neocortex layers, highlighted the AD-associated disruptions in laminar architecture, and identified changes in layer-to-layer interactions as AD progresses. Further, throughout the progression from NA to various stages of AD, we discovered specific genes that were significantly upregulated in neurons experiencing high stress and in nearby non-neuronal cells, compared to cells distant from the source of stress. Notably, the cell-cell interactions between the neurons under the high stress and adjacent glial cells that promote Aβ clearance and neuroprotection were diminished in AD in response to stressors compared to NA. Through cell-type specific gene co-expression analysis, we identified three modules in excitatory and inhibitory neurons associated with neuronal protection, protein dephosphorylation, and negative regulation of Aβ plaque formation. These modules negatively correlated with AD progression, indicating a reduced capacity for toxic substance clearance in AD subject samples. Moreover, we have discovered a novel transcription factor, ZNF460, that regulates all three modules, establishing it as a potential new therapeutic target for AD. Overall, utilizing the latest spatial transcriptome platform, our study developed the first transcriptome-wide atlas with subcellular resolution for assessing the molecular alterations in the human PFC due to AD. This atlas sheds light on the potential mechanisms underlying the progression from NA to AD.

Collapse

Affiliation(s)

Yun Gong Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, 70112, USA
Mohammad Haeri Department of Pathology & Laboratory Medicine, University of Kansas Medical Center, Kansas City, MO, 66160, USA
Xiao Zhang Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, 70112, USA
Yisu Li Department of Cell and Molecular Biology, School of Science of Engineering, Tulane University, New Orleans, LA, 70118, USA
Anqi Liu Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, 70112, USA
Di Wu Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, 70112, USA
Qilei Zhang School of Basic Medical Sciences, Central South University, Changsha, Hunan, 410008, China
S. Michal Jazwinski Tulane Center for Aging, Deming Department of Medicine, Tulane University School of Medicne, New Orleans, LA 70112, USA
Xiang Zhou Department of Biostatistics, University of Michigan, Ann Arbor, MI, 48109, USA
Xiaoying Wang Clinical Neuroscience Research Center, Departments of Neurosurgery and Neurology, Tulane University School of Medicine, New Orleans, LA 70112, USA
Lindong Jiang Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, 70112, USA
Yi-Ping Chen Department of Cell and Molecular Biology, School of Science of Engineering, Tulane University, New Orleans, LA, 70118, USA
Xiaoxin Yan School of Basic Medical Sciences, Central South University, Changsha, Hunan, 410008, China
Russell H. Swerdlow Department of Neurology, University of Kansas Medical Center, Kansas City, MO, 66160, USA
Hui Shen Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, 70112, USA
Hong-Wen Deng Tulane Center for Biomedical Informatics and Genomics, Deming Department of Medicine, School of Medicine, Tulane University, New Orleans, LA, 70112, USA

Collapse

Zou X, Liu Y, Wang M, Zou J, Shi Y, Su X, Xu J, Tong HHY, Ji Y, Gui L, Hao J. scCURE identifies cell types responding to immunotherapy and enables outcome prediction. CELL REPORTS METHODS 2023;3:100643. [PMID: 37989083 PMCID: PMC10694528 DOI: 10.1016/j.crmeth.2023.100643] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/17/2023] [Revised: 07/17/2023] [Accepted: 10/23/2023] [Indexed: 11/23/2023]

Cui T, Wang T. A comprehensive assessment of hurdle and zero-inflated models for single cell RNA-sequencing analysis. Brief Bioinform 2023;24:bbad272. [PMID: 37507115 PMCID: PMC10516395 DOI: 10.1093/bib/bbad272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 06/17/2023] [Accepted: 07/06/2023] [Indexed: 07/30/2023] Open

Erfanian N, Heydari AA, Feriz AM, Iañez P, Derakhshani A, Ghasemigol M, Farahpour M, Razavi SM, Nasseri S, Safarpour H, Sahebkar A. Deep learning applications in single-cell genomics and transcriptomics data analysis. Biomed Pharmacother 2023;165:115077. [PMID: 37393865 DOI: 10.1016/j.biopha.2023.115077] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2023] [Revised: 06/22/2023] [Accepted: 06/23/2023] [Indexed: 07/04/2023] Open

Zhang S, Li X, Lin J, Lin Q, Wong KC. Review of single-cell RNA-seq data clustering for cell-type identification and characterization. RNA (NEW YORK, N.Y.) 2023;29:517-530. [PMID: 36737104 PMCID: PMC10158997 DOI: 10.1261/rna.078965.121] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Accepted: 01/03/2023] [Indexed: 05/06/2023]

Multi-Objective Genetic Algorithm for Cluster Analysis of Single-Cell Transcriptomes. J Pers Med 2023;13:jpm13020183. [PMID: 36836417 PMCID: PMC9960600 DOI: 10.3390/jpm13020183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 01/15/2023] [Accepted: 01/16/2023] [Indexed: 01/22/2023] Open

Xu L, Xue T, Ding W, Shen L. Comparison of scRNA-seq data analysis method combinations. Brief Funct Genomics 2022;21:433-440. [PMID: 36124658 DOI: 10.1093/bfgp/elac027] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 07/29/2022] [Accepted: 07/29/2022] [Indexed: 12/14/2022] Open

Abstract

Single-cell ribonucleic acid (RNA)-sequencing (scRNA-seq) data analysis refers to the use of appropriate methods to analyze the dataset generated by RNA-sequencing performed on the single-cell transcriptome. It usually contains three steps: normalization to eliminate the technical noise, dimensionality reduction to facilitate visual understanding and data compression and clustering to divide the data into several similarity-based clusters. In addition, the gene expression data contain a large number of zero counts. These zero counts are considered relevant to random dropout events induced by multiple factors in the sequencing experiments, such as low RNA input, and the stochastic nature of the gene expression pattern at the single-cell level. The zero counts can be eliminated only through the analysis of the scRNA-seq data, and although many methods have been proposed to this end, there is still a lack of research on the combined effect of existing methods. In this paper, we summarize the two kinds of normalization, two kinds of dimension reduction and three kinds of clustering methods widely used in the current mainstream scRNA-seq data analysis. Furthermore, we propose to combine these methods into 12 technology combinations, each with a whole set of scRNA-seq data analysis processes. We evaluated the proposed combinations using Goolam, a publicly available scRNA-seq, by comparing the final clustering results and found the most suitable collection scheme of these classic methods. Our results showed that using appropriate technology combinations can improve the efficiency and accuracy of the scRNA-seq data analysis. The combinations not only satisfy the basic requirements of noise reduction, dimension reduction and cell clustering but also ensure preserving the heterogeneity of cells in downstream analysis. The dataset, Goolam, used in the study can be obtained from the ArrayExpress database under the accession number E-MTAB-3321.

Collapse

Zeng Y, Wei Z, Zhong F, Pan Z, Lu Y, Yang Y. A parameter-free deep embedded clustering method for single-cell RNA-seq data. Brief Bioinform 2022;23:6582003. [PMID: 35524494 DOI: 10.1093/bib/bbac172] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2021] [Revised: 03/25/2022] [Accepted: 04/18/2022] [Indexed: 11/12/2022] Open

He S, Dou L, Li X, Zhang Y. Review of bioinformatics in Azheimer's Disease Research. Comput Biol Med 2022;143:105269. [PMID: 35158118 DOI: 10.1016/j.compbiomed.2022.105269] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2022] [Revised: 01/21/2022] [Accepted: 01/23/2022] [Indexed: 01/05/2023]

Single Cell Self-Paced Clustering with Transcriptome Sequencing Data. Int J Mol Sci 2022;23:ijms23073900. [PMID: 35409258 PMCID: PMC8999118 DOI: 10.3390/ijms23073900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2022] [Revised: 03/28/2022] [Accepted: 03/29/2022] [Indexed: 11/17/2022] Open

Simultaneous Learning the Dimension and Parameter of a Statistical Model with Big Data. STATISTICS IN BIOSCIENCES 2021. [DOI: 10.1007/s12561-021-09324-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022]

Su K, Yu T, Wu H. Accurate feature selection improves single-cell RNA-seq cell clustering. Brief Bioinform 2021;22:6145899. [PMID: 33611426 DOI: 10.1093/bib/bbab034] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2020] [Revised: 01/06/2021] [Accepted: 01/22/2021] [Indexed: 02/04/2023] Open

Liu Z. Clustering Single-Cell RNA-Seq Data with Regularized Gaussian Graphical Model. Genes (Basel) 2021;12:311. [PMID: 33671799 PMCID: PMC7927011 DOI: 10.3390/genes12020311] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/27/2020] [Revised: 02/07/2021] [Accepted: 02/15/2021] [Indexed: 11/20/2022] Open

Nayak R, Hasija Y. A hitchhiker's guide to single-cell transcriptomics and data analysis pipelines. Genomics 2021;113:606-619. [PMID: 33485955 DOI: 10.1016/j.ygeno.2021.01.007] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2020] [Revised: 12/30/2020] [Accepted: 01/18/2021] [Indexed: 12/20/2022]

Decoding myofibroblast origins in human kidney fibrosis. Nature 2021;589:281-286. [PMID: 33176333 PMCID: PMC7611626 DOI: 10.1038/s41586-020-2941-1] [Citation(s) in RCA: 343] [Impact Index Per Article: 114.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2020] [Accepted: 10/19/2020] [Indexed: 01/29/2023]

Xie K, Huang Y, Zeng F, Liu Z, Chen T. scAIDE: clustering of large-scale single-cell RNA-seq data reveals putative and rare cell types. NAR Genom Bioinform 2020;2:lqaa082. [PMID: 33575628 PMCID: PMC7671411 DOI: 10.1093/nargab/lqaa082] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2020] [Revised: 08/20/2020] [Accepted: 09/18/2020] [Indexed: 02/07/2023] Open

Cao W, Lee H, Wu W, Zaman A, McCorkle S, Yan M, Chen J, Xing Q, Sinnott-Armstrong N, Xu H, Sailani MR, Tang W, Cui Y, Liu J, Guan H, Lv P, Sun X, Sun L, Han P, Lou Y, Chang J, Wang J, Gao Y, Guo J, Schenk G, Shain AH, Biddle FG, Collisson E, Snyder M, Bivona TG. Multi-faceted epigenetic dysregulation of gene expression promotes esophageal squamous cell carcinoma. Nat Commun 2020;11:3675. [PMID: 32699215 PMCID: PMC7376194 DOI: 10.1038/s41467-020-17227-z] [Citation(s) in RCA: 59] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Accepted: 06/17/2020] [Indexed: 12/20/2022] Open

Affiliation(s)

Wei Cao Translational Medical Center, Zhengzhou Central Hospital Affiliated Zhengzhou University, Zhengzhou, China.
Hayan Lee Department of Genetics, School of Medicine, Stanford University, CA, USA
Wei Wu Department of Medicine, University of California San Francisco, San Francisco, CA, USA. Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, CA, USA.
Aubhishek Zaman Department of Medicine, University of California San Francisco, San Francisco, CA, USA Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, CA, USA
Sean McCorkle Computational Science Initiative, Brookhaven National Laboratory, Upton, NY, USA
Ming Yan Basic Medical College, Zhengzhou University, Zhengzhou, China
Justin Chen Department of Genetics, School of Medicine, Stanford University, CA, USA
Qinghe Xing Institutes of Biomedical Sciences and Children's Hospital, Fudan University, Shanghai, China
Nasa Sinnott-Armstrong Department of Genetics, School of Medicine, Stanford University, CA, USA
Hongen Xu Precision Medicine Center, The Academy of Medical Sciences, Zhengzhou University, Zhengzhou, China
M Reza Sailani Department of Genetics, School of Medicine, Stanford University, CA, USA
Wenxue Tang Precision Medicine Center, The Academy of Medical Sciences, Zhengzhou University, Zhengzhou, China
Yuanbo Cui Translational Medical Center, Zhengzhou Central Hospital Affiliated Zhengzhou University, Zhengzhou, China
Jia Liu Translational Medical Center, Zhengzhou Central Hospital Affiliated Zhengzhou University, Zhengzhou, China
Hongyan Guan Translational Medical Center, Zhengzhou Central Hospital Affiliated Zhengzhou University, Zhengzhou, China
Pengju Lv Translational Medical Center, Zhengzhou Central Hospital Affiliated Zhengzhou University, Zhengzhou, China
Xiaoyan Sun Translational Medical Center, Zhengzhou Central Hospital Affiliated Zhengzhou University, Zhengzhou, China
Lei Sun Translational Medical Center, Zhengzhou Central Hospital Affiliated Zhengzhou University, Zhengzhou, China
Pengli Han Translational Medical Center, Zhengzhou Central Hospital Affiliated Zhengzhou University, Zhengzhou, China
Yanan Lou Translational Medical Center, Zhengzhou Central Hospital Affiliated Zhengzhou University, Zhengzhou, China
Jing Chang Jiangsu Mai Jian Biotechnology Development Company, Wuxi, China
Jinwu Wang Department of Pathology, Linzhou Cancer Hospital, Linzhou, China
Yuchi Gao Annoroad Gene Company, Beijing, China
Jiancheng Guo Precision Medicine Center, The Academy of Medical Sciences, Zhengzhou University, Zhengzhou, China
Gundolf Schenk Bakar Computational Health Sciences Institute, University of California San Francisco, San Francisco, CA, USA
Alan Hunter Shain Department of Dermatology, University of California San Francisco, San Francisco, CA, USA
Fred G Biddle Department of Biological Sciences, University of Calgary, Calgary, Canada
Eric Collisson Department of Medicine, University of California San Francisco, San Francisco, CA, USA Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, CA, USA
Michael Snyder Department of Genetics, School of Medicine, Stanford University, CA, USA.
Trever G Bivona Department of Medicine, University of California San Francisco, San Francisco, CA, USA. Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, CA, USA.

Collapse

Dimension Reduction and Clustering Models for Single-Cell RNA Sequencing Data: A Comparative Study. Int J Mol Sci 2020;21:ijms21062181. [PMID: 32235704 PMCID: PMC7139673 DOI: 10.3390/ijms21062181] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2020] [Revised: 03/09/2020] [Accepted: 03/20/2020] [Indexed: 12/30/2022] Open

Mieth B, Hockley JRF, Görnitz N, Vidovic MMC, Müller KR, Gutteridge A, Ziemek D. Using transfer learning from prior reference knowledge to improve the clustering of single-cell RNA-Seq data. Sci Rep 2019;9:20353. [PMID: 31889137 PMCID: PMC6937257 DOI: 10.1038/s41598-019-56911-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 12/13/2019] [Indexed: 01/21/2023] Open

Petegrosso R, Li Z, Kuang R. Machine learning and statistical methods for clustering single-cell RNA-sequencing data. Brief Bioinform 2019;21:1209-1223. [DOI: 10.1093/bib/bbz063] [Citation(s) in RCA: 65] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Revised: 04/04/2019] [Accepted: 04/29/2019] [Indexed: 01/08/2023] Open

Abstract Abstract Single-cell RNAsequencing (scRNA-seq) technologies have enabled the large-scale whole-transcriptome profiling of each individual single cell in a cell population. A core analysis of the scRNA-seq transcriptome profiles is to cluster the single cells to reveal cell subtypes and infer cell lineages based on the relations among the cells. This article reviews the machine learning and statistical methods for clustering scRNA-seq transcriptomes developed in the past few years. The review focuses on how conventional clustering techniques such as hierarchical clustering, graph-based clustering, mixture models, $k$-means, ensemble learning, neural networks and density-based clustering are modified or customized to tackle the unique challenges in scRNA-seq data analysis, such as the dropout of low-expression genes, low and uneven read coverage of transcripts, highly variable total mRNAs from single cells and ambiguous cell markers in the presence of technical biases and irrelevant confounding biological variations. We review how cell-specific normalization, the imputation of dropouts and dimension reduction methods can be applied with new statistical or optimization strategies to improve the clustering of single cells. We will also introduce those more advanced approaches to cluster scRNA-seq transcriptomes in time series data and multiple cell populations and to detect rare cell types. Several software packages developed to support the cluster analysis of scRNA-seq data are also reviewed and experimentally compared to evaluate their performance and efficiency. Finally, we conclude with useful observations and possible future directions in scRNA-seq data analytics. Availability All the source code and data are available at https://github.com/kuanglab/single-cell-review. Collapse

Li X, Zhang S, Wong KC. Single-cell RNA-seq interpretations using evolutionary multiobjective ensemble pruning. Bioinformatics 2018;35:2809-2817. [DOI: 10.1093/bioinformatics/bty1056] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2018] [Revised: 10/31/2018] [Accepted: 12/21/2018] [Indexed: 11/14/2022] Open

Abstract Abstract Motivation In recent years, single-cell RNA sequencing enables us to discover cell types or even subtypes. Its increasing availability provides opportunities to identify cell populations from single-cell RNA-seq data. Computational methods have been employed to reveal the gene expression variations among multiple cell populations. Unfortunately, the existing ones can suffer from realistic restrictions such as experimental noises, numerical instability, high dimensionality and computational scalability. Results We propose an evolutionary multiobjective ensemble pruning algorithm (EMEP) that addresses those realistic restrictions. Our EMEP algorithm first applies the unsupervised dimensionality reduction to project data from the original high dimensions to low-dimensional subspaces; basic clustering algorithms are applied in those new subspaces to generate different clustering results to form cluster ensembles. However, most of those cluster ensembles are unnecessarily bulky with the expense of extra time costs and memory consumption. To overcome that problem, EMEP is designed to dynamically select the suitable clustering results from the ensembles. Moreover, to guide the multiobjective ensemble evolution, three cluster validity indices including the overall cluster deviation, the within-cluster compactness and the number of basic partition clusters are formulated as the objective functions to unleash its cell type discovery performance using evolutionary multiobjective optimization. We applied EMEP to 55 simulated datasets and seven real single-cell RNA-seq datasets, including six single-cell RNA-seq dataset and one large-scale dataset with 3005 cells and 4412 genes. Two case studies are also conducted to reveal mechanistic insights into the biological relevance of EMEP. We found that EMEP can achieve superior performance over the other clustering algorithms, demonstrating that EMEP can identify cell populations clearly. Availability and implementation EMEP is written in Matlab and available at https://github.com/lixt314/EMEP Supplementary information Supplementary data are available at Bioinformatics online. Collapse

The International Conference on Intelligent Biology and Medicine (ICIBM) 2016: summary and innovation in genomics. BMC Genomics 2017;18:703. [PMID: 28984207 PMCID: PMC5629612 DOI: 10.1186/s12864-017-4018-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open