Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Sun T, Song D, Li WV, Li JJ. scDesign2: a transparent simulator that generates high-fidelity single-cell gene expression count data with gene correlations captured. Genome Biol 2021;22:163. [PMID: 34034771 PMCID: PMC8147071 DOI: 10.1186/s13059-021-02367-2] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 04/27/2021] [Indexed: 12/13/2022] Open

For:	Sun T, Song D, Li WV, Li JJ. scDesign2: a transparent simulator that generates high-fidelity single-cell gene expression count data with gene correlations captured. Genome Biol 2021;22:163. [PMID: 34034771 PMCID: PMC8147071 DOI: 10.1186/s13059-021-02367-2] [Citation(s) in RCA: 48] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2020] [Accepted: 04/27/2021] [Indexed: 12/13/2022] Open

Number

Cited by Other Article(s)

González-Velasco O, Simon M, Yilmaz R, Parlato R, Weishaupt J, Imbusch C, Brors B. Identifying similar populations across independent single cell studies without data integration. NAR Genom Bioinform 2025;7:lqaf042. [PMID: 40276039 PMCID: PMC12019640 DOI: 10.1093/nargab/lqaf042] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2024] [Revised: 03/13/2025] [Accepted: 03/26/2025] [Indexed: 04/26/2025] Open

Gao S, Li H, Wu Z, Mizumaki H, Kajigaya S, Young NS. GSNCASCR: An R Package to Identify Differentially Co-Expressed Curated Gene Sets with Single-Cell RNA-Seq Data. Int J Mol Sci 2025;26:4771. [PMID: 40429912 PMCID: PMC12112291 DOI: 10.3390/ijms26104771] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2025] [Revised: 05/06/2025] [Accepted: 05/13/2025] [Indexed: 05/29/2025] Open

Abstract

(1) Differential co-expression analysis between two phenotypes with a known gene set helps to uncover gene regulation alterations. (2) GSNCASCR uses CSCORE to estimate the gene pair correlations for network reconstruction and GSNCA to quantify the structure changes of co-expression networks of the predefined gene sets. It also ranks genes based on their "importance" in the weighted network. The method is implemented with free R software (version 0.1.0, available on GitHub), allowing users to analyze their data with the help of demo vignettes included in the package. (3) With analysis of both simulated and real datasets, we demonstrate that the statistical tests performed with GSNCASCR are able to identify differentially co-expressed gene sets with higher precision than tests with Gene Set Co-Expression Analysis (GSCA, version 1.1.1) and Gene Sets Net Correlations Analysis (GSNCA, version 1.42.0). Specifically, GSNCASCR achieved an AUC value of 0.985, while GSNCA and GSCA achieved 0.817 and 0.893, respectively, when positive and negative pathways are defined as having more than 40% and less than 20% co-expressed gene pairs in the simulated data, respectively. Furthermore, across simulated data with varying noise levels, pathway sizes, and positive/negative pathway definitions, GSNCASCR consistently performs best in over 90% of scenarios, as evaluated by AUC values. With an available COVID-19 dataset, we show CD4+ T cell dysfunction in severe COVID-19 as TNF-α/TNF receptor 1-dependent immune pathways. In the weighted network of a gene set of IFN-γ, IFITM3 was identified as a hub gene, which has been evidenced by a genome-wide association study and functional studies. (4) We developed a bioinformatics tool, GSNCASCR, that analyzes differentially co-expressed pathways with single-cell RNA-sequencing data and also evaluates the importance of the genes within pathways. This tool combines the advantages of two algorithms, enabling the quantification and examination of cell type-specific co-expression changes within pathways. The package allows for the analysis of shared and unique disease-affected pathways across different cell types.

Collapse

Fu S, Li WV. Predicting and comparing transcription start sites in single cell populations. PLoS Comput Biol 2025;21:e1012878. [PMID: 40179341 PMCID: PMC11968111 DOI: 10.1371/journal.pcbi.1012878] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Accepted: 02/15/2025] [Indexed: 04/05/2025] Open

Liang X, Torkel M, Cao Y, Yang JYH. Multi-task benchmarking of spatially resolved gene expression simulation models. Genome Biol 2025;26:57. [PMID: 40098171 PMCID: PMC11912772 DOI: 10.1186/s13059-025-03505-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Accepted: 02/12/2025] [Indexed: 03/19/2025] Open

Song B, Liu D, Dai W, McMyn NF, Wang Q, Yang D, Krejci A, Vasilyev A, Untermoser N, Loregger A, Song D, Williams B, Rosen B, Cheng X, Chao L, Kale HT, Zhang H, Diao Y, Bürckstümmer T, Siliciano JD, Li JJ, Siliciano RF, Huangfu D, Li W. Decoding heterogeneous single-cell perturbation responses. Nat Cell Biol 2025;27:493-504. [PMID: 40011559 PMCID: PMC11906366 DOI: 10.1038/s41556-025-01626-9] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Accepted: 01/20/2025] [Indexed: 02/28/2025]

Affiliation(s)

Bicna Song Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA Department of Genomics and Precision Medicine, George Washington University, Washington, DC, USA
Dingyu Liu Developmental Biology Program, Sloan Kettering Institute, New York City, NY, USA Louis V. Gerstner Jr. Graduate School of Biomedical Sciences, Memorial Sloan Kettering Cancer Center, New York City, NY, USA
Weiwei Dai Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA Howard Hughes Medical Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Natalie F McMyn Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Qingyang Wang Department of Statistics and Data Science, University of California, Los Angeles, CA, USA
Dapeng Yang Developmental Biology Program, Sloan Kettering Institute, New York City, NY, USA
Adam Krejci Myllia Biotechnology GmbH, Vienna, Austria
Anatoly Vasilyev Myllia Biotechnology GmbH, Vienna, Austria
Nicole Untermoser Myllia Biotechnology GmbH, Vienna, Austria
Anke Loregger Myllia Biotechnology GmbH, Vienna, Austria
Dongyuan Song Bioinformatics Interdepartmental PhD Program, University of California, Los Angeles, CA, USA Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT, USA
Breanna Williams Developmental Biology Program, Sloan Kettering Institute, New York City, NY, USA
Bess Rosen Developmental Biology Program, Sloan Kettering Institute, New York City, NY, USA Weill Cornell Graduate School of Medical Sciences, Weill Cornell Medicine, New York, NY, USA
Xiaolong Cheng Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA Department of Genomics and Precision Medicine, George Washington University, Washington, DC, USA
Lumen Chao Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA Department of Genomics and Precision Medicine, George Washington University, Washington, DC, USA
Hanuman T Kale Developmental Biology Program, Sloan Kettering Institute, New York City, NY, USA
Hao Zhang Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Yarui Diao Department of Cell Biology, Duke University Medical Center, Durham, NC, USA
Tilmann Bürckstümmer Myllia Biotechnology GmbH, Vienna, Austria
Janet D Siliciano Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Jingyi Jessica Li Department of Statistics and Data Science, University of California, Los Angeles, CA, USA Bioinformatics Interdepartmental PhD Program, University of California, Los Angeles, CA, USA Department of Human Genetics, University of California, Los Angeles, CA, USA Department of Biostatistics, University of California, Los Angeles, CA, USA Department of Computational Medicine, University of California, Los Angeles, CA, USA
Robert F Siliciano Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, MD, USA Howard Hughes Medical Institute, Johns Hopkins University School of Medicine, Baltimore, MD, USA
Danwei Huangfu Developmental Biology Program, Sloan Kettering Institute, New York City, NY, USA
Wei Li Center for Genetic Medicine Research, Children's National Hospital, Washington, DC, USA. Department of Genomics and Precision Medicine, George Washington University, Washington, DC, USA.

Collapse

Dong S, Cui Z, Liu D, Lei J. scRDiT: Generating Single-cell RNA-seq Data by Diffusion Transformers and Accelerating Sampling. Interdiscip Sci 2025:10.1007/s12539-025-00688-5. [PMID: 39982678 DOI: 10.1007/s12539-025-00688-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2024] [Revised: 01/07/2025] [Accepted: 01/08/2025] [Indexed: 02/22/2025]

Yang J, Grant GR, Brooks TG. Generating Correlated Data for Omics Simulation. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.31.634335. [PMID: 39975030 PMCID: PMC11838456 DOI: 10.1101/2025.01.31.634335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]

Van Hecke M, Beerenwinkel N, Lootens T, Fostier J, Raedt R, Marchal K. ELLIPSIS: robust quantification of splicing in scRNA-seq. Bioinformatics 2025;41:btaf028. [PMID: 39936571 PMCID: PMC11878791 DOI: 10.1093/bioinformatics/btaf028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2024] [Revised: 12/09/2024] [Accepted: 02/10/2025] [Indexed: 02/13/2025] Open

Jiang H, Miao X, Thairu MW, Beebe M, Grupe DW, Davidson RJ, Handelsman J, Sankaran K. Multimedia: multimodal mediation analysis of microbiome data. Microbiol Spectr 2025;13:e0113124. [PMID: 39688588 PMCID: PMC11792470 DOI: 10.1128/spectrum.01131-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2024] [Accepted: 10/30/2024] [Indexed: 12/18/2024] Open

Abstract

Mediation analysis has emerged as a versatile tool for answering mechanistic questions in microbiome research because it provides a statistical framework for attributing treatment effects to alternative causal pathways. Using a series of linked regressions, this analysis quantifies how complementary data relate to one another and respond to treatments. Despite these advances, existing software's rigid assumptions often result in users viewing mediation analysis as a black box. We designed the multimedia R package to make advanced mediation analysis techniques accessible, ensuring that statistical components are interpretable and adaptable. The package provides a uniform interface to direct and indirect effect estimation, synthetic null hypothesis testing, bootstrap confidence interval construction, and sensitivity analysis, enabling experimentation with various mediator and outcome models while maintaining a simple overall workflow. The software includes modules for regularized linear, compositional, random forest, hierarchical, and hurdle modeling, making it well-suited to microbiome data. We illustrate the package through two case studies. The first re-analyzes a study of the microbiome and metabolome of Inflammatory Bowel Disease patients, uncovering potential mechanistic interactions between the microbiome and disease-associated metabolites, not found in the original study. The second analyzes new data about the influence of mindfulness practice on the microbiome. The mediation analysis highlights shifts in taxa previously associated with depression that cannot be explained indirectly by diet or sleep behaviors alone. A gallery of examples and further documentation can be found at https://go.wisc.edu/830110.

IMPORTANCE

Microbiome studies routinely gather complementary data to capture different aspects of a microbiome's response to a change, such as the introduction of a therapeutic. Mediation analysis clarifies the extent to which responses occur sequentially via mediators, thereby supporting causal, rather than purely descriptive, interpretation. Multimedia is a modular R package with close ties to the wider microbiome software ecosystem that makes statistically rigorous, flexible mediation analysis easily accessible, setting the stage for precise and causally informed microbiome engineering.

Collapse

Tian R, Yu Z, Xue Z, Wu J, Wu L, Cai S, Gao B, He B, Zhao Y, Yao J, Lu L, Liu W. Evaluation of T Cell Receptor Construction Methods from scRNA-Seq Data. GENOMICS, PROTEOMICS & BIOINFORMATICS 2025;22:qzae086. [PMID: 39666949 PMCID: PMC11846667 DOI: 10.1093/gpbjnl/qzae086] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/08/2023] [Revised: 11/26/2024] [Accepted: 12/09/2024] [Indexed: 12/14/2024]

Affiliation(s)

Ruonan Tian Department of Rheumatology and Immunology of the Second Affiliated Hospital, and Centre of Biomedical Systems and Informatics of Zhejiang University-University of Edinburgh Institute, Zhejiang University School of Medicine, Hangzhou 310003, China Future Health Laboratory, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing 314100, China
Zhejian Yu Department of Rheumatology and Immunology of the Second Affiliated Hospital, and Centre of Biomedical Systems and Informatics of Zhejiang University-University of Edinburgh Institute, Zhejiang University School of Medicine, Hangzhou 310003, China
Ziwei Xue Department of Rheumatology and Immunology of the Second Affiliated Hospital, and Centre of Biomedical Systems and Informatics of Zhejiang University-University of Edinburgh Institute, Zhejiang University School of Medicine, Hangzhou 310003, China Future Health Laboratory, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing 314100, China
Jiaxin Wu Department of Rheumatology and Immunology of the Second Affiliated Hospital, and Centre of Biomedical Systems and Informatics of Zhejiang University-University of Edinburgh Institute, Zhejiang University School of Medicine, Hangzhou 310003, China
Lize Wu Future Health Laboratory, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing 314100, China Institute of Immunology and Department of Dermatology and Rheumatology at Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou 310058, China
Shuo Cai Department of Rheumatology and Immunology of the Second Affiliated Hospital, and Centre of Biomedical Systems and Informatics of Zhejiang University-University of Edinburgh Institute, Zhejiang University School of Medicine, Hangzhou 310003, China
Bing Gao Department of Rheumatology and Immunology of the Second Affiliated Hospital, and Centre of Biomedical Systems and Informatics of Zhejiang University-University of Edinburgh Institute, Zhejiang University School of Medicine, Hangzhou 310003, China
Bing He AI Lab, Tencent, Shenzhen 518000, China
Yu Zhao AI Lab, Tencent, Shenzhen 518000, China
Jianhua Yao AI Lab, Tencent, Shenzhen 518000, China
Linrong Lu Future Health Laboratory, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing 314100, China Institute of Immunology and Department of Dermatology and Rheumatology at Sir Run Run Shaw Hospital, Zhejiang University School of Medicine, Hangzhou 310058, China Shanghai Immune Therapy Institute, Shanghai Jiao Tong University School of Medicine Affiliated Renji Hospital, Shanghai 200025, China
Wanlu Liu Department of Rheumatology and Immunology of the Second Affiliated Hospital, and Centre of Biomedical Systems and Informatics of Zhejiang University-University of Edinburgh Institute, Zhejiang University School of Medicine, Hangzhou 310003, China Future Health Laboratory, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing 314100, China

Collapse

Song X, Chavez-Fuentes JC, Ma W, Fu W, Wang P, Yuan GC. sCCIgen: A high-fidelity spatially resolved transcriptomics data simulator for cell-cell interaction studies. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.07.631830. [PMID: 39829773 PMCID: PMC11741276 DOI: 10.1101/2025.01.07.631830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/22/2025]

Sun F, Li H, Sun D, Fu S, Gu L, Shao X, Wang Q, Dong X, Duan B, Xing F, Wu J, Xiao M, Zhao F, Han JDJ, Liu Q, Fan X, Li C, Wang C, Shi T. Single-cell omics: experimental workflow, data analyses and applications. SCIENCE CHINA. LIFE SCIENCES 2025;68:5-102. [PMID: 39060615 DOI: 10.1007/s11427-023-2561-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Accepted: 04/18/2024] [Indexed: 07/28/2024]

Affiliation(s)

Fengying Sun Department of Clinical Laboratory, the Affiliated Wuhu Hospital of East China Normal University (The Second People's Hospital of Wuhu City), Wuhu, 241000, China
Haoyan Li Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
Dongqing Sun Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
Shaliu Fu Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China
Lei Gu Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
Xin Shao Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing, 314103, China
Qinqin Wang Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China
Xin Dong Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
Bin Duan Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China
Feiyang Xing Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China
Jun Wu Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, 200241, China
Minmin Xiao Department of Clinical Laboratory, the Affiliated Wuhu Hospital of East China Normal University (The Second People's Hospital of Wuhu City), Wuhu, 241000, China.
Fangqing Zhao Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, 100101, China.
Jing-Dong J Han Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Center for Quantitative Biology (CQB), Peking University, Beijing, 100871, China.
Qi Liu Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China. Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China. Research Institute of Intelligent Computing, Zhejiang Lab, Hangzhou, 311121, China. Shanghai Research Institute for Intelligent Autonomous Systems, Shanghai, 201210, China.
Xiaohui Fan Pharmaceutical Informatics Institute, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China. National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, Jiaxing, 314103, China. Zhejiang Key Laboratory of Precision Diagnosis and Therapy for Major Gynecological Diseases, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China.
Chen Li Center for Single-cell Omics, School of Public Health, Shanghai Jiao Tong University School of Medicine, Shanghai, 200025, China.
Chenfei Wang Key Laboratory of Spine and Spinal Cord Injury Repair and Regeneration (Tongji University), Ministry of Education, Orthopaedic Department, Tongji Hospital, Bioinformatics Department, School of Life Sciences and Technology, Tongji University, Shanghai, 200082, China. Frontier Science Center for Stem Cells, School of Life Sciences and Technology, Tongji University, Shanghai, 200092, China.
Tieliu Shi Department of Clinical Laboratory, the Affiliated Wuhu Hospital of East China Normal University (The Second People's Hospital of Wuhu City), Wuhu, 241000, China. Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, the Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, 200241, China. Key Laboratory of Advanced Theory and Application in Statistics and Data Science-MOE, School of Statistics, East China Normal University, Shanghai, 200062, China.

Collapse

Song D, Chen S, Lee C, Li K, Ge X, Li JJ. Synthetic control removes spurious discoveries from double dipping in single-cell and spatial transcriptomics data analyses. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.21.550107. [PMID: 37546812 PMCID: PMC10401959 DOI: 10.1101/2023.07.21.550107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/08/2023]

Sankaran K, Kodikara S, Li JJ, Cao KAL. Semisynthetic simulation for microbiome data analysis. Brief Bioinform 2024;26:bbaf051. [PMID: 39927858 PMCID: PMC11808806 DOI: 10.1093/bib/bbaf051] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2024] [Revised: 12/19/2024] [Accepted: 01/23/2025] [Indexed: 02/11/2025] Open

Shan X, Zhao H. Inferring Cell-Type-Specific Co-Expressed Genes from Single Cell Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.08.622700. [PMID: 39605403 PMCID: PMC11601408 DOI: 10.1101/2024.11.08.622700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]

Jiang H, Miao X, Thairu MW, Beebe M, Grupe DW, Davidson RJ, Handelsman J, Sankaran K. multimedia: Multimodal Mediation Analysis of Microbiome Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.27.587024. [PMID: 38585817 PMCID: PMC10996591 DOI: 10.1101/2024.03.27.587024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/09/2024]

Subedi S, Sumida TS, Park YP. A scalable approach to topic modelling in single-cell data by approximate pseudobulk projection. Life Sci Alliance 2024;7:e202402713. [PMID: 39107066 PMCID: PMC11303850 DOI: 10.26508/lsa.202402713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 07/29/2024] [Accepted: 07/30/2024] [Indexed: 08/09/2024] Open

Stomma P, Rudnicki WR. HCS-hierarchical algorithm for simulation of omics datasets. Bioinformatics 2024;40:ii98-ii104. [PMID: 39230692 PMCID: PMC11373347 DOI: 10.1093/bioinformatics/btae392] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/05/2024] Open

Zhang J, Larschan E, Bigness J, Singh R. scNODE : generative model for temporal single cell transcriptomic data prediction. Bioinformatics 2024;40:ii146-ii154. [PMID: 39230694 PMCID: PMC11373355 DOI: 10.1093/bioinformatics/btae393] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/05/2024] Open

Chen Z, Wang C, Huang S, Shi Y, Xi R. Directly selecting cell-type marker genes for single-cell clustering analyses. CELL REPORTS METHODS 2024;4:100810. [PMID: 38981475 PMCID: PMC11294843 DOI: 10.1016/j.crmeth.2024.100810] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 03/16/2024] [Accepted: 06/12/2024] [Indexed: 07/11/2024]

Sarkar H, Chitra U, Gold J, Raphael BJ. A count-based model for delineating cell-cell interactions in spatial transcriptomics data. Bioinformatics 2024;40:i481-i489. [PMID: 38940134 PMCID: PMC11211854 DOI: 10.1093/bioinformatics/btae219] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open

Qian J, Bao H, Shao X, Fang Y, Liao J, Chen Z, Li C, Guo W, Hu Y, Li A, Yao Y, Fan X, Cheng Y. Simulating multiple variability in spatially resolved transcriptomics with scCube. Nat Commun 2024;15:5021. [PMID: 38866768 PMCID: PMC11169532 DOI: 10.1038/s41467-024-49445-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 06/03/2024] [Indexed: 06/14/2024] Open

Affiliation(s)

Jingyang Qian College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, 314100, Jiaxing, China
Hudong Bao College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China
Xin Shao College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, 314100, Jiaxing, China
Yin Fang College of Computer Science and Technology, Zhejiang University, Hangzhou, 310013, China
Jie Liao College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, 314100, Jiaxing, China
Zhuo Chen College of Computer Science and Technology, Zhejiang University, Hangzhou, 310013, China
Chengyu Li College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, 314100, Jiaxing, China
Wenbo Guo College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, 314100, Jiaxing, China
Yining Hu College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, 314100, Jiaxing, China
Anyao Li College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, 314100, Jiaxing, China
Yue Yao College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, 314100, Jiaxing, China
Xiaohui Fan College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China. National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, 314100, Jiaxing, China. Zhejiang Key Laboratory of Precision Diagnosis and Therapy for Major Gynecological Diseases, Women's Hospital, Zhejiang University School of Medicine, Hangzhou, 310006, China.
Yiyu Cheng College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, China. National Key Laboratory of Chinese Medicine Modernization, Innovation Center of Yangtze River Delta, Zhejiang University, 314100, Jiaxing, China.

Collapse

Wang W, Cen Y, Lu Z, Xu Y, Sun T, Xiao Y, Liu W, Li JJ, Wang C. scCDC: a computational method for gene-specific contamination detection and correction in single-cell and single-nucleus RNA-seq data. Genome Biol 2024;25:136. [PMID: 38783325 PMCID: PMC11112958 DOI: 10.1186/s13059-024-03284-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 05/16/2024] [Indexed: 05/25/2024] Open

Kim H, Chang W, Chae SJ, Park JE, Seo M, Kim JK. scLENS: data-driven signal detection for unbiased scRNA-seq data analysis. Nat Commun 2024;15:3575. [PMID: 38678050 PMCID: PMC11519519 DOI: 10.1038/s41467-024-47884-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Accepted: 04/14/2024] [Indexed: 04/29/2024] Open

Peng M, Lin B, Zhang J, Zhou Y, Lin B. scFSNN: a feature selection method based on neural network for single-cell RNA-seq data. BMC Genomics 2024;25:264. [PMID: 38459442 PMCID: PMC10924397 DOI: 10.1186/s12864-024-10160-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Accepted: 02/25/2024] [Indexed: 03/10/2024] Open

Singhal V, Chou N, Lee J, Yue Y, Liu J, Chock WK, Lin L, Chang YC, Teo EML, Aow J, Lee HK, Chen KH, Prabhakar S. BANKSY unifies cell typing and tissue domain segmentation for scalable spatial omics data analysis. Nat Genet 2024;56:431-441. [PMID: 38413725 PMCID: PMC10937399 DOI: 10.1038/s41588-024-01664-3] [Citation(s) in RCA: 28] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2023] [Accepted: 01/16/2024] [Indexed: 02/29/2024]

Affiliation(s)

Vipul Singhal Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Nigel Chou Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Joseph Lee Faculty of Science, National University of Singapore, Singapore, Republic of Singapore
Yifei Yue Department of Chemical and Biomolecular Engineering, National University of Singapore, Singapore, Republic of Singapore
Jinyue Liu Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Wan Kee Chock Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Li Lin Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Yun-Ching Chang Veranome Biosystems, Mountain View, CA, USA
Erica Mei Ling Teo Veranome Biosystems, Mountain View, CA, USA
Jonathan Aow Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore
Hwee Kuan Lee Bioinformatics Institute (BII), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore School of Computing, National University of Singapore, Singapore, Republic of Singapore Singapore Eye Research Institute, Singapore, Republic of Singapore International Research Laboratory on Artificial Intelligence, Singapore, Republic of Singapore School of Biological Sciences, Nanyang Technological University, Singapore, Republic of Singapore Singapore Institute for Clinical Sciences, Agency for Science, Technology and Research, Singapore, Republic of Singapore
Kok Hao Chen Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore.
Shyam Prabhakar Spatial and Single Cell Systems Domain, Genome Institute of Singapore (GIS), Agency for Science, Technology and Research (A*STAR), Singapore, Republic of Singapore. Population and Global Health, Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore, Republic of Singapore. Cancer Science Institute of Singapore, National University of Singapore, Singapore, Republic of Singapore.

Collapse

Song D, Wang Q, Yan G, Liu T, Sun T, Li JJ. scDesign3 generates realistic in silico data for multimodal single-cell and spatial omics. Nat Biotechnol 2024;42:247-252. [PMID: 37169966 PMCID: PMC11182337 DOI: 10.1038/s41587-023-01772-1] [Citation(s) in RCA: 34] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2022] [Accepted: 03/30/2023] [Indexed: 05/13/2023]

Su C, Zhang J, Zhao H. Estimating cell-type-specific gene co-expression networks from bulk gene expression data with an application to Alzheimer's disease. J Am Stat Assoc 2024;119:811-824. [PMID: 39280354 PMCID: PMC11394578 DOI: 10.1080/01621459.2023.2297467] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 11/20/2023] [Accepted: 12/13/2023] [Indexed: 09/18/2024]

Yang Y, Wang K, Lu Z, Wang T, Wang X. Cytomulate: accurate and efficient simulation of CyTOF data. Genome Biol 2023;24:262. [PMID: 37974276 PMCID: PMC10652542 DOI: 10.1186/s13059-023-03099-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Accepted: 10/24/2023] [Indexed: 11/19/2023] Open

Liu J, Kreimer A, Li WV. Differential variability analysis of single-cell gene expression data. Brief Bioinform 2023;24:bbad294. [PMID: 37598422 PMCID: PMC10516347 DOI: 10.1093/bib/bbad294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2023] [Revised: 07/18/2023] [Accepted: 07/29/2023] [Indexed: 08/22/2023] Open

Li H, Zhang Z, Squires M, Chen X, Zhang X. scMultiSim: simulation of single cell multi-omics and spatial data guided by gene regulatory networks and cell-cell interactions. RESEARCH SQUARE 2023:rs.3.rs-3301625. [PMID: 37790516 PMCID: PMC10543280 DOI: 10.21203/rs.3.rs-3301625/v1] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/05/2023]

Ma Y, Deng C, Zhou Y, Zhang Y, Qiu F, Jiang D, Zheng G, Li J, Shuai J, Zhang Y, Yang J, Su J. Polygenic regression uncovers trait-relevant cellular contexts through pathway activation transformation of single-cell RNA sequencing data. CELL GENOMICS 2023;3:100383. [PMID: 37719150 PMCID: PMC10504677 DOI: 10.1016/j.xgen.2023.100383] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Revised: 05/26/2023] [Accepted: 07/25/2023] [Indexed: 09/19/2023]

Affiliation(s)

Yunlong Ma School of Biomedical Engineering, School of OphthalmoFlogy & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou, Zhejiang 325027, China Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou, Zhejiang 325101, China
Chunyu Deng School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150080, China
Yijun Zhou School of Biomedical Engineering, School of OphthalmoFlogy & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou, Zhejiang 325027, China Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou, Zhejiang 325101, China
Yaru Zhang School of Biomedical Engineering, School of OphthalmoFlogy & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou, Zhejiang 325027, China Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou, Zhejiang 325101, China
Fei Qiu School of Biomedical Engineering, School of OphthalmoFlogy & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou, Zhejiang 325027, China
Dingping Jiang School of Biomedical Engineering, School of OphthalmoFlogy & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou, Zhejiang 325027, China
Gongwei Zheng School of Biomedical Engineering, School of OphthalmoFlogy & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou, Zhejiang 325027, China
Jingjing Li School of Biomedical Engineering, School of OphthalmoFlogy & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou, Zhejiang 325027, China
Jianwei Shuai Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou, Zhejiang 325101, China
Yan Zhang School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang 150080, China
Jian Yang School of Life Sciences, Westlake University, Hangzhou, Zhejiang 310012, China Westlake Laboratory of Life Sciences and Biomedicine, Hangzhou, Zhejiang 310024, China
Jianzhong Su School of Biomedical Engineering, School of OphthalmoFlogy & Optometry and Eye Hospital, Wenzhou Medical University, Wenzhou, Zhejiang 325027, China Oujiang Laboratory, Zhejiang Lab for Regenerative Medicine, Vision and Brain Health, Wenzhou, Zhejiang 325101, China

Collapse

He X, Qian K, Wang Z, Zeng S, Li H, Li WV. scAce: an adaptive embedding and clustering method for single-cell gene expression data. Bioinformatics 2023;39:btad546. [PMID: 37672035 PMCID: PMC10500084 DOI: 10.1093/bioinformatics/btad546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 08/01/2023] [Accepted: 09/05/2023] [Indexed: 09/07/2023] Open

Song D, Li K, Ge X, Li JJ. ClusterDE: a post-clustering differential expression (DE) method robust to false-positive inflation caused by double dipping. RESEARCH SQUARE 2023:rs.3.rs-3211191. [PMID: 37577698 PMCID: PMC10418557 DOI: 10.21203/rs.3.rs-3211191/v1] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/15/2023]

Li C, Chen X, Chen S, Jiang R, Zhang X. simCAS: an embedding-based method for simulating single-cell chromatin accessibility sequencing data. Bioinformatics 2023;39:btad453. [PMID: 37494428 PMCID: PMC10394124 DOI: 10.1093/bioinformatics/btad453] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 06/25/2023] [Accepted: 07/25/2023] [Indexed: 07/28/2023] Open

Mohammad-Taheri S, Tewari V, Kapre R, Rahiminasab E, Sachs K, Tapley Hoyt C, Zucker J, Vitek O. Optimal adjustment sets for causal query estimation in partially observed biomolecular networks. Bioinformatics 2023;39:i494-i503. [PMID: 37387179 PMCID: PMC10311316 DOI: 10.1093/bioinformatics/btad270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023] Open

Lu S, Keleş S. Debiased personalized gene coexpression networks for population-scale scRNA-seq data. Genome Res 2023;33:932-947. [PMID: 37295843 PMCID: PMC10519377 DOI: 10.1101/gr.277363.122] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 06/07/2023] [Indexed: 06/12/2023]

Abstract

Population-scale single-cell RNA-seq (scRNA-seq) data sets create unique opportunities for quantifying expression variation across individuals at the gene coexpression network level. Estimation of coexpression networks is well established for bulk RNA-seq; however, single-cell measurements pose novel challenges owing to technical limitations and noise levels of this technology. Gene-gene correlation estimates from scRNA-seq tend to be severely biased toward zero for genes with low and sparse expression. Here, we present Dozer to debias gene-gene correlation estimates from scRNA-seq data sets and accurately quantify network-level variation across individuals. Dozer corrects correlation estimates in the general Poisson measurement model and provides a metric to quantify genes measured with high noise. Computational experiments establish that Dozer estimates are robust to mean expression levels of the genes and the sequencing depths of the data sets. Compared with alternatives, Dozer results in fewer false-positive edges in the coexpression networks, yields more accurate estimates of network centrality measures and modules, and improves the faithfulness of networks estimated from separate batches of the data sets. We showcase unique analyses enabled by Dozer in two population-scale scRNA-seq applications. Coexpression network-based centrality analysis of multiple differentiating human induced pluripotent stem cell (iPSC) lines yields biologically coherent gene groups that are associated with iPSC differentiation efficiency. Application with population-scale scRNA-seq of oligodendrocytes from postmortem human tissues of Alzheimer's disease and controls uniquely reveals coexpression modules of innate immune response with distinct coexpression levels between the diagnoses. Dozer represents an important advance in estimating personalized coexpression networks from scRNA-seq data.

Collapse

Lu S, Keleş S. Dozer: Debiased personalized gene co-expression networks for population-scale scRNA-seq data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.04.25.538290. [PMID: 37163070 PMCID: PMC10168282 DOI: 10.1101/2023.04.25.538290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/11/2023]

Abstract

Population-scale single cell RNA-seq (scRNA-seq) datasets create unique opportunities for quantifying expression variation across individuals at the gene co-expression network level. Estimation of co-expression networks is well-established for bulk RNA-seq; however, single-cell measurements pose novel challenges due to technical limitations and noise levels of this technology. Gene-gene correlation estimates from scRNA-seq tend to be severely biased towards zero for genes with low and sparse expression. Here, we present Dozer to debias gene-gene correlation estimates from scRNA-seq datasets and accurately quantify network level variation across individuals. Dozer corrects correlation estimates in the general Poisson measurement model and provides a metric to quantify genes measured with high noise. Computational experiments establish that Dozer estimates are robust to mean expression levels of the genes and the sequencing depths of the datasets. Compared to alternatives, Dozer results in fewer false positive edges in the co-expression networks, yields more accurate estimates of network centrality measures and modules, and improves the faithfulness of networks estimated from separate batches of the datasets. We showcase unique analyses enabled by Dozer in two population-scale scRNA-seq applications. Co-expression network-based centrality analysis of multiple differentiating human induced pluripotent stem cell (iPSC) lines yields biologically coherent gene groups that are associated with iPSC differentiation efficiency. Application with population-scale scRNA-seq of oligodendrocytes from postmortem human tissues of Alzheimer disease and controls uniquely reveals co-expression modules of innate immune response with markedly different co-expression levels between the diagnoses. Dozer represents an important advance in estimating personalized co-expression networks from scRNA-seq data.

Collapse

Crowell HL, Morillo Leonardo SX, Soneson C, Robinson MD. The shaky foundations of simulating single-cell RNA sequencing data. Genome Biol 2023;24:62. [PMID: 36991470 PMCID: PMC10061781 DOI: 10.1186/s13059-023-02904-1] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 03/20/2023] [Indexed: 03/31/2023] Open

Li H, Zhang Z, Squires M, Chen X, Zhang X. scMultiSim: simulation of multi-modality single cell data guided by cell-cell interactions and gene regulatory networks. RESEARCH SQUARE 2023:rs.3.rs-2675530. [PMID: 36993284 PMCID: PMC10055660 DOI: 10.21203/rs.3.rs-2675530/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]

De Falco A, Caruso F, Su XD, Iavarone A, Ceccarelli M. A variational algorithm to detect the clonal copy number substructure of tumors from scRNA-seq data. Nat Commun 2023;14:1074. [PMID: 36841879 PMCID: PMC9968345 DOI: 10.1038/s41467-023-36790-9] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Accepted: 02/16/2023] [Indexed: 02/27/2023] Open

Sun T, Song D, Li W, Li J. Author Correction: scDesign2: a transparent simulator that generates high-fidelity single-cell gene expression count data with gene correlations captured. Genome Biol 2023;24:32. [PMID: 36814256 PMCID: PMC9945685 DOI: 10.1186/s13059-023-02884-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/24/2023] Open

Sun L, Wang G, Zhang Z. SimCH: simulation of single-cell RNA sequencing data by modeling cellular heterogeneity at gene expression level. Brief Bioinform 2023;24:6961608. [PMID: 36575569 DOI: 10.1093/bib/bbac590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Revised: 11/08/2022] [Accepted: 12/02/2022] [Indexed: 12/29/2022] Open

Su C, Xu Z, Shan X, Cai B, Zhao H, Zhang J. Cell-type-specific co-expression inference from single cell RNA-sequencing data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2022:2022.12.13.520181. [PMID: 36561173 DOI: 10.1101/2022.04.07.487499] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/26/2023]

Abstract

The inference of gene co-expressions from microarray and RNA-sequencing data has led to rich insights on biological processes and disease mechanisms. However, the bulk samples analyzed in most studies are a mixture of different cell types. As a result, the inferred co-expressions are confounded by varying cell type compositions across samples and only offer an aggregated view of gene regulations that may be distinct across different cell types. The advancement of single cell RNA-sequencing (scRNA-seq) technology has enabled the direct inference of co-expressions in specific cell types, facilitating our understanding of cell-type-specific biological functions. However, the high sequencing depth variations and measurement errors in scRNA-seq data present significant challenges in inferring cell-type-specific gene co-expressions, and these issues have not been adequately addressed in the existing methods. We propose a statistical approach, CS-CORE, for estimating and testing cell-type-specific co-expressions, built on a general expression-measurement model that explicitly accounts for sequencing depth variations and measurement errors in the observed single cell data. Systematic evaluations show that most existing methods suffer from inflated false positives and biased co-expression estimates and clustering analysis, whereas CS-CORE has appropriate false positive control, unbiased co-expression estimates, good statistical power and satisfactory performance in downstream co-expression analysis. When applied to analyze scRNA-seq data from postmortem brain samples from Alzheimerâ€™s disease patients and controls and blood samples from COVID-19 patients and controls, CS-CORE identified cell-type-specific co-expressions and differential co-expressions that were more reproducible and/or more enriched for relevant biological pathways than those inferred from other methods.

Collapse

Regan C, Preall J. Practical Considerations for Single-Cell Genomics. Curr Protoc 2022;2:e498. [PMID: 35926125 PMCID: PMC9479272 DOI: 10.1002/cpz1.498] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]

Wu G, Li Y. Distinct characteristics of correlation analysis at the single-cell and the population level. Stat Appl Genet Mol Biol 2022;21:sagmb-2022-0015. [PMID: 35918809 DOI: 10.1515/sagmb-2022-0015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Accepted: 06/13/2022] [Indexed: 11/15/2022]

Abstract

Correlation analysis is widely used in biological studies to infer molecular relationships within biological networks. Recently, single-cell analysis has drawn tremendous interests, for its ability to obtain high-resolution molecular phenotypes. It turns out that there is little overlap of co-expressed genes identified in single-cell level investigations with that of population level investigations. However, the nature of the relationship of correlations between single-cell and population levels remains unclear. In this manuscript, we aimed to unveil the origin of the differences between the correlation coefficients at the single-cell level and that at the population level, and bridge the gap between them. Through developing formulations to link correlations at the single-cell and the population level, we illustrated that aggregated correlations could be stronger, weaker or equal to the corresponding individual correlations, depending on the variations and the correlations within the population. When the correlation within the population is weaker than the individual correlation, the aggregated correlation is stronger than the corresponding individual correlation. Besides, our data indicated that aggregated correlation is more likely to be stronger than the corresponding individual correlation, and it was rare to find gene-pairs exclusively strongly correlated at the single-cell level. Through a bottom-up approach to model interactions between molecules in a signaling cascade or a multi-regulator-controlled gene expression, we surprisingly found that the existence of interaction between two components could not be excluded simply based on their low correlation coefficients, suggesting a reconsideration of connectivity within biological networks which was derived solely from correlation analysis. We also investigated the impact of technical random measurement errors on the correlation coefficients for the single-cell level and the population level. The results indicate that the aggregated correlation is relatively robust and less affected. Because of the heterogeneity among single cells, correlation coefficients calculated based on data of the single-cell level might be different from that of the population level. Depending on the specific question we are asking, proper sampling and normalization procedure should be done before we draw any conclusions.

Collapse

Cao Y, Yang P, Yang JYH. A benchmark study of simulation methods for single-cell RNA sequencing data. Nat Commun 2021;12:6911. [PMID: 34824223 PMCID: PMC8617278 DOI: 10.1038/s41467-021-27130-w] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2021] [Accepted: 10/26/2021] [Indexed: 11/09/2022] Open

Ge X, Chen YE, Song D, McDermott M, Woyshner K, Manousopoulou A, Wang N, Li W, Wang LD, Li JJ. Clipper: p-value-free FDR control on high-throughput data from two conditions. Genome Biol 2021;22:288. [PMID: 34635147 PMCID: PMC8504070 DOI: 10.1186/s13059-021-02506-9] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2021] [Accepted: 09/21/2021] [Indexed: 12/12/2022] Open

Song D, Li K, Hemminger Z, Wollman R, Li JJ. scPNMF: sparse gene encoding of single cells to facilitate gene selection for targeted gene profiling. Bioinformatics 2021;37:i358-i366. [PMID: 34252925 PMCID: PMC8275345 DOI: 10.1093/bioinformatics/btab273] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open

Abstract

Motivation

Single-cell RNA sequencing (scRNA-seq) captures whole transcriptome information of individual cells. While scRNA-seq measures thousands of genes, researchers are often interested in only dozens to hundreds of genes for a closer study. Then, a question is how to select those informative genes from scRNA-seq data. Moreover, single-cell targeted gene profiling technologies are gaining popularity for their low costs, high sensitivity and extra (e.g. spatial) information; however, they typically can only measure up to a few hundred genes. Then another challenging question is how to select genes for targeted gene profiling based on existing scRNA-seq data.

Results

Here, we develop the single-cell Projective Non-negative Matrix Factorization (scPNMF) method to select informative genes from scRNA-seq data in an unsupervised way. Compared with existing gene selection methods, scPNMF has two advantages. First, its selected informative genes can better distinguish cell types. Second, it enables the alignment of new targeted gene profiling data with reference data in a low-dimensional space to facilitate the prediction of cell types in the new data. Technically, scPNMF modifies the PNMF algorithm for gene selection by changing the initialization and adding a basis selection step, which selects informative bases to distinguish cell types. We demonstrate that scPNMF outperforms the state-of-the-art gene selection methods on diverse scRNA-seq datasets. Moreover, we show that scPNMF can guide the design of targeted gene profiling experiments and the cell-type annotation on targeted gene profiling data.

Availability and implementation

The R package is open-access and available at https://github.com/JSB-UCLA/scPNMF. The data used in this work are available at Zenodo: https://doi.org/10.5281/zenodo.4797997.

Supplementary information

Supplementary data are available at Bioinformatics online.

Collapse

Sun T, Song D, Li WV, Li JJ. Publisher Correction: scDesign2: a transparent simulator that generates high-fidelity single-cell gene expression count data with gene correlations captured. Genome Biol 2021;22:177. [PMID: 34108038 PMCID: PMC8191178 DOI: 10.1186/s13059-021-02394-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open