Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Seninge L, Anastopoulos I, Ding H, Stuart J. VEGA is an interpretable generative model for inferring biological network activity in single-cell transcriptomics. Nat Commun 2021;12:5684. [PMID: 34584103 PMCID: PMC8478947 DOI: 10.1038/s41467-021-26017-0] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 09/13/2021] [Indexed: 02/03/2023] Open

For:	Seninge L, Anastopoulos I, Ding H, Stuart J. VEGA is an interpretable generative model for inferring biological network activity in single-cell transcriptomics. Nat Commun 2021;12:5684. [PMID: 34584103 PMCID: PMC8478947 DOI: 10.1038/s41467-021-26017-0] [Citation(s) in RCA: 38] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Accepted: 09/13/2021] [Indexed: 02/03/2023] Open

Number

Cited by Other Article(s)

Selby DA, Sprang M, Ewald J, Vollmer SJ. Beyond the black box with biologically informed neural networks. Nat Rev Genet 2025;26:371-372. [PMID: 40038452 DOI: 10.1038/s41576-025-00826-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2025]

Jiang Y, Immadi MS, Wang D, Zeng S, On Chan Y, Zhou J, Xu D, Joshi T. IRnet: Immunotherapy response prediction using pathway knowledge-informed graph neural network. J Adv Res 2025;72:319-331. [PMID: 39097091 DOI: 10.1016/j.jare.2024.07.036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2024] [Revised: 07/10/2024] [Accepted: 07/30/2024] [Indexed: 08/05/2024] Open

Affiliation(s)

Yuexu Jiang Department of Electrical Engineering and Computer Science, University of Missouri-Columbia, Columbia, MO, USA; Christopher S. Bond Life Sciences Center, University of Missouri-Columbia, Columbia, MO, USA
Manish Sridhar Immadi Department of Electrical Engineering and Computer Science, University of Missouri-Columbia, Columbia, MO, USA
Duolin Wang Department of Electrical Engineering and Computer Science, University of Missouri-Columbia, Columbia, MO, USA; Christopher S. Bond Life Sciences Center, University of Missouri-Columbia, Columbia, MO, USA
Shuai Zeng Department of Electrical Engineering and Computer Science, University of Missouri-Columbia, Columbia, MO, USA; Christopher S. Bond Life Sciences Center, University of Missouri-Columbia, Columbia, MO, USA
Yen On Chan Department of Electrical Engineering and Computer Science, University of Missouri-Columbia, Columbia, MO, USA; MU Institute for Data Science and Informatics, University of Missouri-Columbia, Columbia, MO, USA
Jing Zhou Department of Surgery, University of Missouri-Columbia, Columbia, MO, USA
Dong Xu Department of Electrical Engineering and Computer Science, University of Missouri-Columbia, Columbia, MO, USA; Christopher S. Bond Life Sciences Center, University of Missouri-Columbia, Columbia, MO, USA; MU Institute for Data Science and Informatics, University of Missouri-Columbia, Columbia, MO, USA
Trupti Joshi Department of Electrical Engineering and Computer Science, University of Missouri-Columbia, Columbia, MO, USA; Christopher S. Bond Life Sciences Center, University of Missouri-Columbia, Columbia, MO, USA; MU Institute for Data Science and Informatics, University of Missouri-Columbia, Columbia, MO, USA; Department of Biomedical Informatics, Biostatistics and Medical Epidemiology, University of Missouri-Columbia, Columbia, MO, USA.

Collapse

He Y, Li S, Lan H, Long W, Zhai S, Li M, Wen Z. A Transfer Learning Framework for Predicting and Interpreting Drug Responses via Single-Cell RNA-Seq Data. Int J Mol Sci 2025;26:4365. [PMID: 40362602 PMCID: PMC12072357 DOI: 10.3390/ijms26094365] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2025] [Revised: 04/29/2025] [Accepted: 05/02/2025] [Indexed: 05/15/2025] Open

Wang J, Ye F, Chai H, Jiang Y, Wang T, Ran X, Xia Q, Xu Z, Fu Y, Zhang G, Wu H, Guo G, Guo H, Ruan Y, Wang Y, Xing D, Xu X, Zhang Z. Advances and applications in single-cell and spatial genomics. SCIENCE CHINA. LIFE SCIENCES 2025;68:1226-1282. [PMID: 39792333 DOI: 10.1007/s11427-024-2770-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/20/2024] [Accepted: 10/10/2024] [Indexed: 01/12/2025]

Affiliation(s)

Jingjing Wang Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
Fang Ye Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
Haoxi Chai Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, China
Yujia Jiang BGI Research, Shenzhen, 518083, China BGI Research, Hangzhou, 310030, China
Teng Wang Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, China
Xia Ran Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China Institute of Hematology, Zhejiang University, Hangzhou, 310000, China
Qimin Xia Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China
Ziye Xu Department of Laboratory Medicine of The First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China
Yuting Fu Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
Guodong Zhang Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
Hanyu Wu Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China
Guoji Guo Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China. Center for Stem Cell and Regenerative Medicine, Zhejiang University School of Medicine, Hangzhou, 310058, China. Zhejiang Provincial Key Lab for Tissue Engineering and Regenerative Medicine, Dr. Li Dak Sum & Yip Yio Chin Center for Stem Cell and Regenerative Medicine, Hangzhou, 310058, China. Institute of Hematology, Zhejiang University, Hangzhou, 310000, China.
Hongshan Guo Bone Marrow Transplantation Center of the First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China. Institute of Hematology, Zhejiang University, Hangzhou, 310000, China.
Yijun Ruan Life Sciences Institute and The Second Affiliated Hospital, Zhejiang University, Hangzhou, 310058, China.
Yongcheng Wang Department of Laboratory Medicine of The First Affiliated Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 310058, China.
Dong Xing Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China. Beijing Advanced Innovation Center for Genomics (ICG), Peking University, Beijing, 100871, China.
Xun Xu BGI Research, Shenzhen, 518083, China. BGI Research, Hangzhou, 310030, China. Guangdong Provincial Key Laboratory of Genome Read and Write, BGI Research, Shenzhen, 518083, China.
Zemin Zhang Biomedical Pioneering Innovation Center (BIOPIC) and School of Life Sciences, Peking University, Beijing, 100871, China.

Collapse

Birk S, Bonafonte-Pardàs I, Feriz AM, Boxall A, Agirre E, Memi F, Maguza A, Yadav A, Armingol E, Fan R, Castelo-Branco G, Theis FJ, Bayraktar OA, Talavera-López C, Lotfollahi M. Quantitative characterization of cell niches in spatially resolved omics data. Nat Genet 2025;57:897-909. [PMID: 40102688 PMCID: PMC11985353 DOI: 10.1038/s41588-025-02120-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2024] [Accepted: 02/05/2025] [Indexed: 03/20/2025]

Affiliation(s)

Sebastian Birk Institute of AI for Health, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany School of Computation, Information and Technology, Technical University of Munich, Munich, Germany Würzburg Institute of Systems Immunology (WüSI), University of Würzburg, Würzburg, Germany Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
Irene Bonafonte-Pardàs Institute of Computational Biology, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany Biomedical Center (BMC), Physiological Chemistry, Faculty of Medicine, Ludwig Maximilian University of Munich, Planegg-Martinsried, Germany
Adib Miraki Feriz Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
Adam Boxall Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
Eneritz Agirre Laboratory of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden
Fani Memi Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
Anna Maguza Würzburg Institute of Systems Immunology (WüSI), University of Würzburg, Würzburg, Germany Faculty of Medicine, University of Würzburg, Würzburg, Germany
Anamika Yadav Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
Erick Armingol Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
Rong Fan Department of Biomedical Engineering, Yale University, New Haven, CT, USA Yale Stem Cell Center and Yale Cancer Center, Yale University School of Medicine, New Haven, CT, USA Department of Pathology, Yale University School of Medicine, New Haven, CT, USA Human and Translational Immunology Program, Yale University School of Medicine, New Haven, CT, USA
Gonçalo Castelo-Branco Laboratory of Molecular Neurobiology, Department of Medical Biochemistry and Biophysics, Karolinska Institutet, Stockholm, Sweden Ming Wai Lau Centre for Reparative Medicine, Stockholm Node, Karolinska Institutet, Stockholm, Sweden
Fabian J Theis School of Computation, Information and Technology, Technical University of Munich, Munich, Germany Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK Institute of Computational Biology, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
Omer Ali Bayraktar Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
Carlos Talavera-López Würzburg Institute of Systems Immunology (WüSI), University of Würzburg, Würzburg, Germany. Faculty of Medicine, University of Würzburg, Würzburg, Germany.
Mohammad Lotfollahi Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK. Institute of Computational Biology, Helmholtz Center Munich-German Research Center for Environmental Health, Neuherberg, Germany.

Collapse

Sadria M, Layton A. scVAEDer: integrating deep diffusion models and variational autoencoders for single-cell transcriptomics analysis. Genome Biol 2025;26:64. [PMID: 40119479 PMCID: PMC11927372 DOI: 10.1186/s13059-025-03519-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 02/27/2025] [Indexed: 03/24/2025] Open

Thapa K, Kinali M, Pei S, Luna A, Babur Ö. Strategies to include prior knowledge in omics analysis with deep neural networks. PATTERNS (NEW YORK, N.Y.) 2025;6:101203. [PMID: 40182174 PMCID: PMC11963003 DOI: 10.1016/j.patter.2025.101203] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/05/2025]

Monzó C, Aguerralde-Martin M, Martínez-Mira C, Arzalluz-Luque Á, Conesa A, Tarazona S. MOSim: bulk and single-cell multilayer regulatory network simulator. Brief Bioinform 2025;26:bbaf110. [PMID: 40116657 PMCID: PMC11926980 DOI: 10.1093/bib/bbaf110] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2024] [Revised: 02/13/2025] [Accepted: 02/21/2025] [Indexed: 03/23/2025] Open

Rodov A, Baniadam H, Zeiser R, Amit I, Yosef N, Wertheimer T, Ingelfinger F. Towards the Next Generation of Data-Driven Therapeutics Using Spatially Resolved Single-Cell Technologies and Generative AI. Eur J Immunol 2025;55:e202451234. [PMID: 39964048 PMCID: PMC11834372 DOI: 10.1002/eji.202451234] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2024] [Revised: 01/28/2025] [Accepted: 02/03/2025] [Indexed: 02/21/2025]

Davidson NR, Zhang F, Greene CS. BuDDI: Bulk Deconvolution with Domain Invariance to predict cell-type-specific perturbations from bulk. PLoS Comput Biol 2025;21:e1012742. [PMID: 39823522 PMCID: PMC11790236 DOI: 10.1371/journal.pcbi.1012742] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2024] [Revised: 02/03/2025] [Accepted: 12/20/2024] [Indexed: 01/19/2025] Open

Abstract

While single-cell experiments provide deep cellular resolution within a single sample, some single-cell experiments are inherently more challenging than bulk experiments due to dissociation difficulties, cost, or limited tissue availability. This creates a situation where we have deep cellular profiles of one sample or condition, and bulk profiles across multiple samples and conditions. To bridge this gap, we propose BuDDI (BUlk Deconvolution with Domain Invariance). BuDDI utilizes domain adaptation techniques to effectively integrate available corpora of case-control bulk and reference scRNA-seq observations to infer cell-type-specific perturbation effects. BuDDI achieves this by learning independent latent spaces within a single variational autoencoder (VAE) encompassing at least four sources of variability: 1) cell type proportion, 2) perturbation effect, 3) structured experimental variability, and 4) remaining variability. Since each latent space is encouraged to be independent, we simulate perturbation responses by independently composing each latent space to simulate cell-type-specific perturbation responses. We evaluated BuDDI's performance on simulated and real data with experimental designs of increasing complexity. We first validated that BuDDI could learn domain invariant latent spaces on data with matched samples across each source of variability. Then we validated that BuDDI could accurately predict cell-type-specific perturbation response when no single-cell perturbed profiles were used during training; instead, only bulk samples had both perturbed and non-perturbed observations. Finally, we validated BuDDI on predicting sex-specific differences, an experimental design where it is not possible to have matched samples. In each experiment, BuDDI outperformed all other comparative methods and baselines. As more reference atlases are completed, BuDDI provides a path to combine these resources with bulk-profiled treatment or disease signatures to study perturbations, sex differences, or other factors at single-cell resolution.

Collapse

Gomez C, Uhrig L, Frouin V, Duchesnay E, Jarraya B, Grigis A. Deep learning models reveal the link between dynamic brain connectivity patterns and states of consciousness. Sci Rep 2024;14:31606. [PMID: 39738114 PMCID: PMC11686193 DOI: 10.1038/s41598-024-76695-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 10/16/2024] [Indexed: 01/01/2025] Open

Abstract

Decoding states of consciousness from brain activity is a central challenge in neuroscience. Dynamic functional connectivity (dFC) allows the study of short-term temporal changes in functional connectivity (FC) between distributed brain areas. By clustering dFC matrices from resting-state fMRI, we previously described "brain patterns" that underlie different functional configurations of the brain at rest. The networks associated with these patterns have been extensively analyzed. However, the overall dynamic organization and how it relates to consciousness remains unclear. We hypothesized that deep learning networks would help to model this relationship. Recent studies have used low-dimensional variational autoencoders (VAE) to learn meaningful representations that can help explaining consciousness. Here, we investigated the complexity of selecting such a generative model to study brain dynamics, and extended the available methods for latent space characterization and modeling. Therefore, our contributions are threefold. First, compared with probabilistic principal component analysis and sparse VAE, we showed that the selected low-dimensional VAE exhibits balanced performance in reconstructing dFCs and classifying brain patterns. We then explored the organization of the obtained low-dimensional dFC latent representations. We showed how these representations stratify the dynamic organization of the brain patterns as well as the experimental conditions. Finally, we proposed to delve into the proposed brain computational model. We first applied a receptive field analysis to identify preferred directions in the latent space to move from one brain pattern to another. Then, an ablation study was achieved where we virtually inactivated specific brain areas. We demonstrated the model's efficiency in summarizing consciousness-specific information encoded in key inter-areal connections, as described in the global neuronal workspace theory of consciousness. The proposed framework advocates the possibility of developing an interpretable computational brain model of interest for disorders of consciousness, paving the way for a dynamic diagnostic support tool.

Collapse

Gavriilidis GI, Vasileiou V, Orfanou A, Ishaque N, Psomopoulos F. A mini-review on perturbation modelling across single-cell omic modalities. Comput Struct Biotechnol J 2024;23:1886-1896. [PMID: 38721585 PMCID: PMC11076269 DOI: 10.1016/j.csbj.2024.04.058] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Revised: 04/23/2024] [Accepted: 04/23/2024] [Indexed: 01/06/2025] Open

Abstract

Recent advances in single-cell omics technology have transformed the landscape of cellular and molecular research, enriching the scope and intricacy of cellular characterisation. Perturbation modelling seeks to comprehensively grasp the effects of external influences like disease onset or molecular knock-outs or external stimulants on cellular physiology, specifically on transcription factors, signal transducers, biological pathways, and dynamic cell states. Machine and deep learning tools transform complex perturbational phenomena in algorithmically tractable tasks to formulate predictions based on various types of single-cell datasets. However, the recent surge in tools and datasets makes it challenging for experimental biologists and computational scientists to keep track of the recent advances in this rapidly expanding filed of single-cell modelling. Here, we recapitulate the main objectives of perturbation modelling and summarise novel single-cell perturbation technologies based on genetic manipulation like CRISPR or compounds, spanning across omic modalities. We then concisely review a burgeoning group of computational methods extending from classical statistical inference methodologies to various machine and deep learning architectures like shallow models or autoencoders, to biologically informed approaches based on gene regulatory networks, and to combinatorial efforts reminiscent of ensemble learning. We also discuss the rising trend of large foundational models in single-cell perturbation modelling inspired by large language models. Lastly, we critically assess the challenges that underline single-cell perturbation modelling while pointing towards relevant future perspectives like perturbation atlases, multi-omics and spatial datasets, causal machine learning for interpretability, multi-task learning for performance and explainability as well as prospects for solving interoperability and benchmarking pitfalls.

Collapse

Wang FA, Li Y, Zeng T. Deep Learning of radiology-genomics integration for computational oncology: A mini review. Comput Struct Biotechnol J 2024;23:2708-2716. [PMID: 39035833 PMCID: PMC11260400 DOI: 10.1016/j.csbj.2024.06.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 06/18/2024] [Accepted: 06/18/2024] [Indexed: 07/23/2024] Open

Hsieh KL, Zhang K, Chu Y, Yu L, Li X, Hu N, Kawosa I, Pilié PG, Bhattacharya PK, Zhi D, Jiang X, Zhao Z, Dai Y. iGTP: Learning interpretable cellular embedding for inferring biological mechanisms underlying single-cell transcriptomics. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2024:2024.03.29.24305092. [PMID: 39649598 PMCID: PMC11623718 DOI: 10.1101/2024.03.29.24305092] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]

de Weerd HA, Guala D, Gustafsson M, Synnergren J, Tegnér J, Lubovac-Pilav Z, Magnusson R. Latent space arithmetic on data embeddings from healthy multi-tissue human RNA-seq decodes disease modules. PATTERNS (NEW YORK, N.Y.) 2024;5:101093. [PMID: 39568475 PMCID: PMC11573900 DOI: 10.1016/j.patter.2024.101093] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Revised: 08/26/2024] [Accepted: 10/11/2024] [Indexed: 11/22/2024]

Almet AA, Tsai YC, Watanabe M, Nie Q. Inferring pattern-driving intercellular flows from single-cell and spatial transcriptomics. Nat Methods 2024;21:1806-1817. [PMID: 39187683 PMCID: PMC11466815 DOI: 10.1038/s41592-024-02380-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2023] [Accepted: 07/23/2024] [Indexed: 08/28/2024]

Joy MT, Carmichael ST. Activity-dependent transcriptional programs in memory regulate motor recovery after stroke. Commun Biol 2024;7:1048. [PMID: 39183218 PMCID: PMC11345429 DOI: 10.1038/s42003-024-06723-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 08/12/2024] [Indexed: 08/27/2024] Open

Maruhashi K, Kashima H, Miyano S, Park H. Meta graphical lasso: uncovering hidden interactions among latent mechanisms. Sci Rep 2024;14:18105. [PMID: 39103384 PMCID: PMC11300637 DOI: 10.1038/s41598-024-68959-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/07/2024] [Accepted: 07/30/2024] [Indexed: 08/07/2024] Open

van Hilten A, Katz S, Saccenti E, Niessen WJ, Roshchupkin GV. Designing interpretable deep learning applications for functional genomics: a quantitative analysis. Brief Bioinform 2024;25:bbae449. [PMID: 39293804 PMCID: PMC11410376 DOI: 10.1093/bib/bbae449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2024] [Revised: 08/07/2024] [Accepted: 08/28/2024] [Indexed: 09/20/2024] Open

Wagle MM, Long S, Chen C, Liu C, Yang P. Interpretable deep learning in single-cell omics. Bioinformatics 2024;40:btae374. [PMID: 38889275 PMCID: PMC11211213 DOI: 10.1093/bioinformatics/btae374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2024] [Revised: 05/11/2024] [Accepted: 06/12/2024] [Indexed: 06/20/2024] Open

Rivero-Garcia I, Torres M, Sánchez-Cabo F. Deep generative models in single-cell omics. Comput Biol Med 2024;176:108561. [PMID: 38749321 DOI: 10.1016/j.compbiomed.2024.108561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 04/30/2024] [Accepted: 05/05/2024] [Indexed: 05/31/2024]

Yang Y, Seninge L, Wang Z, Oro A, Stuart JM, Ding H. The manatee variational autoencoder model for predicting gene expression alterations caused by transcription factor perturbations. Sci Rep 2024;14:11794. [PMID: 38782963 PMCID: PMC11116378 DOI: 10.1038/s41598-024-62620-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Accepted: 05/20/2024] [Indexed: 05/25/2024] Open

Chen H, Lu Y, Dai Z, Yang Y, Li Q, Rao Y. Comprehensive single-cell RNA-seq analysis using deep interpretable generative modeling guided by biological hierarchy knowledge. Brief Bioinform 2024;25:bbae314. [PMID: 38960404 PMCID: PMC11221887 DOI: 10.1093/bib/bbae314] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 12/13/2023] [Accepted: 06/20/2024] [Indexed: 07/05/2024] Open

Luo X, Niyakan S, Johnstone P, McCorkle S, Park G, López-Marrero V, Yoo S, Dougherty ER, Qian X, Alexander FJ, Jha S, Yoon BJ. Pathway-based analyses of gene expression profiles at low doses of ionizing radiation. FRONTIERS IN BIOINFORMATICS 2024;4:1280971. [PMID: 38812660 PMCID: PMC11135168 DOI: 10.3389/fbinf.2024.1280971] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Accepted: 04/16/2024] [Indexed: 05/31/2024] Open

Pancotti C, Rollo C, Codicè F, Birolo G, Fariselli P, Sanavia T. MUSE-XAE: MUtational Signature Extraction with eXplainable AutoEncoder enhances tumour types classification. Bioinformatics 2024;40:btae320. [PMID: 38754097 PMCID: PMC11139523 DOI: 10.1093/bioinformatics/btae320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 04/08/2024] [Accepted: 05/15/2024] [Indexed: 05/18/2024] Open

Davidson NR, Zhang F, Greene CS. BuDDI: BulkDeconvolution withDomainInvariance to predict cell-type-specific perturbations from bulk. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.20.549951. [PMID: 37503097 PMCID: PMC10370205 DOI: 10.1101/2023.07.20.549951] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]

Abstract

Collapse

Wei Z, Chenjun W, Feiyang X, Mingfeng J, Yixuan Z, Qi L, Zhuoxing S, Qi D. scHybridBERT: integrating gene regulation and cell graph for spatiotemporal dynamics in single-cell clustering. Brief Bioinform 2024;25:bbae018. [PMID: 38517692 PMCID: PMC10959234 DOI: 10.1093/bib/bbae018] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Revised: 12/19/2023] [Accepted: 01/09/2024] [Indexed: 03/24/2024] Open

Hu T, Allam M, Cai S, Henderson W, Yueh B, Garipcan A, Ievlev AV, Afkarian M, Beyaz S, Coskun AF. Single-cell spatial metabolomics with cell-type specific protein profiling for tissue systems biology. Nat Commun 2023;14:8260. [PMID: 38086839 PMCID: PMC10716522 DOI: 10.1038/s41467-023-43917-5] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2023] [Accepted: 11/23/2023] [Indexed: 12/18/2023] Open

Baig Y, Ma HR, Xu H, You L. Autoencoder neural networks enable low dimensional structure analyses of microbial growth dynamics. Nat Commun 2023;14:7937. [PMID: 38049401 PMCID: PMC10696002 DOI: 10.1038/s41467-023-43455-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2023] [Accepted: 11/09/2023] [Indexed: 12/06/2023] Open

Toussaint PA, Leiser F, Thiebes S, Schlesner M, Brors B, Sunyaev A. Explainable artificial intelligence for omics data: a systematic mapping study. Brief Bioinform 2023;25:bbad453. [PMID: 38113073 PMCID: PMC10729786 DOI: 10.1093/bib/bbad453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 07/28/2023] [Accepted: 11/08/2023] [Indexed: 12/21/2023] Open

Li S, Guo H, Zhang S, Li Y, Li M. Attention-based deep clustering method for scRNA-seq cell type identification. PLoS Comput Biol 2023;19:e1011641. [PMID: 37948464 PMCID: PMC10703402 DOI: 10.1371/journal.pcbi.1011641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 12/07/2023] [Accepted: 10/30/2023] [Indexed: 11/12/2023] Open

Abstract

Single-cell sequencing (scRNA-seq) technology provides higher resolution of cellular differences than bulk RNA sequencing and reveals the heterogeneity in biological research. The analysis of scRNA-seq datasets is premised on the subpopulation assignment. When an appropriate reference is not available, such as specific marker genes and single-cell reference atlas, unsupervised clustering approaches become the predominant option. However, the inherent sparsity and high-dimensionality of scRNA-seq datasets pose specific analytical challenges to traditional clustering methods. Therefore, a various deep learning-based methods have been proposed to address these challenges. As each method improves partially, a comprehensive method needs to be proposed. In this article, we propose a novel scRNA-seq data clustering method named AttentionAE-sc (Attention fusion AutoEncoder for single-cell). Two different scRNA-seq clustering strategies are combined through an attention mechanism, that include zero-inflated negative binomial (ZINB)-based methods dealing with the impact of dropout events and graph autoencoder (GAE)-based methods relying on information from neighbors to guide the dimension reduction. Based on an iterative fusion between denoising and topological embeddings, AttentionAE-sc can easily acquire clustering-friendly cell representations that similar cells are closer in the hidden embedding. Compared with several state-of-art baseline methods, AttentionAE-sc demonstrated excellent clustering performance on 16 real scRNA-seq datasets without the need to specify the number of groups. Additionally, AttentionAE-sc learned improved cell representations and exhibited enhanced stability and robustness. Furthermore, AttentionAE-sc achieved remarkable identification in a breast cancer single-cell atlas dataset and provided valuable insights into the heterogeneity among different cell subtypes.

Collapse

Yang Y, McCullough CG, Seninge L, Guo L, Kwon WJ, Zhang Y, Li NY, Gaddam S, Pan C, Zhen H, Torkelson J, Glass IA, Charville G, Que J, Stuart J, Ding H, Oro A. A Spatiotemporal and Machine-Learning Platform Accelerates the Manufacturing of hPSC-derived Esophageal Mucosa. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.10.24.563664. [PMID: 37961271 PMCID: PMC10634774 DOI: 10.1101/2023.10.24.563664] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]

Yelmen B, Decelle A, Boulos LL, Szatkownik A, Furtlehner C, Charpiat G, Jay F. Deep convolutional and conditional neural networks for large-scale genomic data generation. PLoS Comput Biol 2023;19:e1011584. [PMID: 37903158 PMCID: PMC10635570 DOI: 10.1371/journal.pcbi.1011584] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 11/09/2023] [Accepted: 10/09/2023] [Indexed: 11/01/2023] Open

Abstract

Applications of generative models for genomic data have gained significant momentum in the past few years, with scopes ranging from data characterization to generation of genomic segments and functional sequences. In our previous study, we demonstrated that generative adversarial networks (GANs) and restricted Boltzmann machines (RBMs) can be used to create novel high-quality artificial genomes (AGs) which can preserve the complex characteristics of real genomes such as population structure, linkage disequilibrium and selection signals. However, a major drawback of these models is scalability, since the large feature space of genome-wide data increases computational complexity vastly. To address this issue, we implemented a novel convolutional Wasserstein GAN (WGAN) model along with a novel conditional RBM (CRBM) framework for generating AGs with high SNP number. These networks implicitly learn the varying landscape of haplotypic structure in order to capture complex correlation patterns along the genome and generate a wide diversity of plausible haplotypes. We performed comparative analyses to assess both the quality of these generated haplotypes and the amount of possible privacy leakage from the training data. As the importance of genetic privacy becomes more prevalent, the need for effective privacy protection measures for genomic data increases. We used generative neural networks to create large artificial genome segments which possess many characteristics of real genomes without substantial privacy leakage from the training dataset. In the near future, with further improvements in haplotype quality and privacy preservation, large-scale artificial genome databases can be assembled to provide easily accessible surrogates of real databases, allowing researchers to conduct studies with diverse genomic data within a safe ethical framework in terms of donor privacy.

Collapse

Martínez-Enguita D, Dwivedi SK, Jörnsten R, Gustafsson M. NCAE: data-driven representations using a deep network-coherent DNA methylation autoencoder identify robust disease and risk factor signatures. Brief Bioinform 2023;24:bbad293. [PMID: 37587790 PMCID: PMC10516364 DOI: 10.1093/bib/bbad293] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 07/25/2023] [Accepted: 07/29/2023] [Indexed: 08/18/2023] Open

Schuster V, Krogh A. The Deep Generative Decoder: MAP estimation of representations improves modelling of single-cell RNA data. Bioinformatics 2023;39:btad497. [PMID: 37572301 PMCID: PMC10483129 DOI: 10.1093/bioinformatics/btad497] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2022] [Revised: 07/12/2023] [Accepted: 08/10/2023] [Indexed: 08/14/2023] Open

Almet AA, Yuan H, Annusver K, Ramos R, Liu Y, Wiedemann J, Sorkin DH, Landén NX, Sonkoly E, Haniffa M, Nie Q, Lichtenberger BM, Luecken MD, Andersen B, Tsoi LC, Watt FM, Gudjonsson JE, Plikus MV, Kasper M. A Roadmap for a Consensus Human Skin Cell Atlas and Single-Cell Data Standardization. J Invest Dermatol 2023;143:1667-1677. [PMID: 37612031 PMCID: PMC10610458 DOI: 10.1016/j.jid.2023.03.1679] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Revised: 03/24/2023] [Accepted: 03/29/2023] [Indexed: 08/25/2023]

Affiliation(s)

Axel A Almet Department of Mathematics, University of California, Irvine, Irvine, California, USA; NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, Irvine, California, USA
Hao Yuan Department of Cell and Molecular Biology, Karolinska Institute, Stockholm, Sweden
Karl Annusver Department of Cell and Molecular Biology, Karolinska Institute, Stockholm, Sweden
Raul Ramos NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, Irvine, California, USA; Department of Developmental and Cell Biology, School of Biological Sciences, University of California, Irvine, Irvine, California, USA; Sue and Bill Gross Stem Cell Research Center, University of California, Irvine, Irvine, California, USA
Yingzi Liu Department of Developmental and Cell Biology, School of Biological Sciences, University of California, Irvine, Irvine, California, USA; Sue and Bill Gross Stem Cell Research Center, University of California, Irvine, Irvine, California, USA
Julie Wiedemann Department of Developmental and Cell Biology, School of Biological Sciences, University of California, Irvine, Irvine, California, USA; Mathematical, Computational & Systems Biology, Department of Medicine, University of California, Irvine, Irvine, California, USA
Dara H Sorkin Institute for Clinical & Translational Science, University of California, Irvine, Irvine, California, USA; Department of Medicine, School of Medicine, University of California, Irvine, Irvine, California, USA
Ning Xu Landén Dermatology and Venereology Division, Department of Medicine, Solna, Karolinska Institute, Stockholm, Sweden; Center for Molecular Medicine, Karolinska Institute, Stockholm, Sweden; Ming Wai Lau Centre for Reparative Medicine, Karolinska Institute, Stockholm, Sweden
Enikö Sonkoly Dermatology and Venereology Division, Department of Medicine, Solna, Karolinska Institute, Stockholm, Sweden; Center for Molecular Medicine, Karolinska Institute, Stockholm, Sweden; Dermatology and Venereology, Department of Medical Sciences, Uppsala University, Uppsala, Sweden
Muzlifah Haniffa Wellcome Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridge, United Kingdom; Biosciences Institute, Newcastle University, Newcastle Upon Tyne, United Kingdom; Department of Dermatology and NIHR Newcastle Biomedical Research Centre, Newcastle Hospitals NHS Foundation Trust, Newcastle Upon Tyne, United Kingdom
Qing Nie Department of Mathematics, University of California, Irvine, Irvine, California, USA; NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, Irvine, California, USA; Department of Developmental and Cell Biology, School of Biological Sciences, University of California, Irvine, Irvine, California, USA
Beate M Lichtenberger Skin & Endothelium Research Division (SERD), Department of Dermatology, Medical University of Vienna, Vienna, Austria
Malte D Luecken Institute of Computational Biology, Helmholtz Munich, Neuherberg, Germany; Institute of Lung Health and Immunity, Helmholtz Munich, Member of the German Center for Lung Research (DZL), Munich, Germany
Bogi Andersen NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, Irvine, California, USA; Sue and Bill Gross Stem Cell Research Center, University of California, Irvine, Irvine, California, USA; Department of Medicine, School of Medicine, University of California, Irvine, Irvine, California, USA; Department of Biological Chemistry, School of Medicine, University of California, Irvine, Irvine, California, USA
Lam C Tsoi Department of Dermatology, University of Michigan, Ann Arbor, Michigan, USA; Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, USA; Department of Biostatistics, School of Public Health, University of Michigan, Ann Arbor, Michigan, USA; Center for Statistical Genetics, School of Public Health, University of Michigan, Ann Arbor, Michigan, USA
Fiona M Watt Centre for Gene Therapy & Regenerative Medicine, Faculty of Life Sciences & Medicine, School of Basic & Medical Biosciences, King's College London, London, United Kingdom; Directors' Research Unit, European Molecular Biology Laboratory, Heidelberg, Germany
Johann E Gudjonsson Department of Dermatology, University of Michigan, Ann Arbor, Michigan, USA
Maksim V Plikus NSF-Simons Center for Multiscale Cell Fate Research, University of California, Irvine, Irvine, California, USA; Department of Developmental and Cell Biology, School of Biological Sciences, University of California, Irvine, Irvine, California, USA; Sue and Bill Gross Stem Cell Research Center, University of California, Irvine, Irvine, California, USA.
Maria Kasper Department of Cell and Molecular Biology, Karolinska Institute, Stockholm, Sweden.

Collapse

Yelmen B, Jay F. An Overview of Deep Generative Models in Functional and Evolutionary Genomics. Annu Rev Biomed Data Sci 2023;6:173-189. [PMID: 37137168 DOI: 10.1146/annurev-biodatasci-020722-115651] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]

Wysocka M, Wysocki O, Zufferey M, Landers D, Freitas A. A systematic review of biologically-informed deep learning models for cancer: fundamental trends for encoding and interpreting oncology data. BMC Bioinformatics 2023;24:198. [PMID: 37189058 PMCID: PMC10186658 DOI: 10.1186/s12859-023-05262-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 03/30/2023] [Indexed: 05/17/2023] Open

Abstract

BACKGROUND

There is an increasing interest in the use of Deep Learning (DL) based methods as a supporting analytical framework in oncology. However, most direct applications of DL will deliver models with limited transparency and explainability, which constrain their deployment in biomedical settings.

METHODS

This systematic review discusses DL models used to support inference in cancer biology with a particular emphasis on multi-omics analysis. It focuses on how existing models address the need for better dialogue with prior knowledge, biological plausibility and interpretability, fundamental properties in the biomedical domain. For this, we retrieved and analyzed 42 studies focusing on emerging architectural and methodological advances, the encoding of biological domain knowledge and the integration of explainability methods.

RESULTS

We discuss the recent evolutionary arch of DL models in the direction of integrating prior biological relational and network knowledge to support better generalisation (e.g. pathways or Protein-Protein-Interaction networks) and interpretability. This represents a fundamental functional shift towards models which can integrate mechanistic and statistical inference aspects. We introduce a concept of bio-centric interpretability and according to its taxonomy, we discuss representational methodologies for the integration of domain prior knowledge in such models.

CONCLUSIONS

The paper provides a critical outlook into contemporary methods for explainability and interpretability used in DL for cancer. The analysis points in the direction of a convergence between encoding prior knowledge and improved interpretability. We introduce bio-centric interpretability which is an important step towards formalisation of biological interpretability of DL models and developing methods that are less problem- or application-specific.

Collapse

Paylar B, Längkvist M, Jass J, Olsson PE. Utilization of Computer Classification Methods for Exposure Prediction and Gene Selection in Daphnia magna Toxicogenomics. BIOLOGY 2023;12:biology12050692. [PMID: 37237504 DOI: 10.3390/biology12050692] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Revised: 05/02/2023] [Accepted: 05/06/2023] [Indexed: 05/28/2023]

Janizek JD, Spiro A, Celik S, Blue BW, Russell JC, Lee TI, Kaeberlin M, Lee SI. PAUSE: principled feature attribution for unsupervised gene expression analysis. Genome Biol 2023;24:81. [PMID: 37076856 PMCID: PMC10114348 DOI: 10.1186/s13059-023-02901-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Accepted: 03/17/2023] [Indexed: 04/21/2023] Open

Utriainen M, Morris JH. clusterMaker2: a major update to clusterMaker, a multi-algorithm clustering app for Cytoscape. BMC Bioinformatics 2023;24:134. [PMID: 37020209 PMCID: PMC10074866 DOI: 10.1186/s12859-023-05225-z] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2022] [Accepted: 03/11/2023] [Indexed: 04/07/2023] Open

Abstract

BACKGROUND

Since the initial publication of clusterMaker, the need for tools to analyze large biological datasets has only increased. New datasets are significantly larger than a decade ago, and new experimental techniques such as single-cell transcriptomics continue to drive the need for clustering or classification techniques to focus on portions of datasets of interest. While many libraries and packages exist that implement various algorithms, there remains the need for clustering packages that are easy to use, integrated with visualization of the results, and integrated with other commonly used tools for biological data analysis. clusterMaker2 has added several new algorithms, including two entirely new categories of analyses: node ranking and dimensionality reduction. Furthermore, many of the new algorithms have been implemented using the Cytoscape jobs API, which provides a mechanism for executing remote jobs from within Cytoscape. Together, these advances facilitate meaningful analyses of modern biological datasets despite their ever-increasing size and complexity.

RESULTS

The use of clusterMaker2 is exemplified by reanalyzing the yeast heat shock expression experiment that was included in our original paper; however, here we explored this dataset in significantly more detail. Combining this dataset with the yeast protein-protein interaction network from STRING, we were able to perform a variety of analyses and visualizations from within clusterMaker2, including Leiden clustering to break the entire network into smaller clusters, hierarchical clustering to look at the overall expression dataset, dimensionality reduction using UMAP to find correlations between our hierarchical visualization and the UMAP plot, fuzzy clustering, and cluster ranking. Using these techniques, we were able to explore the highest-ranking cluster and determine that it represents a strong contender for proteins working together in response to heat shock. We found a series of clusters that, when re-explored as fuzzy clusters, provide a better presentation of mitochondrial processes.

CONCLUSIONS

clusterMaker2 represents a significant advance over the previously published version, and most importantly, provides an easy-to-use tool to perform clustering and to visualize clusters within the Cytoscape network context. The new algorithms should be welcome to the large population of Cytoscape users, particularly the new dimensionality reduction and fuzzy clustering techniques.

Collapse

Choi Y, Li R, Quon G. siVAE: interpretable deep generative models for single-cell transcriptomes. Genome Biol 2023;24:29. [PMID: 36803416 PMCID: PMC9940350 DOI: 10.1186/s13059-023-02850-y] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2022] [Accepted: 01/06/2023] [Indexed: 02/22/2023] Open

Lotfollahi M, Rybakov S, Hrovatin K, Hediyeh-Zadeh S, Talavera-López C, Misharin AV, Theis FJ. Biologically informed deep learning to query gene programs in single-cell atlases. Nat Cell Biol 2023;25:337-350. [PMID: 36732632 PMCID: PMC9928587 DOI: 10.1038/s41556-022-01072-x] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2022] [Accepted: 12/08/2022] [Indexed: 02/04/2023]

Zhang Y, Wang M, Wang Z, Liu Y, Xiong S, Zou Q. MetaSEM: Gene Regulatory Network Inference from Single-Cell RNA Data by Meta-Learning. Int J Mol Sci 2023;24:2595. [PMID: 36768917 PMCID: PMC9916710 DOI: 10.3390/ijms24032595] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Revised: 01/23/2023] [Accepted: 01/26/2023] [Indexed: 01/31/2023] Open

Wang L, Nie R, Zhang J, Cai J. scCapsNet-mask: an updated version of scCapsNet with extended applicability in functional analysis related to scRNA-seq data. BMC Bioinformatics 2022;23:539. [PMID: 36510124 PMCID: PMC9743530 DOI: 10.1186/s12859-022-05098-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Accepted: 12/03/2022] [Indexed: 12/14/2022] Open

Abstract

BACKGROUND

With the rapid accumulation of scRNA-seq data, more and more automatic cell type identification methods have been developed, especially those based on deep learning. Although these methods have reached relatively high prediction accuracy, many issues still exist. One is the interpretability. The second is how to deal with the non-standard test samples that are not encountered in the training process.

RESULTS

Here we introduce scCapsNet-mask, an updated version of scCapsNet. The scCapsNet-mask provides a reasonable solution to the issues of interpretability and non-standard test samples. Firstly, the scCapsNet-mask utilizes a mask to ease the task of model interpretation in the original scCapsNet. The results show that scCapsNet-mask could constrain the coupling coefficients, and make a one-to-one correspondence between the primary capsules and type capsules. Secondly, the scCapsNet-mask can process non-standard samples more reasonably. In one example, the scCapsNet-mask was trained on the committed cells, and then tested on less differentiated cells as the non-standard samples. It could not only estimate the lineage bias of less differentiated cells, but also distinguish the development stages more accurately than traditional machine learning models. Therefore, the pseudo-temporal order of cells for each lineage could be established. Following these pseudo-temporal order, lineage specific genes exhibit a gradual increase expression pattern and stem cell associated genes exhibit a gradual decrease expression pattern. In another example, the scCapsNet-mask was trained on scRNA-seq data, and then used to assign cell type in spatial transcriptomics that may contain non-standard sample of doublets. The results show that the scCapsNet-mask not only restored the spatial map but also identified several non-standard samples of doublet.

CONCLUSIONS

The scCapsNet-mask offers a suitable solution to the challenge of interpretability and non-standard test samples. By adding a mask, it has the advantages of automatic processing and easy interpretation compared with the original scCapsNet. In addition, the scCapsNet-mask could more accurately reflect the composition of non-standard test samples than traditional machine learning methods. Therefore, it can extend its applicability in functional analysis, such as fate bias prediction in less differentiated cells and cell type assignment in spatial transcriptomics.

Collapse