Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Lopez R, Gayoso A, Yosef N. Enhancing scientific discoveries in molecular biology with deep generative models. Mol Syst Biol 2020;16:e9198. [PMID: 32975352 PMCID: PMC7517326 DOI: 10.15252/msb.20199198] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 04/10/2020] [Accepted: 07/09/2020] [Indexed: 12/15/2022] Open

For:	Lopez R, Gayoso A, Yosef N. Enhancing scientific discoveries in molecular biology with deep generative models. Mol Syst Biol 2020;16:e9198. [PMID: 32975352 PMCID: PMC7517326 DOI: 10.15252/msb.20199198] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 04/10/2020] [Accepted: 07/09/2020] [Indexed: 12/15/2022] Open

Number

Cited by Other Article(s)

Plata G, Srinivasan K, Krishnamurthy M, Herron L, Dixit P. Designing host-associated microbiomes using the consumer/resource model. mSystems 2025;10:e0106824. [PMID: 39651880 PMCID: PMC11748559 DOI: 10.1128/msystems.01068-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2024] [Accepted: 11/06/2024] [Indexed: 12/18/2024] Open

Abstract

A key step toward rational microbiome engineering is in silico sampling of realistic microbial communities that correspond to desired host phenotypes, and vice versa. This remains challenging due to a lack of generative models that simultaneously capture compositions of host-associated microbiomes and host phenotypes. To that end, we present a generative model based on the mechanistic consumer/resource (C/R) framework. In the model, variation in microbial ecosystem composition arises due to differences in the availability of effective resources (inferred latent variables), while species' resource preferences remain conserved. Simultaneously, the latent variables are used to model phenotypic states of hosts. In silico microbiomes generated by our model accurately reproduce universal and dataset-specific statistics of bacterial communities. The model allows us to address three salient questions in host-associated microbial ecologies: (i) which host phenotypes maximally constrain the composition of the host-associated microbiomes? (ii) how context-specific are phenotype/microbiome associations, and (iii) what are plausible microbiome compositions that correspond to desired host phenotypes? Our approach aids the analysis and design of microbial communities associated with host phenotypes of interest.

IMPORTANCE

Generative models are extremely popular in modern biology. They have been used to model the variation of protein sequences, entire genomes, and RNA sequencing profiles. Importantly, generative models have been used to extrapolate and interpolate to unobserved regimes of data to design biological systems with desired properties. For example, there has been a boom in machine-learning models aiding in the design of proteins with user-specified structures or functions. Host-associated microbiomes play important roles in animal health and disease, as well as the productivity and environmental footprint of livestock species. However, there are no generative models of host-associated microbiomes. One chief reason is that off-the-shelf machine-learning models are data hungry, and microbiome studies usually deal with large variability and small sample sizes. Moreover, microbiome compositions are heavily context dependent, with characteristics of the host and the abiotic environment leading to distinct patterns in host-microbiome associations. Consequently, off-the-shelf generative modeling has not been successfully applied to microbiomes.To address these challenges, we develop a generative model for host-associated microbiomes derived from the consumer/resource (C/R) framework. This derivation allows us to fit the model to readily available cross-sectional microbiome profile data. Using data from three animal hosts, we show that this mechanistic generative model has several salient features: the model identifies a latent space that represents variables that determine the growth and, therefore, relative abundances of microbial species. Probabilistic modeling of variation in this latent space allows us to generate realistic in silico microbial communities. The model can assign probabilities to microbiomes, thereby allowing us to discriminate between dissimilar ecosystems. Importantly, the model predictively captures host-associated microbiomes and the corresponding hosts' phenotypes, enabling the design of microbial communities associated with user-specified host characteristics.

Collapse

Jin L, Zhou Y, Zhang S, Chen SJ. mRNA vaccine sequence and structure design and optimization: Advances and challenges. J Biol Chem 2025;301:108015. [PMID: 39608721 PMCID: PMC11728972 DOI: 10.1016/j.jbc.2024.108015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2024] [Revised: 11/13/2024] [Accepted: 11/16/2024] [Indexed: 11/30/2024] Open

Schuster V, Dann E, Krogh A, Teichmann SA. multiDGD: A versatile deep generative model for multi-omics data. Nat Commun 2024;15:10031. [PMID: 39567490 PMCID: PMC11579284 DOI: 10.1038/s41467-024-53340-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2024] [Accepted: 10/03/2024] [Indexed: 11/22/2024] Open

Boyeau P, Bates S, Ergen C, Jordan MI, Yosef N. VI-VS: calibrated identification of feature dependencies in single-cell multiomics. Genome Biol 2024;25:294. [PMID: 39548591 PMCID: PMC11566124 DOI: 10.1186/s13059-024-03419-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 10/08/2024] [Indexed: 11/18/2024] Open

He Z, Hu S, Chen Y, An S, Zhou J, Liu R, Shi J, Wang J, Dong G, Shi J, Zhao J, Ou-Yang L, Zhu Y, Bo X, Ying X. Mosaic integration and knowledge transfer of single-cell multimodal data with MIDAS. Nat Biotechnol 2024;42:1594-1605. [PMID: 38263515 PMCID: PMC11471558 DOI: 10.1038/s41587-023-02040-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2022] [Accepted: 10/23/2023] [Indexed: 01/25/2024]

Luo E, Hao M, Wei L, Zhang X. scDiffusion: conditional generation of high-quality single-cell data using diffusion model. Bioinformatics 2024;40:btae518. [PMID: 39171840 PMCID: PMC11368386 DOI: 10.1093/bioinformatics/btae518] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 08/10/2024] [Accepted: 08/20/2024] [Indexed: 08/23/2024] Open

Abstract

MOTIVATION

Single-cell RNA sequencing (scRNA-seq) data are important for studying the laws of life at single-cell level. However, it is still challenging to obtain enough high-quality scRNA-seq data. To mitigate the limited availability of data, generative models have been proposed to computationally generate synthetic scRNA-seq data. Nevertheless, the data generated with current models are not very realistic yet, especially when we need to generate data with controlled conditions. In the meantime, diffusion models have shown their power in generating data with high fidelity, providing a new opportunity for scRNA-seq generation.

RESULTS

In this study, we developed scDiffusion, a generative model combining the diffusion model and foundation model to generate high-quality scRNA-seq data with controlled conditions. We designed multiple classifiers to guide the diffusion process simultaneously, enabling scDiffusion to generate data under multiple condition combinations. We also proposed a new control strategy called Gradient Interpolation. This strategy allows the model to generate continuous trajectories of cell development from a given cell state. Experiments showed that scDiffusion could generate single-cell gene expression data closely resembling real scRNA-seq data. Also, scDiffusion can conditionally produce data on specific cell types including rare cell types. Furthermore, we could use the multiple-condition generation of scDiffusion to generate cell type that was out of the training data. Leveraging the Gradient Interpolation strategy, we generated a continuous developmental trajectory of mouse embryonic cells. These experiments demonstrate that scDiffusion is a powerful tool for augmenting the real scRNA-seq data and can provide insights into cell fate research.

AVAILABILITY AND IMPLEMENTATION

scDiffusion is openly available at the GitHub repository https://github.com/EperLuo/scDiffusion or Zenodo https://zenodo.org/doi/10.5281/zenodo.13268742.

Collapse

Huang W, Liu H. Predicting single-cell cellular responses to perturbations using cycle consistency learning. Bioinformatics 2024;40:i462-i470. [PMID: 38940153 PMCID: PMC11256949 DOI: 10.1093/bioinformatics/btae248] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/29/2024] Open

Rivero-Garcia I, Torres M, Sánchez-Cabo F. Deep generative models in single-cell omics. Comput Biol Med 2024;176:108561. [PMID: 38749321 DOI: 10.1016/j.compbiomed.2024.108561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Revised: 04/30/2024] [Accepted: 05/05/2024] [Indexed: 05/31/2024]

Maizels RJ. A dynamical perspective: moving towards mechanism in single-cell transcriptomics. Philos Trans R Soc Lond B Biol Sci 2024;379:20230049. [PMID: 38432314 PMCID: PMC10909508 DOI: 10.1098/rstb.2023.0049] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 10/31/2023] [Indexed: 03/05/2024] Open

Cusworth S, Gkoutos GV, Acharjee A. A novel generative adversarial networks modelling for the class imbalance problem in high dimensional omics data. BMC Med Inform Decis Mak 2024;24:90. [PMID: 38549123 PMCID: PMC10979623 DOI: 10.1186/s12911-024-02487-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/02/2023] [Accepted: 03/22/2024] [Indexed: 04/01/2024] Open

Gayoso A, Weiler P, Lotfollahi M, Klein D, Hong J, Streets A, Theis FJ, Yosef N. Deep generative modeling of transcriptional dynamics for RNA velocity analysis in single cells. Nat Methods 2024;21:50-59. [PMID: 37735568 PMCID: PMC10776389 DOI: 10.1038/s41592-023-01994-w] [Citation(s) in RCA: 30] [Impact Index Per Article: 30.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Accepted: 08/08/2023] [Indexed: 09/23/2023]

Plattner C, Lamberti G, Blattmann P, Kirchmair A, Rieder D, Loncova Z, Sturm G, Scheidl S, Ijsselsteijn M, Fotakis G, Noureen A, Lisandrelli R, Böck N, Nemati N, Krogsdam A, Daum S, Finotello F, Somarakis A, Schäfer A, Wilflingseder D, Gonzalez Acera M, Öfner D, Huber LA, Clevers H, Becker C, Farin HF, Greten FR, Aebersold R, de Miranda NF, Trajanoski Z. Functional and spatial proteomics profiling reveals intra- and intercellular signaling crosstalk in colorectal cancer. iScience 2023;26:108399. [PMID: 38047086 PMCID: PMC10692669 DOI: 10.1016/j.isci.2023.108399] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2022] [Revised: 04/21/2023] [Accepted: 11/02/2023] [Indexed: 12/05/2023] Open

Affiliation(s)

Christina Plattner Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Giorgia Lamberti Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Peter Blattmann Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8092 Zurich, Switzerland
Alexander Kirchmair Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Dietmar Rieder Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Zuzana Loncova Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Gregor Sturm Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Stefan Scheidl Department of Visceral, Transplant and Thoracic Surgery, Medical University of Innsbruck, 6020 Innsbruck, Austria
Marieke Ijsselsteijn Department of Pathology, Leiden University Medical Center, 2333 ZA Leiden, the Netherlands
Georgios Fotakis Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Asma Noureen Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Rebecca Lisandrelli Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Nina Böck Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Niloofar Nemati Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Anne Krogsdam Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Sophia Daum Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Francesca Finotello Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria
Antonios Somarakis Department of Radiology, Leiden University Medical Center, 2333 ZA Leiden, the Netherlands
Alexander Schäfer Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8092 Zurich, Switzerland
Doris Wilflingseder Institute of Hygiene and Medical Microbiology, Medical University of Innsbruck, 6020 Innsbruck, Austria
Miguel Gonzalez Acera Department of Medicine 1, Friedrich-Alexander Universität Erlangen-Nürnberg (FAU) and Universitätsklinikum Erlangen, 91054 Erlangen, Germany
Dietmar Öfner Department of Visceral, Transplant and Thoracic Surgery, Medical University of Innsbruck, 6020 Innsbruck, Austria
Lukas A. Huber Biocenter, Institute of Cell Biology, Medical University of Innsbruck, 6020 Innsbruck, Austria
Hans Clevers Hubrecht Institute, 3584 CT Utrecht, the Netherlands
Christoph Becker Department of Medicine 1, Friedrich-Alexander Universität Erlangen-Nürnberg (FAU) and Universitätsklinikum Erlangen, 91054 Erlangen, Germany
Henner F. Farin Institute for Tumor Biology and Experimental Therapy, Georg-Speyer-Haus, 60596 Frankfurt am Main, Germany Frankfurt Cancer Institute, Goethe University, 60596 Frankfurt am Main, Germany German Cancer Consortium (DKTK), partner site Frankfurt/Mainz, a partnership with DKFZ Heidelberg, Frankfurt/Mainz, Germany
Florian R. Greten Institute for Tumor Biology and Experimental Therapy, Georg-Speyer-Haus, 60596 Frankfurt am Main, Germany Frankfurt Cancer Institute, Goethe University, 60596 Frankfurt am Main, Germany German Cancer Consortium (DKTK), partner site Frankfurt/Mainz, a partnership with DKFZ Heidelberg, Frankfurt/Mainz, Germany
Ruedi Aebersold Department of Biology, Institute of Molecular Systems Biology, ETH Zurich, 8092 Zurich, Switzerland
Noel F.C.C. de Miranda Department of Pathology, Leiden University Medical Center, 2333 ZA Leiden, the Netherlands
Zlatko Trajanoski Biocenter, Institute of Bioinformatics, Medical University of Innsbruck, 6020 Innsbruck, Austria

Collapse

Barghout RA, Xu Z, Betala S, Mahadevan R. Advances in generative modeling methods and datasets to design novel enzymes for renewable chemicals and fuels. Curr Opin Biotechnol 2023;84:103007. [PMID: 37931573 DOI: 10.1016/j.copbio.2023.103007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 09/12/2023] [Accepted: 09/13/2023] [Indexed: 11/08/2023]

Alexandrov T, Saez‐Rodriguez J, Saka SK. Enablers and challenges of spatial omics, a melting pot of technologies. Mol Syst Biol 2023;19:e10571. [PMID: 37842805 PMCID: PMC10632737 DOI: 10.15252/msb.202110571] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 07/31/2023] [Accepted: 08/03/2023] [Indexed: 10/17/2023] Open

Michoel T, Zhang JD. Causal inference in drug discovery and development. Drug Discov Today 2023;28:103737. [PMID: 37591410 DOI: 10.1016/j.drudis.2023.103737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Revised: 07/31/2023] [Accepted: 08/10/2023] [Indexed: 08/19/2023]

Valeri JA, Soenksen LR, Collins KM, Ramesh P, Cai G, Powers R, Angenent-Mari NM, Camacho DM, Wong F, Lu TK, Collins JJ. BioAutoMATED: An end-to-end automated machine learning tool for explanation and design of biological sequences. Cell Syst 2023;14:525-542.e9. [PMID: 37348466 PMCID: PMC10700034 DOI: 10.1016/j.cels.2023.05.007] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Revised: 02/17/2023] [Accepted: 05/22/2023] [Indexed: 06/24/2023]

Affiliation(s)

Jacqueline A Valeri Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Luis R Soenksen Institute for Medical Engineering and Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Department of Mechanical Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA
Katherine M Collins Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Department of Brain and Cognitive Sciences, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Department of Engineering, University of Cambridge, Trumpington St, Cambridge CB2 1PZ, UK
Pradeep Ramesh Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA
George Cai Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA
Rani Powers Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Pluto Biosciences, Golden, CO 80402, USA
Nicolaas M Angenent-Mari Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA
Diogo M Camacho Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA
Felix Wong Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA
Timothy K Lu Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Synthetic Biology Group, Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
James J Collins Department of Biological Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Institute for Medical Engineering and Science, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA 02139, USA; Wyss Institute for Biologically Inspired Engineering, Harvard University, Boston, MA 02115, USA; Broad Institute of MIT and Harvard, Cambridge, MA 02142, USA; Harvard-MIT Program in Health Sciences and Technology, Cambridge, MA 02139, USA; Abdul Latif Jameel Clinic for Machine Learning in Health, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.

Collapse

Boyeau P, Regier J, Gayoso A, Jordan MI, Lopez R, Yosef N. An empirical Bayes method for differential expression analysis of single cells with deep generative models. Proc Natl Acad Sci U S A 2023;120:e2209124120. [PMID: 37192164 PMCID: PMC10214125 DOI: 10.1073/pnas.2209124120] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Accepted: 01/23/2023] [Indexed: 05/18/2023] Open

Lotfollahi M, Klimovskaia Susmelj A, De Donno C, Hetzel L, Ji Y, Ibarra IL, Srivatsan SR, Naghipourfar M, Daza RM, Martin B, Shendure J, McFaline-Figueroa JL, Boyeau P, Wolf FA, Yakubova N, Günnemann S, Trapnell C, Lopez-Paz D, Theis FJ. Predicting cellular responses to complex perturbations in high-throughput screens. Mol Syst Biol 2023:e11517. [PMID: 37154091 DOI: 10.15252/msb.202211517] [Citation(s) in RCA: 77] [Impact Index Per Article: 38.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2022] [Revised: 03/23/2023] [Accepted: 03/31/2023] [Indexed: 05/10/2023] Open

Affiliation(s)

Mohammad Lotfollahi Helmholtz Center Munich - German Research Center for Environmental Health, Institute of Computational Biology, Munich, Germany Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
Anna Klimovskaia Susmelj Meta AI, Paris, France Swiss Data Science Center, Zurich, Switzerland
Carlo De Donno Helmholtz Center Munich - German Research Center for Environmental Health, Institute of Computational Biology, Munich, Germany School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
Leon Hetzel Helmholtz Center Munich - German Research Center for Environmental Health, Institute of Computational Biology, Munich, Germany Department of Mathematics, Technical University of Munich, Munich, Germany
Yuge Ji Helmholtz Center Munich - German Research Center for Environmental Health, Institute of Computational Biology, Munich, Germany School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany
Ignacio L Ibarra Helmholtz Center Munich - German Research Center for Environmental Health, Institute of Computational Biology, Munich, Germany
Sanjay R Srivatsan Department of Genome Sciences, University of Washington, Seattle, WA, USA
Mohsen Naghipourfar Department of Bioengineering, University of California, Berkeley, CA, USA
Riza M Daza Department of Genome Sciences, University of Washington, Seattle, WA, USA
Beth Martin Department of Genome Sciences, University of Washington, Seattle, WA, USA
Jay Shendure Department of Genome Sciences, University of Washington, Seattle, WA, USA Howard Hughes Medical Institute, Seattle, WA, USA Brotman Baty Institute for Precision Medicine, Seattle, WA, USA Allen Discovery Center for Cell Lineage Tracing, Seattle, WA, USA
Jose L McFaline-Figueroa Department of Biomedical Engineering, Columbia University, New York, NY, USA
Pierre Boyeau Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA, USA
F Alexander Wolf Helmholtz Center Munich - German Research Center for Environmental Health, Institute of Computational Biology, Munich, Germany
Nafissa Yakubova Meta AI, Paris, France
Stephan Günnemann Department of Computer Science, Technical University of Munich, Munich, Germany
Cole Trapnell Department of Genome Sciences, University of Washington, Seattle, WA, USA Brotman Baty Institute for Precision Medicine, Seattle, WA, USA Allen Discovery Center for Cell Lineage Tracing, Seattle, WA, USA
David Lopez-Paz Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK
Fabian J Theis Helmholtz Center Munich - German Research Center for Environmental Health, Institute of Computational Biology, Munich, Germany Wellcome Trust Sanger Institute, Wellcome Genome Campus, Hinxton, Cambridgeshire, UK School of Life Sciences Weihenstephan, Technical University of Munich, Munich, Germany Department of Mathematics, Technical University of Munich, Munich, Germany

Collapse

Dou Z, Sun Y, Jiang X, Wu X, Li Y, Gong B, Wang L. Data-driven strategies for the computational design of enzyme thermal stability: trends, perspectives, and prospects. Acta Biochim Biophys Sin (Shanghai) 2023;55:343-355. [PMID: 37143326 PMCID: PMC10160227 DOI: 10.3724/abbs.2023033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2022] [Accepted: 11/23/2022] [Indexed: 03/05/2023] Open

Jirsa V, Wang H, Triebkorn P, Hashemi M, Jha J, Gonzalez-Martinez J, Guye M, Makhalova J, Bartolomei F. Personalised virtual brain models in epilepsy. Lancet Neurol 2023;22:443-454. [PMID: 36972720 DOI: 10.1016/s1474-4422(23)00008-x] [Citation(s) in RCA: 49] [Impact Index Per Article: 24.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2021] [Revised: 12/20/2022] [Accepted: 01/04/2023] [Indexed: 03/29/2023]

Affiliation(s)

Viktor Jirsa Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106, Aix Marseille Université, Marseille, France.
Huifang Wang Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106, Aix Marseille Université, Marseille, France
Paul Triebkorn Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106, Aix Marseille Université, Marseille, France
Meysam Hashemi Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106, Aix Marseille Université, Marseille, France
Jayant Jha Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106, Aix Marseille Université, Marseille, France
Jorge Gonzalez-Martinez School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
Maxime Guye Centre National de la Recherche Scientifique, Center for Magnetic Resonance in Biology and Medicine, Aix Marseille Université, Marseille, France; Centre d'Exploration Métabolique par Résonance Magnétique, Assistance Publique - Hôpitaux de Marseille, La Timone University Hospital, Marseille, France
Julia Makhalova Centre National de la Recherche Scientifique, Center for Magnetic Resonance in Biology and Medicine, Aix Marseille Université, Marseille, France; Centre d'Exploration Métabolique par Résonance Magnétique, Assistance Publique - Hôpitaux de Marseille, La Timone University Hospital, Marseille, France; Epileptology and Clinical Neurophysiology Department, Assistance Publique - Hôpitaux de Marseille, La Timone University Hospital, Marseille, France
Fabrice Bartolomei Institut National de la Santé et de la Recherche Médicale, Institut de Neurosciences des Systèmes (INS) UMR1106, Aix Marseille Université, Marseille, France; Epileptology and Clinical Neurophysiology Department, Assistance Publique - Hôpitaux de Marseille, La Timone University Hospital, Marseille, France

Collapse

Brombacher E, Hackenberg M, Kreutz C, Binder H, Treppner M. The performance of deep generative models for learning joint embeddings of single-cell multi-omics data. Front Mol Biosci 2022;9:962644. [PMID: 36387277 PMCID: PMC9643784 DOI: 10.3389/fmolb.2022.962644] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Accepted: 10/12/2022] [Indexed: 11/07/2023] Open

Yeo HC, Selvarajoo K. Machine learning alternative to systems biology should not solely depend on data. Brief Bioinform 2022;23:6731718. [PMID: 36184188 PMCID: PMC9677488 DOI: 10.1093/bib/bbac436] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Revised: 08/24/2022] [Accepted: 09/09/2022] [Indexed: 12/14/2022] Open

Treppner M, Binder H, Hess M. Interpretable generative deep learning: an illustration with single cell gene expression data. Hum Genet 2022;141:1481-1498. [PMID: 34988661 PMCID: PMC9360114 DOI: 10.1007/s00439-021-02417-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 08/06/2021] [Indexed: 11/26/2022]

Lopez R, Li B, Keren-Shaul H, Boyeau P, Kedmi M, Pilzer D, Jelinski A, Yofe I, David E, Wagner A, Ergen C, Addadi Y, Golani O, Ronchese F, Jordan MI, Amit I, Yosef N. DestVI identifies continuums of cell types in spatial transcriptomics data. Nat Biotechnol 2022;40:1360-1369. [PMID: 35449415 PMCID: PMC9756396 DOI: 10.1038/s41587-022-01272-8] [Citation(s) in RCA: 103] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Accepted: 03/07/2022] [Indexed: 11/09/2022]

Affiliation(s)

Romain Lopez Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley CA, USA
Baoguo Li Department of Immunology, Weizmann Institute of Science, Rehovot, Israel
Hadas Keren-Shaul Department of Life Sciences Core Facilities, Weizmann Institute of Science, Rehovot, Israel
Pierre Boyeau Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley CA, USA
Merav Kedmi Department of Life Sciences Core Facilities, Weizmann Institute of Science, Rehovot, Israel
David Pilzer Department of Life Sciences Core Facilities, Weizmann Institute of Science, Rehovot, Israel
Adam Jelinski Department of Immunology, Weizmann Institute of Science, Rehovot, Israel
Ido Yofe Department of Immunology, Weizmann Institute of Science, Rehovot, Israel
Eyal David Department of Immunology, Weizmann Institute of Science, Rehovot, Israel
Allon Wagner Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley CA, USA
Can Ergen Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley CA, USA
Yoseph Addadi Department of Life Sciences Core Facilities, Weizmann Institute of Science, Rehovot, Israel
Ofra Golani Department of Life Sciences Core Facilities, Weizmann Institute of Science, Rehovot, Israel
Franca Ronchese Malaghan Institute of Medical Research, Wellington, New Zealand
Michael I Jordan Department of Immunology, Weizmann Institute of Science, Rehovot, Israel Department of Statistics, University of California, Berkeley, Berkeley CA, USA
Ido Amit Department of Immunology, Weizmann Institute of Science, Rehovot, Israel.
Nir Yosef Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley CA, USA. Center for Computational Biology, University of California, Berkeley, Berkeley CA, USA. Chan Zuckerberg Biohub, San Francisco CA, USA. Ragon Institute of MGH, MIT and Harvard, Cambridge MA, USA.

Collapse

Martinelli DD. Generative machine learning for de novo drug discovery: A systematic review. Comput Biol Med 2022;145:105403. [PMID: 35339849 DOI: 10.1016/j.compbiomed.2022.105403] [Citation(s) in RCA: 39] [Impact Index Per Article: 13.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2022] [Revised: 03/10/2022] [Accepted: 03/11/2022] [Indexed: 02/08/2023]

Abstract

Recent research on artificial intelligence indicates that machine learning algorithms can auto-generate novel drug-like molecules. Generative models have revolutionized de novo drug discovery, rendering the explorative process more efficient. Several model frameworks and input formats have been proposed to enhance the performance of intelligent algorithms in generative molecular design. In this systematic literature review of experimental articles and reviews over the last five years, machine learning models, challenges associated with computational molecule design along with proposed solutions, and molecular encoding methods are discussed. A query-based search of the PubMed, ScienceDirect, Springer, Wiley Online Library, arXiv, MDPI, bioRxiv, and IEEE Xplore databases yielded 87 studies. Twelve additional studies were identified via citation searching. Of the articles in which machine learning was implemented, six prominent algorithms were identified: long short-term memory recurrent neural networks (LSTM-RNNs), variational autoencoders (VAEs), generative adversarial networks (GANs), adversarial autoencoders (AAEs), evolutionary algorithms, and gated recurrent unit (GRU-RNNs). Furthermore, eight central challenges were designated: homogeneity of generated molecular libraries, deficient synthesizability, limited assay data, model interpretability, incapacity for multi-property optimization, incomparability, restricted molecule size, and uncertainty in model evaluation. Molecules were encoded either as strings, which were occasionally augmented using randomization, as 2D graphs, or as 3D graphs. Statistical analysis and visualization are performed to illustrate how approaches to machine learning in de novo drug design have evolved over the past five years. Finally, future opportunities and reservations are discussed.

Collapse

Benegas G, Fischer J, Song YS. Robust and annotation-free analysis of alternative splicing across diverse cell types in mice. eLife 2022;11:73520. [PMID: 35229721 PMCID: PMC8975553 DOI: 10.7554/elife.73520] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 02/27/2022] [Indexed: 11/13/2022] Open

Spatial components of molecular tissue biology. Nat Biotechnol 2022;40:308-318. [PMID: 35132261 DOI: 10.1038/s41587-021-01182-1] [Citation(s) in RCA: 152] [Impact Index Per Article: 50.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2020] [Accepted: 12/03/2021] [Indexed: 02/06/2023]

Gayoso A, Lopez R, Xing G, Boyeau P, Valiollah Pour Amiri V, Hong J, Wu K, Jayasuriya M, Mehlman E, Langevin M, Liu Y, Samaran J, Misrachi G, Nazaret A, Clivio O, Xu C, Ashuach T, Gabitto M, Lotfollahi M, Svensson V, da Veiga Beltrame E, Kleshchevnikov V, Talavera-López C, Pachter L, Theis FJ, Streets A, Jordan MI, Regier J, Yosef N. A Python library for probabilistic analysis of single-cell omics data. Nat Biotechnol 2022;40:163-166. [DOI: 10.1038/s41587-021-01206-w] [Citation(s) in RCA: 37] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]

Greener JG, Kandathil SM, Moffat L, Jones DT. A guide to machine learning for biologists. Nat Rev Mol Cell Biol 2022;23:40-55. [PMID: 34518686 DOI: 10.1038/s41580-021-00407-0] [Citation(s) in RCA: 790] [Impact Index Per Article: 263.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/23/2021] [Indexed: 02/08/2023]

Interpretable Autoencoders Trained on Single Cell Sequencing Data Can Transfer Directly to Data from Unseen Tissues. Cells 2021;11:cells11010085. [PMID: 35011647 PMCID: PMC8750521 DOI: 10.3390/cells11010085] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 12/17/2021] [Accepted: 12/21/2021] [Indexed: 01/04/2023] Open

Kitano H. Nobel Turing Challenge: creating the engine for scientific discovery. NPJ Syst Biol Appl 2021;7:29. [PMID: 34145287 PMCID: PMC8213706 DOI: 10.1038/s41540-021-00189-3] [Citation(s) in RCA: 34] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Accepted: 06/03/2021] [Indexed: 12/15/2022] Open

Osadchy M, Kolodny R. How Deep Learning Tools Can Help Protein Engineers Find Good Sequences. J Phys Chem B 2021;125:6440-6450. [PMID: 34105961 DOI: 10.1021/acs.jpcb.1c02449] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]

Yáñez Feliú G, Earle Gómez B, Codoceo Berrocal V, Muñoz Silva M, Nuñez IN, Matute TF, Arce Medina A, Vidal G, Vitalis C, Dahlin J, Federici F, Rudge TJ. Flapjack: Data Management and Analysis for Genetic Circuit Characterization. ACS Synth Biol 2021;10:183-191. [PMID: 33382586 DOI: 10.1021/acssynbio.0c00554] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Affiliation(s)

Guillermo Yáñez Feliú Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
Benjamín Earle Gómez Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
Verner Codoceo Berrocal Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
Macarena Muñoz Silva Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
Isaac N Nuñez Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Pontificia Universidad Católica de Chile, Santiago 8330005, Chile
Tamara F Matute Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Pontificia Universidad Católica de Chile, Santiago 8330005, Chile
Anibal Arce Medina ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Pontificia Universidad Católica de Chile, Santiago 8330005, Chile Departamento de Genética Molecular y Microbiología, Facultad de Ciencias Biológicas, Pontificia Universidad Católica de Chile, Santiago 8330005, Chile
Gonzalo Vidal Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
Carlos Vitalis Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile
Jonathan Dahlin The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, 2800 Kgs. Lyngby, Denmark
Fernán Federici Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile ANID - Millennium Science Initiative Program - Millennium Institute for Integrative Biology (iBio), Pontificia Universidad Católica de Chile, Santiago 8330005, Chile FONDAP, Center for Genome Regulation, Pontificia Universidad Católica de Chile, Santiago 8330005, Chile
Timothy J Rudge Department of Chemical and Bioprocess Engineering, School of Engineering, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile Institute for Biological and Medical Engineering, Schools of Engineering, Biology and Medicine, Pontificia Universidad Católica de Chile, Santiago 7820244, Chile

Collapse

Kell DB, Samanta S, Swainston N. Deep learning and generative methods in cheminformatics and chemical biology: navigating small molecule space intelligently. Biochem J 2020;477:4559-4580. [PMID: 33290527 PMCID: PMC7733676 DOI: 10.1042/bcj20200781] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2020] [Revised: 11/11/2020] [Accepted: 11/12/2020] [Indexed: 12/15/2022]

Lopez R, Gayoso A, Yosef N. Enhancing scientific discoveries in molecular biology with deep generative models. Mol Syst Biol 2020;16:e9198. [PMID: 32975352 PMCID: PMC7517326 DOI: 10.15252/msb.20199198] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2019] [Revised: 04/10/2020] [Accepted: 07/09/2020] [Indexed: 12/15/2022] Open