Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Skinnider MA, Stacey RG, Wishart DS, Foster LJ. Chemical language models enable navigation in sparsely populated chemical space. NAT MACH INTELL 2021;3:759-70. [DOI: 10.1038/s42256-021-00368-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

For:	Skinnider MA, Stacey RG, Wishart DS, Foster LJ. Chemical language models enable navigation in sparsely populated chemical space. NAT MACH INTELL 2021;3:759-70. [DOI: 10.1038/s42256-021-00368-1] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]

Number

Cited by Other Article(s)

Wu JN, Wang T, Chen Y, Tang LJ, Wu HL, Yu RQ. t-SMILES: a fragment-based molecular representation framework for de novo ligand design. Nat Commun 2024;15:4993. [PMID: 38862578 PMCID: PMC11167009 DOI: 10.1038/s41467-024-49388-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2022] [Accepted: 06/04/2024] [Indexed: 06/13/2024] Open

van Tilborg D, Brinkmann H, Criscuolo E, Rossen L, Özçelik R, Grisoni F. Deep learning for low-data drug discovery: Hurdles and opportunities. Curr Opin Struct Biol 2024;86:102818. [PMID: 38669740 DOI: 10.1016/j.sbi.2024.102818] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2024] [Revised: 03/27/2024] [Accepted: 03/29/2024] [Indexed: 04/28/2024]

Das M, Ghosh A, Sunoj RB. Advances in machine learning with chemical language models in molecular property and reaction outcome predictions. J Comput Chem 2024;45:1160-1176. [PMID: 38299229 DOI: 10.1002/jcc.27315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 01/06/2024] [Accepted: 01/09/2024] [Indexed: 02/02/2024]

Abstract

Molecular properties and reactions form the foundation of chemical space. Over the years, innumerable molecules have been synthesized, a smaller fraction of them found immediate applications, while a larger proportion served as a testimony to creative and empirical nature of the domain of chemical science. With increasing emphasis on sustainable practices, it is desirable that a target set of molecules are synthesized preferably through a fewer empirical attempts instead of a larger library, to realize an active candidate. In this front, predictive endeavors using machine learning (ML) models built on available data acquire high timely significance. Prediction of molecular property and reaction outcome remain one of the burgeoning applications of ML in chemical science. Among several methods of encoding molecular samples for ML models, the ones that employ language like representations are gaining steady popularity. Such representations would additionally help adopt well-developed natural language processing (NLP) models for chemical applications. Given this advantageous background, herein we describe several successful chemical applications of NLP focusing on molecular property and reaction outcome predictions. From relatively simpler recurrent neural networks (RNNs) to complex models like transformers, different network architecture have been leveraged for tasks such as de novo drug design, catalyst generation, forward and retro-synthesis predictions. The chemical language model (CLM) provides promising avenues toward a broad range of applications in a time and cost-effective manner. While we showcase an optimistic outlook of CLMs, attention is also placed on the persisting challenges in reaction domain, which would optimistically be addressed by advanced algorithms tailored to chemical language and with increased availability of high-quality datasets.

Collapse

Zhang G, Zhang Y, Li L, Zhou J, Chen H, Ji J, Li Y, Cao Y, Xu Z, Pian C. Exploring Novel Fentanyl Analogues Using a Graph-Based Transformer Model. Interdiscip Sci 2024:10.1007/s12539-024-00623-0. [PMID: 38683279 DOI: 10.1007/s12539-024-00623-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2023] [Revised: 02/23/2024] [Accepted: 02/25/2024] [Indexed: 05/01/2024]

Atz K, Cotos L, Isert C, Håkansson M, Focht D, Hilleke M, Nippa DF, Iff M, Ledergerber J, Schiebroek CCG, Romeo V, Hiss JA, Merk D, Schneider P, Kuhn B, Grether U, Schneider G. Prospective de novo drug design with deep interactome learning. Nat Commun 2024;15:3408. [PMID: 38649351 PMCID: PMC11035696 DOI: 10.1038/s41467-024-47613-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Accepted: 04/02/2024] [Indexed: 04/25/2024] Open

Affiliation(s)

Kenneth Atz ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
Leandro Cotos ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
Clemens Isert ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
Maria Håkansson SARomics Biostructures AB, Medicon Village, SE-223 81, Lund, Sweden
Dorota Focht SARomics Biostructures AB, Medicon Village, SE-223 81, Lund, Sweden
Mattis Hilleke ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
David F Nippa Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Grenzacherstrasse 124, CH-4070, Basel, Switzerland Department of Pharmacy, Ludwig-Maximilians-Universität München, Butenandtstrasse 5, 81377, Munich, Germany
Michael Iff ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
Jann Ledergerber ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
Carl C G Schiebroek ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
Valentina Romeo Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Grenzacherstrasse 124, CH-4070, Basel, Switzerland
Jan A Hiss ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
Daniel Merk Department of Pharmacy, Ludwig-Maximilians-Universität München, Butenandtstrasse 5, 81377, Munich, Germany
Petra Schneider ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland
Bernd Kuhn Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Grenzacherstrasse 124, CH-4070, Basel, Switzerland
Uwe Grether Roche Pharma Research and Early Development (pRED), Roche Innovation Center Basel, F. Hoffmann-La Roche Ltd., Grenzacherstrasse 124, CH-4070, Basel, Switzerland
Gisbert Schneider ETH Zurich, Department of Chemistry and Applied Biosciences, Vladimir-Prelog-Weg 4, 8093, Zurich, Switzerland.

Collapse

Yao S, Song J, Jia L, Cheng L, Zhong Z, Song M, Feng Z. Fast and effective molecular property prediction with transferability map. Commun Chem 2024;7:85. [PMID: 38632308 PMCID: PMC11024153 DOI: 10.1038/s42004-024-01169-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2023] [Accepted: 04/05/2024] [Indexed: 04/19/2024] Open

Xu T, Gao W, Zhu L, Chen W, Niu C, Yin W, Ma L, Zhu X, Ling Y, Gao S, Liu L, Jiao N, Chen W, Zhang G, Zhu R, Wu D. NAFLDkb: A Knowledge Base and Platform for Drug Development against Nonalcoholic Fatty Liver Disease. J Chem Inf Model 2024;64:2817-2828. [PMID: 37167092 DOI: 10.1021/acs.jcim.3c00395] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]

Affiliation(s)

Tingjun Xu Putuo People's Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200060, P. R. China Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 LingLing Road, Shanghai 200032, P. R. China
Wenxing Gao Putuo People's Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200060, P. R. China
Lixin Zhu Guangdong Institute of Gastroenterology; Guangdong Provincial Key Laboratory of Colorectal and Pelvic Floor Diseases; Biomedical Innovation Center, Sun Yat-sen University, Guangzhou 510655, P. R. China Department of General Surgery, The Sixth Affiliated Hospital of Sun Yat-sen University, Guangzhou 510655, P. R. China
Wanning Chen Putuo People's Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200060, P. R. China
Chaoqun Niu Chinese Academy of Sciences Key Laboratory of Computational Biology, Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health, University of the Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, P. R. China
Wenjing Yin Putuo People's Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200060, P. R. China
Liangxiao Ma Chinese Academy of Sciences Key Laboratory of Computational Biology, Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health, University of the Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, P. R. China
Xinyue Zhu Putuo People's Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200060, P. R. China
Yunchao Ling Chinese Academy of Sciences Key Laboratory of Computational Biology, Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health, University of the Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, P. R. China
Sheng Gao Putuo People's Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200060, P. R. China
Lei Liu Putuo People's Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200060, P. R. China
Na Jiao National Clinical Research Center for Child Health, the Children's Hospital, Zhejiang University School of Medicine, Hangzhou 310058, Zhejiang, P. R. China
Weiming Chen Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences, 345 LingLing Road, Shanghai 200032, P. R. China
Guoqing Zhang Chinese Academy of Sciences Key Laboratory of Computational Biology, Bio-Med Big Data Center, Shanghai Institute of Nutrition and Health, University of the Chinese Academy of Sciences, Chinese Academy of Sciences, Shanghai 200031, P. R. China
Ruixin Zhu Putuo People's Hospital, School of Life Sciences and Technology, Tongji University, Shanghai 200060, P. R. China
Dingfeng Wu National Clinical Research Center for Child Health, the Children's Hospital, Zhejiang University School of Medicine, Hangzhou 310058, Zhejiang, P. R. China

Collapse

Vogt M. Chemoinformatic approaches for navigating large chemical spaces. Expert Opin Drug Discov 2024;19:403-414. [PMID: 38300511 DOI: 10.1080/17460441.2024.2313475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/12/2023] [Accepted: 01/30/2024] [Indexed: 02/02/2024]

Loeffler HH, He J, Tibo A, Janet JP, Voronov A, Mervin LH, Engkvist O. Reinvent 4: Modern AI-driven generative molecule design. J Cheminform 2024;16:20. [PMID: 38383444 PMCID: PMC10882833 DOI: 10.1186/s13321-024-00812-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 02/09/2024] [Indexed: 02/23/2024] Open

Gangwal A, Ansari A, Ahmad I, Azad AK, Kumarasamy V, Subramaniyan V, Wong LS. Generative artificial intelligence in drug discovery: basic framework, recent advances, challenges, and opportunities. Front Pharmacol 2024;15:1331062. [PMID: 38384298 PMCID: PMC10879372 DOI: 10.3389/fphar.2024.1331062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2023] [Accepted: 01/17/2024] [Indexed: 02/23/2024] Open

Abstract

There are two main ways to discover or design small drug molecules. The first involves fine-tuning existing molecules or commercially successful drugs through quantitative structure-activity relationships and virtual screening. The second approach involves generating new molecules through de novo drug design or inverse quantitative structure-activity relationship. Both methods aim to get a drug molecule with the best pharmacokinetic and pharmacodynamic profiles. However, bringing a new drug to market is an expensive and time-consuming endeavor, with the average cost being estimated at around $2.5 billion. One of the biggest challenges is screening the vast number of potential drug candidates to find one that is both safe and effective. The development of artificial intelligence in recent years has been phenomenal, ushering in a revolution in many fields. The field of pharmaceutical sciences has also significantly benefited from multiple applications of artificial intelligence, especially drug discovery projects. Artificial intelligence models are finding use in molecular property prediction, molecule generation, virtual screening, synthesis planning, repurposing, among others. Lately, generative artificial intelligence has gained popularity across domains for its ability to generate entirely new data, such as images, sentences, audios, videos, novel chemical molecules, etc. Generative artificial intelligence has also delivered promising results in drug discovery and development. This review article delves into the fundamentals and framework of various generative artificial intelligence models in the context of drug discovery via de novo drug design approach. Various basic and advanced models have been discussed, along with their recent applications. The review also explores recent examples and advances in the generative artificial intelligence approach, as well as the challenges and ongoing efforts to fully harness the potential of generative artificial intelligence in generating novel drug molecules in a faster and more affordable manner. Some clinical-level assets generated form generative artificial intelligence have also been discussed in this review to show the ever-increasing application of artificial intelligence in drug discovery through commercial partnerships.

Collapse

Bajorath J. Chemical language models for molecular design. Mol Inform 2024;43:e202300288. [PMID: 38010610 DOI: 10.1002/minf.202300288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Revised: 11/22/2023] [Accepted: 11/23/2023] [Indexed: 11/29/2023]

Wang F, Pasin D, Skinnider MA, Liigand J, Kleis JN, Brown D, Oler E, Sajed T, Gautam V, Harrison S, Greiner R, Foster LJ, Dalsgaard PW, Wishart DS. Deep Learning-Enabled MS/MS Spectrum Prediction Facilitates Automated Identification Of Novel Psychoactive Substances. Anal Chem 2023;95:18326-18334. [PMID: 38048435 PMCID: PMC10733899 DOI: 10.1021/acs.analchem.3c02413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2023] [Revised: 11/10/2023] [Accepted: 11/13/2023] [Indexed: 12/06/2023]

Affiliation(s)

Fei Wang Department of Computing Science, University of Alberta, Edmonton, Alberta T6G 2E8, Canada Alberta Machine Intelligence Institute, Edmonton, Alberta T5J 3B1, Canada
Daniel Pasin Section of Forensic Chemistry, Department of Forensic Medicine, University of Copenhagen, Copenhagen 2100, Denmark
Michael A. Skinnider Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, New Jersey 08544, United States Ludwig Institute for Cancer Research, Princeton University, Princeton, New Jersey 08544, United States
Jaanus Liigand Department of Biological Sciences, University of Alberta, Edmonton, Alberta T6G 2E9, Canada Institute of Chemistry, University of Tartu, Tartu 50411, Estonia
Jan-Niklas Kleis Institute of Forensic Medicine, Forensic Toxicology, Johannes Gutenberg University Mainz, Mainz 55131, Germany
David Brown Forensic Science Laboratory, ChemCentre, Bentley, Western Australia 6102, Australia School of Molecular and Life Sciences, Curtin University, Bentley, Western Australia 6009, Australia
Eponine Oler Department of Biological Sciences, University of Alberta, Edmonton, Alberta T6G 2E9, Canada
Tanvir Sajed Department of Biological Sciences, University of Alberta, Edmonton, Alberta T6G 2E9, Canada
Vasuk Gautam Department of Biological Sciences, University of Alberta, Edmonton, Alberta T6G 2E9, Canada
Stephen Harrison Forensic Science Laboratory, ChemCentre, Bentley, Western Australia 6102, Australia
Russell Greiner Department of Computing Science, University of Alberta, Edmonton, Alberta T6G 2E8, Canada Alberta Machine Intelligence Institute, Edmonton, Alberta T5J 3B1, Canada
Leonard J. Foster Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada Department of Biochemistry and Molecular Biology, University of British Columbia, Vancouver, British Columbia V6T 2A1, Canada
Petur Weihe Dalsgaard Section of Forensic Chemistry, Department of Forensic Medicine, University of Copenhagen, Copenhagen 2100, Denmark
David S. Wishart Department of Computing Science, University of Alberta, Edmonton, Alberta T6G 2E8, Canada Department of Biological Sciences, University of Alberta, Edmonton, Alberta T6G 2E9, Canada Department of Laboratory Medicine and Pathology, University of Alberta, Edmonton, Alberta T6G 1C9, Canada Faculty of Pharmacy and Pharmaceutical Sciences, University of Alberta, Edmonton, Alberta T6G 2C8, Canada Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99354, United States

Collapse

Skinnider MA, Mérette SAM, Pasin D, Rogalski J, Foster LJ, Scheuermeyer F, Shapiro AM. Identification of Emerging Novel Psychoactive Substances by Retrospective Analysis of Population-Scale Mass Spectrometry Data Sets. Anal Chem 2023;95:17300-17310. [PMID: 37966487 DOI: 10.1021/acs.analchem.3c03451] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2023]

Abstract

Over the last two decades, hundreds of new psychoactive substances (NPSs), also known as "designer drugs", have emerged on the illicit drug market. The toxic and potentially fatal effects of these compounds oblige laboratories around the world to screen for NPS in seized materials and biological samples, commonly using high-resolution mass spectrometry. However, unambiguous identification of a NPS by mass spectrometry requires comparison to data from analytical reference materials, acquired on the same instrument. The sheer number of NPSs that are available on the illicit market, and the pace at which new compounds are introduced, means that forensic laboratories must make difficult decisions about which reference materials to acquire. Here, we asked whether retrospective suspect screening of population-scale mass spectrometry data could provide a data-driven platform to prioritize emerging NPSs for assay development. We curated a suspect database of precursor and diagnostic fragment ion masses for 83 emerging NPSs and used this database to retrospectively screen mass spectrometry data from 12,727 urine drug screens from one Canadian province. We developed integrative computational strategies to prioritize the most reliable identifications and tracked the frequency of these identifications over a 3 year study period between August 2019 and August 2022. The resulting data were used to guide the acquisition of new reference materials, which were in turn used to validate a subset of the retrospective identifications. Last, we took advantage of matching clinical reports for all 12,727 samples to systematically benchmark the accuracy of our retrospective data analysis approach. Our work opens up new avenues to enable the rapid detection of emerging illicit drugs through large-scale reanalysis of mass spectrometry data.

Collapse

Ochiai T, Inukai T, Akiyama M, Furui K, Ohue M, Matsumori N, Inuki S, Uesugi M, Sunazuka T, Kikuchi K, Kakeya H, Sakakibara Y. Variational autoencoder-based chemical latent space for large molecular structures with 3D complexity. Commun Chem 2023;6:249. [PMID: 37973971 PMCID: PMC10654724 DOI: 10.1038/s42004-023-01054-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2023] [Accepted: 11/06/2023] [Indexed: 11/19/2023] Open

Xia S, Chen E, Zhang Y. Integrated Molecular Modeling and Machine Learning for Drug Design. J Chem Theory Comput 2023;19:7478-7495. [PMID: 37883810 PMCID: PMC10653122 DOI: 10.1021/acs.jctc.3c00814] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 10/10/2023] [Accepted: 10/11/2023] [Indexed: 10/28/2023]

Kim GB, Kim JY, Lee JA, Norsigian CJ, Palsson BO, Lee SY. Functional annotation of enzyme-encoding genes using deep learning with transformer layers. Nat Commun 2023;14:7370. [PMID: 37963869 PMCID: PMC10645960 DOI: 10.1038/s41467-023-43216-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 11/03/2023] [Indexed: 11/16/2023] Open

Affiliation(s)

Gi Bae Kim Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), KAIST, Daejeon, 34141, Republic of Korea KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, KAIST, Daejeon, 34141, Republic of Korea
Ji Yeon Kim Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), KAIST, Daejeon, 34141, Republic of Korea KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, KAIST, Daejeon, 34141, Republic of Korea
Jong An Lee Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), KAIST, Daejeon, 34141, Republic of Korea KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, KAIST, Daejeon, 34141, Republic of Korea
Charles J Norsigian Division of Biological Sciences, University of California San Diego, La Jolla, CA, 92093, USA Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA
Bernhard O Palsson Department of Bioengineering, University of California San Diego, La Jolla, CA, 92093, USA Bioinformatics and Systems Biology Program, University of California San Diego, La Jolla, CA, 92093, USA Novo Nordisk Foundation Center for Biosustainability, 2800, Kongens Lyngby, Denmark
Sang Yup Lee Metabolic and Biomolecular Engineering National Research Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea. Systems Metabolic Engineering and Systems Healthcare Cross-Generation Collaborative Laboratory, Department of Chemical and Biomolecular Engineering (BK21 four), KAIST, Daejeon, 34141, Republic of Korea. KAIST Institute for the BioCentury and KAIST Institute for Artificial Intelligence, KAIST, Daejeon, 34141, Republic of Korea. BioProcess Engineering Research Center and BioInformatics Research Center, KAIST, Daejeon, 34141, Republic of Korea.

Collapse

Skinnider MA. Hallucinating hallucinogens. Science 2023;382:656-657. [PMID: 37943903 DOI: 10.1126/science.adk8626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2023]

Wang H, Fu T, Du Y, Gao W, Huang K, Liu Z, Chandak P, Liu S, Van Katwyk P, Deac A, Anandkumar A, Bergen K, Gomes CP, Ho S, Kohli P, Lasenby J, Leskovec J, Liu TY, Manrai A, Marks D, Ramsundar B, Song L, Sun J, Tang J, Veličković P, Welling M, Zhang L, Coley CW, Bengio Y, Zitnik M. Scientific discovery in the age of artificial intelligence. Nature 2023;620:47-60. [PMID: 37532811 DOI: 10.1038/s41586-023-06221-2] [Citation(s) in RCA: 69] [Impact Index Per Article: 69.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Accepted: 05/16/2023] [Indexed: 08/04/2023]

Affiliation(s)

Hanchen Wang Department of Engineering, University of Cambridge, Cambridge, UK Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA Department of Research and Early Development, Genentech Inc, South San Francisco, CA, USA Department of Computer Science, Stanford University, Stanford, CA, USA
Tianfan Fu Department of Computational Science and Engineering, Georgia Institute of Technology, Atlanta, GA, USA
Yuanqi Du Department of Computer Science, Cornell University, Ithaca, NY, USA
Wenhao Gao Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
Kexin Huang Department of Computer Science, Stanford University, Stanford, CA, USA
Ziming Liu Department of Physics, Massachusetts Institute of Technology, Cambridge, MA, USA
Payal Chandak Harvard-MIT Program in Health Sciences and Technology, Cambridge, MA, USA
Shengchao Liu Mila - Quebec AI Institute, Montreal, Quebec, Canada Université de Montréal, Montreal, Quebec, Canada
Peter Van Katwyk Department of Earth, Environmental and Planetary Sciences, Brown University, Providence, RI, USA Data Science Institute, Brown University, Providence, RI, USA
Andreea Deac Mila - Quebec AI Institute, Montreal, Quebec, Canada Université de Montréal, Montreal, Quebec, Canada
Anima Anandkumar Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA, USA NVIDIA, Santa Clara, CA, USA
Karianne Bergen Department of Earth, Environmental and Planetary Sciences, Brown University, Providence, RI, USA Data Science Institute, Brown University, Providence, RI, USA
Carla P Gomes Department of Computer Science, Cornell University, Ithaca, NY, USA
Shirley Ho Center for Computational Astrophysics, Flatiron Institute, New York, NY, USA Department of Astrophysical Sciences, Princeton University, Princeton, NJ, USA Department of Physics, Carnegie Mellon University, Pittsburgh, PA, USA Department of Physics and Center for Data Science, New York University, New York, NY, USA
Pushmeet Kohli Google DeepMind, London, UK
Joan Lasenby Department of Engineering, University of Cambridge, Cambridge, UK
Jure Leskovec Department of Computer Science, Stanford University, Stanford, CA, USA
Tie-Yan Liu Microsoft Research, Beijing, China
Arjun Manrai Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
Debora Marks Department of Systems Biology, Harvard Medical School, Boston, MA, USA Broad Institute of MIT and Harvard, Cambridge, MA, USA
Bharath Ramsundar Deep Forest Sciences, Palo Alto, CA, USA
Le Song BioMap, Beijing, China Mohamed bin Zayed University of Artificial Intelligence, Abu Dhabi, United Arab Emirates
Jimeng Sun University of Illinois at Urbana-Champaign, Champaign, IL, USA
Jian Tang Mila - Quebec AI Institute, Montreal, Quebec, Canada HEC Montréal, Montreal, Quebec, Canada CIFAR AI Chair, Toronto, Ontario, Canada
Petar Veličković Google DeepMind, London, UK Department of Computer Science and Technology, University of Cambridge, Cambridge, UK
Max Welling University of Amsterdam, Amsterdam, Netherlands Microsoft Research Amsterdam, Amsterdam, Netherlands
Linfeng Zhang DP Technology, Beijing, China AI for Science Institute, Beijing, China
Connor W Coley Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
Yoshua Bengio Mila - Quebec AI Institute, Montreal, Quebec, Canada Université de Montréal, Montreal, Quebec, Canada
Marinka Zitnik Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA. Broad Institute of MIT and Harvard, Cambridge, MA, USA. Harvard Data Science Initiative, Cambridge, MA, USA. Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Cambridge, MA, USA.

Collapse

Tay DWP, Yeo NZX, Adaikkappan K, Lim YH, Ang SJ. 67 million natural product-like compound database generated via molecular language processing. Sci Data 2023;10:296. [PMID: 37208372 DOI: 10.1038/s41597-023-02207-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 04/21/2023] [Indexed: 05/21/2023] Open

Yoshimori A, Bajorath J. Motif2Mol: Prediction of New Active Compounds Based on Sequence Motifs of Ligand Binding Sites in Proteins Using a Biochemical Language Model. Biomolecules 2023;13:biom13050833. [PMID: 37238703 DOI: 10.3390/biom13050833] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2023] [Revised: 05/05/2023] [Accepted: 05/12/2023] [Indexed: 05/28/2023] Open

Chen H, Bajorath J. Designing highly potent compounds using a chemical language model. Sci Rep 2023;13:7412. [PMID: 37150793 PMCID: PMC10164739 DOI: 10.1038/s41598-023-34683-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2023] [Accepted: 05/05/2023] [Indexed: 05/09/2023] Open

Grisoni F. Chemical language models for de novo drug design: Challenges and opportunities. Curr Opin Struct Biol 2023;79:102527. [PMID: 36738564 DOI: 10.1016/j.sbi.2023.102527] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Revised: 12/07/2022] [Accepted: 12/20/2022] [Indexed: 02/05/2023]

Seo S, Lim J, Kim WY. Molecular Generative Model via Retrosynthetically Prepared Chemical Building Block Assembly. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2023;10:e2206674. [PMID: 36596675 PMCID: PMC10015872 DOI: 10.1002/advs.202206674] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2022] [Indexed: 06/17/2023]

Bajorath J. Generative kinase inhibitor modeling viewed from a medicinal chemistry perspective. Future Med Chem 2023;15:313-315. [PMID: 36892087 DOI: 10.4155/fmc-2023-0029] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/10/2023] Open

Molecular Blueprinting by Word Processing. Symmetry (Basel) 2023. [DOI: 10.3390/sym15020357] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open

Moret M, Pachon Angona I, Cotos L, Yan S, Atz K, Brunner C, Baumgartner M, Grisoni F, Schneider G. Leveraging molecular structure and bioactivity with chemical language models for de novo drug design. Nat Commun 2023;14:114. [PMID: 36611029 PMCID: PMC9825622 DOI: 10.1038/s41467-022-35692-6] [Citation(s) in RCA: 24] [Impact Index Per Article: 24.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Accepted: 12/19/2022] [Indexed: 01/09/2023] Open

Atz K, Guba W, Grether U, Schneider G. Machine Learning and Computational Chemistry for the Endocannabinoid System. Methods Mol Biol 2023;2576:477-493. [PMID: 36152211 DOI: 10.1007/978-1-0716-2728-0_39] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]

Chemical language models for applications in medicinal chemistry. Future Med Chem 2023;15:119-121. [PMID: 36727442 DOI: 10.4155/fmc-2022-0315] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open

Li C, Wang C, Sun M, Zeng Y, Yuan Y, Gou Q, Wang G, Guo Y, Pu X. Correlated RNN Framework to Quickly Generate Molecules with Desired Properties for Energetic Materials in the Low Data Regime. J Chem Inf Model 2022;62:4873-4887. [PMID: 35998331 DOI: 10.1021/acs.jcim.2c00997] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]

Woodward DJ, Bradley AR, van Hoorn WP. Coverage Score: A Model Agnostic Method to Efficiently Explore Chemical Space. J Chem Inf Model 2022;62:4391-4402. [PMID: 35867814 DOI: 10.1021/acs.jcim.2c00258] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Blay V, Radivojevic T, Allen JE, Hudson CM, Garcia Martin H. MACAW: An Accessible Tool for Molecular Embedding and Inverse Molecular Design. J Chem Inf Model 2022;62:3551-3564. [PMID: 35857932 PMCID: PMC9364320 DOI: 10.1021/acs.jcim.2c00229] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]

Yoshimori A, Bajorath J. DeepAS - Chemical language model for the extension of active analogue series. Bioorg Med Chem 2022;66:116808. [PMID: 35567984 DOI: 10.1016/j.bmc.2022.116808] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2022] [Revised: 04/28/2022] [Accepted: 05/04/2022] [Indexed: 11/30/2022]

Yoshimori A, Bajorath J. Computational analysis, alignment and extension of analogue series from medicinal chemistry. Future Sci OA 2022;8:FSO804. [PMID: 36248066 PMCID: PMC9540237 DOI: 10.2144/fsoa-2022-0033] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/09/2022] [Accepted: 06/10/2022] [Indexed: 11/23/2022] Open

Singh S, Sunoj RB. A Transfer Learning Approach for Reaction Discovery in Small Data Situations Using Generative Model. iScience 2022;25:104661. [PMID: 35832891 PMCID: PMC9272387 DOI: 10.1016/j.isci.2022.104661] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2022] [Revised: 05/20/2022] [Accepted: 06/16/2022] [Indexed: 11/01/2022] Open

Moret M, Grisoni F, Katzberger P, Schneider G. Perplexity-Based Molecule Ranking and Bias Estimation of Chemical Language Models. J Chem Inf Model 2022;62:1199-1206. [PMID: 35191696 PMCID: PMC8924923 DOI: 10.1021/acs.jcim.2c00079] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Geometric deep learning on molecular representations. NAT MACH INTELL 2021. [DOI: 10.1038/s42256-021-00418-8] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]

A deep generative model enables automated structure elucidation of novel psychoactive substances. NAT MACH INTELL 2021. [DOI: 10.1038/s42256-021-00407-x] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

High-confidence structural annotation of metabolites absent from spectral libraries. Nat Biotechnol 2021;40:411-421. [PMID: 34650271 PMCID: PMC8926923 DOI: 10.1038/s41587-021-01045-9] [Citation(s) in RCA: 89] [Impact Index Per Article: 29.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2021] [Accepted: 08/04/2021] [Indexed: 12/14/2022]

Chen JM, Zovko M, Šimurina N, Zovko V. Fear in a Handful of Dust: The Epidemiological, Environmental, and Economic Drivers of Death by PM_2.5 Pollution. INTERNATIONAL JOURNAL OF ENVIRONMENTAL RESEARCH AND PUBLIC HEALTH 2021;18:8688. [PMID: 34444435 PMCID: PMC8393768 DOI: 10.3390/ijerph18168688] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Revised: 08/03/2021] [Accepted: 08/14/2021] [Indexed: 01/13/2023]