1
|
Cheng P, Jin Y, Wang D, Tao S. Design and computational screening of high-energy, low-sensitivity bistetrazole-based energetic molecules. RSC Adv 2025; 15:11645-11654. [PMID: 40230632 PMCID: PMC11995157 DOI: 10.1039/d5ra01604e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2025] [Accepted: 04/02/2025] [Indexed: 04/16/2025] Open
Abstract
Bistetrazole-based compounds are novel high-nitrogen energetic molecules that have garnered attention in recent years. They possess a higher energy density and lower sensitivity, and are less challenging to synthesize than complex cage-like molecules. This study employed a molecular auto-generation mechanism to generate 35 322 bistetrazole-based molecules with 20 bridgeheads and 29 side substituents. A combination of quantum chemical calculations and machine learning models was used to sequentially screen the molecules based on their oxygen balance index, synthesis difficulty, density, and detonation pressure, thus rapidly narrowing the search scope. Three bistetrazole-based energetic molecules with high potential were identified. The theoretical enthalpy of the formation of the designed molecules was as high as 854.76 kJ mol-1 and their detonation velocity reached 9.58 km s-1. Further calculations also demonstrated that these molecules have better macroscopic stability than trinitrotoluene, making them promising candidates for practical applications in developing energetic materials.
Collapse
Affiliation(s)
- Peihao Cheng
- School of Chemistry, Dalian University of Technology Dalian 116024 Liaoning China
| | - Yunhe Jin
- School of Chemistry, Dalian University of Technology Dalian 116024 Liaoning China
| | - Dongqi Wang
- School of Chemistry, Dalian University of Technology Dalian 116024 Liaoning China
| | - Shengyang Tao
- School of Chemistry, Dalian University of Technology Dalian 116024 Liaoning China
| |
Collapse
|
2
|
Zhou G, Li T, Du J, He C, Yang Y, Chen G, Li J, Shen B, Pu W, Zhang J, Gu Z. OmicsCam Enables Trimodal Profiling of Mitochondrial Genome Editing. Anal Chem 2025; 97:7047-7054. [PMID: 40132106 DOI: 10.1021/acs.analchem.4c05251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/27/2025]
Abstract
Mitochondrial DNA (mtDNA) editing can generate cellular and animal models of mitochondrial genetic disorders and holds promise for future ex vivo and in vivo therapeutic applications. However, due to the quantitative nature of mitochondrion genetics, as more base-editing tools evolve, it is crucial to evaluate not only their efficiency and specificity on the sequence level but also the resulting molecular phenotypes. Here, we devised a novel Omics Carrier microcapsule, abbreviated as OmicsCam, that achieves homogeneous reactions within a heterogeneous carrier membrane, enabling highly efficient multistep biochemistry workflows. Incorporating magnetic beads into the carrier enables high-throughput automation. We demonstrated simultaneous trimodal assessment of mtDNA editing efficiency, postediting cellular transcriptome, and chromatin accessibility in minute cell samples containing as few as 25,000 cells. Applying OmicsCam to two TALE-DdCBE-edited human cell lines revealed that ND4 gene knockdown led to the downregulation of the mitochondrial oxidative phosphorylation pathway and changes in NF-Y transcription factor-associated histone modification pathways in the cell nucleus. Our study provides the most comprehensive analysis of mitochondrial gene editing efficiency and molecular phenotypes to date, which not only facilitates the establishment of mitochondrial genotype-molecular phenotype relationships but also helps assess the global safety of mitochondrial genome nucleases prior to clinical use.
Collapse
Affiliation(s)
- Guoqiang Zhou
- Center for Mitochondrial Genetics and Health, Greater Bay Area Institute of Precision Medicine (Guangzhou), Fudan University, Guangzhou 511458, China
- HKUST Fok Ying Tung Research Institute, Guangzhou 511458, China
| | - Ting Li
- Human Phenome Institute, Fudan University, Shanghai 200438, China
| | - Jingjing Du
- Center for Mitochondrial Genetics and Health, Greater Bay Area Institute of Precision Medicine (Guangzhou), Fudan University, Guangzhou 511458, China
| | - Chengpeng He
- Center for Mitochondrial Genetics and Health, Greater Bay Area Institute of Precision Medicine (Guangzhou), Fudan University, Guangzhou 511458, China
| | - Yu Yang
- Center for Mitochondrial Genetics and Health, Greater Bay Area Institute of Precision Medicine (Guangzhou), Fudan University, Guangzhou 511458, China
| | - Guanju Chen
- School of Life Sciences, Fudan University, Shanghai 200438, China
| | - Jie Li
- Center for Mitochondrial Genetics and Health, Greater Bay Area Institute of Precision Medicine (Guangzhou), Fudan University, Guangzhou 511458, China
| | - Bin Shen
- State Key Laboratory of Reproductive Medicine, Center for Global Health, Nanjing Medical University, Nanjing 211166, China
| | - Weilin Pu
- Center for Mitochondrial Genetics and Health, Greater Bay Area Institute of Precision Medicine (Guangzhou), Fudan University, Guangzhou 511458, China
| | - Jingwei Zhang
- School of Life Sciences, Fudan University, Shanghai 200438, China
- Zhejiang Lab, Hangzhou 310000, China
| | - Zhenglong Gu
- Center for Mitochondrial Genetics and Health, Greater Bay Area Institute of Precision Medicine (Guangzhou), Fudan University, Guangzhou 511458, China
- School of Life Sciences, Fudan University, Shanghai 200438, China
| |
Collapse
|
3
|
Fan L, Shen Y, Lou D, Gu N. Progress in the Computer-Aided Analysis in Multiple Aspects of Nanocatalysis Research. Adv Healthc Mater 2025; 14:e2401576. [PMID: 38936401 DOI: 10.1002/adhm.202401576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 06/08/2024] [Indexed: 06/29/2024]
Abstract
Making the utmost of the differences and advantages of multiple disciplines, interdisciplinary integration breaks the science boundaries and accelerates the progress in mutual quests. As an organic connection of material science, enzymology, and biomedicine, nanozyme-related research is further supported by computer technology, which injects in new vitality, and contributes to in-depth understanding, unprecedented insights, and broadened application possibilities. Utilizing computer-aided first-principles method, high-speed and high-throughput mathematic, physic, and chemic models are introduced to perform atomic-level kinetic analysis for nanocatalytic reaction process, and theoretically illustrate the underlying nanozymetic mechanism and structure-function relationship. On this basis, nanozymes with desirable properties can be designed and demand-oriented synthesized without repeated trial-and-error experiments. Besides that, computational analysis and device also play an indispensable role in nanozyme-based detecting methods to realize automatic readouts with improved accuracy and reproducibility. Here, this work focuses on the crossing of nanocatalysis research and computational technology, to inspire the research in computer-aided analysis in nanozyme field to a greater extent.
Collapse
Affiliation(s)
- Lin Fan
- Medical School of Nanjing University, Nanjing, 210093, P. R. China
- School of Integrated Circuit Science and Engineering (Industry-Education Integration School), Nanjing University of Posts and Telecommunications, Nanjing, 210023, P. R. China
| | - Yilei Shen
- School of Integrated Circuit Science and Engineering (Industry-Education Integration School), Nanjing University of Posts and Telecommunications, Nanjing, 210023, P. R. China
| | - Doudou Lou
- Nanjing Institute for Food and Drug Control, Nanjing, 211198, P. R. China
| | - Ning Gu
- Medical School of Nanjing University, Nanjing, 210093, P. R. China
| |
Collapse
|
4
|
Gabrieli G, Manica M, Cadow‐Gossweiler J, Ruch PW. Digital Fingerprinting of Complex Liquids Using a Reconfigurable Multi-Sensor System with Foundation Models. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2407513. [PMID: 39373824 PMCID: PMC11600221 DOI: 10.1002/advs.202407513] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Revised: 08/27/2024] [Indexed: 10/08/2024]
Abstract
Combining chemical sensor arrays with machine learning enables designing intelligent systems to perform complex sensing tasks and unveil properties that are not directly accessible through conventional analytical chemistry. However, personalized and portable sensor systems are typically unsuitable for the generation of extensive data sets, thereby limiting the ability to train large models in the chemical sensing realm. Foundation models have demonstrated unprecedented zero-shot learning capabilities on various data structures and modalities, in particular for language and vision. Transfer learning from such models is explored by providing a framework to create effective data representations for chemical sensors and ultimately describe a novel, generalizable approach for AI-assisted chemical sensing. The translation of signals produced by remarkably simple and portable multi-sensor systems into visual fingerprints of liquid samples under test is demonstrated, and it is illustrated that how a pipeline incorporating pretrained vision models yields> 95 % $>95\%$ average classification accuracy in four unrelated chemical sensing tasks with limited domain-specific training measurements. This approach matches or outperforms expert-curated sensor signal features, thereby providing a generalization of data processing for ultimate ease-of-use and broad applicability to enable interpretation of multi-signal outputs for generic sensing applications.
Collapse
Affiliation(s)
| | - Matteo Manica
- IBM Research EuropeSäumerstrasse 4Rüschlikon8803Switzerland
| | | | | |
Collapse
|
5
|
Lu JM, Wang HF, Guo QH, Wang JW, Li TT, Chen KX, Zhang MT, Chen JB, Shi QN, Huang Y, Shi SW, Chen GY, Pan JZ, Lu Z, Fang Q. Roboticized AI-assisted microfluidic photocatalytic synthesis and screening up to 10,000 reactions per day. Nat Commun 2024; 15:8826. [PMID: 39396057 PMCID: PMC11470948 DOI: 10.1038/s41467-024-53204-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Accepted: 10/04/2024] [Indexed: 10/14/2024] Open
Abstract
The current throughput of conventional organic chemical synthesis is usually a few experiments for each operator per day. We develop a robotic system for ultra-high-throughput chemical synthesis, online characterization, and large-scale condition screening of photocatalytic reactions, based on the liquid-core waveguide, microfluidic liquid-handling, and artificial intelligence techniques. The system is capable of performing automated reactant mixture preparation, changing, introduction, ultra-fast photocatalytic reactions in seconds, online spectroscopic detection of the reaction product, and screening of different reaction conditions. We apply the system in large-scale screening of 12,000 reaction conditions of a photocatalytic [2 + 2] cycloaddition reaction including multiple continuous and discrete variables, reaching an ultra-high throughput up to 10,000 reaction conditions per day. Based on the data, AI-assisted cross-substrate/photocatalyst prediction is conducted.
Collapse
Affiliation(s)
- Jia-Min Lu
- Department of Chemistry, Zhejiang University, Hangzhou, China
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Hui-Feng Wang
- Department of Chemistry, Zhejiang University, Hangzhou, China
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Qi-Hang Guo
- Department of Chemistry, Zhejiang University, Hangzhou, China
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
- Center of Chemistry for Frontier Technologies, Department of Chemistry, Zhejiang University, Hangzhou, China
| | - Jian-Wei Wang
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Tong-Tong Li
- Department of Chemistry, Zhejiang University, Hangzhou, China
- Center of Chemistry for Frontier Technologies, Department of Chemistry, Zhejiang University, Hangzhou, China
| | - Ke-Xin Chen
- The Research Center for Life Sciences Computing, Zhejiang Lab, Hangzhou, China
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, New Territories, Hong Kong, China
| | - Meng-Ting Zhang
- Department of Chemistry, Zhejiang University, Hangzhou, China
| | - Jian-Bo Chen
- Department of Chemistry, Zhejiang University, Hangzhou, China
| | - Qian-Nuan Shi
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Yi Huang
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Shao-Wen Shi
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Guang-Yong Chen
- The Research Center for Life Sciences Computing, Zhejiang Lab, Hangzhou, China.
| | - Jian-Zhang Pan
- Department of Chemistry, Zhejiang University, Hangzhou, China.
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China.
| | - Zhan Lu
- Department of Chemistry, Zhejiang University, Hangzhou, China.
- Center of Chemistry for Frontier Technologies, Department of Chemistry, Zhejiang University, Hangzhou, China.
| | - Qun Fang
- Department of Chemistry, Zhejiang University, Hangzhou, China.
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China.
- Key Laboratory of Excited-State Materials of Zhejiang Province, Zhejiang University, Hangzhou, China.
| |
Collapse
|
6
|
Liang W, Zheng S, Shu Y, Huang J. Machine Learning Optimizing Enzyme/ZIF Biocomposites for Enhanced Encapsulation Efficiency and Bioactivity. JACS AU 2024; 4:3170-3182. [PMID: 39211601 PMCID: PMC11350574 DOI: 10.1021/jacsau.4c00485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 07/31/2024] [Accepted: 08/01/2024] [Indexed: 09/04/2024]
Abstract
In this study, we present the first example of using a machine learning (ML)-assisted design strategy to optimize the synthesis formulation of enzyme/ZIFs (zeolitic imidazolate framework) for enhanced performance. Glucose oxidase (GOx) and horseradish peroxidase (HRP) were chosen as model enzymes, while Zn(eIM)2 (eIM = 2-ethylimidazolate) was selected as the model ZIF to test our ML-assisted workflow paradigm. Through an iterative ML-driven training-design-synthesis-measurement workflow, we efficiently discovered GOx/ZIF (G151) and HRP/ZIF (H150) with their overall performance index (OPI) values (OPI represents the product of encapsulation efficiency (E in %), retained enzymatic activity (A in %), and thermal stability (T in %)) at least 1.3 times higher than those in systematic seed data studies. Furthermore, advanced statistical methods derived from the trained random forest model qualitatively and quantitatively reveal the relationship among synthesis, structure, and performance in the enzyme/ZIF system, offering valuable guidance for future studies on enzyme/ZIFs. Overall, our proposed ML-assisted design strategy holds promise for accelerating the development of enzyme/ZIFs and other enzyme immobilization systems for biocatalysis applications and beyond, including drug delivery and sensing, among others.
Collapse
Affiliation(s)
- Weibin Liang
- School of Chemical and Biomolecular
Engineering, The University of Sydney, Darlington, NSW 2008, Australia
| | | | - Ying Shu
- School of Chemical and Biomolecular
Engineering, The University of Sydney, Darlington, NSW 2008, Australia
| | - Jun Huang
- School of Chemical and Biomolecular
Engineering, The University of Sydney, Darlington, NSW 2008, Australia
| |
Collapse
|
7
|
Wu NP, Wang W, Gadiagellan D, Counsell M, Hamidi NK, Koike Y, Nguyen HQ. An Automated Robotic Interface for Assays: Facilitating Machine Learning in Drug Discovery by the Automation of Physicochemical Property Assays. ACS OMEGA 2024; 9:24948-24958. [PMID: 38882107 PMCID: PMC11170699 DOI: 10.1021/acsomega.4c02003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 05/14/2024] [Accepted: 05/17/2024] [Indexed: 06/18/2024]
Abstract
Measuring the physicochemical properties of molecules is an iterative but integral process in the drug development process. A strategy to overcome the challenges in maximizing assay throughput relies on the usage of in silico machine learning (ML) prediction models trained on experimental data. Consequently, the performance of these in silico models are dependent on the quality of the utilized experimental data. To improve the data quality, we have designed and implemented an automated robotic system to prepare and run physicochemical property assays (Automated Robotic Interface for Assays, ARIA) with an increase in sample throughput of 6 to10-fold. Through this process, we overcame major challenges and achieved consistent reproducible assay data compared to semiautomated assay preparation.
Collapse
Affiliation(s)
- Newton P Wu
- Analytical Research, Discovery Chemistry Department, Genentech, Inc., 1 DNA Way, South San Francisco, California 94080, United States
| | - Wenyi Wang
- Drug Metabolism and Pharmacokinetics, Genentech, Inc., 1 DNA Way, South San Francisco, California 94080, United States
| | | | - Mike Counsell
- UK Robotics, Inc., Manchester BL5 3EH, United Kingdom
| | - Nikkia K Hamidi
- Analytical Research, Discovery Chemistry Department, Genentech, Inc., 1 DNA Way, South San Francisco, California 94080, United States
| | - Yuko Koike
- Analytical Research, Discovery Chemistry Department, Genentech, Inc., 1 DNA Way, South San Francisco, California 94080, United States
| | - Huy Q Nguyen
- Analytical Research, Discovery Chemistry Department, Genentech, Inc., 1 DNA Way, South San Francisco, California 94080, United States
| |
Collapse
|
8
|
Xerxa E, Bajorath J. Data-oriented protein kinase drug discovery. Eur J Med Chem 2024; 271:116413. [PMID: 38636127 DOI: 10.1016/j.ejmech.2024.116413] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 04/06/2024] [Accepted: 04/11/2024] [Indexed: 04/20/2024]
Abstract
The continued growth of data from biological screening and medicinal chemistry provides opportunities for data-driven experimental design and decision making in early-phase drug discovery. Approaches adopted from data science help to integrate internal and public domain data and extract knowledge from historical in-house data. Protein kinase (PK) drug discovery is an exemplary area where large amounts of data are accumulating, providing a valuable knowledge base for discovery projects. Herein, the evolution of PK drug discovery and development of small molecular PK inhibitors (PKIs) is reviewed, highlighting milestone developments in the field and discussing exemplary studies providing a basis for increasing data orientation of PK discovery efforts.
Collapse
Affiliation(s)
- Elena Xerxa
- Department of Life Science Informatics and Data Science, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Lamarr Institute for Machine Learning and Artificial Intelligence, Rheinische Friedrich-Wilhelms-Universität, Friedrich-Hirzebruch-Allee 5/6, D-53115, Bonn, Germany
| | - Jürgen Bajorath
- Department of Life Science Informatics and Data Science, B-IT, LIMES Program Unit Chemical Biology and Medicinal Chemistry, Lamarr Institute for Machine Learning and Artificial Intelligence, Rheinische Friedrich-Wilhelms-Universität, Friedrich-Hirzebruch-Allee 5/6, D-53115, Bonn, Germany.
| |
Collapse
|
9
|
Mahjour BA, Coley CW. Automation of air-free synthesis. Nat Rev Chem 2024; 8:300-301. [PMID: 38605148 DOI: 10.1038/s41570-024-00599-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]
Affiliation(s)
- Babak A Mahjour
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Connor W Coley
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
10
|
Leonov AI, Hammer AJS, Lach S, Mehr SHM, Caramelli D, Angelone D, Khan A, O'Sullivan S, Craven M, Wilbraham L, Cronin L. An integrated self-optimizing programmable chemical synthesis and reaction engine. Nat Commun 2024; 15:1240. [PMID: 38336880 PMCID: PMC10858227 DOI: 10.1038/s41467-024-45444-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 01/22/2024] [Indexed: 02/12/2024] Open
Abstract
Robotic platforms for chemistry are developing rapidly but most systems are not currently able to adapt to changing circumstances in real-time. We present a dynamically programmable system capable of making, optimizing, and discovering new molecules which utilizes seven sensors that continuously monitor the reaction. By developing a dynamic programming language, we demonstrate the 10-fold scale-up of a highly exothermic oxidation reaction, end point detection, as well as detecting critical hardware failures. We also show how the use of in-line spectroscopy such as HPLC, Raman, and NMR can be used for closed-loop optimization of reactions, exemplified using Van Leusen oxazole synthesis, a four-component Ugi condensation and manganese-catalysed epoxidation reactions, as well as two previously unreported reactions, discovered from a selected chemical space, providing up to 50% yield improvement over 25-50 iterations. Finally, we demonstrate an experimental pipeline to explore a trifluoromethylations reaction space, that discovers new molecules.
Collapse
Affiliation(s)
- Artem I Leonov
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Alexander J S Hammer
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Slawomir Lach
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - S Hessam M Mehr
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Dario Caramelli
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Davide Angelone
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Aamir Khan
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Steven O'Sullivan
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Matthew Craven
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Liam Wilbraham
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Leroy Cronin
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK.
| |
Collapse
|
11
|
Bai J, Mosbach S, Taylor CJ, Karan D, Lee KF, Rihm SD, Akroyd J, Lapkin AA, Kraft M. A dynamic knowledge graph approach to distributed self-driving laboratories. Nat Commun 2024; 15:462. [PMID: 38263405 PMCID: PMC10805810 DOI: 10.1038/s41467-023-44599-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 12/21/2023] [Indexed: 01/25/2024] Open
Abstract
The ability to integrate resources and share knowledge across organisations empowers scientists to expedite the scientific discovery process. This is especially crucial in addressing emerging global challenges that require global solutions. In this work, we develop an architecture for distributed self-driving laboratories within The World Avatar project, which seeks to create an all-encompassing digital twin based on a dynamic knowledge graph. We employ ontologies to capture data and material flows in design-make-test-analyse cycles, utilising autonomous agents as executable knowledge components to carry out the experimentation workflow. Data provenance is recorded to ensure its findability, accessibility, interoperability, and reusability. We demonstrate the practical application of our framework by linking two robots in Cambridge and Singapore for a collaborative closed-loop optimisation for a pharmaceutically-relevant aldol condensation reaction in real-time. The knowledge graph autonomously evolves toward the scientist's research goals, with the two robots effectively generating a Pareto front for cost-yield optimisation in three days.
Collapse
Affiliation(s)
- Jiaru Bai
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge, CB3 0AS, UK
| | - Sebastian Mosbach
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge, CB3 0AS, UK
- Cambridge Centre for Advanced Research and Education in Singapore (CARES), 1 Create Way, CREATE Tower, #05-05, Singapore, 138602, Singapore
| | - Connor J Taylor
- Astex Pharmaceuticals, 436 Cambridge Science Park Milton Road, Cambridge, CB4 0QA, UK
- Innovation Centre in Digital Molecular Technologies, Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
- Faculty of Engineering, University of Nottingham, University Park, Nottingham, NG7 2RD, UK
| | - Dogancan Karan
- Cambridge Centre for Advanced Research and Education in Singapore (CARES), 1 Create Way, CREATE Tower, #05-05, Singapore, 138602, Singapore
| | - Kok Foong Lee
- CMCL Innovations, Sheraton House, Cambridge, CB3 0AX, UK
| | - Simon D Rihm
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge, CB3 0AS, UK
- Cambridge Centre for Advanced Research and Education in Singapore (CARES), 1 Create Way, CREATE Tower, #05-05, Singapore, 138602, Singapore
| | - Jethro Akroyd
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge, CB3 0AS, UK
- Cambridge Centre for Advanced Research and Education in Singapore (CARES), 1 Create Way, CREATE Tower, #05-05, Singapore, 138602, Singapore
| | - Alexei A Lapkin
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge, CB3 0AS, UK
- Cambridge Centre for Advanced Research and Education in Singapore (CARES), 1 Create Way, CREATE Tower, #05-05, Singapore, 138602, Singapore
- Innovation Centre in Digital Molecular Technologies, Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Markus Kraft
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge, CB3 0AS, UK.
- Cambridge Centre for Advanced Research and Education in Singapore (CARES), 1 Create Way, CREATE Tower, #05-05, Singapore, 138602, Singapore.
- School of Chemical and Biomedical Engineering, Nanyang Technological University, 62 Nanyang Drive, 637459, Singapore, Singapore.
- The Alan Turing Institute, London, NW1 2DB, UK.
| |
Collapse
|
12
|
Foote S, Sinhadc P, Mathis C, Walker SI. False Positives and the Challenge of Testing the Alien Hypothesis. ASTROBIOLOGY 2023; 23:1189-1201. [PMID: 37962842 DOI: 10.1089/ast.2023.0005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
The origin of life and the detection of alien life have historically been treated as separate scientific research problems. However, they are not strictly independent. Here, we discuss the need for a better integration of the sciences of life detection and origins of life. Framing these dual problems within the formalism of Bayesian hypothesis testing, we demonstrate via simple examples how high confidence in life detection claims require either (1) a strong prior hypothesis about the existence of life in a particular alien environment, or conversely, (2) signatures of life that are not susceptible to false positives. As a case study, we discuss the role of priors and hypothesis testing in recent results reporting potential detection of life in the venusian atmosphere and in the icy plumes of Enceladus. While many current leading biosignature candidates are subject to false positives because they are not definitive of life, our analyses demonstrate why it is necessary to shift focus to candidate signatures that are definitive. This indicates a necessity to develop methods that lack substantial false positives, by using observables for life that rely on prior hypotheses with strong theoretical and empirical support in identifying defining features of life. Abstract theories developed in pursuit of understanding universal features of life are more likely to be definitive and to apply to life-as-we-don't-know-it. We discuss Molecular Assembly theory as an example of such an observable which is applicable to life detection within the solar system. In the absence of alien examples these are best validated in origin of life experiments, substantiating the need for better integration between origins of life and biosignature science research communities. This leads to a conclusion that extraordinary claims in astrobiology (e.g., definitive detection of alien life) require extraordinary explanations, whereas the evidence itself could be quite ordinary.
Collapse
Affiliation(s)
- Searra Foote
- School of Earth and Space Exploration, Arizona State University, Tempe, Arizona, USA
| | - Pritvik Sinhadc
- Beyond Center for Fundamental Concepts in Science, Arizona State University, Tempe, Arizona, USA
- Dubai College, Dubai, UAE
| | - Cole Mathis
- Beyond Center for Fundamental Concepts in Science, Arizona State University, Tempe, Arizona, USA
- Santa Fe Institute, Santa Fe, New Mexico, USA
| | - Sara Imari Walker
- School of Earth and Space Exploration, Arizona State University, Tempe, Arizona, USA
- Beyond Center for Fundamental Concepts in Science, Arizona State University, Tempe, Arizona, USA
- Santa Fe Institute, Santa Fe, New Mexico, USA
- Blue Marble Space Institute for Science, Seattle, Washington, USA
- ASU-SFI Center for Biosocial Complex Systems, Arizona State University, Tempe, Arizona, USA
| |
Collapse
|
13
|
Du MH, Dai Y, Jiang LP, Su YM, Qi MQ, Wang C, Long LS, Zheng LS, Kong XJ. Exploration and Insights on Topology Adjustment of Giant Heterometallic Cages Featuring Inorganic Skeletons Assisted by Machine Learning. J Am Chem Soc 2023; 145:23188-23195. [PMID: 37820275 DOI: 10.1021/jacs.3c07635] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/13/2023]
Abstract
Inorganic molecular cages are emerging multifunctional molecular-based platforms with the unique merits of rigid skeletons and inherited properties from constituent metal ions. However, the sensitive coordination bonds and vast synthetic space have limited their systematic exploration. Herein, two giant cage-like clusters featuring the organic ligand-directed inorganic skeletons of Ni4[La74Ni104(IDA)96(OH)184(C2O4)12(H2O)76]·(NO3)38·(H2O)120 (La74Ni104, 5 × 5 × 3 - C2O4) and [La84Ni132(IDA)108(OH)168(C2O4)24(NO3)12(H2O)116]·(NO3)72·(H2O)296 (La84Ni132, 5 × 5 × 5 - C2O4) were discovered by a high-throughput synthetic search. With the assistance of machine learning analysis of the experimental data, phase diagrams of the two clusters in a four-parameter synthetic space were depicted. The effect of alkali, oxalate, and other parameters on the formation of clusters and the mechanism regulating the size of two n × m × l clusters were elucidated. This work uses high-throughput synthesis and machine learning methods to improve the efficiency of 3d-4f cluster discovery and finds the highest-nuclearity 3d-4f cluster to date by regulating the size of the n × m × l inorganic cages through oxalate ions, which pushes the synthetic methodology study on elusive inorganic giant cages in a significantly systematic way.
Collapse
Affiliation(s)
- Ming-Hao Du
- Collaborative Innovation Center of Chemistry for Energy Materials, State Key Laboratory of Physical Chemistry of Solid Surfaces and Department of Chemistry, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Yiheng Dai
- Collaborative Innovation Center of Chemistry for Energy Materials, State Key Laboratory of Physical Chemistry of Solid Surfaces and Department of Chemistry, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Lin-Peng Jiang
- Collaborative Innovation Center of Chemistry for Energy Materials, State Key Laboratory of Physical Chemistry of Solid Surfaces and Department of Chemistry, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Yu-Ming Su
- Collaborative Innovation Center of Chemistry for Energy Materials, State Key Laboratory of Physical Chemistry of Solid Surfaces and Department of Chemistry, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Ming-Qiang Qi
- Collaborative Innovation Center of Chemistry for Energy Materials, State Key Laboratory of Physical Chemistry of Solid Surfaces and Department of Chemistry, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Cheng Wang
- Collaborative Innovation Center of Chemistry for Energy Materials, State Key Laboratory of Physical Chemistry of Solid Surfaces and Department of Chemistry, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - La-Sheng Long
- Collaborative Innovation Center of Chemistry for Energy Materials, State Key Laboratory of Physical Chemistry of Solid Surfaces and Department of Chemistry, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Lan-Sun Zheng
- Collaborative Innovation Center of Chemistry for Energy Materials, State Key Laboratory of Physical Chemistry of Solid Surfaces and Department of Chemistry, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| | - Xiang-Jian Kong
- Collaborative Innovation Center of Chemistry for Energy Materials, State Key Laboratory of Physical Chemistry of Solid Surfaces and Department of Chemistry, College of Chemistry and Chemical Engineering, Xiamen University, Xiamen 361005, China
| |
Collapse
|
14
|
Salley D, Manzano JS, Kitson PJ, Cronin L. Robotic Modules for the Programmable Chemputation of Molecules and Materials. ACS CENTRAL SCIENCE 2023; 9:1525-1537. [PMID: 37637738 PMCID: PMC10450877 DOI: 10.1021/acscentsci.3c00304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/12/2023] [Indexed: 08/29/2023]
Abstract
Before leveraging big data methods like machine learning and artificial intelligence (AI) in chemistry, there is an imperative need for an affordable, universal digitization standard. This mirrors the foundational requisites of the digital revolution, which demanded standard architectures with precise specifications. Recently, we have developed automated platforms tailored for chemical AI-driven exploration, including the synthesis of molecules, materials, nanomaterials, and formulations. Our focus has been on designing and constructing affordable standard hardware and software modules that serve as a blueprint for chemistry digitization across varied fields. Our platforms can be categorized into four types based on their applications: (i) discovery systems for the exploration of chemical space and novel reactivity, (ii) systems for the synthesis and manufacture of fine chemicals, (iii) platforms for formulation discovery and exploration, and (iv) systems for materials discovery and synthesis. We also highlight the convergent evolution of these platforms through shared hardware, firmware, and software alongside the creation of a unique programming language for chemical and material systems. This programming approach is essential for reliable synthesis, designing experiments, discovery, optimization, and establishing new collaboration standards. Furthermore, it is crucial for verifying literature findings, enhancing experimental outcome reliability, and fostering collaboration and sharing of unsuccessful experiments across different research labs.
Collapse
Affiliation(s)
- Daniel Salley
- School of Chemistry, University
of Glasgow, University Avenue, Glasgow G12 8QQ, U.K.
| | - J. Sebastián Manzano
- School of Chemistry, University
of Glasgow, University Avenue, Glasgow G12 8QQ, U.K.
| | - Philip J. Kitson
- School of Chemistry, University
of Glasgow, University Avenue, Glasgow G12 8QQ, U.K.
| | - Leroy Cronin
- School of Chemistry, University
of Glasgow, University Avenue, Glasgow G12 8QQ, U.K.
| |
Collapse
|
15
|
Zhang XE, Liu C, Dai J, Yuan Y, Gao C, Feng Y, Wu B, Wei P, You C, Wang X, Si T. Enabling technology and core theory of synthetic biology. SCIENCE CHINA. LIFE SCIENCES 2023; 66:1742-1785. [PMID: 36753021 PMCID: PMC9907219 DOI: 10.1007/s11427-022-2214-2] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 10/04/2022] [Indexed: 02/09/2023]
Abstract
Synthetic biology provides a new paradigm for life science research ("build to learn") and opens the future journey of biotechnology ("build to use"). Here, we discuss advances of various principles and technologies in the mainstream of the enabling technology of synthetic biology, including synthesis and assembly of a genome, DNA storage, gene editing, molecular evolution and de novo design of function proteins, cell and gene circuit engineering, cell-free synthetic biology, artificial intelligence (AI)-aided synthetic biology, as well as biofoundries. We also introduce the concept of quantitative synthetic biology, which is guiding synthetic biology towards increased accuracy and predictability or the real rational design. We conclude that synthetic biology will establish its disciplinary system with the iterative development of enabling technologies and the maturity of the core theory.
Collapse
Affiliation(s)
- Xian-En Zhang
- Faculty of Synthetic Biology, Shenzhen Institute of Advanced Technology, Shenzhen, 518055, China.
- National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China.
| | - Chenli Liu
- Faculty of Synthetic Biology, Shenzhen Institute of Advanced Technology, Shenzhen, 518055, China.
- Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China.
| | - Junbiao Dai
- Faculty of Synthetic Biology, Shenzhen Institute of Advanced Technology, Shenzhen, 518055, China.
- Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China.
| | - Yingjin Yuan
- Frontiers Science Center for Synthetic Biology and Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin, 300072, China.
| | - Caixia Gao
- State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing, 100101, China.
| | - Yan Feng
- State Key Laboratory of Microbial Metabolism, Shanghai Jiao Tong University, Shanghai, 200240, China.
| | - Bian Wu
- State Key Laboratory of Microbial Resources, Institute of Microbiology, Chinese Academy of Sciences, Beijing, 100101, China.
| | - Ping Wei
- Faculty of Synthetic Biology, Shenzhen Institute of Advanced Technology, Shenzhen, 518055, China.
- Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China.
| | - Chun You
- Tianjin Institute of Industrial Biotechnology, Chinese Academy of Sciences, Tianjin, 300308, China.
| | - Xiaowo Wang
- Ministry of Education Key Laboratory of Bioinformatics; Center for Synthetic and Systems Biology; Bioinformatics Division, Beijing National Research Center for Information Science and Technology; Department of Automation, Tsinghua University, Beijing, 100084, China.
| | - Tong Si
- Faculty of Synthetic Biology, Shenzhen Institute of Advanced Technology, Shenzhen, 518055, China.
- Institute of Synthetic Biology, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, China.
| |
Collapse
|
16
|
Tay DWP, Yeo NZX, Adaikkappan K, Lim YH, Ang SJ. 67 million natural product-like compound database generated via molecular language processing. Sci Data 2023; 10:296. [PMID: 37208372 DOI: 10.1038/s41597-023-02207-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2023] [Accepted: 04/21/2023] [Indexed: 05/21/2023] Open
Abstract
Natural products are a rich resource of bioactive compounds for valuable applications across multiple fields such as food, agriculture, and medicine. For natural product discovery, high throughput in silico screening offers a cost-effective alternative to traditional resource-heavy assay-guided exploration of structurally novel chemical space. In this data descriptor, we report a characterized database of 67,064,204 natural product-like molecules generated using a recurrent neural network trained on known natural products, demonstrating a significant 165-fold expansion in library size over the approximately 400,000 known natural products. This study highlights the potential of using deep generative models to explore novel natural product chemical space for high throughput in silico discovery.
Collapse
Affiliation(s)
- Dillon W P Tay
- Institute of Sustainability for Chemicals, Energy and Environment (ISCE2), Agency for Science, Technology and Research (A*STAR), 8 Biomedical Grove, #07-01 Neuros Building, Singapore, 138665, Republic of Singapore.
| | - Naythan Z X Yeo
- Institute of Sustainability for Chemicals, Energy and Environment (ISCE2), Agency for Science, Technology and Research (A*STAR), 8 Biomedical Grove, #07-01 Neuros Building, Singapore, 138665, Republic of Singapore
- Hwa Chong Institution, 661 Bukit Timah Road, Singapore, 269734, Republic of Singapore
| | - Krishnan Adaikkappan
- Institute of Sustainability for Chemicals, Energy and Environment (ISCE2), Agency for Science, Technology and Research (A*STAR), 8 Biomedical Grove, #07-01 Neuros Building, Singapore, 138665, Republic of Singapore
- National Junior College, 37 Hillcrest Road, Singapore, 288913, Republic of Singapore
| | - Yee Hwee Lim
- Institute of Sustainability for Chemicals, Energy and Environment (ISCE2), Agency for Science, Technology and Research (A*STAR), 8 Biomedical Grove, #07-01 Neuros Building, Singapore, 138665, Republic of Singapore
- Synthetic Biology Translational Research Program, Yong Loo Lin School of Medicine, National University of Singapore, 10 Medical Drive, Singapore, 117597, Republic of Singapore
| | - Shi Jun Ang
- Institute of Sustainability for Chemicals, Energy and Environment (ISCE2), Agency for Science, Technology and Research (A*STAR), 8 Biomedical Grove, #07-01 Neuros Building, Singapore, 138665, Republic of Singapore.
- Institute of High Performance Computing (IHPC), Agency for Science, Technology and Research (A*STAR), 1 Fusionopolis Way, #16-16 Connexis, Singapore, 138632, Republic of Singapore.
| |
Collapse
|
17
|
Jablonka K, Rosen AS, Krishnapriyan AS, Smit B. An Ecosystem for Digital Reticular Chemistry. ACS CENTRAL SCIENCE 2023; 9:563-581. [PMID: 37122448 PMCID: PMC10141625 DOI: 10.1021/acscentsci.2c01177] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
The vastness of the materials design space makes it impractical to explore using traditional brute-force methods, particularly in reticular chemistry. However, machine learning has shown promise in expediting and guiding materials design. Despite numerous successful applications of machine learning to reticular materials, progress in the field has stagnated, possibly because digital chemistry is more an art than a science and its limited accessibility to inexperienced researchers. To address this issue, we present mofdscribe, a software ecosystem tailored to novice and seasoned digital chemists that streamlines the ideation, modeling, and publication process. Though optimized for reticular chemistry, our tools are versatile and can be used in nonreticular materials research. We believe that mofdscribe will enable a more reliable, efficient, and comparable field of digital chemistry.
Collapse
Affiliation(s)
- Kevin
Maik Jablonka
- Laboratory of molecular simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL), Rue de l’Industrie 17, CH-1951 Sion, Switzerland
| | - Andrew S. Rosen
- Department of Materials
Science and Engineering, University of California, Berkeley, California 94720, United States
- Miller Institute for Basic Research in Science, University of California, Berkeley, California 94720, United States
- Materials Science Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Aditi S. Krishnapriyan
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720, United States
- Department of Electrical Engineering and
Computer Science, University of California, Berkeley, California 94720, United States
- Computational
Research Division, Lawrence Berkeley National
Laboratory, Berkeley, California 94720, United States
| | - Berend Smit
- Laboratory of molecular simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL), Rue de l’Industrie 17, CH-1951 Sion, Switzerland
| |
Collapse
|
18
|
Volk AA, Epps RW, Yonemoto DT, Masters BS, Castellano FN, Reyes KG, Abolhasani M. AlphaFlow: autonomous discovery and optimization of multi-step chemistry using a self-driven fluidic lab guided by reinforcement learning. Nat Commun 2023; 14:1403. [PMID: 36918561 PMCID: PMC10015005 DOI: 10.1038/s41467-023-37139-y] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 03/02/2023] [Indexed: 03/16/2023] Open
Abstract
Closed-loop, autonomous experimentation enables accelerated and material-efficient exploration of large reaction spaces without the need for user intervention. However, autonomous exploration of advanced materials with complex, multi-step processes and data sparse environments remains a challenge. In this work, we present AlphaFlow, a self-driven fluidic lab capable of autonomous discovery of complex multi-step chemistries. AlphaFlow uses reinforcement learning integrated with a modular microdroplet reactor capable of performing reaction steps with variable sequence, phase separation, washing, and continuous in-situ spectral monitoring. To demonstrate the power of reinforcement learning toward high dimensionality multi-step chemistries, we use AlphaFlow to discover and optimize synthetic routes for shell-growth of core-shell semiconductor nanoparticles, inspired by colloidal atomic layer deposition (cALD). Without prior knowledge of conventional cALD parameters, AlphaFlow successfully identified and optimized a novel multi-step reaction route, with up to 40 parameters, that outperformed conventional sequences. Through this work, we demonstrate the capabilities of closed-loop, reinforcement learning-guided systems in exploring and solving challenges in multi-step nanoparticle syntheses, while relying solely on in-house generated data from a miniaturized microfluidic platform. Further application of AlphaFlow in multi-step chemistries beyond cALD can lead to accelerated fundamental knowledge generation as well as synthetic route discoveries and optimization.
Collapse
Affiliation(s)
- Amanda A Volk
- Department of Chemical and Biomolecular Engineering, North Carolina State University, 911 Partners Way, Raleigh, NC, 27695-7905, USA
| | - Robert W Epps
- Department of Chemical and Biomolecular Engineering, North Carolina State University, 911 Partners Way, Raleigh, NC, 27695-7905, USA
| | - Daniel T Yonemoto
- Department of Chemistry, North Carolina State University, Raleigh, NC, 27695-8204, USA
| | - Benjamin S Masters
- Department of Chemistry, North Carolina State University, Raleigh, NC, 27695-8204, USA
| | - Felix N Castellano
- Department of Chemistry, North Carolina State University, Raleigh, NC, 27695-8204, USA
| | - Kristofer G Reyes
- Department of Materials Design and Innovation, University at Buffalo, Buffalo, NY, 14260, USA
| | - Milad Abolhasani
- Department of Chemical and Biomolecular Engineering, North Carolina State University, 911 Partners Way, Raleigh, NC, 27695-7905, USA.
| |
Collapse
|
19
|
Tan B, Zhang J, Xiao C, Liu Y, Yang X, Wang W, Li Y, Liu N. Progress of Artificial Intelligence in Drug Synthesis and Prospect of Its Application in Nitrification of Energetic Materials. Molecules 2023; 28:1900. [PMID: 36838887 PMCID: PMC9963094 DOI: 10.3390/molecules28041900] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2022] [Revised: 02/12/2023] [Accepted: 02/13/2023] [Indexed: 02/19/2023] Open
Abstract
Artificial intelligence technology shows the advantages of improving efficiency, reducing costs, shortening time, reducing the number of staff on site and achieving precise operations, making impressive research progress in the fields of drug discovery and development, but there are few reports on application in energetic materials. This paper addresses the high safety risks in the current nitrification process of energetic materials, comprehensively analyses and summarizes the main safety risks and their control elements in the nitrification process, proposes possibilities and suggestions for using artificial intelligence technology to enhance the "essential safety" of the nitrification process in energetic materials, reviews the research progress of artificial intelligence in the field of drug synthesis, looks forward to the application prospects of artificial intelligence technology in the nitrification of energetic materials and provides support and guidance for the safe processing of nitrification in the propellants and explosives industry.
Collapse
Affiliation(s)
- Bojun Tan
- Xi’an Modern Chemistry Research Institute, Xi’an 710065, China
| | - Jing Zhang
- Xi’an Modern Chemistry Research Institute, Xi’an 710065, China
| | - Chuan Xiao
- Academy of Ordnance Science, Beijing 100089, China
| | - Yingzhe Liu
- Xi’an Modern Chemistry Research Institute, Xi’an 710065, China
| | - Xiong Yang
- Xi’an Modern Chemistry Research Institute, Xi’an 710065, China
| | - Wei Wang
- Xi’an Modern Chemistry Research Institute, Xi’an 710065, China
| | - Yanan Li
- Xi’an Modern Chemistry Research Institute, Xi’an 710065, China
| | - Ning Liu
- Xi’an Modern Chemistry Research Institute, Xi’an 710065, China
| |
Collapse
|
20
|
Davies JC, Pattison D, Hirst JD. Machine learning for yield prediction for chemical reactions using in situ sensors. J Mol Graph Model 2023; 118:108356. [PMID: 36272195 DOI: 10.1016/j.jmgm.2022.108356] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2022] [Revised: 09/30/2022] [Accepted: 09/30/2022] [Indexed: 11/28/2022]
Abstract
Machine learning models were developed to predict product formation from time-series reaction data for ten Buchwald-Hartwig coupling reactions. The data was provided by DeepMatter and was collected in their DigitalGlassware cloud platform. The reaction probe has 12 sensors to measure properties of interest, including temperature, pressure, and colour. Colour was a good predictor of product formation for this reaction and machine learning models were able to learn which of the properties were important. Predictions for the current product formation (in terms of % yield) had a mean absolute error of 1.2%. For predicting 30, 60 and 120 min ahead the error rose to 3.4, 4.1 and 4.6%, respectively. The work here presents an example into the insight that can be obtained from applying machine learning methods to sensor data in synthetic chemistry.
Collapse
Affiliation(s)
- Joseph C Davies
- School of Chemistry, University of Nottingham, University Park, Nottingham, NG7 2RD, UK
| | | | - Jonathan D Hirst
- School of Chemistry, University of Nottingham, University Park, Nottingham, NG7 2RD, UK.
| |
Collapse
|
21
|
Wang W, Liu Y, Wang Z, Hao G, Song B. The way to AI-controlled synthesis: how far do we need to go? Chem Sci 2022; 13:12604-12615. [PMID: 36519036 PMCID: PMC9645373 DOI: 10.1039/d2sc04419f] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 09/26/2022] [Indexed: 09/08/2024] Open
Abstract
Chemical synthesis always plays an irreplaceable role in chemical, materials, and pharmacological fields. Meanwhile, artificial intelligence (AI) is causing a rapid technological revolution in many fields by replacing manual chemical synthesis and has exhibited a much more economical and time-efficient manner. However, the rate-determining step of AI-controlled synthesis systems is rarely mentioned, which makes it difficult to apply them in general laboratories. Here, the history of developing AI-aided synthesis has been overviewed and summarized. We propose that the hardware of AI-controlled synthesis systems should be more adaptive to execute reactions with different phase reagents and under different reaction conditions, and the software of AI-controlled synthesis systems should have richer kinds of reaction prediction modules. An updated system will better address more different kinds of syntheses. Our viewpoint could help scientists advance the revolution that combines AI and synthesis to achieve more progress in complicated systems.
Collapse
Affiliation(s)
- Wei Wang
- State Key Laboratory Breeding Base of Green Pesticide and Agricultural Bioengineering, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Research and Development Center for Fine Chemicals, Guizhou University Guiyang 550025 P. R. China
| | - Yingwei Liu
- State Key Laboratory of Public Big Data, Guizhou University Guiyang 550025 P. R. China
| | - Zheng Wang
- State Key Laboratory Breeding Base of Green Pesticide and Agricultural Bioengineering, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Research and Development Center for Fine Chemicals, Guizhou University Guiyang 550025 P. R. China
| | - Gefei Hao
- State Key Laboratory Breeding Base of Green Pesticide and Agricultural Bioengineering, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Research and Development Center for Fine Chemicals, Guizhou University Guiyang 550025 P. R. China
| | - Baoan Song
- State Key Laboratory Breeding Base of Green Pesticide and Agricultural Bioengineering, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Research and Development Center for Fine Chemicals, Guizhou University Guiyang 550025 P. R. China
| |
Collapse
|
22
|
Harris Y, Sason H, Niezni D, Shamay Y. Automated discovery of nanomaterials via drug aggregation induced emission. Biomaterials 2022; 289:121800. [PMID: 36166893 DOI: 10.1016/j.biomaterials.2022.121800] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2022] [Revised: 08/30/2022] [Accepted: 09/07/2022] [Indexed: 12/19/2022]
Abstract
Nanoformulations of small molecule drugs are essential to effectively deliver them and treat a wide range of diseases. They are normally complex to develop, lack predictability, and exhibit low drug loading. Recently, nanoparticles made via co-assembly of hydrophobic drugs and organic dyes, exhibited drug-loading of up to 90% with high predictability from the drug structure. However, these particles have relatively short stability and can formulate only a small fraction of the drug space. Here, we developed an automated workflow to synthesize and select novel dye stabilizers, based on their ability to inhibit drug aggregation-induced emission (AIE). We first screened and identified 10 drugs with previously unknown strong AIE activity and exploited this trait to automatically synthesize and select a new ultra-stabilizer named R595. Interestingly, it shares several synthetic similarities and advantages with polydopamine. We found that R595 is superior to myriad types of excipients and solubilizers such as cyclodextrins, poloxamers, albumin, and previously published organic dyes, in both long-term stability and drug compatibility. We investigated the biodistribution, pharmacokinetics, safety and efficacy of the AIEgenic MEK inhibitor trametinib-R595 nanoparticles in vitro and in vivo and demonstrated that they are non-toxic and effective in KRAS driven colon and lung cancer models.
Collapse
Affiliation(s)
- Yuval Harris
- Faculty of Biomedical Engineering, Technion - Israel Institute of Technology, Haifa, Israel
| | - Hagit Sason
- Faculty of Biomedical Engineering, Technion - Israel Institute of Technology, Haifa, Israel
| | - Danna Niezni
- Faculty of Biomedical Engineering, Technion - Israel Institute of Technology, Haifa, Israel
| | - Yosi Shamay
- Faculty of Biomedical Engineering, Technion - Israel Institute of Technology, Haifa, Israel.
| |
Collapse
|
23
|
Seifrid M, Pollice R, Aguilar-Granda A, Morgan Chan Z, Hotta K, Ser CT, Vestfrid J, Wu TC, Aspuru-Guzik A. Autonomous Chemical Experiments: Challenges and Perspectives on Establishing a Self-Driving Lab. Acc Chem Res 2022; 55:2454-2466. [PMID: 35948428 PMCID: PMC9454899 DOI: 10.1021/acs.accounts.2c00220] [Citation(s) in RCA: 49] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Indexed: 01/19/2023]
Abstract
We must accelerate the pace at which we make technological advancements to address climate change and disease risks worldwide. This swifter pace of discovery requires faster research and development cycles enabled by better integration between hypothesis generation, design, experimentation, and data analysis. Typical research cycles take months to years. However, data-driven automated laboratories, or self-driving laboratories, can significantly accelerate molecular and materials discovery. Recently, substantial advancements have been made in the areas of machine learning and optimization algorithms that have allowed researchers to extract valuable knowledge from multidimensional data sets. Machine learning models can be trained on large data sets from the literature or databases, but their performance can often be hampered by a lack of negative results or metadata. In contrast, data generated by self-driving laboratories can be information-rich, containing precise details of the experimental conditions and metadata. Consequently, much larger amounts of high-quality data are gathered in self-driving laboratories. When placed in open repositories, this data can be used by the research community to reproduce experiments, for more in-depth analysis, or as the basis for further investigation. Accordingly, high-quality open data sets will increase the accessibility and reproducibility of science, which is sorely needed.In this Account, we describe our efforts to build a self-driving lab for the development of a new class of materials: organic semiconductor lasers (OSLs). Since they have only recently been demonstrated, little is known about the molecular and material design rules for thin-film, electrically-pumped OSL devices as compared to other technologies such as organic light-emitting diodes or organic photovoltaics. To realize high-performing OSL materials, we are developing a flexible system for automated synthesis via iterative Suzuki-Miyaura cross-coupling reactions. This automated synthesis platform is directly coupled to the analysis and purification capabilities. Subsequently, the molecules of interest can be transferred to an optical characterization setup. We are currently limited to optical measurements of the OSL molecules in solution. However, material properties are ultimately most important in the solid state (e.g., as a thin-film device). To that end and for a different scientific goal, we are developing a self-driving lab for inorganic thin-film materials focused on the oxygen evolution reaction.While the future of self-driving laboratories is very promising, numerous challenges still need to be overcome. These challenges can be split into cognition and motor function. Generally, the cognitive challenges are related to optimization with constraints or unexpected outcomes for which general algorithmic solutions have yet to be developed. A more practical challenge that could be resolved in the near future is that of software control and integration because few instrument manufacturers design their products with self-driving laboratories in mind. Challenges in motor function are largely related to handling heterogeneous systems, such as dispensing solids or performing extractions. As a result, it is critical to understand that adapting experimental procedures that were designed for human experimenters is not as simple as transferring those same actions to an automated system, and there may be more efficient ways to achieve the same goal in an automated fashion. Accordingly, for self-driving laboratories, we need to carefully rethink the translation of manual experimental protocols.
Collapse
Affiliation(s)
- Martin Seifrid
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Robert Pollice
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | | | - Zamyla Morgan Chan
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Acceleration
Consortium, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Kazuhiro Hotta
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Science
& Innovation Center, Mitsubishi Chemical
Corporation, 1000 Kamoshidacho, Aoba, Yokohama, Kanagawa 227-8502, Japan
| | - Cher Tian Ser
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Jenya Vestfrid
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Tony C. Wu
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Alán Aspuru-Guzik
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Department
of Chemical Engineering & Applied Chemistry, University of Toronto, Toronto, Ontario M5S 3E5, Canada
- Department
of Materials Science, University of Toronto, Toronto, Ontario M5S 3E4, Canada
- Vector
Institute for Artificial Intelligence, Toronto, Ontario M5S 1M1, Canada
- Lebovic
Fellow, Canadian Institute for Advanced
Research, Toronto, Ontario M5S 1M1, Canada
| |
Collapse
|
24
|
High-throughput discovery of chemical structure-polarity relationships combining automation and machine-learning techniques. Chem 2022. [DOI: 10.1016/j.chempr.2022.08.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
|
25
|
Rohrbach S, Šiaučiulis M, Chisholm G, Pirvan PA, Saleeb M, Mehr SHM, Trushina E, Leonov AI, Keenan G, Khan A, Hammer A, Cronin L. Digitization and validation of a chemical synthesis literature database in the ChemPU. Science 2022; 377:172-180. [DOI: 10.1126/science.abo0058] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Despite huge potential, automation of synthetic chemistry has only made incremental progress over the past few decades. We present an automatically executable chemical reaction database of 100 molecules representative of the range of reactions found in contemporary organic synthesis. These reactions include transition metal–catalyzed coupling reactions, heterocycle formations, functional group interconversions, and multicomponent reactions. The chemical reaction codes or χDLs for the reactions have been stored in a database for version control, validation, collaboration, and data mining. Of these syntheses, more than 50 entries from the database have been downloaded and robotically run in seven modular ChemPU’s with yields and purities comparable to those achieved by an expert chemist. We also demonstrate the automatic purification of a range of compounds using a chromatography module seamlessly coupled to the platform and programmed with the same language.
Collapse
Affiliation(s)
- Simon Rohrbach
- School of Chemistry, the University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
| | - Mindaugas Šiaučiulis
- School of Chemistry, the University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
| | - Greig Chisholm
- School of Chemistry, the University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
| | - Petrisor-Alin Pirvan
- School of Chemistry, the University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
| | - Michael Saleeb
- School of Chemistry, the University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
| | - S. Hessam M. Mehr
- School of Chemistry, the University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
| | - Ekaterina Trushina
- School of Chemistry, the University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
| | - Artem I. Leonov
- School of Chemistry, the University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
| | - Graham Keenan
- School of Chemistry, the University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
| | - Aamir Khan
- School of Chemistry, the University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
| | - Alexander Hammer
- School of Chemistry, the University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
| | - Leroy Cronin
- School of Chemistry, the University of Glasgow, University Avenue, Glasgow G12 8QQ, UK
| |
Collapse
|
26
|
Otović E, Njirjak M, Kalafatovic D, Mauša G. Sequential Properties Representation Scheme for Recurrent Neural Network-Based Prediction of Therapeutic Peptides. J Chem Inf Model 2022; 62:2961-2972. [PMID: 35704881 DOI: 10.1021/acs.jcim.2c00526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The discovery of therapeutic peptides is often accelerated by means of virtual screening supported by machine learning-based predictive models. The predictive performance of such models is sensitive to the choice of data and its representation scheme. While the peptide physicochemical and compositional representations fail to distinguish sequence permutations, the amino acid arrangement within the sequence lacks the important information contained in physicochemical, conformational, topological, and geometrical properties. In this paper, we propose a solution to the identified information gap by implementing a hybrid scheme that complements the best traits from both approaches with the aim of predicting antimicrobial and antiviral activities based on experimental data from DRAMP 2.0, AVPdb, and Uniprot data repositories. Using the Friedman test of statistical significance, we compared our hybrid, sequential properties approach to peptide properties, one-hot vector encoding, and word embedding schemes in the 10-fold cross-validation setting, with respect to the F1 score, Matthews correlation coefficient, geometric mean, recall, and precision evaluation metrics. Moreover, the sequence modeling neural network was employed to gain insight into the synergic effect of both properties- and amino acid order-based predictions. The results suggest that sequential properties significantly (P < 0.01) surpasses the aforementioned state-of-the-art representation schemes. This makes it a strong candidate for increasing the predictive power of screening methods based on machine learning, applicable to any category of peptides.
Collapse
Affiliation(s)
- Erik Otović
- University of Rijeka, Faculty of Engineering, 51000 Rijeka, Croatia
| | - Marko Njirjak
- University of Rijeka, Faculty of Engineering, 51000 Rijeka, Croatia
| | - Daniela Kalafatovic
- University of Rijeka, Department of Biotechnology, 51000 Rijeka, Croatia.,University of Rijeka, Center for Artificial Intelligence and Cybersecurity, 51000 Rijeka, Croatia
| | - Goran Mauša
- University of Rijeka, Faculty of Engineering, 51000 Rijeka, Croatia.,University of Rijeka, Center for Artificial Intelligence and Cybersecurity, 51000 Rijeka, Croatia
| |
Collapse
|
27
|
Das A, Weise C, Polack M, Urban RD, Krafft B, Hasan S, Westphal H, Warias R, Schmidt S, Gulder T, Belder D. On-the-Fly Mass Spectrometry in Digital Microfluidics Enabled by a Microspray Hole: Toward Multidimensional Reaction Monitoring in Automated Synthesis Platforms. J Am Chem Soc 2022; 144:10353-10360. [PMID: 35640072 DOI: 10.1021/jacs.2c01651] [Citation(s) in RCA: 22] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We report an approach for the online coupling of digital microfluidics (DMF) with mass spectrometry (MS) using a chip-integrated microspray hole (μSH). The technique uses an adapted electrostatic spray ionization (ESTASI) method to spray a portion of a sample droplet through a microhole in the cover plate, allowing its chemical content to be analyzed by MS. This eliminates the need for chip disassembly or the introduction of capillary emitters for MS analysis, as required by state-of-the-art. For the first time, this allows the essential advantage of a DMF device─free droplet movement─to be retained during MS analysis. The broad applicability of the developed seamless coupling of DMF and mass spectrometry was successfully applied to the study of various on-chip organic syntheses as well as protein and peptide analysis. In the case of a Hantzsch synthesis, we were able to show that the method is very well suited for monitoring even rapid chemical reactions that are completed in a few seconds. In addition, the strength of the low resource consumption in such on-chip microsyntheses was demonstrated by the example of enzymatic brominations, for which only a minute amount of a special haloperoxidase is required in the droplet. The unique selling point of this approach is that the analyzed droplet remains completely movable after the MS measurement and is available for subsequent on-DMF chip processes. This is illustrated here for the example of MS analysis of the starting materials in the corresponding droplets before they are combined to investigate the reaction progress by DMF-MS further. This technology enables the ongoing and almost unlimited tracking of multistep chemical processes in a DMF chip and offers exciting prospects for transforming digital microfluidics into automated synthesis platforms.
Collapse
Affiliation(s)
- Anish Das
- Institute of Analytical Chemistry, Leipzig University, Linnéstraße 3, 04103 Leipzig, Germany
| | - Chris Weise
- Institute of Analytical Chemistry, Leipzig University, Linnéstraße 3, 04103 Leipzig, Germany
| | - Matthias Polack
- Institute of Analytical Chemistry, Leipzig University, Linnéstraße 3, 04103 Leipzig, Germany
| | - Raphael D Urban
- Institute of Analytical Chemistry, Leipzig University, Linnéstraße 3, 04103 Leipzig, Germany
| | - Benjamin Krafft
- Institute of Analytical Chemistry, Leipzig University, Linnéstraße 3, 04103 Leipzig, Germany
| | - Sadat Hasan
- Institute of Analytical Chemistry, Leipzig University, Linnéstraße 3, 04103 Leipzig, Germany
| | - Hannes Westphal
- Institute of Analytical Chemistry, Leipzig University, Linnéstraße 3, 04103 Leipzig, Germany
| | - Rico Warias
- Institute of Analytical Chemistry, Leipzig University, Linnéstraße 3, 04103 Leipzig, Germany
| | - Simon Schmidt
- Institute of Organic Chemistry, Leipzig University, Johannisallee 29, 04103 Leipzig, Germany
| | - Tanja Gulder
- Institute of Organic Chemistry, Leipzig University, Johannisallee 29, 04103 Leipzig, Germany
| | - Detlev Belder
- Institute of Analytical Chemistry, Leipzig University, Linnéstraße 3, 04103 Leipzig, Germany
| |
Collapse
|
28
|
Shim E, Kammeraad JA, Xu Z, Tewari A, Cernak T, Zimmerman PM. Predicting reaction conditions from limited data through active transfer learning. Chem Sci 2022; 13:6655-6668. [PMID: 35756521 PMCID: PMC9172577 DOI: 10.1039/d1sc06932b] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2021] [Accepted: 05/10/2022] [Indexed: 12/30/2022] Open
Abstract
Transfer and active learning have the potential to accelerate the development of new chemical reactions, using prior data and new experiments to inform models that adapt to the target area of interest. This article shows how specifically tuned machine learning models, based on random forest classifiers, can expand the applicability of Pd-catalyzed cross-coupling reactions to types of nucleophiles unknown to the model. First, model transfer is shown to be effective when reaction mechanisms and substrates are closely related, even when models are trained on relatively small numbers of data points. Then, a model simplification scheme is tested and found to provide comparative predictivity on reactions of new nucleophiles that include unseen reagent combinations. Lastly, for a challenging target where model transfer only provides a modest benefit over random selection, an active transfer learning strategy is introduced to improve model predictions. Simple models, composed of a small number of decision trees with limited depths, are crucial for securing generalizability, interpretability, and performance of active transfer learning.
Collapse
Affiliation(s)
- Eunjae Shim
- Department of Chemistry, University of Michigan Ann Arbor MI USA
| | - Joshua A Kammeraad
- Department of Chemistry, University of Michigan Ann Arbor MI USA
- Department of Statistics, University of Michigan Ann Arbor MI USA
| | - Ziping Xu
- Department of Statistics, University of Michigan Ann Arbor MI USA
| | - Ambuj Tewari
- Department of Statistics, University of Michigan Ann Arbor MI USA
- Department of Electrical Engineering and Computer Science, University of Michigan Ann Arbor MI USA
| | - Tim Cernak
- Department of Chemistry, University of Michigan Ann Arbor MI USA
- Department of Medicinal Chemistry, University of Michigan Ann Arbor MI USA
| | - Paul M Zimmerman
- Department of Chemistry, University of Michigan Ann Arbor MI USA
| |
Collapse
|
29
|
|
30
|
Warr WA, Nicklaus MC, Nicolaou CA, Rarey M. Exploration of Ultralarge Compound Collections for Drug Discovery. J Chem Inf Model 2022; 62:2021-2034. [PMID: 35421301 DOI: 10.1021/acs.jcim.2c00224] [Citation(s) in RCA: 75] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Designing new medicines more cheaply and quickly is tightly linked to the quest of exploring chemical space more widely and efficiently. Chemical space is monumentally large, but recent advances in computer software and hardware have enabled researchers to navigate virtual chemical spaces containing billions of chemical structures. This review specifically concerns collections of many millions or even billions of enumerated chemical structures as well as even larger chemical spaces that are not fully enumerated. We present examples of chemical libraries and spaces and the means used to construct them, and we discuss new technologies for searching huge libraries and for searching combinatorially in chemical space. We also cover space navigation techniques and consider new approaches to de novo drug design and the impact of the "autonomous laboratory" on synthesis of designed compounds. Finally, we summarize some other challenges and opportunities for the future.
Collapse
Affiliation(s)
- Wendy A Warr
- Wendy Warr & Associates, 6 Berwick Court, Holmes Chapel, Crewe, Cheshire CW4 7HZ, United Kingdom
| | - Marc C Nicklaus
- NCI, NIH, CADD Group, NCI-Frederick, Frederick, Maryland 21702, United States
| | - Christos A Nicolaou
- Discovery Chemistry, Lilly Research Laboratories, Eli Lilly and Company, Indianapolis, Indiana 46285, United States
| | - Matthias Rarey
- Universität Hamburg, ZBH Center for Bioinformatics, 20146 Hamburg, Germany
| |
Collapse
|
31
|
Bai J, Cao L, Mosbach S, Akroyd J, Lapkin AA, Kraft M. From Platform to Knowledge Graph: Evolution of Laboratory Automation. JACS AU 2022; 2:292-309. [PMID: 35252980 PMCID: PMC8889618 DOI: 10.1021/jacsau.1c00438] [Citation(s) in RCA: 21] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2021] [Indexed: 05/19/2023]
Abstract
High-fidelity computer-aided experimentation is becoming more accessible with the development of computing power and artificial intelligence tools. The advancement of experimental hardware also empowers researchers to reach a level of accuracy that was not possible in the past. Marching toward the next generation of self-driving laboratories, the orchestration of both resources lies at the focal point of autonomous discovery in chemical science. To achieve such a goal, algorithmically accessible data representations and standardized communication protocols are indispensable. In this perspective, we recategorize the recently introduced approach based on Materials Acceleration Platforms into five functional components and discuss recent case studies that focus on the data representation and exchange scheme between different components. Emerging technologies for interoperable data representation and multi-agent systems are also discussed with their recent applications in chemical automation. We hypothesize that knowledge graph technology, orchestrating semantic web technologies and multi-agent systems, will be the driving force to bring data to knowledge, evolving our way of automating the laboratory.
Collapse
Affiliation(s)
- Jiaru Bai
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, United Kingdom
| | - Liwei Cao
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, United Kingdom
| | - Sebastian Mosbach
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, United Kingdom
- Cambridge
Centre for Advanced Research and Education in Singapore (CARES), CREATE Tower #05-05, 1 Create Way, 138602 Singapore
| | - Jethro Akroyd
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, United Kingdom
- Cambridge
Centre for Advanced Research and Education in Singapore (CARES), CREATE Tower #05-05, 1 Create Way, 138602 Singapore
| | - Alexei A. Lapkin
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, United Kingdom
- Cambridge
Centre for Advanced Research and Education in Singapore (CARES), CREATE Tower #05-05, 1 Create Way, 138602 Singapore
| | - Markus Kraft
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, United Kingdom
- Cambridge
Centre for Advanced Research and Education in Singapore (CARES), CREATE Tower #05-05, 1 Create Way, 138602 Singapore
- School
of Chemical and Biomedical Engineering, Nanyang Technological University, 62 Nanyang Drive, 637459 Singapore
- The
Alan Turing Institute, London NW1 2DB, United Kingdom
| |
Collapse
|
32
|
Trunschke A. Prospects and challenges for autonomous catalyst discovery viewed from an experimental perspective. Catal Sci Technol 2022. [DOI: 10.1039/d2cy00275b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Autonomous catalysis research requires elaborate integration of operando experiments into automated workflows. Suitable experimental data for analysis by artificial intelligence can be measured more readily according to standard operating procedures.
Collapse
Affiliation(s)
- Annette Trunschke
- Fritz-Haber-Institut der Max-Planck-Gesellschaft, Department of Inorganic Chemistry, Faradayweg 4-6, 14195 Berlin, Germany
| |
Collapse
|
33
|
Zhang L, He M. Prediction of solar cell materials via unsupervised literature learning. JOURNAL OF PHYSICS. CONDENSED MATTER : AN INSTITUTE OF PHYSICS JOURNAL 2021; 34:095902. [PMID: 34844235 DOI: 10.1088/1361-648x/ac3e1e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/23/2021] [Accepted: 11/29/2021] [Indexed: 06/13/2023]
Abstract
Despite the significant advancement of the data-driven studies for physical science, the textual data that are numerous in the literature are not fully embraced by the physics and materials community. In this manuscript, we successfully employ the natural language processing (NLP) technique to unsupervisedly predict the existence of solar cell types including the dye-sensitized solar cells and the perovskite solar cells based on literatures published prior to their first discovery without human annotation. Enlightened by this, we further identify possible solar cell material candidates via NLP starting with a comprehensive training database of 3.2 million paper abstracts published before 2021. The NLP model effectively predicts the existing solar cell materials, while an uncommon solar cell material namely PtSe2is suggested as an appropriate candidate for the future solar cells. Its optoelectronic properties are comprehensive investigated via first-principles calculations to reveal the decent stability and optoelectronic performance of the NLP-predicted candidate. This study demonstrates the viability of the textual data for the data-driven materials prediction and highlights the NLP method as a powerful tool to reliably predict the solar cell materials.
Collapse
Affiliation(s)
- Lei Zhang
- Institute of Advanced Materials and Flexible Electronics (IAMFE), School of Chemistry and Materials Science, Nanjing University of Information Science & Technology, 210044, Nanjing, People's Republic of China
- Department of Materials Physics, School of Chemistry and Materials Science, Nanjing University of Information Science & Technology, 210044, Nanjing, People's Republic of China
| | - Mu He
- Institute of Advanced Materials and Flexible Electronics (IAMFE), School of Chemistry and Materials Science, Nanjing University of Information Science & Technology, 210044, Nanjing, People's Republic of China
| |
Collapse
|
34
|
Szymkuć S, Badowski T, Grzybowski BA. Is Organic Chemistry Really Growing Exponentially? Angew Chem Int Ed Engl 2021. [DOI: 10.1002/ange.202111540] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Sara Szymkuć
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
- Allchemy, Inc. Highland IN USA
| | - Tomasz Badowski
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
- Allchemy, Inc. Highland IN USA
| | - Bartosz A. Grzybowski
- Institute of Organic Chemistry Polish Academy of Sciences Ul. Kasprzaka 44/52 01-224 Warsaw Poland
- Allchemy, Inc. Highland IN USA
- IBS Center for Soft and Living Matter and Department of Chemistry UNIST 50, UNIST-gil, Eonyang-eup, Ulju-gun Ulsan South Korea
| |
Collapse
|
35
|
Szymkuć S, Badowski T, Grzybowski BA. Is Organic Chemistry Really Growing Exponentially? Angew Chem Int Ed Engl 2021; 60:26226-26232. [PMID: 34558168 DOI: 10.1002/anie.202111540] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Indexed: 11/05/2022]
Abstract
In terms of molecules and specific reaction examples, organic chemistry features an impressive, exponential growth. However, new reaction classes/types that fuel this growth are being discovered at a much slower and only linear (or even sublinear) rate. The proportion of newly discovered reaction types to all reactions being performed keeps decreasing, suggesting that synthetic chemistry becomes more reliant on reusing the well-known methods. The newly discovered chemistries are more complex than decades ago and allow for the rapid construction of complex scaffolds in fewer numbers of steps. We study these and other trends in the function of time, reaction-type popularity and complexity based on the algorithm that extracts generalized reaction class templates. These analyses are useful in the context of computer-assisted synthesis, machine learning (to estimate the numbers of models with sufficient reaction statistics), and identifying erroneous entries in reaction databases.
Collapse
Affiliation(s)
- Sara Szymkuć
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, 01-224, Warsaw, Poland.,Allchemy, Inc., Highland, IN, USA
| | - Tomasz Badowski
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, 01-224, Warsaw, Poland.,Allchemy, Inc., Highland, IN, USA
| | - Bartosz A Grzybowski
- Institute of Organic Chemistry, Polish Academy of Sciences, Ul. Kasprzaka 44/52, 01-224, Warsaw, Poland.,Allchemy, Inc., Highland, IN, USA.,IBS Center for Soft and Living Matter and Department of Chemistry, UNIST, 50, UNIST-gil, Eonyang-eup, Ulju-gun, Ulsan, South Korea
| |
Collapse
|
36
|
Seierstad M, Tichenor MS, DesJarlais RL, Na J, Bacani GM, Chung DM, Mercado-Marin EV, Steffens HC, Mirzadegan T. Novel Reagent Space: Identifying Unorderable but Readily Synthesizable Building Blocks. ACS Med Chem Lett 2021; 12:1853-1860. [PMID: 34795876 DOI: 10.1021/acsmedchemlett.1c00340] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Accepted: 10/01/2021] [Indexed: 01/14/2023] Open
Abstract
Drug discovery building blocks available commercially or within an internal inventory cover a diverse range of chemical space and yet describe only a tiny fraction of all chemically feasible reagents. Vendors will eagerly provide tools to search the former; there is no straightforward method of mining the latter. We describe a procedure and use case in assembling chemical structures not available for purchase but that could likely be synthesized in one robust chemical transformation starting from readily available building blocks. Accessing this vast virtual chemical space dramatically increases our curated collection of reagents available for medicinal chemistry exploration and novel hit generation, almost tripling the number of those with 10 or fewer atoms.
Collapse
Affiliation(s)
- Mark Seierstad
- Janssen Research and Development, San Diego, California 92121, United States
| | - Mark S. Tichenor
- Janssen Research and Development, San Diego, California 92121, United States
| | - Renee L. DesJarlais
- Janssen Research and Development, Spring House, Pennsylvania 19477, United States
| | - Jim Na
- Janssen Research and Development, San Diego, California 92121, United States
| | - Genesis M. Bacani
- Janssen Research and Development, San Diego, California 92121, United States
| | - De Michael Chung
- Janssen Research and Development, San Diego, California 92121, United States
| | | | - Helena C. Steffens
- Janssen Research and Development, San Diego, California 92121, United States
| | - Taraneh Mirzadegan
- Janssen Research and Development, San Diego, California 92121, United States
| |
Collapse
|
37
|
Munro LJ, Kell DB. Intelligent host engineering for metabolic flux optimisation in biotechnology. Biochem J 2021; 478:3685-3721. [PMID: 34673920 PMCID: PMC8589332 DOI: 10.1042/bcj20210535] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2021] [Revised: 09/22/2021] [Accepted: 09/24/2021] [Indexed: 12/13/2022]
Abstract
Optimising the function of a protein of length N amino acids by directed evolution involves navigating a 'search space' of possible sequences of some 20N. Optimising the expression levels of P proteins that materially affect host performance, each of which might also take 20 (logarithmically spaced) values, implies a similar search space of 20P. In this combinatorial sense, then, the problems of directed protein evolution and of host engineering are broadly equivalent. In practice, however, they have different means for avoiding the inevitable difficulties of implementation. The spare capacity exhibited in metabolic networks implies that host engineering may admit substantial increases in flux to targets of interest. Thus, we rehearse the relevant issues for those wishing to understand and exploit those modern genome-wide host engineering tools and thinking that have been designed and developed to optimise fluxes towards desirable products in biotechnological processes, with a focus on microbial systems. The aim throughput is 'making such biology predictable'. Strategies have been aimed at both transcription and translation, especially for regulatory processes that can affect multiple targets. However, because there is a limit on how much protein a cell can produce, increasing kcat in selected targets may be a better strategy than increasing protein expression levels for optimal host engineering.
Collapse
Affiliation(s)
- Lachlan J. Munro
- Novo Nordisk Foundation Centre for Biosustainability, Technical University of Denmark, Building 220, Kemitorvet, 2800 Kgs. Lyngby, Denmark
| | - Douglas B. Kell
- Novo Nordisk Foundation Centre for Biosustainability, Technical University of Denmark, Building 220, Kemitorvet, 2800 Kgs. Lyngby, Denmark
- Department of Biochemistry and Systems Biology, Institute of Systems, Molecular and Integrative Biology, University of Liverpool, Crown St, Liverpool L69 7ZB, U.K
- Mellizyme Biotechnology Ltd, IC1, Liverpool Science Park, 131 Mount Pleasant, Liverpool L3 5TF, U.K
| |
Collapse
|
38
|
Puthongkham P, Wirojsaengthong S, Suea-Ngam A. Machine learning and chemometrics for electrochemical sensors: moving forward to the future of analytical chemistry. Analyst 2021; 146:6351-6364. [PMID: 34585185 DOI: 10.1039/d1an01148k] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
Electrochemical sensors and biosensors have been successfully used in a wide range of applications, but systematic optimization and nonlinear relationships have been compromised for electrode fabrication and data analysis. Machine learning and experimental designs are chemometric tools that have been proved to be useful in method development and data analysis. This minireview summarizes recent applications of machine learning and experimental designs in electroanalytical chemistry. First, experimental designs, e.g., full factorial, central composite, and Box-Behnken are discussed as systematic approaches to optimize electrode fabrication to consider the effects from individual variables and their interactions. Then, the principles of machine learning algorithms, including linear and logistic regressions, neural network, and support vector machine, are introduced. These machine learning models have been implemented to extract complex relationships between chemical structures and their electrochemical properties and to analyze complicated electrochemical data to improve calibration and analyte classification, such as in electronic tongues. Lastly, the future of machine learning and experimental designs in electrochemical sensors is outlined. These chemometric strategies will accelerate the development and enhance the performance of electrochemical devices for point-of-care diagnostics and commercialization.
Collapse
Affiliation(s)
- Pumidech Puthongkham
- Department of Chemistry, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand. .,Electrochemistry and Optical Spectroscopy Center of Excellence (EOSCE), Chulalongkorn University, Bangkok 10330, Thailand.,Center of Excellence in Responsive Wearable Materials, Chulalongkorn University, Bangkok 10330, Thailand
| | - Supacha Wirojsaengthong
- Department of Chemistry, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand.
| | - Akkapol Suea-Ngam
- Department of Materials, Department of Bioengineering, and Institute of Biomedical Engineering, Imperial College London, London, SW7 2AZ, UK
| |
Collapse
|
39
|
Hammer AS, Leonov AI, Bell NL, Cronin L. Chemputation and the Standardization of Chemical Informatics. JACS AU 2021; 1:1572-1587. [PMID: 34723260 PMCID: PMC8549037 DOI: 10.1021/jacsau.1c00303] [Citation(s) in RCA: 22] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2021] [Indexed: 05/11/2023]
Abstract
The explosion in the use of machine learning for automated chemical reaction optimization is gathering pace. However, the lack of a standard architecture that connects the concept of chemical transformations universally to software and hardware provides a barrier to using the results of these optimizations and could cause the loss of relevant data and prevent reactions from being reproducible or unexpected findings verifiable or explainable. In this Perspective, we describe how the development of the field of digital chemistry or chemputation, that is the universal code-enabled control of chemical reactions using a standard language and ontology, will remove these barriers allowing users to focus on the chemistry and plug in algorithms according to the problem space to be explored or unit function to be optimized. We describe a standard hardware (the chemical processing programming architecture-the ChemPU) to encompass all chemical synthesis, an approach which unifies all chemistry automation strategies, from solid-phase peptide synthesis, to HTE flow chemistry platforms, while at the same time establishing a publication standard so that researchers can exchange chemical code (χDL) to ensure reproducibility and interoperability. Not only can a vast range of different chemistries be plugged into the hardware, but the ever-expanding developments in software and algorithms can also be accommodated. These technologies, when combined will allow chemistry, or chemputation, to follow computation-that is the running of code across many different types of capable hardware to get the same result every time with a low error rate.
Collapse
|
40
|
Bornemann‐Pfeiffer M, Wolf J, Meyer K, Kern S, Angelone D, Leonov A, Cronin L, Emmerling F. Standardisierung und Kontrolle von Grignard‐Reaktionen mittels Online‐NMR in einer universellen chemischen Syntheseplattform. Angew Chem Int Ed Engl 2021. [DOI: 10.1002/ange.202106323] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Affiliation(s)
- Martin Bornemann‐Pfeiffer
- Bundesanstalt für Materialforschung und -prüfung Richard-Willstätter-Straße 11 12489 Berlin Deutschland
- Chair of Chemical and Process Engineering Technische Universität Berlin Marchstr. 23 10587 Berlin Germany
| | - Jakob Wolf
- Bundesanstalt für Materialforschung und -prüfung Richard-Willstätter-Straße 11 12489 Berlin Deutschland
| | - Klas Meyer
- Bundesanstalt für Materialforschung und -prüfung Richard-Willstätter-Straße 11 12489 Berlin Deutschland
| | - Simon Kern
- S-PACT GmbH Burtscheiderstr. 1 52064 Aachen Deutschland
| | - Davide Angelone
- School of Chemistry University of Glasgow Glasgow G12 8QQ UK
| | - Artem Leonov
- School of Chemistry University of Glasgow Glasgow G12 8QQ UK
| | - Leroy Cronin
- School of Chemistry University of Glasgow Glasgow G12 8QQ UK
| | - Franziska Emmerling
- Bundesanstalt für Materialforschung und -prüfung Richard-Willstätter-Straße 11 12489 Berlin Deutschland
| |
Collapse
|
41
|
Bornemann‐Pfeiffer M, Wolf J, Meyer K, Kern S, Angelone D, Leonov A, Cronin L, Emmerling F. Standardization and Control of Grignard Reactions in a Universal Chemical Synthesis Machine using online NMR. Angew Chem Int Ed Engl 2021; 60:23202-23206. [PMID: 34278673 PMCID: PMC8597166 DOI: 10.1002/anie.202106323] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Indexed: 11/17/2022]
Abstract
A big problem with the chemistry literature is that it is not standardized with respect to precise operational parameters, and real time corrections are hard to make without expert knowledge. This lack of context means difficult reproducibility because many steps are ambiguous, and hence depend on tacit knowledge. Here we present the integration of online NMR into an automated chemical synthesis machine (CSM aka. "Chemputer" which is capable of small-molecule synthesis using a universal programming language) to allow automated analysis and adjustment of reactions on the fly. The system was validated and benchmarked by using Grignard reactions which were chosen due to their importance in synthesis. The system was monitored in real time using online-NMR, and spectra were measured continuously during the reactions. This shows that the synthesis being done in the Chemputer can be dynamically controlled in response to feedback optimizing the reaction conditions according to the user requirements.
Collapse
Affiliation(s)
- Martin Bornemann‐Pfeiffer
- Department 1: Analytical Chemistry, Reference MaterialsBundesanstalt für Materialforschung und -prüfungRichard-Willstätter-Straße 1112489BerlinGermany
- Chair of Chemical and Process EngineeringTechnische Universität BerlinMarchstr. 2310587BerlinGermany
| | - Jakob Wolf
- Department 1: Analytical Chemistry, Reference MaterialsBundesanstalt für Materialforschung und -prüfungRichard-Willstätter-Straße 1112489BerlinGermany
| | - Klas Meyer
- Department 1: Analytical Chemistry, Reference MaterialsBundesanstalt für Materialforschung und -prüfungRichard-Willstätter-Straße 1112489BerlinGermany
| | - Simon Kern
- S-PACT GmbHBurtscheiderstr. 152064AachenGermany
| | | | - Artem Leonov
- School of ChemistryUniversity of GlasgowGlasgowG12 8QQUK
| | - Leroy Cronin
- School of ChemistryUniversity of GlasgowGlasgowG12 8QQUK
| | - Franziska Emmerling
- Department 1: Analytical Chemistry, Reference MaterialsBundesanstalt für Materialforschung und -prüfungRichard-Willstätter-Straße 1112489BerlinGermany
| |
Collapse
|
42
|
Bannigan P, Aldeghi M, Bao Z, Häse F, Aspuru-Guzik A, Allen C. Machine learning directed drug formulation development. Adv Drug Deliv Rev 2021; 175:113806. [PMID: 34019959 DOI: 10.1016/j.addr.2021.05.016] [Citation(s) in RCA: 100] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 03/31/2021] [Accepted: 05/14/2021] [Indexed: 12/12/2022]
Abstract
Machine learning (ML) has enabled ground-breaking advances in the healthcare and pharmaceutical sectors, from improvements in cancer diagnosis, to the identification of novel drugs and drug targets as well as protein structure prediction. Drug formulation is an essential stage in the discovery and development of new medicines. Through the design of drug formulations, pharmaceutical scientists can engineer important properties of new medicines, such as improved bioavailability and targeted delivery. The traditional approach to drug formulation development relies on iterative trial-and-error, requiring a large number of resource-intensive and time-consuming in vitro and in vivo experiments. This review introduces the basic concepts of ML-directed workflows and discusses how these tools can be used to aid in the development of various types of drug formulations. ML-directed drug formulation development offers unparalleled opportunities to fast-track development efforts, uncover new materials, innovative formulations, and generate new knowledge in drug formulation science. The review also highlights the latest artificial intelligence (AI) technologies, such as generative models, Bayesian deep learning, reinforcement learning, and self-driving laboratories, which have been gaining momentum in drug discovery and chemistry and have potential in drug formulation development.
Collapse
Affiliation(s)
- Pauric Bannigan
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON M5S 3M2, Canada
| | - Matteo Aldeghi
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, ON M5S 3H6, Canada; Department of Computer Science, University of Toronto, Toronto, ON M5S 3H6, Canada; Vector Institute for Artificial Intelligence, Toronto, ON M5S 1M1, Canada
| | - Zeqing Bao
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON M5S 3M2, Canada
| | - Florian Häse
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, ON M5S 3H6, Canada; Department of Computer Science, University of Toronto, Toronto, ON M5S 3H6, Canada; Vector Institute for Artificial Intelligence, Toronto, ON M5S 1M1, Canada
| | - Alán Aspuru-Guzik
- Chemical Physics Theory Group, Department of Chemistry, University of Toronto, Toronto, ON M5S 3H6, Canada; Department of Computer Science, University of Toronto, Toronto, ON M5S 3H6, Canada; Vector Institute for Artificial Intelligence, Toronto, ON M5S 1M1, Canada; Lebovic Fellow, Canadian Institute for Advanced Research, Toronto, ON M5S 1M1, Canada.
| | - Christine Allen
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON M5S 3M2, Canada.
| |
Collapse
|
43
|
Häse F, Aldeghi M, Hickman RJ, Roch LM, Christensen M, Liles E, Hein JE, Aspuru-Guzik A. Olympus: a benchmarking framework for noisy optimization and experiment planning. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abedc8] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Abstract
Research challenges encountered across science, engineering, and economics can frequently be formulated as optimization tasks. In chemistry and materials science, recent growth in laboratory digitization and automation has sparked interest in optimization-guided autonomous discovery and closed-loop experimentation. Experiment planning strategies based on off-the-shelf optimization algorithms can be employed in fully autonomous research platforms to achieve desired experimentation goals with the minimum number of trials. However, the experiment planning strategy that is most suitable to a scientific discovery task is a priori unknown while rigorous comparisons of different strategies are highly time and resource demanding. As optimization algorithms are typically benchmarked on low-dimensional synthetic functions, it is unclear how their performance would translate to noisy, higher-dimensional experimental tasks encountered in chemistry and materials science. We introduce Olympus, a software package that provides a consistent and easy-to-use framework for benchmarking optimization algorithms against realistic experiments emulated via probabilistic deep-learning models. Olympus includes a collection of experimentally derived benchmark sets from chemistry and materials science and a suite of experiment planning strategies that can be easily accessed via a user-friendly Python interface. Furthermore, Olympus facilitates the integration, testing, and sharing of custom algorithms and user-defined datasets. In brief, Olympus mitigates the barriers associated with benchmarking optimization algorithms on realistic experimental scenarios, promoting data sharing and the creation of a standard framework for evaluating the performance of experiment planning strategies.
Collapse
|
44
|
Muravyev NV, Luciano G, Ornaghi HL, Svoboda R, Vyazovkin S. Artificial Neural Networks for Pyrolysis, Thermal Analysis, and Thermokinetic Studies: The Status Quo. Molecules 2021; 26:3727. [PMID: 34207246 PMCID: PMC8235697 DOI: 10.3390/molecules26123727] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2021] [Revised: 06/15/2021] [Accepted: 06/15/2021] [Indexed: 11/29/2022] Open
Abstract
Artificial neural networks (ANNs) are a method of machine learning (ML) that is now widely used in physics, chemistry, and material science. ANN can learn from data to identify nonlinear trends and give accurate predictions. ML methods, and ANNs in particular, have already demonstrated their worth in solving various chemical engineering problems, but applications in pyrolysis, thermal analysis, and, especially, thermokinetic studies are still in an initiatory stage. The present article gives a critical overview and summary of the available literature on applying ANNs in the field of pyrolysis, thermal analysis, and thermokinetic studies. More than 100 papers from these research areas are surveyed. Some approaches from the broad field of chemical engineering are discussed as the venues for possible transfer to the field of pyrolysis and thermal analysis studies in general. It is stressed that the current thermokinetic applications of ANNs are yet to evolve significantly to reach the capabilities of the existing isoconversional and model-fitting methods.
Collapse
Affiliation(s)
- Nikita V. Muravyev
- N.N. Semenov Federal Research Center for Chemical Physics, Russian Academy of Sciences, 4 Kosygina Str., 119991 Moscow, Russia
| | - Giorgio Luciano
- CNR, Istituto di Scienze e Tecnologie Chimiche “Giulio Natta”, Via De Marini 6, 16149 Genova, Italy;
| | - Heitor Luiz Ornaghi
- Department of Materials, Federal University for Latin American Integration (UNILA), Silvio Américo Sasdelli Avenue, 1842, Foz do Iguaçu-Paraná 85866-000, Brazil;
| | - Roman Svoboda
- Department of Physical Chemistry, Faculty of Chemical Technology, University of Pardubice, Studentská 95, CZ-53210 Pardubice, Czech Republic;
| | - Sergey Vyazovkin
- Department of Chemistry, University of Alabama at Birmingham, 901 S. 14th Street, Birmingham, AL 35294, USA
| |
Collapse
|
45
|
Seifrid M, Aspuru-Guzik A. You Wouldn't Download a Molecule! Now, ChemSCAD Makes It Possible. ACS CENTRAL SCIENCE 2021; 7:228-230. [PMID: 33655062 PMCID: PMC7908020 DOI: 10.1021/acscentsci.1c00108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/09/2023]
Affiliation(s)
- Martin Seifrid
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Department of Computer
Science, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Department of Computer
Science, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Vector
Institute for Artificial Intelligence, Toronto, Ontario M5S 1M1, Canada
- Canadian Institute
for Advanced Research (CIFAR) Senior Fellow, Toronto, Ontario M5S 1M1, Canada
| |
Collapse
|