1
|
Zhong H, Liu Y, Sun H, Liu Y, Zhang R, Li B, Yang Y, Huang Y, Yang F, Mak FS, Foo K, Lin S, Yu T, Wang P, Wang X. Towards global reaction feasibility and robustness prediction with high throughput data and bayesian deep learning. Nat Commun 2025; 16:4522. [PMID: 40374636 PMCID: PMC12081921 DOI: 10.1038/s41467-025-59812-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2024] [Accepted: 05/05/2025] [Indexed: 05/17/2025] Open
Abstract
Predicting organic reaction feasibility and robustness against environmental factors is challenging. We address this issue by integrating high throughput experimentation (HTE) and Bayesian deep learning. Diverging from existing HTE studies focused on niche chemical spaces, in this work, our in-house HTE platform conducted 11,669 distinct acid amine coupling reactions in 156 working hours, yielding the most extensive single HTE dataset at a volumetric scale for industrial delivery. Our Bayesian neural network model achieved a benchmark for prediction accuracy of 89.48% for reaction feasibility. Furthermore, our fine-grained uncertainty disentanglement enables efficient active learning, reducing 80% of data requirements. Additionally, our uncertainty analysis effectively identifies out-of-domain reactions and evaluates reaction robustness or reproducibility against environmental factors for scaling up, offering a practical framework for navigating chemical spaces and designing highly robust industrial processes.
Collapse
Affiliation(s)
| | | | | | - Yuru Liu
- ChemLex, Shanghai, Shanghai, China
| | | | | | - Yi Yang
- ChemLex, Shanghai, Shanghai, China
| | - Yuqing Huang
- MegaRobo Technologies Co., Ltd., Shanghai, Shanghai, China
| | - Fei Yang
- Zhejiang Laboratory, Hangzhou, Zhejiang, China
| | - Frankie S Mak
- Experimental Drug Development Centre (EDDC), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Klement Foo
- Experimental Drug Development Centre (EDDC), Agency for Science, Technology and Research (A*STAR), Singapore, Singapore
| | - Sen Lin
- ChemLex, Shanghai, Shanghai, China
| | - Tianshu Yu
- School of Data Science, The Chinese University of Hong Kong - Shenzhen, Shenzhen, Guangdong, China.
| | | | | |
Collapse
|
2
|
Krzyzanowski A, Pickett SD, Pogány P. Exploring BERT for Reaction Yield Prediction: Evaluating the Impact of Tokenization, Molecular Representation, and Pretraining Data Augmentation. J Chem Inf Model 2025; 65:4381-4402. [PMID: 40311104 DOI: 10.1021/acs.jcim.5c00359] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/03/2025]
Abstract
Predicting reaction yields in synthetic chemistry remains a significant challenge. This study systematically evaluates the impact of tokenization, molecular representation, pretraining data, and adversarial training on a BERT-based model for yield prediction of Buchwald-Hartwig and Suzuki-Miyaura coupling reactions using publicly available HTE data sets. We demonstrate that molecular representation choice (SMILES, DeepSMILES, SELFIES, Morgan fingerprint-based notation, IUPAC names) has minimal impact on model performance, while typically BPE and SentencePiece tokenization outperform other methods. WordPiece is strongly discouraged for SELFIES and fingerprint-based notation. Furthermore, pretraining with relatively small data sets (<100 K reactions) achieves comparable performance to larger data sets containing millions of examples. The use of artificially generated domain-specific pretraining data is proposed. The artificially generated sets prove to be a good surrogate for the reaction schemes extracted from reaction data sets such as Pistachio or Reaxys. The best performance was observed for hybrid pretraining sets combining the real and the domain-specific, artificial data. Finally, we show that a novel adversarial training approach, perturbing input embeddings dynamically, improves model robustness and generalizability for yield and reaction success prediction. These findings provide valuable insights for developing robust and practical machine learning models for yield prediction in synthetic chemistry. GSK's BERT training code base is made available to the community with this work.
Collapse
Affiliation(s)
| | - Stephen D Pickett
- GSK Medicines Research Centre, Gunnels Wood Road, Stevenage SG1 2NY, U.K
| | - Peter Pogány
- GSK Medicines Research Centre, Gunnels Wood Road, Stevenage SG1 2NY, U.K
| |
Collapse
|
3
|
Huang KH, Chen K, Morato NM, Sams TC, Dziekonski ET, Cooks RG. High-throughput microdroplet-based synthesis using automated array-to-array transfer. Chem Sci 2025; 16:7544-7550. [PMID: 40171032 PMCID: PMC11955803 DOI: 10.1039/d5sc00638d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2025] [Accepted: 03/20/2025] [Indexed: 04/03/2025] Open
Abstract
Automation of chemical synthesis and high-throughput (HT) screening are important for speeding up drug discovery. Here, we describe an automated HT picomole scale synthesis system which uses desorption electrospray ionization (DESI) to create microdroplets of reaction mixtures at individual positions from a two-dimensional reactant array and transfer them to a corresponding position in an array of collected reaction products. On-the-fly chemical transformations are facilitated by the reaction acceleration phenomenon in microdroplets and high reaction conversions are achieved during the milliseconds droplet flight time from the reactant to the product array. Successful functionalization of bioactive molecules is demonstrated through the generation of 172 analogs (64% success rate) using multiple reaction types. Synthesis throughput is ∼45 seconds/reaction including droplet formation, reaction, and collection steps, all of which occur in an integrated fashion, generating product amounts sufficient for subsequent bioactivity screening (low ng to low μg). Quantitative performance was validated using LC/MS. This system bridges the demonstrated capabilities of HT-DESI for reaction screening and label-free bioassays, allowing consolidation of the key early drug discovery steps around a single synthetic-analytical technology.
Collapse
Affiliation(s)
- Kai-Hung Huang
- Department of Chemistry, Purdue University West Lafayette Indiana 47907 USA
| | - Kitmin Chen
- Department of Chemistry, Purdue University West Lafayette Indiana 47907 USA
| | - Nicolás M Morato
- Department of Chemistry, Purdue University West Lafayette Indiana 47907 USA
- Purdue Institute for Cancer Research, Purdue University West Lafayette Indiana 47907 USA
| | - Thomas C Sams
- Department of Chemistry, Purdue University West Lafayette Indiana 47907 USA
| | - Eric T Dziekonski
- Department of Chemistry, Purdue University West Lafayette Indiana 47907 USA
| | - R Graham Cooks
- Department of Chemistry, Purdue University West Lafayette Indiana 47907 USA
- Purdue Institute for Cancer Research, Purdue University West Lafayette Indiana 47907 USA
| |
Collapse
|
4
|
Song W, Sun H. Local reaction condition optimization via machine learning. J Mol Model 2025; 31:143. [PMID: 40266356 DOI: 10.1007/s00894-025-06365-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2025] [Accepted: 03/31/2025] [Indexed: 04/24/2025]
Abstract
CONTEXT Reaction condition optimization addresses shared requirements across academia and industry, particularly in chemistry, pharmaceutical development, and fine chemical engineering. This review examines recent progress and persistent challenges in machine learning-guided optimization of localized reaction conditions, with an emphasis on three core aspects: dataset, condition representation, and optimization methods, as well as the main issues in each related stage. The review explores challenges such as dataset scarcity, data quality, and the "completeness trap" in dataset preparation stage, summarizes the limitations of current molecular representation techniques in condition representation stage, and discusses the search efficiency challenges of optimization methods in optimization stage. METHODS The review analyzes the molecular representation techniques and identifies them as the primary bottleneck in advancing localized reaction condition optimization. It further examines existing optimization methodologies. Among them, Bayesian optimization and active learning emerges as the most commonly applied approaches in this field, utilizing incremental learning mechanisms and human-in-the-loop strategies to minimize experimental data requirements while mitigating molecular representation limitations. The review concludes that advancements in molecular representation techniques are essential for developing more efficient optimization methods in the future.
Collapse
Affiliation(s)
- Wenhuan Song
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, 264209, China.
| | - Honggang Sun
- School of Mechanical, Electrical & Information Engineering, Shandong University, Weihai, 264209, China
| |
Collapse
|
5
|
Huo Z, Xie X, Tong R. Machine Learning for Developing Sustainable Polymers. Chemistry 2025:e202500718. [PMID: 40266984 DOI: 10.1002/chem.202500718] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2025] [Revised: 04/20/2025] [Accepted: 04/22/2025] [Indexed: 04/25/2025]
Abstract
Sustainable polymers from renewable resources have been gaining importance due to their recyclability and reduced environmental impact. However, their development through conventional trial-and-error methods remains inefficient and resource-intensive. Machine learning (ML) has emerged as a powerful tool in polymer science, enabling rapid prediction, and discovery of new chemicals and materials. In this review, we examine emerging trends in ML applications for sustainable polymer development, focusing on catalyst discovery, property optimization, and new polymer design. We analyze unique challenges in applying ML to sustainable polymers and evaluate proposed solutions, providing insights for future development in this rapidly evolving field.
Collapse
Affiliation(s)
- Ziyu Huo
- Department of Chemical Engineering, Virginia Polytechnic Institute and State University, 635 Prices Fork Road, Blacksburg, Virginia, 24061, USA
| | - Xiaoyu Xie
- Department of Chemical Engineering, Virginia Polytechnic Institute and State University, 635 Prices Fork Road, Blacksburg, Virginia, 24061, USA
| | - Rong Tong
- Department of Chemical Engineering, Virginia Polytechnic Institute and State University, 635 Prices Fork Road, Blacksburg, Virginia, 24061, USA
| |
Collapse
|
6
|
Ghosh J, Morato NM, Feng Y, Cooks RG. High-Throughput Drug Derivatization and Bioassay by Desorption Electrospray Ionization Mass Spectrometry. Chempluschem 2025:e2500164. [PMID: 40095503 DOI: 10.1002/cplu.202500164] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2025] [Accepted: 03/17/2025] [Indexed: 03/19/2025]
Abstract
Adapting high-throughput (HT) synthetic methods to the modification of drugs and to testing of their bioactivity should expedite drug discovery. Herein, the applicability of HT desorption electrospray ionization mass spectrometry (DESI MS) to achieve late-stage functionalization (LSF) and rapidly generate a modified opioid library is demonstrated. Specifically, aza-Michael addition and sulfur (VI) fluoride exchange reactions are used for functionalization. The modified drugs are both synthesized and characterized using an automated HT-DESI MS platform, with the reaction occurring during the droplet flight. Analysis by MS characterizes reaction products at a throughput of >1 reaction per second. With this platform, multiple nor-opioid scaffolds and functionalization reagents are screened and a selection of the hits obtained is subjected to HT label-free bioassays using the same DESI-MS platform. This combination of accelerated LSF reactions to rapidly create a diverse library of functionalized drugs with direct bioassays of the crude reaction mixtures for structure-activity relationship evaluation, both using the same platform, is anticipated to help expedite the early drug discovery process.
Collapse
Affiliation(s)
- Jyotirmoy Ghosh
- Department of Chemistry, Purdue University, 560 Oval Drive, West Lafayette, IN, 47907, USA
| | - Nicolás M Morato
- Department of Chemistry, Purdue University, 560 Oval Drive, West Lafayette, IN, 47907, USA
| | - Yunfei Feng
- Department of Chemistry, Purdue University, 560 Oval Drive, West Lafayette, IN, 47907, USA
| | - R Graham Cooks
- Department of Chemistry, Purdue University, 560 Oval Drive, West Lafayette, IN, 47907, USA
| |
Collapse
|
7
|
Cao Y, Zhang T, Zhao X, Li H. HiRXN: Hierarchical Attention-Based Representation Learning for Chemical Reaction. J Chem Inf Model 2025; 65:1990-2002. [PMID: 39901569 DOI: 10.1021/acs.jcim.4c01787] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/05/2025]
Abstract
In recent years, natural language processing (NLP) techniques, including large language modeling (LLM), have contributed significantly to advancements in organic chemistry research. Chemical reaction representations provide a link between NLP models and chemistry prediction tasks and enable the translation of complex chemical processes into a format that NLP models can understand and learn from. However, previous representation methods fail to adequately consider the hierarchical and structural information inherent in chemical reactions. Here, we propose a tool named HiRXN to learn the comprehensive representation of chemical reactions based on their hierarchical structure. In order to significantly enhance feature engineering for machine learning (ML) models, HiRXN develops an effective tokenization method called RXNTokenizer to capture atomic microenvironment features with multiradius. Then, the hierarchical attention network is used to integrate information from atomic microenvironment-level and molecule-level to accurately understand chemical reactions. The experimental results show that HiRXN is capable of representing chemical reactions and achieves remarkable performance in terms of reaction regression and classification prediction tasks. A web server has been developed to provide a specialized service that accepts Reaction SMILES as input and provides predicted results. The Web site is accessible at http://bdatju.com.
Collapse
Affiliation(s)
- Yahui Cao
- School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
| | - Tao Zhang
- School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
| | - Xin Zhao
- School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
| | - Haotong Li
- School of Electrical and Information Engineering, Tianjin University, Tianjin 300072, China
| |
Collapse
|
8
|
Janssen K, Proppe J. Predicting and Explaining Yields with Machine Learning for Carboxylated Azoles and Beyond. J Chem Inf Model 2025; 65:1862-1872. [PMID: 39916507 PMCID: PMC11863374 DOI: 10.1021/acs.jcim.4c02336] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2024] [Revised: 01/24/2025] [Accepted: 01/28/2025] [Indexed: 02/25/2025]
Abstract
Carbon dioxide (CO2) can be transformed into valuable chemical building blocks, including C2-carboxylated 1,3-azoles, which have potential applications in pharmaceuticals, cosmetics, and pesticides. However, only a small fraction of the millions of available 1,3-azoles are carboxylated at the C2 position, highlighting significant opportunities for further research in the synthesis and application of these compounds. In this study, we utilized a supervised machine learning approach to predict reaction yields for a data set of amide-coupled C2-carboxylated 1,3-azoles. To facilitate molecular design, we integrated an interpretable heat-mapping algorithm named PIXIE (Predictive Insights and Xplainability for Informed chemical space Exploration). PIXIE visualizes the influence of molecular substructures on predicted yields by leveraging fingerprint bit importances, providing synthetic chemists with a powerful tool for the rational design of molecules. While heat mapping is an established technique, its integration with a machine-learning model tailored to the chemical space of C2-carboxylated 1,3-azoles represents a significant advancement. This approach not only enables targeted exploration of this underrepresented chemical space, fostering the discovery of new bioactive compounds, but also demonstrates the potential of combining these methods for broader applications in other chemical domains.
Collapse
Affiliation(s)
- Kerrin Janssen
- TU Braunschweig, Institute of Physical
and Theoretical Chemistry, Gauss Str 17, 38106 Braunschweig, Germany
| | - Jonny Proppe
- TU Braunschweig, Institute of Physical
and Theoretical Chemistry, Gauss Str 17, 38106 Braunschweig, Germany
| |
Collapse
|
9
|
Tavakoli M, Chiu YTT, Carlton AM, Van Vranken D, Baldi P. Chemically Informed Deep Learning for Interpretable Radical Reaction Prediction. J Chem Inf Model 2025; 65:1228-1242. [PMID: 39871741 PMCID: PMC11815866 DOI: 10.1021/acs.jcim.4c01901] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2024] [Revised: 01/14/2025] [Accepted: 01/15/2025] [Indexed: 01/29/2025]
Abstract
Organic radical reactions are crucial in many areas of chemistry, including synthetic, biological, and atmospheric chemistry. We develop a predictive framework based on the interaction of molecular orbitals that operates on mechanistic-level radical reactions. Given our chemistry-aware model, all predictions are provided with different levels of interpretability. Our models are trained and evaluated using the RMechDB database of radical reaction steps. Our model predicts the correct orbital interaction and products for 96% of the test reactions in RMechDB. By chaining these predictions, we perform a pathway search capable of identifying all intermediates and byproducts of a radical reaction. We test the pathway search on two classes of problems in atmospheric and polymerization chemistry. RMechRP is publicly available online at https://deeprxn.ics.uci.edu/rmechrp/.
Collapse
Affiliation(s)
- Mohammadamin Tavakoli
- Department
of Computer Science, University of California,
Irvine, Irvine, California 92697, United States
| | - Yin Ting T. Chiu
- Department
of Chemistry, University of California,
Irvine, Irvine, California 92697, United States
| | - Ann Marie Carlton
- Department
of Chemistry, University of California,
Irvine, Irvine, California 92697, United States
| | - David Van Vranken
- Department
of Chemistry, University of California,
Irvine, Irvine, California 92697, United States
| | - Pierre Baldi
- Department
of Computer Science, University of California,
Irvine, Irvine, California 92697, United States
| |
Collapse
|
10
|
Hua PX, Huang Z, Xu ZY, Zhao Q, Ye CY, Wang YF, Xu YH, Fu Y, Ding H. An active representation learning method for reaction yield prediction with small-scale data. Commun Chem 2025; 8:42. [PMID: 39929993 PMCID: PMC11811124 DOI: 10.1038/s42004-025-01434-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Accepted: 01/27/2025] [Indexed: 02/13/2025] Open
Abstract
Reaction optimization plays an essential role in chemical research and industrial production. To explore a large reaction system, a practical issue is how to reduce the heavy experimental load for finding the high-yield conditions. In this paper, we present an efficient machine learning tool called "RS-Coreset", where the key idea is to take advantage of deep representation learning techniques to guide an interactive procedure for representing the full reaction space. Our proposed tool only uses small-scale data, say 2.5% to 5% of the instances, to predict the yields of the reaction space. We validate the performance on three public datasets and achieve state-of-the-art results. Moreover, we apply this tool to assist the realistic exploration of the Lewis base-boryl radicals enabled dechlorinative coupling reactions in our lab. The tool can help us to effectively predict the yields and even discover several feasible reaction combinations that were overlooked in previous articles.
Collapse
Affiliation(s)
- Peng-Xiang Hua
- School of Computer Science and Technology, University of Science and Technology of China, Hefei, Anhui, 230026, China
| | - Zhen Huang
- School of Computer Science and Technology, University of Science and Technology of China, Hefei, Anhui, 230026, China
| | - Zhe-Yuan Xu
- Key Laboratory of Precision and Intelligent Chemistry, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, Department of Chemistry, University of Science and Technology of China, Hefei, Anhui, 230026, China
| | - Qiang Zhao
- Key Laboratory of Precision and Intelligent Chemistry, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, Department of Chemistry, University of Science and Technology of China, Hefei, Anhui, 230026, China
| | - Chen-Yang Ye
- Key Laboratory of Precision and Intelligent Chemistry, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, Department of Chemistry, University of Science and Technology of China, Hefei, Anhui, 230026, China
| | - Yi-Feng Wang
- Key Laboratory of Precision and Intelligent Chemistry, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, Department of Chemistry, University of Science and Technology of China, Hefei, Anhui, 230026, China.
| | - Yun-He Xu
- Key Laboratory of Precision and Intelligent Chemistry, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, Department of Chemistry, University of Science and Technology of China, Hefei, Anhui, 230026, China.
| | - Yao Fu
- Key Laboratory of Precision and Intelligent Chemistry, CAS Key Laboratory of Urban Pollutant Conversion, Anhui Province Key Laboratory of Biomass Clean Energy, Department of Chemistry, University of Science and Technology of China, Hefei, Anhui, 230026, China.
| | - Hu Ding
- School of Computer Science and Technology, University of Science and Technology of China, Hefei, Anhui, 230026, China.
| |
Collapse
|
11
|
Zhang Y, Liu M, Zheng X, Gao L, Wan L, Cheng D, Chen F. Flow chemistry-enabled asymmetric synthesis of cyproterone acetate in a chemo-biocatalytic approach. Nat Commun 2025; 16:1064. [PMID: 39870623 PMCID: PMC11772765 DOI: 10.1038/s41467-025-56371-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Accepted: 01/15/2025] [Indexed: 01/29/2025] Open
Abstract
Flow chemistry has many advantages over batch synthesis of organic small-molecules in terms of environmental compatibility, safety and synthetic efficiency when scale-up is considered. Herein, we report the 10-step chemo-biocatalytic continuous flow asymmetric synthesis of cyproterone acetate (4) in which 10 transformations are combined into a telescoped flow linear sequence from commercially available 4-androstene-3, 17-dione (11). This integrated one-flow synthesis features an engineered 3-ketosteroid-Δ1-dehydrogenase (ReM2)-catalyzed Δ1-dehydrogenation to form the C1, C2-double bond of A ring, a substrate-controlled Co-catalyzed Mukaiyama hydration of 9 to forge the crucial chiral C17α-OH group of D ring with excellent stereoselectivity, and a rapid flow Corey-Chaykovsky cyclopropanation of 7 to build the cyclopropyl core of A ring. By strategic use of these three key reactions and fully continuous-flow operations, cyproterone acetate (4) is produced in an overall yield of 9.6% in 3 h of total reaction time, this is the highest total number of chemical transformation performance in any other continuous-flow synthesis reported to date.
Collapse
Affiliation(s)
- Yajiao Zhang
- Engineering Center of Catalysis and Synthesis for Chiral Molecules, Department of Chemistry, Fudan University, Shanghai, 200433, China
| | - Minjie Liu
- Engineering Center of Catalysis and Synthesis for Chiral Molecules, Department of Chemistry, Fudan University, Shanghai, 200433, China
| | - Xianjing Zheng
- Engineering Center of Catalysis and Synthesis for Chiral Molecules, Department of Chemistry, Fudan University, Shanghai, 200433, China
| | - Liang Gao
- Engineering Center of Catalysis and Synthesis for Chiral Molecules, Department of Chemistry, Fudan University, Shanghai, 200433, China
| | - Li Wan
- Engineering Center of Catalysis and Synthesis for Chiral Molecules, Department of Chemistry, Fudan University, Shanghai, 200433, China
| | - Dang Cheng
- Engineering Center of Catalysis and Synthesis for Chiral Molecules, Department of Chemistry, Fudan University, Shanghai, 200433, China
| | - Fener Chen
- Engineering Center of Catalysis and Synthesis for Chiral Molecules, Department of Chemistry, Fudan University, Shanghai, 200433, China.
- Institute of Flow Chemistry and Engineering, School of Chemistry and Materials, Jiangxi Normal University, Nanchang, 330022, China.
| |
Collapse
|
12
|
Wang H, Li F, Yang W, Wang Y, Miskevich AA, Loiko VA, Zhang L, Tao S. Impact of Adding N-hexylamine to Nickel Metallophotoredox C-N Coupling to Form Diarylamines. J Org Chem 2025; 90:1233-1244. [PMID: 39787300 DOI: 10.1021/acs.joc.4c02106] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2025]
Abstract
The mechanistic understanding of how alkylamines impact Ni-metallophotoredox C-N coupling to form diarylamines remains unclear. In this study, 12-alkylamines were evaluated as additives to determine their effects on the synthesis of diarylamines in a flow photochemical system. Notably, n-hexylamine demonstrated the most significant promotional effect. Spectroscopic studies and experimental data reveal n-hexylamine substitutes DABCO as a Ni catalyst ligand, enhancing yields particularly in sterically hindered arylamines.
Collapse
Affiliation(s)
- Haiyang Wang
- School of Chemistry, State Key Laboratory of Fine Chemicals, Frontier Science Center for Smart Materials, Dalian Key Laboratory of Intelligent Chemistry, Dalian University of Technology, Dalian 116024, China
| | - Fujun Li
- School of Chemistry, State Key Laboratory of Fine Chemicals, Frontier Science Center for Smart Materials, Dalian Key Laboratory of Intelligent Chemistry, Dalian University of Technology, Dalian 116024, China
| | - Wenbo Yang
- School of Chemistry, State Key Laboratory of Fine Chemicals, Frontier Science Center for Smart Materials, Dalian Key Laboratory of Intelligent Chemistry, Dalian University of Technology, Dalian 116024, China
| | - Yuchao Wang
- School of Chemistry, State Key Laboratory of Fine Chemicals, Frontier Science Center for Smart Materials, Dalian Key Laboratory of Intelligent Chemistry, Dalian University of Technology, Dalian 116024, China
| | - Alexander A Miskevich
- Institute of Physics, National Academy of Sciences of Belarus, 68-2 Niezalezhnastsi avenue, Minsk 220072, Belarus
| | - Valery A Loiko
- Institute of Physics, National Academy of Sciences of Belarus, 68-2 Niezalezhnastsi avenue, Minsk 220072, Belarus
| | - Lijing Zhang
- School of Chemistry, State Key Laboratory of Fine Chemicals, Frontier Science Center for Smart Materials, Dalian Key Laboratory of Intelligent Chemistry, Dalian University of Technology, Dalian 116024, China
| | - Shengyang Tao
- School of Chemistry, State Key Laboratory of Fine Chemicals, Frontier Science Center for Smart Materials, Dalian Key Laboratory of Intelligent Chemistry, Dalian University of Technology, Dalian 116024, China
| |
Collapse
|
13
|
Kevlishvili I, St Michel RG, Garrison AG, Toney JW, Adamji H, Jia H, Román-Leshkov Y, Kulik HJ. Leveraging natural language processing to curate the tmCAT, tmPHOTO, tmBIO, and tmSCO datasets of functional transition metal complexes. Faraday Discuss 2025; 256:275-303. [PMID: 39301698 DOI: 10.1039/d4fd00087k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/22/2024]
Abstract
The breadth of transition metal chemical space covered by databases such as the Cambridge Structural Database and the derived computational database tmQM is not conducive to application-specific modeling and the development of structure-property relationships. Here, we employ both supervised and unsupervised natural language processing (NLP) techniques to link experimentally synthesized compounds in the tmQM database to their respective applications. Leveraging NLP models, we curate four distinct datasets: tmCAT for catalysis, tmPHOTO for photophysical activity, tmBIO for biological relevance, and tmSCO for magnetism. Analyzing the chemical substructures within each dataset reveals common chemical motifs in each of the designated applications. We then use these common chemical structures to augment our initial datasets for each application, yielding a total of 21 631 compounds in tmCAT, 4599 in tmPHOTO, 2782 in tmBIO, and 983 in tmSCO. These datasets are expected to accelerate the more targeted computational screening and development of refined structure-property relationships with machine learning.
Collapse
Affiliation(s)
- Ilia Kevlishvili
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| | - Roland G St Michel
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Aaron G Garrison
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| | - Jacob W Toney
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| | - Husain Adamji
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| | - Haojun Jia
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Yuriy Román-Leshkov
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| |
Collapse
|
14
|
Velasco PQ, Hippalgaonkar K, Ramalingam B. Emerging trends in the optimization of organic synthesis through high-throughput tools and machine learning. Beilstein J Org Chem 2025; 21:10-38. [PMID: 39811684 PMCID: PMC11730176 DOI: 10.3762/bjoc.21.3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Accepted: 11/26/2024] [Indexed: 01/16/2025] Open
Abstract
The discovery of the optimal conditions for chemical reactions is a labor-intensive, time-consuming task that requires exploring a high-dimensional parametric space. Historically, the optimization of chemical reactions has been performed by manual experimentation guided by human intuition and through the design of experiments where reaction variables are modified one at a time to find the optimal conditions for a specific reaction outcome. Recently, a paradigm change in chemical reaction optimization has been enabled by advances in lab automation and the introduction of machine learning algorithms. Therein, multiple reaction variables can be synchronously optimized to obtain the optimal reaction conditions, requiring a shorter experimentation time and minimal human intervention. Herein, we review the currently used state-of-the-art high-throughput automated chemical reaction platforms and machine learning algorithms that drive the optimization of chemical reactions, highlighting the limitations and future opportunities of this new field of research.
Collapse
Affiliation(s)
- Pablo Quijano Velasco
- Institute of Materials Research and Engineering (IMRE), Agency for Science Technology and Research (A*STAR), 2 Fusionopolis Way, Singapore 138634, Republic of Singapore
| | - Kedar Hippalgaonkar
- Institute of Materials Research and Engineering (IMRE), Agency for Science Technology and Research (A*STAR), 2 Fusionopolis Way, Singapore 138634, Republic of Singapore
- Department of Materials Science and Engineering, Nanyang Technological University, Singapore 639798, Republic of Singapore
- Institute for Functional Intelligent Materials, National University of Singapore, 4 Science Drive 2, Singapore 117544, Republic of Singapore
| | - Balamurugan Ramalingam
- Institute of Materials Research and Engineering (IMRE), Agency for Science Technology and Research (A*STAR), 2 Fusionopolis Way, Singapore 138634, Republic of Singapore
- Institute of Sustainability for Chemicals, Energy and Environment (ISCE2), Agency for Science Technology and Research (A*STAR), 1 Pesek Road, Jurong Island, Singapore 627833, Republic of Singapore
| |
Collapse
|
15
|
Zhang C, Lin Q, Zhu B, Yang H, Lian X, Deng H, Zheng J, Liao K. SynAsk: unleashing the power of large language models in organic synthesis. Chem Sci 2024; 16:43-56. [PMID: 39600494 PMCID: PMC11587532 DOI: 10.1039/d4sc04757e] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Accepted: 11/11/2024] [Indexed: 11/29/2024] Open
Abstract
The field of natural language processing (NLP) has witnessed a transformative shift with the emergence of large language models (LLMs), revolutionizing various language tasks and applications, and the integration of LLMs into specialized domains enhances their capabilities for domain-specific applications. Notably, NLP has made significant strides in organic chemistry, particularly in predicting synthetic tasks, paving the way for the development of LLMs tailored to the organic chemistry field. In this work, we introduce SynAsk, a comprehensive organic chemistry domain-specific LLM platform developed by AIChemEco Inc. By fine-tuning an LLM with domain-specific data and integrating it with a chain of thought approach, SynAsk seamlessly accesses our knowledge base and advanced chemistry tools in a question-and-answer format. This includes functionalities such as a basic chemistry knowledge base, molecular information retrieval, reaction performance prediction, retrosynthesis prediction, chemical literature acquisition, and more. This novel methodology synergizes fine-tuning techniques with external resource integration, resulting in an organic chemistry-specific model poised to facilitate research and discovery in the field. Accessible at https://synask.aichemeco.com, SynAsk represents a significant advancement in leveraging NLP for synthetic applications.
Collapse
Affiliation(s)
- Chonghuan Zhang
- Guangzhou National Laboratory Guangzhou Guangdong 510005 PR China
| | - Qianghua Lin
- Guangzhou National Laboratory Guangzhou Guangdong 510005 PR China
| | - Biwei Zhu
- AIChemEco Inc. Guangzhou Guangdong 510005 PR China
| | - Haopeng Yang
- AIChemEco Inc. Guangzhou Guangdong 510005 PR China
| | - Xiao Lian
- AIChemEco Inc. Guangzhou Guangdong 510005 PR China
| | - Hao Deng
- AIChemEco Inc. Guangzhou Guangdong 510005 PR China
| | - Jiajun Zheng
- AIChemEco Inc. Guangzhou Guangdong 510005 PR China
| | - Kuangbiao Liao
- Guangzhou National Laboratory Guangzhou Guangdong 510005 PR China
| |
Collapse
|
16
|
Rial-Rodríguez E, Williams JD, Cantillo D, Fuchß T, Sommer A, Eggenweiler HM, Kappe CO, Laudadio G. An Automated Electrochemical Flow Platform to Accelerate Library Synthesis and Reaction Optimization. Angew Chem Int Ed Engl 2024; 63:e202412045. [PMID: 39317660 PMCID: PMC11627123 DOI: 10.1002/anie.202412045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Revised: 09/18/2024] [Accepted: 09/23/2024] [Indexed: 09/26/2024]
Abstract
Automated batch and flow setups are well-established for high throughput experimentation in both thermal chemistry and photochemistry. However, the development of automated electrochemical platforms is hindered by cell miniaturization challenges in batch and difficulties in designing effective single-pass flow systems. In order to address these issues, we have designed and implemented a new, slug-based automated electrochemical flow platform. This platform was successfully demonstrated for electrochemical C-N cross-couplings of E3 ligase binders with diverse amines (44 examples), which were subsequently transferred to a continuous-flow mode for confirmation and isolation, showing its applicability for medicinal chemistry purposes. To further validate the versatility of the platform, Design of Experiments (DoE) optimization was performed for an unsuccessful library target. This optimization process, fully automated by the platform, resulted in a remarkable 6-fold increase in reaction yield.
Collapse
Affiliation(s)
- Eduardo Rial-Rodríguez
- Institute of Chemistry, NAWI Graz,. Department, University of Graz, Heinrichstrasse 28, 8010, Graz, Austria
- Center for Continuous Flow Synthesis and Processing (CCFLOW), Research Center Pharmaceutical Engineering GmbH (RCPE), Inffeldgasse 13, 8010, Graz, Austria
| | - Jason D Williams
- Institute of Chemistry, NAWI Graz,. Department, University of Graz, Heinrichstrasse 28, 8010, Graz, Austria
- Center for Continuous Flow Synthesis and Processing (CCFLOW), Research Center Pharmaceutical Engineering GmbH (RCPE), Inffeldgasse 13, 8010, Graz, Austria
| | - David Cantillo
- Institute of Chemistry, NAWI Graz,. Department, University of Graz, Heinrichstrasse 28, 8010, Graz, Austria
- Center for Continuous Flow Synthesis and Processing (CCFLOW), Research Center Pharmaceutical Engineering GmbH (RCPE), Inffeldgasse 13, 8010, Graz, Austria
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, Queensland, 4072, Australia
| | - Thomas Fuchß
- Medicinal Chemistry and Drug Design, Merck Healthcare KGaA, Frankfurter Strasse 250, 64293, Darmstadt, Germany
| | - Alena Sommer
- Medicinal Chemistry and Drug Design, Merck Healthcare KGaA, Frankfurter Strasse 250, 64293, Darmstadt, Germany
| | - Hans-Michael Eggenweiler
- Medicinal Chemistry and Drug Design, Merck Healthcare KGaA, Frankfurter Strasse 250, 64293, Darmstadt, Germany
| | - C Oliver Kappe
- Institute of Chemistry, NAWI Graz,. Department, University of Graz, Heinrichstrasse 28, 8010, Graz, Austria
- Center for Continuous Flow Synthesis and Processing (CCFLOW), Research Center Pharmaceutical Engineering GmbH (RCPE), Inffeldgasse 13, 8010, Graz, Austria
| | - Gabriele Laudadio
- Institute of Chemistry, NAWI Graz,. Department, University of Graz, Heinrichstrasse 28, 8010, Graz, Austria
- Center for Continuous Flow Synthesis and Processing (CCFLOW), Research Center Pharmaceutical Engineering GmbH (RCPE), Inffeldgasse 13, 8010, Graz, Austria
| |
Collapse
|
17
|
Huang KH, Morato N, Feng Y, Toney A, Cooks RG. Rapid Exploration of Chemical Space by High-Throughput Desorption Electrospray Ionization Mass Spectrometry. J Am Chem Soc 2024; 146:33112-33120. [PMID: 39561979 PMCID: PMC11622223 DOI: 10.1021/jacs.4c11037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2024] [Revised: 10/28/2024] [Accepted: 10/29/2024] [Indexed: 11/21/2024]
Abstract
This study leverages accelerated reactions at the solution/air interface of microdroplets generated by desorption electrospray ionization (DESI) to explore the chemical space. DESI is utilized to synthesize drug analogs at an overall rate of 1 reaction mixture per second, working on the low-nanogram scale. Transformations of multiple drug molecules at specific functionalities (phenol, hydroxyl, amino, carbonyl, phenyl, thiophenyl, and alkenyl) are achieved using electrophilic/nucleophilic, redox, C-H functionalization, and coupling reactions. These transformations occur under ambient conditions on the millisecond time scale with direct detection of products being successful in all but three of the reaction types studied. The large scope (22 bioactive compounds, >20 chemical transformations, and >300 functionalization reagents) and high speed (>3000 reactions/hour) provide access to a wide array of drug analogs that can be used for bioactivity testing. A total of ∼6800 unique reactions were examined through a data-driven workflow, and more than 3000 unique derivatives (∼44%) were identified tentatively by the m/z value and signal-to-control ratio in single-stage mass spectrometry (MS) analysis, with over 1000 being further characterized by tandem MS. The speed of the DESI-MS reaction screen provides potential advantages for emerging machine learning-based predictions of organic synthesis, and it sets the stage for future online DESI-MS bioassays and scaled-up microdroplet synthesis before formal characterization of hit compounds is sought using traditional methods of drug discovery.
Collapse
Affiliation(s)
- Kai-Hung Huang
- Department of Chemistry, Purdue University, West Lafayette, Indiana 47907, United States
| | - Nicolás
M. Morato
- Department of Chemistry, Purdue University, West Lafayette, Indiana 47907, United States
| | - Yunfei Feng
- Department of Chemistry, Purdue University, West Lafayette, Indiana 47907, United States
| | - Alexis Toney
- Department of Chemistry, Purdue University, West Lafayette, Indiana 47907, United States
| | - R. Graham Cooks
- Department of Chemistry, Purdue University, West Lafayette, Indiana 47907, United States
| |
Collapse
|
18
|
Ruan Y, Lu C, Xu N, He Y, Chen Y, Zhang J, Xuan J, Pan J, Fang Q, Gao H, Shen X, Ye N, Zhang Q, Mo Y. An automatic end-to-end chemical synthesis development platform powered by large language models. Nat Commun 2024; 15:10160. [PMID: 39580482 PMCID: PMC11585555 DOI: 10.1038/s41467-024-54457-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2024] [Accepted: 11/07/2024] [Indexed: 11/25/2024] Open
Abstract
The rapid emergence of large language model (LLM) technology presents promising opportunities to facilitate the development of synthetic reactions. In this work, we leveraged the power of GPT-4 to build an LLM-based reaction development framework (LLM-RDF) to handle fundamental tasks involved throughout the chemical synthesis development. LLM-RDF comprises six specialized LLM-based agents, including Literature Scouter, Experiment Designer, Hardware Executor, Spectrum Analyzer, Separation Instructor, and Result Interpreter, which are pre-prompted to accomplish the designated tasks. A web application with LLM-RDF as the backend was built to allow chemist users to interact with automated experimental platforms and analyze results via natural language, thus, eliminating the need for coding skills and ensuring accessibility for all chemists. We demonstrated the capabilities of LLM-RDF in guiding the end-to-end synthesis development process for the copper/TEMPO catalyzed aerobic alcohol oxidation to aldehyde reaction, including literature search and information extraction, substrate scope and condition screening, reaction kinetics study, reaction condition optimization, reaction scale-up and product purification. Furthermore, LLM-RDF's broader applicability and versability was validated on various synthesis tasks of three distinct reactions (SNAr reaction, photoredox C-C cross-coupling reaction, and heterogeneous photoelectrochemical reaction).
Collapse
Affiliation(s)
- Yixiang Ruan
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, China
- Zhejiang-Hong Kong Joint Laboratory for Intelligent Molecule and Material Design and Synthesis, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, 311215, China
| | - Chenyin Lu
- Zhejiang-Hong Kong Joint Laboratory for Intelligent Molecule and Material Design and Synthesis, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, 311215, China
| | - Ning Xu
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, China
- Zhejiang-Hong Kong Joint Laboratory for Intelligent Molecule and Material Design and Synthesis, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, 311215, China
| | - Yuchen He
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, China
- Zhejiang-Hong Kong Joint Laboratory for Intelligent Molecule and Material Design and Synthesis, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, 311215, China
| | - Yixin Chen
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, China
- Zhejiang-Hong Kong Joint Laboratory for Intelligent Molecule and Material Design and Synthesis, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, 311215, China
| | - Jian Zhang
- Zhejiang-Hong Kong Joint Laboratory for Intelligent Molecule and Material Design and Synthesis, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, 311215, China
| | - Jun Xuan
- Zhejiang-Hong Kong Joint Laboratory for Intelligent Molecule and Material Design and Synthesis, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, 311215, China
| | - Jianzhang Pan
- Zhejiang-Hong Kong Joint Laboratory for Intelligent Molecule and Material Design and Synthesis, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, 311215, China
- Institute of Microanalytical Systems, Department of Chemistry, Zhejiang University, Hangzhou, 310058, China
| | - Qun Fang
- Zhejiang-Hong Kong Joint Laboratory for Intelligent Molecule and Material Design and Synthesis, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, 311215, China
- Institute of Microanalytical Systems, Department of Chemistry, Zhejiang University, Hangzhou, 310058, China
| | - Hanyu Gao
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Hong Kong, 999077, China
| | - Xiaodong Shen
- Chemical & Analytical Development, Suzhou Novartis Technical Development Co. Ltd., Changshu, 215537, China
| | - Ning Ye
- Rezubio Pharmaceuticals Co. Ltd., Zhuhai, 519070, China
| | - Qiang Zhang
- Zhejiang-Hong Kong Joint Laboratory for Intelligent Molecule and Material Design and Synthesis, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, 311215, China
- College of Computer Science and Technology, Zhejiang University, Hangzhou, 310027, China
| | - Yiming Mo
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou, 310027, China.
- Zhejiang-Hong Kong Joint Laboratory for Intelligent Molecule and Material Design and Synthesis, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, 311215, China.
| |
Collapse
|
19
|
Gesmundo NJ, Rago AJ, Young JM, Keess S, Wang Y. At the Speed of Light: The Systematic Implementation of Photoredox Cross-Coupling Reactions for Medicinal Chemistry Research. J Org Chem 2024; 89:16070-16092. [PMID: 38442262 DOI: 10.1021/acs.joc.3c02351] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2024]
Abstract
The adoption of new and emerging techniques in organic synthesis is essential to promote innovation in drug discovery. In this Perspective, we detail the strategy we used for the systematic deployment of photoredox-mediated, metal-catalyzed cross-coupling reactions in AbbVie's medicinal chemistry organization, focusing on topics such as assessment, evaluation, implementation, and accessibility. The comprehensive evaluation of photoredox reaction setups and published methods will be discussed, along with internal efforts to build expertise and photoredox high-throughput experimentation capabilities. We also highlight AbbVie's academic-industry collaborations in this field that have been leveraged to develop new synthetic strategies, along with discussing the internal adoption of photoredox cross-coupling reactions. The work described herein has culminated in robust photocatalysis and cross-coupling capabilities which are viewed as key platforms for medicinal chemistry research at AbbVie.
Collapse
Affiliation(s)
- Nathan J Gesmundo
- Advanced Chemistry Technologies Group, Small Molecule Therapeutics & Platform Technologies, AbbVie, Inc., 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Alexander J Rago
- Advanced Chemistry Technologies Group, Small Molecule Therapeutics & Platform Technologies, AbbVie, Inc., 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Jonathon M Young
- Advanced Chemistry Technologies Group, Small Molecule Therapeutics & Platform Technologies, AbbVie, Inc., 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| | - Sebastian Keess
- Global Medicinal Chemistry, Small Molecule Therapeutics & Platform Technologies, AbbVie Deutschland GmbH & Co. KG, 67061 Ludwigshafen, Germany
| | - Ying Wang
- Advanced Chemistry Technologies Group, Small Molecule Therapeutics & Platform Technologies, AbbVie, Inc., 1 North Waukegan Road, North Chicago, Illinois 60064, United States
| |
Collapse
|
20
|
Husbands DR, Tanner T, Whitwood AC, Hodnett NS, Wheelhouse KMP, Fairlamb IJS. The ubiquitous P( o-tol) 3 ligand promotes formation of catalytically-active higher order palladacyclic clusters. Chem Sci 2024:d4sc05346j. [PMID: 39464606 PMCID: PMC11497114 DOI: 10.1039/d4sc05346j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2024] [Accepted: 10/14/2024] [Indexed: 10/29/2024] Open
Abstract
The Herrmann-Beller catalyst, Pd[(C^P)(μ2-OAc)]2, is readily formed by reaction of the cyclic trimer of 'Pd(OAc)2' with P(o-tol)3. In the presence of hydroxide, Pd(C^P)(μ2-OAc)]2 converts to [Pd(C^P)(μ 2 -OH)]2. Here, we report how this activated Pd precatalyst species, and related species, serve as a conduit for formation of higher order Pd n clusters containing multiple cyclopalladated P(o-tol)3 ligands. The catalytic competency of a Pd4-palladacyclic cluster is demonstrated in an arylated Heck cross-coupling, which is comparable to the base-activated form of Herrmann's catalyst, namely [Pd(C^P)(μ2-OH)]2. The findings show that 'simple' ubiquitous phosphine ligands can promote higher order Pd speciation, moving beyond well-known phosphine-ligated Pd1 and Pd2 complexes. The findings challenge the status quo in the field, in that phosphine ligands can ligate higher order Pd n species which are catalytically competent species in cross-coupling reactions.
Collapse
Affiliation(s)
- David R Husbands
- Department of Chemistry, University of York Heslington York YO10 5DD UK
| | - Theo Tanner
- Department of Chemistry, University of York Heslington York YO10 5DD UK
| | - Adrian C Whitwood
- Department of Chemistry, University of York Heslington York YO10 5DD UK
| | - Neil S Hodnett
- Medicine Development & Supply, GSK Medicines Research Centre Gunnels Wood Road, Stevenage Hertfordshire SG1 2NY UK
| | - Katherine M P Wheelhouse
- Medicine Development & Supply, GSK Medicines Research Centre Gunnels Wood Road, Stevenage Hertfordshire SG1 2NY UK
| | - Ian J S Fairlamb
- Department of Chemistry, University of York Heslington York YO10 5DD UK
| |
Collapse
|
21
|
Lu JM, Wang HF, Guo QH, Wang JW, Li TT, Chen KX, Zhang MT, Chen JB, Shi QN, Huang Y, Shi SW, Chen GY, Pan JZ, Lu Z, Fang Q. Roboticized AI-assisted microfluidic photocatalytic synthesis and screening up to 10,000 reactions per day. Nat Commun 2024; 15:8826. [PMID: 39396057 PMCID: PMC11470948 DOI: 10.1038/s41467-024-53204-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Accepted: 10/04/2024] [Indexed: 10/14/2024] Open
Abstract
The current throughput of conventional organic chemical synthesis is usually a few experiments for each operator per day. We develop a robotic system for ultra-high-throughput chemical synthesis, online characterization, and large-scale condition screening of photocatalytic reactions, based on the liquid-core waveguide, microfluidic liquid-handling, and artificial intelligence techniques. The system is capable of performing automated reactant mixture preparation, changing, introduction, ultra-fast photocatalytic reactions in seconds, online spectroscopic detection of the reaction product, and screening of different reaction conditions. We apply the system in large-scale screening of 12,000 reaction conditions of a photocatalytic [2 + 2] cycloaddition reaction including multiple continuous and discrete variables, reaching an ultra-high throughput up to 10,000 reaction conditions per day. Based on the data, AI-assisted cross-substrate/photocatalyst prediction is conducted.
Collapse
Affiliation(s)
- Jia-Min Lu
- Department of Chemistry, Zhejiang University, Hangzhou, China
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Hui-Feng Wang
- Department of Chemistry, Zhejiang University, Hangzhou, China
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Qi-Hang Guo
- Department of Chemistry, Zhejiang University, Hangzhou, China
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
- Center of Chemistry for Frontier Technologies, Department of Chemistry, Zhejiang University, Hangzhou, China
| | - Jian-Wei Wang
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Tong-Tong Li
- Department of Chemistry, Zhejiang University, Hangzhou, China
- Center of Chemistry for Frontier Technologies, Department of Chemistry, Zhejiang University, Hangzhou, China
| | - Ke-Xin Chen
- The Research Center for Life Sciences Computing, Zhejiang Lab, Hangzhou, China
- Department of Computer Science and Engineering, The Chinese University of Hong Kong, New Territories, Hong Kong, China
| | - Meng-Ting Zhang
- Department of Chemistry, Zhejiang University, Hangzhou, China
| | - Jian-Bo Chen
- Department of Chemistry, Zhejiang University, Hangzhou, China
| | - Qian-Nuan Shi
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Yi Huang
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Shao-Wen Shi
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China
| | - Guang-Yong Chen
- The Research Center for Life Sciences Computing, Zhejiang Lab, Hangzhou, China.
| | - Jian-Zhang Pan
- Department of Chemistry, Zhejiang University, Hangzhou, China.
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China.
| | - Zhan Lu
- Department of Chemistry, Zhejiang University, Hangzhou, China.
- Center of Chemistry for Frontier Technologies, Department of Chemistry, Zhejiang University, Hangzhou, China.
| | - Qun Fang
- Department of Chemistry, Zhejiang University, Hangzhou, China.
- Institute of Intelligent Chemical Manufacturing and iChemFoundry Platform, Engineering Research Center of Functional Materials Intelligent Manufacturing of Zhejiang Province, ZJU-Hangzhou Global Scientific and Technological Innovation Center, Hangzhou, China.
- Key Laboratory of Excited-State Materials of Zhejiang Province, Zhejiang University, Hangzhou, China.
| |
Collapse
|
22
|
Chen LY, Li YP. Machine learning-guided strategies for reaction conditions design and optimization. Beilstein J Org Chem 2024; 20:2476-2492. [PMID: 39376489 PMCID: PMC11457048 DOI: 10.3762/bjoc.20.212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Accepted: 09/19/2024] [Indexed: 10/09/2024] Open
Abstract
This review surveys the recent advances and challenges in predicting and optimizing reaction conditions using machine learning techniques. The paper emphasizes the importance of acquiring and processing large and diverse datasets of chemical reactions, and the use of both global and local models to guide the design of synthetic processes. Global models exploit the information from comprehensive databases to suggest general reaction conditions for new reactions, while local models fine-tune the specific parameters for a given reaction family to improve yield and selectivity. The paper also identifies the current limitations and opportunities in this field, such as the data quality and availability, and the integration of high-throughput experimentation. The paper demonstrates how the combination of chemical engineering, data science, and ML algorithms can enhance the efficiency and effectiveness of reaction conditions design, and enable novel discoveries in synthetic chemistry.
Collapse
Affiliation(s)
- Lung-Yi Chen
- Department of Chemical Engineering, National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei 10617, Taiwan
| | - Yi-Pei Li
- Department of Chemical Engineering, National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei 10617, Taiwan
- Taiwan International Graduate Program on Sustainable Chemical Science and Technology (TIGP-SCST), No. 128, Sec. 2, Academia Road, Taipei 11529, Taiwan
| |
Collapse
|
23
|
Han Y, Deng M, Liu K, Chen J, Wang Y, Xu YN, Dian L. Computer-Aided Synthesis Planning (CASP) and Machine Learning: Optimizing Chemical Reaction Conditions. Chemistry 2024; 30:e202401626. [PMID: 39083362 DOI: 10.1002/chem.202401626] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Revised: 07/27/2024] [Accepted: 07/28/2024] [Indexed: 08/02/2024]
Abstract
Computer-aided synthesis planning (CASP) has garnered increasing attention in light of recent advancements in machine learning models. While the focus is on reverse synthesis or forward outcome prediction, optimizing reaction conditions remains a significant challenge. For datasets with multiple variables, the choice of descriptors and models is pivotal. This selection dictates the effective extraction of conditional features and the achievement of higher prediction accuracy. This review delineates the origins of data in conditional optimization, the criteria for descriptor selection, the response models, and the metrics for outcome evaluation, aiming to acquaint readers with the latest research trends and facilitate more informed research in this domain.
Collapse
Affiliation(s)
- Yu Han
- State Key Laboratory of Microbial Technology, Institute of Microbial Technology, Shandong University, No. 72 Binhai Avenue, Qingdao, 266237, P. R. China
| | - Mingjing Deng
- State Key Laboratory of Microbial Technology, Institute of Microbial Technology, Shandong University, No. 72 Binhai Avenue, Qingdao, 266237, P. R. China
| | - Ke Liu
- State Key Laboratory of Microbial Technology, Institute of Microbial Technology, Shandong University, No. 72 Binhai Avenue, Qingdao, 266237, P. R. China
| | - Jia Chen
- State Key Laboratory of Microbial Technology, Institute of Microbial Technology, Shandong University, No. 72 Binhai Avenue, Qingdao, 266237, P. R. China
| | - Yuting Wang
- State Key Laboratory of Microbial Technology, Institute of Microbial Technology, Shandong University, No. 72 Binhai Avenue, Qingdao, 266237, P. R. China
| | - Yu-Ning Xu
- State Key Laboratory of Microbial Technology, Institute of Microbial Technology, Shandong University, No. 72 Binhai Avenue, Qingdao, 266237, P. R. China
| | - Longyang Dian
- State Key Laboratory of Microbial Technology, Institute of Microbial Technology, Shandong University, No. 72 Binhai Avenue, Qingdao, 266237, P. R. China
- Suzhou Institute of Shandong University, No. 388 Ruoshui Road, Suzhou Industrial Park, Suzhou, 215123, P. R. China
| |
Collapse
|
24
|
Sato A, Asahara R, Miyao T. Chemical Graph-Based Transformer Models for Yield Prediction of High-Throughput Cross-Coupling Reaction Datasets. ACS OMEGA 2024; 9:40907-40919. [PMID: 39372005 PMCID: PMC11447720 DOI: 10.1021/acsomega.4c06113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/02/2024] [Revised: 08/28/2024] [Accepted: 09/03/2024] [Indexed: 10/08/2024]
Abstract
The chemical reaction yield is an important factor to determine the reaction conditions. Recently, many data-driven models for yield prediction using high-throughput experimentation datasets have been reported. In this study, we propose a neural network architecture based on the chemical graphs of the reaction components to predict the reaction yield. The proposed model is the sequential combination of a message-passing neural network and a transformer encoder (MPNN-Transformer). The reaction components are converted to molecular matrices by the first network, followed by the interplay of the reaction components in the second network after adding the embeddings of the compound roles in the chemical reaction. The predictive ability of the proposed models was compared with state-of-the-art yield prediction models using two high-throughput experimental datasets: the Buchwald-Hartwig cross-coupling (BHC) and Suzuki-Miyaura cross-coupling (SMC) reaction datasets. Overall, the MPNN-Transformer models showed high prediction accuracy for the BHC reaction datasets and some of the extrapolation-oriented SMC reaction datasets. These models also performed well when the training dataset size was relatively large. Furthermore, analyzing the poorly predicted reactions for the BHC reaction dataset revealed a limitation of the data-driven yield prediction approach based on the chemical structural similarity.
Collapse
Affiliation(s)
- Akinori Sato
- Data
Science Center, Nara Institute of Science
and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
- Graduate
School of Science and Technology, Nara Institute
of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
| | - Ryosuke Asahara
- Graduate
School of Science and Technology, Nara Institute
of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
| | - Tomoyuki Miyao
- Data
Science Center, Nara Institute of Science
and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
- Graduate
School of Science and Technology, Nara Institute
of Science and Technology, 8916-5 Takayama-cho, Ikoma, Nara 630-0192, Japan
| |
Collapse
|
25
|
Schmid SP, Schlosser L, Glorius F, Jorner K. Catalysing (organo-)catalysis: Trends in the application of machine learning to enantioselective organocatalysis. Beilstein J Org Chem 2024; 20:2280-2304. [PMID: 39290209 PMCID: PMC11406055 DOI: 10.3762/bjoc.20.196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Accepted: 08/09/2024] [Indexed: 09/19/2024] Open
Abstract
Organocatalysis has established itself as a third pillar of homogeneous catalysis, besides transition metal catalysis and biocatalysis, as its use for enantioselective reactions has gathered significant interest over the last decades. Concurrent to this development, machine learning (ML) has been increasingly applied in the chemical domain to efficiently uncover hidden patterns in data and accelerate scientific discovery. While the uptake of ML in organocatalysis has been comparably slow, the last two decades have showed an increased interest from the community. This review gives an overview of the work in the field of ML in organocatalysis. The review starts by giving a short primer on ML for experimental chemists, before discussing its application for predicting the selectivity of organocatalytic transformations. Subsequently, we review ML employed for privileged catalysts, before focusing on its application for catalyst and reaction design. Concluding, we give our view on current challenges and future directions for this field, drawing inspiration from the application of ML to other scientific domains.
Collapse
Affiliation(s)
- Stefan P Schmid
- Institute of Chemical and Bioengineering, Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich CH-8093, Switzerland
| | - Leon Schlosser
- Organisch-Chemisches Institut, Universität Münster, 48149 Münster, Germany
| | - Frank Glorius
- Organisch-Chemisches Institut, Universität Münster, 48149 Münster, Germany
| | - Kjell Jorner
- Institute of Chemical and Bioengineering, Department of Chemistry and Applied Biosciences, ETH Zurich, Zurich CH-8093, Switzerland
- National Centre of Competence in Research (NCCR) Catalysis, ETH Zurich, Zurich CH-8093, Switzerland
| |
Collapse
|
26
|
Husbands DR, Tanner T, Whitwood AC, Hodnett NS, Wheelhouse KMP, Fairlamb IJS. Revealing the Hidden Complexity and Reactivity of Palladacyclic Precatalysts: The P( o-tolyl) 3 Ligand Enables a Cocktail of Active Species Utilizing the Pd(II)/Pd(IV) and Pd(0)/Pd(II) Pathways for Efficient Catalysis. ACS Catal 2024; 14:12769-12782. [PMID: 39263545 PMCID: PMC11385352 DOI: 10.1021/acscatal.4c02585] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Revised: 07/22/2024] [Accepted: 07/22/2024] [Indexed: 09/13/2024]
Abstract
The ligand, P(o-tolyl)3, is ubiquitous in applied synthetic chemistry and catalysis, particularly in Pd-catalyzed processes, which typically include Pd(OAc)2 (most commonly used as Pd3(OAc)6) as a precatalyst. The Herrmann-Beller palladacycle [Pd(C^P)(μ2-OAc)]2 (where C^P = monocyclopalladated P(o-tolyl)3) is easily formed from reaction of Pd(OAc)2 with P(o-tolyl)3. The mechanisms by which this precatalyst system operates are inherently complex, with studies previously implicating Pd nanoparticles (PdNPs) as reservoirs for active Pd(0) species in arylative cross-coupling reactions. In this study, we reveal the fascinating, complex, and nontrivial behavior of the palladacyclic group. First, in the presence of hydroxide base, [Pd(C^P)(μ2-OAc)]2 is readily converted into an activated form, [Pd(C^P)(μ2-OH)]2, which serves as a conduit for activation to catalytically relevant species. Second, palladacyclization imparts unique stability for catalytic species under reaction conditions, bringing into play a Pd(II)/Pd(IV) cross-coupling mechanism. For a benchmark Suzuki-Miyaura cross-coupling (SMCC) reaction, there is a shift from a mononuclear Pd catalytic pathway to a PdNP-controlled catalytic pathway during the reaction. The activation pathway of [Pd(C^P)(μ2-OH)]2 has been studied using an arylphosphine-stabilized boronic acid and low-temperature NMR spectroscopic analysis, which sheds light on the preactivation step, with water and/or acid being critical for the formation of active Pd(0) and Pd(II) species. In situ reaction monitoring has demonstrated that there is a sensitivity to the structure of the arylboron species in the presence of pinacol. This work, taken together, highlights the mechanistic complexity accompanying the use of palladacyclic precatalyst systems. It builds on recent findings involving related Pd(OAc)2/PPh3 precatalyst systems which readily form higher order Pdn clusters and PdNPs under cross-coupling reaction conditions. Thus, generally, one needs to be cautious with the assumption that Pd(OAc)2/tertiary phosphine mixtures cleanly deliver mononuclear "Pd(0)Ln" species and that any assessment of individual phosphine ligands may need to be taken on a case-by-case basis.
Collapse
Affiliation(s)
- David R Husbands
- Department of Chemistry, University of York, York, Heslington YO10 5DD, United Kingdom
| | - Theo Tanner
- Department of Chemistry, University of York, York, Heslington YO10 5DD, United Kingdom
| | - Adrian C Whitwood
- Department of Chemistry, University of York, York, Heslington YO10 5DD, United Kingdom
| | - Neil S Hodnett
- Medicine Development & Supply, GSK Medicines Research Centre, Gunnels Wood Road, Stevenage, Hertfordshire SG1 2NY, U.K
| | - Katherine M P Wheelhouse
- Medicine Development & Supply, GSK Medicines Research Centre, Gunnels Wood Road, Stevenage, Hertfordshire SG1 2NY, U.K
| | - Ian J S Fairlamb
- Department of Chemistry, University of York, York, Heslington YO10 5DD, United Kingdom
| |
Collapse
|
27
|
Liang W, Zheng S, Shu Y, Huang J. Machine Learning Optimizing Enzyme/ZIF Biocomposites for Enhanced Encapsulation Efficiency and Bioactivity. JACS AU 2024; 4:3170-3182. [PMID: 39211601 PMCID: PMC11350574 DOI: 10.1021/jacsau.4c00485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/06/2024] [Revised: 07/31/2024] [Accepted: 08/01/2024] [Indexed: 09/04/2024]
Abstract
In this study, we present the first example of using a machine learning (ML)-assisted design strategy to optimize the synthesis formulation of enzyme/ZIFs (zeolitic imidazolate framework) for enhanced performance. Glucose oxidase (GOx) and horseradish peroxidase (HRP) were chosen as model enzymes, while Zn(eIM)2 (eIM = 2-ethylimidazolate) was selected as the model ZIF to test our ML-assisted workflow paradigm. Through an iterative ML-driven training-design-synthesis-measurement workflow, we efficiently discovered GOx/ZIF (G151) and HRP/ZIF (H150) with their overall performance index (OPI) values (OPI represents the product of encapsulation efficiency (E in %), retained enzymatic activity (A in %), and thermal stability (T in %)) at least 1.3 times higher than those in systematic seed data studies. Furthermore, advanced statistical methods derived from the trained random forest model qualitatively and quantitatively reveal the relationship among synthesis, structure, and performance in the enzyme/ZIF system, offering valuable guidance for future studies on enzyme/ZIFs. Overall, our proposed ML-assisted design strategy holds promise for accelerating the development of enzyme/ZIFs and other enzyme immobilization systems for biocatalysis applications and beyond, including drug delivery and sensing, among others.
Collapse
Affiliation(s)
- Weibin Liang
- School of Chemical and Biomolecular
Engineering, The University of Sydney, Darlington, NSW 2008, Australia
| | | | - Ying Shu
- School of Chemical and Biomolecular
Engineering, The University of Sydney, Darlington, NSW 2008, Australia
| | - Jun Huang
- School of Chemical and Biomolecular
Engineering, The University of Sydney, Darlington, NSW 2008, Australia
| |
Collapse
|
28
|
Sato T, Masuda K, Sano C, Matsumoto K, Numata H, Munetoh S, Kasama T, Miyake R. Democratizing Microreactor Technology for Accelerated Discoveries in Chemistry and Materials Research. MICROMACHINES 2024; 15:1064. [PMID: 39337724 PMCID: PMC11434323 DOI: 10.3390/mi15091064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2024] [Revised: 08/11/2024] [Accepted: 08/14/2024] [Indexed: 09/30/2024]
Abstract
Microreactor technologies have emerged as versatile platforms with the potential to revolutionize chemistry and materials research, offering sustainable solutions to global challenges in environmental and health domains. This survey paper provides an in-depth review of recent advancements in microreactor technologies, focusing on their role in facilitating accelerated discoveries in chemistry and materials. Specifically, we examine the convergence of microfluidics with machine intelligence and automation, enabling the exploitation of the cyber-physical environment as a highly integrated experimentation platform for rapid scientific discovery and process development. We investigate the applicability and limitations of microreactor-enabled discovery accelerators in various chemistry and materials contexts. Despite their tremendous potential, the integration of machine intelligence and automation into microreactor-based experiments presents challenges in establishing fully integrated, automated, and intelligent systems. These challenges can hinder the broader adoption of microreactor technologies within the research community. To address this, we review emerging technologies that can help lower barriers and facilitate the implementation of microreactor-enabled discovery accelerators. Lastly, we provide our perspective on future research directions for democratizing microreactor technologies, with the aim of accelerating scientific discoveries and promoting widespread adoption of these transformative platforms.
Collapse
Affiliation(s)
- Tomomi Sato
- Graduate School of Engineering, The University of Tokyo, Kawasaki 212-0032, Japan; (T.K.); (R.M.)
| | - Koji Masuda
- Department of Physics and Astronomy, University of Exeter, Exeter EX4 4QL, UK;
| | - Chikako Sano
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Keiji Matsumoto
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Hidetoshi Numata
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Seiji Munetoh
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Toshihiro Kasama
- Graduate School of Engineering, The University of Tokyo, Kawasaki 212-0032, Japan; (T.K.); (R.M.)
| | - Ryo Miyake
- Graduate School of Engineering, The University of Tokyo, Kawasaki 212-0032, Japan; (T.K.); (R.M.)
| |
Collapse
|
29
|
Sun Y, Zhao Y, Xie X, Li H, Feng W. Printed polymer platform empowering machine-assisted chemical synthesis in stacked droplets. Nat Commun 2024; 15:6759. [PMID: 39117641 PMCID: PMC11310347 DOI: 10.1038/s41467-024-50768-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2023] [Accepted: 07/19/2024] [Indexed: 08/10/2024] Open
Abstract
Efficiently exploring organic molecules through multi-step processes demands a transition from conventional laboratory synthesis to automated systems. Existing platforms for machine-assistant synthetic workflows compatible with multiple liquid-phases require substantial engineering investments for setup, thereby hindering quick customization and throughput increasement. Here we present a droplet-based chip that facilitates the self-organization of various liquid phases into stacked layers for conducting chemical transformations. The chip's precision polymer printing capability, enabled by digital micromirror device (DMD)-maskless photolithography and dual post-chemical modifications, allows it to create customized, sub-10 µm featured patterns to confine diverse liquids, regardless of density, within each droplet. The robustness and open design of surface-templated liquid layers actualize machine-assistant droplet manipulation, synchronous reaction triggering, local oscillation, and real-time monitoring of individual layers into a reality. We propose that, with further integration of machine operation line and self-learning, this droplet-based platform holds the potential to become a valuable addition to the toolkit of chemistry process, operating autonomously and with high-throughput.
Collapse
Affiliation(s)
- Yingxue Sun
- College of Polymer Science and Engineering, Sichuan University, Chengdu, China
| | - Yuanyi Zhao
- College of Polymer Science and Engineering, Sichuan University, Chengdu, China
| | - Xinjian Xie
- College of Polymer Science and Engineering, Sichuan University, Chengdu, China
| | - Hongjiao Li
- College of Chemical Engineering, Sichuan University, Chengdu, China
| | - Wenqian Feng
- College of Polymer Science and Engineering, Sichuan University, Chengdu, China.
- Department State Key Laboratory of Polymer Materials Engineering, Sichuan University, Chengdu, China.
| |
Collapse
|
30
|
Qiu L, Li X, Holden DT, Cooks RG. Reaction acceleration at the surface of a levitated droplet by vapor dosing from a partner droplet. Chem Sci 2024; 15:12277-12283. [PMID: 39118618 PMCID: PMC11304536 DOI: 10.1039/d4sc03528c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Accepted: 06/30/2024] [Indexed: 08/10/2024] Open
Abstract
Chemical reactions in micrometer-sized droplets can be accelerated by up to six orders of magnitude. However, this acceleration factor (ratio of rate constants relative to bulk) drops to less than 10 for millimeter-sized droplets due to the reduction in surface/volume ratio. To enhance the acceleration in millimeter-sized droplets, we use a new synthesis platform that directly doses reagent vapor onto the reaction droplet surface from a second levitated droplet. Using Katritzky transamination as a model reaction, we made quantitative measurements on size-controlled vapor-dosed droplets, revealing a 31-fold increase in reaction rate constants when examining the entire droplet contents. This enhancement is attributed to a greater reaction rate constant in the droplet surface region (estimated as 105 times greater than that for the bulk). The capability for substantial reaction acceleration in large droplets highlights the potential for rapid synthesis of important chemicals at useful scales. For example, we successfully prepared 23 pyridinium salts within minutes. This efficiency positions droplets as an exceptional platform for rapid, in situ catalyst synthesis. This is illustrated by the preparation of pyridinium salts as photocatalysts and their subsequent use in mediation of amine oxidation both within the same droplet.
Collapse
Affiliation(s)
- Lingqi Qiu
- Department of Chemistry, Purdue University 560 Oval Drive West Lafayette Indiana 47907 USA
| | - Xilai Li
- Department of Chemistry, Purdue University 560 Oval Drive West Lafayette Indiana 47907 USA
| | - Dylan T Holden
- Department of Chemistry, Purdue University 560 Oval Drive West Lafayette Indiana 47907 USA
| | - R Graham Cooks
- Department of Chemistry, Purdue University 560 Oval Drive West Lafayette Indiana 47907 USA
| |
Collapse
|
31
|
Das M, Ghosh A, Sunoj RB. Advances in machine learning with chemical language models in molecular property and reaction outcome predictions. J Comput Chem 2024; 45:1160-1176. [PMID: 38299229 DOI: 10.1002/jcc.27315] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Revised: 01/06/2024] [Accepted: 01/09/2024] [Indexed: 02/02/2024]
Abstract
Molecular properties and reactions form the foundation of chemical space. Over the years, innumerable molecules have been synthesized, a smaller fraction of them found immediate applications, while a larger proportion served as a testimony to creative and empirical nature of the domain of chemical science. With increasing emphasis on sustainable practices, it is desirable that a target set of molecules are synthesized preferably through a fewer empirical attempts instead of a larger library, to realize an active candidate. In this front, predictive endeavors using machine learning (ML) models built on available data acquire high timely significance. Prediction of molecular property and reaction outcome remain one of the burgeoning applications of ML in chemical science. Among several methods of encoding molecular samples for ML models, the ones that employ language like representations are gaining steady popularity. Such representations would additionally help adopt well-developed natural language processing (NLP) models for chemical applications. Given this advantageous background, herein we describe several successful chemical applications of NLP focusing on molecular property and reaction outcome predictions. From relatively simpler recurrent neural networks (RNNs) to complex models like transformers, different network architecture have been leveraged for tasks such as de novo drug design, catalyst generation, forward and retro-synthesis predictions. The chemical language model (CLM) provides promising avenues toward a broad range of applications in a time and cost-effective manner. While we showcase an optimistic outlook of CLMs, attention is also placed on the persisting challenges in reaction domain, which would optimistically be addressed by advanced algorithms tailored to chemical language and with increased availability of high-quality datasets.
Collapse
Affiliation(s)
- Manajit Das
- Department of Chemistry, Indian Institute of Technology Bombay, Mumbai, India
| | - Ankit Ghosh
- Department of Chemistry, Indian Institute of Technology Bombay, Mumbai, India
| | - Raghavan B Sunoj
- Department of Chemistry, Indian Institute of Technology Bombay, Mumbai, India
- Centre for Machine Intelligence and Data Science, Indian Institute of Technology Bombay, Mumbai, India
| |
Collapse
|
32
|
van Gerwen P, Briling KR, Calvino Alonso Y, Franke M, Corminboeuf C. Benchmarking machine-readable vectors of chemical reactions on computed activation barriers. DIGITAL DISCOVERY 2024; 3:932-943. [PMID: 38756222 PMCID: PMC11094696 DOI: 10.1039/d3dd00175j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Accepted: 02/28/2024] [Indexed: 05/18/2024]
Abstract
In recent years, there has been a surge of interest in predicting computed activation barriers, to enable the acceleration of the automated exploration of reaction networks. Consequently, various predictive approaches have emerged, ranging from graph-based models to methods based on the three-dimensional structure of reactants and products. In tandem, many representations have been developed to predict experimental targets, which may hold promise for barrier prediction as well. Here, we bring together all of these efforts and benchmark various methods (Morgan fingerprints, the DRFP, the CGR representation-based Chemprop, SLATMd, B2Rl2, EquiReact and language model BERT + RXNFP) for the prediction of computed activation barriers on three diverse datasets.
Collapse
Affiliation(s)
- Puck van Gerwen
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne 1015 Lausanne Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne 1015 Lausanne Switzerland
| | - Ksenia R Briling
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne 1015 Lausanne Switzerland
| | - Yannick Calvino Alonso
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne 1015 Lausanne Switzerland
| | - Malte Franke
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne 1015 Lausanne Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne 1015 Lausanne Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne 1015 Lausanne Switzerland
| |
Collapse
|
33
|
Chen L, Gao Z, Zhang Y, Dai X, Meng F, Guo Y. A green, facile, and practical preparation of capsaicin derivatives with thiourea structure. Sci Rep 2024; 14:10576. [PMID: 38719947 PMCID: PMC11078945 DOI: 10.1038/s41598-024-61014-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Accepted: 04/30/2024] [Indexed: 05/12/2024] Open
Abstract
Capsaicin derivatives with thiourea structure (CDTS) is highly noteworthy owing to its higher analgesic potency in rodent models and higher agonism in vitro. However, the direct synthesis of CDTS remains t one or more shortcomings. In this study, we present reported a green, facile, and practical synthetic method of capsaicin derivatives with thiourea structure is developed by using an automated synthetic system, leading to a series of capsaicin derivatives with various electronic properties and functionalities in good to excellent yields.
Collapse
Affiliation(s)
- Lina Chen
- State Key Laboratory of NBC Protection for Civilian, Beijing, People's Republic of China
| | - Zhenhua Gao
- State Key Laboratory of NBC Protection for Civilian, Beijing, People's Republic of China
| | - Ye Zhang
- Sichuan University of Science and Engineering, Zigong, People's Republic of China
| | - Xiandong Dai
- State Key Laboratory of NBC Protection for Civilian, Beijing, People's Republic of China
| | - Fanhua Meng
- State Key Laboratory of NBC Protection for Civilian, Beijing, People's Republic of China
| | - Yongbiao Guo
- State Key Laboratory of NBC Protection for Civilian, Beijing, People's Republic of China.
| |
Collapse
|
34
|
Coin G, Jiang T, Bordi S, Nichols PL, Bode JW, Wanner BM. Automated, Capsule-Based Suzuki-Miyaura Cross Couplings. Org Lett 2024; 26:2708-2712. [PMID: 37126221 DOI: 10.1021/acs.orglett.3c01057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
The development of an automated process for Suzuki-Miyaura cross couplings is described, in which the complete reaction, workup, and product isolation are effected automatically with no user involvement, aside from loading of the starting materials and reaction capsule. This practical and simple method was successfully demonstrated to provide the desired biaryl products using a range of aryl bromides and boronic acids and is also effective for the late-stage functionalization of aryl halides in bioactive molecules.
Collapse
Affiliation(s)
- Guillaume Coin
- Synple Chem AG, Kemptpark 18, 8310 Kemptthal, Switzerland
- Laboratory of Organic Chemistry, Department of Chemistry and Applied Biosciences, ETH Zürich, 8093 Zürich, Switzerland
| | - Tuo Jiang
- Synple Chem AG, Kemptpark 18, 8310 Kemptthal, Switzerland
| | - Samuele Bordi
- Synple Chem AG, Kemptpark 18, 8310 Kemptthal, Switzerland
| | - Paula L Nichols
- Synple Chem AG, Kemptpark 18, 8310 Kemptthal, Switzerland
- Laboratory of Organic Chemistry, Department of Chemistry and Applied Biosciences, ETH Zürich, 8093 Zürich, Switzerland
| | - Jeffrey W Bode
- Laboratory of Organic Chemistry, Department of Chemistry and Applied Biosciences, ETH Zürich, 8093 Zürich, Switzerland
| | | |
Collapse
|
35
|
Wagner F, Sagmeister P, Jusner CE, Tampone TG, Manee V, Buono FG, Williams JD, Kappe CO. A Slug Flow Platform with Multiple Process Analytics Facilitates Flexible Reaction Optimization. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2308034. [PMID: 38273711 PMCID: PMC10987115 DOI: 10.1002/advs.202308034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 12/21/2023] [Indexed: 01/27/2024]
Abstract
Flow processing offers many opportunities to optimize reactions in a rapid and automated manner, yet often requires relatively large quantities of input materials. To combat this, the use of a flexible slug flow reactor, equipped with two analytical instruments, for low-volume optimization experiments are reported. A Buchwald-Hartwig amination toward the drug olanzapine, with 6 independent optimizable variables, is optimized using three different automated approaches: self-optimization, design of experiments, and kinetic modeling. These approaches are complementary and provide differing information on the reaction: pareto optimal operating points, response surface models, and mechanistic models, respectively. The results are achieved using <10% of the material that would be required for standard flow operation. Finally, a chemometric model is built utilizing automated data handling and three subsequent validation experiments demonstrate good agreement between the slug flow reactor and a standard (larger scale) flow reactor.
Collapse
Affiliation(s)
- Florian Wagner
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Peter Sagmeister
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Clemens E. Jusner
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Thomas G. Tampone
- Boehringer Ingelheim Pharmaceuticals, Inc900 Ridgebury RoadRidgefieldCT06877USA
| | - Vidhyadhar Manee
- Boehringer Ingelheim Pharmaceuticals, Inc900 Ridgebury RoadRidgefieldCT06877USA
| | - Frederic G. Buono
- Boehringer Ingelheim Pharmaceuticals, Inc900 Ridgebury RoadRidgefieldCT06877USA
| | - Jason D. Williams
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - C. Oliver Kappe
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| |
Collapse
|
36
|
King-Smith E, Berritt S, Bernier L, Hou X, Klug-McLeod JL, Mustakis J, Sach NW, Tucker JW, Yang Q, Howard RM, Lee AA. Probing the chemical 'reactome' with high-throughput experimentation data. Nat Chem 2024; 16:633-643. [PMID: 38168924 PMCID: PMC10997498 DOI: 10.1038/s41557-023-01393-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2022] [Accepted: 11/06/2023] [Indexed: 01/05/2024]
Abstract
High-throughput experimentation (HTE) has the potential to improve our understanding of organic chemistry by systematically interrogating reactivity across diverse chemical spaces. Notable bottlenecks include few publicly available large-scale datasets and the need for facile interpretation of these data's hidden chemical insights. Here we report the development of a high-throughput experimentation analyser, a robust and statistically rigorous framework, which is applicable to any HTE dataset regardless of size, scope or target reaction outcome, which yields interpretable correlations between starting material(s), reagents and outcomes. We improve the HTE data landscape with the disclosure of 39,000+ previously proprietary HTE reactions that cover a breadth of chemistry, including cross-coupling reactions and chiral salt resolutions. The high-throughput experimentation analyser was validated on cross-coupling and hydrogenation datasets, showcasing the elucidation of statistically significant hidden relationships between reaction components and outcomes, as well as highlighting areas of dataset bias and the specific reaction spaces that necessitate further investigation.
Collapse
Affiliation(s)
- Emma King-Smith
- Cavendish Laboratory, University of Cambridge, Cambridge, UK
| | | | | | - Xinjun Hou
- Pfizer Research and Development, Cambridge, MA, USA
| | | | | | - Neal W Sach
- Pfizer Research and Development, La Jolla, CA, USA
| | | | - Qingyi Yang
- Pfizer Research and Development, Cambridge, MA, USA
| | | | - Alpha A Lee
- Cavendish Laboratory, University of Cambridge, Cambridge, UK.
| |
Collapse
|
37
|
Sheng H, Sun J, Rodríguez O, Hoar BB, Zhang W, Xiang D, Tang T, Hazra A, Min DS, Doyle AG, Sigman MS, Costentin C, Gu Q, Rodríguez-López J, Liu C. Autonomous closed-loop mechanistic investigation of molecular electrochemistry via automation. Nat Commun 2024; 15:2781. [PMID: 38555303 PMCID: PMC10981680 DOI: 10.1038/s41467-024-47210-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 03/18/2024] [Indexed: 04/02/2024] Open
Abstract
Electrochemical research often requires stringent combinations of experimental parameters that are demanding to manually locate. Recent advances in automated instrumentation and machine-learning algorithms unlock the possibility for accelerated studies of electrochemical fundamentals via high-throughput, online decision-making. Here we report an autonomous electrochemical platform that implements an adaptive, closed-loop workflow for mechanistic investigation of molecular electrochemistry. As a proof-of-concept, this platform autonomously identifies and investigates an EC mechanism, an interfacial electron transfer (E step) followed by a solution reaction (C step), for cobalt tetraphenylporphyrin exposed to a library of organohalide electrophiles. The generally applicable workflow accurately discerns the EC mechanism's presence amid negative controls and outliers, adaptively designs desired experimental conditions, and quantitatively extracts kinetic information of the C step spanning over 7 orders of magnitude, from which mechanistic insights into oxidative addition pathways are gained. This work opens opportunities for autonomous mechanistic discoveries in self-driving electrochemistry laboratories without manual intervention.
Collapse
Affiliation(s)
- Hongyuan Sheng
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
| | - Jingwen Sun
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Oliver Rodríguez
- Department of Chemistry, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
- Joint Center for Energy Storage Research (JCESR), Argonne National Laboratory, Lemont, IL, 60439, USA
| | - Benjamin B Hoar
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Weitong Zhang
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Danlei Xiang
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Tianhua Tang
- Department of Chemistry, University of Utah, Salt Lake City, UT, 84112, USA
| | - Avijit Hazra
- Department of Chemistry, University of Utah, Salt Lake City, UT, 84112, USA
| | - Daniel S Min
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Abigail G Doyle
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Matthew S Sigman
- Department of Chemistry, University of Utah, Salt Lake City, UT, 84112, USA
| | | | - Quanquan Gu
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Joaquín Rodríguez-López
- Department of Chemistry, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
- Joint Center for Energy Storage Research (JCESR), Argonne National Laboratory, Lemont, IL, 60439, USA
| | - Chong Liu
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
- California NanoSystems Institute, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
| |
Collapse
|
38
|
Yu J, Liu J, Li C, Huang J, Zhu Y, You H. Recent advances and applications in high-throughput continuous flow. Chem Commun (Camb) 2024; 60:3217-3225. [PMID: 38436212 DOI: 10.1039/d3cc06180a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/05/2024]
Abstract
High-throughput continuous flow technology has emerged as a revolutionary approach in chemical synthesis, offering accelerated experimentation and improved efficiency. With the aid of process analytical technology and automation, this system not only enables rapid optimisation of reaction conditions at the millimole to the picomole scale, but also facilitates automated scale-up synthesis. It can even achieve the self-planning and self-synthesis of small drug molecules with artificial intelligence incorporated in the system. The versatility of the system is highlighted by its compatibility with both electrochemistry and photochemistry, and its significant applications in organic synthesis and drug discovery. This highlight summarises its recent developments and applications, emphasising its significant impact on advancing research across multiple disciplines.
Collapse
Affiliation(s)
- Jiaping Yu
- School of Science, Harbin Institute of Technology (Shenzhen), Shenzhen, 518055, China.
| | - Jiaying Liu
- Institute of Advanced Technology of Heilongjiang Academy of Sciences, Harbin, 150000, China
| | - Chaoyi Li
- School of Science, Harbin Institute of Technology (Shenzhen), Shenzhen, 518055, China.
| | - Junrong Huang
- School of Science, Harbin Institute of Technology (Shenzhen), Shenzhen, 518055, China.
| | - Yuxiang Zhu
- School of Pharmaceutical Sciences (Shenzhen), Shenzhen Campus of Sun Yat-sen University, Shenzhen, 518107, China.
| | - Hengzhi You
- School of Science, Harbin Institute of Technology (Shenzhen), Shenzhen, 518055, China.
- Green Pharmaceutical Engineering Research Centre, Harbin Institute of Technology (Shenzhen), Shenzhen, 518055, China
| |
Collapse
|
39
|
Xin Y, Foster SW, Makey DM, Parker D, Bradow J, Wang X, Berritt S, Mongillo R, Grinias JP, Kennedy RT. High-Throughput Capillary Liquid Chromatography Using a Droplet Injection and Application to Reaction Screening. Anal Chem 2024; 96:4693-4701. [PMID: 38442211 PMCID: PMC11001260 DOI: 10.1021/acs.analchem.4c00150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/07/2024]
Abstract
The cycle time of a standard liquid chromatography (LC) system is the sum of the time for the chromatographic run and the autosampler injection sequence. Although LC separation times in the 1-10 s range have been demonstrated, injection sequences are commonly >15 s, limiting throughput possible with LC separations. Further, such separations are performed on relatively large bore columns requiring flow rates of ≥5 mL/min, thus generating large volumes of mobile phase waste when used for large scale screening and increasing the difficulty in interfacing to mass spectrometry. Here, a droplet injector system was established that replaces the autosampler with a four-port, two-position valve equipped with a 20 nL internal loop interfaced to a syringe pump and a three-axis positioner to withdraw sample droplets from a well plate. In the system, sample and immiscible fluid are pulled alternately from a well plate into a capillary and then through the injection valve. The valve is actuated when sample fills the loop to allow sequential injection of samples at high throughput. Capillary LC columns with 300 μm inner diameter were used to reduce the consumption of mobile phase and sample. The system achieved 96 separations of 20 nL droplet samples containing 3 components in as little as 8.1 min with 5-s cycle time. This system was coupled to a mass spectrometer through an electrospray ionization source for high-throughput chemical reaction screening.
Collapse
Affiliation(s)
- Yue Xin
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Samuel W Foster
- Department of Chemistry and Biochemistry, Rowan University, Glassboro, New Jersey 08028, United States
| | - Devin M Makey
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Deklin Parker
- Department of Chemistry and Biochemistry, Rowan University, Glassboro, New Jersey 08028, United States
| | - James Bradow
- Pfizer Global Research and Development, Eastern Point Road, Groton, Connecticut 06415, United States
| | - Xiaochun Wang
- Pfizer Global Research and Development, Eastern Point Road, Groton, Connecticut 06415, United States
| | - Simon Berritt
- Pfizer Global Research and Development, Eastern Point Road, Groton, Connecticut 06415, United States
| | - Robert Mongillo
- Pfizer Global Research and Development, Eastern Point Road, Groton, Connecticut 06415, United States
| | - James P Grinias
- Department of Chemistry and Biochemistry, Rowan University, Glassboro, New Jersey 08028, United States
| | - Robert T Kennedy
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
- Department of Pharmacology, University of Michigan, Ann Arbor, Michigan 48109, United States
| |
Collapse
|
40
|
Han J, Kwon Y, Choi YS, Kang S. Improving chemical reaction yield prediction using pre-trained graph neural networks. J Cheminform 2024; 16:25. [PMID: 38429787 PMCID: PMC10905905 DOI: 10.1186/s13321-024-00818-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2023] [Accepted: 02/19/2024] [Indexed: 03/03/2024] Open
Abstract
Graph neural networks (GNNs) have proven to be effective in the prediction of chemical reaction yields. However, their performance tends to deteriorate when they are trained using an insufficient training dataset in terms of quantity or diversity. A promising solution to alleviate this issue is to pre-train a GNN on a large-scale molecular database. In this study, we investigate the effectiveness of GNN pre-training in chemical reaction yield prediction. We present a novel GNN pre-training method for performance improvement.Given a molecular database consisting of a large number of molecules, we calculate molecular descriptors for each molecule and reduce the dimensionality of these descriptors by applying principal component analysis. We define a pre-text task by assigning a vector of principal component scores as the pseudo-label to each molecule in the database. A GNN is then pre-trained to perform the pre-text task of predicting the pseudo-label for the input molecule. For chemical reaction yield prediction, a prediction model is initialized using the pre-trained GNN and then fine-tuned with the training dataset containing chemical reactions and their yields. We demonstrate the effectiveness of the proposed method through experimental evaluation on benchmark datasets.
Collapse
Affiliation(s)
- Jongmin Han
- Department of Industrial Engineering, Sungkyunkwan University, 2066 Seobu-ro, Jangan-gu, Suwon, Republic of Korea
| | - Youngchun Kwon
- Samsung Advanced Institute of Technology, Samsung Electronics Co. Ltd., 130 Samsung-ro, Yeongtong-gu, Suwon, Republic of Korea
| | - Youn-Suk Choi
- Samsung Advanced Institute of Technology, Samsung Electronics Co. Ltd., 130 Samsung-ro, Yeongtong-gu, Suwon, Republic of Korea.
| | - Seokho Kang
- Department of Industrial Engineering, Sungkyunkwan University, 2066 Seobu-ro, Jangan-gu, Suwon, Republic of Korea.
| |
Collapse
|
41
|
Shi R, Yu G, Huo X, Yang Y. Prediction of chemical reaction yields with large-scale multi-view pre-training. J Cheminform 2024; 16:22. [PMID: 38403627 PMCID: PMC10895839 DOI: 10.1186/s13321-024-00815-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2023] [Accepted: 02/14/2024] [Indexed: 02/27/2024] Open
Abstract
Developing machine learning models with high generalization capability for predicting chemical reaction yields is of significant interest and importance. The efficacy of such models depends heavily on the representation of chemical reactions, which has commonly been learned from SMILES or graphs of molecules using deep neural networks. However, the progression of chemical reactions is inherently determined by the molecular 3D geometric properties, which have been recently highlighted as crucial features in accurately predicting molecular properties and chemical reactions. Additionally, large-scale pre-training has been shown to be essential in enhancing the generalization capability of complex deep learning models. Based on these considerations, we propose the Reaction Multi-View Pre-training (ReaMVP) framework, which leverages self-supervised learning techniques and a two-stage pre-training strategy to predict chemical reaction yields. By incorporating multi-view learning with 3D geometric information, ReaMVP achieves state-of-the-art performance on two benchmark datasets. Notably, the experimental results indicate that ReaMVP has a significant advantage in predicting out-of-sample data, suggesting an enhanced generalization ability to predict new reactions. Scientific Contribution: This study presents the ReaMVP framework, which improves the generalization capability of machine learning models for predicting chemical reaction yields. By integrating sequential and geometric views and leveraging self-supervised learning techniques with a two-stage pre-training strategy, ReaMVP achieves state-of-the-art performance on benchmark datasets. The framework demonstrates superior predictive ability for out-of-sample data and enhances the prediction of new reactions.
Collapse
Affiliation(s)
- Runhan Shi
- Department of Computer Science and Engineering, and Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Gufeng Yu
- Department of Computer Science and Engineering, and Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Xiaohong Huo
- Shanghai Key Laboratory for Molecular Engineering of Chiral Drugs, Frontiers Science Center for Transformative Molecules, School of Chemistry and Chemical Engineering, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Yang Yang
- Department of Computer Science and Engineering, and Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering, Shanghai Jiao Tong University, Shanghai, 200240, China.
| |
Collapse
|
42
|
Niu KK, Cui J, Dong RZ, Yu S, Liu H, Xing LB. Visible-light-mediated direct C3 alkylation of quinoxalin-2(1 H)-ones using alkanes. Chem Commun (Camb) 2024; 60:2409-2412. [PMID: 38323602 DOI: 10.1039/d3cc06285f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2024]
Abstract
Due to the high C-H bond dissociation energy of alkanes, the utilization of alkanes as alkyl radical precursors for C-H functionalization of heteroarenes is synthetically captivating but practically challenging, especially under metal- and photocatalyst-free conditions. We report herein a mild and practical visible-light-mediated method for C-H alkylation of quinoxalin-2(1H)-ones using trifluoroacetic acid as a hydrogen atom transfer reagent and air as an oxidant. This mild protocol was performed under metal- and photocatalyst-free circumstances and presented good functional-group tolerance as well as a broad substrate scope.
Collapse
Affiliation(s)
- Kai-Kai Niu
- School of Chemistry and Chemical Engineering, Shandong University of Technology, Zibo 255000, P. R. China.
| | - Jing Cui
- School of Chemistry and Chemical Engineering, Shandong University of Technology, Zibo 255000, P. R. China.
| | - Rui-Zhi Dong
- School of Chemistry and Chemical Engineering, Shandong University of Technology, Zibo 255000, P. R. China.
| | - Shengsheng Yu
- School of Chemistry and Chemical Engineering, Shandong University of Technology, Zibo 255000, P. R. China.
| | - Hui Liu
- School of Chemistry and Chemical Engineering, Shandong University of Technology, Zibo 255000, P. R. China.
| | - Ling-Bao Xing
- School of Chemistry and Chemical Engineering, Shandong University of Technology, Zibo 255000, P. R. China.
| |
Collapse
|
43
|
Brocklehurst CE, Altmann E, Bon C, Davis H, Dunstan D, Ertl P, Ginsburg-Moraff C, Grob J, Gosling DJ, Lapointe G, Marziale AN, Mues H, Palmieri M, Racine S, Robinson RI, Springer C, Tan K, Ulmer W, Wyler R. MicroCycle: An Integrated and Automated Platform to Accelerate Drug Discovery. J Med Chem 2024; 67:2118-2128. [PMID: 38270627 DOI: 10.1021/acs.jmedchem.3c02029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2024]
Abstract
We herein describe the development and application of a modular technology platform which incorporates recent advances in plate-based microscale chemistry, automated purification, in situ quantification, and robotic liquid handling to enable rapid access to high-quality chemical matter already formatted for assays. In using microscale chemistry and thus consuming minimal chemical matter, the platform is not only efficient but also follows green chemistry principles. By reorienting existing high-throughput assay technology, the platform can generate a full package of relevant data on each set of compounds in every learning cycle. The multiparameter exploration of chemical and property space is hereby driven by active learning models. The enhanced compound optimization process is generating knowledge for drug discovery projects in a time frame never before possible.
Collapse
Affiliation(s)
- Cara E Brocklehurst
- Global Discovery Chemistry, Novartis Biomedical Research, Novartis Pharma AG, Basel 4033, Switzerland
| | - Eva Altmann
- Global Discovery Chemistry, Novartis Biomedical Research, Novartis Pharma AG, Basel 4033, Switzerland
| | - Corentin Bon
- Global Discovery Chemistry, Novartis Biomedical Research, Novartis Pharma AG, Basel 4033, Switzerland
| | - Holly Davis
- Global Discovery Chemistry, Novartis Biomedical Research, Novartis Pharma AG, Basel 4033, Switzerland
| | - David Dunstan
- Global Discovery Chemistry, Novartis Biomedical Research, Cambridge, Massachusetts 02139, United States
| | - Peter Ertl
- Global Discovery Chemistry, Novartis Biomedical Research, Novartis Pharma AG, Basel 4033, Switzerland
| | - Carol Ginsburg-Moraff
- Global Discovery Chemistry, Novartis Biomedical Research, Cambridge, Massachusetts 02139, United States
| | - Jonathan Grob
- Global Discovery Chemistry, Novartis Biomedical Research, Cambridge, Massachusetts 02139, United States
| | - Daniel J Gosling
- Global Discovery Chemistry, Novartis Biomedical Research, Novartis Pharma AG, Basel 4033, Switzerland
| | - Guillaume Lapointe
- Global Discovery Chemistry, Novartis Biomedical Research, Novartis Pharma AG, Basel 4033, Switzerland
| | - Alexander N Marziale
- Global Discovery Chemistry, Novartis Biomedical Research, Novartis Pharma AG, Basel 4033, Switzerland
| | - Heinrich Mues
- Global Discovery Chemistry, Novartis Biomedical Research, Novartis Pharma AG, Basel 4033, Switzerland
| | - Marco Palmieri
- Global Discovery Chemistry, Novartis Biomedical Research, Novartis Pharma AG, Basel 4033, Switzerland
| | - Sophie Racine
- Global Discovery Chemistry, Novartis Biomedical Research, Novartis Pharma AG, Basel 4033, Switzerland
| | - Richard I Robinson
- Global Discovery Chemistry, Novartis Biomedical Research, Cambridge, Massachusetts 02139, United States
| | - Clayton Springer
- Global Discovery Chemistry, Novartis Biomedical Research, Cambridge, Massachusetts 02139, United States
| | - Kian Tan
- Global Discovery Chemistry, Novartis Biomedical Research, Cambridge, Massachusetts 02139, United States
| | - William Ulmer
- Global Discovery Chemistry, Novartis Biomedical Research, Cambridge, Massachusetts 02139, United States
| | - René Wyler
- Global Discovery Chemistry, Novartis Biomedical Research, Novartis Pharma AG, Basel 4033, Switzerland
| |
Collapse
|
44
|
Slattery A, Wen Z, Tenblad P, Sanjosé-Orduna J, Pintossi D, den Hartog T, Noël T. Automated self-optimization, intensification, and scale-up of photocatalysis in flow. Science 2024; 383:eadj1817. [PMID: 38271529 DOI: 10.1126/science.adj1817] [Citation(s) in RCA: 31] [Impact Index Per Article: 31.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2023] [Accepted: 12/13/2023] [Indexed: 01/27/2024]
Abstract
The optimization, intensification, and scale-up of photochemical processes constitute a particular challenge in a manufacturing environment geared primarily toward thermal chemistry. In this work, we present a versatile flow-based robotic platform to address these challenges through the integration of readily available hardware and custom software. Our open-source platform combines a liquid handler, syringe pumps, a tunable continuous-flow photoreactor, inexpensive Internet of Things devices, and an in-line benchtop nuclear magnetic resonance spectrometer to enable automated, data-rich optimization with a closed-loop Bayesian optimization strategy. A user-friendly graphical interface allows chemists without programming or machine learning expertise to easily monitor, analyze, and improve photocatalytic reactions with respect to both continuous and discrete variables. The system's effectiveness was demonstrated by increasing overall reaction yields and improving space-time yields compared with those of previously reported processes.
Collapse
Affiliation(s)
- Aidan Slattery
- Flow Chemistry Group, van 't Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Science Park 904, 1098 XH Amsterdam, Netherlands
| | - Zhenghui Wen
- Flow Chemistry Group, van 't Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Science Park 904, 1098 XH Amsterdam, Netherlands
| | - Pauline Tenblad
- Flow Chemistry Group, van 't Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Science Park 904, 1098 XH Amsterdam, Netherlands
| | - Jesús Sanjosé-Orduna
- Flow Chemistry Group, van 't Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Science Park 904, 1098 XH Amsterdam, Netherlands
| | - Diego Pintossi
- Flow Chemistry Group, van 't Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Science Park 904, 1098 XH Amsterdam, Netherlands
| | - Tim den Hartog
- Flow Chemistry Group, van 't Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Science Park 904, 1098 XH Amsterdam, Netherlands
- Zuyd University of Applied Sciences, Nieuw Eyckholt 300, 6419 DJ Heerlen, Netherlands
- Netherlands Organisation for Applied Scientific Research (TNO), High Tech Campus 25, 5656 AE Eindhoven, Netherlands
| | - Timothy Noël
- Flow Chemistry Group, van 't Hoff Institute for Molecular Sciences (HIMS), University of Amsterdam, Science Park 904, 1098 XH Amsterdam, Netherlands
| |
Collapse
|
45
|
Lennon G, Dingwall P. Enabling High Throughput Kinetic Experimentation by Using Flow as a Differential Kinetic Technique. Angew Chem Int Ed Engl 2024; 63:e202318146. [PMID: 38078481 PMCID: PMC10952970 DOI: 10.1002/anie.202318146] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Indexed: 12/23/2023]
Abstract
Kinetic data is most commonly collected through the generation of time-series data under either batch or flow conditions. Existing methods to generate kinetic data in flow collect integral data (concentration over time) only. Here, we report a method for the rapid and direct collection of differential kinetic data (direct measurement of rate) in flow by performing a series of instantaneous rate measurements on sequential small-scale reactions. This technique decouples the time required to generate a full kinetic profile from the time required for a reaction to reach completion, enabling high throughput kinetic experimentation. In addition, comparison of kinetic profiles constructed at different residence times allows the robustness, or stability, of homogeneously catalysed reactions to be interrogated. This approach makes use of a segmented flow platform which was shown to quantitatively reproduce batch kinetic data. The proline mediated aldol reaction was chosen as a model reaction to perform a high throughput kinetic screen of 216 kinetic profiles in 90 hours, one every 25 minutes, which would have taken an estimated continuous 3500 hours in batch, an almost 40-fold increase in experimental throughput matched by a corresponding reduction in material consumption.
Collapse
Affiliation(s)
- Gavin Lennon
- School of Chemistry and Chemical EngineeringQueen's University BelfastDavid Keir Building, Stranmillis RoadBelfastBT9 5AGUK
| | - Paul Dingwall
- School of Chemistry and Chemical EngineeringQueen's University BelfastDavid Keir Building, Stranmillis RoadBelfastBT9 5AGUK
| |
Collapse
|
46
|
Yin X, Hsieh CY, Wang X, Wu Z, Ye Q, Bao H, Deng Y, Chen H, Luo P, Liu H, Hou T, Yao X. Enhancing Generic Reaction Yield Prediction through Reaction Condition-Based Contrastive Learning. RESEARCH (WASHINGTON, D.C.) 2024; 7:0292. [PMID: 38213662 PMCID: PMC10777739 DOI: 10.34133/research.0292] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2023] [Accepted: 12/06/2023] [Indexed: 01/13/2024]
Abstract
Deep learning (DL)-driven efficient synthesis planning may profoundly transform the paradigm for designing novel pharmaceuticals and materials. However, the progress of many DL-assisted synthesis planning (DASP) algorithms has suffered from the lack of reliable automated pathway evaluation tools. As a critical metric for evaluating chemical reactions, accurate prediction of reaction yields helps improve the practicality of DASP algorithms in the real-world scenarios. Currently, accurately predicting yields of interesting reactions still faces numerous challenges, mainly including the absence of high-quality generic reaction yield datasets and robust generic yield predictors. To compensate for the limitations of high-throughput yield datasets, we curated a generic reaction yield dataset containing 12 reaction categories and rich reaction condition information. Subsequently, by utilizing 2 pretraining tasks based on chemical reaction masked language modeling and contrastive learning, we proposed a powerful bidirectional encoder representations from transformers (BERT)-based reaction yield predictor named Egret. It achieved comparable or even superior performance to the best previous models on 4 benchmark datasets and established state-of-the-art performance on the newly curated dataset. We found that reaction-condition-based contrastive learning enhances the model's sensitivity to reaction conditions, and Egret is capable of capturing subtle differences between reactions involving identical reactants and products but different reaction conditions. Furthermore, we proposed a new scoring function that incorporated Egret into the evaluation of multistep synthesis routes. Test results showed that yield-incorporated scoring facilitated the prioritization of literature-supported high-yield reaction pathways for target molecules. In addition, through meta-learning strategy, we further improved the reliability of the model's prediction for reaction types with limited data and lower data quality. Our results suggest that Egret holds the potential to become an essential component of the next-generation DASP tools.
Collapse
Affiliation(s)
- Xiaodan Yin
- Dr. Neher’s Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine,
Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao 999078, China
- CarbonSilicon AI Technology Co. Ltd, Hangzhou, Zhejiang 310018, China
| | - Chang-Yu Hsieh
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Xiaorui Wang
- Dr. Neher’s Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine,
Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao 999078, China
- CarbonSilicon AI Technology Co. Ltd, Hangzhou, Zhejiang 310018, China
| | - Zhenxing Wu
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
- CarbonSilicon AI Technology Co. Ltd, Hangzhou, Zhejiang 310018, China
| | - Qing Ye
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
- CarbonSilicon AI Technology Co. Ltd, Hangzhou, Zhejiang 310018, China
| | - Honglei Bao
- Dr. Neher’s Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine,
Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao 999078, China
| | - Yafeng Deng
- CarbonSilicon AI Technology Co. Ltd, Hangzhou, Zhejiang 310018, China
| | - Hongming Chen
- Center of Chemistry and Chemical Biology,
Guangzhou Regenerative Medicine and Health Guangdong Laboratory, Guangzhou 510530, China
| | - Pei Luo
- Dr. Neher’s Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine,
Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao 999078, China
| | - Huanxiang Liu
- Faculty of Applied Sciences,
Macao Polytechnic University, Macao 999078, China
| | - Tingjun Hou
- Innovation Institute for Artificial Intelligence in Medicine of Zhejiang University, College of Pharmaceutical Sciences, Zhejiang University, Hangzhou 310058, China
| | - Xiaojun Yao
- Faculty of Applied Sciences,
Macao Polytechnic University, Macao 999078, China
| |
Collapse
|
47
|
Voinarovska V, Kabeshov M, Dudenko D, Genheden S, Tetko IV. When Yield Prediction Does Not Yield Prediction: An Overview of the Current Challenges. J Chem Inf Model 2024; 64:42-56. [PMID: 38116926 PMCID: PMC10778086 DOI: 10.1021/acs.jcim.3c01524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 11/29/2023] [Accepted: 11/30/2023] [Indexed: 12/21/2023]
Abstract
Machine Learning (ML) techniques face significant challenges when predicting advanced chemical properties, such as yield, feasibility of chemical synthesis, and optimal reaction conditions. These challenges stem from the high-dimensional nature of the prediction task and the myriad essential variables involved, ranging from reactants and reagents to catalysts, temperature, and purification processes. Successfully developing a reliable predictive model not only holds the potential for optimizing high-throughput experiments but can also elevate existing retrosynthetic predictive approaches and bolster a plethora of applications within the field. In this review, we systematically evaluate the efficacy of current ML methodologies in chemoinformatics, shedding light on their milestones and inherent limitations. Additionally, a detailed examination of a representative case study provides insights into the prevailing issues related to data availability and transferability in the discipline.
Collapse
Affiliation(s)
- Varvara Voinarovska
- Molecular
AI, Discovery Sciences R&D, AstraZeneca, 431 83 Gothenburg, Sweden
- TUM
Graduate School, Faculty of Chemistry, Technical
University of Munich, 85748 Garching, Germany
| | - Mikhail Kabeshov
- Molecular
AI, Discovery Sciences R&D, AstraZeneca, 431 83 Gothenburg, Sweden
| | - Dmytro Dudenko
- Enamine
Ltd., 78 Chervonotkatska str., 02094 Kyiv, Ukraine
| | - Samuel Genheden
- Molecular
AI, Discovery Sciences R&D, AstraZeneca, 431 83 Gothenburg, Sweden
| | - Igor V. Tetko
- Molecular
Targets and Therapeutics Center, Helmholtz Munich − Deutsches
Forschungszentrum für Gesundheit und Umwelt (GmbH), Institute of Structural Biology, 85764 Neuherberg, Germany
| |
Collapse
|
48
|
Raghavan P, Haas BC, Ruos ME, Schleinitz J, Doyle AG, Reisman SE, Sigman MS, Coley CW. Dataset Design for Building Models of Chemical Reactivity. ACS CENTRAL SCIENCE 2023; 9:2196-2204. [PMID: 38161380 PMCID: PMC10755851 DOI: 10.1021/acscentsci.3c01163] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/20/2023] [Revised: 11/06/2023] [Accepted: 11/15/2023] [Indexed: 01/03/2024]
Abstract
Models can codify our understanding of chemical reactivity and serve a useful purpose in the development of new synthetic processes via, for example, evaluating hypothetical reaction conditions or in silico substrate tolerance. Perhaps the most determining factor is the composition of the training data and whether it is sufficient to train a model that can make accurate predictions over the full domain of interest. Here, we discuss the design of reaction datasets in ways that are conducive to data-driven modeling, emphasizing the idea that training set diversity and model generalizability rely on the choice of molecular or reaction representation. We additionally discuss the experimental constraints associated with generating common types of chemistry datasets and how these considerations should influence dataset design and model building.
Collapse
Affiliation(s)
- Priyanka Raghavan
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, Cambridge, Massachusetts 02139, United States
| | - Brittany C. Haas
- Department
of Chemistry, University of Utah, Salt Lake City, Utah 84112, United States
| | - Madeline E. Ruos
- Department
of Chemistry & Biochemistry, University
of California, Los Angeles, Los Angeles, California 90095, United States
| | - Jules Schleinitz
- Division
of Chemistry and Chemical Engineering, California
Institute of Technology, Pasadena, California 91125, United States
| | - Abigail G. Doyle
- Department
of Chemistry & Biochemistry, University
of California, Los Angeles, Los Angeles, California 90095, United States
| | - Sarah E. Reisman
- Division
of Chemistry and Chemical Engineering, California
Institute of Technology, Pasadena, California 91125, United States
| | - Matthew S. Sigman
- Department
of Chemistry, University of Utah, Salt Lake City, Utah 84112, United States
| | - Connor W. Coley
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, Cambridge, Massachusetts 02139, United States
- Department
of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
49
|
Boiko DA, MacKnight R, Kline B, Gomes G. Autonomous chemical research with large language models. Nature 2023; 624:570-578. [PMID: 38123806 PMCID: PMC10733136 DOI: 10.1038/s41586-023-06792-0] [Citation(s) in RCA: 92] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 10/27/2023] [Indexed: 12/23/2023]
Abstract
Transformer-based large language models are making significant strides in various fields, such as natural language processing1-5, biology6,7, chemistry8-10 and computer programming11,12. Here, we show the development and capabilities of Coscientist, an artificial intelligence system driven by GPT-4 that autonomously designs, plans and performs complex experiments by incorporating large language models empowered by tools such as internet and documentation search, code execution and experimental automation. Coscientist showcases its potential for accelerating research across six diverse tasks, including the successful reaction optimization of palladium-catalysed cross-couplings, while exhibiting advanced capabilities for (semi-)autonomous experimental design and execution. Our findings demonstrate the versatility, efficacy and explainability of artificial intelligence systems like Coscientist in advancing research.
Collapse
Affiliation(s)
- Daniil A Boiko
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Robert MacKnight
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Ben Kline
- Emerald Cloud Lab, South San Francisco, CA, USA
| | - Gabe Gomes
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, PA, USA.
- Department of Chemistry, Carnegie Mellon University, Pittsburgh, PA, USA.
- Wilton E. Scott Institute for Energy Innovation, Carnegie Mellon University, Pittsburgh, PA, USA.
| |
Collapse
|
50
|
Boby ML, Fearon D, Ferla M, Filep M, Koekemoer L, Robinson MC, Chodera JD, Lee AA, London N, von Delft A, von Delft F, Achdout H, Aimon A, Alonzi DS, Arbon R, Aschenbrenner JC, Balcomb BH, Bar-David E, Barr H, Ben-Shmuel A, Bennett J, Bilenko VA, Borden B, Boulet P, Bowman GR, Brewitz L, Brun J, Bvnbs S, Calmiano M, Carbery A, Carney DW, Cattermole E, Chang E, Chernyshenko E, Clyde A, Coffland JE, Cohen G, Cole JC, Contini A, Cox L, Croll TI, Cvitkovic M, De Jonghe S, Dias A, Donckers K, Dotson DL, Douangamath A, Duberstein S, Dudgeon T, Dunnett LE, Eastman P, Erez N, Eyermann CJ, Fairhead M, Fate G, Fedorov O, Fernandes RS, Ferrins L, Foster R, Foster H, Fraisse L, Gabizon R, García-Sastre A, Gawriljuk VO, Gehrtz P, Gileadi C, Giroud C, Glass WG, Glen RC, Glinert I, Godoy AS, Gorichko M, Gorrie-Stone T, Griffen EJ, Haneef A, Hassell Hart S, Heer J, Henry M, Hill M, Horrell S, Huang QYJ, Huliak VD, Hurley MFD, Israely T, Jajack A, Jansen J, Jnoff E, Jochmans D, John T, Kaminow B, Kang L, Kantsadi AL, Kenny PW, Kiappes JL, Kinakh SO, Kovar B, Krojer T, La VNT, Laghnimi-Hahn S, Lefker BA, et alBoby ML, Fearon D, Ferla M, Filep M, Koekemoer L, Robinson MC, Chodera JD, Lee AA, London N, von Delft A, von Delft F, Achdout H, Aimon A, Alonzi DS, Arbon R, Aschenbrenner JC, Balcomb BH, Bar-David E, Barr H, Ben-Shmuel A, Bennett J, Bilenko VA, Borden B, Boulet P, Bowman GR, Brewitz L, Brun J, Bvnbs S, Calmiano M, Carbery A, Carney DW, Cattermole E, Chang E, Chernyshenko E, Clyde A, Coffland JE, Cohen G, Cole JC, Contini A, Cox L, Croll TI, Cvitkovic M, De Jonghe S, Dias A, Donckers K, Dotson DL, Douangamath A, Duberstein S, Dudgeon T, Dunnett LE, Eastman P, Erez N, Eyermann CJ, Fairhead M, Fate G, Fedorov O, Fernandes RS, Ferrins L, Foster R, Foster H, Fraisse L, Gabizon R, García-Sastre A, Gawriljuk VO, Gehrtz P, Gileadi C, Giroud C, Glass WG, Glen RC, Glinert I, Godoy AS, Gorichko M, Gorrie-Stone T, Griffen EJ, Haneef A, Hassell Hart S, Heer J, Henry M, Hill M, Horrell S, Huang QYJ, Huliak VD, Hurley MFD, Israely T, Jajack A, Jansen J, Jnoff E, Jochmans D, John T, Kaminow B, Kang L, Kantsadi AL, Kenny PW, Kiappes JL, Kinakh SO, Kovar B, Krojer T, La VNT, Laghnimi-Hahn S, Lefker BA, Levy H, Lithgo RM, Logvinenko IG, Lukacik P, Macdonald HB, MacLean EM, Makower LL, Malla TR, Marples PG, Matviiuk T, McCorkindale W, McGovern BL, Melamed S, Melnykov KP, Michurin O, Miesen P, Mikolajek H, Milne BF, Minh D, Morris A, Morris GM, Morwitzer MJ, Moustakas D, Mowbray CE, Nakamura AM, Neto JB, Neyts J, Nguyen L, Noske GD, Oleinikovas V, Oliva G, Overheul GJ, Owen CD, Pai R, Pan J, Paran N, Payne AM, Perry B, Pingle M, Pinjari J, Politi B, Powell A, Pšenák V, Pulido I, Puni R, Rangel VL, Reddi RN, Rees P, Reid SP, Reid L, Resnick E, Ripka EG, Robinson RP, Rodriguez-Guerra J, Rosales R, Rufa DA, Saar K, Saikatendu KS, Salah E, Schaller D, Scheen J, Schiffer CA, Schofield CJ, Shafeev M, Shaikh A, Shaqra AM, Shi J, Shurrush K, Singh S, Sittner A, Sjö P, Skyner R, Smalley A, Smeets B, Smilova MD, Solmesky LJ, Spencer J, Strain-Damerell C, Swamy V, Tamir H, Taylor JC, Tennant RE, Thompson W, Thompson A, Tomásio S, Tomlinson CWE, Tsurupa IS, Tumber A, Vakonakis I, van Rij RP, Vangeel L, Varghese FS, Vaschetto M, Vitner EB, Voelz V, Volkamer A, Walsh MA, Ward W, Weatherall C, Weiss S, White KM, Wild CF, Witt KD, Wittmann M, Wright N, Yahalom-Ronen Y, Yilmaz NK, Zaidmann D, Zhang I, Zidane H, Zitzmann N, Zvornicanin SN. Open science discovery of potent noncovalent SARS-CoV-2 main protease inhibitors. Science 2023; 382:eabo7201. [PMID: 37943932 PMCID: PMC7615835 DOI: 10.1126/science.abo7201] [Show More Authors] [Citation(s) in RCA: 68] [Impact Index Per Article: 34.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2022] [Accepted: 10/09/2023] [Indexed: 11/12/2023]
Abstract
We report the results of the COVID Moonshot, a fully open-science, crowdsourced, and structure-enabled drug discovery campaign targeting the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) main protease. We discovered a noncovalent, nonpeptidic inhibitor scaffold with lead-like properties that is differentiated from current main protease inhibitors. Our approach leveraged crowdsourcing, machine learning, exascale molecular simulations, and high-throughput structural biology and chemistry. We generated a detailed map of the structural plasticity of the SARS-CoV-2 main protease, extensive structure-activity relationships for multiple chemotypes, and a wealth of biochemical activity data. All compound designs (>18,000 designs), crystallographic data (>490 ligand-bound x-ray structures), assay data (>10,000 measurements), and synthesized molecules (>2400 compounds) for this campaign were shared rapidly and openly, creating a rich, open, and intellectual property-free knowledge base for future anticoronavirus drug discovery.
Collapse
Affiliation(s)
- Melissa L Boby
- Pharmacology Graduate Program, Weill Cornell Graduate School of Medical Sciences, New York, NY 10065, USA
- Program in Chemical Biology, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA
- Program in Computational and Systems Biology, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA
| | - Daren Fearon
- Diamond Light Source Ltd., Harwell Science and Innovation Campus, Didcot, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot, UK
| | - Matteo Ferla
- Oxford Biomedical Research Centre, National Institute for Health Research, University of Oxford, Oxford, UK
| | - Mihajlo Filep
- Department of Chemical and Structural Biology, The Weizmann Institute of Science, Rehovot, Israel
| | - Lizbé Koekemoer
- Centre for Medicines Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, UK
- Structural Genomics Consortium, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | | | - John D Chodera
- Program in Computational and Systems Biology, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY 10065, USA
| | | | - Nir London
- Department of Chemical and Structural Biology, The Weizmann Institute of Science, Rehovot, Israel
| | - Annette von Delft
- Oxford Biomedical Research Centre, National Institute for Health Research, University of Oxford, Oxford, UK
- Centre for Medicines Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, UK
- Structural Genomics Consortium, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Frank von Delft
- Diamond Light Source Ltd., Harwell Science and Innovation Campus, Didcot, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot, UK
- Centre for Medicines Discovery, Nuffield Department of Medicine, University of Oxford, Oxford, UK
- Structural Genomics Consortium, Nuffield Department of Medicine, University of Oxford, Oxford, UK
- Department of Biochemistry, University of Johannesburg, Auckland Park, Johannesburg 2006, South Africa
| | - Hagit Achdout
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Anthony Aimon
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Dominic S Alonzi
- University of Oxford, Department of Biochemistry, Oxford Glycobiology Institute, South Parks Road, Oxford OX1 3QU, UK
| | - Robert Arbon
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Computational and Systems Biology Program, New York, NY 10065, USA
| | - Jasmin C Aschenbrenner
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Blake H Balcomb
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Elad Bar-David
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Haim Barr
- The Weizmann Institute of Science, Wohl Institute for Drug Discovery of the Nancy and Stephen Grand Israel National Center for Personalized Medicine, Rehovot, 7610001, Israel
| | - Amir Ben-Shmuel
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - James Bennett
- University of Oxford, Nuffield Department of Medicine, Centre for Medicines Discovery, Oxford, OX3 7DQ, UK
- University of Oxford, Nuffield Department of Medicine, Target Discovery Institute, Oxford, OX3 7FZ, UK
| | - Vitaliy A Bilenko
- Enamine Ltd, Kyiv, 02094, Ukraine
- Taras Shevchenko National University of Kyiv, Kyiv, 01601, Ukraine
| | | | - Pascale Boulet
- Drugs for Neglected Diseases Initiative (DNDi), Geneva, 1202, Switzerland
| | - Gregory R Bowman
- University of Pennsylvania, Departments of Biochemistry and Biophysics and Bioengineering, Philadelphia, PA 19083, USA
| | - Lennart Brewitz
- University of Oxford, Department of Chemistry, Chemistry Research Laboratory, Oxford, OX1 3TA, UK
| | - Juliane Brun
- University of Oxford, Department of Biochemistry, Oxford Glycobiology Institute, South Parks Road, Oxford OX1 3QU, UK
| | - Sarma Bvnbs
- Sai Life Sciences Limited, ICICI Knowledge Park, Shameerpet, Hyderabad 500 078, Telangana, India
| | | | - Anna Carbery
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- University of Oxford, Department of Statistics, Oxford OX1 3LB, UK
| | - Daniel W Carney
- Takeda Development Center Americas, Inc., San Diego, CA 92121, USA
| | - Emma Cattermole
- University of Oxford, Department of Biochemistry, Oxford Glycobiology Institute, South Parks Road, Oxford OX1 3QU, UK
| | - Edcon Chang
- Takeda Development Center Americas, Inc., San Diego, CA 92121, USA
| | | | | | | | - Galit Cohen
- The Weizmann Institute of Science, Wohl Institute for Drug Discovery of the Nancy and Stephen Grand Israel National Center for Personalized Medicine, Rehovot, 7610001, Israel
| | - Jason C Cole
- Cambridge Crystallographic Data Centre, Cambridge, CB2 1EZ, UK
| | - Alessandro Contini
- University of Milan, Department of General and Organic Chemistry, Milan, 20133, Italy
| | - Lisa Cox
- Life Compass Consulting Ltd, Macclesfield, SK10 5UE, UK
| | - Tristan Ian Croll
- The University of Cambridge, Cambridge Institute for Medical Research, Department of Haematology, Cambridge CB2 0XY, UK
- Present address: Altos Labs, BioML group, Great Abington, CB21 6GP
| | | | - Steven De Jonghe
- KU Leuven, Department of Microbiology, Immunology and Transplantation, Rega Institute for Medical Research, Laboratory of Virology and Chemotherapy, Leuven, Belgium
| | - Alex Dias
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Kim Donckers
- KU Leuven, Department of Microbiology, Immunology and Transplantation, Rega Institute for Medical Research, Laboratory of Virology and Chemotherapy, Leuven, Belgium
| | | | - Alice Douangamath
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Shirly Duberstein
- The Weizmann Institute of Science, Wohl Institute for Drug Discovery of the Nancy and Stephen Grand Israel National Center for Personalized Medicine, Rehovot, 7610001, Israel
| | - Tim Dudgeon
- Informatics Matters Ltd, Bicester, OX26 6JU, UK
| | - Louise E Dunnett
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Peter Eastman
- Stanford University, Department of Chemistry, Stanford, CA 94305, USA
| | - Noam Erez
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Charles J Eyermann
- Northeastern University, Department of Chemistry and Chemical Biology, Boston MA 02115, USA
| | - Michael Fairhead
- University of Oxford, Nuffield Department of Medicine, Centre for Medicines Discovery, Oxford, OX3 7DQ, UK
| | - Gwen Fate
- Thames Pharma Partners LLC, Mystic, CT 06355, USA
| | - Oleg Fedorov
- University of Oxford, Nuffield Department of Medicine, Centre for Medicines Discovery, Oxford, OX3 7DQ, UK
- University of Oxford, Nuffield Department of Medicine, Target Discovery Institute, Oxford, OX3 7FZ, UK
| | - Rafaela S Fernandes
- University of Sao Paulo, Sao Carlos Institute of Physics, Sao Carlos, 13563-120, Brazil
| | - Lori Ferrins
- Northeastern University, Department of Chemistry and Chemical Biology, Boston MA 02115, USA
| | - Richard Foster
- University of Leeds, School of Chemistry, Leeds, LS2 9JT, UK
| | - Holly Foster
- University of Leeds, School of Chemistry, Leeds, LS2 9JT, UK
- Present address: Exscientia, Oxford Science Park, Oxford, OX4 4GE, UK
| | - Laurent Fraisse
- Drugs for Neglected Diseases Initiative (DNDi), Geneva, 1202, Switzerland
| | - Ronen Gabizon
- The Weizmann Institute of Science, Department of Chemical and Structural Biology, Rehovot, 7610001, Israel
| | - Adolfo García-Sastre
- Icahn School of Medicine at Mount Sinai, Department of Microbiology, New York, NY 10029, USA
- Icahn School of Medicine at Mount Sinai, Global Health and Emerging Pathogens Institute, New York, NY 10029, USA
- Icahn School of Medicine at Mount Sinai, Department of Medicine, Division of Infectious Diseases, New York, NY 10029, USA
- Icahn School of Medicine at Mount Sinai, The Tisch Cancer Institute, New York, NY 10029, USA
- Icahn School of Medicine at Mount Sinai, Department of Pathology, Molecular and Cell-Based Medicine, New York, NY 10029, USA
| | - Victor O Gawriljuk
- University of Sao Paulo, Sao Carlos Institute of Physics, Sao Carlos, 13563-120, Brazil
- Present address: University of Groningen, Groningen Research Institute of Pharmacy, Department of Drug Design, Groningen, 9700 AV, Netherlands
| | - Paul Gehrtz
- The Weizmann Institute of Science, Department of Chemical and Structural Biology, Rehovot, 7610001, Israel
- Present address: Merck Healthcare KGaA, Darmstadt, 64293, Germany
| | - Carina Gileadi
- University of Oxford, Nuffield Department of Medicine, Centre for Medicines Discovery, Oxford, OX3 7DQ, UK
| | - Charline Giroud
- University of Oxford, Nuffield Department of Medicine, Centre for Medicines Discovery, Oxford, OX3 7DQ, UK
- University of Oxford, Nuffield Department of Medicine, Target Discovery Institute, Oxford, OX3 7FZ, UK
| | - William G Glass
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Computational and Systems Biology Program, New York, NY 10065, USA
- Present address: Exscientia, Oxford Science Park, Oxford, OX4 4GE, UK
| | - Robert C Glen
- University of Cambridge, Department of Chemistry, Cambridge, CB2 1EW, UK
| | - Itai Glinert
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Andre S Godoy
- University of Sao Paulo, Sao Carlos Institute of Physics, Sao Carlos, 13563-120, Brazil
| | - Marian Gorichko
- Taras Shevchenko National University of Kyiv, Kyiv, 01601, Ukraine
| | - Tyler Gorrie-Stone
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Ed J Griffen
- MedChemica Ltd, Macclesfield, Cheshire. SK11 6PU UK
| | - Amna Haneef
- Illinois Institute of Technology, Department of Biology, Chicago IL 60616 USA
| | - Storm Hassell Hart
- University of Sussex, Department of Chemistry, School of Life Sciences, Brighton, East Sussex, BN1 9QJ, UK
| | - Jag Heer
- Syngene International Limited, Headington, Oxford, OX3 7BZ, UK
| | - Michael Henry
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Computational and Systems Biology Program, New York, NY 10065, USA
| | - Michelle Hill
- University of Oxford, Department of Biochemistry, Oxford Glycobiology Institute, South Parks Road, Oxford OX1 3QU, UK
- Present address: Sir William Dunn School of Pathology, Oxford. OX1 3RE, UK
| | - Sam Horrell
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Qiu Yu Judy Huang
- University of Massachusetts, Chan Medical School, Department of Biochemistry and Molecular Biotechnology, Worcester MA 01655, USA
| | | | | | - Tomer Israely
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | | | - Jitske Jansen
- RWTH Aachen University, Institute of Experimental Medicine and Systems Biology, Aachen, 52074, Germany
| | - Eric Jnoff
- UCB, Chemin du Foriest, 1420 Braine-l'Alleud, Belgium
| | - Dirk Jochmans
- KU Leuven, Department of Microbiology, Immunology and Transplantation, Rega Institute for Medical Research, Laboratory of Virology and Chemotherapy, Leuven, Belgium
| | - Tobias John
- University of Oxford, Department of Chemistry, Chemistry Research Laboratory, Oxford, OX1 3TA, UK
- Present address: AMSilk, Neuried, 82061, Germany
| | - Benjamin Kaminow
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Computational and Systems Biology Program, New York, NY 10065, USA
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Tri-Institutional Program in Computational Biology and Medicine, New York, NY 10065, USA
| | - Lulu Kang
- Illinois Institute of Technology, Department of Applied Mathematics, Chicago IL 60616 USA
| | - Anastassia L Kantsadi
- University of Oxford, Department of Biochemistry, Oxford Glycobiology Institute, South Parks Road, Oxford OX1 3QU, UK
- University of Thessaly, Department of Biochemistry and Biotechnology, Larissa, 415 00, Greece
| | - Peter W Kenny
- Berwick-on-Sea, North Coast Road, Blanchisseuse, Saint George, Trinidad and Tobago
| | - J L Kiappes
- University of Oxford, Department of Biochemistry, Oxford Glycobiology Institute, South Parks Road, Oxford OX1 3QU, UK
- Present address: University College of London, Department of Chemistry, London WC1H 0AJ, UK
| | | | - Boris Kovar
- M2M solutions s.r.o. Žilina, 010 01, Slovakia
| | - Tobias Krojer
- University of Oxford, Nuffield Department of Medicine, Centre for Medicines Discovery, Oxford, OX3 7DQ, UK
- MAX IV Laboratory, Fotongatan 2, 224 84 Lund, Sweden
| | - Van Ngoc Thuy La
- Illinois Institute of Technology, Department of Biology, Chicago IL 60616 USA
| | | | | | - Haim Levy
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Ryan M Lithgo
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | | | - Petra Lukacik
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Hannah Bruce Macdonald
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Computational and Systems Biology Program, New York, NY 10065, USA
- Present address: Charm Therapeutics, London, N1C 4AG, UK
| | - Elizabeth M MacLean
- University of Oxford, Nuffield Department of Medicine, Centre for Medicines Discovery, Oxford, OX3 7DQ, UK
| | - Laetitia L Makower
- University of Oxford, Department of Biochemistry, Oxford Glycobiology Institute, South Parks Road, Oxford OX1 3QU, UK
| | - Tika R Malla
- University of Oxford, Nuffield Department of Medicine, Centre for Medicines Discovery, Oxford, OX3 7DQ, UK
| | - Peter G Marples
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | | | - Willam McCorkindale
- Present address: Charm Therapeutics, London, N1C 4AG, UK
- University of Cambridge, Cavendish Laboratory, Cambridge, CB3 0HE UK
| | - Briana L McGovern
- Icahn School of Medicine at Mount Sinai, Department of Microbiology, New York, NY 10029, USA
- Icahn School of Medicine at Mount Sinai, Global Health and Emerging Pathogens Institute, New York, NY 10029, USA
| | - Sharon Melamed
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Kostiantyn P Melnykov
- Enamine Ltd, Kyiv, 02094, Ukraine
- Taras Shevchenko National University of Kyiv, Kyiv, 01601, Ukraine
| | | | - Pascal Miesen
- Radboud University Medical Center, Department of Medical Microbiology, Radboud Institute for Molecular Life Sciences, Nijmegen, 6525 GA, Netherlands
| | - Halina Mikolajek
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Bruce F Milne
- University of Aberdeen, Department of Chemistry, Old Aberdeen, AB24 3UE Scotland, UK
- University of Coimbra, CFisUC, Department of Physics, Coimbra, 3004-516, Portugal
| | - David Minh
- Illinois Institute of Technology, Department of Chemistry, Chicago IL 60616 USA
| | | | - Garrett M Morris
- University of Oxford, Department of Statistics, Oxford OX1 3LB, UK
| | - Melody Jane Morwitzer
- University of Nebraska Medical Centre, Dept of Pathology and Microbiology, Omaha, NE 68198-5900, USA
| | | | - Charles E Mowbray
- Drugs for Neglected Diseases Initiative (DNDi), Geneva, 1202, Switzerland
| | - Aline M Nakamura
- University of Sao Paulo, Sao Carlos Institute of Physics, Sao Carlos, 13563-120, Brazil
- Present address: Instituto Butantan, Sao Paulo, 05503-900, Brazil
| | - Jose Brandao Neto
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Johan Neyts
- KU Leuven, Department of Microbiology, Immunology and Transplantation, Rega Institute for Medical Research, Laboratory of Virology and Chemotherapy, Leuven, Belgium
| | | | - Gabriela D Noske
- University of Sao Paulo, Sao Carlos Institute of Physics, Sao Carlos, 13563-120, Brazil
| | - Vladas Oleinikovas
- UCB, Slough, SL1 3WE, UK
- Present address: Monte Rosa Therapeutics, Basel, CH 4057, Switzerland
| | - Glaucius Oliva
- University of Sao Paulo, Sao Carlos Institute of Physics, Sao Carlos, 13563-120, Brazil
| | - Gijs J Overheul
- Radboud University Medical Center, Department of Medical Microbiology, Radboud Institute for Molecular Life Sciences, Nijmegen, 6525 GA, Netherlands
| | - C David Owen
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Ruby Pai
- PostEra Inc., Cambridge, MA, 02142, USA
| | - Jin Pan
- PostEra Inc., Cambridge, MA, 02142, USA
| | - Nir Paran
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Alexander Matthew Payne
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Computational and Systems Biology Program, New York, NY 10065, USA
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Tri-Institutional Program in Computational Biology and Medicine, New York, NY 10065, USA
| | - Benjamin Perry
- Drugs for Neglected Diseases Initiative (DNDi), Geneva, 1202, Switzerland
- Present address: Medicxi, Geneva, 1204, Switzerland
| | - Maneesh Pingle
- Sai Life Sciences Limited, ICICI Knowledge Park, Shameerpet, Hyderabad 500 078, Telangana, India
| | - Jakir Pinjari
- Sai Life Sciences Limited, ICICI Knowledge Park, Shameerpet, Hyderabad 500 078, Telangana, India
- Present address: Sun Pharma Advanced Research Company (SPARC), Baroda, India
| | - Boaz Politi
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Ailsa Powell
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | | | - Iván Pulido
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Computational and Systems Biology Program, New York, NY 10065, USA
| | - Reut Puni
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Victor L Rangel
- University of São Paulo, Ribeirão Preto School of Pharmaceutical Sciences, Ribeirão Preto - SP/CEP 14040-903, Brazil
- Present address: Evotec (UK) Ltd, Milton Park, Abingdon, Oxfordshire, OX14 4RZ, UK
| | - Rambabu N Reddi
- The Weizmann Institute of Science, Department of Chemical and Structural Biology, Rehovot, 7610001, Israel
| | - Paul Rees
- Compass Bussiness Partners Ltd, Southcliffe, Bucks, SL9 0PD, UK
| | - St Patrick Reid
- University of Nebraska Medical Centre, Dept of Pathology and Microbiology, Omaha, NE 68198-5900, USA
| | - Lauren Reid
- MedChemica Ltd, Macclesfield, Cheshire. SK11 6PU UK
| | - Efrat Resnick
- The Weizmann Institute of Science, Department of Chemical and Structural Biology, Rehovot, 7610001, Israel
| | | | | | - Jaime Rodriguez-Guerra
- Charité - Universitätsmedizin Berlin, In silico Toxicology and Structural Bioinformatics, Berlin, 10117, Germany
| | - Romel Rosales
- Icahn School of Medicine at Mount Sinai, Department of Microbiology, New York, NY 10029, USA
- Icahn School of Medicine at Mount Sinai, Global Health and Emerging Pathogens Institute, New York, NY 10029, USA
| | - Dominic A Rufa
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Computational and Systems Biology Program, New York, NY 10065, USA
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Tri-Institutional Program in Computational Biology and Medicine, New York, NY 10065, USA
| | - Kadi Saar
- University of Cambridge, Cavendish Laboratory, Cambridge, CB3 0HE UK
| | | | - Eidarus Salah
- University of Oxford, Department of Chemistry, Chemistry Research Laboratory, Oxford, OX1 3TA, UK
| | - David Schaller
- Charité - Universitätsmedizin Berlin, In silico Toxicology and Structural Bioinformatics, Berlin, 10117, Germany
| | - Jenke Scheen
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Computational and Systems Biology Program, New York, NY 10065, USA
| | - Celia A Schiffer
- University of Massachusetts, Chan Medical School, Department of Biochemistry and Molecular Biotechnology, Worcester MA 01655, USA
| | - Christopher J Schofield
- University of Oxford, Department of Chemistry, Chemistry Research Laboratory, Oxford, OX1 3TA, UK
| | | | - Aarif Shaikh
- Sai Life Sciences Limited, ICICI Knowledge Park, Shameerpet, Hyderabad 500 078, Telangana, India
| | - Ala M Shaqra
- University of Massachusetts, Chan Medical School, Department of Biochemistry and Molecular Biotechnology, Worcester MA 01655, USA
| | - Jiye Shi
- UCB, Chemin du Foriest, 1420 Braine-l'Alleud, Belgium
- Present address: Eli Lilly and Company, San Diego, CA 92121, USA
| | - Khriesto Shurrush
- The Weizmann Institute of Science, Wohl Institute for Drug Discovery of the Nancy and Stephen Grand Israel National Center for Personalized Medicine, Rehovot, 7610001, Israel
| | - Sukrit Singh
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Computational and Systems Biology Program, New York, NY 10065, USA
| | - Assa Sittner
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Peter Sjö
- Drugs for Neglected Diseases Initiative (DNDi), Geneva, 1202, Switzerland
| | - Rachael Skyner
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | | | - Bart Smeets
- Radboud University Medical Center, Department of pathology, Radboud Institute for Molecular Life Sciences, Nijmegen, 6525 GA, Netherlands
| | - Mihaela D Smilova
- University of Oxford, Nuffield Department of Medicine, Centre for Medicines Discovery, Oxford, OX3 7DQ, UK
| | - Leonardo J Solmesky
- The Weizmann Institute of Science, Wohl Institute for Drug Discovery of the Nancy and Stephen Grand Israel National Center for Personalized Medicine, Rehovot, 7610001, Israel
| | - John Spencer
- University of Sussex, Department of Chemistry, School of Life Sciences, Brighton, East Sussex, BN1 9QJ, UK
| | - Claire Strain-Damerell
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Vishwanath Swamy
- Sai Life Sciences Limited, ICICI Knowledge Park, Shameerpet, Hyderabad 500 078, Telangana, India
- Present address: TCG Life Sciences, Pune, India
| | - Hadas Tamir
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Jenny C Taylor
- University of Oxford, Nuffield Department of Medicine, Wellcome Centre for Human Genetics, Oxford OX3 7BN, UK
| | | | - Warren Thompson
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Andrew Thompson
- University of Oxford, Nuffield Department of Medicine, Centre for Medicines Discovery, Oxford, OX3 7DQ, UK
- Present address: Walter and Eliza Hall Institute, Parkville 3052, Victoria, Australia
| | | | - Charles W E Tomlinson
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | | | - Anthony Tumber
- University of Oxford, Department of Chemistry, Chemistry Research Laboratory, Oxford, OX1 3TA, UK
| | - Ioannis Vakonakis
- University of Oxford, Department of Biochemistry, Oxford Glycobiology Institute, South Parks Road, Oxford OX1 3QU, UK
- Present address: Lonza Biologics, Lonza Ltd, Lonzastrasse, CH-3930 Visp, Switzerland
| | - Ronald P van Rij
- Radboud University Medical Center, Department of Medical Microbiology, Radboud Institute for Molecular Life Sciences, Nijmegen, 6525 GA, Netherlands
| | - Laura Vangeel
- KU Leuven, Department of Microbiology, Immunology and Transplantation, Rega Institute for Medical Research, Laboratory of Virology and Chemotherapy, Leuven, Belgium
| | - Finny S Varghese
- Radboud University Medical Center, Department of Medical Microbiology, Radboud Institute for Molecular Life Sciences, Nijmegen, 6525 GA, Netherlands
- Present address: uniQure Biopharma, Amsterdam, 1105 BP, Netherlands
| | | | - Einat B Vitner
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Vincent Voelz
- Temple University, Department of Chemistry, Philadelphia, PA 19122, USA
| | - Andrea Volkamer
- Charité - Universitätsmedizin Berlin, In silico Toxicology and Structural Bioinformatics, Berlin, 10117, Germany
- Present address: Saarland University, Data Driven Drug Design, Campus - E2.1, 66123 Saarbrücken, Germany
| | - Martin A Walsh
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Walter Ward
- Walter Ward Consultancy and Training, Derbyshire, SK22 4AA, UK
| | | | - Shay Weiss
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Kris M White
- Icahn School of Medicine at Mount Sinai, Department of Microbiology, New York, NY 10029, USA
- Icahn School of Medicine at Mount Sinai, Global Health and Emerging Pathogens Institute, New York, NY 10029, USA
| | - Conor Francis Wild
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot, OX11 0DE, UK
- Research Complex at Harwell, Harwell Science and Innovation Campus, Didcot OX11 0FA, UK
| | - Karolina D Witt
- University of Oxford, Nuffield Department of Medicine, Pandemic Sciences Institute, Oxford, Oxon, OX3 7DQ, UK
| | - Matthew Wittmann
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Computational and Systems Biology Program, New York, NY 10065, USA
| | - Nathan Wright
- University of Oxford, Nuffield Department of Medicine, Centre for Medicines Discovery, Oxford, OX3 7DQ, UK
| | - Yfat Yahalom-Ronen
- Israel Institute for Biological Research, Department of Infectious Diseases, Ness-Ziona, Israel
| | - Nese Kurt Yilmaz
- University of Massachusetts, Chan Medical School, Department of Biochemistry and Molecular Biotechnology, Worcester MA 01655, USA
| | - Daniel Zaidmann
- The Weizmann Institute of Science, Department of Chemical and Structural Biology, Rehovot, 7610001, Israel
| | - Ivy Zhang
- Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, Computational and Systems Biology Program, New York, NY 10065, USA
| | - Hadeer Zidane
- The Weizmann Institute of Science, Wohl Institute for Drug Discovery of the Nancy and Stephen Grand Israel National Center for Personalized Medicine, Rehovot, 7610001, Israel
| | - Nicole Zitzmann
- University of Oxford, Department of Biochemistry, Oxford Glycobiology Institute, South Parks Road, Oxford OX1 3QU, UK
| | - Sarah N Zvornicanin
- University of Massachusetts, Chan Medical School, Department of Biochemistry and Molecular Biotechnology, Worcester MA 01655, USA
| |
Collapse
|