1
|
Sletten ET, Wolf JB, Danglad‐Flores J, Seeberger PH. Carbohydrate Synthesis is Entering the Data-Driven Digital Era. Chemistry 2025; 31:e202500289. [PMID: 40178205 PMCID: PMC12080308 DOI: 10.1002/chem.202500289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2025] [Revised: 03/27/2025] [Accepted: 03/28/2025] [Indexed: 04/05/2025]
Abstract
Glycans are vital in biological processes, but their nontemplated, heterogeneous structures complicate structure-function analyses. Glycosylation, the key reaction in synthetic glycochemistry, remains not entirely predictable due to its complex mechanism and the need for protecting groups that impact reaction outcomes. This concept highlights recent advancements in glycochemistry and emphasizes the integration of digital tools, including automation, computational modelling, and data management, to improve carbohydrate synthesis and support further progress in the field.
Collapse
Affiliation(s)
- Eric T. Sletten
- Max Planck Institute of Colloids and InterfacesPotsdam Science ParkAm Mühlenberg 114476PotsdamGermany
| | - Jakob B. Wolf
- Max Planck Institute of Colloids and InterfacesPotsdam Science ParkAm Mühlenberg 114476PotsdamGermany
- Institut für Chemie, Biochemie und PharmazieFreie Universität BerlinTakusstraße 314195BerlinGermany
| | - José Danglad‐Flores
- Max Planck Institute of Colloids and InterfacesPotsdam Science ParkAm Mühlenberg 114476PotsdamGermany
| | - Peter H. Seeberger
- Max Planck Institute of Colloids and InterfacesPotsdam Science ParkAm Mühlenberg 114476PotsdamGermany
- Institut für Chemie, Biochemie und PharmazieFreie Universität BerlinTakusstraße 314195BerlinGermany
| |
Collapse
|
2
|
Tanabe Y, Sugisawa H, Miyazawa T, Hotta K, Shiratori K, Fujitani T. High-Throughput Optimization of a High-Pressure Catalytic Reaction. J Chem Inf Model 2025; 65:2245-2250. [PMID: 39977660 DOI: 10.1021/acs.jcim.4c02273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2025]
Abstract
High-throughput optimization of a hydroformylation reaction using CO2 instead of CO was performed through Bayesian optimization in combination with a high-throughput screening system. CO2 and H2 pressure as well as catalyst composition were efficiently optimized by transferring a surrogate model, constructed through catalyst composition optimization, for the comprehensive optimization of the entire search space. This method successfully increased the aldehyde yield by 1.5 times compared to that reported in the literature with a combination of small amounts of Rh and Ru catalysts combined with ionic liquid with chloride ions. The optimization was completed within 1-2 months through the combination of AI, robotics, and human expertise, demonstrating the feasibility of rapid catalyst development, even for high-pressure reactions.
Collapse
Affiliation(s)
- Yusuke Tanabe
- Science & Innovation Center, Mitsubishi Chemical Corporation (MCC), Yokohama, Kanagawa 227-8502, Japan
| | - Hiroki Sugisawa
- Science & Innovation Center, Mitsubishi Chemical Corporation (MCC), Yokohama, Kanagawa 227-8502, Japan
| | - Tomohisa Miyazawa
- Interdisciplinary Research Center for Catalytic Chemistry, National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Higashi, Tsukuba, Ibaraki 305-8565, Japan
| | - Kazuhiro Hotta
- Science & Innovation Center, Mitsubishi Chemical Corporation (MCC), Yokohama, Kanagawa 227-8502, Japan
| | - Kazuya Shiratori
- Science & Innovation Center, Mitsubishi Chemical Corporation (MCC), Yokohama, Kanagawa 227-8502, Japan
| | - Tadahiro Fujitani
- Interdisciplinary Research Center for Catalytic Chemistry, National Institute of Advanced Industrial Science and Technology (AIST), 1-1-1 Higashi, Tsukuba, Ibaraki 305-8565, Japan
| |
Collapse
|
3
|
Felten S, He CQ, Emmert MH. C-H Aminoalkylation of 5-Membered Heterocycles: Influence of Descriptors, Data Set Size, and Data Quality on the Predictiveness of Machine Learning Models and Expansion of the Substrate Space Beyond 1,3-Azoles. J Org Chem 2025; 90:2613-2625. [PMID: 39933045 DOI: 10.1021/acs.joc.4c02574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2025]
Abstract
We report a general C-H aminoalkylation of 5-membered heterocycles through a combined machine learning/experimental workflow. Our work describes previously unknown C-H functionalization reactivity and creates a predictive machine learning (ML) model through iterative refinement over 6 rounds of active learning. The initial model established with 1,3-azoles predicts the reactivities of N-aryl indazoles, 1,2,4-triazolopyrazines, 1,2,3-thiadiazoles, and 1,3,4-oxadiazoles, while other substrate classes (e.g., pyrazoles and 1,2,4-triazoles) are not predicted well. The final model includes the reactivities of additional heterocyclic scaffolds in the training data, which results in high predictive accuracy across all of the tested cores. The high prediction performance is shown both within the training set via cross-validation (CV R2 = 0.81) and when predicting unseen substrates of diverse molecular weight and structure (Test R2 = 0.95). The concept of feature engineering is discussed, and we benchmark mechanistically related DFT-based features that are more time-intensive and laborious in comparison with molecular descriptors and fingerprints. Importantly, this work establishes novel reactivity for heterocycles for which C-H functionalization methods are underdeveloped. Since such heterocycles are key motifs in drug discovery and development, we expect this work to be of significant use to the synthetic and synthesis-oriented ML communities.
Collapse
Affiliation(s)
- Stephanie Felten
- Process Research and Development, MRL, Merck & Co., Inc., 126 E Lincoln Ave, Rahway, New Jersey 07065, United States
| | - Cyndi Qixin He
- Computational and Structural Chemistry, MRL, Merck & Co., Inc., 126 E Lincoln Ave, Rahway, New Jersey 07065, United States
| | - Marion H Emmert
- Process Research and Development, MRL, Merck & Co., Inc., 126 E Lincoln Ave, Rahway, New Jersey 07065, United States
| |
Collapse
|
4
|
Muroga S, Honda T, Miki Y, Nakajima H, Futaba DN, Hata K. Real-time autonomous control of a continuous macroscopic process as demonstrated by plastic forming. MATERIALS HORIZONS 2025; 12:623-629. [PMID: 39508391 DOI: 10.1039/d4mh00051j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2024]
Abstract
To meet the need for more adaptable and expedient approaches in research and manufacturing, we present a continuous autonomous system that leverages real-time, in situ characterization and an active-learning-based decision-making processor. This system was applied to a plastic film forming process to demonstrate its capability in autonomously determining process conditions for specified film dimensions without human intervention. Application of the system to nine film dimensions (width and thickness) highlighted its ability to explore the search space and identify appropriate and stable process conditions, with an average of 11 characterization-adjustment iterations and a processing time of 19 minutes per width, thickness combination. The system successfully avoided common pitfalls, such as repetitive over-correction, and demonstrated high accuracy, with R2 values of 0.87 and 0.90 for film width and thickness, respectively. Moreover, the active learning algorithm enabled the system to begin exploration with zero training data, effectively addressing the complex and interdependent relationships between control factors (material supply rate, applied force, material viscosity) in the continuous plastic forming process. Given that the core concept of this autonomous process can, in principle, be transferred to other continuous material processing systems, these results have implications for accelerating progress in both research and industry.
Collapse
Affiliation(s)
- Shun Muroga
- Nano Carbon Device Research Center, National Institute of Advanced Industrial Science and Technology, Tsukuba Central 5, 1-1-1, Higashi, Tsukuba, Ibaraki, 305-8565, Japan.
| | - Takashi Honda
- Research Association of High-Throughput Design and Development for Advanced Functional Materials (ADMAT), Tsukuba, Ibaraki, 305-8568, Japan
| | - Yasuaki Miki
- Nano Carbon Device Research Center, National Institute of Advanced Industrial Science and Technology, Tsukuba Central 5, 1-1-1, Higashi, Tsukuba, Ibaraki, 305-8565, Japan.
| | - Hideaki Nakajima
- Nano Carbon Device Research Center, National Institute of Advanced Industrial Science and Technology, Tsukuba Central 5, 1-1-1, Higashi, Tsukuba, Ibaraki, 305-8565, Japan.
| | - Don N Futaba
- Nano Carbon Device Research Center, National Institute of Advanced Industrial Science and Technology, Tsukuba Central 5, 1-1-1, Higashi, Tsukuba, Ibaraki, 305-8565, Japan.
| | - Kenji Hata
- Nano Carbon Device Research Center, National Institute of Advanced Industrial Science and Technology, Tsukuba Central 5, 1-1-1, Higashi, Tsukuba, Ibaraki, 305-8565, Japan.
| |
Collapse
|
5
|
Cheng AH, Ser CT, Skreta M, Guzmán-Cordero A, Thiede L, Burger A, Aldossary A, Leong SX, Pablo-García S, Strieth-Kalthoff F, Aspuru-Guzik A. Spiers Memorial Lecture: How to do impactful research in artificial intelligence for chemistry and materials science. Faraday Discuss 2025; 256:10-60. [PMID: 39400305 DOI: 10.1039/d4fd00153b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2024]
Abstract
Machine learning has been pervasively touching many fields of science. Chemistry and materials science are no exception. While machine learning has been making a great impact, it is still not reaching its full potential or maturity. In this perspective, we first outline current applications across a diversity of problems in chemistry. Then, we discuss how machine learning researchers view and approach problems in the field. Finally, we provide our considerations for maximizing impact when researching machine learning for chemistry.
Collapse
Affiliation(s)
- Austin H Cheng
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Cher Tian Ser
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Marta Skreta
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Andrés Guzmán-Cordero
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
- Tinbergen Institute, University of Amsterdam, Amsterdam, Netherlands
| | - Luca Thiede
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Andreas Burger
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | | | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore 63737, Singapore
| | | | | | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
- Acceleration Consortium, Toronto, Ontario M5G 1X6, Canada
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Canada
- Department of Materials Science and Engineering, University of Toronto, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), Canada
| |
Collapse
|
6
|
Velasco PQ, Hippalgaonkar K, Ramalingam B. Emerging trends in the optimization of organic synthesis through high-throughput tools and machine learning. Beilstein J Org Chem 2025; 21:10-38. [PMID: 39811684 PMCID: PMC11730176 DOI: 10.3762/bjoc.21.3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Accepted: 11/26/2024] [Indexed: 01/16/2025] Open
Abstract
The discovery of the optimal conditions for chemical reactions is a labor-intensive, time-consuming task that requires exploring a high-dimensional parametric space. Historically, the optimization of chemical reactions has been performed by manual experimentation guided by human intuition and through the design of experiments where reaction variables are modified one at a time to find the optimal conditions for a specific reaction outcome. Recently, a paradigm change in chemical reaction optimization has been enabled by advances in lab automation and the introduction of machine learning algorithms. Therein, multiple reaction variables can be synchronously optimized to obtain the optimal reaction conditions, requiring a shorter experimentation time and minimal human intervention. Herein, we review the currently used state-of-the-art high-throughput automated chemical reaction platforms and machine learning algorithms that drive the optimization of chemical reactions, highlighting the limitations and future opportunities of this new field of research.
Collapse
Affiliation(s)
- Pablo Quijano Velasco
- Institute of Materials Research and Engineering (IMRE), Agency for Science Technology and Research (A*STAR), 2 Fusionopolis Way, Singapore 138634, Republic of Singapore
| | - Kedar Hippalgaonkar
- Institute of Materials Research and Engineering (IMRE), Agency for Science Technology and Research (A*STAR), 2 Fusionopolis Way, Singapore 138634, Republic of Singapore
- Department of Materials Science and Engineering, Nanyang Technological University, Singapore 639798, Republic of Singapore
- Institute for Functional Intelligent Materials, National University of Singapore, 4 Science Drive 2, Singapore 117544, Republic of Singapore
| | - Balamurugan Ramalingam
- Institute of Materials Research and Engineering (IMRE), Agency for Science Technology and Research (A*STAR), 2 Fusionopolis Way, Singapore 138634, Republic of Singapore
- Institute of Sustainability for Chemicals, Energy and Environment (ISCE2), Agency for Science Technology and Research (A*STAR), 1 Pesek Road, Jurong Island, Singapore 627833, Republic of Singapore
| |
Collapse
|
7
|
Haas BC, Kalyani D, Sigman MS. Applying statistical modeling strategies to sparse datasets in synthetic chemistry. SCIENCE ADVANCES 2025; 11:eadt3013. [PMID: 39742471 DOI: 10.1126/sciadv.adt3013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/20/2024] [Accepted: 11/20/2024] [Indexed: 01/03/2025]
Abstract
The application of statistical modeling in organic chemistry is emerging as a standard practice for probing structure-activity relationships and as a predictive tool for many optimization objectives. This review is aimed as a tutorial for those entering the area of statistical modeling in chemistry. We provide case studies to highlight the considerations and approaches that can be used to successfully analyze datasets in low data regimes, a common situation encountered given the experimental demands of organic chemistry. Statistical modeling hinges on the data (what is being modeled), descriptors (how data are represented), and algorithms (how data are modeled). Herein, we focus on how various reaction outputs (e.g., yield, rate, selectivity, solubility, stability, and turnover number) and data structures (e.g., binned, heavily skewed, and distributed) influence the choice of algorithm used for constructing predictive and chemically insightful statistical models.
Collapse
Affiliation(s)
- Brittany C Haas
- Department of Chemistry, University of Utah, Salt Lake City, UT 84112, USA
| | | | - Matthew S Sigman
- Department of Chemistry, University of Utah, Salt Lake City, UT 84112, USA
| |
Collapse
|
8
|
Kaspersetz L, Englert B, Krah F, Martinez EC, Neubauer P, Cruz Bournazou MN. Management of experimental workflows in robotic cultivation platforms. SLAS Technol 2024; 29:100214. [PMID: 39486480 DOI: 10.1016/j.slast.2024.100214] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2024] [Revised: 09/13/2024] [Accepted: 10/22/2024] [Indexed: 11/04/2024]
Abstract
In the last decades, robotic cultivation facilities combined with automated execution of workflows have drastically increased the speed of research in biotechnology. In this work, we present the design and deployment of a digital infrastructure for robotic cultivation platforms. We implement a Workflow Management System, using Directed Acyclic Graphs, based on the open-source platform Apache Airflow to increase traceability and the automated execution of experiments. We demonstrate the integration and automation of experimental workflows in a laboratory environment with a heterogeneous device landscape including liquid handling stations, parallel cultivation systems, and mobile robots. The feasibility of our approach is assessed in parallel E. coli fed-batch cultivations with glucose oscillations in which different elastin-like proteins are produced. We show that the use of workflow management systems in robotic cultivation platforms increases automation, robustness and traceability of experimental data.
Collapse
Affiliation(s)
- Lucas Kaspersetz
- Technische Universität Berlin, Institute of Biotechnology, Chair of Bioprocess Engineering, Ackerstr. 76, 13355, Berlin, Germany.
| | - Britta Englert
- Technische Universität Berlin, Institute of Biotechnology, Chair of Bioprocess Engineering, Ackerstr. 76, 13355, Berlin, Germany
| | - Fabian Krah
- BTC Business Technology Consulting AG, Escherweg 5, 26121, Oldenburg, Germany
| | - Ernesto C Martinez
- Technische Universität Berlin, Institute of Biotechnology, Chair of Bioprocess Engineering, Ackerstr. 76, 13355, Berlin, Germany; INGAR, (CONICET - UTN), Avellaneda 3657, Santa Fe, Argentina
| | - Peter Neubauer
- Technische Universität Berlin, Institute of Biotechnology, Chair of Bioprocess Engineering, Ackerstr. 76, 13355, Berlin, Germany
| | - M Nicolas Cruz Bournazou
- Technische Universität Berlin, Institute of Biotechnology, Chair of Bioprocess Engineering, Ackerstr. 76, 13355, Berlin, Germany.
| |
Collapse
|
9
|
Augustine L, Wang Y, Adelman SL, Batista ER, Kozimor SA, Perez D, Schrier J, Yang P. Advancing Rare-Earth (4 f) and Actinide (5 f) Separation through Machine Learning and Automated High-Throughput Experiments. ACS SUSTAINABLE CHEMISTRY & ENGINEERING 2024; 12:16692-16699. [PMID: 39545104 PMCID: PMC11558677 DOI: 10.1021/acssuschemeng.4c06166] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Revised: 10/14/2024] [Accepted: 10/14/2024] [Indexed: 11/17/2024]
Abstract
Identifying improved and sustainable alternatives to "classic" separation techniques is an active research field due to its potential widespread impact in fundamental and applied chemistry. As basic purification methodologies, like liquid-liquid extraction, undergo continuous refinement by chemists and engineers, identifying new conditions that outperform existing techniques can be difficult. A major contributor to this challenging problem is the need to explore a vast experimental space to identify the precise conditions that optimize the separation procedure. The advent of artificial intelligence and the advancement of robotic technologies offer the potential to shift the traditional design paradigm. Toward that end, we applied a combination of Bayesian Optimization and high-throughput robotic experiments on the liquid-liquid extraction of thorium (Th4+) and demonstrated that this approach speeds up discovery and significantly accelerates the optimization process. By using Bayesian Optimization as a guide, our automated instrument carried out a total of 339 distribution ratio measurements, corresponding to 113 unique conditions, identifying the optimal experimental conditions with reduced experimental efforts by an estimated 74% compared to a traditional full screening approach. This time and cost saving is particularly significant for radioactive materials, as it not only is more economical and sustainable but also minimizes human exposure to radioactivity.
Collapse
Affiliation(s)
- Logan
J. Augustine
- Theoretical
Division, Los Alamos National Lab, Los Alamos, New Mexico 87545, United States
| | - Yufei Wang
- Chemistry
Division, Los Alamos National Lab, Los Alamos, New Mexico 87545, United States
| | - Sara L. Adelman
- Chemistry
Division, Los Alamos National Lab, Los Alamos, New Mexico 87545, United States
| | - Enrique R. Batista
- Theoretical
Division, Los Alamos National Lab, Los Alamos, New Mexico 87545, United States
| | - Stosh A. Kozimor
- Chemistry
Division, Los Alamos National Lab, Los Alamos, New Mexico 87545, United States
| | - Danny Perez
- Theoretical
Division, Los Alamos National Lab, Los Alamos, New Mexico 87545, United States
| | - Joshua Schrier
- Department
of Chemistry & Biochemistry, Fordham
University, The Bronx, New York 10458, United States
| | - Ping Yang
- Theoretical
Division, Los Alamos National Lab, Los Alamos, New Mexico 87545, United States
| |
Collapse
|
10
|
McMinn SE, Miller DV, Yur D, Stone K, Xu Y, Vikram A, Murali S, Raffaele J, Holland D, Wang SC, Smith JP. High-Throughput Algorithmic Optimization of In Vitro Transcription for SARS-CoV-2 mRNA Vaccine Production. Biochemistry 2024; 63:2793-2802. [PMID: 39428617 DOI: 10.1021/acs.biochem.4c00188] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2024]
Abstract
The in vitro transcription (IVT) of messenger ribonucleic acid (mRNA) from the linearized deoxyribonucleic acid (DNA) template of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) Delta variant (B.1.617.2) was optimized for total mRNA yield and purity (by percent intact mRNA) utilizing machine learning in conjunction with automated, high-throughput liquid handling technology. An iterative Bayesian optimization approach successfully optimized 11 critical process parameters in 42 reactions across 5 experimental rounds. Once the optimized conditions were achieved, an automated, high-throughput screen was conducted to evaluate commercially available T7 RNA polymerases for rate and quality of mRNA production. Final conditions showed a 12% yield improvement and a 50% reduction in reaction time, while simultaneously significantly decreasing (up to 44% reduction) the use of expensive reagents. This novel platform offers a powerful new approach for optimizing IVT reactions for mRNA production.
Collapse
Affiliation(s)
- Spencer E McMinn
- Process Research and Development, Merck & Co., Inc., West Point, Pennsylvania 19486, United States
| | - Danielle V Miller
- Process Research and Development, Merck & Co., Inc., West Point, Pennsylvania 19486, United States
| | - Daniel Yur
- Process Research and Development, Merck & Co., Inc., West Point, Pennsylvania 19486, United States
| | - Kevin Stone
- Process Research and Development, Merck & Co., Inc., West Point, Pennsylvania 19486, United States
| | - Yuting Xu
- Biometrics Research, Merck & Co., Inc., Rahway, New Jersey 07065, United States
| | - Ajit Vikram
- Process Research and Development, Merck & Co., Inc., West Point, Pennsylvania 19486, United States
| | - Shashank Murali
- Process Development, Eurofins PSS, West Point, Pennsylvania 19486, United States
| | - Jessica Raffaele
- Analytical Research and Development, Merck & Co., Inc., West Point, Pennsylvania 19486, United States
| | - David Holland
- Analytical Research and Development, Merck & Co., Inc., West Point, Pennsylvania 19486, United States
| | - Sheng-Ching Wang
- Process Research and Development, Merck & Co., Inc., West Point, Pennsylvania 19486, United States
| | - Joseph P Smith
- Process Research and Development, Merck & Co., Inc., West Point, Pennsylvania 19486, United States
| |
Collapse
|
11
|
Dai T, Vijayakrishnan S, Szczypiński FT, Ayme JF, Simaei E, Fellowes T, Clowes R, Kotopanov L, Shields CE, Zhou Z, Ward JW, Cooper AI. Autonomous mobile robots for exploratory synthetic chemistry. Nature 2024; 635:890-897. [PMID: 39506122 PMCID: PMC11602721 DOI: 10.1038/s41586-024-08173-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Accepted: 10/08/2024] [Indexed: 11/08/2024]
Abstract
Autonomous laboratories can accelerate discoveries in chemical synthesis, but this requires automated measurements coupled with reliable decision-making1,2. Most autonomous laboratories involve bespoke automated equipment3-6, and reaction outcomes are often assessed using a single, hard-wired characterization technique7. Any decision-making algorithms8 must then operate using this narrow range of characterization data9,10. By contrast, manual experiments tend to draw on a wider range of instruments to characterize reaction products, and decisions are rarely taken based on one measurement alone. Here we show that a synthesis laboratory can be integrated into an autonomous laboratory by using mobile robots11-13 that operate equipment and make decisions in a human-like way. Our modular workflow combines mobile robots, an automated synthesis platform, a liquid chromatography-mass spectrometer and a benchtop nuclear magnetic resonance spectrometer. This allows robots to share existing laboratory equipment with human researchers without monopolizing it or requiring extensive redesign. A heuristic decision-maker processes the orthogonal measurement data, selecting successful reactions to take forward and automatically checking the reproducibility of any screening hits. We exemplify this approach in the three areas of structural diversification chemistry, supramolecular host-guest chemistry and photochemical synthesis. This strategy is particularly suited to exploratory chemistry that can yield multiple potential products, as for supramolecular assemblies, where we also extend the method to an autonomous function assay by evaluating host-guest binding properties.
Collapse
Affiliation(s)
- Tianwei Dai
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Sriram Vijayakrishnan
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Filip T Szczypiński
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Jean-François Ayme
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Ehsan Simaei
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Thomas Fellowes
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Rob Clowes
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Lyubomir Kotopanov
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Caitlin E Shields
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Zhengxue Zhou
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - John W Ward
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Andrew I Cooper
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK.
| |
Collapse
|
12
|
Angelopoulos A, Cahoon JF, Alterovitz R. Transforming science labs into automated factories of discovery. Sci Robot 2024; 9:eadm6991. [PMID: 39441898 DOI: 10.1126/scirobotics.adm6991] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2024] [Accepted: 09/24/2024] [Indexed: 10/25/2024]
Abstract
Laboratories in chemistry, biochemistry, and materials science are at the leading edge of technology, discovering molecules and materials to unlock capabilities in energy, catalysis, biotechnology, sustainability, electronics, and more. Yet, most modern laboratories resemble factories from generations past, with a large reliance on humans manually performing synthesis and characterization tasks. Robotics and automation can enable scientific experiments to be conducted faster, more safely, more accurately, and with greater reproducibility, allowing scientists to tackle large societal problems in domains such as health and energy on a shorter timescale. We define five levels of laboratory automation, from laboratory assistance to full automation. We also introduce robotics research challenges that arise when increasing levels of automation and when increasing the generality of tasks within the laboratory. Robots are poised to transform science labs into automated factories of discovery that accelerate scientific progress.
Collapse
Affiliation(s)
- Angelos Angelopoulos
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - James F Cahoon
- Department of Chemistry, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Ron Alterovitz
- Department of Computer Science, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| |
Collapse
|
13
|
Tom G, Schmid SP, Baird SG, Cao Y, Darvish K, Hao H, Lo S, Pablo-García S, Rajaonson EM, Skreta M, Yoshikawa N, Corapi S, Akkoc GD, Strieth-Kalthoff F, Seifrid M, Aspuru-Guzik A. Self-Driving Laboratories for Chemistry and Materials Science. Chem Rev 2024; 124:9633-9732. [PMID: 39137296 PMCID: PMC11363023 DOI: 10.1021/acs.chemrev.4c00055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/15/2024]
Abstract
Self-driving laboratories (SDLs) promise an accelerated application of the scientific method. Through the automation of experimental workflows, along with autonomous experimental planning, SDLs hold the potential to greatly accelerate research in chemistry and materials discovery. This review provides an in-depth analysis of the state-of-the-art in SDL technology, its applications across various scientific disciplines, and the potential implications for research and industry. This review additionally provides an overview of the enabling technologies for SDLs, including their hardware, software, and integration with laboratory infrastructure. Most importantly, this review explores the diverse range of scientific domains where SDLs have made significant contributions, from drug discovery and materials science to genomics and chemistry. We provide a comprehensive review of existing real-world examples of SDLs, their different levels of automation, and the challenges and limitations associated with each domain.
Collapse
Affiliation(s)
- Gary Tom
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Stefan P. Schmid
- Department
of Chemistry and Applied Biosciences, ETH
Zurich, Vladimir-Prelog-Weg 1, CH-8093 Zurich, Switzerland
| | - Sterling G. Baird
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Yang Cao
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Kourosh Darvish
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Han Hao
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Stanley Lo
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
| | - Sergio Pablo-García
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
| | - Ella M. Rajaonson
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Marta Skreta
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Naruki Yoshikawa
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Samantha Corapi
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
| | - Gun Deniz Akkoc
- Forschungszentrum
Jülich GmbH, Helmholtz Institute
for Renewable Energy Erlangen-Nürnberg, Cauerstr. 1, 91058 Erlangen, Germany
- Department
of Chemical and Biological Engineering, Friedrich-Alexander Universität Erlangen-Nürnberg, Egerlandstr. 3, 91058 Erlangen, Germany
| | - Felix Strieth-Kalthoff
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- School of
Mathematics and Natural Sciences, University
of Wuppertal, Gaußstraße
20, 42119 Wuppertal, Germany
| | - Martin Seifrid
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Department
of Materials Science and Engineering, North
Carolina State University, Raleigh, North Carolina 27695, United States of America
| | - Alán Aspuru-Guzik
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
- Department
of Chemical Engineering & Applied Chemistry, University of Toronto, Toronto, Ontario M5S 3E5, Canada
- Department
of Materials Science & Engineering, University of Toronto, Toronto, Ontario M5S 3E4, Canada
- Lebovic
Fellow, Canadian Institute for Advanced
Research (CIFAR), 661
University Ave, Toronto, Ontario M5G 1M1, Canada
| |
Collapse
|
14
|
Sato T, Masuda K, Sano C, Matsumoto K, Numata H, Munetoh S, Kasama T, Miyake R. Democratizing Microreactor Technology for Accelerated Discoveries in Chemistry and Materials Research. MICROMACHINES 2024; 15:1064. [PMID: 39337724 PMCID: PMC11434323 DOI: 10.3390/mi15091064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2024] [Revised: 08/11/2024] [Accepted: 08/14/2024] [Indexed: 09/30/2024]
Abstract
Microreactor technologies have emerged as versatile platforms with the potential to revolutionize chemistry and materials research, offering sustainable solutions to global challenges in environmental and health domains. This survey paper provides an in-depth review of recent advancements in microreactor technologies, focusing on their role in facilitating accelerated discoveries in chemistry and materials. Specifically, we examine the convergence of microfluidics with machine intelligence and automation, enabling the exploitation of the cyber-physical environment as a highly integrated experimentation platform for rapid scientific discovery and process development. We investigate the applicability and limitations of microreactor-enabled discovery accelerators in various chemistry and materials contexts. Despite their tremendous potential, the integration of machine intelligence and automation into microreactor-based experiments presents challenges in establishing fully integrated, automated, and intelligent systems. These challenges can hinder the broader adoption of microreactor technologies within the research community. To address this, we review emerging technologies that can help lower barriers and facilitate the implementation of microreactor-enabled discovery accelerators. Lastly, we provide our perspective on future research directions for democratizing microreactor technologies, with the aim of accelerating scientific discoveries and promoting widespread adoption of these transformative platforms.
Collapse
Affiliation(s)
- Tomomi Sato
- Graduate School of Engineering, The University of Tokyo, Kawasaki 212-0032, Japan; (T.K.); (R.M.)
| | - Koji Masuda
- Department of Physics and Astronomy, University of Exeter, Exeter EX4 4QL, UK;
| | - Chikako Sano
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Keiji Matsumoto
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Hidetoshi Numata
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Seiji Munetoh
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Toshihiro Kasama
- Graduate School of Engineering, The University of Tokyo, Kawasaki 212-0032, Japan; (T.K.); (R.M.)
| | - Ryo Miyake
- Graduate School of Engineering, The University of Tokyo, Kawasaki 212-0032, Japan; (T.K.); (R.M.)
| |
Collapse
|
15
|
Kalikadien AV, Mirza A, Hossaini AN, Sreenithya A, Pidko EA. Paving the road towards automated homogeneous catalyst design. Chempluschem 2024; 89:e202300702. [PMID: 38279609 DOI: 10.1002/cplu.202300702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2023] [Revised: 12/20/2023] [Indexed: 01/28/2024]
Abstract
In the past decade, computational tools have become integral to catalyst design. They continue to offer significant support to experimental organic synthesis and catalysis researchers aiming for optimal reaction outcomes. More recently, data-driven approaches utilizing machine learning have garnered considerable attention for their expansive capabilities. This Perspective provides an overview of diverse initiatives in the realm of computational catalyst design and introduces our automated tools tailored for high-throughput in silico exploration of the chemical space. While valuable insights are gained through methods for high-throughput in silico exploration and analysis of chemical space, their degree of automation and modularity are key. We argue that the integration of data-driven, automated and modular workflows is key to enhancing homogeneous catalyst design on an unprecedented scale, contributing to the advancement of catalysis research.
Collapse
Affiliation(s)
- Adarsh V Kalikadien
- Inorganic Systems Engineering, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ, Delft, The Netherlands
| | - Adrian Mirza
- Inorganic Systems Engineering, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ, Delft, The Netherlands
| | - Aydin Najl Hossaini
- Inorganic Systems Engineering, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ, Delft, The Netherlands
| | - Avadakkam Sreenithya
- Inorganic Systems Engineering, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ, Delft, The Netherlands
| | - Evgeny A Pidko
- Inorganic Systems Engineering, Department of Chemical Engineering, Faculty of Applied Sciences, Delft University of Technology, Van der Maasweg 9, 2629 HZ, Delft, The Netherlands
| |
Collapse
|
16
|
Raghavan P, Rago AJ, Verma P, Hassan MM, Goshu GM, Dombrowski AW, Pandey A, Coley CW, Wang Y. Incorporating Synthetic Accessibility in Drug Design: Predicting Reaction Yields of Suzuki Cross-Couplings by Leveraging AbbVie's 15-Year Parallel Library Data Set. J Am Chem Soc 2024; 146:15070-15084. [PMID: 38768950 PMCID: PMC11157529 DOI: 10.1021/jacs.4c00098] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Revised: 04/24/2024] [Accepted: 04/25/2024] [Indexed: 05/22/2024]
Abstract
Despite the increased use of computational tools to supplement medicinal chemists' expertise and intuition in drug design, predicting synthetic yields in medicinal chemistry endeavors remains an unsolved challenge. Existing design workflows could profoundly benefit from reaction yield prediction, as precious material waste could be reduced, and a greater number of relevant compounds could be delivered to advance the design, make, test, analyze (DMTA) cycle. In this work, we detail the evaluation of AbbVie's medicinal chemistry library data set to build machine learning models for the prediction of Suzuki coupling reaction yields. The combination of density functional theory (DFT)-derived features and Morgan fingerprints was identified to perform better than one-hot encoded baseline modeling, furnishing encouraging results. Overall, we observe modest generalization to unseen reactant structures within the 15-year retrospective library data set. Additionally, we compare predictions made by the model to those made by expert medicinal chemists, finding that the model can often predict both reaction success and reaction yields with greater accuracy. Finally, we demonstrate the application of this approach to suggest structurally and electronically similar building blocks to replace those predicted or observed to be unsuccessful prior to or after synthesis, respectively. The yield prediction model was used to select similar monomers predicted to have higher yields, resulting in greater synthesis efficiency of relevant drug-like molecules.
Collapse
Affiliation(s)
- Priyanka Raghavan
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Ave, Cambridge, Massachusetts 02139, United States
| | - Alexander J. Rago
- Advanced
Chemistry Technologies Group, AbbVie, Inc., 1 N Waukegan Rd, North Chicago, Illinois 60064, United States
| | - Pritha Verma
- Advanced
Chemistry Technologies Group, AbbVie, Inc., 1 N Waukegan Rd, North Chicago, Illinois 60064, United States
| | - Majdi M. Hassan
- RAIDERS
Group, AbbVie, Inc., 1 N Waukegan Rd, North Chicago, Illinois 60064, United States
| | - Gashaw M. Goshu
- Advanced
Chemistry Technologies Group, AbbVie, Inc., 1 N Waukegan Rd, North Chicago, Illinois 60064, United States
| | - Amanda W. Dombrowski
- Advanced
Chemistry Technologies Group, AbbVie, Inc., 1 N Waukegan Rd, North Chicago, Illinois 60064, United States
| | - Abhishek Pandey
- RAIDERS
Group, AbbVie, Inc., 1 N Waukegan Rd, North Chicago, Illinois 60064, United States
| | - Connor W. Coley
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Ave, Cambridge, Massachusetts 02139, United States
| | - Ying Wang
- Advanced
Chemistry Technologies Group, AbbVie, Inc., 1 N Waukegan Rd, North Chicago, Illinois 60064, United States
| |
Collapse
|
17
|
Schilter O, Gutierrez DP, Folkmann LM, Castrogiovanni A, García-Durán A, Zipoli F, Roch LM, Laino T. Combining Bayesian optimization and automation to simultaneously optimize reaction conditions and routes. Chem Sci 2024; 15:7732-7741. [PMID: 38784737 PMCID: PMC11110165 DOI: 10.1039/d3sc05607d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 04/05/2024] [Indexed: 05/25/2024] Open
Abstract
Reaching optimal reaction conditions is crucial to achieve high yields, minimal by-products, and environmentally sustainable chemical reactions. With the recent rise of artificial intelligence, there has been a shift from traditional Edisonian trial-and-error optimization to data-driven and automated approaches, which offer significant advantages. Here, we showcase the capabilities of an integrated platform; we conducted simultaneous optimizations of four different terminal alkynes and two reaction routes using an automation platform combined with a Bayesian optimization platform. Remarkably, we achieved a conversion rate of over 80% for all four substrates in 23 experiments, covering ca. 0.2% of the combinatorial space. Further analysis allowed us to identify the influence of different reaction parameters on the reaction outcomes, demonstrating the potential for expedited reaction condition optimization and the prospect of more efficient chemical processes in the future.
Collapse
Affiliation(s)
- Oliver Schilter
- IBM Research Europe Säumerstrasse 4 8803 Rüschlikon Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis) Switzerland
| | | | - Linnea M Folkmann
- Atinary Technologies Route de la Corniche 4 1066 Epalinges Switzerland
| | | | | | - Federico Zipoli
- IBM Research Europe Säumerstrasse 4 8803 Rüschlikon Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis) Switzerland
| | - Loïc M Roch
- Atinary Technologies Route de la Corniche 4 1066 Epalinges Switzerland
| | - Teodoro Laino
- IBM Research Europe Säumerstrasse 4 8803 Rüschlikon Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis) Switzerland
| |
Collapse
|
18
|
Strieth-Kalthoff F, Hao H, Rathore V, Derasp J, Gaudin T, Angello NH, Seifrid M, Trushina E, Guy M, Liu J, Tang X, Mamada M, Wang W, Tsagaantsooj T, Lavigne C, Pollice R, Wu TC, Hotta K, Bodo L, Li S, Haddadnia M, Wołos A, Roszak R, Ser CT, Bozal-Ginesta C, Hickman RJ, Vestfrid J, Aguilar-Granda A, Klimareva EL, Sigerson RC, Hou W, Gahler D, Lach S, Warzybok A, Borodin O, Rohrbach S, Sanchez-Lengeling B, Adachi C, Grzybowski BA, Cronin L, Hein JE, Burke MD, Aspuru-Guzik A. Delocalized, asynchronous, closed-loop discovery of organic laser emitters. Science 2024; 384:eadk9227. [PMID: 38753786 DOI: 10.1126/science.adk9227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 04/05/2024] [Indexed: 05/18/2024]
Abstract
Contemporary materials discovery requires intricate sequences of synthesis, formulation, and characterization that often span multiple locations with specialized expertise or instrumentation. To accelerate these workflows, we present a cloud-based strategy that enabled delocalized and asynchronous design-make-test-analyze cycles. We showcased this approach through the exploration of molecular gain materials for organic solid-state lasers as a frontier application in molecular optoelectronics. Distributed robotic synthesis and in-line property characterization, orchestrated by a cloud-based artificial intelligence experiment planner, resulted in the discovery of 21 new state-of-the-art materials. Gram-scale synthesis ultimately allowed for the verification of best-in-class stimulated emission in a thin-film device. Demonstrating the asynchronous integration of five laboratories across the globe, this workflow provides a blueprint for delocalizing-and democratizing-scientific discovery.
Collapse
Affiliation(s)
- Felix Strieth-Kalthoff
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Han Hao
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Acceleration Consortium, University of Toronto, Toronto, ON, Canada
| | - Vandana Rathore
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Joshua Derasp
- Department of Chemistry, University of British Columbia, Vancouver, BC, Canada
| | - Théophile Gaudin
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Nicholas H Angello
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab Institute, Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Martin Seifrid
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Department of Materials Science and Engineering, North Carolina State University, Raleigh, NC, USA
| | | | - Mason Guy
- Department of Chemistry, University of British Columbia, Vancouver, BC, Canada
| | - Junliang Liu
- Department of Chemistry, University of British Columbia, Vancouver, BC, Canada
| | - Xun Tang
- Center for Organic Photonics and Electronics Research (OPERA), Kyushu University, Fukuoka, Japan
| | - Masashi Mamada
- Center for Organic Photonics and Electronics Research (OPERA), Kyushu University, Fukuoka, Japan
| | - Wesley Wang
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab Institute, Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Tuul Tsagaantsooj
- Center for Organic Photonics and Electronics Research (OPERA), Kyushu University, Fukuoka, Japan
| | - Cyrille Lavigne
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Robert Pollice
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Tony C Wu
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Kazuhiro Hotta
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Mitsubishi Chemical Corporation Science & Innovation Center, Kanagawa, Japan
| | - Leticia Bodo
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
| | - Shangyu Li
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
| | - Mohammad Haddadnia
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
| | - Agnieszka Wołos
- Allchemy Inc., Highland, IN, USA
- Institute of Organic Chemistry, Polish Academy of Sciences, Warsaw, Poland
| | - Rafał Roszak
- Allchemy Inc., Highland, IN, USA
- Institute of Organic Chemistry, Polish Academy of Sciences, Warsaw, Poland
| | - Cher Tian Ser
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Carlota Bozal-Ginesta
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Catalonia Institute for Energy Research, Barcelona, Spain
| | - Riley J Hickman
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Jenya Vestfrid
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Andrés Aguilar-Granda
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | | | | | - Wenduan Hou
- School of Chemistry, University of Glasgow, Glasgow, UK
| | - Daniel Gahler
- School of Chemistry, University of Glasgow, Glasgow, UK
| | - Slawomir Lach
- School of Chemistry, University of Glasgow, Glasgow, UK
| | - Adrian Warzybok
- School of Chemistry, University of Glasgow, Glasgow, UK
- Department of Chemical Physics, Jagiellonian University, Krakow, Poland
| | - Oleg Borodin
- School of Chemistry, University of Glasgow, Glasgow, UK
| | | | | | - Chihaya Adachi
- Center for Organic Photonics and Electronics Research (OPERA), Kyushu University, Fukuoka, Japan
| | - Bartosz A Grzybowski
- Institute of Organic Chemistry, Polish Academy of Sciences, Warsaw, Poland
- Center for Algorithmic and Robotized Synthesis, Institute for Basic Science, Ulsan, Republic of Korea
- Department of Chemistry, Ulsan Institute of Science and Technology, Ulsan, Republic of Korea
| | - Leroy Cronin
- Acceleration Consortium, University of Toronto, Toronto, ON, Canada
- School of Chemistry, University of Glasgow, Glasgow, UK
| | - Jason E Hein
- Acceleration Consortium, University of Toronto, Toronto, ON, Canada
- Department of Chemistry, University of British Columbia, Vancouver, BC, Canada
- Department of Chemistry, University of Bergen, Bergen, Norway
| | - Martin D Burke
- Acceleration Consortium, University of Toronto, Toronto, ON, Canada
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab Institute, Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Cancer Center at Illinois, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Carle Illinois College of Medicine, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Acceleration Consortium, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Materials Science and Engineering, University of Toronto, Toronto, ON, Canada
- Canadian Institute for Advanced Research (CIFAR), Toronto, ON, Canada
| |
Collapse
|
19
|
Christensen M, Xu Y, Kwan EE, Di Maso MJ, Ji Y, Reibarkh M, Sun AC, Liaw A, Fier PS, Grosser S, Hein JE. Dynamic sampling in autonomous process optimization. Chem Sci 2024; 15:7160-7169. [PMID: 38756794 PMCID: PMC11095507 DOI: 10.1039/d3sc06884f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2023] [Accepted: 04/10/2024] [Indexed: 05/18/2024] Open
Abstract
Autonomous process optimization (APO) is a technology that has recently found utility in a multitude of process optimization challenges. In contrast to most APO examples in microflow reactor systems, we recently presented a system capable of optimization in high-throughput batch reactor systems. The drawback of APO in a high-throughput batch reactor system is the reliance on reaction sampling at a predetermined static timepoint rather than a dynamic endpoint. Static timepoint sampling can lead to the inconsistent capture of the process performance under each process parameter permutation. This is important because critical process behaviors such as rate acceleration accompanied by decomposition could be missed entirely. To address this drawback, we implemented a dynamic reaction endpoint determination strategy to capture the product purity once the process stream stabilized. We accomplished this through the incorporation of a real-time plateau detection algorithm into the APO workflow to measure and report the product purity at the dynamically determined reaction endpoint. We then applied this strategy to the autonomous optimization of a photobromination reaction towards the synthesis of a pharmaceutically relevant intermediate. In doing so, we not only uncovered process conditions to access the desired monohalogenation product in 85 UPLC area % purity with minimal decomposition risk, but also measured the effect of each parameter on the process performance. Our results highlight the advantage of incorporating dynamic sampling in APO workflows to drive optimization toward a stable and high-performing process.
Collapse
Affiliation(s)
- Melodie Christensen
- Department of Chemistry, University of British Columbia Vancouver British Columbia V6T 1Z1 Canada
- Department of Process Research and Development, Merck & Co., Inc Rahway NJ 07065 USA
| | - Yuting Xu
- Department of Process Research and Development, Merck & Co., Inc Rahway NJ 07065 USA
| | - Eugene E Kwan
- Department of Process Research and Development, Merck & Co., Inc Rahway NJ 07065 USA
| | - Michael J Di Maso
- Department of Process Research and Development, Merck & Co., Inc Rahway NJ 07065 USA
| | - Yining Ji
- Department of Process Research and Development, Merck & Co., Inc Rahway NJ 07065 USA
| | - Mikhail Reibarkh
- Department of Process Research and Development, Merck & Co., Inc Rahway NJ 07065 USA
| | - Alexandra C Sun
- Department of Process Research and Development, Merck & Co., Inc Rahway NJ 07065 USA
| | - Andy Liaw
- Department of Process Research and Development, Merck & Co., Inc Rahway NJ 07065 USA
| | - Patrick S Fier
- Department of Process Research and Development, Merck & Co., Inc Rahway NJ 07065 USA
| | - Shane Grosser
- Department of Process Research and Development, Merck & Co., Inc Rahway NJ 07065 USA
| | - Jason E Hein
- Department of Chemistry, University of British Columbia Vancouver British Columbia V6T 1Z1 Canada
- Acceleration Consortium, University of Toronto Toronto ON Canada
- Department of Chemistry, University of Bergen Bergen Norway
| |
Collapse
|
20
|
Clarke GE, Firth JD, Ledingham LA, Horbaczewskyj CS, Bourne RA, Bray JTW, Martin PL, Eastwood JB, Campbell R, Pagett A, MacQuarrie DJ, Slattery JM, Lynam JM, Whitwood AC, Milani J, Hart S, Wilson J, Fairlamb IJS. Deciphering complexity in Pd-catalyzed cross-couplings. Nat Commun 2024; 15:3968. [PMID: 38729925 PMCID: PMC11087562 DOI: 10.1038/s41467-024-47939-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2023] [Accepted: 04/12/2024] [Indexed: 05/12/2024] Open
Abstract
Understanding complex reaction systems is critical in chemistry. While synthetic methods for selective formation of products are sought after, oftentimes it is the full reaction signature, i.e., complete profile of products/side-products, that informs mechanistic rationale and accelerates discovery chemistry. Here, we report a methodology using high-throughput experimentation and multivariate data analysis to examine the full signature of one of the most complicated chemical reactions catalyzed by palladium known in the chemical literature. A model Pd-catalyzed reaction was selected involving functionalization of 2-bromo-N-phenylbenzamide and multiple bond activation pathways. Principal component analysis, correspondence analysis and heatmaps with hierarchical clustering reveal the factors contributing to the variance in product distributions and show associations between solvents and reaction products. Using robust data from experiments performed with eight solvents, for four different reaction times at five different temperatures, we correlate side-products to a major dominant N-phenyl phenanthridinone product, and many other side products.
Collapse
Affiliation(s)
- George E Clarke
- Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
| | - James D Firth
- Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
| | | | | | - Richard A Bourne
- Institute of Process Research and Development, School of Chemistry & School of Chemical and Process Engineering, University of Leeds, Leeds, LS2 9JT, UK
| | - Joshua T W Bray
- Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
| | - Poppy L Martin
- Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
| | | | - Rebecca Campbell
- Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
| | - Alex Pagett
- Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
| | | | - John M Slattery
- Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
| | - Jason M Lynam
- Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
| | - Adrian C Whitwood
- Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
| | - Jessica Milani
- Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
| | - Sam Hart
- Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK
| | - Julie Wilson
- Department of Mathematics, University of York, Heslington, York, YO10 5DD, UK.
| | - Ian J S Fairlamb
- Department of Chemistry, University of York, Heslington, York, YO10 5DD, UK.
| |
Collapse
|
21
|
Wagner F, Sagmeister P, Jusner CE, Tampone TG, Manee V, Buono FG, Williams JD, Kappe CO. A Slug Flow Platform with Multiple Process Analytics Facilitates Flexible Reaction Optimization. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2308034. [PMID: 38273711 PMCID: PMC10987115 DOI: 10.1002/advs.202308034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/30/2023] [Revised: 12/21/2023] [Indexed: 01/27/2024]
Abstract
Flow processing offers many opportunities to optimize reactions in a rapid and automated manner, yet often requires relatively large quantities of input materials. To combat this, the use of a flexible slug flow reactor, equipped with two analytical instruments, for low-volume optimization experiments are reported. A Buchwald-Hartwig amination toward the drug olanzapine, with 6 independent optimizable variables, is optimized using three different automated approaches: self-optimization, design of experiments, and kinetic modeling. These approaches are complementary and provide differing information on the reaction: pareto optimal operating points, response surface models, and mechanistic models, respectively. The results are achieved using <10% of the material that would be required for standard flow operation. Finally, a chemometric model is built utilizing automated data handling and three subsequent validation experiments demonstrate good agreement between the slug flow reactor and a standard (larger scale) flow reactor.
Collapse
Affiliation(s)
- Florian Wagner
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Peter Sagmeister
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Clemens E. Jusner
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - Thomas G. Tampone
- Boehringer Ingelheim Pharmaceuticals, Inc900 Ridgebury RoadRidgefieldCT06877USA
| | - Vidhyadhar Manee
- Boehringer Ingelheim Pharmaceuticals, Inc900 Ridgebury RoadRidgefieldCT06877USA
| | - Frederic G. Buono
- Boehringer Ingelheim Pharmaceuticals, Inc900 Ridgebury RoadRidgefieldCT06877USA
| | - Jason D. Williams
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| | - C. Oliver Kappe
- Center for Continuous Flow Synthesis and Processing (CC FLOW)Research Center Pharmaceutical Engineering GmbH (RCPE)Inffeldgasse 13Graz8010Austria
- Institute of ChemistryUniversity of GrazNAWI Graz, Heinrichstrasse 28Graz8010Austria
| |
Collapse
|
22
|
Okada H, Maeda S. On Accelerating Substrate Optimization Using Computational Gibbs Energy Barriers: A Numerical Consideration Utilizing a Computational Data Set. ACS OMEGA 2024; 9:7123-7131. [PMID: 38371820 PMCID: PMC10870292 DOI: 10.1021/acsomega.3c09066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/05/2024] [Accepted: 01/16/2024] [Indexed: 02/20/2024]
Abstract
Substrate optimization is a time- and resource-consuming step in organic synthesis. Recent advances in chemo- and materials-informatics provide systematic and efficient procedures utilizing tools such as Bayesian optimization (BO). This study explores the possibility of reducing the required experiments further by utilizing computational Gibbs energy barriers. To thoroughly validate the impact of using computational Gibbs energy barriers in BO-assisted substrate optimization, this study employs a computational Gibbs energy barrier data set in the literature and performs an extensive numerical investigation virtually regarding the Gibbs energy barriers as virtual experimental results and those with systematic and random noises as virtual computational results. The present numerical investigation shows that even the computational reactivity affected by noises of as much as 20 kJ/mol helps reduce the number of required experiments.
Collapse
Affiliation(s)
- Hiroaki Okada
- Graduate
School of Chemical Sciences and Engineering, Hokkaido University, Sapporo, Hokkaido 060-8628, Japan
| | - Satoshi Maeda
- Department
of Chemistry, Graduate School of Science, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
- Institute
for Chemical Reaction Design and Discovery (WPI-ICReDD), Hokkaido University, Sapporo, Hokkaido 001-0021, Japan
- ERATO
Maeda Artificial Intelligence for Chemical Reaction Design and Discovery
Project, Hokkaido University, Sapporo, Hokkaido 060-0810, Japan
- Research
and Services Division of Materials Data and Integrated System (MaDIS), National Institute for Materials Science (NIMS), Tsukuba, Ibaraki 305-0044, Japan
| |
Collapse
|
23
|
Leonov AI, Hammer AJS, Lach S, Mehr SHM, Caramelli D, Angelone D, Khan A, O'Sullivan S, Craven M, Wilbraham L, Cronin L. An integrated self-optimizing programmable chemical synthesis and reaction engine. Nat Commun 2024; 15:1240. [PMID: 38336880 PMCID: PMC10858227 DOI: 10.1038/s41467-024-45444-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 01/22/2024] [Indexed: 02/12/2024] Open
Abstract
Robotic platforms for chemistry are developing rapidly but most systems are not currently able to adapt to changing circumstances in real-time. We present a dynamically programmable system capable of making, optimizing, and discovering new molecules which utilizes seven sensors that continuously monitor the reaction. By developing a dynamic programming language, we demonstrate the 10-fold scale-up of a highly exothermic oxidation reaction, end point detection, as well as detecting critical hardware failures. We also show how the use of in-line spectroscopy such as HPLC, Raman, and NMR can be used for closed-loop optimization of reactions, exemplified using Van Leusen oxazole synthesis, a four-component Ugi condensation and manganese-catalysed epoxidation reactions, as well as two previously unreported reactions, discovered from a selected chemical space, providing up to 50% yield improvement over 25-50 iterations. Finally, we demonstrate an experimental pipeline to explore a trifluoromethylations reaction space, that discovers new molecules.
Collapse
Affiliation(s)
- Artem I Leonov
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Alexander J S Hammer
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Slawomir Lach
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - S Hessam M Mehr
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Dario Caramelli
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Davide Angelone
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Aamir Khan
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Steven O'Sullivan
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Matthew Craven
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Liam Wilbraham
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK
| | - Leroy Cronin
- School of Chemistry, The University of Glasgow, University Avenue, Glasgow, G12 8QQ, UK.
| |
Collapse
|
24
|
El-Khawaldeh R, Guy M, Bork F, Taherimakhsousi N, Jones KN, Hawkins JM, Han L, Pritchard RP, Cole BA, Monfette S, Hein JE. Keeping an "eye" on the experiment: computer vision for real-time monitoring and control. Chem Sci 2024; 15:1271-1282. [PMID: 38274057 PMCID: PMC10806693 DOI: 10.1039/d3sc05491h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2023] [Accepted: 11/24/2023] [Indexed: 01/27/2024] Open
Abstract
This work presents a generalizable computer vision (CV) and machine learning model that is used for automated real-time monitoring and control of a diverse array of workup processes. Our system simultaneously monitors multiple physical outputs (e.g., liquid level, homogeneity, turbidity, solid, residue, and color), offering a method for rapid data acquisition and deeper analysis from multiple visual cues. We demonstrate a single platform (consisting of CV, machine learning, real-time monitoring techniques, and flexible hardware) to monitor and control vision-based experimental techniques, including solvent exchange distillation, antisolvent crystallization, evaporative crystallization, cooling crystallization, solid-liquid mixing, and liquid-liquid extraction. Both qualitative (video capturing) and quantitative data (visual outputs measurement) were obtained which provided a method for data cross-validation. Our CV model's ease of use, generalizability, and non-invasiveness make it an appealing complementary option to in situ and real-time analytical monitoring tools and mathematical modeling. Additionally, our platform is integrated with Mettler-Toledo's iControl software, which acts as a centralized system for real-time data collection, visualization, and storage. With consistent data representation and infrastructure, we were able to efficiently transfer the technology and reproduce results between different labs. This ability to easily monitor and respond to the dynamic situational changes of the experiments is pivotal to enabling future flexible automation workflows.
Collapse
Affiliation(s)
- Rama El-Khawaldeh
- Department of Chemistry, University of British Columba Vancouver BC Canada
| | - Mason Guy
- Department of Chemistry, University of British Columba Vancouver BC Canada
| | - Finn Bork
- Department of Chemistry, University of British Columba Vancouver BC Canada
| | | | - Kris N Jones
- Pfizer Worldwide Chemical Research and Development, Pfizer Inc. Groton Connecticut 06340 USA
| | - Joel M Hawkins
- Pfizer Worldwide Chemical Research and Development, Pfizer Inc. Groton Connecticut 06340 USA
| | - Lu Han
- Pfizer Worldwide Chemical Research and Development, Pfizer Inc. Groton Connecticut 06340 USA
| | - Robert P Pritchard
- Pfizer Worldwide Chemical Research and Development, Pfizer Inc. Groton Connecticut 06340 USA
| | - Blaine A Cole
- Pfizer Worldwide Chemical Research and Development, Pfizer Inc. Groton Connecticut 06340 USA
| | - Sebastien Monfette
- Pfizer Worldwide Chemical Research and Development, Pfizer Inc. Groton Connecticut 06340 USA
| | - Jason E Hein
- Department of Chemistry, University of British Columba Vancouver BC Canada
- Acceleration Consortium, University of Toronto Toronto ON Canada
- Department of Chemistry, University of Bergen Bergen Norway
| |
Collapse
|
25
|
Voinarovska V, Kabeshov M, Dudenko D, Genheden S, Tetko IV. When Yield Prediction Does Not Yield Prediction: An Overview of the Current Challenges. J Chem Inf Model 2024; 64:42-56. [PMID: 38116926 PMCID: PMC10778086 DOI: 10.1021/acs.jcim.3c01524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Revised: 11/29/2023] [Accepted: 11/30/2023] [Indexed: 12/21/2023]
Abstract
Machine Learning (ML) techniques face significant challenges when predicting advanced chemical properties, such as yield, feasibility of chemical synthesis, and optimal reaction conditions. These challenges stem from the high-dimensional nature of the prediction task and the myriad essential variables involved, ranging from reactants and reagents to catalysts, temperature, and purification processes. Successfully developing a reliable predictive model not only holds the potential for optimizing high-throughput experiments but can also elevate existing retrosynthetic predictive approaches and bolster a plethora of applications within the field. In this review, we systematically evaluate the efficacy of current ML methodologies in chemoinformatics, shedding light on their milestones and inherent limitations. Additionally, a detailed examination of a representative case study provides insights into the prevailing issues related to data availability and transferability in the discipline.
Collapse
Affiliation(s)
- Varvara Voinarovska
- Molecular
AI, Discovery Sciences R&D, AstraZeneca, 431 83 Gothenburg, Sweden
- TUM
Graduate School, Faculty of Chemistry, Technical
University of Munich, 85748 Garching, Germany
| | - Mikhail Kabeshov
- Molecular
AI, Discovery Sciences R&D, AstraZeneca, 431 83 Gothenburg, Sweden
| | - Dmytro Dudenko
- Enamine
Ltd., 78 Chervonotkatska str., 02094 Kyiv, Ukraine
| | - Samuel Genheden
- Molecular
AI, Discovery Sciences R&D, AstraZeneca, 431 83 Gothenburg, Sweden
| | - Igor V. Tetko
- Molecular
Targets and Therapeutics Center, Helmholtz Munich − Deutsches
Forschungszentrum für Gesundheit und Umwelt (GmbH), Institute of Structural Biology, 85764 Neuherberg, Germany
| |
Collapse
|
26
|
Schmid SY, Lachowski K, Chiang HT, Pozzo L, De Yoreo J, Zhang S. Mechanisms of Biomolecular Self-Assembly Investigated Through In Situ Observations of Structures and Dynamics. Angew Chem Int Ed Engl 2023; 62:e202309725. [PMID: 37702227 DOI: 10.1002/anie.202309725] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2023] [Indexed: 09/14/2023]
Abstract
Biomolecular self-assembly of hierarchical materials is a precise and adaptable bottom-up approach to synthesizing across scales with considerable energy, health, environment, sustainability, and information technology applications. To achieve desired functions in biomaterials, it is essential to directly observe assembly dynamics and structural evolutions that reflect the underlying energy landscape and the assembly mechanism. This review will summarize the current understanding of biomolecular assembly mechanisms based on in situ characterization and discuss the broader significance and achievements of newly gained insights. In addition, we will also introduce how emerging deep learning/machine learning-based approaches, multiparametric characterization, and high-throughput methods can boost the development of biomolecular self-assembly. The objective of this review is to accelerate the development of in situ characterization approaches for biomolecular self-assembly and to inspire the next generation of biomimetic materials.
Collapse
Affiliation(s)
- Sakshi Yadav Schmid
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
| | - Kacper Lachowski
- Chemical Engineering, University of Washington, Seattle, WA 98105, USA
- Molecular Engineering and Sciences Institute, University of Washington, Seattle, WA 98105, USA
| | - Huat Thart Chiang
- Chemical Engineering, University of Washington, Seattle, WA 98105, USA
| | - Lilo Pozzo
- Chemical Engineering, University of Washington, Seattle, WA 98105, USA
- Molecular Engineering and Sciences Institute, University of Washington, Seattle, WA 98105, USA
- Materials Science and Engineering, University of Washington, Seattle, WA 98105, USA
| | - Jim De Yoreo
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
- Materials Science and Engineering, University of Washington, Seattle, WA 98105, USA
| | - Shuai Zhang
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, WA 99354, USA
- Molecular Engineering and Sciences Institute, University of Washington, Seattle, WA 98105, USA
- Materials Science and Engineering, University of Washington, Seattle, WA 98105, USA
| |
Collapse
|
27
|
Walker M, Pizzuto G, Fakhruldeen H, Cooper AI. Go with the flow: deep learning methods for autonomous viscosity estimations. DIGITAL DISCOVERY 2023; 2:1540-1547. [PMID: 38013903 PMCID: PMC10561544 DOI: 10.1039/d3dd00109a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Accepted: 09/04/2023] [Indexed: 11/29/2023]
Abstract
Closed-loop experiments can accelerate material discovery by automating both experimental manipulations and decisions that have traditionally been made by researchers. Fast and non-invasive measurements are particularly attractive for closed-loop strategies. Viscosity is a physical property for fluids that is important in many applications. It is fundamental in application areas such as coatings; also, even if viscosity is not the key property of interest, it can impact our ability to do closed-loop experimentation. For example, unexpected increases in viscosity can cause liquid-handling robots to fail. Traditional viscosity measurements are manual, invasive, and slow. Here we use convolutional neural networks (CNNs) as an alternative to traditional viscometry by non-invasively extracting the spatiotemporal features of fluid motion under flow. To do this, we built a workflow using a dual-armed collaborative robot that collects video data of fluid motion autonomously. This dataset was then used to train a 3-dimensional convolutional neural network (3D-CNN) for viscosity estimation, either by classification or by regression. We also used these models to identify unknown laboratory solvents, again based on differences in fluid motion. The 3D-CNN model performance was compared with the performance of a panel of human participants for the same classification tasks. Our models strongly outperformed human classification in both cases. For example, even with training on fewer than 50 videos for each liquid, the 3D-CNN model gave an average accuracy of 88% for predicting the identity of five different laboratory solvents, compared to an average accuracy of 32% for human observation. For comparison, random category selection would give an average accuracy of 20%. Our method offers an alternative to traditional viscosity measurements for autonomous chemistry workflows that might be used both for process control (e.g., choosing not to pipette liquids that are too viscous) or for materials discovery (e.g., identifying new polymerization catalysts on the basis of viscosification).
Collapse
Affiliation(s)
- Michael Walker
- Department of Chemistry, University of Liverpool L69 3BX UK
| | | | | | | |
Collapse
|
28
|
Tachibana R, Zhang K, Zou Z, Burgener S, Ward TR. A Customized Bayesian Algorithm to Optimize Enzyme-Catalyzed Reactions. ACS SUSTAINABLE CHEMISTRY & ENGINEERING 2023; 11:12336-12344. [PMID: 37621696 PMCID: PMC10445256 DOI: 10.1021/acssuschemeng.3c02402] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Revised: 07/21/2023] [Indexed: 08/26/2023]
Abstract
Design of experiments (DoE) plays an important role in optimizing the catalytic performance of chemical reactions. The most commonly used DoE relies on the response surface methodology (RSM) to model the variable space of experimental conditions with the fewest number of experiments. However, the RSM leads to an exponential increase in the number of required experiments as the number of variables increases. Herein we describe a Bayesian optimization algorithm (BOA) to optimize the continuous parameters (e.g., temperature, reaction time, reactant and enzyme concentrations, etc.) of enzyme-catalyzed reactions with the aim of maximizing performance. Compared to existing Bayesian optimization methods, we propose an improved algorithm that leads to better results under limited resources and time for experiments. To validate the versatility of the BOA, we benchmarked its performance with biocatalytic C-C bond formation and amination for the optimization of the turnover number. Gratifyingly, up to 80% improvement compared to RSM and up to 360% improvement vs previous Bayesian optimization algorithms were obtained. Importantly, this strategy enabled simultaneous optimization of both the enzyme's activity and selectivity for cross-benzoin condensation.
Collapse
Affiliation(s)
- Ryo Tachibana
- Department
of Chemistry, University of Basel, Mattenstrasse 24a, BPR 1096, CH-4058, Basel, Switzerland
- National
Center of Competence in Research (NCCR) “Catalysis”,
ETHZ, 8093 Zurich, Switzerland
| | - Kailin Zhang
- Department
of Chemistry, University of Basel, Mattenstrasse 24a, BPR 1096, CH-4058, Basel, Switzerland
| | - Zhi Zou
- Department
of Chemistry, University of Basel, Mattenstrasse 24a, BPR 1096, CH-4058, Basel, Switzerland
| | - Simon Burgener
- Department
of Chemistry, University of Basel, Mattenstrasse 24a, BPR 1096, CH-4058, Basel, Switzerland
| | - Thomas R. Ward
- Department
of Chemistry, University of Basel, Mattenstrasse 24a, BPR 1096, CH-4058, Basel, Switzerland
- National
Center of Competence in Research (NCCR) “Molecular Systems
Engineering”, 4058 Basel, Switzerland
- National
Center of Competence in Research (NCCR) “Catalysis”,
ETHZ, 8093 Zurich, Switzerland
| |
Collapse
|
29
|
Dunlap JH, Ethier JG, Putnam-Neeb AA, Iyer S, Luo SXL, Feng H, Garrido Torres JA, Doyle AG, Swager TM, Vaia RA, Mirau P, Crouse CA, Baldwin LA. Continuous flow synthesis of pyridinium salts accelerated by multi-objective Bayesian optimization with active learning. Chem Sci 2023; 14:8061-8069. [PMID: 37538827 PMCID: PMC10395269 DOI: 10.1039/d3sc01303k] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2023] [Accepted: 06/19/2023] [Indexed: 08/05/2023] Open
Abstract
We report a human-in-the-loop implementation of the multi-objective experimental design via a Bayesian optimization platform (EDBO+) towards the optimization of butylpyridinium bromide synthesis under continuous flow conditions. The algorithm simultaneously optimized reaction yield and production rate (or space-time yield) and generated a well defined Pareto front. The versatility of EDBO+ was demonstrated by expanding the reaction space mid-campaign by increasing the upper temperature limit. Incorporation of continuous flow techniques enabled improved control over reaction parameters compared to common batch chemistry processes, while providing a route towards future automated syntheses and improved scalability. To that end, we applied the open-source Python module, nmrglue, for semi-automated nuclear magnetic resonance (NMR) spectroscopy analysis, and compared the acquired outputs against those obtained through manual processing methods from spectra collected on both low-field (60 MHz) and high-field (400 MHz) NMR spectrometers. The EDBO+ based model was retrained with these four different datasets and the resulting Pareto front predictions provided insight into the effect of data analysis on model predictions. Finally, quaternization of poly(4-vinylpyridine) with bromobutane illustrated the extension of continuous flow chemistry to synthesize functional materials.
Collapse
Affiliation(s)
- John H Dunlap
- Materials and Manufacturing Directorate, Air Force Research Laboratory Wright-Patterson AFB OH 45433 USA
- UES, Inc. Dayton OH 45431 USA
| | - Jeffrey G Ethier
- Materials and Manufacturing Directorate, Air Force Research Laboratory Wright-Patterson AFB OH 45433 USA
- UES, Inc. Dayton OH 45431 USA
| | - Amelia A Putnam-Neeb
- Materials and Manufacturing Directorate, Air Force Research Laboratory Wright-Patterson AFB OH 45433 USA
- National Research Council Research Associate, Air Force Research Laboratory Wright-Patterson AFB OH 45433 USA
| | - Sanjay Iyer
- Department of Chemistry, Purdue University West Lafayette IN 47907 USA
| | - Shao-Xiong Lennon Luo
- Department of Chemistry, Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Haosheng Feng
- Department of Chemistry, Massachusetts Institute of Technology Cambridge MA 02139 USA
| | | | - Abigail G Doyle
- Department of Chemistry and Biochemistry, University of California Los Angeles CA 90095 USA
| | - Timothy M Swager
- Department of Chemistry, Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Richard A Vaia
- Materials and Manufacturing Directorate, Air Force Research Laboratory Wright-Patterson AFB OH 45433 USA
| | - Peter Mirau
- Materials and Manufacturing Directorate, Air Force Research Laboratory Wright-Patterson AFB OH 45433 USA
| | - Christopher A Crouse
- Materials and Manufacturing Directorate, Air Force Research Laboratory Wright-Patterson AFB OH 45433 USA
| | - Luke A Baldwin
- Materials and Manufacturing Directorate, Air Force Research Laboratory Wright-Patterson AFB OH 45433 USA
| |
Collapse
|
30
|
Wang T, You J, Gong X, Yang S, Wang L, Chang Z. Probabilistic Bayesian Deep Learning Approach for Online Forecasting of Fed-Batch Fermentation. ACS OMEGA 2023; 8:25272-25278. [PMID: 37483241 PMCID: PMC10357427 DOI: 10.1021/acsomega.3c02387] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/09/2023] [Accepted: 06/22/2023] [Indexed: 07/25/2023]
Abstract
The microbial fermentation process often involves various biological metabolic reactions and chemical processes. The mixed bacterial culture process of 2-keto-l-gulonic acid has strong nonlinear and time-varying characteristics. In this study, a probabilistic Bayesian deep learning approach is proposed to obtain a highly accurate and robust prediction of product formation. The Bayesian optimized deep neural network (BODNN) is utilized as basic model for prediction, the structural parameters of which are optimized. Then, the training datasets are classified into different categories according to the prior evaluation of prediction error. The final forecasting is a weighted combination of BODNN models based on the Bayesian hybrid method. The weights can be interpreted as Bayesian posterior probabilities and are computed recursively. The validation of 95 industrial batches is carried out, and the average root mean square errors are 1.51 and 2.01% for 4 and 8 h ahead prediction, respectively. The results illustrate that the proposed approach can capture the dynamics of fermentation batches and is suitable for online process monitoring.
Collapse
Affiliation(s)
- Tao Wang
- School
of Computer Science and Technology, Shandong
University of Technology, Zibo 255000, China
| | - Jiebing You
- Department
of Neurology, Zibo Central Hospital, Zibo, Shandong 255036, China
| | - Xiugang Gong
- School
of Computer Science and Technology, Shandong
University of Technology, Zibo 255000, China
| | - Shanliang Yang
- School
of Computer Science and Technology, Shandong
University of Technology, Zibo 255000, China
| | - Lei Wang
- School
of Computer Science and Technology, Shandong
University of Technology, Zibo 255000, China
| | - Zheng Chang
- School
of Computer Science and Technology, Shandong
University of Technology, Zibo 255000, China
| |
Collapse
|
31
|
Mahjour B, Zhang R, Shen Y, McGrath A, Zhao R, Mohamed OG, Lin Y, Zhang Z, Douthwaite JL, Tripathi A, Cernak T. Rapid planning and analysis of high-throughput experiment arrays for reaction discovery. Nat Commun 2023; 14:3924. [PMID: 37400469 DOI: 10.1038/s41467-023-39531-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Accepted: 06/13/2023] [Indexed: 07/05/2023] Open
Abstract
High-throughput experimentation (HTE) is an increasingly important tool in reaction discovery. While the hardware for running HTE in the chemical laboratory has evolved significantly in recent years, there remains a need for software solutions to navigate data-rich experiments. Here we have developed phactor™, a software that facilitates the performance and analysis of HTE in a chemical laboratory. phactor™ allows experimentalists to rapidly design arrays of chemical reactions or direct-to-biology experiments in 24, 96, 384, or 1,536 wellplates. Users can access online reagent data, such as a chemical inventory, to virtually populate wells with experiments and produce instructions to perform the reaction array manually, or with the assistance of a liquid handling robot. After completion of the reaction array, analytical results can be uploaded for facile evaluation, and to guide the next series of experiments. All chemical data, metadata, and results are stored in machine-readable formats that are readily translatable to various software. We also demonstrate the use of phactor™ in the discovery of several chemistries, including the identification of a low micromolar inhibitor of the SARS-CoV-2 main protease. Furthermore, phactor™ has been made available for free academic use in 24- and 96-well formats via an online interface.
Collapse
Affiliation(s)
- Babak Mahjour
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Rui Zhang
- Department of Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Yuning Shen
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Andrew McGrath
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Ruheng Zhao
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Osama G Mohamed
- Natural Products Discovery Core, Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA
| | - Yingfu Lin
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Zirong Zhang
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - James L Douthwaite
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Ashootosh Tripathi
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA
- Natural Products Discovery Core, Life Sciences Institute, University of Michigan, Ann Arbor, MI, USA
| | - Tim Cernak
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA.
- Department of Chemistry, University of Michigan, Ann Arbor, MI, USA.
| |
Collapse
|
32
|
Shim E, Tewari A, Cernak T, Zimmerman PM. Machine Learning Strategies for Reaction Development: Toward the Low-Data Limit. J Chem Inf Model 2023; 63:3659-3668. [PMID: 37312524 PMCID: PMC11163943 DOI: 10.1021/acs.jcim.3c00577] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Machine learning models are increasingly being utilized to predict outcomes of organic chemical reactions. A large amount of reaction data is used to train these models, which is in stark contrast to how expert chemists discover and develop new reactions by leveraging information from a small number of relevant transformations. Transfer learning and active learning are two strategies that can operate in low-data situations, which may help fill this gap and promote the use of machine learning for tackling real-world challenges in organic synthesis. This Perspective introduces active and transfer learning and connects these to potential opportunities and directions for further research, especially in the area of prospective development of chemical transformations.
Collapse
Affiliation(s)
- Eunjae Shim
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Ambuj Tewari
- Department of Statistics, University of Michigan, Ann Arbor, Michigan 48109, United States
- Department of Electrical Engineering and Computer Science, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Tim Cernak
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
| | - Paul M Zimmerman
- Department of Chemistry, University of Michigan, Ann Arbor, Michigan 48109, United States
| |
Collapse
|
33
|
Faurschou NV, Taaning RH, Pedersen CM. Substrate specific closed-loop optimization of carbohydrate protective group chemistry using Bayesian optimization and transfer learning. Chem Sci 2023; 14:6319-6329. [PMID: 37325141 PMCID: PMC10266441 DOI: 10.1039/d3sc01261a] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2023] [Accepted: 05/12/2023] [Indexed: 06/17/2023] Open
Abstract
A new way of performing reaction optimization within carbohydrate chemistry is presented. This is done by performing closed-loop optimization of regioselective benzoylation of unprotected glycosides using Bayesian optimization. Both 6-O-monobenzoylations and 3,6-O-dibenzoylations of three different monosaccharides are optimized. A novel transfer learning approach, where data from previous optimizations of different substrates is used to speed up the optimizations, has also been developed. The optimal conditions found by the Bayesian optimization algorithm provide new insight into substrate specificity, as the conditions found are significantly different. In most cases, the optimal conditions include Et3N and benzoic anhydride, a new reagent combination for these reactions, discovered by the algorithm, demonstrating the power of this concept to widen the chemical space. Further, the developed procedures include ambient conditions and short reaction times.
Collapse
|
34
|
Sanders LM, Scott RT, Yang JH, Qutub AA, Garcia Martin H, Berrios DC, Hastings JJA, Rask J, Mackintosh G, Hoarfrost AL, Chalk S, Kalantari J, Khezeli K, Antonsen EL, Babdor J, Barker R, Baranzini SE, Beheshti A, Delgado-Aparicio GM, Glicksberg BS, Greene CS, Haendel M, Hamid AA, Heller P, Jamieson D, Jarvis KJ, Komarova SV, Komorowski M, Kothiyal P, Mahabal A, Manor U, Mason CE, Matar M, Mias GI, Miller J, Myers JG, Nelson C, Oribello J, Park SM, Parsons-Wingerter P, Prabhu RK, Reynolds RJ, Saravia-Butler A, Saria S, Sawyer A, Singh NK, Snyder M, Soboczenski F, Soman K, Theriot CA, Van Valen D, Venkateswaran K, Warren L, Worthey L, Zitnik M, Costes SV. Biological research and self-driving labs in deep space supported by artificial intelligence. NAT MACH INTELL 2023. [DOI: 10.1038/s42256-023-00618-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/28/2023]
|
35
|
Taylor CJ, Pomberger A, Felton KC, Grainger R, Barecka M, Chamberlain TW, Bourne RA, Johnson CN, Lapkin AA. A Brief Introduction to Chemical Reaction Optimization. Chem Rev 2023; 123:3089-3126. [PMID: 36820880 PMCID: PMC10037254 DOI: 10.1021/acs.chemrev.2c00798] [Citation(s) in RCA: 76] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2022] [Indexed: 02/24/2023]
Abstract
From the start of a synthetic chemist's training, experiments are conducted based on recipes from textbooks and manuscripts that achieve clean reaction outcomes, allowing the scientist to develop practical skills and some chemical intuition. This procedure is often kept long into a researcher's career, as new recipes are developed based on similar reaction protocols, and intuition-guided deviations are conducted through learning from failed experiments. However, when attempting to understand chemical systems of interest, it has been shown that model-based, algorithm-based, and miniaturized high-throughput techniques outperform human chemical intuition and achieve reaction optimization in a much more time- and material-efficient manner; this is covered in detail in this paper. As many synthetic chemists are not exposed to these techniques in undergraduate teaching, this leads to a disproportionate number of scientists that wish to optimize their reactions but are unable to use these methodologies or are simply unaware of their existence. This review highlights the basics, and the cutting-edge, of modern chemical reaction optimization as well as its relation to process scale-up and can thereby serve as a reference for inspired scientists for each of these techniques, detailing several of their respective applications.
Collapse
Affiliation(s)
- Connor J. Taylor
- Astex
Pharmaceuticals, 436 Cambridge Science Park, Milton Road, Cambridge CB4 0QA, U.K.
- Innovation
Centre in Digital Molecular Technologies, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K.
| | - Alexander Pomberger
- Innovation
Centre in Digital Molecular Technologies, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K.
| | - Kobi C. Felton
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K.
| | - Rachel Grainger
- Astex
Pharmaceuticals, 436 Cambridge Science Park, Milton Road, Cambridge CB4 0QA, U.K.
| | - Magda Barecka
- Chemical
Engineering Department, Northeastern University, 360 Huntington Avenue, Boston, Massachusetts 02115, United States
- Chemistry
and Chemical Biology Department, Northeastern
University, 360 Huntington Avenue, Boston, Massachusetts 02115, United States
- Cambridge
Centre for Advanced Research and Education in Singapore, 1 Create Way, 138602 Singapore
| | - Thomas W. Chamberlain
- Institute
of Process Research and Development, School of Chemistry and School
of Chemical and Process Engineering, University
of Leeds, Leeds LS2 9JT, U.K.
| | - Richard A. Bourne
- Institute
of Process Research and Development, School of Chemistry and School
of Chemical and Process Engineering, University
of Leeds, Leeds LS2 9JT, U.K.
| | | | - Alexei A. Lapkin
- Innovation
Centre in Digital Molecular Technologies, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K.
| |
Collapse
|
36
|
Wang N, Zhang Y, Wang W, Ye Z, Chen H, Hu G, Ouyang D. How can machine learning and multiscale modeling benefit ocular drug development? Adv Drug Deliv Rev 2023; 196:114772. [PMID: 36906232 DOI: 10.1016/j.addr.2023.114772] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 02/06/2023] [Accepted: 03/05/2023] [Indexed: 03/12/2023]
Abstract
The eyes possess sophisticated physiological structures, diverse disease targets, limited drug delivery space, distinctive barriers, and complicated biomechanical processes, requiring a more in-depth understanding of the interactions between drug delivery systems and biological systems for ocular formulation development. However, the tiny size of the eyes makes sampling difficult and invasive studies costly and ethically constrained. Developing ocular formulations following conventional trial-and-error formulation and manufacturing process screening procedures is inefficient. Along with the popularity of computational pharmaceutics, non-invasive in silico modeling & simulation offer new opportunities for the paradigm shift of ocular formulation development. The current work first systematically reviews the theoretical underpinnings, advanced applications, and unique advantages of data-driven machine learning and multiscale simulation approaches represented by molecular simulation, mathematical modeling, and pharmacokinetic (PK)/pharmacodynamic (PD) modeling for ocular drug development. Following this, a new computer-driven framework for rational pharmaceutical formulation design is proposed, inspired by the potential of in silico explorations in understanding drug delivery details and facilitating drug formulation design. Lastly, to promote the paradigm shift, integrated in silico methodologies were highlighted, and discussions on data challenges, model practicality, personalized modeling, regulatory science, interdisciplinary collaboration, and talent training were conducted in detail with a view to achieving more efficient objective-oriented pharmaceutical formulation design.
Collapse
Affiliation(s)
- Nannan Wang
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences (ICMS), University of Macau, Macau, China
| | - Yunsen Zhang
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences (ICMS), University of Macau, Macau, China
| | - Wei Wang
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences (ICMS), University of Macau, Macau, China
| | - Zhuyifan Ye
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences (ICMS), University of Macau, Macau, China
| | - Hongyu Chen
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences (ICMS), University of Macau, Macau, China; Faculty of Science and Technology (FST), University of Macau, Macau, China
| | - Guanghui Hu
- Faculty of Science and Technology (FST), University of Macau, Macau, China
| | - Defang Ouyang
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences (ICMS), University of Macau, Macau, China; Department of Public Health and Medicinal Administration, Faculty of Health Sciences (FHS), University of Macau, Macau, China.
| |
Collapse
|
37
|
Haas C, Lübbesmeyer M, Jin EH, McDonald MA, Koscher BA, Guimond N, Di Rocco L, Kayser H, Leweke S, Niedenführ S, Nicholls R, Greeves E, Barber DM, Hillenbrand J, Volpin G, Jensen KF. Open-Source Chromatographic Data Analysis for Reaction Optimization and Screening. ACS CENTRAL SCIENCE 2023; 9:307-317. [PMID: 36844498 PMCID: PMC9951288 DOI: 10.1021/acscentsci.2c01042] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/05/2022] [Indexed: 06/18/2023]
Abstract
Automation and digitalization solutions in the field of small molecule synthesis face new challenges for chemical reaction analysis, especially in the field of high-performance liquid chromatography (HPLC). Chromatographic data remains locked in vendors' hardware and software components, limiting their potential in automated workflows and data science applications. In this work, we present an open-source Python project called MOCCA for the analysis of HPLC-DAD (photodiode array detector) raw data. MOCCA provides a comprehensive set of data analysis features, including an automated peak deconvolution routine of known signals, even if overlapped with signals of unexpected impurities or side products. We highlight the broad applicability of MOCCA in four studies: (i) a simulation study to validate MOCCA's data analysis features; (ii) a reaction kinetics study on a Knoevenagel condensation reaction demonstrating MOCCA's peak deconvolution feature; (iii) a closed-loop optimization study for the alkylation of 2-pyridone without human control during data analysis; (iv) a well plate screening of categorical reaction parameters for a novel palladium-catalyzed cyanation of aryl halides employing O-protected cyanohydrins. By publishing MOCCA as a Python package with this work, we envision an open-source community project for chromatographic data analysis with the potential of further advancing its scope and capabilities.
Collapse
Affiliation(s)
- Christian
P. Haas
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
- Research
and Development, Small Molecules Technologies, Bayer AG, Crop Science Division, Industriepark Höchst, 65926 Frankfurt am Main, Germany
| | - Maximilian Lübbesmeyer
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
- Research
and Development, Small Molecules Technologies, Bayer AG, Crop Science Division, Industriepark Höchst, 65926 Frankfurt am Main, Germany
| | - Edward H. Jin
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Matthew A. McDonald
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Brent A. Koscher
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| | - Nicolas Guimond
- Research
and Development, Small Molecules Technologies, Bayer AG, Crop Science Division, Alfred-Nobel-Straße 50, 40789 Monheim am Rhein, Germany
| | - Laura Di Rocco
- Chemical
& Pharmaceutical Development, Bayer
AG, Pharmaceuticals Division, Müllerstraße 178, 13353 Berlin, Germany
| | - Henning Kayser
- Research
and Development, Small Molecules Technologies, Bayer AG, Crop Science Division, Alfred-Nobel-Straße 50, 40789 Monheim am Rhein, Germany
| | - Samuel Leweke
- Applied
Mathematics, Bayer AG, Enabling Functions
Division, Kaiser-Wilhelm-Allee
1, 51368 Leverkusen, Germany
| | - Sebastian Niedenführ
- Research
and Development, Computational Life Science, Bayer AG, Crop Science Division, Alfred-Nobel-Straße 50, 40789 Monheim am Rhein, Germany
| | - Rachel Nicholls
- Research
and Development, Computational Life Science, Bayer AG, Crop Science Division, Alfred-Nobel-Straße 50, 40789 Monheim am Rhein, Germany
| | - Emily Greeves
- Research
and Development, Small Molecules Technologies, Bayer AG, Crop Science Division, Industriepark Höchst, 65926 Frankfurt am Main, Germany
| | - David M. Barber
- Research
and Development, Weed Control Chemistry, Bayer AG, Crop Science Division, Industriepark Höchst, 65926 Frankfurt am Main, Germany
| | - Julius Hillenbrand
- Chemical
& Pharmaceutical Development, Bayer
AG, Pharmaceuticals Division, Friedrich-Ebert-Straße 475, 42117 Wuppertal, Germany
| | - Giulio Volpin
- Research
and Development, Small Molecules Technologies, Bayer AG, Crop Science Division, Industriepark Höchst, 65926 Frankfurt am Main, Germany
| | - Klavs F. Jensen
- Department
of Chemical Engineering, Massachusetts Institute
of Technology, 77 Massachusetts Avenue, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
38
|
Ruan Y, Lin S, Mo Y. AROPS: A Framework of Automated Reaction Optimization with Parallelized Scheduling. J Chem Inf Model 2023; 63:770-781. [PMID: 36653913 DOI: 10.1021/acs.jcim.2c01168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Abstract
With the development of automated experimental platforms and optimization algorithms, chemists can easily optimize chemical reactions in an automated and high-throughput fashion. However, the modules in existing automated experimental platforms are operated in a linear fashion without orchestrating with the optimization algorithm, thus leaving room for further efficiency improvement. Here, we introduced a framework of automated reaction optimization with parallelized scheduling (AROPS) to realize the integration of the optimization algorithm and module scheduling. AROPS relies on a customized Bayesian optimizer to solve multi-reactor/analyzer reaction optimization problems with three different scheduling modes to arrange tasks for various experimental modules. In addition, a mechanism based on probability of improvement (PI) for discarding unpromising ongoing experiments was developed to facilitate freeing up valuable experimental resources in parallelized optimization. We tested the performance of AROPS using a hardware emulator on three representative benchmark reactions encountered in organic synthesis, illustrating that AROPS can trade off optimization time and cost according to the chemists' preference.
Collapse
Affiliation(s)
- Yixiang Ruan
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou310027, China.,ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou311215, China
| | - Sen Lin
- Shanghai ChemLex Technology Co., Ltd., Shanghai201210, China
| | - Yiming Mo
- College of Chemical and Biological Engineering, Zhejiang University, Hangzhou310027, China.,ZJU-Hangzhou Global Scientific and Technological Innovation Center, Zhejiang University, Hangzhou311215, China
| |
Collapse
|
39
|
Martin HG, Radivojevic T, Zucker J, Bouchard K, Sustarich J, Peisert S, Arnold D, Hillson N, Babnigg G, Marti JM, Mungall CJ, Beckham GT, Waldburger L, Carothers J, Sundaram S, Agarwal D, Simmons BA, Backman T, Banerjee D, Tanjore D, Ramakrishnan L, Singh A. Perspectives for self-driving labs in synthetic biology. Curr Opin Biotechnol 2023; 79:102881. [PMID: 36603501 DOI: 10.1016/j.copbio.2022.102881] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 11/23/2022] [Accepted: 12/07/2022] [Indexed: 01/04/2023]
Abstract
Self-driving labs (SDLs) combine fully automated experiments with artificial intelligence (AI) that decides the next set of experiments. Taken to their ultimate expression, SDLs could usher a new paradigm of scientific research, where the world is probed, interpreted, and explained by machines for human benefit. While there are functioning SDLs in the fields of chemistry and materials science, we contend that synthetic biology provides a unique opportunity since the genome provides a single target for affecting the incredibly wide repertoire of biological cell behavior. However, the level of investment required for the creation of biological SDLs is only warranted if directed toward solving difficult and enabling biological questions. Here, we discuss challenges and opportunities in creating SDLs for synthetic biology.
Collapse
Affiliation(s)
- Hector G Martin
- Lawrence Berkeley National Laboratory, Biological Systems and Engineering Division, Berkeley, CA, United States; Department of Energy, Agile BioFoundry, Emeryville, CA, United States; Joint BioEnergy Institute, Emeryville, CA, United States; BCAM, Basque Center for Applied Mathematics, Bilbao, Spain.
| | - Tijana Radivojevic
- Lawrence Berkeley National Laboratory, Biological Systems and Engineering Division, Berkeley, CA, United States; Department of Energy, Agile BioFoundry, Emeryville, CA, United States; Joint BioEnergy Institute, Emeryville, CA, United States
| | - Jeremy Zucker
- Earth and Biological Sciences Division, Pacific Northwest National Laboratories, Richland, WA, United States
| | - Kristofer Bouchard
- Lawrence Berkeley National Laboratory, Biological Systems and Engineering Division, Berkeley, CA, United States; Lawrence Berkeley National Laboratory, Scientific Data Division, Berkeley, CA, United States; Helen Wills Neuroscience Institute and Redwood Center for Theoretical Neuroscience, Berkeley, CA, United States
| | - Jess Sustarich
- Joint BioEnergy Institute, Emeryville, CA, United States; Biomaterials and Biomanufacturing Division, Sandia National Laboratories, Livermore, CA, United States
| | - Sean Peisert
- Lawrence Berkeley National Laboratory, Scientific Data Division, Berkeley, CA, United States; University of California, Davis, Department of Computer Science, Davis, CA, United States
| | - Dan Arnold
- Lawrence Berkeley National Laboratory, Energy Storage and Distributed Resources Division, Berkeley, CA, United States
| | - Nathan Hillson
- Lawrence Berkeley National Laboratory, Biological Systems and Engineering Division, Berkeley, CA, United States; Department of Energy, Agile BioFoundry, Emeryville, CA, United States; Joint BioEnergy Institute, Emeryville, CA, United States
| | - Gyorgy Babnigg
- Department of Energy, Agile BioFoundry, Emeryville, CA, United States; Biosciences Division, Argonne National Laboratory, Argonne, IL, United States
| | - Jose M Marti
- Lawrence Berkeley National Laboratory, Biological Systems and Engineering Division, Berkeley, CA, United States; Department of Energy, Agile BioFoundry, Emeryville, CA, United States; Joint BioEnergy Institute, Emeryville, CA, United States; Global Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, CA, United States
| | - Christopher J Mungall
- Lawrence Berkeley National Laboratory, Biological Systems and Engineering Division, Berkeley, CA, United States
| | - Gregg T Beckham
- Department of Energy, Agile BioFoundry, Emeryville, CA, United States; Resources and Enabling Sciences Center, National Renewable Energy Laboratory, Golden, CO 80401, United States
| | - Lucas Waldburger
- Department of Bioengineering, University of California, Berkeley, CA, United States
| | - James Carothers
- Department of Chemical Engineering, Molecular Engineering & Sciences Institute and Center for Synthetic Biology, University of Washington, Seattle, WA, United States
| | - ShivShankar Sundaram
- Engineering Directorate, Lawrence Livermore National Laboratory, Livermore, CA, United States; Center for Bioengineering, Lawrence Livermore National Laboratory, Livermore, CA, United States
| | - Deb Agarwal
- Lawrence Berkeley National Laboratory, Scientific Data Division, Berkeley, CA, United States
| | - Blake A Simmons
- Lawrence Berkeley National Laboratory, Biological Systems and Engineering Division, Berkeley, CA, United States; Department of Energy, Agile BioFoundry, Emeryville, CA, United States; Joint BioEnergy Institute, Emeryville, CA, United States
| | - Tyler Backman
- Lawrence Berkeley National Laboratory, Biological Systems and Engineering Division, Berkeley, CA, United States; Joint BioEnergy Institute, Emeryville, CA, United States
| | - Deepanwita Banerjee
- Lawrence Berkeley National Laboratory, Biological Systems and Engineering Division, Berkeley, CA, United States; Joint BioEnergy Institute, Emeryville, CA, United States
| | - Deepti Tanjore
- Lawrence Berkeley National Laboratory, Biological Systems and Engineering Division, Berkeley, CA, United States; Department of Energy, Agile BioFoundry, Emeryville, CA, United States; Advanced Biofuels and Bioproducts Process Development Unit, Lawrence Berkeley National Laboratory, Berkeley, CA, United States
| | - Lavanya Ramakrishnan
- Lawrence Berkeley National Laboratory, Scientific Data Division, Berkeley, CA, United States
| | - Anup Singh
- Joint BioEnergy Institute, Emeryville, CA, United States; Engineering Directorate, Lawrence Livermore National Laboratory, Livermore, CA, United States
| |
Collapse
|
40
|
Wu TC, Aguilar-Granda A, Hotta K, Yazdani SA, Pollice R, Vestfrid J, Hao H, Lavigne C, Seifrid M, Angello N, Bencheikh F, Hein JE, Burke M, Adachi C, Aspuru-Guzik A. A Materials Acceleration Platform for Organic Laser Discovery. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2023; 35:e2207070. [PMID: 36373553 DOI: 10.1002/adma.202207070] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 10/16/2022] [Indexed: 06/16/2023]
Abstract
Conventional materials discovery is a laborious and time-consuming process that can take decades from initial conception of the material to commercialization. Recent developments in materials acceleration platforms promise to accelerate materials discovery using automation of experiments coupled with machine learning. However, most of the automation efforts in chemistry focus on synthesis and compound identification, with integrated target property characterization receiving less attention. In this work, an automated platform is introduced for the discovery of molecules as gain mediums for organic semiconductor lasers, a problem that has been challenging for conventional approaches. This platform encompasses automated lego-like synthesis, product identification, and optical characterization that can be executed in a fully integrated end-to-end fashion. Using this workflow to screen organic laser candidates, discovered eight potential candidates for organic lasers is discovered. The lasing threshold of four molecules in thin-film devices and find two molecules with state-of-the-art performance is tested. These promising results show the potential of automated synthesis and screening for accelerated materials development.
Collapse
Affiliation(s)
- Tony C Wu
- Department of Chemistry, University of Toronto, 80 St George St, Toronto, ON, M5S 3H6, Canada
| | - Andrés Aguilar-Granda
- Department of Chemistry, University of Toronto, 80 St George St, Toronto, ON, M5S 3H6, Canada
| | - Kazuhiro Hotta
- Department of Chemistry, University of Toronto, 80 St George St, Toronto, ON, M5S 3H6, Canada
| | - Sahar Alasvand Yazdani
- Center for Organic Photonics and Electronics Research (OPERA), Kyushu University, 744 Motooka, Nishi-ku, Fukuoka, 819-0395, Japan
| | - Robert Pollice
- Department of Chemistry, University of Toronto, 80 St George St, Toronto, ON, M5S 3H6, Canada
| | - Jenya Vestfrid
- Department of Chemistry, University of Toronto, 80 St George St, Toronto, ON, M5S 3H6, Canada
| | - Han Hao
- Department of Chemistry, University of Toronto, 80 St George St, Toronto, ON, M5S 3H6, Canada
| | - Cyrille Lavigne
- Department of Chemistry, University of Toronto, 80 St George St, Toronto, ON, M5S 3H6, Canada
| | - Martin Seifrid
- Department of Chemistry, University of Toronto, 80 St George St, Toronto, ON, M5S 3H6, Canada
| | - Nicholas Angello
- Department of Chemistry, University of Illinois at Urbana-Champaign, 505 S Mathews Ave, Urbana, IL, 61801, USA
| | - Fatima Bencheikh
- Center for Organic Photonics and Electronics Research (OPERA), Kyushu University, 744 Motooka, Nishi-ku, Fukuoka, 819-0395, Japan
| | - Jason E Hein
- Department of Chemistry, University of British Columbia, 2036 Main Mall, Vancouver, BC, V6T 1Z1, Canada
| | - Martin Burke
- Department of Chemistry, University of Illinois at Urbana-Champaign, 505 S Mathews Ave, Urbana, IL, 61801, USA
| | - Chihaya Adachi
- Center for Organic Photonics and Electronics Research (OPERA), Kyushu University, 744 Motooka, Nishi-ku, Fukuoka, 819-0395, Japan
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, 80 St George St, Toronto, ON, M5S 3H6, Canada
| |
Collapse
|
41
|
A Review on Artificial Intelligence Enabled Design, Synthesis, and Process Optimization of Chemical Products for Industry 4.0. Processes (Basel) 2023. [DOI: 10.3390/pr11020330] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023] Open
Abstract
With the development of Industry 4.0, artificial intelligence (AI) is gaining increasing attention for its performance in solving particularly complex problems in industrial chemistry and chemical engineering. Therefore, this review provides an overview of the application of AI techniques, in particular machine learning, in chemical design, synthesis, and process optimization over the past years. In this review, the focus is on the application of AI for structure-function relationship analysis, synthetic route planning, and automated synthesis. Finally, we discuss the challenges and future of AI in making chemical products.
Collapse
|
42
|
Firsan S, Sivakumar V, Colacot TJ. Emerging Trends in Cross-Coupling: Twelve-Electron-Based L 1Pd(0) Catalysts, Their Mechanism of Action, and Selected Applications. Chem Rev 2022; 122:16983-17027. [PMID: 36190916 PMCID: PMC9756297 DOI: 10.1021/acs.chemrev.2c00204] [Citation(s) in RCA: 44] [Impact Index Per Article: 14.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Indexed: 01/25/2023]
Abstract
Monoligated palladium(0) species, L1Pd(0), have emerged as the most active catalytic species in the cross-coupling cycle. Today, there are methods available to generate the highly active but unstable L1Pd(0) catalysts from stable precatalysts. While the size of the ligand plays an important role in the formation of L1Pd(0) during in situ catalysis, the latter can be precisely generated from the precatalyst by various technologies. Computational, kinetic, and experimental studies indicate that all three steps in the catalytic cycle─oxidative addition, transmetalation, and reductive elimination─contain monoligated Pd. The synthesis of precatalysts, their mode of activation, application studies in model systems, as well as in industry are discussed. Ligand parametrization and AI based data science can potentially help predict the facile formation of L1Pd(0) species.
Collapse
Affiliation(s)
- Sharbil
J. Firsan
- Science
and Lab Solutions−Chemistry, MilliporeSigma, 6000 North Teutonia Avenue, Milwaukee, Wisconsin53209, United States
| | - Vilvanathan Sivakumar
- Merck
Life Science Pvt Ltd, No-12, Bommasandra-Jigani Link Road, Industrial Area, Bangalore560100, India
| | - Thomas J. Colacot
- Science
and Lab Solutions−Chemistry, MilliporeSigma, 6000 North Teutonia Avenue, Milwaukee, Wisconsin53209, United States
| |
Collapse
|
43
|
Domingues NP, Moosavi SM, Talirz L, Jablonka KM, Ireland CP, Ebrahim FM, Smit B. Using genetic algorithms to systematically improve the synthesis conditions of Al-PMOF. Commun Chem 2022; 5:170. [PMID: 36697847 PMCID: PMC9814730 DOI: 10.1038/s42004-022-00785-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2022] [Accepted: 11/22/2022] [Indexed: 12/14/2022] Open
Abstract
The synthesis of metal-organic frameworks (MOFs) is often complex and the desired structure is not always obtained. In this work, we report a methodology that uses a joint machine learning and experimental approach to optimize the synthesis conditions of Al-PMOF (Al2(OH)2TCPP) [H2TCPP = meso-tetra(4-carboxyphenyl)porphine], a promising material for carbon capture applications. Al-PMOF was previously synthesized using a hydrothermal reaction, which gave a low throughput yield due to its relatively long reaction time (16 hours). Here, we use a genetic algorithm to carry out a systematic search for the optimal synthesis conditions and a microwave-based high-throughput robotic platform for the syntheses. We show that, in just two generations, we could obtain excellent crystallinity and yield close to 80% in a much shorter reaction time (50 minutes). Moreover, by analyzing the failed and partially successful experiments, we could identify the most important experimental variables that determine the crystallinity and yield.
Collapse
Affiliation(s)
- Nency P Domingues
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), Sion, Valais, Switzerland
| | - Seyed Mohamad Moosavi
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), Sion, Valais, Switzerland
- Department of Mathematics and Computer Science, Freie Universität Berlin, Berlin, Germany
| | - Leopold Talirz
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), Sion, Valais, Switzerland
- Theory and Simulation of Materials (THEOS), School of Engineering (STI), École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Vaud, Switzerland
| | - Kevin Maik Jablonka
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), Sion, Valais, Switzerland
| | - Christopher P Ireland
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), Sion, Valais, Switzerland
| | - Fatmah Mish Ebrahim
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), Sion, Valais, Switzerland
- Cavendish Laboratory, School of Physical Sciences, University of Cambridge, Cambridge, UK
| | - Berend Smit
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, École Polytechnique Fédérale de Lausanne (EPFL), Sion, Valais, Switzerland.
| |
Collapse
|
44
|
Shirasawa R, Takemura I, Hattori S, Nagata Y. A semi-automated material exploration scheme to predict the solubilities of tetraphenylporphyrin derivatives. Commun Chem 2022; 5:158. [PMID: 36697881 PMCID: PMC9814751 DOI: 10.1038/s42004-022-00770-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2022] [Accepted: 11/04/2022] [Indexed: 11/24/2022] Open
Abstract
Acceleration of material discovery has been tackled by informatics and laboratory automation. Here we show a semi-automated material exploration scheme to modelize the solubility of tetraphenylporphyrin derivatives. The scheme involved the following steps: definition of a practical chemical search space, prioritization of molecules in the space using an extended algorithm for submodular function maximization without requiring biased variable selection or pre-existing data, synthesis & automated measurement, and machine-learning model estimation. The optimal evaluation order selected using the algorithm covered several similar molecules (32% of all targeted molecules, whereas that obtained by random sampling and uncertainty sampling was ~7% and ~4%, respectively) with a small number of evaluations (10 molecules: 0.13% of all targeted molecules). The derived binary classification models predicted 'good solvents' with an accuracy >0.8. Overall, we confirmed the effectivity of the proposed semi-automated scheme in early-stage material search projects for accelerating a wider range of material research.
Collapse
Affiliation(s)
- Raku Shirasawa
- Advanced Research Laboratory, R&D Center, Sony Group Corporation, Atsugi Tec. 4-14-1 Asahi-cho, Atsugi-shi, Kanagawa, 243-0014, Japan.
| | - Ichiro Takemura
- Tokyo Laboratory 26, R&D Center, Sony Group Corporation, Atsugi Tec. 4-14-1 Asahi-cho, Atsugi-shi, Kanagawa, 243-0014, Japan
| | - Shinnosuke Hattori
- Advanced Research Laboratory, R&D Center, Sony Group Corporation, Atsugi Tec. 4-14-1 Asahi-cho, Atsugi-shi, Kanagawa, 243-0014, Japan
| | - Yuuya Nagata
- Institute for Chemical Reaction Design and Discovery, Hokkaido University, Kita 21 Nishi 10, Kita-ku, Sapporo, Hokkaido, 001-0021, Japan.
| |
Collapse
|
45
|
Angello NH, Rathore V, Beker W, Wołos A, Jira ER, Roszak R, Wu TC, Schroeder CM, Aspuru-Guzik A, Grzybowski BA, Burke MD. Closed-loop optimization of general reaction conditions for heteroaryl Suzuki-Miyaura coupling. Science 2022; 378:399-405. [DOI: 10.1126/science.adc8743] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/02/2022]
Abstract
General conditions for organic reactions are important but rare, and efforts to identify them usually consider only narrow regions of chemical space. Discovering more general reaction conditions requires considering vast regions of chemical space derived from a large matrix of substrates crossed with a high-dimensional matrix of reaction conditions, rendering exhaustive experimentation impractical. Here, we report a simple closed-loop workflow that leverages data-guided matrix down-selection, uncertainty-minimizing machine learning, and robotic experimentation to discover general reaction conditions. Application to the challenging and consequential problem of heteroaryl Suzuki-Miyaura cross-coupling identified conditions that double the average yield relative to a widely used benchmark that was previously developed using traditional approaches. This study provides a practical road map for solving multidimensional chemical optimization problems with large search spaces.
Collapse
Affiliation(s)
- Nicholas H. Angello
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Vandana Rathore
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | | | - Agnieszka Wołos
- Allchemy, Inc., Highland, IN, USA
- Institute of Organic Chemistry, Polish Academy of Sciences, Warsaw, Poland
| | - Edward R. Jira
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Rafał Roszak
- Allchemy, Inc., Highland, IN, USA
- Institute of Organic Chemistry, Polish Academy of Sciences, Warsaw, Poland
| | - Tony C. Wu
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Charles M. Schroeder
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Department of Materials Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
- Canadian Institute for Advanced Research, Toronto, ON, Canada
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON, Canada
| | - Bartosz A. Grzybowski
- Allchemy, Inc., Highland, IN, USA
- Institute of Organic Chemistry, Polish Academy of Sciences, Warsaw, Poland
- Center for Soft and Living Matter, Institute for Basic Science, Ulsan, Republic of Korea
- Department of Chemistry, Ulsan Institute of Science and Technology, Ulsan, Republic of Korea
| | - Martin D. Burke
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Cancer Center at Illinois, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Carle Illinois College of Medicine, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| |
Collapse
|
46
|
Torres JAG, Lau SH, Anchuri P, Stevens JM, Tabora JE, Li J, Borovika A, Adams RP, Doyle AG. A Multi-Objective Active Learning Platform and Web App for Reaction Optimization. J Am Chem Soc 2022; 144:19999-20007. [PMID: 36260788 DOI: 10.1021/jacs.2c08592] [Citation(s) in RCA: 38] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We report the development of an open-source experimental design via Bayesian optimization platform for multi-objective reaction optimization. Using high-throughput experimentation (HTE) and virtual screening data sets containing high-dimensional continuous and discrete variables, we optimized the performance of the platform by fine-tuning the algorithm components such as reaction encodings, surrogate model parameters, and initialization techniques. Having established the framework, we applied the optimizer to real-world test scenarios for the simultaneous optimization of the reaction yield and enantioselectivity in a Ni/photoredox-catalyzed enantioselective cross-electrophile coupling of styrene oxide with two different aryl iodide substrates. Starting with no previous experimental data, the Bayesian optimizer identified reaction conditions that surpassed the previously human-driven optimization campaigns within 15 and 24 experiments, for each substrate, among 1728 possible configurations available in each optimization. To make the platform more accessible to nonexperts, we developed a graphical user interface (GUI) that can be accessed online through a web-based application and incorporated features such as condition modification on the fly and data visualization. This web application does not require software installation, removing any programming barrier to use the platform, which enables chemists to integrate Bayesian optimization routines into their everyday laboratory practices.
Collapse
Affiliation(s)
| | - Sii Hong Lau
- Department of Chemistry, Princeton University, Princeton, New Jersey 08544, United States.,Department of Chemistry & Biochemistry, University of California, Los Angeles, California 90095, United States
| | - Pranay Anchuri
- Center of Information Technology Policy, Princeton University, Princeton, New Jersey 08544, United States
| | - Jason M Stevens
- Chemical Process Development, Bristol Myers Squibb, New Brunswick, New Jersey 08901, United States
| | - Jose E Tabora
- Chemical Process Development, Bristol Myers Squibb, New Brunswick, New Jersey 08901, United States
| | - Jun Li
- Chemical Process Development, Bristol Myers Squibb, New Brunswick, New Jersey 08901, United States
| | - Alina Borovika
- Chemical Process Development, Bristol Myers Squibb, New Brunswick, New Jersey 08901, United States
| | - Ryan P Adams
- Department of Computer Science, Princeton University, Princeton, New Jersey 08544, United States
| | - Abigail G Doyle
- Department of Chemistry & Biochemistry, University of California, Los Angeles, California 90095, United States
| |
Collapse
|
47
|
Deng H, Bengsch M, Tchorz N, Neumann CN. Sterically Controlled Late-Stage Functionalization of Bulky Phosphines. Chemistry 2022; 28:e202202074. [PMID: 35789048 PMCID: PMC9544633 DOI: 10.1002/chem.202202074] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2022] [Indexed: 11/07/2022]
Abstract
The fine-tuning of metal-phosphine-catalyzed reactions relies largely on accessing ever more precisely tuned phosphine ligands by de-novo synthesis. Late-stage C-H functionalization and diversification of commercial phosphines offers rapid access to entire libraries of derivatives based on privileged scaffolds. But existing routes, relying on phosphorus-directed transformations, only yield functionalization of Csp 2 -H bonds in a specific position relative to phosphorus. In contrast to phosphorus-directed strategies, herein we disclose an orthogonal functionalization strategy capable of introducing a range of substituents into previously inaccessible positions on arylphosphines. The strongly coordinating phosphine group acts solely as a bystander in the sterically controlled borylation of bulky phosphines, and the resulting borylated phosphines serve as the supporting ligands for palladium during diversification through phosphine self-assisted Suzuki-Miyaura reactions.
Collapse
Affiliation(s)
- Hao Deng
- Department of Heterogeneous CatalysisMax-Planck-Institut für KohlenforschungKaiser-Wilhelm-Platz 145470Mülheim an der RuhrGermany
| | - Marco Bengsch
- Department of Heterogeneous CatalysisMax-Planck-Institut für KohlenforschungKaiser-Wilhelm-Platz 145470Mülheim an der RuhrGermany
| | - Nico Tchorz
- Department of Heterogeneous CatalysisMax-Planck-Institut für KohlenforschungKaiser-Wilhelm-Platz 145470Mülheim an der RuhrGermany
| | - Constanze N. Neumann
- Department of Heterogeneous CatalysisMax-Planck-Institut für KohlenforschungKaiser-Wilhelm-Platz 145470Mülheim an der RuhrGermany
| |
Collapse
|
48
|
Seifrid M, Pollice R, Aguilar-Granda A, Morgan Chan Z, Hotta K, Ser CT, Vestfrid J, Wu TC, Aspuru-Guzik A. Autonomous Chemical Experiments: Challenges and Perspectives on Establishing a Self-Driving Lab. Acc Chem Res 2022; 55:2454-2466. [PMID: 35948428 PMCID: PMC9454899 DOI: 10.1021/acs.accounts.2c00220] [Citation(s) in RCA: 49] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2022] [Indexed: 01/19/2023]
Abstract
We must accelerate the pace at which we make technological advancements to address climate change and disease risks worldwide. This swifter pace of discovery requires faster research and development cycles enabled by better integration between hypothesis generation, design, experimentation, and data analysis. Typical research cycles take months to years. However, data-driven automated laboratories, or self-driving laboratories, can significantly accelerate molecular and materials discovery. Recently, substantial advancements have been made in the areas of machine learning and optimization algorithms that have allowed researchers to extract valuable knowledge from multidimensional data sets. Machine learning models can be trained on large data sets from the literature or databases, but their performance can often be hampered by a lack of negative results or metadata. In contrast, data generated by self-driving laboratories can be information-rich, containing precise details of the experimental conditions and metadata. Consequently, much larger amounts of high-quality data are gathered in self-driving laboratories. When placed in open repositories, this data can be used by the research community to reproduce experiments, for more in-depth analysis, or as the basis for further investigation. Accordingly, high-quality open data sets will increase the accessibility and reproducibility of science, which is sorely needed.In this Account, we describe our efforts to build a self-driving lab for the development of a new class of materials: organic semiconductor lasers (OSLs). Since they have only recently been demonstrated, little is known about the molecular and material design rules for thin-film, electrically-pumped OSL devices as compared to other technologies such as organic light-emitting diodes or organic photovoltaics. To realize high-performing OSL materials, we are developing a flexible system for automated synthesis via iterative Suzuki-Miyaura cross-coupling reactions. This automated synthesis platform is directly coupled to the analysis and purification capabilities. Subsequently, the molecules of interest can be transferred to an optical characterization setup. We are currently limited to optical measurements of the OSL molecules in solution. However, material properties are ultimately most important in the solid state (e.g., as a thin-film device). To that end and for a different scientific goal, we are developing a self-driving lab for inorganic thin-film materials focused on the oxygen evolution reaction.While the future of self-driving laboratories is very promising, numerous challenges still need to be overcome. These challenges can be split into cognition and motor function. Generally, the cognitive challenges are related to optimization with constraints or unexpected outcomes for which general algorithmic solutions have yet to be developed. A more practical challenge that could be resolved in the near future is that of software control and integration because few instrument manufacturers design their products with self-driving laboratories in mind. Challenges in motor function are largely related to handling heterogeneous systems, such as dispensing solids or performing extractions. As a result, it is critical to understand that adapting experimental procedures that were designed for human experimenters is not as simple as transferring those same actions to an automated system, and there may be more efficient ways to achieve the same goal in an automated fashion. Accordingly, for self-driving laboratories, we need to carefully rethink the translation of manual experimental protocols.
Collapse
Affiliation(s)
- Martin Seifrid
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Robert Pollice
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | | | - Zamyla Morgan Chan
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Acceleration
Consortium, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Kazuhiro Hotta
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Science
& Innovation Center, Mitsubishi Chemical
Corporation, 1000 Kamoshidacho, Aoba, Yokohama, Kanagawa 227-8502, Japan
| | - Cher Tian Ser
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Jenya Vestfrid
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Tony C. Wu
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
| | - Alán Aspuru-Guzik
- Department
of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, Toronto, Ontario M5S 3H6, Canada
- Department
of Chemical Engineering & Applied Chemistry, University of Toronto, Toronto, Ontario M5S 3E5, Canada
- Department
of Materials Science, University of Toronto, Toronto, Ontario M5S 3E4, Canada
- Vector
Institute for Artificial Intelligence, Toronto, Ontario M5S 1M1, Canada
- Lebovic
Fellow, Canadian Institute for Advanced
Research, Toronto, Ontario M5S 1M1, Canada
| |
Collapse
|
49
|
Dong Q. Optimization and Application of Information Visualization Design Based on Image Symbol under the Guidance of Feature Integration Theory. CONTRAST MEDIA & MOLECULAR IMAGING 2022; 2022:5257187. [PMID: 36262994 PMCID: PMC9552687 DOI: 10.1155/2022/5257187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/16/2022] [Revised: 06/21/2022] [Accepted: 07/19/2022] [Indexed: 01/26/2023]
Abstract
Increasingly, today's businesses rely on data visualization to aid in the outcome that is directly linked to the bulk of their earnings. Due to the enormous volume, speed, and accuracy requirements of data management, database professionals are becoming increasingly necessary to aid in the effective visualization of data. Assuming the information to be depicted is free of ambiguity, most visualization approaches were developed. However, this is a rare occurrence. There has been a recent upsurge in visualizations that attempt to convey a sense of unpredictability. When it comes to visual optimization, we present a novel cognitive fuzzy logic-based particle swarm optimization (CFLPSO) to optimize the data visualizations. Initially, the datasets are gathered as images as well as are denoised and enhanced by employing the bilateral three-dimensional fairing median filter (B-3D-FMF) and contrast illuminate histogram equalization (CIHE), correspondingly. Principal component analysis (PCA) is utilized in the feature extraction stage to extract the features from the enhanced data. Then, the feature integration theory is applied to the extracted features, and also a fast rectangle-packing algorithm is applied to the data visualization. And the proposed approach is employed for visual optimization. The performance of the proposed technique is examined and compared with other existing techniques to obtain the proposed technique with the greatest effectiveness of visual optimization. The findings are depicted by utilizing the Origin tool.
Collapse
Affiliation(s)
- Qi Dong
- Department of Art, School of Humanities and Social Science, Xi'an Jiaotong University, Xi'an 710049, Shaanxi, China
| |
Collapse
|
50
|
Duan C, Ladera AJ, Liu JCL, Taylor MG, Ariyarathna IR, Kulik HJ. Exploiting Ligand Additivity for Transferable Machine Learning of Multireference Character across Known Transition Metal Complex Ligands. J Chem Theory Comput 2022; 18:4836-4845. [PMID: 35834742 DOI: 10.1021/acs.jctc.2c00468] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Accurate virtual high-throughput screening (VHTS) of transition metal complexes (TMCs) remains challenging due to the possibility of high multireference (MR) character that complicates property evaluation. We compute MR diagnostics for over 5,000 ligands present in previously synthesized octahedral mononuclear transition metal complexes in the Cambridge Structural Database (CSD). To accomplish this task, we introduce an iterative approach for consistent ligand charge assignment for ligands in the CSD. Across this set, we observe that the MR character correlates linearly with the inverse value of the averaged bond order over all bonds in the molecule. We then demonstrate that ligand additivity of the MR character holds in TMCs, which suggests that the TMC MR character can be inferred from the sum of the MR character of the ligands. Encouraged by this observation, we leverage ligand additivity and develop a ligand-derived machine learning representation to train neural networks to predict the MR character of TMCs from properties of the constituent ligands. This approach yields models with excellent performance and superior transferability to unseen ligand chemistry and compositions.
Collapse
Affiliation(s)
- Chenru Duan
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States.,Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Adriana J Ladera
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Julian C-L Liu
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Michael G Taylor
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Isuru R Ariyarathna
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Heather J Kulik
- Department of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|