1
|
Duke R, Yang CH, Ganapathysubramanian B, Risko C. Evaluating Molecular Similarity Measures: Do Similarity Measures Reflect Electronic Structure Properties? J Chem Inf Model 2025; 65:4311-4319. [PMID: 40299458 DOI: 10.1021/acs.jcim.5c00175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/30/2025]
Abstract
The rapid adoption of big data, machine learning (ML), and generative artificial intelligence (AI) in chemical discovery has heightened the importance of quantifying molecular similarity. Molecular similarity, commonly assessed as the distance between molecular fingerprints, is integral to applications such as database curation, diversity analysis, and property prediction. AI tools frequently rely on these similarity measures to cluster molecules under the assumption that structurally similar molecules exhibit similar properties. However, this assumption is not universally valid, particularly for continuous properties like electronic structure properties. Despite the prevalence of fingerprint-based similarity measures, their evaluation has largely depended on biological activity data sets and qualitative metrics, limiting their relevance for nonbiological domains. To address this gap, we propose a framework to evaluate the correlation between molecular similarity measures and molecular properties. Our approach builds on the concept of neighborhood behavior and incorporates kernel density estimation (KDE) analysis to quantify how well similarity measures capture property relationships. Using a data set of over 350 million molecule pairs with electronic structure, redox, and optical properties, we systematically evaluate the correlation between several molecular fingerprint generators, distance functions, and these properties. Both the curated data set and the evaluation framework are publicly available.
Collapse
Affiliation(s)
- Rebekah Duke
- Department of Chemistry and Center for Applied Energy Research, University of Kentucky, Lexington, Kentucky 40506, United States
| | - Chih-Hsuan Yang
- Department of Mechanical Engineering and Translational AI Research and Education Center, Iowa State University, Ames, Iowa 50011, United States
| | - Baskar Ganapathysubramanian
- Department of Mechanical Engineering and Translational AI Research and Education Center, Iowa State University, Ames, Iowa 50011, United States
| | - Chad Risko
- Department of Chemistry and Center for Applied Energy Research, University of Kentucky, Lexington, Kentucky 40506, United States
| |
Collapse
|
2
|
Mroz AM, Basford AR, Hastedt F, Jayasekera IS, Mosquera-Lois I, Sedgwick R, Ballester PJ, Bocarsly JD, Antonio Del Río Chanona E, Evans ML, Frost JM, Ganose AM, Greenaway RL, Kuok Mimi Hii K, Li Y, Misener R, Walsh A, Zhang D, Jelfs KE. Cross-disciplinary perspectives on the potential for artificial intelligence across chemistry. Chem Soc Rev 2025. [PMID: 40278836 PMCID: PMC12024683 DOI: 10.1039/d5cs00146c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2025] [Indexed: 04/26/2025]
Abstract
From accelerating simulations and exploring chemical space, to experimental planning and integrating automation within experimental labs, artificial intelligence (AI) is changing the landscape of chemistry. We are seeing a significant increase in the number of publications leveraging these powerful data-driven insights and models to accelerate all aspects of chemical research. For example, how we represent molecules and materials to computer algorithms for predictive and generative models, as well as the physical mechanisms by which we perform experiments in the lab for automation. Here, we present ten diverse perspectives on the impact of AI coming from those with a range of backgrounds from experimental chemistry, computational chemistry, computer science, engineering and across different areas of chemistry, including drug discovery, catalysis, chemical automation, chemical physics, materials chemistry. The ten perspectives presented here cover a range of themes, including AI for computation, facilitating discovery, supporting experiments, and enabling technologies for transformation. We highlight and discuss imminent challenges and ways in which we are redefining problems to accelerate the impact of chemical research via AI.
Collapse
Affiliation(s)
- Austin M Mroz
- Department of Chemistry, Imperial College London, London W12 0BZ, UK.
- I-X Centre for AI in Science, Imperial College London, London W12 0BZ, UK
| | - Annabel R Basford
- Department of Chemistry, Imperial College London, London W12 0BZ, UK.
| | - Friedrich Hastedt
- Department of Chemical Engineering, Imperial College London, London SW7 2AZ, UK
| | | | | | - Ruby Sedgwick
- Department of Computing, Imperial College London, London SW7 2AZ, UK
| | - Pedro J Ballester
- Department of Bioengineering, Imperial College London, London SW7 2AZ, UK
| | - Joshua D Bocarsly
- Department of Chemistry and Texas Center for Superconductivity, University of Houston, Houston, USA
| | | | - Matthew L Evans
- UCLouvain, Institute of Condensed Matter and Nanosciences (IMCN), Chemin des Étoiles 8, Louvain-la-Neuve 1348, Belgium
- Matgenix SRL, A6K Advanced Engineering Center, Charleroi, Belgium
- Datalab Industries Ltd, King's Lynn, Norfolk, UK
| | - Jarvist M Frost
- Department of Chemistry, Imperial College London, London W12 0BZ, UK.
| | - Alex M Ganose
- Department of Chemistry, Imperial College London, London W12 0BZ, UK.
| | | | | | - Yingzhen Li
- Department of Computing, Imperial College London, London SW7 2AZ, UK
| | - Ruth Misener
- Department of Computing, Imperial College London, London SW7 2AZ, UK
| | - Aron Walsh
- Department of Materials, Imperial College London, London SW7 2AZ, UK
| | - Dandan Zhang
- I-X Centre for AI in Science, Imperial College London, London W12 0BZ, UK
- Department of Bioengineering, Imperial College London, London SW7 2AZ, UK
| | - Kim E Jelfs
- Department of Chemistry, Imperial College London, London W12 0BZ, UK.
| |
Collapse
|
3
|
Edaugal J, Zhang D, Liu D, Glezakou VA, Sun N. Solvent Screening for Separation Processes Using Machine Learning and High-Throughput Technologies. CHEM & BIO ENGINEERING 2025; 2:210-228. [PMID: 40302870 PMCID: PMC12035567 DOI: 10.1021/cbe.4c00170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Revised: 02/13/2025] [Accepted: 02/16/2025] [Indexed: 05/02/2025]
Abstract
As the chemical industry shifts toward sustainable practices, there is a growing initiative to replace conventional fossil-derived solvents with environmentally friendly alternatives such as ionic liquids (ILs) and deep eutectic solvents (DESs). Artificial intelligence (AI) plays a key role in the discovery and design of novel solvents and the development of green processes. This review explores the latest advancements in AI-assisted solvent screening with a specific focus on machine learning (ML) models for physicochemical property prediction and separation process design. Additionally, this paper highlights recent progress in the development of automated high-throughput (HT) platforms for solvent screening. Finally, this paper discusses the challenges and prospects of ML-driven HT strategies for green solvent design and optimization. To this end, this review provides key insights to advance solvent screening strategies for future chemical and separation processes.
Collapse
Affiliation(s)
- Justin
P. Edaugal
- Advanced
Biofuels and Bioproducts Process Development Unit, Biological Systems
and Engineering Division, Lawrence Berkeley
National Laboratory, Emeryville, California 94608, United States
| | - Difan Zhang
- Physical
and Computational Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington 99354, United States
| | - Dupeng Liu
- Advanced
Biofuels and Bioproducts Process Development Unit, Biological Systems
and Engineering Division, Lawrence Berkeley
National Laboratory, Emeryville, California 94608, United States
| | | | - Ning Sun
- Advanced
Biofuels and Bioproducts Process Development Unit, Biological Systems
and Engineering Division, Lawrence Berkeley
National Laboratory, Emeryville, California 94608, United States
| |
Collapse
|
4
|
Canty RB, Bennett JA, Brown KA, Buonassisi T, Kalinin SV, Kitchin JR, Maruyama B, Moore RG, Schrier J, Seifrid M, Sun S, Vegge T, Abolhasani M. Science acceleration and accessibility with self-driving labs. Nat Commun 2025; 16:3856. [PMID: 40274856 PMCID: PMC12022019 DOI: 10.1038/s41467-025-59231-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2024] [Accepted: 04/08/2025] [Indexed: 04/26/2025] Open
Abstract
In the evolving landscape of scientific research, the complexity of global challenges demands innovative approaches to experimental planning and execution. Self-Driving Laboratories (SDLs) automate experimental tasks in chemical and materials sciences and the design and selection of experiments to optimize research processes and reduce material usage. This perspective explores improving access to SDLs via centralized facilities and distributed networks. We discuss the technical and collaborative challenges in realizing SDLs' potential to enhance human-machine and human-human collaboration, ultimately fostering a more inclusive research community and facilitating previously untenable research projects.
Collapse
Affiliation(s)
- Richard B Canty
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA
| | - Jeffrey A Bennett
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA
| | - Keith A Brown
- Department of Mechanical Engineering, Boston University, Boston, MA, USA
| | - Tonio Buonassisi
- Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Sergei V Kalinin
- Materials Science and Engineering, The University of Tennessee, Knoxville, TN, USA
| | - John R Kitchin
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, PA, USA
| | - Benji Maruyama
- Air Force Research Laboratory, Materials and Manufacturing Directorate, Wright-Patterson AFB, OH, USA
| | - Robert G Moore
- Materials Science and Technology Division, Oak Ridge National Laboratory, Oak Ridge, TN, USA
| | - Joshua Schrier
- Department of Chemistry and Biochemistry, Fordham University, New York, NY, USA
| | - Martin Seifrid
- Department of Materials Science and Engineering, North Carolina State University, Raleigh, NC, USA
| | - Shijing Sun
- Department of Mechanical Engineering, University of Washington, Seattle, WA, USA
| | - Tejs Vegge
- Department of Energy Conversion and Storage, Technical University of Denmark, Lyngby, Denmark
| | - Milad Abolhasani
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC, USA.
| |
Collapse
|
5
|
Lin DZ, Pan KJ, Li Y, Charles B Musgrave Iii, Zhang L, Jayarapu KN, Li T, Tran JV, Goddard WA, Luo Z, Liu Y. A high-throughput experimentation platform for data-driven discovery in electrochemistry. SCIENCE ADVANCES 2025; 11:eadu4391. [PMID: 40184458 DOI: 10.1126/sciadv.adu4391] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2024] [Accepted: 02/28/2025] [Indexed: 04/06/2025]
Abstract
Automating electrochemical analyses combined with artificial intelligence is poised to accelerate discoveries in renewable energy sciences and technologies. This study presents an automated high-throughput electrochemical characterization (AHTech) platform as a cost-effective and versatile tool for rapidly assessing liquid analytes. The Python-controlled platform combines a liquid handling robot, potentiostat, and customizable microelectrode bundles for diverse, reproducible electrochemical measurements in microtiter plates, minimizing chemical consumption and manual effort. To showcase the capability of AHTech, we screened a library of 180 small molecules as electrolyte additives for aqueous zinc metal batteries, generating data for training machine learning models to predict Coulombic efficiencies. Key molecular features governing additive performance were elucidated using Shapley Additive exPlanations and Spearman's correlation, pinpointing high-performance candidates like cis-4-hydroxy-d-proline, which achieved an average Coulombic efficiency of 99.52% over 200 cycles. The workflow established herein is highly adaptable, offering a powerful framework for accelerating the exploration and optimization of extensive chemical spaces across diverse energy storage and conversion fields.
Collapse
Affiliation(s)
- Dian-Zhao Lin
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Kai-Jui Pan
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Yuyin Li
- Department of Chemical and Biological Engineering, William Mong Institute of Nano Science and Technology and Hong Kong Branch of Chinese National Engineering Research Center for Tissue Restoration and Reconstruction, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong 999077, P. R. China
- Materials and Process Simulation Center, Department of Chemistry, California Institute of Technology, Pasadena, CA 91125, USA
| | - Charles B Musgrave Iii
- Materials and Process Simulation Center, Department of Chemistry, California Institute of Technology, Pasadena, CA 91125, USA
| | - Lingyu Zhang
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Krish N Jayarapu
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Tianchen Li
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jasmine Vy Tran
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - William A Goddard
- Materials and Process Simulation Center, Department of Chemistry, California Institute of Technology, Pasadena, CA 91125, USA
| | - Zhengtang Luo
- Department of Chemical and Biological Engineering, William Mong Institute of Nano Science and Technology and Hong Kong Branch of Chinese National Engineering Research Center for Tissue Restoration and Reconstruction, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong 999077, P. R. China
| | - Yayuan Liu
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
6
|
Zsoldos P, Wolf Á, Széll K, Galambos P. Towards robotic Laboratory Automation Plug & Play: LAPP reference implementation with the TIAGo mobile manipulator. SLAS Technol 2025; 31:100251. [PMID: 39922496 DOI: 10.1016/j.slast.2025.100251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 12/24/2024] [Accepted: 01/29/2025] [Indexed: 02/10/2025]
Abstract
Laboratory automation with mobile manipulators presents a promising avenue for enhancing efficiency and reducing the dependency on human activity in laboratory workflows. To achieve seamless integration, the Laboratory Automation Plag & Play concept has been introduced. This study presents an experimental implementation of the LAPP concept through the development of labware transfer functionality for a mobile manipulator employing the SiLA and ROS frameworks as widely accepted control layers. While validating the feasibility of the LAPP concept, this reference implementation accentuates the need for advancements in both hardware and software environments to fully capitalize on its benefits in contemporary laboratory settings.
Collapse
Affiliation(s)
- Panna Zsoldos
- Antal Bejczy Center for Intelligent Robotics, Obuda University, Budapest, Hungary.
| | - Ádám Wolf
- Baxalta Innovations GmbH, a Takeda company, Wien, Austria; Doctoral School of Applied Informatics and Applied Mathematics, Obuda University, Budapest, Hungary.
| | - Károly Széll
- Alba Regia Technical Faculty, Obuda University, Székesfehérvár, Hungary
| | - Péter Galambos
- Antal Bejczy Center for Intelligent Robotics, Obuda University, Budapest, Hungary
| |
Collapse
|
7
|
Baronas P, Lekavičius J, Majdecki M, Elholm JL, Kazlauskas K, Gaweł P, Moth-Poulsen K. Automated Research Platform for Development of Triplet-Triplet Annihilation Photon Upconversion Systems. ACS CENTRAL SCIENCE 2025; 11:413-421. [PMID: 40161950 PMCID: PMC11950846 DOI: 10.1021/acscentsci.4c02059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/01/2024] [Revised: 01/24/2025] [Accepted: 02/04/2025] [Indexed: 04/02/2025]
Abstract
Triplet-triplet annihilation photon upconversion (TTA-UC) systems hold great promise for applications in energy, 3D printing, and photopharmacology. However, their optimization remains challenging due to the need for precise tuning of sensitizer and annihilator concentrations under oxygen-free conditions. This study presents an automated, high-throughput platform for the discovery and optimization of TTA-UC systems. Capable of performing 100 concentration scans in just two hours, the platform generates comprehensive concentration maps of critical parameters, including quantum yield, triplet energy transfer efficiency, and threshold intensity. Using this approach, we identify key loss mechanisms in both the established and novel TTA-UC systems. At high porphyrin-based sensitizer concentrations, upconversion quantum yield losses are attributed to sensitizer triplet self-quenching via aggregation and sensitizer triplet-triplet annihilation (sensitizer-TTA). Additionally, reverse triplet energy transfer (RTET) at elevated sensitizer levels increases the upconversion losses and excitation thresholds. Testing novel sensitizer-annihilator pairs confirms these loss mechanisms, highlighting opportunities for molecular design improvements. This automated platform offers a powerful tool for advancing TTA-UC research and other photochemical studies requiring low oxygen levels, intense laser excitation, and minimal material use.
Collapse
Affiliation(s)
- Paulius Baronas
- Department
of Chemical Engineering, Universitat Politècnica
de Catalunya, EEBE, Eduard
Maristany 10-14, 08019 Barcelona, Spain
- Institute
of Photonics and Nanotechnology, Vilnius
University, Saulėtekis av. 3, 10257 Vilnius, Lithuania
- The Institute
of Materials Science of Barcelona, ICMAB-CSIC, Bellaterra, 08193 Barcelona, Spain
| | - Justas Lekavičius
- Institute
of Photonics and Nanotechnology, Vilnius
University, Saulėtekis av. 3, 10257 Vilnius, Lithuania
| | - Maciej Majdecki
- Institute
of Organic Chemistry, Polish Academy of
Sciences, Kasprzaka 44/52, 01-224 Warsaw, Poland
| | - Jacob Lynge Elholm
- Department
of Chemical Engineering, Universitat Politècnica
de Catalunya, EEBE, Eduard
Maristany 10-14, 08019 Barcelona, Spain
| | - Karolis Kazlauskas
- Institute
of Photonics and Nanotechnology, Vilnius
University, Saulėtekis av. 3, 10257 Vilnius, Lithuania
| | - Przemysław Gaweł
- Institute
of Organic Chemistry, Polish Academy of
Sciences, Kasprzaka 44/52, 01-224 Warsaw, Poland
| | - Kasper Moth-Poulsen
- Department
of Chemical Engineering, Universitat Politècnica
de Catalunya, EEBE, Eduard
Maristany 10-14, 08019 Barcelona, Spain
- Catalan
Institution for Research & Advanced Studies, ICREA, Pg. Lluís Companys 23, 08010 Barcelona, Spain
- Department
of Chemistry and Chemical Engineering, Chalmers
University of Technology, SE-41296 Gothenburg, Sweden
- The Institute
of Materials Science of Barcelona, ICMAB-CSIC, Bellaterra, 08193 Barcelona, Spain
| |
Collapse
|
8
|
Fushimi K, Nakai Y, Nishi A, Suzuki R, Ikegami M, Nimura R, Tomono T, Hidese R, Yasueda H, Tagawa Y, Hasunuma T. Development of the autonomous lab system to support biotechnology research. Sci Rep 2025; 15:6648. [PMID: 39994271 PMCID: PMC11850614 DOI: 10.1038/s41598-025-89069-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2024] [Accepted: 02/03/2025] [Indexed: 02/26/2025] Open
Abstract
In this study, we developed the autonomous lab (ANL), which is a system based on robotics and artificial intelligence (AI) to conduct biotechnology experiments and formulate scientific hypotheses. This system was designed with modular devices and Bayesian optimization algorithms, allowing it to effectively run a closed loop from culturing to preprocessing, measurement, analysis, and hypothesis formulation. As a case study, we used the ANL to optimize medium conditions for a recombinant Escherichia coli strain, which overproduces glutamic acid. The results demonstrated that our autonomous system successfully replicated the experimental techniques, such as sample preparation and data measurement, and improved both the cell growth rate and the maximum cell growth. The ANL offers a versatile and scalable solution for various applications in the field of bioproduction, with the potential to improve efficiency and reliability of experimental processes in the future.
Collapse
Affiliation(s)
- Keiji Fushimi
- Graduate School of Science, Innovation and Technology, Kobe University, 1-1 Rokkodai, Nada, Kobe, 657-8501, Japan
| | - Yusuke Nakai
- Technology Research Laboratory, Shimadzu Corporation, 3-9-4 Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0237, Japan
| | - Akiko Nishi
- Graduate School of Science, Innovation and Technology, Kobe University, 1-1 Rokkodai, Nada, Kobe, 657-8501, Japan
| | - Ryo Suzuki
- Graduate School of Science, Innovation and Technology, Kobe University, 1-1 Rokkodai, Nada, Kobe, 657-8501, Japan
| | - Masahiro Ikegami
- Technology Research Laboratory, Shimadzu Corporation, 3-9-4 Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0237, Japan
| | - Risa Nimura
- Technology Research Laboratory, Shimadzu Corporation, 3-9-4 Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0237, Japan
| | - Taichi Tomono
- Technology Research Laboratory, Shimadzu Corporation, 3-9-4 Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0237, Japan
| | - Ryota Hidese
- Engineering Biology Research Center, Kobe University, 1-1 Rokkodai, Nada, Kobe, 657- 8501, Japan
| | - Hisashi Yasueda
- Graduate School of Science, Innovation and Technology, Kobe University, 1-1 Rokkodai, Nada, Kobe, 657-8501, Japan
- Engineering Biology Research Center, Kobe University, 1-1 Rokkodai, Nada, Kobe, 657- 8501, Japan
- Research and Development Center for Precision Medicine, University of Tsukuba, 1-2 Kasuga, Tsukuba, Ibaraki, 305-8550, Japan
| | - Yusuke Tagawa
- Technology Research Laboratory, Shimadzu Corporation, 3-9-4 Hikaridai, Seika-cho, Soraku-gun, Kyoto, 619-0237, Japan.
| | - Tomohisa Hasunuma
- Graduate School of Science, Innovation and Technology, Kobe University, 1-1 Rokkodai, Nada, Kobe, 657-8501, Japan.
- Engineering Biology Research Center, Kobe University, 1-1 Rokkodai, Nada, Kobe, 657- 8501, Japan.
| |
Collapse
|
9
|
Cheng AH, Ser CT, Skreta M, Guzmán-Cordero A, Thiede L, Burger A, Aldossary A, Leong SX, Pablo-García S, Strieth-Kalthoff F, Aspuru-Guzik A. Spiers Memorial Lecture: How to do impactful research in artificial intelligence for chemistry and materials science. Faraday Discuss 2025; 256:10-60. [PMID: 39400305 DOI: 10.1039/d4fd00153b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/15/2024]
Abstract
Machine learning has been pervasively touching many fields of science. Chemistry and materials science are no exception. While machine learning has been making a great impact, it is still not reaching its full potential or maturity. In this perspective, we first outline current applications across a diversity of problems in chemistry. Then, we discuss how machine learning researchers view and approach problems in the field. Finally, we provide our considerations for maximizing impact when researching machine learning for chemistry.
Collapse
Affiliation(s)
- Austin H Cheng
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Cher Tian Ser
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Marta Skreta
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Andrés Guzmán-Cordero
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
- Tinbergen Institute, University of Amsterdam, Amsterdam, Netherlands
| | - Luca Thiede
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | - Andreas Burger
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
| | | | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- School of Chemistry, Chemical Engineering and Biotechnology, Nanyang Technological University, Singapore 63737, Singapore
| | | | | | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada.
- Department of Computer Science, University of Toronto, Toronto, Ontario M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5G 1M1, Canada
- Acceleration Consortium, Toronto, Ontario M5G 1X6, Canada
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Canada
- Department of Materials Science and Engineering, University of Toronto, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), Canada
| |
Collapse
|
10
|
Foadian E, Sanchez S, Kalinin SV, Ahmadi M. From Sunlight to Solutions: Closing the Loop on Halide Perovskites. ACS MATERIALS AU 2025; 5:11-23. [PMID: 39802149 PMCID: PMC11718539 DOI: 10.1021/acsmaterialsau.4c00096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/27/2024] [Revised: 10/20/2024] [Accepted: 10/21/2024] [Indexed: 01/16/2025]
Abstract
Halide perovskites (HPs) are emerging as key materials in the fight against global warming with well recognized applications, such as photovoltaics, and emergent opportunities, such as photocatalysis for methane removal and environmental remediation. These current and emergent applications are enabled by a unique combination of high absorption coefficients, tunable band gaps, and long carrier diffusion lengths, making them highly efficient for solar energy conversion. To address the challenge of discovery and optimization of HPs in huge chemical and compositional spaces of possible candidates, this perspective discusses a comprehensive strategy for screening HPs through automated high-throughput and combinatorial synthesis techniques. A critical aspect of this approach is closing the characterization loop, where machine learning (ML) and human collaboration play pivotal roles. By leveraging human creativity and domain knowledge for hypothesis generation and employing ML to test and refine these hypotheses efficiently, we aim to accelerate the discovery and optimization of HPs under specific environmental conditions. This synergy enables rapid identification of the most promising materials, advancing from fundamental discovery to scalable manufacturability. Our ultimate goal of this work is to transition from laboratory-scale innovations to real-world applications, ensuring that HPs can be deployed effectively in technologies that mitigate global warming, such as in solar energy harvesting and methane removal systems.
Collapse
Affiliation(s)
- Elham Foadian
- Institute
for Advanced Materials and Manufacturing, Department of Materials Science and Engineering, Knoxville, Tennessee 37996, United States
| | - Sheryl Sanchez
- Institute
for Advanced Materials and Manufacturing, Department of Materials Science and Engineering, Knoxville, Tennessee 37996, United States
| | - Sergei V. Kalinin
- Institute
for Advanced Materials and Manufacturing, Department of Materials Science and Engineering, Knoxville, Tennessee 37996, United States
- Physical
Science Directorate, Pacific Northwest National
Laboratory, Richland, Washington 99354, United States
| | - Mahshid Ahmadi
- Institute
for Advanced Materials and Manufacturing, Department of Materials Science and Engineering, Knoxville, Tennessee 37996, United States
| |
Collapse
|
11
|
Velasco PQ, Hippalgaonkar K, Ramalingam B. Emerging trends in the optimization of organic synthesis through high-throughput tools and machine learning. Beilstein J Org Chem 2025; 21:10-38. [PMID: 39811684 PMCID: PMC11730176 DOI: 10.3762/bjoc.21.3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Accepted: 11/26/2024] [Indexed: 01/16/2025] Open
Abstract
The discovery of the optimal conditions for chemical reactions is a labor-intensive, time-consuming task that requires exploring a high-dimensional parametric space. Historically, the optimization of chemical reactions has been performed by manual experimentation guided by human intuition and through the design of experiments where reaction variables are modified one at a time to find the optimal conditions for a specific reaction outcome. Recently, a paradigm change in chemical reaction optimization has been enabled by advances in lab automation and the introduction of machine learning algorithms. Therein, multiple reaction variables can be synchronously optimized to obtain the optimal reaction conditions, requiring a shorter experimentation time and minimal human intervention. Herein, we review the currently used state-of-the-art high-throughput automated chemical reaction platforms and machine learning algorithms that drive the optimization of chemical reactions, highlighting the limitations and future opportunities of this new field of research.
Collapse
Affiliation(s)
- Pablo Quijano Velasco
- Institute of Materials Research and Engineering (IMRE), Agency for Science Technology and Research (A*STAR), 2 Fusionopolis Way, Singapore 138634, Republic of Singapore
| | - Kedar Hippalgaonkar
- Institute of Materials Research and Engineering (IMRE), Agency for Science Technology and Research (A*STAR), 2 Fusionopolis Way, Singapore 138634, Republic of Singapore
- Department of Materials Science and Engineering, Nanyang Technological University, Singapore 639798, Republic of Singapore
- Institute for Functional Intelligent Materials, National University of Singapore, 4 Science Drive 2, Singapore 117544, Republic of Singapore
| | - Balamurugan Ramalingam
- Institute of Materials Research and Engineering (IMRE), Agency for Science Technology and Research (A*STAR), 2 Fusionopolis Way, Singapore 138634, Republic of Singapore
- Institute of Sustainability for Chemicals, Energy and Environment (ISCE2), Agency for Science Technology and Research (A*STAR), 1 Pesek Road, Jurong Island, Singapore 627833, Republic of Singapore
| |
Collapse
|
12
|
Yoo HJ, Lee KY, Kim D, Han SS. OCTOPUS: operation control system for task optimization and job parallelization via a user-optimal scheduler. Nat Commun 2024; 15:9669. [PMID: 39516207 PMCID: PMC11549402 DOI: 10.1038/s41467-024-54067-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Accepted: 10/29/2024] [Indexed: 11/16/2024] Open
Abstract
The material acceleration platform, empowered by robotics and artificial intelligence, is a transformative approach for expediting material discovery processes across diverse domains. However, the development of an operating system for material acceleration platform faces challenges in simultaneously managing diverse experiments from multiple users. Specifically, when it is utilized by multiple users, the overlapping challenges of experimental modules or devices can lead to inefficiencies in both resource utilization and safety hazards. To overcome these challenges, we present an operation control system for material acceleration platform, namely, OCTOPUS, which is an acronym for operation control system for task optimization and job parallelization via a user-optimal scheduler. OCTOPUS streamlines experiment scheduling and optimizes resource utilization through integrating its interface node, master node and module nodes. Leveraging process modularization and a network protocol, OCTOPUS ensures the homogeneity, scalability, safety and versatility of the platform. In addition, OCTOPUS embodies a user-optimal scheduler. Job parallelization and task optimization techniques mitigate delays and safety hazards within realistic operational environments, while the closed-packing schedule algorithm efficiently executes multiple jobs with minimal resource waste. Copilot of OCTOPUS is developed to promote the reusability of OCTOPUS for potential users with their own sets of lab resources, which substantially simplifies the process of code generation and customization through GPT recommendations and client feedback. This work offers a solution to the challenges encountered within the platform accessed by multiple users, and thereby will facilitate its widespread adoption in material development processes.
Collapse
Affiliation(s)
- Hyuk Jun Yoo
- Computational Science Research Center, Korea Institute of Science and Technology, Seoul, Republic of Korea
- Department of Chemical and Biological Engineering, Korea University, Seoul, Republic of Korea
| | - Kwan-Young Lee
- Department of Chemical and Biological Engineering, Korea University, Seoul, Republic of Korea.
| | - Donghun Kim
- Computational Science Research Center, Korea Institute of Science and Technology, Seoul, Republic of Korea.
| | - Sang Soo Han
- Computational Science Research Center, Korea Institute of Science and Technology, Seoul, Republic of Korea.
| |
Collapse
|
13
|
Schoepfer AA, Weinreich J, Laplaza R, Waser J, Corminboeuf C. Cost-informed Bayesian reaction optimization. DIGITAL DISCOVERY 2024; 3:2289-2297. [PMID: 39398973 PMCID: PMC11465108 DOI: 10.1039/d4dd00225c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 09/30/2024] [Indexed: 10/15/2024]
Abstract
Bayesian optimization (BO) is an efficient method for solving complex optimization problems, including those in chemical research, where it is gaining significant popularity. Although effective in guiding experimental design, BO does not account for experimentation costs: testing readily available reagents under different conditions could be more cost and time-effective than synthesizing or buying additional ones. To address this issue, we present cost-informed BO (CIBO), an approach tailored for the rational planning of chemical experimentation that prioritizes the most cost-effective experiments. Reagents are used only when their anticipated improvement in reaction performance sufficiently outweighs their costs. Our algorithm tracks available reagents, including those recently acquired, and dynamically updates their cost during the optimization. Using literature data of Pd-catalyzed reactions, we show that CIBO reduces the cost of reaction optimization by up to 90% compared to standard BO. Our approach is compatible with any type of cost, e.g., of buying equipment or compounds, waiting time, as well as environmental or security concerns. We believe CIBO extends the possibilities of BO in chemistry and envision applications for both traditional and self-driving laboratories for experiment planning.
Collapse
Affiliation(s)
- Alexandre A Schoepfer
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- Laboratory of Catalysis and Organic Synthesis, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne 1015 Lausanne Switzerland
| | - Jan Weinreich
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne 1015 Lausanne Switzerland
| | - Ruben Laplaza
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne 1015 Lausanne Switzerland
| | - Jerome Waser
- Laboratory of Catalysis and Organic Synthesis, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne 1015 Lausanne Switzerland
| | - Clemence Corminboeuf
- Laboratory for Computational Molecular Design, Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL) 1015 Lausanne Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis), École Polytechnique Fédérale de Lausanne 1015 Lausanne Switzerland
| |
Collapse
|
14
|
Dallaire N, Boileau NT, Myers I, Brixi S, Ourabi M, Raluchukwu E, Cranston R, Lamontagne HR, King B, Ronnasi B, Melville OA, Manion JG, Lessard BH. High Throughput Characterization of Organic Thin Film Transistors. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2406105. [PMID: 39149766 DOI: 10.1002/adma.202406105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 07/29/2024] [Indexed: 08/17/2024]
Abstract
Automation is vital to accelerating research. In recent years, the application of self-driving labs to materials discovery and device optimization has highlighted many benefits and challenges inherent to these new technologies. Successful automated workflows offer tangible benefits to fundamental science and industrial scale-up by significantly increasing productivity and reproducibility all while enabling entirely new types of experiments. However, it's implemtation is often time-consuming and cost-prohibitive and necessitates establishing multidisciplinary teams that bring together domain-specific knowledge with specific skillsets in computer science and engineering. This perspective article provides a comprehensive overview of how the research group has adopted "hybrid automation" over the last 8 years by using simple automatic electrical testers (autotesters) as a tool to increase productivity and enhance reproducibility in organic thin film transistor (OTFT) research. From wearable and stretchable electronics to next-generation sensors and displays, OTFTs have the potential to be a key technology that will enable new applications from health to aerospace. The combination of materials chemistry, device manufacturing, thin film characterization and electrical engineering makes OTFT research challenging due to the large parameter space created by both diverse material roles and device architectures. Consequently, this research stands to benefit enormously from automation. By leveraging the multidisciplinary team and taking a user-centered design approach in the design and continued improvement of the autotesters, the group has meaningfully increased productivity, explored research avenues impossible with traditional workflows, and developed as scientists and engineers capable of effectively designing and leveraging automation to build the future of their fields to encourage this approach, the files for replicating the infrastructure are included, and questions and potential collaborations are welcomed.
Collapse
Affiliation(s)
- Nicholas Dallaire
- School of Electrical Engineering and Computer Science, University of Ottawa, 800 King Edward Ave., Ottawa, ON, K1N 6N5, Canada
| | - Nicholas T Boileau
- Department of Chemical and Biological Engineering, University of Ottawa, 161 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
| | - Ian Myers
- University of Ottawa Electronics shop, University of Ottawa, 150 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
| | - Samantha Brixi
- Department of Chemical and Biological Engineering, University of Ottawa, 161 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
| | - May Ourabi
- Department of Chemical and Biological Engineering, University of Ottawa, 161 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
| | - Ewenike Raluchukwu
- Department of Chemical and Biological Engineering, University of Ottawa, 161 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
| | - Rosemary Cranston
- Department of Chemical and Biological Engineering, University of Ottawa, 161 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
| | - Halynne R Lamontagne
- Department of Chemical and Biological Engineering, University of Ottawa, 161 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
- Department of Chemistry and Biomolecular Sciences, University of Ottawa, 150 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
| | - Benjamin King
- Department of Chemical and Biological Engineering, University of Ottawa, 161 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
| | - Bahar Ronnasi
- Department of Chemical and Biological Engineering, University of Ottawa, 161 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
| | - Owen A Melville
- Department of Chemical and Biological Engineering, University of Ottawa, 161 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
- Acceleration Consortium, University of Toronto, 80 St George St, Toronto, ON, M5S 3H6, Canada
| | - Joseph G Manion
- Department of Chemical and Biological Engineering, University of Ottawa, 161 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
| | - Benoît H Lessard
- School of Electrical Engineering and Computer Science, University of Ottawa, 800 King Edward Ave., Ottawa, ON, K1N 6N5, Canada
- Department of Chemical and Biological Engineering, University of Ottawa, 161 Louis Pasteur, Ottawa, ON, K1N 6N5, Canada
| |
Collapse
|
15
|
Dai T, Vijayakrishnan S, Szczypiński FT, Ayme JF, Simaei E, Fellowes T, Clowes R, Kotopanov L, Shields CE, Zhou Z, Ward JW, Cooper AI. Autonomous mobile robots for exploratory synthetic chemistry. Nature 2024; 635:890-897. [PMID: 39506122 PMCID: PMC11602721 DOI: 10.1038/s41586-024-08173-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Accepted: 10/08/2024] [Indexed: 11/08/2024]
Abstract
Autonomous laboratories can accelerate discoveries in chemical synthesis, but this requires automated measurements coupled with reliable decision-making1,2. Most autonomous laboratories involve bespoke automated equipment3-6, and reaction outcomes are often assessed using a single, hard-wired characterization technique7. Any decision-making algorithms8 must then operate using this narrow range of characterization data9,10. By contrast, manual experiments tend to draw on a wider range of instruments to characterize reaction products, and decisions are rarely taken based on one measurement alone. Here we show that a synthesis laboratory can be integrated into an autonomous laboratory by using mobile robots11-13 that operate equipment and make decisions in a human-like way. Our modular workflow combines mobile robots, an automated synthesis platform, a liquid chromatography-mass spectrometer and a benchtop nuclear magnetic resonance spectrometer. This allows robots to share existing laboratory equipment with human researchers without monopolizing it or requiring extensive redesign. A heuristic decision-maker processes the orthogonal measurement data, selecting successful reactions to take forward and automatically checking the reproducibility of any screening hits. We exemplify this approach in the three areas of structural diversification chemistry, supramolecular host-guest chemistry and photochemical synthesis. This strategy is particularly suited to exploratory chemistry that can yield multiple potential products, as for supramolecular assemblies, where we also extend the method to an autonomous function assay by evaluating host-guest binding properties.
Collapse
Affiliation(s)
- Tianwei Dai
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Sriram Vijayakrishnan
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Filip T Szczypiński
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Jean-François Ayme
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Ehsan Simaei
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Thomas Fellowes
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Rob Clowes
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Lyubomir Kotopanov
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Caitlin E Shields
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Zhengxue Zhou
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - John W Ward
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK
| | - Andrew I Cooper
- Leverhulme Research Centre for Functional Materials Design and Materials Innovation Factory, University of Liverpool, Liverpool, UK.
| |
Collapse
|
16
|
Park J, Kim H, Kang Y, Lim Y, Kim J. From Data to Discovery: Recent Trends of Machine Learning in Metal-Organic Frameworks. JACS AU 2024; 4:3727-3743. [PMID: 39483241 PMCID: PMC11522899 DOI: 10.1021/jacsau.4c00618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Revised: 08/28/2024] [Accepted: 08/29/2024] [Indexed: 11/03/2024]
Abstract
Renowned for their high porosity and structural diversity, metal-organic frameworks (MOFs) are a promising class of materials for a wide range of applications. In recent decades, with the development of large-scale databases, the MOF community has witnessed innovations brought by data-driven machine learning methods, which have enabled a deeper understanding of the chemical nature of MOFs and led to the development of novel structures. Notably, machine learning is continuously and rapidly advancing as new methodologies, architectures, and data representations are actively being investigated, and their implementation in materials discovery is vigorously pursued. Under these circumstances, it is important to closely monitor recent research trends and identify the technologies that are being introduced. In this Perspective, we focus on emerging trends of machine learning within the field of MOFs, the challenges they face, and the future directions of their development.
Collapse
Affiliation(s)
- Junkil Park
- Department
of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - Honghui Kim
- Department
of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - Yeonghun Kang
- Department
of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - Yunsung Lim
- Department
of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea
| | - Jihan Kim
- Department
of Chemical and Biomolecular Engineering, Korea Advanced Institute of Science and Technology (KAIST), 291 Daehak-ro, Yuseong-gu, Daejeon 34141, Republic of Korea
| |
Collapse
|
17
|
Chen LY, Li YP. Machine learning-guided strategies for reaction conditions design and optimization. Beilstein J Org Chem 2024; 20:2476-2492. [PMID: 39376489 PMCID: PMC11457048 DOI: 10.3762/bjoc.20.212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Accepted: 09/19/2024] [Indexed: 10/09/2024] Open
Abstract
This review surveys the recent advances and challenges in predicting and optimizing reaction conditions using machine learning techniques. The paper emphasizes the importance of acquiring and processing large and diverse datasets of chemical reactions, and the use of both global and local models to guide the design of synthetic processes. Global models exploit the information from comprehensive databases to suggest general reaction conditions for new reactions, while local models fine-tune the specific parameters for a given reaction family to improve yield and selectivity. The paper also identifies the current limitations and opportunities in this field, such as the data quality and availability, and the integration of high-throughput experimentation. The paper demonstrates how the combination of chemical engineering, data science, and ML algorithms can enhance the efficiency and effectiveness of reaction conditions design, and enable novel discoveries in synthetic chemistry.
Collapse
Affiliation(s)
- Lung-Yi Chen
- Department of Chemical Engineering, National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei 10617, Taiwan
| | - Yi-Pei Li
- Department of Chemical Engineering, National Taiwan University, No. 1, Sec. 4, Roosevelt Road, Taipei 10617, Taiwan
- Taiwan International Graduate Program on Sustainable Chemical Science and Technology (TIGP-SCST), No. 128, Sec. 2, Academia Road, Taipei 11529, Taiwan
| |
Collapse
|
18
|
Liu Y, Roccapriore K, Checa M, Valleti SM, Yang JC, Jesse S, Vasudevan RK. AEcroscopy: A Software-Hardware Framework Empowering Microscopy Toward Automated and Autonomous Experimentation. SMALL METHODS 2024; 8:e2301740. [PMID: 38639016 DOI: 10.1002/smtd.202301740] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Revised: 03/31/2024] [Indexed: 04/20/2024]
Abstract
Microscopy has been pivotal in improving the understanding of structure-function relationships at the nanoscale and is by now ubiquitous in most characterization labs. However, traditional microscopy operations are still limited largely by a human-centric click-and-go paradigm utilizing vendor-provided software, which limits the scope, utility, efficiency, effectiveness, and at times reproducibility of microscopy experiments. Here, a coupled software-hardware platform is developed that consists of a software package termed AEcroscopy (short for Automated Experiments in Microscopy), along with a field-programmable-gate-array device with LabView-built customized acquisition scripts, which overcome these limitations and provide the necessary abstractions toward full automation of microscopy platforms. The platform works across multiple vendor devices on scanning probe microscopes and electron microscopes. It enables customized scan trajectories, processing functions that can be triggered locally or remotely on processing servers, user-defined excitation waveforms, standardization of data models, and completely seamless operation through simple Python commands to enable a plethora of microscopy experiments to be performed in a reproducible, automated manner. This platform can be readily coupled with existing machine-learning libraries and simulations, to provide automated decision-making and active theory-experiment optimization to turn microscopes from characterization tools to instruments capable of autonomous model refinement and physics discovery.
Collapse
Affiliation(s)
- Yongtao Liu
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Kevin Roccapriore
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Marti Checa
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Sai Mani Valleti
- Bredesen Center for Interdisciplinary Research, University of Tennessee, Knoxville, TN, 37996, USA
| | - Jan-Chi Yang
- Department of Physics, National Cheng Kung University, Tainan, 70101, Taiwan
| | - Stephen Jesse
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Rama K Vasudevan
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| |
Collapse
|
19
|
van Tilborg D, Grisoni F. Traversing chemical space with active deep learning for low-data drug discovery. NATURE COMPUTATIONAL SCIENCE 2024; 4:786-796. [PMID: 39333789 DOI: 10.1038/s43588-024-00697-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/23/2024] [Accepted: 08/22/2024] [Indexed: 09/30/2024]
Abstract
Deep learning is accelerating drug discovery. However, current approaches are often affected by limitations in the available data, in terms of either size or molecular diversity. Active deep learning has high potential for low-data drug discovery, as it allows iterative model improvement during the screening process. However, there are several 'known unknowns' that limit the wider adoption of active deep learning in drug discovery: (1) what the best computational strategies are for chemical space exploration, (2) how active learning holds up to traditional, non-iterative, approaches and (3) how it should be used in the low-data scenarios typical of drug discovery. To provide answers, this study simulates a low-data drug discovery scenario, and systematically analyzes six active learning strategies combined with two deep learning architectures, on three large-scale molecular libraries. We identify the most important determinants of success in low-data regimes and show that active learning can achieve up to a sixfold improvement in hit discovery when compared with traditional screening methods.
Collapse
Affiliation(s)
- Derek van Tilborg
- Institute for Complex Molecular Systems (ICMS), Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands
- Centre for Living Technologies, Alliance TU/e, WUR, UU, UMC Utrecht, Utrecht, The Netherlands
| | - Francesca Grisoni
- Institute for Complex Molecular Systems (ICMS), Department of Biomedical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands.
- Centre for Living Technologies, Alliance TU/e, WUR, UU, UMC Utrecht, Utrecht, The Netherlands.
| |
Collapse
|
20
|
Slautin BN, Liu Y, Funakubo H, Vasudevan RK, Ziatdinov M, Kalinin SV. Bayesian Conavigation: Dynamic Designing of the Material Digital Twins via Active Learning. ACS NANO 2024; 18:24898-24908. [PMID: 39183496 DOI: 10.1021/acsnano.4c05368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/27/2024]
Abstract
Scientific advancement is universally based on the dynamic interplay between theoretical insights, modeling, and experimental discoveries. However, this feedback loop is often slow, including delayed community interactions and the gradual integration of experimental data into theoretical frameworks. This challenge is particularly exacerbated in domains dealing with high-dimensional object spaces, such as molecules and complex microstructures. Hence, the integration of theory within automated and autonomous experimental setups, or theory in the loop-automated experiment, is emerging as a crucial objective for accelerating scientific research. The critical aspect is to use not only theory but also on-the-fly theory updates during the experiment. Here, we introduce a method for integrating theory into the loop through Bayesian conavigation of theoretical model space and experimentation. Our approach leverages the concurrent development of surrogate models for both simulation and experimental domains at the rates determined by latencies and costs of experiments and computation, alongside the adjustment of control parameters within theoretical models to minimize epistemic uncertainty over the experimental object spaces. This methodology facilitates the creation of digital twins of material structures, encompassing both the surrogate model of behavior that includes the correlative part and the theoretical model itself. While being demonstrated here within the context of functional responses in ferroelectric materials, our approach holds promise for broader applications, such as the exploration of optical properties in nanoclusters, microstructure-dependent properties in complex materials, and properties of molecular systems.
Collapse
Affiliation(s)
- Boris N Slautin
- Institute for Materials Science and Center for Nanointegration Duisburg-Essen (CENIDE), University of Duisburg-Essen, Essen 45141, Germany
| | - Yongtao Liu
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
| | - Hiroshi Funakubo
- Department of Materials Science and Engineering, Tokyo Institute of Technology, Yokohama 226-8502, Japan
| | - Rama K Vasudevan
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37831, United States
| | - Maxim Ziatdinov
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
| | - Sergei V Kalinin
- Physical Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, United States
- Department of Materials Science and Engineering, University of Tennessee, Knoxville, Tennessee 37996, United States
| |
Collapse
|
21
|
Gormley AJ. Machine learning in drug delivery. J Control Release 2024; 373:23-30. [PMID: 38909704 PMCID: PMC11384327 DOI: 10.1016/j.jconrel.2024.06.045] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Revised: 06/17/2024] [Accepted: 06/19/2024] [Indexed: 06/25/2024]
Abstract
For decades, drug delivery scientists have been performing trial-and-error experimentation to manually sample parameter spaces and optimize release profiles through rational design. To enable this approach, scientists spend much of their career learning nuanced drug-material interactions that drive system behavior. In relatively simple systems, rational design criteria allow us to fine tune release profiles and enable efficacious therapies. However, as materials and drugs become increasingly sophisticated and their interactions have non-linear and compounding effects, the field is suffering the Curse of Dimensionality which prevents us from comprehending complex structure-function relationships. In the past, we have embraced this complexity by implementing high-throughput screens to increase the probability of finding ideal compositions. However, this brute force method was inefficient and led many to abandon these fishing expeditions. Fortunately, methods in data science including artificial intelligence / machine learning (AI/ML) are providing ideal analytical tools to model this complex data and ascertain quantitative structure-function relationships. In this Oration, I speak to the potential value of data science in drug delivery with particular focus on polymeric delivery systems. Here, I do not suggest that AI/ML will simply replace mechanistic understanding of complex systems. Rather, I propose that AI/ML should be yet another useful tool in the lab to navigate complex parameter spaces. The recent hype around AI/ML is breathtaking and potentially over inflated, but the value of these methods is poised to revolutionize how we perform science. Therefore, I encourage readers to consider adopting these skills and applying data science methods to their own problems. If done successfully, I believe we will all realize a paradigm shift in our approach to drug delivery.
Collapse
Affiliation(s)
- Adam J Gormley
- Associate Professor, Department of Biomedical Engineering, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, United States.
| |
Collapse
|
22
|
Tom G, Schmid SP, Baird SG, Cao Y, Darvish K, Hao H, Lo S, Pablo-García S, Rajaonson EM, Skreta M, Yoshikawa N, Corapi S, Akkoc GD, Strieth-Kalthoff F, Seifrid M, Aspuru-Guzik A. Self-Driving Laboratories for Chemistry and Materials Science. Chem Rev 2024; 124:9633-9732. [PMID: 39137296 PMCID: PMC11363023 DOI: 10.1021/acs.chemrev.4c00055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/15/2024]
Abstract
Self-driving laboratories (SDLs) promise an accelerated application of the scientific method. Through the automation of experimental workflows, along with autonomous experimental planning, SDLs hold the potential to greatly accelerate research in chemistry and materials discovery. This review provides an in-depth analysis of the state-of-the-art in SDL technology, its applications across various scientific disciplines, and the potential implications for research and industry. This review additionally provides an overview of the enabling technologies for SDLs, including their hardware, software, and integration with laboratory infrastructure. Most importantly, this review explores the diverse range of scientific domains where SDLs have made significant contributions, from drug discovery and materials science to genomics and chemistry. We provide a comprehensive review of existing real-world examples of SDLs, their different levels of automation, and the challenges and limitations associated with each domain.
Collapse
Affiliation(s)
- Gary Tom
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Stefan P. Schmid
- Department
of Chemistry and Applied Biosciences, ETH
Zurich, Vladimir-Prelog-Weg 1, CH-8093 Zurich, Switzerland
| | - Sterling G. Baird
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Yang Cao
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Kourosh Darvish
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Han Hao
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Stanley Lo
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
| | - Sergio Pablo-García
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
| | - Ella M. Rajaonson
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Marta Skreta
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Naruki Yoshikawa
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Samantha Corapi
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
| | - Gun Deniz Akkoc
- Forschungszentrum
Jülich GmbH, Helmholtz Institute
for Renewable Energy Erlangen-Nürnberg, Cauerstr. 1, 91058 Erlangen, Germany
- Department
of Chemical and Biological Engineering, Friedrich-Alexander Universität Erlangen-Nürnberg, Egerlandstr. 3, 91058 Erlangen, Germany
| | - Felix Strieth-Kalthoff
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- School of
Mathematics and Natural Sciences, University
of Wuppertal, Gaußstraße
20, 42119 Wuppertal, Germany
| | - Martin Seifrid
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Department
of Materials Science and Engineering, North
Carolina State University, Raleigh, North Carolina 27695, United States of America
| | - Alán Aspuru-Guzik
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
- Department
of Chemical Engineering & Applied Chemistry, University of Toronto, Toronto, Ontario M5S 3E5, Canada
- Department
of Materials Science & Engineering, University of Toronto, Toronto, Ontario M5S 3E4, Canada
- Lebovic
Fellow, Canadian Institute for Advanced
Research (CIFAR), 661
University Ave, Toronto, Ontario M5G 1M1, Canada
| |
Collapse
|
23
|
Sato T, Masuda K, Sano C, Matsumoto K, Numata H, Munetoh S, Kasama T, Miyake R. Democratizing Microreactor Technology for Accelerated Discoveries in Chemistry and Materials Research. MICROMACHINES 2024; 15:1064. [PMID: 39337724 PMCID: PMC11434323 DOI: 10.3390/mi15091064] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2024] [Revised: 08/11/2024] [Accepted: 08/14/2024] [Indexed: 09/30/2024]
Abstract
Microreactor technologies have emerged as versatile platforms with the potential to revolutionize chemistry and materials research, offering sustainable solutions to global challenges in environmental and health domains. This survey paper provides an in-depth review of recent advancements in microreactor technologies, focusing on their role in facilitating accelerated discoveries in chemistry and materials. Specifically, we examine the convergence of microfluidics with machine intelligence and automation, enabling the exploitation of the cyber-physical environment as a highly integrated experimentation platform for rapid scientific discovery and process development. We investigate the applicability and limitations of microreactor-enabled discovery accelerators in various chemistry and materials contexts. Despite their tremendous potential, the integration of machine intelligence and automation into microreactor-based experiments presents challenges in establishing fully integrated, automated, and intelligent systems. These challenges can hinder the broader adoption of microreactor technologies within the research community. To address this, we review emerging technologies that can help lower barriers and facilitate the implementation of microreactor-enabled discovery accelerators. Lastly, we provide our perspective on future research directions for democratizing microreactor technologies, with the aim of accelerating scientific discoveries and promoting widespread adoption of these transformative platforms.
Collapse
Affiliation(s)
- Tomomi Sato
- Graduate School of Engineering, The University of Tokyo, Kawasaki 212-0032, Japan; (T.K.); (R.M.)
| | - Koji Masuda
- Department of Physics and Astronomy, University of Exeter, Exeter EX4 4QL, UK;
| | - Chikako Sano
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Keiji Matsumoto
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Hidetoshi Numata
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Seiji Munetoh
- IBM Semiconductors, IBM Research–Tokyo, Kawasaki 212-0032, Japan; (C.S.); (K.M.); (H.N.); (S.M.)
| | - Toshihiro Kasama
- Graduate School of Engineering, The University of Tokyo, Kawasaki 212-0032, Japan; (T.K.); (R.M.)
| | - Ryo Miyake
- Graduate School of Engineering, The University of Tokyo, Kawasaki 212-0032, Japan; (T.K.); (R.M.)
| |
Collapse
|
24
|
Yang X, Gong B, Chen W, Chen J, Qian C, Lu R, Min Y, Jiang T, Li L, Yu H. In Situ Quantitative Monitoring of Adsorption from Aqueous Phase by UV-vis Spectroscopy: Implication for Understanding of Heterogeneous Processes. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2402732. [PMID: 38923364 PMCID: PMC11348127 DOI: 10.1002/advs.202402732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 06/05/2024] [Indexed: 06/28/2024]
Abstract
The development of in situ techniques to quantitatively characterize the heterogeneous reactions is essential for understanding physicochemical processes in aqueous phase. In this work, a new approach coupling in situ UV-vis spectroscopy with a two-step algorithm strategy is developed to quantitatively monitor heterogeneous reactions in a compact closed-loop incorporation. The algorithm involves the inverse adding-doubling method for light scattering correction and the multivariate curve resolution-alternating least squares (MCR-ALS) method for spectral deconvolution. Innovatively, theoretical spectral simulations are employed to connect MCR-ALS solutions with chemical molecular structural evolution without prior information for reference spectra. As a model case study, the aqueous adsorption kinetics of bisphenol A onto polyamide microparticles are successfully quantified in a one-step UV-vis spectroscopic measurement. The practical applicability of this approach is confirmed by rapidly screening a superior adsorbent from commercial materials for antibiotic wastewater adsorption treatment. The demonstrated capabilities are expected to extend beyond monitoring adsorption systems to other heterogeneous reactions, significantly advancing UV-vis spectroscopic techniques toward practical integration into automated experimental platforms for probing aqueous chemical processes and beyond.
Collapse
Affiliation(s)
- Xu‐Dan Yang
- CAS Key Laboratory of Urban Pollutant ConversionDepartment of Environmental Science and EngineeringUniversity of Science and Technology of ChinaHefei230026China
| | - Bo Gong
- CAS Key Laboratory of Urban Pollutant ConversionDepartment of Environmental Science and EngineeringUniversity of Science and Technology of ChinaHefei230026China
| | - Wei Chen
- School of Metallurgy and EnvironmentCentral South UniversityChangsha410083China
| | - Jie‐Jie Chen
- CAS Key Laboratory of Urban Pollutant ConversionDepartment of Environmental Science and EngineeringUniversity of Science and Technology of ChinaHefei230026China
| | - Chen Qian
- CAS Key Laboratory of Urban Pollutant ConversionDepartment of Environmental Science and EngineeringUniversity of Science and Technology of ChinaHefei230026China
| | - Rui Lu
- School of Environmental and Biological EngineeringNanjing University of Science and TechnologyNanjing210094China
| | - Yuan Min
- CAS Key Laboratory of Urban Pollutant ConversionDepartment of Environmental Science and EngineeringUniversity of Science and Technology of ChinaHefei230026China
| | - Ting Jiang
- CAS Key Laboratory of Urban Pollutant ConversionDepartment of Environmental Science and EngineeringUniversity of Science and Technology of ChinaHefei230026China
| | - Liang Li
- CAS Key Laboratory of Urban Pollutant ConversionDepartment of Environmental Science and EngineeringUniversity of Science and Technology of ChinaHefei230026China
| | - Han‐Qing Yu
- CAS Key Laboratory of Urban Pollutant ConversionDepartment of Environmental Science and EngineeringUniversity of Science and Technology of ChinaHefei230026China
| |
Collapse
|
25
|
Carrer M, Cezar HM, Bore SL, Ledum M, Cascella M. Learning Force Field Parameters from Differentiable Particle-Field Molecular Dynamics. J Chem Inf Model 2024; 64:5510-5520. [PMID: 38963184 PMCID: PMC11267579 DOI: 10.1021/acs.jcim.4c00564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2024] [Revised: 05/15/2024] [Accepted: 06/12/2024] [Indexed: 07/05/2024]
Abstract
We develop ∂-HylleraasMD (∂-HyMD), a fully end-to-end differentiable molecular dynamics software based on the Hamiltonian hybrid particle-field formalism, and use it to establish a protocol for automated optimization of force field parameters. ∂-HyMD is templated on the recently released HylleraaasMD software, while using the JAX autodiff framework as the main engine for the differentiable dynamics. ∂-HyMD exploits an embarrassingly parallel optimization algorithm by spawning independent simulations, whose trajectories are simultaneously processed by reverse mode automatic differentiation to calculate the gradient of the loss function, which is in turn used for iterative optimization of the force-field parameters. We show that parallel organization facilitates the convergence of the minimization procedure, avoiding the known memory and numerical stability issues of differentiable molecular dynamics approaches. We showcase the effectiveness of our implementation by producing a library of force field parameters for standard phospholipids, with either zwitterionic or anionic heads and with saturated or unsaturated tails. Compared to the all-atom reference, the force field obtained by ∂-HyMD yields better density profiles than the parameters derived from previously utilized gradient-free optimization procedures. Moreover, ∂-HyMD models can predict with good accuracy properties not included in the learning objective, such as lateral pressure profiles, and are transferable to other systems, including triglycerides.
Collapse
Affiliation(s)
- Manuel Carrer
- Hylleraas Centre for Quantum Molecular
Sciences and Department of Chemistry, University
of Oslo, PO Box 1033, Blindern, 0315 Oslo, Norway
| | - Henrique Musseli Cezar
- Hylleraas Centre for Quantum Molecular
Sciences and Department of Chemistry, University
of Oslo, PO Box 1033, Blindern, 0315 Oslo, Norway
| | - Sigbjørn Løland Bore
- Hylleraas Centre for Quantum Molecular
Sciences and Department of Chemistry, University
of Oslo, PO Box 1033, Blindern, 0315 Oslo, Norway
| | - Morten Ledum
- Hylleraas Centre for Quantum Molecular
Sciences and Department of Chemistry, University
of Oslo, PO Box 1033, Blindern, 0315 Oslo, Norway
| | - Michele Cascella
- Hylleraas Centre for Quantum Molecular
Sciences and Department of Chemistry, University
of Oslo, PO Box 1033, Blindern, 0315 Oslo, Norway
| |
Collapse
|
26
|
Kissman EN, Sosa MB, Millar DC, Koleski EJ, Thevasundaram K, Chang MCY. Expanding chemistry through in vitro and in vivo biocatalysis. Nature 2024; 631:37-48. [PMID: 38961155 DOI: 10.1038/s41586-024-07506-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Accepted: 05/01/2024] [Indexed: 07/05/2024]
Abstract
Living systems contain a vast network of metabolic reactions, providing a wealth of enzymes and cells as potential biocatalysts for chemical processes. The properties of protein and cell biocatalysts-high selectivity, the ability to control reaction sequence and operation in environmentally benign conditions-offer approaches to produce molecules at high efficiency while lowering the cost and environmental impact of industrial chemistry. Furthermore, biocatalysis offers the opportunity to generate chemical structures and functions that may be inaccessible to chemical synthesis. Here we consider developments in enzymes, biosynthetic pathways and cellular engineering that enable their use in catalysis for new chemistry and beyond.
Collapse
Affiliation(s)
- Elijah N Kissman
- Department of Chemistry, University of California Berkeley, Berkeley, CA, USA
| | - Max B Sosa
- Department of Chemistry, University of California Berkeley, Berkeley, CA, USA
| | - Douglas C Millar
- Department of Chemical and Biomolecular Engineering, University of California Berkeley, Berkeley, CA, USA
| | - Edward J Koleski
- Department of Chemistry, University of California Berkeley, Berkeley, CA, USA
| | | | - Michelle C Y Chang
- Department of Chemistry, University of California Berkeley, Berkeley, CA, USA.
- Department of Chemical and Biomolecular Engineering, University of California Berkeley, Berkeley, CA, USA.
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA.
- Department of Chemistry, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
27
|
Yao W, Ye XS. Donor Preactivation-Based Glycan Assembly: from Manual to Automated Synthesis. Acc Chem Res 2024; 57:1577-1594. [PMID: 38623919 DOI: 10.1021/acs.accounts.4c00072] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/17/2024]
Abstract
Carbohydrates are called the third chain of life. Carbohydrates participate in many important biochemical functions in living species, and the biological information carried by them is several orders of magnitude larger than that of nucleic acids and proteins. However, due to the intrinsic complexity and heterogeneity of carbohydrate structures, furnishing pure and structurally well-defined glycans for functional studies is a formidable task, especially for homogeneous large-size glycans. To address this issue, we have developed a donor preactivation-based one-pot glycosylation strategy enabling multiple sequential glycosylations in a single reaction vessel.The donor preactivation-based one-pot glycosylation refers to the strategy in which the glycosyl donor is activated in the absence of a glycosyl acceptor to generate a reactive intermediate. Subsequently, the glycosyl acceptor with the same anomeric leaving group is added, leading to a glycosyl coupling reaction, which is then iterated to rapidly achieve the desired glycan in the same reactor. The advantages of this strategy include the following: (1) unique chemoselectivity is obtained after preactivation; (2) it is independent of the reactivity of glycosyl donors; (3) multiple-step glycosylations are enabled without the need for intermediate purification; (4) only stoichiometric building blocks are required without complex protecting group manipulations. Using this protocol, a range of glycans including tumor-associated carbohydrate antigens, various glycosaminoglycans, complex N-glycans, and diverse bacterial glycans have been synthesized manually. Gratifyingly, the synthesis of mycobacterial arabinogalactan containing 92 monosaccharide units has been achieved, which created a precedent in the field of polysaccharide synthesis. Recently, the synthesis of a highly branched arabinogalactan from traditional Chinese medicine featuring 140 monosaccharide units has been also accomplished to evaluate its anti-pancreatic-cancer activity. In the spirit of green and sustainable chemistry, this strategy can also be applied to light-driven glycosylation reactions, where either UV or visible light can be used for the activation of glycosyl donors.Automated synthesis is an advanced approach to the construction of complex glycans. Based on the two preactivation modes (general promoter activation mode and light-induced activation mode), a universal and highly efficient automated solution-phase synthesizer was further developed to drive glycan assembly from manual to automated synthesis. Using this synthesizer, a library of oligosaccharides covering various glycoforms and glycosidic linkages was assembled rapidly, either in a general promoter-activation mode or in a light-induced-activation mode. The automated synthesis of a fully protected fondaparinux pentasaccharide was realized on a gram scale. Furthermore, the automated synthesis of large-size polysaccharides was performed, allowing the assembly of arabinans up to an astonishing 1080-mer using the automated multiplicative synthesis strategy, taking glycan synthesis to a new height far beyond the synthesis of nucleic acids (up to 200-mer) and proteins (up to 472-mer).
Collapse
Affiliation(s)
- Wenlong Yao
- State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, and Chemical Biology Center, Peking University, Xue Yuan Road No. 38, Beijing 100191, China
| | - Xin-Shan Ye
- State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, and Chemical Biology Center, Peking University, Xue Yuan Road No. 38, Beijing 100191, China
- National Research Center for Carbohydrate Synthesis, Jiangxi Normal University, Nanchang 330022, China
| |
Collapse
|
28
|
da Silva RGL. The advancement of artificial intelligence in biomedical research and health innovation: challenges and opportunities in emerging economies. Global Health 2024; 20:44. [PMID: 38773458 PMCID: PMC11107016 DOI: 10.1186/s12992-024-01049-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2023] [Accepted: 04/30/2024] [Indexed: 05/23/2024] Open
Abstract
The advancement of artificial intelligence (AI), algorithm optimization and high-throughput experiments has enabled scientists to accelerate the discovery of new chemicals and materials with unprecedented efficiency, resilience and precision. Over the recent years, the so-called autonomous experimentation (AE) systems are featured as key AI innovation to enhance and accelerate research and development (R&D). Also known as self-driving laboratories or materials acceleration platforms, AE systems are digital platforms capable of running a large number of experiments autonomously. Those systems are rapidly impacting biomedical research and clinical innovation, in areas such as drug discovery, nanomedicine, precision oncology, and others. As it is expected that AE will impact healthcare innovation from local to global levels, its implications for science and technology in emerging economies should be examined. By examining the increasing relevance of AE in contemporary R&D activities, this article aims to explore the advancement of artificial intelligence in biomedical research and health innovation, highlighting its implications, challenges and opportunities in emerging economies. AE presents an opportunity for stakeholders from emerging economies to co-produce the global knowledge landscape of AI in health. However, asymmetries in R&D capabilities should be acknowledged since emerging economies suffers from inadequacies and discontinuities in resources and funding. The establishment of decentralized AE infrastructures could support stakeholders to overcome local restrictions and opens venues for more culturally diverse, equitable, and trustworthy development of AI in health-related R&D through meaningful partnerships and engagement. Collaborations with innovators from emerging economies could facilitate anticipation of fiscal pressures in science and technology policies, obsolescence of knowledge infrastructures, ethical and regulatory policy lag, and other issues present in the Global South. Also, improving cultural and geographical representativeness of AE contributes to foster the diffusion and acceptance of AI in health-related R&D worldwide. Institutional preparedness is critical and could enable stakeholders to navigate opportunities of AI in biomedical research and health innovation in the coming years.
Collapse
Affiliation(s)
- Renan Gonçalves Leonel da Silva
- Health Ethics and Policy Lab, Department of Health Sciences and Technology, ETH Zurich, Hottingerstrasse 10, HOA 17, Zurich, 8092, Switzerland.
| |
Collapse
|
29
|
Strieth-Kalthoff F, Hao H, Rathore V, Derasp J, Gaudin T, Angello NH, Seifrid M, Trushina E, Guy M, Liu J, Tang X, Mamada M, Wang W, Tsagaantsooj T, Lavigne C, Pollice R, Wu TC, Hotta K, Bodo L, Li S, Haddadnia M, Wołos A, Roszak R, Ser CT, Bozal-Ginesta C, Hickman RJ, Vestfrid J, Aguilar-Granda A, Klimareva EL, Sigerson RC, Hou W, Gahler D, Lach S, Warzybok A, Borodin O, Rohrbach S, Sanchez-Lengeling B, Adachi C, Grzybowski BA, Cronin L, Hein JE, Burke MD, Aspuru-Guzik A. Delocalized, asynchronous, closed-loop discovery of organic laser emitters. Science 2024; 384:eadk9227. [PMID: 38753786 DOI: 10.1126/science.adk9227] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 04/05/2024] [Indexed: 05/18/2024]
Abstract
Contemporary materials discovery requires intricate sequences of synthesis, formulation, and characterization that often span multiple locations with specialized expertise or instrumentation. To accelerate these workflows, we present a cloud-based strategy that enabled delocalized and asynchronous design-make-test-analyze cycles. We showcased this approach through the exploration of molecular gain materials for organic solid-state lasers as a frontier application in molecular optoelectronics. Distributed robotic synthesis and in-line property characterization, orchestrated by a cloud-based artificial intelligence experiment planner, resulted in the discovery of 21 new state-of-the-art materials. Gram-scale synthesis ultimately allowed for the verification of best-in-class stimulated emission in a thin-film device. Demonstrating the asynchronous integration of five laboratories across the globe, this workflow provides a blueprint for delocalizing-and democratizing-scientific discovery.
Collapse
Affiliation(s)
- Felix Strieth-Kalthoff
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Han Hao
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Acceleration Consortium, University of Toronto, Toronto, ON, Canada
| | - Vandana Rathore
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Joshua Derasp
- Department of Chemistry, University of British Columbia, Vancouver, BC, Canada
| | - Théophile Gaudin
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Nicholas H Angello
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab Institute, Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Martin Seifrid
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Department of Materials Science and Engineering, North Carolina State University, Raleigh, NC, USA
| | | | - Mason Guy
- Department of Chemistry, University of British Columbia, Vancouver, BC, Canada
| | - Junliang Liu
- Department of Chemistry, University of British Columbia, Vancouver, BC, Canada
| | - Xun Tang
- Center for Organic Photonics and Electronics Research (OPERA), Kyushu University, Fukuoka, Japan
| | - Masashi Mamada
- Center for Organic Photonics and Electronics Research (OPERA), Kyushu University, Fukuoka, Japan
| | - Wesley Wang
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab Institute, Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Tuul Tsagaantsooj
- Center for Organic Photonics and Electronics Research (OPERA), Kyushu University, Fukuoka, Japan
| | - Cyrille Lavigne
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Robert Pollice
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Tony C Wu
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Kazuhiro Hotta
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Mitsubishi Chemical Corporation Science & Innovation Center, Kanagawa, Japan
| | - Leticia Bodo
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
| | - Shangyu Li
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
| | - Mohammad Haddadnia
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
| | - Agnieszka Wołos
- Allchemy Inc., Highland, IN, USA
- Institute of Organic Chemistry, Polish Academy of Sciences, Warsaw, Poland
| | - Rafał Roszak
- Allchemy Inc., Highland, IN, USA
- Institute of Organic Chemistry, Polish Academy of Sciences, Warsaw, Poland
| | - Cher Tian Ser
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Carlota Bozal-Ginesta
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Catalonia Institute for Energy Research, Barcelona, Spain
| | - Riley J Hickman
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Jenya Vestfrid
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Andrés Aguilar-Granda
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | | | | | - Wenduan Hou
- School of Chemistry, University of Glasgow, Glasgow, UK
| | - Daniel Gahler
- School of Chemistry, University of Glasgow, Glasgow, UK
| | - Slawomir Lach
- School of Chemistry, University of Glasgow, Glasgow, UK
| | - Adrian Warzybok
- School of Chemistry, University of Glasgow, Glasgow, UK
- Department of Chemical Physics, Jagiellonian University, Krakow, Poland
| | - Oleg Borodin
- School of Chemistry, University of Glasgow, Glasgow, UK
| | | | | | - Chihaya Adachi
- Center for Organic Photonics and Electronics Research (OPERA), Kyushu University, Fukuoka, Japan
| | - Bartosz A Grzybowski
- Institute of Organic Chemistry, Polish Academy of Sciences, Warsaw, Poland
- Center for Algorithmic and Robotized Synthesis, Institute for Basic Science, Ulsan, Republic of Korea
- Department of Chemistry, Ulsan Institute of Science and Technology, Ulsan, Republic of Korea
| | - Leroy Cronin
- Acceleration Consortium, University of Toronto, Toronto, ON, Canada
- School of Chemistry, University of Glasgow, Glasgow, UK
| | - Jason E Hein
- Acceleration Consortium, University of Toronto, Toronto, ON, Canada
- Department of Chemistry, University of British Columbia, Vancouver, BC, Canada
- Department of Chemistry, University of Bergen, Bergen, Norway
| | - Martin D Burke
- Acceleration Consortium, University of Toronto, Toronto, ON, Canada
- Department of Chemistry, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Molecule Maker Lab Institute, Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Cancer Center at Illinois, University of Illinois at Urbana-Champaign, Urbana, IL, USA
- Carle Illinois College of Medicine, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
- Acceleration Consortium, University of Toronto, Toronto, ON, Canada
- Vector Institute for Artificial Intelligence, Toronto, ON, Canada
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON, Canada
- Department of Materials Science and Engineering, University of Toronto, Toronto, ON, Canada
- Canadian Institute for Advanced Research (CIFAR), Toronto, ON, Canada
| |
Collapse
|
30
|
Zhang J, Hauch JA, Brabec CJ. Toward Self-Driven Autonomous Material and Device Acceleration Platforms (AMADAP) for Emerging Photovoltaics Technologies. Acc Chem Res 2024; 57:1434-1445. [PMID: 38652511 PMCID: PMC11079961 DOI: 10.1021/acs.accounts.4c00095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2024] [Revised: 04/11/2024] [Accepted: 04/16/2024] [Indexed: 04/25/2024]
Abstract
ConspectusIn the ever-increasing renewable-energy demand scenario, developing new photovoltaic technologies is important, even in the presence of established terawatt-scale silicon technology. Emerging photovoltaic technologies play a crucial role in diversifying material flows while expanding the photovoltaic product portfolio, thus enhancing security and competitiveness within the solar industry. They also serve as a valuable backup for silicon photovoltaic, providing resilience to the overall energy infrastructure. However, the development of functional solar materials poses intricate multiobjective optimization challenges in a large multidimensional composition and parameter space, in some cases with millions of potential candidates to be explored. Solving it necessitates reproducible, user-independent laboratory work and intelligent preselection of innovative experimental methods.Materials acceleration platforms (MAPs) seamlessly integrate robotic materials synthesis and characterization with AI-driven data analysis and experimental design, positioning them as enabling technologies for the discovery and exploration of new materials. They are proposed to revolutionize materials development away from the Edisonian trial-and-error approaches to ultrashort cycles of experiments with exceptional precision, generating a reliable and highly qualitative data situation that allows training machine learning algorithms with predictive power. MAPs are designed to assist the researcher in multidimensional aspects of materials discovery, such as material synthesis, precursor preparation, sample processing and characterization, and data analysis, and are drawing escalating attention in the field of energy materials. Device acceleration platforms (DAPs), however, are designed to optimize functional films and layer stacks. Unlike MAPs, which focus on material discovery, a central aspect of DAPs is the identification and refinement of ideal processing conditions for a predetermined set of materials. Such platforms prove especially invaluable when dealing with "disordered semiconductors," which depend heavily on the processing parameters that ultimately define the functional properties and functionality of thin film layers. By facilitating the fine-tuning of processing conditions, DAPs contribute significantly to the advancement and optimization of disordered semiconductor devices, such as emerging photovoltaics.In this Account, we review the recent advancements made by our group in automated and autonomous laboratories for advanced material discovery and device optimization with a strong focus on emerging photovoltaics, such as solution-processing perovskite solar cells and organic photovoltaics. We first introduce two MAPs and two DAPs developed in-house: a microwave-assisted high-throughput synthesis platform for the discovery of organic interface materials, a multipurpose robot-based pipetting platform for the synthesis of new semiconductors and the characterization of thin film semiconductor composites, the SPINBOT system, which is a spin-coating DAP with the potential to optimize complex device architectures, and finally, AMANDA, a fully integrated and autonomously operating DAP. Notably, we underscore the utilization of a robot-based high-throughput experimentation technique to address the common optimization challenges encountered in extensive multidimensional composition and parameter spaces pertaining to organic and perovskite photovoltaics materials. Finally, we briefly propose a holistic concept and technology, a self-driven autonomous material and device acceleration platform (AMADAP) laboratory, for autonomous functional solar materials discovery and development. We hope to discover how AMADAP can be further strengthened and universalized with advancing development of hardware and software infrastructures in the future.
Collapse
Affiliation(s)
- Jiyun Zhang
- Forschungszentrum
Juelich GmbH, Helmholtz-Institute Erlangen-Nürnberg (HI ERN), Department of High Throughput Methods in Photovoltaics, Immerwahrstraße 2, 91058 Erlangen, Germany
- Friedrich-Alexander-University
Erlangen-Nuremberg, Faculty of Engineering, Department of Material
Science, Institute of Materials for Electronics
and Energy Technology (i-MEET), Martensstrasse 7, 91058 Erlangen, Germany
| | - Jens A. Hauch
- Forschungszentrum
Juelich GmbH, Helmholtz-Institute Erlangen-Nürnberg (HI ERN), Department of High Throughput Methods in Photovoltaics, Immerwahrstraße 2, 91058 Erlangen, Germany
| | - Christoph J. Brabec
- Forschungszentrum
Juelich GmbH, Helmholtz-Institute Erlangen-Nürnberg (HI ERN), Department of High Throughput Methods in Photovoltaics, Immerwahrstraße 2, 91058 Erlangen, Germany
- Friedrich-Alexander-University
Erlangen-Nuremberg, Faculty of Engineering, Department of Material
Science, Institute of Materials for Electronics
and Energy Technology (i-MEET), Martensstrasse 7, 91058 Erlangen, Germany
| |
Collapse
|
31
|
Padula D. A Computational Perspective on the Reactivity of π-spacers in Self-Immolative Elimination Reactions. Chem Asian J 2024; 19:e202400010. [PMID: 38407472 DOI: 10.1002/asia.202400010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Revised: 02/16/2024] [Accepted: 02/26/2024] [Indexed: 02/27/2024]
Abstract
The controlled release of chemicals, especially in drug delivery, is crucial, often employing "self-immolative" spacers to enhance reliability. These spacers separate the payload from the protecting group, ensuring a more controlled release. Over the years, design rules have been proposed to improve the elimination process's reaction rate by modifying spacers with electron-donating groups or reducing their aromaticity. The spacer design is critical for determining the range of functional groups released during this process. This study explores various strategies from the literature aimed at improving release rates, focusing on the electronic nature of the spacer, its aromaticity, the electronic nature of its substituents, and the leaving groups involved in the elimination reaction. Through computational analysis, I investigate activation free energies by identifying transition states for model reactions. My calculations align qualitatively with experimental results, demonstrating the feasibility and reliability of computationally pre-screening model self-immolative eliminations. This approach allows proposing optimal combinations of spacer and leaving group for achieving the highest possible release rate.
Collapse
Affiliation(s)
- Daniele Padula
- Dipartimento di Biotecnologie, Chimica e Farmacia, Università di Siena, Via A. Moro 2, 53100, Siena, Italy
| |
Collapse
|
32
|
Sheng H, Sun J, Rodríguez O, Hoar BB, Zhang W, Xiang D, Tang T, Hazra A, Min DS, Doyle AG, Sigman MS, Costentin C, Gu Q, Rodríguez-López J, Liu C. Autonomous closed-loop mechanistic investigation of molecular electrochemistry via automation. Nat Commun 2024; 15:2781. [PMID: 38555303 PMCID: PMC10981680 DOI: 10.1038/s41467-024-47210-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Accepted: 03/18/2024] [Indexed: 04/02/2024] Open
Abstract
Electrochemical research often requires stringent combinations of experimental parameters that are demanding to manually locate. Recent advances in automated instrumentation and machine-learning algorithms unlock the possibility for accelerated studies of electrochemical fundamentals via high-throughput, online decision-making. Here we report an autonomous electrochemical platform that implements an adaptive, closed-loop workflow for mechanistic investigation of molecular electrochemistry. As a proof-of-concept, this platform autonomously identifies and investigates an EC mechanism, an interfacial electron transfer (E step) followed by a solution reaction (C step), for cobalt tetraphenylporphyrin exposed to a library of organohalide electrophiles. The generally applicable workflow accurately discerns the EC mechanism's presence amid negative controls and outliers, adaptively designs desired experimental conditions, and quantitatively extracts kinetic information of the C step spanning over 7 orders of magnitude, from which mechanistic insights into oxidative addition pathways are gained. This work opens opportunities for autonomous mechanistic discoveries in self-driving electrochemistry laboratories without manual intervention.
Collapse
Affiliation(s)
- Hongyuan Sheng
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
| | - Jingwen Sun
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Oliver Rodríguez
- Department of Chemistry, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
- Joint Center for Energy Storage Research (JCESR), Argonne National Laboratory, Lemont, IL, 60439, USA
| | - Benjamin B Hoar
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Weitong Zhang
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Danlei Xiang
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Tianhua Tang
- Department of Chemistry, University of Utah, Salt Lake City, UT, 84112, USA
| | - Avijit Hazra
- Department of Chemistry, University of Utah, Salt Lake City, UT, 84112, USA
| | - Daniel S Min
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Abigail G Doyle
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Matthew S Sigman
- Department of Chemistry, University of Utah, Salt Lake City, UT, 84112, USA
| | | | - Quanquan Gu
- Department of Computer Science, University of California, Los Angeles, Los Angeles, CA, 90095, USA
| | - Joaquín Rodríguez-López
- Department of Chemistry, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
- Beckman Institute for Advanced Science and Technology, University of Illinois Urbana-Champaign, Urbana, IL, 61801, USA
- Joint Center for Energy Storage Research (JCESR), Argonne National Laboratory, Lemont, IL, 60439, USA
| | - Chong Liu
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
- California NanoSystems Institute, University of California, Los Angeles, Los Angeles, CA, 90095, USA.
| |
Collapse
|
33
|
Rao A, Grzelczak M. Revisiting El-Sayed Synthesis: Bayesian Optimization for Revealing New Insights during the Growth of Gold Nanorods. CHEMISTRY OF MATERIALS : A PUBLICATION OF THE AMERICAN CHEMICAL SOCIETY 2024; 36:2577-2587. [PMID: 38680830 PMCID: PMC11049742 DOI: 10.1021/acs.chemmater.4c00271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 02/16/2024] [Accepted: 02/16/2024] [Indexed: 05/01/2024]
Abstract
In diverse fields, machine learning (ML) has sparked transformative changes, primarily driven by the wealth of big data. However, an alternative approach seeks to mine insights from "precious data", offering the possibility to reveal missed knowledge and escape potential knowledge traps. In this context, Bayesian optimization (BO) protocols have emerged as crucial tools for optimizing the synthesis and discovery of a broad spectrum of compounds including nanoparticles. In our work, we aimed to go beyond the commonly explored experimental conditions and showcase a workflow capable of unearthing fresh insights, even in well-studied research domains. The growth of AuNRs is a nonequilibrium process that remains poorly understood despite the presence of well-established seeded growth protocols. Traditional research aimed at understanding the mechanism of AuNR growth has primarily relied on altering one reaction condition at a time. While these studies are undeniably valuable, they often fail to capture the synergies between different reaction conditions, thus constraining the depth of insights they can offer. In the present study, we exploit BO, to identify diverse experimental conditions yielding AuNRs with similar spectroscopic characteristics. Notably, we identify viable and accelerated synthesis conditions involving elevated temperatures (36-40 °C) as well as high ascorbic acid concentrations. More importantly, we note that ascorbic acid and temperature can modulate each other's undesirable influences on the growth of AuNRs. Finally, by harnessing the power of interpretable ML algorithms, complemented by our deep chemical understanding, we revisited the established hierarchical relationships among reaction conditions that impact the El-Sayed-based growth of AuNRs.
Collapse
Affiliation(s)
- Anish Rao
- Centro
de Física de Materiales CSIC-UPV/EHU, Paseo Manuel de Lardizabal 5, 20018 Donostia San-Sebastián, Spain
| | - Marek Grzelczak
- Centro
de Física de Materiales CSIC-UPV/EHU, Paseo Manuel de Lardizabal 5, 20018 Donostia San-Sebastián, Spain
- Donostia
International Physics Center (DIPC), Paseo Manuel de Lardizabal 4, 20018 Donostia-San Sebastián, Spain
| |
Collapse
|
34
|
Back S, Aspuru-Guzik A, Ceriotti M, Gryn'ova G, Grzybowski B, Gu GH, Hein J, Hippalgaonkar K, Hormázabal R, Jung Y, Kim S, Kim WY, Moosavi SM, Noh J, Park C, Schrier J, Schwaller P, Tsuda K, Vegge T, von Lilienfeld OA, Walsh A. Accelerated chemical science with AI. DIGITAL DISCOVERY 2024; 3:23-33. [PMID: 38239898 PMCID: PMC10793638 DOI: 10.1039/d3dd00213f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Accepted: 12/06/2023] [Indexed: 01/22/2024]
Abstract
In light of the pressing need for practical materials and molecular solutions to renewable energy and health problems, to name just two examples, one wonders how to accelerate research and development in the chemical sciences, so as to address the time it takes to bring materials from initial discovery to commercialization. Artificial intelligence (AI)-based techniques, in particular, are having a transformative and accelerating impact on many if not most, technological domains. To shed light on these questions, the authors and participants gathered in person for the ASLLA Symposium on the theme of 'Accelerated Chemical Science with AI' at Gangneung, Republic of Korea. We present the findings, ideas, comments, and often contentious opinions expressed during four panel discussions related to the respective general topics: 'Data', 'New applications', 'Machine learning algorithms', and 'Education'. All discussions were recorded, transcribed into text using Open AI's Whisper, and summarized using LG AI Research's EXAONE LLM, followed by revision by all authors. For the broader benefit of current researchers, educators in higher education, and academic bodies such as associations, publishers, librarians, and companies, we provide chemistry-specific recommendations and summarize the resulting conclusions.
Collapse
Affiliation(s)
- Seoin Back
- Department of Chemical and Biomolecular Engineering, Institute of Emergent Materials, Sogang University Seoul Republic of Korea
| | - Alán Aspuru-Guzik
- Departments of Chemistry, Computer Science, University of Toronto St. George Campus Toronto ON Canada
- Acceleration Consortium and Vector Institute for Artificial Intelligence Toronto ON M5S 1M1 Canada
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling (COSMO), École Polytechnique Fédérale de Lausanne Lausanne Switzerland
| | - Ganna Gryn'ova
- Heidelberg Institute for Theoretical Studies (HITS gGmbH) 69118 Heidelberg Germany
- Interdisciplinary Center for Scientific Computing, Heidelberg University 69120 Heidelberg Germany
| | - Bartosz Grzybowski
- Center for Algorithmic and Robotized Synthesis (CARS), Institute for Basic Science (IBS) Ulsan Republic of Korea
- Institute of Organic Chemistry, Polish Academy of Sciences Warsaw Poland
- Department of Chemistry, Ulsan National Institute of Science and Technology Ulsan Republic of Korea
| | - Geun Ho Gu
- Department of Energy Engineering, Korea Institute of Energy Technology (KENTECH) Naju 58330 Republic of Korea
| | - Jason Hein
- Department of Chemistry, University of British Columbia Vancouver BC V6T 1Z1 Canada
| | - Kedar Hippalgaonkar
- School of Materials Science and Engineering, Nanyang Technological University 50 Nanyang Avenue Singapore 639798 Singapore
- Institute of Materials Research and Engineering, Agency for Science Technology and Research 2 Fusionopolis Way, 08-03 Singapore 138634 Singapore
| | | | - Yousung Jung
- Department of Chemical and Biomolecular Engineering, KAIST Daejeon Republic of Korea
- School of Chemical and Biological Engineering, Interdisciplinary Program in Artificial Intelligence, Seoul National University 1 Gwanak-ro, Gwanak-gu Seoul 08826 Republic of Korea
| | - Seonah Kim
- Department of Chemistry, Colorado State University 1301 Center Avenue Fort Collins CO 80523 USA
| | - Woo Youn Kim
- Department of Chemistry, KAIST Daejeon Republic of Korea
| | - Seyed Mohamad Moosavi
- Chemical Engineering & Applied Chemistry, University of Toronto Toronto Ontario M5S 3E5 Canada
| | - Juhwan Noh
- Chemical Data-Driven Research Center, Korea Research Institute of Chemical Technology Daejeon 34114 Republic of Korea
| | | | - Joshua Schrier
- Department of Chemistry, Fordham University The Bronx NY 10458 USA
| | - Philippe Schwaller
- Laboratory of Artificial Chemical Intelligence (LIAC) & National Centre of Competence in Research (NCCR) Catalysis, École Polytechnique Fédérale de Lausanne Lausanne Switzerland
| | - Koji Tsuda
- Graduate School of Frontier Sciences, The University of Tokyo Kashiwa Chiba 277-8561 Japan
- Center for Basic Research on Materials, National Institute for Materials Science Tsukuba Ibaraki 305-0044 Japan
- RIKEN Center for Advanced Intelligence Project Tokyo 103-0027 Japan
| | - Tejs Vegge
- Department of Energy Conversion and Storage, Technical University of Denmark 301 Anker Engelunds vej, Kongens Lyngby Copenhagen 2800 Denmark
| | - O Anatole von Lilienfeld
- Acceleration Consortium and Vector Institute for Artificial Intelligence Toronto ON M5S 1M1 Canada
- Departments of Chemistry, Materials Science and Engineering, and Physics, University of Toronto, St George Campus Toronto ON Canada
- Machine Learning Group, Technische Universität Berlin and Berlin Institute for the Foundations of Learning and Data 10587 Berlin Germany
| | - Aron Walsh
- Department of Materials, Imperial College London London SW7 2AZ UK
- Department of Physics, Ewha Women's University Seoul Republic of Korea
| |
Collapse
|
35
|
Qiu H, Liu L, Qiu X, Dai X, Ji X, Sun ZY. PolyNC: a natural and chemical language model for the prediction of unified polymer properties. Chem Sci 2024; 15:534-544. [PMID: 38179518 PMCID: PMC10763023 DOI: 10.1039/d3sc05079c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2023] [Accepted: 12/04/2023] [Indexed: 01/06/2024] Open
Abstract
Language models exhibit a profound aptitude for addressing multimodal and multidomain challenges, a competency that eludes the majority of off-the-shelf machine learning models. Consequently, language models hold great potential for comprehending the intricate interplay between material compositions and diverse properties, thereby accelerating material design, particularly in the realm of polymers. While past limitations in polymer data hindered the use of data-intensive language models, the growing availability of standardized polymer data and effective data augmentation techniques now opens doors to previously uncharted territories. Here, we present a revolutionary model to enable rapid and precise prediction of Polymer properties via the power of Natural language and Chemical language (PolyNC). To showcase the efficacy of PolyNC, we have meticulously curated a labeled prompt-structure-property corpus encompassing 22 970 polymer data points on a series of essential polymer properties. Through the use of natural language prompts, PolyNC gains a comprehensive understanding of polymer properties, while employing chemical language (SMILES) to describe polymer structures. In a unified text-to-text manner, PolyNC consistently demonstrates exceptional performance on both regression tasks (such as property prediction) and the classification task (polymer classification). Simultaneous and interactive multitask learning enables PolyNC to holistically grasp the structure-property relationships of polymers. Through a combination of experiments and characterizations, the generalization ability of PolyNC has been demonstrated, with attention analysis further indicating that PolyNC effectively learns structural information about polymers from multimodal inputs. This work provides compelling evidence of the potential for deploying end-to-end language models in polymer research, representing a significant advancement in the AI community's dedicated pursuit of advancing polymer science.
Collapse
Affiliation(s)
- Haoke Qiu
- State Key Laboratory of Polymer Physics and Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun 130022 China
- School of Applied Chemistry and Engineering, University of Science and Technology of China Hefei 230026 China
| | - Lunyang Liu
- State Key Laboratory of Polymer Physics and Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun 130022 China
| | - Xuepeng Qiu
- School of Applied Chemistry and Engineering, University of Science and Technology of China Hefei 230026 China
- CAS Key Laboratory of High-Performance Synthetic Rubber and its Composite Materials, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun 130022 China
| | - Xuemin Dai
- CAS Key Laboratory of High-Performance Synthetic Rubber and its Composite Materials, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun 130022 China
| | - Xiangling Ji
- State Key Laboratory of Polymer Physics and Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun 130022 China
- School of Applied Chemistry and Engineering, University of Science and Technology of China Hefei 230026 China
| | - Zhao-Yan Sun
- State Key Laboratory of Polymer Physics and Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences Changchun 130022 China
- School of Applied Chemistry and Engineering, University of Science and Technology of China Hefei 230026 China
| |
Collapse
|
36
|
Dias AL, Rodrigues T. Large language models direct automated chemistry laboratory. Nature 2023; 624:530-531. [PMID: 38123802 DOI: 10.1038/d41586-023-03790-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
|
37
|
Ting JM, Tamayo-Mendoza T, Petersen SR, Van Reet J, Ahmed UA, Snell NJ, Fisher JD, Stern M, Oviedo F. Frontiers in nonviral delivery of small molecule and genetic drugs, driven by polymer chemistry and machine learning for materials informatics. Chem Commun (Camb) 2023; 59:14197-14209. [PMID: 37955165 DOI: 10.1039/d3cc04705a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2023]
Abstract
Materials informatics (MI) has immense potential to accelerate the pace of innovation and new product development in biotechnology. Close collaborations between skilled physical and life scientists with data scientists are being established in pursuit of leveraging MI tools in automation and artificial intelligence (AI) to predict material properties in vitro and in vivo. However, the scarcity of large, standardized, and labeled materials data for connecting structure-function relationships represents one of the largest hurdles to overcome. In this Highlight, focus is brought to emerging developments in polymer-based therapeutic delivery platforms, where teams generate large experimental datasets around specific therapeutics and successfully establish a design-to-deployment cycle of specialized nanocarriers. Three select collaborations demonstrate how custom-built polymers protect and deliver small molecules, nucleic acids, and proteins, representing ideal use-cases for machine learning to understand how molecular-level interactions impact drug stabilization and release. We conclude with our perspectives on how MI innovations in automation efficiencies and digitalization of data-coupled with fundamental insight and creativity from the polymer science community-can accelerate translation of more gene therapies into lifesaving medicines.
Collapse
|
38
|
Kawashima K, Márquez RA, Smith LA, Vaidyula RR, Carrasco-Jaim OA, Wang Z, Son YJ, Cao CL, Mullins CB. A Review of Transition Metal Boride, Carbide, Pnictide, and Chalcogenide Water Oxidation Electrocatalysts. Chem Rev 2023. [PMID: 37967475 DOI: 10.1021/acs.chemrev.3c00005] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2023]
Abstract
Transition metal borides, carbides, pnictides, and chalcogenides (X-ides) have emerged as a class of materials for the oxygen evolution reaction (OER). Because of their high earth abundance, electrical conductivity, and OER performance, these electrocatalysts have the potential to enable the practical application of green energy conversion and storage. Under OER potentials, X-ide electrocatalysts demonstrate various degrees of oxidation resistance due to their differences in chemical composition, crystal structure, and morphology. Depending on their resistance to oxidation, these catalysts will fall into one of three post-OER electrocatalyst categories: fully oxidized oxide/(oxy)hydroxide material, partially oxidized core@shell structure, and unoxidized material. In the past ten years (from 2013 to 2022), over 890 peer-reviewed research papers have focused on X-ide OER electrocatalysts. Previous review papers have provided limited conclusions and have omitted the significance of "catalytically active sites/species/phases" in X-ide OER electrocatalysts. In this review, a comprehensive summary of (i) experimental parameters (e.g., substrates, electrocatalyst loading amounts, geometric overpotentials, Tafel slopes, etc.) and (ii) electrochemical stability tests and post-analyses in X-ide OER electrocatalyst publications from 2013 to 2022 is provided. Both mono and polyanion X-ides are discussed and classified with respect to their material transformation during the OER. Special analytical techniques employed to study X-ide reconstruction are also evaluated. Additionally, future challenges and questions yet to be answered are provided in each section. This review aims to provide researchers with a toolkit to approach X-ide OER electrocatalyst research and to showcase necessary avenues for future investigation.
Collapse
Affiliation(s)
- Kenta Kawashima
- Department of Chemistry, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Raúl A Márquez
- Department of Chemistry, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Lettie A Smith
- Department of Chemistry, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Rinish Reddy Vaidyula
- Department of Chemistry, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Omar A Carrasco-Jaim
- McKetta Department of Chemical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Ziqing Wang
- Department of Chemistry, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Yoon Jun Son
- McKetta Department of Chemical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
| | - Chi L Cao
- McKetta Department of Chemical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
| | - C Buddie Mullins
- Department of Chemistry, The University of Texas at Austin, Austin, Texas 78712, United States
- McKetta Department of Chemical Engineering, The University of Texas at Austin, Austin, Texas 78712, United States
- Center for Electrochemistry, The University of Texas at Austin, Austin, Texas 78712, United States
- H2@UT, The University of Texas at Austin, Austin, Texas 78712, United States
| |
Collapse
|
39
|
Pascazio L, Rihm S, Naseri A, Mosbach S, Akroyd J, Kraft M. Chemical Species Ontology for Data Integration and Knowledge Discovery. J Chem Inf Model 2023; 63:6569-6586. [PMID: 37883649 PMCID: PMC10647085 DOI: 10.1021/acs.jcim.3c00820] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 10/13/2023] [Accepted: 10/13/2023] [Indexed: 10/28/2023]
Abstract
Web ontologies are important tools in modern scientific research because they provide a standardized way to represent and manage web-scale amounts of complex data. In chemistry, a semantic database for chemical species is indispensable for its ability to interrelate and infer relationships, enabling a more precise analysis and prediction of chemical behavior. This paper presents OntoSpecies, a web ontology designed to represent chemical species and their properties. The ontology serves as a core component of The World Avatar knowledge graph chemistry domain and includes a wide range of identifiers, chemical and physical properties, chemical classifications and applications, and spectral information associated with each species. The ontology includes provenance and attribution metadata, ensuring the reliability and traceability of data. Most of the information about chemical species are sourced from PubChem and ChEBI data on the respective compound Web pages using a software agent, making OntoSpecies a comprehensive semantic database of chemical species able to solve novel types of problems in the field. Access to this reliable source of chemical data is provided through a SPARQL end point. The paper presents example use cases to demonstrate the contribution of OntoSpecies in solving complex tasks that require integrated semantically searchable chemical data. The approach presented in this paper represents a significant advancement in the field of chemical data management, offering a powerful tool for representing, navigating, and analyzing chemical information to support scientific research.
Collapse
Affiliation(s)
- Laura Pascazio
- CARES,
Cambridge Centre for Advanced Research and Education in Singapore, 1 Create Way, CREATE Tower, #05-05, Singapore 138602, Singapore
| | - Simon Rihm
- CARES,
Cambridge Centre for Advanced Research and Education in Singapore, 1 Create Way, CREATE Tower, #05-05, Singapore 138602, Singapore
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K.
| | - Ali Naseri
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K.
| | - Sebastian Mosbach
- CARES,
Cambridge Centre for Advanced Research and Education in Singapore, 1 Create Way, CREATE Tower, #05-05, Singapore 138602, Singapore
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K.
- CMCL
Innovations, Sheraton
House, Castle Park, Cambridge CB3 0AX, U.K.
| | - Jethro Akroyd
- CARES,
Cambridge Centre for Advanced Research and Education in Singapore, 1 Create Way, CREATE Tower, #05-05, Singapore 138602, Singapore
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K.
- CMCL
Innovations, Sheraton
House, Castle Park, Cambridge CB3 0AX, U.K.
| | - Markus Kraft
- CARES,
Cambridge Centre for Advanced Research and Education in Singapore, 1 Create Way, CREATE Tower, #05-05, Singapore 138602, Singapore
- Department
of Chemical Engineering and Biotechnology, University of Cambridge, Philippa Fawcett Drive, Cambridge CB3 0AS, U.K.
- CMCL
Innovations, Sheraton
House, Castle Park, Cambridge CB3 0AX, U.K.
- School
of Chemical and Biomedical Engineering, Nanyang Technological University, 62 Nanyang Drive, Singapore 637459, Singapore
- The
Alan Turing Institute, 96 Euston Rd., London NW1 2DB, U.K.
| |
Collapse
|
40
|
Liu Y, Ziatdinov MA, Vasudevan RK, Kalinin SV. Explainability and human intervention in autonomous scanning probe microscopy. PATTERNS (NEW YORK, N.Y.) 2023; 4:100858. [PMID: 38035198 PMCID: PMC10682748 DOI: 10.1016/j.patter.2023.100858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Revised: 07/26/2023] [Accepted: 09/15/2023] [Indexed: 12/02/2023]
Abstract
The broad adoption of machine learning (ML)-based autonomous experiments (AEs) in material characterization and synthesis requires strategies development for understanding and intervention in the experimental workflow. Here, we introduce and realize a post-experimental analysis strategy for deep kernel learning-based autonomous scanning probe microscopy. This approach yields real-time and post-experimental indicators for the progression of an active learning process interacting with an experimental system. We further illustrate how this approach can be applied to human-in-the-loop AEs, where human operators make high-level decisions at high latencies setting the policies for AEs, and the ML algorithm performs low-level, fast decisions. The proposed approach is universal and can be extended to other techniques and applications such as combinatorial library analysis.
Collapse
Affiliation(s)
- Yongtao Liu
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA
| | - Maxim A. Ziatdinov
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA
- Computational Sciences and Engineering Division, Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA
| | - Rama K. Vasudevan
- Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA
| | - Sergei V. Kalinin
- Department of Materials Science and Engineering, University of Tennessee, Knoxville, TN 37996, USA
| |
Collapse
|
41
|
Bao Z, Bufton J, Hickman RJ, Aspuru-Guzik A, Bannigan P, Allen C. Revolutionizing drug formulation development: The increasing impact of machine learning. Adv Drug Deliv Rev 2023; 202:115108. [PMID: 37774977 DOI: 10.1016/j.addr.2023.115108] [Citation(s) in RCA: 27] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2023] [Revised: 09/24/2023] [Accepted: 09/25/2023] [Indexed: 10/01/2023]
Abstract
Over the past few years, the adoption of machine learning (ML) techniques has rapidly expanded across many fields of research including formulation science. At the same time, the use of lipid nanoparticles to enable the successful delivery of mRNA vaccines in the recent COVID-19 pandemic demonstrated the impact of formulation science. Yet, the design of advanced pharmaceutical formulations is non-trivial and primarily relies on costly and time-consuming wet-lab experimentation. In 2021, our group published a review article focused on the use of ML as a means to accelerate drug formulation development. Since then, the field has witnessed significant growth and progress, reflected by an increasing number of studies published in this area. This updated review summarizes the current state of ML directed drug formulation development, introduces advanced ML techniques that have been implemented in formulation design and shares the progress on making self-driving laboratories a reality. Furthermore, this review highlights several future applications of ML yet to be fully exploited to advance drug formulation research and development.
Collapse
Affiliation(s)
- Zeqing Bao
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON M5S 3M2, Canada
| | - Jack Bufton
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON M5S 3M2, Canada
| | - Riley J Hickman
- Department of Chemistry, University of Toronto, Toronto, ON M5S 3H6, Canada; Department of Computer Science, University of Toronto, Toronto, ON M5S 2E4, Canada; Vector Institute for Artificial Intelligence, Toronto, ON M5S 1M1, Canada
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, Toronto, ON M5S 3H6, Canada; Department of Computer Science, University of Toronto, Toronto, ON M5S 2E4, Canada; Vector Institute for Artificial Intelligence, Toronto, ON M5S 1M1, Canada; Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), Toronto, ON M5S 1M1, Canada; Department of Chemical Engineering & Applied Chemistry, University of Toronto, Toronto, ON M5S 3E5, Canada; Department of Materials Science & Engineering, University of Toronto, Toronto, ON M5S 3E4, Canada; CIFAR Artificial Intelligence Research Chair, Vector Institute, Toronto, ON M5S 1M1, Canada; Acceleration Consortium, Toronto, ON M5S 3H6, Canada
| | - Pauric Bannigan
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON M5S 3M2, Canada.
| | - Christine Allen
- Leslie Dan Faculty of Pharmacy, University of Toronto, Toronto, ON M5S 3M2, Canada; Department of Chemical Engineering & Applied Chemistry, University of Toronto, Toronto, ON M5S 3E5, Canada; Acceleration Consortium, Toronto, ON M5S 3H6, Canada.
| |
Collapse
|
42
|
Tetsassi Feugmo CG. Accurately predicting molecular spectra with deep learning. NATURE COMPUTATIONAL SCIENCE 2023; 3:918-919. [PMID: 38177595 DOI: 10.1038/s43588-023-00553-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2024]
|
43
|
Schrier J, Norquist AJ, Buonassisi T, Brgoch J. In Pursuit of the Exceptional: Research Directions for Machine Learning in Chemical and Materials Science. J Am Chem Soc 2023; 145:21699-21716. [PMID: 37754929 DOI: 10.1021/jacs.3c04783] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/28/2023]
Abstract
Exceptional molecules and materials with one or more extraordinary properties are both technologically valuable and fundamentally interesting, because they often involve new physical phenomena or new compositions that defy expectations. Historically, exceptionality has been achieved through serendipity, but recently, machine learning (ML) and automated experimentation have been widely proposed to accelerate target identification and synthesis planning. In this Perspective, we argue that the data-driven methods commonly used today are well-suited for optimization but not for the realization of new exceptional materials or molecules. Finding such outliers should be possible using ML, but only by shifting away from using traditional ML approaches that tweak the composition, crystal structure, or reaction pathway. We highlight case studies of high-Tc oxide superconductors and superhard materials to demonstrate the challenges of ML-guided discovery and discuss the limitations of automation for this task. We then provide six recommendations for the development of ML methods capable of exceptional materials discovery: (i) Avoid the tyranny of the middle and focus on extrema; (ii) When data are limited, qualitative predictions that provide direction are more valuable than interpolative accuracy; (iii) Sample what can be made and how to make it and defer optimization; (iv) Create room (and look) for the unexpected while pursuing your goal; (v) Try to fill-in-the-blanks of input and output space; (vi) Do not confuse human understanding with model interpretability. We conclude with a description of how these recommendations can be integrated into automated discovery workflows, which should enable the discovery of exceptional molecules and materials.
Collapse
Affiliation(s)
- Joshua Schrier
- Department of Chemistry, Fordham University, The Bronx, New York 10458, United States
| | - Alexander J Norquist
- Department of Chemistry, Haverford College, Haverford, Pennsylvania 19041, United States
| | - Tonio Buonassisi
- Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Jakoah Brgoch
- Department of Chemistry and Texas Center for Superconductivity, University of Houston, Houston, Texas 77204, United States
| |
Collapse
|
44
|
Baronas P, Elholm JL, Moth-Poulsen K. Efficient degassing and ppm-level oxygen monitoring flow chemistry system. REACT CHEM ENG 2023; 8:2052-2059. [PMID: 37496729 PMCID: PMC10366651 DOI: 10.1039/d3re00109a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2023] [Accepted: 04/27/2023] [Indexed: 07/28/2023]
Abstract
Low oxygen levels are critical for a long range of chemical transformations carried out in both flow and batch chemistry. Here, we present an inline continuous flow degassing system based on a gas-permeable membrane inside a vacuum chamber for achieving and monitoring ppm-level oxygen concentrations in solutions. The oxygen presence was monitored with a molecular oxygen probe and a continuously running UV-vis spectrometer. An automated setup for discovering optimal reaction conditions for minimal oxygen presence was devised. The parameters tested were: flow rate, vacuum pressure, solvent back-pressure, tube material, tube length and solvent oxygen solubility. The inline degassing system was proven to be effective in removing up to 99.9% of ambient oxygen from solvents at a flow rate of 300 μl min-1 and 4 mbar vacuum pressure inside the degassing chamber. Reaching lower oxygen concentrations was limited by gas permeation in the tubing following the degassing unit, which could be addressed by purging large volume flow reactors with an inert gas after degassing or by using tubing with lower gas permeability, such as stainless steel tubing. Among all factors, oxygen solubility in solvents was found to play a significant role in achieving efficient degassing of solvents. The data presented here can be used to choose optimal experimental parameters for oxygen-sensitive reactions in flow chemistry reaction setups. The data were also fitted to an analytically derived model from simple differential equations in physical context of the experiment.
Collapse
Affiliation(s)
- Paulius Baronas
- The Institute of Materials Science of Barcelona, ICMAB-CSIC Bellaterra 08193 Barcelona Spain
| | - Jacob Lynge Elholm
- The Institute of Materials Science of Barcelona, ICMAB-CSIC Bellaterra 08193 Barcelona Spain
| | - Kasper Moth-Poulsen
- The Institute of Materials Science of Barcelona, ICMAB-CSIC Bellaterra 08193 Barcelona Spain
- Catalan Institution for Research & Advanced Studies, ICREA Pg. Lluís Companys 23 08010 Barcelona Spain
- Chalmers University of Technology, Department of Chemistry and Chemical Engineering SE-412 96 Gothenburg Sweden
- Department of Chemical Engineering, Universitat Politècnica de Catalunya, EEBE Eduard Maristany 10-14 08019 Barcelona Spain https://www.moth-poulsen.com
| |
Collapse
|
45
|
Anstine D, Isayev O. Generative Models as an Emerging Paradigm in the Chemical Sciences. J Am Chem Soc 2023; 145:8736-8750. [PMID: 37052978 PMCID: PMC10141264 DOI: 10.1021/jacs.2c13467] [Citation(s) in RCA: 88] [Impact Index Per Article: 44.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2022] [Indexed: 04/14/2023]
Abstract
Traditional computational approaches to design chemical species are limited by the need to compute properties for a vast number of candidates, e.g., by discriminative modeling. Therefore, inverse design methods aim to start from the desired property and optimize a corresponding chemical structure. From a machine learning viewpoint, the inverse design problem can be addressed through so-called generative modeling. Mathematically, discriminative models are defined by learning the probability distribution function of properties given the molecular or material structure. In contrast, a generative model seeks to exploit the joint probability of a chemical species with target characteristics. The overarching idea of generative modeling is to implement a system that produces novel compounds that are expected to have a desired set of chemical features, effectively sidestepping issues found in the forward design process. In this contribution, we overview and critically analyze popular generative algorithms like generative adversarial networks, variational autoencoders, flow, and diffusion models. We highlight key differences between each of the models, provide insights into recent success stories, and discuss outstanding challenges for realizing generative modeling discovered solutions in chemical applications.
Collapse
Affiliation(s)
- Dylan
M. Anstine
- Department
of Chemistry, Mellon College of Science, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| | - Olexandr Isayev
- Department
of Chemistry, Mellon College of Science, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| |
Collapse
|
46
|
Volk AA, Epps RW, Yonemoto DT, Masters BS, Castellano FN, Reyes KG, Abolhasani M. AlphaFlow: autonomous discovery and optimization of multi-step chemistry using a self-driven fluidic lab guided by reinforcement learning. Nat Commun 2023; 14:1403. [PMID: 36918561 PMCID: PMC10015005 DOI: 10.1038/s41467-023-37139-y] [Citation(s) in RCA: 31] [Impact Index Per Article: 15.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Accepted: 03/02/2023] [Indexed: 03/16/2023] Open
Abstract
Closed-loop, autonomous experimentation enables accelerated and material-efficient exploration of large reaction spaces without the need for user intervention. However, autonomous exploration of advanced materials with complex, multi-step processes and data sparse environments remains a challenge. In this work, we present AlphaFlow, a self-driven fluidic lab capable of autonomous discovery of complex multi-step chemistries. AlphaFlow uses reinforcement learning integrated with a modular microdroplet reactor capable of performing reaction steps with variable sequence, phase separation, washing, and continuous in-situ spectral monitoring. To demonstrate the power of reinforcement learning toward high dimensionality multi-step chemistries, we use AlphaFlow to discover and optimize synthetic routes for shell-growth of core-shell semiconductor nanoparticles, inspired by colloidal atomic layer deposition (cALD). Without prior knowledge of conventional cALD parameters, AlphaFlow successfully identified and optimized a novel multi-step reaction route, with up to 40 parameters, that outperformed conventional sequences. Through this work, we demonstrate the capabilities of closed-loop, reinforcement learning-guided systems in exploring and solving challenges in multi-step nanoparticle syntheses, while relying solely on in-house generated data from a miniaturized microfluidic platform. Further application of AlphaFlow in multi-step chemistries beyond cALD can lead to accelerated fundamental knowledge generation as well as synthetic route discoveries and optimization.
Collapse
Affiliation(s)
- Amanda A Volk
- Department of Chemical and Biomolecular Engineering, North Carolina State University, 911 Partners Way, Raleigh, NC, 27695-7905, USA
| | - Robert W Epps
- Department of Chemical and Biomolecular Engineering, North Carolina State University, 911 Partners Way, Raleigh, NC, 27695-7905, USA
| | - Daniel T Yonemoto
- Department of Chemistry, North Carolina State University, Raleigh, NC, 27695-8204, USA
| | - Benjamin S Masters
- Department of Chemistry, North Carolina State University, Raleigh, NC, 27695-8204, USA
| | - Felix N Castellano
- Department of Chemistry, North Carolina State University, Raleigh, NC, 27695-8204, USA
| | - Kristofer G Reyes
- Department of Materials Design and Innovation, University at Buffalo, Buffalo, NY, 14260, USA
| | - Milad Abolhasani
- Department of Chemical and Biomolecular Engineering, North Carolina State University, 911 Partners Way, Raleigh, NC, 27695-7905, USA.
| |
Collapse
|
47
|
Mottafegh A, Ahn GN, Kim DP. Meta optimization based on real-time benchmarking of multiple surrogate models for autonomous flow synthesis. LAB ON A CHIP 2023; 23:1613-1621. [PMID: 36722393 DOI: 10.1039/d2lc00938b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Optimizing a wide range of reaction parameters, steps, and pathways is currently considered one of the most complex and challenging problems in microflow-based organic synthesis. As a novel solution, Bayesian optimization (BO) has been utilized to efficiently guide the optimized conditions of flow reactors; however, the benchmarking process for selecting the optimal model among various surrogate models remains inefficient. In this work, we report meta optimization (MO) by benchmarking multiple surrogate models in real-time without any pre-work, which is realized by evaluating the expected values obtained by the regressor used to build each surrogate model, enabling efficient optimization of reaction conditions. By the comparison of the performance of MO with that of various BOs on four datasets of different flow syntheses, it was verified that MO consistently performs the best-in-class for all emulators developed through machine learning, while the conventional BOs based on surrogate models such as the Gaussian process, random forest, neural network ensemble, and gradient boosting demonstrated varying performances from each emulator, which implies that benchmarking is required.
Collapse
Affiliation(s)
- Amirreza Mottafegh
- Center for Intelligent Microprocess of Pharmaceutical Synthesis, Department of Chemical Engineering, Pohang University of Science and Technology (POSTECH), Pohang 37673, Republic of Korea.
| | - Gwang-Noh Ahn
- Center for Intelligent Microprocess of Pharmaceutical Synthesis, Department of Chemical Engineering, Pohang University of Science and Technology (POSTECH), Pohang 37673, Republic of Korea.
| | - Dong-Pyo Kim
- Center for Intelligent Microprocess of Pharmaceutical Synthesis, Department of Chemical Engineering, Pohang University of Science and Technology (POSTECH), Pohang 37673, Republic of Korea.
| |
Collapse
|
48
|
Lin MH, Wolf JB, Sletten ET, Cambié D, Danglad-Flores J, Seeberger PH. Enabling Technologies in Carbohydrate Chemistry: Automated Glycan Assembly, Flow Chemistry and Data Science. Chembiochem 2023; 24:e202200607. [PMID: 36382494 DOI: 10.1002/cbic.202200607] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Revised: 11/15/2022] [Indexed: 11/17/2022]
Abstract
The synthesis of defined oligosaccharides is a complex task. Several enabling technologies have been introduced in the last two decades to facilitate synthetic access to these valuable biomolecules. In this concept, we describe the technological solutions that have advanced glycochemistry using automated glycan assembly, flow chemistry and data science as examples. We highlight how the synergies between these different technologies can further advance the field, with progress toward the realization of a self-driving lab for glycan synthesis.
Collapse
Affiliation(s)
- Mei-Huei Lin
- Department of Biomolecular Systems, Max-Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany.,Department of Chemistry and Biochemistry, Freie Universität Berlin, Arnimallee 22, 14195, Berlin, Germany
| | - Jakob B Wolf
- Department of Biomolecular Systems, Max-Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany.,Department of Chemistry and Biochemistry, Freie Universität Berlin, Arnimallee 22, 14195, Berlin, Germany
| | - Eric T Sletten
- Department of Biomolecular Systems, Max-Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany
| | - Dario Cambié
- Department of Biomolecular Systems, Max-Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany
| | - José Danglad-Flores
- Department of Biomolecular Systems, Max-Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany
| | - Peter H Seeberger
- Department of Biomolecular Systems, Max-Planck Institute of Colloids and Interfaces, Am Mühlenberg 1, 14476, Potsdam, Germany.,Department of Chemistry and Biochemistry, Freie Universität Berlin, Arnimallee 22, 14195, Berlin, Germany
| |
Collapse
|
49
|
Wen M, Spotte-Smith EWC, Blau SM, McDermott MJ, Krishnapriyan AS, Persson KA. Chemical reaction networks and opportunities for machine learning. NATURE COMPUTATIONAL SCIENCE 2023; 3:12-24. [PMID: 38177958 DOI: 10.1038/s43588-022-00369-z] [Citation(s) in RCA: 33] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/24/2022] [Accepted: 11/08/2022] [Indexed: 01/06/2024]
Abstract
Chemical reaction networks (CRNs), defined by sets of species and possible reactions between them, are widely used to interrogate chemical systems. To capture increasingly complex phenomena, CRNs can be leveraged alongside data-driven methods and machine learning (ML). In this Perspective, we assess the diverse strategies available for CRN construction and analysis in pursuit of a wide range of scientific goals, discuss ML techniques currently being applied to CRNs and outline future CRN-ML approaches, presenting scientific and technical challenges to overcome.
Collapse
Affiliation(s)
- Mingjian Wen
- Chemical and Biomolecular Engineering, University of Houston, Houston, TX, USA
- Energy Technologies Area, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Evan Walter Clark Spotte-Smith
- Materials Science Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Materials Science and Engineering, University of California, Berkeley, Berkeley, CA, USA
| | - Samuel M Blau
- Energy Technologies Area, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
| | - Matthew J McDermott
- Materials Science Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Materials Science and Engineering, University of California, Berkeley, Berkeley, CA, USA
| | - Aditi S Krishnapriyan
- Computational Research Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Chemical and Biomolecular Engineering, University of California, Berkeley, Berkeley, CA, USA
- Electrical Engineering and Computer Science, University of California, Berkeley, Berkeley, CA, USA
| | - Kristin A Persson
- Materials Science and Engineering, University of California, Berkeley, Berkeley, CA, USA.
- Molecular Foundry, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
| |
Collapse
|