1
|
Aldossary A, Campos-Gonzalez-Angulo JA, Pablo-García S, Leong SX, Rajaonson EM, Thiede L, Tom G, Wang A, Avagliano D, Aspuru-Guzik A. In Silico Chemical Experiments in the Age of AI: From Quantum Chemistry to Machine Learning and Back. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024:e2402369. [PMID: 38794859 DOI: 10.1002/adma.202402369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 04/28/2024] [Indexed: 05/26/2024]
Abstract
Computational chemistry is an indispensable tool for understanding molecules and predicting chemical properties. However, traditional computational methods face significant challenges due to the difficulty of solving the Schrödinger equations and the increasing computational cost with the size of the molecular system. In response, there has been a surge of interest in leveraging artificial intelligence (AI) and machine learning (ML) techniques to in silico experiments. Integrating AI and ML into computational chemistry increases the scalability and speed of the exploration of chemical space. However, challenges remain, particularly regarding the reproducibility and transferability of ML models. This review highlights the evolution of ML in learning from, complementing, or replacing traditional computational chemistry for energy and property predictions. Starting from models trained entirely on numerical data, a journey set forth toward the ideal model incorporating or learning the physical laws of quantum mechanics. This paper also reviews existing computational methods and ML models and their intertwining, outlines a roadmap for future research, and identifies areas for improvement and innovation. Ultimately, the goal is to develop AI architectures capable of predicting accurate and transferable solutions to the Schrödinger equation, thereby revolutionizing in silico experiments within chemistry and materials science.
Collapse
Affiliation(s)
- Abdulrahman Aldossary
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | | | - Sergio Pablo-García
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
| | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Ella Miray Rajaonson
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Luca Thiede
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Gary Tom
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Andrew Wang
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Davide Avagliano
- Chimie ParisTech, PSL University, CNRS, Institute of Chemistry for Life and Health Sciences (iCLeHS UMR 8060), Paris, F-75005, France
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
- Department of Materials Science & Engineering, University of Toronto, 184 College St., Toronto, ON, M5S 3E4, Canada
- Department of Chemical Engineering & Applied Chemistry, University of Toronto, 200 College St., Toronto, ON, M5S 3E5, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), 66118 University Ave., Toronto, M5G 1M1, Canada
- Acceleration Consortium, 80 St George St, Toronto, M5S 3H6, Canada
| |
Collapse
|
2
|
Wang Z, Chen A, Tao K, Han Y, Li J. MatGPT: A Vane of Materials Informatics from Past, Present, to Future. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2306733. [PMID: 37813548 DOI: 10.1002/adma.202306733] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Revised: 09/05/2023] [Indexed: 10/17/2023]
Abstract
Combining materials science, artificial intelligence (AI), physical chemistry, and other disciplines, materials informatics is continuously accelerating the vigorous development of new materials. The emergence of "GPT (Generative Pre-trained Transformer) AI" shows that the scientific research field has entered the era of intelligent civilization with "data" as the basic factor and "algorithm + computing power" as the core productivity. The continuous innovation of AI will impact the cognitive laws and scientific methods, and reconstruct the knowledge and wisdom system. This leads to think more about materials informatics. Here, a comprehensive discussion of AI models and materials infrastructures is provided, and the advances in the discovery and design of new materials are reviewed. With the rise of new research paradigms triggered by "AI for Science", the vane of materials informatics: "MatGPT", is proposed and the technical path planning from the aspects of data, descriptors, generative models, pretraining models, directed design models, collaborative training, experimental robots, as well as the efforts and preparations needed to develop a new generation of materials informatics, is carried out. Finally, the challenges and constraints faced by materials informatics are discussed, in order to achieve a more digital, intelligent, and automated construction of materials informatics with the joint efforts of more interdisciplinary scientists.
Collapse
Affiliation(s)
- Zhilong Wang
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - An Chen
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Kehao Tao
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Yanqiang Han
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| | - Jinjin Li
- National Key Laboratory of Science and Technology on Micro/Nano Fabrication, Shanghai Jiao Tong University, Shanghai, 200240, China
- Key Laboratory of Thin Film and Microfabrication of Ministry of Education, Department of Micro/Nano Electronics, Shanghai Jiao Tong University, Shanghai, 200240, China
| |
Collapse
|
3
|
Friedrich R, Curtarolo S. AFLOW-CCE for the thermodynamics of ionic materials. J Chem Phys 2024; 160:042501. [PMID: 38276957 DOI: 10.1063/5.0184917] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Accepted: 12/25/2023] [Indexed: 01/27/2024] Open
Abstract
Accurate thermodynamic stability predictions enable data-driven computational materials design. Standard density functional theory (DFT) approximations have limited accuracy with average errors of a few hundred meV/atom for ionic materials, such as oxides and nitrides. Thus, insightful correction schemes as given by the coordination corrected enthalpies (CCE) method, based on an intuitive parametrization of DFT errors with respect to coordination numbers and cation oxidation states, present a simple, yet accurate solution to enable materials stability assessments. Here, we illustrate the computational capabilities of our AFLOW-CCE software by utilizing our previous results for oxides and introducing new results for nitrides. The implementation reduces the deviations between theory and experiment to the order of the room temperature thermal energy scale, i.e., ∼25 meV/atom. The automated corrections for both materials classes are freely available within the AFLOW ecosystem via the AFLOW-CCE module, requiring only structural inputs.
Collapse
Affiliation(s)
- Rico Friedrich
- Theoretical Chemistry, Technische Universität Dresden, 01062 Dresden, Germany
- Institute of Ion Beam Physics and Materials Research, Helmholtz-Zentrum Dresden-Rossendorf, 01328 Dresden, Germany
- Center for Autonomous Materials Design, Duke University, Durham, North Carolina 27708, USA
| | - Stefano Curtarolo
- Center for Autonomous Materials Design, Duke University, Durham, North Carolina 27708, USA
- Materials Science, Electrical Engineering, and Physics, Duke University, Durham, North Carolina 27708, USA
| |
Collapse
|
4
|
Gong S, Yan K, Xie T, Shao-Horn Y, Gomez-Bombarelli R, Ji S, Grossman JC. Examining graph neural networks for crystal structures: Limitations and opportunities for capturing periodicity. SCIENCE ADVANCES 2023; 9:eadi3245. [PMID: 37948518 PMCID: PMC10637739 DOI: 10.1126/sciadv.adi3245] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 10/13/2023] [Indexed: 11/12/2023]
Abstract
Graph neural networks (GNNs) have recently been used to learn the representations of crystal structures through an end-to-end data-driven approach. However, a systematic top-down approach to evaluate and understand the limitations of GNNs in accurately capturing crystal structures has yet to be established. In this study, we introduce an approach using human-designed descriptors as a compendium of human knowledge to investigate the extent to which GNNs can comprehend crystal structures. Our findings reveal that current state-of-the-art GNNs fall short in accurately capturing the periodicity of crystal structures. We analyze this failure by exploring three aspects: local expressive power, long-range information processing, and readout function. To address these identified limitations, we propose a straightforward and general solution: the hybridization of descriptors with GNNs, which directly supplements the missing information to GNNs. The hybridization enhances the predictive accuracy of GNNs for specific material properties, most notably phonon internal energy and heat capacity, which heavily rely on the periodicity of materials.
Collapse
Affiliation(s)
- Sheng Gong
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Keqiang Yan
- Computer Science and Engineering, Texas A&M University, College Station, TX 77843, USA
| | - Tian Xie
- Microsoft Research, Cambridge CB1 2FB, UK
| | - Yang Shao-Horn
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Department of Mechanical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Rafael Gomez-Bombarelli
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Shuiwang Ji
- Computer Science and Engineering, Texas A&M University, College Station, TX 77843, USA
| | - Jeffrey C. Grossman
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| |
Collapse
|
5
|
Zhang Z, Valente DS, Shi Y, Limbu DK, Momeni MR, Shakib FA. In Silico High-Throughput Design and Prediction of Structural and Electronic Properties of Low-Dimensional Metal-Organic Frameworks. ACS APPLIED MATERIALS & INTERFACES 2023; 15:9494-9507. [PMID: 36749899 DOI: 10.1021/acsami.2c22665] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The advent of π-stacked layered metal-organic frameworks (MOFs), which offer electrical conductivity on top of permanent porosity and high surface area, opened up new horizons for designing compact MOF-based devices such as battery electrodes, supercapacitors, and spintronics. Permutation of structural building blocks, including metal nodes and organic linkers, in these electrically conductive (EC) materials, results in new systems with unprecedented and unexplored physical and chemical properties. With the ultimate goal of providing a platform for accelerated material design and discovery, here we lay the foundations for the creation of the first comprehensive database of EC-MOFs with an experimentally guided approach. The first phase of this database, coined EC-MOF/Phase-I, is composed of 1,057 bulk and monolayer structures built by all possible combinations of experimentally reported organic linkers, functional groups, and metal nodes. A high-throughput screening (HTS) workflow is constructed to implement density functional theory calculations with periodic boundary conditions to optimize the structures and calculate some of their most relevant properties. Because research and development in the area of EC-MOFs has long been suffering from the lack of appropriate initial crystal structures, all of the geometries and property data have been made available for the use of the community through an online platform that was developed during the course of this work. This database provides comprehensive physical and chemical data of EC-MOFs as well as the convenience of selecting appropriate materials for specific applications, thus accelerating the design and discovery of EC-MOF-based compact devices.
Collapse
Affiliation(s)
- Zeyu Zhang
- Department of Chemistry and Environmental Science, New Jersey Institute of Technology, Newark, New Jersey 07102, United States
| | - Dylan S Valente
- Department of Chemistry and Environmental Science, New Jersey Institute of Technology, Newark, New Jersey 07102, United States
| | - Yuliang Shi
- Department of Chemistry and Environmental Science, New Jersey Institute of Technology, Newark, New Jersey 07102, United States
| | - Dil K Limbu
- Department of Chemistry and Environmental Science, New Jersey Institute of Technology, Newark, New Jersey 07102, United States
| | - Mohammad R Momeni
- Division of Energy, Matter and Systems, School of Science and Engineering, University of Missouri─Kansas City, Kansas City, Missouri 64110, United States
| | - Farnaz A Shakib
- Department of Chemistry and Environmental Science, New Jersey Institute of Technology, Newark, New Jersey 07102, United States
| |
Collapse
|
6
|
Chew PY, Reinhardt A. Phase diagrams-Why they matter and how to predict them. J Chem Phys 2023; 158:030902. [PMID: 36681642 DOI: 10.1063/5.0131028] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Understanding the thermodynamic stability and metastability of materials can help us to, for example, gauge whether crystalline polymorphs in pharmaceutical formulations are likely to be durable. It can also help us to design experimental routes to novel phases with potentially interesting properties. In this Perspective, we provide an overview of how thermodynamic phase behavior can be quantified both in computer simulations and machine-learning approaches to determine phase diagrams, as well as combinations of the two. We review the basic workflow of free-energy computations for condensed phases, including some practical implementation advice, ranging from the Frenkel-Ladd approach to thermodynamic integration and to direct-coexistence simulations. We illustrate the applications of such methods on a range of systems from materials chemistry to biological phase separation. Finally, we outline some challenges, questions, and practical applications of phase-diagram determination which we believe are likely to be possible to address in the near future using such state-of-the-art free-energy calculations, which may provide fundamental insight into separation processes using multicomponent solvents.
Collapse
Affiliation(s)
- Pin Yu Chew
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Aleks Reinhardt
- Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|