1
|
Jiao Z, Mao Y, Lu R, Liu Y, Guo L, Wang Z. Fine-Tuning Graph Neural Networks via Active Learning: Unlocking the Potential of Graph Neural Networks Trained on Nonaqueous Systems for Aqueous CO 2 Reduction. J Chem Theory Comput 2025; 21:3176-3186. [PMID: 40084714 DOI: 10.1021/acs.jctc.5c00089] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/16/2025]
Abstract
Graph neural networks (GNNs) have revolutionized catalysis research with their efficiency and accuracy in modeling complex chemical interactions. However, adapting GNNs trained on nonaqueous data sets to aqueous systems poses notable challenges due to intricate water interactions. In this study, we proposed an active learning-based fine-tuning approach to extend the applicability of GNNs to aqueous environments. The geometry optimization and transition state search workflows are designed to reduce computational costs while maintaining DFT-level accuracy. Applied to the CO2 reduction reaction, the workflow delivers a 2-3-fold acceleration in geometry optimization through a relaxed force threshold combined with DFT refinement. The versatility of the transition state search algorithm was demonstrated on key C-C coupling pathways, pinpointing *CO-*COH as the most energetically favorable pathway in aqueous systems of Cu and Cu-based Ag, Au, and Zn alloys. The Brønsted-Evans-Polanyi relationship remains robust under water-induced fluctuations, with alloyed metals such as Al, Ga, and Pd, along with Ag, Au, and Zn, exhibiting coupling efficiency comparable to that of Cu. Additionally, perturbation-based training on forces and energies extends the application of GNNs to aqueous ab initio molecular dynamics simulations, enabling efficient modeling of dynamical trajectories. This work presents novel approaches to adapting nonaqueous models for application in aqueous systems, highlighting GNNs' potential in solvated environments and laying a foundation for accelerating predictions of catalytic mechanisms under realistic conditions.
Collapse
Affiliation(s)
- Zihao Jiao
- International Research Center for Renewable Energy, State Key Laboratory of Multiphase Flow in Power Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
- School of Chemical Sciences, University of Auckland, Auckland 1010, New Zealand
| | - Yu Mao
- School of Chemical Sciences, University of Auckland, Auckland 1010, New Zealand
| | - Ruihu Lu
- School of Chemical Sciences, University of Auckland, Auckland 1010, New Zealand
| | - Ya Liu
- International Research Center for Renewable Energy, State Key Laboratory of Multiphase Flow in Power Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| | - Liejin Guo
- International Research Center for Renewable Energy, State Key Laboratory of Multiphase Flow in Power Engineering, Xi'an Jiaotong University, Xi'an, Shaanxi 710049, China
| | - Ziyun Wang
- School of Chemical Sciences, University of Auckland, Auckland 1010, New Zealand
| |
Collapse
|
2
|
Jiao Z, Liu Y, Wang Z. Application of graph neural network in computational heterogeneous catalysis. J Chem Phys 2024; 161:171001. [PMID: 39484893 DOI: 10.1063/5.0227821] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 10/11/2024] [Indexed: 11/03/2024] Open
Abstract
Heterogeneous catalysis, as a key technology in modern chemical industries, plays a vital role in social progress and economic development. However, its complex reaction process poses challenges to theoretical research. Graph neural networks (GNNs) are gradually becoming a key tool in this field as they can intrinsically learn atomic representation and consider connection relationship, making them naturally applicable to atomic and molecular systems. This article introduces the basic principles, current network architectures, and datasets of GNNs and reviews the application of GNN in heterogeneous catalysis from accelerating the materials screening and exploring the potential energy surface. In the end, we summarize the main challenges and potential application prospects of GNNs in future research endeavors.
Collapse
Affiliation(s)
- Zihao Jiao
- International Research Center for Renewable Energy, State Key Laboratory of Multiphase Flow in Power Engineering, Xi'an Jiaotong University, Shaanxi 710049, China
- School of Chemical Sciences, University of Auckland, Auckland 1010, New Zealand
| | - Ya Liu
- International Research Center for Renewable Energy, State Key Laboratory of Multiphase Flow in Power Engineering, Xi'an Jiaotong University, Shaanxi 710049, China
| | - Ziyun Wang
- School of Chemical Sciences, University of Auckland, Auckland 1010, New Zealand
| |
Collapse
|
3
|
Cao D, Chan MK. Enhancing chemical synthesis research with NLP: Word embeddings for chemical reagent identification-A case study on nano-FeCu. iScience 2024; 27:110780. [PMID: 39319268 PMCID: PMC11417335 DOI: 10.1016/j.isci.2024.110780] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 03/11/2024] [Accepted: 08/16/2024] [Indexed: 09/26/2024] Open
Abstract
Nanoparticle synthesis is complex, influenced by multiple variables including reagent selection. This study introduces a specialized corpus focused on "Fe, Cu, synthesis" to train a domain-specific word embedding model using natural language processing (NLP) in an unsupervised environment. Evaluation metrics included average cosine similarity, visual analysis via t-distributed stochastic neighbor embedding (t-SNE), synonym analysis, and analogy reasoning analysis. Results indicate a strong correlation between learning rate and cosine similarity, with enhanced chemical specificity in the tailored model compared to general models. The framework facilitates rapid identification of potential reagents for nano-FeCu synthesis, enhancing precision in nanomaterial research. This innovative approach offers a data-driven pathway for chemical material synthesis, demonstrating significant interdisciplinary applications.
Collapse
Affiliation(s)
- Dingding Cao
- Centre for Water Research, Faculty of Engineering, Built Environment and Information Technology, SEGi University. Jalan Teknologi, Kota Damansara, Petaling Jaya 47810, Selangor Darul Ehsan, Malaysia
- Department of Electrical and Electronic Engineering, Guangdong Technology College, Zhaoqing 526100, China
| | - Mieow Kee Chan
- Centre for Water Research, Faculty of Engineering, Built Environment and Information Technology, SEGi University. Jalan Teknologi, Kota Damansara, Petaling Jaya 47810, Selangor Darul Ehsan, Malaysia
| |
Collapse
|
4
|
Yang Z, Zhao YM, Wang X, Liu X, Zhang X, Li Y, Lv Q, Chen CYC, Shen L. Scalable crystal structure relaxation using an iteration-free deep generative model with uncertainty quantification. Nat Commun 2024; 15:8148. [PMID: 39289379 PMCID: PMC11408520 DOI: 10.1038/s41467-024-52378-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2024] [Accepted: 09/02/2024] [Indexed: 09/19/2024] Open
Abstract
In computational molecular and materials science, determining equilibrium structures is the crucial first step for accurate subsequent property calculations. However, the recent discovery of millions of new crystals and super large twisted structures has challenged traditional computational methods, both ab initio and machine-learning-based, due to their computationally intensive iterative processes. To address these scalability issues, here we introduce DeepRelax, a deep generative model capable of performing geometric crystal structure relaxation rapidly and without iterations. DeepRelax learns the equilibrium structural distribution, enabling it to predict relaxed structures directly from their unrelaxed ones. The ability to perform structural relaxation at the millisecond level per structure, combined with the scalability of parallel processing, makes DeepRelax particularly useful for large-scale virtual screening. We demonstrate DeepRelax's reliability and robustness by applying it to five diverse databases, including oxides, Materials Project, two-dimensional materials, van der Waals crystals, and crystals with point defects. DeepRelax consistently shows high accuracy and efficiency, validated by density functional theory calculations. Finally, we enhance its trustworthiness by integrating uncertainty quantification. This work significantly accelerates computational workflows, offering a robust and trustworthy machine-learning method for material discovery and advancing the application of AI for science.
Collapse
Affiliation(s)
- Ziduo Yang
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, China
| | - Yi-Ming Zhao
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore
| | - Xian Wang
- Department of Physics, National University of Singapore, Singapore, Singapore
| | - Xiaoqing Liu
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore
| | - Xiuying Zhang
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore
| | - Yifan Li
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore
| | - Qiujie Lv
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore
- Artificial Intelligence Medical Research Center, School of Intelligent Systems Engineering, Shenzhen Campus of Sun Yat-sen University, Shenzhen, China
| | - Calvin Yu-Chian Chen
- AI for Science (AI4S)-Preferred Program, School of Electronic and Computer Engineering, Peking University Shenzhen Graduate School, Shenzhen, China.
- State Key Laboratory of Chemical Oncogenomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School, Shenzhen, China.
- Department of Medical Research, China Medical University Hospital, Taichung, Taiwan.
- Department of Bioinformatics and Medical Engineering, Asia University, Taichung, Taiwan.
- Guangdong L-Med Biotechnology Co., Ltd., Meizhou, Guangdong, China.
| | - Lei Shen
- Department of Mechanical Engineering, National University of Singapore, Singapore, Singapore.
- National University of Singapore (Chongqing) Research Institute, Chongqing, China.
| |
Collapse
|
5
|
Guo ZX, Song GL, Liu ZP. Artificial intelligence driven molecule adsorption prediction (AIMAP) applied to chirality recognition of amino acid adsorption on metals. Chem Sci 2024; 15:13369-13380. [PMID: 39183905 PMCID: PMC11339975 DOI: 10.1039/d4sc02304h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Accepted: 07/15/2024] [Indexed: 08/27/2024] Open
Abstract
Predicting the adsorption structure of molecules has long been a challenging topic given the coupled complexity of surface binding sites and molecule flexibility. Here, we develop AIMAP, an Artificial Intelligence Driven Molecule Adsorption Prediction tool, to achieve the general-purpose end-to-end prediction of molecule adsorption structures. AIMAP features efficient exploration of the global potential energy surface of the adsorption system based on global neural network (G-NN) potential, by rapidly screening qualified adsorption patterns and fine searching using stochastic surface walking (SSW) global optimization. We demonstrate the AIMAP efficiency in constructing the Cu-HCNO6 adsorption database, encompassing 1 182 351 distinct adsorption configurations of 9592 molecules on three copper surfaces. AIMAP is then utilized to identify the best adsorption structure for 18 amino acids (AAs) on achiral Cu surfaces and the chiral Cu(3,1,17) S surface. We find that AAs chemisorb on copper surfaces in their highest deprotonated state, through both the carboxylate-amino skeleton and side groups. The chiral recognition is identified for the d-preference of Asp, Glu, and Tyr, and l-preference for His. The physical origin for the enantiospecific adsorption is thus rationalized, pointing to the critical role of the competitive adsorption between functional side groups and the carboxylate-amino skeleton at surface low-coordination sites.
Collapse
Affiliation(s)
- Zi-Xing Guo
- Collaborative Innovation Center of Chemistry for Energy Material, Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Key Laboratory of Computational Physical Science, Department of Chemistry, Fudan University Shanghai 200433 China
| | - Guo-Liang Song
- Collaborative Innovation Center of Chemistry for Energy Material, Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Key Laboratory of Computational Physical Science, Department of Chemistry, Fudan University Shanghai 200433 China
| | - Zhi-Pan Liu
- Collaborative Innovation Center of Chemistry for Energy Material, Shanghai Key Laboratory of Molecular Catalysis and Innovative Materials, Key Laboratory of Computational Physical Science, Department of Chemistry, Fudan University Shanghai 200433 China
- Key Laboratory of Synthetic and Self-Assembly Chemistry for Organic Functional Molecules, Shanghai Institute of Organic Chemistry, Chinese Academy of Sciences Shanghai 200032 China
- Shanghai Qi Zhi Institution Shanghai 200030 China
| |
Collapse
|
6
|
Xue Z, Tan R, Tian J, Hou H, Zhang X, Zhao Y. Designing asymmetrical TMN 4 sites via phosphorus or sulfur dual coordination as high-performance electrocatalysts for oxygen evolution reaction. J Colloid Interface Sci 2024; 667:679-687. [PMID: 38670011 DOI: 10.1016/j.jcis.2024.04.095] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2024] [Revised: 04/09/2024] [Accepted: 04/14/2024] [Indexed: 04/28/2024]
Abstract
The development ofhighly efficient oxygen evolution reaction (OER) catalysts based on more cost-effective and earth-abundant elements is of great significance and still faces a huge challenge. In this work, a series of transition metal (TM)embedding a newly-defined monolayer carbon nitride phase is theoretically profiled and constructed as a catalytic platform for OER studies. Typically, a four-step screening strategy was proposed to rapidly identified high performance candidates and the coordination structure and catalytic performance relationship was thoroughly analyzed. Moreover, the eliminating criterion was established to condenses valid range based on the Gibbs free energy of OH*. Our results reveal that the as-constructed 2FeCN/P exhibits superior activity toward OER with an ultralow overpotential of 0.25 V, at the same time, the established 3FeCN/S configuration performed well as abifunctional OER/ORR electrocatalysis with extremely low overpotential ηOER/ηORR of 0.26/0.48 V. Overall, this work provides an effective framework for screening advanced OER catalysts, which can also be extended to other complex multistep catalytic reactions.
Collapse
Affiliation(s)
- Zhe Xue
- School of Materials Science and Engineering, Collaborative Innovation Center of Ministry of Education and Shanxi Province for High-performance Al/Mg Alloy Materials, North University of China, Taiyuan 030051, China; State Key Laboratory of Metastable Materials Science and Technology, Yanshan University, Qinhuangdao 066004, Hebei, China
| | - Rui Tan
- College of Physics and Electronics Engineering, Hengyang Normal University, Hengyang 421002, China
| | - Jinzhong Tian
- School of Materials Science and Engineering, Collaborative Innovation Center of Ministry of Education and Shanxi Province for High-performance Al/Mg Alloy Materials, North University of China, Taiyuan 030051, China
| | - Hua Hou
- School of Materials Science and Engineering, Collaborative Innovation Center of Ministry of Education and Shanxi Province for High-performance Al/Mg Alloy Materials, North University of China, Taiyuan 030051, China; School of Materials Science and Engineering, Taiyuan University of Science and Technology, Taiyuan 030024, China
| | - Xinyu Zhang
- State Key Laboratory of Metastable Materials Science and Technology, Yanshan University, Qinhuangdao 066004, Hebei, China.
| | - Yuhong Zhao
- School of Materials Science and Engineering, Collaborative Innovation Center of Ministry of Education and Shanxi Province for High-performance Al/Mg Alloy Materials, North University of China, Taiyuan 030051, China; Beijing Advanced Innovation Center for Materials Genome Engineering, University of Science and Technology Beijing, Beijing 100083, China.
| |
Collapse
|
7
|
Wang Y, Sun J, Sun N, Zhang M, Liu X, Zhang A, Wang L. The spin polarization strategy regulates heterogeneous catalytic activity performance: from fundamentals to applications. Chem Commun (Camb) 2024; 60:7397-7413. [PMID: 38946499 DOI: 10.1039/d4cc02012j] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
In recent years, there has been significant attention towards the development of catalysts that exhibit superior performance and environmentally friendly attributes. This surge in interest is driven by the growing demands for energy utilization and storage as well as environmental preservation. Spin polarization plays a crucial role in catalyst design, comprehension of catalytic mechanisms, and reaction control, offering novel insights for the design of highly efficient catalysts. However, there are still some significant research gaps in the current study of spin catalysis. Therefore, it is urgent to understand how spin polarization impacts catalytic reactions to develop superior performance catalysts. Herein, we present a comprehensive summary of the application of spin polarization in catalysis. Firstly, we summarize the fundamental mechanism of spin polarization in catalytic reactions from two aspects of kinetics and thermodynamics. Additionally, we review the regulation mechanism of spin polarization in various catalytic applications and several approaches to modulate spin polarization. Moreover, we discuss the future development of spin polarization in catalysis and propose several potential avenues for further progress. We aim to improve current catalytic systems through implementing a novel and distinctive spin engineering strategy.
Collapse
Affiliation(s)
- Yan Wang
- College of Electronic and Optical Engineering, Institute of Flexible Electronics (Future Technology), Nanjing University of Posts & Telecommunications (NJUPT), Nanjing 210023, Jiangsu, P. R. China.
| | - Junkang Sun
- College of Electronic and Optical Engineering, Institute of Flexible Electronics (Future Technology), Nanjing University of Posts & Telecommunications (NJUPT), Nanjing 210023, Jiangsu, P. R. China.
| | - Ning Sun
- College of Electronic and Optical Engineering, Institute of Flexible Electronics (Future Technology), Nanjing University of Posts & Telecommunications (NJUPT), Nanjing 210023, Jiangsu, P. R. China.
| | - Mengyang Zhang
- College of Electronic and Optical Engineering, Institute of Flexible Electronics (Future Technology), Nanjing University of Posts & Telecommunications (NJUPT), Nanjing 210023, Jiangsu, P. R. China.
| | - Xianya Liu
- College of Electronic and Optical Engineering, Institute of Flexible Electronics (Future Technology), Nanjing University of Posts & Telecommunications (NJUPT), Nanjing 210023, Jiangsu, P. R. China.
| | - Anlei Zhang
- College of Science, Nanjing University of Posts & Telecommunications (NJUPT), Nanjing 210023, Jiangsu, P. R. China.
| | - Longlu Wang
- College of Electronic and Optical Engineering, Institute of Flexible Electronics (Future Technology), Nanjing University of Posts & Telecommunications (NJUPT), Nanjing 210023, Jiangsu, P. R. China.
| |
Collapse
|
8
|
Duignan TT. The Potential of Neural Network Potentials. ACS PHYSICAL CHEMISTRY AU 2024; 4:232-241. [PMID: 38800721 PMCID: PMC11117678 DOI: 10.1021/acsphyschemau.4c00004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 03/04/2024] [Accepted: 03/05/2024] [Indexed: 05/29/2024]
Abstract
In the next half-century, physical chemistry will likely undergo a profound transformation, driven predominantly by the combination of recent advances in quantum chemistry and machine learning (ML). Specifically, equivariant neural network potentials (NNPs) are a breakthrough new tool that are already enabling us to simulate systems at the molecular scale with unprecedented accuracy and speed, relying on nothing but fundamental physical laws. The continued development of this approach will realize Paul Dirac's 80-year-old vision of using quantum mechanics to unify physics with chemistry and providing invaluable tools for understanding materials science, biology, earth sciences, and beyond. The era of highly accurate and efficient first-principles molecular simulations will provide a wealth of training data that can be used to build automated computational methodologies, using tools such as diffusion models, for the design and optimization of systems at the molecular scale. Large language models (LLMs) will also evolve into increasingly indispensable tools for literature review, coding, idea generation, and scientific writing.
Collapse
|
9
|
Chowdhury J, Fricke C, Bamidele O, Bello M, Yang W, Heyden A, Terejanu G. Invariant Molecular Representations for Heterogeneous Catalysis. J Chem Inf Model 2024; 64:327-339. [PMID: 38197612 PMCID: PMC10806804 DOI: 10.1021/acs.jcim.3c00594] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2023] [Revised: 12/25/2023] [Accepted: 12/28/2023] [Indexed: 01/11/2024]
Abstract
Catalyst screening is a critical step in the discovery and development of heterogeneous catalysts, which are vital for a wide range of chemical processes. In recent years, computational catalyst screening, primarily through density functional theory (DFT), has gained significant attention as a method for identifying promising catalysts. However, the computation of adsorption energies for all likely chemical intermediates present in complex surface chemistries is computationally intensive and costly due to the expensive nature of these calculations and the intrinsic idiosyncrasies of the methods or data sets used. This study introduces a novel machine learning (ML) method to learn adsorption energies from multiple DFT functionals by using invariant molecular representations (IMRs). To do this, we first extract molecular fingerprints for the reaction intermediates and later use a Siamese-neural-network-based training strategy to learn invariant molecular representations or the IMR across all available functionals. Our Siamese network-based representations demonstrate superior performance in predicting adsorption energies compared with other molecular representations. Notably, when considering mean absolute values of adsorption energies as 0.43 eV (PBE-D3), 0.46 eV (BEEF-vdW), 0.81 eV (RPBE), and 0.37 eV (scan+rVV10), our IMR method has achieved the lowest mean absolute errors (MAEs) of 0.18 0.10, 0.16, and 0.18 eV, respectively. These results emphasize the superior predictive capacity of our Siamese network-based representations. The empirical findings in this study illuminate the efficacy, robustness, and dependability of our proposed ML paradigm in predicting adsorption energies, specifically for propane dehydrogenation on a platinum catalyst surface.
Collapse
Affiliation(s)
- Jawad Chowdhury
- Department
of Computer Science, University of North
Carolina at Charlotte, Charlotte, North Carolina 28223, United States
| | - Charles Fricke
- Department
of Chemical Engineering, University of South
Carolina, Columbia, South Carolina 29208, United States
| | - Olajide Bamidele
- Department
of Chemical Engineering, University of South
Carolina, Columbia, South Carolina 29208, United States
| | - Mubarak Bello
- Department
of Chemical Engineering, University of South
Carolina, Columbia, South Carolina 29208, United States
| | - Wenqiang Yang
- Department
of Chemical Engineering, University of South
Carolina, Columbia, South Carolina 29208, United States
| | - Andreas Heyden
- Department
of Chemical Engineering, University of South
Carolina, Columbia, South Carolina 29208, United States
| | - Gabriel Terejanu
- Department
of Computer Science, University of North
Carolina at Charlotte, Charlotte, North Carolina 28223, United States
| |
Collapse
|
10
|
Buterez D, Janet JP, Kiddle SJ, Oglic D, Liò P. Modelling local and general quantum mechanical properties with attention-based pooling. Commun Chem 2023; 6:262. [PMID: 38030692 PMCID: PMC10686994 DOI: 10.1038/s42004-023-01045-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2023] [Accepted: 10/27/2023] [Indexed: 12/01/2023] Open
Abstract
Atom-centred neural networks represent the state-of-the-art for approximating the quantum chemical properties of molecules, such as internal energies. While the design of machine learning architectures that respect chemical principles has continued to advance, the final atom pooling operation that is necessary to convert from atomic to molecular representations in most models remains relatively undeveloped. The most common choices, sum and average pooling, compute molecular representations that are naturally a good fit for many physical properties, while satisfying properties such as permutation invariance which are desirable from a geometric deep learning perspective. However, there are growing concerns that such simplistic functions might have limited representational power, while also being suboptimal for physical properties that are highly localised or intensive. Based on recent advances in graph representation learning, we investigate the use of a learnable pooling function that leverages an attention mechanism to model interactions between atom representations. The proposed pooling operation is a drop-in replacement requiring no changes to any of the other architectural components. Using SchNet and DimeNet++ as starting models, we demonstrate consistent uplifts in performance compared to sum and mean pooling and a recent physics-aware pooling operation designed specifically for orbital energies, on several datasets, properties, and levels of theory, with up to 85% improvements depending on the specific task.
Collapse
Affiliation(s)
- David Buterez
- Department of Computer Science and Technology, University of Cambridge, Cambridge, CB3 0FD, UK.
| | - Jon Paul Janet
- Molecular AI, Discovery Sciences, R&D, AstraZeneca, Gothenburg, 431 50, Sweden
| | - Steven J Kiddle
- Data Science & Advanced Analytics, Data Science & AI, R&D, AstraZeneca, Cambridge, CB2 8PA, UK
| | - Dino Oglic
- Center for AI, Data Science & AI, R&D, AstraZeneca, Cambridge, CB2 8PA, UK
| | - Pietro Liò
- Department of Computer Science and Technology, University of Cambridge, Cambridge, CB3 0FD, UK
| |
Collapse
|
11
|
Wang H, Pei Y, Wang K, Zuo Y, Wei M, Xiong J, Zhang P, Chen Z, Shang N, Zhong D, Pei P. First-Row Transition Metals for Catalyzing Oxygen Redox. SMALL (WEINHEIM AN DER BERGSTRASSE, GERMANY) 2023; 19:e2304863. [PMID: 37469215 DOI: 10.1002/smll.202304863] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/09/2023] [Revised: 07/09/2023] [Indexed: 07/21/2023]
Abstract
Rechargeable zinc-air batteries are widely recognized as a highly promising technology for energy conversion and storage, offering a cost-effective and viable alternative to commercial lithium-ion batteries due to their unique advantages. However, the practical application and commercialization of zinc-air batteries are hindered by the sluggish kinetics of the oxygen reduction reaction (ORR) and oxygen evolution reaction (OER). Recently, extensive research has focused on the potential of first-row transition metals (Mn, Fe, Co, Ni, and Cu) as promising alternatives to noble metals in bifunctional ORR/OER electrocatalysts, leveraging their high-efficiency electrocatalytic activity and excellent durability. This review provides a comprehensive summary of the recent advancements in the mechanisms of ORR/OER, the performance of bifunctional electrocatalysts, and the preparation strategies employed for electrocatalysts based on first-row transition metals in alkaline media for zinc-air batteries. The paper concludes by proposing several challenges and highlighting emerging research trends for the future development of bifunctional electrocatalysts based on first-row transition metals.
Collapse
Affiliation(s)
- Hengwei Wang
- School of Mechanical Engineering, Beijing Institute of Technology, Beijing, 100081, China
| | - Yu Pei
- Department of Chemical and Biological Engineering, The University of British Columbia, 2360 East Mall, Vancouver, BC, V6T 1Z3, Canada
| | - Keliang Wang
- School of Mechanical Engineering, Beijing Institute of Technology, Beijing, 100081, China
- State Key Laboratory of Automotive Safety and Energy, Tsinghua University, Beijing, 100084, China
| | - Yayu Zuo
- School of Mechanical Engineering, Beijing Institute of Technology, Beijing, 100081, China
| | - Manhui Wei
- School of Mechanical Engineering, Beijing Institute of Technology, Beijing, 100081, China
| | - Jianyin Xiong
- School of Mechanical Engineering, Beijing Institute of Technology, Beijing, 100081, China
| | - Pengfei Zhang
- School of Mechanical Engineering, Beijing Institute of Technology, Beijing, 100081, China
| | - Zhuo Chen
- School of Mechanical Engineering, Beijing Institute of Technology, Beijing, 100081, China
| | - Nuo Shang
- School of Mechanical Engineering, Beijing Institute of Technology, Beijing, 100081, China
| | - Daiyuan Zhong
- School of Mechanical Engineering, Beijing Institute of Technology, Beijing, 100081, China
| | - Pucheng Pei
- State Key Laboratory of Automotive Safety and Energy, Tsinghua University, Beijing, 100084, China
| |
Collapse
|
12
|
Dou B, Zhu Z, Merkurjev E, Ke L, Chen L, Jiang J, Zhu Y, Liu J, Zhang B, Wei GW. Machine Learning Methods for Small Data Challenges in Molecular Science. Chem Rev 2023; 123:8736-8780. [PMID: 37384816 PMCID: PMC10999174 DOI: 10.1021/acs.chemrev.3c00189] [Citation(s) in RCA: 73] [Impact Index Per Article: 36.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023]
Abstract
Small data are often used in scientific and engineering research due to the presence of various constraints, such as time, cost, ethics, privacy, security, and technical limitations in data acquisition. However, big data have been the focus for the past decade, small data and their challenges have received little attention, even though they are technically more severe in machine learning (ML) and deep learning (DL) studies. Overall, the small data challenge is often compounded by issues, such as data diversity, imputation, noise, imbalance, and high-dimensionality. Fortunately, the current big data era is characterized by technological breakthroughs in ML, DL, and artificial intelligence (AI), which enable data-driven scientific discovery, and many advanced ML and DL technologies developed for big data have inadvertently provided solutions for small data problems. As a result, significant progress has been made in ML and DL for small data challenges in the past decade. In this review, we summarize and analyze several emerging potential solutions to small data challenges in molecular science, including chemical and biological sciences. We review both basic machine learning algorithms, such as linear regression, logistic regression (LR), k-nearest neighbor (KNN), support vector machine (SVM), kernel learning (KL), random forest (RF), and gradient boosting trees (GBT), and more advanced techniques, including artificial neural network (ANN), convolutional neural network (CNN), U-Net, graph neural network (GNN), Generative Adversarial Network (GAN), long short-term memory (LSTM), autoencoder, transformer, transfer learning, active learning, graph-based semi-supervised learning, combining deep learning with traditional machine learning, and physical model-based data augmentation. We also briefly discuss the latest advances in these methods. Finally, we conclude the survey with a discussion of promising trends in small data challenges in molecular science.
Collapse
Affiliation(s)
- Bozheng Dou
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Zailiang Zhu
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Ekaterina Merkurjev
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Lu Ke
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Long Chen
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Jian Jiang
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Yueying Zhu
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Jie Liu
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Bengong Zhang
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
- Department of Electrical and Computer Engineering, Michigan State University, East Lansing, Michigan 48824, United States
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| |
Collapse
|
13
|
Wang X, Huang Y, Xie X, Liu Y, Huo Z, Lin M, Xin H, Tong R. Bayesian-optimization-assisted discovery of stereoselective aluminum complexes for ring-opening polymerization of racemic lactide. Nat Commun 2023; 14:3647. [PMID: 37339991 DOI: 10.1038/s41467-023-39405-5] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Accepted: 06/12/2023] [Indexed: 06/22/2023] Open
Abstract
Stereoselective ring-opening polymerization catalysts are used to produce degradable stereoregular poly(lactic acids) with thermal and mechanical properties that are superior to those of atactic polymers. However, the process of discovering highly stereoselective catalysts is still largely empirical. We aim to develop an integrated computational and experimental framework for efficient, predictive catalyst selection and optimization. As a proof of principle, we have developed a Bayesian optimization workflow on a subset of literature results for stereoselective lactide ring-opening polymerization, and using the algorithm, we identify multiple new Al complexes that catalyze either isoselective or heteroselective polymerization. In addition, feature attribution analysis uncovers mechanistically meaningful ligand descriptors, such as percent buried volume (%Vbur) and the highest occupied molecular orbital energy (EHOMO), that can access quantitative and predictive models for catalyst development.
Collapse
Affiliation(s)
- Xiaoqian Wang
- Department of Chemical Engineering, Virginia Polytechnic Institute and State University, 635 Prices Fork Road, Blacksburg, VA, 24061, USA
| | - Yang Huang
- Department of Chemical Engineering, Virginia Polytechnic Institute and State University, 635 Prices Fork Road, Blacksburg, VA, 24061, USA
| | - Xiaoyu Xie
- Department of Chemical Engineering, Virginia Polytechnic Institute and State University, 635 Prices Fork Road, Blacksburg, VA, 24061, USA
| | - Yan Liu
- Department of Chemical Engineering, Virginia Polytechnic Institute and State University, 635 Prices Fork Road, Blacksburg, VA, 24061, USA
| | - Ziyu Huo
- Department of Chemical Engineering, Virginia Polytechnic Institute and State University, 635 Prices Fork Road, Blacksburg, VA, 24061, USA
| | - Maverick Lin
- Department of Chemical Engineering, Virginia Polytechnic Institute and State University, 635 Prices Fork Road, Blacksburg, VA, 24061, USA
| | - Hongliang Xin
- Department of Chemical Engineering, Virginia Polytechnic Institute and State University, 635 Prices Fork Road, Blacksburg, VA, 24061, USA.
| | - Rong Tong
- Department of Chemical Engineering, Virginia Polytechnic Institute and State University, 635 Prices Fork Road, Blacksburg, VA, 24061, USA.
| |
Collapse
|
14
|
Tran R, Lan J, Shuaibi M, Wood BM, Goyal S, Das A, Heras-Domingo J, Kolluru A, Rizvi A, Shoghi N, Sriram A, Therrien F, Abed J, Voznyy O, Sargent EH, Ulissi Z, Zitnick CL. The Open Catalyst 2022 (OC22) Dataset and Challenges for Oxide Electrocatalysts. ACS Catal 2023. [DOI: 10.1021/acscatal.2c05426] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023]
Affiliation(s)
- Richard Tran
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15217, United States
| | - Janice Lan
- Fundamental AI Research, Meta AI, Menlo Park, California 94025, United States
| | - Muhammed Shuaibi
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15217, United States
- Fundamental AI Research, Meta AI, Menlo Park, California 94025, United States
| | - Brandon M. Wood
- Fundamental AI Research, Meta AI, Menlo Park, California 94025, United States
| | - Siddharth Goyal
- Fundamental AI Research, Meta AI, Menlo Park, California 94025, United States
| | - Abhishek Das
- Fundamental AI Research, Meta AI, Menlo Park, California 94025, United States
| | - Javier Heras-Domingo
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15217, United States
| | - Adeesh Kolluru
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15217, United States
| | - Ammar Rizvi
- Fundamental AI Research, Meta AI, Menlo Park, California 94025, United States
| | - Nima Shoghi
- Fundamental AI Research, Meta AI, Menlo Park, California 94025, United States
| | - Anuroop Sriram
- Fundamental AI Research, Meta AI, Menlo Park, California 94025, United States
| | - Félix Therrien
- Department of Electrical and Computer Engineering, University of Toronto, 10 King’s College Road, Toronto, Ontario M5S 3G4, Canada
- Department of Physical and Environmental Sciences, University of Toronto Scarborough, Scarborough, Ontario M1C 1A4, Canada
| | - Jehad Abed
- Department of Electrical and Computer Engineering, University of Toronto, 10 King’s College Road, Toronto, Ontario M5S 3G4, Canada
- Department of Materials Science and Engineering, University of Toronto, 10 King’s College Road, Toronto, Ontario M5S 3G4, Canada
| | - Oleksandr Voznyy
- Department of Physical and Environmental Sciences, University of Toronto Scarborough, Scarborough, Ontario M1C 1A4, Canada
| | - Edward H. Sargent
- Department of Electrical and Computer Engineering, University of Toronto, 10 King’s College Road, Toronto, Ontario M5S 3G4, Canada
| | - Zachary Ulissi
- Department of Chemical Engineering, Carnegie Mellon University, Pittsburgh, Pennsylvania 15217, United States
- Scott Institute for Energy Innovation, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| | - C. Lawrence Zitnick
- Fundamental AI Research, Meta AI, Menlo Park, California 94025, United States
| |
Collapse
|
15
|
Xu G, Cai C, Zhao W, Liu Y, Wang T. Rational design of catalysts with earth‐abundant elements. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1654] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Affiliation(s)
- Gaomou Xu
- Center of Artificial Photosynthesis for Solar Fuels and Department of Chemistry, School of Science Westlake University Hangzhou Zhejiang Province China
- Institute of Natural Sciences, Westlake Institute for Advanced Study Hangzhou Zhejiang Province China
| | - Cheng Cai
- Center of Artificial Photosynthesis for Solar Fuels and Department of Chemistry, School of Science Westlake University Hangzhou Zhejiang Province China
- Institute of Natural Sciences, Westlake Institute for Advanced Study Hangzhou Zhejiang Province China
| | - Wanghui Zhao
- Center of Artificial Photosynthesis for Solar Fuels and Department of Chemistry, School of Science Westlake University Hangzhou Zhejiang Province China
- Institute of Natural Sciences, Westlake Institute for Advanced Study Hangzhou Zhejiang Province China
| | - Yonghua Liu
- Center of Artificial Photosynthesis for Solar Fuels and Department of Chemistry, School of Science Westlake University Hangzhou Zhejiang Province China
- Institute of Natural Sciences, Westlake Institute for Advanced Study Hangzhou Zhejiang Province China
| | - Tao Wang
- Center of Artificial Photosynthesis for Solar Fuels and Department of Chemistry, School of Science Westlake University Hangzhou Zhejiang Province China
- Institute of Natural Sciences, Westlake Institute for Advanced Study Hangzhou Zhejiang Province China
| |
Collapse
|