1
|
Aldossary A, Campos-Gonzalez-Angulo JA, Pablo-García S, Leong SX, Rajaonson EM, Thiede L, Tom G, Wang A, Avagliano D, Aspuru-Guzik A. In Silico Chemical Experiments in the Age of AI: From Quantum Chemistry to Machine Learning and Back. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024:e2402369. [PMID: 38794859 DOI: 10.1002/adma.202402369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 04/28/2024] [Indexed: 05/26/2024]
Abstract
Computational chemistry is an indispensable tool for understanding molecules and predicting chemical properties. However, traditional computational methods face significant challenges due to the difficulty of solving the Schrödinger equations and the increasing computational cost with the size of the molecular system. In response, there has been a surge of interest in leveraging artificial intelligence (AI) and machine learning (ML) techniques to in silico experiments. Integrating AI and ML into computational chemistry increases the scalability and speed of the exploration of chemical space. However, challenges remain, particularly regarding the reproducibility and transferability of ML models. This review highlights the evolution of ML in learning from, complementing, or replacing traditional computational chemistry for energy and property predictions. Starting from models trained entirely on numerical data, a journey set forth toward the ideal model incorporating or learning the physical laws of quantum mechanics. This paper also reviews existing computational methods and ML models and their intertwining, outlines a roadmap for future research, and identifies areas for improvement and innovation. Ultimately, the goal is to develop AI architectures capable of predicting accurate and transferable solutions to the Schrödinger equation, thereby revolutionizing in silico experiments within chemistry and materials science.
Collapse
Affiliation(s)
- Abdulrahman Aldossary
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | | | - Sergio Pablo-García
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
| | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Ella Miray Rajaonson
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Luca Thiede
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Gary Tom
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Andrew Wang
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Davide Avagliano
- Chimie ParisTech, PSL University, CNRS, Institute of Chemistry for Life and Health Sciences (iCLeHS UMR 8060), Paris, F-75005, France
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
- Department of Materials Science & Engineering, University of Toronto, 184 College St., Toronto, ON, M5S 3E4, Canada
- Department of Chemical Engineering & Applied Chemistry, University of Toronto, 200 College St., Toronto, ON, M5S 3E5, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), 66118 University Ave., Toronto, M5G 1M1, Canada
- Acceleration Consortium, 80 St George St, Toronto, M5S 3H6, Canada
| |
Collapse
|
2
|
Kaiser S, Yue Z, Peng Y, Nguyen TD, Chen S, Teng D, Voth GA. Molecular Dynamics Simulation of Complex Reactivity with the Rapid Approach for Proton Transport and Other Reactions (RAPTOR) Software Package. J Phys Chem B 2024; 128:4959-4974. [PMID: 38742764 DOI: 10.1021/acs.jpcb.4c01987] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Simulating chemically reactive phenomena such as proton transport on nanosecond to microsecond and beyond time scales is a challenging task. Ab initio methods are unable to currently access these time scales routinely, and traditional molecular dynamics methods feature fixed bonding arrangements that cannot account for changes in the system's bonding topology. The Multiscale Reactive Molecular Dynamics (MS-RMD) method, as implemented in the Rapid Approach for Proton Transport and Other Reactions (RAPTOR) software package for the LAMMPS molecular dynamics code, offers a method to routinely sample longer time scale reactive simulation data with statistical precision. RAPTOR may also be interfaced with enhanced sampling methods to drive simulations toward the analysis of reactive rare events, and a number of collective variables (CVs) have been developed to facilitate this. Key advances to this methodology, including GPU acceleration efforts and novel CVs to model water wire formation are reviewed, along with recent applications of the method which demonstrate its versatility and robustness.
Collapse
Affiliation(s)
- Scott Kaiser
- Department of Chemistry, Chicago Center for Theoretical Chemistry, James Franck Institute, and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Zhi Yue
- Department of Chemistry, Chicago Center for Theoretical Chemistry, James Franck Institute, and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Yuxing Peng
- NVIDIA Corporation, Santa Clara, California 95051, United States
| | - Trung Dac Nguyen
- Research Computing Center, The University of Chicago, Chicago, Illinois 60637, United States
| | - Sijia Chen
- Department of Chemistry, Chicago Center for Theoretical Chemistry, James Franck Institute, and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Da Teng
- Department of Chemistry, Chicago Center for Theoretical Chemistry, James Franck Institute, and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| | - Gregory A Voth
- Department of Chemistry, Chicago Center for Theoretical Chemistry, James Franck Institute, and Institute for Biophysical Dynamics, The University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
3
|
Wang G, Wang C, Zhang X, Li Z, Zhou J, Sun Z. Machine learning interatomic potential: Bridge the gap between small-scale models and realistic device-scale simulations. iScience 2024; 27:109673. [PMID: 38646181 PMCID: PMC11033164 DOI: 10.1016/j.isci.2024.109673] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/23/2024] Open
Abstract
Machine learning interatomic potential (MLIP) overcomes the challenges of high computational costs in density-functional theory and the relatively low accuracy in classical large-scale molecular dynamics, facilitating more efficient and precise simulations in materials research and design. In this review, the current state of the four essential stages of MLIP is discussed, including data generation methods, material structure descriptors, six unique machine learning algorithms, and available software. Furthermore, the applications of MLIP in various fields are investigated, notably in phase-change memory materials, structure searching, material properties predicting, and the pre-trained universal models. Eventually, the future perspectives, consisting of standard datasets, transferability, generalization, and trade-off between accuracy and complexity in MLIPs, are reported.
Collapse
Affiliation(s)
- Guanjie Wang
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
- School of Integrated Circuit Science and Engineering, Beihang University, Beijing 100191, China
| | - Changrui Wang
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
| | - Xuanguang Zhang
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
| | - Zefeng Li
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
| | - Jian Zhou
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
| | - Zhimei Sun
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
| |
Collapse
|
4
|
Back S, Aspuru-Guzik A, Ceriotti M, Gryn'ova G, Grzybowski B, Gu GH, Hein J, Hippalgaonkar K, Hormázabal R, Jung Y, Kim S, Kim WY, Moosavi SM, Noh J, Park C, Schrier J, Schwaller P, Tsuda K, Vegge T, von Lilienfeld OA, Walsh A. Accelerated chemical science with AI. DIGITAL DISCOVERY 2024; 3:23-33. [PMID: 38239898 PMCID: PMC10793638 DOI: 10.1039/d3dd00213f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Accepted: 12/06/2023] [Indexed: 01/22/2024]
Abstract
In light of the pressing need for practical materials and molecular solutions to renewable energy and health problems, to name just two examples, one wonders how to accelerate research and development in the chemical sciences, so as to address the time it takes to bring materials from initial discovery to commercialization. Artificial intelligence (AI)-based techniques, in particular, are having a transformative and accelerating impact on many if not most, technological domains. To shed light on these questions, the authors and participants gathered in person for the ASLLA Symposium on the theme of 'Accelerated Chemical Science with AI' at Gangneung, Republic of Korea. We present the findings, ideas, comments, and often contentious opinions expressed during four panel discussions related to the respective general topics: 'Data', 'New applications', 'Machine learning algorithms', and 'Education'. All discussions were recorded, transcribed into text using Open AI's Whisper, and summarized using LG AI Research's EXAONE LLM, followed by revision by all authors. For the broader benefit of current researchers, educators in higher education, and academic bodies such as associations, publishers, librarians, and companies, we provide chemistry-specific recommendations and summarize the resulting conclusions.
Collapse
Affiliation(s)
- Seoin Back
- Department of Chemical and Biomolecular Engineering, Institute of Emergent Materials, Sogang University Seoul Republic of Korea
| | - Alán Aspuru-Guzik
- Departments of Chemistry, Computer Science, University of Toronto St. George Campus Toronto ON Canada
- Acceleration Consortium and Vector Institute for Artificial Intelligence Toronto ON M5S 1M1 Canada
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling (COSMO), École Polytechnique Fédérale de Lausanne Lausanne Switzerland
| | - Ganna Gryn'ova
- Heidelberg Institute for Theoretical Studies (HITS gGmbH) 69118 Heidelberg Germany
- Interdisciplinary Center for Scientific Computing, Heidelberg University 69120 Heidelberg Germany
| | - Bartosz Grzybowski
- Center for Algorithmic and Robotized Synthesis (CARS), Institute for Basic Science (IBS) Ulsan Republic of Korea
- Institute of Organic Chemistry, Polish Academy of Sciences Warsaw Poland
- Department of Chemistry, Ulsan National Institute of Science and Technology Ulsan Republic of Korea
| | - Geun Ho Gu
- Department of Energy Engineering, Korea Institute of Energy Technology (KENTECH) Naju 58330 Republic of Korea
| | - Jason Hein
- Department of Chemistry, University of British Columbia Vancouver BC V6T 1Z1 Canada
| | - Kedar Hippalgaonkar
- School of Materials Science and Engineering, Nanyang Technological University 50 Nanyang Avenue Singapore 639798 Singapore
- Institute of Materials Research and Engineering, Agency for Science Technology and Research 2 Fusionopolis Way, 08-03 Singapore 138634 Singapore
| | | | - Yousung Jung
- Department of Chemical and Biomolecular Engineering, KAIST Daejeon Republic of Korea
- School of Chemical and Biological Engineering, Interdisciplinary Program in Artificial Intelligence, Seoul National University 1 Gwanak-ro, Gwanak-gu Seoul 08826 Republic of Korea
| | - Seonah Kim
- Department of Chemistry, Colorado State University 1301 Center Avenue Fort Collins CO 80523 USA
| | - Woo Youn Kim
- Department of Chemistry, KAIST Daejeon Republic of Korea
| | - Seyed Mohamad Moosavi
- Chemical Engineering & Applied Chemistry, University of Toronto Toronto Ontario M5S 3E5 Canada
| | - Juhwan Noh
- Chemical Data-Driven Research Center, Korea Research Institute of Chemical Technology Daejeon 34114 Republic of Korea
| | | | - Joshua Schrier
- Department of Chemistry, Fordham University The Bronx NY 10458 USA
| | - Philippe Schwaller
- Laboratory of Artificial Chemical Intelligence (LIAC) & National Centre of Competence in Research (NCCR) Catalysis, École Polytechnique Fédérale de Lausanne Lausanne Switzerland
| | - Koji Tsuda
- Graduate School of Frontier Sciences, The University of Tokyo Kashiwa Chiba 277-8561 Japan
- Center for Basic Research on Materials, National Institute for Materials Science Tsukuba Ibaraki 305-0044 Japan
- RIKEN Center for Advanced Intelligence Project Tokyo 103-0027 Japan
| | - Tejs Vegge
- Department of Energy Conversion and Storage, Technical University of Denmark 301 Anker Engelunds vej, Kongens Lyngby Copenhagen 2800 Denmark
| | - O Anatole von Lilienfeld
- Acceleration Consortium and Vector Institute for Artificial Intelligence Toronto ON M5S 1M1 Canada
- Departments of Chemistry, Materials Science and Engineering, and Physics, University of Toronto, St George Campus Toronto ON Canada
- Machine Learning Group, Technische Universität Berlin and Berlin Institute for the Foundations of Learning and Data 10587 Berlin Germany
| | - Aron Walsh
- Department of Materials, Imperial College London London SW7 2AZ UK
- Department of Physics, Ewha Women's University Seoul Republic of Korea
| |
Collapse
|
5
|
de Raffele D, Ilie IM. Unlocking novel therapies: cyclic peptide design for amyloidogenic targets through synergies of experiments, simulations, and machine learning. Chem Commun (Camb) 2024; 60:632-645. [PMID: 38131333 DOI: 10.1039/d3cc04630c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
Existing therapies for neurodegenerative diseases like Parkinson's and Alzheimer's address only their symptoms and do not prevent disease onset. Common therapeutic agents, such as small molecules and antibodies struggle with insufficient selectivity, stability and bioavailability, leading to poor performance in clinical trials. Peptide-based therapeutics are emerging as promising candidates, with successful applications for cardiovascular diseases and cancers due to their high bioavailability, good efficacy and specificity. In particular, cyclic peptides have a long in vivo stability, while maintaining a robust antibody-like binding affinity. However, the de novo design of cyclic peptides is challenging due to the lack of long-lived druggable pockets of the target polypeptide, absence of exhaustive conformational distributions of the target and/or the binder, unknown binding site, methodological limitations, associated constraints (failed trials, time, money) and the vast combinatorial sequence space. Hence, efficient alignment and cooperation between disciplines, and synergies between experiments and simulations complemented by popular techniques like machine-learning can significantly speed up the therapeutic cyclic-peptide development for neurodegenerative diseases. We review the latest advancements in cyclic peptide design against amyloidogenic targets from a computational perspective in light of recent advancements and potential of machine learning to optimize the design process. We discuss the difficulties encountered when designing novel peptide-based inhibitors and we propose new strategies incorporating experiments, simulations and machine learning to design cyclic peptides to inhibit the toxic propagation of amyloidogenic polypeptides. Importantly, these strategies extend beyond the mere design of cyclic peptides and serve as template for the de novo generation of (bio)materials with programmable properties.
Collapse
Affiliation(s)
- Daria de Raffele
- University of Amsterdam, van 't Hoff Institute for Molecular Sciences, Science Park 904, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands.
- Amsterdam Center for Multiscale Modeling (ACMM), University of Amsterdam, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands
| | - Ioana M Ilie
- University of Amsterdam, van 't Hoff Institute for Molecular Sciences, Science Park 904, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands.
- Amsterdam Center for Multiscale Modeling (ACMM), University of Amsterdam, P.O. Box 94157, 1090 GD Amsterdam, The Netherlands
| |
Collapse
|
6
|
Chen Z, Yang W. Development of a machine learning finite-range nonlocal density functional. J Chem Phys 2024; 160:014105. [PMID: 38180254 DOI: 10.1063/5.0179149] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Accepted: 12/12/2023] [Indexed: 01/06/2024] Open
Abstract
Kohn-Sham density functional theory has been the most popular method in electronic structure calculations. To fulfill the increasing accuracy requirements, new approximate functionals are needed to address key issues in existing approximations. It is well known that nonlocal components are crucial. Current nonlocal functionals mostly require orbital dependence such as in Hartree-Fock exchange and many-body perturbation correlation energy, which, however, leads to higher computational costs. Deviating from this pathway, we describe functional nonlocality in a new approach. By partitioning the total density to atom-centered local densities, a many-body expansion is proposed. This many-body expansion can be truncated at one-body contributions, if a base functional is used and an energy correction is approximated. The contribution from each atom-centered local density is a single finite-range nonlocal functional that is universal for all atoms. We then use machine learning to develop this universal atom-centered functional. Parameters in this functional are determined by fitting to data that are produced by high-level theories. Extensive tests on several different test sets, which include reaction energies, reaction barrier heights, and non-covalent interaction energies, show that the new functional, with only the density as the basic variable, can produce results comparable to the best-performing double-hybrid functionals, (for example, for the thermochemistry test set selected from the GMTKN55 database, BLYP based machine learning functional gives a weighted total mean absolute deviations of 3.33 kcal/mol, while DSD-BLYP-D3(BJ) gives 3.28 kcal/mol) with a lower computational cost. This opens a new pathway to nonlocal functional development and applications.
Collapse
Affiliation(s)
- Zehua Chen
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| | - Weitao Yang
- Department of Chemistry and Department of Physics, Duke University, Durham, North Carolina 27708, USA
| |
Collapse
|
7
|
Demir Gİ, Tekin A. NICE-FF: A non-empirical, intermolecular, consistent, and extensible force field for nucleic acids and beyond. J Chem Phys 2023; 159:244117. [PMID: 38153156 DOI: 10.1063/5.0176641] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Accepted: 12/04/2023] [Indexed: 12/29/2023] Open
Abstract
A new non-empirical ab initio intermolecular force field (NICE-FF in buffered 14-7 potential form) has been developed for nucleic acids and beyond based on the dimer interaction energies (IEs) calculated at the spin component scaled-MI-second order Møller-Plesset perturbation theory. A fully automatic framework has been implemented for this purpose, capable of generating well-polished computational grids, performing the necessary ab initio calculations, conducting machine learning (ML) assisted force field (FF) parametrization, and extending existing FF parameters by incorporating new atom types. For the ML-assisted parametrization of NICE-FF, interaction energies of ∼18 000 dimer geometries (with IE < 0) were used, and the best fit gave a mean square deviation of about 0.46 kcal/mol. During this parametrization, atom types apparent in four deoxyribonucleic acid (DNA) bases have been first trained using the generated DNA base datasets. Both uracil and hypoxanthine, which contain the same atom types found in DNA bases, have been considered as test molecules. Three new atom types have been added to the DNA atom types by using IE datasets of both pyrazinamide and 9-methylhypoxanthine. Finally, the last test molecule, theophylline, has been selected, which contains already-fitted atom-type parameters. The performance of NICE-FF has been investigated on the S22 dataset, and it has been found that NICE-FF outperforms the well-known FFs by generating the most consistent IEs with the high-level ab initio ones. Moreover, NICE-FF has been integrated into our in-house developed crystal structure prediction (CSP) tool [called FFCASP (Fast and Flexible CrystAl Structure Predictor)], aiming to find the experimental crystal structures of all considered molecules. CSPs, which were performed up to 4 formula units (Z), resulted in NICE-FF being able to locate almost all the known experimental crystal structures with sufficiently low RMSD20 values to provide good starting points for density functional theory optimizations.
Collapse
Affiliation(s)
- Gözde İniş Demir
- Informatics Institute, Istanbul Technical University, 34469 Maslak, Istanbul, Türkiye
| | - Adem Tekin
- Informatics Institute, Istanbul Technical University, 34469 Maslak, Istanbul, Türkiye
- Research Institute for Fundamental Sciences (TÜBİTAK-TBAE), Kocaeli, Türkiye
| |
Collapse
|
8
|
Kumar S, Jing X, Pask JE, Medford AJ, Suryanarayana P. Kohn-Sham accuracy from orbital-free density functional theory via Δ-machine learning. J Chem Phys 2023; 159:244106. [PMID: 38147461 DOI: 10.1063/5.0180541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Accepted: 11/30/2023] [Indexed: 12/28/2023] Open
Abstract
We present a Δ-machine learning model for obtaining Kohn-Sham accuracy from orbital-free density functional theory (DFT) calculations. In particular, we employ a machine-learned force field (MLFF) scheme based on the kernel method to capture the difference between Kohn-Sham and orbital-free DFT energies/forces. We implement this model in the context of on-the-fly molecular dynamics simulations and study its accuracy, performance, and sensitivity to parameters for representative systems. We find that the formalism not only improves the accuracy of Thomas-Fermi-von Weizsäcker orbital-free energies and forces by more than two orders of magnitude but is also more accurate than MLFFs based solely on Kohn-Sham DFT while being more efficient and less sensitive to model parameters. We apply the framework to study the structure of molten Al0.88Si0.12, the results suggesting no aggregation of Si atoms, in agreement with a previous Kohn-Sham study performed at an order of magnitude smaller length and time scales.
Collapse
Affiliation(s)
- Shashikant Kumar
- College of Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Xin Jing
- College of Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
- College of Computing, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - John E Pask
- Physics Division, Lawrence Livermore National Laboratory, Livermore, California 94550, USA
| | - Andrew J Medford
- College of Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Phanish Suryanarayana
- College of Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
- College of Computing, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| |
Collapse
|
9
|
Thürlemann M, Riniker S. Hybrid classical/machine-learning force fields for the accurate description of molecular condensed-phase systems. Chem Sci 2023; 14:12661-12675. [PMID: 38020395 PMCID: PMC10646964 DOI: 10.1039/d3sc04317g] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 10/24/2023] [Indexed: 12/01/2023] Open
Abstract
Electronic structure methods offer in principle accurate predictions of molecular properties, however, their applicability is limited by computational costs. Empirical methods are cheaper, but come with inherent approximations and are dependent on the quality and quantity of training data. The rise of machine learning (ML) force fields (FFs) exacerbates limitations related to training data even further, especially for condensed-phase systems for which the generation of large and high-quality training datasets is difficult. Here, we propose a hybrid ML/classical FF model that is parametrized exclusively on high-quality ab initio data of dimers and monomers in vacuum but is transferable to condensed-phase systems. The proposed hybrid model combines our previous ML-parametrized classical model with ML corrections for situations where classical approximations break down, thus combining the robustness and efficiency of classical FFs with the flexibility of ML. Extensive validation on benchmarking datasets and experimental condensed-phase data, including organic liquids and small-molecule crystal structures, showcases how the proposed approach may promote FF development and unlock the full potential of classical FFs.
Collapse
Affiliation(s)
- Moritz Thürlemann
- Department of Chemistry and Applied Biosciences, ETH Zürich Vladimir-Prelog-Weg 2 Zürich 8093 Switzerland
| | - Sereina Riniker
- Department of Chemistry and Applied Biosciences, ETH Zürich Vladimir-Prelog-Weg 2 Zürich 8093 Switzerland
| |
Collapse
|
10
|
Abedi M, Behler J, Goldsmith CF. High-Dimensional Neural Network Potentials for Accurate Prediction of Equation of State: A Case Study of Methane. J Chem Theory Comput 2023; 19:7825-7832. [PMID: 37902963 DOI: 10.1021/acs.jctc.3c00469] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/01/2023]
Abstract
Machine learning-based interatomic potentials, such as those provided by neural networks, are increasingly important in molecular dynamics simulations. In the present work, we consider the applicability and robustness of machine learning molecular dynamics to predict the equation of state properties of methane by using high-dimensional neural network potentials (HDNNPs). We investigate two different strategies for generating training data: one strategy based upon bulk representations using periodic cells and another strategy based upon clusters of molecules. We assess the accuracy of the trained potentials by predicting the equilibrium mass density for a wide range of thermodynamic conditions to characterize the liquid phase, supercritical fluid, and gas phase, as well as the liquid-vapor coexistence curve. Our results show an excellent agreement with reference phase diagrams, with an average error below ∼2% for all studied phases. Moreover, we confirm the applicability of models trained on cluster data sets for producing accurate and reliable results.
Collapse
Affiliation(s)
- Mostafa Abedi
- School of Engineering, Brown University, Providence, Rhode Island 02906, United States
| | - Jörg Behler
- Lehrstuhl für Theoretische Chemie II, Ruhr-Universität Bochum, 44780 Bochum, Germany
- Research Center Chemical Sciences and Sustainability, Research Alliance Ruhr, 44780 Bochum, Germany
| | - C Franklin Goldsmith
- School of Engineering, Brown University, Providence, Rhode Island 02906, United States
| |
Collapse
|
11
|
Langer MF, Frank JT, Knoop F. Stress and heat flux via automatic differentiation. J Chem Phys 2023; 159:174105. [PMID: 37921248 DOI: 10.1063/5.0155760] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/23/2023] [Accepted: 09/25/2023] [Indexed: 11/04/2023] Open
Abstract
Machine-learning potentials provide computationally efficient and accurate approximations of the Born-Oppenheimer potential energy surface. This potential determines many materials properties and simulation techniques usually require its gradients, in particular forces and stress for molecular dynamics, and heat flux for thermal transport properties. Recently developed potentials feature high body order and can include equivariant semi-local interactions through message-passing mechanisms. Due to their complex functional forms, they rely on automatic differentiation (AD), overcoming the need for manual implementations or finite-difference schemes to evaluate gradients. This study discusses how to use AD to efficiently obtain forces, stress, and heat flux for such potentials, and provides a model-independent implementation. The method is tested on the Lennard-Jones potential, and then applied to predict cohesive properties and thermal conductivity of tin selenide using an equivariant message-passing neural network potential.
Collapse
Affiliation(s)
- Marcel F Langer
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
- BIFOLD-Berlin Institute for the Foundations of Learning and Data, Berlin, Germany
- The NOMAD Laboratory at the Fritz Haber Institute of the Max Planck Society and Humboldt University, Berlin, Germany
| | - J Thorben Frank
- Machine Learning Group, Technische Universität Berlin, 10587 Berlin, Germany
- BIFOLD-Berlin Institute for the Foundations of Learning and Data, Berlin, Germany
| | - Florian Knoop
- Theoretical Physics Division, Department of Physics, Chemistry and Biology (IFM), Linköping University, SE-581 83 Linköping, Sweden
| |
Collapse
|
12
|
Firaha D, Liu YM, van de Streek J, Sasikumar K, Dietrich H, Helfferich J, Aerts L, Braun DE, Broo A, DiPasquale AG, Lee AY, Le Meur S, Nilsson Lill SO, Lunsmann WJ, Mattei A, Muglia P, Putra OD, Raoui M, Reutzel-Edens SM, Rome S, Sheikh AY, Tkatchenko A, Woollam GR, Neumann MA. Predicting crystal form stability under real-world conditions. Nature 2023; 623:324-328. [PMID: 37938708 PMCID: PMC10632141 DOI: 10.1038/s41586-023-06587-3] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 08/30/2023] [Indexed: 11/09/2023]
Abstract
The physicochemical properties of molecular crystals, such as solubility, stability, compactability, melting behaviour and bioavailability, depend on their crystal form1. In silico crystal form selection has recently come much closer to realization because of the development of accurate and affordable free-energy calculations2-4. Here we redefine the state of the art, primarily by improving the accuracy of free-energy calculations, constructing a reliable experimental benchmark for solid-solid free-energy differences, quantifying statistical errors for the computed free energies and placing both hydrate crystal structures of different stoichiometries and anhydrate crystal structures on the same energy landscape, with defined error bars, as a function of temperature and relative humidity. The calculated free energies have standard errors of 1-2 kJ mol-1 for industrially relevant compounds, and the method to place crystal structures with different hydrate stoichiometries on the same energy landscape can be extended to other multi-component systems, including solvates. These contributions reduce the gap between the needs of the experimentalist and the capabilities of modern computational tools, transforming crystal structure prediction into a more reliable and actionable procedure that can be used in combination with experimental evidence to direct crystal form selection and establish control5.
Collapse
Affiliation(s)
| | | | | | | | | | - Julian Helfferich
- Avant-garde Materials Simulation, Merzhausen, Germany
- JobRad, Freiburg, Germany
| | - Luc Aerts
- UCB Pharma SA, Chemin du Foriest, Braine-l'Alleud, Belgium
| | - Doris E Braun
- Institute of Pharmacy, University of Innsbruck, Innsbruck, Austria
| | - Anders Broo
- Data Science and Modelling, Pharmaceutical Sciences, R&D, AstraZeneca Gothenburg, Mölndal, Sweden
| | | | - Alfred Y Lee
- Merck, Analytical Research & Development, Rahway, NJ, USA
| | - Sarah Le Meur
- UCB Pharma SA, Chemin du Foriest, Braine-l'Alleud, Belgium
| | - Sten O Nilsson Lill
- Data Science and Modelling, Pharmaceutical Sciences, R&D, AstraZeneca Gothenburg, Mölndal, Sweden
| | | | - Alessandra Mattei
- Solid State Chemistry, Research & Development, AbbVie, North Chicago, IL, USA
| | | | - Okky Dwichandra Putra
- Early Product Development and Manufacturing, Pharmaceutical Sciences R&D, AstraZeneca Gothenburg, Mölndal, Sweden
| | | | - Susan M Reutzel-Edens
- Cambridge Crystallographic Data Centre, Cambridge, UK
- SuRE Pharma Consulting, Zionsville, IN, USA
| | - Sandrine Rome
- UCB Pharma SA, Chemin du Foriest, Braine-l'Alleud, Belgium
| | - Ahmad Y Sheikh
- Solid State Chemistry, Research & Development, AbbVie, North Chicago, IL, USA
| | - Alexandre Tkatchenko
- Department of Physics and Materials Science, University of Luxembourg, Luxembourg City, Luxembourg
| | | | | |
Collapse
|
13
|
Illarionov A, Sakipov S, Pereyaslavets L, Kurnikov IV, Kamath G, Butin O, Voronina E, Ivahnenko I, Leontyev I, Nawrocki G, Darkhovskiy M, Olevanov M, Cherniavskyi YK, Lock C, Greenslade S, Sankaranarayanan SKRS, Kurnikova MG, Potoff J, Kornberg RD, Levitt M, Fain B. Combining Force Fields and Neural Networks for an Accurate Representation of Chemically Diverse Molecular Interactions. J Am Chem Soc 2023; 145:23620-23629. [PMID: 37856313 PMCID: PMC10623557 DOI: 10.1021/jacs.3c07628] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Indexed: 10/21/2023]
Abstract
A key goal of molecular modeling is the accurate reproduction of the true quantum mechanical potential energy of arbitrary molecular ensembles with a tractable classical approximation. The challenges are that analytical expressions found in general purpose force fields struggle to faithfully represent the intermolecular quantum potential energy surface at close distances and in strong interaction regimes; that the more accurate neural network approximations do not capture crucial physics concepts, e.g., nonadditive inductive contributions and application of electric fields; and that the ultra-accurate narrowly targeted models have difficulty generalizing to the entire chemical space. We therefore designed a hybrid wide-coverage intermolecular interaction model consisting of an analytically polarizable force field combined with a short-range neural network correction for the total intermolecular interaction energy. Here, we describe the methodology and apply the model to accurately determine the properties of water, the free energy of solvation of neutral and charged molecules, and the binding free energy of ligands to proteins. The correction is subtyped for distinct chemical species to match the underlying force field, to segment and reduce the amount of quantum training data, and to increase accuracy and computational speed. For the systems considered, the hybrid ab initio parametrized Hamiltonian reproduces the two-body dimer quantum mechanics (QM) energies to within 0.03 kcal/mol and the nonadditive many-molecule contributions to within 2%. Simulations of molecular systems using this interaction model run at speeds of several nanoseconds per day.
Collapse
Affiliation(s)
- Alexey Illarionov
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| | - Serzhan Sakipov
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| | - Leonid Pereyaslavets
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| | - Igor V. Kurnikov
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| | - Ganesh Kamath
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| | - Oleg Butin
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| | - Ekaterina Voronina
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
- Lomonosov
MSU, Skobeltsyn Institute of Nuclear Physics, Moscow, 119991, Russia
| | - Ilya Ivahnenko
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| | - Igor Leontyev
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| | - Grzegorz Nawrocki
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| | - Mikhail Darkhovskiy
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| | - Michael Olevanov
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
- Lomonosov
MSU, Dept. of Physics, Moscow, 119991, Russia
| | - Yevhen K. Cherniavskyi
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| | - Christopher Lock
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
- Department
of Neurology and Neurological Sciences, Stanford University School of Medicine, Palo Alto, California 94304, United States
| | - Sean Greenslade
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| | - Subramanian KRS Sankaranarayanan
- Center
for Nanoscale Materials, Argonne National
Lab, Argonne, Illinois 604391, United States
- Department
of Mechanical and Industrial Engineering, University of Illinois, Chicago, Illinois 60607, United States
| | - Maria G. Kurnikova
- Department
of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| | - Jeffrey Potoff
- Department
of Chemical Engineering and Materials Science, Wayne State University, Detroit, Michigan 48202, United States
| | - Roger D. Kornberg
- Department
of Structural Biology, Stanford University
School of Medicine, Stanford, California 94304, United States
| | - Michael Levitt
- Department
of Structural Biology, Stanford University
School of Medicine, Stanford, California 94304, United States
| | - Boris Fain
- InterX
Inc. (a Subsidiary of NeoTX Therapeutics Ltd.), 805 Allston Way, Berkeley, California 94710, United States
| |
Collapse
|
14
|
Hu F, He F, Yaron DJ. Treating Semiempirical Hamiltonians as Flexible Machine Learning Models Yields Accurate and Interpretable Results. J Chem Theory Comput 2023; 19:6185-6196. [PMID: 37705220 PMCID: PMC10536991 DOI: 10.1021/acs.jctc.3c00491] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Indexed: 09/15/2023]
Abstract
Quantum chemistry provides chemists with invaluable information, but the high computational cost limits the size and type of systems that can be studied. Machine learning (ML) has emerged as a means to dramatically lower the cost while maintaining high accuracy. However, ML models often sacrifice interpretability by using components such as the artificial neural networks of deep learning that function as black boxes. These components impart the flexibility needed to learn from large volumes of data but make it difficult to gain insight into the physical or chemical basis for the predictions. Here, we demonstrate that semiempirical quantum chemical (SEQC) models can learn from large volumes of data without sacrificing interpretability. The SEQC model is that of density-functional-based tight binding (DFTB) with fixed atomic orbital energies and interactions that are one-dimensional functions of the interatomic distance. This model is trained to ab initio data in a manner that is analogous to that used to train deep learning models. Using benchmarks that reflect the accuracy of the training data, we show that the resulting model maintains a physically reasonable functional form while achieving an accuracy, relative to coupled cluster energies with a complete basis set extrapolation (CCSD(T)*/CBS), that is comparable to that of density functional theory (DFT). This suggests that trained SEQC models can achieve a low computational cost and high accuracy without sacrificing interpretability. Use of a physically motivated model form also substantially reduces the amount of ab initio data needed to train the model compared to that required for deep learning models.
Collapse
Affiliation(s)
- Frank Hu
- Department of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| | - Francis He
- Department of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| | - David J. Yaron
- Department of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, United States
| |
Collapse
|
15
|
Wang B, Wu Y, Liu D, Vasenko AS, Casanova D, Prezhdo OV. Efficient Modeling of Quantum Dynamics of Charge Carriers in Materials Using Short Nonequilibrium Molecular Dynamics. J Phys Chem Lett 2023; 14:8289-8295. [PMID: 37681642 PMCID: PMC10518862 DOI: 10.1021/acs.jpclett.3c02187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2023] [Accepted: 09/05/2023] [Indexed: 09/09/2023]
Abstract
Nonadiabatic molecular dynamics provides essential insights into excited-state processes, but it is computationally intense and simplifications are needed. The classical path approximation provides critical savings. Still, long heating and equilibration steps are required. We demonstrate that practical results can be obtained with short, partially equilibrated ab initio trajectories. Once the system's structure is adequate and essential fluctuations are sampled, the nonadiabatic Hamiltonian can be constructed. Local structures require only 1-2 ps trajectories, as demonstrated with point defects in metal halide perovskites. Short trajectories represent anharmonic motions common in defective structures, an essential improvement over the harmonic approximation around the optimized geometry. Glassy systems, such as grain boundaries, require different simulation protocols, e.g., involving machine learning force fields. 10-fold shorter trajectories generate 10-20% time scale errors, which are acceptable, given experimental uncertainties and other approximations. The practical NAMD protocol enables fast screening of excited-state dynamics for rapid exploration of new materials.
Collapse
Affiliation(s)
- Bipeng Wang
- Department
of Chemical Engineering, University of Southern
California, Los Angeles, California 90089, United States
| | - Yifan Wu
- Department
of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| | | | - Andrey S. Vasenko
- HSE
University, 101000 Moscow, Russia
- Donostia
International Physics Center (DIPC), 20018 San Sebastián-Donostia, Euskadi, Spain
| | - David Casanova
- Donostia
International Physics Center (DIPC), 20018 San Sebastián-Donostia, Euskadi, Spain
- IKERBASQUE, Basque Foundation for Science, 48009 Bilbao, Euskadi, Spain
| | - Oleg V. Prezhdo
- Department
of Chemical Engineering, University of Southern
California, Los Angeles, California 90089, United States
- Department
of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| |
Collapse
|
16
|
M V, Singh S, Bononi F, Andreussi O, Karmodak N. Thermodynamic and kinetic modeling of electrocatalytic reactions using a first-principles approach. J Chem Phys 2023; 159:111001. [PMID: 37728202 DOI: 10.1063/5.0165835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Accepted: 08/28/2023] [Indexed: 09/21/2023] Open
Abstract
The computational modeling of electrochemical interfaces and their applications in electrocatalysis has attracted great attention in recent years. While tremendous progress has been made in this area, however, the accurate atomistic descriptions at the electrode/electrolyte interfaces remain a great challenge. The Computational Hydrogen Electrode (CHE) method and continuum modeling of the solvent and electrolyte interactions form the basis for most of these methodological developments. Several posterior corrections have been added to the CHE method to improve its accuracy and widen its applications. The most recently developed grand canonical potential approaches with the embedded diffuse layer models have shown considerable improvement in defining interfacial interactions at electrode/electrolyte interfaces over the state-of-the-art computational models for electrocatalysis. In this Review, we present an overview of these different computational models developed over the years to quantitatively probe the thermodynamics and kinetics of electrochemical reactions in the presence of an electrified catalyst surface under various electrochemical environments. We begin our discussion by giving a brief picture of the different continuum solvation approaches, implemented within the ab initio method to effectively model the solvent and electrolyte interactions. Next, we present the thermodynamic and kinetic modeling approaches to determine the activity and stability of the electrocatalysts. A few applications to these approaches are also discussed. We conclude by giving an outlook on the different machine learning models that have been integrated with the thermodynamic approaches to improve their efficiency and widen their applicability.
Collapse
Affiliation(s)
- Vasanthapandiyan M
- Department of Chemistry, Shiv Nadar Institution of Eminence, Dadri, Gautam Buddha Nagar, Uttar Pradesh 201314, India
| | - Shagun Singh
- Department of Chemistry, Shiv Nadar Institution of Eminence, Dadri, Gautam Buddha Nagar, Uttar Pradesh 201314, India
| | - Fernanda Bononi
- Department of Physics, University of North Texas, Denton, Texas 76203, USA
| | - Oliviero Andreussi
- Department of Chemistry and Biochemistry, Boise State University, Boise, Idaho 83725, USA
| | - Naiwrit Karmodak
- Department of Chemistry, Shiv Nadar Institution of Eminence, Dadri, Gautam Buddha Nagar, Uttar Pradesh 201314, India
| |
Collapse
|
17
|
Sowa JK, Roberts ST, Rossky PJ. Exploring Configurations of Nanocrystal Ligands Using Machine-Learned Force Fields. J Phys Chem Lett 2023; 14:7215-7222. [PMID: 37552568 DOI: 10.1021/acs.jpclett.3c01618] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/10/2023]
Abstract
Semiconducting nanocrystals passivated with organic ligands have emerged as a powerful platform for light harvesting, light-driven chemical reactions, and sensing. Due to their complexity and size, little structural information is available from experiments, making these systems challenging to model computationally. Here, we develop a machine-learned force field trained on DFT data and use it to investigate the surface chemistry of a PbS nanocrystal interfaced with acetate ligands. In doing so, we go beyond considering individual local minimum energy geometries and, importantly, circumvent a precarious issue associated with the assumption of a single assigned atomic partial charge for each element in a nanocrystal, independent of its structural position. We demonstrate that the carboxylate ligands passivate the metal-rich surfaces by adopting a very wide range of "tilted-bridge" and "bridge" geometries and investigate the corresponding ligand IR spectrum. This work illustrates the potential of machine-learned force fields to transform computational modeling of these materials.
Collapse
Affiliation(s)
- Jakub K Sowa
- Department of Chemistry, Rice University, Houston, Texas 77005, United States
- Center for Adapting Flaws into Features, Rice University, Houston, Texas 77005, United States
| | - Sean T Roberts
- Department of Chemistry, The University of Texas at Austin, Austin, Texas 78712, United States
- Center for Adapting Flaws into Features, Rice University, Houston, Texas 77005, United States
| | - Peter J Rossky
- Department of Chemistry, Rice University, Houston, Texas 77005, United States
- Center for Adapting Flaws into Features, Rice University, Houston, Texas 77005, United States
| |
Collapse
|
18
|
Lanjan A, Moradi Z, Srinivasan S. Computational Framework Combining Quantum Mechanics, Molecular Dynamics, and Deep Neural Networks to Evaluate the Intrinsic Properties of Materials. J Phys Chem A 2023; 127:6603-6613. [PMID: 37497552 DOI: 10.1021/acs.jpca.3c02887] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/28/2023]
Abstract
The design and evaluation of future nanomaterials with specific properties is a challenging task as the current traditional methods rely on trial and error approaches that are time-consuming and expensive. On the computational front, design tools such as molecular dynamics (MD) simulations help us reduce the costs and times. However, nonbonded potential parameters, the key input parameters for an MD simulation, are usually not available for designing and studying new materials. Resolving this, quantum mechanics (QM) calculations could be used to evaluate the system's energy as a function of the nonbonded distances, and the resulting data set could be fit to a generic potential equation to obtain the fitting constants (potential parameters). However, fitting this massive data set containing thousands of unknown parameters using traditional mathematical formulations is not feasible. Hence, most computational frameworks in the literature utilize several simplifications, leading to a severe loss of accuracy. Addressing this deficiency, in this work, we propose a multi-scale framework that couples QM calculations and MD with advanced deep neural networks to determine the potential parameters. This advanced framework has been extensively validated by employing it to predict properties such as the density, boiling point, and melting point of five different types of molecules that are well-understood, namely, the polar molecule H2O, ionic compound LiPF6, ethanol (C2H5OH), long-chain molecule C8H18, and the complex molecular system ethylene carbonate (EC).
Collapse
Affiliation(s)
- Amirmasoud Lanjan
- Department of Mechanical Engineering, McMaster University, Hamilton, Ontario L8S 4K1, Canada
| | - Zahra Moradi
- W Booth School of Engineering Practice and Technology, McMaster University, Hamilton, Ontario L8S 4K1, Canada
| | - Seshasai Srinivasan
- Department of Mechanical Engineering, McMaster University, Hamilton, Ontario L8S 4K1, Canada
- W Booth School of Engineering Practice and Technology, McMaster University, Hamilton, Ontario L8S 4K1, Canada
| |
Collapse
|
19
|
Liu D, Wu J, Lu D. Transferability evaluation of the deep potential model for simulating water-graphene confined system. J Chem Phys 2023; 159:044712. [PMID: 37522409 DOI: 10.1063/5.0153196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2023] [Accepted: 07/11/2023] [Indexed: 08/01/2023] Open
Abstract
Machine learning potentials (MLPs) are poised to combine the accuracy of ab initio predictions with the computational efficiency of classical molecular dynamics (MD) simulation. While great progress has been made over the last two decades in developing MLPs, there is still much to be done to evaluate their model transferability and facilitate their development. In this work, we construct two deep potential (DP) models for liquid water near graphene surfaces, Model S and Model F, with the latter having more training data. A concurrent learning algorithm (DP-GEN) is adopted to explore the configurational space beyond the scope of conventional ab initio MD simulation. By examining the performance of Model S, we find that an accurate prediction of atomic force does not imply an accurate prediction of system energy. The deviation from the relative atomic force alone is insufficient to assess the accuracy of the DP models. Based on the performance of Model F, we propose that the relative magnitude of the model deviation and the corresponding root-mean-square error of the original test dataset, including energy and atomic force, can serve as an indicator for evaluating the accuracy of the model prediction for a given structure, which is particularly applicable for large systems where density functional theory calculations are infeasible. In addition to the prediction accuracy of the model described above, we also briefly discuss simulation stability and its relationship to the former. Both are important aspects in assessing the transferability of the MLP model.
Collapse
Affiliation(s)
- Dongfei Liu
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, People's Republic of China
| | - Jianzhong Wu
- Department of Chemical and Environmental Engineering, University of California, Riverside, California 92521, USA
| | - Diannan Lu
- Department of Chemical Engineering, Tsinghua University, Beijing 100084, People's Republic of China
| |
Collapse
|
20
|
Zhang P, Yang W. Toward a general neural network force field for protein simulations: Refining the intramolecular interaction in protein. J Chem Phys 2023; 159:024118. [PMID: 37431910 PMCID: PMC10481389 DOI: 10.1063/5.0142280] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 06/22/2023] [Indexed: 07/12/2023] Open
Abstract
Molecular dynamics (MD) is an extremely powerful, highly effective, and widely used approach to understanding the nature of chemical processes in atomic details for proteins. The accuracy of results from MD simulations is highly dependent on force fields. Currently, molecular mechanical (MM) force fields are mainly utilized in MD simulations because of their low computational cost. Quantum mechanical (QM) calculation has high accuracy, but it is exceedingly time consuming for protein simulations. Machine learning (ML) provides the capability for generating accurate potential at the QM level without increasing much computational effort for specific systems that can be studied at the QM level. However, the construction of general machine learned force fields, needed for broad applications and large and complex systems, is still challenging. Here, general and transferable neural network (NN) force fields based on CHARMM force fields, named CHARMM-NN, are constructed for proteins by training NN models on 27 fragments partitioned from the residue-based systematic molecular fragmentation (rSMF) method. The NN for each fragment is based on atom types and uses new input features that are similar to MM inputs, including bonds, angles, dihedrals, and non-bonded terms, which enhance the compatibility of CHARMM-NN to MM MD and enable the implementation of CHARMM-NN force fields in different MD programs. While the main part of the energy of the protein is based on rSMF and NN, the nonbonded interactions between the fragments and with water are taken from the CHARMM force field through mechanical embedding. The validations of the method for dipeptides on geometric data, relative potential energies, and structural reorganization energies demonstrate that the CHARMM-NN local minima on the potential energy surface are very accurate approximations to QM, showing the success of CHARMM-NN for bonded interactions. However, the MD simulations on peptides and proteins indicate that more accurate methods to represent protein-water interactions in fragments and non-bonded interactions between fragments should be considered in the future improvement of CHARMM-NN, which can increase the accuracy of approximation beyond the current mechanical embedding QM/MM level.
Collapse
Affiliation(s)
- Pan Zhang
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| | - Weitao Yang
- Department of Chemistry, Duke University, Durham, North Carolina 27708, USA
| |
Collapse
|
21
|
Dou B, Zhu Z, Merkurjev E, Ke L, Chen L, Jiang J, Zhu Y, Liu J, Zhang B, Wei GW. Machine Learning Methods for Small Data Challenges in Molecular Science. Chem Rev 2023; 123:8736-8780. [PMID: 37384816 PMCID: PMC10999174 DOI: 10.1021/acs.chemrev.3c00189] [Citation(s) in RCA: 15] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/01/2023]
Abstract
Small data are often used in scientific and engineering research due to the presence of various constraints, such as time, cost, ethics, privacy, security, and technical limitations in data acquisition. However, big data have been the focus for the past decade, small data and their challenges have received little attention, even though they are technically more severe in machine learning (ML) and deep learning (DL) studies. Overall, the small data challenge is often compounded by issues, such as data diversity, imputation, noise, imbalance, and high-dimensionality. Fortunately, the current big data era is characterized by technological breakthroughs in ML, DL, and artificial intelligence (AI), which enable data-driven scientific discovery, and many advanced ML and DL technologies developed for big data have inadvertently provided solutions for small data problems. As a result, significant progress has been made in ML and DL for small data challenges in the past decade. In this review, we summarize and analyze several emerging potential solutions to small data challenges in molecular science, including chemical and biological sciences. We review both basic machine learning algorithms, such as linear regression, logistic regression (LR), k-nearest neighbor (KNN), support vector machine (SVM), kernel learning (KL), random forest (RF), and gradient boosting trees (GBT), and more advanced techniques, including artificial neural network (ANN), convolutional neural network (CNN), U-Net, graph neural network (GNN), Generative Adversarial Network (GAN), long short-term memory (LSTM), autoencoder, transformer, transfer learning, active learning, graph-based semi-supervised learning, combining deep learning with traditional machine learning, and physical model-based data augmentation. We also briefly discuss the latest advances in these methods. Finally, we conclude the survey with a discussion of promising trends in small data challenges in molecular science.
Collapse
Affiliation(s)
- Bozheng Dou
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Zailiang Zhu
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Ekaterina Merkurjev
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Lu Ke
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Long Chen
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Jian Jiang
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
| | - Yueying Zhu
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Jie Liu
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Bengong Zhang
- Research Center of Nonlinear Science, School of Mathematical and Physical Sciences,Wuhan Textile University, Wuhan 430200, P, R. China
| | - Guo-Wei Wei
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824, United States
- Department of Electrical and Computer Engineering, Michigan State University, East Lansing, Michigan 48824, United States
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| |
Collapse
|
22
|
Kabylda A, Vassilev-Galindo V, Chmiela S, Poltavsky I, Tkatchenko A. Efficient interatomic descriptors for accurate machine learning force fields of extended molecules. Nat Commun 2023; 14:3562. [PMID: 37322039 DOI: 10.1038/s41467-023-39214-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2022] [Accepted: 05/17/2023] [Indexed: 06/17/2023] Open
Abstract
Machine learning force fields (MLFFs) are gradually evolving towards enabling molecular dynamics simulations of molecules and materials with ab initio accuracy but at a small fraction of the computational cost. However, several challenges remain to be addressed to enable predictive MLFF simulations of realistic molecules, including: (1) developing efficient descriptors for non-local interatomic interactions, which are essential to capture long-range molecular fluctuations, and (2) reducing the dimensionality of the descriptors to enhance the applicability and interpretability of MLFFs. Here we propose an automatized approach to substantially reduce the number of interatomic descriptor features while preserving the accuracy and increasing the efficiency of MLFFs. To simultaneously address the two stated challenges, we illustrate our approach on the example of the global GDML MLFF. We found that non-local features (atoms separated by as far as 15 Å in studied systems) are crucial to retain the overall accuracy of the MLFF for peptides, DNA base pairs, fatty acids, and supramolecular complexes. Interestingly, the number of required non-local features in the reduced descriptors becomes comparable to the number of local interatomic features (those below 5 Å). These results pave the way to constructing global molecular MLFFs whose cost increases linearly, instead of quadratically, with system size.
Collapse
Affiliation(s)
- Adil Kabylda
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg
| | - Valentin Vassilev-Galindo
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg
| | - Stefan Chmiela
- Machine Learning Group, Technische Universität Berlin, 10587, Berlin, Germany
- BIFOLD - Berlin Institute for the Foundations of Learning and Data, 10587, Berlin, Germany
| | - Igor Poltavsky
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg
| | - Alexandre Tkatchenko
- Department of Physics and Materials Science, University of Luxembourg, L-1511, Luxembourg City, Luxembourg.
| |
Collapse
|
23
|
Kümmerer F, Orioli S, Lindorff-Larsen K. Fitting Force Field Parameters to NMR Relaxation Data. J Chem Theory Comput 2023. [PMID: 37276045 DOI: 10.1021/acs.jctc.3c00174] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
We present an approach to optimize force field parameters using time-dependent data from NMR relaxation experiments. To do so, we scan parameters in the dihedral angle potential energy terms describing the rotation of the methyl groups in proteins and compare NMR relaxation rates calculated from molecular dynamics simulations with the modified force fields to deuterium relaxation measurements of T4 lysozyme. We find that a small modification of Cγ methyl groups improves the agreement with experiments both for the protein used to optimize the force field and when validating using simulations of CI2 and ubiquitin. We also show that these improvements enable a more effective a posteriori reweighting of the MD trajectories. The resulting force field thus enables more direct comparison between simulations and side-chain NMR relaxation data and makes it possible to construct ensembles that better represent the dynamics of proteins in solution.
Collapse
Affiliation(s)
- Felix Kümmerer
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
| | - Simone Orioli
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
- Structural Biophysics, Niels Bohr Institute, Faculty of Science, University of Copenhagen, DK-2100 Copenhagen Ø, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, DK-2200 Copenhagen N, Denmark
| |
Collapse
|
24
|
Jablonka K, Rosen AS, Krishnapriyan AS, Smit B. An Ecosystem for Digital Reticular Chemistry. ACS CENTRAL SCIENCE 2023; 9:563-581. [PMID: 37122448 PMCID: PMC10141625 DOI: 10.1021/acscentsci.2c01177] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/03/2023]
Abstract
The vastness of the materials design space makes it impractical to explore using traditional brute-force methods, particularly in reticular chemistry. However, machine learning has shown promise in expediting and guiding materials design. Despite numerous successful applications of machine learning to reticular materials, progress in the field has stagnated, possibly because digital chemistry is more an art than a science and its limited accessibility to inexperienced researchers. To address this issue, we present mofdscribe, a software ecosystem tailored to novice and seasoned digital chemists that streamlines the ideation, modeling, and publication process. Though optimized for reticular chemistry, our tools are versatile and can be used in nonreticular materials research. We believe that mofdscribe will enable a more reliable, efficient, and comparable field of digital chemistry.
Collapse
Affiliation(s)
- Kevin
Maik Jablonka
- Laboratory of molecular simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL), Rue de l’Industrie 17, CH-1951 Sion, Switzerland
| | - Andrew S. Rosen
- Department of Materials
Science and Engineering, University of California, Berkeley, California 94720, United States
- Miller Institute for Basic Research in Science, University of California, Berkeley, California 94720, United States
- Materials Science Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, United States
| | - Aditi S. Krishnapriyan
- Department of Chemical and Biomolecular Engineering, University of California, Berkeley, California 94720, United States
- Department of Electrical Engineering and
Computer Science, University of California, Berkeley, California 94720, United States
- Computational
Research Division, Lawrence Berkeley National
Laboratory, Berkeley, California 94720, United States
| | - Berend Smit
- Laboratory of molecular simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL), Rue de l’Industrie 17, CH-1951 Sion, Switzerland
- E-mail:
| |
Collapse
|
25
|
Wu J, Gao T, Guo H, Zhao L, Lv S, Lv J, Yao R, Yu Y, Ma F. Application of molecular dynamics simulation for exploring the roles of plant biomolecules in promoting environmental health. THE SCIENCE OF THE TOTAL ENVIRONMENT 2023; 869:161871. [PMID: 36708839 DOI: 10.1016/j.scitotenv.2023.161871] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/19/2022] [Revised: 01/23/2023] [Accepted: 01/24/2023] [Indexed: 06/18/2023]
Abstract
Understanding the dynamic changes of plant biomolecules is vital for exploring their mechanisms in the environment. Molecular dynamics (MD) simulation has been widely used to study structural evolution and corresponding properties of plant biomolecules at the microscopic scale. Here, this review (i) outlines structural properties of plant biomolecules, and the crucial role of MD simulation in advancing studies of the biomolecules; (ii) describes the development of MD simulation in plant biomolecules, determinants of simulation, and analysis parameters; (iii) introduces the applications of MD simulation in plant biomolecules, including the response of the biomolecules to multiple stresses, their roles in corrosive environments, and their contributions in improving environmental health; (iv) reviews techniques integrated with MD simulation, such as molecular biology, quantum mechanics, molecular docking, and machine learning modeling, which bridge gaps in MD simulation. Finally, we make suggestions on determination of force field types, investigation of plant biomolecule mechanisms, and use of MD simulation in combination with other techniques. This review provides comprehensive summaries of the mechanisms of plant biomolecules in the environment revealed by MD simulation and validates it as an applicable tool for bridging gaps between macroscopic and microscopic behavior, providing insights into the wide application of MD simulation in plant biomolecules.
Collapse
Affiliation(s)
- Jieting Wu
- School of Environmental Science, Liaoning University, Shenyang 110036, People's Republic of China.
| | - Tian Gao
- School of Environmental Science, Liaoning University, Shenyang 110036, People's Republic of China
| | - Haijuan Guo
- School of Environmental Science, Liaoning University, Shenyang 110036, People's Republic of China
| | - Lei Zhao
- State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin 150090, People's Republic of China
| | - Sidi Lv
- School of Environmental Science, Liaoning University, Shenyang 110036, People's Republic of China
| | - Jin Lv
- School of Environmental Science, Liaoning University, Shenyang 110036, People's Republic of China
| | - Ruyi Yao
- School of Environmental Science, Liaoning University, Shenyang 110036, People's Republic of China
| | - Yanyi Yu
- School of Environmental Science, Liaoning University, Shenyang 110036, People's Republic of China
| | - Fang Ma
- State Key Laboratory of Urban Water Resource and Environment, Harbin Institute of Technology, Harbin 150090, People's Republic of China
| |
Collapse
|
26
|
Zhou Y, Ouyang Y, Zhang Y, Li Q, Wang J. Machine Learning Assisted Simulations of Electrochemical Interfaces: Recent Progress and Challenges. J Phys Chem Lett 2023; 14:2308-2316. [PMID: 36847421 DOI: 10.1021/acs.jpclett.2c03288] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
The electrochemical interface, where the adsorption of reactants and electrocatalytic reactions take place, has long been a focus of attention. Some of the important processes on it tend to possess relatively slow kinetic characteristics, which are usually beyond the scope of ab initio molecular dynamics. The newly emerging technique, machine learning methods, provides an alternative approach to achieve thousands of atoms and nanosecond time scale while ensuring precision and efficiency. In this Perspective, we summarize in detail the recent progress and achievements made by the introduction of machine learning to simulate electrochemical interfaces, and focus on the limitations of current machine learning models, such as accurate descriptions of long-range electrostatic interactions and the kinetics of the electrochemical reactions occurring at the interface. Finally, we further point out the future directions for machine learning to expand in the field of electrochemical interfaces.
Collapse
Affiliation(s)
- Yipeng Zhou
- School of Physics, Southeast University, Nanjing 211189, China
| | - Yixin Ouyang
- School of Physics, Southeast University, Nanjing 211189, China
| | - Yehui Zhang
- School of Physics, Southeast University, Nanjing 211189, China
| | - Qiang Li
- School of Physics, Southeast University, Nanjing 211189, China
| | - Jinlan Wang
- School of Physics, Southeast University, Nanjing 211189, China
| |
Collapse
|
27
|
Li W, Xue T, Mora-Perez C, Prezhdo OV. Ab initio quantum dynamics of plasmonic charge carriers. TRENDS IN CHEMISTRY 2023. [DOI: 10.1016/j.trechm.2023.02.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/19/2023]
|
28
|
Manzhos S, Tsuda S, Ihara M. Machine learning in computational chemistry: interplay between (non)linearity, basis sets, and dimensionality. Phys Chem Chem Phys 2023; 25:1546-1555. [PMID: 36562317 DOI: 10.1039/d2cp04155c] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Machine learning (ML) based methods and tools have now firmly established themselves in physical chemistry and in particular in theoretical and computational chemistry and in materials chemistry. The generality of popular ML techniques such as neural networks or kernel methods (Gaussian process and kernel ridge regression and their flavors) permitted their application to diverse problems from prediction of properties of functional materials (catalysts, solid state ionic conductors, etc.) from descriptors to the building of interatomic potentials (where ML is currently routinely used in applications) and electron density functionals. These ML techniques are assumed to have superior expressive power of nonlinear methods, and are often used "as is", with concepts such as "non-parametric" or "deep learning" used without a clear justification for their need or advantage over simpler and more robust alternatives. In this Perspective, we highlight some interrelations between popular ML techniques and traditional linear regressions and basis expansions and demonstrate that in certain regimes (such as a very high dimensionality) these approximations might collapse. We also discuss ways to recover the expressive power of a nonlinear approach and to help select hyperparameters with the help of high-dimensional model representation and to obtain elements of insight while preserving the generality of the method.
Collapse
Affiliation(s)
- Sergei Manzhos
- School of Materials and Chemical Technology, Tokyo Institute of Technology, Ookayama 2-12-1, Meguro-ku, Tokyo 152-8552, Japan.
| | - Shunsaku Tsuda
- School of Materials and Chemical Technology, Tokyo Institute of Technology, Ookayama 2-12-1, Meguro-ku, Tokyo 152-8552, Japan.
| | - Manabu Ihara
- School of Materials and Chemical Technology, Tokyo Institute of Technology, Ookayama 2-12-1, Meguro-ku, Tokyo 152-8552, Japan.
| |
Collapse
|
29
|
Yu W, Weber DJ, MacKerell AD. Computer-Aided Drug Design: An Update. Methods Mol Biol 2023; 2601:123-152. [PMID: 36445582 PMCID: PMC9838881 DOI: 10.1007/978-1-0716-2855-3_7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Computer-aided drug design (CADD) approaches are playing an increasingly important role in understanding the fundamentals of ligand-receptor interactions and helping medicinal chemists design therapeutics. About 5 years ago, we presented a chapter devoted to an overview of CADD methods and covered typical CADD protocols including structure-based drug design (SBDD) and ligand-based drug design (LBDD) approaches that were frequently used in the antibiotic drug design process. Advances in computational hardware and algorithms and emerging CADD methods are enhancing the accuracy and ability of CADD in drug design and development. In this chapter, an update to our previous chapter is provided with a focus on new CADD approaches from our laboratory and other peers that can be employed to facilitate the development of antibiotic therapeutics.
Collapse
Affiliation(s)
- Wenbo Yu
- Department of Pharmaceutical Sciences, Computer-Aided Drug Design Center, School of Pharmacy, University of Maryland, Baltimore, MD, USA.
- Institute for Bioscience and Biotechnology Research (IBBR), Rockville, MD, USA.
- Center for Biomolecular Therapeutics (CBT), School of Medicine, University of Maryland, Baltimore, MD, USA.
| | - David J Weber
- Institute for Bioscience and Biotechnology Research (IBBR), Rockville, MD, USA
- Center for Biomolecular Therapeutics (CBT), School of Medicine, University of Maryland, Baltimore, MD, USA
| | - Alexander D MacKerell
- Department of Pharmaceutical Sciences, Computer-Aided Drug Design Center, School of Pharmacy, University of Maryland, Baltimore, MD, USA.
- Institute for Bioscience and Biotechnology Research (IBBR), Rockville, MD, USA.
- Center for Biomolecular Therapeutics (CBT), School of Medicine, University of Maryland, Baltimore, MD, USA.
| |
Collapse
|
30
|
Wu Y, Liu D, Chu W, Wang B, Vasenko AS, Prezhdo OV. Fluctuations at Metal Halide Perovskite Grain Boundaries Create Transient Trap States: Machine Learning Assisted Ab Initio Analysis. ACS APPLIED MATERIALS & INTERFACES 2022; 14:55753-55761. [PMID: 36475599 DOI: 10.1021/acsami.2c16203] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
All-inorganic perovskites are promising candidates for solar energy and optoelectronic applications, despite their polycrystalline nature with a large density of grain boundaries (GBs) due to facile solution-processed fabrication. GBs exhibit complex atomistic structures undergoing slow rearrangements. By studying evolution of the Σ5(210) CsPbBr3 GB on a nanosecond time scale, comparable to charge carrier lifetimes, we demonstrate that GB deformations appear every ∼100 ps and increase significantly the probability of deep charge traps. However, the deep traps form only transiently for a few hundred femtoseconds. In contrast, shallow traps appear continuously at the GB. Shallow traps are localized in the GB layer, while deep traps are in a sublayer, which is still distorted from the pristine structure and can be jammed in unfavorable conformations. The GB electronic properties correlate with bond angles, with notable exception of the Br-Br distance, which provides a signature of halide migration along GBs. The transient nature of trap states and localization of electrons and holes at different parts of GBs indicate that charge carrier lifetimes should be long. At the same time, charge mobility can be reduced. The complex, multiscale evolution of geometric and electronic structures of GBs rationalize the contradictory statements made in the literature regarding both benign and detrimental roles of GBs in perovskite performance and provide new atomistic insights into perovskite properties.
Collapse
Affiliation(s)
- Yifan Wu
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| | | | - Weibin Chu
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
- Key Laboratory of Computational Physical Sciences (Ministry of Education), Institute of Computational Physical Sciences, Fudan University, Shanghai 200433, China
- Shanghai Qi Zhi Institute, Shanghai 200030, China
| | - Bipeng Wang
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| | - Andrey S Vasenko
- HSE University, 101000 Moscow, Russia
- I.E. Tamm Department of Theoretical Physics, P.N. Lebedev Physical Institute, Russian Academy of Sciences, 119991 Moscow, Russia
| | - Oleg V Prezhdo
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
- Department of Physics and Astronomy, University of Southern California, Los Angeles, California 90089, United States
| |
Collapse
|
31
|
Dutta P, Roy P, Sengupta N. Effects of External Perturbations on Protein Systems: A Microscopic View. ACS OMEGA 2022; 7:44556-44572. [PMID: 36530249 PMCID: PMC9753117 DOI: 10.1021/acsomega.2c06199] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/26/2022] [Accepted: 11/14/2022] [Indexed: 06/17/2023]
Abstract
Protein folding can be viewed as the origami engineering of biology resulting from the long process of evolution. Even decades after its recognition, research efforts worldwide focus on demystifying molecular factors that underlie protein structure-function relationships; this is particularly relevant in the era of proteopathic disease. A complex co-occurrence of different physicochemical factors such as temperature, pressure, solvent, cosolvent, macromolecular crowding, confinement, and mutations that represent realistic biological environments are known to modulate the folding process and protein stability in unique ways. In the current review, we have contextually summarized the substantial efforts in unveiling individual effects of these perturbative factors, with major attention toward bottom-up approaches. Moreover, we briefly present some of the biotechnological applications of the insights derived from these studies over various applications including pharmaceuticals, biofuels, cryopreservation, and novel materials. Finally, we conclude by summarizing the challenges in studying the combined effects of multifactorial perturbations in protein folding and refer to complementary advances in experiment and computational techniques that lend insights to the emergent challenges.
Collapse
Affiliation(s)
- Pallab Dutta
- Department
of Biological Sciences, Indian Institute
of Science Education and Research (IISER) Kolkata, Mohanpur741246, India
| | - Priti Roy
- Department
of Biological Sciences, Indian Institute
of Science Education and Research (IISER) Kolkata, Mohanpur741246, India
- Department
of Chemistry, Oklahoma State University, Stillwater, Oklahoma74078, United States
| | - Neelanjana Sengupta
- Department
of Biological Sciences, Indian Institute
of Science Education and Research (IISER) Kolkata, Mohanpur741246, India
| |
Collapse
|
32
|
Lunghi A, Sanvito S. Computational design of magnetic molecules and their environment using quantum chemistry, machine learning and multiscale simulations. Nat Rev Chem 2022; 6:761-781. [PMID: 37118096 DOI: 10.1038/s41570-022-00424-3] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/15/2022] [Indexed: 11/09/2022]
Abstract
Having served as a playground for fundamental studies on the physics of d and f electrons for almost a century, magnetic molecules are now becoming increasingly important for technological applications, such as magnetic resonance, data storage, spintronics and quantum information. All of these applications require the preservation and control of spins in time, an ability hampered by the interaction with the environment, namely with other spins, conduction electrons, molecular vibrations and electromagnetic fields. Thus, the design of a novel magnetic molecule with tailored properties is a formidable task, which does not only concern its electronic structures but also calls for a deep understanding of the interaction among all the degrees of freedom at play. This Review describes how state-of-the-art ab initio computational methods, combined with data-driven approaches to materials modelling, can be integrated into a fully multiscale strategy capable of defining design rules for magnetic molecules.
Collapse
|
33
|
Wang B, Chu W, Wu Y, Casanova D, Saidi WA, Prezhdo OV. Electron-Volt Fluctuation of Defect Levels in Metal Halide Perovskites on a 100 ps Time Scale. J Phys Chem Lett 2022; 13:5946-5952. [PMID: 35732502 DOI: 10.1021/acs.jpclett.2c01452] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Metal halide perovskites (MHPs) have gained considerable attention due to their excellent optoelectronic performance, which is often attributed to unusual defect properties. We demonstrate that midgap defect levels can exhibit very large and slow energy fluctuations associated with anharmonic acoustic motions. Therefore, care should be taken classifying MHP defects as deep or shallow, since shallow defects may become deep and vice versa. As a consequence, charges from deep levels can escape into bands, and light absorption can be extended to longer wavelengths, improving material performance. The phenomenon, demonstrated with iodine vacancy in CH3NH3PbI3 using a machine learning force field, can be expected for a variety of defects and dopants in many MHPs and other soft inorganic semiconductors. Since large-scale anharmonic motions can be precursors to chemical decomposition, a known problem with MHPs, we propose that materials that are stiffer than MHPs but softer than traditional inorganic semiconductors, such as Si and TiO2, may simultaneously exhibit excellent performance and stability.
Collapse
Affiliation(s)
- Bipeng Wang
- Department of Chemical Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Weibin Chu
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| | - Yifan Wu
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| | - David Casanova
- Donostia International Physics Center (DIPC), Donostia, 20018 Euskadi, Spain
- Basque Foundation for Science, IKERBASQUE, Bilbao, 48009 Euskadi, Spain
| | - Wissam A Saidi
- Department of Mechanical Engineering & Materials Science, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Oleg V Prezhdo
- Department of Chemical Engineering, University of Southern California, Los Angeles, California 90089, United States
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| |
Collapse
|
34
|
Abstract
Machine learning (ML) is believed to have enabled a paradigm shift in materials research, and in practice, ML has demonstrated its power in speeding up the cost-efficient discovery of new materials and autonomizing materials laboratories. In this Perspective, current research progress in materials data which are the backbones of ML are reviewed, focusing on high-throughput data generation, standardized data storage, and data representation. More importantly, the challenging issues in materials data that should be overcome to unlock the full potential of ML in materials research and development, including classic 5V (volume, velocity, variety, veracity, and value) issues, 3M (multicomponent, multiscale, and multistage) challenges, co-mining of experimental and computational data, and materials data toward transferable/explainable ML or causal ML, are discussed.
Collapse
Affiliation(s)
- Linggang Zhu
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
- Center for Integrated Computational Materials Engineering, International Research Institute for Multidisciplinary Science, Beihang University, Beijing 100191, China
| | - Jian Zhou
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
- Center for Integrated Computational Materials Engineering, International Research Institute for Multidisciplinary Science, Beihang University, Beijing 100191, China
| | - Zhimei Sun
- School of Materials Science and Engineering, Beihang University, Beijing 100191, China
- Center for Integrated Computational Materials Engineering, International Research Institute for Multidisciplinary Science, Beihang University, Beijing 100191, China
| |
Collapse
|
35
|
Rankine CD, Penfold TJ. Accurate, affordable, and generalizable machine learning simulations of transition metal x-ray absorption spectra using the XANESNET deep neural network. J Chem Phys 2022; 156:164102. [PMID: 35490005 DOI: 10.1063/5.0087255] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The affordable, accurate, and generalizable prediction of spectroscopic observables plays a key role in the analysis of increasingly complex experiments. In this article, we develop and deploy a deep neural network-XANESNET-for predicting the lineshape of first-row transition metal K-edge x-ray absorption near-edge structure (XANES) spectra. XANESNET predicts the spectral intensities using only information about the local coordination geometry of the transition metal complexes encoded in a feature vector of weighted atom-centered symmetry functions. We address in detail the calibration of the feature vector for the particularities of the problem at hand, and we explore the individual feature importance to reveal the physical insight that XANESNET obtains at the Fe K-edge. XANESNET relies on only a few judiciously selected features-radial information on the first and second coordination shells suffices along with angular information sufficient to separate satisfactorily key coordination geometries. The feature importance is found to reflect the XANES spectral window under consideration and is consistent with the expected underlying physics. We subsequently apply XANESNET at nine first-row transition metal (Ti-Zn) K-edges. It can be optimized in as little as a minute, predicts instantaneously, and provides K-edge XANES spectra with an average accuracy of ∼±2%-4% in which the positions of prominent peaks are matched with a >90% hit rate to sub-eV (∼0.8 eV) error.
Collapse
Affiliation(s)
- C D Rankine
- Chemistry-School of Natural and Environmental Sciences, Newcastle University, Newcastle Upon Tyne NE1 7RU, United Kingdom
| | - T J Penfold
- Chemistry-School of Natural and Environmental Sciences, Newcastle University, Newcastle Upon Tyne NE1 7RU, United Kingdom
| |
Collapse
|
36
|
Heiranian M, DuChanois RM, Ritt CL, Violet C, Elimelech M. Molecular Simulations to Elucidate Transport Phenomena in Polymeric Membranes. ENVIRONMENTAL SCIENCE & TECHNOLOGY 2022; 56:3313-3323. [PMID: 35235312 DOI: 10.1021/acs.est.2c00440] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Despite decades of dominance in separation technology, progress in the design and development of high-performance polymer-based membranes has been incremental. Recent advances in materials science and chemical synthesis provide opportunities for molecular-level design of next-generation membrane materials. Such designs necessitate a fundamental understanding of transport and separation mechanisms at the molecular scale. Molecular simulations are important tools that could lead to the development of fundamental structure-property-performance relationships for advancing membrane design. In this Perspective, we assess the application and capability of molecular simulations to understand the mechanisms of ion and water transport across polymeric membranes. Additionally, we discuss the reliability of molecular models in mimicking the structure and chemistry of nanochannels and transport pathways in polymeric membranes. We conclude by providing research directions for resolving key knowledge gaps related to transport phenomena in polymeric membranes and for the construction of structure-property-performance relationships for the design of next-generation membranes.
Collapse
Affiliation(s)
- Mohammad Heiranian
- Department of Chemical and Environmental Engineering, Yale University, New Haven, Connecticut 06520-8286, United States
| | - Ryan M DuChanois
- Department of Chemical and Environmental Engineering, Yale University, New Haven, Connecticut 06520-8286, United States
| | - Cody L Ritt
- Department of Chemical and Environmental Engineering, Yale University, New Haven, Connecticut 06520-8286, United States
| | - Camille Violet
- Department of Chemical and Environmental Engineering, Yale University, New Haven, Connecticut 06520-8286, United States
| | - Menachem Elimelech
- Department of Chemical and Environmental Engineering, Yale University, New Haven, Connecticut 06520-8286, United States
| |
Collapse
|
37
|
Houston PL, Qu C, Nandi A, Conte R, Yu Q, Bowman JM. Permutationally invariant polynomial regression for energies and gradients, using reverse differentiation, achieves orders of magnitude speed-up with high precision compared to other machine learning methods. J Chem Phys 2022; 156:044120. [DOI: 10.1063/5.0080506] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Affiliation(s)
- Paul L. Houston
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853, USA and Department of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332, USA
| | - Chen Qu
- Department of Chemistry and Biochemistry, University of Maryland, College Park, Maryland 20742, USA
| | - Apurba Nandi
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA
| | - Riccardo Conte
- Dipartimento di Chimica, Università Degli Studi di Milano, via Golgi 19, 20133 Milano, Italy
| | - Qi Yu
- Department of Chemistry, Yale University, New Haven, Connecticut 06511, USA
| | - Joel M. Bowman
- Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA
| |
Collapse
|
38
|
Wang B, Chu W, Prezhdo OV. Interpolating Nonadiabatic Molecular Dynamics Hamiltonian with Inverse Fast Fourier Transform. J Phys Chem Lett 2022; 13:331-338. [PMID: 34978830 DOI: 10.1021/acs.jpclett.1c03884] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Nonadiabatic (NA) molecular dynamics (MD) allows one to investigate far-from-equilibrium processes in nanoscale and molecular materials at the atomistic level and in the time domain, mimicking time-resolved spectroscopic experiments. Ab initio NAMD is limited to about 100 atoms and a few picoseconds, due to computational cost of excitation energies and NA couplings. We develop a straightforward methodology that can extend ab initio quality NAMD to nanoseconds and thousands of atoms. The ab initio NAMD Hamiltonian is sampled and interpolated along a trajectory using a Fourier transform, and then, it is used to perform NAMD with known algorithms. The methodology relies on the classical path approximation, which holds for many materials and processes. To achieve a complete ab initio quality description, the trajectory can be obtained using an ab initio trained machine learning force field. The method is demonstrated with charge carrier trapping and relaxation in hybrid organic-inorganic and all-inorganic metal halide perovskites that exhibit complex dynamics and are actively studied for optoelectronic applications.
Collapse
Affiliation(s)
- Bipeng Wang
- Department of Chemical Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Weibin Chu
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| | - Oleg V Prezhdo
- Department of Chemical Engineering, University of Southern California, Los Angeles, California 90089, United States
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| |
Collapse
|
39
|
Challenges and frontiers of computational modelling of biomolecular recognition. QRB DISCOVERY 2022. [DOI: 10.1017/qrd.2022.11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
Abstract
Biomolecular recognition including binding of small molecules, peptides and proteins to their target receptors plays a key role in cellular function and has been targeted for therapeutic drug design. However, the high flexibility of biomolecules and slow binding and dissociation processes have presented challenges for computational modelling. Here, we review the challenges and computational approaches developed to characterise biomolecular binding, including molecular docking, molecular dynamics simulations (especially enhanced sampling) and machine learning. Further improvements are still needed in order to accurately and efficiently characterise binding structures, mechanisms, thermodynamics and kinetics of biomolecules in the future.
Collapse
|
40
|
Can docking scoring functions guarantee success in virtual screening? VIRTUAL SCREENING AND DRUG DOCKING 2022. [DOI: 10.1016/bs.armc.2022.08.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
41
|
Gerrits N. Accurate Simulations of the Reaction of H 2 on a Curved Pt Crystal through Machine Learning. J Phys Chem Lett 2021; 12:12157-12164. [PMID: 34918518 PMCID: PMC8724818 DOI: 10.1021/acs.jpclett.1c03395] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Accepted: 12/14/2021] [Indexed: 06/14/2023]
Abstract
Theoretical studies on molecule-metal surface reactions have so far been limited to small surface unit cells due to computational costs. Here, for the first time molecular dynamics simulations on very large surface unit cells at the level of density functional theory are performed, allowing a direct comparison to experiments performed on a curved crystal. Specifically, the reaction of D2 on a curved Pt crystal is investigated with a neural network potential (NNP). The developed NNP is also accurate for surface unit cells considerably larger than those that have been included in the training data, allowing dynamical simulations on very large surface unit cells that otherwise would have been intractable. Important and complex aspects of the reaction mechanism are discovered such as diffusion and a shadow effect of the step. Furthermore, conclusions from simulations on smaller surface unit cells cannot always be transfered to larger surface unit cells, limiting the applicability of theoretical studies of smaller surface unit cells to heterogeneous catalysts with small defect densities.
Collapse
Affiliation(s)
- Nick Gerrits
- Leiden
Institute of Chemistry, Leiden University, Gorlaeus Laboratories, P.O. Box 9502, 2300 RA Leiden, The Netherlands
- Research
Group PLASMANT, Department of Chemistry, University of Antwerp, Universiteitsplein 1, BE-2610 Wilrijk, Antwerp, Belgium
| |
Collapse
|
42
|
How WB, Wang B, Chu W, Tkatchenko A, Prezhdo OV. Significance of the Chemical Environment of an Element in Nonadiabatic Molecular Dynamics: Feature Selection and Dimensionality Reduction with Machine Learning. J Phys Chem Lett 2021; 12:12026-12032. [PMID: 34902248 DOI: 10.1021/acs.jpclett.1c03469] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
Using supervised and unsupervised machine learning (ML) on features generated from nonadiabatic (NA) molecular dynamics (MD) trajectories under the classical path approximation, we demonstrate that mutual information with the NA Hamiltonian can be used for feature selection and model simplification. Focusing on CsPbI3, a popular metal halide perovskite, we observe that the chemical environment of a single element is sufficient for predicting the NA Hamiltonian. The conclusion applies even to Cs, although Cs does not contribute to the relevant wave functions. Interatomic distances between Cs and I or Pb and the octahedral tilt angle are the most important features. We reduce a typical 360-parameter ML force-field model to just a 12-parameter NA Hamiltonian model, while maintaining a high NA-MD simulation quality. Because NA-MD is a valuable tool for studying excited state processes, overcoming its high computational cost through simple ML models will streamline NA-MD simulations and expand the ranges of accessible system size and simulation time.
Collapse
Affiliation(s)
- Wei Bin How
- Division of Chemistry and Biological Chemistry, School of Physical and Mathematical Sciences, Nanyang Technological University, 637371 Singapore
| | - Bipeng Wang
- Department of Chemical Engineering, University of Southern California, Los Angeles, California 90089, United States
| | - Weibin Chu
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
| | - Alexandre Tkatchenko
- Department of Physics and Materials Science, University of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Oleg V Prezhdo
- Department of Chemical Engineering, University of Southern California, Los Angeles, California 90089, United States
- Department of Chemistry, University of Southern California, Los Angeles, California 90089, United States
- Department of Physics and Astronomy, University of Southern California, Los Angeles, California 90089, United States
| |
Collapse
|
43
|
Abstract
Rapid, far-from-equilibrium processes involving excitation of electronic, vibrational, spin, photon, topological, and other degrees of freedom form the basis of modern technologies, including electronics and optoelectronics, solar energy harvesting and conversion to electrical and chemical energy, quantum information processing, spin- and valleytronics, chemical detection, and medical therapies. Such processes are studied experimentally with various time-resolved spectroscopies that allow scientists to track system's evolution on ultrafast time scales and at close to atomistic level of detail. The availability of various forms of lasing has made such measurements easily accessible to many experimental groups worldwide, to study atoms and small molecules, nanoscale and condensed matter systems, proteins, cells, and mesoscopic materials. The experimental work necessitates parallel theoretical efforts needed to interpret the experiments and to provide insights that cannot be gained through measurements due to experimental limitations.Non-adiabatic (NA) molecular dynamics (MD) allows one to study processes at the atomistic level and in the time domain most directly mimicking the time-resolved experiments. Atomistic modeling takes full advantage of chemical intuition and principles that guide design and fabrication of molecules and materials. It provides atomistic origins of quasi-particles, such as holes, excitons, trions, plasmons, phonons, polarons, polaritons, spin-waves, momentum-resolved and topological states, electrically and magnetically polarized structures, and other abstract concepts. An atomistic description enables one to study realistic aspects of materials, which necessarily contain defects, dopants, surfaces, interfaces, passivating ligands, and solvent layers. Often, such realistic features govern material properties and are hard to account for phenomenologically. NA-MD requires few approximations and assumptions. It does not need to assume that atomic motions are harmonic, that electrons are Drude oscillators, that coupling between different degrees of freedom is weak, that dynamics is Markovian or has short memory, or that evolution occurs by exponential kinetics of transitions between few states. The classical or semiclassical treatment of atomic motions constitutes the main approximation of NA-MD and is used because atoms are 3-5 orders of magnitude heavier than electrons. NA-MD is limited by system size, typically hundreds or thousands of atoms, and time scale, picoseconds to nanoseconds. The quality of NA-MD simulations depends on the electronic structure method used to obtain excited state energies and NA couplings.NA-MD has been largely popularized and advanced in the chemistry community that focuses on molecules. Modeling far-from-equilibrium dynamics in nanoscale and condensed matter systems often has to account for other types of physics. At the same time, condensed phase NA-MD allows for approximations that may not work in molecules. Focusing on the recent NA-MD developments aimed at studying excited state processes in nanoscale and condensed phases, this Account considers how the phenomena important on the nanoscale can be incorporated into NA-MD and what approximations can be made to increase its efficiency with complex systems and processes.
Collapse
Affiliation(s)
- Oleg V. Prezhdo
- Departments of Chemistry, Physics and Astronomy, and Chemical Engineering University of Southern California, Los Angeles, California 90089, United States
| |
Collapse
|
44
|
Dong J, Zhang L, Wu B, Ding F, Liu Y. Theoretical Study of Chemical Vapor Deposition Synthesis of Graphene and Beyond: Challenges and Perspectives. J Phys Chem Lett 2021; 12:7942-7963. [PMID: 34387496 DOI: 10.1021/acs.jpclett.1c02316] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Two-dimensional (2D) materials have attracted great attention in recent years because of their unique dimensionality and related properties. Chemical vapor deposition (CVD), a crucial technique for thin-film epitaxial growth, has become the most promising method of synthesizing 2D materials. Different from traditional thin-film growth, where strong chemical bonds are involved in both thin films and substrates, the interaction in 2D materials and substrates involves the van der Waals force and is highly anisotropic, and therefore, traditional thin-film growth theories cannot be applied to 2D material CVD synthesis. During the last 15 years, extensive theoretical studies were devoted to the CVD synthesis of 2D materials. This Perspective attempts to present a theoretical framework for 2D material CVD synthesis as well as the challenges and opportunities in exploring CVD mechanisms. We hope that this Perspective can provide an in-depth understanding of 2D material CVD synthesis and can further stimulate 2D material synthesis.
Collapse
Affiliation(s)
- Jichen Dong
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Organic Solids, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, P.R. China
- University of Chinese Academy of Sciences, Beijing 100049, P.R. China
| | - Leining Zhang
- Centre for Multidimensional Carbon Materials, Institute for Basic Science, Ulsan 44919, South Korea
| | - Bin Wu
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Organic Solids, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, P.R. China
- University of Chinese Academy of Sciences, Beijing 100049, P.R. China
| | - Feng Ding
- Centre for Multidimensional Carbon Materials, Institute for Basic Science, Ulsan 44919, South Korea
- School of Materials Science and Engineering, Ulsan National Institute of Science and Technology, Ulsan 44919, South Korea
| | - Yunqi Liu
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Organic Solids, Institute of Chemistry, Chinese Academy of Sciences, Beijing 100190, P.R. China
- University of Chinese Academy of Sciences, Beijing 100049, P.R. China
| |
Collapse
|