1
|
Zubieta Rico PF, Pérez-Lemus GR, de Pablo JJ. Efficient sampling of free energy landscapes with functions in Sobolev spaces. J Chem Phys 2025; 162:084109. [PMID: 40008942 DOI: 10.1063/5.0221263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Accepted: 01/30/2025] [Indexed: 02/27/2025] Open
Abstract
Molecular simulations of biological and physical phenomena generally involve sampling complicated, rough energy landscapes characterized by multiple local minima. In this work, we introduce a new family of methods for advanced sampling that draw inspiration from functional representations used in machine learning and approximation theory. As shown here, such representations are particularly well suited for learning free energies using artificial neural networks. As a system evolves through phase space, the proposed methods gradually build a model for the free energy as a function of one or more collective variables, from both the frequency of visits to distinct states and generalized force estimates corresponding to such states. Implementation of the methods is relatively simple and, more importantly, for the representative examples considered in this work, they provide computational efficiency gains of up to several orders of magnitude over other widely used simulation techniques.
Collapse
Affiliation(s)
- Pablo F Zubieta Rico
- Pritzker School of Molecular Engineering, The University of Chicago, Chicago, Illinois 60637, USA
| | - Gustavo R Pérez-Lemus
- Pritzker School of Molecular Engineering, The University of Chicago, Chicago, Illinois 60637, USA
| | - Juan J de Pablo
- Department of Chemical and Biomolecular Engineering, Department of Physics, Department of Computer Science, Tandon School of Engineering, Courant Institute, New York University, New York, New York 10012, USA
- Materials Science Division, Argonne National Laboratory, Argonne, Illinois 60439, USA
| |
Collapse
|
2
|
Xie P, Car R, E W. Ab initio generalized Langevin equation. Proc Natl Acad Sci U S A 2024; 121:e2308668121. [PMID: 38551836 PMCID: PMC10998567 DOI: 10.1073/pnas.2308668121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2023] [Accepted: 02/22/2024] [Indexed: 04/08/2024] Open
Abstract
We introduce a machine learning-based approach called ab initio generalized Langevin equation (AIGLE) to model the dynamics of slow collective variables (CVs) in materials and molecules. In this scheme, the parameters are learned from atomistic simulations based on ab initio quantum mechanical models. Force field, memory kernel, and noise generator are constructed in the context of the Mori-Zwanzig formalism, under the constraint of the fluctuation-dissipation theorem. Combined with deep potential molecular dynamics and electronic density functional theory, this approach opens the way to multiscale modeling in a variety of situations. Here, we demonstrate this capability with a study of two mesoscale processes in crystalline lead titanate, namely the field-driven dynamics of a planar ferroelectric domain wall, and the dynamics of an extensive lattice of coarse-grained electric dipoles. In the first case, AIGLE extends the reach of ab initio simulations to a regime of noise-driven motions not accessible to molecular dynamics. In the second case, AIGLE deals with an extensive set of CVs by adopting a local approximation for the memory kernel and retaining only short-range noise correlations. The scheme is computationally more efficient than molecular dynamics by several orders of magnitude and mimics the microscopic dynamics at low frequencies where it reproduces accurately the dominant far-infrared absorption frequency.
Collapse
Affiliation(s)
- Pinchen Xie
- Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ08544
| | - Roberto Car
- Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ08544
- Department of Chemistry and Princeton Materials Institute, Princeton University, Princeton, NJ08544
- Department of Physics, Princeton University, Princeton, NJ08544
| | - Weinan E
- AI for Science Institute, Beijing100080, China
- Center for Machine Learning Research and School of Mathematical Sciences, Peking University, Beijing100084, China
| |
Collapse
|
3
|
Giese TJ, Ekesan Ş, McCarthy E, Tao Y, York DM. Surface-Accelerated String Method for Locating Minimum Free Energy Paths. J Chem Theory Comput 2024; 20:2058-2073. [PMID: 38367218 PMCID: PMC11059188 DOI: 10.1021/acs.jctc.3c01401] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/19/2024]
Abstract
We present a surface-accelerated string method (SASM) to efficiently optimize low-dimensional reaction pathways from the sampling performed with expensive quantum mechanical/molecular mechanical (QM/MM) Hamiltonians. The SASM accelerates the convergence of the path using the aggregate sampling obtained from the current and previous string iterations, whereas approaches like the string method in collective variables (SMCV) or the modified string method in collective variables (MSMCV) update the path only from the sampling obtained from the current iteration. Furthermore, the SASM decouples the number of images used to perform sampling from the number of synthetic images used to represent the path. The path is optimized on the current best estimate of the free energy surface obtained from all available sampling, and the proposed set of new simulations is not restricted to being located along the optimized path. Instead, the umbrella potential placement is chosen to extend the range of the free energy surface and improve the quality of the free energy estimates near the path. In this manner, the SASM is shown to improve the exploration for a minimum free energy pathway in regions where the free energy surface is relatively flat. Furthermore, it improves the quality of the free energy profile when the string is discretized with too few images. We compare the SASM, SMCV, and MSMCV using 3 QM/MM applications: a ribozyme methyltransferase reaction using 2 reaction coordinates, the 2'-O-transphosphorylation reaction of Hammerhead ribozyme using 3 reaction coordinates, and a tautomeric reaction in B-DNA using 5 reaction coordinates. We show that SASM converges the paths using roughly 3 times less sampling than the SMCV and MSMCV methods. All three algorithms have been implemented in the FE-ToolKit package made freely available.
Collapse
Affiliation(s)
- Timothy J. Giese
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Şölen Ekesan
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Erika McCarthy
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Yujun Tao
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| | - Darrin M. York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ 08854, USA
| |
Collapse
|
4
|
Wang Z, Gong Y, Evans ML, Yan Y, Wang S, Miao N, Zheng R, Rignanese GM, Wang J. Machine Learning-Accelerated Discovery of A2BC2 Ternary Electrides with Diverse Anionic Electron Densities. J Am Chem Soc 2023; 145:26412-26424. [PMID: 37988742 DOI: 10.1021/jacs.3c10538] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2023]
Abstract
This study combines machine learning (ML) and high-throughput calculations to uncover new ternary electrides in the A2BC2 family of compounds with the P4/mbm space group. Starting from a library of 214 known A2BC2 phases, density functional theory calculations were used to compute the maximum value of the electron localization function, indicating that 42 are potential electrides. A model was then trained on this data set and used to predict the electride behavior of 14,437 hypothetical compounds generated by structural prototyping. Then, the stability and electride features of the 1254 electride candidates predicted by the model were carefully checked by high-throughput calculations. Through this tiered approach, 41 stable and 104 metastable new A2BC2 electrides were predicted. Interestingly, all three kinds of electrides, i.e., electron-deficient, electron-neutral, and electron-rich electrides, are present in the set of predicted compounds. Three of the most promising new electrides (two electron-rich, Nd2ScSi2 and La2YbGe2, and one electron-deficient Y2LiSi2) were then successfully synthesized and characterized experimentally. Furthermore, the synthesized electrides were found to exhibit high catalytic activities for NH3 synthesis under mild conditions when Ru-loaded. The electron-deficient Y2LiSi2, in particular, was seen to exhibit a good balance of catalytic activity and chemical stability, suggesting its future application in catalysis.
Collapse
Affiliation(s)
- Zhiqi Wang
- State Key Laboratory of Solidification Processing, School of Materials Science and Engineering, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, People's Republic of China
| | - Yutong Gong
- State Key Laboratory of Solidification Processing, School of Materials Science and Engineering, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, People's Republic of China
| | - Matthew L Evans
- IMCN-MODL, Université Catholique de Louvain, Chemin des Étoiles, 8, Louvain-la-Neuve B-1348, Belgium
| | - Yujing Yan
- State Key Laboratory of Solidification Processing, School of Materials Science and Engineering, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, People's Republic of China
| | - Shiyao Wang
- State Key Laboratory of Solidification Processing, School of Materials Science and Engineering, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, People's Republic of China
| | - Nanxi Miao
- State Key Laboratory of Solidification Processing, School of Materials Science and Engineering, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, People's Republic of China
| | - Ruiheng Zheng
- State Key Laboratory of Solidification Processing, School of Materials Science and Engineering, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, People's Republic of China
| | - Gian-Marco Rignanese
- State Key Laboratory of Solidification Processing, School of Materials Science and Engineering, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, People's Republic of China
- IMCN-MODL, Université Catholique de Louvain, Chemin des Étoiles, 8, Louvain-la-Neuve B-1348, Belgium
| | - Junjie Wang
- State Key Laboratory of Solidification Processing, School of Materials Science and Engineering, Northwestern Polytechnical University, Xi'an, Shaanxi 710072, People's Republic of China
| |
Collapse
|
5
|
Bajpai S, Petkov BK, Tong M, Abreu CRA, Nair NN, Tuckerman ME. An interoperable implementation of collective-variable based enhanced sampling methods in extended phase space within the OpenMM package. J Comput Chem 2023; 44:2166-2183. [PMID: 37464902 DOI: 10.1002/jcc.27182] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2023] [Revised: 05/30/2023] [Accepted: 06/06/2023] [Indexed: 07/20/2023]
Abstract
Collective variable (CV)-based enhanced sampling techniques are widely used today for accelerating barrier-crossing events in molecular simulations. A class of these methods, which includes temperature accelerated molecular dynamics (TAMD)/driven-adiabatic free energy dynamics (d-AFED), unified free energy dynamics (UFED), and temperature accelerated sliced sampling (TASS), uses an extended variable formalism to achieve quick exploration of conformational space. These techniques are powerful, as they enhance the sampling of a large number of CVs simultaneously compared to other techniques. Extended variables are kept at a much higher temperature than the physical temperature by ensuring adiabatic separation between the extended and physical subsystems and employing rigorous thermostatting. In this work, we present a computational platform to perform extended phase space enhanced sampling simulations using the open-source molecular dynamics engine OpenMM. The implementation allows users to have interoperability of sampling techniques, as well as employ state-of-the-art thermostats and multiple time-stepping. This work also presents protocols for determining the critical parameters and procedures for reconstructing high-dimensional free energy surfaces. As a demonstration, we present simulation results on the high dimensional conformational landscapes of the alanine tripeptide in vacuo, tetra-N-methylglycine (tetra-sarcosine) peptoid in implicit solvent, and the Trp-cage mini protein in explicit water.
Collapse
Affiliation(s)
- Shitanshu Bajpai
- Department of Chemistry, Indian Institute of Technology Kanpur (IITK), Kanpur, India
| | - Brian K Petkov
- Department of Chemistry, New York University (NYU), New York, New York, USA
| | - Muchen Tong
- Department of Chemistry, New York University (NYU), New York, New York, USA
| | - Charlles R A Abreu
- Chemical Engineering Department, Escola de Química, Universidade Federal do Rio de Janeiro, Rio de Janeiro, Brazil
| | - Nisanth N Nair
- Department of Chemistry, Indian Institute of Technology Kanpur (IITK), Kanpur, India
| | - Mark E Tuckerman
- Department of Chemistry, New York University (NYU), New York, New York, USA
- Courant Institute of Mathematical Sciences, New York University (NYU), New York, New York, USA
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai, China
- Simons Center for Computational Physical Chemistry, New York University, New York, New York, USA
| |
Collapse
|
6
|
Naleem N, Abreu CRA, Warmuz K, Tong M, Kirmizialtin S, Tuckerman ME. An exploration of machine learning models for the determination of reaction coordinates associated with conformational transitions. J Chem Phys 2023; 159:034102. [PMID: 37458344 DOI: 10.1063/5.0147597] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2023] [Accepted: 06/23/2023] [Indexed: 07/20/2023] Open
Abstract
Determining collective variables (CVs) for conformational transitions is crucial to understanding their dynamics and targeting them in enhanced sampling simulations. Often, CVs are proposed based on intuition or prior knowledge of a system. However, the problem of systematically determining a proper reaction coordinate (RC) for a specific process in terms of a set of putative CVs can be achieved using committor analysis (CA). Identifying essential degrees of freedom that govern such transitions using CA remains elusive because of the high dimensionality of the conformational space. Various schemes exist to leverage the power of machine learning (ML) to extract an RC from CA. Here, we extend these studies and compare the ability of 17 different ML schemes to identify accurate RCs associated with conformational transitions. We tested these methods on an alanine dipeptide in vacuum and on a sarcosine dipeptoid in an implicit solvent. Our comparison revealed that the light gradient boosting machine method outperforms other methods. In order to extract key features from the models, we employed Shapley Additive exPlanations analysis and compared its interpretation with the "feature importance" approach. For the alanine dipeptide, our methodology identifies ϕ and θ dihedrals as essential degrees of freedom in the C7ax to C7eq transition. For the sarcosine dipeptoid system, the dihedrals ψ and ω are the most important for the cisαD to transαD transition. We further argue that analysis of the full dynamical pathway, and not just endpoint states, is essential for identifying key degrees of freedom governing transitions.
Collapse
Affiliation(s)
- Nawavi Naleem
- Chemistry Program, Science Division, New York University, Abu Dhabi, UAE
| | - Charlles R A Abreu
- Chemical Engineering Department, Escola de Química, Universidade Federal do Rio de Janeiro, 21941-909 Rio de Janeiro, RJ, Brazil
| | - Krzysztof Warmuz
- Computer Science Program, Science Division, New York University, Abu Dhabi, UAE
| | - Muchen Tong
- Department of Chemistry, New York University (NYU), New York, New York 10003, USA
| | - Serdal Kirmizialtin
- Chemistry Program, Science Division, New York University, Abu Dhabi, UAE
- Department of Chemistry, New York University (NYU), New York, New York 10003, USA
- Center for Smart Engineering Materials, New York University, Abu Dhabi, UAE
| | - Mark E Tuckerman
- Department of Chemistry, New York University (NYU), New York, New York 10003, USA
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, 3663 Zhongshan Rd. North, Shanghai 200062, China
- Simons Center for Computational Physical Chemistry at New York University, New York, New York 10003, USA
| |
Collapse
|
7
|
Alberini G, Alexis Paz S, Corradi B, Abrams CF, Benfenati F, Maragliano L. Molecular Dynamics Simulations of Ion Permeation in Human Voltage-Gated Sodium Channels. J Chem Theory Comput 2023; 19:2953-2972. [PMID: 37116214 DOI: 10.1021/acs.jctc.2c00990] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/30/2023]
Abstract
The recent determination of cryo-EM structures of voltage-gated sodium (Nav) channels has revealed many details of these proteins. However, knowledge of ionic permeation through the Nav pore remains limited. In this work, we performed atomistic molecular dynamics (MD) simulations to study the structural features of various neuronal Nav channels based on homology modeling of the cryo-EM structure of the human Nav1.4 channel and, in addition, on the recently resolved configuration for Nav1.2. In particular, single Na+ permeation events during standard MD runs suggest that the ion resides in the inner part of the Nav selectivity filter (SF). On-the-fly free energy parametrization (OTFP) temperature-accelerated molecular dynamics (TAMD) was also used to calculate two-dimensional free energy surfaces (FESs) related to single/double Na+ translocation through the SF of the homology-based Nav1.2 model and the cryo-EM Nav1.2 structure, with different realizations of the DEKA filter domain. These additional simulations revealed distinct mechanisms for single and double Na+ permeation through the wild-type SF, which has a charged lysine in the DEKA ring. Moreover, the configurations of the ions in the SF corresponding to the metastable states of the FESs are specific for each SF motif. Overall, the description of these mechanisms gives us new insights into ion conduction in human Nav cryo-EM-based and cryo-EM configurations that could advance understanding of these systems and how they differ from potassium and bacterial Nav channels.
Collapse
Affiliation(s)
- Giulio Alberini
- Center for Synaptic Neuroscience and Technology (NSYN@UniGe), Istituto Italiano di Tecnologia, Largo Rosanna Benzi 10, 16132 Genova, Italy
- IRCCS Ospedale Policlinico San Martino, Largo Rosanna Benzi 10, 16132 Genova, Italy
| | - Sergio Alexis Paz
- Departamento de Química Teórica y Computacional, Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, X5000HUA Córdoba, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Instituto de Fisicoquímica de Córdoba (INFIQC), X5000HUA Córdoba, Argentina
| | - Beatrice Corradi
- Center for Synaptic Neuroscience and Technology (NSYN@UniGe), Istituto Italiano di Tecnologia, Largo Rosanna Benzi 10, 16132 Genova, Italy
- Department of Experimental Medicine, Università degli Studi di Genova, Viale Benedetto XV 3, 16132 Genova, Italy
| | - Cameron F Abrams
- Department of Chemical and Biological Engineering, Drexel University, Philadelphia, Pennsylvania 19104, United States
| | - Fabio Benfenati
- Center for Synaptic Neuroscience and Technology (NSYN@UniGe), Istituto Italiano di Tecnologia, Largo Rosanna Benzi 10, 16132 Genova, Italy
- IRCCS Ospedale Policlinico San Martino, Largo Rosanna Benzi 10, 16132 Genova, Italy
| | - Luca Maragliano
- Center for Synaptic Neuroscience and Technology (NSYN@UniGe), Istituto Italiano di Tecnologia, Largo Rosanna Benzi 10, 16132 Genova, Italy
- Department of Life and Environmental Sciences, Polytechnic University of Marche, Via Brecce Bianche, 60131 Ancona, Italy
| |
Collapse
|
8
|
Ge P, Zhang L, Lei H. Machine learning assisted coarse-grained molecular dynamics modeling of meso-scale interfacial fluids. J Chem Phys 2023; 158:064104. [PMID: 36792498 DOI: 10.1063/5.0131567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023] Open
Abstract
A hallmark of meso-scale interfacial fluids is the multi-faceted, scale-dependent interfacial energy, which often manifests different characteristics across the molecular and continuum scale. The multi-scale nature imposes a challenge to construct reliable coarse-grained (CG) models, where the CG potential function needs to faithfully encode the many-body interactions arising from the unresolved atomistic interactions and account for the heterogeneous density distributions across the interface. We construct the CG models of both single- and two-component polymeric fluid systems based on the recently developed deep coarse-grained potential [Zhang et al., J. Chem. Phys. 149, 034101 (2018)] scheme, where each polymer molecule is modeled as a CG particle. By only using the training samples of the instantaneous force under the thermal equilibrium state, the constructed CG models can accurately reproduce both the probability density function of the void formation in bulk and the spectrum of the capillary wave across the fluid interface. More importantly, the CG models accurately predict the volume-to-area scaling transition for the apolar solvation energy, illustrating the effectiveness to probe the meso-scale collective behaviors encoded with molecular-level fidelity.
Collapse
Affiliation(s)
- Pei Ge
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824, USA
| | | | - Huan Lei
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824, USA
| |
Collapse
|
9
|
Kawada R, Endo K, Yuhara D, Yasuoka K. MD-GAN with multi-particle input: the machine learning of long-time molecular behavior from short-time MD data. SOFT MATTER 2022; 18:8446-8455. [PMID: 36314893 DOI: 10.1039/d2sm00852a] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/16/2023]
Abstract
Molecular dynamics simulation is a method of investigating the behavior of molecules, which is useful for analyzing a variety of structural and dynamic properties and mechanisms of phenomena. However, the huge computational cost of large-scale and long-time simulations is an enduring problem that must be addressed. MD-GAN is a machine learning-based method that can evolve part of the system at any time step, accelerating the generation of molecular dynamics data [Endo et al., Proceedings of the AAAI Conference on Artificial Intelligence, 2018, 32]. For the accurate prediction of MD-GAN, sufficient information on the dynamics of a part of the system should be included with the training data. Therefore, the selection of the part of the system is important for efficient learning. In a previous study, only one particle (or vector) of each molecule was extracted as part of the system. The effectiveness of adding information from other particles to the learning process is investigated in this study. When the dynamics of three particles of each molecule were used in the polyethylene experiment, the diffusion was successfully predicted using the training data with a time length of approximately 40%, compared to the single-particle input. Surprisingly, the unobserved transition of diffusion in the training data was also predicted using this method. The reduced cost for the generation of training MD data achieved in this study is useful for accelerating MD-GAN.
Collapse
Affiliation(s)
- Ryo Kawada
- Department of Mechanical Engineering, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Kanagawa, 223-8522, Japan.
| | - Katsuhiro Endo
- Department of Mechanical Engineering, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Kanagawa, 223-8522, Japan.
| | - Daisuke Yuhara
- Materials Design Laboratory, Science & Innovation Center, R&D Transformation Div., Mitsubishi Chemical Holdings Group, 1000 Kamoshida-cho, Aoba-ku, Yokohama, Kanagawa, 227-8502, Japan
| | - Kenji Yasuoka
- Department of Mechanical Engineering, Keio University, 3-14-1 Hiyoshi, Kohoku-ku, Yokohama, Kanagawa, 223-8522, Japan.
| |
Collapse
|
10
|
Jin J, Pak AJ, Durumeric AEP, Loose TD, Voth GA. Bottom-up Coarse-Graining: Principles and Perspectives. J Chem Theory Comput 2022; 18:5759-5791. [PMID: 36070494 PMCID: PMC9558379 DOI: 10.1021/acs.jctc.2c00643] [Citation(s) in RCA: 111] [Impact Index Per Article: 37.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Indexed: 01/14/2023]
Abstract
Large-scale computational molecular models provide scientists a means to investigate the effect of microscopic details on emergent mesoscopic behavior. Elucidating the relationship between variations on the molecular scale and macroscopic observable properties facilitates an understanding of the molecular interactions driving the properties of real world materials and complex systems (e.g., those found in biology, chemistry, and materials science). As a result, discovering an explicit, systematic connection between microscopic nature and emergent mesoscopic behavior is a fundamental goal for this type of investigation. The molecular forces critical to driving the behavior of complex heterogeneous systems are often unclear. More problematically, simulations of representative model systems are often prohibitively expensive from both spatial and temporal perspectives, impeding straightforward investigations over possible hypotheses characterizing molecular behavior. While the reduction in resolution of a study, such as moving from an atomistic simulation to that of the resolution of large coarse-grained (CG) groups of atoms, can partially ameliorate the cost of individual simulations, the relationship between the proposed microscopic details and this intermediate resolution is nontrivial and presents new obstacles to study. Small portions of these complex systems can be realistically simulated. Alone, these smaller simulations likely do not provide insight into collectively emergent behavior. However, by proposing that the driving forces in both smaller and larger systems (containing many related copies of the smaller system) have an explicit connection, systematic bottom-up CG techniques can be used to transfer CG hypotheses discovered using a smaller scale system to a larger system of primary interest. The proposed connection between different CG systems is prescribed by (i) the CG representation (mapping) and (ii) the functional form and parameters used to represent the CG energetics, which approximate potentials of mean force (PMFs). As a result, the design of CG methods that facilitate a variety of physically relevant representations, approximations, and force fields is critical to moving the frontier of systematic CG forward. Crucially, the proposed connection between the system used for parametrization and the system of interest is orthogonal to the optimization used to approximate the potential of mean force present in all systematic CG methods. The empirical efficacy of machine learning techniques on a variety of tasks provides strong motivation to consider these approaches for approximating the PMF and analyzing these approximations.
Collapse
Affiliation(s)
- Jaehyeok Jin
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, Institute for Biophysical
Dynamics, and James Franck Institute, The
University of Chicago, Chicago, Illinois 60637, United States
| | - Alexander J. Pak
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, Institute for Biophysical
Dynamics, and James Franck Institute, The
University of Chicago, Chicago, Illinois 60637, United States
| | - Aleksander E. P. Durumeric
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, Institute for Biophysical
Dynamics, and James Franck Institute, The
University of Chicago, Chicago, Illinois 60637, United States
| | - Timothy D. Loose
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, Institute for Biophysical
Dynamics, and James Franck Institute, The
University of Chicago, Chicago, Illinois 60637, United States
| | - Gregory A. Voth
- Department of Chemistry,
Chicago Center for Theoretical Chemistry, Institute for Biophysical
Dynamics, and James Franck Institute, The
University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
11
|
Ghamari D, Hauke P, Covino R, Faccioli P. Sampling rare conformational transitions with a quantum computer. Sci Rep 2022; 12:16336. [PMID: 36175529 PMCID: PMC9522734 DOI: 10.1038/s41598-022-20032-x] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2022] [Accepted: 09/07/2022] [Indexed: 11/09/2022] Open
Abstract
Structural rearrangements play a central role in the organization and function of complex biomolecular systems. In principle, Molecular Dynamics (MD) simulations enable us to investigate these thermally activated processes with an atomic level of resolution. In practice, an exponentially large fraction of computational resources must be invested to simulate thermal fluctuations in metastable states. Path sampling methods focus the computational power on sampling the rare transitions between states. One of their outstanding limitations is to efficiently generate paths that visit significantly different regions of the conformational space. To overcome this issue, we introduce a new algorithm for MD simulations that integrates machine learning and quantum computing. First, using functional integral methods, we derive a rigorous low-resolution spatially coarse-grained representation of the system's dynamics, based on a small set of molecular configurations explored with machine learning. Then, we use a quantum annealer to sample the transition paths of this low-resolution theory. We provide a proof-of-concept application by simulating a benchmark conformational transition with all-atom resolution on the D-Wave quantum computer. By exploiting the unique features of quantum annealing, we generate uncorrelated trajectories at every iteration, thus addressing one of the challenges of path sampling. Once larger quantum machines will be available, the interplay between quantum and classical resources may emerge as a new paradigm of high-performance scientific computing. In this work, we provide a platform to implement this integrated scheme in the field of molecular simulations.
Collapse
Affiliation(s)
- Danial Ghamari
- Department of Physics, University of Trento, Via Sommarive 14, Trento, 38123, Italy.,INFN-TIFPA, Via Sommarive 14, Trento, 38123, Italy
| | - Philipp Hauke
- INO-CNR BEC Center & Department of Physics, University of Trento, Via Sommarive 14, Trento, 38123, Italy
| | - Roberto Covino
- Frankfurt Institute for Advanced Studies, Ruth-Moufang-Straße 1, Frankfurt am Main, 60438, Germany.
| | - Pietro Faccioli
- Department of Physics, University of Trento, Via Sommarive 14, Trento, 38123, Italy. .,INFN-TIFPA, Via Sommarive 14, Trento, 38123, Italy.
| |
Collapse
|
12
|
Sharma A, Dey P. Novel insights into the structural changes induced by disease-associated mutations in TDP-43: a computational approach. J Biomol Struct Dyn 2022:1-11. [PMID: 35751132 DOI: 10.1080/07391102.2022.2092551] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/17/2022]
Abstract
Over the last two decades, the pathogenic aggregation of TAR DNA-binding protein 43 (TDP-43) is found to be strongly associated with several fatal neurodegenerative diseases such as amyotrophic lateral sclerosis (ALS) and frontotemporal lobar degeneration (FTD), etc. While the mutations and truncation in TDP-43 protein have been suggested to be responsible for TDP-43 pathogenesis by accelerating the aggregation process, the effects of these mutations on the bio-mechanism of pathological TDP-43 protein remained poorly understood. Investigating this at the molecular level, we formulized an integrated workflow of molecular dynamic simulation and machine learning models (MD-ML). By performing an extensive structural analysis of three disease-related mutations (i.e., I168A, D169G, and I168A-D169G) in the conserved RNA recognition motifs (RRM1) of TDP-43, we observed that the I168A-D169G double mutant delineates the highest packing of the protein inner core as compared to the other mutations, which may indicate more stability and higher chances of pathogenesis. Moreover, through our MD-ML workflow, we identified the biological descriptors of TDP-43 which includes the interacting residue pairs and individual protein residues that influence the stability of the protein and could be experimentally evaluated to develop potential therapeutic strategies.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Abhibhav Sharma
- School of Computer and System Sciences, Jawaharlal Nehru University, New Delhi, India
| | - Pinki Dey
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, Australia
| |
Collapse
|
13
|
Gupta A, Verma S, Javed R, Sudhakar S, Srivastava S, Nair NN. Exploration of high dimensional free energy landscapes by a combination of temperature-accelerated sliced sampling and parallel biasing. J Comput Chem 2022; 43:1186-1200. [PMID: 35510789 DOI: 10.1002/jcc.26882] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2021] [Revised: 03/27/2022] [Accepted: 04/11/2022] [Indexed: 12/22/2022]
Abstract
Temperature-accelerated sliced sampling (TASS) is an enhanced sampling method for achieving accelerated and controlled exploration of high-dimensional free energy landscapes in molecular dynamics simulations. With the aid of umbrella bias potentials, the TASS method realizes a controlled exploration and divide-and-conquer strategy for computing high-dimensional free energy surfaces. In TASS, diffusion of the system in the collective variable (CV) space is enhanced with the help of metadynamics bias and elevated-temperature of the auxiliary degrees of freedom (DOF) that are coupled to the CVs. Usually, a low-dimensional metadynamics bias is applied in TASS. In order to further improve the performance of TASS, we propose here to use a highdimensional metadynamics bias, in the same form as in a parallel bias metadynamics scheme. Here, a modified reweighting scheme, in combination with artificial neural network is used for computing unbiased probability distribution of CVs and projections of high-dimensional free energy surfaces. We first validate the accuracy and efficiency of our method in computing the four-dimensional free energy landscape for alanine tripeptide in vacuo. Subsequently, we employ the approach to calculate the eight-dimensional free energy landscape of alanine pentapeptide in vacuo. Finally, the method is applied to a more realistic problem wherein we compute the broad four-dimensional free energy surface corresponding to the deacylation of a drug molecule which is covalently complexed with a β-lactamase enzyme. We demonstrate that using parallel bias in TASS improves the efficiency of exploration of high-dimensional free energy landscapes.
Collapse
Affiliation(s)
- Abhinav Gupta
- Department of Chemistry, Indian Institute of Technology Kanpur, Kanpur, India
| | - Shivani Verma
- Department of Chemistry, Indian Institute of Technology Kanpur, Kanpur, India
| | - Ramsha Javed
- Department of Chemistry, Indian Institute of Technology Kanpur, Kanpur, India
| | - Suraj Sudhakar
- Department of Chemistry, Indian Institute of Technology Kanpur, Kanpur, India
| | - Saurabh Srivastava
- Department of Chemistry, Indian Institute of Technology Kanpur, Kanpur, India.,Department of Chemistry, Manipal University Jaipur, Jaipur, Rajasthan, India
| | - Nisanth N Nair
- Department of Chemistry, Indian Institute of Technology Kanpur, Kanpur, India
| |
Collapse
|
14
|
Latham AP, Zhang B. Unifying coarse-grained force fields for folded and disordered proteins. Curr Opin Struct Biol 2022; 72:63-70. [PMID: 34536913 PMCID: PMC9057422 DOI: 10.1016/j.sbi.2021.08.006] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2021] [Revised: 08/08/2021] [Accepted: 08/17/2021] [Indexed: 12/22/2022]
Abstract
Liquid-liquid phase separation drives the formation of biological condensates that play essential roles in transcriptional regulation and signal sensing. Computational modeling could provide high-resolution structural characterizations of these condensates and help uncover physicochemical interactions that dictate their stability. However, many protein molecules involved in phase separation often contain multiple ordered domains connected with flexible, structureless linkers. Simulating such proteins necessitates force fields with consistent accuracy for both folded and disordered proteins. We provide a critical review of existing coarse-grained force fields for disordered proteins and highlight the challenges in their application to folded proteins. After discussing existing algorithms for force field parameterization, we propose an optimization strategy that should lead to computer models with improved transferability across protein types.
Collapse
Affiliation(s)
- Andrew P Latham
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
15
|
Machine-Learned Free Energy Surfaces for Capillary Condensation and Evaporation in Mesopores. ENTROPY 2022; 24:e24010097. [PMID: 35052123 PMCID: PMC8774451 DOI: 10.3390/e24010097] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 12/29/2021] [Accepted: 01/05/2022] [Indexed: 12/04/2022]
Abstract
Using molecular simulations, we study the processes of capillary condensation and capillary evaporation in model mesopores. To determine the phase transition pathway, as well as the corresponding free energy profile, we carry out enhanced sampling molecular simulations using entropy as a reaction coordinate to map the onset of order during the condensation process and of disorder during the evaporation process. The structural analysis shows the role played by intermediate states, characterized by the onset of capillary liquid bridges and bubbles. We also analyze the dependence of the free energy barrier on the pore width. Furthermore, we propose a method to build a machine learning model for the prediction of the free energy surfaces underlying capillary phase transition processes in mesopores.
Collapse
|
16
|
Tuckerman ME. The curse of dimensionality loses its power. NATURE COMPUTATIONAL SCIENCE 2022; 2:6-7. [PMID: 38177704 DOI: 10.1038/s43588-021-00182-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/06/2024]
Affiliation(s)
- Mark E Tuckerman
- Department of Chemistry, New York University (NYU), New York City, NY, USA.
- Courant Institute of Mathematical Sciences, New York University (NYU), New York City, NY, USA.
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai, China.
| |
Collapse
|
17
|
Wang D, Wang Y, Chang J, Zhang L, Wang H, E W. Efficient sampling of high-dimensional free energy landscapes using adaptive reinforced dynamics. NATURE COMPUTATIONAL SCIENCE 2022; 2:20-29. [PMID: 38177702 DOI: 10.1038/s43588-021-00173-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2021] [Accepted: 11/15/2021] [Indexed: 01/06/2024]
Abstract
Enhanced sampling methods such as metadynamics and umbrella sampling have become essential tools for exploring the configuration space of molecules and materials. At the same time, they have long faced a number of issues such as the inefficiency when dealing with a large number of collective variables (CVs) or systems with high free energy barriers. Here we show that, with clustering and adaptive tuning techniques, the reinforced dynamics (RiD) scheme can be used to efficiently explore the configuration space and free energy landscapes with a large number of CVs or systems with high free energy barriers. We illustrate this by studying various representative and challenging examples. First we demonstrate the efficiency of adaptive RiD compared with other methods and construct the nine-dimensional (9D) free energy landscape of a peptoid trimer, which has energy barriers of more than 8 kcal mol-1. We then study the folding of the protein chignolin using 18 CVs. In this case, both the folding and unfolding rates are observed to be 4.30 μs-1. Finally, we propose a protein structure refinement protocol based on RiD. This protocol allows us to efficiently employ more than 100 CVs for exploring the landscape of protein structures and it gives rise to an overall improvement of 14.6 units over the initial global distance test-high accuracy (GDT-HA) score.
Collapse
Affiliation(s)
- Dongdong Wang
- Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ, USA
- DP Technology, Beijing, People's Republic of China
| | - Yanze Wang
- DP Technology, Beijing, People's Republic of China
- College of Chemistry and Molecular Engineering, Peking University, Beijing, People's Republic of China
| | - Junhan Chang
- DP Technology, Beijing, People's Republic of China
- College of Chemistry and Molecular Engineering, Peking University, Beijing, People's Republic of China
| | - Linfeng Zhang
- Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ, USA.
- DP Technology, Beijing, People's Republic of China.
| | - Han Wang
- Laboratory of Computational Physics, Institute of Applied Physics and Computational Mathematics, Beijing, People's Republic of China.
| | - Weinan E
- School of Mathematical Sciences, Peking University, Beijing, People's Republic of China
- Department of Mathematics and Program in Applied and Computational Mathematics, Princeton University, Princeton, NJ, USA
- Beijing Institute of Big Data Research, Beijing, People's Republic of China
| |
Collapse
|
18
|
Guardiani C, Cecconi F, Chiodo L, Cottone G, Malgaretti P, Maragliano L, Barabash ML, Camisasca G, Ceccarelli M, Corry B, Roth R, Giacomello A, Roux B. Computational methods and theory for ion channel research. ADVANCES IN PHYSICS: X 2022; 7:2080587. [PMID: 35874965 PMCID: PMC9302924 DOI: 10.1080/23746149.2022.2080587] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 05/15/2022] [Indexed: 06/15/2023] Open
Abstract
Ion channels are fundamental biological devices that act as gates in order to ensure selective ion transport across cellular membranes; their operation constitutes the molecular mechanism through which basic biological functions, such as nerve signal transmission and muscle contraction, are carried out. Here, we review recent results in the field of computational research on ion channels, covering theoretical advances, state-of-the-art simulation approaches, and frontline modeling techniques. We also report on few selected applications of continuum and atomistic methods to characterize the mechanisms of permeation, selectivity, and gating in biological and model channels.
Collapse
Affiliation(s)
- C. Guardiani
- Dipartimento di Ingegneria Meccanica e Aerospaziale, Sapienza Università di Roma, Rome, Italy
| | - F. Cecconi
- CNR - Istituto dei Sistemi Complessi, Rome, Italy and Istituto Nazionale di Fisica Nucleare, INFN, Roma1 section. 00185, Roma, Italy
| | - L. Chiodo
- Department of Engineering, Campus Bio-Medico University, Rome, Italy
| | - G. Cottone
- Department of Physics and Chemistry-Emilio Segrè, University of Palermo, Palermo, Italy
| | - P. Malgaretti
- Helmholtz Institute Erlangen-Nürnberg for Renewable Energy (IEK-11), Forschungszentrum Jülich, Erlangen, Germany
| | - L. Maragliano
- Department of Life and Environmental Sciences, Polytechnic University of Marche, Ancona, Italy, and Center for Synaptic Neuroscience and Technology, Istituto Italiano di Tecnologia, Genova, Italy
| | - M. L. Barabash
- Department of Materials Science and Nanoengineering, Rice University, Houston, TX 77005, USA
| | - G. Camisasca
- Dipartimento di Ingegneria Meccanica e Aerospaziale, Sapienza Università di Roma, Rome, Italy
- Dipartimento di Fisica, Università Roma Tre, Rome, Italy
| | - M. Ceccarelli
- Department of Physics and CNR-IOM, University of Cagliari, Monserrato 09042-IT, Italy
| | - B. Corry
- Research School of Biology, The Australian National University, Canberra, ACT 2600, Australia
| | - R. Roth
- Institut Für Theoretische Physik, Eberhard Karls Universität Tübingen, Tübingen, Germany
| | - A. Giacomello
- Dipartimento di Ingegneria Meccanica e Aerospaziale, Sapienza Università di Roma, Rome, Italy
| | - B. Roux
- Department of Biochemistry & Molecular Biology, University of Chicago, Chicago IL, USA
| |
Collapse
|
19
|
Bal KM. Reweighted Jarzynski Sampling: Acceleration of Rare Events and Free Energy Calculation with a Bias Potential Learned from Nonequilibrium Work. J Chem Theory Comput 2021; 17:6766-6774. [PMID: 34714088 DOI: 10.1021/acs.jctc.1c00574] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Abstract
We introduce a simple enhanced sampling approach for the calculation of free energy differences and barriers along a one-dimensional reaction coordinate. First, a small number of short nonequilibrium simulations are carried out along the reaction coordinate, and the Jarzynski equality is used to learn an approximate free energy surface from the nonequilibrium work distribution. This free energy estimate is represented in a compact form as an artificial neural network and used as an external bias potential to accelerate rare events in a subsequent molecular dynamics simulation. The final free energy estimate is then obtained by reweighting the equilibrium probability distribution of the reaction coordinate sampled under the influence of the external bias. We apply our reweighted Jarzynski sampling recipe to four processes of varying scales and complexities─spanning chemical reaction in the gas phase, pair association in solution, and droplet nucleation in supersaturated vapor. In all cases, we find reweighted Jarzynski sampling to be a very efficient strategy, resulting in rapid convergence of the free energy to high precision.
Collapse
Affiliation(s)
- Kristof M Bal
- Department of Chemistry and NANOlab Center of Excellence, University of Antwerp, Universiteitsplein 1, 2610 Antwerp, Belgium
| |
Collapse
|
20
|
Secor M, Soudackov AV, Hammes-Schiffer S. Artificial Neural Networks as Propagators in Quantum Dynamics. J Phys Chem Lett 2021; 12:10654-10662. [PMID: 34704767 DOI: 10.1021/acs.jpclett.1c03117] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The utilization of artificial neural networks (ANNs) provides strategies for accelerating molecular simulations. Herein, ANNs are implemented as propagators of the time-dependent Schrödinger equation to simulate the quantum dynamics of systems with time-dependent potentials. These ANN propagators are trained to map nonstationary wavepackets from a given time to a future time within the discrete variable representation. Each propagator is trained for a specified time step, and iterative application of the propagator enables the propagation of wavepackets over long time scales. Such ANN propagators are developed and applied to one- and two-dimensional proton transfer systems, which exhibit nuclear quantum effects such as hydrogen tunneling. These ANN propagators are trained for either a specific time-independent potential or general potentials that can be time-dependent. Hierarchical, multiple time step algorithms enable parallelization, and the extension to higher dimensions is straightforward. This strategy is applicable to quantum dynamical simulations of diverse chemical and biological processes.
Collapse
Affiliation(s)
- Maxim Secor
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520, United States
| | - Alexander V Soudackov
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520, United States
| | - Sharon Hammes-Schiffer
- Department of Chemistry, Yale University, 225 Prospect Street, New Haven, Connecticut 06520, United States
| |
Collapse
|
21
|
Chen M. Collective variable-based enhanced sampling and machine learning. THE EUROPEAN PHYSICAL JOURNAL. B 2021; 94:211. [PMID: 34697536 PMCID: PMC8527828 DOI: 10.1140/epjb/s10051-021-00220-w] [Citation(s) in RCA: 33] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/02/2021] [Accepted: 10/03/2021] [Indexed: 05/14/2023]
Abstract
ABSTRACT Collective variable-based enhanced sampling methods have been widely used to study thermodynamic properties of complex systems. Efficiency and accuracy of these enhanced sampling methods are affected by two factors: constructing appropriate collective variables for enhanced sampling and generating accurate free energy surfaces. Recently, many machine learning techniques have been developed to improve the quality of collective variables and the accuracy of free energy surfaces. Although machine learning has achieved great successes in improving enhanced sampling methods, there are still many challenges and open questions. In this perspective, we shall review recent developments on integrating machine learning techniques and collective variable-based enhanced sampling approaches. We also discuss challenges and future research directions including generating kinetic information, exploring high-dimensional free energy surfaces, and efficiently sampling all-atom configurations. GRAPHIC ABSTRACT
Collapse
Affiliation(s)
- Ming Chen
- Department of Chemistry, Purdue University, West Lafayette, IN 47907 USA
| |
Collapse
|
22
|
Hoxha M, Kamberaj H. Automation of some macromolecular properties using a machine learning approach. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abe7b6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022] Open
Abstract
Abstract
In this study, we employed a newly developed method to predict macromolecular properties using a swarm artificial neural network (ANN) method as a machine learning approach. In this method, the molecular structures are represented by the feature description vectors used as training input data for a neural network. This study aims to develop an efficient approach for training an ANN using either experimental or quantum mechanics data. We aim to introduce an error model controlling the reliability of the prediction confidence interval using a bootstrapping swarm approach. We created different datasets of selected experimental or quantum mechanics results. Using this optimized ANN, we hope to predict properties and their statistical errors for new molecules. There are four datasets used in this study. That includes the dataset of 642 small organic molecules with known experimental hydration free energies, the dataset of 1475 experimental pKa values of ionizable groups in 192 proteins, the dataset of 2693 mutants in 14 proteins with given experimental values of changes in the Gibbs free energy, and a dataset of 7101 quantum mechanics heat of formation calculations. All the data are prepared and optimized using the AMBER force field in the CHARMM macromolecular computer simulation program. The bootstrapping swarm ANN code for performing the optimization and prediction is written in Python computer programming language. The descriptor vectors of the small molecules are based on the Coulomb matrix and sum over bond properties. For the macromolecular systems, they consider the chemical-physical fingerprints of the region in the vicinity of each amino acid.
Collapse
|
23
|
Glielmo A, Husic BE, Rodriguez A, Clementi C, Noé F, Laio A. Unsupervised Learning Methods for Molecular Simulation Data. Chem Rev 2021; 121:9722-9758. [PMID: 33945269 PMCID: PMC8391792 DOI: 10.1021/acs.chemrev.0c01195] [Citation(s) in RCA: 150] [Impact Index Per Article: 37.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2020] [Indexed: 12/21/2022]
Abstract
Unsupervised learning is becoming an essential tool to analyze the increasingly large amounts of data produced by atomistic and molecular simulations, in material science, solid state physics, biophysics, and biochemistry. In this Review, we provide a comprehensive overview of the methods of unsupervised learning that have been most commonly used to investigate simulation data and indicate likely directions for further developments in the field. In particular, we discuss feature representation of molecular systems and present state-of-the-art algorithms of dimensionality reduction, density estimation, and clustering, and kinetic models. We divide our discussion into self-contained sections, each discussing a specific method. In each section, we briefly touch upon the mathematical and algorithmic foundations of the method, highlight its strengths and limitations, and describe the specific ways in which it has been used-or can be used-to analyze molecular simulation data.
Collapse
Affiliation(s)
- Aldo Glielmo
- International
School for Advanced Studies (SISSA) 34014 Trieste, Italy
| | - Brooke E. Husic
- Freie
Universität Berlin, Department of Mathematics
and Computer Science, 14195 Berlin, Germany
| | - Alex Rodriguez
- International Centre for Theoretical
Physics (ICTP), Condensed Matter and Statistical
Physics Section, 34100 Trieste, Italy
| | - Cecilia Clementi
- Freie
Universität Berlin, Department for
Physics, 14195 Berlin, Germany
- Rice
University Houston, Department of Chemistry, Houston, Texas 77005, United States
| | - Frank Noé
- Freie
Universität Berlin, Department of Mathematics
and Computer Science, 14195 Berlin, Germany
- Freie
Universität Berlin, Department for
Physics, 14195 Berlin, Germany
- Rice
University Houston, Department of Chemistry, Houston, Texas 77005, United States
| | - Alessandro Laio
- International
School for Advanced Studies (SISSA) 34014 Trieste, Italy
- International Centre for Theoretical
Physics (ICTP), Condensed Matter and Statistical
Physics Section, 34100 Trieste, Italy
| |
Collapse
|
24
|
Keith JA, Vassilev-Galindo V, Cheng B, Chmiela S, Gastegger M, Müller KR, Tkatchenko A. Combining Machine Learning and Computational Chemistry for Predictive Insights Into Chemical Systems. Chem Rev 2021; 121:9816-9872. [PMID: 34232033 PMCID: PMC8391798 DOI: 10.1021/acs.chemrev.1c00107] [Citation(s) in RCA: 265] [Impact Index Per Article: 66.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2021] [Indexed: 12/23/2022]
Abstract
Machine learning models are poised to make a transformative impact on chemical sciences by dramatically accelerating computational algorithms and amplifying insights available from computational chemistry methods. However, achieving this requires a confluence and coaction of expertise in computer science and physical sciences. This Review is written for new and experienced researchers working at the intersection of both fields. We first provide concise tutorials of computational chemistry and machine learning methods, showing how insights involving both can be achieved. We follow with a critical review of noteworthy applications that demonstrate how computational chemistry and machine learning can be used together to provide insightful (and useful) predictions in molecular and materials modeling, retrosyntheses, catalysis, and drug design.
Collapse
Affiliation(s)
- John A. Keith
- Department
of Chemical and Petroleum Engineering Swanson School of Engineering, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Valentin Vassilev-Galindo
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| | - Bingqing Cheng
- Accelerate
Programme for Scientific Discovery, Department
of Computer Science and Technology, 15 J. J. Thomson Avenue, Cambridge CB3 0FD, United Kingdom
| | - Stefan Chmiela
- Department
of Software Engineering and Theoretical Computer Science, Technische Universität Berlin, 10587, Berlin, Germany
| | - Michael Gastegger
- Department
of Software Engineering and Theoretical Computer Science, Technische Universität Berlin, 10587, Berlin, Germany
| | - Klaus-Robert Müller
- Machine
Learning Group, Technische Universität
Berlin, 10587, Berlin, Germany
- Department
of Artificial Intelligence, Korea University, Anam-dong, Seongbuk-gu, Seoul, 02841, Korea
- Max-Planck-Institut für Informatik, 66123 Saarbrücken, Germany
- Google Research, Brain Team, 10117 Berlin, Germany
| | - Alexandre Tkatchenko
- Department
of Physics and Materials Science, University
of Luxembourg, L-1511 Luxembourg City, Luxembourg
| |
Collapse
|
25
|
Pal A, Pal S, Verma S, Shiga M, Nair NN. Mean force based temperature accelerated sliced sampling: Efficient reconstruction of high dimensional free energy landscapes. J Comput Chem 2021; 42:1996-2003. [DOI: 10.1002/jcc.26727] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2021] [Revised: 06/28/2021] [Accepted: 07/21/2021] [Indexed: 12/14/2022]
Affiliation(s)
- Asit Pal
- Department of Chemistry Indian Institute of Technology Kanpur Kanpur India
| | - Subhendu Pal
- Department of Chemistry Indian Institute of Technology Kanpur Kanpur India
| | - Shivani Verma
- Department of Chemistry Indian Institute of Technology Kanpur Kanpur India
| | - Motoyuki Shiga
- Center for Computational Science and E‐Systems Japan Atomic Energy Agency Chiba Japan
| | - Nisanth N. Nair
- Department of Chemistry Indian Institute of Technology Kanpur Kanpur India
| |
Collapse
|
26
|
Lin X, Qi Y, Latham AP, Zhang B. Multiscale modeling of genome organization with maximum entropy optimization. J Chem Phys 2021; 155:010901. [PMID: 34241389 PMCID: PMC8253599 DOI: 10.1063/5.0044150] [Citation(s) in RCA: 39] [Impact Index Per Article: 9.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Accepted: 04/28/2021] [Indexed: 12/15/2022] Open
Abstract
Three-dimensional (3D) organization of the human genome plays an essential role in all DNA-templated processes, including gene transcription, gene regulation, and DNA replication. Computational modeling can be an effective way of building high-resolution genome structures and improving our understanding of these molecular processes. However, it faces significant challenges as the human genome consists of over 6 × 109 base pairs, a system size that exceeds the capacity of traditional modeling approaches. In this perspective, we review the progress that has been made in modeling the human genome. Coarse-grained models parameterized to reproduce experimental data via the maximum entropy optimization algorithm serve as effective means to study genome organization at various length scales. They have provided insight into the principles of whole-genome organization and enabled de novo predictions of chromosome structures from epigenetic modifications. Applications of these models at a near-atomistic resolution further revealed physicochemical interactions that drive the phase separation of disordered proteins and dictate chromatin stability in situ. We conclude with an outlook on the opportunities and challenges in studying chromosome dynamics.
Collapse
Affiliation(s)
- Xingcheng Lin
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Yifeng Qi
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Andrew P. Latham
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, USA
| |
Collapse
|
27
|
Moradzadeh A, Aluru NR. Understanding simple liquids through statistical and deep learning approaches. J Chem Phys 2021; 154:204503. [PMID: 34241171 DOI: 10.1063/5.0046226] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022] Open
Abstract
Statistical and deep learning-based methods are employed to obtain insights into the quasi-universal properties of simple liquids. In the first part, a statistical model is employed to provide a probabilistic explanation for the similarity in the structure of simple liquids interacting with different pair potential forms, collectively known as simple liquids. The methodology works by sampling the radial distribution function and the number of interacting particles within the cutoff distance, and it produces the probability density function of the net force. We show that matching the probability distribution of the net force can be a direct route to parameterize simple liquid pair potentials with a similar structure, as the net force is the main component of the Newtonian equations of motion. The statistical model is assessed and validated against various cases. In the second part, we exploit DeepILST [A. Moradzadeh and N. R. Aluru, J. Phys. Chem. Lett. 10, 1242-1250 (2019)], a data-driven and deep-learning assisted framework to parameterize the standard 12-6 Lennard-Jones (LJ) pair potential, to find structurally equivalent/isomorphic LJ liquids that identify constant order parameter [τ=∫0 ξcf gξ-1ξ2dξ, where gξ and ξ(=rρ13) are the reduced radial distribution function and radial distance, respectively] systems in the space of non-dimensional temperature and density of the LJ liquids. We also investigate the consistency of DeepILST in reproducibility of radial distribution functions of various quasi-universal potentials, e.g., exponential, inverse-power-law, and Yukawa pair potentials, quantified based on the radial distribution functions and Kullback-Leibler errors. Our results provide insights into the quasi-universality of simple liquids using the statistical and deep learning methods.
Collapse
Affiliation(s)
- A Moradzadeh
- Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| | - N R Aluru
- Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA
| |
Collapse
|
28
|
Banerjee A, Jasrasaria D, Niblett SP, Wales DJ. Crystal Structure Prediction for Benzene Using Basin-Hopping Global Optimization. J Phys Chem A 2021; 125:3776-3784. [PMID: 33881850 PMCID: PMC8279651 DOI: 10.1021/acs.jpca.1c00903] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2021] [Revised: 04/07/2021] [Indexed: 11/29/2022]
Abstract
Organic molecules can be stable in distinct crystalline forms, known as polymorphs, which have significant consequences for industrial applications. Here, we predict the polymorphs of crystalline benzene computationally for an accurate anisotropic model parametrized to reproduce electronic structure calculations. We adapt the basin-hopping global optimization procedure to the case of crystalline unit cells, simultaneously optimizing the molecular coordinates and unit cell parameters to locate multiple low-energy structures from a variety of crystal space groups. We rapidly locate all the well-established experimental polymorphs of benzene, each of which corresponds to a single local energy minimum of the model. Our results show that basin-hopping can be both an efficient and effective tool for polymorphic crystal structure prediction, requiring no a priori experimental knowledge of cell parameters or symmetry.
Collapse
Affiliation(s)
- Atreyee Banerjee
- Yusuf
Hamied Department of Chemistry, University
of Cambridge, Lensfield
Road, Cambridge CB2 1EW, United Kingdom
- Max
Planck Institute for Polymer Research, 55128 Mainz, Germany
| | - Dipti Jasrasaria
- Yusuf
Hamied Department of Chemistry, University
of Cambridge, Lensfield
Road, Cambridge CB2 1EW, United Kingdom
- Department
of Chemistry, University of California, Berkeley, California 94609, United States
| | - Samuel P. Niblett
- Yusuf
Hamied Department of Chemistry, University
of Cambridge, Lensfield
Road, Cambridge CB2 1EW, United Kingdom
- Department
of Chemistry, University of California, Berkeley, California 94609, United States
- Materials
Science Division, Lawrence Berkeley National
Laboratory, Berkeley, California 94609, United States
| | - David J. Wales
- Yusuf
Hamied Department of Chemistry, University
of Cambridge, Lensfield
Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
29
|
Ding X, Lin X, Zhang B. Stability and folding pathways of tetra-nucleosome from six-dimensional free energy surface. Nat Commun 2021; 12:1091. [PMID: 33597548 PMCID: PMC7889939 DOI: 10.1038/s41467-021-21377-z] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2020] [Accepted: 01/22/2021] [Indexed: 01/01/2023] Open
Abstract
The three-dimensional organization of chromatin is expected to play critical roles in regulating genome functions. High-resolution characterization of its structure and dynamics could improve our understanding of gene regulation mechanisms but has remained challenging. Using a near-atomistic model that preserves the chemical specificity of protein-DNA interactions at residue and base-pair resolution, we studied the stability and folding pathways of a tetra-nucleosome. Dynamical simulations performed with an advanced sampling technique uncovered multiple pathways that connect open chromatin configurations with the zigzag crystal structure. Intermediate states along the simulated folding pathways resemble chromatin configurations reported from in situ experiments. We further determined a six-dimensional free energy surface as a function of the inter-nucleosome distances via a deep learning approach. The zigzag structure can indeed be seen as the global minimum of the surface. However, it is not favored by a significant amount relative to the partially unfolded, in situ configurations. Chemical perturbations such as histone H4 tail acetylation and thermal fluctuations can further tilt the energetic balance to stabilize intermediate states. Our study provides insight into the connection between various reported chromatin configurations and has implications on the in situ relevance of the 30 nm fiber. The three-dimensional organization of chromatin plays critical roles in regulating genome function. Here the authors apply a near atomistic model to study the structure and dynamics of the chromatin folding unit - the tetra-nucleosome - to provide insight into how chromatin folds.
Collapse
Affiliation(s)
- Xinqiang Ding
- Department of Chemistry, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Xingcheng Lin
- Department of Chemistry, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA.
| |
Collapse
|
30
|
Cendagorta JR, Shen H, Bačić Z, Tuckerman ME. Enhanced Sampling Path Integral Methods Using Neural Network Potential Energy Surfaces with Application to Diffusion in Hydrogen Hydrates. ADVANCED THEORY AND SIMULATIONS 2020. [DOI: 10.1002/adts.202000258] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Affiliation(s)
| | - Hengyuan Shen
- Department of Chemistry New York University Shanghai 1555 Century Avenue Pudong Shanghai 200122 China
| | - Zlatko Bačić
- Department of Chemistry New York University New York NY 10003 USA
- NYU‐ECNU Center for Computational Chemistry at NYU Shanghai 3663 Zhongshan Road, North Shanghai 200062 China
| | - Mark E. Tuckerman
- Department of Chemistry New York University New York NY 10003 USA
- NYU‐ECNU Center for Computational Chemistry at NYU Shanghai 3663 Zhongshan Road, North Shanghai 200062 China
- Courant Institute of Mathematical Sciences New York University New York NY 10012 USA
| |
Collapse
|
31
|
Abstract
Fast and accurate evaluation of free energy has broad applications from drug design to material engineering. Computing the absolute free energy is of particular interest since it allows the assessment of the relative stability between states without intermediates. Here, we introduce a general framework for calculating the absolute free energy of a state. A key step of the calculation is the definition of a reference state with tractable deep generative models using locally sampled configurations. The absolute free energy of this reference state is zero by design. The free energy for the state of interest can then be determined as the difference from the reference. We applied this approach to both discrete and continuous systems and demonstrated its effectiveness. It was found that the Bennett acceptance ratio method provides more accurate and efficient free energy estimations than approximate expressions based on work. We anticipate the method presented here to be a valuable strategy for computing free energy differences.
Collapse
Affiliation(s)
- Xinqiang Ding
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| | - Bin Zhang
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
32
|
Zhang J, Lei YK, Yang YI, Gao YQ. Deep learning for variational multiscale molecular modeling. J Chem Phys 2020; 153:174115. [DOI: 10.1063/5.0026836] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Affiliation(s)
- Jun Zhang
- Institute of Systems and Physical Biology, Shenzhen Bay Laboratory, 518055 Shenzhen, China
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
| | - Yao-Kun Lei
- Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, 100871 Beijing, China
| | - Yi Isaac Yang
- Institute of Systems and Physical Biology, Shenzhen Bay Laboratory, 518055 Shenzhen, China
| | - Yi Qin Gao
- Institute of Systems and Physical Biology, Shenzhen Bay Laboratory, 518055 Shenzhen, China
- Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, 100871 Beijing, China
- Beijing Advanced Innovation Center for Genomics, Peking University, 100871 Beijing, China
- Biomedical Pioneering Innovation Center, Peking University, 100871 Beijing, China
| |
Collapse
|
33
|
Bogojeski M, Vogt-Maranto L, Tuckerman ME, Müller KR, Burke K. Quantum chemical accuracy from density functional approximations via machine learning. Nat Commun 2020; 11:5223. [PMID: 33067479 PMCID: PMC7567867 DOI: 10.1038/s41467-020-19093-1] [Citation(s) in RCA: 153] [Impact Index Per Article: 30.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Accepted: 09/24/2020] [Indexed: 12/21/2022] Open
Abstract
Kohn-Sham density functional theory (DFT) is a standard tool in most branches of chemistry, but accuracies for many molecules are limited to 2-3 kcal ⋅ mol-1 with presently-available functionals. Ab initio methods, such as coupled-cluster, routinely produce much higher accuracy, but computational costs limit their application to small molecules. In this paper, we leverage machine learning to calculate coupled-cluster energies from DFT densities, reaching quantum chemical accuracy (errors below 1 kcal ⋅ mol-1) on test data. Moreover, density-based Δ-learning (learning only the correction to a standard DFT calculation, termed Δ-DFT ) significantly reduces the amount of training data required, particularly when molecular symmetries are included. The robustness of Δ-DFT is highlighted by correcting "on the fly" DFT-based molecular dynamics (MD) simulations of resorcinol (C6H4(OH)2) to obtain MD trajectories with coupled-cluster accuracy. We conclude, therefore, that Δ-DFT facilitates running gas-phase MD simulations with quantum chemical accuracy, even for strained geometries and conformer changes where standard DFT fails.
Collapse
Affiliation(s)
- Mihail Bogojeski
- Machine Learning Group, Technische Universität Berlin, Marchstr. 23, 10587, Berlin, Germany
| | | | - Mark E Tuckerman
- Department of Chemistry, New York University, New York, NY, 10003, USA.
- Courant Institute of Mathematical Science, New York University, New York, NY, 10012, USA.
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, 3663 Zhongshan Road North, Shanghai, 200062, China.
| | - Klaus-Robert Müller
- Machine Learning Group, Technische Universität Berlin, Marchstr. 23, 10587, Berlin, Germany.
- Department of Artificial Intelligence, Korea University, Anam-dong, Seongbuk-gu, Seoul, 02841, Korea.
- Max-Planck-Institut für Informatik, Stuhlsatzenhausweg, 66123, Saarbrücken, Germany.
| | - Kieron Burke
- Department of Physics and Astronomy, University of California, Irvine, CA, 92697, USA.
- Department of Chemistry, University of California, Irvine, CA, 92697, USA.
| |
Collapse
|
34
|
Lazim R, Suh D, Choi S. Advances in Molecular Dynamics Simulations and Enhanced Sampling Methods for the Study of Protein Systems. Int J Mol Sci 2020; 21:E6339. [PMID: 32882859 PMCID: PMC7504087 DOI: 10.3390/ijms21176339] [Citation(s) in RCA: 102] [Impact Index Per Article: 20.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2020] [Revised: 08/10/2020] [Accepted: 08/11/2020] [Indexed: 12/12/2022] Open
Abstract
Molecular dynamics (MD) simulation is a rigorous theoretical tool that when used efficiently could provide reliable answers to questions pertaining to the structure-function relationship of proteins. Data collated from protein dynamics can be translated into useful statistics that can be exploited to sieve thermodynamics and kinetics crucial for the elucidation of mechanisms responsible for the modulation of biological processes such as protein-ligand binding and protein-protein association. Continuous modernization of simulation tools enables accurate prediction and characterization of the aforementioned mechanisms and these qualities are highly beneficial for the expedition of drug development when effectively applied to structure-based drug design (SBDD). In this review, current all-atom MD simulation methods, with focus on enhanced sampling techniques, utilized to examine protein structure, dynamics, and functions are discussed. This review will pivot around computer calculations of protein-ligand and protein-protein systems with applications to SBDD. In addition, we will also be highlighting limitations faced by current simulation tools as well as the improvements that have been made to ameliorate their efficiency.
Collapse
Affiliation(s)
- Raudah Lazim
- College of Pharmacy and Graduate School of Pharmaceutical Sciences, Ewha Womans University, Seoul 03760, Korea
| | - Donghyuk Suh
- College of Pharmacy and Graduate School of Pharmaceutical Sciences, Ewha Womans University, Seoul 03760, Korea
| | - Sun Choi
- College of Pharmacy and Graduate School of Pharmaceutical Sciences, Ewha Womans University, Seoul 03760, Korea
| |
Collapse
|
35
|
Wasielewski MR, Forbes MDE, Frank NL, Kowalski K, Scholes GD, Yuen-Zhou J, Baldo MA, Freedman DE, Goldsmith RH, Goodson T, Kirk ML, McCusker JK, Ogilvie JP, Shultz DA, Stoll S, Whaley KB. Exploiting chemistry and molecular systems for quantum information science. Nat Rev Chem 2020; 4:490-504. [PMID: 37127960 DOI: 10.1038/s41570-020-0200-5] [Citation(s) in RCA: 255] [Impact Index Per Article: 51.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/27/2020] [Indexed: 12/21/2022]
Abstract
The power of chemistry to prepare new molecules and materials has driven the quest for new approaches to solve problems having global societal impact, such as in renewable energy, healthcare and information science. In the latter case, the intrinsic quantum nature of the electronic, nuclear and spin degrees of freedom in molecules offers intriguing new possibilities to advance the emerging field of quantum information science. In this Perspective, which resulted from discussions by the co-authors at a US Department of Energy workshop held in November 2018, we discuss how chemical systems and reactions can impact quantum computing, communication and sensing. Hierarchical molecular design and synthesis, from small molecules to supramolecular assemblies, combined with new spectroscopic probes of quantum coherence and theoretical modelling of complex systems, offer a broad range of possibilities to realize practical quantum information science applications.
Collapse
Affiliation(s)
| | - Malcolm D E Forbes
- Department of Chemistry, Bowling Green State University, Bowling Green, OH, USA
| | - Natia L Frank
- Department of Chemistry, University of Nevada-Reno, Reno, Nevada, USA
| | - Karol Kowalski
- Pacific Northwest National Laboratory, Richland, WA, USA
| | | | - Joel Yuen-Zhou
- Department of Chemistry & Biochemistry, University of California, San Diego, La Jolla, CA, USA
| | - Marc A Baldo
- Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Danna E Freedman
- Department of Chemistry, Northwestern University, Evanston, IL, USA
| | | | - Theodore Goodson
- Department of Chemistry, University of Michigan, Ann Arbor, MI, USA
| | - Martin L Kirk
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, NM, USA
| | - James K McCusker
- Department of Chemistry, Michigan State University, East Lansing, MI, USA
| | | | - David A Shultz
- Department of Chemistry, North Carolina State University, Raleigh, NC, USA
| | - Stefan Stoll
- Department of Chemistry, University of Washington, Seattle, WA, USA
| | - K Birgitta Whaley
- Department of Chemistry, University of California, Berkeley, CA, USA
| |
Collapse
|
36
|
Sinz P, Swift MW, Brumwell X, Liu J, Kim KJ, Qi Y, Hirn M. Wavelet scattering networks for atomistic systems with extrapolation of material properties. J Chem Phys 2020; 153:084109. [DOI: 10.1063/5.0016020] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Affiliation(s)
- Paul Sinz
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824-1226, USA
| | - Michael W. Swift
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan 48824-1226, USA
| | - Xavier Brumwell
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824-1226, USA
| | - Jialin Liu
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan 48824-1226, USA
| | - Kwang Jin Kim
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan 48824-1226, USA
| | - Yue Qi
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, Michigan 48824-1226, USA
| | - Matthew Hirn
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824-1226, USA
- Department of Mathematics, Michigan State University, East Lansing, Michigan 48824-1226, USA
- Center for Quantum Computing, Science and Engineering, Michigan State University, East Lansing, Michigan 48824-1226, USA
| |
Collapse
|
37
|
Zhang J, Lei YK, Zhang Z, Chang J, Li M, Han X, Yang L, Yang YI, Gao YQ. A Perspective on Deep Learning for Molecular Modeling and Simulations. J Phys Chem B 2020. [DOI: 10.1021/acs.jpcb.0c04473] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
38
|
Zhang J, Lei YK, Zhang Z, Chang J, Li M, Han X, Yang L, Yang YI, Gao YQ. A Perspective on Deep Learning for Molecular Modeling and Simulations. J Phys Chem A 2020; 124:6745-6763. [PMID: 32786668 DOI: 10.1021/acs.jpca.0c04473] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Deep learning is transforming many areas in science, and it has great potential in modeling molecular systems. However, unlike the mature deployment of deep learning in computer vision and natural language processing, its development in molecular modeling and simulations is still at an early stage, largely because the inductive biases of molecules are completely different from those of images or texts. Footed on these differences, we first reviewed the limitations of traditional deep learning models from the perspective of molecular physics and wrapped up some relevant technical advancement at the interface between molecular modeling and deep learning. We do not focus merely on the ever more complex neural network models; instead, we introduce various useful concepts and ideas brought by modern deep learning. We hope that transacting these ideas into molecular modeling will create new opportunities. For this purpose, we summarized several representative applications, ranging from supervised to unsupervised and reinforcement learning, and discussed their connections with the emerging trends in deep learning. Finally, we give an outlook for promising directions which may help address the existing issues in the current framework of deep molecular modeling.
Collapse
Affiliation(s)
- Jun Zhang
- Institute of Systems and Physical Biology, Shenzhen Bay Laboratory, 518055 Shenzhen, China
- Department of Mathematics and Computer Science, Freie Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
| | - Yao-Kun Lei
- Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, 100871 Beijing, China
| | - Zhen Zhang
- Department of Physics, Tangshan Normal University, 063000 Tangshan, China
| | - Junhan Chang
- Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, 100871 Beijing, China
| | - Maodong Li
- Institute of Systems and Physical Biology, Shenzhen Bay Laboratory, 518055 Shenzhen, China
| | - Xu Han
- Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, 100871 Beijing, China
| | - Lijiang Yang
- Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, 100871 Beijing, China
| | - Yi Isaac Yang
- Institute of Systems and Physical Biology, Shenzhen Bay Laboratory, 518055 Shenzhen, China
| | - Yi Qin Gao
- Institute of Systems and Physical Biology, Shenzhen Bay Laboratory, 518055 Shenzhen, China
- Beijing National Laboratory for Molecular Sciences, College of Chemistry and Molecular Engineering, Peking University, 100871 Beijing, China
- Beijing Advanced Innovation Center for Genomics, Peking University, 100871 Beijing, China
- Biomedical Pioneering Innovation Center, Peking University, 100871 Beijing, China
| |
Collapse
|
39
|
Shirts MR, Ferguson AL. Statistically Optimal Continuous Free Energy Surfaces from Biased Simulations and Multistate Reweighting. J Chem Theory Comput 2020; 16:4107-4125. [DOI: 10.1021/acs.jctc.0c00077] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Michael R. Shirts
- Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, Colorado 80309, United States
| | - Andrew L. Ferguson
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, Illinois 60637, United States
| |
Collapse
|
40
|
Mitsuta Y, Shigeta Y. Analytical Method Using a Scaled Hypersphere Search for High-Dimensional Metadynamics Simulations. J Chem Theory Comput 2020; 16:3869-3878. [PMID: 32384233 DOI: 10.1021/acs.jctc.0c00010] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]
Abstract
Metadynamics (MTD) is one of the most effective methods for calculating the free energy surface and finding rare events. Nevertheless, numerous studies using MTD have been carried out using 3D or lower dimensional collective variables (CVs), as higher dimensional CVs require costly computational resources and the obtained results are too complex to understand the important events. The latter issue can be conveniently solved by utilizing the free energy reaction network (FERN), which is a graph structure consisting of edges of minimum free energy paths (MFEPs), nodes of equation (EQ) points, and transition state (TS) points. In the present article, a new method for exploring FERNs on high-dimensional CVs using MTD and the scaled hypersphere search (SHS) method is described. A test calculation based on the MTD-SHS simulation of met-enkephalin in explicit water with 7 CVs was conducted. As a result, 889 EQ points and 1805 TS points were found. The MTD-SHS approach can find MFEPs exhaustively; therefore, the FERNs can be estimated without any a priori knowledge of the EQ and TS points.
Collapse
Affiliation(s)
- Yuki Mitsuta
- Center for Computational Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki 305-8577, Japan
| | - Yasuteru Shigeta
- Center for Computational Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki 305-8577, Japan
| |
Collapse
|
41
|
Sawada R, Iwasaki Y, Ishida M. Model-Free Cluster Analysis of Physical Property Data using Information Maximizing Self-Argument Training. Sci Rep 2020; 10:7903. [PMID: 32404915 PMCID: PMC7221089 DOI: 10.1038/s41598-020-64281-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2019] [Accepted: 04/09/2020] [Indexed: 12/04/2022] Open
Abstract
We present semi-supervised information maximizing self-argument training (IMSAT), a neural network-based classification method that works without the preparation of labeled data. Semi-supervised IMSAT can amplify specific differences and avoid undesirable misclassification in accordance with the purpose. We demonstrate that semi-supervised IMSAT has a comparable performance with existing methods for semi-supervised learning of image classification and can also classify real experimental data (X-ray diffraction patterns and thermoelectric hysteresis curves) in the same way even though their shape and dimensions are different. Our algorithm will contribute to the automation of big data processing and artificial intelligence-driven material development.
Collapse
Affiliation(s)
- Ryohto Sawada
- System Platform Research Laboratories, NEC Corporation, Tsukuba, 305-8501, Japan.
| | - Yuma Iwasaki
- System Platform Research Laboratories, NEC Corporation, Tsukuba, 305-8501, Japan
- PRESTO, JST, Saitama, 322-0012, Japan
| | - Masahiko Ishida
- System Platform Research Laboratories, NEC Corporation, Tsukuba, 305-8501, Japan
| |
Collapse
|
42
|
Cendagorta JR, Tolpin J, Schneider E, Topper RQ, Tuckerman ME. Comparison of the Performance of Machine Learning Models in Representing High-Dimensional Free Energy Surfaces and Generating Observables. J Phys Chem B 2020; 124:3647-3660. [PMID: 32275148 DOI: 10.1021/acs.jpcb.0c01218] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023]
Abstract
Free energy surfaces of chemical and physical systems are often generated using a popular class of enhanced sampling methods that target a set of collective variables (CVs) chosen to distinguish the characteristic features of these surfaces. While some of these approaches are typically limited to low (∼1-3)-dimensional CV subspaces, methods such as driven adiabatic free-energy dynamics/temperature-accelerated molecular dynamics have been shown to be capable of generating free energy surfaces of quite high dimension by sampling the associated marginal probability distribution via full sweeps over the CV landscape. These approaches repeatedly visit conformational basins, producing a scattering of points within the basins on each visit. Consequently, they are particularly amenable to synergistic combination with regression machine learning methods for filling in the surfaces between the sampled points and for providing a compact and continuous (or semicontinuous) representation of the surfaces that can be easily stored and used for further computation of observable properties. Given the central role of machine learning techniques in this combined approach, it is timely to provide a detailed comparison of the performance of different machine learning strategies and models, including neural networks, kernel ridge regression, support vector machines, and weighted neighbor schemes, for their ability to learn these high-dimensional surfaces as a function of the amount of sampled training data and, once trained, to subsequently generate accurate ensemble averages corresponding to observable properties of the systems. In this article, we perform such a comparison on a set of oligopeptides, in both gas and aqueous phases, corresponding to CV spaces of 2-10 dimensions and assess their ability to provide a global representation of the free energy surfaces and to generate accurate ensemble averages.
Collapse
Affiliation(s)
- Joseph R Cendagorta
- Department of Chemistry, New York University, New York, New York 10003, United States
| | - Jocelyn Tolpin
- Department of Statistics, Harvard University, Cambridge, Massachusetts 02138, United States
| | - Elia Schneider
- Department of Chemistry, New York University, New York, New York 10003, United States
| | - Robert Q Topper
- Department of Chemistry, The Cooper Union for the Advancement of Science and Art, New York, New York 10003, United States
| | - Mark E Tuckerman
- Department of Chemistry, New York University, New York, New York 10003, United States.,Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, United States.,NYU-ECNU Center for Computational Chemistry at NYU Shanghai, Shanghai 200062, China
| |
Collapse
|
43
|
Prakashchand DD, Ahalawat N, Bandyopadhyay S, Sengupta S, Mondal J. Nonaffine Displacements Encode Collective Conformational Fluctuations in Proteins. J Chem Theory Comput 2020; 16:2508-2516. [DOI: 10.1021/acs.jctc.9b01100] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Affiliation(s)
- Dube Dheeraj Prakashchand
- Tata Institute of Fundamental Research, Center for Interdisciplinary Sciences, Hyderabad 500107, India
| | - Navjeet Ahalawat
- Tata Institute of Fundamental Research, Center for Interdisciplinary Sciences, Hyderabad 500107, India
- Department of Molecular Biology, Biotechnology and Bioinformatics, Chaudhary Charan Singh Haryana Agricultural University, Hisar 125004, India
| | - Satyabrata Bandyopadhyay
- Tata Institute of Fundamental Research, Center for Interdisciplinary Sciences, Hyderabad 500107, India
| | - Surajit Sengupta
- Tata Institute of Fundamental Research, Center for Interdisciplinary Sciences, Hyderabad 500107, India
| | - Jagannath Mondal
- Tata Institute of Fundamental Research, Center for Interdisciplinary Sciences, Hyderabad 500107, India
| |
Collapse
|
44
|
Sidky H, Chen W, Ferguson AL. Machine learning for collective variable discovery and enhanced sampling in biomolecular simulation. Mol Phys 2020. [DOI: 10.1080/00268976.2020.1737742] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Affiliation(s)
- Hythem Sidky
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, IL, USA
| | - Wei Chen
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Andrew L. Ferguson
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, IL, USA
| |
Collapse
|
45
|
Abstract
Machine learning (ML) is transforming all areas of science. The complex and time-consuming calculations in molecular simulations are particularly suitable for an ML revolution and have already been profoundly affected by the application of existing ML methods. Here we review recent ML methods for molecular simulation, with particular focus on (deep) neural networks for the prediction of quantum-mechanical energies and forces, on coarse-grained molecular dynamics, on the extraction of free energy surfaces and kinetics, and on generative network approaches to sample molecular equilibrium structures and compute thermodynamics. To explain these methods and illustrate open methodological problems, we review some important principles of molecular physics and describe how they can be incorporated into ML structures. Finally, we identify and describe a list of open challenges for the interface between ML and molecular simulation.
Collapse
Affiliation(s)
- Frank Noé
- Department of Mathematics and Computer Science, Freie Universität Berlin, 14195 Berlin, Germany; .,Department of Physics, Freie Universität Berlin, 14195 Berlin, Germany.,Department of Chemistry and Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA;
| | - Alexandre Tkatchenko
- Physics and Materials Science Research Unit, University of Luxembourg, 1511 Luxembourg, Luxembourg;
| | - Klaus-Robert Müller
- Department of Computer Science, Technical University Berlin, 10587 Berlin, Germany; .,Max-Planck-Institut für Informatik, 66123 Saarbrücken, Germany.,Department of Brain and Cognitive Engineering, Korea University, Seoul 136-713, South Korea
| | - Cecilia Clementi
- Department of Mathematics and Computer Science, Freie Universität Berlin, 14195 Berlin, Germany; .,Department of Chemistry and Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, USA; .,Department of Physics, Rice University, Houston, Texas 77005, USA
| |
Collapse
|
46
|
Whitfield TW, Ragland DA, Zeldovich KB, Schiffer CA. Characterizing Protein-Ligand Binding Using Atomistic Simulation and Machine Learning: Application to Drug Resistance in HIV-1 Protease. J Chem Theory Comput 2020; 16:1284-1299. [PMID: 31877249 DOI: 10.1021/acs.jctc.9b00781] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Over the past several decades, atomistic simulations of biomolecules, whether carried out using molecular dynamics or Monte Carlo techniques, have provided detailed insights into their function. Comparing the results of such simulations for a few closely related systems has guided our understanding of the mechanisms by which changes such as ligand binding or mutation can alter the function. The general problem of detecting and interpreting such mechanisms from simulations of many related systems, however, remains a challenge. This problem is addressed here by applying supervised and unsupervised machine learning techniques to a variety of thermodynamic observables extracted from molecular dynamics simulations of different systems. As an important test case, these methods are applied to understand the evasion by human immunodeficiency virus type-1 (HIV-1) protease of darunavir, a potent inhibitor to which resistance can develop via the simultaneous mutation of multiple amino acids. Complex mutational patterns have been observed among resistant strains, presenting a challenge to developing a mechanistic picture of resistance in the protease. In order to dissect these patterns and gain mechanistic insight into the role of specific mutations, molecular dynamics simulations were carried out on a collection of HIV-1 protease variants, chosen to include highly resistant strains and susceptible controls, in complex with darunavir. Using a machine learning approach that takes advantage of the hierarchical nature in the relationships among the sequence, structure, and function, an integrative analysis of these trajectories reveals key details of the resistance mechanism, including changes in the protein structure, hydrogen bonding, and protein-ligand contacts.
Collapse
Affiliation(s)
- Troy W Whitfield
- Department of Medicine , University of Massachusetts Medical School , Worcester , Massachusetts 01605 , United States.,Program in Bioinformatics and Integrative Biology , University of Massachusetts Medical School , Worcester , Massachusetts 01605 , United States
| | - Debra A Ragland
- Department of Biochemistry and Molecular Pharmacology , University of Massachusetts Medical School , Worcester , Massachusetts 01605 , United States
| | - Konstantin B Zeldovich
- Program in Bioinformatics and Integrative Biology , University of Massachusetts Medical School , Worcester , Massachusetts 01605 , United States
| | - Celia A Schiffer
- Department of Biochemistry and Molecular Pharmacology , University of Massachusetts Medical School , Worcester , Massachusetts 01605 , United States
| |
Collapse
|
47
|
Rogal J, Schneider E, Tuckerman ME. Neural-Network-Based Path Collective Variables for Enhanced Sampling of Phase Transformations. PHYSICAL REVIEW LETTERS 2019; 123:245701. [PMID: 31922858 DOI: 10.1103/physrevlett.123.245701] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/04/2019] [Indexed: 05/27/2023]
Abstract
The investigation of the microscopic processes underlying structural phase transformations in solids is extremely challenging for both simulation and experiment. Atomistic simulations of solid-solid phase transitions require extensive sampling of the corresponding high-dimensional and often rugged energy landscape. Here, we propose a rigorous construction of a 1D path collective variable that is used in combination with enhanced sampling techniques for efficient exploration of the transformation mechanisms. The path collective variable is defined in a space spanned by global classifiers that are derived from local structural units. A reliable identification of the local structural environments is achieved by employing a neural-network-based classification scheme. The proposed path collective variable is generally applicable and enables the investigation of both transformation mechanisms and kinetics.
Collapse
Affiliation(s)
- Jutta Rogal
- Interdisciplinary Centre for Advanced Materials Simulation, Ruhr-Universität Bochum, 44780 Bochum, Germany
- Department of Chemistry, New York University (NYU), New York, New York 10003, USA
| | - Elia Schneider
- Department of Chemistry, New York University (NYU), New York, New York 10003, USA
| | - Mark E Tuckerman
- Department of Chemistry, New York University (NYU), New York, New York 10003, USA
- Courant Institute of Mathematical Sciences, New York University (NYU), New York, New York 10012, USA
- NYU-ECNU Center for Computational Chemistry at NYU Shanghai, 3663 Zhongshan Road North, Shanghai 200062, China
| |
Collapse
|
48
|
Shen L, Tang D, Xie B, Fang WH. Quantum Trajectory Mean-Field Method for Nonadiabatic Dynamics in Photochemistry. J Phys Chem A 2019; 123:7337-7350. [PMID: 31373814 DOI: 10.1021/acs.jpca.9b03480] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The mixed quantum-classical dynamical approaches have been widely used to study nonadiabatic phenomena in photochemistry and photobiology, in which the time evolutions of the electronic and nuclear subsystems are treated based on quantum and classical mechanics, respectively. The key issue is how to deal with coherence and decoherence during the propagation of the two subsystems, which has been the subject of numerous investigations for a few decades. A brief description on Ehrenfest mean-field and surface-hopping (SH) methods is first provided, and then different algorithms for treatment of quantum decoherence are reviewed in the present paper. More attentions were paid to quantum trajectory mean-field (QTMF) method under the picture of quantum measurements, which is able to overcome the overcoherence problem. Furthermore, the combined QTMF and SH algorithm is proposed in the present work, which takes advantages of the QTMF and SH methods. The potential to extend the applicability of the QTMF method was briefly discussed, such as the generalization to other type of nonadiabatic transitions, the combination with multiscale computational models, and possible improvements on its accuracy and efficiency by using machine-learning techniques.
Collapse
Affiliation(s)
- Lin Shen
- Key Laboratory of Theoretical and Computational Photochemistry of Ministry of Education, College of Chemistry , Beijing Normal University , Beijing 100875 , P. R. China
| | - Diandong Tang
- Key Laboratory of Theoretical and Computational Photochemistry of Ministry of Education, College of Chemistry , Beijing Normal University , Beijing 100875 , P. R. China
| | - Binbin Xie
- Hangzhou Institute of Advanced Studies , Zhejiang Normal University , 1108 Gengwen Road , Hangzhou 311231 , Zhejiang P. R. China
| | - Wei-Hai Fang
- Key Laboratory of Theoretical and Computational Photochemistry of Ministry of Education, College of Chemistry , Beijing Normal University , Beijing 100875 , P. R. China
| |
Collapse
|
49
|
Rydzewski J, Valsson O. Finding multiple reaction pathways of ligand unbinding. J Chem Phys 2019; 150:221101. [DOI: 10.1063/1.5108638] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022] Open
Affiliation(s)
- Jakub Rydzewski
- Institute of Physics, Faculty of Physics, Astronomy and Informatics, Nicolaus Copernicus University, Grudziadzka 5, 87–100 Torun, Poland
| | - Omar Valsson
- Max Planck Institute for Polymer Research, Ackermannweg 10, D-55128 Mainz, Germany
| |
Collapse
|
50
|
Wang J, Olsson S, Wehmeyer C, Pérez A, Charron NE, de Fabritiis G, Noé F, Clementi C. Machine Learning of Coarse-Grained Molecular Dynamics Force Fields. ACS CENTRAL SCIENCE 2019; 5:755-767. [PMID: 31139712 PMCID: PMC6535777 DOI: 10.1021/acscentsci.8b00913] [Citation(s) in RCA: 228] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/09/2018] [Indexed: 05/17/2023]
Abstract
Atomistic or ab initio molecular dynamics simulations are widely used to predict thermodynamics and kinetics and relate them to molecular structure. A common approach to go beyond the time- and length-scales accessible with such computationally expensive simulations is the definition of coarse-grained molecular models. Existing coarse-graining approaches define an effective interaction potential to match defined properties of high-resolution models or experimental data. In this paper, we reformulate coarse-graining as a supervised machine learning problem. We use statistical learning theory to decompose the coarse-graining error and cross-validation to select and compare the performance of different models. We introduce CGnets, a deep learning approach, that learns coarse-grained free energy functions and can be trained by a force-matching scheme. CGnets maintain all physically relevant invariances and allow one to incorporate prior physics knowledge to avoid sampling of unphysical structures. We show that CGnets can capture all-atom explicit-solvent free energy surfaces with models using only a few coarse-grained beads and no solvent, while classical coarse-graining methods fail to capture crucial features of the free energy surface. Thus, CGnets are able to capture multibody terms that emerge from the dimensionality reduction.
Collapse
Affiliation(s)
- Jiang Wang
- Center
for Theoretical Biological Physics, Rice
University, Houston, Texas 77005, United States
- Department
of Chemistry, Rice University, Houston, Texas 77005, United States
| | - Simon Olsson
- Department
of Mathematics and Computer Science, Freie
Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
| | - Christoph Wehmeyer
- Department
of Mathematics and Computer Science, Freie
Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
| | - Adrià Pérez
- Computational
Science Laboratory, Universitat Pompeu Fabra, PRBB, C/Dr Aiguader 88, 08003 Barcelona, Spain
| | - Nicholas E. Charron
- Center
for Theoretical Biological Physics, Rice
University, Houston, Texas 77005, United States
- Department
of Physics, Rice University, Houston, Texas 77005, United States
| | - Gianni de Fabritiis
- Computational
Science Laboratory, Universitat Pompeu Fabra, PRBB, C/Dr Aiguader 88, 08003 Barcelona, Spain
- Institucio
Catalana de Recerca i Estudis Avanats (ICREA), Passeig Lluis Companys 23, 08010 Barcelona, Spain
| | - Frank Noé
- Center
for Theoretical Biological Physics, Rice
University, Houston, Texas 77005, United States
- Department
of Chemistry, Rice University, Houston, Texas 77005, United States
- Department
of Mathematics and Computer Science, Freie
Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
| | - Cecilia Clementi
- Center
for Theoretical Biological Physics, Rice
University, Houston, Texas 77005, United States
- Department
of Chemistry, Rice University, Houston, Texas 77005, United States
- Department
of Mathematics and Computer Science, Freie
Universität Berlin, Arnimallee 6, 14195 Berlin, Germany
- Department
of Physics, Rice University, Houston, Texas 77005, United States
| |
Collapse
|