1
|
Leung JG, Frazee NC, Brace A, Bogetti AT, Ramanathan A, Chong LT. Unsupervised Learning of Progress Coordinates during Weighted Ensemble Simulations: Application to NTL9 Protein Folding. J Chem Theory Comput 2025; 21:3691-3699. [PMID: 40105797 PMCID: PMC11983707 DOI: 10.1021/acs.jctc.4c01136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2024] [Revised: 02/21/2025] [Accepted: 02/24/2025] [Indexed: 03/20/2025]
Abstract
A major challenge for many rare-event sampling strategies is the identification of progress coordinates that capture the slowest relevant motions. Machine-learning methods that can identify progress coordinates in an unsupervised manner have therefore been of great interest to the simulation community. Here, we developed a general method for identifying progress coordinates "on-the-fly" during weighted ensemble (WE) rare-event sampling via deep learning (DL) of outliers among sampled conformations. Our method identifies outliers in a latent space model of the system's sampled conformations that is periodically trained using a convolutional variational autoencoder. As a proof of principle, we applied our DL-enhanced WE method to simulate the NTL9 protein folding process. To enable rapid tests, our simulations propagated discrete-state synthetic molecular dynamics trajectories using a generative, fine-grained Markov state model. Results revealed that our on-the-fly DL of outliers enhanced the efficiency of WE by >3-fold in estimating the folding rate constant. Our efforts are a significant step forward in the unsupervised learning of slow coordinates during rare event sampling.
Collapse
Affiliation(s)
- Jeremy
M. G. Leung
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Nicolas C. Frazee
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Alexander Brace
- Data
Science and Learning Division, Argonne National
Laboratory, Lemont, Illinois 60439, United States
- Department
of Computer Science, University of Chicago, Chicago, Illinois 60637, United States
| | - Anthony T. Bogetti
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Arvind Ramanathan
- Data
Science and Learning Division, Argonne National
Laboratory, Lemont, Illinois 60439, United States
- Department
of Computer Science, University of Chicago, Chicago, Illinois 60637, United States
| | - Lillian T. Chong
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| |
Collapse
|
2
|
Paloncýová M, Valério M, Dos Santos RN, Kührová P, Šrejber M, Čechová P, Dobchev DA, Balsubramani A, Banáš P, Agarwal V, Souza PCT, Otyepka M. Computational Methods for Modeling Lipid-Mediated Active Pharmaceutical Ingredient Delivery. Mol Pharm 2025; 22:1110-1141. [PMID: 39879096 PMCID: PMC11881150 DOI: 10.1021/acs.molpharmaceut.4c00744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/06/2024] [Revised: 01/06/2025] [Accepted: 01/06/2025] [Indexed: 01/31/2025]
Abstract
Lipid-mediated delivery of active pharmaceutical ingredients (API) opened new possibilities in advanced therapies. By encapsulating an API into a lipid nanocarrier (LNC), one can safely deliver APIs not soluble in water, those with otherwise strong adverse effects, or very fragile ones such as nucleic acids. However, for the rational design of LNCs, a detailed understanding of the composition-structure-function relationships is missing. This review presents currently available computational methods for LNC investigation, screening, and design. The state-of-the-art physics-based approaches are described, with the focus on molecular dynamics simulations in all-atom and coarse-grained resolution. Their strengths and weaknesses are discussed, highlighting the aspects necessary for obtaining reliable results in the simulations. Furthermore, a machine learning, i.e., data-based learning, approach to the design of lipid-mediated API delivery is introduced. The data produced by the experimental and theoretical approaches provide valuable insights. Processing these data can help optimize the design of LNCs for better performance. In the final section of this Review, state-of-the-art of computer simulations of LNCs are reviewed, specifically addressing the compatibility of experimental and computational insights.
Collapse
Affiliation(s)
- Markéta Paloncýová
- Regional
Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký
University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | - Mariana Valério
- Laboratoire
de Biologie et Modélisation de la Cellule, CNRS, UMR 5239,
Inserm, U1293, Université Claude Bernard Lyon 1, Ecole Normale
Supérieure de Lyon, 46 Allée d’Italie, 69364 Lyon, France
- Centre Blaise
Pascal de Simulation et de Modélisation Numérique, Ecole Normale Supérieure de Lyon, 46 Allée d’Italie, 69364 Lyon, France
| | | | - Petra Kührová
- Regional
Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký
University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | - Martin Šrejber
- Regional
Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký
University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | - Petra Čechová
- Regional
Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký
University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | | | - Akshay Balsubramani
- mRNA Center
of Excellence, Sanofi, Waltham, Massachusetts 02451, United States
| | - Pavel Banáš
- Regional
Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký
University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
| | - Vikram Agarwal
- mRNA Center
of Excellence, Sanofi, Waltham, Massachusetts 02451, United States
| | - Paulo C. T. Souza
- Laboratoire
de Biologie et Modélisation de la Cellule, CNRS, UMR 5239,
Inserm, U1293, Université Claude Bernard Lyon 1, Ecole Normale
Supérieure de Lyon, 46 Allée d’Italie, 69364 Lyon, France
- Centre Blaise
Pascal de Simulation et de Modélisation Numérique, Ecole Normale Supérieure de Lyon, 46 Allée d’Italie, 69364 Lyon, France
| | - Michal Otyepka
- Regional
Center of Advanced Technologies and Materials, Czech Advanced Technology and Research Institute (CATRIN), Palacký
University Olomouc, Šlechtitelů 27, 779 00 Olomouc, Czech Republic
- IT4Innovations,
VŠB − Technical University of Ostrava, 17. listopadu 2172/15, 708 00 Ostrava-Poruba, Czech Republic
| |
Collapse
|
3
|
Barker-Rothschild D, Chen J, Wan Z, Renneckar S, Burgert I, Ding Y, Lu Y, Rojas OJ. Lignin-based porous carbon adsorbents for CO 2 capture. Chem Soc Rev 2025; 54:623-652. [PMID: 39526409 DOI: 10.1039/d4cs00923a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2024]
Abstract
A major driver of global climate change is the rising concentration of atmospheric CO2, the mitigation of which requires the development of efficient and sustainable carbon capture technologies. Solid porous adsorbents have emerged as promising alternatives to liquid amine counterparts due to their potential to reduce regeneration costs. Among them, porous carbons stand out for their high surface area, tailorable pore structure, and exceptional thermal and mechanical properties, making them highly robust and efficient in cycling operations. Moreover, porous carbons can be synthesized from readily available organic (waste) streams, reducing costs and promoting circularity. Lignin, a renewable and abundant by-product of the forest products industry and emerging biorefineries, is a complex organic polymer with a high carbon content, making it a suitable precursor for carbon-based adsorbents. This review explores lignin's sources, structure, and thermal properties, as well as traditional and emerging methods for producing lignin-based porous adsorbents. We examine the physicochemical properties, CO2 adsorption mechanisms, and performance of lignin-derived materials. Additionally, the review highlights recent advances in lignin valorization and provides critical insights into optimizing the design of lignin-based adsorbents to enhance CO2 capture efficiency. Finally, it addresses the prospects and challenges in the field, emphasizing the significant role that lignin-derived materials could play in advancing sustainable carbon capture technologies and mitigating climate change.
Collapse
Affiliation(s)
- Daniel Barker-Rothschild
- Bioproducts Institute, Department of Chemical and Biological Engineering, The University of British Columbia, 2360 East Mall, Vancouver, BC V6T 1Z3, Canada.
| | - Jingqian Chen
- Bioproducts Institute, Department of Chemical and Biological Engineering, The University of British Columbia, 2360 East Mall, Vancouver, BC V6T 1Z3, Canada.
| | - Zhangmin Wan
- Bioproducts Institute, Department of Chemical and Biological Engineering, The University of British Columbia, 2360 East Mall, Vancouver, BC V6T 1Z3, Canada.
| | - Scott Renneckar
- Department of Wood Science, University of British Columbia, 2424 Main Mall, Vancouver, BC V6T 1Z4, Canada
| | - Ingo Burgert
- Wood Materials Science, Institute for Building Materials, ETH Zürich, 8093 Zürich, Switzerland.
- WoodTec Group, Cellulose & Wood Materials, Empa, 8600 Dübendorf, Switzerland
| | - Yong Ding
- Wood Materials Science, Institute for Building Materials, ETH Zürich, 8093 Zürich, Switzerland.
- WoodTec Group, Cellulose & Wood Materials, Empa, 8600 Dübendorf, Switzerland
| | - Yi Lu
- Bioproducts Institute, Department of Chemical and Biological Engineering, The University of British Columbia, 2360 East Mall, Vancouver, BC V6T 1Z3, Canada.
| | - Orlando J Rojas
- Bioproducts Institute, Department of Chemical and Biological Engineering, The University of British Columbia, 2360 East Mall, Vancouver, BC V6T 1Z3, Canada.
- Department of Wood Science, University of British Columbia, 2424 Main Mall, Vancouver, BC V6T 1Z4, Canada
- Department of Chemistry, University of British Columbia, 2036 Main Mall, Vancouver, BC V6T 1Z1, Canada
| |
Collapse
|
4
|
Shao D, Zhang Z, Liu X, Fu H, Shao X, Cai W. Screening Fast-Mode Motion in Collective Variable Discovery for Biochemical Processes. J Chem Theory Comput 2024; 20:10393-10405. [PMID: 39601677 DOI: 10.1021/acs.jctc.4c01282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2024]
Abstract
Collective variables (CVs) describing slow degrees of freedom (DOFs) in biomolecular assemblies are crucial for analyzing molecular dynamics trajectories, creating Markov models and performing CV-based enhanced sampling simulations. While time-lagged independent component analysis (tICA) and its nonlinear successor, time-lagged autoencoder (tAE), are widely used, they often struggle to capture protein dynamics due to interference from random fluctuations along fast DOFs. To address this issue, we propose a novel approach integrating discrete wavelet transform (DWT) with dimensionality reduction techniques. DWT effectively separates fast and slow motion in protein simulation trajectories by decoupling high- and low-frequency signals. Based on the trajectory after filtering out high-frequency signals, which corresponds to fast motion, tICA and tAE can accurately extract CVs representing slow DOFs, providing reliable insights into protein dynamics. Our method demonstrates superior performance in identifying CVs that distinguish metastable states compared to standard tICA and tAE, as validated through analyses of conformational changes of alanine dipeptide and tripeptide and folding of CLN025. Moreover, we show that DWT can be used to improve the performance of a variety of CV-finding algorithms by combining it with Deep-tICA, a cutting-edge CV-finding algorithm, to extract CVs for enhanced-sampling calculations. Given its negligible computational cost and remarkable ability to screen fast motion, we propose DWT as a "free lunch" for CV extraction, applicable to a wide range of CV-finding algorithms.
Collapse
Affiliation(s)
- Donghui Shao
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Zhiteng Zhang
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Xuyang Liu
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Haohao Fu
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Xueguang Shao
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Wensheng Cai
- Research Center for Analytical Sciences, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, College of Chemistry, Nankai University, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| |
Collapse
|
5
|
Harding-Larsen D, Funk J, Madsen NG, Gharabli H, Acevedo-Rocha CG, Mazurenko S, Welner DH. Protein representations: Encoding biological information for machine learning in biocatalysis. Biotechnol Adv 2024; 77:108459. [PMID: 39366493 DOI: 10.1016/j.biotechadv.2024.108459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 09/19/2024] [Accepted: 09/29/2024] [Indexed: 10/06/2024]
Abstract
Enzymes offer a more environmentally friendly and low-impact solution to conventional chemistry, but they often require additional engineering for their application in industrial settings, an endeavour that is challenging and laborious. To address this issue, the power of machine learning can be harnessed to produce predictive models that enable the in silico study and engineering of improved enzymatic properties. Such machine learning models, however, require the conversion of the complex biological information to a numerical input, also called protein representations. These inputs demand special attention to ensure the training of accurate and precise models, and, in this review, we therefore examine the critical step of encoding protein information to numeric representations for use in machine learning. We selected the most important approaches for encoding the three distinct biological protein representations - primary sequence, 3D structure, and dynamics - to explore their requirements for employment and inductive biases. Combined representations of proteins and substrates are also introduced as emergent tools in biocatalysis. We propose the division of fixed representations, a collection of rule-based encoding strategies, and learned representations extracted from the latent spaces of large neural networks. To select the most suitable protein representation, we propose two main factors to consider. The first one is the model setup, which is influenced by the size of the training dataset and the choice of architecture. The second factor is the model objectives such as consideration about the assayed property, the difference between wild-type models and mutant predictors, and requirements for explainability. This review is aimed at serving as a source of information and guidance for properly representing enzymes in future machine learning models for biocatalysis.
Collapse
Affiliation(s)
- David Harding-Larsen
- The Novo Nordisk Center for Biosustainability, Technical University of Denmark, Søltofts Plads, Bygning 220, 2800 Kgs. Lyngby, Denmark
| | - Jonathan Funk
- The Novo Nordisk Center for Biosustainability, Technical University of Denmark, Søltofts Plads, Bygning 220, 2800 Kgs. Lyngby, Denmark
| | - Niklas Gesmar Madsen
- The Novo Nordisk Center for Biosustainability, Technical University of Denmark, Søltofts Plads, Bygning 220, 2800 Kgs. Lyngby, Denmark
| | - Hani Gharabli
- The Novo Nordisk Center for Biosustainability, Technical University of Denmark, Søltofts Plads, Bygning 220, 2800 Kgs. Lyngby, Denmark
| | - Carlos G Acevedo-Rocha
- The Novo Nordisk Center for Biosustainability, Technical University of Denmark, Søltofts Plads, Bygning 220, 2800 Kgs. Lyngby, Denmark
| | - Stanislav Mazurenko
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech Republic; International Clinical Research Center, St. Anne's University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Ditte Hededam Welner
- The Novo Nordisk Center for Biosustainability, Technical University of Denmark, Søltofts Plads, Bygning 220, 2800 Kgs. Lyngby, Denmark.
| |
Collapse
|
6
|
Riccabona JR, Spoendlin FC, Fischer ALM, Loeffler JR, Quoika PK, Jenkins TP, Ferguson JA, Smorodina E, Laustsen AH, Greiff V, Forli S, Ward AB, Deane CM, Fernández-Quintero ML. Assessing AF2's ability to predict structural ensembles of proteins. Structure 2024; 32:2147-2159.e2. [PMID: 39332396 DOI: 10.1016/j.str.2024.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 08/07/2024] [Accepted: 09/02/2024] [Indexed: 09/29/2024]
Abstract
Recent breakthroughs in protein structure prediction have enhanced the precision and speed at which protein configurations can be determined. Additionally, molecular dynamics (MD) simulations serve as a crucial tool for capturing the conformational space of proteins, providing valuable insights into their structural fluctuations. However, the scope of MD simulations is often limited by the accessible timescales and the computational resources available, posing challenges to comprehensively exploring protein behaviors. Recently emerging approaches have focused on expanding the capability of AlphaFold2 (AF2) to predict conformational substates of protein. Here, we benchmark the performance of various workflows that have adapted AF2 for ensemble prediction and compare the obtained structures with ensembles obtained from MD simulations and NMR. We provide an overview of the levels of performance and accessible timescales that can currently be achieved with machine learning (ML) based ensemble generation. Significant minima of the free energy surfaces remain undetected.
Collapse
Affiliation(s)
- Jakob R Riccabona
- Center for Molecular Biosciences Innsbruck, Department of General, Inorganic and Theoretical Chemistry, University of Innsbruck, Innsbruck, Austria
| | - Fabian C Spoendlin
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
| | - Anna-Lena M Fischer
- Center for Molecular Biosciences Innsbruck, Department of General, Inorganic and Theoretical Chemistry, University of Innsbruck, Innsbruck, Austria
| | - Johannes R Loeffler
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Patrick K Quoika
- Center for Functional Protein Assemblies, Technical University of Munich, Ernst-Otto-Fischer-Str. 8, 85748 Garching, Germany
| | - Timothy P Jenkins
- Department of Biotechnology and Biomedicine, Technical University of Denmark, DK-2800 Kongens Lyngby, Denmark
| | - James A Ferguson
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Eva Smorodina
- Department of Immunology, University of Oslo, Oslo, Norway
| | - Andreas H Laustsen
- Department of Biotechnology and Biomedicine, Technical University of Denmark, DK-2800 Kongens Lyngby, Denmark
| | - Victor Greiff
- Department of Immunology, University of Oslo, Oslo, Norway
| | - Stefano Forli
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Andrew B Ward
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA.
| | - Charlotte M Deane
- Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, UK.
| | - Monica L Fernández-Quintero
- Center for Molecular Biosciences Innsbruck, Department of General, Inorganic and Theoretical Chemistry, University of Innsbruck, Innsbruck, Austria; Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA; Department of Biotechnology and Biomedicine, Technical University of Denmark, DK-2800 Kongens Lyngby, Denmark.
| |
Collapse
|
7
|
Votapka LW, Ojha AA, Asada N, Amaro RE. Prediction of Threonine-Tyrosine Kinase Receptor-Ligand Unbinding Kinetics with Multiscale Milestoning and Metadynamics. J Phys Chem Lett 2024; 15:10473-10478. [PMID: 39392497 PMCID: PMC11514002 DOI: 10.1021/acs.jpclett.4c02332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Revised: 09/27/2024] [Accepted: 10/01/2024] [Indexed: 10/12/2024]
Abstract
Accurately describing protein-ligand binding and unbinding kinetics remains challenging. Computational calculations are difficult and costly, while experimental measurements often lack molecular detail and can be unobtainable. Here, we extend our multiscale milestoning method, Simulation-Enabled Estimation of Kinetics Rates (SEEKR), with metadynamics molecular dynamics simulations to yield accurate small molecule drug residence times. Using the pharmaceutically relevant threonine-tyrosine kinase (TTK) and eight long-residence-time (tens of seconds to hours) inhibitors, we demonstrate accurate prediction of absolute and rank-ordered ligand residence times and free energies of binding.
Collapse
Affiliation(s)
- Lane W. Votapka
- Department
of Chemistry and Biochemistry, University
of California San Diego, La Jolla, California 92093, United States
| | - Anupam Anand Ojha
- Department
of Chemistry and Biochemistry, University
of California San Diego, La Jolla, California 92093, United States
- Center for
Computational Biology and Center for Computational Mathematics, Flatiron
Institute, New York 10010, United States
| | - Naoya Asada
- Laboratory
for Medicinal Chemistry Research, Shionogi
& CO. Ltd, Osaka 541-0045, Japan
| | - Rommie E. Amaro
- Department
of Molecular Biology, University of California
San Diego, La Jolla, California 92093, United States
| |
Collapse
|
8
|
Son A, Kim W, Park J, Lee W, Lee Y, Choi S, Kim H. Utilizing Molecular Dynamics Simulations, Machine Learning, Cryo-EM, and NMR Spectroscopy to Predict and Validate Protein Dynamics. Int J Mol Sci 2024; 25:9725. [PMID: 39273672 PMCID: PMC11395565 DOI: 10.3390/ijms25179725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2024] [Revised: 09/06/2024] [Accepted: 09/07/2024] [Indexed: 09/15/2024] Open
Abstract
Protein dynamics play a crucial role in biological function, encompassing motions ranging from atomic vibrations to large-scale conformational changes. Recent advancements in experimental techniques, computational methods, and artificial intelligence have revolutionized our understanding of protein dynamics. Nuclear magnetic resonance spectroscopy provides atomic-resolution insights, while molecular dynamics simulations offer detailed trajectories of protein motions. Computational methods applied to X-ray crystallography and cryo-electron microscopy (cryo-EM) have enabled the exploration of protein dynamics, capturing conformational ensembles that were previously unattainable. The integration of machine learning, exemplified by AlphaFold2, has accelerated structure prediction and dynamics analysis. These approaches have revealed the importance of protein dynamics in allosteric regulation, enzyme catalysis, and intrinsically disordered proteins. The shift towards ensemble representations of protein structures and the application of single-molecule techniques have further enhanced our ability to capture the dynamic nature of proteins. Understanding protein dynamics is essential for elucidating biological mechanisms, designing drugs, and developing novel biocatalysts, marking a significant paradigm shift in structural biology and drug discovery.
Collapse
Affiliation(s)
- Ahrum Son
- Department of Molecular Medicine, Scripps Research, San Diego, CA 92037, USA
| | - Woojin Kim
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
| | - Jongham Park
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
| | - Wonseok Lee
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
| | - Yerim Lee
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
| | - Seongyun Choi
- Department of Convergent Bioscience and Informatics, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
| | - Hyunsoo Kim
- Department of Bio-AI Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
- Department of Convergent Bioscience and Informatics, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
- Protein AI Design Institute, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
- SCICS, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Republic of Korea
| |
Collapse
|
9
|
Oh M, Rosa M, Xie H, Khelashvili G. Automated collective variable discovery for MFSD2A transporter from molecular dynamics simulations. Biophys J 2024; 123:2934-2955. [PMID: 38932456 PMCID: PMC11393714 DOI: 10.1016/j.bpj.2024.06.024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2024] [Revised: 06/03/2024] [Accepted: 06/24/2024] [Indexed: 06/28/2024] Open
Abstract
Biomolecules often exhibit complex free energy landscapes in which long-lived metastable states are separated by large energy barriers. Overcoming these barriers to robustly sample transitions between the metastable states with classical molecular dynamics (MD) simulations presents a challenge. To circumvent this issue, collective variable (CV)-based enhanced sampling MD approaches are often employed. Traditional CV selection relies on intuition and prior knowledge of the system. This approach introduces bias, which can lead to incomplete mechanistic insights. Thus, automated CV detection is desired to gain a deeper understanding of the system/process. Analysis of MD data with various machine-learning algorithms, such as principal component analysis (PCA), support vector machine, and linear discriminant analysis (LDA) based approaches have been implemented for automated CV detection. However, their performance has not been systematically evaluated on structurally and mechanistically complex biological systems. Here, we applied these methods to MD simulations of the MFSD2A (Major Facilitator Superfamily Domain 2A) lysolipid transporter in multiple functionally relevant metastable states with the goal of identifying optimal CVs that would structurally discriminate these states. Specific emphasis was on the automated detection and interpretive power of LDA-based CVs. We found that LDA methods, which included a novel gradient descent-based multiclass harmonic variant, termed GDHLDA, we developed here, outperform PCA in class separation, exhibiting remarkable consistency in extracting CVs critical for distinguishing metastable states. Furthermore, the identified CVs included features previously associated with conformational transitions in MFSD2A. Specifically, conformational shifts in transmembrane helix 7 and in residue Y294 on this helix emerged as critical features discriminating the metastable states in MFSD2A. This highlights the effectiveness of LDA-based approaches in automatically extracting from MD trajectories CVs of functional relevance that can be used to drive biased MD simulations to efficiently sample conformational transitions in the molecular system.
Collapse
Affiliation(s)
- Myongin Oh
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, New York
| | - Margarida Rosa
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, New York
| | - Hengyi Xie
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, New York
| | - George Khelashvili
- Department of Physiology and Biophysics, Weill Cornell Medicine, New York, New York; Institute for Computational Biomedicine, Weill Cornell Medicine, New York, New York.
| |
Collapse
|
10
|
Vats S, Bobrovs R, Söderhjelm P, Bhakat S. AlphaFold-SFA: Accelerated sampling of cryptic pocket opening, protein-ligand binding and allostery by AlphaFold, slow feature analysis and metadynamics. PLoS One 2024; 19:e0307226. [PMID: 39190764 DOI: 10.1371/journal.pone.0307226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 07/02/2024] [Indexed: 08/29/2024] Open
Abstract
Sampling rare events in proteins is crucial for comprehending complex phenomena like cryptic pocket opening, where transient structural changes expose new binding sites. Understanding these rare events also sheds light on protein-ligand binding and allosteric communications, where distant site interactions influence protein function. Traditional unbiased molecular dynamics simulations often fail to sample such rare events, as the free energy barrier between metastable states is large relative to the thermal energy. This renders these events inaccessible on the timescales typically simulated by unbiased molecular dynamics, limiting our understanding of these critical processes. In this paper, we proposed a novel unsupervised learning approach termed as slow feature analysis (SFA) which aims to extract slowly varying features from high-dimensional temporal data. SFA trained on small unbiased molecular dynamics simulations launched from AlphaFold generated conformational ensembles manages to capture rare events governing cryptic pocket opening, protein-ligand binding, and allosteric communications in a kinase. Metadynamics simulations using SFA as collective variables manage to sample 'deep' cryptic pocket opening within a few hundreds of nanoseconds which was beyond the reach of microsecond long unbiased molecular dynamics simulations. SFA augmented metadynamics also managed to capture conformational plasticity of protein upon ligand binding/unbinding and provided novel insights into allosteric communication in receptor-interacting protein kinase 2 (RIPK2) which dictates protein-protein interaction. Taken together, our results show how SFA acts as a dimensionality reduction tool which bridges the gap between AlphaFold, molecular dynamics simulation and metadynamics in context of capturing rare events in biomolecules, extending the scope of structure-based drug discovery in the era of AlphaFold.
Collapse
Affiliation(s)
- Shray Vats
- Department of Computer Science, University of Texas at Austin, Austin, TX, United States of America
| | | | - Pär Söderhjelm
- Division of Biophysical Chemistry, Chemical Center, Lund University, Lund, Sweden
| | | |
Collapse
|
11
|
Miles CM, Cullen S, Kenaan H, Gu W, Andrews GP, Sosso GC, Tian Y. Unravelling the interactions between small molecules and liposomal bilayers via molecular dynamics and thermodynamic modelling. Int J Pharm 2024; 660:124367. [PMID: 38901537 DOI: 10.1016/j.ijpharm.2024.124367] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2024] [Revised: 06/06/2024] [Accepted: 06/17/2024] [Indexed: 06/22/2024]
Abstract
Lipid-based drug delivery systems hold immense promise in addressing critical medical needs, from cancer and neurodegenerative diseases to infectious diseases. By encapsulating active pharmaceutical ingredients - ranging from small molecule drugs to proteins and nucleic acids - these nanocarriers enhance treatment efficacy and safety. However, their commercial success faces hurdles, such as the lack of a systematic design approach and the issues related to scalability and reproducibility. This work aims to provide insights into the drug-phospholipid interaction by combining molecular dynamic simulations and thermodynamic modelling techniques. In particular, we have made a connection between the structural properties of the drug-phospholipid system and the physicochemical performance of the drug-loaded liposomal nanoformulations. We have considered two prototypical drugs, felodipine (FEL) and naproxen (NPX), and one model hydrogenated soy phosphatidylcholine (HSPC) bilayer membrane. Molecular dynamic simulations revealed which regions within the phospholipid bilayers are most and least favoured by the drug molecules. NPX tends to reside at the water-phospholipid interface and is characterized by a lower free energy barrier for bilayer membrane permeation. Meanwhile, FEL prefers to sit within the hydrophobic tails of the phospholipids and is characterized by a higher free energy barrier for membrane permeation. Flory-Huggins thermodynamic modelling, small angle X-ray scattering, dynamic light scattering, TEM, and drug release studies of these liposomal nanoformulations confirmed this drug-phospholipid structural difference. The naproxen-phospholipid system has a lower free energy barrier for permeation, higher drug miscibility with the bilayer, larger liposomal nanoparticle size, and faster drug release in the aqueous medium than felodipine. We suggest that this combination of molecular dynamics and thermodynamics approach may offer a new tool for designing and developing lipid-based nanocarriers for unmet medical applications.
Collapse
Affiliation(s)
- Christopher M Miles
- Department of Chemistry, University of Warwick, Coventry, CV4 7AL, United Kingdom
| | - Shane Cullen
- School of Pharmacy, Queen's University Belfast, 97 Lisburn Road, Belfast BT9 7BL, United Kingdom
| | - Hussein Kenaan
- School of Pharmacy, Queen's University Belfast, 97 Lisburn Road, Belfast BT9 7BL, United Kingdom
| | - Wenjie Gu
- School of Pharmacy, Queen's University Belfast, 97 Lisburn Road, Belfast BT9 7BL, United Kingdom
| | - Gavin P Andrews
- School of Pharmacy, Queen's University Belfast, 97 Lisburn Road, Belfast BT9 7BL, United Kingdom
| | - Gabriele C Sosso
- Department of Chemistry, University of Warwick, Coventry, CV4 7AL, United Kingdom.
| | - Yiwei Tian
- School of Pharmacy, Queen's University Belfast, 97 Lisburn Road, Belfast BT9 7BL, United Kingdom.
| |
Collapse
|
12
|
Faran M, Ray D, Nag S, Raucci U, Parrinello M, Bisker G. A Stochastic Landscape Approach for Protein Folding State Classification. J Chem Theory Comput 2024; 20:5428-5438. [PMID: 38924770 PMCID: PMC11238538 DOI: 10.1021/acs.jctc.4c00464] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2024] [Revised: 06/12/2024] [Accepted: 06/12/2024] [Indexed: 06/28/2024]
Abstract
Protein folding is a critical process that determines the functional state of proteins. Proper folding is essential for proteins to acquire their functional three-dimensional structures and execute their biological role, whereas misfolded proteins can lead to various diseases, including neurodegenerative disorders like Alzheimer's and Parkinson's. Therefore, a deeper understanding of protein folding is vital for understanding disease mechanisms and developing therapeutic strategies. This study introduces the Stochastic Landscape Classification (SLC), an innovative, automated, nonlearning algorithm that quantitatively analyzes protein folding dynamics. Focusing on collective variables (CVs) - low-dimensional representations of complex dynamical systems like molecular dynamics (MD) of macromolecules - the SLC approach segments the CVs into distinct macrostates, revealing the protein folding pathway explored by MD simulations. The segmentation is achieved by analyzing changes in CV trends and clustering these segments using a standard density-based spatial clustering of applications with noise (DBSCAN) scheme. Applied to the MD-based CV trajectories of Chignolin and Trp-Cage proteins, the SLC demonstrates apposite accuracy, validated by comparing standard classification metrics against ground-truth data. These metrics affirm the efficacy of the SLC in capturing intricate protein dynamics and offer a method to evaluate and select the most informative CVs. The practical application of this technique lies in its ability to provide a detailed, quantitative description of protein folding processes, with significant implications for understanding and manipulating protein behavior in industrial and pharmaceutical contexts.
Collapse
Affiliation(s)
- Michael Faran
- Department
of Biomedical Engineering, Faculty of Engineering, Tel Aviv University, Tel Aviv 69978, Israel
| | - Dhiman Ray
- Atomistic
Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Shubhadeep Nag
- Department
of Biomedical Engineering, Faculty of Engineering, Tel Aviv University, Tel Aviv 69978, Israel
| | - Umberto Raucci
- Atomistic
Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Michele Parrinello
- Atomistic
Simulations, Italian Institute of Technology, Via Enrico Melen 83, 16152 Genova, Italy
| | - Gili Bisker
- Department
of Biomedical Engineering, Faculty of Engineering, Tel Aviv University, Tel Aviv 69978, Israel
- The
Center for Physics and Chemistry of Living Systems, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Nanoscience and Nanotechnology, Tel Aviv University, Tel Aviv 6997801, Israel
- The
Center for Light-Matter Interaction, Tel
Aviv University, Tel Aviv 6997801, Israel
| |
Collapse
|
13
|
Ishraaq R, Das S. All-atom molecular dynamics simulations of polymer and polyelectrolyte brushes. Chem Commun (Camb) 2024; 60:6093-6129. [PMID: 38819435 DOI: 10.1039/d4cc01557f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/01/2024]
Abstract
Densely grafted polymer and polyelectrolyte (PE) brushes, owing to their significant abilities to functionalize surfaces for a plethora of applications in sensing, diagnostics, current rectification, surface wettability modification, drug delivery, and oil recovery, have attracted significant attention over the past several decades. Unfortunately, most of the attention has primarily focused on understanding the properties of the grafted polymer and the PE chains with little attention devoted to studying the behavior of the brush-supported ions (counterions needed to screen the PE chains) and water molecules. Over the past few years, our group has been at the forefront of addressing this gap: we have employed all-atom molecular dynamics (MD) simulations for studying a wide variety of polymer and PE brush systems with specific attention to unraveling the properties and behavior of the brush-supported water molecules and ions. Our findings have revealed some of the most fascinating properties of such brush-supported ions and water molecules, including the most remarkable control of nanofluidic transport afforded by the specific ion and water responses induced by the PE brushes grafted on the inner walls of the nanochannel. This feature article aims to summarize some of our key contributions associated with such atomistic simulations of polymer and PE brushes and brush-supported water molecules and counterions.
Collapse
Affiliation(s)
- Raashiq Ishraaq
- Department of Mechanical Engineering, University of Maryland, College Park, MD 20742, USA.
| | - Siddhartha Das
- Department of Mechanical Engineering, University of Maryland, College Park, MD 20742, USA.
| |
Collapse
|
14
|
Stracke K, Evans JD. The use of collective variables and enhanced sampling in the simulations of existing and emerging microporous materials. NANOSCALE 2024; 16:9186-9196. [PMID: 38647659 DOI: 10.1039/d4nr01024h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/25/2024]
Abstract
Microporous materials, including zeolites, metal-organic frameworks, and cage compounds, offer diverse functionalities due to their unique dynamics and guest confinement properties. These materials play a significant role in separation, catalysis, and sensing, but their complexity hinders exploration using traditional atomistic simulations. This review explores collective variables (CVs) paired with enhanced sampling as a powerful approach to enable efficient investigation of key features in microporous materials. We highlight successful applications of CVs in studying adsorption, diffusion, phase transitions, and mechanical properties, demonstrating their crucial role in guiding material design and optimisation. The future of CVs lies in integration with techniques like machine learning, allowing for enhanced efficiency and accuracy. By tailoring CVs to specific materials and developing multi-scale approaches we can further unlock the intricacies of these fascinating materials. Simulations are a cornerstone in unravelling the complexities of microporous materials and are crucial for our future understanding.
Collapse
Affiliation(s)
- Konstantin Stracke
- School of Physics, Chemistry and Earth Science, The University of Adelaide, 5005 Australia.
| | - Jack D Evans
- School of Physics, Chemistry and Earth Science, The University of Adelaide, 5005 Australia.
| |
Collapse
|
15
|
Fu H, Bian H, Shao X, Cai W. Collective Variable-Based Enhanced Sampling: From Human Learning to Machine Learning. J Phys Chem Lett 2024; 15:1774-1783. [PMID: 38329095 DOI: 10.1021/acs.jpclett.3c03542] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/09/2024]
Abstract
Enhanced-sampling algorithms relying on collective variables (CVs) are extensively employed to study complex (bio)chemical processes that are not amenable to brute-force molecular simulations. The selection of appropriate CVs characterizing the slow movement modes is of paramount importance for reliable and efficient enhanced-sampling simulations. In this Perspective, we first review the application and limitations of CVs obtained from chemical and geometrical intuition. We also introduce path-sampling algorithms, which can identify path-like CVs in a high-dimensional free-energy space. Machine-learning algorithms offer a viable approach to finding suitable CVs by analyzing trajectories from preliminary simulations. We discuss both the performance of machine-learning-derived CVs in enhanced-sampling simulations of experimental models and the challenges involved in applying these CVs to realistic, complex molecular assemblies. Moreover, we provide a prospective view of the potential advancements of machine-learning algorithms for the development of CVs in the field of enhanced-sampling simulations.
Collapse
Affiliation(s)
- Haohao Fu
- Research Center for Analytical Sciences, Frontiers Science Center for New Organic Matter, College of Chemistry, Nankai University, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Hengwei Bian
- Research Center for Analytical Sciences, Frontiers Science Center for New Organic Matter, College of Chemistry, Nankai University, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Xueguang Shao
- Research Center for Analytical Sciences, Frontiers Science Center for New Organic Matter, College of Chemistry, Nankai University, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| | - Wensheng Cai
- Research Center for Analytical Sciences, Frontiers Science Center for New Organic Matter, College of Chemistry, Nankai University, Tianjin Key Laboratory of Biosensing and Molecular Recognition, State Key Laboratory of Medicinal Chemical Biology, Tianjin 300071, China
- Haihe Laboratory of Sustainable Chemical Transformations, Tianjin 300192, China
| |
Collapse
|
16
|
Brown BP, Stein RA, Meiler J, Mchaourab HS. Approximating Projections of Conformational Boltzmann Distributions with AlphaFold2 Predictions: Opportunities and Limitations. J Chem Theory Comput 2024; 20:1434-1447. [PMID: 38215214 PMCID: PMC10867840 DOI: 10.1021/acs.jctc.3c01081] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 12/13/2023] [Accepted: 12/13/2023] [Indexed: 01/14/2024]
Abstract
Protein thermodynamics is intimately tied to biological function and can enable processes such as signal transduction, enzyme catalysis, and molecular recognition. The relative free energies of conformations that contribute to these functional equilibria evolved for the physiology of the organism. Despite the importance of these equilibria for understanding biological function and developing treatments for disease, computational and experimental methods capable of quantifying the energetic determinants of these equilibria are limited to systems of modest size. Recently, it has been demonstrated that the artificial intelligence system AlphaFold2 can be manipulated to produce structurally valid protein conformational ensembles. Here, we extend these studies and explore the extent to which AlphaFold2 contact distance distributions can approximate projections of the conformational Boltzmann distributions. For this purpose, we examine the joint probability distributions of inter-residue contact distances along functionally relevant collective variables of several protein systems. Our studies suggest that AlphaFold2 normalized contact distance distributions can correlate with conformation probabilities obtained with other methods but that they suffer from peak broadening. We also find that the AlphaFold2 contact distance distributions can be sensitive to point mutations. Overall, we anticipate that our findings will be valuable as the community seeks to model the thermodynamics of conformational changes in large biomolecular systems.
Collapse
Affiliation(s)
- Benjamin P. Brown
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Applied AI in Protein Dynamics, Vanderbilt
University, Nashville, Tennessee 37232, United States
| | - Richard A. Stein
- Center
for Applied AI in Protein Dynamics, Vanderbilt
University, Nashville, Tennessee 37232, United States
- Department
of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, Tennessee 37232, United States
| | - Jens Meiler
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Applied AI in Protein Dynamics, Vanderbilt
University, Nashville, Tennessee 37232, United States
- Institute
for Drug Discovery, Leipzig University Medical
School, Leipzig, SAC 04103, Germany
| | - Hassane S. Mchaourab
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37232, United States
- Center
for Applied AI in Protein Dynamics, Vanderbilt
University, Nashville, Tennessee 37232, United States
- Department
of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, Tennessee 37232, United States
| |
Collapse
|
17
|
Romero ME, McElhenney SJ, Yu J. Trapping a non-cognate nucleotide upon initial binding for replication fidelity control in SARS-CoV-2 RNA dependent RNA polymerase. Phys Chem Chem Phys 2024; 26:1792-1808. [PMID: 38168789 DOI: 10.1039/d3cp04410f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
The RNA dependent RNA polymerase (RdRp) in SARS-CoV-2 is a highly conserved enzyme responsible for viral genome replication/transcription. To understand how the viral RdRp achieves fidelity control during such processes, here we computationally investigate the natural non-cognate vs. cognate nucleotide addition and selectivity during viral RdRp elongation. We focus on the nucleotide substrate initial binding (RdRp active site open) to the prechemical insertion (active site closed) of the RdRp. The current studies were first carried out using microsecond ensemble equilibrium all-atom molecular dynamics (MD) simulations. Due to the slow conformational changes (from open to closed) during nucleotide insertion and selection, enhanced or umbrella sampling methods have been further employed to calculate the free energy profiles of the nucleotide insertion. Our studies find notable stability of noncognate dATP and GTP upon initial binding in the active-site open state. The results indicate that while natural cognate ATP and Remdesivir drug analogue (RDV-TP) are biased toward stabilization in the closed state to facilitate insertion, the natural non-cognate dATP and GTP can be well trapped in off-path initial binding configurations and prevented from insertion so that to be further rejected. The current work thus presents the intrinsic nucleotide selectivity of SARS-CoV-2 RdRp for natural substrate fidelity control, which should be considered in antiviral drug design.
Collapse
Affiliation(s)
- Moises E Romero
- Department of Chemistry, University of California, Irvine, CA 92697, USA
| | | | - Jin Yu
- Department of Physics and Astronomy, Department of Chemistry, NSF-Simmons Center for Multi-scale Cell Fate Research, University of California, Irvine, CA 92697, USA.
| |
Collapse
|
18
|
Ahmed M, Maldonado AM, Durrant JD. From Byte to Bench to Bedside: Molecular Dynamics Simulations and Drug Discovery. ARXIV 2023:arXiv:2311.16946v1. [PMID: 38076508 PMCID: PMC10705576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Molecular dynamics (MD) simulations and computer-aided drug design (CADD) have advanced substantially over the past two decades, thanks to continuous computer hardware and software improvements. Given these advancements, MD simulations are poised to become even more powerful tools for investigating the dynamic interactions between potential small-molecule drugs and their target proteins, with significant implications for pharmacological research.
Collapse
Affiliation(s)
- Mayar Ahmed
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Alex M. Maldonado
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| | - Jacob D. Durrant
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, Pennsylvania, United States of America
| |
Collapse
|
19
|
Bose S, Lotz SD, Deb I, Shuck M, Lee KSS, Dickson A. How Robust Is the Ligand Binding Transition State? J Am Chem Soc 2023; 145:25318-25331. [PMID: 37943667 PMCID: PMC11059145 DOI: 10.1021/jacs.3c08940] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2023]
Abstract
For many drug targets, it has been shown that the kinetics of drug binding (e.g., on rate and off rate) is more predictive of drug efficacy than thermodynamic quantities alone. This motivates the development of predictive computational models that can be used to optimize compounds on the basis of their kinetics. The structural details underpinning these computational models are found not only in the bound state but also in the short-lived ligand binding transition states. Although transition states cannot be directly observed experimentally due to their extremely short lifetimes, recent successes have demonstrated that modeling the ligand binding transition state is possible with the help of enhanced sampling molecular dynamics methods. Previously, we generated unbinding paths for an inhibitor of soluble epoxide hydrolase (sEH) with a residence time of 11 min. Here, we computationally modeled unbinding events with the weighted ensemble method REVO (resampling of ensembles by variation optimization) for five additional inhibitors of sEH with residence times ranging from 14.25 to 31.75 min, with average prediction accuracy within an order of magnitude. The unbinding ensembles are analyzed in detail, focusing on features of the ligand binding transition state ensembles (TSEs). We find that ligands with similar bound poses can show significant differences in their ligand binding TSEs, in terms of their spatial distribution and protein-ligand interactions. However, we also find similarities across the TSEs when examining more general features such as ligand degrees of freedom. Together these findings show significant challenges for rational, kinetics-based drug design.
Collapse
Affiliation(s)
- Samik Bose
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| | - Samuel D Lotz
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| | - Indrajit Deb
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| | - Megan Shuck
- Department of Pharmacology and Toxicology, Michigan State University, East Lansing, Michigan 48824, United States
| | - Kin Sing Stephen Lee
- Department of Pharmacology and Toxicology, Michigan State University, East Lansing, Michigan 48824, United States
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48824, United States
- Institute of Integrative Toxicology, Michigan State University, East Lansing, Michigan 48824, United States
| | - Alex Dickson
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
- Department of Computational Mathematics, Science and Engineering, Michigan State University, East Lansing, Michigan 48824, United States
| |
Collapse
|
20
|
Siddiqui GA, Stebani JA, Wragg D, Koutsourelakis PS, Casini A, Gagliardi A. Application of Machine Learning Algorithms to Metadynamics for the Elucidation of the Binding Modes and Free Energy Landscape of Drug/Target Interactions: a Case Study. Chemistry 2023; 29:e202302375. [PMID: 37555841 DOI: 10.1002/chem.202302375] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2023] [Accepted: 08/09/2023] [Indexed: 08/10/2023]
Abstract
In the context of drug discovery, computational methods were able to accelerate the challenging process of designing and optimizing a new drug candidate. Amongst the possible atomistic simulation approaches, metadynamics (metaD) has proven very powerful. However, the choice of collective variables (CVs) is not trivial for complex systems. To automate the process of CVs identification, two different machine learning algorithms were applied in this study, namely DeepLDA and Autoencoder, to the metaD simulation of a well-researched drug/target complex, consisting in a pharmacologically relevant non-canonical DNA secondary structure (G-quadruplex) and a metallodrug acting as its stabilizer, as well as solvent molecules.
Collapse
Affiliation(s)
- Gohar Ali Siddiqui
- Professorship of Simulation of Nanosystems for Energy Conversion Department of Electrical and Computer Engineering School of Computation, Information and Technology, Technical University of Munich (TUM), Hans-Piloty-Str. 1, 85748, Garching b. München, Germany
| | - Julia A Stebani
- Chair of Medicinal and Bioinorganic Chemistry Department of Chemistry, School of Natural Sciences, Technical University of Munich (TUM), Lichtenbergstr. 4, 85748, Garching b. München, Germany
| | - Darren Wragg
- Chair of Medicinal and Bioinorganic Chemistry Department of Chemistry, School of Natural Sciences, Technical University of Munich (TUM), Lichtenbergstr. 4, 85748, Garching b. München, Germany
| | - Phaedon-Stelios Koutsourelakis
- Professorship for Data-driven Materials Modeling School of Engineering and Design, Technical University of Munich (TUM), Boltzmannstr. 15, 85748, Garching b. München, Germany
| | - Angela Casini
- Chair of Medicinal and Bioinorganic Chemistry Department of Chemistry, School of Natural Sciences, Technical University of Munich (TUM), Lichtenbergstr. 4, 85748, Garching b. München, Germany
| | - Alessio Gagliardi
- Professorship of Simulation of Nanosystems for Energy Conversion Department of Electrical and Computer Engineering School of Computation, Information and Technology, Technical University of Munich (TUM), Hans-Piloty-Str. 1, 85748, Garching b. München, Germany
| |
Collapse
|
21
|
Conflitti P, Raniolo S, Limongelli V. Perspectives on Ligand/Protein Binding Kinetics Simulations: Force Fields, Machine Learning, Sampling, and User-Friendliness. J Chem Theory Comput 2023; 19:6047-6061. [PMID: 37656199 PMCID: PMC10536999 DOI: 10.1021/acs.jctc.3c00641] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2023] [Indexed: 09/02/2023]
Abstract
Computational techniques applied to drug discovery have gained considerable popularity for their ability to filter potentially active drugs from inactive ones, reducing the time scale and costs of preclinical investigations. The main focus of these studies has historically been the search for compounds endowed with high affinity for a specific molecular target to ensure the formation of stable and long-lasting complexes. Recent evidence has also correlated the in vivo drug efficacy with its binding kinetics, thus opening new fascinating scenarios for ligand/protein binding kinetic simulations in drug discovery. The present article examines the state of the art in the field, providing a brief summary of the most popular and advanced ligand/protein binding kinetics techniques and evaluating their current limitations and the potential solutions to reach more accurate kinetic models. Particular emphasis is put on the need for a paradigm change in the present methodologies toward ligand and protein parametrization, the force field problem, characterization of the transition states, the sampling issue, and algorithms' performance, user-friendliness, and data openness.
Collapse
Affiliation(s)
- Paolo Conflitti
- Faculty
of Biomedical Sciences, Euler Institute, Universitá della Svizzera italiana (USI), 6900 Lugano, Switzerland
| | - Stefano Raniolo
- Faculty
of Biomedical Sciences, Euler Institute, Universitá della Svizzera italiana (USI), 6900 Lugano, Switzerland
| | - Vittorio Limongelli
- Faculty
of Biomedical Sciences, Euler Institute, Universitá della Svizzera italiana (USI), 6900 Lugano, Switzerland
- Department
of Pharmacy, University of Naples “Federico
II”, 80131 Naples, Italy
| |
Collapse
|
22
|
Van Speybroeck V, Bocus M, Cnudde P, Vanduyfhuys L. Operando Modeling of Zeolite-Catalyzed Reactions Using First-Principles Molecular Dynamics Simulations. ACS Catal 2023; 13:11455-11493. [PMID: 37671178 PMCID: PMC10476167 DOI: 10.1021/acscatal.3c01945] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Revised: 07/27/2023] [Indexed: 09/07/2023]
Abstract
Within this Perspective, we critically reflect on the role of first-principles molecular dynamics (MD) simulations in unraveling the catalytic function within zeolites under operating conditions. First-principles MD simulations refer to methods where the dynamics of the nuclei is followed in time by integrating the Newtonian equations of motion on a potential energy surface that is determined by solving the quantum-mechanical many-body problem for the electrons. Catalytic solids used in industrial applications show an intriguing high degree of complexity, with phenomena taking place at a broad range of length and time scales. Additionally, the state and function of a catalyst critically depend on the operating conditions, such as temperature, moisture, presence of water, etc. Herein we show by means of a series of exemplary cases how first-principles MD simulations are instrumental to unravel the catalyst complexity at the molecular scale. Examples show how the nature of reactive species at higher catalytic temperatures may drastically change compared to species at lower temperatures and how the nature of active sites may dynamically change upon exposure to water. To simulate rare events, first-principles MD simulations need to be used in combination with enhanced sampling techniques to efficiently sample low-probability regions of phase space. Using these techniques, it is shown how competitive pathways at operating conditions can be discovered and how broad transition state regions can be explored. Interestingly, such simulations can also be used to study hindered diffusion under operating conditions. The cases shown clearly illustrate how first-principles MD simulations reveal insights into the catalytic function at operating conditions, which could not be discovered using static or local approaches where only a few points are considered on the potential energy surface (PES). Despite these advantages, some major hurdles still exist to fully integrate first-principles MD methods in a standard computational catalytic workflow or to use the output of MD simulations as input for multiple length/time scale methods that aim to bridge to the reactor scale. First of all, methods are needed that allow us to evaluate the interatomic forces with quantum-mechanical accuracy, albeit at a much lower computational cost compared to currently used density functional theory (DFT) methods. The use of DFT limits the currently attainable length/time scales to hundreds of picoseconds and a few nanometers, which are much smaller than realistic catalyst particle dimensions and time scales encountered in the catalysis process. One solution could be to construct machine learning potentials (MLPs), where a numerical potential is derived from underlying quantum-mechanical data, which could be used in subsequent MD simulations. As such, much longer length and time scales could be reached; however, quite some research is still necessary to construct MLPs for the complex systems encountered in industrially used catalysts. Second, most currently used enhanced sampling techniques in catalysis make use of collective variables (CVs), which are mostly determined based on chemical intuition. To explore complex reactive networks with MD simulations, methods are needed that allow the automatic discovery of CVs or methods that do not rely on a priori definition of CVs. Recently, various data-driven methods have been proposed, which could be explored for complex catalytic systems. Lastly, first-principles MD methods are currently mostly used to investigate local reactive events. We hope that with the rise of data-driven methods and more efficient methods to describe the PES, first-principles MD methods will in the future also be able to describe longer length/time scale processes in catalysis. This might lead to a consistent dynamic description of all steps-diffusion, adsorption, and reaction-as they take place at the catalyst particle level.
Collapse
Affiliation(s)
| | - Massimo Bocus
- Center for Molecular Modeling, Ghent University, Technologiepark 46, 9052 Zwijnaarde, Belgium
| | - Pieter Cnudde
- Center for Molecular Modeling, Ghent University, Technologiepark 46, 9052 Zwijnaarde, Belgium
| | - Louis Vanduyfhuys
- Center for Molecular Modeling, Ghent University, Technologiepark 46, 9052 Zwijnaarde, Belgium
| |
Collapse
|
23
|
Brown BP, Stein RA, Meiler J, Mchaourab H. Approximating conformational Boltzmann distributions with AlphaFold2 predictions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.06.552168. [PMID: 37609301 PMCID: PMC10441281 DOI: 10.1101/2023.08.06.552168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/24/2023]
Abstract
Protein dynamics are intimately tied to biological function and can enable processes such as signal transduction, enzyme catalysis, and molecular recognition. The relative free energies of conformations that contribute to these functional equilibria are evolved for the physiology of the organism. Despite the importance of these equilibria for understanding biological function and developing treatments for disease, the computational and experimental methods capable of quantifying them are limited to systems of modest size. Here, we demonstrate that AlphaFold2 contact distance distributions can approximate conformational Boltzmann distributions, which we evaluate through examination of the joint probability distributions of inter-residue contact distances along functionally relevant collective variables of several protein systems. Further, we show that contact distance probability distributions generated by AlphaFold2 are sensitive to points mutations thus AF2 can predict the structural effects of mutations in some systems. We anticipate that our approach will be a valuable tool to model the thermodynamics of conformational changes in large biomolecular systems.
Collapse
Affiliation(s)
- Benjamin P. Brown
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Applied AI in Protein Dynamics, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
| | - Richard A. Stein
- Center for Applied AI in Protein Dynamics, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Department of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, TN, USA. Nashville, TN 37232, USA
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Applied AI in Protein Dynamics, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Institute for Drug Discovery, Leipzig University Medical School, Leipzig, SAC 04103, Germany
| | - Hassane Mchaourab
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Center for Applied AI in Protein Dynamics, Vanderbilt University, Nashville, TN, USA. Nashville, TN 37232, USA
- Department of Molecular Physiology and Biophysics, Vanderbilt University School of Medicine, Nashville, TN, USA. Nashville, TN 37232, USA
| |
Collapse
|
24
|
Pliushcheuskaya P, Künze G. Recent Advances in Computer-Aided Structure-Based Drug Design on Ion Channels. Int J Mol Sci 2023; 24:ijms24119226. [PMID: 37298178 DOI: 10.3390/ijms24119226] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Revised: 05/16/2023] [Accepted: 05/22/2023] [Indexed: 06/12/2023] Open
Abstract
Ion channels play important roles in fundamental biological processes, such as electric signaling in cells, muscle contraction, hormone secretion, and regulation of the immune response. Targeting ion channels with drugs represents a treatment option for neurological and cardiovascular diseases, muscular degradation disorders, and pathologies related to disturbed pain sensation. While there are more than 300 different ion channels in the human organism, drugs have been developed only for some of them and currently available drugs lack selectivity. Computational approaches are an indispensable tool for drug discovery and can speed up, especially, the early development stages of lead identification and optimization. The number of molecular structures of ion channels has considerably increased over the last ten years, providing new opportunities for structure-based drug development. This review summarizes important knowledge about ion channel classification, structure, mechanisms, and pathology with the main focus on recent developments in the field of computer-aided, structure-based drug design on ion channels. We highlight studies that link structural data with modeling and chemoinformatic approaches for the identification and characterization of new molecules targeting ion channels. These approaches hold great potential to advance research on ion channel drugs in the future.
Collapse
Affiliation(s)
- Palina Pliushcheuskaya
- Institute for Drug Discovery, Medical Faculty, University of Leipzig, Brüderstr. 34, D-04103 Leipzig, Germany
| | - Georg Künze
- Institute for Drug Discovery, Medical Faculty, University of Leipzig, Brüderstr. 34, D-04103 Leipzig, Germany
- Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstr. 16-18, D-04107 Leipzig, Germany
| |
Collapse
|
25
|
Franke L, Peter C. Visualizing the Residue Interaction Landscape of Proteins by Temporal Network Embedding. J Chem Theory Comput 2023; 19:2985-2995. [PMID: 37122117 DOI: 10.1021/acs.jctc.2c01228] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/02/2023]
Abstract
Characterizing the structural dynamics of proteins with heterogeneous conformational landscapes is crucial to understanding complex biomolecular processes. To this end, dimensionality reduction algorithms are used to produce low-dimensional embeddings of the high-dimensional conformational phase space. However, identifying a compact and informative set of input features for the embedding remains an ongoing challenge. Here, we propose to harness the power of Residue Interaction Networks (RINs) and their centrality measures, established tools to provide a graph theoretical view on molecular structure. Specifically, we combine the closeness centrality, which captures global features of the protein conformation at residue-wise resolution, with EncoderMap, a hybrid neural-network autoencoder/multidimensional-scaling like dimensionality reduction algorithm. We find that the resulting low-dimensional embedding is a meaningful visualization of the residue interaction landscape that resolves structural details of the protein behavior while retaining global interpretability. This feature-based graph embedding of temporal protein graphs makes it possible to apply the general descriptive power of RIN formalisms to the analysis of protein simulations of complex processes such as protein folding and multidomain interactions requiring no protein-specific input. We demonstrate this on simulations of the fast folding protein Trp-Cage and the multidomain signaling protein FAT10. Due to its generality and modularity, the presented approach can easily be transferred to other protein systems.
Collapse
Affiliation(s)
- Leon Franke
- Department of Chemistry, University of Konstanz, Universitätsstraße 10, Konstanz 78457, Germany
- Konstanz Research School Chemical Biology, University of Konstanz, Universitätsstraße 10, Konstanz 78457, Germany
| | - Christine Peter
- Department of Chemistry, University of Konstanz, Universitätsstraße 10, Konstanz 78457, Germany
| |
Collapse
|