1
|
Chen L, Leung JMG, Zsigmond K, Chong LT, Miranda-Quintana RA. SHINE: Deterministic Many-to-Many Clustering of Molecular Pathways. J Chem Inf Model 2025; 65:4775-4782. [PMID: 40326720 PMCID: PMC12107702 DOI: 10.1021/acs.jcim.5c00240] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/07/2025]
Abstract
State-of-the-art molecular dynamics (MD) simulation methods can generate diverse ensembles of pathways for complex biological processes. Analyzing these pathways using statistical mechanics tools demands identifying key states that contribute to both the dynamic and equilibrium properties of the system. This task becomes especially challenging when analyzing multiple MD simulations simultaneously, a common scenario in enhanced sampling techniques like the weighted ensemble strategy. Here, we present a new module of the MDANCE package designed to streamline the analysis of pathway ensembles. This module integrates n-ary similarity, cheminformatics-inspired tools, and hierarchical clustering to improve analysis efficiency. We present the theoretical foundation behind this approach, termed Sampling Hierarchical Intrinsic N-ary Ensembles (SHINE), and demonstrate its application to simulations of alanine dipeptide and adenylate kinase.
Collapse
Affiliation(s)
- Lexin Chen
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, Florida 32603, United States
| | - Jeremy M G Leung
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Krisztina Zsigmond
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, Florida 32603, United States
| | - Lillian T Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Ramón Alain Miranda-Quintana
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, Florida 32603, United States
| |
Collapse
|
2
|
Latorraca NR, Sabaat S, Habrian CH, Bleier J, Stanley C, Kinz-Thompson CD, Marqusee S, Isacoff EY. Domain coupling in activation of a family C GPCR. Nat Chem Biol 2025:10.1038/s41589-025-01895-3. [PMID: 40281344 DOI: 10.1038/s41589-025-01895-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2024] [Accepted: 03/31/2025] [Indexed: 04/29/2025]
Abstract
The G protein-coupled metabotropic glutamate receptors form homodimers and heterodimers with highly diverse responses to glutamate and varying physiological functions. We employ molecular dynamics, single-molecule spectroscopy and hydrogen-deuterium exchange to dissect the activation pathway triggered by glutamate. We find that activation entails multiple loosely coupled steps, including formation of an agonist-bound, pre-active intermediate whose transition to active conformations forms dimerization interface contacts that set efficacy. The agonist-bound receptor populates at least two additional intermediates en route to G protein-coupling conformations. Sequential transitions into these states act as 'gates', which attenuate the effects of glutamate. Thus, the agonist-bound receptor is remarkably dynamic, with low occupancy of G protein-coupling conformations, providing considerable headroom for modulation by allosteric ligands. Sequence variation within the dimerization interface, as well as altered conformational coupling in receptor heterodimers, may contribute to precise decoding of glutamate signals over broad spatial and temporal scales.
Collapse
Affiliation(s)
- Naomi R Latorraca
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Department of Biochemistry and Molecular Biophysics, Columbia University Irving Medical Center, New York City, NY, USA
| | - Sam Sabaat
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Chris H Habrian
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Department of Molecular and Cellular Physiology, Stanford University, Stanford, CA, USA
| | - Julia Bleier
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | - Cherise Stanley
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
| | | | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA
- Department of Chemistry, University of California, Berkeley, Berkeley, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
- California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, CA, USA
| | - Ehud Y Isacoff
- Department of Molecular and Cell Biology, University of California, Berkeley, Berkeley, CA, USA.
- California Institute for Quantitative Biosciences, University of California, Berkeley, Berkeley, CA, USA.
- Department of Neuroscience, University of California, Berkeley, Berkeley, CA, USA.
- Molecular Biology and Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA.
- Weill Neurohub, University of California, Berkeley, Berkeley, CA, USA.
| |
Collapse
|
3
|
Braza MK, Demir Ö, Ahn SH, Morris CK, Calvó-Tusell C, McGuire KL, de la Peña Avalos B, Carpenter MA, Chen Y, Casalino L, Aihara H, Herzik MA, Harris RS, Amaro RE. Regulatory Interactions between APOBEC3B N- and C-Terminal Domains. J Chem Inf Model 2025; 65:3593-3604. [PMID: 40105360 PMCID: PMC12004528 DOI: 10.1021/acs.jcim.4c02272] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2024] [Revised: 01/29/2025] [Accepted: 02/11/2025] [Indexed: 03/20/2025]
Abstract
APOBEC3B (A3B) is implicated in DNA mutations that facilitate tumor evolution. Although structures of its individual N- and C-terminal domains (NTD and CTD) have been resolved through X-ray crystallography, the full-length A3B (fl-A3B) structure remains elusive, limiting our understanding of its dynamics and mechanisms. In particular, the APOBEC3B C-terminal domain (A3Bctd) is frequently closed in models and structures. In this study, we built several new models of fl-A3B using integrative structural biology methods and selected a top model for further dynamical investigation. We compared the dynamics of the truncated (A3Bctd) to that of the fl-A3B via conventional and Gaussian accelerated molecular dynamics (MD) simulations. Subsequently, we employed weighted ensemble methods to explore the fl-A3B active site opening mechanism, finding that interactions at the NTD-CTD interface enhance the opening frequency of the fl-A3B active site. Our findings shed light on the structural dynamics and potential druggability of fl-A3B, including observations regarding both the active and allosteric sites, which may offer new avenues for therapeutic intervention in cancer.
Collapse
Affiliation(s)
- Mac Kevin
E. Braza
- Department
of Chemistry and Biochemistry, University
of California, San Diego, La Jolla, California 92093, United States
| | - Özlem Demir
- Department
of Chemistry and Biochemistry, University
of California, San Diego, La Jolla, California 92093, United States
| | - Surl-Hee Ahn
- Department
of Chemical Engineering, University of California,
Davis, Davis, California 95616, United States
| | - Clare K. Morris
- Department
of Chemistry and Biochemistry, University
of California, San Diego, La Jolla, California 92093, United States
| | - Carla Calvó-Tusell
- Department
of Molecular Biology, University of California,
San Diego, La Jolla, California 92093, United States
| | - Kelly L. McGuire
- Department
of Chemistry and Biochemistry, University
of California, San Diego, La Jolla, California 92093, United States
| | - Bárbara de la Peña Avalos
- Department
of Biochemistry and Structural Biology, University of Texas Health San Antonio, San Antonio, Texas 78229, United States
| | - Michael A. Carpenter
- Department
of Biochemistry and Structural Biology, University of Texas Health San Antonio, San Antonio, Texas 78229, United States
- Howard
Hughes Medical Institute, University of
Texas Health San Antonio, San Antonio, Texas 78229, United States
| | - Yanjun Chen
- Department
of Biochemistry and Structural Biology, University of Texas Health San Antonio, San Antonio, Texas 78229, United States
| | - Lorenzo Casalino
- Department
of Molecular Biology, University of California,
San Diego, La Jolla, California 92093, United States
| | - Hideki Aihara
- Department
of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, Minnesota 55455, United States
| | - Mark A. Herzik
- Department
of Chemistry and Biochemistry, University
of California, San Diego, La Jolla, California 92093, United States
| | - Reuben S. Harris
- Department
of Biochemistry and Structural Biology, University of Texas Health San Antonio, San Antonio, Texas 78229, United States
- Howard
Hughes Medical Institute, University of
Texas Health San Antonio, San Antonio, Texas 78229, United States
| | - Rommie E. Amaro
- Department
of Molecular Biology, University of California,
San Diego, La Jolla, California 92093, United States
| |
Collapse
|
4
|
Leung JG, Frazee NC, Brace A, Bogetti AT, Ramanathan A, Chong LT. Unsupervised Learning of Progress Coordinates during Weighted Ensemble Simulations: Application to NTL9 Protein Folding. J Chem Theory Comput 2025; 21:3691-3699. [PMID: 40105797 PMCID: PMC11983707 DOI: 10.1021/acs.jctc.4c01136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2024] [Revised: 02/21/2025] [Accepted: 02/24/2025] [Indexed: 03/20/2025]
Abstract
A major challenge for many rare-event sampling strategies is the identification of progress coordinates that capture the slowest relevant motions. Machine-learning methods that can identify progress coordinates in an unsupervised manner have therefore been of great interest to the simulation community. Here, we developed a general method for identifying progress coordinates "on-the-fly" during weighted ensemble (WE) rare-event sampling via deep learning (DL) of outliers among sampled conformations. Our method identifies outliers in a latent space model of the system's sampled conformations that is periodically trained using a convolutional variational autoencoder. As a proof of principle, we applied our DL-enhanced WE method to simulate the NTL9 protein folding process. To enable rapid tests, our simulations propagated discrete-state synthetic molecular dynamics trajectories using a generative, fine-grained Markov state model. Results revealed that our on-the-fly DL of outliers enhanced the efficiency of WE by >3-fold in estimating the folding rate constant. Our efforts are a significant step forward in the unsupervised learning of slow coordinates during rare event sampling.
Collapse
Affiliation(s)
- Jeremy
M. G. Leung
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Nicolas C. Frazee
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Alexander Brace
- Data
Science and Learning Division, Argonne National
Laboratory, Lemont, Illinois 60439, United States
- Department
of Computer Science, University of Chicago, Chicago, Illinois 60637, United States
| | - Anthony T. Bogetti
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Arvind Ramanathan
- Data
Science and Learning Division, Argonne National
Laboratory, Lemont, Illinois 60439, United States
- Department
of Computer Science, University of Chicago, Chicago, Illinois 60637, United States
| | - Lillian T. Chong
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| |
Collapse
|
5
|
Lynch DL, Fan Z, Pavlova A, Gumbart JC. Weighted Ensemble Simulations Reveal Novel Conformations and Modulator Effects in Hepatitis B Virus Capsid Assembly. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.15.643452. [PMID: 40166272 PMCID: PMC11957032 DOI: 10.1101/2025.03.15.643452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]
Abstract
Molecular dynamics (MD) simulations provide a detailed description of biophysical processes allowing mechanistic questions to be addressed at the atomic level. The promise of such approaches is partly hampered by well known sampling issues of typical simulations, where time scales available are significantly shorter than the process of interest. For the system of interest here, the binding of modulators of Hepatitis B virus capsid self-assembly, the binding site is at a flexible protein-protein interface. Characterization of the conformational landscape and how it is altered upon ligand binding is thus a prerequisite for a complete mechanistic description of capsid assembly modulation. However, such a description can be difficult due to the aforementioned sampling issues of standard MD, and enhanced sampling strategies are required. Here we employ the Weighted Ensemble methodology to characterize the free-energy landscape of our earlier determined functionally relevant progress coordinates. It is shown that this approach provides conformations outside those sampled by standard MD, as well as an increased number of structures with correspondingly enlarged binding pockets conducive to ligand binding, illustrating the utility of Weighted Ensemble for computational drug development.
Collapse
Affiliation(s)
- Diane L Lynch
- School of Physics, Georgia Institute of Technology, Atlanta, Georgia, 30332, USA
| | - Zixing Fan
- Interdisciplinary Bioengineering Graduate Program, Georgia Institute of Technology, Atlanta, Georgia, 30332, USA
| | - Anna Pavlova
- School of Physics, Georgia Institute of Technology, Atlanta, Georgia, 30332, USA
| | - James C Gumbart
- School of Physics, Georgia Institute of Technology, Atlanta, Georgia, 30332, USA
- School of Chemistry & Biochemistry, Georgia Institute of Technology, Atlanta, Georgia, 30332, USA
| |
Collapse
|
6
|
Han Y, Dawson JR, DeMarco KR, Rouen KC, Ngo K, Bekker S, Yarov-Yarovoy V, Clancy CE, Xiang YK, Ahn SH, Vorobyov I. Molecular simulations reveal intricate coupling between agonist-bound β-adrenergic receptors and G protein. iScience 2025; 28:111741. [PMID: 39898043 PMCID: PMC11787599 DOI: 10.1016/j.isci.2024.111741] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 10/24/2024] [Accepted: 12/18/2024] [Indexed: 02/04/2025] Open
Abstract
G protein-coupled receptors (GPCRs) and G proteins transmit signals from hormones and neurotransmitters across cell membranes, initiating downstream signaling and modulating cellular behavior. Using advanced computer modeling and simulation, we identified atomistic-level structural, dynamic, and energetic mechanisms of norepinephrine (NE) and stimulatory G protein (Gs) interactions with β-adrenergic receptors (βARs), crucial GPCRs for heart function regulation and major drug targets. Our analysis revealed distinct binding behaviors of NE within β1AR and β2AR despite identical orthosteric binding pockets. β2AR had an additional binding site, explaining variations in NE binding affinities. Simulations showed significant differences in NE dissociation pathways and receptor interactions with the Gs. β1AR binds Gs more strongly, while β2AR induces greater conformational changes in the α subunit of Gs. Furthermore, GTP and GDP binding to Gs may disrupt coupling between NE and βAR, as well as between βAR and Gs. These findings may aid in designing precise βAR-targeted drugs.
Collapse
Affiliation(s)
- Yanxiao Han
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, CA 95616, USA
| | - John R.D. Dawson
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, CA 95616, USA
- Biophysics Graduate Group, University of California, Davis, Davis, CA 95616, USA
| | - Kevin R. DeMarco
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, CA 95616, USA
| | - Kyle C. Rouen
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, CA 95616, USA
- Biophysics Graduate Group, University of California, Davis, Davis, CA 95616, USA
| | - Khoa Ngo
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, CA 95616, USA
- Biophysics Graduate Group, University of California, Davis, Davis, CA 95616, USA
| | - Slava Bekker
- American River College, Sacramento, CA 95841, USA
| | - Vladimir Yarov-Yarovoy
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, CA 95616, USA
- Department of Anesthesiology and Pain Medicine, University of California, Davis, Davis, CA 95616, USA
- Center for Precision Medicine and Data Science, University of California, Davis, Davis, CA 95616, USA
| | - Colleen E. Clancy
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, CA 95616, USA
- Center for Precision Medicine and Data Science, University of California, Davis, Davis, CA 95616, USA
- Department of Pharmacology, University of California, Davis, Davis, CA 95616, USA
| | - Yang K. Xiang
- Department of Pharmacology, University of California, Davis, Davis, CA 95616, USA
- VA Northern California Health Care System, Mather, CA 95655, USA
| | - Surl-Hee Ahn
- Department of Chemical Engineering, University of California, Davis, Davis, CA 95616, USA
| | - Igor Vorobyov
- Department of Physiology and Membrane Biology, University of California, Davis, Davis, CA 95616, USA
- Department of Pharmacology, University of California, Davis, Davis, CA 95616, USA
| |
Collapse
|
7
|
Chen L, Leung JMG, Zsigmond K, Chong LT, Miranda-Quintana RA. SHINE: Deterministic Many-to-Many clustering of Molecular Pathways. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.07.636541. [PMID: 39975301 PMCID: PMC11839051 DOI: 10.1101/2025.02.07.636541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/21/2025]
Abstract
State-of-the-art molecular dynamics (MD) simulation methods can generate diverse ensembles of pathways for complex biological processes. Analyzing these pathways using statistical mechanics tools demands identifying key states that contribute to both the dynamic and equilibrium properties of the system. This task becomes especially challenging when analyzing multiple MD simulations simultaneously, a common scenario in enhanced sampling techniques like the weighted ensemble strategy. Here, we present a new module of the MDANCE package designed to streamline the analysis of pathway ensembles. This module integrates n-ary similarity, cheminformatics-inspired tools, and hierarchical clustering to improve analysis efficiency. We present the theoretical foundation behind this approach, termed Sampling Hierarchical Intrinsic N-ary Ensembles (SHINE), and demonstrate its application to simulations of alanine dipeptide and adenylate kinase.
Collapse
Affiliation(s)
- Lexin Chen
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL 32603, USA
| | - Jeremy M G Leung
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| | - Krisztina Zsigmond
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL 32603, USA
| | - Lillian T Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| | | |
Collapse
|
8
|
Mitra S, Biswas R, Chakrabarty S. WeTICA: A directed search weighted ensemble based enhanced sampling method to estimate rare event kinetics in a reduced dimensional space. J Chem Phys 2025; 162:034106. [PMID: 39812249 DOI: 10.1063/5.0239713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2024] [Accepted: 12/30/2024] [Indexed: 01/16/2025] Open
Abstract
Estimating rare event kinetics from molecular dynamics simulations is a non-trivial task despite the great advances in enhanced sampling methods. Weighted Ensemble (WE) simulation, a special class of enhanced sampling techniques, offers a way to directly calculate kinetic rate constants from biased trajectories without the need to modify the underlying energy landscape using bias potentials. Conventional WE algorithms use different binning schemes to partition the collective variable (CV) space separating the two metastable states of interest. In this work, we have developed a new "binless" WE simulation algorithm to bypass the hurdles of optimizing binning procedures. Our proposed protocol (WeTICA) uses a low-dimensional CV space to drive the WE simulation toward the specified target state. We have applied this new algorithm to recover the unfolding kinetics of three proteins: (A) TC5b Trp-cage mutant, (B) TC10b Trp-cage mutant, and (C) Protein G, with unfolding times spanning the range between 3 and 40 μs using projections along predefined fixed Time-lagged Independent Component Analysis (TICA) eigenvectors as CVs. Calculated unfolding times converge to the reported values with good accuracy with more than one order of magnitude less cumulative WE simulation time than the unfolding time scales with or without a priori knowledge of the CVs that can capture unfolding. Our algorithm can be used with other linear CVs, not limited to TICA. Moreover, the new walker selection criteria for resampling employed in this algorithm can be used on more sophisticated nonlinear CV space for further improvements of binless WE methods.
Collapse
Affiliation(s)
- Sudipta Mitra
- Department of Chemical and Biological Sciences, S. N. Bose National Centre for Basic Sciences, Block-JD, Sector-III, Salt Lake, Kolkata 700106, India
| | - Ranjit Biswas
- Department of Chemical and Biological Sciences, S. N. Bose National Centre for Basic Sciences, Block-JD, Sector-III, Salt Lake, Kolkata 700106, India
| | - Suman Chakrabarty
- Department of Chemical and Biological Sciences, S. N. Bose National Centre for Basic Sciences, Block-JD, Sector-III, Salt Lake, Kolkata 700106, India
| |
Collapse
|
9
|
Zhang BW, Fajer M, Chen W, Moraca F, Wang L. Leveraging the Thermodynamics of Protein Conformations in Drug Discovery. J Chem Inf Model 2025; 65:252-264. [PMID: 39681511 DOI: 10.1021/acs.jcim.4c01612] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2024]
Abstract
As the name implies, structure-based drug design requires confidence in the holo complex structure. The ability to clarify which protein conformation to use when ambiguity arises would be incredibly useful. We present a large scale validation of the computational method Protein Reorganization Free Energy Perturbation (PReorg-FEP) and demonstrate its quantitative accuracy in selecting the correct protein conformation among candidate models in apo or ligand induced states for 14 different systems. These candidate conformations are pulled from various drug discovery related campaigns: cryptic conformations induced by novel hits in lead identification, binding site rearrangement during lead optimization, and conflicting structural biology models. We also show an example of a pH-dependent conformational change, relevant to protein design.
Collapse
Affiliation(s)
- Bin W Zhang
- Schrödinger Inc., 1540 Broadway, 24th Floor, New York, New York 10036-4041, United States
| | - Mikolai Fajer
- Schrödinger Inc., 1540 Broadway, 24th Floor, New York, New York 10036-4041, United States
| | - Wei Chen
- Schrödinger Inc., 1540 Broadway, 24th Floor, New York, New York 10036-4041, United States
| | - Francesca Moraca
- Schrödinger Inc., 1540 Broadway, 24th Floor, New York, New York 10036-4041, United States
| | - Lingle Wang
- Schrödinger Inc., 1540 Broadway, 24th Floor, New York, New York 10036-4041, United States
| |
Collapse
|
10
|
Braza MKE, Demir Ö, Ahn SH, Morris CK, Calvó-Tusell C, McGuire KL, de la Peña Avalos B, Carpenter MA, Chen Y, Casalino L, Aihara H, Herzik MA, Harris RS, Amaro RE. Regulatory interactions between APOBEC3B N- and C-terminal domains. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.12.11.628032. [PMID: 39713448 PMCID: PMC11661193 DOI: 10.1101/2024.12.11.628032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/24/2024]
Abstract
APOBEC3B (A3B) is implicated in DNA mutations that facilitate tumor evolution. Although structures of its individual N- and C-terminal domains (NTD and CTD) have been resolved through X-ray crystallography, the full-length A3B (fl-A3B) structure remains elusive, limiting understanding of its dynamics and mechanisms. In particular, the APOBEC3B C-terminal domain (A3Bctd) active site is frequently closed in models and structures. In this study, we built several new models of fl-A3B using integrative structural biology methods and selected a top model for further dynamical investigation. We compared dynamics of the truncated (A3Bctd) to the fl-A3B via conventional and Gaussian accelerated molecular dynamics (MD) simulations. Subsequently, we employed weighted ensemble methods to explore the fl-A3B active site opening mechanism, finding that interactions at the NTD-CTD interface enhance the opening frequency of the fl-A3B active site. Our findings shed light on the structural dynamics of fl-A3B, which may offer new avenues for therapeutic intervention in cancer.
Collapse
Affiliation(s)
- Mac Kevin E Braza
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA
| | - Özlem Demir
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA
| | - Surl-Hee Ahn
- Department of Chemical Engineering, University of California, Davis, Davis, CA
| | - Clare K Morris
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA
| | - Carla Calvó-Tusell
- Department of Molecular Biology, University of California, San Diego, La Jolla, CA
| | - Kelly L McGuire
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA
| | - Bárbara de la Peña Avalos
- Department of Biochemistry and Structural Biology, University of Texas Health San Antonio, San Antonio, TX
| | - Michael A Carpenter
- Department of Biochemistry and Structural Biology, University of Texas Health San Antonio, San Antonio, TX
- Howard Hughes Medical Institute, University of Texas Health San Antonio, San Antonio, TX
| | - Yanjun Chen
- Department of Biochemistry and Structural Biology, University of Texas Health San Antonio, San Antonio, TX
| | - Lorenzo Casalino
- Department of Molecular Biology, University of California, San Diego, La Jolla, CA
| | - Hideki Aihara
- Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN
| | - Mark A Herzik
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA
| | - Reuben S Harris
- Department of Biochemistry and Structural Biology, University of Texas Health San Antonio, San Antonio, TX
- Howard Hughes Medical Institute, University of Texas Health San Antonio, San Antonio, TX
| | - Rommie E Amaro
- Department of Molecular Biology, University of California, San Diego, La Jolla, CA
| |
Collapse
|
11
|
Wang D, Tiwary P. Augmenting Human Expertise in Weighted Ensemble Simulations through Deep Learning-Based Information Bottleneck. J Chem Theory Comput 2024; 20:10371-10383. [PMID: 39589127 DOI: 10.1021/acs.jctc.4c00919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2024]
Abstract
The weighted ensemble (WE) method stands out as a widely used segment-based sampling technique renowned for its rigorous treatment of kinetics. The WE framework typically involves initially mapping the configuration space onto a low-dimensional collective variable (CV) space and then partitioning it into bins. The efficacy of WE simulations heavily depends on the selection of CVs and binning schemes. The recently proposed state predictive information bottleneck (SPIB) method has emerged as a promising tool for automatically constructing CVs from data and guiding enhanced sampling through an iterative manner. In this work, we advance this data-driven pipeline by incorporating prior expert knowledge. Our hybrid approach combines SPIB-learned CVs to enhance sampling in explored regions with expert-based CVs to guide exploration in regions of interest, synergizing the strengths of both methods. Through benchmarking on alanine dipeptide and chignolin systems, we demonstrate that our hybrid approach effectively guides WE simulations to sample states of interest and reduces run-to-run variances. Moreover, our integration of the SPIB model also enhances the analysis and interpretation of WE simulation data by effectively identifying metastable states and pathways and offering direct visualization of dynamics.
Collapse
Affiliation(s)
- Dedi Wang
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, United States
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, United States
- University of Maryland Institute for Health Computing, Bethesda, Maryland 20852, United States
| |
Collapse
|
12
|
He X, Man VH, Gao J, Wang J. Effects of All-Atom and Coarse-Grained Molecular Mechanics Force Fields on Amyloid Peptide Assembly: The Case of a Tau K18 Monomer. J Chem Inf Model 2024; 64:8880-8891. [PMID: 39579121 DOI: 10.1021/acs.jcim.4c01448] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2024]
Abstract
To propose new mechanism-based therapeutics for Alzheimer's disease (AD), it is crucial to study the kinetics and oligomerization/aggregation mechanisms of the hallmark tau proteins, which have various isoforms and are intrinsically disordered. In this study, multiple all-atom (AA) and coarse-grained (CG) force fields (FFs) have been benchmarked on molecular dynamics (MD) simulations of K18 tau (M243-E372), which is a truncated form (130 residues) of full-length tau (441 residues). FF19SB is first excluded because the dynamics are too slow, and the conformations are too stable. All other benchmarked AAFFs (Charmm36m, FF14SB, Gromos54A7, and OPLS-AA) and CGFFs (Martini3 and Sirah2.0) exhibit a trend of shrinking K18 tau into compact structures with the radius of gyration (ROG) around 2.0 nm, which is much smaller than the experimental value of 3.8 nm, within 200 ns of AA-MD or 2000 ns of CG-MD. Gromos54A7, OPLS-AA, and Martini3 shrink much faster than the other FFs. To perform meaningful postanalysis of various properties, we propose a strategy of selecting snapshots with 2.5 < ROG < 4.5 nm, instead of using all sampled snapshots. The calculated chemical shifts of all C, CA, and CB atoms have very good and close root-mean-square error (RMSE) values, while Charmm36m and Sirah2.0 exhibit better chemical shifts of N than other FFs. Comparing the calculated distributions of the distance between the CA atoms of CYS291 and CYS322 with the results of the FRET experiment demonstrates that Charmm36m is a perfect match with the experiment while other FFs exhibit limitations. In summary, Charmm36m is recommended as the best AAFF, and Sirah2.0 is recommended as an excellent CGFF for simulating tau K18.
Collapse
Affiliation(s)
- Xibing He
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Viet Hoang Man
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| | - Jie Gao
- Department of Neuroscience, The Ohio State University Wexner Medical Center, Columbus, Ohio 43210, United States
| | - Junmei Wang
- Department of Pharmaceutical Sciences and Computational Chemical Genomics Screening Center, School of Pharmacy, University of Pittsburgh, Pittsburgh, Pennsylvania 15261, United States
| |
Collapse
|
13
|
Wang D, Tiwary P. Augmenting Human Expertise in Weighted Ensemble Simulations through Deep Learning based Information Bottleneck. ARXIV 2024:arXiv:2406.14839v2. [PMID: 38947925 PMCID: PMC11213147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
The weighted ensemble (WE) method stands out as a widely used segment-based sampling technique renowned for its rigorous treatment of kinetics. The WE framework typically involves initially mapping the configuration space onto a low-dimensional collective variable (CV) space and then partitioning it into bins. The efficacy of WE simulations heavily depends on the selection of CVs and binning schemes. The recently proposed State Predictive Information Bottleneck (SPIB) method has emerged as a promising tool for automatically constructing CVs from data and guiding enhanced sampling through an iterative manner. In this work, we advance this data-driven pipeline by incorporating prior expert knowledge. Our hybrid approach combines SPIB-learned CVs to enhance sampling in explored regions with expert-based CVs to guide exploration in regions of interest, synergizing the strengths of both methods. Through benchmarking on alanine dipeptide and chignoin systems, we demonstrate that our hybrid approach effectively guides WE simulations to sample states of interest, and reduces run-to-run variances. Moreover, our integration of the SPIB model also enhances the analysis and interpretation of WE simulation data by effectively identifying metastable states and pathways, and offering direct visualization of dynamics.
Collapse
Affiliation(s)
- Dedi Wang
- Biophysics Program and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
| | - Pratyush Tiwary
- Department of Chemistry and Biochemistry and Institute for Physical Science and Technology, University of Maryland, College Park 20742, USA
- University of Maryland Institute for Health Computing, Bethesda 20852, USA
| |
Collapse
|
14
|
Vithani N, Zhang S, Thompson JP, Patel LA, Demidov A, Xia J, Balaeff A, Mentes A, Arnautova YA, Kohlmann A, Lawson JD, Nicholls A, Skillman AG, LeBard DN. Exploration of Cryptic Pockets Using Enhanced Sampling Along Normal Modes: A Case Study of KRAS G12D. J Chem Inf Model 2024; 64:8258-8273. [PMID: 39419500 PMCID: PMC11558672 DOI: 10.1021/acs.jcim.4c01435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2024] [Revised: 10/03/2024] [Accepted: 10/03/2024] [Indexed: 10/19/2024]
Abstract
Identification of cryptic pockets has the potential to open new therapeutic opportunities by discovering ligand binding sites that remain hidden in static apo structures of a target protein. Moreover, allosteric cryptic pockets can become valuable for designing target-selective ligands when the natural ligand binding sites are conserved in variants of a protein. For example, before an allosteric cryptic pocket was discovered, KRAS was considered undruggable due to its smooth surface and conservation of the GDP/GTP binding pocket across the wild type and oncogenic isoforms. Recent identification of the Switch-II cryptic pocket in the KRASG12C mutant and FDA approval of anticancer drugs targeting this site underscores the importance of cryptic pockets in solving pharmaceutical challenges. Here, we present a newly developed approach for the exploration of cryptic pockets using weighted ensemble molecular dynamics simulations with inherent normal modes as progress coordinates applied to the wild type KRAS and the G12D mutant. We performed extensive all-atomic simulations (>400 μs) with and without several cosolvents (xenon, ethanol, benzene), and analyzed trajectories using three distinct methods to search for potential binding pockets. These methods have been applied as a proof-of-concept to KRAS and have shown they can predict known cryptic binding sites. Furthermore, we performed ligand-binding simulations of a known inhibitor (MRTX1133) to shed light on the nature of cryptic pockets in KRASG12D and the role of conformational selection vs induced-fit mechanism in the formation of these cryptic pockets.
Collapse
Affiliation(s)
- Neha Vithani
- OpenEye,
Cadence Molecular Sciences, Santa Fe, New Mexico 87508, United States
| | - She Zhang
- OpenEye,
Cadence Molecular Sciences, Santa Fe, New Mexico 87508, United States
| | - Jeffrey P. Thompson
- OpenEye,
Cadence Molecular Sciences, Santa Fe, New Mexico 87508, United States
| | - Lara A. Patel
- OpenEye,
Cadence Molecular Sciences, Santa Fe, New Mexico 87508, United States
| | - Alex Demidov
- OpenEye,
Cadence Molecular Sciences, Santa Fe, New Mexico 87508, United States
| | - Junchao Xia
- OpenEye,
Cadence Molecular Sciences, Santa Fe, New Mexico 87508, United States
| | - Alexander Balaeff
- Black
Diamond Therapeutics, Cambridge, Massachusetts 02142, United States
| | - Ahmet Mentes
- Black
Diamond Therapeutics, Cambridge, Massachusetts 02142, United States
| | | | - Anna Kohlmann
- Black
Diamond Therapeutics, Cambridge, Massachusetts 02142, United States
| | - J. David Lawson
- Mirati
Therapeutics, Inc., San Diego, California 92121, United States
| | - Anthony Nicholls
- OpenEye,
Cadence Molecular Sciences, Santa Fe, New Mexico 87508, United States
| | | | - David N. LeBard
- OpenEye,
Cadence Molecular Sciences, Santa Fe, New Mexico 87508, United States
| |
Collapse
|
15
|
Choe S. Insights into Translocation of Arginine-Rich Cell-Penetrating Peptides across a Model Membrane. J Phys Chem B 2024; 128:10894-10903. [PMID: 39445646 DOI: 10.1021/acs.jpcb.4c04266] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2024]
Abstract
It is well-known that membrane deformation and water pores contribute to the spontaneous translocation of arginine-rich cell-penetrating peptides (CPPs). We confirm this through the observation of the spontaneous translocation of single R9 (nona-arginine) and Tat (48-60) peptides across a model membrane using the weighted ensemble (WE) method within all-atom molecular dynamics (MD) simulations. Furthermore, we demonstrate that membrane deformation and the presence of a water pore reduce the effective charge of the CPP and the bending rigidity of the model membrane during translocation. We find that R9 disturbs the model membrane more than Tat (48-60), leading to more efficient translocation of R9 across the model membrane.
Collapse
Affiliation(s)
- Seungho Choe
- Department of Energy Science & Engineering, Daegu Gyeongbuk Institute of Science & Technology (DGIST), Daegu 42988, Korea
- Energy Science & Engineering Research Center, Daegu Gyeongbuk Institute of Science & Technology (DGIST), Daegu 42988, Korea
| |
Collapse
|
16
|
Silvestrini ML, Solazzo R, Boral S, Cocco MJ, Closson JD, Masetti M, Gardner KH, Chong LT. Gating residues govern ligand unbinding kinetics from the buried cavity in HIF-2α PAS-B. Protein Sci 2024; 33:e5198. [PMID: 39467204 PMCID: PMC11516114 DOI: 10.1002/pro.5198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Revised: 09/24/2024] [Accepted: 09/27/2024] [Indexed: 10/30/2024]
Abstract
While transcription factors have been generally perceived as "undruggable," an exception is the HIF-2 hypoxia-inducible transcription factor, which contains an internal cavity that is sufficiently large to accommodate a range of small-molecules, including the therapeutically used inhibitor belzutifan. Given the relatively long ligand residence times of these small molecules and the lack of any experimentally observed pathway connecting the cavity to solvent, there has been great interest in understanding how these drug ligands exit the buried receptor cavity. Here, we focus on the relevant PAS-B domain of hypoxia-inducible factor 2α (HIF-2α) and examine how one such small molecule (THS-017) exits from the buried cavity within this domain on the seconds-timescale using atomistic simulations and ZZ-exchange NMR. To enable the simulations, we applied the weighted ensemble path sampling strategy, which generates continuous pathways for a rare-event process [e.g., ligand (un)binding] with rigorous kinetics in orders of magnitude less computing time compared to conventional simulations. Results reveal the formation of an encounter complex intermediate and two distinct classes of pathways for ligand exit. Based on these pathways, we identified two pairs of conformational gating residues in the receptor: one for the major class (N288 and S304) and another for the minor class (L272 and M309). ZZ-exchange NMR validated the kinetic importance of N288 for ligand unbinding. Our results provide an ideal simulation dataset for rational manipulation of ligand unbinding kinetics.
Collapse
Affiliation(s)
| | - Riccardo Solazzo
- Department of Pharmacy and BiotechnologyAlma Mater Studiorum‐Università di BolognaBolognaItaly
| | - Soumendu Boral
- Structural Biology InitiativeCUNY Advanced Science Research CenterNew YorkNew YorkUSA
| | - Melanie J. Cocco
- Department of Pharmaceutical SciencesUniversity of California, IrvineIrvineCaliforniaUSA
- Department of Molecular Biology and BiochemistryUniversity of California, IrvineIrvineCaliforniaUSA
| | - Joseph D. Closson
- Structural Biology InitiativeCUNY Advanced Science Research CenterNew YorkNew YorkUSA
- PhD Program in BiochemistryCUNY Graduate CenterNew YorkNew YorkUSA
| | - Matteo Masetti
- Department of Pharmacy and BiotechnologyAlma Mater Studiorum‐Università di BolognaBolognaItaly
| | - Kevin H. Gardner
- Structural Biology InitiativeCUNY Advanced Science Research CenterNew YorkNew YorkUSA
- Department of Chemistry and BiochemistryCity College of New YorkNew YorkNew YorkUSA
- PhD Programs in Biochemistry, Biology, and ChemistryCUNY Graduate CenterNew YorkNew YorkUSA
| | - Lillian T. Chong
- Department of ChemistryUniversity of PittsburghPittsburghPennsylvaniaUSA
| |
Collapse
|
17
|
Poruthoor AJ, Stallone JJ, Miaro M, Sharma A, Grossfield A. System size effects on the free energy landscapes from molecular dynamics of phase-separating bilayers. J Chem Phys 2024; 161:145101. [PMID: 39382132 PMCID: PMC11829248 DOI: 10.1063/5.0225753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Accepted: 09/24/2024] [Indexed: 10/10/2024] Open
Abstract
The "lipid raft" hypothesis proposes that cell membranes contain distinct domains of varying lipid compositions, where "rafts" of ordered lipids and cholesterol coexist with disordered lipid regions. Experimental and theoretical phase diagrams of model membranes have revealed multiple coexisting phases. Molecular dynamics (MD) simulations can also capture spontaneous phase separation of bilayers. However, these methods merely determine the sign of the free energy change upon phase separation-whether or not it is favorable-but not the amplitude. Recently, we developed a workflow to compute the free energy of phase separation from MD simulations using the weighted ensemble method. However, while theoretical treatments generally focus on infinite systems and experimental measurements on mesoscopic to macroscopic systems, MD simulations are comparatively small. Therefore, if we are to put the results of these calculations into the appropriate context, we need to understand the effects the finite size of the simulation has on the computed free energy landscapes. In this study, we investigate this phenomenon by computing free energy profiles for a model phase-separating system as a function of system size, ranging from 324 to 10 110 lipids. The results suggest that, within the limits of statistical uncertainty, bulk-like behavior emerges once the systems contain roughly 4000 lipids.
Collapse
Affiliation(s)
- Ashlin J. Poruthoor
- Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, New York 14642, USA
| | - Jack J. Stallone
- Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, New York 14642, USA
| | - Megan Miaro
- Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, New York 14642, USA
| | - Akshara Sharma
- Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, New York 14642, USA
| | - Alan Grossfield
- Department of Biochemistry and Biophysics, University of Rochester Medical Center, Rochester, New York 14642, USA
| |
Collapse
|
18
|
Yang DT, Goldberg AM, Chong LT. Rare-Event Sampling using a Reinforcement Learning-Based Weighted Ensemble Method. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.09.617475. [PMID: 39416089 PMCID: PMC11482931 DOI: 10.1101/2024.10.09.617475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
Despite the power of path sampling strategies in enabling simulations of rare events, such strategies have not reached their full potential. A common challenge that remains is the identification of a progress coordinate that captures the slow relevant motions of a rare event. Here we have developed a weighted ensemble (WE) path sampling strategy that exploits reinforcement learning to automatically identify an effective progress coordinate among a set of potential coordinates during a simulation. We apply our WE strategy with reinforcement learning to three benchmark systems: (i) an egg carton-shaped toy potential, (ii) an S-shaped toy potential, and (iii) a dimer of the HIV-1 capsid protein (C-terminal domain). To enable rapid testing of the latter system at the atomic level, we employed discrete-state synthetic molecular dynamics trajectories using a generative, fine-grained Markov state model that was based on extensive conventional simulations. Our results demonstrate that using concepts from reinforcement learning with a weighted ensemble of trajectories automatically identifies relevant progress co-ordinates among multiple candidates at a given time during a simulation. Due to the rigorous weighting of trajectories, the simulations maintain rigorous kinetics.
Collapse
Affiliation(s)
- Darian T. Yang
- Molecular Biophysics and Structural Biology Graduate Program, University of Pittsburgh and Carnegie Mellon University, Pittsburgh, Pennsylvania 15260
- Department of Structural Biology, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania 15260
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| | - Alex M. Goldberg
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| | - Lillian T. Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| |
Collapse
|
19
|
Xu X, Closson JD, Marcelino LP, Favaro DC, Silvestrini ML, Solazzo R, Chong LT, Gardner KH. Identification of small-molecule ligand-binding sites on and in the ARNT PAS-B domain. J Biol Chem 2024; 300:107606. [PMID: 39059491 PMCID: PMC11381877 DOI: 10.1016/j.jbc.2024.107606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2024] [Revised: 07/16/2024] [Accepted: 07/18/2024] [Indexed: 07/28/2024] Open
Abstract
Transcription factors are challenging to target with small-molecule inhibitors due to their structural plasticity and lack of catalytic sites. Notable exceptions include naturally ligand-regulated transcription factors, including our prior work with the hypoxia-inducible factor (HIF)-2 transcription factor, showing that small-molecule binding within an internal pocket of the HIF-2α Per-Aryl hydrocarbon Receptor Nuclear Translocator (ARNT)-Sim (PAS)-B domain can disrupt its interactions with its dimerization partner, ARNT. Here, we explore the feasibility of targeting small molecules to the analogous ARNT PAS-B domain itself, potentially opening a promising route to modulate several ARNT-mediated signaling pathways. Using solution NMR fragment screening, we previously identified several compounds that bind ARNT PAS-B and, in certain cases, antagonize ARNT association with the transforming acidic coiled-coil containing protein 3 transcriptional coactivator. However, these ligands have only modest binding affinities, complicating characterization of their binding sites. We address this challenge by combining NMR, molecular dynamics simulations, and ensemble docking to identify ligand-binding "hotspots" on and within the ARNT PAS-B domain. Our data indicate that the two ARNT/transforming acidic coiled-coil containing protein 3 inhibitors, KG-548 and KG-655, bind to a β-sheet surface implicated in both HIF-2 dimerization and coactivator recruitment. Furthermore, while KG-548 binds exclusively to the β-sheet surface, KG-655 can additionally bind within a water-accessible internal cavity in ARNT PAS-B. Finally, KG-279, while not a coactivator inhibitor, exemplifies ligands that preferentially bind only to the internal cavity. All three ligands promoted ARNT PAS-B homodimerization, albeit to varying degrees. Taken together, our findings provide a comprehensive overview of ARNT PAS-B ligand-binding sites and may guide the development of more potent coactivator inhibitors for cellular and functional studies.
Collapse
Affiliation(s)
- Xingjian Xu
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, New York, USA; PhD Program in Biochemistry, The Graduate Center, CUNY, New York, New York, USA
| | - Joseph D Closson
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, New York, USA; PhD Program in Biochemistry, The Graduate Center, CUNY, New York, New York, USA
| | | | - Denize C Favaro
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, New York, USA
| | - Marion L Silvestrini
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Riccardo Solazzo
- Department of Pharmacy and Biotechnology, Alma Mater Studiorum-University of Bologna, Bologna, Bologna, Italy
| | - Lillian T Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania, USA
| | - Kevin H Gardner
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, New York, USA; Department of Chemistry and Biochemistry, City College of New York, New York, New York, USA; PhD. Programs in Biochemistry, Chemistry and Biology, The Graduate Center, CUNY, New York, New York, USA.
| |
Collapse
|
20
|
Strahan J, Lorpaiboon C, Weare J, Dinner AR. BAD-NEUS: Rapidly converging trajectory stratification. J Chem Phys 2024; 161:084109. [PMID: 39185846 PMCID: PMC11349377 DOI: 10.1063/5.0215975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Accepted: 07/25/2024] [Indexed: 08/27/2024] Open
Abstract
An issue for molecular dynamics simulations is that events of interest often involve timescales that are much longer than the simulation time step, which is set by the fastest timescales of the model. Because of this timescale separation, direct simulation of many events is prohibitively computationally costly. This issue can be overcome by aggregating information from many relatively short simulations that sample segments of trajectories involving events of interest. This is the strategy of Markov state models (MSMs) and related approaches, but such methods suffer from approximation error because the variables defining the states generally do not capture the dynamics fully. By contrast, once converged, the weighted ensemble (WE) method aggregates information from trajectory segments so as to yield unbiased estimates of both thermodynamic and kinetic statistics. Unfortunately, errors decay no faster than unbiased simulation in WE as originally formulated and commonly deployed. Here, we introduce a theoretical framework for describing WE that shows that the introduction of an approximate stationary distribution on top of the stratification, as in nonequilibrium umbrella sampling (NEUS), accelerates convergence. Building on ideas from MSMs and related methods, we generalize the NEUS approach in such a way that the approximation error can be reduced systematically. We show that the improved algorithm can decrease the simulation time required to achieve the desired precision by orders of magnitude.
Collapse
Affiliation(s)
- John Strahan
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Chatipat Lorpaiboon
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| | - Jonathan Weare
- Courant Institute of Mathematical Sciences, New York University, New York, New York 10012, USA
| | - Aaron R. Dinner
- Department of Chemistry and James Franck Institute, University of Chicago, Chicago, Illinois 60637, USA
| |
Collapse
|
21
|
Nuqui X, Casalino L, Zhou L, Shehata M, Wang A, Tse AL, Ojha AA, Kearns FL, Rosenfeld MA, Miller EH, Acreman CM, Ahn SH, Chandran K, McLellan JS, Amaro RE. Simulation-driven design of stabilized SARS-CoV-2 spike S2 immunogens. Nat Commun 2024; 15:7370. [PMID: 39191724 PMCID: PMC11350062 DOI: 10.1038/s41467-024-50976-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Accepted: 07/25/2024] [Indexed: 08/29/2024] Open
Abstract
The full-length prefusion-stabilized SARS-CoV-2 spike (S) is the principal antigen of COVID-19 vaccines. Vaccine efficacy has been impacted by emerging variants of concern that accumulate most of the sequence modifications in the immunodominant S1 subunit. S2, in contrast, is the most evolutionarily conserved region of the spike and can elicit broadly neutralizing and protective antibodies. Yet, S2's usage as an alternative vaccine strategy is hampered by its general instability. Here, we use a simulation-driven approach to design S2-only immunogens stabilized in a closed prefusion conformation. Molecular simulations provide a mechanistic characterization of the S2 trimer's opening, informing the design of tryptophan substitutions that impart kinetic and thermodynamic stabilization. Structural characterization via cryo-EM shows the molecular basis of S2 stabilization in the closed prefusion conformation. Informed by molecular simulations and corroborated by experiments, we report an engineered S2 immunogen that exhibits increased protein expression, superior thermostability, and preserved immunogenicity against sarbecoviruses.
Collapse
Affiliation(s)
- Xandra Nuqui
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA, USA
| | - Lorenzo Casalino
- Department of Molecular Biology, University of California San Diego, La Jolla, CA, USA
| | - Ling Zhou
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, USA
| | - Mohamed Shehata
- Department of Molecular Biology, University of California San Diego, La Jolla, CA, USA
| | - Albert Wang
- Department of Microbiology and Immunology, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Alexandra L Tse
- Department of Microbiology and Immunology, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Anupam A Ojha
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA, USA
| | - Fiona L Kearns
- Department of Molecular Biology, University of California San Diego, La Jolla, CA, USA
| | - Mia A Rosenfeld
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA, USA
- Laboratory of Computational Biology, National Heart, Lung and Blood Institute, National Institutes of Health, Bethesda, MD, USA
| | - Emily Happy Miller
- Department of Microbiology and Immunology, Albert Einstein College of Medicine, Bronx, NY, USA
- Department of Medicine, Division of Infectious Diseases, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Cory M Acreman
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, USA
| | - Surl-Hee Ahn
- Department of Chemical Engineering, University of California Davis, Davis, CA, USA
| | - Kartik Chandran
- Department of Microbiology and Immunology, Albert Einstein College of Medicine, Bronx, NY, USA
| | - Jason S McLellan
- Department of Molecular Biosciences, The University of Texas at Austin, Austin, TX, USA
| | - Rommie E Amaro
- Department of Chemistry and Biochemistry, University of California San Diego, La Jolla, CA, USA.
- Department of Molecular Biology, University of California San Diego, La Jolla, CA, USA.
| |
Collapse
|
22
|
Yang D, Chong LT. WEDAP: A Python Package for Streamlined Plotting of Molecular Simulation Data. J Chem Inf Model 2024; 64:5749-5755. [PMID: 39013164 PMCID: PMC11323263 DOI: 10.1021/acs.jcim.4c00867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2024] [Revised: 07/01/2024] [Accepted: 07/06/2024] [Indexed: 07/18/2024]
Abstract
Given the growing interest in path sampling methods for extending the time scales of molecular dynamics (MD) simulations, there has been great interest in software tools that streamline the generation of plots for monitoring the progress of large-scale simulations. Here, we present the WEDAP Python package for simplifying the analysis of data generated from either conventional MD simulations or the weighted ensemble (WE) path sampling method, as implemented in the widely used WESTPA software package. WEDAP facilitates (i) the parsing of WE simulation data stored in highly compressed, hierarchical HDF5 files and (ii) incorporates trajectory weights from WE simulations into all generated plots. Our Python package consists of multiple user-friendly interfaces: a command-line interface, a graphical user interface, and a Python application programming interface. We demonstrate the plotting features of WEDAP through a series of examples using data from WE and conventional MD simulations that focus on the HIV-1 capsid protein's C-terminal domain dimer as a showcase system. The source code for WEDAP is freely available on GitHub at https://github.com/chonglab-pitt/wedap.
Collapse
Affiliation(s)
- Darian
T. Yang
- Molecular
Biophysics and Structural Biology Graduate Program, University of Pittsburgh and Carnegie Mellon University, Pittsburgh, Pennsylvania 15260, United States
- Department
of Structural Biology, University of Pittsburgh
School of Medicine, Pittsburgh, Pennsylvania 15260, United States
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Lillian T. Chong
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| |
Collapse
|
23
|
Plotnikov D, Ahn SH. Optimization of the resampling method in the weighted ensemble simulation toolkit with parallelization and analysis (WESTPA). J Chem Phys 2024; 161:046101. [PMID: 39037142 DOI: 10.1063/5.0197141] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2024] [Accepted: 07/09/2024] [Indexed: 07/23/2024] Open
Affiliation(s)
- Dennis Plotnikov
- Department of Chemical Engineering, University of California, Davis, Davis, California 95616, USA
| | - Surl-Hee Ahn
- Department of Chemical Engineering, University of California, Davis, Davis, California 95616, USA
| |
Collapse
|
24
|
Tao Y, Giese TJ, Ekesan Ş, Zeng J, Aradi B, Hourahine B, Aktulga HM, Götz AW, Merz KM, York DM. Amber free energy tools: Interoperable software for free energy simulations using generalized quantum mechanical/molecular mechanical and machine learning potentials. J Chem Phys 2024; 160:224104. [PMID: 38856060 DOI: 10.1063/5.0211276] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Accepted: 05/15/2024] [Indexed: 06/11/2024] Open
Abstract
We report the development and testing of new integrated cyberinfrastructure for performing free energy simulations with generalized hybrid quantum mechanical/molecular mechanical (QM/MM) and machine learning potentials (MLPs) in Amber. The Sander molecular dynamics program has been extended to leverage fast, density-functional tight-binding models implemented in the DFTB+ and xTB packages, and an interface to the DeePMD-kit software enables the use of MLPs. The software is integrated through application program interfaces that circumvent the need to perform "system calls" and enable the incorporation of long-range Ewald electrostatics into the external software's self-consistent field procedure. The infrastructure provides access to QM/MM models that may serve as the foundation for QM/MM-ΔMLP potentials, which supplement the semiempirical QM/MM model with a MLP correction trained to reproduce ab initio QM/MM energies and forces. Efficient optimization of minimum free energy pathways is enabled through a new surface-accelerated finite-temperature string method implemented in the FE-ToolKit package. Furthermore, we interfaced Sander with the i-PI software by implementing the socket communication protocol used in the i-PI client-server model. The new interface with i-PI allows for the treatment of nuclear quantum effects with semiempirical QM/MM-ΔMLP models. The modular interoperable software is demonstrated on proton transfer reactions in guanine-thymine mispairs in a B-form deoxyribonucleic acid helix. The current work represents a considerable advance in the development of modular software for performing free energy simulations of chemical reactions that are important in a wide range of applications.
Collapse
Affiliation(s)
- Yujun Tao
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Timothy J Giese
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Şölen Ekesan
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Jinzhe Zeng
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| | - Bálint Aradi
- Bremen Center for Computational Materials Science, University of Bremen, D-28334 Bremen, Germany
| | - Ben Hourahine
- SUPA, Department of Physics, University of Strathclyde, Glasgow G4 0NG, United Kingdom
| | - Hasan Metin Aktulga
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48824, USA
| | - Andreas W Götz
- San Diego Supercomputer Center, University of California San Diego, La Jolla, California 92093, USA
| | - Kenneth M Merz
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48824, USA
| | - Darrin M York
- Laboratory for Biomolecular Simulation Research, Institute for Quantitative Biomedicine and Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854, USA
| |
Collapse
|
25
|
Xu X, Closson J, Marcelino LP, Favaro DC, Silvestrini ML, Solazzo R, Chong LT, Gardner KH. Identification of Small Molecule Ligand Binding Sites On and In the ARNT PAS-B Domain. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.11.03.565595. [PMID: 37961463 PMCID: PMC10635134 DOI: 10.1101/2023.11.03.565595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2023]
Abstract
Transcription factors are generally challenging to target with small molecule inhibitors due to their structural plasticity and lack of catalytic sites. Notable exceptions include several naturally ligand-regulated transcription factors, including our prior work with the heterodimeric HIF-2 transcription factor which showed that small molecule binding within an internal pocket of the HIF-2α PAS-B domain can disrupt its interactions with its dimerization partner, ARNT. Here, we explore the feasibility of similarly targeting small molecules to the analogous ARNT PAS-B domain itself, potentially opening a promising route to simultaneously modulate several ARNT-mediated signaling pathways. Using solution NMR screening of an in-house fragment library, we previously identified several compounds that bind ARNT PAS-B and, in certain cases, antagonize ARNT association with the TACC3 transcriptional coactivator. However, these ligands have only modest binding affinities, complicating characterization of their binding sites. We address this challenge by combining NMR, MD simulations, and ensemble docking to identify ligand-binding 'hotspots' on and within the ARNT PAS-B domain. Our data indicate that the two ARNT/TACC3 inhibitors, KG-548 and KG-655, bind to a β-sheet surface implicated in both HIF-2 dimerization and coactivator recruitment. Furthermore, while KG-548 binds exclusively to the β-sheet surface, KG-655 can additionally bind within a water-accessible internal cavity in ARNT PAS-B. Finally, KG-279, while not a coactivator inhibitor, exemplifies ligands that preferentially bind only to the internal cavity. All three ligands promoted ARNT PAS-B homodimerization, albeit to varying degrees. Taken together, our findings provide a comprehensive overview of ARNT PAS-B ligand-binding sites and may guide the development of more potent coactivator inhibitors for cellular and functional studies.
Collapse
|
26
|
Li Z, Yue C, Xie S, Shi S, Ye S. Computational insights into the conformational transition of STING: Mechanistic, energetic considerations, and the influence of crucial mutations. J Mol Graph Model 2024; 129:108764. [PMID: 38581901 DOI: 10.1016/j.jmgm.2024.108764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2023] [Revised: 03/13/2024] [Accepted: 04/01/2024] [Indexed: 04/08/2024]
Abstract
STING (stimulator of interferon genes) is a crucial protein in the innate immune system's response to viral and bacterial infections. In this study, we investigated the mechanistic and energetic mechanism of the conformational transition process of STING activated by cGAMP binding. We found that the STING connector region undergoes an energetically unfavorable rotation during this process, which is compensated by the favorable interaction between cGAMP and the STING ligand binding domain. We further studied several disease-causing mutations and found that the V155 M mutation facilitates a smoother transition in the STING connector region. However, the V147L mutation exhibits unfavorable conformational transition energy, suggesting it may hinder STING activation pathway that relies on connector region rotation. Despite being labeled as hyperactive, the widespread prevalence of V147L/V147I mutations across species implies a neutral character, indicating complexity in its role. Overall, our analysis deepens the understanding of STING activation within the connector region, and targeting this region with compounds may provide an alternative approach to interfering with STING's function.
Collapse
Affiliation(s)
- Zhenlu Li
- School of Life Science, Tianjin University, 92 Weijin Road, Tianjin, 300072, China; Frontiers Science Center for Synthetic Biology (Ministry of Education), Tianjin Key Laboratory of Function and Application of Biological Macromolecular Structures, Tianjin University, 92 Weijin Road, Tianjin, 300072, China.
| | - Congran Yue
- School of Life Science, Tianjin University, 92 Weijin Road, Tianjin, 300072, China
| | - Shangqiang Xie
- School of Life Science, Tianjin University, 92 Weijin Road, Tianjin, 300072, China
| | - Sai Shi
- School of Life Science, Tianjin University, 92 Weijin Road, Tianjin, 300072, China
| | - Sheng Ye
- School of Life Science, Tianjin University, 92 Weijin Road, Tianjin, 300072, China; Frontiers Science Center for Synthetic Biology (Ministry of Education), Tianjin Key Laboratory of Function and Application of Biological Macromolecular Structures, Tianjin University, 92 Weijin Road, Tianjin, 300072, China.
| |
Collapse
|
27
|
Yang DT, Chong LT. WEDAP: A Python Package for Streamlined Plotting of Molecular Simulation Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.18.594829. [PMID: 38826259 PMCID: PMC11142070 DOI: 10.1101/2024.05.18.594829] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/04/2024]
Abstract
Given the growing interest in path sampling methods for extending the timescales of molecular dynamics (MD) simulations, there has been great interest in software tools that streamline the generation of plots for monitoring the progress of large-scale simulations. Here, we present the WEDAP Python package for simplifying the analysis of data generated from either conventional MD simulations or the weighted ensemble (WE) path sampling method, as implemented in the widely used WESTPA software package. WEDAP facilitates (i) the parsing of WE simulation data stored in highly compressed, hierarchical HDF5 files, and (ii) incorporates trajectory weights from WE simulations into all generated plots. Our Python package consists of multiple user-friendly interfaces: a command-line interface, a graphical user interface, and a Python application programming interface. We demonstrate the plotting features of WEDAP through a series of examples using data from WE and conventional MD simulations that focus on the HIV-1 capsid protein C-terminal domain dimer as a showcase system. The source code for WEDAP is freely available on GitHub at https://github.com/chonglab-pitt/wedap .
Collapse
|
28
|
Bogetti A, Zwier MC, Chong LT. Revisiting Textbook Azide-Clock Reactions: A "Propeller-Crawling" Mechanism Explains Differences in Rates. J Am Chem Soc 2024; 146:12828-12835. [PMID: 38687173 PMCID: PMC11078601 DOI: 10.1021/jacs.4c03360] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2024] [Revised: 04/23/2024] [Accepted: 04/24/2024] [Indexed: 05/02/2024]
Abstract
An ongoing challenge to chemists is the analysis of pathways and kinetics for chemical reactions in solution, including transient structures between the reactants and products that are difficult to resolve using laboratory experiments. Here, we enabled direct molecular dynamics simulations of a textbook series of chemical reactions on the hundreds of ns to μs time scale using the weighted ensemble (WE) path sampling strategy with hybrid quantum mechanical/molecular mechanical (QM/MM) models. We focused on azide-clock reactions involving addition of an azide anion to each of three long-lived trityl cations in an acetonitrile-water solvent mixture. Results reveal a two-step mechanism: (1) diffusional collision of reactants to form an ion-pair intermediate; (2) "activation" or rearrangement of the intermediate to the product. Our simulations yield not only reaction rates that are within error of experiment but also rates for individual steps, indicating the activation step as rate-limiting for all three cations. Further, the trend in reaction rates is due to dynamical effects, i.e., differing extents of the azide anion "crawling" along the cation's phenyl-ring "propellers" during the activation step. Our study demonstrates the power of analyzing pathways and kinetics to gain insights on reaction mechanisms, underscoring the value of including WE and other related path sampling strategies in the modern toolbox for chemists.
Collapse
Affiliation(s)
- Anthony
T. Bogetti
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Matthew C. Zwier
- Department
of Chemistry, Drake University, Des Moines, Iowa 50311, United States
| | - Lillian T. Chong
- Department
of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| |
Collapse
|
29
|
Ikizawa S, Hori T, Wijaya TN, Kono H, Bai Z, Kimizono T, Lu W, Tran DP, Kitao A. PaCS-Toolkit: Optimized Software Utilities for Parallel Cascade Selection Molecular Dynamics (PaCS-MD) Simulations and Subsequent Analyses. J Phys Chem B 2024; 128:3631-3642. [PMID: 38578072 PMCID: PMC11033871 DOI: 10.1021/acs.jpcb.4c01271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2024] [Revised: 03/26/2024] [Accepted: 03/26/2024] [Indexed: 04/06/2024]
Abstract
Parallel cascade selection molecular dynamics (PaCS-MD) is an enhanced conformational sampling method conducted as a "repetition of time leaps in parallel worlds", comprising cycles of multiple molecular dynamics (MD) simulations performed in parallel and selection of the initial structures of MDs for the next cycle. We developed PaCS-Toolkit, an optimized software utility enabling the use of different MD software and trajectory analysis tools to facilitate the execution of the PaCS-MD simulation and analyze the obtained trajectories, including the preparation for the subsequent construction of the Markov state model. PaCS-Toolkit is coded with Python, is compatible with various computing environments, and allows for easy customization by editing the configuration file and specifying the MD software and analysis tools to be used. We present the software design of PaCS-Toolkit and demonstrate applications of PaCS-MD variations: original targeted PaCS-MD to peptide folding; rmsdPaCS-MD to protein domain motion; and dissociation PaCS-MD to ligand dissociation from adenosine A2A receptor.
Collapse
Affiliation(s)
- Shinji Ikizawa
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Tatsuki Hori
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Tegar Nurwahyu Wijaya
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
- Department
of Chemistry, Universitas Pertamina, Jl. Teuku Nyak Arief, Simprug, Jakarta 12220, Indonesia
| | - Hiroshi Kono
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Zhen Bai
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Tatsuhiro Kimizono
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Wenbo Lu
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Duy Phuoc Tran
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| | - Akio Kitao
- School
of Life Science and Technology, Tokyo Institute
of Technology, 2-12-2 Ookayama, Meguro, Tokyo 152-8550, Japan
| |
Collapse
|
30
|
Latorraca NR, Sabaat S, Habrian C, Bleier J, Stanley C, Marqusee S, Isacoff EY. Domain coupling in activation of a family C GPCR. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.28.582567. [PMID: 38464305 PMCID: PMC10925283 DOI: 10.1101/2024.02.28.582567] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2024]
Abstract
The G protein-coupled metabotropic glutamate receptors form homodimers and heterodimers with highly diverse responses to glutamate and varying physiological function. The molecular basis for this diversity remains poorly delineated. We employ molecular dynamics, single-molecule spectroscopy, and hydrogen-deuterium exchange to dissect the pathway of activation triggered by glutamate. We find that activation entails multiple loosely coupled steps and identify a novel pre-active intermediate whose transition to the active state forms dimer interactions that set signaling efficacy. Such subunit interactions generate functional diversity that differs across homodimers and heterodimers. The agonist-bound receptor is remarkably dynamic, with low occupancy of G protein-coupling conformations, providing considerable headroom for modulation of the landscape by allosteric ligands. Sites of sequence diversity within the dimerization interface and diverse coupling between activation rearrangements may contribute to precise decoding of glutamate signals and transients over broad spatial and temporal scales.
Collapse
Affiliation(s)
- Naomi R. Latorraca
- Department of Molecular and Cell Biology, University of California, Berkeley, California, 94720, USA
| | - Sam Sabaat
- Department of Molecular and Cell Biology, University of California, Berkeley, California, 94720, USA
| | - Chris Habrian
- Department of Molecular and Cell Biology, University of California, Berkeley, California, 94720, USA
| | - Julia Bleier
- Department of Molecular and Cell Biology, University of California, Berkeley, California, 94720, USA
| | - Cherise Stanley
- Department of Molecular and Cell Biology, University of California, Berkeley, California, 94720, USA
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, California, 94720, USA
- Department of Chemistry, University of California, Berkeley, California, 94720 USA
- Chan Zuckerberg Biohub, San Francisco, CA 94158
- California Institute for Quantitative Biosciences, University of California, Berkeley, California, 94720 USA
| | - Ehud Y. Isacoff
- Department of Molecular and Cell Biology, University of California, Berkeley, California, 94720, USA
- California Institute for Quantitative Biosciences, University of California, Berkeley, California, 94720 USA
- Molecular Biology & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California, 94720, USA
- Weill Neurohub, University of California, Berkeley, California, 94720 USA
- Molecular Biology & Integrated Bioimaging Division, Lawrence Berkeley National Laboratory, Berkeley, California, 94720, USA
| |
Collapse
|
31
|
Santhouse JR, Leung JMG, Chong LT, Horne WS. Effects of altered backbone composition on the folding kinetics and mechanism of an ultrafast-folding protein. Chem Sci 2024; 15:675-682. [PMID: 38179541 PMCID: PMC10763558 DOI: 10.1039/d3sc03976e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 12/02/2023] [Indexed: 01/06/2024] Open
Abstract
Sequence-encoded protein folding is a ubiquitous biological process that has been successfully engineered in a range of oligomeric molecules with artificial backbone chemical connectivity. A remarkable aspect of protein folding is the contrast between the rapid rates at which most sequences in nature fold and the vast number of conformational states possible in an unfolded chain with hundreds of rotatable bonds. Research efforts spanning several decades have sought to elucidate the fundamental chemical principles that dictate the speed and mechanism of natural protein folding. In contrast, little is known about how protein mimetic entities transition between an unfolded and folded state. Here, we report effects of altered backbone connectivity on the folding kinetics and mechanism of the B domain of Staphylococcal protein A (BdpA), an ultrafast-folding sequence. A combination of experimental biophysical analysis and atomistic molecular dynamics simulations performed on the prototype protein and several heterogeneous-backbone variants reveal the interplay among backbone flexibility, folding rates, and structural details of the transition state ensemble. Collectively, these findings suggest a significant degree of plasticity in the mechanisms that can give rise to ultrafast folding in the BdpA sequence and provide atomic level insights into how protein mimetic chains adopt an ordered folded state.
Collapse
Affiliation(s)
| | - Jeremy M G Leung
- Department of Chemistry, University of Pittsburgh Pittsburgh PA 15260 USA
| | - Lillian T Chong
- Department of Chemistry, University of Pittsburgh Pittsburgh PA 15260 USA
| | - W Seth Horne
- Department of Chemistry, University of Pittsburgh Pittsburgh PA 15260 USA
| |
Collapse
|
32
|
Chang L, Mondal A, Singh B, Martínez-Noa Y, Perez A. Revolutionizing Peptide-Based Drug Discovery: Advances in the Post-AlphaFold Era. WILEY INTERDISCIPLINARY REVIEWS. COMPUTATIONAL MOLECULAR SCIENCE 2024; 14:e1693. [PMID: 38680429 PMCID: PMC11052547 DOI: 10.1002/wcms.1693] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2023] [Accepted: 09/18/2023] [Indexed: 05/01/2024]
Abstract
Peptide-based drugs offer high specificity, potency, and selectivity. However, their inherent flexibility and differences in conformational preferences between their free and bound states create unique challenges that have hindered progress in effective drug discovery pipelines. The emergence of AlphaFold (AF) and Artificial Intelligence (AI) presents new opportunities for enhancing peptide-based drug discovery. We explore recent advancements that facilitate a successful peptide drug discovery pipeline, considering peptides' attractive therapeutic properties and strategies to enhance their stability and bioavailability. AF enables efficient and accurate prediction of peptide-protein structures, addressing a critical requirement in computational drug discovery pipelines. In the post-AF era, we are witnessing rapid progress with the potential to revolutionize peptide-based drug discovery such as the ability to rank peptide binders or classify them as binders/non-binders and the ability to design novel peptide sequences. However, AI-based methods are struggling due to the lack of well-curated datasets, for example to accommodate modified amino acids or unconventional cyclization. Thus, physics-based methods, such as docking or molecular dynamics simulations, continue to hold a complementary role in peptide drug discovery pipelines. Moreover, MD-based tools offer valuable insights into binding mechanisms, as well as the thermodynamic and kinetic properties of complexes. As we navigate this evolving landscape, a synergistic integration of AI and physics-based methods holds the promise of reshaping the landscape of peptide-based drug discovery.
Collapse
Affiliation(s)
- Liwei Chang
- Department of Chemistry, University of Florida, Gainesville, FL 32611
| | - Arup Mondal
- Department of Chemistry, University of Florida, Gainesville, FL 32611
| | - Bhumika Singh
- Department of Chemistry, University of Florida, Gainesville, FL 32611
| | | | - Alberto Perez
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, FL 32611
| |
Collapse
|
33
|
Bogetti A, Leung JMG, Chong LT. LPATH: A Semiautomated Python Tool for Clustering Molecular Pathways. J Chem Inf Model 2023; 63:7610-7616. [PMID: 38048485 PMCID: PMC10751797 DOI: 10.1021/acs.jcim.3c01318] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Revised: 10/14/2023] [Accepted: 11/09/2023] [Indexed: 12/06/2023]
Abstract
The pathways by which a molecular process transitions to a target state are highly sought-after as direct views of a transition mechanism. While great strides have been made in the physics-based simulation of such pathways, the analysis of these pathways can be a major challenge due to their diversity and variable lengths. Here, we present the LPATH Python tool, which implements a semiautomated method for linguistics-assisted clustering of pathways into distinct classes (or routes). This method involves three steps: 1) discretizing the configurational space into key states, 2) extracting a text-string sequence of key visited states for each pathway, and 3) pairwise matching of pathways based on a text-string similarity score. To circumvent the prohibitive memory requirements of the first step, we have implemented a general two-stage method for clustering conformational states that exploits machine learning. LPATH is primarily designed for use with the WESTPA software for weighted ensemble simulations; however, the tool can also be applied to conventional simulations. As demonstrated for the C7eq to C7ax conformational transition of the alanine dipeptide, LPATH provides physically reasonable classes of pathways and corresponding probabilities.
Collapse
Affiliation(s)
- Anthony
T. Bogetti
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Jeremy M. G. Leung
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Lillian T. Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| |
Collapse
|
34
|
Bogetti AT, Leung JMG, Chong LT. LPATH: A semi-automated Python tool for clustering molecular pathways. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.08.17.553774. [PMID: 37645995 PMCID: PMC10462149 DOI: 10.1101/2023.08.17.553774] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/01/2023]
Abstract
The pathways by which a molecular process transitions to a target state are highly sought-after as direct views of a transition mechanism. While great strides have been made in the physics-based simulation of such pathways, the analysis of these pathways can be a major challenge due to their diversity and variable lengths. Here we present the LPATH Python tool, which implements a semi-automated method for linguistics-assisted clustering of pathways into distinct classes (or routes). This method involves three steps: 1) discretizing the configurational space into key states, 2) extracting a text-string sequence of key visited states for each pathway, and 3) pairwise matching of pathways based on a text-string similarity score. To circumvent the prohibitive memory requirements of the first step, we have implemented a general two-stage method for clustering conformational states that exploits machine learning. LPATH is primarily designed for use with the WESTPA software for weighted ensemble simulations; however, the tool can also be applied to conventional simulations. As demonstrated for the C7eq to C7ax conformational transition of alanine dipeptide, LPATH provides physically reasonable classes of pathways and corresponding probabilities.
Collapse
Affiliation(s)
- Anthony T. Bogetti
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| | - Jeremy M. G. Leung
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| | - Lillian T. Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260
| |
Collapse
|
35
|
Bogetti X, Bogetti A, Casto J, Rule G, Chong L, Saxena S. Direct observation of negative cooperativity in a detoxification enzyme at the atomic level by Electron Paramagnetic Resonance spectroscopy and simulation. Protein Sci 2023; 32:e4770. [PMID: 37632831 PMCID: PMC10503414 DOI: 10.1002/pro.4770] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2023] [Revised: 07/14/2023] [Accepted: 08/23/2023] [Indexed: 08/28/2023]
Abstract
The catalytic activity of human glutathione S-transferase A1-1 (hGSTA1-1), a homodimeric detoxification enzyme, is dependent on the conformational dynamics of a key C-terminal helix α9 in each monomer. However, the structural details of how the two monomers interact upon binding of substrates is not well understood and the structure of the ligand-free state of the hGSTA1-1 homodimer has not been resolved. Here, we used a combination of electron paramagnetic resonance (EPR) distance measurements and weighted ensemble (WE) simulations to characterize the conformational ensemble of the ligand-free state at the atomic level. EPR measurements reveal a broad distance distribution between a pair of Cu(II) labels in the ligand-free state that gradually shifts and narrows as a function of increasing ligand concentration. These shifts suggest changes in the relative positioning of the two α9 helices upon ligand binding. WE simulations generated unbiased pathways for the seconds-timescale transition between alternate states of the enzyme, leading to the generation of atomically detailed structures of the ligand-free state. Notably, the simulations provide direct observations of negative cooperativity between the monomers of hGSTA1-1, which involve the mutually exclusive docking of α9 in each monomer as a lid over the active site. We identify key interactions between residues that lead to this negative cooperativity. Negative cooperativity may be essential for interaction of hGSTA1-1 with a wide variety of toxic substrates and their subsequent neutralization. More broadly, this work demonstrates the power of integrating EPR distances with WE rare-events sampling strategy to gain mechanistic information on protein function at the atomic level.
Collapse
Affiliation(s)
- Xiaowei Bogetti
- Department of ChemistryUniversity of PittsburghPittsburghPennsylvaniaUSA
| | - Anthony Bogetti
- Department of ChemistryUniversity of PittsburghPittsburghPennsylvaniaUSA
| | - Joshua Casto
- Department of ChemistryUniversity of PittsburghPittsburghPennsylvaniaUSA
| | - Gordon Rule
- Department of Biological SciencesCarnegie Mellon UniversityPittsburghPennsylvaniaUSA
| | - Lillian Chong
- Department of ChemistryUniversity of PittsburghPittsburghPennsylvaniaUSA
| | - Sunil Saxena
- Department of ChemistryUniversity of PittsburghPittsburghPennsylvaniaUSA
| |
Collapse
|
36
|
Tang R, Wang Z, Xiang S, Wang L, Yu Y, Wang Q, Deng Q, Hou T, Sun H. Uncovering the Kinetic Characteristics and Degradation Preference of PROTAC Systems with Advanced Theoretical Analyses. JACS AU 2023; 3:1775-1789. [PMID: 37388700 PMCID: PMC10301679 DOI: 10.1021/jacsau.3c00195] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Revised: 05/16/2023] [Accepted: 05/16/2023] [Indexed: 07/01/2023]
Abstract
Proteolysis-targeting chimeras (PROTACs), which can selectively induce the degradation of target proteins, represent an attractive technology in drug discovery. A large number of PROTACs have been reported, but due to the complicated structural and kinetic characteristics of the target-PROTAC-E3 ligase ternary interaction process, the rational design of PROTACs is still quite challenging. Here, we characterized and analyzed the kinetic mechanism of MZ1, a PROTAC that targets the bromodomain (BD) of the bromodomain and extra terminal (BET) protein (Brd2, Brd3, or Brd4) and von Hippel-Lindau E3 ligase (VHL), from the kinetic and thermodynamic perspectives of view by using enhanced sampling simulations and free energy calculations. The simulations yielded satisfactory predictions on the relative residence time and standard binding free energy (rp > 0.9) for MZ1 in different BrdBD-MZ1-VHL ternary complexes. Interestingly, the simulation of the PROTAC ternary complex disintegration illustrates that MZ1 tends to remain on the surface of VHL with the BD proteins dissociating alone without a specific dissociation direction, indicating that the PROTAC prefers more to bind with E3 ligase at the first step in the formation of the target-PROTAC-E3 ligase ternary complex. Further exploration of the degradation difference of MZ1 in different Brd systems shows that the PROTAC with higher degradation efficiency tends to leave more lysine exposed on the target protein, which is guaranteed by the stability (binding affinity) and durability (residence time) of the target-PROTAC-E3 ligase ternary complex. It is quite possible that the underlying binding characteristics of the BrdBD-MZ1-VHL systems revealed by this study may be shared by different PROTAC systems as a general rule, which may accelerate rational PROTAC design with higher degradation efficiency.
Collapse
Affiliation(s)
- Rongfan Tang
- Department
of Medicinal Chemistry, China Pharmaceutical
University, Nanjing 210009, Jiangsu, P. R. China
| | - Zhe Wang
- Innovation
Institute for Artificial Intelligence in Medicine of Zhejiang University,
College of Pharmaceutical Sciences, Zhejiang
University, Hangzhou 310058, Zhejiang, P. R. China
| | - Sutong Xiang
- Department
of Medicinal Chemistry, China Pharmaceutical
University, Nanjing 210009, Jiangsu, P. R. China
| | - Lingling Wang
- Department
of Medicinal Chemistry, China Pharmaceutical
University, Nanjing 210009, Jiangsu, P. R. China
| | - Yang Yu
- Department
of Medicinal Chemistry, China Pharmaceutical
University, Nanjing 210009, Jiangsu, P. R. China
| | - Qinghua Wang
- Department
of Medicinal Chemistry, China Pharmaceutical
University, Nanjing 210009, Jiangsu, P. R. China
| | - Qirui Deng
- Department
of Medicinal Chemistry, China Pharmaceutical
University, Nanjing 210009, Jiangsu, P. R. China
| | - Tingjun Hou
- Innovation
Institute for Artificial Intelligence in Medicine of Zhejiang University,
College of Pharmaceutical Sciences, Zhejiang
University, Hangzhou 310058, Zhejiang, P. R. China
| | - Huiyong Sun
- Department
of Medicinal Chemistry, China Pharmaceutical
University, Nanjing 210009, Jiangsu, P. R. China
| |
Collapse
|
37
|
Ojha AA, Thakur S, Ahn SH, Amaro RE. DeepWEST: Deep Learning of Kinetic Models with the Weighted Ensemble Simulation Toolkit for Enhanced Sampling. J Chem Theory Comput 2023; 19:1342-1359. [PMID: 36719802 DOI: 10.1021/acs.jctc.2c00282] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Recent advances in computational power and algorithms have enabled molecular dynamics (MD) simulations to reach greater time scales. However, for observing conformational transitions associated with biomolecular processes, MD simulations still have limitations. Several enhanced sampling techniques seek to address this challenge, including the weighted ensemble (WE) method, which samples transitions between metastable states using many weighted trajectories to estimate kinetic rate constants. However, initial sampling of the potential energy surface has a significant impact on the performance of WE, i.e., convergence and efficiency. We therefore introduce deep-learned kinetic modeling approaches that extract statistically relevant information from short MD trajectories to provide a well-sampled initial state distribution for WE simulations. This hybrid approach overcomes any statistical bias to the system, as it runs short unbiased MD trajectories and identifies meaningful metastable states of the system. It is shown to provide a more refined free energy landscape closer to the steady state that could efficiently sample kinetic properties such as rate constants.
Collapse
Affiliation(s)
- Anupam Anand Ojha
- Department of Chemistry, University of California San Diego, La Jolla, California92093, United States
| | - Saumya Thakur
- Department of Chemistry, Indian Institute of Technology Bombay, Mumbai, Maharashtra400076, India
| | - Surl-Hee Ahn
- Department of Chemical Engineering, University of California Davis, Davis, California95616, United States
| | - Rommie E Amaro
- Department of Chemistry, University of California San Diego, La Jolla, California92093, United States
| |
Collapse
|
38
|
Choe S. Translocation of a single Arg[Formula: see text] peptide across a DOPC/DOPG(4:1) model membrane using the weighted ensemble method. Sci Rep 2023; 13:1168. [PMID: 36670187 PMCID: PMC9860060 DOI: 10.1038/s41598-023-28493-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Accepted: 01/19/2023] [Indexed: 01/21/2023] Open
Abstract
It is difficult to observe a spontaneous translocation of cell-penetrating peptides(CPPs) within a short time scale (e.g., a few hundred ns) in all-atom molecular dynamics(MD) simulations because the time required for the translocation of usual CPPs is on the order of minutes or so. In this work, we report a spontaneous translocation of a single Arg[Formula: see text](R9) across a DOPC/DOPG(4:1) model membrane within an order of a few tens ns scale by using the weighted ensemble(WE) method. We identify how water molecules and the orientation of Arg[Formula: see text] play a role in translocation. We also show how lipid molecules are transported along with Arg[Formula: see text]. In addition, we present free energy profiles of the translocation across the membrane using umbrella sampling and show that a single Arg[Formula: see text] translocation is energetically unfavorable. We expect that the WE method can help study interactions of CPPs with various model membranes within MD simulation approaches.
Collapse
Affiliation(s)
- Seungho Choe
- Department of Energy Science & Engineering, Daegu Gyeongbuk Institute of Science & Technology (DGIST), Daegu, 42988 South Korea
- Energy Science & Engineering Research Center, Daegu Gyeongbuk Institute of Science & Technology (DGIST), Daegu, 42988 South Korea
| |
Collapse
|
39
|
Aristoff D, Copperman J, Simpson G, Webber RJ, Zuckerman DM. Weighted ensemble: Recent mathematical developments. J Chem Phys 2023; 158:014108. [PMID: 36610976 PMCID: PMC9822651 DOI: 10.1063/5.0110873] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Accepted: 12/04/2022] [Indexed: 12/12/2022] Open
Abstract
Weighted ensemble (WE) is an enhanced sampling method based on periodically replicating and pruning trajectories generated in parallel. WE has grown increasingly popular for computational biochemistry problems due, in part, to improved hardware and accessible software implementations. Algorithmic and analytical improvements have played an important role, and progress has accelerated in recent years. Here, we discuss and elaborate on the WE method from a mathematical perspective, highlighting recent results that enhance the computational efficiency. The mathematical theory reveals a new strategy for optimizing trajectory management that approaches the best possible variance while generalizing to systems of arbitrary dimension.
Collapse
Affiliation(s)
- D. Aristoff
- Mathematics, Colorado State University, Fort Collins, CO 80521 USA
| | - J. Copperman
- Biomedical Engineering, Oregon Health and Science University, Portland, OR 97239 USA
| | - G. Simpson
- Mathematics, Drexel University, Philadelphia, Pennsylvania 19104 USA
| | - R. J. Webber
- Computing and Mathematical Sciences, California Institute of Technology, Pasadena, California 91125 USA
| | - D. M. Zuckerman
- Biomedical Engineering, Oregon Health and Science University, Portland, OR 97239 USA
| |
Collapse
|
40
|
Bogetti AT, Leung JMG, Russo JD, Zhang S, Thompson JP, Saglam AS, Ray D, Mostofian B, Pratt AJ, Abraham RC, Harrison PO, Dudek M, Torrillo PA, DeGrave AJ, Adhikari U, Faeder JR, Andricioaei I, Adelman JL, Zwier MC, LeBard DN, Zuckerman DM, Chong LT. A Suite of Tutorials for the WESTPA 2.0 Rare-Events Sampling Software [Article v2.0]. LIVING JOURNAL OF COMPUTATIONAL MOLECULAR SCIENCE 2023; 5:1655. [PMID: 37200895 PMCID: PMC10191340 DOI: 10.33011/livecoms.5.1.1655] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/20/2023]
Abstract
The weighted ensemble (WE) strategy has been demonstrated to be highly efficient in generating pathways and rate constants for rare events such as protein folding and protein binding using atomistic molecular dynamics simulations. Here we present two sets of tutorials instructing users in the best practices for preparing, carrying out, and analyzing WE simulations for various applications using the WESTPA software. The first set of more basic tutorials describes a range of simulation types, from a molecular association process in explicit solvent to more complex processes such as host-guest association, peptide conformational sampling, and protein folding. The second set ecompasses six advanced tutorials instructing users in the best practices of using key new features and plugins/extensions of the WESTPA 2.0 software package, which consists of major upgrades for larger systems and/or slower processes. The advanced tutorials demonstrate the use of the following key features: (i) a generalized resampler module for the creation of "binless" schemes, (ii) a minimal adaptive binning scheme for more efficient surmounting of free energy barriers, (iii) streamlined handling of large simulation datasets using an HDF5 framework, (iv) two different schemes for more efficient rate-constant estimation, (v) a Python API for simplified analysis of WE simulations, and (vi) plugins/extensions for Markovian Weighted Ensemble Milestoning and WE rule-based modeling for systems biology models. Applications of the advanced tutorials include atomistic and non-spatial models, and consist of complex processes such as protein folding and the membrane permeability of a drug-like molecule. Users are expected to already have significant experience with running conventional molecular dynamics or systems biology simulations.
Collapse
Affiliation(s)
| | | | - John D. Russo
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | | | | | - Ali S. Saglam
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA
| | - Dhiman Ray
- Department of Chemistry, University of California Irvine, Irvine, CA
| | - Barmak Mostofian
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | - AJ Pratt
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Rhea C. Abraham
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Page O. Harrison
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Max Dudek
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Paul A. Torrillo
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Alex J. DeGrave
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| | - Upendra Adhikari
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | - James R. Faeder
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA
| | - Ioan Andricioaei
- Department of Chemistry, University of California Irvine, Irvine, CA
| | - Joshua L. Adelman
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA
| | | | | | - Daniel M. Zuckerman
- Department of Biomedical Engineering, Oregon Health and Science University, Portland, OR
| | - Lillian T. Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA
| |
Collapse
|
41
|
Dommer A, Casalino L, Kearns F, Rosenfeld M, Wauer N, Ahn SH, Russo J, Oliveira S, Morris C, Bogetti A, Trifan A, Brace A, Sztain T, Clyde A, Ma H, Chennubhotla C, Lee H, Turilli M, Khalid S, Tamayo-Mendoza T, Welborn M, Christensen A, Smith DG, Qiao Z, Sirumalla SK, O'Connor M, Manby F, Anandkumar A, Hardy D, Phillips J, Stern A, Romero J, Clark D, Dorrell M, Maiden T, Huang L, McCalpin J, Woods C, Gray A, Williams M, Barker B, Rajapaksha H, Pitts R, Gibbs T, Stone J, Zuckerman DM, Mulholland AJ, Miller T, Jha S, Ramanathan A, Chong L, Amaro RE. #COVIDisAirborne: AI-enabled multiscale computational microscopy of delta SARS-CoV-2 in a respiratory aerosol. THE INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS 2023; 37:28-44. [PMID: 36647365 PMCID: PMC9527558 DOI: 10.1177/10943420221128233] [Citation(s) in RCA: 36] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 05/29/2023]
Abstract
We seek to completely revise current models of airborne transmission of respiratory viruses by providing never-before-seen atomic-level views of the SARS-CoV-2 virus within a respiratory aerosol. Our work dramatically extends the capabilities of multiscale computational microscopy to address the significant gaps that exist in current experimental methods, which are limited in their ability to interrogate aerosols at the atomic/molecular level and thus obscure our understanding of airborne transmission. We demonstrate how our integrated data-driven platform provides a new way of exploring the composition, structure, and dynamics of aerosols and aerosolized viruses, while driving simulation method development along several important axes. We present a series of initial scientific discoveries for the SARS-CoV-2 Delta variant, noting that the full scientific impact of this work has yet to be realized.
Collapse
Affiliation(s)
| | | | | | | | | | | | - John Russo
- Oregon Health & Science University, Portland, OR, USA
| | | | | | | | - Anda Trifan
- Argonne National Laboratory, Lemont, IL, USA
- University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - Alexander Brace
- Argonne National Laboratory, Lemont, IL, USA
- University of Chicago, Chicago, IL, USA
| | - Terra Sztain
- UC San Diego, La Jolla, CA, USA
- Freie Universitat Berlin
| | - Austin Clyde
- Argonne National Laboratory, Lemont, IL, USA
- University of Chicago, Chicago, IL, USA
| | - Heng Ma
- Argonne National Laboratory, Lemont, IL, USA
| | | | - Hyungro Lee
- Brookhaven National Lab and Rutgers University
| | | | | | | | | | | | | | - Zhuoran Qiao
- California Institute of Technology, Pasadena, CA, USA
| | | | | | | | - Anima Anandkumar
- California Institute of Technology, Pasadena, CA, USA
- NVIDIA Corp, Santa Clara, CA, USA
| | - David Hardy
- University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | - James Phillips
- University of Illinois at Urbana-Champaign, Urbana, IL, USA
| | | | | | | | | | - Tom Maiden
- Pittsburgh Supercomputing Center, Pittsburgh, PA, USA
| | - Lei Huang
- Texas Advanced Computing Center, Austin, TX, USA
| | | | | | | | | | | | | | | | | | - John Stone
- University of Illinois at Urbana-Champaign, Urbana, IL, USA
- NVIDIA Corp, Santa Clara, CA, USA
| | | | | | - Thomas Miller
- Entos, Inc., San Diego, CA, USA
- California Institute of Technology, Pasadena, CA, USA
| | | | | | | | | |
Collapse
|
42
|
Vymětal J, Vondrášek J. Iterative Landmark-Based Umbrella Sampling (ILBUS) Protocol for Sampling of Conformational Space of Biomolecules. J Chem Inf Model 2022; 62:4783-4798. [PMID: 36122323 DOI: 10.1021/acs.jcim.2c00370] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Computer simulations of biomolecules such as molecular dynamics often suffer from insufficient sampling. Due to limited computational resources, insufficient sampling prevents obtaining proper equilibrium distributions of observed properties. To deal with this problem, we proposed a simulation protocol for efficient resampling of collected off-equilibrium trajectories. These trajectories are utilized for the initial mapping of the conformational space, which is later properly resampled by the introduced Iterative Landmark-Based Umbrella Sampling (ILBUS) method. Reconstruction of static equilibrium properties is achieved by the multistate Bennett acceptance ratio (MBAR) method, which enables efficient use of simulated data. The ILBUS protocol is geometry-based and does not demand any additional collective variable or a dimensional-reduction technique. The only requirement is a set of suitably spaced reference conformations, which serve as landmarks in the mapped conformational space. Additionally, the ILBUS protocol encompasses an iterative process that optimizes the force constant used in the umbrella sampling simulation. Such tuning is an inherent feature of the protocol and does not need to be performed by the user in advance. Furthermore, even the simulations with suboptimal force constants can be used in estimates by MBAR. We demonstrate the feasibility and the performance of this approach in the study of the conformational landscape of the alanine dipeptide, met-enkephalin, and adenylate kinase.
Collapse
Affiliation(s)
- Jiří Vymětal
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 542/2, 160 00 Praha 6, Czech Republic
| | - Jiří Vondrášek
- Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo náměstí 542/2, 160 00 Praha 6, Czech Republic
| |
Collapse
|
43
|
Kleiman DE, Shukla D. Multiagent Reinforcement Learning-Based Adaptive Sampling for Conformational Dynamics of Proteins. J Chem Theory Comput 2022; 18:5422-5434. [PMID: 36044642 DOI: 10.1021/acs.jctc.2c00683] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Machine learning is increasingly applied to improve the efficiency and accuracy of molecular dynamics (MD) simulations. Although the growth of distributed computer clusters has allowed researchers to obtain higher amounts of data, unbiased MD simulations have difficulty sampling rare states, even under massively parallel adaptive sampling schemes. To address this issue, several algorithms inspired by reinforcement learning (RL) have arisen to promote exploration of the slow collective variables (CVs) of complex systems. Nonetheless, most of these algorithms are not well-suited to leverage the information gained by simultaneously sampling a system from different initial states (e.g., a protein in different conformations associated with distinct functional states). To fill this gap, we propose two algorithms inspired by multiagent RL that extend the functionality of closely related techniques (REAP and TSLC) to situations where the sampling can be accelerated by learning from different regions of the energy landscape through coordinated agents. Essentially, the algorithms work by remembering which agent discovered each conformation and sharing this information with others at the action-space discretization step. A stakes function is introduced to modulate how different agents sense rewards from discovered states of the system. The consequences are three-fold: (i) agents learn to prioritize CVs using only relevant data, (ii) redundant exploration is reduced, and (iii) agents that obtain higher stakes are assigned more actions. We compare our algorithm with other adaptive sampling techniques (least counts, REAP, TSLC, and AdaptiveBandit) to show and rationalize the gain in performance.
Collapse
Affiliation(s)
- Diego E Kleiman
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| | - Diwakar Shukla
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Chemical and Biomolecular Engineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Bioengineering, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States.,Department of Plant Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, United States
| |
Collapse
|
44
|
Zhang S, Thompson JP, Xia J, Bogetti AT, York F, Skillman AG, Chong LT, LeBard DN. Mechanistic Insights into Passive Membrane Permeability of Drug-like Molecules from a Weighted Ensemble of Trajectories. J Chem Inf Model 2022; 62:1891-1904. [PMID: 35421313 PMCID: PMC9044451 DOI: 10.1021/acs.jcim.1c01540] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
![]()
Passive permeability
of a drug-like molecule is a critical property
assayed early in a drug discovery campaign that informs a medicinal
chemist how well a compound can traverse biological membranes, such
as gastrointestinal epithelial or restrictive organ barriers, so it
can perform a specific therapeutic function. However, the challenge
that remains is the development of a method, experimental or computational,
which can both determine the permeation rate and provide mechanistic
insights into the transport process to help with the rational design
of any given molecule. Typically, one of the following three methods
are used to measure the membrane permeability: (1) experimental permeation
assays acting on either artificial or natural membranes; (2) quantitative
structure–permeability relationship models that rely on experimental
values of permeability or related pharmacokinetic properties of a
range of molecules to infer those for new molecules; and (3) estimation
of permeability from the Smoluchowski equation, where free energy
and diffusion profiles along the membrane normal are taken as input
from large-scale molecular dynamics simulations. While all these methods
provide estimates of permeation coefficients, they provide very little
information for guiding rational drug design. In this study, we employ
a highly parallelizable weighted ensemble (WE) path sampling strategy,
empowered by cloud computing techniques, to generate unbiased permeation
pathways and permeability coefficients for a set of drug-like molecules
across a neat 1-palmitoyl-2-oleoyl-sn-glycero-3-phosphatidylcholine
membrane bilayer. Our WE method predicts permeability coefficients
that compare well to experimental values from an MDCK-LE cell line
and PAMPA assays for a set of drug-like amines of varying size, shape,
and flexibility. Our method also yields a series of continuous permeation
pathways weighted and ranked by their associated probabilities. Taken
together, the ensemble of reactive permeation pathways, along with
the estimate of the permeability coefficient, provides a clearer picture
of the microscopic underpinnings of small-molecule membrane permeation.
Collapse
Affiliation(s)
- She Zhang
- OpenEye Scientific, Santa Fe, New Mexico 87508, United States
| | - Jeff P Thompson
- OpenEye Scientific, Santa Fe, New Mexico 87508, United States
| | - Junchao Xia
- OpenEye Scientific, Santa Fe, New Mexico 87508, United States
| | - Anthony T Bogetti
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - Forrest York
- OpenEye Scientific, Santa Fe, New Mexico 87508, United States
| | | | - Lillian T Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, Pennsylvania 15260, United States
| | - David N LeBard
- OpenEye Scientific, Santa Fe, New Mexico 87508, United States
| |
Collapse
|
45
|
Santhouse JR, Leung JMG, Chong LT, Horne WS. Implications of the unfolded state in the folding energetics of heterogeneous-backbone protein mimetics. Chem Sci 2022; 13:11798-11806. [PMID: 36320921 PMCID: PMC9580521 DOI: 10.1039/d2sc04427g] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2022] [Accepted: 09/19/2022] [Indexed: 12/28/2022] Open
Abstract
Sequence-encoded folding is the foundation of protein structure and is also possible in synthetic chains of artificial chemical composition. In natural proteins, the characteristics of the unfolded state are as important as those of the folded state in determining folding energetics. While much is known about folded structures adopted by artificial protein-like chains, corresponding information about the unfolded states of these molecules is lacking. Here, we report the consequences of altered backbone composition on the structure, stability, and dynamics of the folded and unfolded states of a compact helix-rich protein. Characterization through a combination of biophysical experiments and atomistic simulation reveals effects of backbone modification that depend on both the type of artificial monomers employed and where they are applied in sequence. In general, introducing artificial connectivity in a way that reinforces characteristics of the unfolded state ensemble of the prototype natural protein minimizes the impact of chemical changes on folded stability. These findings have implications in the design of protein mimetics and provide an atomically detailed picture of the unfolded state of a natural protein and artificial analogues under non-denaturing conditions. Biophysical experiments and atomistic simulation reveal impacts of protein backbone alteration on the ensemble that defines the unfolded state. These effects have implications on folded stability of protein mimetics.![]()
Collapse
Affiliation(s)
| | - Jeremy M. G. Leung
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA, 15211, USA
| | - Lillian T. Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA, 15211, USA
| | - W. Seth Horne
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA, 15211, USA
| |
Collapse
|