1
|
Correy GJ, Rachman MM, Togo T, Gahbauer S, Doruk YU, Stevens MGV, Jaishankar P, Kelley B, Goldman B, Schmidt M, Kramer T, Radchenko DS, Moroz YS, Ashworth A, Riley P, Shoichet BK, Renslo AR, Walters WP, Fraser JS. Exploration of structure-activity relationships for the SARS-CoV-2 macrodomain from shape-based fragment linking and active learning. SCIENCE ADVANCES 2025; 11:eads7187. [PMID: 40435250 PMCID: PMC12118597 DOI: 10.1126/sciadv.ads7187] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Accepted: 04/22/2025] [Indexed: 06/01/2025]
Abstract
The macrodomain of severe acute respiratory syndrome coronavirus 2 nonstructural protein 3 is required for viral pathogenesis and is an emerging antiviral target. We previously performed an x-ray crystallography-based fragment screen and found submicromolar inhibitors by fragment linking. However, these compounds had poor membrane permeability and liabilities that complicated optimization. Here, we developed a shape-based virtual screening pipeline-FrankenROCS. We screened the Enamine high-throughput collection of 2.1 million compounds, selecting 39 compounds for testing, with the most potent binding with a 130 μM median inhibitory concentration (IC50). We then paired FrankenROCS with an active learning algorithm (Thompson sampling) to efficiently search the Enamine REAL database of 22 billion molecules, testing 32 compounds with the most potent binding with a 220 μM IC50. Further optimization led to analogs with IC50 values better than 10 μM. This lead series has improved membrane permeability and is poised for optimization. FrankenROCS is a scalable method for fragment linking to exploit synthesis-on-demand libraries.
Collapse
Affiliation(s)
- Galen J. Correy
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94158, USA
| | - Moira M. Rachman
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158, USA
| | - Takaya Togo
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158, USA
| | - Stefan Gahbauer
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158, USA
| | - Yagmur U. Doruk
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, CA 94158, USA
| | - Maisie G. V. Stevens
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, CA 94158, USA
| | - Priyadarshini Jaishankar
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158, USA
| | | | | | | | | | | | - Yurii S. Moroz
- Enamine Ltd., Kyiv, Ukraine
- Chemspace LLC, Kyiv, Ukraine
- Department of Chemistry, Taras Shevchenko National University of Kyiv, Kyiv, Ukraine
| | - Alan Ashworth
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, CA 94158, USA
| | | | - Brian K. Shoichet
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158, USA
| | - Adam R. Renslo
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158, USA
| | | | - James S. Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94158, USA
| |
Collapse
|
2
|
Raju A, Sharma S, Riley BT, Djuraev S, Tan Y, Kim M, Mahmud T, Keedy DA. Mapping allosteric rewiring in related protein structures from collections of crystallographic multiconformer models. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.05.23.655529. [PMID: 40501656 PMCID: PMC12154631 DOI: 10.1101/2025.05.23.655529] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 06/22/2025]
Abstract
How do related proteins with a common fold perform diverse biological functions? Although the average structure may be similar, structural excursions from this average may differ, giving rise to allosteric rewiring that enables differential activity and regulation. However, this idea has been difficult to test in detail. Here we used the qFit algorithm to model "hidden" alternate conformations from electron density maps for an entire protein family, the Protein Tyrosine Phosphatases (PTPs), spanning 26 enzymes and 221 structures. To interrogate these multiconformer models, we developed a new algorithm, Residue Interaction Networks From Alternate conformations In RElated structures (RINFAIRE), that calculates networks of interactions between flexible residues and quantitatively compares them. We show that PTPs share a common allosteric network which rewires dynamically in response to catalytic loop motions or active-site vs. allosteric ligand binding, but also that individual PTPs have unique allosteric signatures. As experimental validation, we show that targeted mutations at residues with varying sequence conservation but high network connectivity modulate enzyme catalysis, including a surprising enhancement of activity. Overall, our work provides new tools for understanding how evolution has recycled modular macromolecular building blocks to diversify biological function. RINFAIRE is available at https://github.com/keedylab/rinfaire .
Collapse
Affiliation(s)
- Akshay Raju
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Shivani Sharma
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
- PhD Program in Biology, CUNY Graduate Center, New York, NY 10016
| | - Blake T. Riley
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Shakhriyor Djuraev
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Yingxian Tan
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
- Department of Chemistry and Biochemistry, City College of New York, New York, NY 10031
| | - Minyoung Kim
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Toufique Mahmud
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Daniel A. Keedy
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
- Department of Chemistry and Biochemistry, City College of New York, New York, NY 10031
- PhD Programs in Biochemistry, Biology, & Chemistry, CUNY Graduate Center, New York, NY 10016
| |
Collapse
|
3
|
Wankowicz SA, Fraser JS. Advances in uncovering the mechanisms of macromolecular conformational entropy. Nat Chem Biol 2025; 21:623-634. [PMID: 40275100 PMCID: PMC12103944 DOI: 10.1038/s41589-025-01879-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Accepted: 03/10/2025] [Indexed: 04/26/2025]
Abstract
During protein folding, proteins transition from a disordered polymer into a globular structure, markedly decreasing their conformational degrees of freedom, leading to a substantial reduction in entropy. Nonetheless, folded proteins retain substantial entropy as they fluctuate between the conformations that make up their native state. This residual entropy contributes to crucial functions like binding and catalysis, supported by growing evidence primarily from NMR and simulation studies. Here, we propose three major ways that macromolecules use conformational entropy to perform their functions; first, prepaying entropic cost through ordering of the ground state; second, spatially redistributing entropy, in which a decrease in entropy in one area is reciprocated by an increase in entropy elsewhere; third, populating catalytically competent ensembles, in which conformational entropy within the enzymatic scaffold aids in lowering transition state barriers. We also provide our perspective on how solving the current challenge of structurally defining the ensembles encoding conformational entropy will lead to new possibilities for controlling binding, catalysis and allostery.
Collapse
Affiliation(s)
- Stephanie A Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA.
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA, USA.
| |
Collapse
|
4
|
Cavender CE, Case DA, Chen JCH, Chong LT, Keedy DA, Lindorff-Larsen K, Mobley DL, Ollila OHS, Oostenbrink C, Robustelli P, Voelz VA, Wall ME, Wych DC, Gilson MK. Structure-Based Experimental Datasets for Benchmarking Protein Simulation Force Fields [Article v0.1]. ARXIV 2025:arXiv:2303.11056v2. [PMID: 40196146 PMCID: PMC11975311] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
This review article provides an overview of structurally oriented experimental datasets that can be used to benchmark protein force fields, focusing on data generated by nuclear magnetic resonance (NMR) spectroscopy and room temperature (RT) protein crystallography. We discuss what the observables are, what they tell us about structure and dynamics, what makes them useful for assessing force field accuracy, and how they can be connected to molecular dynamics simulations carried out using the force field one wishes to benchmark. We also touch on statistical issues that arise when comparing simulations with experiment. We hope this article will be particularly useful to computational researchers and trainees who develop, benchmark, or use protein force fields for molecular simulations.
Collapse
Affiliation(s)
- Chapin E. Cavender
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| | - David A. Case
- Department of Chemistry & Chemical Biology, Rutgers University, Piscataway, NJ, USA
| | - Julian C.-H. Chen
- Bioscience Division, Los Alamos National Laboratory, Los Alamos, NM, USA; Department of Chemistry and Biochemistry, The University of Toledo, Toledo, OH, USA
| | - Lillian T. Chong
- Department of Chemistry, University of Pittsburgh, Pittsburgh, PA, USA
| | - Daniel A. Keedy
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY, USA; Department of Chemistry and Biochemistry, City College of New York, New York, NY, USA; PhD Programs in Biochemistry, Biology, and Chemistry, CUNY Graduate Center, New York, NY, USA
| | - Kresten Lindorff-Larsen
- Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | - David L. Mobley
- Department of Pharmaceutical Sciences, University of California Irvine, Irvine, CA, USA
| | - O. H. Samuli Ollila
- Institute of Biotechnology, University of Helsinki, Helsinki, Finland; VTT Technical Research Centre of Finland, Espoo, Finland
| | - Chris Oostenbrink
- Institute for Molecular Modeling and Simulation, University of Natural Resources and Life Sciences, Vienna, Austria
| | - Paul Robustelli
- Department of Chemistry, Dartmouth College, Hanover, NH, USA
| | - Vincent A. Voelz
- Department of Chemistry, Temple University, Philadelphia, PA, USA
| | - Michael E. Wall
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, USA; The Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - David C. Wych
- Computer, Computational, and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM, USA; The Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, USA
| | - Michael K. Gilson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA, USA
| |
Collapse
|
5
|
Stachowski TR, Fischer M. FLEXR-MSA: electron-density map comparisons of sequence-diverse structures. IUCRJ 2025; 12:245-254. [PMID: 40014007 PMCID: PMC11878447 DOI: 10.1107/s2052252525001332] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/19/2024] [Accepted: 02/13/2025] [Indexed: 02/28/2025]
Abstract
Proteins with near-identical sequences often share similar static structures. Yet, comparing crystal structures is limited or even biased by what has been included or omitted in the deposited model. Information about unique dynamics is often hidden in electron-density maps. Currently, automatic map comparisons are limited to sequence-identical structures. To overcome this limitation, we developed FLEXR-MSA, which enables unbiased electron-density map comparisons of sequence-diverse structures by coupling multiple sequence alignment (MSA) with electron-density sampling. FLEXR-MSA generates visualizations that pinpoint low-occupancy features on the residue level and chart them across the protein surface to reveal global changes. To exemplify the utility of this tool, we probed electron densities for protein-wide alternative conformations of HSP90 across four human isoforms and other homologs. Our analysis demonstrates that FLEXR-MSA can reveal hidden differences among HSP90 variants bound to clinically important ligands. Integrating this new functionality into the FLEXR suite of tools links the comparison of conformational landscapes hidden in electron-density maps to the building of multi-conformer models that reveal structural/functional differences that might be of interest when designing selective ligands.
Collapse
Affiliation(s)
- Timothy R. Stachowski
- Department of Chemical Biology and Therapeutics, MS 1000St Jude Children’s Research Hospital262 Danny Thomas PlaceMemphisTN38105USA
| | - Marcus Fischer
- Department of Chemical Biology and Therapeutics, MS 1000St Jude Children’s Research Hospital262 Danny Thomas PlaceMemphisTN38105USA
| |
Collapse
|
6
|
Yabukarski F. Ensemble-function relationships: From qualitative to quantitative relationships between protein structure and function. J Struct Biol 2025; 217:108152. [PMID: 39577782 DOI: 10.1016/j.jsb.2024.108152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2024] [Revised: 11/03/2024] [Accepted: 11/19/2024] [Indexed: 11/24/2024]
Abstract
Structure-function relationships are deeply rooted in modern biochemistry and structural biology and have provided the basis for our understanding of how protein structure defines function. While structure-function relationships continue to provide invaluable qualitative information, they cannot, in principle, provide the quantitative information ultimately needed to fully understand how proteins function and to make quantitative predictions about changes in activity from changes in sequence and structure. These limitations appear to arise from fundamental principles of physics, which dictate that proteins exist as interchanging ensembles of conformations, rather than as static structures that underly conventional structure-function relationships. This perspective discusses the concept of ensemble-function relationships as quantitative relationships that build on and extend traditional structure-function relationships. The concepts of free energy landscapes and conformational ensembles and their application to proteins are reviewed. The perspective summarizes a range of approaches that can provide conformational ensemble information and focuses on X-ray crystallography methods for obtaining experimental protein conformational ensembles. Focusing on enzymes as archetypes of protein function, recent literature examples are reviewed that used ensemble-function relationships to understand how protein residues contribute to function and how changes in protein sequence and structure impact activity, leading to new models and providing previously inaccessible mechanistic insights. Potential applications of conformational ensembles and ensemble-function relationships to protein design are examined. The perspective concludes with current limitations of the ensemble-function relationships and potential paths forward toward the next generation of quantitative ensemble-function models.
Collapse
Affiliation(s)
- Filip Yabukarski
- Protein Homeostasis Structural Biology Group, Bristol Myers Squibb, San Diego, CA 92121, United States.
| |
Collapse
|
7
|
Deck SL, Xu M, Milano SK, Cerione RA. Revealing Functional Hotspots: Temperature-Dependent Crystallography of K-RAS Highlights Allosteric and Druggable Sites. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.27.639303. [PMID: 40060414 PMCID: PMC11888411 DOI: 10.1101/2025.02.27.639303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 03/15/2025]
Abstract
K-RAS mutations drive oncogenesis in multiple cancers, yet the lack of druggable sites has long hindered therapeutic development. Here, we use multi-temperature X-ray crystallography (MT-XRC) to capture functionally relevant K-RAS conformations across a temperature gradient, spanning cryogenic to physiological and even "fever" conditions, and show how cryogenic conditions may obscure key dynamic states as targets for new drug development. This approach revealed a temperature-dependent conformational landscape of K-RAS, shedding light on the dynamic nature of key regions. We identified significant conformational changes occurring at critical sites, including known allosteric and drug-binding pockets, which were hidden under cryogenic conditions but later discovered to be critically important for drug-protein interactions and inhibitor design. These structural changes align with regions previously highlighted by large-scale mutational studies as functionally significant. However, our MT-XRC analysis provides precise structural snapshots, capturing the exact conformations of these potentially important allosteric sites in unprecedented detail. Our findings underscore the necessity of advancing tools like MT-XRC to visualize conformational transitions that may be important in signal propagation which are missed by standard cryogenic XRC and to address hard-to-drug targets through rational drug design. This approach not only provides unique structural insights into K-RAS signaling events and identifies new potential sites to target with drug candidates but also establishes a powerful framework for discovering therapeutic opportunities against other challenging drug targets.
Collapse
Affiliation(s)
- Samuel L Deck
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853
- Department of Molecular Medicine, Cornell University, Ithaca, NY 14853
| | - Megan Xu
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853
- Department of Molecular Medicine, Cornell University, Ithaca, NY 14853
| | - Shawn K Milano
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853
- Department of Molecular Medicine, Cornell University, Ithaca, NY 14853
| | - Richard A Cerione
- Department of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853
- Department of Molecular Medicine, Cornell University, Ithaca, NY 14853
| |
Collapse
|
8
|
Colizzi F. Leveraging Cryptic Ligand Envelopes through Enhanced Molecular Simulations. J Phys Chem Lett 2025; 16:443-453. [PMID: 39740196 DOI: 10.1021/acs.jpclett.4c03215] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2025]
Abstract
Protein-bound ligands can adopt a range of different conformations, collectively defining a ligand envelope that has proven to be crucial for the design of potent and selective drugs. Yet, the cryptic nature of this ligand envelope makes it difficult to visualize, characterize, and ultimately exploit for drug design. Using enhanced molecular dynamics simulations, here, we provide a general framework to reconstruct the cryptic ligand envelope that is dynamically accessible by protein-bound small molecules in solution. We apply this approach to quantify hidden conformational heterogeneity in structurally complex ligands including the marine natural product plitidepsin. The computed conformational heterogeneity expands the small-molecule footprint beyond that typically observed in experiments, also revealing key thermodynamic and kinetic properties of single ligand-target interactions. The model agrees quantitatively with solution NMR, X-ray crystallography, and biochemical measurements, showcasing a versatile strategy to integrate receptor-bound ligand conformational ensembles in molecular design.
Collapse
Affiliation(s)
- Francesco Colizzi
- Molecular Ocean Lab, Institute for Advanced Chemistry of Catalonia, IQAC-CSIC, Carrer de Jordi Girona 18-26, 08034 Barcelona, Spain
- Institute of Marine Sciences, ICM-CSIC, Passeig Marítim de la Barceloneta 37-49, 08003 Barcelona, Spain
| |
Collapse
|
9
|
Zawistowski RK, Crane BR. Differential Responses in the Core, Active Site and Peripheral Regions of Cytochrome c Peroxidase to Extreme Pressure and Temperature. J Mol Biol 2024; 436:168799. [PMID: 39332669 PMCID: PMC11563881 DOI: 10.1016/j.jmb.2024.168799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2024] [Revised: 09/18/2024] [Accepted: 09/19/2024] [Indexed: 09/29/2024]
Abstract
In consideration of life in extreme environments, the effects of hydrostatic pressure on proteins at the atomic level have drawn substantial interest. Large deviations of temperature and pressure from ambient conditions can shift the free energy landscape of proteins to reveal otherwise lowly populated structural states and even promote unfolding. We report the crystal structure of the heme-containing peroxidase, cytochrome c peroxidase (CcP) at 1.5 and 3.0 kbar and make comparisons to structures determined at 1.0 bar and cryo-temperatures (100 K). Pressure produces anisotropic changes in CcP, but compressibility plateaus after 1.5 kbar. CcP responds to pressure with volume declines at the periphery of the protein where B-factors are relatively high but maintains nearly intransient core structure, hydrogen bonding interactions and active site channels. Changes in active-site solvation and heme ligation reveal pressure sensitivity to protein-ligand interactions and a potential docking site for the substrate peroxide. Compression at the surface affects neither alternate side-chain conformers nor B-factors. Thus, packing in the core, which resembles a crystalline solid, limits motion and protects the active site, whereas looser packing at the surface preserves side-chain dynamics. These data demonstrate that conformational dynamics and packing densities are not fully correlated in proteins and that encapsulation of cofactors by the polypeptide can provide a precisely structured environment resistant to change across a wide range of physical conditions.
Collapse
Affiliation(s)
- Rebecca K Zawistowski
- Department of Chemistry and Chemical Biology, Cornell University, 122 Baker Laboratory, Ithaca, NY 14853, USA.
| | - Brian R Crane
- Department of Chemistry and Chemical Biology, Cornell University, 122 Baker Laboratory, Ithaca, NY 14853, USA.
| |
Collapse
|
10
|
Flowers J, Echols N, Correy G, Jaishankar P, Togo T, Renslo AR, van den Bedem H, Fraser JS, Wankowicz SA. Expanding Automated Multiconformer Ligand Modeling to Macrocycles and Fragments. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.20.613996. [PMID: 39386683 PMCID: PMC11463535 DOI: 10.1101/2024.09.20.613996] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 10/12/2024]
Abstract
Small molecule ligands exhibit a diverse range of conformations in solution. Upon binding to a target protein, this conformational diversity is generally reduced. However, ligands can retain some degree of conformational flexibility even when bound to a receptor. In the Protein Data Bank (PDB), a small number of ligands have been modeled with distinct alternative conformations that are supported by X-ray crystallography density maps. However, the vast majority of structural models are fit to a single ligand conformation, potentially ignoring the underlying conformational heterogeneity present in the sample. We previously developed qFit-ligand to sample diverse ligand conformations and to select a parsimonious ensemble consistent with the density. While this approach indicated that many ligands populate alternative conformations, limitations in our sampling procedures often resulted in non-physical conformations and could not model complex ligands like macrocycles. Here, we introduce several improvements to qFit-ligand, including the use of routines within RDKit for stochastic conformational sampling. This new sampling method greatly enriches low energy conformations of small molecules and macrocycles. We further extended qFit-ligand to identify alternative conformations in PanDDA-modified density maps from high throughput X-ray fragment screening experiments. The new version of qFit-ligand improves fit to electron density and reduces torsional strain relative to deposited single conformer models and our previous version of qFit-ligand. These advances enhance the analysis of residual conformational heterogeneity present in ligand-bound structures, which can provide important insights for the rational design of therapeutic agents.
Collapse
Affiliation(s)
- Jessica Flowers
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA
| | - Nathaniel Echols
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA
| | - Galen Correy
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA
| | - Priya Jaishankar
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA
| | - Takaya Togo
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA
| | - Adam R. Renslo
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA
| | - Henry van den Bedem
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA
- Atomwise Inc, San Francisco, CA
| | - James S. Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA
| | - Stephanie A. Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA
- Current Address: Department of Molecular Physiology and Biophysics, Vanderbilt University, Nashville, TN
| |
Collapse
|
11
|
Correy GJ, Rachman M, Togo T, Gahbauer S, Doruk YU, Stevens M, Jaishankar P, Kelley B, Goldman B, Schmidt M, Kramer T, Ashworth A, Riley P, Shoichet BK, Renslo AR, Walters WP, Fraser JS. Extensive exploration of structure activity relationships for the SARS-CoV-2 macrodomain from shape-based fragment merging and active learning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.25.609621. [PMID: 39253507 PMCID: PMC11383323 DOI: 10.1101/2024.08.25.609621] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/11/2024]
Abstract
The macrodomain contained in the SARS-CoV-2 non-structural protein 3 (NSP3) is required for viral pathogenesis and lethality. Inhibitors that block the macrodomain could be a new therapeutic strategy for viral suppression. We previously performed a large-scale X-ray crystallography-based fragment screen and discovered a sub-micromolar inhibitor by fragment linking. However, this carboxylic acid-containing lead had poor membrane permeability and other liabilities that made optimization difficult. Here, we developed a shape-based virtual screening pipeline - FrankenROCS - to identify new macrodomain inhibitors using fragment X-ray crystal structures. We used FrankenROCS to exhaustively screen the Enamine high-throughput screening (HTS) collection of 2.1 million compounds and selected 39 compounds for testing, with the most potent compound having an IC50 value equal to 130 μM. We then paired FrankenROCS with an active learning algorithm (Thompson sampling) to efficiently search the Enamine REAL database of 22 billion molecules, testing 32 compounds with the most potent having an IC50 equal to 220 μM. Further optimization led to analogs with IC50 values better than 10 μM, with X-ray crystal structures revealing diverse binding modes despite conserved chemical features. These analogs represent a new lead series with improved membrane permeability that is poised for optimization. In addition, the collection of 137 X-ray crystal structures with associated binding data will serve as a resource for the development of structure-based drug discovery methods. FrankenROCS may be a scalable method for fragment linking to exploit ever-growing synthesis-on-demand libraries.
Collapse
Affiliation(s)
- Galen J. Correy
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94158
| | - Moira Rachman
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158
| | - Takaya Togo
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158
| | - Stefan Gahbauer
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158
| | - Yagmur U. Doruk
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, CA 94158
| | - Maisie Stevens
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, CA 94158
| | - Priyadarshini Jaishankar
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158
| | | | | | | | | | - Alan Ashworth
- Helen Diller Family Comprehensive Cancer Center, University of California San Francisco, San Francisco, CA 94158
| | | | - Brian K. Shoichet
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158
| | - Adam R. Renslo
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA 94158
| | | | - James S. Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA 94158
| |
Collapse
|
12
|
Rosenberg AA, Marx A, Bronstein AM. A dataset of alternately located segments in protein crystal structures. Sci Data 2024; 11:783. [PMID: 39019896 PMCID: PMC11255211 DOI: 10.1038/s41597-024-03595-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Accepted: 07/01/2024] [Indexed: 07/19/2024] Open
Abstract
Protein Data Bank (PDB) files list the relative spatial location of atoms in a protein structure as the final output of the process of fitting and refining to experimentally determined electron density measurements. Where experimental evidence exists for multiple conformations, atoms are modelled in alternate locations. Programs reading PDB files commonly ignore these alternate conformations by default leaving users oblivious to the presence of alternate conformations in the structures they analyze. This has led to underappreciation of their prevalence, under characterisation of their features and limited the accessibility to this high-resolution data representing structural ensembles. We have trawled PDB files to extract structural features of residues with alternately located atoms. The output includes the distance between alternate conformations and identifies the location of these segments within the protein chain and in proximity of all other atoms within a defined radius. This dataset should be of use in efforts to predict multiple structures from a single sequence and support studies investigating protein flexibility and the association with protein function.
Collapse
Affiliation(s)
- Aviv A Rosenberg
- Department of Computer Science, Technion - Israel Institute of Technology, Haifa, Israel
| | - Ailie Marx
- Department of Molecular and Computational Biosciences and Biotechnology, Migal - Galilee Research Institute, Qiryat, Israel.
| | - Alexander M Bronstein
- Department of Computer Science, Technion - Israel Institute of Technology, Haifa, Israel.
| |
Collapse
|
13
|
Liu F, Mailhot O, Glenn IS, Vigneron SF, Bassim V, Xu X, Fonseca-Valencia K, Smith MS, Radchenko DS, Fraser JS, Moroz YS, Irwin JJ, Shoichet BK. The impact of Library Size and Scale of Testing on Virtual Screening. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.08.602536. [PMID: 39026784 PMCID: PMC11257449 DOI: 10.1101/2024.07.08.602536] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/20/2024]
Abstract
Virtual libraries for ligand discovery have recently increased 10,000-fold, and this is thought to have improved hit rates and potencies from library docking. This idea has not, however, been experimentally tested in direct comparisons of larger-vs-smaller libraries. Meanwhile, though libraries have exploded, the scale of experimental testing has little changed, with often only dozens of high-ranked molecules investigated, making interpretation of hit rates and affinities uncertain. Accordingly, we docked a 1.7 billion molecule virtual library against the model enzyme AmpC β-lactamase, testing 1,521 new molecules and comparing the results to the same screen with a library of 99 million molecules, where only 44 molecules were tested. Encouragingly, the larger screen outperformed the smaller one: hit rates improved by two-fold, more new scaffolds were discovered, and potency improved. Overall, 50-fold more inhibitors were found, supporting the idea that there are many more compounds to be discovered than are being tested. With so many compounds evaluated, we could ask how the results vary with number tested, sampling smaller sets at random from the 1521. Hit rates and affinities were highly variable when we only sampled dozens of molecules, and it was only when we included several hundred molecules that results converged. As docking scores improved, so too did the likelihood of a molecule binding; hit rates improved steadily with docking score, as did affinities. This also appeared true on reanalysis of large-scale results against the σ2 and dopamine D4 receptors. It may be that as the scale of both the virtual libraries and their testing grows, not only are better ligands found but so too does our ability to rank them.
Collapse
Affiliation(s)
- Fangyu Liu
- Dept. of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco CA 94143, USA
| | - Olivier Mailhot
- Dept. of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco CA 94143, USA
| | - Isabella S Glenn
- Dept. of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco CA 94143, USA
| | - Seth F Vigneron
- Dept. of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco CA 94143, USA
| | - Violla Bassim
- Dept. of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco CA 94143, USA
| | - Xinyu Xu
- Dept. of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco CA 94143, USA
| | - Karla Fonseca-Valencia
- Dept. of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco CA 94143, USA
| | - Matthew S Smith
- Dept. of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco CA 94143, USA
| | | | - James S Fraser
- Dept. of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco CA 94143, USA
| | - Yurii S Moroz
- Enamine Ltd., Kyiv, 02094, Ukraine
- Chemspace (www.chem-space.com), Chervonotkatska Street 85, Kyїv 02094, Ukraine
- Taras Shevchenko National University of Kyїv, Volodymyrska Street 60, Kyїv 01601, Ukraine
| | - John J Irwin
- Dept. of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco CA 94143, USA
| | - Brian K Shoichet
- Dept. of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco CA 94143, USA
| |
Collapse
|
14
|
Wankowicz SA, Fraser JS. Comprehensive encoding of conformational and compositional protein structural ensembles through the mmCIF data structure. IUCRJ 2024; 11:494-501. [PMID: 38958015 PMCID: PMC11220883 DOI: 10.1107/s2052252524005098] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/17/2023] [Accepted: 05/29/2024] [Indexed: 07/04/2024]
Abstract
In the folded state, biomolecules exchange between multiple conformational states crucial for their function. However, most structural models derived from experiments and computational predictions only encode a single state. To represent biomolecules accurately, we must move towards modeling and predicting structural ensembles. Information about structural ensembles exists within experimental data from X-ray crystallography and cryo-electron microscopy. Although new tools are available to detect conformational and compositional heterogeneity within these ensembles, the legacy PDB data structure does not robustly encapsulate this complexity. We propose modifications to the macromolecular crystallographic information file (mmCIF) to improve the representation and interrelation of conformational and compositional heterogeneity. These modifications will enable the capture of macromolecular ensembles in a human and machine-interpretable way, potentially catalyzing breakthroughs for ensemble-function predictions, analogous to the achievements of AlphaFold with single-structure prediction.
Collapse
Affiliation(s)
- Stephanie A. Wankowicz
- Department of Bioengineering and Therapeutic ScienceUniversity of CaliforniaSan FranciscoCA94117USA
| | - James S. Fraser
- Department of Bioengineering and Therapeutic ScienceUniversity of CaliforniaSan FranciscoCA94117USA
| |
Collapse
|
15
|
Hoff SE, Thomasen FE, Lindorff-Larsen K, Bonomi M. Accurate model and ensemble refinement using cryo-electron microscopy maps and Bayesian inference. PLoS Comput Biol 2024; 20:e1012180. [PMID: 39008528 PMCID: PMC11271924 DOI: 10.1371/journal.pcbi.1012180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Revised: 07/25/2024] [Accepted: 05/20/2024] [Indexed: 07/17/2024] Open
Abstract
Converting cryo-electron microscopy (cryo-EM) data into high-quality structural models is a challenging problem of outstanding importance. Current refinement methods often generate unbalanced models in which physico-chemical quality is sacrificed for excellent fit to the data. Furthermore, these techniques struggle to represent the conformational heterogeneity averaged out in low-resolution regions of density maps. Here we introduce EMMIVox, a Bayesian inference approach to determine single-structure models as well as structural ensembles from cryo-EM maps. EMMIVox automatically balances experimental information with accurate physico-chemical models of the system and the surrounding environment, including waters, lipids, and ions. Explicit treatment of data correlation and noise as well as inference of accurate B-factors enable determination of structural models and ensembles with both excellent fit to the data and high stereochemical quality, thus outperforming state-of-the-art refinement techniques. EMMIVox represents a flexible approach to determine high-quality structural models that will contribute to advancing our understanding of the molecular mechanisms underlying biological functions.
Collapse
Affiliation(s)
- Samuel E. Hoff
- Institut Pasteur, Université Paris Cité, CNRS UMR 3528, Computational Structural Biology Unit, Paris, France
| | - F. Emil Thomasen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory, Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Massimiliano Bonomi
- Institut Pasteur, Université Paris Cité, CNRS UMR 3528, Computational Structural Biology Unit, Paris, France
| |
Collapse
|
16
|
Wankowicz SA, Ravikumar A, Sharma S, Riley B, Raju A, Hogan DW, Flowers J, van den Bedem H, Keedy DA, Fraser JS. Automated multiconformer model building for X-ray crystallography and cryo-EM. eLife 2024; 12:RP90606. [PMID: 38904665 PMCID: PMC11192534 DOI: 10.7554/elife.90606] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/22/2024] Open
Abstract
In their folded state, biomolecules exchange between multiple conformational states that are crucial for their function. Traditional structural biology methods, such as X-ray crystallography and cryogenic electron microscopy (cryo-EM), produce density maps that are ensemble averages, reflecting molecules in various conformations. Yet, most models derived from these maps explicitly represent only a single conformation, overlooking the complexity of biomolecular structures. To accurately reflect the diversity of biomolecular forms, there is a pressing need to shift toward modeling structural ensembles that mirror the experimental data. However, the challenge of distinguishing signal from noise complicates manual efforts to create these models. In response, we introduce the latest enhancements to qFit, an automated computational strategy designed to incorporate protein conformational heterogeneity into models built into density maps. These algorithmic improvements in qFit are substantiated by superior Rfree and geometry metrics across a wide range of proteins. Importantly, unlike more complex multicopy ensemble models, the multiconformer models produced by qFit can be manually modified in most major model building software (e.g., Coot) and fit can be further improved by refinement using standard pipelines (e.g., Phenix, Refmac, Buster). By reducing the barrier of creating multiconformer models, qFit can foster the development of new hypotheses about the relationship between macromolecular conformational dynamics and function.
Collapse
Affiliation(s)
- Stephanie A Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Ashraya Ravikumar
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Shivani Sharma
- Structural Biology Initiative, CUNY Advanced Science Research CenterNew YorkUnited States
- Ph.D. Program in Biology, The Graduate Center, City University of New YorkNew YorkUnited States
| | - Blake Riley
- Structural Biology Initiative, CUNY Advanced Science Research CenterNew YorkUnited States
| | - Akshay Raju
- Structural Biology Initiative, CUNY Advanced Science Research CenterNew YorkUnited States
| | - Daniel W Hogan
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Jessica Flowers
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Henry van den Bedem
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
- Atomwise IncSan FranciscoUnited States
| | - Daniel A Keedy
- Structural Biology Initiative, CUNY Advanced Science Research CenterNew YorkUnited States
- Department of Chemistry and Biochemistry, City College of New YorkNew YorkUnited States
- Ph.D. Programs in Biochemistry, Biology and Chemistry, The Graduate Center, City University of New YorkNew YorkUnited States
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| |
Collapse
|
17
|
Wankowicz SA, Ravikumar A, Sharma S, Riley BT, Raju A, Flowers J, Hogan D, van den Bedem H, Keedy DA, Fraser JS. Uncovering Protein Ensembles: Automated Multiconformer Model Building for X-ray Crystallography and Cryo-EM. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.06.28.546963. [PMID: 37425870 PMCID: PMC10327213 DOI: 10.1101/2023.06.28.546963] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/11/2023]
Abstract
In their folded state, biomolecules exchange between multiple conformational states that are crucial for their function. Traditional structural biology methods, such as X-ray crystallography and cryogenic electron microscopy (cryo-EM), produce density maps that are ensemble averages, reflecting molecules in various conformations. Yet, most models derived from these maps explicitly represent only a single conformation, overlooking the complexity of biomolecular structures. To accurately reflect the diversity of biomolecular forms, there is a pressing need to shift towards modeling structural ensembles that mirror the experimental data. However, the challenge of distinguishing signal from noise complicates manual efforts to create these models. In response, we introduce the latest enhancements to qFit, an automated computational strategy designed to incorporate protein conformational heterogeneity into models built into density maps. These algorithmic improvements in qFit are substantiated by superior R f r e e and geometry metrics across a wide range of proteins. Importantly, unlike more complex multicopy ensemble models, the multiconformer models produced by qFit can be manually modified in most major model building software (e.g. Coot) and fit can be further improved by refinement using standard pipelines (e.g. Phenix, Refmac, Buster). By reducing the barrier of creating multiconformer models, qFit can foster the development of new hypotheses about the relationship between macromolecular conformational dynamics and function.
Collapse
Affiliation(s)
- Stephanie A. Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Ashraya Ravikumar
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Shivani Sharma
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
- Ph.D. Program in Biology, The Graduate Center – City University of New York, New York, NY 10016
| | - Blake T. Riley
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Akshay Raju
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Jessica Flowers
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Daniel Hogan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Henry van den Bedem
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
- Atomwise, Inc., San Francisco, CA, United States
| | - Daniel A. Keedy
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
- Department of Chemistry and Biochemistry, City College of New York, New York, NY 10031
- Ph.D. Programs in Biochemistry, Biology, and Chemistry, The Graduate Center – City University of New York, New York, NY 10016
| | - James S. Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| |
Collapse
|
18
|
Stachowski TR, Fischer M. FLEXR GUI: a graphical user interface for multi-conformer modeling of proteins. J Appl Crystallogr 2024; 57:580-586. [PMID: 38596743 PMCID: PMC11001397 DOI: 10.1107/s1600576724001523] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2024] [Accepted: 02/14/2024] [Indexed: 04/11/2024] Open
Abstract
Proteins are well known 'shapeshifters' which change conformation to function. In crystallography, multiple conformational states are often present within the crystal and the resulting electron-density map. Yet, explicitly incorporating alternative states into models to disentangle multi-conformer ensembles is challenging. We previously reported the tool FLEXR, which, within a few minutes, automatically separates conformational signal from noise and builds the corresponding, often missing, structural features into a multi-conformer model. To make the method widely accessible for routine multi-conformer building as part of the computational toolkit for macromolecular crystallography, we present a graphical user interface (GUI) for FLEXR, designed as a plugin for Coot 1. The GUI implementation seamlessly connects FLEXR models with the existing suite of validation and modeling tools available in Coot. We envision that FLEXR will aid crystallographers by increasing access to a multi-conformer modeling method that will ultimately lead to a better representation of protein conformational heterogeneity in the Protein Data Bank. In turn, deeper insights into the protein conformational landscape may inform biology or provide new opportunities for ligand design. The code is open source and freely available on GitHub at https://github.com/TheFischerLab/FLEXR-GUI.
Collapse
Affiliation(s)
- Timothy R. Stachowski
- Department of Chemical Biology and Therapeutics, St Jude Children’s Research Hospital, Memphis, TN 38105, USA
| | - Marcus Fischer
- Department of Chemical Biology and Therapeutics, St Jude Children’s Research Hospital, Memphis, TN 38105, USA
| |
Collapse
|
19
|
Saccuzzo EG, Mebrat MD, Scelsi HF, Kim M, Ma MT, Su X, Hill SE, Rheaume E, Li R, Torres MP, Gumbart JC, Van Horn WD, Lieberman RL. Competition between inside-out unfolding and pathogenic aggregation in an amyloid-forming β-propeller. Nat Commun 2024; 15:155. [PMID: 38168102 PMCID: PMC10762032 DOI: 10.1038/s41467-023-44479-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 12/14/2023] [Indexed: 01/05/2024] Open
Abstract
Studies of folded-to-misfolded transitions using model protein systems reveal a range of unfolding needed for exposure of amyloid-prone regions for subsequent fibrillization. Here, we probe the relationship between unfolding and aggregation for glaucoma-associated myocilin. Mutations within the olfactomedin domain of myocilin (OLF) cause a gain-of-function, namely cytotoxic intracellular aggregation, which hastens disease progression. Aggregation by wild-type OLF (OLFWT) competes with its chemical unfolding, but only below the threshold where OLF loses tertiary structure. Representative moderate (OLFD380A) and severe (OLFI499F) disease variants aggregate differently, with rates comparable to OLFWT in initial stages of unfolding, and variants adopt distinct partially folded structures seen along the OLFWT urea-unfolding pathway. Whether initiated with mutation or chemical perturbation, unfolding propagates outward to the propeller surface. In sum, for this large protein prone to amyloid formation, the requirement for a conformational change to promote amyloid fibrillization leads to direct competition between unfolding and aggregation.
Collapse
Affiliation(s)
- Emily G Saccuzzo
- School of Chemistry & Biochemistry, Georgia Institute of Technology, Atlanta, USA
| | - Mubark D Mebrat
- Biodesign Center for Personalized Diagnostics, Arizona State University, Tempe, USA
- School of Molecular Sciences, Arizona State University, Tempe, USA
| | - Hailee F Scelsi
- School of Chemistry & Biochemistry, Georgia Institute of Technology, Atlanta, USA
| | - Minjoo Kim
- Biodesign Center for Personalized Diagnostics, Arizona State University, Tempe, USA
- School of Molecular Sciences, Arizona State University, Tempe, USA
| | - Minh Thu Ma
- School of Chemistry & Biochemistry, Georgia Institute of Technology, Atlanta, USA
| | - Xinya Su
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, USA
| | - Shannon E Hill
- School of Chemistry & Biochemistry, Georgia Institute of Technology, Atlanta, USA
| | - Elisa Rheaume
- Interdisciplinary Graduate Program in Quantitative Biosciences, Georgia Institute of Technology, Atlanta, USA
| | - Renhao Li
- Aflac Cancer and Blood Disorders Center, Children's Healthcare of Atlanta and Department of Pediatrics, Emory University School of Medicine, Atlanta, USA
| | - Matthew P Torres
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, USA
| | - James C Gumbart
- School of Chemistry & Biochemistry, Georgia Institute of Technology, Atlanta, USA
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, USA
- School of Physics, Georgia Institute of Technology, Atlanta, USA
| | - Wade D Van Horn
- Biodesign Center for Personalized Diagnostics, Arizona State University, Tempe, USA.
- School of Molecular Sciences, Arizona State University, Tempe, USA.
| | - Raquel L Lieberman
- School of Chemistry & Biochemistry, Georgia Institute of Technology, Atlanta, USA.
| |
Collapse
|
20
|
Sharma S, Skaist Mehlman T, Sagabala RS, Boivin B, Keedy DA. High-resolution double vision of the allosteric phosphatase PTP1B. Acta Crystallogr F Struct Biol Commun 2024; 80:1-12. [PMID: 38133579 PMCID: PMC10833341 DOI: 10.1107/s2053230x23010749] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Accepted: 12/14/2023] [Indexed: 12/23/2023] Open
Abstract
Protein tyrosine phosphatase 1B (PTP1B) plays important roles in cellular homeostasis and is a highly validated therapeutic target for multiple human ailments, including diabetes, obesity and breast cancer. However, much remains to be learned about how conformational changes may convey information through the structure of PTP1B to enable allosteric regulation by ligands or functional responses to mutations. High-resolution X-ray crystallography can offer unique windows into protein conformational ensembles, but comparison of even high-resolution structures is often complicated by differences between data sets, including non-isomorphism. Here, the highest resolution crystal structure of apo wild-type (WT) PTP1B to date is presented out of a total of ∼350 PTP1B structures in the PDB. This structure is in a crystal form that is rare for PTP1B, with two unique copies of the protein that exhibit distinct patterns of conformational heterogeneity, allowing a controlled comparison of local disorder across the two chains within the same asymmetric unit. The conformational differences between these chains are interrogated in the apo structure and between several recently reported high-resolution ligand-bound structures. Electron-density maps in a high-resolution structure of a recently reported activating double mutant are also examined, and unmodeled alternate conformations in the mutant structure are discovered that coincide with regions of enhanced conformational heterogeneity in the new WT structure. These results validate the notion that these mutations operate by enhancing local dynamics, and suggest a latent susceptibility to such changes in the WT enzyme. Together, these new data and analysis provide a detailed view of the conformational ensemble of PTP1B and highlight the utility of high-resolution crystallography for elucidating conformational heterogeneity with potential relevance for function.
Collapse
Affiliation(s)
- Shivani Sharma
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031, USA
- PhD Program in Biology, CUNY Graduate Center, New York, NY 10016, USA
| | - Tamar Skaist Mehlman
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031, USA
| | - Reddy Sudheer Sagabala
- Department of Nanobioscience, College of Nanoscale Science and Engineering, SUNY Polytechnic Institute, Albany, NY 12203, USA
| | - Benoit Boivin
- Department of Nanobioscience, College of Nanoscale Science and Engineering, SUNY Polytechnic Institute, Albany, NY 12203, USA
| | - Daniel A. Keedy
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031, USA
- Department of Chemistry and Biochemistry, City College of New York, New York, NY 10031, USA
- PhD Programs in Biochemistry, Biology and Chemistry, CUNY Graduate Center, New York, NY 10016, USA
| |
Collapse
|
21
|
Ramelot TA, Tejero R, Montelione GT. Representing structures of the multiple conformational states of proteins. Curr Opin Struct Biol 2023; 83:102703. [PMID: 37776602 PMCID: PMC10841472 DOI: 10.1016/j.sbi.2023.102703] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 08/18/2023] [Accepted: 08/23/2023] [Indexed: 10/02/2023]
Abstract
Biomolecules exhibit dynamic behavior that single-state models of their structures cannot fully capture. We review some recent advances for investigating multiple conformations of biomolecules, including experimental methods, molecular dynamics simulations, and machine learning. We also address the challenges associated with representing single- and multiple-state models in data archives, with a particular focus on NMR structures. Establishing standardized representations and annotations will facilitate effective communication and understanding of these complex models to the broader scientific community.
Collapse
Affiliation(s)
- Theresa A Ramelot
- Dept of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA.
| | - Roberto Tejero
- Dept of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Gaetano T Montelione
- Dept of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA.
| |
Collapse
|
22
|
Boulos I, Jabbour J, Khoury S, Mikhael N, Tishkova V, Candoni N, Ghadieh HE, Veesler S, Bassim Y, Azar S, Harb F. Exploring the World of Membrane Proteins: Techniques and Methods for Understanding Structure, Function, and Dynamics. Molecules 2023; 28:7176. [PMID: 37894653 PMCID: PMC10608922 DOI: 10.3390/molecules28207176] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Revised: 09/13/2023] [Accepted: 10/04/2023] [Indexed: 10/29/2023] Open
Abstract
In eukaryotic cells, membrane proteins play a crucial role. They fall into three categories: intrinsic proteins, extrinsic proteins, and proteins that are essential to the human genome (30% of which is devoted to encoding them). Hydrophobic interactions inside the membrane serve to stabilize integral proteins, which span the lipid bilayer. This review investigates a number of computational and experimental methods used to study membrane proteins. It encompasses a variety of technologies, including electrophoresis, X-ray crystallography, cryogenic electron microscopy (cryo-EM), nuclear magnetic resonance spectroscopy (NMR), biophysical methods, computational methods, and artificial intelligence. The link between structure and function of membrane proteins has been better understood thanks to these approaches, which also hold great promise for future study in the field. The significance of fusing artificial intelligence with experimental data to improve our comprehension of membrane protein biology is also covered in this paper. This effort aims to shed light on the complexity of membrane protein biology by investigating a variety of experimental and computational methods. Overall, the goal of this review is to emphasize how crucial it is to understand the functions of membrane proteins in eukaryotic cells. It gives a general review of the numerous methods used to look into these crucial elements and highlights the demand for multidisciplinary approaches to advance our understanding.
Collapse
Affiliation(s)
- Imad Boulos
- Faculty of Medicine and Medical Sciences, University of Balamand, Tripoli P.O. Box 100, Lebanon; (I.B.); (J.J.); (S.K.); (N.M.); (H.E.G.); (Y.B.); (S.A.)
| | - Joy Jabbour
- Faculty of Medicine and Medical Sciences, University of Balamand, Tripoli P.O. Box 100, Lebanon; (I.B.); (J.J.); (S.K.); (N.M.); (H.E.G.); (Y.B.); (S.A.)
| | - Serena Khoury
- Faculty of Medicine and Medical Sciences, University of Balamand, Tripoli P.O. Box 100, Lebanon; (I.B.); (J.J.); (S.K.); (N.M.); (H.E.G.); (Y.B.); (S.A.)
| | - Nehme Mikhael
- Faculty of Medicine and Medical Sciences, University of Balamand, Tripoli P.O. Box 100, Lebanon; (I.B.); (J.J.); (S.K.); (N.M.); (H.E.G.); (Y.B.); (S.A.)
| | - Victoria Tishkova
- CNRS, CINaM (Centre Interdisciplinaire de Nanosciences de Marseille), Campus de Luminy, Case 913, Aix-Marseille University, CEDEX 09, F-13288 Marseille, France; (V.T.); (N.C.); (S.V.)
| | - Nadine Candoni
- CNRS, CINaM (Centre Interdisciplinaire de Nanosciences de Marseille), Campus de Luminy, Case 913, Aix-Marseille University, CEDEX 09, F-13288 Marseille, France; (V.T.); (N.C.); (S.V.)
| | - Hilda E. Ghadieh
- Faculty of Medicine and Medical Sciences, University of Balamand, Tripoli P.O. Box 100, Lebanon; (I.B.); (J.J.); (S.K.); (N.M.); (H.E.G.); (Y.B.); (S.A.)
| | - Stéphane Veesler
- CNRS, CINaM (Centre Interdisciplinaire de Nanosciences de Marseille), Campus de Luminy, Case 913, Aix-Marseille University, CEDEX 09, F-13288 Marseille, France; (V.T.); (N.C.); (S.V.)
| | - Youssef Bassim
- Faculty of Medicine and Medical Sciences, University of Balamand, Tripoli P.O. Box 100, Lebanon; (I.B.); (J.J.); (S.K.); (N.M.); (H.E.G.); (Y.B.); (S.A.)
| | - Sami Azar
- Faculty of Medicine and Medical Sciences, University of Balamand, Tripoli P.O. Box 100, Lebanon; (I.B.); (J.J.); (S.K.); (N.M.); (H.E.G.); (Y.B.); (S.A.)
| | - Frédéric Harb
- Faculty of Medicine and Medical Sciences, University of Balamand, Tripoli P.O. Box 100, Lebanon; (I.B.); (J.J.); (S.K.); (N.M.); (H.E.G.); (Y.B.); (S.A.)
| |
Collapse
|
23
|
Ma W, Zhang W, Le Y, Shi X, Xu Q, Xiao Y, Dou Y, Wang X, Zhou W, Peng W, Zhang H, Huang B. Using macromolecular electron densities to improve the enrichment of active compounds in virtual screening. Commun Chem 2023; 6:173. [PMID: 37608192 PMCID: PMC10444862 DOI: 10.1038/s42004-023-00984-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2023] [Accepted: 08/15/2023] [Indexed: 08/24/2023] Open
Abstract
The quest for effective virtual screening algorithms is hindered by the scarcity of training data, calling for innovative approaches. This study presents the use of experimental electron density (ED) data for improving active compound enrichment in virtual screening, supported by ED's ability to reflect the time-averaged behavior of ligands and solvents in the binding pocket. Experimental ED-based grid matching score (ExptGMS) was developed to score compounds by measuring the degree of matching between their binding conformations and a series of multi-resolution experimental ED grids. The efficiency of ExptGMS was validated using both in silico tests with the Directory of Useful Decoys-Enhanced dataset and wet-lab tests on Covid-19 3CLpro-inhibitors. ExptGMS improved the active compound enrichment in top-ranked molecules by approximately 20%. Furthermore, ExptGMS identified four active inhibitors of 3CLpro, with the most effective showing an IC50 value of 1.9 µM. We also developed an online database containing experimental ED grids for over 17,000 proteins to facilitate the use of ExptGMS for academic users.
Collapse
Affiliation(s)
- Wenzhi Ma
- Beijing StoneWise Technology Co Ltd., Haidian Street #15, Haidian District, 100080, Beijing, China
| | - Wei Zhang
- State Key Laboratory of Respiratory Disease, First Affiliated Hospital of Guangzhou Medical University, 510182, Guangzhou, China
- Innovation Center for Pathogen Research, Guangzhou Laboratory, 510320, Guangzhou, China
| | - Yuan Le
- Beijing StoneWise Technology Co Ltd., Haidian Street #15, Haidian District, 100080, Beijing, China
| | - Xiaoxuan Shi
- Beijing StoneWise Technology Co Ltd., Haidian Street #15, Haidian District, 100080, Beijing, China
| | - Qingbo Xu
- Beijing StoneWise Technology Co Ltd., Haidian Street #15, Haidian District, 100080, Beijing, China
| | - Yang Xiao
- Beijing StoneWise Technology Co Ltd., Haidian Street #15, Haidian District, 100080, Beijing, China
| | - Yueying Dou
- Beijing StoneWise Technology Co Ltd., Haidian Street #15, Haidian District, 100080, Beijing, China
| | - Xiaoman Wang
- Beijing StoneWise Technology Co Ltd., Haidian Street #15, Haidian District, 100080, Beijing, China
| | - Wenbiao Zhou
- Beijing StoneWise Technology Co Ltd., Haidian Street #15, Haidian District, 100080, Beijing, China
| | - Wei Peng
- State Key Laboratory of Respiratory Disease, First Affiliated Hospital of Guangzhou Medical University, 510182, Guangzhou, China
- Innovation Center for Pathogen Research, Guangzhou Laboratory, 510320, Guangzhou, China
| | - Hongbo Zhang
- Beijing StoneWise Technology Co Ltd., Haidian Street #15, Haidian District, 100080, Beijing, China.
| | - Bo Huang
- Beijing StoneWise Technology Co Ltd., Haidian Street #15, Haidian District, 100080, Beijing, China.
| |
Collapse
|
24
|
Wych DC, Wall ME. Molecular-dynamics simulations of macromolecular diffraction, part II: Analysis of protein crystal simulations. Methods Enzymol 2023; 688:115-143. [PMID: 37748824 DOI: 10.1016/bs.mie.2023.06.012] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/27/2023]
Abstract
Molecular-dynamics (MD) simulations have contributed substantially to our understanding of protein structure and dynamics, yielding insights into many biological processes including protein folding, drug binding, and mechanisms of protein-protein interactions. Much of what is known about protein structure comes from macromolecular crystallography (MX) experiments. MD simulations of protein crystals are useful in the study of MX because the simulations can be analyzed to calculate almost any crystallographic observable of interest, from atomic coordinates to structure factors and densities, B-factors, multiple conformations and their populations/occupancies, and diffuse scattering intensities. Computing resources and software to support crystalline MD simulations are now readily available to many researchers studying protein structure and dynamics and who may be interested in advanced interpretation of MX data, including diffuse scattering. In this work, we outline methods of analyzing MD simulations of protein crystals and provide accompanying Jupyter notebooks as practical resources for researchers wishing to perform similar analyses on their own systems of interest.
Collapse
Affiliation(s)
- David C Wych
- Computer, Computational and Statistical Sciences Division, Los Alamos, NM, United States; Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, United States
| | - Michael E Wall
- Computer, Computational and Statistical Sciences Division, Los Alamos, NM, United States.
| |
Collapse
|
25
|
Thompson MC. Combining temperature perturbations with X-ray crystallography to study dynamic macromolecules: A thorough discussion of experimental methods. Methods Enzymol 2023; 688:255-305. [PMID: 37748829 DOI: 10.1016/bs.mie.2023.07.008] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/27/2023]
Abstract
Temperature is an important state variable that governs the behavior of microscopic systems, yet crystallographers rarely exploit temperature changes to study the structure and dynamics of biological macromolecules. In fact, approximately 90% of crystal structures in the Protein Data Bank were determined under cryogenic conditions, because sample cryocooling makes crystals robust to X-ray radiation damage and facilitates data collection. On the other hand, cryocooling can introduce artifacts into macromolecular structures, and can suppress conformational dynamics that are critical for function. Fortunately, recent advances in X-ray detector technology, X-ray sources, and computational data processing algorithms make non-cryogenic X-ray crystallography easier and more broadly applicable than ever before. Without the reliance on cryocooling, high-resolution crystallography can be combined with various temperature perturbations to gain deep insight into the conformational landscapes of macromolecules. This Chapter reviews the historical reasons for the prevalence of cryocooling in macromolecular crystallography, and discusses its potential drawbacks. Next, the Chapter summarizes technological developments and methodologies that facilitate non-cryogenic crystallography experiments. Finally, the chapter discusses the theoretical underpinnings and practical aspects of multi-temperature and temperature-jump crystallography experiments, which are powerful tools for understanding the relationship between the structure, dynamics, and function of proteins and other biological macromolecules.
Collapse
Affiliation(s)
- Michael C Thompson
- Department of Chemistry and Biochemistry, University of California, Merced, Merced, CA, United States.
| |
Collapse
|
26
|
Peck A, Lane TJ, Poitevin F. Modeling diffuse scattering with simple, physically interpretable models. Methods Enzymol 2023; 688:169-194. [PMID: 37748826 DOI: 10.1016/bs.mie.2023.06.022] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/27/2023]
Abstract
Diffuse scattering has long been proposed to probe protein dynamics relevant for biological function, and more recently, as a tool to aid structure determination. Despite recent advances in measuring and modeling this signal, the field has not been able to routinely use experimental diffuse scattering for either application. A persistent challenge has been to devise models that are sophisticated enough to robustly reproduce experimental diffuse features but remain readily interpretable from the standpoint of structural biology. This chapter presents eryx, a suite of computational tools to evaluate the primary models of disorder that have been used to analyze protein diffuse scattering. By facilitating comparative modeling, eryx aims to provide insights into the physical origins of this signal and help identify the sources of disorder that are critical for reproducing experimental features. This framework also lays the groundwork for the development of more advanced models that integrate different types of disorder without loss of interpretability.
Collapse
Affiliation(s)
- Ariana Peck
- Linac Coherent Light Source, SLAC National Accelerator Laboratory, Menlo Park, CA, United States.
| | | | - Frédéric Poitevin
- Linac Coherent Light Source, SLAC National Accelerator Laboratory, Menlo Park, CA, United States
| |
Collapse
|
27
|
Du S, Wankowicz SA, Yabukarski F, Doukov T, Herschlag D, Fraser JS. Refinement of multiconformer ensemble models from multi-temperature X-ray diffraction data. Methods Enzymol 2023; 688:223-254. [PMID: 37748828 PMCID: PMC10637719 DOI: 10.1016/bs.mie.2023.06.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/27/2023]
Abstract
Conformational ensembles underlie all protein functions. Thus, acquiring atomic-level ensemble models that accurately represent conformational heterogeneity is vital to deepen our understanding of how proteins work. Modeling ensemble information from X-ray diffraction data has been challenging, as traditional cryo-crystallography restricts conformational variability while minimizing radiation damage. Recent advances have enabled the collection of high quality diffraction data at ambient temperatures, revealing innate conformational heterogeneity and temperature-driven changes. Here, we used diffraction datasets for Proteinase K collected at temperatures ranging from 313 to 363 K to provide a tutorial for the refinement of multiconformer ensemble models. Integrating automated sampling and refinement tools with manual adjustments, we obtained multiconformer models that describe alternative backbone and sidechain conformations, their relative occupancies, and interconnections between conformers. Our models revealed extensive and diverse conformational changes across temperature, including increased bound peptide ligand occupancies, different Ca2+ binding site configurations and altered rotameric distributions. These insights emphasize the value and need for multiconformer model refinement to extract ensemble information from diffraction data and to understand ensemble-function relationships.
Collapse
Affiliation(s)
- Siyuan Du
- Department of Biochemistry, Stanford University, Stanford, CA, United States; Department of Chemistry, Stanford University, Stanford, CA, United States
| | - Stephanie A Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States
| | - Filip Yabukarski
- Department of Biochemistry, Stanford University, Stanford, CA, United States; Bristol-Myers Squibb, San Diego, CA, United States
| | - Tzanko Doukov
- Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, CA, United States
| | - Daniel Herschlag
- Department of Biochemistry, Stanford University, Stanford, CA, United States; Department of Chemical Engineering, Stanford University, Stanford, CA, United States; Stanford ChEM-H, Stanford University, Stanford, CA, United States
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, United States; Quantitative Biosciences Institute, University of California, San Francisco, CA, United States.
| |
Collapse
|
28
|
Grybauskas A, Gražulis S. Building protein structure-specific rotamer libraries. Bioinformatics 2023; 39:btad429. [PMID: 37439702 PMCID: PMC10359632 DOI: 10.1093/bioinformatics/btad429] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2022] [Revised: 06/19/2023] [Indexed: 07/14/2023] Open
Abstract
MOTIVATION Identifying the probable positions of the protein side-chains is one of the protein modelling steps that can improve the prediction of protein-ligand and protein-protein interactions. Most of the strategies predicting the side-chain conformations use predetermined dihedral angle lists, also called rotamer libraries, that are usually generated from a subset of high-quality protein structures. Although these methods are fast to apply, they tend to average out geometries instead of taking into account the surrounding atoms and molecules and ignore structures not included in the selected subset. Such simplifications can result in inaccuracies when predicting possible side-chain atom positions. RESULTS We propose an approach that takes into account both of these circumstances by scanning through sterically accessible side-chain conformations and generating dihedral angle libraries specific to the target proteins. The method avoids the drawbacks of lacking conformations due to unusual or rare protein structures and successfully suggests potential rotamers with average RMSD closer to the experimentally determined side-chain atom positions than other widely used rotamer libraries. AVAILABILITY AND IMPLEMENTATION The technique is implemented in open-source software package rotag and available at GitHub: https://www.github.com/agrybauskas/rotag, under GNU Lesser General Public License.
Collapse
Affiliation(s)
- Algirdas Grybauskas
- Sector of Crystallography and Cheminformatics, Institute of Biotechnology, Life Sciences Center, Vilnius University, 7 Saulėtekio Ave, Vilnius, LT- 10257, Lithuania
| | - Saulius Gražulis
- Sector of Crystallography and Cheminformatics, Institute of Biotechnology, Life Sciences Center, Vilnius University, 7 Saulėtekio Ave, Vilnius, LT- 10257, Lithuania
| |
Collapse
|
29
|
Du S, Wankowicz SA, Yabukarski F, Doukov T, Herschlag D, Fraser JS. Refinement of Multiconformer Ensemble Models from Multi-temperature X-ray Diffraction Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.05.539620. [PMID: 37205593 PMCID: PMC10187334 DOI: 10.1101/2023.05.05.539620] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/21/2023]
Abstract
Conformational ensembles underlie all protein functions. Thus, acquiring atomic-level ensemble models that accurately represent conformational heterogeneity is vital to deepen our understanding of how proteins work. Modeling ensemble information from X-ray diffraction data has been challenging, as traditional cryo-crystallography restricts conformational variability while minimizing radiation damage. Recent advances have enabled the collection of high quality diffraction data at ambient temperatures, revealing innate conformational heterogeneity and temperature-driven changes. Here, we used diffraction datasets for Proteinase K collected at temperatures ranging from 313 to 363K to provide a tutorial for the refinement of multiconformer ensemble models. Integrating automated sampling and refinement tools with manual adjustments, we obtained multiconformer models that describe alternative backbone and sidechain conformations, their relative occupancies, and interconnections between conformers. Our models revealed extensive and diverse conformational changes across temperature, including increased bound peptide ligand occupancies, different Ca2+ binding site configurations and altered rotameric distributions. These insights emphasize the value and need for multiconformer model refinement to extract ensemble information from diffraction data and to understand ensemble-function relationships.
Collapse
Affiliation(s)
- Siyuan Du
- Department of Biochemistry, Stanford University, Stanford, California 94305, United States
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
| | - Stephanie A. Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California 94143, United States
| | - Filip Yabukarski
- Department of Biochemistry, Stanford University, Stanford, California 94305, United States
- Bristol-Myers Squibb, San Diego, California 92121, United States
| | - Tzanko Doukov
- Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, Menlo Park, California 94025, United States
| | - Daniel Herschlag
- Department of Biochemistry, Stanford University, Stanford, California 94305, United States
- Department of Chemical Engineering, Stanford University, Stanford, California 94305, United States
- Stanford ChEM-H, Stanford University, Stanford, California 94305, United States
| | - James S. Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, California 94143, United States
- Quantitative Biosciences Institute, University of California, San Francisco, California 94143, United States
| |
Collapse
|
30
|
Stachowski TR, Fischer M. FLEXR: automated multi-conformer model building using electron-density map sampling. Acta Crystallogr D Struct Biol 2023; 79:354-367. [PMID: 37071395 PMCID: PMC10167668 DOI: 10.1107/s2059798323002498] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 03/13/2023] [Indexed: 04/19/2023] Open
Abstract
Protein conformational dynamics that may inform biology often lie dormant in high-resolution electron-density maps. While an estimated ∼18% of side chains in high-resolution models contain alternative conformations, these are underrepresented in current PDB models due to difficulties in manually detecting, building and inspecting alternative conformers. To overcome this challenge, we developed an automated multi-conformer modeling program, FLEXR. Using Ringer-based electron-density sampling, FLEXR builds explicit multi-conformer models for refinement. Thereby, it bridges the gap of detecting hidden alternate states in electron-density maps and including them in structural models for refinement, inspection and deposition. Using a series of high-quality crystal structures (0.8-1.85 Å resolution), we show that the multi-conformer models produced by FLEXR uncover new insights that are missing in models built either manually or using current tools. Specifically, FLEXR models revealed hidden side chains and backbone conformations in ligand-binding sites that may redefine protein-ligand binding mechanisms. Ultimately, the tool facilitates crystallographers with opportunities to include explicit multi-conformer states in their high-resolution crystallographic models. One key advantage is that such models may better reflect interesting higher energy features in electron-density maps that are rarely consulted by the community at large, which can then be productively used for ligand discovery downstream. FLEXR is open source and publicly available on GitHub at https://github.com/TheFischerLab/FLEXR.
Collapse
Affiliation(s)
- Timothy R. Stachowski
- Department of Chemical Biology and Therapeutics, St Jude Children’s Research Hospital, Memphis, TN 38105, USA
| | - Marcus Fischer
- Department of Chemical Biology and Therapeutics, St Jude Children’s Research Hospital, Memphis, TN 38105, USA
| |
Collapse
|
31
|
Doukov T, Herschlag D, Yabukarski F. Obtaining anomalous and ensemble information from protein crystals from 220 K up to physiological temperatures. Acta Crystallogr D Struct Biol 2023; 79:212-223. [PMID: 36876431 PMCID: PMC9986799 DOI: 10.1107/s205979832300089x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2022] [Accepted: 01/31/2023] [Indexed: 03/01/2023] Open
Abstract
X-ray crystallography has been invaluable in delivering structural information about proteins. Previously, an approach has been developed that allows high-quality X-ray diffraction data to be obtained from protein crystals at and above room temperature. Here, this previous work is built on and extended by showing that high-quality anomalous signal can be obtained from single protein crystals using diffraction data collected at 220 K up to physiological temperatures. The anomalous signal can be used to directly determine the structure of a protein, i.e. to phase the data, as is routinely performed under cryoconditions. This ability is demonstrated by obtaining diffraction data from model lysozyme, thaumatin and proteinase K crystals, the anomalous signal from which allowed their structures to be solved experimentally at 7.1 keV X-ray energy and at room temperature with relatively low data redundancy. It is also demonstrated that the anomalous signal from diffraction data obtained at 310 K (37°C) can be used to solve the structure of proteinase K and to identify ordered ions. The method provides useful anomalous signal at temperatures down to 220 K, resulting in an extended crystal lifetime and increased data redundancy. Finally, we show that useful anomalous signal can be obtained at room temperature using X-rays of 12 keV energy as typically used for routine data collection, allowing this type of experiment to be carried out at widely accessible synchrotron beamline energies and enabling the simultaneous extraction of high-resolution data and anomalous signal. With the recent emphasis on obtaining conformational ensemble information for proteins, the high resolution of the data allows such ensembles to be built, while the anomalous signal allows the structure to be experimentally solved, ions to be identified, and water molecules and ions to be differentiated. Because bound metal-, phosphorus- and sulfur-containing ions all have anomalous signal, obtaining anomalous signal across temperatures and up to physiological temperatures will provide a more complete description of protein conformational ensembles, function and energetics.
Collapse
Affiliation(s)
- Tzanko Doukov
- SMB, Stanford Synchrotron Radiation Lightsource, SLAC National Accelerator Laboratory, 2575 Sand Hill Road, Menlo Park, CA 94025, USA
| | - Daniel Herschlag
- Deparment of Biochemistry, Stanford University, Stanford, CA 94305, USA
- Department of Chemical Engineering, Stanford University, Stanford, CA 94305, USA
- Stanford ChEM-H, Stanford University, Stanford, CA 94305, USA
| | - Filip Yabukarski
- Deparment of Biochemistry, Stanford University, Stanford, CA 94305, USA
- Bristol-Myers Squibb, San Diego, CA 92121, USA
| |
Collapse
|
32
|
Ginn HM. Torsion angles to map and visualize the conformational space of a protein. Protein Sci 2023; 32:e4608. [PMID: 36840926 PMCID: PMC10022581 DOI: 10.1002/pro.4608] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2022] [Revised: 02/06/2023] [Accepted: 02/22/2023] [Indexed: 02/26/2023]
Abstract
Present understanding of protein structure dynamics trails behind that of static structures. A torsion-angle based approach, called representation of protein entities (RoPE), derives an interpretable conformational space which correlates with data collection temperature, resolution and reaction coordinate. For more complex systems, atomic coordinates fail to separate functional conformational states, which are still preserved by torsion angle-derived space. This indicates that torsion angles are often a more sensitive and biologically relevant descriptor for protein conformational dynamics than atomic coordinates. This article is protected by copyright. All rights reserved.
Collapse
Affiliation(s)
- Helen Mary Ginn
- Diamond Light Source Ltd, Harwell Science and Innovation Campus, Didcot OX11 0DE, United Kingdom; Institute for Nanostructure and Solid State Physics, Hamburg, Germany
| |
Collapse
|
33
|
Jain AN, Brueckner AC, Cleves AE, Reibarkh M, Sherer EC. A Distributional Model of Bound Ligand Conformational Strain: From Small Molecules up to Large Peptidic Macrocycles. J Med Chem 2023; 66:1955-1971. [PMID: 36701387 PMCID: PMC9923749 DOI: 10.1021/acs.jmedchem.2c01744] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Abstract
The internal conformational strain incurred by ligands upon binding a target site has a critical impact on binding affinity, and expectations about the magnitude of ligand strain guide conformational search protocols. Estimates for bound ligand strain begin with modeled ligand atomic coordinates from X-ray co-crystal structures. By deriving low-energy conformational ensembles to fit X-ray diffraction data, calculated strain energies are substantially reduced compared with prior approaches. We show that the distribution of expected global strain energy values is dependent on molecular size in a superlinear manner. The distribution of strain energy follows a rectified normal distribution whose mean and variance are related to conformational complexity. The modeled strain distribution closely matches calculated strain values from experimental data comprising over 3000 protein-ligand complexes. The distributional model has direct implications for conformational search protocols as well as for directions in molecular design.
Collapse
Affiliation(s)
- Ajay N. Jain
- Research
& Development, BioPharmics LLC, Sonoma County, California95404, United States,
| | - Alexander C. Brueckner
- Molecular
Structure & Design, Bristol Myers Squibb, Princeton, New Jersey08543, United States
| | - Ann E. Cleves
- Research
& Development, BioPharmics LLC, Sonoma County, California95404, United States
| | - Mikhail Reibarkh
- Analytical
Research and Development, Merck & Co.
Inc., Rahway, New Jersey07065, United States
| | - Edward C. Sherer
- Analytical
Research and Development, Merck & Co.
Inc., Rahway, New Jersey07065, United States,
| |
Collapse
|
34
|
Wych DC, Aoto PC, Vu L, Wolff AM, Mobley DL, Fraser JS, Taylor SS, Wall ME. Molecular-dynamics simulation methods for macromolecular crystallography. Acta Crystallogr D Struct Biol 2023; 79:50-65. [PMID: 36601807 PMCID: PMC9815100 DOI: 10.1107/s2059798322011871] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2022] [Accepted: 12/13/2022] [Indexed: 12/31/2022] Open
Abstract
It is investigated whether molecular-dynamics (MD) simulations can be used to enhance macromolecular crystallography (MX) studies. Historically, protein crystal structures have been described using a single set of atomic coordinates. Because conformational variation is important for protein function, researchers now often build models that contain multiple structures. Methods for building such models can fail, however, in regions where the crystallographic density is difficult to interpret, for example at the protein-solvent interface. To address this limitation, a set of MD-MX methods that combine MD simulations of protein crystals with conventional modeling and refinement tools have been developed. In an application to a cyclic adenosine monophosphate-dependent protein kinase at room temperature, the procedure improved the interpretation of ambiguous density, yielding an alternative water model and a revised protein model including multiple conformations. The revised model provides mechanistic insights into the catalytic and regulatory interactions of the enzyme. The same methods may be used in other MX studies to seek mechanistic insights.
Collapse
Affiliation(s)
- David C. Wych
- Computer, Computational and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
- Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
- Department of Pharmaceutical Sciences, University of California, Irvine, Irvine, CA 92697, USA
| | - Phillip C. Aoto
- Department of Pharmacology, University of California, San Diego, La Jolla, CA 92093, USA
| | - Lily Vu
- Department of Pharmacology, University of California, San Diego, La Jolla, CA 92093, USA
| | - Alexander M. Wolff
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
| | - David L. Mobley
- Department of Pharmaceutical Sciences, University of California, Irvine, Irvine, CA 92697, USA
- Department of Chemistry, University of California, Irvine, Irvine, CA 92697, USA
| | - James S. Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Susan S. Taylor
- Department of Pharmacology, University of California, San Diego, La Jolla, CA 92093, USA
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, CA 92093, USA
| | - Michael E. Wall
- Computer, Computational and Statistical Sciences Division, Los Alamos National Laboratory, Los Alamos, NM 87545, USA
| |
Collapse
|
35
|
A pocket-based 3D molecule generative model fueled by experimental electron density. Sci Rep 2022; 12:15100. [PMID: 36068257 PMCID: PMC9448726 DOI: 10.1038/s41598-022-19363-6] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2022] [Accepted: 08/29/2022] [Indexed: 11/08/2022] Open
Abstract
We report for the first time the use of experimental electron density (ED) as training data for the generation of drug-like three-dimensional molecules based on the structure of a target protein pocket. Similar to a structural biologist building molecules based on their ED, our model functions with two main components: a generative adversarial network (GAN) to generate the ligand ED in the input pocket and an ED interpretation module for molecule generation. The model was tested on three targets: a kinase (hematopoietic progenitor kinase 1), protease (SARS-CoV-2 main protease), and nuclear receptor (vitamin D receptor), and evaluated with a reference dataset composed of over 8000 compounds that have their activities reported in the literature. The evaluation considered the chemical validity, chemical space distribution-based diversity, and similarity with reference active compounds concerning the molecular structure and pocket-binding mode. Our model can generate molecules with similar structures to classical active compounds and novel compounds sharing similar binding modes with active compounds, making it a promising tool for library generation supporting high-throughput virtual screening. The ligand ED generated can also be used to support fragment-based drug design. Our model is available as an online service to academic users via https://edmg.stonewise.cn/#/create .
Collapse
|
36
|
Ebrahim A, Riley BT, Kumaran D, Andi B, Fuchs MR, McSweeney S, Keedy DA. The temperature-dependent conformational ensemble of SARS-CoV-2 main protease (M pro). IUCRJ 2022; 9:682-694. [PMID: 36071812 PMCID: PMC9438506 DOI: 10.1107/s2052252522007497] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Accepted: 07/21/2022] [Indexed: 05/12/2023]
Abstract
The COVID-19 pandemic, instigated by the SARS-CoV-2 coronavirus, continues to plague the globe. The SARS-CoV-2 main protease, or Mpro, is a promising target for the development of novel antiviral therapeutics. Previous X-ray crystal structures of Mpro were obtained at cryogenic tem-per-ature or room tem-per-ature only. Here we report a series of high-resolution crystal structures of unliganded Mpro across multiple tem-per-atures from cryogenic to physiological, and another at high humidity. We inter-rogate these data sets with parsimonious multiconformer models, multi-copy ensemble models, and isomorphous difference density maps. Our analysis reveals a perturbation-dependent conformational landscape for Mpro, including a mobile zinc ion inter-leaved between the catalytic dyad, mercurial conformational heterogeneity at various sites including a key substrate-binding loop, and a far-reaching intra-molecular network bridging the active site and dimer inter-face. Our results may inspire new strategies for antiviral drug development to aid preparation for future coronavirus pandemics.
Collapse
Affiliation(s)
- Ali Ebrahim
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot, OX11 0DE, England, United Kingdom
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031, USA
| | - Blake T. Riley
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031, USA
| | - Desigan Kumaran
- Biology Department, Brookhaven National Laboratory, Upton, NY 11973, USA
| | - Babak Andi
- Center for BioMolecular Structure, NSLS-II, Brookhaven National Laboratory, Upton, NY 11973, USA
- National Virtual Biotechnology Laboratory (NVBL), US Department of Energy, Washington, DC, USA
| | - Martin R. Fuchs
- Center for BioMolecular Structure, NSLS-II, Brookhaven National Laboratory, Upton, NY 11973, USA
| | - Sean McSweeney
- Center for BioMolecular Structure, NSLS-II, Brookhaven National Laboratory, Upton, NY 11973, USA
- National Virtual Biotechnology Laboratory (NVBL), US Department of Energy, Washington, DC, USA
| | - Daniel A. Keedy
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031, USA
- Department of Chemistry and Biochemistry, City College of New York, New York, NY 10031, USA
- PhD Programs in Biochemistry, Biology, and Chemistry, The Graduate Center–City University of New York, New York, NY 10016, USA
| |
Collapse
|
37
|
Wankowicz SA, de Oliveira SH, Hogan DW, van den Bedem H, Fraser JS. Ligand binding remodels protein side-chain conformational heterogeneity. eLife 2022; 11:e74114. [PMID: 35312477 PMCID: PMC9084896 DOI: 10.7554/elife.74114] [Citation(s) in RCA: 49] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2021] [Accepted: 03/18/2022] [Indexed: 12/15/2022] Open
Abstract
While protein conformational heterogeneity plays an important role in many aspects of biological function, including ligand binding, its impact has been difficult to quantify. Macromolecular X-ray diffraction is commonly interpreted with a static structure, but it can provide information on both the anharmonic and harmonic contributions to conformational heterogeneity. Here, through multiconformer modeling of time- and space-averaged electron density, we measure conformational heterogeneity of 743 stringently matched pairs of crystallographic datasets that reflect unbound/apo and ligand-bound/holo states. When comparing the conformational heterogeneity of side chains, we observe that when binding site residues become more rigid upon ligand binding, distant residues tend to become more flexible, especially in non-solvent-exposed regions. Among ligand properties, we observe increased protein flexibility as the number of hydrogen bonds decreases and relative hydrophobicity increases. Across a series of 13 inhibitor-bound structures of CDK2, we find that conformational heterogeneity is correlated with inhibitor features and identify how conformational changes propagate differences in conformational heterogeneity away from the binding site. Collectively, our findings agree with models emerging from nuclear magnetic resonance studies suggesting that residual side-chain entropy can modulate affinity and point to the need to integrate both static conformational changes and conformational heterogeneity in models of ligand binding.
Collapse
Affiliation(s)
- Stephanie A Wankowicz
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
- Biophysics Graduate Program, University of California San FranciscoSan FranciscoUnited States
| | | | - Daniel W Hogan
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| | - Henry van den Bedem
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
- Atomwise Inc.San FranciscoUnited States
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San FranciscoSan FranciscoUnited States
| |
Collapse
|
38
|
Schulz EC, Yorke BA, Pearson AR, Mehrabi P. Best practices for time-resolved serial synchrotron crystallography. Acta Crystallogr D Struct Biol 2022; 78:14-29. [PMID: 34981758 PMCID: PMC8725164 DOI: 10.1107/s2059798321011621] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2021] [Accepted: 11/03/2021] [Indexed: 11/10/2022] Open
Abstract
With recent developments in X-ray sources, instrumentation and data-analysis tools, time-resolved crystallographic experiments, which were originally the preserve of a few expert groups, are becoming simpler and can be carried out at more radiation sources, and are thus increasingly accessible to a growing user base. However, these experiments are just that: discrete experiments, not just `data collections'. As such, careful planning and consideration of potential pitfalls is required to enable a successful experiment. Here, some of the key factors that should be considered during the planning and execution of a time-resolved structural study are outlined, with a particular focus on synchrotron-based experiments.
Collapse
Affiliation(s)
- Eike C. Schulz
- Institute for Nanostructure and Solid State Physics, Universität Hamburg, HARBOR, Luruper Chaussee 149, 22761 Hamburg, Germany
| | - Briony A. Yorke
- School of Chemistry and Bioscience, University of Bradford, Bradford BD7 1DP, United Kingdom
| | - Arwen R. Pearson
- Institute for Nanostructure and Solid State Physics, Universität Hamburg, HARBOR, Luruper Chaussee 149, 22761 Hamburg, Germany
- Hamburg Centre for Ultrafast Imaging, Universität Hamburg, HARBOR, Luruper Chaussee 149, 22761 Hamburg, Germany
| | - Pedram Mehrabi
- Institute for Nanostructure and Solid State Physics, Universität Hamburg, HARBOR, Luruper Chaussee 149, 22761 Hamburg, Germany
| |
Collapse
|
39
|
Shi H, Kimsey IJ, Gu S, Liu HF, Pham U, Schumacher MA, Al-Hashimi HM. Revealing A-T and G-C Hoogsteen base pairs in stressed protein-bound duplex DNA. Nucleic Acids Res 2021; 49:12540-12555. [PMID: 34792150 PMCID: PMC8643651 DOI: 10.1093/nar/gkab936] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Revised: 09/25/2021] [Accepted: 11/14/2021] [Indexed: 11/17/2022] Open
Abstract
Watson–Crick base pairs (bps) are the fundamental unit of genetic information and the building blocks of the DNA double helix. However, A-T and G-C can also form alternative ‘Hoogsteen’ bps, expanding the functional complexity of DNA. We developed ‘Hoog-finder’, which uses structural fingerprints to rapidly screen Hoogsteen bps, which may have been mismodeled as Watson–Crick in crystal structures of protein–DNA complexes. We uncovered 17 Hoogsteen bps, 7 of which were in complex with 6 proteins never before shown to bind Hoogsteen bps. The Hoogsteen bps occur near mismatches, nicks and lesions and some appear to participate in recognition and damage repair. Our results suggest a potentially broad role for Hoogsteen bps in stressed regions of the genome and call for a community-wide effort to identify these bps in current and future crystal structures of DNA and its complexes.
Collapse
Affiliation(s)
- Honglue Shi
- Department of Chemistry, Duke University, Durham, NC 27710, USA
| | - Isaac J Kimsey
- Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| | - Stephanie Gu
- Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| | - Hsuan-Fu Liu
- Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| | - Uyen Pham
- Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| | - Maria A Schumacher
- Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| | - Hashim M Al-Hashimi
- Department of Chemistry, Duke University, Durham, NC 27710, USA.,Department of Biochemistry, Duke University School of Medicine, Durham, NC 27710, USA
| |
Collapse
|
40
|
Ebrahim A, Riley BT, Kumaran D, Andi B, Fuchs MR, McSweeney S, Keedy DA. The temperature-dependent conformational ensemble of SARS-CoV-2 main protease (M pro). BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2021:2021.05.03.437411. [PMID: 33972941 PMCID: PMC8109201 DOI: 10.1101/2021.05.03.437411] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The COVID-19 pandemic, instigated by the SARS-CoV-2 coronavirus, continues to plague the globe. The SARS-CoV-2 main protease, or Mpro, is a promising target for development of novel antiviral therapeutics. Previous X-ray crystal structures of Mpro were obtained at cryogenic temperature or room temperature only. Here we report a series of high-resolution crystal structures of unliganded Mpro across multiple temperatures from cryogenic to physiological, and another at high humidity. We interrogate these datasets with parsimonious multiconformer models, multi-copy ensemble models, and isomorphous difference density maps. Our analysis reveals a temperature-dependent conformational landscape for Mpro, including mobile solvent interleaved between the catalytic dyad, mercurial conformational heterogeneity in a key substrate-binding loop, and a far-reaching intramolecular network bridging the active site and dimer interface. Our results may inspire new strategies for antiviral drug development to counter-punch COVID-19 and combat future coronavirus pandemics.
Collapse
Affiliation(s)
- Ali Ebrahim
- Diamond Light Source, Harwell Science and Innovation Campus, Didcot OX11 0DE, England
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Blake T. Riley
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
| | - Desigan Kumaran
- Biology Department, Brookhaven National Laboratory, Upton, NY 11973
| | - Babak Andi
- National Synchrotron Light Source II, Brookhaven National Laboratory, Upton, NY 11973
- National Virtual Biotechnology Laboratory (NVBL), US Department of Energy, Washington, DC, United States
| | - Martin R. Fuchs
- National Synchrotron Light Source II, Brookhaven National Laboratory, Upton, NY 11973
| | - Sean McSweeney
- National Synchrotron Light Source II, Brookhaven National Laboratory, Upton, NY 11973
- National Virtual Biotechnology Laboratory (NVBL), US Department of Energy, Washington, DC, United States
| | - Daniel A. Keedy
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, NY 10031
- Department of Chemistry and Biochemistry, City College of New York, New York, NY 10031
- Ph.D. Programs in Biochemistry, Biology, and Chemistry, The Graduate Center – City University of New York, New York, NY 10016
| |
Collapse
|
41
|
Riley BT, Wankowicz SA, de Oliveira SHP, van Zundert GCP, Hogan DW, Fraser JS, Keedy DA, van den Bedem H. qFit 3: Protein and ligand multiconformer modeling for X-ray crystallographic and single-particle cryo-EM density maps. Protein Sci 2021; 30:270-285. [PMID: 33210433 PMCID: PMC7737783 DOI: 10.1002/pro.4001] [Citation(s) in RCA: 40] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2020] [Revised: 11/10/2020] [Accepted: 11/17/2020] [Indexed: 01/04/2023]
Abstract
New X-ray crystallography and cryo-electron microscopy (cryo-EM) approaches yield vast amounts of structural data from dynamic proteins and their complexes. Modeling the full conformational ensemble can provide important biological insights, but identifying and modeling an internally consistent set of alternate conformations remains a formidable challenge. qFit efficiently automates this process by generating a parsimonious multiconformer model. We refactored qFit from a distributed application into software that runs efficiently on a small server, desktop, or laptop. We describe the new qFit 3 software and provide some examples. qFit 3 is open-source under the MIT license, and is available at https://github.com/ExcitedStates/qfit-3.0.
Collapse
Affiliation(s)
- Blake T. Riley
- Structural Biology InitiativeCUNY Advanced Science Research CenterNew YorkNew YorkUSA
| | - Stephanie A. Wankowicz
- Department of Bioengineering and Therapeutic SciencesUniversity of California San FranciscoSan FranciscoCaliforniaUSA
- Biophysics Graduate ProgramUniversity of California San FranciscoSan FranciscoCaliforniaUSA
| | | | | | - Daniel W. Hogan
- Department of Bioengineering and Therapeutic SciencesUniversity of California San FranciscoSan FranciscoCaliforniaUSA
| | - James S. Fraser
- Department of Bioengineering and Therapeutic SciencesUniversity of California San FranciscoSan FranciscoCaliforniaUSA
| | - Daniel A. Keedy
- Structural Biology InitiativeCUNY Advanced Science Research CenterNew YorkNew YorkUSA
- Department of Chemistry and BiochemistryCity College of New YorkNew YorkNew YorkUSA
- Ph.D. Programs in Biochemistry, Biology, and ChemistryThe Graduate Center, City University of New YorkNew YorkUSA
| | - Henry van den Bedem
- Department of Bioengineering and Therapeutic SciencesUniversity of California San FranciscoSan FranciscoCaliforniaUSA
- Atomwise, Inc.San FranciscoCaliforniaUSA
| |
Collapse
|