1
|
Williams RV, Rogals MJ, Eletsky A, Huang C, Morris LC, Moremen KW, Prestegard JH. AssignSLP_GUI, a software tool exploiting AI for NMR resonance assignment of sparsely labeled proteins. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2022; 345:107336. [PMID: 36442299 PMCID: PMC9742323 DOI: 10.1016/j.jmr.2022.107336] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/04/2022] [Revised: 11/12/2022] [Accepted: 11/15/2022] [Indexed: 05/06/2023]
Abstract
Not all proteins are amenable to uniform isotopic labeling with 13C and 15N, something needed for the widely used, and largely deductive, triple resonance assignment process. Among them are proteins expressed in mammalian cell culture where native glycosylation can be maintained, and proper formation of disulfide bonds facilitated. Uniform labeling in mammalian cells is prohibitively expensive, but sparse labeling with one or a few isotopically enriched amino acid types is an option for these proteins. However, assignment then relies on accessing the best match between a variety of measured NMR parameters and predictions based on 3D structure, often from X-ray crystallography. Finding this match is a challenging process that has benefitted from many computational tools, including trained neural nets for chemical shift prediction, genetic algorithms for searches through a myriad of assignment possibilities, and now AI-based prediction of high-quality structures for protein targets. AssignSLP_GUI, a new version of a software package for assignment of resonances from sparsely-labeled proteins, uses many of these tools. These tools and new additions to the package are highlighted in an application to a sparsely-labeled domain from a glycoprotein, CEACAM1.
Collapse
Affiliation(s)
- Robert V Williams
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602, USA
| | - Monique J Rogals
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602, USA
| | - Alexander Eletsky
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602, USA
| | - Chin Huang
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602, USA
| | - Laura C Morris
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602, USA
| | - Kelley W Moremen
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602, USA
| | - James H Prestegard
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602, USA.
| |
Collapse
|
2
|
Pritišanac I, Alderson TR, Güntert P. Automated assignment of methyl NMR spectra from large proteins. PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY 2020; 118-119:54-73. [PMID: 32883449 DOI: 10.1016/j.pnmrs.2020.04.001] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/25/2020] [Revised: 04/15/2020] [Accepted: 04/17/2020] [Indexed: 05/05/2023]
Abstract
As structural biology trends towards larger and more complex biomolecular targets, a detailed understanding of their interactions and underlying structures and dynamics is required. The development of methyl-TROSY has enabled NMR spectroscopy to provide atomic-resolution insight into the mechanisms of large molecular assemblies in solution. However, the applicability of methyl-TROSY has been hindered by the laborious and time-consuming resonance assignment process, typically performed with domain fragmentation, site-directed mutagenesis, and analysis of NOE data in the context of a crystal structure. In response, several structure-based automatic methyl assignment strategies have been developed over the past decade. Here, we present a comprehensive analysis of all available methods and compare their input data requirements, algorithmic strategies, and reported performance. In general, the methods fall into two categories: those that primarily rely on inter-methyl NOEs, and those that utilize methyl PRE- and PCS-based restraints. We discuss their advantages and limitations, and highlight the potential benefits from standardizing and combining different methods.
Collapse
Affiliation(s)
- Iva Pritišanac
- Institute of Biophysical Chemistry, Center for Biomolecular Magnetic Resonance, Goethe University Frankfurt am Main, 60438 Frankfurt am Main, Germany
| | - T Reid Alderson
- Laboratory of Chemical Physics, NIDDK, National Institutes of Health, Bethesda, MD 20892, USA
| | - Peter Güntert
- Institute of Biophysical Chemistry, Center for Biomolecular Magnetic Resonance, Goethe University Frankfurt am Main, 60438 Frankfurt am Main, Germany; Laboratory of Physical Chemistry, ETH Zürich, 8093 Zürich, Switzerland; Department of Chemistry, Tokyo Metropolitan University, Hachioji, Tokyo 192-0397, Japan.
| |
Collapse
|
3
|
Kooijman L, Ansorge P, Schuster M, Baumann C, Löhr F, Jurt S, Güntert P, Zerbe O. Backbone and methyl assignment of bacteriorhodopsin incorporated into nanodiscs. JOURNAL OF BIOMOLECULAR NMR 2020; 74:45-60. [PMID: 31754899 PMCID: PMC7015963 DOI: 10.1007/s10858-019-00289-7] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/10/2019] [Accepted: 11/11/2019] [Indexed: 05/21/2023]
Abstract
Resonance assignments are challenging for membrane proteins due to the size of the lipid/detergent-protein complex and the presence of line-broadening from conformational exchange. As a consequence, many correlations are missing in the triple-resonance NMR experiments typically used for assignments. Herein, we present an approach in which correlations from these solution-state NMR experiments are supplemented by data from 13C unlabeling, single-amino acid type labeling, 4D NOESY data and proximity of moieties to lipids or water in combination with a structure of the protein. These additional data are used to edit the expected peaklists for the automated assignment protocol FLYA, a module of the program package CYANA. We demonstrate application of the protocol to the 262-residue proton pump from archaeal bacteriorhodopsin (bR) in lipid nanodiscs. The lipid-protein assembly is characterized by an overall correlation time of 44 ns. The protocol yielded assignments for 62% of all backbone (H, N, Cα, Cβ, C') resonances of bR, corresponding to 74% of all observed backbone spin systems, and 60% of the Ala, Met, Ile (δ1), Leu and Val methyl groups, thus enabling to assign a large fraction of the protein without mutagenesis data. Most missing resonances stem from the extracellular half, likely due intermediate exchange line-broadening. Further analysis revealed that missing information of the amino acid type of the preceding residue is the largest problem, and that 4D NOESY experiments are particularly helpful to compensate for that information loss.
Collapse
Affiliation(s)
- Laurens Kooijman
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057, Zurich, Switzerland
| | - Philipp Ansorge
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057, Zurich, Switzerland
| | - Matthias Schuster
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057, Zurich, Switzerland
| | - Christian Baumann
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057, Zurich, Switzerland
| | - Frank Löhr
- Institute of Biophysical Chemistry and Center for Biomolecular Magnetic Resonance, Goethe University Frankfurt, Max-von-Laue-Straße 9, 60438, Frankfurt am Main, Germany
| | - Simon Jurt
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057, Zurich, Switzerland
| | - Peter Güntert
- Institute of Biophysical Chemistry and Center for Biomolecular Magnetic Resonance, Goethe University Frankfurt, Max-von-Laue-Straße 9, 60438, Frankfurt am Main, Germany
- Laboratory of Physical Chemistry, ETH Zürich, Vladimir-Prelog-Weg 1-5/10, 8093, Zurich, Switzerland
- Department of Chemistry, Tokyo Metropolitan University, 1-1 Minami-Osawa, Hachioji, Tokyo, 192-0397, Japan
| | - Oliver Zerbe
- Department of Chemistry, University of Zurich, Winterthurerstrasse 190, 8057, Zurich, Switzerland.
| |
Collapse
|
4
|
NMR Resonance Assignment Methodology: Characterizing Large Sparsely Labeled Glycoproteins. J Mol Biol 2019; 431:2369-2382. [PMID: 31034888 DOI: 10.1016/j.jmb.2019.04.029] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2019] [Revised: 04/17/2019] [Accepted: 04/18/2019] [Indexed: 01/02/2023]
Abstract
Characterization of proteins using NMR methods begins with assignment of resonances to specific residues. This is usually accomplished using sequential connectivities between nuclear pairs in proteins uniformly labeled with NMR active isotopes. This becomes impractical for larger proteins, and especially for proteins that are best expressed in mammalian cells, including glycoproteins. Here an alternate protocol for the assignment of NMR resonances of sparsely labeled proteins, namely, the ones labeled with a single amino acid type, or a limited subset of types, isotopically enriched with 15N or 13C, is described. The protocol is based on comparison of data collected using extensions of simple two-dimensional NMR experiments (correlated chemical shifts, nuclear Overhauser effects, residual dipolar couplings) to predictions from molecular dynamics trajectories that begin with known protein structures. Optimal pairing of predicted and experimental values is facilitated by a software package that employs a genetic algorithm, ASSIGN_SLP_MD. The approach is applied to the 36-kDa luminal domain of the sialyltransferase, rST6Gal1, in which all phenylalanines are labeled with 15N, and the results are validated by elimination of resonances via single-point mutations of selected phenylalanines to tyrosines. Assignment allows the use of previously published paramagnetic relaxation enhancements to evaluate placement of a substrate analog in the active site of this protein. The protocol will open the way to structural characterization of the many glycosylated and other proteins that are best expressed in mammalian cells.
Collapse
|
5
|
Williams RV, Yang JY, Moremen KW, Amster IJ, Prestegard JH. Measurement of residual dipolar couplings in methyl groups via carbon detection. JOURNAL OF BIOMOLECULAR NMR 2019; 73:191-198. [PMID: 31041649 PMCID: PMC7020099 DOI: 10.1007/s10858-019-00245-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 03/30/2019] [Indexed: 06/09/2023]
Abstract
Residual dipolar couplings (RDCs) provide both structural and dynamical information useful in the characterization of biological macromolecules. While most data come from the interaction of simple pairs of directly bonded spin-1/2 nuclei (1H-15N, 1H-13C, 1H-1H), it is possible to acquire data from interactions among the multiple spins of 13C-labeled methyl groups (1H3-13C). This is especially important because of the advantages that observation of 13C-labeled methyl groups offers in working with very large molecules. Here we consider some of the options for measurement of methyl RDCs in large and often fully protonated proteins and arrive at a pulse sequence that exploits both J-modulation and direct detection of 13C. Its utility is illustrated by application to a fully protonated two domain fragment from the mammalian glycoprotein, Robo1, 13C-methyl-labeled in all valines.
Collapse
Affiliation(s)
- Robert V Williams
- Department of Chemistry, University of Georgia, Athens, GA, USA
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA, USA
| | - Jeong-Yeh Yang
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA, USA
| | - Kelley W Moremen
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA, USA
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA
| | | | - James H Prestegard
- Department of Chemistry, University of Georgia, Athens, GA, USA.
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, GA, USA.
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA.
| |
Collapse
|
6
|
Moure MJ, Eletsky A, Gao Q, Morris LC, Yang JY, Chapla D, Zhao Y, Zong C, Amster IJ, Moremen KW, Boons GJ, Prestegard JH. Paramagnetic Tag for Glycosylation Sites in Glycoproteins: Structural Constraints on Heparan Sulfate Binding to Robo1. ACS Chem Biol 2018; 13:2560-2567. [PMID: 30063822 PMCID: PMC6161356 DOI: 10.1021/acschembio.8b00511] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
An enzyme- and click chemistry-mediated methodology for the site-specific nitroxide spin labeling of glycoproteins has been developed and applied. The procedure relies on the presence of single N-glycosylation sites that are present natively in proteins or that can be engineered into glycoproteins by mutational elimination of all but one glycosylation site. Recombinantly expressing glycoproteins in HEK293S (GnT1-) cells results in N-glycans with high-mannose structures that can be processed to leave a single GlcNAc residue. This can in turn be modified by enzymatic addition of a GalNAz residue that is subject to reaction with an alkyne-carrying TEMPO moiety using copper(I)-catalyzed click chemistry. To illustrate the procedure, we have made an application to a two-domain construct of Robo1, a protein that carries a single N-glycosylation site in its N-terminal domains. The construct has also been labeled with 15N at amide nitrogens of lysine residues to provide a set of sites that are used to derive an effective location of the paramagnetic nitroxide moiety of the TEMPO group. This, in turn, allowed measurements of paramagnetic perturbations to the spectra of a new high affinity heparan sulfate ligand. Calculation of distance constraints from these data facilitated determination of an atomic level model for the docked complex.
Collapse
Affiliation(s)
- Maria J. Moure
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States
| | - Alexander Eletsky
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States
| | - Qi Gao
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States
- Department of Chemistry, University of Georgia, Athens, Georgia 30602, United States
| | - Laura C. Morris
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States
| | - Jeong-Yeh Yang
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States
| | - Digantkumar Chapla
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States
| | - Yuejie Zhao
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States
- Department of Chemistry, University of Georgia, Athens, Georgia 30602, United States
| | - Chengli Zong
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States
| | - I. Jonathan Amster
- Department of Chemistry, University of Georgia, Athens, Georgia 30602, United States
| | - Kelley W. Moremen
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, Georgia 30602, United States
| | - Geert-Jan Boons
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States
- Department of Chemistry, University of Georgia, Athens, Georgia 30602, United States
- Department of Chemical Biology and Drug Discovery, Utrecht Institute for Pharmaceutical Sciences, and Bijvoet Center for Biomolecular Research, Utrecht University, Utrecht, The Netherlands
| | - James H. Prestegard
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, United States
- Department of Chemistry, University of Georgia, Athens, Georgia 30602, United States
- Department of Biochemistry and Molecular Biology, University of Georgia, Athens, Georgia 30602, United States
| |
Collapse
|
7
|
Gao Q, Yang JY, Moremen KW, Flanagan JG, Prestegard JH. Structural Characterization of a Heparan Sulfate Pentamer Interacting with LAR-Ig1-2. Biochemistry 2018; 57:2189-2199. [PMID: 29570275 DOI: 10.1021/acs.biochem.8b00241] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023]
Abstract
Leukocyte common antigen-related (LAR) protein is one of the type IIa receptor protein tyrosine phosphatases (RPTPs) that are important for signal transduction in biological processes, including axon growth and regeneration. Glycosaminoglycan chains, including heparan sulfate (HS) and chondroitin sulfate (CS), act as ligands that regulate LAR signaling. Here, we report the structural characterization of the first two immunoglobulin domains (Ig1-2) of LAR interacting with an HS pentasaccharide (GlcNS6S-GlcA-GlcNS3,6S-IdoA2S-GlcNS6S-OME, fondaparinux) using multiple solution-based NMR methods. In the course of the study, we extended an assignment strategy useful for sparsely labeled proteins expressed in mammalian cell culture supplemented with a single type of isotopically enriched amino acid ([15N]-Lys in this case) by including paramagnetic perturbations to NMR resonances. The folded two-domain structure for LAR-Ig1-2 seen in previous crystal structures has been validated in solution using residual dipolar coupling data, and a combination of chemical shift perturbation on titration of LAR-Ig1-2 with fondaparinux, saturation transfer difference (STD) spectra, and transferred nuclear Overhauser effects (trNOEs) have been employed in the docking program HADDOCK to generate models for the LAR-fondaparinux complex. These models are further analyzed by postprocessing energetic analysis to identify key binding interactions. In addition to providing insight into the ligand interaction mechanisms of type IIa RPTPs and the origin of opposing effects of CS and HS ligands, these results may assist in future design of therapeutic compounds for nervous system repair.
Collapse
Affiliation(s)
- Qi Gao
- Complex Carbohydrate Research Center , University of Georgia , Athens , Georgia 30602 , United States
| | - Jeong-Yeh Yang
- Complex Carbohydrate Research Center , University of Georgia , Athens , Georgia 30602 , United States
| | - Kelley W Moremen
- Complex Carbohydrate Research Center , University of Georgia , Athens , Georgia 30602 , United States
| | - John G Flanagan
- Department of Cell Biology and Program in Neuroscience , Harvard Medical School , Boston , Massachusetts 02115 , United States
| | - James H Prestegard
- Complex Carbohydrate Research Center , University of Georgia , Athens , Georgia 30602 , United States
| |
Collapse
|
8
|
Pederson K, Chalmers GR, Gao Q, Elnatan D, Ramelot TA, Ma LC, Montelione GT, Kennedy MA, Agard DA, Prestegard JH. NMR characterization of HtpG, the E. coli Hsp90, using sparse labeling with 13C-methyl alanine. JOURNAL OF BIOMOLECULAR NMR 2017; 68:225-236. [PMID: 28653216 PMCID: PMC5546222 DOI: 10.1007/s10858-017-0123-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/03/2017] [Accepted: 06/22/2017] [Indexed: 05/03/2023]
Abstract
A strategy for acquiring structural information from sparsely isotopically labeled large proteins is illustrated with an application to the E. coli heat-shock protein, HtpG (high temperature protein G), a 145 kDa dimer. It uses 13C-alanine methyl labeling in a perdeuterated background to take advantage of the sensitivity and resolution of Methyl-TROSY spectra, as well as the backbone-centered structural information from 1H-13C residual dipolar couplings (RDCs) of alanine methyl groups. In all, 40 of the 47 expected crosspeaks were resolved and 36 gave RDC data. Assignments of crosspeaks were partially achieved by transferring assignments from those made on individual domains using triple resonance methods. However, these were incomplete and in many cases the transfer was ambiguous. A genetic algorithm search for consistency between predictions based on domain structures and measurements for chemical shifts and RDCs allowed 60% of the 40 resolved crosspeaks to be assigned with confidence. Chemical shift changes of these crosspeaks on adding an ATP analog to the apo-protein are shown to be consistent with structural changes expected on comparing previous crystal structures for apo- and complex- structures. RDCs collected on the assigned alanine methyl peaks are used to generate a new solution model for the apo-protein structure.
Collapse
Affiliation(s)
- Kari Pederson
- Complex Carbohydrate Research Center, University of Georgia, Athens, USA
| | - Gordon R Chalmers
- Complex Carbohydrate Research Center, University of Georgia, Athens, USA
- Department of Computer Science, University of Georgia, Athens, USA
| | - Qi Gao
- Complex Carbohydrate Research Center, University of Georgia, Athens, USA
| | - Daniel Elnatan
- Department of Biochemistry and Biophysics, Howard Hughes Medical Institute, University of California, San Francisco, USA
| | - Theresa A Ramelot
- Department of Chemistry and Biochemistry, Miami University, Oxford, USA
| | - Li-Chung Ma
- Department of Molecular Biology and Biochemistry, Center for Advanced Biotechnology and Medicine, The State University of New Jersey, Piscataway, USA
- Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, USA
| | - Gaetano T Montelione
- Department of Molecular Biology and Biochemistry, Center for Advanced Biotechnology and Medicine, The State University of New Jersey, Piscataway, USA
- Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, USA
| | - Michael A Kennedy
- Department of Chemistry and Biochemistry, Miami University, Oxford, USA
| | - David A Agard
- Department of Biochemistry and Biophysics, Howard Hughes Medical Institute, University of California, San Francisco, USA
| | - James H Prestegard
- Complex Carbohydrate Research Center, University of Georgia, Athens, USA.
| |
Collapse
|