1
|
The NS1 protein of influenza B virus binds 5'-triphosphorylated dsRNA to suppress RIG-I activation and the host antiviral response. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.09.25.559316. [PMID: 38328244 PMCID: PMC10849492 DOI: 10.1101/2023.09.25.559316] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2024]
Abstract
Influenza A and B viruses overcome the host antiviral response to cause a contagious and often severe human respiratory disease. Here, integrative structural biology and biochemistry studies on non-structural protein 1 of influenza B virus (NS1B) reveal a previously unrecognized viral mechanism for innate immune evasion. Conserved basic groups of its C-terminal domain (NS1B-CTD) bind 5'triphosphorylated double-stranded RNA (5'-ppp-dsRNA), the primary pathogen-associated feature that activates the host retinoic acid-inducible gene I protein (RIG-I) to initiate interferon synthesis and the cellular antiviral response. Like RIG-I, NS1B-CTD preferentially binds blunt-end 5'ppp-dsRNA. NS1B-CTD also competes with RIG-I for binding 5'ppp-dsRNA, and thus suppresses activation of RIG-I's ATPase activity. Although the NS1B N-terminal domain also binds dsRNA, it utilizes a different binding mode and lacks 5'ppp-dsRNA end preferences. In cells infected with wild-type influenza B virus, RIG-I activation is inhibited. In contrast, RIG-I activation and the resulting phosphorylation of transcription factor IRF-3 are not inhibited in cells infected with a mutant virus encoding NS1B with a R208A substitution it its CTD that eliminates its 5'ppp-dsRNA binding activity. These results reveal a novel mechanism in which NS1B binds 5'ppp-dsRNA to inhibit the RIG-I antiviral response during influenza B virus infection, and open the door to new avenues for antiviral drug discovery.
Collapse
|
2
|
Sifting Through the Noise: A Computational Pipeline for Accurate Prioritization of Protein-Protein Binding Candidates in High-Throughput Protein Libraries. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.01.20.576374. [PMID: 38328039 PMCID: PMC10849530 DOI: 10.1101/2024.01.20.576374] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/09/2024]
Abstract
Identifying the interactome for a protein of interest is challenging due to the large number of possible binders. High-throughput experimental approaches narrow down possible binding partners, but often include false positives. Furthermore, they provide no information about what the binding region is (e.g. the binding epitope). We introduce a novel computational pipeline based on an AlphaFold2 (AF) Competition Assay (AF-CBA) to identify proteins that bind a target of interest from a pull-down experiment, along with the binding epitope. Our focus is on proteins that bind the Extraterminal (ET) domain of Bromo and Extraterminal domain (BET) proteins, but we also introduce nine additional systems to show transferability to other peptide-protein systems. We describe a series of limitations to the methodology based on intrinsic deficiencies to AF and AF-CBA, to help users identify scenarios where the approach will be most useful. Given the speed and accuracy of the methodology, we expect it to be generally applicable to facilitate target selection for experimental verification starting from high-throughput protein libraries.
Collapse
|
3
|
Structure Determination of Challenging Protein-Peptide Complexes Combining NMR Chemical Shift Data and Molecular Dynamics Simulations. J Chem Inf Model 2023; 63:2058-2072. [PMID: 36988562 PMCID: PMC10150588 DOI: 10.1021/acs.jcim.2c01595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/30/2023]
Abstract
Intrinsically disordered regions of proteins often mediate important protein-protein interactions. However, the folding-upon-binding nature of many polypeptide-protein interactions limits the ability of modeling tools to predict the three-dimensional structures of such complexes. To address this problem, we have taken a tandem approach combining NMR chemical shift data and molecular simulations to determine the structures of peptide-protein complexes. Here, we use the MELD (Modeling Employing Limited Data) technique applied to polypeptide complexes formed with the extraterminal domain (ET) of bromo and extraterminal domain (BET) proteins, which exhibit a high degree of binding plasticity. This system is particularly challenging as the binding process includes allosteric changes across the ET receptor upon binding, and the polypeptide binding partners can adopt different conformations (e.g., helices and hairpins) in the complex. In a blind study, the new approach successfully modeled bound-state conformations and binding poses, using only protein receptor backbone chemical shift data, in excellent agreement with experimentally determined structures for moderately tight (Kd ∼100 nM) binders. The hybrid MELD + NMR approach required additional peptide ligand chemical shift data for weaker (Kd ∼250 μM) peptide binding partners. AlphaFold also successfully predicts the structures of some of these peptide-protein complexes. However, whereas AlphaFold can provide qualitative peptide rankings, MELD can directly estimate relative binding affinities. The hybrid MELD + NMR approach offers a powerful new tool for structural analysis of protein-polypeptide complexes involving disorder-to-order transitions upon complex formation, which are not successfully modeled with most other complex prediction methods, providing both the 3D structures of peptide-protein complexes and their relative binding affinities.
Collapse
|
4
|
SpecDB: A relational database for archiving biomolecular NMR spectral data. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2022; 342:107268. [PMID: 35930941 PMCID: PMC9922030 DOI: 10.1016/j.jmr.2022.107268] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/25/2022] [Revised: 06/16/2022] [Accepted: 07/06/2022] [Indexed: 05/11/2023]
Abstract
NMR is a valuable experimental tool in the structural biologist's toolkit to elucidate the structures, functions, and motions of biomolecules. The progress of machine learning, particularly in structural biology, reveals the critical importance of large, diverse, and reliable datasets in developing new methods and understanding in structural biology and science more broadly. Biomolecular NMR research groups produce large amounts of data, and there is renewed interest in organizing these data to train new, sophisticated machine learning architectures and to improve biomolecular NMR analysis pipelines. The foundational data type in NMR is the free-induction decay (FID). There are opportunities to build sophisticated machine learning methods to tackle long-standing problems in NMR data processing, resonance assignment, dynamics analysis, and structure determination using NMR FIDs. Our goal in this study is to provide a lightweight, broadly available tool for archiving FID data as it is generated at the spectrometer, and grow a new resource of FID data and associated metadata. This study presents a relational schema for storing and organizing the metadata items that describe an NMR sample and FID data, which we call Spectral Database (SpecDB). SpecDB is implemented in SQLite and includes a Python software library providing a command-line application to create, organize, query, backup, share, and maintain the database. This set of software tools and database schema allow users to store, organize, share, and learn from NMR time domain data. SpecDB is freely available under an open source license at https://github.rpi.edu/RPIBioinformatics/SpecDB.
Collapse
|
5
|
Assessment of prediction methods for protein structures determined by NMR in CASP14: Impact of AlphaFold2. Proteins 2021; 89:1959-1976. [PMID: 34559429 PMCID: PMC8616817 DOI: 10.1002/prot.26246] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2021] [Revised: 09/09/2021] [Accepted: 09/14/2021] [Indexed: 12/26/2022]
Abstract
NMR studies can provide unique information about protein conformations in solution. In CASP14, three reference structures provided by solution NMR methods were available (T1027, T1029, and T1055), as well as a fourth data set of NMR‐derived contacts for an integral membrane protein (T1088). For the three targets with NMR‐based structures, the best prediction results ranged from very good (GDT_TS = 0.90, for T1055) to poor (GDT_TS = 0.47, for T1029). We explored the basis of these results by comparing all CASP14 prediction models against experimental NMR data. For T1027, NMR data reveal extensive internal dynamics, presenting a unique challenge for protein structure prediction methods. The analysis of T1029 motivated exploration of a novel method of “inverse structure determination,” in which an AlphaFold2 model was used to guide NMR data analysis. NMR data provided to CASP predictor groups for target T1088, a 238‐residue integral membrane porin, was also used to assess several NMR‐assisted prediction methods. Most groups involved in this exercise generated similar beta‐barrel models, with good agreement with the experimental data. However, as was also observed in CASP13, some pure prediction groups that did not use any NMR data generated models for T1088 that better fit the NMR data than the models generated using these experimental data. These results demonstrate the remarkable power of modern methods to predict structures of proteins with accuracies rivaling solution NMR structures, and that it is now possible to reliably use prediction models to guide and complement experimental NMR data analysis.
Collapse
|
6
|
A common binding motif in the ET domain of BRD3 forms polymorphic structural interfaces with host and viral proteins. Structure 2021; 29:886-898.e6. [PMID: 33592170 DOI: 10.1016/j.str.2021.01.010] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2020] [Revised: 12/22/2020] [Accepted: 01/21/2021] [Indexed: 12/23/2022]
Abstract
The extraterminal (ET) domain of BRD3 is conserved among BET proteins (BRD2, BRD3, BRD4), interacting with multiple host and viral protein-protein networks. Solution NMR structures of complexes formed between the BRD3 ET domain and either the 79-residue murine leukemia virus integrase (IN) C-terminal domain (IN329-408) or its 22-residue IN tail peptide (IN386-407) alone reveal similar intermolecular three-stranded β-sheet formations. 15N relaxation studies reveal a 10-residue linker region (IN379-388) tethering the SH3 domain (IN329-378) to the ET-binding motif (IN389-405):ET complex. This linker has restricted flexibility, affecting its potential range of orientations in the IN:nucleosome complex. The complex of the ET-binding peptide of the host NSD3 protein (NSD3148-184) and the BRD3 ET domain includes a similar three-stranded β-sheet interaction, but the orientation of the β hairpin is flipped compared with the two IN:ET complexes. These studies expand our understanding of molecular recognition polymorphism in complexes of ET-binding motifs with viral and host proteins.
Collapse
|
7
|
Protein structure prediction assisted with sparse NMR data in CASP13. Proteins 2019; 87:1315-1332. [PMID: 31603581 DOI: 10.1002/prot.25837] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2019] [Revised: 09/26/2019] [Accepted: 09/27/2019] [Indexed: 01/05/2023]
Abstract
CASP13 has investigated the impact of sparse NMR data on the accuracy of protein structure prediction. NOESY and 15 N-1 H residual dipolar coupling data, typical of that obtained for 15 N,13 C-enriched, perdeuterated proteins up to about 40 kDa, were simulated for 11 CASP13 targets ranging in size from 80 to 326 residues. For several targets, two prediction groups generated models that are more accurate than those produced using baseline methods. Real NMR data collected for a de novo designed protein were also provided to predictors, including one data set in which only backbone resonance assignments were available. Some NMR-assisted prediction groups also did very well with these data. CASP13 also assessed whether incorporation of sparse NMR data improves the accuracy of protein structure prediction relative to nonassisted regular methods. In most cases, incorporation of sparse, noisy NMR data results in models with higher accuracy. The best NMR-assisted models were also compared with the best regular predictions of any CASP13 group for the same target. For six of 13 targets, the most accurate model provided by any NMR-assisted prediction group was more accurate than the most accurate model provided by any regular prediction group; however, for the remaining seven targets, one or more regular prediction method provided a more accurate model than even the best NMR-assisted model. These results suggest a novel approach for protein structure determination, in which advanced prediction methods are first used to generate structural models, and sparse NMR data is then used to validate and/or refine these models.
Collapse
|
8
|
Structural Basis by Which the N-Terminal Polypeptide Segment of Rhizopus chinensis Lipase Regulates Its Substrate Binding Affinity. Biochemistry 2019; 58:3943-3954. [PMID: 31436959 PMCID: PMC7195698 DOI: 10.1021/acs.biochem.9b00462] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Members of an important group of industrial enzymes, Rhizopus lipases, exhibit valuable hydrolytic features that underlie their biological functions. Particularly important is their N-terminal polypeptide segment (NTPS), which is required for secretion and proper folding but is removed in the process of enzyme maturation. A second common feature of this class of lipases is the α-helical "lid", which regulates the accessibility of the substrate to the enzyme active site. Some Rhizopus lipases also exhibit "interfacial activation" by micelle and/or aggregate surfaces. While it has long been recognized that the NTPS is critical for function, its dynamic features have frustrated efforts to characterize its structure by X-ray crystallography. Here, we combine nuclear magnetic resonance spectroscopy and X-ray crystallography to determine the structure and dynamics of Rhizopus chinensis lipase (RCL) with its 27-residue NTPS prosequence (r27RCL). Both r27RCL and the truncated mature form of RCL (mRCL) exhibit biphasic interfacial activation kinetics with p-nitrophenyl butyrate (pNPB). r27RCL exhibits a substrate binding affinity significantly lower than that of mRCL due to stabilization of the closed lid conformation by the NTPS. In contrast to previous predictions, the NTPS does not enhance lipase activity by increasing surface hydrophobicity but rather inhibits activity by forming conserved interactions with both the closed lid and the core protein structure. Single-site mutations and kinetic studies were used to confirm that the NTPS serves as internal competitive inhibitor and to develop a model of the associated process of interfacial activation. These structure-function studies provide the basis for engineering RCL lipases with enhanced catalytic activities.
Collapse
|
9
|
The copBL operon protects Staphylococcus aureus from copper toxicity: CopL is an extracellular membrane-associated copper-binding protein. J Biol Chem 2019; 294:4027-4044. [PMID: 30655293 PMCID: PMC6422080 DOI: 10.1074/jbc.ra118.004723] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2018] [Revised: 01/08/2019] [Indexed: 12/22/2022] Open
Abstract
As complications associated with antibiotic resistance have intensified, copper (Cu) is attracting attention as an antimicrobial agent. Recent studies have shown that copper surfaces decrease microbial burden, and host macrophages use Cu to increase bacterial killing. Not surprisingly, microbes have evolved mechanisms to tightly control intracellular Cu pools and protect against Cu toxicity. Here, we identified two genes (copB and copL) encoded within the Staphylococcus aureus arginine-catabolic mobile element (ACME) that we hypothesized function in Cu homeostasis. Supporting this hypothesis, mutational inactivation of copB or copL increased copper sensitivity. We found that copBL are co-transcribed and that their transcription is increased during copper stress and in a strain in which csoR, encoding a Cu-responsive transcriptional repressor, was mutated. Moreover, copB displayed genetic synergy with copA, suggesting that CopB functions in Cu export. We further observed that CopL functions independently of CopB or CopA in Cu toxicity protection and that CopL from the S. aureus clone USA300 is a membrane-bound and surface-exposed lipoprotein that binds up to four Cu+ ions. Solution NMR structures of the homologous Bacillus subtilis CopL, together with phylogenetic analysis and chemical-shift perturbation experiments, identified conserved residues potentially involved in Cu+ coordination. The solution NMR structure also revealed a novel Cu-binding architecture. Of note, a CopL variant with defective Cu+ binding did not protect against Cu toxicity in vivo Taken together, these findings indicate that the ACME-encoded CopB and CopL proteins are additional factors utilized by the highly successful S. aureus USA300 clone to suppress copper toxicity.
Collapse
|
10
|
Antiparallel Coiled-Coil Interactions Mediate the Homodimerization of the DNA Damage-Repair Protein PALB2. Biochemistry 2018; 57:6581-6591. [PMID: 30289697 DOI: 10.1021/acs.biochem.8b00789] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
Deficits in DNA damage-repair pathways are the root cause of several human cancers. In mammalian cells, DNA double-strand break repair is carried out by multiple mechanisms, including homologous recombination (HR). The partner and localizer of BRCA2 (PALB2), which is an essential factor for HR, binds to the breast cancer susceptibility 1 (BRCA1) protein at DNA double-strand breaks. At the break site, PALB2 also associates with the breast cancer susceptibility 2 (BRCA2) protein to form a multiprotein complex that facilitates HR. The BRCA1-PALB2 interaction is mediated by association of predicted helical coiled-coil regions in both proteins. PALB2 can also homodimerize through the formation of a coiled coil by the self-association of helical elements at the N-terminus of the PALB2 protein, and this homodimerization has been proposed to regulate the efficiency of HR. We have produced a segment of PALB2, designated PALB2cc (PALB2 coiled coil segment) that forms α-helical structures, which assemble into stable homodimers. PALB2cc also forms heterodimers with a helical segment of BRCA1, called BRCA1cc (BRCA1 coiled coil segment). The three-dimensional structure of the homodimer formed by PALB2cc was determined by solution NMR spectroscopy. This PALB2cc homodimer is a classical antiparallel coiled-coil leucine zipper. NMR chemical-shift perturbation studies were used to study dimer formation for both the PALB2cc homodimer and the PALB2cc/BRCA1cc heterodimer. The mutation of residue Leu24 of PALB2cc significantly reduces its homodimer stability, but has a more modest effect on the stability of the heterodimer formed between PALB2cc and BRCA1cc. We show that mutation of Leu24 leads to genomic instability and reduced cell viability after treatment with agents that induce DNA double-strand breaks. These studies may allow the identification of distinct mutations of PALB2cc that selectively disrupt homodimeric versus heterodimeric interactions, and reveal the specific role of PALB2cc homodimerization in HR.
Collapse
|
11
|
Minimal Heterochiral de Novo Designed 4Fe-4S Binding Peptide Capable of Robust Electron Transfer. J Am Chem Soc 2018; 140:11210-11213. [PMID: 30141918 DOI: 10.1021/jacs.8b07553] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Ambidoxin is a designed, minimal dodecapeptide consisting of alternating L and D amino acids that binds a 4Fe-4S cluster through ligand-metal interactions and an extensive network of second-shell hydrogen bonds. The peptide can withstand hundreds of oxidation-reduction cycles at room temperature. Ambidoxin suggests how simple, prebiotic peptides may have achieved robust redox catalysis on the early Earth.
Collapse
|
12
|
Backbone and Ile-δ1, Leu, Val methyl 1H, 15N, and 13C, chemical shift assignments for Rhizopus chinensis lipase. BIOMOLECULAR NMR ASSIGNMENTS 2018; 12:63-68. [PMID: 28929427 DOI: 10.1007/s12104-017-9781-4] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2017] [Accepted: 09/14/2017] [Indexed: 06/07/2023]
Abstract
Lipase r27RCL is a 296-residue, 33 kDa monomeric enzyme with high ester hydrolysis activity, which has significant applications in the baking, paper and leather industries. The lipase gene proRCL from Rhizopus microsporus var. chinensis (also Rhizopus chinensis) CCTCC M201021 was cloned as a fusion construct C-terminal to a maltose-binding protein (MBP) tag, and expressed as MBP-proRCL in an Escherichia coli BL21 trxB (DE3) expression system with uniform 2H,13C,15N-enrichment and Ile-δ1, Leu, and Val 13CH3 methyl labeling. The fusion protein was hydrolyzed by Kex2 protease at the recognition site Lys-Arg between residues -29 and -28 of the prosequence, producing the enzyme form called r27RCL. Here we report extensive backbone 1H, 15N, and 13C, as well as Ile-δ1, Leu, and Val side chain methyl, NMR resonance assignments for r27RCL.
Collapse
|
13
|
Effect of mitochondrial uncouplers niclosamide ethanolamine (NEN) and oxyclozanide on hepatic metastasis of colon cancer. Cell Death Dis 2018; 9:215. [PMID: 29440715 PMCID: PMC5833462 DOI: 10.1038/s41419-017-0092-6] [Citation(s) in RCA: 52] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2017] [Revised: 08/25/2017] [Accepted: 09/20/2017] [Indexed: 02/06/2023]
Abstract
Metabolism of cancer cells is characterized by aerobic glycolysis, or the Warburg effect. Aerobic glycolysis reduces pyruvate flux into mitochondria, preventing a complete oxidation of glucose and shunting glucose to anabolic pathways essential for cell proliferation. Here we tested a new strategy, mitochondrial uncoupling, for its potential of antagonizing the anabolic effect of aerobic glycolysis and for its potential anticancer activities. Mitochondrial uncoupling is a process that facilitates proton influx across the mitochondrial inner membrane without generating ATP, stimulating a futile cycle of acetyl- CoA oxidation. We tested two safe mitochondrial uncouplers, NEN (niclosamide ethanolamine) and oxyclozanide, on their metabolic effects and anti-cancer activities. We used metabolomic NMR to examine the effect of mitochondrial uncoupling on glucose metabolism in colon cancer MC38 cells. We further tested the anti-cancer effect of NEN and oxyclozanide in cultured cell models, APCmin/+ mouse model, and a metastatic colon cancer mouse model. Using a metabolomic NMR approach, we demonstrated that mitochondrial uncoupling promotes pyruvate influx to mitochondria and reduces various anabolic pathway activities. Moreover, mitochondrial uncoupling inhibits cell proliferation and reduces clonogenicity of cultured colon cancer cells. Furthermore, oral treatment with mitochondrial uncouplers reduces intestinal polyp formation in APCmin/+ mice, and diminishes hepatic metastasis of colon cancer cells transplanted intrasplenically. Our data highlight a unique approach for targeting cancer cell metabolism for cancer prevention and treatment, identified two prototype compounds, and shed light on the anti-cancer mechanism of niclosamide.
Collapse
|
14
|
Multiple helical conformations of the helix-turn-helix region revealed by NOE-restrained MD simulations of tryptophan aporepressor, TrpR. Proteins 2017; 85:731-740. [PMID: 28120439 DOI: 10.1002/prot.25252] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2016] [Revised: 12/20/2016] [Accepted: 01/04/2017] [Indexed: 11/11/2022]
Abstract
The nature of flexibility in the helix-turn-helix region of E. coli trp aporepressor has been unexplained for many years. The original ensemble of nuclear magnetic resonance (NMR structures showed apparent disorder, but chemical shift and relaxation measurements indicated a helical region. Nuclear Overhauser effect (NOE) data for a temperature-sensitive mutant showed more helical character in its helix-turn-helix region, but nevertheless also led to an apparently disordered ensemble. However, conventional NMR structure determination methods require all structures in the ensemble to be consistent with every NOE simultaneously. This work uses an alternative approach in which some structures of the ensemble are allowed to violate some NOEs to permit modeling of multiple conformational states that are in dynamic equilibrium. Newly measured NOE data for wild-type aporepressor are used as time-averaged distance restraints in molecular dynamics simulations to generate an ensemble of helical conformations that is more consistent with the observed NMR data than the apparent disorder in the previously reported NMR structures. The results indicate the presence of alternating helical conformations that provide a better explanation for the flexibility of the helix-turn-helix region of trp aporepressor. Structures representing these conformations have been deposited with PDB ID: 5TM0. Proteins 2017; 85:731-740. © 2016 Wiley Periodicals, Inc.
Collapse
|
15
|
Principles for designing proteins with cavities formed by curved β sheets. Science 2017; 355:201-206. [PMID: 28082595 PMCID: PMC5588894 DOI: 10.1126/science.aah7389] [Citation(s) in RCA: 88] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2016] [Accepted: 12/02/2016] [Indexed: 12/23/2022]
Abstract
Active sites and ligand-binding cavities in native proteins are often formed by curved β sheets, and the ability to control β-sheet curvature would allow design of binding proteins with cavities customized to specific ligands. Toward this end, we investigated the mechanisms controlling β-sheet curvature by studying the geometry of β sheets in naturally occurring protein structures and folding simulations. The principles emerging from this analysis were used to design, de novo, a series of proteins with curved β sheets topped with α helices. Nuclear magnetic resonance and crystal structures of the designs closely match the computational models, showing that β-sheet curvature can be controlled with atomic-level accuracy. Our approach enables the design of proteins with cavities and provides a route to custom design ligand-binding and catalytic sites.
Collapse
|
16
|
Efficient production of (2)H, (13)C, (15)N-enriched industrial enzyme Rhizopus chinensis lipase with native disulfide bonds. Microb Cell Fact 2016; 15:123. [PMID: 27411547 PMCID: PMC4944435 DOI: 10.1186/s12934-016-0522-7] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2016] [Accepted: 07/03/2016] [Indexed: 01/29/2023] Open
Abstract
BACKGROUND In order to use most modern methods of NMR spectroscopy to study protein structure and dynamics, isotope-enriched protein samples are essential. Especially for larger proteins (>20 kDa), perdeuterated and Ile (δ1), Leu, and Val methyl-protonated protein samples are required for suppressing nuclear relaxation to provide improved spectral quality, allowing key backbone and side chain resonance assignments needed for protein structure and dynamics studies. Escherichia coli and Pichia pastoris are two of the most popular expression systems for producing isotope-enriched, recombinant protein samples for NMR investigations. The P. pastoris system can be used to produce (13)C, (15)N-enriched and even (2)H,(13)C, (15)N-enriched protein samples, but efficient methods for producing perdeuterated proteins with Ile (δ1), Leu and Val methyl-protonated groups in P. pastoris are still unavailable. Glycosylation heterogeneity also provides challenges to NMR studies. E. coli expression systems are efficient for overexpressing perdeuterated and Ile (δ1), Leu, Val methyl-protonated protein samples, but are generally not successful for producing secreted eukaryotic proteins with native disulfide bonds. RESULTS The 33 kDa protein-Rhizopus chinensis lipase (RCL), an important industrial enzyme, was produced using both P. pastoris and E. coli BL21 trxB (DE3) systems. Samples produced from both systems exhibit identical native disulfide bond formation and similar 2D NMR spectra, indicating similar native protein folding. The yield of (13)C, (15)N-enriched r27RCL produced using P. pastoris was 1.7 times higher that obtained using E. coli, while the isotope-labeling efficiency was ~15 % lower. Protein samples produced in P. pastoris exhibit O-glycosylation, while the protein samples produced in E. coli were not glycosylated. The specific activity of r27RCL from P. pastoris was ~1.4 times higher than that produced in E. coli. CONCLUSIONS These data demonstrate efficient production of (2)H, (13)C, (15)N-enriched, Ile (δ1), Leu, Val methyl-protonated eukaryotic protein r27RCL with native disulfides using the E. coli BL21 trxB (DE3) system. For certain NMR studies, particularly efforts for resonance assignments, structural studies, and dynamic studies, E. coli provides a cost-effective system for producing isotope-enriched RCL. It should also be potential for producing other (2)H, (13)C, (15)N-enriched, Ile (δ1), Leu, Val methyl-protonated eukaryotic proteins with native disulfide bonds.
Collapse
|
17
|
A community resource of experimental data for NMR / X-ray crystal structure pairs. Protein Sci 2015; 25:30-45. [PMID: 26293815 DOI: 10.1002/pro.2774] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2015] [Accepted: 08/17/2015] [Indexed: 12/11/2022]
Abstract
We have developed an online NMR / X-ray Structure Pair Data Repository. The NIGMS Protein Structure Initiative (PSI) has provided many valuable reagents, 3D structures, and technologies for structural biology. The Northeast Structural Genomics Consortium was one of several PSI centers. NESG used both X-ray crystallography and NMR spectroscopy for protein structure determination. A key goal of the PSI was to provide experimental structures for at least one representative of each of hundreds of targeted protein domain families. In some cases, structures for identical (or nearly identical) constructs were determined by both NMR and X-ray crystallography. NMR spectroscopy and X-ray diffraction data for 41 of these "NMR / X-ray" structure pairs determined using conventional triple-resonance NMR methods with extensive sidechain resonance assignments have been organized in an online NMR / X-ray Structure Pair Data Repository. In addition, several NMR data sets for perdeuterated, methyl-protonated protein samples are included in this repository. As an example of the utility of this repository, these data were used to revisit questions about the precision and accuracy of protein NMR structures first outlined by Levy and coworkers several years ago (Andrec et al., Proteins 2007;69:449-465). These results demonstrate that the agreement between NMR and X-ray crystal structures is improved using modern methods of protein NMR spectroscopy. The NMR / X-ray Structure Pair Data Repository will provide a valuable resource for new computational NMR methods development.
Collapse
|
18
|
Aspirin's Active Metabolite Salicylic Acid Targets High Mobility Group Box 1 to Modulate Inflammatory Responses. Mol Med 2015; 21:526-35. [PMID: 26101955 DOI: 10.2119/molmed.2015.00148] [Citation(s) in RCA: 72] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2015] [Accepted: 06/18/2015] [Indexed: 02/02/2023] Open
Abstract
Salicylic acid (SA) and its derivatives have been used for millennia to reduce pain, fever and inflammation. In addition, prophylactic use of acetylsalicylic acid, commonly known as aspirin, reduces the risk of heart attack, stroke and certain cancers. Because aspirin is rapidly de-acetylated by esterases in human plasma, much of aspirin's bioactivity can be attributed to its primary metabolite, SA. Here we demonstrate that human high mobility group box 1 (HMGB1) is a novel SA-binding protein. SA-binding sites on HMGB1 were identified in the HMG-box domains by nuclear magnetic resonance (NMR) spectroscopic studies and confirmed by mutational analysis. Extracellular HMGB1 is a damage-associated molecular pattern molecule (DAMP), with multiple redox states. SA suppresses both the chemoattractant activity of fully reduced HMGB1 and the increased expression of proinflammatory cytokine genes and cyclooxygenase 2 (COX-2) induced by disulfide HMGB1. Natural and synthetic SA derivatives with greater potency for inhibition of HMGB1 were identified, providing proof-of-concept that new molecules with high efficacy against sterile inflammation are attainable. An HMGB1 protein mutated in one of the SA-binding sites identified by NMR chemical shift perturbation studies retained chemoattractant activity, but lost binding of and inhibition by SA and its derivatives, thereby firmly establishing that SA binding to HMGB1 directly suppresses its proinflammatory activities. Identification of HMGB1 as a pharmacological target of SA/aspirin provides new insights into the mechanisms of action of one of the world's longest and most used natural and synthetic drugs. It may also provide an explanation for the protective effects of low-dose aspirin usage.
Collapse
|
19
|
Altering murine leukemia virus integration through disruption of the integrase and BET protein family interaction. Nucleic Acids Res 2014; 42:5917-28. [PMID: 24623816 PMCID: PMC4027182 DOI: 10.1093/nar/gku175] [Citation(s) in RCA: 53] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2013] [Revised: 02/12/2014] [Accepted: 02/12/2014] [Indexed: 12/17/2022] Open
Abstract
We report alterations to the murine leukemia virus (MLV) integrase (IN) protein that successfully result in decreasing its integration frequency at transcription start sites and CpG islands, thereby reducing the potential for insertional activation. The host bromo and extraterminal (BET) proteins Brd2, 3 and 4 interact with the MLV IN protein primarily through the BET protein ET domain. Using solution NMR, protein interaction studies, and next generation sequencing, we show that the C-terminal tail peptide region of MLV IN is important for the interaction with BET proteins and that disruption of this interaction through truncation mutations affects the global targeting profile of MLV vectors. The use of the unstructured tails of gammaretroviral INs to direct association with complexes at active promoters parallels that used by histones and RNA polymerase II. Viruses bearing MLV IN C-terminal truncations can provide new avenues to improve the safety profile of gammaretroviral vectors for human gene therapy.
Collapse
|
20
|
(19)F NMR reveals multiple conformations at the dimer interface of the nonstructural protein 1 effector domain from influenza A virus. Structure 2014; 22:515-525. [PMID: 24582435 DOI: 10.1016/j.str.2014.01.010] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/04/2013] [Revised: 01/17/2014] [Accepted: 01/28/2014] [Indexed: 11/19/2022]
Abstract
Nonstructural protein 1 of influenza A virus (NS1A) is a conserved virulence factor comprised of an N-terminal double-stranded RNA (dsRNA)-binding domain and a multifunctional C-terminal effector domain (ED), each of which can independently form symmetric homodimers. Here we apply (19)F NMR to NS1A from influenza A/Udorn/307/1972 virus (H3N2) labeled with 5-fluorotryptophan, and we demonstrate that the (19)F signal of Trp187 is a sensitive, direct monitor of the ED helix:helix dimer interface. (19)F relaxation dispersion data reveal the presence of conformational dynamics within this functionally important protein:protein interface, whose rate is more than three orders of magnitude faster than the kinetics of ED dimerization. (19)F NMR also affords direct spectroscopic evidence that Trp187, which mediates intermolecular ED:ED interactions required for cooperative dsRNA binding, is solvent exposed in full-length NS1A at concentrations below aggregation. These results have important implications for the diverse roles of this NS1A epitope during influenza virus infection.
Collapse
|
21
|
Three structural representatives of the PF06855 protein domain family from Staphyloccocus aureus and Bacillus subtilis have SAM domain-like folds and different functions. ACTA ACUST UNITED AC 2012; 13:163-70. [PMID: 22843344 DOI: 10.1007/s10969-012-9134-6] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2012] [Accepted: 02/02/2012] [Indexed: 10/28/2022]
Abstract
Protein domain family PF06855 (DUF1250) is a family of small domains of unknown function found only in bacteria, and mostly in the order Bacillales and Lactobacillales. Here we describe the solution NMR or X-ray crystal structures of three representatives of this domain family, MW0776 and MW1311 from Staphyloccocus aureus and yozE from Bacillus subtilis. All three proteins adopt a four-helix motif similar to sterile alpha motif (SAM) domains. Phylogenetic analysis classifies MW1311 and yozE as functionally equivalent proteins of the UPF0346 family of unknown function, but excludes MW0776, which likely has a different biological function. Our structural characterization of the three domains supports this separation of function. The structures of MW0776, MW1311, and yozE constitute the first structural representatives from this protein domain family.
Collapse
|
22
|
Segmental isotope labeling of proteins for NMR structural study using a protein S tag for higher expression and solubility. JOURNAL OF BIOMOLECULAR NMR 2012; 52:303-313. [PMID: 22389115 PMCID: PMC4117381 DOI: 10.1007/s10858-012-9610-0] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/10/2011] [Accepted: 01/11/2012] [Indexed: 05/31/2023]
Abstract
A common obstacle to NMR studies of proteins is sample preparation. In many cases, proteins targeted for NMR studies are poorly expressed and/or expressed in insoluble forms. Here, we describe a novel approach to overcome these problems. In the protein S tag-intein (PSTI) technology, two tandem 92-residue N-terminal domains of protein S (PrS(2)) from Myxococcus xanthus is fused at the N-terminal end of a protein to enhance its expression and solubility. Using intein technology, the isotope-labeled PrS(2)-tag is replaced with non-isotope labeled PrS(2)-tag, silencing the NMR signals from PrS(2)-tag in isotope-filtered (1)H-detected NMR experiments. This method was applied to the E. coli ribosome binding factor A (RbfA), which aggregates and precipitates in the absence of a solubilization tag unless the C-terminal 25-residue segment is deleted (RbfAΔ25). Using the PrS(2)-tag, full-length well-behaved RbfA samples could be successfully prepared for NMR studies. PrS(2) (non-labeled)-tagged RbfA (isotope-labeled) was produced with the use of the intein approach. The well-resolved TROSY-HSQC spectrum of full-length PrS(2)-tagged RbfA superimposes with the TROSY-HSQC spectrum of RbfAΔ25, indicating that PrS(2)-tag does not affect the structure of the protein to which it is fused. Using a smaller PrS-tag, consisting of a single N-terminal domain of protein S, triple resonance experiments were performed, and most of the backbone (1)H, (15)N and (13)C resonance assignments for full-length E. coli RbfA were determined. Analysis of these chemical shift data with the Chemical Shift Index and heteronuclear (1)H-(15)N NOE measurements reveal the dynamic nature of the C-terminal segment of the full-length RbfA protein, which could not be inferred using the truncated RbfAΔ25 construct. CS-Rosetta calculations also demonstrate that the core structure of full-length RbfA is similar to that of the RbfAΔ25 construct.
Collapse
|
23
|
Backbone and Ile-δ1, Leu, Val methyl 1H, 13C and 15N NMR chemical shift assignments for human interferon-stimulated gene 15 protein. BIOMOLECULAR NMR ASSIGNMENTS 2011; 5:215-219. [PMID: 21544738 PMCID: PMC3167004 DOI: 10.1007/s12104-011-9303-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/31/2010] [Accepted: 04/14/2011] [Indexed: 05/30/2023]
Abstract
Human interferon-stimulated gene 15 protein (ISG15), also called ubiquitin cross-reactive protein (UCRP), is the first identified ubiquitin-like protein containing two ubiquitin-like domains fused in tandem. The active form of ISG15 is conjugated to target proteins via the C-terminal glycine residue through an isopeptide bond in a manner similar to ubiquitin. The biological role of ISG15 is strongly associated with the modulation of cell immune function, and there is mounting evidence suggesting that many viral pathogens evade the host innate immune response by interfering with ISG15 conjugation to both host and viral proteins in a variety of ways. Here we report nearly complete backbone (1)H(N), (15)N, (13)C', and (13)C(α), as well as side chain (13)C(β), methyl (Ile-δ1, Leu, Val), amide (Asn, Gln), and indole N-H (Trp) NMR resonance assignments for the 157-residue human ISG15 protein. These resonance assignments provide the basis for future structural and functional solution NMR studies of the biologically important human ISG15 protein.
Collapse
|
24
|
Preparation of protein samples for NMR structure, function, and small-molecule screening studies. Methods Enzymol 2011; 493:21-60. [PMID: 21371586 DOI: 10.1016/b978-0-12-381274-2.00002-9] [Citation(s) in RCA: 76] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/03/2022]
Abstract
In this chapter, we concentrate on the production of high-quality protein samples for nuclear magnetic resonance (NMR) studies. In particular, we provide an in-depth description of recent advances in the production of NMR samples and their synergistic use with recent advancements in NMR hardware. We describe the protein production platform of the Northeast Structural Genomics Consortium and outline our high-throughput strategies for producing high-quality protein samples for NMR studies. Our strategy is based on the cloning, expression, and purification of 6×-His-tagged proteins using T7-based Escherichia coli systems and isotope enrichment in minimal media. We describe 96-well ligation-independent cloning and analytical expression systems, parallel preparative scale fermentation, and high-throughput purification protocols. The 6×-His affinity tag allows for a similar two-step purification procedure implemented in a parallel high-throughput fashion that routinely results in purity levels sufficient for NMR studies (>97% homogeneity). Using this platform, the protein open reading frames of over 17,500 different targeted proteins (or domains) have been cloned as over 28,000 constructs. Nearly 5000 of these proteins have been purified to homogeneity in tens of milligram quantities (see Summary Statistics, http://nesg.org/statistics.html), resulting in more than 950 new protein structures, including more than 400 NMR structures, deposited in the Protein Data Bank. The Northeast Structural Genomics Consortium pipeline has been effective in producing protein samples of both prokaryotic and eukaryotic origin. Although this chapter describes our entire pipeline for producing isotope-enriched protein samples, it focuses on the major updates introduced during the last 5 years (Phase 2 of the National Institute of General Medical Sciences Protein Structure Initiative). Our advanced automated and/or parallel cloning, expression, purification, and biophysical screening technologies are suitable for implementation in a large individual laboratory or by a small group of collaborating investigators for structural biology, functional proteomics, ligand screening, and structural genomics research.
Collapse
|
25
|
Solution NMR structure of Lin0431 protein from Listeria innocua reveals high structural similarity with domain II of bacterial transcription antitermination protein NusG. Proteins 2010; 78:2563-8. [PMID: 20602357 DOI: 10.1002/prot.22760] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
26
|
The high-throughput protein sample production platform of the Northeast Structural Genomics Consortium. J Struct Biol 2010; 172:21-33. [PMID: 20688167 DOI: 10.1016/j.jsb.2010.07.011] [Citation(s) in RCA: 108] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2010] [Revised: 07/24/2010] [Accepted: 07/28/2010] [Indexed: 11/15/2022]
Abstract
We describe the core Protein Production Platform of the Northeast Structural Genomics Consortium (NESG) and outline the strategies used for producing high-quality protein samples. The platform is centered on the cloning, expression and purification of 6X-His-tagged proteins using T7-based Escherichia coli systems. The 6X-His tag allows for similar purification procedures for most targets and implementation of high-throughput (HTP) parallel methods. In most cases, the 6X-His-tagged proteins are sufficiently purified (>97% homogeneity) using a HTP two-step purification protocol for most structural studies. Using this platform, the open reading frames of over 16,000 different targeted proteins (or domains) have been cloned as>26,000 constructs. Over the past 10 years, more than 16,000 of these expressed protein, and more than 4400 proteins (or domains) have been purified to homogeneity in tens of milligram quantities (see Summary Statistics, http://nesg.org/statistics.html). Using these samples, the NESG has deposited more than 900 new protein structures to the Protein Data Bank (PDB). The methods described here are effective in producing eukaryotic and prokaryotic protein samples in E. coli. This paper summarizes some of the updates made to the protein production pipeline in the last 5 years, corresponding to phase 2 of the NIGMS Protein Structure Initiative (PSI-2) project. The NESG Protein Production Platform is suitable for implementation in a large individual laboratory or by a small group of collaborating investigators. These advanced automated and/or parallel cloning, expression, purification, and biophysical screening technologies are of broad value to the structural biology, functional proteomics, and structural genomics communities.
Collapse
|
27
|
Engineering of a wheat germ expression system to provide compatibility with a high throughput pET-based cloning platform. ACTA ACUST UNITED AC 2010; 11:201-9. [PMID: 20574660 PMCID: PMC2921493 DOI: 10.1007/s10969-010-9093-8] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2009] [Accepted: 06/11/2010] [Indexed: 11/01/2022]
Abstract
Wheat germ cell-free methods provide an important approach for the production of eukaryotic proteins. We have developed a protein expression vector for the TNT((R)) SP6 High-Yield Wheat Germ Cell-Free (TNT WGCF) expression system (Promega) that is also compatible with our T7-based Escherichia coli intracellular expression vector pET15_NESG. This allows cloning of the same PCR product into either one of several pET_NESG vectors and this modified WGCF vector (pWGHisAmp) by In-Fusion LIC cloning (Zhu et al. in Biotechniques 43:354-359, 2007). Integration of these two vector systems allowed us to explore the efficacy of the TNT WGCF system by comparing the expression and solubility characteristics of 59 human protein constructs in both WGCF and pET15_NESG E. coli intracellular expression. While only 30% of these human proteins could be produced in soluble form using the pET15_NESG based system, some 70% could be produced in soluble form using the TNT WGCF system. This high success rate underscores the importance of eukaryotic expression host systems like the TNT WGCF system for eukaryotic protein production in a structural genomics sample production pipeline. To further demonstrate the value of this WGCF system in producing protein suitable for structural studies, we scaled up, purified, and analyzed by 2D NMR two (15)N-, (13)C-enriched human proteins. The results of this study indicate that the TNT WGCF system is a successful salvage pathway for producing samples of difficult-to-express small human proteins for NMR studies, providing an important complementary pathway for eukaryotic sample production in the NESG NMR structure production pipeline.
Collapse
|
28
|
A microscale protein NMR sample screening pipeline. JOURNAL OF BIOMOLECULAR NMR 2010; 46:11-22. [PMID: 19915800 PMCID: PMC2797623 DOI: 10.1007/s10858-009-9386-z] [Citation(s) in RCA: 75] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2009] [Accepted: 10/14/2009] [Indexed: 05/14/2023]
Abstract
As part of efforts to develop improved methods for NMR protein sample preparation and structure determination, the Northeast Structural Genomics Consortium (NESG) has implemented an NMR screening pipeline for protein target selection, construct optimization, and buffer optimization, incorporating efficient microscale NMR screening of proteins using a micro-cryoprobe. The process is feasible because the newest generation probe requires only small amounts of protein, typically 30-200 microg in 8-35 microl volume. Extensive automation has been made possible by the combination of database tools, mechanization of key process steps, and the use of a micro-cryoprobe that gives excellent data while requiring little optimization and manual setup. In this perspective, we describe the overall process used by the NESG for screening NMR samples as part of a sample optimization process, assessing optimal construct design and solution conditions, as well as for determining protein rotational correlation times in order to assess protein oligomerization states. Database infrastructure has been developed to allow for flexible implementation of new screening protocols and harvesting of the resulting output. The NESG micro NMR screening pipeline has also been used for detergent screening of membrane proteins. Descriptions of the individual steps in the NESG NMR sample design, production, and screening pipeline are presented in the format of a standard operating procedure.
Collapse
|
29
|
Construct optimization for protein NMR structure analysis using amide hydrogen/deuterium exchange mass spectrometry. Proteins 2009; 76:882-94. [PMID: 19306341 PMCID: PMC2739808 DOI: 10.1002/prot.22394] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Disordered or unstructured regions of proteins, while often very important biologically, can pose significant challenges for resonance assignment and three-dimensional structure determination of the ordered regions of proteins by NMR methods. In this article, we demonstrate the application of (1)H/(2)H exchange mass spectrometry (DXMS) for the rapid identification of disordered segments of proteins and design of protein constructs that are more suitable for structural analysis by NMR. In this benchmark study, DXMS is applied to five NMR protein targets chosen from the Northeast Structural Genomics project. These data were then used to design optimized constructs for three partially disordered proteins. Truncated proteins obtained by deletion of disordered N- and C-terminal tails were evaluated using (1)H-(15)N HSQC and (1)H-(15)N heteronuclear NOE NMR experiments to assess their structural integrity. These constructs provide significantly improved NMR spectra, with minimal structural perturbations to the ordered regions of the protein structure. As a representative example, we compare the solution structures of the full length and DXMS-based truncated construct for a 77-residue partially disordered DUF896 family protein YnzC from Bacillus subtilis, where deletion of the disordered residues (ca. 40% of the protein) does not affect the native structure. In addition, we demonstrate that throughput of the DXMS process can be increased by analyzing mixtures of up to four proteins without reducing the sequence coverage for each protein. Our results demonstrate that DXMS can serve as a central component of a process for optimizing protein constructs for NMR structure determination.
Collapse
|
30
|
|
31
|
Solution NMR structure of Escherichia coli ytfP expands the structural coverage of the UPF0131 protein domain family. Proteins 2007; 68:789-95. [PMID: 17523190 DOI: 10.1002/prot.21450] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
32
|
Conserved surface features form the double-stranded RNA binding site of non-structural protein 1 (NS1) from influenza A and B viruses. J Biol Chem 2007; 282:20584-92. [PMID: 17475623 DOI: 10.1074/jbc.m611619200] [Citation(s) in RCA: 75] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Influenza A viruses cause a highly contagious respiratory disease in humans and are responsible for periodic widespread epidemics with high mortality rates. The influenza A virus NS1 protein (NS1A) plays a key role in countering host antiviral defense and in virulence. The 73-residue N-terminal domain of NS1A (NS1A-(1-73)) forms a symmetric homodimer with a unique six-helical chain fold. It binds canonical A-form double-stranded RNA (dsRNA). Mutational inactivation of this dsRNA binding activity of NS1A highly attenuates virus replication. Here, we have characterized the unique structural features of the dsRNA binding surface of NS1A-(1-73) using NMR methods and describe the 2.1-A x-ray crystal structure of the corresponding dsRNA binding domain from human influenza B virus NS1B-(15-93). These results identify conserved dsRNA binding surfaces on both NS1A-(1-73) and NS1B-(15-93) that are very different from those indicated in earlier "working models" of the complex between dsRNA and NS1A-(1-73). The combined NMR and crystallographic data reveal highly conserved surface tracks of basic and hydrophilic residues that interact with dsRNA. These tracks are structurally complementary to the polyphosphate backbone conformation of A-form dsRNA and run at an approximately 45 degrees angle relative to the axes of helices alpha2/alpha2'. At the center of this dsRNA binding epitope, and common to NS1 proteins from influenza A and B viruses, is a deep pocket that includes both hydrophilic and hydrophobic amino acids. This pocket provides a target on the surface of the NS1 protein that is potentially suitable for the development of antiviral drugs targeting both influenza A and B viruses.
Collapse
MESH Headings
- Crystallography, X-Ray
- Dimerization
- Humans
- Influenza A virus/chemistry
- Influenza A virus/metabolism
- Influenza A virus/pathogenicity
- Influenza B virus/chemistry
- Influenza B virus/metabolism
- Influenza B virus/pathogenicity
- Influenza, Human/metabolism
- Influenza, Human/mortality
- Nuclear Magnetic Resonance, Biomolecular
- Nucleic Acid Conformation
- Protein Binding
- Protein Folding
- Protein Structure, Quaternary
- Protein Structure, Secondary
- RNA, Double-Stranded/chemistry
- RNA, Double-Stranded/metabolism
- RNA, Viral/chemistry
- RNA, Viral/metabolism
- Viral Nonstructural Proteins/chemistry
- Viral Nonstructural Proteins/metabolism
Collapse
|
33
|
Solution NMR Structure of the Junction between Tropomyosin Molecules: Implications for Actin Binding and Regulation. J Mol Biol 2006; 364:80-96. [PMID: 16999976 DOI: 10.1016/j.jmb.2006.08.033] [Citation(s) in RCA: 104] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2006] [Accepted: 08/07/2006] [Indexed: 10/24/2022]
Abstract
Tropomyosin is a coiled-coil protein that binds head-to-tail along the length of actin filaments in eukaryotic cells, stabilizing them and providing protection from severing proteins. Tropomyosin cooperatively regulates actin's interaction with myosin and mediates the Ca2+ -dependent regulation of contraction by troponin in striated muscles. The N-terminal and C-terminal ends are critical functional determinants that form an "overlap complex". Here we report the solution NMR structure of an overlap complex formed of model peptides. In the complex, the chains of the C-terminal coiled coil spread apart to allow insertion of 11 residues of the N-terminal coiled coil into the resulting cleft. The plane of the N-terminal coiled coil is rotated 90 degrees relative to the plane of the C terminus. A consequence of the geometry is that the orientation of postulated periodic actin binding sites on the coiled-coil surface is retained from one molecule to the next along the actin filament when the overlap complex is modeled into the X-ray structure of tropomyosin determined at 7 Angstroms. Nuclear relaxation NMR data reveal flexibility of the junction, which may function to optimize binding along the helical actin filament and to allow mobility of tropomyosin on the filament surface as it switches between regulatory states.
Collapse
|
34
|
Mechanism of activation for transcription factor PhoB suggested by different modes of dimerization in the inactive and active states. Structure 2005; 13:1353-63. [PMID: 16154092 PMCID: PMC3685586 DOI: 10.1016/j.str.2005.06.006] [Citation(s) in RCA: 99] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/05/2005] [Revised: 06/14/2005] [Accepted: 06/15/2005] [Indexed: 10/25/2022]
Abstract
Response regulators (RRs), which undergo phosphorylation/dephosphorylation at aspartate residues, are highly prevalent in bacterial signal transduction. RRs typically contain an N-terminal receiver domain that regulates the activities of a C-terminal DNA binding domain in a phosphorylation-dependent manner. We present crystallography and solution NMR data for the receiver domain of Escherichia coli PhoB which show distinct 2-fold symmetric dimers in the inactive and active states. These structures, together with the previously determined structure of the C-terminal domain of PhoB bound to DNA, define the conformation of the active transcription factor and provide a model for the mechanism of activation in the OmpR/PhoB subfamily, the largest group of RRs. In the active state, the receiver domains dimerize with 2-fold rotational symmetry using their alpha4-beta5-alpha5 faces, while the effector domains bind to DNA direct repeats with tandem symmetry, implying a loss of intramolecular interactions.
Collapse
|
35
|
Comparisons of NMR Spectral Quality and Success in Crystallization Demonstrate that NMR and X-ray Crystallography Are Complementary Methods for Small Protein Structure Determination. J Am Chem Soc 2005; 127:16505-11. [PMID: 16305237 DOI: 10.1021/ja053564h] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
X-ray crystallography and NMR spectroscopy provide the only sources of experimental data from which protein structures can be analyzed at high or even atomic resolution. The degree to which these methods complement each other as sources of structural knowledge is a matter of debate; it is often proposed that small proteins yielding high quality, readily analyzed NMR spectra are a subset of those that readily yield strongly diffracting crystals. We have examined the correlation between NMR spectral quality and success in structure determination by X-ray crystallography for 159 prokaryotic and eukaryotic proteins, prescreened to avoid proteins providing polydisperse and/or aggregated samples. This study demonstrates that, across this protein sample set, the quality of a protein's [15N-1H]-heteronuclear correlation (HSQC) spectrum recorded under conditions generally suitable for 3D structure determination by NMR, a key predictor of the ability to determine a structure by NMR, is not correlated with successful crystallization and structure determination by X-ray crystallography. These results, together with similar results of an independent study presented in the accompanying paper (Yee, et al., J. Am. Chem. Soc., accompanying paper), demonstrate that X-ray crystallography and NMR often provide complementary sources of structural data and that both methods are required in order to optimize success for as many targets as possible in large-scale structural proteomics efforts.
Collapse
|
36
|
1H, 13C, and 15N resonance assignments for Escherichia coli ytfP, a member of the broadly conserved UPF0131 protein domain family. JOURNAL OF BIOMOLECULAR NMR 2005; 33:197. [PMID: 16331424 DOI: 10.1007/s10858-005-2597-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/05/2023]
|
37
|
Abstract
In this chapter we describe the core Protein Production Platform of the Northeast Structural Genomics Consortium (NESG) and outline the strategies used for producing high-quality protein samples using Escherichia coli host vectors. The platform is centered on 6X-His affinity-tagged protein constructs, allowing for a similar purification procedure for most targets, and the implementation of high-throughput parallel methods. In most cases, these affinity-purified proteins are sufficiently homogeneous that a single subsequent gel filtration chromatography step is adequate to produce protein preparations that are greater than 98% pure. Using this platform, over 1000 different proteins have been cloned, expressed, and purified in tens of milligram quantities over the last 36-month period (see Summary Statistics for All Targets, ). Our experience using a hierarchical multiplex expression and purification strategy, also described in this chapter, has allowed us to achieve success in producing not only protein samples but also many three-dimensional structures. As of December 2004, the NESG Consortium has deposited over 145 new protein structures to the Protein Data Bank (PDB); about two-thirds of these protein samples were produced by the NESG Protein Production Facility described here. The methods described here have proven effective in producing quality samples of both eukaryotic and prokaryotic proteins. These improved robotic and?or parallel cloning, expression, protein production, and biophysical screening technologies will be of broad value to the structural biology, functional proteomics, and structural genomics communities.
Collapse
|
38
|
Cold-shock induced high-yield protein production in Escherichia coli. Nat Biotechnol 2004; 22:877-82. [PMID: 15195104 DOI: 10.1038/nbt984] [Citation(s) in RCA: 257] [Impact Index Per Article: 12.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2004] [Accepted: 04/27/2004] [Indexed: 11/09/2022]
Abstract
Overexpression of proteins in Escherichia coli at low temperature improves their solubility and stability. Here, we apply the unique features of the cspA gene to develop a series of expression vectors, termed pCold vectors, that drive the high expression of cloned genes upon induction by cold-shock. Several proteins were produced with very high yields, including E. coli EnvZ ATP-binding domain (EnvZ-B) and Xenopus laevis calmodulin (CaM). The pCold vector system can also be used to selectively enrich target proteins with isotopes to study their properties in cell lysates using NMR spectroscopy. We have cloned 38 genes from a range of prokaryotic and eukaryotic organisms into both pCold and pET14 (ref. 3) systems, and found that pCold vectors are highly complementary to the widely used pET vectors.
Collapse
|
39
|
Backbone 1H, 15N and 13C assignments for the 21 kDa Caenorhabditis elegans homologue of "brain-specific" protein. JOURNAL OF BIOMOLECULAR NMR 2004; 28:91-92. [PMID: 14739645 DOI: 10.1023/b:jnmr.0000012832.71049.bf] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/24/2023]
|
40
|
Abstract
The antibacterial peptide microcin J25 (MccJ25) inhibits bacterial transcription by binding within, and obstructing, the nucleotide-uptake channel of bacterial RNA polymerase. Published covalent and three-dimensional structures indicate that MccJ25 is a 21-residue cycle. Here, we show that the published covalent and three-dimensional structures are incorrect, and that MccJ25 in fact is a 21-residue "lariat protoknot", consisting of an 8-residue cyclic segment followed by a 13-residue linear segment that loops back and threads through the cyclic segment. MccJ25 is the first example of a lariat protoknot involving a backbone-side chain amide linkage.
Collapse
|
41
|
Automated protein fold determination using a minimal NMR constraint strategy. Protein Sci 2003; 12:1232-46. [PMID: 12761394 PMCID: PMC2323888 DOI: 10.1110/ps.0300203] [Citation(s) in RCA: 45] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2003] [Revised: 03/11/2003] [Accepted: 03/12/2003] [Indexed: 10/27/2022]
Abstract
Determination of precise and accurate protein structures by NMR generally requires weeks or even months to acquire and interpret all the necessary NMR data. However, even medium-accuracy fold information can often provide key clues about protein evolution and biochemical function(s). In this article we describe a largely automatic strategy for rapid determination of medium-accuracy protein backbone structures. Our strategy derives from ideas originally introduced by other groups for determining medium-accuracy NMR structures of large proteins using deuterated, (13)C-, (15)N-enriched protein samples with selective protonation of side-chain methyl groups ((13)CH(3)). Data collection includes acquiring NMR spectra for automatically determining assignments of backbone and side-chain (15)N, H(N) resonances, and side-chain (13)CH(3) methyl resonances. These assignments are determined automatically by the program AutoAssign using backbone triple resonance NMR data, together with Spin System Type Assignment Constraints (STACs) derived from side-chain triple-resonance experiments. The program AutoStructure then derives conformational constraints using these chemical shifts, amide (1)H/(2)H exchange, nuclear Overhauser effect spectroscopy (NOESY), and residual dipolar coupling data. The total time required for collecting such NMR data can potentially be as short as a few days. Here we demonstrate an integrated set of NMR software which can process these NMR spectra, carry out resonance assignments, interpret NOESY data, and generate medium-accuracy structures within a few days. The feasibility of this combined data collection and analysis strategy starting from raw NMR time domain data was illustrated by automatic analysis of a medium accuracy structure of the Z domain of Staphylococcal protein A.
Collapse
|
42
|
Solution NMR structure of ribosome-binding factor A (RbfA), a cold-shock adaptation protein from Escherichia coli. J Mol Biol 2003; 327:521-36. [PMID: 12628255 DOI: 10.1016/s0022-2836(03)00061-5] [Citation(s) in RCA: 70] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
Ribosome-binding factor A (RbfA) from Escherichia coli is a cold-shock adaptation protein. It is essential for efficient processing of 16S rRNA and is suspected to interact with the 5'-terminal helix (helix I) of 16S rRNA. RbfA is a member of a large family of small proteins found in most bacterial organisms, making it an important target for structural proteomics. Here, we describe the three-dimensional structure of RbfADelta25, a 108 residue construct with 25 residues removed from the carboxyl terminus of full-length RbfA, determined in solution at pH 5.0 by heteronuclear NMR methods. The structure determination was carried out using largely automated methods for determining resonance assignments, interpreting nuclear Overhauser effect (NOE) spectroscopy (NOESY) spectra, and structure generation. RbfADelta25 has an alpha+beta fold containing three helices and three beta-strands, alpha1-beta1-beta2-alpha2-alpha3-beta3. The structure has type-II KH-domain fold topology, related to conserved KH sequence family proteins whose betaalphaalphabeta subunits are characterized by a helix-turn-helix motif with sequence signature GxxG at the turn. In RbfA, this betaalphaalphabeta subunit is characterized by a helix-kink-helix motif in which the GxxG sequence is replaced by a conserved AxG sequence, including a strongly conserved Ala residue at position 75 forming an interhelical kink. The electrostatic field distribution about RbfADelta25 is bipolar; one side of the molecule is strongly negative and the opposite face has a strong positive electrostatic field. A "dynamic hot spot" of RbfADelta25 has been identified in the vicinity of a beta-bulge at strongly conserved residue Ser39 by 15N R(1), R(2) relaxation rate and heteronuclear 15N-1H NOE measurements. Analyses of these distributions of electrostatic field and internal dynamics, together with evolutionary implications of fold and sequence conservation, suggest that RbfA is indeed a nucleic acid-binding protein, and identify a potential RNA-binding site in or around the conserved polypeptide segment Ser76-Asp100 corresponding to the alpha3-loop-beta3 helix-loop-strand structure. While the structure of RbfADelta25 is most similar to that of the KH domain of the E.coli Era GTPase, its electrostatic field distribution is most similar to the KH1 domain of the NusA protein from Thermotoga maritima, another cold-shock associated RNA-binding protein. Both RbfA and NusA are regulated in the same E.coli operon. Structural and functional similarities between RbfA, NusA, and other bacterial type II KH domains suggest previously unsuspected evolutionary relationships between these cold-shock associated proteins.
Collapse
MESH Headings
- Adaptation, Physiological
- Amino Acid Sequence
- Cold Temperature
- Escherichia coli
- Escherichia coli Proteins/chemistry
- Escherichia coli Proteins/genetics
- Escherichia coli Proteins/metabolism
- Gene Deletion
- Molecular Sequence Data
- Mutagenesis, Site-Directed
- Nuclear Magnetic Resonance, Biomolecular
- Peptide Elongation Factors/chemistry
- Protein Conformation
- RNA, Bacterial/chemistry
- RNA, Bacterial/genetics
- RNA, Ribosomal, 16S/chemistry
- RNA, Ribosomal, 16S/genetics
- Ribosomal Proteins/chemistry
- Ribosomal Proteins/genetics
- Ribosomal Proteins/metabolism
- Ribosomes/chemistry
- Sequence Homology, Amino Acid
- Shock
- Transcription Factors/chemistry
- Transcriptional Elongation Factors
Collapse
|
43
|
The structure of the carboxyl terminus of striated alpha-tropomyosin in solution reveals an unusual parallel arrangement of interacting alpha-helices. Biochemistry 2003; 42:614-9. [PMID: 12534273 DOI: 10.1021/bi026989e] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Coiled coils are well-known as oligomerization domains, but they are also important sites of protein-protein interactions. We determined the NMR solution structure and backbone (15)N relaxation rates of a disulfide cross-linked, two-chain, 37-residue polypeptide containing the 34 C-terminal residues of striated muscle alpha-tropomyosin, TM9a(251-284). The peptide binds to the N-terminal region of TM and to the tropomyosin-binding domain of the regulatory protein, troponin T. Comparison of the NMR solution structure of TM9a(251-284) with the X-ray structure of a related peptide [Li, Y., Mui, S., Brown, J. H., Strand, J., Reshetnikova, L., Tobacman, L. S., and Cohen, C. (2002) Proc. Natl. Acad. Sci. U.S.A. 99, 7378-7383] reveals significant differences. In solution, residues 253-269 (like most of the tropomyosin molecule) form a canonical coiled coil. Residues 270-279, however, are parallel, linear helices, novel for tropomyosin. The packing between the parallel helices results from unusual interface residues that are atypical for coiled coils. Y267 has poor packing at the coiled-coil interface and a lower R(2) relaxation rate than neighboring residues, suggesting there is conformational flexibility around this residue. The last five residues are nonhelical and flexible. The exposed surface presented by the parallel helices, and the flexibility around Y267 and the ends, may facilitate binding to troponin T and formation of complexes with the N-terminus of tropomyosin and actin. We propose that unusual packing and flexibility are general features of coiled-coil domains in proteins that are involved in intermolecular interactions.
Collapse
|
44
|
Semi-automated backbone resonance assignments of the extracellular ligand-binding domain of an ionotropic glutamate receptor. JOURNAL OF BIOMOLECULAR NMR 2002; 22:297-298. [PMID: 11991359 DOI: 10.1023/a:1014954931635] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
|
45
|
Sensitivity Enhancement of Triple-Resonance Protein NMR Spectra by Proton Evolution of Multiple-Quantum Coherences Using a Simultaneous 1H and 13C Constant-Time Evolution Period. J Am Chem Soc 1997. [DOI: 10.1021/ja970734k] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
46
|
Chiral polyhydroxylated tetrahydrothiophene derivatives: novel synthesis and structural elucidation by X-ray crystallography, NMR spectroscopy and molecular mechanics calculations. ACTA ACUST UNITED AC 1993. [DOI: 10.1039/p19930001255] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
|