1
|
Manthey I, Tonelli M, II LC, Rahimi M, Markley JL, Lee W. POKY software tools encapsulating assignment strategies for solution and solid-state protein NMR data. J Struct Biol X 2022; 6:100073. [PMID: 36081577 PMCID: PMC9445392 DOI: 10.1016/j.yjsbx.2022.100073] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 08/04/2022] [Accepted: 08/23/2022] [Indexed: 11/23/2022] Open
Abstract
New tools support efficient analysis of solution and solid-state NMR spectra of proteins. POKY integrates a powerful suite of software packages for automated assignments. The Versatile Assigner module validates assignments through probabilistic analysis. The operation of these tools is supported by on-line guidance. The performance of these tools is evaluated in reference to competing software.
NMR spectroscopy provides structural and functional information about biomolecules and their complexes. The complexity of these systems can make the NMR data difficult to interpret, particularly for newer users of NMR technology, who may have limited understanding of the tools available and how they are used. To alleviate this problem, we have created software based on standardized workflows for both solution and solid-state NMR spectroscopy of proteins. These tools assist with manual and automated peak picking and with chemical shift assignment and validation. They provide users with an optimized path through spectral analysis that can help them perform the necessary tasks more efficiently.
Collapse
Affiliation(s)
- Ira Manthey
- Department of Chemistry, and URS Scholars Program, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Marco Tonelli
- National Magnetic Resonance Facility at Madison, University of Wisconsin-Madison, Madison, WI 53706, USA
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | | | - Mehdi Rahimi
- Department of Chemistry, University of Colorado Denver, Denver, CO 80204, USA
| | - John L. Markley
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Woonghee Lee
- Department of Chemistry, University of Colorado Denver, Denver, CO 80204, USA
- Corresponding author.
| |
Collapse
|
2
|
Romero PR, Kobayashi N, Wedell JR, Baskaran K, Iwata T, Yokochi M, Maziuk D, Yao H, Fujiwara T, Kurusu G, Ulrich EL, Hoch JC, Markley JL. BioMagResBank (BMRB) as a Resource for Structural Biology. Methods Mol Biol 2020; 2112:187-218. [PMID: 32006287 DOI: 10.1007/978-1-0716-0270-6_14] [Citation(s) in RCA: 40] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Abstract
The Biological Magnetic Resonance Data Bank (BioMagResBank or BMRB), founded in 1988, serves as the archive for data generated by nuclear magnetic resonance (NMR) spectroscopy of biological systems. NMR spectroscopy is unique among biophysical approaches in its ability to provide a broad range of atomic and higher-level information relevant to the structural, dynamic, and chemical properties of biological macromolecules, as well as report on metabolite and natural product concentrations in complex mixtures and their chemical structures. BMRB became a core member of the Worldwide Protein Data Bank (wwPDB) in 2007, and the BMRB archive is now a core archive of the wwPDB. Currently, about 10% of the structures deposited into the PDB archive are based on NMR spectroscopy. BMRB stores experimental and derived data from biomolecular NMR studies. Newer BMRB biopolymer depositions are divided about evenly between those associated with structure determinations (atomic coordinates and supporting information archived in the PDB) and those reporting experimental information on molecular dynamics, conformational transitions, ligand binding, assigned chemical shifts, or other results from NMR spectroscopy. BMRB also provides resources for NMR studies of metabolites and other small molecules that are often macromolecular ligands and/or nonstandard residues. This chapter is directed to the structural biology community rather than the metabolomics and natural products community. Our goal is to describe various BMRB services offered to structural biology researchers and how they can be accessed and utilized. These services can be classified into four main groups: (1) data deposition, (2) data retrieval, (3) data analysis, and (4) services for NMR spectroscopists and software developers. The chapter also describes the NMR-STAR data format used by BMRB and the tools provided to facilitate its use. For programmers, BMRB offers an application programming interface (API) and libraries in the Python and R languages that enable users to develop their own BMRB-based tools for data analysis, visualization, and manipulation of NMR-STAR formatted files. BMRB also provides users with direct access tools through the NMRbox platform.
Collapse
Affiliation(s)
- Pedro R Romero
- BMRB, Biochemistry Department, University of Wisconsin-Madison, Madison, WI, USA
| | - Naohiro Kobayashi
- PDBj-BMRB, Institute for Protein Research, Osaka University, Suita, Osaka, Japan
| | - Jonathan R Wedell
- BMRB, Biochemistry Department, University of Wisconsin-Madison, Madison, WI, USA
| | - Kumaran Baskaran
- BMRB, Biochemistry Department, University of Wisconsin-Madison, Madison, WI, USA
| | - Takeshi Iwata
- PDBj-BMRB, Institute for Protein Research, Osaka University, Suita, Osaka, Japan
| | - Masashi Yokochi
- PDBj-BMRB, Institute for Protein Research, Osaka University, Suita, Osaka, Japan
| | - Dimitri Maziuk
- BMRB, Biochemistry Department, University of Wisconsin-Madison, Madison, WI, USA
| | - Hongyang Yao
- BMRB, Biochemistry Department, University of Wisconsin-Madison, Madison, WI, USA
| | - Toshimichi Fujiwara
- PDBj-BMRB, Institute for Protein Research, Osaka University, Suita, Osaka, Japan
| | - Genji Kurusu
- PDBj-BMRB, Institute for Protein Research, Osaka University, Suita, Osaka, Japan
| | - Eldon L Ulrich
- BMRB, Biochemistry Department, University of Wisconsin-Madison, Madison, WI, USA
| | - Jeffrey C Hoch
- BMRB, Department of Molecular Biology and Biophysics, UConn Health, Farmington, CT, USA
| | - John L Markley
- BMRB, Biochemistry Department, University of Wisconsin-Madison, Madison, WI, USA.
| |
Collapse
|
3
|
Bortnov V, Tonelli M, Lee W, Lin Z, Annis DS, Demerdash ON, Bateman A, Mitchell JC, Ge Y, Markley JL, Mosher DF. Solution structure of human myeloid-derived growth factor suggests a conserved function in the endoplasmic reticulum. Nat Commun 2019; 10:5612. [PMID: 31819058 PMCID: PMC6901522 DOI: 10.1038/s41467-019-13577-5] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2019] [Accepted: 11/13/2019] [Indexed: 12/12/2022] Open
Abstract
Human myeloid-derived growth factor (hMYDGF) is a 142-residue protein with a C-terminal endoplasmic reticulum (ER) retention sequence (ERS). Extracellular MYDGF mediates cardiac repair in mice after anoxic injury. Although homologs of hMYDGF are found in eukaryotes as distant as protozoans, its structure and function are unknown. Here we present the NMR solution structure of hMYDGF, which consists of a short α-helix and ten β-strands distributed in three β-sheets. Conserved residues map to the unstructured ERS, loops on the face opposite the ERS, and the surface of a cavity underneath the conserved loops. The only protein or portion of a protein known to have a similar fold is the base domain of VNN1. We suggest, in analogy to the tethering of the VNN1 nitrilase domain to the plasma membrane via its base domain, that MYDGF complexed to the KDEL receptor binds cargo via its conserved residues for transport to the ER.
Collapse
Affiliation(s)
- Valeriu Bortnov
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Marco Tonelli
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
- National Magnetic Resonance Facility at Madison, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Woonghee Lee
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
- National Magnetic Resonance Facility at Madison, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Ziqing Lin
- Departments of Cell and Regenerative Biology and Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Human Proteomics Program, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Douglas S Annis
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Omar N Demerdash
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Alex Bateman
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK
| | - Julie C Mitchell
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN, 37831, USA
| | - Ying Ge
- Departments of Cell and Regenerative Biology and Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
- Human Proteomics Program, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - John L Markley
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA
- National Magnetic Resonance Facility at Madison, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Deane F Mosher
- Department of Biomolecular Chemistry, University of Wisconsin-Madison, Madison, WI, 53706, USA.
- Department of Medicine, University of Wisconsin-Madison, Madison, WI, 53706, USA.
| |
Collapse
|
4
|
Lee W, Markley JL. PINE-SPARKY.2 for automated NMR-based protein structure research. Bioinformatics 2019; 34:1586-1588. [PMID: 29281006 PMCID: PMC5925765 DOI: 10.1093/bioinformatics/btx785] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2017] [Accepted: 12/20/2017] [Indexed: 12/30/2022] Open
Abstract
Summary Nuclear magnetic resonance (NMR) spectroscopy, along with X-ray crystallography and cryoelectron microscopy, is one of the three major tools that enable the determination of atomic-level structural models of biological macromolecules. Of these, NMR has the unique ability to follow important processes in solution, including conformational changes, internal dynamics and protein-ligand interactions. As a means for facilitating the handling and analysis of spectra involved in these types of NMR studies, we have developed PINE-SPARKY.2, a software package that integrates and automates discrete tasks that previously required interaction with separate software packages. The graphical user interface of PINE-SPARKY.2 simplifies chemical shift assignment and verification, automated detection of secondary structural elements, predictions of flexibility and hydrophobic cores, and calculation of three-dimensional structural models. Availability and implementation PINE-SPARKY.2 is available in the latest version of NMRFAM-SPARKY from the National Magnetic Resonance Facility at Madison (http://pine.nmrfam.wisc.edu/download_packages.html), the NMRbox Project (https://nmrbox.org) and to subscribers to the SBGrid (https://sbgrid.org). For a detailed description of the program, see http://www.nmrfam.wisc.edu/pine-sparky2.htm. Contact whlee@nmrfam.wisc.edu or markley@nmrfam.wisc.edu. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Woonghee Lee
- National Magnetic Resonance Facility at Madison, Biochemistry Department, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - John L Markley
- National Magnetic Resonance Facility at Madison, Biochemistry Department, University of Wisconsin-Madison, Madison, WI 53706, USA
| |
Collapse
|
5
|
Ulrich EL, Baskaran K, Dashti H, Ioannidis YE, Livny M, Romero PR, Maziuk D, Wedell JR, Yao H, Eghbalnia HR, Hoch JC, Markley JL. NMR-STAR: comprehensive ontology for representing, archiving and exchanging data from nuclear magnetic resonance spectroscopic experiments. JOURNAL OF BIOMOLECULAR NMR 2019; 73:5-9. [PMID: 30580387 PMCID: PMC6441402 DOI: 10.1007/s10858-018-0220-3] [Citation(s) in RCA: 30] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/30/2018] [Accepted: 12/14/2018] [Indexed: 05/16/2023]
Abstract
The growth of the biological nuclear magnetic resonance (NMR) field and the development of new experimental technology have mandated the revision and enlargement of the NMR-STAR ontology used to represent experiments, spectral and derived data, and supporting metadata. We present here a brief description of the NMR-STAR ontology and software tools for manipulating NMR-STAR data files, editing the files, extracting selected data, and creating data visualizations. Detailed information on these is accessible from the links provided.
Collapse
Affiliation(s)
- Eldon L Ulrich
- Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Kumaran Baskaran
- Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Hesam Dashti
- Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | | | - Miron Livny
- Department of Computer Sciences, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Pedro R Romero
- Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Dimitri Maziuk
- Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Jonathan R Wedell
- Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Hongyang Yao
- Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Hamid R Eghbalnia
- Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Jeffrey C Hoch
- Department of Molecular Biology and Biophysics, UConn Health, 263 Farmington Avenue, Farmington, CT, 06030, USA
| | - John L Markley
- Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA.
| |
Collapse
|
6
|
Current Solution NMR Techniques for Structure-Function Studies of Proteins and RNA Molecules. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2018; 1105:43-58. [DOI: 10.1007/978-981-13-2200-6_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
7
|
Gao Q, Chalmers GR, Moremen KW, Prestegard JH. NMR assignments of sparsely labeled proteins using a genetic algorithm. JOURNAL OF BIOMOLECULAR NMR 2017; 67:283-294. [PMID: 28289927 PMCID: PMC5434516 DOI: 10.1007/s10858-017-0101-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/30/2016] [Accepted: 02/22/2017] [Indexed: 05/16/2023]
Abstract
Sparse isotopic labeling of proteins for NMR studies using single types of amino acid (15N or 13C enriched) has several advantages. Resolution is enhanced by reducing numbers of resonances for large proteins, and isotopic labeling becomes economically feasible for glycoproteins that must be expressed in mammalian cells. However, without access to the traditional triple resonance strategies that require uniform isotopic labeling, NMR assignment of crosspeaks in heteronuclear single quantum coherence (HSQC) spectra is challenging. We present an alternative strategy which combines readily accessible NMR data with known protein domain structures. Based on the structures, chemical shifts are predicted, NOE cross-peak lists are generated, and residual dipolar couplings (RDCs) are calculated for each labeled site. Simulated data are then compared to measured values for a trial set of assignments and scored. A genetic algorithm uses the scores to search for an optimal pairing of HSQC crosspeaks with labeled sites. While none of the individual data types can give a definitive assignment for a particular site, their combination can in most cases. Four test proteins previously assigned using triple resonance methods and a sparsely labeled glycosylated protein, Robo1, previously assigned by manual analysis, are used to validate the method and develop a criterion for identifying sites assigned with high confidence.
Collapse
Affiliation(s)
- Qi Gao
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, 30602, USA
| | - Gordon R Chalmers
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, 30602, USA
- Department of Computer Science and Complex Carbohydrate Research Center, University of Georgia, Athens, GA, 30602, USA
| | - Kelley W Moremen
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, 30602, USA
| | - James H Prestegard
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, 30602, USA.
| |
Collapse
|
8
|
Lee W, Cornilescu G, Dashti H, Eghbalnia HR, Tonelli M, Westler WM, Butcher SE, Henzler-Wildman KA, Markley JL. Integrative NMR for biomolecular research. JOURNAL OF BIOMOLECULAR NMR 2016; 64:307-32. [PMID: 27023095 PMCID: PMC4861749 DOI: 10.1007/s10858-016-0029-x] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2016] [Accepted: 03/21/2016] [Indexed: 05/05/2023]
Abstract
NMR spectroscopy is a powerful technique for determining structural and functional features of biomolecules in physiological solution as well as for observing their intermolecular interactions in real-time. However, complex steps associated with its practice have made the approach daunting for non-specialists. We introduce an NMR platform that makes biomolecular NMR spectroscopy much more accessible by integrating tools, databases, web services, and video tutorials that can be launched by simple installation of NMRFAM software packages or using a cross-platform virtual machine that can be run on any standard laptop or desktop computer. The software package can be downloaded freely from the NMRFAM software download page ( http://pine.nmrfam.wisc.edu/download_packages.html ), and detailed instructions are available from the Integrative NMR Video Tutorial page ( http://pine.nmrfam.wisc.edu/integrative.html ).
Collapse
Affiliation(s)
- Woonghee Lee
- National Magnetic Resonance Facility at Madison and Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA.
| | - Gabriel Cornilescu
- National Magnetic Resonance Facility at Madison and Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Hesam Dashti
- National Magnetic Resonance Facility at Madison and Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Hamid R Eghbalnia
- National Magnetic Resonance Facility at Madison and Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Marco Tonelli
- National Magnetic Resonance Facility at Madison and Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - William M Westler
- National Magnetic Resonance Facility at Madison and Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Samuel E Butcher
- National Magnetic Resonance Facility at Madison and Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - Katherine A Henzler-Wildman
- National Magnetic Resonance Facility at Madison and Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA
| | - John L Markley
- National Magnetic Resonance Facility at Madison and Biochemistry Department, University of Wisconsin-Madison, Madison, WI, 53706, USA.
| |
Collapse
|
9
|
Dashti H, Lee W, Tonelli M, Cornilescu CC, Cornilescu G, Assadi-Porter FM, Westler WM, Eghbalnia HR, Markley JL. NMRFAM-SDF: a protein structure determination framework. JOURNAL OF BIOMOLECULAR NMR 2015; 62:481-95. [PMID: 25900069 PMCID: PMC4569665 DOI: 10.1007/s10858-015-9933-8] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/16/2015] [Accepted: 04/15/2015] [Indexed: 05/21/2023]
Abstract
The computationally demanding nature of automated NMR structure determination necessitates a delicate balancing of factors that include the time complexity of data collection, the computational complexity of chemical shift assignments, and selection of proper optimization steps. During the past two decades the computational and algorithmic aspects of several discrete steps of the process have been addressed. Although no single comprehensive solution has emerged, the incorporation of a validation protocol has gained recognition as a necessary step for a robust automated approach. The need for validation becomes even more pronounced in cases of proteins with higher structural complexity, where potentially larger errors generated at each step can propagate and accumulate in the process of structure calculation, thereby significantly degrading the efficacy of any software framework. This paper introduces a complete framework for protein structure determination with NMR--from data acquisition to the structure determination. The aim is twofold: to simplify the structure determination process for non-NMR experts whenever feasible, while maintaining flexibility by providing a set of modules that validate each step, and to enable the assessment of error propagations. This framework, called NMRFAM-SDF (NMRFAM-Structure Determination Framework), and its various components are available for download from the NMRFAM website (http://nmrfam.wisc.edu/software.htm).
Collapse
Affiliation(s)
- Hesam Dashti
- National Magnetic Resonance Facility at Madison, Biochemistry Department, University of Wisconsin-Madison, 433 Babcock Drive, Madison, WI, USA
| | - Woonghee Lee
- National Magnetic Resonance Facility at Madison, Biochemistry Department, University of Wisconsin-Madison, 433 Babcock Drive, Madison, WI, USA
| | - Marco Tonelli
- National Magnetic Resonance Facility at Madison, Biochemistry Department, University of Wisconsin-Madison, 433 Babcock Drive, Madison, WI, USA
| | - Claudia C Cornilescu
- National Magnetic Resonance Facility at Madison, Biochemistry Department, University of Wisconsin-Madison, 433 Babcock Drive, Madison, WI, USA
| | - Gabriel Cornilescu
- National Magnetic Resonance Facility at Madison, Biochemistry Department, University of Wisconsin-Madison, 433 Babcock Drive, Madison, WI, USA
| | - Fariba M Assadi-Porter
- National Magnetic Resonance Facility at Madison, Biochemistry Department, University of Wisconsin-Madison, 433 Babcock Drive, Madison, WI, USA
| | - William M Westler
- National Magnetic Resonance Facility at Madison, Biochemistry Department, University of Wisconsin-Madison, 433 Babcock Drive, Madison, WI, USA
| | - Hamid R Eghbalnia
- National Magnetic Resonance Facility at Madison, Biochemistry Department, University of Wisconsin-Madison, 433 Babcock Drive, Madison, WI, USA
| | - John L Markley
- National Magnetic Resonance Facility at Madison, Biochemistry Department, University of Wisconsin-Madison, 433 Babcock Drive, Madison, WI, USA.
| |
Collapse
|