1
|
Alazmi M, Abbas A, Guo X, Fan M, Li L, Gao X. A Slice-based 13C-detected NMR Spin System Forming and Resonance Assignment Method. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2018; 15:1999-2008. [PMID: 29994483 DOI: 10.1109/tcbb.2018.2849728] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
Nuclear magnetic resonance (NMR) spectroscopy is attracting more attention in the field of computational structural biology. Till recently, 1H-detected experiments are the dominant NMR technique used due to the high sensitivity of 1H nuclei. However, the current availability of high magnetic fields and cryogenically cooled probe heads allow researchers to overcome the low sensitivity of 13C nuclei. Consequently, 13C-detected experiments have become a popular technique in different NMR applications especially resonance assignment and structure determination of large proteins. In this paper, we propose the first spin system forming method for 13C-detected NMR spectra. Our method is able to accurately form spin systems based on as few as two 13C-detected spectra, CBCACON, and CBCANCO. Our method picks slices from the more trusted spectrum and uses them as feedback to direct the slice picking in the less trusted one. This feedback leads to picking the accurate slices that consequently helps to form better spin systems. We tested our method on a real dataset of 'Ubiquitin' and a benchmark simulated dataset consisting of 12 proteins. We fed our spin systems as inputs to a genetic algorithm to generate the chemical shift assignment, and obtained 92 percent correct chemical shift assignment for Ubiquitin. For the simulated dataset, we obtained an average recall of 86 percent and an average precision of 88 percent. Finally, our chemical shift assignment of Ubiquitin was given as an input to CS-ROSETTA server that generated structures close to the experimentally determined structure.
Collapse
|
2
|
Unraveling the meaning of chemical shifts in protein NMR. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2017; 1865:1564-1576. [PMID: 28716441 DOI: 10.1016/j.bbapap.2017.07.005] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/22/2017] [Revised: 06/29/2017] [Accepted: 07/07/2017] [Indexed: 12/14/2022]
Abstract
Chemical shifts are among the most informative parameters in protein NMR. They provide wealth of information about protein secondary and tertiary structure, protein flexibility, and protein-ligand binding. In this report, we review the progress in interpreting and utilizing protein chemical shifts that has occurred over the past 25years, with a particular focus on the large body of work arising from our group and other Canadian NMR laboratories. More specifically, this review focuses on describing, assessing, and providing some historical context for various chemical shift-based methods to: (1) determine protein secondary and super-secondary structure; (2) derive protein torsion angles; (3) assess protein flexibility; (4) predict residue accessible surface area; (5) refine 3D protein structures; (6) determine 3D protein structures and (7) characterize intrinsically disordered proteins. This review also briefly covers some of the methods that we previously developed to predict chemical shifts from 3D protein structures and/or protein sequence data. It is hoped that this review will help to increase awareness of the considerable utility of NMR chemical shifts in structural biology and facilitate more widespread adoption of chemical-shift based methods by the NMR spectroscopists, structural biologists, protein biophysicists, and biochemists worldwide. This article is part of a Special Issue entitled: Biophysics in Canada, edited by Lewis Kay, John Baenziger, Albert Berghuis and Peter Tieleman.
Collapse
|
3
|
Maximova T, Moffatt R, Ma B, Nussinov R, Shehu A. Principles and Overview of Sampling Methods for Modeling Macromolecular Structure and Dynamics. PLoS Comput Biol 2016; 12:e1004619. [PMID: 27124275 PMCID: PMC4849799 DOI: 10.1371/journal.pcbi.1004619] [Citation(s) in RCA: 138] [Impact Index Per Article: 15.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Investigation of macromolecular structure and dynamics is fundamental to understanding how macromolecules carry out their functions in the cell. Significant advances have been made toward this end in silico, with a growing number of computational methods proposed yearly to study and simulate various aspects of macromolecular structure and dynamics. This review aims to provide an overview of recent advances, focusing primarily on methods proposed for exploring the structure space of macromolecules in isolation and in assemblies for the purpose of characterizing equilibrium structure and dynamics. In addition to surveying recent applications that showcase current capabilities of computational methods, this review highlights state-of-the-art algorithmic techniques proposed to overcome challenges posed in silico by the disparate spatial and time scales accessed by dynamic macromolecules. This review is not meant to be exhaustive, as such an endeavor is impossible, but rather aims to balance breadth and depth of strategies for modeling macromolecular structure and dynamics for a broad audience of novices and experts.
Collapse
Affiliation(s)
- Tatiana Maximova
- Department of Computer Science, George Mason University, Fairfax, Virginia, United States of America
| | - Ryan Moffatt
- Department of Computer Science, George Mason University, Fairfax, Virginia, United States of America
| | - Buyong Ma
- Basic Science Program, Leidos Biomedical Research, Inc. Cancer and Inflammation Program, National Cancer Institute, Frederick, Maryland, United States of America
| | - Ruth Nussinov
- Basic Science Program, Leidos Biomedical Research, Inc. Cancer and Inflammation Program, National Cancer Institute, Frederick, Maryland, United States of America
- Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Amarda Shehu
- Department of Computer Science, George Mason University, Fairfax, Virginia, United States of America
- Department of Biongineering, George Mason University, Fairfax, Virginia, United States of America
- School of Systems Biology, George Mason University, Manassas, Virginia, United States of America
| |
Collapse
|
4
|
Carr JM, Whittleston CS, Wade DC, Wales DJ. Energy landscapes of a hairpin peptide including NMR chemical shift restraints. Phys Chem Chem Phys 2015; 17:20250-8. [PMID: 26186565 DOI: 10.1039/c5cp01259g] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
Methods recently introduced to improve the efficiency of protein structure prediction simulations by adding a restraint potential to a molecular mechanics force field introduce additional input parameters that can affect the performance. Here we investigate the changes in the energy landscape as the relative weight of the two contributions, force field and restraint potential, is systematically altered, for restraint functions constructed from calculated nuclear magnetic resonance chemical shifts. Benchmarking calculations were performed on a 12-residue peptide, tryptophan zipper 1, which features both secondary structure (a β-hairpin) and specific packing of tryptophan sidechains. Basin-hopping global optimization was performed to assess the efficiency with which lowest-energy structures are located, and the discrete path sampling approach was employed to survey the energy landscapes between unfolded and folded structures. We find that inclusion of the chemical shift restraints improves the efficiency of structure prediction because the energy landscape becomes more funnelled and the proportion of local minima classified as native increases. However, the funnelling nature of the landscape is reduced as the relative contribution of the chemical shift restraint potential is increased past an optimal value.
Collapse
Affiliation(s)
- Joanne M Carr
- University Chemical Laboratories, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK.
| | | | | | | |
Collapse
|
5
|
Equilibrium simulations of proteins using molecular fragment replacement and NMR chemical shifts. Proc Natl Acad Sci U S A 2014; 111:13852-7. [PMID: 25192938 DOI: 10.1073/pnas.1404948111] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/03/2023] Open
Abstract
Methods of protein structure determination based on NMR chemical shifts are becoming increasingly common. The most widely used approaches adopt the molecular fragment replacement strategy, in which structural fragments are repeatedly reassembled into different complete conformations in molecular simulations. Although these approaches are effective in generating individual structures consistent with the chemical shift data, they do not enable the sampling of the conformational space of proteins with correct statistical weights. Here, we present a method of molecular fragment replacement that makes it possible to perform equilibrium simulations of proteins, and hence to determine their free energy landscapes. This strategy is based on the encoding of the chemical shift information in a probabilistic model in Markov chain Monte Carlo simulations. First, we demonstrate that with this approach it is possible to fold proteins to their native states starting from extended structures. Second, we show that the method satisfies the detailed balance condition and hence it can be used to carry out an equilibrium sampling from the Boltzmann distribution corresponding to the force field used in the simulations. Third, by comparing the results of simulations carried out with and without chemical shift restraints we describe quantitatively the effects that these restraints have on the free energy landscapes of proteins. Taken together, these results demonstrate that the molecular fragment replacement strategy can be used in combination with chemical shift information to characterize not only the native structures of proteins but also their conformational fluctuations.
Collapse
|
6
|
Mechelke M, Habeck M. Bayesian weighting of statistical potentials in NMR structure calculation. PLoS One 2014; 9:e100197. [PMID: 24956116 PMCID: PMC4067304 DOI: 10.1371/journal.pone.0100197] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2014] [Accepted: 05/23/2014] [Indexed: 11/24/2022] Open
Abstract
The use of statistical potentials in NMR structure calculation improves the accuracy of the final structure but also raises issues of double counting and possible bias. Because statistical potentials are averaged over a large set of structures, they may not reflect the preferences of a particular structure or data set. We propose a Bayesian method to incorporate a knowledge-based backbone dihedral angle potential into an NMR structure calculation. To avoid bias exerted through the backbone potential, we adjust its weight by inferring it from the experimental data. We demonstrate that an optimally weighted potential leads to an improvement in the accuracy and quality of the final structure, especially with sparse and noisy data. Our findings suggest that no universally optimal weight exists, and that the weight should be determined based on the experimental data. Other knowledge-based potentials can be incorporated using the same approach.
Collapse
Affiliation(s)
- Martin Mechelke
- Institute for Mathematical Stochastics, Georg August University Göttingen, Göttingen, Germany
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, Tübingen, Germany
| | - Michael Habeck
- Institute for Mathematical Stochastics, Georg August University Göttingen, Göttingen, Germany
- * E-mail:
| |
Collapse
|
7
|
|
8
|
Menon V, Vallat BK, Dybas JM, Fiser A. Modeling proteins using a super-secondary structure library and NMR chemical shift information. Structure 2013; 21:891-9. [PMID: 23685209 DOI: 10.1016/j.str.2013.04.012] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2012] [Revised: 04/02/2013] [Accepted: 04/13/2013] [Indexed: 11/29/2022]
Abstract
A remaining challenge in protein modeling is to predict structures for sequences with no sequence similarity to any experimentally solved structure. Based on earlier observations, the library of protein backbone supersecondary structure motifs (Smotifs) saturated about a decade ago. Therefore, it should be possible to build any structure from a combination of existing Smotifs with the help of limited experimental data that are sufficient to relate the backbone conformations of Smotifs between target proteins and known structures. Here, we present a hybrid modeling algorithm that relies on an exhaustive Smotif library and on nuclear magnetic resonance chemical shift patterns without any input of primary sequence information. In a test of 102 proteins, the algorithm delivered 90 homology-model-quality models, among them 24 high-quality ones, and a topologically correct solution for almost all cases. The current approach opens a venue to address the modeling of larger protein structures for which chemical shifts are available.
Collapse
Affiliation(s)
- Vilas Menon
- Department of Systems and Computational Biology, Department of Biochemistry, Albert Einstein College of Medicine, 1300 Morris Park Avenue, Bronx, NY 10461, USA
| | | | | | | |
Collapse
|
9
|
Abstract
Solution nuclear magnetic resonance (NMR) spectroscopy has come a long way in characterizing the structure and function of biological molecules since the first one-dimensional spectrum of protein was recorded about 30 years ago. To date (September 1, 2012), there are 9,521 solution NMR structures in the Protein Data Bank, compared to 74,009 determined by crystallographic methods. Unlike X-ray and electron microscopy (EM) methods, which are based on the concepts of Fourier optics and image reconstruction, structure determination by NMR involves measuring structural restraints and finding structural solutions that satisfy the restraints. Although the NMR approach is much less direct in a physical sense, it has proven itself over the years to be capable of de novo structure determination at high precision. Moreover, the method is highly versatile and can be used in a variety of ways for addressing mechanistic questions. NMR measurements of protein internal dynamics and protein-protein or protein-ligand interaction are directly relevant to function in vivo because the molecules are often in physiological buffer conditions. The method can also be applied to investigate protein-folding intermediates, conformational changes, as well as intrinsically unfolded proteins. Recently, along with X-ray and EM, solution NMR has entered a state of rapid growth for structural studies of membrane proteins, already demonstrating its feasibility in de novo structure determination of membrane-embedded ion channels and receptors. As the hardware advances rapidly, especially in cryogenic probes that have much higher sensitivity, the sample concentration required for solution NMR investigation is decreasing, hopefully soon to a concentration level at which nonspecific protein aggregation is no longer an issue. After three decades of improvement in spectrometer technology, NMR pulse experiments, isotope labeling schemes, and structure determination software, we believe that solution NMR will truly enter the production phase in the next decade to answer biological questions of high impact, and to become more versatile than ever in complementing X-ray and EM in investigating protein structure and function.
Collapse
Affiliation(s)
- James J Chou
- Jack and Eileen Connors Structural Biology Laboratory, Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA, USA
| | | |
Collapse
|
10
|
Hoffmann F, Strodel B. Protein structure prediction using global optimization by basin-hopping with NMR shift restraints. J Chem Phys 2013; 138:025102. [PMID: 23320726 DOI: 10.1063/1.4773406] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Computational methods that utilize chemical shifts to produce protein structures at atomic resolution have recently been introduced. In the current work, we exploit chemical shifts by combining the basin-hopping approach to global optimization with chemical shift restraints using a penalty function. For three peptides, we demonstrate that this approach allows us to find near-native structures from fully extended structures within 10,000 basin-hopping steps. The effect of adding chemical shift restraints is that the α and β secondary structure elements form within 1000 basin-hopping steps, after which the orientation of the secondary structure elements, which produces the tertiary contacts, is driven by the underlying protein force field. We further show that our chemical shift-restraint BH approach also works for incomplete chemical shift assignments, where the information from only one chemical shift type is considered. For the proper implementation of chemical shift restraints in the basin-hopping approach, we determined the optimal weight of the chemical shift penalty energy with respect to the CHARMM force field in conjunction with the FACTS solvation model employed in this study. In order to speed up the local energy minimization procedure, we developed a function, which continuously decreases the width of the chemical shift penalty function as the minimization progresses. We conclude that the basin-hopping approach with chemical shift restraints is a promising method for protein structure prediction.
Collapse
Affiliation(s)
- Falk Hoffmann
- Institute of Complex Systems: Structural Biochemistry, Research Centre Jülich, 52425 Jülich, Germany
| | | |
Collapse
|
11
|
Kim TR, Ji S, Lee S, Chu IS, Shin S, Lee J. A hybrid modeling strategy using Nuclear Overhauser Effect data with contact information. Chem Phys Lett 2012. [DOI: 10.1016/j.cplett.2012.09.074] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
|
12
|
Yu W, Lee W, Lee W, Kim S, Chang I. Uncovering symmetry-breaking vector and reliability order for assigning secondary structures of proteins from atomic NMR chemical shifts in amino acids. JOURNAL OF BIOMOLECULAR NMR 2011; 51:411-424. [PMID: 22038647 DOI: 10.1007/s10858-011-9579-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2011] [Accepted: 06/10/2011] [Indexed: 05/31/2023]
Abstract
Unravelling the complex correlation between chemical shifts of (13) C (α), (13) C (β), (13) C', (1) H (α), (15) N, (1) H ( N ) atoms in amino acids of proteins from NMR experiment and local structural environments of amino acids facilitates the assignment of secondary structures of proteins. This is an important impetus for both determining the three-dimensional structure and understanding the biological function of proteins. The previous empirical correlation scores which relate chemical shifts of (13) C (α), (13) C (β), (13) C', (1) H (α), (15) N, (1) H ( N ) atoms to secondary structures resulted in progresses toward assigning secondary structures of proteins. However, the physical-mathematical framework for these was elusive partly due to both the limited and orthogonal exploration of higher-dimensional chemical shifts of hetero-nucleus and the lack of physical-mathematical understanding underlying those correlation scores. Here we present a simple multi-dimensional hetero-nuclear chemical shift score function (MDHN-CSSF) which captures systematically the salient feature of such complex correlations without any references to a random coil state of proteins. We uncover the symmetry-breaking vector and its reliability order not only for distinguishing different secondary structures of proteins but also for capturing the delicate sensitivity interplayed among chemical shifts of (13) C (α), (13) C (β), (13) C', (1) H (α), (15) N, (1) H ( N ) atoms simultaneously, which then provides a straightforward framework toward assigning secondary structures of proteins. MDHN-CSSF could correctly assign secondary structures of training (validating) proteins with the favourable (comparable) Q3 scores in comparison with those from the previous correlation scores. MDHN-CSSF provides a simple and robust strategy for the systematic assignment of secondary structures of proteins and would facilitate the de novo determination of three-dimensional structures of proteins.
Collapse
Affiliation(s)
- Wookyung Yu
- Department of Physics, Center for Proteome Biophysics, Pusan National University, Busan, Korea
| | | | | | | | | |
Collapse
|
13
|
Masica DL, Gray JJ, Shaw WJ. Partial high-resolution structure of phosphorylated and non-phosphorylated leucine-rich amelogenin protein adsorbed to hydroxyapatite. THE JOURNAL OF PHYSICAL CHEMISTRY. C, NANOMATERIALS AND INTERFACES 2011; 115:13775-13785. [PMID: 21845207 PMCID: PMC3155182 DOI: 10.1021/jp202965h] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/31/2023]
Abstract
The formation of biogenic materials requires the interaction of organic molecules with the mineral phase. In forming enamel, the amelogenin proteins contribute to the mineralization of hydroxyapatite (HAp). Leucine-rich amelogenin protein (LRAP) is a naturally occurring splice variant of amelogenin that comprises amelogenin's predicted HAp binding domains. We determined the partial structure of phosphorylated and non-phosphorylated LRAP variants bound to HAp using combined solid-state NMR (ssNMR) and ssNMR-biased computational structure prediction. New ssNMR measurements in the N-terminus indicate a largely extended structure for both variants, though some measurements are consistent with a partially helical N-terminal segment. The N-terminus of the phosphorylated variant is found to be consistently closer to the HAp surface than the non-phosphorylated variant. Structure prediction was biased using 21 ssNMR measurements in the N- and C-terminus at five HAp crystal faces. The predicted fold of LRAP is similar at all HAp faces studied, regardless of phosphorylation. Largely consistent with experimental observations, LRAP's predicted structure is relatively extended with a helix-turn-helix motif in the N-terminal domain and some helix in the C-terminal domain, and the N-terminal domain of the phosphorylated variant binds HAp more closely than the N-terminal domain of the non-phosphorylated variant. Predictions for both variants show some potential binding specificity for the {010} HAp crystal face, providing further support that amelogenins block crystal growth on the a and b faces to allow elongated crystals in the c-axis.
Collapse
Affiliation(s)
- David L. Masica
- Program in Molecular Biophysics The Johns Hopkins University, Baltimore MD
| | - Jeffrey J. Gray
- Program in Molecular Biophysics The Johns Hopkins University, Baltimore MD
- Dept. of Chemical and Biomolecular Engineering The Johns Hopkins University, Baltimore MD
| | | |
Collapse
|
14
|
Abstract
Around half of all protein structures solved nowadays using solution-state nuclear magnetic resonance (NMR) spectroscopy have been because of automated data analysis. The pervasiveness of computational approaches in general hides, however, a more nuanced view in which the full variety and richness of the field appears. This review is structured around a comparison of methods associated with three NMR observables: classical nuclear Overhauser effect (NOE) constraint gathering in contrast with more recent chemical shift and residual dipole coupling (RDC) based protocols. In each case, the emphasis is placed on the latest research, covering mainly the past 5 years. By describing both general concepts and representative programs, the objective is to map out a field in which--through the very profusion of approaches--it is all too easy to lose one's bearings.
Collapse
|
15
|
Wishart DS. Interpreting protein chemical shift data. PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY 2011; 58:62-87. [PMID: 21241884 DOI: 10.1016/j.pnmrs.2010.07.004] [Citation(s) in RCA: 191] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/14/2010] [Accepted: 07/29/2010] [Indexed: 05/12/2023]
Affiliation(s)
- David S Wishart
- Department of Biological Sciences, National Institute for Nanotechnology (NINT), Edmonton, AB, Canada T6G 2E8.
| |
Collapse
|
16
|
Zhou Y, Duan Y, Yang Y, Faraggi E, Lei H. Trends in template/fragment-free protein structure prediction. Theor Chem Acc 2011; 128:3-16. [PMID: 21423322 PMCID: PMC3030773 DOI: 10.1007/s00214-010-0799-2] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2010] [Accepted: 08/15/2010] [Indexed: 12/13/2022]
Abstract
Predicting the structure of a protein from its amino acid sequence is a long-standing unsolved problem in computational biology. Its solution would be of both fundamental and practical importance as the gap between the number of known sequences and the number of experimentally solved structures widens rapidly. Currently, the most successful approaches are based on fragment/template reassembly. Lacking progress in template-free structure prediction calls for novel ideas and approaches. This article reviews trends in the development of physical and specific knowledge-based energy functions as well as sampling techniques for fragment-free structure prediction. Recent physical- and knowledge-based studies demonstrated that it is possible to sample and predict highly accurate protein structures without borrowing native fragments from known protein structures. These emerging approaches with fully flexible sampling have the potential to move the field forward.
Collapse
Affiliation(s)
- Yaoqi Zhou
- School of Informatics, Indiana Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indiana University Purdue University, 719 Indiana Ave #319, Walker Plaza Building, Indianapolis, IN 46202 USA
| | - Yong Duan
- UC Davis Genome Center and Department of Applied Science, University of California, One Shields Avenue, Davis, CA USA
- College of Physics, Huazhong University of Science and Technology, 1037 Luoyu Road, 430074 Wuhan, China
| | - Yuedong Yang
- School of Informatics, Indiana Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indiana University Purdue University, 719 Indiana Ave #319, Walker Plaza Building, Indianapolis, IN 46202 USA
| | - Eshel Faraggi
- School of Informatics, Indiana Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indiana University Purdue University, 719 Indiana Ave #319, Walker Plaza Building, Indianapolis, IN 46202 USA
| | - Hongxing Lei
- UC Davis Genome Center and Department of Applied Science, University of California, One Shields Avenue, Davis, CA USA
- Beijing Institute of Genomics, Chinese Academy of Sciences, 100029 Beijing, China
| |
Collapse
|
17
|
Masica DL, Ash JT, Ndao M, Drobny GP, Gray JJ. Toward a structure determination method for biomineral-associated protein using combined solid- state NMR and computational structure prediction. Structure 2010; 18:1678-87. [PMID: 21134646 PMCID: PMC3031250 DOI: 10.1016/j.str.2010.09.013] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2010] [Revised: 09/10/2010] [Accepted: 09/17/2010] [Indexed: 11/23/2022]
Abstract
Protein-biomineral interactions are paramount to materials production in biology, including the mineral phase of hard tissue. Unfortunately, the structure of biomineral-associated proteins cannot be determined by X-ray crystallography or solution nuclear magnetic resonance (NMR). Here we report a method for determining the structure of biomineral-associated proteins. The method combines solid-state NMR (ssNMR) and ssNMR-biased computational structure prediction. In addition, the algorithm is able to identify lattice geometries most compatible with ssNMR constraints, representing a quantitative, novel method for investigating crystal-face binding specificity. We use this method to determine most of the structure of human salivary statherin interacting with the mineral phase of tooth enamel. Computation and experiment converge on an ensemble of related structures and identify preferential binding at three crystal surfaces. The work represents a significant advance toward determining structure of biomineral-adsorbed protein using experimentally biased structure prediction. This method is generally applicable to proteins that can be chemically synthesized.
Collapse
Affiliation(s)
- David L. Masica
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Jason T. Ash
- Department of Chemistry, University of Washington, Seattle, WA 98195, USA
| | - Moise Ndao
- Department of Chemistry, University of Washington, Seattle, WA 98195, USA
| | - Gary P. Drobny
- Department of Chemistry, University of Washington, Seattle, WA 98195, USA
| | - Jeffrey J Gray
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD 21218, USA
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| |
Collapse
|
18
|
Shen Y, Bryan PN, He Y, Orban J, Baker D, Bax A. De novo structure generation using chemical shifts for proteins with high-sequence identity but different folds. Protein Sci 2010; 19:349-56. [PMID: 19998407 DOI: 10.1002/pro.303] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
Proteins with high-sequence identity but very different folds present a special challenge to sequence-based protein structure prediction methods. In particular, a 56-residue three-helical bundle protein (GA(95)) and an alpha/beta-fold protein (GB(95)), which share 95% sequence identity, were targets in the CASP-8 structure prediction contest. With only 12 out of 300 submitted server-CASP8 models for GA(95) exhibiting the correct fold, this protein proved particularly challenging despite its small size. Here, we demonstrate that the information contained in NMR chemical shifts can readily be exploited by the CS-Rosetta structure prediction program and yields adequate convergence, even when input chemical shifts are limited to just amide (1)H(N) and (15)N or (1)H(N) and (1)H(alpha) values.
Collapse
Affiliation(s)
- Yang Shen
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892-0520, USA
| | | | | | | | | | | |
Collapse
|
19
|
Faraggi E, Yang Y, Zhang S, Zhou Y. Predicting continuous local structure and the effect of its substitution for secondary structure in fragment-free protein structure prediction. Structure 2010; 17:1515-27. [PMID: 19913486 DOI: 10.1016/j.str.2009.09.006] [Citation(s) in RCA: 84] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2009] [Revised: 09/01/2009] [Accepted: 09/03/2009] [Indexed: 11/30/2022]
Abstract
Local structures predicted from protein sequences are used extensively in every aspect of modeling and prediction of protein structure and function. For more than 50 years, they have been predicted at a low-resolution coarse-grained level (e.g., three-state secondary structure). Here, we combine a two-state classifier with real-value predictor to predict local structure in continuous representation by backbone torsion angles. The accuracy of the angles predicted by this approach is close to that derived from NMR chemical shifts. Their substitution for predicted secondary structure as restraints for ab initio structure prediction doubles the success rate. This result demonstrates the potential of predicted local structure for fragment-free tertiary-structure prediction. It further implies potentially significant benefits from using predicted real-valued torsion angles as a replacement for or supplement to the secondary-structure prediction tools used almost exclusively in many computational methods ranging from sequence alignment to function prediction.
Collapse
Affiliation(s)
- Eshel Faraggi
- Indiana University School of Informatics, Indiana University-Purdue University and Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN 46202, USA
| | | | | | | |
Collapse
|
20
|
Abi-Ghanem J, Heddi B, Foloppe N, Hartmann B. DNA structures from phosphate chemical shifts. Nucleic Acids Res 2010; 38:e18. [PMID: 19942687 PMCID: PMC2817473 DOI: 10.1093/nar/gkp1061] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2009] [Revised: 10/14/2009] [Accepted: 11/01/2009] [Indexed: 01/04/2023] Open
Abstract
For B-DNA, the strong linear correlation observed by nuclear magnetic resonance (NMR) between the (31)P chemical shifts (deltaP) and three recurrent internucleotide distances demonstrates the tight coupling between phosphate motions and helicoidal parameters. It allows to translate deltaP into distance restraints directly exploitable in structural refinement. It even provides a new method for refining DNA oligomers with restraints exclusively inferred from deltaP. Combined with molecular dynamics in explicit solvent, these restraints lead to a structural and dynamical view of the DNA as detailed as that obtained with conventional and more extensive restraints. Tests with the Jun-Fos oligomer show that this deltaP-based strategy can provide a simple and straightforward method to capture DNA properties in solution, from routine NMR experiments on unlabeled samples.
Collapse
Affiliation(s)
- Joséphine Abi-Ghanem
- INTS, INSERM S-665, 6 rue Alexandre Cabanel, Paris 75015, IBPC, CNRS UPR 9080, 13 rue Pierre et Marie Curie, Paris 75005, France and 51 Natal Road, Cambridge CB1 3NY, UK
| | - Brahim Heddi
- INTS, INSERM S-665, 6 rue Alexandre Cabanel, Paris 75015, IBPC, CNRS UPR 9080, 13 rue Pierre et Marie Curie, Paris 75005, France and 51 Natal Road, Cambridge CB1 3NY, UK
| | - Nicolas Foloppe
- INTS, INSERM S-665, 6 rue Alexandre Cabanel, Paris 75015, IBPC, CNRS UPR 9080, 13 rue Pierre et Marie Curie, Paris 75005, France and 51 Natal Road, Cambridge CB1 3NY, UK
| | - Brigitte Hartmann
- INTS, INSERM S-665, 6 rue Alexandre Cabanel, Paris 75015, IBPC, CNRS UPR 9080, 13 rue Pierre et Marie Curie, Paris 75005, France and 51 Natal Road, Cambridge CB1 3NY, UK
| |
Collapse
|
21
|
Robustelli P, Cavalli A, Dobson CM, Vendruscolo M, Salvatella X. Folding of Small Proteins by Monte Carlo Simulations with Chemical Shift Restraints without the Use of Molecular Fragment Replacement or Structural Homology. J Phys Chem B 2009; 113:7890-6. [DOI: 10.1021/jp900780b] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Paul Robustelli
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K., and ICREA and Institute for Research in Biomedicine Barcelona, Baldiri Reixac 10-12, 08028 Barcelona, Spain
| | - Andrea Cavalli
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K., and ICREA and Institute for Research in Biomedicine Barcelona, Baldiri Reixac 10-12, 08028 Barcelona, Spain
| | - Christopher M. Dobson
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K., and ICREA and Institute for Research in Biomedicine Barcelona, Baldiri Reixac 10-12, 08028 Barcelona, Spain
| | - Michele Vendruscolo
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K., and ICREA and Institute for Research in Biomedicine Barcelona, Baldiri Reixac 10-12, 08028 Barcelona, Spain
| | - Xavier Salvatella
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K., and ICREA and Institute for Research in Biomedicine Barcelona, Baldiri Reixac 10-12, 08028 Barcelona, Spain
| |
Collapse
|
22
|
Bader R. Utilizing the charge field effect on amide (15)N chemical shifts for protein structure validation. J Phys Chem B 2009; 113:347-58. [PMID: 19118488 DOI: 10.1021/jp807362v] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Of all the nuclei in proteins, the nuclear magnetic resonance (NMR) chemical shifts of nitrogen are the theoretically least well understood. In this study, quantum chemical methods are used in combination with polarizable-continuum models in order to show that consideration of the effective electric field, including charge screening due to solvation, improves considerably the consistencies of statistical relationships between experimental and computed amide (15)N shifts between various sets of charged and uncharged oligopeptides and small organic molecules. A single conversion scheme between shielding parameters from first principles using density functional theory (DFT) and experimental shifts is derived that holds for all classes of compounds examined here. This relationship is then used to test the accuracy of such (15)N chemical shift predictions in the cyclic decapeptide antibiotic gramicidin S (GS). GS has previously been studied in great detail, both by NMR and X-ray crystallography. It adopts a well-defined backbone conformation, and hence, only a few discrete side chain conformational states need to be considered. Moreover, a charge-relay effect of the two cationic ornithine side chains to the protein backbone has been described earlier by NMR spectroscopy. Here, DFT-derived backbone amide nitrogen chemical shifts were calculated for multiple conformations of GS. Overall, the structural dynamics of GS is revisited in view of chemical shift behavior along with energetic considerations. Together, the study demonstrates proof of concept that (15)N chemical shift information is particularly useful in the analysis and validation of protein conformational states in a charged environment.
Collapse
Affiliation(s)
- Reto Bader
- Department of Physics, Stockholm University, Arrhenius Laboratories, 106 91 Stockholm, Sweden.
| |
Collapse
|
23
|
Shen Y, Vernon R, Baker D, Bax A. De novo protein structure generation from incomplete chemical shift assignments. JOURNAL OF BIOMOLECULAR NMR 2009; 43:63-78. [PMID: 19034676 PMCID: PMC2683404 DOI: 10.1007/s10858-008-9288-5] [Citation(s) in RCA: 195] [Impact Index Per Article: 12.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Received: 08/20/2008] [Accepted: 10/28/2008] [Indexed: 05/19/2023]
Abstract
NMR chemical shifts provide important local structural information for proteins. Consistent structure generation from NMR chemical shift data has recently become feasible for proteins with sizes of up to 130 residues, and such structures are of a quality comparable to those obtained with the standard NMR protocol. This study investigates the influence of the completeness of chemical shift assignments on structures generated from chemical shifts. The Chemical-Shift-Rosetta (CS-Rosetta) protocol was used for de novo protein structure generation with various degrees of completeness of the chemical shift assignment, simulated by omission of entries in the experimental chemical shift data previously used for the initial demonstration of the CS-Rosetta approach. In addition, a new CS-Rosetta protocol is described that improves robustness of the method for proteins with missing or erroneous NMR chemical shift input data. This strategy, which uses traditional Rosetta for pre-filtering of the fragment selection process, is demonstrated for two paramagnetic proteins and also for two proteins with solid-state NMR chemical shift assignments.
Collapse
Affiliation(s)
- Yang Shen
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, MD 20892-0520, USA
| | | | | | | |
Collapse
|
24
|
Huang A, Stultz CM. The effect of a DeltaK280 mutation on the unfolded state of a microtubule-binding repeat in Tau. PLoS Comput Biol 2008; 4:e1000155. [PMID: 18725924 PMCID: PMC2494868 DOI: 10.1371/journal.pcbi.1000155] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2008] [Accepted: 07/10/2008] [Indexed: 12/14/2022] Open
Abstract
Tau is a natively unfolded protein that forms intracellular aggregates in the brains of patients with Alzheimer's disease. To decipher the mechanism underlying the formation of tau aggregates, we developed a novel approach for constructing models of natively unfolded proteins. The method, energy-minima mapping and weighting (EMW), samples local energy minima of subsequences within a natively unfolded protein and then constructs ensembles from these energetically favorable conformations that are consistent with a given set of experimental data. A unique feature of the method is that it does not strive to generate a single ensemble that represents the unfolded state. Instead we construct a number of candidate ensembles, each of which agrees with a given set of experimental constraints, and focus our analysis on local structural features that are present in all of the independently generated ensembles. Using EMW we generated ensembles that are consistent with chemical shift measurements obtained on tau constructs. Thirty models were constructed for the second microtubule binding repeat (MTBR2) in wild-type (WT) tau and a DeltaK280 mutant, which is found in some forms of frontotemporal dementia. By focusing on structural features that are preserved across all ensembles, we find that the aggregation-initiating sequence, PHF6*, prefers an extended conformation in both the WT and DeltaK280 sequences. In addition, we find that residue K280 can adopt a loop/turn conformation in WT MTBR2 and that deletion of this residue, which can adopt nonextended states, leads to an increase in locally extended conformations near the C-terminus of PHF6*. As an increased preference for extended states near the C-terminus of PHF6* may facilitate the propagation of beta-structure downstream from PHF6*, these results explain how a deletion at position 280 can promote the formation of tau aggregates.
Collapse
Affiliation(s)
- Austin Huang
- Harvard–MIT Division of Health Sciences and Technology, Department of Electrical Engineering and Computer Science, Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Collin M. Stultz
- Harvard–MIT Division of Health Sciences and Technology, Department of Electrical Engineering and Computer Science, Research Laboratory of Electronics, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
- * E-mail:
| |
Collapse
|
25
|
Wishart DS, Arndt D, Berjanskii M, Tang P, Zhou J, Lin G. CS23D: a web server for rapid protein structure generation using NMR chemical shifts and sequence data. Nucleic Acids Res 2008; 36:W496-502. [PMID: 18515350 PMCID: PMC2447725 DOI: 10.1093/nar/gkn305] [Citation(s) in RCA: 169] [Impact Index Per Article: 9.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
CS23D (chemical shift to 3D structure) is a web server for rapidly generating accurate 3D protein structures using only assigned nuclear magnetic resonance (NMR) chemical shifts and sequence data as input. Unlike conventional NMR methods, CS23D requires no NOE and/or J-coupling data to perform its calculations. CS23D accepts chemical shift files in either SHIFTY or BMRB formats, and produces a set of PDB coordinates for the protein in about 10-15 min. CS23D uses a pipeline of several preexisting programs or servers to calculate the actual protein structure. Depending on the sequence similarity (or lack thereof) CS23D uses either (i) maximal subfragment assembly (a form of homology modeling), (ii) chemical shift threading or (iii) shift-aided de novo structure prediction (via Rosetta) followed by chemical shift refinement to generate and/or refine protein coordinates. Tests conducted on more than 100 proteins from the BioMagResBank indicate that CS23D converges (i.e. finds a solution) for >95% of protein queries. These chemical shift generated structures were found to be within 0.2-2.8 A RMSD of the NMR structure generated using conventional NOE-base NMR methods or conventional X-ray methods. The performance of CS23D is dependent on the completeness of the chemical shift assignments and the similarity of the query protein to known 3D folds. CS23D is accessible at http://www.cs23d.ca.
Collapse
Affiliation(s)
- David S Wishart
- Department of Computing Science, Department of Biological Sciences, University of Alberta and National Research Council, National Institute for Nanotechnology (NINT), Edmonton, AB, Canada T6G 2E8
| | | | | | | | | | | |
Collapse
|
26
|
Consistent blind protein structure generation from NMR chemical shift data. Proc Natl Acad Sci U S A 2008; 105:4685-90. [PMID: 18326625 DOI: 10.1073/pnas.0800256105] [Citation(s) in RCA: 669] [Impact Index Per Article: 39.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Protein NMR chemical shifts are highly sensitive to local structure. A robust protocol is described that exploits this relation for de novo protein structure generation, using as input experimental parameters the (13)C(alpha), (13)C(beta), (13)C', (15)N, (1)H(alpha) and (1)H(N) NMR chemical shifts. These shifts are generally available at the early stage of the traditional NMR structure determination process, before the collection and analysis of structural restraints. The chemical shift based structure determination protocol uses an empirically optimized procedure to select protein fragments from the Protein Data Bank, in conjunction with the standard ROSETTA Monte Carlo assembly and relaxation methods. Evaluation of 16 proteins, varying in size from 56 to 129 residues, yielded full-atom models that have 0.7-1.8 A root mean square deviations for the backbone atoms relative to the experimentally determined x-ray or NMR structures. The strategy also has been successfully applied in a blind manner to nine protein targets with molecular masses up to 15.4 kDa, whose conventional NMR structure determination was conducted in parallel by the Northeast Structural Genomics Consortium. This protocol potentially provides a new direction for high-throughput NMR structure determination.
Collapse
|
27
|
Assessing the solvent-dependent surface area of unfolded proteins using an ensemble model. Proc Natl Acad Sci U S A 2008; 105:3321-6. [PMID: 18305164 DOI: 10.1073/pnas.0712240105] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We present a physically rigorous method to calculate solvent-dependent accessible surface areas (ASAs) of amino acid residues in unfolded proteins. ASA values will be larger in a good solvent, where solute-solvent interactions dominate and promote chain extension. Conversely, they will be smaller in a poor solvent, where solute-solute interactions dominate and promote chain collapse. In the method described here, these solvent-dependent effects are modeled by Boltzmann-weighting a simulated ensemble for solvent quality-good or poor. Solvent quality is parameterized as intramolecular hydrogen bond strength, using a "hydrogen bond dial" that can be varied from "off" to "high" (i.e., from 0 to -6 kcal/mol per hydrogen bond). When plotted as a function of hydrogen bond strength, the Boltzmann-weighted distribution of conformers describes a sigmoidal curve, with a transition midpoint near 1.5 kcal/mol per hydrogen bond. ASA tables for the 20 residues are provided under good solvent conditions and at this transition midpoint. For the backbone, these midpoint ASA values are found to be in good agreement with the earlier estimate of unfolded state ASA given by the mean of Creamer's upper and lower bounds [Creamer TP, et al. (1997) Biochemistry 36:2832-2835], a gratifying result in that cosolvents of experimental interest, such as urea (good solvent) and trimethylamine N-oxide (poor solvent), are known to affect the backbone predominantly. Unanticipated results from our simulations predict that a significant population of three-residue, hydrogen-bonded turns (inverse gamma-turns) will be detectable in blocked polyalanyl heptamers in poor solvent-an experimentally verifiable conjecture.
Collapse
|