1
|
Hassan M, Coutsias EA. Kinematic Reconstruction of Cyclic Peptides and Protein Backbones from Partial Data. J Chem Inf Model 2021; 61:4975-5000. [PMID: 34570494 PMCID: PMC10129052 DOI: 10.1021/acs.jcim.1c00453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We present an algorithm, QBKR (Quaternary Backbone Kinematic Reconstruction), a fast analytical method for an all-atom backbone reconstruction of proteins and linear or cyclic peptide chains from Cα coordinate traces. Unlike previous analytical methods for deriving all-atom representations from coarse-grained models that rely on canonical geometry with planar peptides in the trans conformation, our de novo kinematic model incorporates noncanonical, cis-trans, geometry naturally. Perturbations to this geometry can be effected with ease in our formulation, for example, to account for a continuous change from cis to trans geometry. A simple optimization of a spring-based objective function is employed for Cα-Cα distance variations that extend beyond the cis-trans limit. The kinematic construction produces a linked chain of peptide units, Cα-C-N-Cα, hinged at the Cα atoms spanning all possible planar and nonplanar peptide conformations. We have combined our method with a ring closure algorithm for the case of ring peptides and missing loops in a protein structure. Here, the reconstruction proceeding from both the N and C termini of the protein backbone (or in both directions from a starting position for rings) requires freedom in the position of one Cα atom (a capstone) to achieve a successful loop or ring closure. A salient feature of our reconstruction method is the ability to enrich conformational ensembles to produce alternative feasible conformations in which H-bond forming C-O or N-H pairs in the backbone can reverse orientations, thus addressing a well-known shortcoming in Cα-based RMSD structure comparison, wherein very close structures may lead to significantly different overall H-bond behavior. We apply the fixed Cα-based design to the reverse reconstruction from noisy Cryo-EM data, a posteriori to the optimization. Our method can be applied to speed up the process of an all-atom description from voluminous experimental data or subpar electron density maps.
Collapse
Affiliation(s)
- Mosavverul Hassan
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794, United States
| | - Evangelos A Coutsias
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, New York 11794, United States.,Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, New York 11794-5252, United States
| |
Collapse
|
2
|
Hatti KS, McCoy AJ, Oeffner RD, Sammito MD, Read RJ. Factors influencing estimates of coordinate error for molecular replacement. Acta Crystallogr D Struct Biol 2020; 76:19-27. [PMID: 31909740 PMCID: PMC6939440 DOI: 10.1107/s2059798319015730] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2019] [Accepted: 11/21/2019] [Indexed: 11/24/2022] Open
Abstract
Good prior estimates of the effective root-mean-square deviation (r.m.s.d.) between the atomic coordinates of the model and the target optimize the signal in molecular replacement, thereby increasing the success rate in difficult cases. Previous studies using protein structures solved by X-ray crystallography as models showed that optimal error estimates (refined after structure solution) were correlated with the sequence identity between the model and target, and with the number of residues in the model. Here, this work has been extended to find additional correlations between parameters of the model and the target and hence improved prior estimates of the coordinate error. Using a graph database, a curated set of 6030 molecular-replacement calculations using models that had been solved by X-ray crystallography was analysed to consider about 120 model and target parameters. Improved estimates were achieved by replacing the sequence identity with the Gonnet score for sequence similarity, as well as by considering the resolution of the target structure and the MolProbity score of the model. This approach was extended by analysing 12 610 additional molecular-replacement calculations where the model was determined by NMR. The median r.m.s.d. between pairs of models in an ensemble was found to be correlated with the estimated r.m.s.d. to the target. For models solved by NMR, the overall coordinate error estimates were larger than for structures determined by X-ray crystallography, and were more highly correlated with the number of residues.
Collapse
Affiliation(s)
- Kaushik S. Hatti
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, England
| | - Airlie J. McCoy
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, England
| | - Robert D. Oeffner
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, England
| | - Massimo D. Sammito
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, England
| | - Randy J. Read
- Department of Haematology, Cambridge Institute for Medical Research, University of Cambridge, Cambridge, England
| |
Collapse
|
3
|
Huang YJ, Brock KP, Ishida Y, Swapna GVT, Inouye M, Marks DS, Sander C, Montelione GT. Combining Evolutionary Covariance and NMR Data for Protein Structure Determination. Methods Enzymol 2018; 614:363-392. [PMID: 30611430 PMCID: PMC6640129 DOI: 10.1016/bs.mie.2018.11.004] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Accurate protein structure determination by solution-state NMR is challenging for proteins greater than about 20kDa, for which extensive perdeuteration is generally required, providing experimental data that are incomplete (sparse) and ambiguous. However, the massive increase in evolutionary sequence information coupled with advances in methods for sequence covariance analysis can provide reliable residue-residue contact information for a protein from sequence data alone. These "evolutionary couplings (ECs)" can be combined with sparse NMR data to determine accurate 3D protein structures. This hybrid "EC-NMR" method has been developed using NMR data for several soluble proteins and validated by comparison with corresponding reference structures determined by X-ray crystallography and/or conventional NMR methods. For small proteins, only backbone resonance assignments are utilized, while for larger proteins both backbone and some sidechain methyl resonance assignments are generally required. ECs can be combined with sparse NMR data obtained on deuterated, selectively protonated protein samples to provide structures that are more accurate and complete than those obtained using such sparse NMR data alone. EC-NMR also has significant potential for analysis of protein structures from solid-state NMR data and for studies of integral membrane proteins. The requirement that ECs are consistent with NMR data recorded on a specific member of a protein family, under specific conditions, also allows identification of ECs that reflect alternative allosteric or excited states of the protein structure.
Collapse
Affiliation(s)
- Yuanpeng Janet Huang
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey, Piscataway, NJ, United States; Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, NJ, United States
| | - Kelly P Brock
- Department of Systems Biology, Harvard Medical School, Boston, MA, United States
| | - Yojiro Ishida
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey, Piscataway, NJ, United States; Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, NJ, United States
| | - Gurla V T Swapna
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey, Piscataway, NJ, United States; Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, NJ, United States
| | - Masayori Inouye
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey, Piscataway, NJ, United States; Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, NJ, United States
| | - Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA, United States
| | - Chris Sander
- Department of Cell Biology, Harvard Medical School and cBio Center, Dana-Farber Cancer Institute, Boston, MA, United States
| | - Gaetano T Montelione
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey, Piscataway, NJ, United States; Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, NJ, United States; Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, NJ, United States.
| |
Collapse
|
4
|
Chen X, Smelter A, Moseley HNB. Automatic 13C chemical shift reference correction for unassigned protein NMR spectra. JOURNAL OF BIOMOLECULAR NMR 2018; 72:11-28. [PMID: 30097912 PMCID: PMC6209040 DOI: 10.1007/s10858-018-0202-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Accepted: 08/01/2018] [Indexed: 05/09/2023]
Abstract
Poor chemical shift referencing, especially for 13C in protein Nuclear Magnetic Resonance (NMR) experiments, fundamentally limits and even prevents effective study of biomacromolecules via NMR, including protein structure determination and analysis of protein dynamics. To solve this problem, we constructed a Bayesian probabilistic framework that circumvents the limitations of previous reference correction methods that required protein resonance assignment and/or three-dimensional protein structure. Our algorithm named Bayesian Model Optimized Reference Correction (BaMORC) can detect and correct 13C chemical shift referencing errors before the protein resonance assignment step of analysis and without three-dimensional structure. By combining the BaMORC methodology with a new intra-peaklist grouping algorithm, we created a combined method called Unassigned BaMORC that utilizes only unassigned experimental peak lists and the amino acid sequence. Unassigned BaMORC kept all experimental three-dimensional HN(CO)CACB-type peak lists tested within ± 0.4 ppm of the correct 13C reference value. On a much larger unassigned chemical shift test set, the base method kept 13C chemical shift referencing errors to within ± 0.45 ppm at a 90% confidence interval. With chemical shift assignments, Assigned BaMORC can detect and correct 13C chemical shift referencing errors to within ± 0.22 at a 90% confidence interval. Therefore, Unassigned BaMORC can correct 13C chemical shift referencing errors when it will have the most impact, right before protein resonance assignment and other downstream analyses are started. After assignment, chemical shift reference correction can be further refined with Assigned BaMORC. These new methods will allow non-NMR experts to detect and correct 13C referencing error at critical early data analysis steps, lowering the bar of NMR expertise required for effective protein NMR analysis.
Collapse
Affiliation(s)
- Xi Chen
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, KY, 40356, USA
- Department of Statistics, University of Kentucky, Lexington, KY, 40356, USA
| | - Andrey Smelter
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, KY, 40356, USA
- Markey Cancer Center, University of Kentucky, Lexington, KY, 40356, USA
- Center for Environmental and Systems Biochemistry, University of Kentucky, Lexington, KY, 40356, USA
| | - Hunter N B Moseley
- Department of Molecular and Cellular Biochemistry, University of Kentucky, Lexington, KY, 40356, USA.
- Department of Statistics, University of Kentucky, Lexington, KY, 40356, USA.
- Markey Cancer Center, University of Kentucky, Lexington, KY, 40356, USA.
- Center for Environmental and Systems Biochemistry, University of Kentucky, Lexington, KY, 40356, USA.
- Institute for Biomedical Informatics, University of Kentucky, Lexington, KY, 40356, USA.
| |
Collapse
|
5
|
Huang YJ, Brock KP, Sander C, Marks DS, Montelione GT. A Hybrid Approach for Protein Structure Determination Combining Sparse NMR with Evolutionary Coupling Sequence Data. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2018; 1105:153-169. [PMID: 30617828 DOI: 10.1007/978-981-13-2200-6_10] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/04/2022]
Abstract
While 3D structure determination of small (<15 kDa) proteins by solution NMR is largely automated and routine, structural analysis of larger proteins is more challenging. An emerging hybrid strategy for modeling protein structures combines sparse NMR data that can be obtained for larger proteins with sequence co-variation data, called evolutionary couplings (ECs), obtained from multiple sequence alignments of protein families. This hybrid "EC-NMR" method can be used to accurately model larger (15-60 kDa) proteins, and more rapidly determine structures of smaller (5-15 kDa) proteins using only backbone NMR data. The resulting structures have accuracies relative to reference structures comparable to those obtained with full backbone and sidechain NMR resonance assignments. The requirement that evolutionary couplings (ECs) are consistent with NMR data recorded on a specific member of a protein family, under specific conditions, potentially also allows identification of ECs that reflect alternative allosteric or excited states of the protein structure.
Collapse
Affiliation(s)
- Yuanpeng Janet Huang
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Kelly P Brock
- cBio Center, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Chris Sander
- Department of Cell Biology, Harvard Medical School, Boston, MA, USA
- cBio Center, Dana-Farber Cancer Institute, Boston, MA, USA
| | - Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
| | - Gaetano T Montelione
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, NJ, USA.
| |
Collapse
|
6
|
Moult J, Fidelis K, Kryshtafovych A, Schwede T, Tramontano A. Critical assessment of methods of protein structure prediction: Progress and new directions in round XI. Proteins 2016; 84 Suppl 1:4-14. [PMID: 27171127 DOI: 10.1002/prot.25064] [Citation(s) in RCA: 149] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2016] [Revised: 04/29/2016] [Accepted: 05/08/2016] [Indexed: 12/15/2022]
Abstract
Modeling of protein structure from amino acid sequence now plays a major role in structural biology. Here we report new developments and progress from the CASP11 community experiment, assessing the state of the art in structure modeling. Notable points include the following: (1) New methods for predicting three dimensional contacts resulted in a few spectacular template free models in this CASP, whereas models based on sequence homology to proteins with experimental structure continue to be the most accurate. (2) Refinement of initial protein models, primarily using molecular dynamics related approaches, has now advanced to the point where the best methods can consistently (though slightly) improve nearly all models. (3) The use of relatively sparse NMR constraints dramatically improves the accuracy of models, and another type of sparse data, chemical crosslinking, introduced in this CASP, also shows promise for producing better models. (4) A new emphasis on modeling protein complexes, in collaboration with CAPRI, has produced interesting results, but also shows the need for more focus on this area. (5) Methods for estimating the accuracy of models have advanced to the point where they are of considerable practical use. (6) A first assessment demonstrates that models can sometimes successfully address biological questions that motivate experimental structure determination. (7) There is continuing progress in accuracy of modeling regions of structure not directly available by comparative modeling, while there is marginal or no progress in some other areas. Proteins 2016; 84(Suppl 1):4-14. © 2016 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- John Moult
- Institute for Bioscience and Biotechnology Research and Department of Cell Biology and Molecular Genetics, University of Maryland, Rockville, Maryland, 20850.
| | - Krzysztof Fidelis
- Genome Center, University of California, Davis, Davis, California, 95616
| | | | - Torsten Schwede
- Biozentrum & SIB Swiss Institute of Bioinformatics, University of Basel, Basel, Switzerland
| | - Anna Tramontano
- Department of Physics and Istituto Pasteur - Fondazione Cenci Bolognetti, Sapienza University of Rome, Rome, Italy
| |
Collapse
|
7
|
Everett JK, Tejero R, Murthy SBK, Acton TB, Aramini JM, Baran MC, Benach J, Cort JR, Eletsky A, Forouhar F, Guan R, Kuzin AP, Lee HW, Liu G, Mani R, Mao B, Mills JL, Montelione AF, Pederson K, Powers R, Ramelot T, Rossi P, Seetharaman J, Snyder D, Swapna GVT, Vorobiev SM, Wu Y, Xiao R, Yang Y, Arrowsmith CH, Hunt JF, Kennedy MA, Prestegard JH, Szyperski T, Tong L, Montelione GT. A community resource of experimental data for NMR / X-ray crystal structure pairs. Protein Sci 2015; 25:30-45. [PMID: 26293815 DOI: 10.1002/pro.2774] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2015] [Accepted: 08/17/2015] [Indexed: 12/11/2022]
Abstract
We have developed an online NMR / X-ray Structure Pair Data Repository. The NIGMS Protein Structure Initiative (PSI) has provided many valuable reagents, 3D structures, and technologies for structural biology. The Northeast Structural Genomics Consortium was one of several PSI centers. NESG used both X-ray crystallography and NMR spectroscopy for protein structure determination. A key goal of the PSI was to provide experimental structures for at least one representative of each of hundreds of targeted protein domain families. In some cases, structures for identical (or nearly identical) constructs were determined by both NMR and X-ray crystallography. NMR spectroscopy and X-ray diffraction data for 41 of these "NMR / X-ray" structure pairs determined using conventional triple-resonance NMR methods with extensive sidechain resonance assignments have been organized in an online NMR / X-ray Structure Pair Data Repository. In addition, several NMR data sets for perdeuterated, methyl-protonated protein samples are included in this repository. As an example of the utility of this repository, these data were used to revisit questions about the precision and accuracy of protein NMR structures first outlined by Levy and coworkers several years ago (Andrec et al., Proteins 2007;69:449-465). These results demonstrate that the agreement between NMR and X-ray crystal structures is improved using modern methods of protein NMR spectroscopy. The NMR / X-ray Structure Pair Data Repository will provide a valuable resource for new computational NMR methods development.
Collapse
Affiliation(s)
- John K Everett
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - Roberto Tejero
- Departamento De Química Física, Universidad De Valencia, Valencia, Spain
| | - Sarath B K Murthy
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - Thomas B Acton
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - James M Aramini
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - Michael C Baran
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - Jordi Benach
- Department of Biological Sciences and Northeast Structural Genomics Consortium, Columbia University, New York, NY, 10027, USA
| | - John R Cort
- Fundamental and Computational Sciences Directorate, Pacific Northwest National Laboratory, Richland, Washington, 99354, USA
| | - Alexander Eletsky
- Department of Chemistry, The State University of New York at Buffalo, and Northeast Structural Genomics Consortium, Buffalo, New York, 14260, USA
| | - Farhad Forouhar
- Department of Biological Sciences and Northeast Structural Genomics Consortium, Columbia University, New York, NY, 10027, USA
| | - Rongjin Guan
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - Alexandre P Kuzin
- Department of Biological Sciences and Northeast Structural Genomics Consortium, Columbia University, New York, NY, 10027, USA
| | - Hsiau-Wei Lee
- Complex Carbohydrate Research Center and Northeast Structural Genomics Consortium, University of Georgia, Athens, Georgia, 30602, USA
| | - Gaohua Liu
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - Rajeswari Mani
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - Binchen Mao
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - Jeffrey L Mills
- Department of Chemistry, The State University of New York at Buffalo, and Northeast Structural Genomics Consortium, Buffalo, New York, 14260, USA
| | - Alexander F Montelione
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - Kari Pederson
- Complex Carbohydrate Research Center and Northeast Structural Genomics Consortium, University of Georgia, Athens, Georgia, 30602, USA
| | - Robert Powers
- Department of Chemistry, University of Nebraska-Lincoln, Lincoln, Nebraska, 68588, USA
| | - Theresa Ramelot
- Department of Chemistry and Biochemistry, Northeast Structural Genomics Consortium, Miami University, Oxford, Ohio, 45056, USA
| | - Paolo Rossi
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - Jayaraman Seetharaman
- Department of Biological Sciences and Northeast Structural Genomics Consortium, Columbia University, New York, NY, 10027, USA
| | - David Snyder
- Department of Chemistry, College of Science and Health, William Paterson University of NJ, Wayne, New Jersey, 07470, USA
| | - G V T Swapna
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - Sergey M Vorobiev
- Department of Biological Sciences and Northeast Structural Genomics Consortium, Columbia University, New York, NY, 10027, USA
| | - Yibing Wu
- Department of Chemistry, The State University of New York at Buffalo, and Northeast Structural Genomics Consortium, Buffalo, New York, 14260, USA
| | - Rong Xiao
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| | - Yunhuang Yang
- Department of Chemistry and Biochemistry, Northeast Structural Genomics Consortium, Miami University, Oxford, Ohio, 45056, USA
| | - Cheryl H Arrowsmith
- Cancer Genomics & Proteomics, Department of Medical Biophysics, Ontario Cancer Institute, and Northeast Structural Genomics Consortium, University of Toronto, Toronto, Ontario, M5G 1L7, Canada
| | - John F Hunt
- Department of Biological Sciences and Northeast Structural Genomics Consortium, Columbia University, New York, NY, 10027, USA
| | - Michael A Kennedy
- Department of Chemistry and Biochemistry, Northeast Structural Genomics Consortium, Miami University, Oxford, Ohio, 45056, USA
| | - James H Prestegard
- Complex Carbohydrate Research Center and Northeast Structural Genomics Consortium, University of Georgia, Athens, Georgia, 30602, USA
| | - Thomas Szyperski
- Department of Chemistry, The State University of New York at Buffalo, and Northeast Structural Genomics Consortium, Buffalo, New York, 14260, USA
| | - Liang Tong
- Department of Biological Sciences and Northeast Structural Genomics Consortium, Columbia University, New York, NY, 10027, USA
| | - Gaetano T Montelione
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, and Northeast Structural Genomics Consortium, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA.,Department of Biochemistry, Robert Wood Johnson Medical School, Rutgers, the State University of New Jersey, Piscataway, New Jersey, 08854, USA
| |
Collapse
|
8
|
Tang Y, Huang YJ, Hopf TA, Sander C, Marks DS, Montelione GT. Protein structure determination by combining sparse NMR data with evolutionary couplings. Nat Methods 2015; 12:751-4. [PMID: 26121406 PMCID: PMC4521990 DOI: 10.1038/nmeth.3455] [Citation(s) in RCA: 54] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/06/2015] [Accepted: 05/26/2015] [Indexed: 11/13/2022]
Abstract
Accurate protein structure determination by NMR is challenging for larger proteins, for which experimental data is often incomplete and ambiguous. Fortunately, the upsurge in evolutionary sequence information and advances in maximum entropy statistical methods now provide a rich complementary source of structural constraints. We have developed a hybrid approach (EC-NMR) combining sparse NMR data with evolutionary residue-residue couplings, and demonstrate accurate structure determination for several 6 to 41 kDa proteins.
Collapse
Affiliation(s)
- Yuefeng Tang
- 1] Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA. [2] Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
| | - Yuanpeng Janet Huang
- 1] Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA. [2] Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
| | - Thomas A Hopf
- 1] Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, USA. [2] Department of Informatics, Technische Universität München, Garching, Germany
| | - Chris Sander
- Computational Biology Center, Memorial Sloan Kettering Cancer Center, New York, New York, USA
| | - Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, Massachusetts, USA
| | - Gaetano T Montelione
- 1] Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA. [2] Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA. [3] Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, New Jersey, USA
| |
Collapse
|
9
|
Shrestha R, Zhang KYJ. A fragmentation and reassembly method for ab initio phasing. ACTA ACUST UNITED AC 2015; 71:304-12. [PMID: 25664740 DOI: 10.1107/s1399004714025449] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2014] [Accepted: 11/20/2014] [Indexed: 11/10/2022]
Abstract
Ab initio phasing with de novo models has become a viable approach for structural solution from protein crystallographic diffraction data. This approach takes advantage of the known protein sequence information, predicts de novo models and uses them for structure determination by molecular replacement. However, even the current state-of-the-art de novo modelling method has a limit as to the accuracy of the model predicted, which is sometimes insufficient to be used as a template for successful molecular replacement. A fragment-assembly phasing method has been developed that starts from an ensemble of low-accuracy de novo models, disassembles them into fragments, places them independently in the crystallographic unit cell by molecular replacement and then reassembles them into a whole structure that can provide sufficient phase information to enable complete structure determination by automated model building. Tests on ten protein targets showed that the method could solve structures for eight of these targets, although the predicted de novo models cannot be used as templates for successful molecular replacement since the best model for each target is on average more than 4.0 Å away from the native structure. The method has extended the applicability of the ab initio phasing by de novo models approach. The method can be used to solve structures when the best de novo models are still of low accuracy.
Collapse
Affiliation(s)
- Rojan Shrestha
- Structural Bioinformatics Team, Division of Structural and Synthetic Biology, Center for Life Science Technologies, RIKEN, Yokohama, Kanagawa 230-0045, Japan
| | - Kam Y J Zhang
- Structural Bioinformatics Team, Division of Structural and Synthetic Biology, Center for Life Science Technologies, RIKEN, Yokohama, Kanagawa 230-0045, Japan
| |
Collapse
|
10
|
Vranken WF, Vuister GW, Bonvin AMJJ. NMR-based modeling and refinement of protein 3D structures. Methods Mol Biol 2015; 1215:351-380. [PMID: 25330971 DOI: 10.1007/978-1-4939-1465-4_16] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
NMR is a well-established method to characterize the structure and dynamics of biomolecules in solution. High-quality structures can now be produced thanks to both experimental advances and computational developments that incorporate new NMR parameters and improved protocols and force fields in the structure calculation and refinement process. In this chapter, we give a short overview of the various types of NMR data that can provide structural information, and then focus on the structure calculation methodology itself. We discuss and illustrate with tutorial examples "classical" structure calculation, refinement, and structure validation approaches.
Collapse
Affiliation(s)
- Wim F Vranken
- Department of Structural Biology, VIB Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, B-1050, Brussels, Belgium
| | | | | |
Collapse
|
11
|
Vranken WF. NMR structure validation in relation to dynamics and structure determination. PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY 2014; 82:27-38. [PMID: 25444697 DOI: 10.1016/j.pnmrs.2014.08.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/01/2014] [Revised: 08/14/2014] [Accepted: 08/14/2014] [Indexed: 06/04/2023]
Abstract
NMR spectroscopy is a key technique for understanding the behaviour of proteins, especially highly dynamic proteins that adopt multiple conformations in solution. Overall, protein structures determined from NMR spectroscopy data constitute just over 10% of the Protein Data Bank archive. This review covers the validation of these NMR protein structures, but rather than describing currently available methodology, it focuses on concepts that are important for understanding where and how validation is most relevant. First, the inherent characteristics of the protein under study have an influence on quality and quantity of the distinct types of data that can be acquired from NMR experiments. Second, these NMR data are necessarily transformed into a model for use in a structure calculation protocol, and the protein structures that result from this reflect the types of NMR data used as well as the protein characteristics. The validation of NMR protein structures should therefore take account, wherever possible, of the inherent behavioural characteristics of the protein, the types of available NMR data, and the calculation protocol. These concepts are discussed in the context of 'knowledge based' and 'model versus data' validation, with suggestions for questions to ask and different validation categories to consider. The principal aim of this review is to stimulate discussion and to help the reader understand the relationships between the above elements in order to make informed decisions on which validation approaches are the most relevant in particular cases.
Collapse
Affiliation(s)
- Wim F Vranken
- Structural Biology Brussels, Vrije Universiteit Brussel, Pleinlaan 2, 1050 Brussels, Belgium; Department of Structural Biology, VIB, 1050 Brussels, Belgium; Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, La Plaine Campus, Triomflaan, BC Building, 6th Floor, CP 263, 1050 Brussels, Belgium.
| |
Collapse
|
12
|
Snyder DA, Grullon J, Huang YJ, Tejero R, Montelione GT. The expanded FindCore method for identification of a core atom set for assessment of protein structure prediction. Proteins 2014; 82 Suppl 2:219-30. [PMID: 24327305 DOI: 10.1002/prot.24490] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2013] [Revised: 11/14/2013] [Accepted: 11/19/2013] [Indexed: 11/09/2022]
Abstract
Maximizing the scientific impact of NMR-based structure determination requires robust and statistically sound methods for assessing the precision of NMR-derived structures. In particular, a method to define a core atom set for calculating superimpositions and validating structure predictions is critical to the use of NMR-derived structures as targets in the CASP competition. FindCore (Snyder and Montelione, Proteins 2005;59:673-686) is a superimposition independent method for identifying a core atom set and partitioning that set into domains. However, as FindCore optimizes superimposition by sensitively excluding not-well-defined atoms, the FindCore core may not comprise all atoms suitable for use in certain applications of NMR structures, including the CASP assessment process. Adapting the FindCore approach to assess predicted models against experimental NMR structures in CASP10 required modification of the FindCore method. This paper describes conventions and a standard protocol to calculate an "Expanded FindCore" atom set suitable for validation and application in biological and biophysical contexts. A key application of the Expanded FindCore method is to identify a core set of atoms in the experimental NMR structure for which it makes sense to validate predicted protein structure models. We demonstrate the application of this Expanded FindCore method in characterizing well-defined regions of 18 NMR-derived CASP10 target structures. The Expanded FindCore protocol defines "expanded core atom sets" that match an expert's intuition of which parts of the structure are sufficiently well defined to use in assessing CASP model predictions. We also illustrate the impact of this analysis on the CASP GDT assessment scores.
Collapse
Affiliation(s)
- David A Snyder
- Department of Chemistry, William Paterson University, Wayne, New Jersey, 07470
| | | | | | | | | |
Collapse
|
13
|
Lange OF. Automatic NOESY assignment in CS-RASREC-Rosetta. JOURNAL OF BIOMOLECULAR NMR 2014; 59:147-159. [PMID: 24831340 DOI: 10.1007/s10858-014-9833-3] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/08/2014] [Accepted: 04/19/2014] [Indexed: 06/03/2023]
Abstract
We have developed an approach for simultaneous structure calculation and automatic Nuclear Overhauser Effect (NOE) assignment to solve nuclear magnetic resonance (NMR) structures from unassigned NOESY data. The approach, autoNOE-Rosetta, integrates Resolution Adapted Structural RECombination (RASREC) Rosetta NMR calculations with algorithms for automatic NOE assignment. The method was applied to two proteins in the 15-20 kDa size range for which both, NMR and X-ray data, is available. The autoNOE-Rosetta calculations converge for both proteins and yield accurate structures with an RMSD of 1.9 Å to the X-ray reference structures. The method greatly expands the radius of convergence for automatic NOE assignment, and should be broadly useful for NMR structure determination.
Collapse
Affiliation(s)
- Oliver F Lange
- Biomolecular NMR and Munich Center for Integrated Protein Science, Department Chemie, Technische Universität München, Lichtenbergstrasse 4, 85747, Garching, Germany,
| |
Collapse
|
14
|
Zhang Z, Porter J, Tripsianes K, Lange OF. Robust and highly accurate automatic NOESY assignment and structure determination with Rosetta. JOURNAL OF BIOMOLECULAR NMR 2014; 59:135-45. [PMID: 24845473 DOI: 10.1007/s10858-014-9832-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2014] [Accepted: 04/19/2014] [Indexed: 05/16/2023]
Abstract
We have developed a novel and robust approach for automatic and unsupervised simultaneous nuclear Overhauser effect (NOE) assignment and structure determination within the CS-Rosetta framework. Starting from unassigned peak lists and chemical shift assignments, autoNOE-Rosetta determines NOE cross-peak assignments and generates structural models. The approach tolerates incomplete and raw NOE peak lists as well as incomplete or partially incorrect chemical shift assignments, and its performance has been tested on 50 protein targets ranging from 50 to 200 residues in size. We find a significantly improved performance compared to established programs, particularly for larger proteins and for NOE data obtained on perdeuterated protein samples. X-ray crystallographic structures allowed comparison of Rosetta and conventional, PDB-deposited, NMR models in 20 of 50 test cases. The unsupervised autoNOE-Rosetta models were often of significantly higher accuracy than the corresponding expert-supervised NMR models deposited in the PDB. We also tested the method with unrefined peak lists and found that performance was nearly as good as for refined peak lists. Finally, demonstrating our method's remarkable robustness against problematic input data, we provided correct models for an incorrect PDB-deposited NMR solution structure.
Collapse
Affiliation(s)
- Zaiyong Zhang
- Department Chemie, Biomolecular NMR and Munich Center for Integrated Protein Science, Technische Universität München, Lichtenbergstrasse 4, 85747, Garching, Germany
| | | | | | | |
Collapse
|
15
|
Zhang W, Zhang T, Zhang H, Hao Q. Crystallographic phasing with NMR models: an envelope approach. ACTA ACUST UNITED AC 2014; 70:1977-82. [DOI: 10.1107/s1399004714009754] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2013] [Accepted: 04/30/2014] [Indexed: 11/10/2022]
Abstract
X-ray crystallography and NMR are complementary tools in structural biology. However, it is often difficult to use NMR structures as search models in molecular replacement (MR) to phase crystallographic data. In this study, a new approach is reported utilizing a molecular envelope of NMR structures for MR phasing with the programFSEARCHat low resolution (about 6 Å). Several targets with both crystallographic and NMR structures available have been tested.FSEARCHwas able to find the correct translation and orientation of the search model in the crystallographic unit cell, while conventional MR procedures were unsuccessful.
Collapse
|
16
|
Vuister GW, Fogh RH, Hendrickx PMS, Doreleijers JF, Gutmanas A. An overview of tools for the validation of protein NMR structures. JOURNAL OF BIOMOLECULAR NMR 2014; 58:259-285. [PMID: 23877928 DOI: 10.1007/s10858-013-9750-x] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/27/2013] [Accepted: 06/04/2013] [Indexed: 06/02/2023]
Abstract
Biomolecular structures at atomic resolution present a valuable resource for the understanding of biology. NMR spectroscopy accounts for 11% of all structures in the PDB repository. In response to serious problems with the accuracy of some of the NMR-derived structures and in order to facilitate proper analysis of the experimental models, a number of program suites are available. We discuss nine of these tools in this review: PROCHECK-NMR, PSVS, GLM-RMSD, CING, Molprobity, Vivaldi, ResProx, NMR constraints analyzer and QMEAN. We evaluate these programs for their ability to assess the structural quality, restraints and their violations, chemical shifts, peaks and the handling of multi-model NMR ensembles. We document both the input required by the programs and output they generate. To discuss their relative merits we have applied the tools to two representative examples from the PDB: a small, globular monomeric protein (Staphylococcal nuclease from S. aureus, PDB entry 2kq3) and a small, symmetric homodimeric protein (a region of human myosin-X, PDB entry 2lw9).
Collapse
Affiliation(s)
- Geerten W Vuister
- Department of Biochemistry, School of Biological Sciences, University of Leicester, Henry Wellcome Building, Lancaster Road, Leicester, LE1 9HN, UK,
| | | | | | | | | |
Collapse
|
17
|
Mao B, Tejero R, Baker D, Montelione GT. Protein NMR structures refined with Rosetta have higher accuracy relative to corresponding X-ray crystal structures. J Am Chem Soc 2014; 136:1893-906. [PMID: 24392845 PMCID: PMC4129517 DOI: 10.1021/ja409845w] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We have found that refinement of protein NMR structures using Rosetta with experimental NMR restraints yields more accurate protein NMR structures than those that have been deposited in the PDB using standard refinement protocols. Using 40 pairs of NMR and X-ray crystal structures determined by the Northeast Structural Genomics Consortium, for proteins ranging in size from 5-22 kDa, restrained Rosetta refined structures fit better to the raw experimental data, are in better agreement with their X-ray counterparts, and have better phasing power compared to conventionally determined NMR structures. For 37 proteins for which NMR ensembles were available and which had similar structures in solution and in the crystal, all of the restrained Rosetta refined NMR structures were sufficiently accurate to be used for solving the corresponding X-ray crystal structures by molecular replacement. The protocol for restrained refinement of protein NMR structures was also compared with restrained CS-Rosetta calculations. For proteins smaller than 10 kDa, restrained CS-Rosetta, starting from extended conformations, provides slightly more accurate structures, while for proteins in the size range of 10-25 kDa the less CPU intensive restrained Rosetta refinement protocols provided equally or more accurate structures. The restrained Rosetta protocols described here can improve the accuracy of protein NMR structures and should find broad and general for studies of protein structure and function.
Collapse
Affiliation(s)
- Binchen Mao
- Center for Advanced Biotechnology and Medicine, and Department of Molecular Biology and Biochemistry, and Department of Biochemistry and Molecular Biology of Robert Wood Johnson Medical School, and Northeast Structural Genomics Consortium, Rutgers, The State University of New Jersey , Piscataway, New Jersey 08854, United States
| | | | | | | |
Collapse
|
18
|
Scapin G. Molecular replacement then and now. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2013; 69:2266-75. [PMID: 24189239 PMCID: PMC3817701 DOI: 10.1107/s0907444913011426] [Citation(s) in RCA: 44] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/06/2013] [Accepted: 04/26/2013] [Indexed: 01/22/2023]
Abstract
The `phase problem' in crystallography results from the inability to directly measure the phases of individual diffracted X-ray waves. While intensities are directly measured during data collection, phases must be obtained by other means. Several phasing methods are available (MIR, SAR, MAD, SAD and MR) and they all rely on the premise that phase information can be obtained if the positions of marker atoms in the unknown crystal structure are known. This paper is dedicated to the most popular phasing method, molecular replacement (MR), and represents a personal overview of the development, use and requirements of the methodology. The first description of noncrystallographic symmetry as a tool for structure determination was explained by Rossmann and Blow [Rossmann & Blow (1962), Acta Cryst. 15, 24-31]. The term `molecular replacement' was introduced as the name of a book in which the early papers were collected and briefly reviewed [Rossmann (1972), The Molecular Replacement Method. New York: Gordon & Breach]. Several programs have evolved from the original concept to allow faster and more sophisticated searches, including six-dimensional searches and brute-force approaches. While careful selection of the resolution range for the search and the quality of the data will greatly influence the outcome, the correct choice of the search model is probably still the main criterion to guarantee success in solving a structure using MR. Two of the main parameters used to define the `best' search model are sequence identity (25% or more) and structural similarity. Another parameter that may often be undervalued is the quality of the probe: there is clearly a relationship between the quality and the correctness of the chosen probe and its usefulness as a search model. Efforts should be made by all structural biologists to ensure that their deposited structures, which are potential search probes for future systems, are of the best possible quality.
Collapse
Affiliation(s)
- Giovanna Scapin
- Global Structural Chemistry, Merck and Co. Inc, 2000 Galloping Hill Road, Kenilworth, NJ 07033, USA
| |
Collapse
|
19
|
Stuart DI, Abrescia NGA. From lows to highs: using low-resolution models to phase X-ray data. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2013; 69:2257-65. [PMID: 24189238 PMCID: PMC3817700 DOI: 10.1107/s0907444913022336] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/01/2013] [Accepted: 08/08/2013] [Indexed: 11/11/2022]
Abstract
The study of virus structures has contributed to methodological advances in structural biology that are generally applicable (molecular replacement and noncrystallographic symmetry are just two of the best known examples). Moreover, structural virology has been instrumental in forging the more general concept of exploiting phase information derived from multiple structural techniques. This hybridization of structural methods, primarily electron microscopy (EM) and X-ray crystallography, but also small-angle X-ray scattering (SAXS) and nuclear magnetic resonance (NMR) spectroscopy, is central to integrative structural biology. Here, the interplay of X-ray crystallography and EM is illustrated through the example of the structural determination of the marine lipid-containing bacteriophage PM2. Molecular replacement starting from an ~13 Å cryo-EM reconstruction, followed by cycling density averaging, phase extension and solvent flattening, gave the X-ray structure of the intact virus at 7 Å resolution This in turn served as a bridge to phase, to 2.5 Å resolution, data from twinned crystals of the major coat protein (P2), ultimately yielding a quasi-atomic model of the particle, which provided significant insights into virus evolution and viral membrane biogenesis.
Collapse
Affiliation(s)
- David I. Stuart
- Division of Structural Biology, The Wellcome Trust Centre for Human Genetics, University of Oxford, Roosevelt Drive, Headington, Oxford OX3 7BN, England
- Diamond Light Source Ltd, Diamond House, Harwell Science and Innovation Campus, Didcot, England
| | - Nicola G. A. Abrescia
- Structural Biology Unit, CIC bioGUNE, CIBERehd, Bizkaia Technology Park, Bld 800, 48160 Derio, Spain
- IKERBASQUE, Basque Foundation for Science, 48011 Bilbao, Spain
| |
Collapse
|
20
|
Bibby J, Keegan RM, Mayans O, Winn MD, Rigden DJ. Application of the AMPLE cluster-and-truncate approach to NMR structures for molecular replacement. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2013; 69:2194-201. [PMID: 24189230 PMCID: PMC3817692 DOI: 10.1107/s0907444913018453] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/11/2013] [Accepted: 07/03/2013] [Indexed: 12/27/2022]
Abstract
AMPLE is a program developed for clustering and truncating ab initio protein structure predictions into search models for molecular replacement. Here, it is shown that its core cluster-and-truncate methods also work well for processing NMR ensembles into search models. Rosetta remodelling helps to extend success to NMR structures bearing low sequence identity or high structural divergence from the target protein. Potential future routes to improved performance are considered and practical, general guidelines on using AMPLE are provided.
Collapse
Affiliation(s)
- Jaclyn Bibby
- Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| | - Ronan M. Keegan
- Research Complex at Harwell, STFC Rutherford Appleton Laboratory, Didcot OX11 0FA, England
| | - Olga Mayans
- Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| | - Martyn D. Winn
- Science and Technology Facilities Council Daresbury Laboratory, Warrington WA4 4AD, England
| | - Daniel J. Rigden
- Institute of Integrative Biology, University of Liverpool, Liverpool L69 7ZB, England
| |
Collapse
|
21
|
Marcia M, Humphris-Narayanan E, Keating KS, Somarowthu S, Rajashankar K, Pyle AM. Solving nucleic acid structures by molecular replacement: examples from group II intron studies. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2013; 69:2174-85. [PMID: 24189228 PMCID: PMC3817690 DOI: 10.1107/s0907444913013218] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/23/2013] [Accepted: 05/14/2013] [Indexed: 12/17/2022]
Abstract
Structured RNA molecules are key players in ensuring cellular viability. It is now emerging that, like proteins, the functions of many nucleic acids are dictated by their tertiary folds. At the same time, the number of known crystal structures of nucleic acids is also increasing rapidly. In this context, molecular replacement will become an increasingly useful technique for phasing nucleic acid crystallographic data in the near future. Here, strategies to select, create and refine molecular-replacement search models for nucleic acids are discussed. Using examples taken primarily from research on group II introns, it is shown that nucleic acids are amenable to different and potentially more flexible and sophisticated molecular-replacement searches than proteins. These observations specifically aim to encourage future crystallographic studies on the newly discovered repertoire of noncoding transcripts.
Collapse
Affiliation(s)
- Marco Marcia
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
| | | | - Kevin S. Keating
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
| | - Srinivas Somarowthu
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
| | - Kanagalaghatta Rajashankar
- The Northeastern Collaborative Access Team (NE-CAT), Advanced Photon Source, Argonne National Laboratory, Argonne, IL 60439, USA
| | - Anna Marie Pyle
- Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, CT 06511, USA
- Department of Chemistry, Yale University, New Haven, CT 06511, USA
- Howard Hughes Medical Institute, Chevy Chase, MD 20815, USA
| |
Collapse
|
22
|
Rosato A, Tejero R, Montelione GT. Quality assessment of protein NMR structures. Curr Opin Struct Biol 2013; 23:715-24. [PMID: 24060334 DOI: 10.1016/j.sbi.2013.08.005] [Citation(s) in RCA: 26] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2013] [Accepted: 08/14/2013] [Indexed: 10/26/2022]
Abstract
Biomolecular NMR structures are now routinely used in biology, chemistry, and bioinformatics. Methods and metrics for assessing the accuracy and precision of protein NMR structures are beginning to be standardized across the biological NMR community. These include both knowledge-based assessment metrics, parameterized from the database of protein structures, and model versus data assessment metrics. On line servers are available that provide comprehensive protein structure quality assessment reports, and efforts are in progress by the world-wide Protein Data Bank (wwPDB) to develop a biomolecular NMR structure quality assessment pipeline as part of the structure deposition process. These quality assessment metrics and standards will aid NMR spectroscopists in determining more accurate structures, and increase the value and utility of these structures for the broad scientific community.
Collapse
Affiliation(s)
- Antonio Rosato
- Magnetic Resonance Center and Department of Chemistry, University of Florence, 50019 Sesto Fiorentino, Italy
| | | | | |
Collapse
|
23
|
Montelione GT, Nilges M, Bax A, Güntert P, Herrmann T, Richardson JS, Schwieters CD, Vranken WF, Vuister GW, Wishart DS, Berman HM, Kleywegt GJ, Markley JL. Recommendations of the wwPDB NMR Validation Task Force. Structure 2013; 21:1563-70. [PMID: 24010715 PMCID: PMC3884077 DOI: 10.1016/j.str.2013.07.021] [Citation(s) in RCA: 134] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2013] [Revised: 07/19/2013] [Accepted: 07/29/2013] [Indexed: 11/25/2022]
Abstract
As methods for analysis of biomolecular structure and dynamics using nuclear magnetic resonance spectroscopy (NMR) continue to advance, the resulting 3D structures, chemical shifts, and other NMR data are broadly impacting biology, chemistry, and medicine. Structure model assessment is a critical area of NMR methods development, and is an essential component of the process of making these structures accessible and useful to the wider scientific community. For these reasons, the Worldwide Protein Data Bank (wwPDB) has convened an NMR Validation Task Force (NMR-VTF) to work with wwPDB partners in developing metrics and policies for biomolecular NMR data harvesting, structure representation, and structure quality assessment. This paper summarizes the recommendations of the NMR-VTF, and lays the groundwork for future work in developing standards and metrics for biomolecular NMR structure quality assessment.
Collapse
Affiliation(s)
- Gaetano T Montelione
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA.
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
24
|
Mariani V, Biasini M, Barbato A, Schwede T. lDDT: a local superposition-free score for comparing protein structures and models using distance difference tests. Bioinformatics 2013; 29:2722-8. [PMID: 23986568 PMCID: PMC3799472 DOI: 10.1093/bioinformatics/btt473] [Citation(s) in RCA: 627] [Impact Index Per Article: 52.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023] Open
Abstract
Motivation: The assessment of protein structure prediction techniques requires objective criteria to measure the similarity between a computational model and the experimentally determined reference structure. Conventional similarity measures based on a global superposition of carbon α atoms are strongly influenced by domain motions and do not assess the accuracy of local atomic details in the model. Results: The Local Distance Difference Test (lDDT) is a superposition-free score that evaluates local distance differences of all atoms in a model, including validation of stereochemical plausibility. The reference can be a single structure, or an ensemble of equivalent structures. We demonstrate that lDDT is well suited to assess local model quality, even in the presence of domain movements, while maintaining good correlation with global measures. These properties make lDDT a robust tool for the automated assessment of structure prediction servers without manual intervention. Availability and implementation: Source code, binaries for Linux and MacOSX, and an interactive web server are available at http://swissmodel.expasy.org/lddt Contact:torsten.schwede@unibas.ch Supplementary information: Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Valerio Mariani
- Biozentrum, Universität Basel, Klingelbergstrasse 50-70 and Computational Structural Biology, SIB Swiss Institute of Bioinformatics, 4056 Basel, Switzerland
| | | | | | | |
Collapse
|
25
|
Tejero R, Snyder D, Mao B, Aramini JM, Montelione GT. PDBStat: a universal restraint converter and restraint analysis software package for protein NMR. JOURNAL OF BIOMOLECULAR NMR 2013; 56:337-51. [PMID: 23897031 PMCID: PMC3932191 DOI: 10.1007/s10858-013-9753-7] [Citation(s) in RCA: 55] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/22/2013] [Accepted: 06/11/2013] [Indexed: 05/20/2023]
Abstract
The heterogeneous array of software tools used in the process of protein NMR structure determination presents organizational challenges in the structure determination and validation processes, and creates a learning curve that limits the broader use of protein NMR in biology. These challenges, including accurate use of data in different data formats required by software carrying out similar tasks, continue to confound the efforts of novices and experts alike. These important issues need to be addressed robustly in order to standardize protein NMR structure determination and validation. PDBStat is a C/C++ computer program originally developed as a universal coordinate and protein NMR restraint converter. Its primary function is to provide a user-friendly tool for interconverting between protein coordinate and protein NMR restraint data formats. It also provides an integrated set of computational methods for protein NMR restraint analysis and structure quality assessment, relabeling of prochiral atoms with correct IUPAC names, as well as multiple methods for analysis of the consistency of atomic positions indicated by their convergence across a protein NMR ensemble. In this paper we provide a detailed description of the PDBStat software, and highlight some of its valuable computational capabilities. As an example, we demonstrate the use of the PDBStat restraint converter for restrained CS-Rosetta structure generation calculations, and compare the resulting protein NMR structure models with those generated from the same NMR restraint data using more traditional structure determination methods. These results demonstrate the value of a universal restraint converter in allowing the use of multiple structure generation methods with the same restraint data for consensus analysis of protein NMR structures and the underlying restraint data.
Collapse
Affiliation(s)
- Roberto Tejero
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey and Robert Wood Johnson Medical School, University of Medicine and Dentistry of New Jersey, and Northeast Structural Genomics Consortium, 679 Hoes Lane, Piscataway, New Jersey, 08854, USA
- Departamento de Quίmica Fίsica, Universidad de Valencia, Avenida Dr. Moliner 50 46100 Burjassot, Valencia, SPAIN
| | - David Snyder
- Department of Chemistry, William Paterson University, 300 Pompton Road Wayne, New Jersey 07470, USA
| | - Binchen Mao
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey and Robert Wood Johnson Medical School, University of Medicine and Dentistry of New Jersey, and Northeast Structural Genomics Consortium, 679 Hoes Lane, Piscataway, New Jersey, 08854, USA
| | - James M. Aramini
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey and Robert Wood Johnson Medical School, University of Medicine and Dentistry of New Jersey, and Northeast Structural Genomics Consortium, 679 Hoes Lane, Piscataway, New Jersey, 08854, USA
| | - Gaetano T Montelione
- Center for Advanced Biotechnology and Medicine, Rutgers, The State University of New Jersey and Robert Wood Johnson Medical School, University of Medicine and Dentistry of New Jersey, and Northeast Structural Genomics Consortium, 679 Hoes Lane, Piscataway, New Jersey, 08854, USA
- To whom correspondence should be addressed: Prof. Gaetano T. Montelione CABM, Rutgers University 679 Hoes Lane Piscataway, NJ 08854-5638 Phone: 732-235-5321
| |
Collapse
|
26
|
Schieborr U, Sreeramulu S, Elshorst B, Maurer M, Saxena K, Stehle T, Kudlinzki D, Gande SL, Schwalbe H. MOTOR: model assisted software for NMR structure determination. Proteins 2013; 81:2007-22. [PMID: 23852655 DOI: 10.1002/prot.24361] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/26/2012] [Revised: 06/26/2013] [Accepted: 06/28/2013] [Indexed: 11/06/2022]
Abstract
Eukaryotic proteins with important biological function can be partially unstructured, conformational flexible, or heterogenic. Crystallization trials often fail for such proteins. In NMR spectroscopy, parts of the polypeptide chain undergoing dynamics in unfavorable time regimes cannot be observed. De novo NMR structure determination is seriously hampered when missing signals lead to an incomplete chemical shift assignment resulting in an information content of the NOE data insufficient to determine the structure ab initio. We developed a new protein structure determination strategy for such cases based on a novel NOE assignment strategy utilizing a number of model structures but no explicit reference structure as it is used for bootstrapping like algorithms. The software distinguishes in detail between consistent and mutually exclusive pairs of possible NOE assignments on the basis of different precision levels of measured chemical shifts searching for a set of maximum number of consistent NOE assignments in agreement with 3D space. Validation of the method using the structure of the low molecular-weight-protein tyrosine phosphatase A (MptpA) showed robust results utilizing protein structures with 30-45% sequence identity and 70% of the chemical shift assignments. About 60% of the resonance assignments are sufficient to identify those structural models with highest conformational similarity to the real structure. The software was benchmarked by de novo solution structures of fibroblast growth factor 21 (FGF21) and the extracellular fibroblast growth factor receptor domain FGFR4 D2, which both failed in crystallization trials and in classical NMR structure determination.
Collapse
Affiliation(s)
- Ulrich Schieborr
- Johann Wolfgang Goethe-University Frankfurt, Institute for Organic Chemistry and Chemical Biology, Center for Biomolecular Magnetic Resonance, Max-von-Laue-Str. 7, 60438, Frankfurt am Main, Germany
| | | | | | | | | | | | | | | | | |
Collapse
|
27
|
Padilla A, Amiable C, Pochet S, Kaminski PA, Labesse G. Structure of the oncoprotein Rcl bound to three nucleotide analogues. ACTA CRYSTALLOGRAPHICA SECTION D: BIOLOGICAL CRYSTALLOGRAPHY 2013; 69:247-55. [PMID: 23385460 DOI: 10.1107/s0907444912045039] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2012] [Accepted: 10/30/2012] [Indexed: 08/26/2023]
Abstract
Rcl is a novel N-glycoside hydrolase found in mammals that shows specificity for the hydrolysis of 5'-monophosphate nucleotides. Its role in nucleotide catabolism and the resulting production of 2-deoxyribose 5-phosphate has suggested that it might fuel cancer growth. Its expression is regulated by c-Myc, but its role as an oncoprotein remains to be clarified. In parallel, various nucleosides have been shown to acquire pro-apoptotic properties upon 5'-monophosphorylation in cells. These include triciribine, a tricyclic nucleoside analogue that is currently in clinical trials in combination with a farnesyltransferase inhibitor. Similarly, an N(6)-alkyl-AMP has been shown to be cytotoxic. Interestingly, Rcl has been shown to be inhibited by such compounds in vitro. In order to gain better insight into the precise ligand-recognition determinants, the crystallization of Rcl with these nucleotide analogues was attempted. The first crystal structure of Rcl was solved by molecular replacement using its NMR structure in combination with distantly related crystal structures. The structures of Rcl bound to two other nucleotides were then solved by molecular replacement using the previous crystal structure as a template. The resulting structures, solved at high resolution, led to a clear characterization of the protein-ligand interactions that will guide further rational drug design.
Collapse
Affiliation(s)
- André Padilla
- CNRS, UMR5048, Université Montpellier 1 et 2, Centre de Biochimie Structurale, F-34090 Montpellier, France
| | | | | | | | | |
Collapse
|
28
|
Terwilliger TC, Read RJ, Adams PD, Brunger AT, Afonine PV, Grosse-Kunstleve RW, Hung LW. Improved crystallographic models through iterated local density-guided model deformation and reciprocal-space refinement. ACTA CRYSTALLOGRAPHICA. SECTION D, BIOLOGICAL CRYSTALLOGRAPHY 2012; 68:861-70. [PMID: 22751672 PMCID: PMC3388814 DOI: 10.1107/s0907444912015636] [Citation(s) in RCA: 33] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/14/2011] [Accepted: 04/10/2012] [Indexed: 11/14/2022]
Abstract
An approach is presented for addressing the challenge of model rebuilding after molecular replacement in cases where the placed template is very different from the structure to be determined. The approach takes advantage of the observation that a template and target structure may have local structures that can be superimposed much more closely than can their complete structures. A density-guided procedure for deformation of a properly placed template is introduced. A shift in the coordinates of each residue in the structure is calculated based on optimizing the match of model density within a 6 Å radius of the center of that residue with a prime-and-switch electron-density map. The shifts are smoothed and applied to the atoms in each residue, leading to local deformation of the template that improves the match of map and model. The model is then refined to improve the geometry and the fit of model to the structure-factor data. A new map is then calculated and the process is repeated until convergence. The procedure can extend the routine applicability of automated molecular replacement, model building and refinement to search models with over 2 Å r.m.s.d. representing 65-100% of the structure.
Collapse
Affiliation(s)
- Thomas C Terwilliger
- Bioscience Division and Los Alamos Institutes, Los Alamos National Laboratory, Los Alamos, NM 87545, USA.
| | | | | | | | | | | | | |
Collapse
|
29
|
Himmel S, Grosse C, Wolff S, Schwiegk C, Becker S. Structure of the RBD-PRDI fragment of the antiterminator protein GlcT. Acta Crystallogr Sect F Struct Biol Cryst Commun 2012; 68:751-6. [PMID: 22750856 DOI: 10.1107/s1744309112020635] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2012] [Accepted: 05/07/2012] [Indexed: 11/10/2022]
Abstract
GlcT is a transcriptional antiterminator protein that is involved in regulation of glucose metabolism in Bacillus subtilis. Antiterminator proteins bind specific RNA sequences, thus preventing the formation of overlapping terminator stem-loops. The structure of a fragment (residues 3-170) comprising the RNA-binding domain (RBD) and the first regulatory domain (PRDI) of GlcT was solved at 2.0 Å resolution with one molecule in the asymmetric unit. The two domains are connected by a helical linker. Their interface is mostly constituted by hydrophobic interactions.
Collapse
Affiliation(s)
- Sebastian Himmel
- Department of NMR-based Structural Biology, Max Planck Institute for Biophysical Chemistry, Göttingen, Germany
| | | | | | | | | |
Collapse
|
30
|
Montelione GT. The Protein Structure Initiative: achievements and visions for the future. F1000 BIOLOGY REPORTS 2012; 4:7. [PMID: 22500193 PMCID: PMC3318194 DOI: 10.3410/b4-7] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
The Protein Structure Initiative (PSI) was established in 2000 by the National Institutes of General Medical Sciences with the long-term goal of providing 3D (three-dimensional) structural information for most proteins in nature. As advances in genomic sequencing, bioinformatics, homology modelling, and methods for rapid determination of 3D structures of proteins by X-ray crystallography and nuclear magnetic resonance (NMR) converged, it was proposed that our understanding of the biology of protein structure and evolution could be greatly enabled by ‘genomic-scale’ protein structure determination. Over the past 12 years, the PSI has evolved from a testing bed for new methods of sample and structure production to a core component of a wide range of biology programs.
Collapse
Affiliation(s)
- Gaetano T Montelione
- Center for Advanced Biotechnology and Medicine, Department of Molecular Biology and Biochemistry, Rutgers University Department of Biochemistry and Molecular Biology, Robert Wood Johnson Medical School, Northeast Structural Genomics Consortium, Piscataway, NJ 08854, USA
| |
Collapse
|
31
|
phenix.mr_rosetta: molecular replacement and model rebuilding with Phenix and Rosetta. ACTA ACUST UNITED AC 2012; 13:81-90. [PMID: 22418934 PMCID: PMC3375004 DOI: 10.1007/s10969-012-9129-3] [Citation(s) in RCA: 118] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2011] [Accepted: 02/07/2012] [Indexed: 11/03/2022]
Abstract
The combination of algorithms from the structure-modeling field with those of crystallographic structure determination can broaden the range of templates that are useful for structure determination by the method of molecular replacement. Automated tools in phenix.mr_rosetta simplify the application of these combined approaches by integrating Phenix crystallographic algorithms and Rosetta structure-modeling algorithms and by systematically generating and evaluating models with a combination of these methods. The phenix.mr_rosetta algorithms can be used to automatically determine challenging structures. The approaches used in phenix.mr_rosetta are described along with examples that show roles that structure-modeling can play in molecular replacement.
Collapse
|