1
|
Sahoo A, Khare S, Devanarayanan S, Jain PC, Varadarajan R. Residue proximity information and protein model discrimination using saturation-suppressor mutagenesis. eLife 2015; 4. [PMID: 26716404 PMCID: PMC4758949 DOI: 10.7554/elife.09532] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2015] [Accepted: 12/29/2015] [Indexed: 12/16/2022] Open
Abstract
Identification of residue-residue contacts from primary sequence can be used to guide protein structure prediction. Using Escherichia coli CcdB as the test case, we describe an experimental method termed saturation-suppressor mutagenesis to acquire residue contact information. In this methodology, for each of five inactive CcdB mutants, exhaustive screens for suppressors were performed. Proximal suppressors were accurately discriminated from distal suppressors based on their phenotypes when present as single mutants. Experimentally identified putative proximal pairs formed spatial constraints to recover >98% of native-like models of CcdB from a decoy dataset. Suppressor methodology was also applied to the integral membrane protein, diacylglycerol kinase A where the structures determined by X-ray crystallography and NMR were significantly different. Suppressor as well as sequence co-variation data clearly point to the X-ray structure being the functional one adopted in vivo. The methodology is applicable to any macromolecular system for which a convenient phenotypic assay exists. DOI:http://dx.doi.org/10.7554/eLife.09532.001 Common techniques to determine the three-dimensional structures of proteins can help researchers to understand these molecules’ activities, but are often time-consuming and do not work for all proteins. Proteins are made of chains of amino acids. When a protein chain folds, some of these amino acids interact with other amino acids and these contacts dictate the overall shape of the protein. This means that identifying the pairs of contacting amino acids could make it possible to predict the protein’s structure. Interactions between pairs of contacting amino acids tend to remain conserved throughout evolution, and if a mutation alters one of the amino acids in a pair then a 'compensatory' change often occurs to alter the second amino acid as well. Compensatory mutations can suggest that two amino acids are close to each other in the three-dimensional shape of a protein, but the computational methods used to identify such amino acid pairs can sometimes be inaccurate. In 2012, researchers generated mutants of a bacterial protein called CcdB with changes to single amino acids that caused the protein to fail to fold correctly. Now, Sahoo et al. – who include two of the researchers involved in the 2012 work – have developed an experimental method to identify contacting amino acids and use the CcdB protein as a test case. The approach involved searching for additional mutations that could restore the activity of five of the original mutant proteins when the proteins were produced in yeast cells. The rationale was that any secondary mutations that restored the activity must have corrected the folding defect caused by the original mutation. Sahoo et al. then predicted how close the amino acids affected by the secondary mutations were to the amino acids altered by the original mutations. This information was used to select reliable three-dimensional models of CcdB from a large set of possible structures that had been generated previously using computer models. Next, the technique was applied to a protein called diacylglycerol kinase A. The structure of this protein had previously been inferred using techniques such as X-ray crystallography and nuclear magnetic resonance, but there was a mismatch between the two methods. Sahoo et al. found that the amino acid contacts derived from their experimental method matched those found in the crystal structure, suggesting that the functional protein structure in living cells is similar to the crystal structure. In the future, the experimental approach developed in this work could be combined with existing methods to reliably guide protein structure prediction. DOI:http://dx.doi.org/10.7554/eLife.09532.002
Collapse
Affiliation(s)
- Anusmita Sahoo
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | - Shruti Khare
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | | | - Pankaj C Jain
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | - Raghavan Varadarajan
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India.,Jawaharlal Nehru Center for Advanced Scientific Research, Bangalore, India
| |
Collapse
|
2
|
Kim DE, Dimaio F, Yu-Ruei Wang R, Song Y, Baker D. One contact for every twelve residues allows robust and accurate topology-level protein structure modeling. Proteins 2013; 82 Suppl 2:208-18. [PMID: 23900763 DOI: 10.1002/prot.24374] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2013] [Revised: 06/12/2013] [Accepted: 06/21/2013] [Indexed: 12/19/2022]
Abstract
A number of methods have been described for identifying pairs of contacting residues in protein three-dimensional structures, but it is unclear how many contacts are required for accurate structure modeling. The CASP10 assisted contact experiment provided a blind test of contact guided protein structure modeling. We describe the models generated for these contact guided prediction challenges using the Rosetta structure modeling methodology. For nearly all cases, the submitted models had the correct overall topology, and in some cases, they had near atomic-level accuracy; for example the model of the 384 residue homo-oligomeric tetramer (Tc680o) had only 2.9 Å root-mean-square deviation (RMSD) from the crystal structure. Our results suggest that experimental and bioinformatic methods for obtaining contact information may need to generate only one correct contact for every 12 residues in the protein to allow accurate topology level modeling.
Collapse
Affiliation(s)
- David E Kim
- Department of Biochemistry, University of Washington, Seattle, 98195, Washington
| | | | | | | | | |
Collapse
|
3
|
Eggimann BL, Vostrikov VV, Veglia G, Siepmann JI. Modeling helical proteins using residual dipolar couplings, sparse long-range distance constraints and a simple residue-based force field. Theor Chem Acc 2013; 132:1388. [PMID: 24639619 DOI: 10.1007/s00214-013-1388-y] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
We present a fast and simple protocol to obtain moderate-resolution backbone structures of helical proteins. This approach utilizes a combination of sparse backbone NMR data (residual dipolar couplings and paramagnetic relaxation enhancements) or EPR data with a residue-based force field and Monte Carlo/simulated annealing protocol to explore the folding energy landscape of helical proteins. By using only backbone NMR data, which are relatively easy to collect and analyze, and strategically placed spin relaxation probes, we show that it is possible to obtain protein structures with correct helical topology and backbone RMS deviations well below 4 Å. This approach offers promising alternatives for the structural determination of proteins in which nuclear Overha-user effect data are difficult or impossible to assign and produces initial models that will speed up the high-resolution structure determination by NMR spectroscopy.
Collapse
Affiliation(s)
- Becky L Eggimann
- Department of Chemistry, Chemical Theory Center, University of Minnesota, 207 Pleasant St. SE, Minneapolis, MN 55455, USA
| | - Vitaly V Vostrikov
- Molecular Biology and Biophysics, University of Minnesota, 321 Church St. SE, Minneapolis, MN 55455, USA
| | - Gianluigi Veglia
- Department of Chemistry, Chemical Theory Center, University of Minnesota, 207 Pleasant St. SE, Minneapolis, MN 55455, USA
| | - J Ilja Siepmann
- Department of Chemistry, Chemical Theory Center, University of Minnesota, 207 Pleasant St. SE, Minneapolis, MN 55455, USA
| |
Collapse
|
4
|
Latek D, Kolinski A. CABS-NMR-De novo tool for rapid global fold determination from chemical shifts, residual dipolar couplings and sparse methyl-methyl noes. J Comput Chem 2010; 32:536-44. [DOI: 10.1002/jcc.21640] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2009] [Revised: 06/27/2010] [Accepted: 06/27/2010] [Indexed: 01/20/2023]
|
5
|
Cornilescu G, Bahrami A, Tonelli M, Markley JL, Eghbalnia HR. HIFI-C: a robust and fast method for determining NMR couplings from adaptive 3D to 2D projections. JOURNAL OF BIOMOLECULAR NMR 2007; 38:341-51. [PMID: 17610130 DOI: 10.1007/s10858-007-9173-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/09/2007] [Revised: 06/11/2007] [Accepted: 06/13/2007] [Indexed: 05/16/2023]
Abstract
We describe a novel method for the robust, rapid, and reliable determination of J couplings in multi-dimensional NMR coupling data, including small couplings from larger proteins. The method, "High-resolution Iterative Frequency Identification of Couplings" (HIFI-C) is an extension of the adaptive and intelligent data collection approach introduced earlier in HIFI-NMR. HIFI-C collects one or more optimally tilted two-dimensional (2D) planes of a 3D experiment, identifies peaks, and determines couplings with high resolution and precision. The HIFI-C approach, demonstrated here for the 3D quantitative J method, offers vital features that advance the goal of rapid and robust collection of NMR coupling data. (1) Tilted plane residual dipolar couplings (RDC) data are collected adaptively in order to offer an intelligent trade off between data collection time and accuracy. (2) Data from independent planes can provide a statistical measure of reliability for each measured coupling. (3) Fast data collection enables measurements in cases where sample stability is a limiting factor (for example in the presence of an orienting medium required for residual dipolar coupling measurements). (4) For samples that are stable, or in experiments involving relatively stronger couplings, robust data collection enables more reliable determinations of couplings in shorter time, particularly for larger biomolecules. As a proof of principle, we have applied the HIFI-C approach to the 3D quantitative J experiment to determine N-C' RDC values for three proteins ranging from 56 to 159 residues (including a homodimer with 111 residues in each subunit). A number of factors influence the robustness and speed of data collection. These factors include the size of the protein, the experimental set up, and the coupling being measured, among others. To exhibit a lower bound on robustness and the potential for time saving, the measurement of dipolar couplings for the N-C' vector represents a realistic "worst case analysis". These couplings are among the smallest currently measured, and their determination in both isotropic and anisotropic media demands the highest measurement precision. The new approach yielded excellent quantitative agreement with values determined independently by the conventional 3D quantitative J NMR method (in cases where sample stability in oriented media permitted these measurements) but with a factor of 2-5 in time savings. The statistical measure of reliability, measuring the quality of each RDC value, offers valuable adjunct information even in cases where modest time savings may be realized.
Collapse
Affiliation(s)
- Gabriel Cornilescu
- National Magnetic Resonance Facility at Madison, University of Wisconsin-Madison, Madison, WI 53706, USA.
| | | | | | | | | |
Collapse
|
6
|
Liu S, Venot A, Meng L, Tian F, Moremen KW, Boons GJ, Prestegard JH. Spin-labeled analogs of CMP-NeuAc as NMR probes of the alpha-2,6-sialyltransferase ST6Gal I. CHEMISTRY & BIOLOGY 2007; 14:409-18. [PMID: 17462576 PMCID: PMC3968682 DOI: 10.1016/j.chembiol.2007.02.010] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/01/2006] [Revised: 02/14/2007] [Accepted: 02/27/2007] [Indexed: 12/15/2022]
Abstract
Structural data on mammalian proteins are often difficult to obtain by conventional NMR approaches because of an inability to produce samples with uniform isotope labeling in bacterial expression hosts. Proteins with sparse isotope labels can be produced in eukaryotic hosts by using isotope-labeled forms of specific amino acids, but structural analysis then requires information from experiments other than nuclear Overhauser effects. One source of alternate structural information is distance-dependent perturbation of spin relaxation times by nitroxide spin-labeled analogs of natural protein ligands. Here, we introduce spin-labeled analogs of sugar nucleotide donors for sialyltransferases, specifically, CMP-TEMPO (CMP-4-O-[2,2,6,6-tetramethylpiperidine-1-oxyl]) and CMP-4carboxyTEMPO (CMP-4-O-[4-carboxy-2,2,6,6-tetramethylpiperidinine-1-oxyl]). An ability to identify resonances from active site residues and produce distance constraints is illustrated on a (15)N phenylalanine-labeled version of the structurally uncharacterized, alpha-2,6-linked sialyltransferase, ST6Gal I.
Collapse
Affiliation(s)
- Shan Liu
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602, USA
| | | | | | | | | | | | | |
Collapse
|
7
|
Mayer KL, Qu Y, Bansal S, LeBlond PD, Jenney FE, Brereton PS, Adams MWW, Xu Y, Prestegard JH. Structure determination of a new protein from backbone-centered NMR data and NMR-assisted structure prediction. Proteins 2006; 65:480-9. [PMID: 16927360 DOI: 10.1002/prot.21119] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Targeting of proteins for structure determination in structural genomic programs often includes the use of threading and fold recognition methods to exclude proteins belonging to well-populated fold families, but such methods can still fail to recognize preexisting folds. The authors illustrate here a method in which limited amounts of structural data are used to improve an initial homology search and the data are subsequently used to produce a structure by data-constrained refinement of an identified structural template. The data used are primarily NMR-based residual dipolar couplings, but they also include additional chemical shift and backbone-nuclear Overhauser effect data. Using this methodology, a backbone structure was efficiently produced for a 10 kDa protein (PF1455) from Pyrococcus furiosus. Its relationship to existing structures and its probable function are discussed.
Collapse
Affiliation(s)
- K L Mayer
- Complex Carbohydrate Research Center, University of Georgia, Athens, Georgia 30602, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Residual Dipolar Couplings Report on the Active Conformation of Rhodopsin-Bound Protein Fragments. Top Curr Chem (Cham) 2006. [DOI: 10.1007/128_2006_088] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
9
|
Walsh JD, Kuszweski J, Wang YX. Determining a helical protein structure using peptide pixels. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2005; 177:155-9. [PMID: 16084744 DOI: 10.1016/j.jmr.2005.06.014] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2005] [Revised: 05/27/2005] [Accepted: 06/11/2005] [Indexed: 05/03/2023]
Abstract
The residual dipolar coupling-periodicity planarity correlation makes it possible to determine peptide plane orientations in regular periodic protein secondary structure elements. Each peptide plane orientation represents a "pixel" of protein structure, and is expressed in terms of three angles referred to as tilt, phase, and pitch angles. In this report, we present the novel "3P" (periodicity, planarity, and pixels) method that allows one to determine secondary and tertiary structure of alpha-helical proteins. We demonstrate the 3P method by determining the structure of domain 1 of the receptor-associated protein (RAP) to a backbone accuracy of 1.0 Angstrom using RDCs measured in a single alignment medium, together with a minimal number of NOE distance restraints, using a new Xplor-NIH module.
Collapse
Affiliation(s)
- Joseph D Walsh
- Protein Nucleic Acid Interaction Section, Structural Biophysics Laboratory, NCI-Frederick, NIH, Frederick, MD 21702, USA
| | | | | |
Collapse
|
10
|
Chen ZZ, Lin G, Rizzi R, Wen J, Xu D, Xu Y, Jiang T. More reliable protein NMR peak assignment via improved 2-interval scheduling. J Comput Biol 2005; 12:129-46. [PMID: 15767773 DOI: 10.1089/cmb.2005.12.129] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Protein NMR peak assignment refers to the process of assigning a group of "spin systems" obtained experimentally to a protein sequence of amino acids. The automation of this process is still an unsolved and challenging problem in NMR protein structure determination. Recently, protein NMR peak assignment has been formulated as an interval scheduling problem (ISP), where a protein sequence P of amino acids is viewed as a discrete time interval I (the amino acids on P one-to-one correspond to the time units of I), each subset S of spin systems that are known to originate from consecutive amino acids from P is viewed as a "job" j(s), the preference of assigning S to a subsequence P of consecutive amino acids on P is viewed as the profit of executing job j(s) in the subinterval of I corresponding to P, and the goal is to maximize the total profit of executing the jobs (on a single machine) during I. The interval scheduling problem is max SNP-hard in general; but in the real practice of protein NMR peak assignment, each job j(s) usually requires at most 10 consecutive time units, and typically the jobs that require one or two consecutive time units are the most difficult to assign/schedule. In order to solve these most difficult assignments, we present an efficient 13/7-approximation algorithm for the special case of the interval scheduling problem where each job takes one or two consecutive time units. Combining this algorithm with a greedy filtering strategy for handling long jobs (i.e., jobs that need more than two consecutive time units), we obtain a new efficient heuristic for protein NMR peak assignment. Our experimental study shows that the new heuristic produces the best peak assignment in most of the cases, compared with the NMR peak assignment algorithms in the recent literature. The above algorithm is also the first approximation algorithm for a nontrivial case of the well-known interval scheduling problem that breaks the ratio 2 barrier.
Collapse
Affiliation(s)
- Zhi-Zhong Chen
- Department of Mathematical Sciences, Tokyo Denki University, Hatoyama, Saitama 350-0394, Japan
| | | | | | | | | | | | | |
Collapse
|