1
|
Li J, Chen G, Guo Y, Wang H, Li H. Single molecule force spectroscopy reveals the context dependent folding pathway of the C-terminal fragment of Top7. Chem Sci 2020; 12:2876-2884. [PMID: 34164053 PMCID: PMC8179357 DOI: 10.1039/d0sc06344d] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022] Open
Abstract
Top7 is a de novo designed protein with atomic level accuracy and shows a folded structure not found in nature. Previous studies showed that the folding of Top7 is not cooperative and involves various folding intermediate states. In addition, various fragments of Top7 were found to fold on their own in isolation. These features displayed by Top7 are distinct from those of naturally occurring proteins of a similar size and suggest a rough folding energy landscape. However, it remains unknown if and how the intra-polypeptide chain interactions among the neighboring sequences of Top7 affect the folding of these Top7 fragments. Here we used single-molecule optical tweezers to investigate the folding–unfolding pathways of full length Top7 as well as its C-terminal fragment (CFr) in different sequence environments. Our results showed that the mechanical folding of Top7 involves an intermediate state that likely involves non-native interactions/structure. More importantly, we found that the folding of CFr is entirely dependent upon its sequence context in which it is located. When in isolation, CFr indeed folds into a cooperative structure showing near-equilibrium unfolding–folding transitions at ∼6.5 pN in OT experiments. However, CFr loses its autonomous cooperative folding ability and displays a folding pathway that is dependent on its interactions with its neighboring sequence/structure. This context-dependent folding dynamics and pathway of CFr are distinct from those of naturally occurring proteins and highlight the critical importance of intra-chain interactions in shaping the overall energy landscape and the folding pathway of Top7. These new insights may have important implications on the de novo design of proteins. Optical tweezers experiments reveal that the folding of the C-terminal fragment of Top7 (cFr) is context-dependent. Depending on its neighboring sequence, cFr shows very different folding pathways and folding kinetics. ![]()
Collapse
Affiliation(s)
- Jiayu Li
- Department of Chemistry, University of British Columbia Vancouver BC V6T 1Z1 Canada
| | - Guojun Chen
- Department of Chemistry, University of British Columbia Vancouver BC V6T 1Z1 Canada
| | - Yabin Guo
- Department of Chemistry, University of British Columbia Vancouver BC V6T 1Z1 Canada
| | - Han Wang
- Department of Chemistry, University of British Columbia Vancouver BC V6T 1Z1 Canada
| | - Hongbin Li
- Department of Chemistry, University of British Columbia Vancouver BC V6T 1Z1 Canada
| |
Collapse
|
2
|
Zamora-Carreras H, Maestro B, Sanz JM, Jiménez MA. Turncoat Polypeptides: We Adapt to Our Environment. Chembiochem 2019; 21:432-441. [PMID: 31456307 DOI: 10.1002/cbic.201900446] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Indexed: 01/25/2023]
Abstract
A common interpretation of Anfinsen's hypothesis states that one amino acid sequence should fold into a single, native, ordered state, or a highly similar set thereof, coinciding with the global minimum in the folding-energy landscape, which, in turn, is responsible for the function of the protein. However, this classical view is challenged by many proteins and peptide sequences, which can adopt exchangeable, significantly dissimilar conformations that even fulfill different biological roles. The similarities and differences of concepts related to these proteins, mainly chameleon sequences, metamorphic proteins, and switch peptides, which are all denoted herein "turncoat" polypeptides, are reviewed. As well as adding a twist to the conventional view of protein folding, the lack of structural definition adds clear versatility to the activity of proteins and can be used as a tool for protein design and further application in biotechnology and biomedicine.
Collapse
Affiliation(s)
- Héctor Zamora-Carreras
- Instituto de Química-Física Rocasolano (IQFR), Consejo Superior de Investigaciones Científicas (CSIC), Serrano 119, 28006, Madrid, Spain
| | - Beatriz Maestro
- Centro de Investigaciones Biológicas (CIB), Consejo Superior de Investigaciones Científicas (CSIC), Ramiro de Maeztu 9, 28040, Madrid, Spain
| | - Jesús M Sanz
- Centro de Investigaciones Biológicas (CIB), Consejo Superior de Investigaciones Científicas (CSIC), Ramiro de Maeztu 9, 28040, Madrid, Spain.,Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBERES), Av. Monforte de Lemos, 3-5. Pabellón, 28029, Madrid, Spain
| | - M Angeles Jiménez
- Instituto de Química-Física Rocasolano (IQFR), Consejo Superior de Investigaciones Científicas (CSIC), Serrano 119, 28006, Madrid, Spain
| |
Collapse
|
3
|
Bahramali G, Goliaei B, Minuchehr Z, Marashi SA. A network biology approach to understanding the importance of chameleon proteins in human physiology and pathology. Amino Acids 2016; 49:303-315. [DOI: 10.1007/s00726-016-2361-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2016] [Accepted: 11/05/2016] [Indexed: 12/20/2022]
|
4
|
Kim B, Do TD, Hayden EY, Teplow DB, Bowers MT, Shea JE. Aggregation of Chameleon Peptides: Implications of α-Helicity in Fibril Formation. J Phys Chem B 2016; 120:5874-83. [PMID: 27001160 DOI: 10.1021/acs.jpcb.6b00830] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We investigate the relationship between the inherent secondary structure and aggregation propensity of peptides containing chameleon sequences (i.e., sequences that can adopt either α or β structure depending on context) using a combination of replica exchange molecular dynamics simulations, ion-mobility mass spectrometry, circular dichroism, and transmission electron microscopy. We focus on an eight-residue long chameleon sequence that can adopt an α-helical structure in the context of the iron-binding protein from Bacillus anthracis (PDB id 1JIG ) and a β-strand in the context of the baculovirus P35 protein (PDB id 1P35 ). We show that the isolated chameleon sequence is intrinsically disordered, interconverting between α-helical and β-rich conformations. The inherent conformational plasticity of the sequence can be constrained by addition of flanking residues with a given secondary structure propensity. Intriguingly, we show that the chameleon sequence with helical flanking residues aggregates rapidly into fibrils, whereas the chameleon sequence with flanking residues that favor β-conformations has weak aggregation propensity. This work sheds new insights into the possible role of α-helical intermediates in fibril formation.
Collapse
Affiliation(s)
| | | | - Eric Y Hayden
- Department of Neurology, David Geffen School of Medicine at UCLA, Mary S. Easton Center for Alzheimer's Disease Research at UCLA, and Brain Research Institute and Molecular Biology Institute, University of California , 635 Charles Young Drive South, Los Angeles, California 90095, United States
| | - David B Teplow
- Department of Neurology, David Geffen School of Medicine at UCLA, Mary S. Easton Center for Alzheimer's Disease Research at UCLA, and Brain Research Institute and Molecular Biology Institute, University of California , 635 Charles Young Drive South, Los Angeles, California 90095, United States
| | | | | |
Collapse
|
5
|
Bahramali G, Goliaei B, Minuchehr Z, Salari A. Chameleon sequences in neurodegenerative diseases. Biochem Biophys Res Commun 2016; 472:209-16. [PMID: 26920059 PMCID: PMC7124260 DOI: 10.1016/j.bbrc.2016.01.187] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2016] [Accepted: 01/30/2016] [Indexed: 01/03/2023]
Abstract
Chameleon sequences can adopt either alpha helix sheet or a coil conformation. Defining chameleon sequences in PDB (Protein Data Bank) may yield to an insight on defining peptides and proteins responsible in neurodegeneration. In this research, we benefitted from the large PDB and performed a sequence analysis on Chameleons, where we developed an algorithm to extract peptide segments with identical sequences, but different structures. In order to find new chameleon sequences, we extracted a set of 8315 non-redundant protein sequences from the PDB with an identity less than 25%. Our data was classified to "helix to strand (HE)", "helix to coil (HC)" and "strand to coil (CE)" alterations. We also analyzed the occurrence of singlet and doublet amino acids and the solvent accessibility in the chameleon sequences; we then sorted out the proteins with the most number of chameleon sequences and named them Chameleon Flexible Proteins (CFPs) in our dataset. Our data revealed that Gly, Val, Ile, Tyr and Phe, are the major amino acids in Chameleons. We also found that there are proteins such as Insulin Degrading Enzyme IDE and GTP-binding nuclear protein Ran (RAN) with the most number of chameleons (640 and 405 respectively). These proteins have known roles in neurodegenerative diseases. Therefore it can be inferred that other CFP's can serve as key proteins in neurodegeneration, and a study on them can shed light on curing and preventing neurodegenerative diseases.
Collapse
Affiliation(s)
- Golnaz Bahramali
- Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran
| | - Bahram Goliaei
- Institute of Biochemistry and Biophysics, University of Tehran, Tehran, Iran.
| | - Zarrin Minuchehr
- Department of Systems Biotechnology, National Institute of Genetic Engineering and Biotechnology, (NIGEB), Tehran, Iran.
| | - Ali Salari
- Department of Systems Biotechnology, National Institute of Genetic Engineering and Biotechnology, (NIGEB), Tehran, Iran
| |
Collapse
|
6
|
Eaton KV, Anderson WJ, Dubrava MS, Kumirov VK, Dykstra EM, Cordes MHJ. Studying protein fold evolution with hybrids of differently folded homologs. Protein Eng Des Sel 2015; 28:241-50. [PMID: 25991865 DOI: 10.1093/protein/gzv027] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2015] [Accepted: 04/20/2015] [Indexed: 11/13/2022] Open
Abstract
To study the sequence determinants governing protein fold evolution, we generated hybrid sequences from two homologous proteins with 40% identity but different folds: Pfl 6 Cro, which has a mixed α + β structure, and Xfaso 1 Cro, which has an all α-helical structure. First, we first examined eight chimeric hybrids in which the more structurally conserved N-terminal half of one protein was fused to the more structurally divergent C-terminal half of the other. None of these chimeras folded, as judged by circular dichroism spectra and thermal melts, suggesting that both halves have strong intrinsic preferences for the native global fold pattern, and/or that the interfaces between the halves are not readily interchangeable. Second, we examined 10 hybrids in which blocks of the structurally divergent C-terminal region were exchanged. These hybrids showed varying levels of thermal stability and suggested that the key residues in the Xfaso 1 C terminus specifying the all-α fold were concentrated near the end of helix 4 in Xfaso 1, which aligns to the end of strand 2 in Pfl 6. Finally, we generated hybrid substitutions for each individual residue in this critical region and measured thermal stabilities. The results suggested that R47 and V48 were the strongest factors that excluded formation of the α + β fold in the C-terminal region of Xfaso 1. In support of this idea, we found that the folding stability of one of the original eight chimeras could be rescued by back-substituting these two residues. Overall, the results show not only that the key factors for Cro fold specificity and evolution are global and multifarious, but also that some all-α Cro proteins have a C-terminal subdomain sequence within a few substitutions of switching to the α + β fold.
Collapse
Affiliation(s)
- Karen V Eaton
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - William J Anderson
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - Matthew S Dubrava
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - Vlad K Kumirov
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - Emily M Dykstra
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | - Matthew H J Cordes
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| |
Collapse
|
7
|
Uversky VN, Davé V, Iakoucheva LM, Malaney P, Metallo SJ, Pathak RR, Joerger AC. Pathological unfoldomics of uncontrolled chaos: intrinsically disordered proteins and human diseases. Chem Rev 2014; 114:6844-79. [PMID: 24830552 PMCID: PMC4100540 DOI: 10.1021/cr400713r] [Citation(s) in RCA: 192] [Impact Index Per Article: 19.2] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
Affiliation(s)
- Vladimir N. Uversky
- Department of Molecular Medicine and USF Health Byrd Alzheimer’s Research Institute University of South Florida, Tampa, Florida 33612, United States
- Institute for Biological Instrumentation, Russian Academy of Sciences, 142290 Pushchino, Moscow Region, Russia
- Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah 22254, Saudi Arabia
| | - Vrushank Davé
- Department of Pathology and Cell Biology , Morsani College of Medicine, University of South Florida, Tampa, Florida 33612, United States
- Department of Molecular Oncology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida 33612, United States
| | - Lilia M. Iakoucheva
- Department of Psychiatry, University of California San Diego, La Jolla, California 92093, United States
| | - Prerna Malaney
- Department of Pathology and Cell Biology , Morsani College of Medicine, University of South Florida, Tampa, Florida 33612, United States
| | - Steven J. Metallo
- Department of Chemistry, Georgetown University, Washington, District of Columbia 20057, United States
| | - Ravi Ramesh Pathak
- Department of Pathology and Cell Biology , Morsani College of Medicine, University of South Florida, Tampa, Florida 33612, United States
| | - Andreas C. Joerger
- Medical Research Council Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge CB2 0QH, United Kingdom
| |
Collapse
|
8
|
Bhattacharjee N, Biswas P. Helical ambivalency induced by point mutations. BMC STRUCTURAL BIOLOGY 2013; 13:9. [PMID: 23675772 PMCID: PMC3683331 DOI: 10.1186/1472-6807-13-9] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/29/2012] [Accepted: 05/02/2013] [Indexed: 01/15/2023]
Abstract
Background Mutation of amino acid sequences in a protein may have diverse effects on its structure and function. Point mutations of even a single amino acid residue in the helices of the non-redundant database may lead to sequentially identical peptides which adopt different secondary structures in different proteins. However, various physico-chemical factors which govern the formation of these ambivalent helices generated by point mutations of a sequence are not clearly known. Results Sequences generated by point mutations of helices are mapped on to their non-helical counterparts in the SCOP database. The results show that short helices are prone to transform into non-helical conformations upon point mutations. Mutation of amino acid residues by helix breakers preferentially yield non-helical conformations, while mutation with residues of intermediate helix propensity display least preferences for non-helical conformations. Differences in the solvent accessibility of the mutating/mutated residues are found to be a major criteria for these sequences to conform to non-helical conformations. Even with minimal differences in the amino acid distributions of the sequences flanking the helical and non-helical conformations, helix-flanking sequences are found be more solvent accessible. Conclusions All types of mutations from helical to non-helical conformations are investigated. The primary factors attributing such changes in conformation can be: i) type/propensity of the mutating and mutant residues ii) solvent accessibility of the residue at the mutation site iii) context/environment dependence of the flanking sequences. The results from the present study may be used to design de novo proteins via point mutations.
Collapse
|
9
|
Stewart KL, Dodds ED, Wysocki VH, Cordes MHJ. A polymetamorphic protein. Protein Sci 2013; 22:641-9. [PMID: 23471712 DOI: 10.1002/pro.2248] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2012] [Revised: 02/25/2013] [Accepted: 03/01/2013] [Indexed: 11/10/2022]
Abstract
Arc repressor is a homodimeric protein with a ribbon-helix-helix fold. A single polar-to-hydrophobic substitution (N11L) at a solvent-exposed position leads to population of an alternate dimeric fold in which 3₁₀ helices replace a β-sheet. Here we find that the variant Q9V/N11L/R13V (S-VLV), with two additional polar-to-hydrophobic surface mutations in the same β-sheet, forms a highly stable, reversibly folded octamer with approximately half the α-helical content of wild-type Arc. At low protein concentration and low ionic strength, S-VLV also populates both dimeric topologies previously observed for N11L, as judged by NMR chemical shift comparisons. Thus, accumulation of simple hydrophobic mutations in Arc progressively reduces fold specificity, leading first to a sequence with two folds and then to a manifold bridge sequence with at least three different topologies. Residues 9-14 of S-VLV form a highly hydrophobic stretch that is predicted to be amyloidogenic, but we do not observe aggregates of higher order than octamer. Increases in sequence hydrophobicity can promote amyloid aggregation but also exert broader and more complex effects on fold specificity. Altered native folds, changes in fold coupled to oligomerization, toxic pre-amyloid oligomers, and amyloid fibrils may represent a near continuum of accessible alternatives in protein structure space.
Collapse
Affiliation(s)
- Katie L Stewart
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, Arizona, USA
| | | | | | | |
Collapse
|
10
|
Abstract
The wealth of available protein structural data provides unprecedented opportunity to study and better understand the underlying principles of protein folding and protein structure evolution. A key to achieving this lies in the ability to analyse these data and to organize them in a coherent classification scheme. Over the past years several protein classifications have been developed that aim to group proteins based on their structural relationships. Some of these classification schemes explore the concept of structural neighbourhood (structural continuum), whereas other utilize the notion of protein evolution and thus provide a discrete rather than continuum view of protein structure space. This chapter presents a strategy for classification of proteins with known three-dimensional structure. Steps in the classification process along with basic definitions are introduced. Examples illustrating some fundamental concepts of protein folding and evolution with a special focus on the exceptions to them are presented.
Collapse
|
11
|
Anderson WJ, Van Dorn LO, Ingram WM, Cordes MHJ. Evolutionary bridges to new protein folds: design of C-terminal Cro protein chameleon sequences. Protein Eng Des Sel 2011; 24:765-71. [PMID: 21676898 DOI: 10.1093/protein/gzr027] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Regions of amino-acid sequence that are compatible with multiple folds may facilitate evolutionary transitions in protein structure. In a previous study, we described a heuristically designed chameleon sequence (SASF1, structurally ambivalent sequence fragment 1) that could adopt either of two naturally occurring conformations (α-helical or β-sheet) when incorporated as part of the C-terminal dimerization subdomain of two structurally divergent transcription factors, P22 Cro and λ Cro. Here we describe longer chameleon designs (SASF2 and SASF3) that in the case of SASF3 correspond to the full C-terminal half of the ordered region of a P22 Cro/λ Cro sequence alignment (residues 34-57). P22-SASF2 and λ(WDD)-SASF2 show moderate thermal stability in denaturation curves monitored by circular dichroism (T(m) values of 46 and 55°C, respectively), while P22-SASF3 and λ(WDD)-SASF3 have somewhat reduced stability (T(m) values of 33 and 49°C, respectively). (13)C and (1)H NMR secondary chemical shift analysis confirms two C-terminal α-helices for P22-SASF2 (residues 36-45 and 54-57) and two C-terminal β-strands for λ(WDD)-SASF2 (residues 40-45 and 50-52), corresponding to secondary structure locations in the two parent sequences. Backbone relaxation data show that both chameleon sequences have a relatively well-ordered structure. Comparisons of (15)N-(1)H correlation spectra for SASF2 and SASF3-containing proteins strongly suggest that SASF3 retains the chameleonism of SASF2. Both Cro C-terminal conformations can be encoded in a single sequence, showing the plausibility of linking different Cro folds by smooth evolutionary transitions. The N-terminal subdomain, though largely conserved in structure, also exerts an important contextual influence on the structure of the C-terminal region.
Collapse
Affiliation(s)
- William J Anderson
- Department of Chemistry and Biochemistry, University of Arizona, Tucson, AZ 85721-0088, USA
| | | | | | | |
Collapse
|
12
|
Krishna N, Guruprasad K. Certain heptapeptide and large sequences representing an entire helix, strand or coil conformation in proteins are associated as chameleon sequences. Int J Biol Macromol 2011; 49:218-22. [PMID: 21569793 PMCID: PMC7124434 DOI: 10.1016/j.ijbiomac.2011.04.017] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2011] [Accepted: 04/25/2011] [Indexed: 11/26/2022]
Abstract
Helices, strands and coils in proteins of known three-dimensional structure, corresponding to heptapeptide and large sequences (‘probe’ peptides), were scanned against peptide sequences of variable length, comprising seven or more residues that correspond to a different conformation (‘target’ peptides) in protein crystal structures available from the Protein Data Bank (PDB). Where the ‘probe’ and ‘target’ peptide sequences exactly match, they correspond to ‘chameleon’ sequences in protein structures. We observed ∼548 heptapeptide and large chameleon sequences that included peptides in the coil conformation from 53,794 PDB files that were analyzed. However, after excluding several chameleon peptides based on the quality of protein structure data, redundancy and peptides associated with cloning artifacts, such as, histidine-tags, we observed only ten chameleon peptides in structurally different proteins and the maximum length comprised seven amino acid residues. Our analysis suggests that the quality of protein structure data is important for identifying possibly, the ‘true chameleons’ in PDB. Majority of the chameleon sequences correspond to an entire strand in one protein that is observed as part of helix sequence in another protein. The heptapeptide chameleons are characterized with a high propensity of alanine, leucine and valine amino acid residues. The total hydropathy values range between −11.2 and 22.9, the difference in solvent accessibility between 2.0 Å2 and 373 Å2 units and the difference in total number of residue neighbor contacts between 0 and 7 residues. Our work identifies for the first time heptapeptide and large sequences that correspond to a single complete helix, strand or coil, which adopt entirely different secondary structures in another protein.
Collapse
Affiliation(s)
- Neethu Krishna
- Bioinformatics, Centre for Cellular and Molecular Biology (CCMB), Hyderabad, India
| | | |
Collapse
|
13
|
Ruvinsky AM, Vakser IA. Sequence composition and environment effects on residue fluctuations in protein structures. J Chem Phys 2011; 133:155101. [PMID: 20969427 DOI: 10.1063/1.3498743] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023] Open
Abstract
Structure fluctuations in proteins affect a broad range of cell phenomena, including stability of proteins and their fragments, allosteric transitions, and energy transfer. This study presents a statistical-thermodynamic analysis of relationship between the sequence composition and the distribution of residue fluctuations in protein-protein complexes. A one-node-per-residue elastic network model accounting for the nonhomogeneous protein mass distribution and the interatomic interactions through the renormalized inter-residue potential is developed. Two factors, a protein mass distribution and a residue environment, were found to determine the scale of residue fluctuations. Surface residues undergo larger fluctuations than core residues in agreement with experimental observations. Ranking residues over the normalized scale of fluctuations yields a distinct classification of amino acids into three groups: (i) highly fluctuating-Gly, Ala, Ser, Pro, and Asp, (ii) moderately fluctuating-Thr, Asn, Gln, Lys, Glu, Arg, Val, and Cys, and (iii) weakly fluctuating-Ile, Leu, Met, Phe, Tyr, Trp, and His. The structural instability in proteins possibly relates to the high content of the highly fluctuating residues and a deficiency of the weakly fluctuating residues in irregular secondary structure elements (loops), chameleon sequences, and disordered proteins. Strong correlation between residue fluctuations and the sequence composition of protein loops supports this hypothesis. Comparing fluctuations of binding site residues (interface residues) with other surface residues shows that, on average, the interface is more rigid than the rest of the protein surface and Gly, Ala, Ser, Cys, Leu, and Trp have a propensity to form more stable docking patches on the interface. The findings have broad implications for understanding mechanisms of protein association and stability of protein structures.
Collapse
Affiliation(s)
- Anatoly M Ruvinsky
- Center for Bioinformatics, The University of Kansas, Lawrence, Kansas 66047, USA.
| | | |
Collapse
|
14
|
Bhattacharjee N, Biswas P. Statistical analysis and molecular dynamics simulations of ambivalent α-helices. BMC Bioinformatics 2010; 11:519. [PMID: 20955581 PMCID: PMC2973962 DOI: 10.1186/1471-2105-11-519] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2010] [Accepted: 10/18/2010] [Indexed: 11/17/2022] Open
Abstract
Background Analysis of known protein structures reveals that identical sequence fragments in proteins can adopt different secondary structure conformations. The extent of this conformational diversity is influenced by various factors like the intrinsic sequence propensity, sequence context and other environmental factors such as pH, site directed mutations or alteration of the binding ligands. Understanding the mechanism by which the environment affects the structural ambivalence of these peptides has potential implications for protein design and reliable local structure prediction algorithms. Identification of the structurally ambivalent sequence fragments and determining the rules which dictate their conformational preferences play an important role in understanding the conformational changes observed in misfolding diseases. However, a systematic classification of their intrinsic sequence patterns or a statistical analysis of their properties and sequence context in relation to the origin of their structural diversity have largely remained unexplored. Results In this work, the conformational variability of α-helices is studied by mapping sequences from the non-redundant database to identical sequences across all classes of the SCOP (Structural Classification of Proteins) database. Some helices retain their conformations when mapped in the SCOP database while others exhibit a complete/partial switch to non-helical conformations. The results clearly depict the differences in the propensities of amino acids for the variable and conserved helices. Sequences flanking these ambivalent sequence fragments have anisotropic propensities at the N- and C-termini. This structural variability is depicted by molecular dynamics simulations in explicit solvent, which show that the short conserved helices retain their conformations while their longer counterparts fray into two or more shorter helices. Variable helices in the non-redundant database exhibit a trend of retaining helical conformations while their corresponding non-helical conformations in SCOP database show large deviations from their respective initial structures by adopting partial or full helical conformations. Partially ambivalent helices are also found to retain their respective conformations. Conclusions All sequence fragments which show structural diversity in different proteins of the non-redundant database are investigated. The final conformation of these ambivalent sequences are dictated by a fine tuning of their intrinsic sequence propensity and the anisotropic amino acid propensity of the flanking sequences. This analysis may unravel the connection between diverse secondary structures, which conserve the overall structural fold of the protein thus determining its function.
Collapse
|
15
|
|
16
|
Guo JT, Jaromczyk JW, Xu Y. Analysis of chameleon sequences and their implications in biological processes. Proteins 2007; 67:548-58. [PMID: 17299764 DOI: 10.1002/prot.21285] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
Chameleon sequences have been implicated in amyloid related diseases. Here we report an analysis of two types of chameleon sequences, chameleon-HS (Helix vs. Strand) and chameleon-HE (Helix vs. Sheet), based on known structures in Protein Data Bank. Our survey shows that the longest chameleon-HS is eight residues while the longest chameleon-HE is seven residues. We have done a detailed analysis on the local and global environment that might contribute to the unique conformation of a chameleon sequence. We found that the existence of chameleon sequences does not present a problem for secondary structure prediction programs, including the first generation prediction programs, such as Chou-Fasman algorithm, and the third generation prediction programs that utilize evolution information. We have also investigated the possible implication of chameleon sequences in structural conservation and functional diversity of alternatively spliced protein isoforms.
Collapse
Affiliation(s)
- Jun-Tao Guo
- Computational Systems Biology Laboratory, Department of Biochemistry and Molecular Biology, University of Georgia, Athens, Georgia 30602, USA
| | | | | |
Collapse
|
17
|
Abstract
The question of whether novel, structurally different protein folds might have arisen from existing ones is crucial to understanding protein evolution. Recent work on cysteine-rich domains in Hydra proteins illuminates how evolutionary transitions between dramatically different structures might occur.
Collapse
Affiliation(s)
- Todd O Yeates
- UCLA Department of Chemistry and Biochemistry, 611 Charles Young Drive East, Los Angeles, California 90095-1569, USA.
| |
Collapse
|
18
|
Belogurov GA, Vassylyeva MN, Svetlov V, Klyuyev S, Grishin NV, Vassylyev DG, Artsimovitch I. Structural basis for converting a general transcription factor into an operon-specific virulence regulator. Mol Cell 2007; 26:117-29. [PMID: 17434131 PMCID: PMC3116145 DOI: 10.1016/j.molcel.2007.02.021] [Citation(s) in RCA: 171] [Impact Index Per Article: 10.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2006] [Revised: 01/16/2007] [Accepted: 02/14/2007] [Indexed: 11/24/2022]
Abstract
RfaH, a paralog of the general transcription factor NusG, is recruited to elongating RNA polymerase at specific regulatory sites. The X-ray structure of Escherichia coli RfaH reported here reveals two domains. The N-terminal domain displays high similarity to that of NusG. In contrast, the alpha-helical coiled-coil C domain, while retaining sequence similarity, is strikingly different from the beta barrel of NusG. To our knowledge, such an all-beta to all-alpha transition of the entire domain is the most extreme example of protein fold evolution known to date. Both N domains possess a vast hydrophobic cavity that is buried by the C domain in RfaH but is exposed in NusG. We propose that this cavity constitutes the RNA polymerase-binding site, which becomes unmasked in RfaH only upon sequence-specific binding to the nontemplate DNA strand that triggers domain dissociation. Finally, we argue that RfaH binds to the beta' subunit coiled coil, the major target site for the initiation sigma factors.
Collapse
Affiliation(s)
- Georgiy A. Belogurov
- Department of Microbiology, The Ohio State University, 484 West 12th Avenue, Columbus, OH 43210, USA
- The RNA Group, The Ohio State University, 484 West 12th Avenue, Columbus, OH 43210, USA
| | - Marina N. Vassylyeva
- Department of Biochemistry and Molecular Genetics, Schools of Medicine and Dentistry, University of Alabama at Birmingham, 402B Kaul Genetics Building, 720 20th Street South, Birmingham, AL 35294, USA
| | - Vladimir Svetlov
- Department of Microbiology, The Ohio State University, 484 West 12th Avenue, Columbus, OH 43210, USA
- The RNA Group, The Ohio State University, 484 West 12th Avenue, Columbus, OH 43210, USA
| | - Sergiy Klyuyev
- Department of Biochemistry and Molecular Genetics, Schools of Medicine and Dentistry, University of Alabama at Birmingham, 402B Kaul Genetics Building, 720 20th Street South, Birmingham, AL 35294, USA
| | - Nick V. Grishin
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center at Dallas, 5323 Harry Hines Boulevard, Dallas, TX 75390, USA
- Department of Biochemistry, University of Texas Southwestern Medical Center at Dallas, 5323 Harry Hines Boulevard, Dallas, TX 75390, USA
| | - Dmitry G. Vassylyev
- Department of Biochemistry and Molecular Genetics, Schools of Medicine and Dentistry, University of Alabama at Birmingham, 402B Kaul Genetics Building, 720 20th Street South, Birmingham, AL 35294, USA
| | - Irina Artsimovitch
- Department of Microbiology, The Ohio State University, 484 West 12th Avenue, Columbus, OH 43210, USA
- The RNA Group, The Ohio State University, 484 West 12th Avenue, Columbus, OH 43210, USA
- Correspondence:
| |
Collapse
|
19
|
Andreeva A, Murzin AG. Evolution of protein fold in the presence of functional constraints. Curr Opin Struct Biol 2006; 16:399-408. [PMID: 16650981 DOI: 10.1016/j.sbi.2006.04.003] [Citation(s) in RCA: 95] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2006] [Revised: 03/28/2006] [Accepted: 04/21/2006] [Indexed: 11/24/2022]
Abstract
The functional requirement to form and maintain the active site structure probably exerts a strong selective pressure on a protein to adopt just one stable and evolutionarily conserved fold. Nonetheless, new evidence suggests the likelihood of protein fold being neither physically nor biologically invariant. Alternative folds discovered in several proteins are composed of constant and variable parts. The latter display context-dependent conformations and a tendency to form new oligomeric interfaces. In turn, oligomerisation mediates fold evolution without loss of protein function. Gene duplication breaks down homo-oligomeric symmetry and relieves the pressure to maintain the local architecture of redundant active sites; this can lead to further structural changes.
Collapse
Affiliation(s)
- Antonina Andreeva
- MRC Centre for Protein Engineering, Hills Road, Cambridge CB2 2QH, UK
| | | |
Collapse
|
20
|
Schechter NM, Choi EJ, Wang ZM, Hanakawa Y, Stanley JR, Kang Y, Clayman GL, Jayakumar A. Inhibition of human kallikreins 5 and 7 by the serine protease inhibitor lympho-epithelial Kazal-type inhibitor (LEKTI). Biol Chem 2005; 386:1173-84. [PMID: 16307483 DOI: 10.1515/bc.2005.134] [Citation(s) in RCA: 74] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
AbstractLEKTI is a 120-kDa protein that plays an important role in skin development, as mutations affecting LEKTI synthesis underlie Netherton syndrome, an inherited skin disorder producing severe scaling. Its primary sequence indicates that the protein consists of 15 domains, all resembling a Kazal-type serine protease inhibitor. LEKTI and two serine proteases belonging to the human tissue kallikrein (hK) family (hK5 and hK7) are expressed in the granular layer of skin. In this study, we characterize the interaction of two recombinant LEKTI fragments containing three or four intact Kazal domains (domains 6–8 and 9–12) with recombinant rhK5, a trypsin-like protease, and recombinant rhK7, a chymotrypsin-like protease. Both fragments inhibited rhK5 similarly in binding and kinetic studies performed at pH 8.0, as well as pH 5.0, the pH of the stratum corneum where both LEKTI and proteases may function. Inhibition equilibrium constants (Ki) measured either directly in concentration-dependent studies or calculated from measured association (kass) and dissociation (kdis) rate constants were 1.2–5.5 nM at pH 8.0 and 10–20 nM at pH 5.0. At pH 8.0,kassandkdisvalues were 4.7×105 M−1s−1and 5.5×10−4 s−1, and at pH 5.0 they were 4.0×104 M−1 s−1and 4.3×10−4 s−1, respectively. The lowKiandkdisvalues (t1/2of 20–25 min) indicate tight and specific association. Only fragment 6–9′ was a good inhibitor of rhK7, demonstrating aKiof 11 nM at pH 8.0 in a reaction that was rapidly reversible. These results show that LEKTI, at least in fragment form, is a potent inhibitor of rhK5 and that this protease may be a target of LEKTI in human skin.
Collapse
Affiliation(s)
- Norman M Schechter
- Department of Dermatology, University of Pennsylvania, Philadelphia, PA 19104, USA.
| | | | | | | | | | | | | | | |
Collapse
|