1
|
Zhang H, Zhang H, Chen C. Investigating the folding mechanism of the N-terminal domain of ribosomal protein L9. Proteins 2021; 89:832-844. [PMID: 33576138 DOI: 10.1002/prot.26062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 01/04/2021] [Accepted: 01/31/2021] [Indexed: 11/10/2022]
Abstract
Protein folding is a popular topic in the life science. However, due to the limited sampling ability of experiments and simulations, the general folding mechanism is not yet clear to us. In this work, we study the folding of the N-terminal domain of ribosomal protein L9 (NTL9) in detail by a mixing replica exchange molecular dynamics method. The simulation results are close to previous experimental observations. According to the Markov state model, the folding of the protein follows a nucleation-condensation path. Moreover, after the comparison to its 39-residue β-α-β motif, we find that the helix at the C-terminal has a great influence on the folding process of the intact protein, including the nucleation of the key residues in the transition state ensemble and the packing of the hydrophobic residues in the native state.
Collapse
Affiliation(s)
- Haozhe Zhang
- Biomolecular Physics and Modeling Group, School of Physics, Huazhong University of Science and Technology, Wuhan, China
| | - Haomiao Zhang
- Biomolecular Physics and Modeling Group, School of Physics, Huazhong University of Science and Technology, Wuhan, China
| | - Changjun Chen
- Biomolecular Physics and Modeling Group, School of Physics, Huazhong University of Science and Technology, Wuhan, China
| |
Collapse
|
2
|
Halder R, Jana B. Exploring the role of hydrophilic amino acids in unfolding of protein in aqueous ethanol solution. Proteins 2020; 89:116-125. [PMID: 32860277 DOI: 10.1002/prot.25999] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2019] [Revised: 08/07/2020] [Accepted: 08/25/2020] [Indexed: 12/14/2022]
Abstract
Hydrophobic association is the key contributor behind the formation of well packed core of a protein which is often believed to be an important step for folding from an unfolded chain to its compact functional form. While most of the protein folding/unfolding studies have evaluated the changes in the hydrophobic interactions during chemical denaturation, the role of hydrophilic amino acids in such processes are not discussed in detail. Here we report the role of the hydrophilic amino acids behind ethanol induced unfolding of protein. Using free energy simulations, we show that chicken villin head piece (HP-36) protein unfolds gradually in presence of water-ethanol binary mixture with increasing composition of ethanol. However, upon mutation of hydrophilic amino acids by glycine while keeping the hydrophobic amino acids intact, the compact state of the protein is found to be stable at all compositions with gradual flattening of the free energy landscape upon increasing compositions. The local environment around the protein in terms of ethanol/water number significantly differs in wild type protein compared to the mutated protein. The calculated Wyman-Tanford preferential binding coefficient of ethanol for wild type protein reveals that a greater number of cosolutes (here ethanol) bind to the unfolded state compared to its folded state. However, no significant increase in binding coefficient of ethanol at the unfolded state is found for mutated protein. Local-bulk partition coefficient calculation also suggests similar scenarios. Our results reveal that the weakening of hydrophobic interactions in aqueous ethanol solution along with larger preferential binding of ethanol to the unfolded state mediated by hydrophilic amino acids combinedly helps unfolding of protein in aqueous ethanol solution.
Collapse
Affiliation(s)
- Ritaban Halder
- School of Chemical Sciences, Indian Association for the cultivation of Science, Jadavpur, Kolkata, West Bengal, India
| | - Biman Jana
- School of Chemical Sciences, Indian Association for the cultivation of Science, Jadavpur, Kolkata, West Bengal, India
| |
Collapse
|
3
|
Wilson CJ, Bommarius AS, Champion JA, Chernoff YO, Lynn DG, Paravastu AK, Liang C, Hsieh MC, Heemstra JM. Biomolecular Assemblies: Moving from Observation to Predictive Design. Chem Rev 2018; 118:11519-11574. [PMID: 30281290 PMCID: PMC6650774 DOI: 10.1021/acs.chemrev.8b00038] [Citation(s) in RCA: 55] [Impact Index Per Article: 9.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Biomolecular assembly is a key driving force in nearly all life processes, providing structure, information storage, and communication within cells and at the whole organism level. These assembly processes rely on precise interactions between functional groups on nucleic acids, proteins, carbohydrates, and small molecules, and can be fine-tuned to span a range of time, length, and complexity scales. Recognizing the power of these motifs, researchers have sought to emulate and engineer biomolecular assemblies in the laboratory, with goals ranging from modulating cellular function to the creation of new polymeric materials. In most cases, engineering efforts are inspired or informed by understanding the structure and properties of naturally occurring assemblies, which has in turn fueled the development of predictive models that enable computational design of novel assemblies. This Review will focus on selected examples of protein assemblies, highlighting the story arc from initial discovery of an assembly, through initial engineering attempts, toward the ultimate goal of predictive design. The aim of this Review is to highlight areas where significant progress has been made, as well as to outline remaining challenges, as solving these challenges will be the key that unlocks the full power of biomolecules for advances in technology and medicine.
Collapse
Affiliation(s)
- Corey J. Wilson
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Andreas S. Bommarius
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Julie A. Champion
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Yury O. Chernoff
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
- Laboratory of Amyloid Biology & Institute of Translational Biomedicine, St. Petersburg State University, St. Petersburg 199034, Russia
| | - David G. Lynn
- Department of Chemistry, Emory University, Atlanta, Georgia 30322, United States
| | - Anant K. Paravastu
- School of Chemical & Biomolecular Engineering, Georgia Institute of Technology, Atlanta, Georgia 30332, United States
| | - Chen Liang
- Department of Chemistry, Emory University, Atlanta, Georgia 30322, United States
| | - Ming-Chien Hsieh
- Department of Chemistry, Emory University, Atlanta, Georgia 30322, United States
| | - Jennifer M. Heemstra
- Department of Chemistry, Emory University, Atlanta, Georgia 30322, United States
| |
Collapse
|
4
|
Chen J, Chen J, Pinamonti G, Clementi C. Learning Effective Molecular Models from Experimental Observables. J Chem Theory Comput 2018; 14:3849-3858. [DOI: 10.1021/acs.jctc.8b00187] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022]
Affiliation(s)
- Justin Chen
- Department of Physics and Astronomy, Rice University, Houston, Texas 77005, United States
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, United States
| | - Jiming Chen
- Department of Chemical and Biomolecular Engineering, Rice University, Houston, Texas 77005, United States
| | - Giovanni Pinamonti
- Department of Mathematics and Computer Science, Freie Universität, Berlin, Germany
| | - Cecilia Clementi
- Department of Chemical and Biomolecular Engineering, Rice University, Houston, Texas 77005, United States
- Department of Mathematics and Computer Science, Freie Universität, Berlin, Germany
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, United States
- Department of Chemistry, Rice University, Houston, Texas 77005, United States
| |
Collapse
|
5
|
Fuglestad B, Stetz MA, Belnavis Z, Wand AJ. Solution NMR investigation of the response of the lactose repressor core domain dimer to hydrostatic pressure. Biophys Chem 2017; 231:39-44. [PMID: 28249763 DOI: 10.1016/j.bpc.2017.02.002] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2017] [Revised: 02/21/2017] [Accepted: 02/22/2017] [Indexed: 11/19/2022]
Abstract
Previous investigations of the sensitivity of the lac repressor to high-hydrostatic pressure have led to varying conclusions. Here high-pressure solution NMR spectroscopy is used to provide an atomic level view of the pressure induced structural transition of the lactose repressor regulatory domain (LacI* RD) bound to the ligand IPTG. As the pressure is raised from ambient to 3kbar the native state of the protein is converted to a partially unfolded form. Estimates of rotational correlation times using transverse optimized relaxation indicates that a monomeric state is never reached and that the predominate form of the LacI* RD is dimeric throughout this pressure change. Spectral analysis suggests that the pressure-induced transition is localized and is associated with a volume change of approximately -115mlmol-1 and an average pressure dependent change in compressibility of approximately 30mlmol-1kbar-1. In addition, a subset of resonances emerge at high-pressures indicating the presence of a non-native but folded alternate state.
Collapse
Affiliation(s)
- Brian Fuglestad
- Johnson Research Foundation, Department of Biochemistry and Biophysics, University of Pennsylvania Perelman School of Medicine, 905 Stellar-Chance Laboratories, 422 Curie Blvd, Philadelphia, PA 19104-6059, USA
| | - Matthew A Stetz
- Johnson Research Foundation, Department of Biochemistry and Biophysics, University of Pennsylvania Perelman School of Medicine, 905 Stellar-Chance Laboratories, 422 Curie Blvd, Philadelphia, PA 19104-6059, USA
| | - Zachary Belnavis
- Johnson Research Foundation, Department of Biochemistry and Biophysics, University of Pennsylvania Perelman School of Medicine, 905 Stellar-Chance Laboratories, 422 Curie Blvd, Philadelphia, PA 19104-6059, USA
| | - A Joshua Wand
- Johnson Research Foundation, Department of Biochemistry and Biophysics, University of Pennsylvania Perelman School of Medicine, 905 Stellar-Chance Laboratories, 422 Curie Blvd, Philadelphia, PA 19104-6059, USA.
| |
Collapse
|
6
|
Wolynes PG. Evolution, energy landscapes and the paradoxes of protein folding. Biochimie 2014; 119:218-30. [PMID: 25530262 DOI: 10.1016/j.biochi.2014.12.007] [Citation(s) in RCA: 110] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Accepted: 12/11/2014] [Indexed: 01/25/2023]
Abstract
Protein folding has been viewed as a difficult problem of molecular self-organization. The search problem involved in folding however has been simplified through the evolution of folding energy landscapes that are funneled. The funnel hypothesis can be quantified using energy landscape theory based on the minimal frustration principle. Strong quantitative predictions that follow from energy landscape theory have been widely confirmed both through laboratory folding experiments and from detailed simulations. Energy landscape ideas also have allowed successful protein structure prediction algorithms to be developed. The selection constraint of having funneled folding landscapes has left its imprint on the sequences of existing protein structural families. Quantitative analysis of co-evolution patterns allows us to infer the statistical characteristics of the folding landscape. These turn out to be consistent with what has been obtained from laboratory physicochemical folding experiments signaling a beautiful confluence of genomics and chemical physics.
Collapse
Affiliation(s)
- Peter G Wolynes
- Department of Chemistry and Center for Theoretical Biological Physics, Rice University, Houston, TX 77005, USA.
| |
Collapse
|
7
|
Dasgupta A, Udgaonkar JB, Das P. Multistage Unfolding of an SH3 Domain: An Initial Urea-Filled Dry Molten Globule Precedes a Wet Molten Globule with Non-Native Structure. J Phys Chem B 2014; 118:6380-92. [DOI: 10.1021/jp410019f] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Amrita Dasgupta
- National
Centre for Biological Sciences, Tata Institute of Fundamental Research, Bangalore 560065, India
| | - Jayant B. Udgaonkar
- National
Centre for Biological Sciences, Tata Institute of Fundamental Research, Bangalore 560065, India
| | - Payel Das
- Computational
Biology Center, IBM Thomas J. Watson Research Center, 1101 Kitchawan
Road, Yorktown Heights, New
York 10598, United States
| |
Collapse
|
8
|
Coarse-grained molecular simulation of epidermal growth factor receptor protein tyrosine kinase multi-site self-phosphorylation. PLoS Comput Biol 2014; 10:e1003435. [PMID: 24453959 PMCID: PMC3894164 DOI: 10.1371/journal.pcbi.1003435] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2013] [Accepted: 11/14/2013] [Indexed: 12/22/2022] Open
Abstract
Upon the ligand-dependent dimerization of the epidermal growth factor receptor (EGFR), the intrinsic protein tyrosine kinase (PTK) activity of one receptor monomer is activated, and the dimeric receptor undergoes self-phosphorylation at any of eight candidate phosphorylation sites (P-sites) in either of the two C-terminal (CT) domains. While the structures of the extracellular ligand binding and intracellular PTK domains are known, that of the ∼225-amino acid CT domain is not, presumably because it is disordered. Receptor phosphorylation on CT domain P-sites is critical in signaling because of the binding of specific signaling effector molecules to individual phosphorylated P-sites. To investigate how the combination of conventional substrate recognition and the unique topological factors involved in the CT domain self-phosphorylation reaction lead to selectivity in P-site phosphorylation, we performed coarse-grained molecular simulations of the P-site/catalytic site binding reactions that precede EGFR self-phosphorylation events. Our results indicate that self-phosphorylation of the dimeric EGFR, although generally believed to occur in trans, may well occur with a similar efficiency in cis, with the P-sites of both receptor monomers being phosphorylated to a similar extent. An exception was the case of the most kinase-proximal P-site-992, the catalytic site binding of which occurred exclusively in cis via an intramolecular reaction. We discovered that the in cis interaction of P-site-992 with the catalytic site was facilitated by a cleft between the N-terminal and C-terminal lobes of the PTK domain that allows the short CT domain sequence tethering P-site-992 to the PTK core to reach the catalytic site. Our work provides several new mechanistic insights into the EGFR self-phosphorylation reaction, and demonstrates the potential of coarse-grained molecular simulation approaches for investigating the complexities of self-phosphorylation in molecules such as EGFR (HER/ErbB) family receptors and growth factor receptor PTKs in general. The epidermal growth factor receptor (EGFR) is one of a large group of cell surface receptors that allow cells to respond to growth-stimulating signals in their environment. Upon sensing of growth factor, the EGFR is activated, which triggers a signaling cascade leading to the cell nucleus and ultimately initiating cell division. The first event following receptor activation is an intramolecular kinase reaction that results in the introduction of phosphate groups onto several specific amino acids (phosphorylation sites or P-sites) in the tail of the EGFR protein. Thus, the tail of the receptor undergoes self-phosphorylation, which involves conformational motions enabling the various P-sites to access the catalytic site. The structure of the tail of the receptor is unknown, and hence the mechanism of the self-phosphorylation reaction is not well understood. To investigate this mechanism, we generated a structural model of the EGFR protein and performed computer simulations of EGFR P-site/catalytic site binding reactions. These simulations indicated how the distribution of P-sites along the tail of the receptor and restrictions in molecular movements of the tail lead to selectivity in the phosphorylation of the different P-sites. Our simulations yielded unique insights into the mechanism of EGFR self-phosphorylation that have important biological implications.
Collapse
|
9
|
Dasgupta A, Udgaonkar JB. Transient Non-Native Burial of a Trp Residue Occurs Initially during the Unfolding of a SH3 Domain. Biochemistry 2012; 51:8226-34. [DOI: 10.1021/bi3008627] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Amrita Dasgupta
- National Centre for Biological
Sciences, Tata Institute of Fundamental Research, Bangalore 560065,
India
| | - Jayant B. Udgaonkar
- National Centre for Biological
Sciences, Tata Institute of Fundamental Research, Bangalore 560065,
India
| |
Collapse
|
10
|
Ramot R, Kishore Inampudi K, Wilson CJ. Lactose Repressor Experimental Folding Landscape: Fundamental Functional Unit and Tetramer Folding Mechanisms. Biochemistry 2012; 51:7569-79. [DOI: 10.1021/bi300545f] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Roee Ramot
- Department
of Chemical and Environmental Engineering, Yale University, New Haven, Connecticut 06511, United States
| | - Krishna Kishore Inampudi
- Department
of Chemical and Environmental Engineering, Yale University, New Haven, Connecticut 06511, United States
| | - Corey J. Wilson
- Department
of Chemical and Environmental Engineering, Yale University, New Haven, Connecticut 06511, United States
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06511, United
States
| |
Collapse
|
11
|
Wang Y, Chu X, Suo Z, Wang E, Wang J. Multidomain protein solves the folding problem by multifunnel combined landscape: theoretical investigation of a Y-family DNA polymerase. J Am Chem Soc 2012; 134:13755-64. [PMID: 22827444 DOI: 10.1021/ja3045663] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023]
Abstract
Approximately three-fourths of eukaryotic proteins are composed of multiple independently folded domains. However, much of our understanding is based on single domain proteins or isolated domains whose studies directly lead to well-known energy landscape theory in which proteins fold by navigating through a funneled energy landscape toward native structure ensembles. The degrees of freedom for proteins with multiple domains are many orders of magnitude larger than that for single domain proteins. Now, the question arises: How do the multidomain proteins solve the "protein folding problem"? Here, we specifically address this issue by exploring the structure folding relationship of Sulfolobus solfataricus DNA polymerase IV (DPO4), a prototype Y-family DNA polymerase which contains a polymerase core consisting of a palm (P domain), a finger (F domain), and a thumb domain (T domain) in addition to a little finger domain (LF domain). The theoretical results are in good agreement with the experimental data and lead to several theoretical predictions. Finally, we propose that for rapid folding into well-defined conformations which carry out the biological functions, four-domain DPO4 employs a divide-and-conquer strategy, that is, combining multiple individual folding funnels into a single funnel (domains fold independently and then coalesce). In this way, the degrees of freedom for multidomain proteins are polynomial rather than exponential, and the conformational search process can be reduced effectively from a large to a smaller time scale.
Collapse
Affiliation(s)
- Yong Wang
- State Key Laboratory of Electroanalytical Chemistry, Changchun Institute of Applied Chemistry, Chinese Academy of Sciences, Changchun, Jilin, 130022, PR China
| | | | | | | | | |
Collapse
|
12
|
Das P, Matysiak S. Direct Characterization of Hydrophobic Hydration during Cold and Pressure Denaturation. J Phys Chem B 2012; 116:5342-8. [DOI: 10.1021/jp211832c] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Payel Das
- Computational Biology Center, IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598, United States
| | - Silvina Matysiak
- Fischell Department of Bioengineering, University of Maryland, College Park, Maryland 20742,
United States
| |
Collapse
|
13
|
Huang H, Zhao R, Dickson BM, Skeel RD, Post CB. αC helix as a switch in the conformational transition of Src/CDK-like kinase domains. J Phys Chem B 2012; 116:4465-75. [PMID: 22448785 DOI: 10.1021/jp301628r] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
One mechanism of regulating the catalytic activity of protein kinases is through conformational transitions. Despite great diversity in the structural changes involved in the transitions, a certain set of changes within the kinase domain (KD) has been observed for many kinases including Src and CDK2. We investigated this conformational transition computationally to identify the topological features that are energetically critical to the transition. Results from both molecular dynamics sampling and transition path optimization highlight the displacement of the αC helix as the major energy barrier, mediating the switch of the KD between the active and down-regulated states. The critical role of the αC helix is noteworthy by providing a rationale for a number of activation and deactivation mechanisms known to occur in cells. We find that kinases with the αC helix displacement exist throughout the kinome, suggesting that this feature may have emerged early in evolution.
Collapse
Affiliation(s)
- He Huang
- Department of Medicinal Chemistry and Molecular Pharmacology, Markey Center for Structural Biology and Purdue Cancer Center, Purdue University, West Lafayette, Indiana 47907, United States
| | | | | | | | | |
Collapse
|
14
|
Wolff K, Vendruscolo M, Porto M. Coarse-grained model for protein folding based on structural profiles. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2011; 84:041934. [PMID: 22181202 DOI: 10.1103/physreve.84.041934] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/21/2011] [Indexed: 05/31/2023]
Abstract
We study a coarse-grained protein model whose primary characteristics are (i) a tubelike geometry to describe the self-avoidance effects of the polypeptide chain and (ii) an energy function based on a one-dimensional structural representation. The latter specifies the connectivity of a sequence in a given conformation, so that the energy function, rather than favoring the formation of specific native pairwise contacts, promotes the establishment of a specific target connectivity for each amino acid. We show that the resulting dynamics is in good agreement with both experimental observations and the results of all-atoms simulations. In contrast to the latter, our coarse-grained approach provides the possibility to explore longer time scales and thus enables one to access, albeit in less detail, larger regions of the conformational space. We illustrate our approach by its application to the villin headpiece domain, a three-helix protein, by studying its folding behavior and determining heat capacities and free-energy landscapes in various reaction coordinates.
Collapse
Affiliation(s)
- Katrin Wolff
- School of Physics, University of Edinburgh, JCMB Kings Buildings, Edinburgh EH9 3JZ, United Kingdom
| | | | | |
Collapse
|
15
|
Das P, King JA, Zhou R. Aggregation of γ-crystallins associated with human cataracts via domain swapping at the C-terminal β-strands. Proc Natl Acad Sci U S A 2011; 108:10514-9. [PMID: 21670251 PMCID: PMC3127930 DOI: 10.1073/pnas.1019152108] [Citation(s) in RCA: 101] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The prevalent eye disease age-onset cataract is associated with aggregation of human γD-crystallins, one of the longest-lived proteins. Identification of the γ-crystallin precursors to aggregates is crucial for developing strategies to prevent and reverse cataract. Our microseconds of atomistic molecular dynamics simulations uncover the molecular structure of the experimentally detected aggregation-prone folding intermediate species of monomeric native γD-crystallin with a largely folded C-terminal domain and a mostly unfolded N-terminal domain. About 30 residues including a, b, and c strands from the Greek Key motif 4 of the C-terminal domain experience strong solvent exposure of hydrophobic residues as well as partial unstructuring upon N-terminal domain unfolding. Those strands comprise the domain-domain interface crucial for unusually high stability of γD-crystallin. We further simulate the intermolecular linkage of these monomeric aggregation precursors, which reveals domain-swapped dimeric structures. In the simulated dimeric structures, the N-terminal domain of one monomer is frequently found in contact with residues 135-164 encompassing the a, b, and c strands of the Greek Key motif 4 of the second molecule. The present results suggest that γD-crystallin may polymerize through successive domain swapping of those three C-terminal β-strands leading to age-onset cataract, as an evolutionary cost of its very high stability. Alanine substitutions of the hydrophobic residues in those aggregation-prone β-strands, such as L145 and M147, hinder domain swapping as a pathway toward dimerization. These findings thus provide critical molecular insights onto the initial stages of age-onset cataract, which is important for understanding protein aggregation diseases.
Collapse
Affiliation(s)
- Payel Das
- IBM Thomas J. Watson Research Center, Yorktown Heights, NY 10598
| | - Jonathan A. King
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139; and
| | - Ruhong Zhou
- IBM Thomas J. Watson Research Center, Yorktown Heights, NY 10598
- Department of Chemistry, Columbia University, New York, NY 10027
| |
Collapse
|
16
|
Stamati H, Clementi C, Kavraki LE. Application of nonlinear dimensionality reduction to characterize the conformational landscape of small peptides. Proteins 2010; 78:223-35. [PMID: 19731366 PMCID: PMC2795065 DOI: 10.1002/prot.22526] [Citation(s) in RCA: 51] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The automatic classification of the wealth of molecular configurations gathered in simulation in the form of a few coordinates that help to explain the main states and transitions of the system is a recurring problem in computational molecular biophysics. We use the recently proposed ScIMAP algorithm to automatically extract motion parameters from simulation data. The procedure uses only molecular shape similarity and topology information inferred directly from the simulated conformations, and is not biased by a priori known information. The automatically recovered coordinates prove as excellent reaction coordinates for the molecules studied and can be used to identify stable states and transitions, and as a basis to build free-energy surfaces. The coordinates provide a better description of the free energy landscape when compared with coordinates computed using principal components analysis, the most popular linear dimensionality reduction technique. The method is first validated on the analysis of the dynamics of an all-atom model of alanine dipeptide, where it successfully recover all previously known metastable states. When applied to characterize the simulated folding of a coarse-grained model of beta-hairpin, in addition to the folded and unfolded states, two symmetric misfolding crossings of the hairpin strands are observed, together with the most likely transitions from one to the other.
Collapse
Affiliation(s)
- Hernán Stamati
- Department of Computer Science, Rice University, Houston, Texas, 77005
| | - Cecilia Clementi
- Department of Chemistry Rice University, Houston, Texas, 77005
- Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, Houston, Texas, 77030
| | - Lydia E. Kavraki
- Department of Computer Science, Rice University, Houston, Texas, 77005
- Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, Houston, Texas, 77030
- Department of Bioengineering Rice University, Houston, Texas, 77005
| |
Collapse
|
17
|
Universal distribution of protein evolution rates as a consequence of protein folding physics. Proc Natl Acad Sci U S A 2010; 107:2983-8. [PMID: 20133769 DOI: 10.1073/pnas.0910445107] [Citation(s) in RCA: 58] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
The hypothesis that folding robustness is the primary determinant of the evolution rate of proteins is explored using a coarse-grained off-lattice model. The simplicity of the model allows rapid computation of the folding probability of a sequence to any folded conformation. For each robust folder, the network of sequences that share its native structure is identified. The fitness of a sequence is postulated to be a simple function of the number of misfolded molecules that have to be produced to reach a characteristic protein abundance. After fixation probabilities of mutants are computed under a simple population dynamics model, a Markov chain on the fold network is constructed, and the fold-averaged evolution rate is computed. The distribution of the logarithm of the evolution rates across distinct networks exhibits a peak with a long tail on the low rate side and resembles the universal empirical distribution of the evolutionary rates more closely than either distribution resembles the log-normal distribution. The results suggest that the universal distribution of the evolutionary rates of protein-coding genes is a direct consequence of the basic physics of protein folding.
Collapse
|
18
|
Allen LR, Krivov SV, Paci E. Analysis of the free-energy surface of proteins from reversible folding simulations. PLoS Comput Biol 2009; 5:e1000428. [PMID: 19593364 PMCID: PMC2700257 DOI: 10.1371/journal.pcbi.1000428] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2009] [Accepted: 06/01/2009] [Indexed: 11/23/2022] Open
Abstract
Computer generated trajectories can, in principle, reveal the folding pathways of a protein at atomic resolution and possibly suggest general and simple rules for predicting the folded structure of a given sequence. While such reversible folding trajectories can only be determined ab initio using all-atom transferable force-fields for a few small proteins, they can be determined for a large number of proteins using coarse-grained and structure-based force-fields, in which a known folded structure is by construction the absolute energy and free-energy minimum. Here we use a model of the fast folding helical λ-repressor protein to generate trajectories in which native and non-native states are in equilibrium and transitions are accurately sampled. Yet, representation of the free-energy surface, which underlies the thermodynamic and dynamic properties of the protein model, from such a trajectory remains a challenge. Projections over one or a small number of arbitrarily chosen progress variables often hide the most important features of such surfaces. The results unequivocally show that an unprojected representation of the free-energy surface provides important and unbiased information and allows a simple and meaningful description of many-dimensional, heterogeneous trajectories, providing new insight into the possible mechanisms of fast-folding proteins. The process of protein folding is a complex transition from a disordered to an ordered state. Here, we simulate a specific fast-folding protein at the point at which the native and denatured states are at equilibrium and show that obtaining an accurate description of the mechanisms of folding and unfolding is far from trivial. Using simple quantities which quantify the degree of native order is, in the case of this protein, clearly misleading. We show that an unbiased representation of the free-energy surface can be obtained; using such a representation we are able to redesign the landscape and thus modify, upon site-specific “mutations”, the folding and unfolding rates. This leads us to formulate a hypothesis to explain the very fast folding of many proteins.
Collapse
Affiliation(s)
- Lucy R. Allen
- School of Physics & Astronomy, University of Leeds, Leeds, United Kingdom
| | - Sergei V. Krivov
- Institute of Molecular and Cellular Biology, University of Leeds, Leeds, United Kingdom
- * E-mail: (SVK); (EP)
| | - Emanuele Paci
- School of Physics & Astronomy, University of Leeds, Leeds, United Kingdom
- Institute of Molecular and Cellular Biology, University of Leeds, Leeds, United Kingdom
- * E-mail: (SVK); (EP)
| |
Collapse
|
19
|
Frembgen-Kesner T, Elcock AH. Striking Effects of Hydrodynamic Interactions on the Simulated Diffusion and Folding of Proteins. J Chem Theory Comput 2009; 5:242-56. [DOI: 10.1021/ct800499p] [Citation(s) in RCA: 96] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Affiliation(s)
| | - Adrian H. Elcock
- Department of Biochemistry, University of Iowa, Iowa City, Iowa 52242
| |
Collapse
|
20
|
Santillán M, Mackey MC. Quantitative approaches to the study of bistability in the lac operon of Escherichia coli. J R Soc Interface 2008; 5 Suppl 1:S29-39. [PMID: 18426771 PMCID: PMC2504340 DOI: 10.1098/rsif.2008.0086.focus] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2008] [Accepted: 03/27/2008] [Indexed: 01/25/2023] Open
Abstract
In this paper, the history and importance of the lac operon in the development of molecular and systems biology are briefly reviewed. We start by presenting a description of the regulatory mechanisms in this operon, taking into account the most recent discoveries. Then we offer a survey of the history of the lac operon, including the discovery of its main elements and the subsequent influence on the development of molecular and systems biology. Next the bistable behaviour of the operon is discussed, both with respect to its discovery and its molecular origin. A review of the literature in which this bistable phenomenon has been studied from a mathematical modelling viewpoint is then given. We conclude with some brief remarks.
Collapse
Affiliation(s)
- Moisés Santillán
- Centro de Investigación y Estudios Avanzados del IPN, Unidad Monterrey, Parque de Investigación e Innovación Tecnológica, Apodaca, NL, Mexico.
| | | |
Collapse
|
21
|
Chng CP, Yang LW. Coarse-grained models reveal functional dynamics--II. Molecular dynamics simulation at the coarse-grained level--theories and biological applications. Bioinform Biol Insights 2008; 2:171-85. [PMID: 19812774 PMCID: PMC2735960 DOI: 10.4137/bbi.s459] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Molecular dynamics (MD) simulation has remained the most indispensable tool in studying equilibrium/non-equilibrium conformational dynamics since its advent 30 years ago. With advances in spectroscopy accompanying solved biocomplexes in growing sizes, sampling their dynamics that occur at biologically interesting spatial/temporal scales becomes computationally intractable; this motivated the use of coarse-grained (CG) approaches. CG-MD models are used to study folding and conformational transitions in reduced resolution and can employ enlarged time steps due to the absence of some of the fastest motions in the system. The Boltzmann-Inversion technique, heavily used in parameterizing these models, provides a smoothed-out effective potential on which molecular conformation evolves at a faster pace thus stretching simulations into tens of microseconds. As a result, a complete catalytic cycle of HIV-1 protease or the assembly of lipid-protein mixtures could be investigated by CG-MD to gain biological insights. In this review, we survey the theories developed in recent years, which are categorized into Folding-based and Molecular-Mechanics-based. In addition, physical bases in the selection of CG beads/time-step, the choice of effective potentials, representation of solvent, and restoration of molecular representations back to their atomic details are systematically discussed.
Collapse
Affiliation(s)
- Choon-Peng Chng
- Institute of Molecular and Cellular Biosciences, University of Tokyo, Tokyo 113-0032, Japan.
| | | |
Collapse
|
22
|
Nelson ED, Grishin NV. Folding domain B of protein A on a dynamically partitioned free energy landscape. Proc Natl Acad Sci U S A 2008; 105:1489-93. [PMID: 18230738 PMCID: PMC2234171 DOI: 10.1073/pnas.0705707105] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2007] [Indexed: 11/18/2022] Open
Abstract
The B domain of staphylococcal protein A (BdpA) is a small helical protein that has been studied intensively in kinetics experiments and detailed computer simulations that include explicit water. The simulations indicate that BdpA needs to reorganize in crossing the transition barrier to facilitate folding its C-terminal helix (H3) onto the nucleus formed from helices H1 and H2. This process suggests frustration between two partially ordered forms of the protein, but recent varphi value measurements indicate that the transition structure is relatively constant over a broad range of temperatures. Here we develop a simplistic model to investigate the folding transition in which properties of the free energy landscape can be quantitatively compared with experimental data. The model is a continuation of the Muñoz-Eaton model to include the intermittency of contacts between structured parts of the protein, and the results compare variations in the landscape with denaturant and temperature to varphi value measurements and chevron plots of the kinetic rates. The topography of the model landscape (in particular, the feature of frustration) is consistent with detailed simulations even though variations in the varphi values are close to measured values. The transition barrier is smaller than indicated by the chevron data, but it agrees in order of magnitude with a similar alpha-carbon type of model. Discrepancies with the chevron plots are investigated from the point of view of solvent effects, and an approach is suggested to account for solvent participation in the model.
Collapse
Affiliation(s)
- Erik D. Nelson
- Howard Hughes Medical Institute and University of Texas Southwestern Medical Center, 6001 Forest Park Boulevard, Room ND10.124, Dallas, TX 75235-9050
| | - Nick V. Grishin
- Howard Hughes Medical Institute and University of Texas Southwestern Medical Center, 6001 Forest Park Boulevard, Room ND10.124, Dallas, TX 75235-9050
| |
Collapse
|
23
|
Matysiak S, Clementi C. Mapping folding energy landscapes with theory and experiment. Arch Biochem Biophys 2008; 469:29-33. [PMID: 17910943 DOI: 10.1016/j.abb.2007.08.019] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2007] [Accepted: 08/14/2007] [Indexed: 11/16/2022]
Abstract
The detailed characterization of the overall free energy landscape associated with the folding process of a protein is the ultimate goal in protein folding studies. Modern experimental techniques and all-atom simulations provide a way to obtain accurate thermodynamic and kinetic measurements, but they are oftentimes restricted to probe limited regions of a protein landscape. Although simplified protein models can access larger regions of the landscape, they are built on assumptions and approximations that can affect the accuracy of the results. We review here recent promising approaches that allow to combine the complementary strengths of theory and experiment for a more complete characterization of a protein folding landscape at multiple resolutions. Recent results and possible applications are discussed.
Collapse
Affiliation(s)
- Silvina Matysiak
- Department of Chemistry, Rice University, 6100 Main Street, Houston, TX 77005, USA.
| | | |
Collapse
|
24
|
Kenzaki H, Kikuchi M. Free-energy landscape of kinesin by a realistic lattice model. Proteins 2008; 71:389-95. [DOI: 10.1002/prot.21707] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
25
|
Clementi C. Coarse-grained models of protein folding: toy models or predictive tools? Curr Opin Struct Biol 2007; 18:10-5. [PMID: 18160277 DOI: 10.1016/j.sbi.2007.10.005] [Citation(s) in RCA: 229] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2007] [Accepted: 10/29/2007] [Indexed: 11/16/2022]
Abstract
Coarse-grained models are emerging as a practical alternative to all-atom simulations for the characterization of protein folding mechanisms over long time scales. While a decade ago minimalist toy models were mainly designed to test general hypotheses on the principles regulating protein folding, the latest coarse-grained models are increasingly realistic and can be used to characterize quantitatively the detailed folding mechanism of specific proteins. The ability of such models to reproduce the essential features of folding dynamics suggests that each single atomic degree of freedom is not by itself particularly relevant to folding and supports a statistical mechanical approach to characterize folding transitions. When combined with more refined models and with experimental studies, the systematic investigation of protein systems and complexes using coarse-grained models can advance our theoretical understanding of the actual organizing principles that emerge from the complex network of interactions among protein atomic constituents.
Collapse
Affiliation(s)
- Cecilia Clementi
- Department of Chemistry, Rice University, Houston, TX 77005, USA
| |
Collapse
|
26
|
Kim J, Keyes T. Influence of Go-Like Interactions on Global Shapes of Energy Landscapes in β-Barrel Forming Model Proteins: Inherent Structure Analysis and Statistical Temperature Molecular Dynamics Simulation. J Phys Chem B 2007; 112:954-66. [DOI: 10.1021/jp072872u] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Jaegil Kim
- Department of Chemistry, Boston University, Boston, Massachusetts 02215
| | - Thomas Keyes
- Department of Chemistry, Boston University, Boston, Massachusetts 02215
| |
Collapse
|
27
|
Consequences of localized frustration for the folding mechanism of the IM7 protein. Proc Natl Acad Sci U S A 2007; 104:19825-30. [PMID: 18077415 DOI: 10.1073/pnas.0709922104] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In the laboratory, IM7 has been found to have an unusual folding mechanism in which an "on-pathway" intermediate with nonnative interactions is formed. We show that this intermediate is a consequence of an unusual cluster of highly frustrated interactions in the native structure. This cluster is involved in the binding of IM7 to its target, Colicin E7. Redesign of residues in this cluster to eliminate frustration is predicted by simulations to lead to faster folding without the population of an intermediate ensemble.
Collapse
|
28
|
Tapia L, Tang X, Thomas S, Amato NM. Kinetics analysis methods for approximate folding landscapes. Bioinformatics 2007; 23:i539-48. [PMID: 17646341 DOI: 10.1093/bioinformatics/btm199] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Protein motions play an essential role in many biochemical processes. Lab studies often quantify these motions in terms of their kinetics such as the speed at which a protein folds or the population of certain interesting states like the native state. Kinetic metrics give quantifiable measurements of the folding process that can be compared across a group of proteins such as a wild-type protein and its mutants. RESULTS We present two new techniques, map-based master equation solution and map-based Monte Carlo simulation, to study protein kinetics through folding rates and population kinetics from approximate folding landscapes, models called maps. From these two new techniques, interesting metrics that describe the folding process, such as reaction coordinates, can also be studied. In this article we focus on two metrics, formation of helices and structure formation around tryptophan residues. These two metrics are often studied in the lab through circular dichroism (CD) spectra analysis and tryptophan fluorescence experiments, respectively. The approximated landscape models we use here are the maps of protein conformations and their associated transitions that we have presented and validated previously. In contrast to other methods such as the traditional master equation and Monte Carlo simulation, our techniques are both fast and can easily be computed for full-length detailed protein models. We validate our map-based kinetics techniques by comparing folding rates to known experimental results. We also look in depth at the population kinetics, helix formation and structure near tryptophan residues for a variety of proteins. AVAILABILITY We invite the community to help us enrich our publicly available database of motions and kinetics analysis by submitting to our server: http://parasol.tamu.edu/foldingserver/.
Collapse
Affiliation(s)
- Lydia Tapia
- Parasol Lab, Department of Computer Science, Texas A&M University, College Station, TX 77843, USA
| | | | | | | |
Collapse
|
29
|
Mossa A, Clementi C. Supersymmetric Langevin equation to explore free-energy landscapes. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2007; 75:046707. [PMID: 17501016 DOI: 10.1103/physreve.75.046707] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/24/2006] [Revised: 02/15/2007] [Indexed: 05/15/2023]
Abstract
The recently discovered supersymmetric generalizations of the Langevin dynamics and Kramers equation can be utilized for the exploration of free-energy landscapes of systems whose large time-scale separation hampers the usefulness of standard molecular dynamics techniques. The first realistic application is here presented. The system chosen is a minimalist model for a short alanine peptide exhibiting a helix-coil transition.
Collapse
Affiliation(s)
- Alessandro Mossa
- Department of Chemistry, Rice University, 6100 Main Street, Houston, Texas 77005, USA
| | | |
Collapse
|
30
|
Kim BJ, Mangala SL, Muralidhara B, Hayashi K. Fragment complementation for the co-refolding of Thermotoga maritima β-glucosidase by gene splitting at non-homologous region. Enzyme Microb Technol 2007. [DOI: 10.1016/j.enzmictec.2006.06.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
31
|
Abstract
Since the introduction of the induced-fit theory by D. E. Koshland Jr., it has been established that conformational motion invariably accompanies the execution of protein function. The catalytic activity of kinases, specifically, is associated with large conformational changes ( approximately 1 nm amplitude). In the case of guanylate kinase, upon substrate binding, the LID and nucleotide-monophosphate-binding domains are brought together and toward the CORE with large concerted movements about the alpha3 (helix 3) axis. However, whether the change in conformation mostly affects the catalytic rate or mostly increases binding affinities for one or the other substrate is unclear. We investigate this question using a nanotechnology approach based on mechanical stress. Using an "allosteric spring probe", we bias conformational states in favor of the "open" (substrate-free) conformation of the enzyme; the result is that the binding constant for the substrate guanosine monophosphate (GMP) is reduced by up to a factor of 10, whereas the binding constant for adenosine triphosphate (ATP) and the catalytic rate are essentially unaffected. The results show that the GMP-induced conformational change, which promotes catalysis, does not promote ATP binding, consistent with previous mutagenesis studies. Furthermore, they show that this conformational change is of the induced-fit type with respect to GMP binding (but not ATP binding). We elaborate on this point by proposing a quantitative criterion for the classification of conformational changes with respect to the induced-fit theory. More generally, these results show that the allosteric spring probe can be used to affect enzymatic activity in a continuously controlled manner, and also to affect specific steps of the reaction mechanism while leaving others unaffected. It is presumed that this will enable informative comparisons with the results of future molecular dynamics or statistical mechanics computations.
Collapse
Affiliation(s)
- Brian Choi
- Department of Physics and Astronomy, University of California, Los Angeles, California, USA
| | | |
Collapse
|
32
|
Mossa A, Pettini M, Clementi C. Hamiltonian dynamics of homopolymer chain models. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2006; 74:041805. [PMID: 17155086 DOI: 10.1103/physreve.74.041805] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/01/2006] [Indexed: 05/12/2023]
Abstract
The Hamiltonian dynamics of chains of nonlinearly coupled particles is numerically investigated in two and three dimensions. Simple, off-lattice homopolymer models are used to represent the interparticle potentials. Time averages of observables numerically computed along dynamical trajectories are found to reproduce results given by the statistical mechanics of homopolymer models. The dynamical treatment, however, indicates a nontrivial transition between regimes of slow and fast phase space mixing. Such a transition is inaccessible to a statistical mechanical treatment and reflects a bimodality in the relaxation of time averages to corresponding ensemble averages. It is also found that a change in the energy dependence of the largest Lyapunov exponent indicates the Theta transition between filamentary and globular polymer configurations, clearly detecting the transition even for a finite number of particles.
Collapse
Affiliation(s)
- Alessandro Mossa
- Department of Chemistry, Rice University, 6100 Main Street, Houston, Texas 77005, USA
| | | | | |
Collapse
|
33
|
Matysiak S, Clementi C. Minimalist protein model as a diagnostic tool for misfolding and aggregation. J Mol Biol 2006; 363:297-308. [PMID: 16959265 DOI: 10.1016/j.jmb.2006.07.088] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2006] [Revised: 07/25/2006] [Accepted: 07/28/2006] [Indexed: 11/24/2022]
Abstract
We propose a realistic coarse-grained protein model and a technique to "anchor" the model to available experimental data. We apply this procedure to characterize the effect of multiple mutations on the folding mechanism of protein S6. We show that the mutation of a few "gatekeeper" residues triggers significant changes on the folding landscape of S6. These results suggest that gatekeeper residues control the flexibility of critical regions of S6, that in turn regulates the delicate balance between folding and aggregation. Although obtained with a minimalist protein model, these results are fully consistent with experimental evidence and offer a clue to understand the interplay between folding and aggregation in protein S6.
Collapse
Affiliation(s)
- Silvina Matysiak
- Department of Chemistry, Rice University, 6100 Main Street, Houston, TX 77005, USA
| | | |
Collapse
|
34
|
Elcock AH. Molecular simulations of cotranslational protein folding: fragment stabilities, folding cooperativity, and trapping in the ribosome. PLoS Comput Biol 2006; 2:e98. [PMID: 16789821 PMCID: PMC1523309 DOI: 10.1371/journal.pcbi.0020098.eor] [Citation(s) in RCA: 49] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2006] [Accepted: 06/14/2006] [Indexed: 11/19/2022] Open
Abstract
Although molecular simulation methods have yielded valuable insights into mechanistic aspects of protein refolding in vitro, they have up to now not been used to model the folding of proteins as they are actually synthesized by the ribosome. To address this issue, we report here simulation studies of three model proteins: chymotrypsin inhibitor 2 (CI2), barnase, and Semliki forest virus protein (SFVP), and directly compare their folding during ribosome-mediated synthesis with their refolding from random, denatured conformations. To calibrate the methodology, simulations are first compared with in vitro data on the folding stabilities of N-terminal fragments of CI2 and barnase; the simulations reproduce the fact that both the stability and thermal folding cooperativity increase as fragments increase in length. Coupled simulations of synthesis and folding for the same two proteins are then described, showing that both fold essentially post-translationally, with mechanisms effectively identical to those for refolding. In both cases, confinement of the nascent polypeptide chain within the ribosome tunnel does not appear to promote significant formation of native structure during synthesis; there are however clear indications that the formation of structure within the nascent chain is sensitive to location within the ribosome tunnel, being subject to both gain and loss as the chain lengthens. Interestingly, simulations in which CI2 is artificially stabilized show a pronounced tendency to become trapped within the tunnel in partially folded conformations: non-cooperative folding, therefore, appears in the simulations to exert a detrimental effect on the rate at which fully folded conformations are formed. Finally, simulations of the two-domain protease module of SFVP, which experimentally folds cotranslationally, indicate that for multi-domain proteins, ribosome-mediated folding may follow different pathways from those taken during refolding. Taken together, these studies provide a first step toward developing more realistic methods for simulating protein folding as it occurs in vivo.
Collapse
Affiliation(s)
- Adrian H Elcock
- Department of Biochemistry, University of Iowa, Iowa City, Iowa, USA.
| |
Collapse
|
35
|
Das P, Moll M, Stamati H, Kavraki LE, Clementi C. Low-dimensional, free-energy landscapes of protein-folding reactions by nonlinear dimensionality reduction. Proc Natl Acad Sci U S A 2006; 103:9885-90. [PMID: 16785435 PMCID: PMC1502548 DOI: 10.1073/pnas.0603553103] [Citation(s) in RCA: 217] [Impact Index Per Article: 12.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
The definition of reaction coordinates for the characterization of a protein-folding reaction has long been a controversial issue, even for the "simple" case in which one single free-energy barrier separates the folded and unfolded ensemble. We propose a general approach to this problem to obtain a few collective coordinates by using nonlinear dimensionality reduction. We validate the usefulness of this method by characterizing the folding landscape associated with a coarse-grained protein model of src homology 3 as sampled by molecular dynamics simulations. The folding free-energy landscape projected on the few relevant coordinates emerging from the dimensionality reduction can correctly identify the transition-state ensemble of the reaction. The first embedding dimension efficiently captures the evolution of the folding process along the main folding route. These results clearly show that the proposed method can efficiently find a low-dimensional representation of a complex process such as protein folding.
Collapse
Affiliation(s)
| | | | | | - Lydia E. Kavraki
- Computer Science, and
- Bioengineering, Rice University, Houston, TX 77005; and
- Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, Houston, TX 77030
- To whom correspondence may be addressed. E-mail: or
| | - Cecilia Clementi
- Departments of *Chemistry
- Bioengineering, Rice University, Houston, TX 77005; and
- To whom correspondence may be addressed. E-mail: or
| |
Collapse
|
36
|
Elcock AH. Molecular simulations of cotranslational protein folding: fragment stabilities, folding cooperativity, and trapping in the ribosome. PLoS Comput Biol 2006. [PMID: 16789821 PMCID: PMC1523309 DOI: 10.1371/journal.pcbi.0020098] [Citation(s) in RCA: 99] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Although molecular simulation methods have yielded valuable insights into mechanistic aspects of protein refolding in vitro, they have up to now not been used to model the folding of proteins as they are actually synthesized by the ribosome. To address this issue, we report here simulation studies of three model proteins: chymotrypsin inhibitor 2 (CI2), barnase, and Semliki forest virus protein (SFVP), and directly compare their folding during ribosome-mediated synthesis with their refolding from random, denatured conformations. To calibrate the methodology, simulations are first compared with in vitro data on the folding stabilities of N-terminal fragments of CI2 and barnase; the simulations reproduce the fact that both the stability and thermal folding cooperativity increase as fragments increase in length. Coupled simulations of synthesis and folding for the same two proteins are then described, showing that both fold essentially post-translationally, with mechanisms effectively identical to those for refolding. In both cases, confinement of the nascent polypeptide chain within the ribosome tunnel does not appear to promote significant formation of native structure during synthesis; there are however clear indications that the formation of structure within the nascent chain is sensitive to location within the ribosome tunnel, being subject to both gain and loss as the chain lengthens. Interestingly, simulations in which CI2 is artificially stabilized show a pronounced tendency to become trapped within the tunnel in partially folded conformations: non-cooperative folding, therefore, appears in the simulations to exert a detrimental effect on the rate at which fully folded conformations are formed. Finally, simulations of the two-domain protease module of SFVP, which experimentally folds cotranslationally, indicate that for multi-domain proteins, ribosome-mediated folding may follow different pathways from those taken during refolding. Taken together, these studies provide a first step toward developing more realistic methods for simulating protein folding as it occurs in vivo.
Collapse
Affiliation(s)
- Adrian H Elcock
- Department of Biochemistry, University of Iowa, Iowa City, Iowa, USA.
| |
Collapse
|
37
|
Nelson ED, Grishin NV. Scaling approach to the folding kinetics of large proteins. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2006; 73:011904. [PMID: 16486182 DOI: 10.1103/physreve.73.011904] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/15/2005] [Revised: 11/11/2005] [Indexed: 05/06/2023]
Abstract
We study a nucleation-growth model of protein folding and extend it to describe larger proteins with multiple folding units. The model is of one of an extremely simple type in which amino acids are allowed just two states--either folded (frozen) or unfolded. Its energetics are heterogeneous and Gō-like, the energy being defined in terms of the number of atom-to-atom contacts that would occur between frozen amino acids in the native crystal structure of the protein. Each collective state of the amino acids is intended to represent a small free energy microensemble consisting of the possible configurations of unfolded loops, open segments, and free ends constrained by the cross-links that form between folded parts of the molecule. We approximate protein free energy landscapes by an infinite subset of these microensemble topologies in which loops and open unfolded segments can be viewed roughly as independent objects for the purpose of calculating their entropy, and we develop a means to implement this approximation in Monte Carlo simulations. We show that this approach describes transition state structures (phi values) more accurately and identifies folding intermediates that were unavailable to previous versions of the model that restricted the number of loops and nuclei.
Collapse
Affiliation(s)
- Erik D Nelson
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, 6001 Forest Park Blvd., Room ND10.124, Dallas, TX 75235-9050, USA.
| | | |
Collapse
|
38
|
Wilson CJ, Das P, Clementi C, Matthews KS, Wittung-Stafshede P. The experimental folding landscape of monomeric lactose repressor, a large two-domain protein, involves two kinetic intermediates. Proc Natl Acad Sci U S A 2005; 102:14563-8. [PMID: 16203983 PMCID: PMC1253568 DOI: 10.1073/pnas.0505808102] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
To probe the experimental folding behavior of a large protein with complex topology, we created a monomeric variant of the lactose repressor protein (MLAc), a well characterized tetrameric protein that regulates transcription of the lac operon. Purified MLAc is folded, fully functional, and binds the inducer isopropyl beta-d-thiogalactoside with the same affinity as wild-type LacI. Equilibrium unfolding of MLAc induced by the chemical denaturant urea is a reversible, apparent two-state process (pH 7.5, 20 degrees C). However, time-resolved experiments demonstrate that unfolding is single-exponential, whereas refolding data indicate two transient intermediates. The data reveal the initial formation of a burst-phase (tau < ms) intermediate that corresponds to approximately 50% of the total secondary-structure content. This step is followed by a rearrangement reaction that is rate-limited by an unfolding process (tau approximately 3 s; pH 7.5, 20 degrees C) and results in a second intermediate. This MLAc intermediate converts to the native structure (tau approximately 30 s; pH 7.5, 20 degrees C). Remarkably, the experimental folding-energy landscape for MLAc is in excellent agreement with theoretical predictions using a simple topology-based C(alpha)-model as presented in a companion article in this issue.
Collapse
Affiliation(s)
- Corey J Wilson
- Department of Biochemistry and Cell Biology, Keck Center for Structural Computational Biology, and Department of Chemistry, Rice University, 6100 Main Street, Houston, TX 77251, USA
| | | | | | | | | |
Collapse
|