51
|
Han W, Wan CK, Jiang F, Wu YD. PACE Force Field for Protein Simulations. 1. Full Parameterization of Version 1 and Verification. J Chem Theory Comput 2010; 6:3373-89. [DOI: 10.1021/ct1003127] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Affiliation(s)
- Wei Han
- Department of Chemistry, The Hong Kong University of Science & Technology, Clear Water Bay, Kowloon, Hong Kong, China, School of Chemical Biology and Biotechnology, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, China, and College of Chemistry, Peking University, Beijing, China
| | - Cheuk-Kin Wan
- Department of Chemistry, The Hong Kong University of Science & Technology, Clear Water Bay, Kowloon, Hong Kong, China, School of Chemical Biology and Biotechnology, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, China, and College of Chemistry, Peking University, Beijing, China
| | - Fan Jiang
- Department of Chemistry, The Hong Kong University of Science & Technology, Clear Water Bay, Kowloon, Hong Kong, China, School of Chemical Biology and Biotechnology, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, China, and College of Chemistry, Peking University, Beijing, China
| | - Yun-Dong Wu
- Department of Chemistry, The Hong Kong University of Science & Technology, Clear Water Bay, Kowloon, Hong Kong, China, School of Chemical Biology and Biotechnology, Laboratory of Chemical Genomics, Peking University Shenzhen Graduate School, Shenzhen, China, and College of Chemistry, Peking University, Beijing, China
| |
Collapse
|
52
|
Kozłowska U, Liwo A, Scheraga HA. Determination of side-chain-rotamer and side-chain and backbone virtual-bond-stretching potentials of mean force from AM1 energy surfaces of terminally-blocked amino-acid residues, for coarse-grained simulations of protein structure and folding. I. The method. J Comput Chem 2010; 31:1143-53. [PMID: 20073062 DOI: 10.1002/jcc.21399] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
In this and the accompanying article, we report the development of new physics-based side-chain-rotamer and virtual-bond-deformation potentials which now replace the respective statistical potentials used so far in our physics-based united-reside UNRES force field for large-scale simulations of protein structure and dynamics. In this article, we describe the methodology for determining the corresponding potentials of mean force (PMF's) from the energy surfaces of terminally-blocked amino-acid residues calculated with the AM1 quantum-mechanical semiempirical method. The approach is based on minimization of the AM1 energy for fixed values of the angles lambda for rotation of the peptide groups about the C(alpha)...C(alpha) virtual bonds, and for fixed values of the side-chain dihedral angles chi, which formed a multidimensional grid. A harmonic-approximation approach was developed to extrapolate from the energy at a given grid point to other points of the conformational space to compute the respective contributions to the PMF. To test the applicability of the harmonic approximation, the rotamer PMF's of alanine and valine obtained with this approach have been compared with those obtained by using a Metropolis Monte Carlo method. The PMF surfaces computed with the harmonic approximation are more rugged and have more pronounced minima than the MC-calculated surfaces but the harmonic-approximation- and MC-calculated PMF values are linearly correlated. The potentials derived with the harmonic approximation are, therefore, appropriate for UNRES for which the weights (scaling factors) of the energy terms are determined by force-field optimization for foldability.
Collapse
Affiliation(s)
- Urszula Kozłowska
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, USA.
| | | | | |
Collapse
|
53
|
Liwo A, Ołdziej S, Czaplewski C, Kleinerman DS, Blood P, Scheraga HA. Implementation of molecular dynamics and its extensions with the coarse-grained UNRES force field on massively parallel systems; towards millisecond-scale simulations of protein structure, dynamics, and thermodynamics. J Chem Theory Comput 2010; 6:890-909. [PMID: 20305729 PMCID: PMC2839251 DOI: 10.1021/ct9004068] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
We report the implementation of our united-residue UNRES force field for simulations of protein structure and dynamics with massively parallel architectures. In addition to coarse-grained parallelism already implemented in our previous work, in which each conformation was treated by a different task, we introduce a fine-grained level in which energy and gradient evaluation are split between several tasks. The Message Passing Interface (MPI) libraries have been utilized to construct the parallel code. The parallel performance of the code has been tested on a professional Beowulf cluster (Xeon Quad Core), a Cray XT3 supercomputer, and two IBM BlueGene/P supercomputers with canonical and replica-exchange molecular dynamics. With IBM BlueGene/P, about 50 % efficiency and 120-fold speed-up of the fine-grained part was achieved for a single trajectory of a 767-residue protein with use of 256 processors/trajectory. Because of averaging over the fast degrees of freedom, UNRES provides an effective 1000-fold speed-up compared to the experimental time scale and, therefore, enables us to effectively carry out millisecond-scale simulations of proteins with 500 and more amino-acid residues in days of wall-clock time.
Collapse
Affiliation(s)
- Adam Liwo
- Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, U.S.A
| | - Stanisław Ołdziej
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, U.S.A
- Laboratory of Biopolymer Structure, Intercollegiate Faculty of Biotechology, University of Gdańsk, Medical University of Gdańsk, Kładki 24, 80-822 Gdańsk, Poland
| | - Cezary Czaplewski
- Faculty of Chemistry, University of Gdańsk, Sobieskiego 18, 80-952 Gdańsk, Poland
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, U.S.A
| | - Dana S. Kleinerman
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, U.S.A
| | - Philip Blood
- Pittsburgh Supercomputing Center, Carnegie Mellon University, University of Pittsburgh, 300 S. Craig Street, Pittsburgh, PA 15213, U.S.A
| | - Harold A. Scheraga
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, U.S.A
| |
Collapse
|
54
|
Kolinski M, Filipek S. Study of a structurally similar kappa opioid receptor agonist and antagonist pair by molecular dynamics simulations. J Mol Model 2010; 16:1567-76. [PMID: 20195661 DOI: 10.1007/s00894-010-0678-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2010] [Accepted: 02/02/2010] [Indexed: 01/12/2023]
Abstract
Among the structurally similar guanidinonaltrindole (GNTI) compounds, 5'-GNTI is an antagonist while 6'-GNTI is an agonist of the kappaOR opioid receptor. To explore how a subtle alteration of the ligand structure influences the receptor activity, we investigated two concurrent processes: the final steps of ligand binding at the receptor binding site and the initial steps of receptor activation. To trace these early activation steps, the membranous part of the receptor was built on an inactive receptor template while the extracellular loops were built using the ab initio CABS method. We used the simulated annealing procedure for ligand docking and all-atom molecular dynamics simulations to determine the immediate changes in the structure of the ligand-receptor complex. The binding of an agonist, in contrast to an antagonist, induced the breakage of the "3-7 lock" between helices TM3 and TM7. We also observed an action of the extended rotamer toggle switch which suggests that those two switches are interdependent.
Collapse
Affiliation(s)
- Michal Kolinski
- International Institute of Molecular and Cell Biology, 4 Trojdena St, 02-109, Warsaw, Poland
| | | |
Collapse
|
55
|
Ha-Duong T. Protein Backbone Dynamics Simulations Using Coarse-Grained Bonded Potentials and Simplified Hydrogen Bonds. J Chem Theory Comput 2010; 6:761-73. [DOI: 10.1021/ct900408s] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/27/2023]
Affiliation(s)
- Tap Ha-Duong
- Laboratoire Analyse et Modélisation pour la Biologie et l’Environnement Université d’Evry-Val-d’Essonne Rue du Pere André Jarlan, 91025 Evry Cedex, France
| |
Collapse
|
56
|
Chen Y, Wang M, Zhang Q, Liu J. Construction of an implicit membrane environment for the lattice Monte Carlo simulation of transmembrane protein. Biophys Chem 2009; 147:35-41. [PMID: 20079964 PMCID: PMC7117040 DOI: 10.1016/j.bpc.2009.12.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2009] [Revised: 12/17/2009] [Accepted: 12/18/2009] [Indexed: 02/01/2023]
Abstract
Due to the complexity of biological membrane, computer simulation of transmembrane protein's folding is challenging. In this paper, an implicit biological membrane environment has been constructed in lattice space, in which the lipid chains and water molecules were represented by the unoccupied lattice sites. The biological membrane was characterized with three features: stronger hydrogen bonding interaction, membrane lateral pressure, and lipophobicity index for the amino acid residues. In addition to the hydrocarbon core spanning region and the water solution, the lipid interface has also been represented in this implicit membrane environment, which was proved to be effective for the transmembrane protein's folding. The associated Monte Carlo simulations have been performed for SARS-CoV E protein and M2 protein segment (residues 18–60) of influenza A virus. It was found that the coil–helix transition of the transmembrane segment occurred earlier than the coil–globule transition of the two terminal domains. The folding process and final orientation of the amphipathic helical block in water solution are obviously influenced by its corresponding hydrophobicity/lipophobicity. Therefore, this implicit membrane environment, though in lattice space, can make an elaborate balance between different driving forces for the membrane protein's folding, thus offering a potential means for the simulation of transmembrane protein oligomers in feasible time.
Collapse
Affiliation(s)
- Yantao Chen
- Shenzhen Key Laboratory of Functional Polymer, College of Chemistry and Chemical Engineering, Shenzhen University, Shenzhen 518060, China.
| | | | | | | |
Collapse
|
57
|
DeVane R, Shinoda W, Moore PB, Klein ML. A Transferable Coarse Grain Non-bonded Interaction Model For Amino Acids. J Chem Theory Comput 2009; 5:2115-2124. [PMID: 20161179 PMCID: PMC2725330 DOI: 10.1021/ct800441u] [Citation(s) in RCA: 104] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
The large quantity of protein sequences being generated from genomic data has greatly outpaced the throughput of experimental protein structure determining methods and consequently brought urgency to the need for accurate protein structure prediction tools. Reduced resolution, or coarse grained (CG) models, have become a mainstay in computational protein structure prediction perfoming among the best tools available. The quest for high quality generalized CG models presents an extremely challenging yet popular endeavor. To this point, a CG based interaction potential is presented here for the naturally occurring amino acids. In the present approach, three to four heavy atoms and associated hydrogens are condensed into a single CG site. The parameterization of the site-site interaction potential relies on experimental data thus providing a novel approach that is neither based on all-atom (AA) simulations nor experimental protein structural data. Specifically, intermolecular potentials, which are based on Lennard-Jones (LJ) style functional forms, are parameterized using thermodynamic data including surface tension and density. Using this approach, an amino acid potential dataset has been developed for use in modeling peptides and proteins. The potential is evaluated here by comparing the solvent accessible surface area (SASA) to AA representations and ranking of protein decoy data sets provided by Decoys 'R' Us. The model is shown to perform very well compared to other existing prediction models for these properties.
Collapse
Affiliation(s)
- Russell DeVane
- Center for Molecular Modeling and Department of Chemistry, University of Pennsylvania, 231 South 34th Street, Philadelphia, PA 19104-6323
| | - Wataru Shinoda
- Research Institute for Computational Sciences (RICS), National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba Central 2, Umezono 1-1-1, Tsukuba, Ibaraki 305-8568, Japan
| | - Preston B. Moore
- Department of Chemistry and Biochemistry, University of the Sciences in Philadelphia, 600 s. 43rd Street, Philadelphia, PA 19104
| | - Michael L. Klein
- Center for Molecular Modeling and Department of Chemistry, University of Pennsylvania, 231 South 34th Street, Philadelphia, PA 19104-6323
| |
Collapse
|
58
|
Shen H, Liwo A, Scheraga HA. An improved functional form for the temperature scaling factors of the components of the mesoscopic UNRES force field for simulations of protein structure and dynamics. J Phys Chem B 2009; 113:8738-44. [PMID: 19480420 PMCID: PMC2766665 DOI: 10.1021/jp901788q] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Coarse-grained or mesoscopic models of proteins and the corresponding force fields are of great importance because they enable us to reduce the folding simulation time by several orders of magnitude compared to the all-atom approach and, consequently, reach the millisecond time scale of simulations. In the coarse-grained UNRES model for simulations of protein structure and dynamics, developed by our group, each amino acid residue is represented by a united side chain and a united peptide group located in the middle between the two neighboring alpha-carbon atoms, which assist only in the definition of the geometry. The prototype of the UNRES force field has been defined as a potential of mean force or restricted free-energy function corresponding to averaging out the degrees of freedom not present in the coarse-grained representation, which has further been approximated by a truncated Kubo cumulant series to enable us to derive analytical expressions for the corresponding terms. This force field should depend on temperature, and in its simplest form, a term corresponding to the cumulant of order n should be multiplied by f(n) = 1/T(n-1). The temperature dependence has been introduced in recent work ( J. Phys. Chem. B , 2007 , 111 , 260 - 285 ), and in order to prevent too steep a variation with temperature, the factors at the nth order cumulant terms were assumed to have a form f(n) = ln[exp(1) + exp(-1)]/ln{exp[(T/T(0))(n-1)] + exp[-(T/T(0))(n-1)]}, where T(0) = 300 K is the reference temperature. In this work, we have introduced a modified scaling factor f(n) = ln[exp(c) + exp(-c)]/ln{exp[c(T/T(0))(n-1)] + exp[-c(T/T(0))(n-1)]}, where c is an adjustable parameter, and determined c by fitting the analytical approximation of the temperature dependence of the virtual bond torsional term corresponding to rotation about the C(alpha)...C(alpha) virtual bond in terminally blocked dialanine to the respective potential of mean force calculated from the MP2/6-31G(d, p) ab initio energy surfaces of terminally blocked alanine (Ac-Ala-NHMe) and, independently, by optimizing it to obtain a sharp heat capacity curve and the lowest ensemble-averaged root-mean-square deviation over the C(alpha) atoms of 1GAB used as a training protein. Both approaches gave consistent results, and c = 1.4 has been selected as the optimal value of this parameter. The force field with the new temperature scaling factors has been optimized using 1GAB as the training protein. The new force field has been tested on a series of medium size alpha-helical proteins and found to perform better than that with the original temperature scaling factors.
Collapse
Affiliation(s)
- Hujun Shen
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca NY, USA 14853
| | - Adam Liwo
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca NY, USA 14853
| | - Harold A. Scheraga
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca NY, USA 14853
| |
Collapse
|
59
|
Sobolewski E, Makowski M, Oldziej S, Czaplewski C, Liwo A, Scheraga HA. Towards temperature-dependent coarse-grained potentials of side-chain interactions for protein folding simulations. I: molecular dynamics study of a pair of methane molecules in water at various temperatures. Protein Eng Des Sel 2009; 22:547-52. [PMID: 19556395 DOI: 10.1093/protein/gzp028] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
By means of molecular dynamics simulations of a pair of methane molecules in a TIP3P periodic water box with the NVT scheme at six temperatures and, additionally, the NPT scheme at three temperatures ranging from T = 283 to 373 K, we determined the potential of mean force (PMF) of pairs of interacting methane molecules in water as functions of distance between the methane molecules. The PMFs converge to a single baseline only for r> 11 A at all temperatures. The curves of the dimensionless PMF obtained at different temperatures with the NVT scheme overlap almost perfectly in the region of the contact minimum and still very well in the regions of the desolvation maximum and the solvent-separated minimum, which suggests that the temperature-dependent hydrophobic interaction potentials at constant volume in united-residue force fields can be obtained by scaling the respective dimensionless potentials by RT, R being the universal gas constant. For the dimensionless potentials of mean force obtained with the NPT scheme, the depth of the contact minimum increases, whereas the height of the desolvation maximum and the depth of the solvent-separated minimum decrease with temperature, in agreement with results reported in the literature.
Collapse
Affiliation(s)
- Emil Sobolewski
- Laboratory of Biopolymer Structure, Intercollegiate Faculty of Biotechnology, University of Gdańsk, Medical University of Gdańsk, Kładki 24, 80-822 Gdańsk
| | | | | | | | | | | |
Collapse
|
60
|
Wüst T, Landau DP. Versatile approach to access the low temperature thermodynamics of lattice polymers and proteins. PHYSICAL REVIEW LETTERS 2009; 102:178101. [PMID: 19518836 DOI: 10.1103/physrevlett.102.178101] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/18/2008] [Indexed: 05/27/2023]
Abstract
We show that Wang-Landau sampling, combined with suitable Monte Carlo trial moves, provides a powerful method for both the ground state search and the determination of the density of states for the hydrophobic-polar (HP) protein model and the interacting self-avoiding walk (ISAW) model for homopolymers. We obtain accurate estimates of thermodynamic quantities for HP sequences with >100 monomers and for ISAWs up to >500 monomers. Our procedure possesses an intrinsic simplicity and overcomes the limitations inherent in more tailored approaches making it interesting for a broad range of protein and polymer models.
Collapse
Affiliation(s)
- Thomas Wüst
- Center for Simulational Physics, The University of Georgia, Athens, Georgia 30602, USA.
| | | |
Collapse
|
61
|
Taylor MP, Paul W, Binder K. All-or-none proteinlike folding transition of a flexible homopolymer chain. PHYSICAL REVIEW. E, STATISTICAL, NONLINEAR, AND SOFT MATTER PHYSICS 2009; 79:050801. [PMID: 19518407 DOI: 10.1103/physreve.79.050801] [Citation(s) in RCA: 55] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/11/2009] [Indexed: 05/27/2023]
Abstract
Here we report a first-order all-or-none transition from an expanded coil to a compact crystallite for a flexible polymer chain. Wang-Landau sampling is used to construct the complete density of states for square-well chains up to length 256. Analysis within both the microcanonical and canonical ensembles shows a direct freezing transition for finite length chains with sufficiently short-range interactions. This type of transition is a distinctive feature of "one-step" protein folding and our findings demonstrate that a simple homopolymer model can exhibit protein-folding thermodynamics.
Collapse
Affiliation(s)
- Mark P Taylor
- Department of Physics, Hiram College, Hiram, Ohio 44234, USA
| | | | | |
Collapse
|
62
|
Clarke OJ, Parker MJ. Identification of amyloidogenic peptide sequences using a coarse-grained physicochemical model. J Comput Chem 2009; 30:621-30. [PMID: 18711722 DOI: 10.1002/jcc.21085] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Cross-beta amyloid is implicated in over 20 human diseases. Experiments suggest that specific sequence elements within amyloidogenic proteins play a major role in seeding amyloid formation. Identifying these seeding sequences is important for rationalizing the molecular mechanisms of amyloid formation and for elaborating therapeutic strategies that target amyloid. Theoretical techniques play an important role in facilitating the identification and structural characterization of putative seeding sequences; most amyloid species are not amenable to high resolution experimental structure techniques. In this study we have combined a coarse-grained physicochemical protein model with a highly efficient Monte Carlo sampling technique to identify amyloidogenic sequences in four proteins for which respective experimental peptide fragmentation data exist. Peptide sequences were defined as amyloidogenic if the ensemble structure predicted for three interacting peptides described a stable and regular three-stranded beta-sheet. For such peptides, free energies were calculated to provide a measure of amyloid propensity. The overall agreement between the experimental and predicted data is good, and we correctly identify several self-recognition motifs proposed to define the cross-beta amyloid fibril architectures of two of the proteins. Our results compare very favorably with those obtained using atomistic molecular dynamics methods, though our simulations are 30-40 times faster.
Collapse
Affiliation(s)
- Oliver J Clarke
- Institute of Molecular and Cellular Biology & Astbury Centre for Structural Molecular Biology, University of Leeds, Leeds LS2 9JT, United Kingdom
| | | |
Collapse
|
63
|
Podtelezhnikov AA, Wild DL. CRANKITE: A fast polypeptide backbone conformation sampler. SOURCE CODE FOR BIOLOGY AND MEDICINE 2008; 3:12. [PMID: 18577227 PMCID: PMC2443148 DOI: 10.1186/1751-0473-3-12] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/04/2008] [Accepted: 06/24/2008] [Indexed: 11/10/2022]
Abstract
BACKGROUND CRANKITE is a suite of programs for simulating backbone conformations of polypeptides and proteins. The core of the suite is an efficient Metropolis Monte Carlo sampler of backbone conformations in continuous three-dimensional space in atomic details. METHODS In contrast to other programs relying on local Metropolis moves in the space of dihedral angles, our sampler utilizes local crankshaft rotations of rigid peptide bonds in Cartesian space. RESULTS The sampler allows fast simulation and analysis of secondary structure formation and conformational changes for proteins of average length.
Collapse
Affiliation(s)
- Alexei A Podtelezhnikov
- Keck Graduate Institute of Applied Life Sciences, Claremont, CA, USA
- Department of Physics, Michigan Technological University, Houghton, MI, USA
| | - David L Wild
- Keck Graduate Institute of Applied Life Sciences, Claremont, CA, USA
- Systems Biology Centre, University of Warwick, Coventry, UK
| |
Collapse
|
64
|
Zheng W, Andrec M, Gallicchio E, Levy RM. Simple continuous and discrete models for simulating replica exchange simulations of protein folding. J Phys Chem B 2008; 112:6083-93. [PMID: 18251533 PMCID: PMC2978075 DOI: 10.1021/jp076377+] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The efficiency of temperature replica exchange (RE) simulations hinge on their ability to enhance conformational sampling at physiological temperatures by taking advantage of more rapid conformational interconversions at higher temperatures. While temperature RE is a parallel simulation technique that is relatively straightforward to implement, kinetics in the RE ensemble is complicated, and there is much to learn about how best to employ RE simulations in computational biophysics. Protein folding rates often slow down above a certain temperature due to entropic bottlenecks. This "anti-Arrhenius" behavior represents a challenge for RE. However, it is far from straightforward to systematically explore the impact of this on RE by brute force molecular simulations, since RE simulations of protein folding are very difficult to converge. To understand some of the basic mechanisms that determine the efficiency of RE, it is useful to study simplified low dimensionality systems that share some of the key characteristics of molecular systems. Results are presented concerning the efficiency of temperature RE on a continuous two-dimensional potential that contains an entropic bottleneck. Optimal efficiency was obtained when the temperatures of the replicas did not exceed the temperature at which the harmonic mean of the folding and unfolding rates is maximized. This confirms a result we previously obtained using a discrete network model of RE. Comparison of the efficiencies obtained using the continuous and discrete models makes it possible to identify non-Markovian effects, which slow down equilibration of the RE ensemble on the more complex continuous potential. In particular, the rate of temperature diffusion and also the efficiency of RE is limited by the time scale of conformational rearrangements within free energy basins.
Collapse
Affiliation(s)
- Weihua Zheng
- Department of Physics and Astronomy Rutgers, the State University of New Jersey, 136 Frelinghuysen Road, Piscataway NJ 08854, USA
| | - Michael Andrec
- Department of Chemistry and Chemical Biology and BioMaPS Institute for Quantitative Biology Rutgers, the State University of New Jersey, 610 Taylor Road, Piscataway NJ 08854, USA
| | - Emilio Gallicchio
- Department of Chemistry and Chemical Biology and BioMaPS Institute for Quantitative Biology Rutgers, the State University of New Jersey, 610 Taylor Road, Piscataway NJ 08854, USA
| | - Ronald M. Levy
- Department of Chemistry and Chemical Biology and BioMaPS Institute for Quantitative Biology Rutgers, the State University of New Jersey, 610 Taylor Road, Piscataway NJ 08854, USA
| |
Collapse
|
65
|
Pierri CL, De Grassi A, Turi A. Lattices for ab initio protein structure prediction. Proteins 2008; 73:351-61. [DOI: 10.1002/prot.22070] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
|
66
|
Minary P, Levitt M. Probing protein fold space with a simplified model. J Mol Biol 2008; 375:920-33. [PMID: 18054792 PMCID: PMC2254652 DOI: 10.1016/j.jmb.2007.10.087] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2007] [Revised: 10/15/2007] [Accepted: 10/31/2007] [Indexed: 11/24/2022]
Abstract
We probe the stability and near-native energy landscape of protein fold space using powerful conformational sampling methods together with simple reduced models and statistical potentials. Fold space is represented by a set of 280 protein domains spanning all topological classes and having a wide range of lengths (33-300 residues) amino acid composition and number of secondary structural elements. The degrees of freedom are taken as the loop torsion angles. This choice preserves the native secondary structure but allows the tertiary structure to change. The proteins are represented by three-point per residue, three-dimensional models with statistical potentials derived from a knowledge-based study of known protein structures. When this space is sampled by a combination of parallel tempering and equi-energy Monte Carlo, we find that the three-point model captures the known stability of protein native structures with stable energy basins that are near-native (all alpha: 4.77 A, all beta: 2.93 A, alpha/beta: 3.09 A, alpha+beta: 4.89 A on average and within 6 A for 71.41%, 92.85%, 94.29% and 64.28% for all-alpha, all-beta, alpha/beta and alpha+beta, classes, respectively). Denatured structures also occur and these have interesting structural properties that shed light on the different landscape characteristics of alpha and beta folds. We find that alpha/beta proteins with alternating alpha and beta segments (such as the beta-barrel) are more stable than proteins in other fold classes.
Collapse
Affiliation(s)
- Peter Minary
- Department of Structural Biology, Stanford University School of Medicine, Stanford, CA 94305, USA.
| | | |
Collapse
|
67
|
Michel J, Orsi M, Essex JW. Prediction of partition coefficients by multiscale hybrid atomic-level/coarse-grain simulations. J Phys Chem B 2008; 112:657-60. [PMID: 18163606 DOI: 10.1021/jp076142y] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Coarse-grain models are becoming an increasingly important tool in computer simulations of a wide variety of molecular processes. In many instances it is, however, desirable to describe key portions of a molecular system at the atomic level. There is therefore a strong interest in the development of simulation methodologies that allow representations of matter with mixed granularities in a multiscale fashion. We report here a strategy to conduct mixed atomic-level and coarse-grain simulations of molecular systems with a recently developed coarse-grain model. The methodology is validated by computing partition coefficients of small molecules described in atomic detail and solvated by water or octane, both of which are represented by coarse-grain models. Because the present coarse-grain force field retains electrostatic interactions, the simplified solvent particles can interact realistically with the all-atom solutes. The partition coefficients computed by this approach rival the accuracy of fully atomistic simulations and are obtained at a fraction of their computational cost. The present methodology is simple, robust and applicable to a wide variety of molecular systems.
Collapse
|
68
|
Abstract
A short review of the results of molecular modeling of prion disease is presented in this chapter. According to the "one-protein theory" proposed by Prusiner, prion proteins are misfolded naturally occurring proteins, which, on interaction with correctly folded proteins may induce misfolding and propagate the disease, resulting in insoluble amyloid aggregates in cells of affected specimens. Because of experimental difficulties in measurements of origin and growth of insoluble amyloid aggregations in cells, theoretical modeling is often the only one source of information regarding the molecular mechanism of the disease. Replica exchange Monte Carlo simulations presented in this chapter indicate that proteins in the native state, N, on interaction with an energetically higher structure, R, can change their conformation into R and form a dimer, R(2). The addition of another protein in the N state to R(2) may lead to spontaneous formation of a trimer, R(3). These results reveal the molecular basis for a model of prion disease propagation or conformational diseases in general.
Collapse
|
69
|
Clarke OJ, Parker MJ. Time-averaged predictions of folded and misfolded peptides using a reduced physicochemical model. J Comput Chem 2008; 29:1177-85. [DOI: 10.1002/jcc.20879] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
70
|
Kenzaki H, Kikuchi M. Free-energy landscape of kinesin by a realistic lattice model. Proteins 2008; 71:389-95. [DOI: 10.1002/prot.21707] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
71
|
Shmygelska A, Hoos HH. An adaptive bin framework search method for a beta-sheet protein homopolymer model. BMC Bioinformatics 2007; 8:136. [PMID: 17451609 PMCID: PMC1894818 DOI: 10.1186/1471-2105-8-136] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2007] [Accepted: 04/24/2007] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The problem of protein structure prediction consists of predicting the functional or native structure of a protein given its linear sequence of amino acids. This problem has played a prominent role in the fields of biomolecular physics and algorithm design for over 50 years. Additionally, its importance increases continually as a result of an exponential growth over time in the number of known protein sequences in contrast to a linear increase in the number of determined structures. Our work focuses on the problem of searching an exponentially large space of possible conformations as efficiently as possible, with the goal of finding a global optimum with respect to a given energy function. This problem plays an important role in the analysis of systems with complex search landscapes, and particularly in the context of ab initio protein structure prediction. RESULTS In this work, we introduce a novel approach for solving this conformation search problem based on the use of a bin framework for adaptively storing and retrieving promising locally optimal solutions. Our approach provides a rich and general framework within which a broad range of adaptive or reactive search strategies can be realized. Here, we introduce adaptive mechanisms for choosing which conformations should be stored, based on the set of conformations already stored in memory, and for biasing choices when retrieving conformations from memory in order to overcome search stagnation. CONCLUSION We show that our bin framework combined with a widely used optimization method, Monte Carlo search, achieves significantly better performance than state-of-the-art generalized ensemble methods for a well-known protein-like homopolymer model on the face-centered cubic lattice.
Collapse
Affiliation(s)
- Alena Shmygelska
- Department of Structural Biology, Stanford University, 299 W. Campus Dr., Stanford, CA 94305, USA
| | - Holger H Hoos
- Department of Computer Science, University of British Columbia, 2366 Main Mall, Vancouver, BC V6T 1Z4, Canada
| |
Collapse
|
72
|
Fitzgerald JE, Jha AK, Colubri A, Sosnick TR, Freed KF. Reduced C(beta) statistical potentials can outperform all-atom potentials in decoy identification. Protein Sci 2007; 16:2123-39. [PMID: 17893359 PMCID: PMC2204143 DOI: 10.1110/ps.072939707] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
Abstract
We developed a series of statistical potentials to recognize the native protein from decoys, particularly when using only a reduced representation in which each side chain is treated as a single C(beta) atom. Beginning with a highly successful all-atom statistical potential, the Discrete Optimized Protein Energy function (DOPE), we considered the implications of including additional information in the all-atom statistical potential and subsequently reducing to the C(beta) representation. One of the potentials includes interaction energies conditional on backbone geometries. A second potential separates sequence local from sequence nonlocal interactions and introduces a novel reference state for the sequence local interactions. The resultant potentials perform better than the original DOPE statistical potential in decoy identification. Moreover, even upon passing to a reduced C(beta) representation, these statistical potentials outscore the original (all-atom) DOPE potential in identifying native states for sets of decoys. Interestingly, the backbone-dependent statistical potential is shown to retain nearly all of the information content of the all-atom representation in the C(beta) representation. In addition, these new statistical potentials are combined with existing potentials to model hydrogen bonding, torsion energies, and solvation energies to produce even better performing potentials. The ability of the C(beta) statistical potentials to accurately represent protein interactions bodes well for computational efficiency in protein folding calculations using reduced backbone representations, while the extensions to DOPE illustrate general principles for improving knowledge-based potentials.
Collapse
Affiliation(s)
- James E Fitzgerald
- Department of Physics, The University of Chicago, Chicago, Illinois 60637, USA
| | | | | | | | | |
Collapse
|
73
|
Murarka RK, Liwo A, Scheraga HA. Separation of time scale and coupling in the motion governed by the coarse-grained and fine degrees of freedom in a polypeptide backbone. J Chem Phys 2007; 127:155103. [DOI: 10.1063/1.2784200] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023] Open
|
74
|
Thachuk C, Shmygelska A, Hoos HH. A replica exchange Monte Carlo algorithm for protein folding in the HP model. BMC Bioinformatics 2007; 8:342. [PMID: 17875212 PMCID: PMC2071922 DOI: 10.1186/1471-2105-8-342] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2007] [Accepted: 09/17/2007] [Indexed: 12/04/2022] Open
Abstract
Background The ab initio protein folding problem consists of predicting protein tertiary structure from a given amino acid sequence by minimizing an energy function; it is one of the most important and challenging problems in biochemistry, molecular biology and biophysics. The ab initio protein folding problem is computationally challenging and has been shown to be NP
MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaacqWFneVtcqqGqbauaaa@3961@-hard even when conformations are restricted to a lattice. In this work, we implement and evaluate the replica exchange Monte Carlo (REMC) method, which has already been applied very successfully to more complex protein models and other optimization problems with complex energy landscapes, in combination with the highly effective pull move neighbourhood in two widely studied Hydrophobic Polar (HP) lattice models. Results We demonstrate that REMC is highly effective for solving instances of the square (2D) and cubic (3D) HP protein folding problem. When using the pull move neighbourhood, REMC outperforms current state-of-the-art algorithms for most benchmark instances. Additionally, we show that this new algorithm provides a larger ensemble of ground-state structures than the existing state-of-the-art methods. Furthermore, it scales well with sequence length, and it finds significantly better conformations on long biological sequences and sequences with a provably unique ground-state structure, which is believed to be a characteristic of real proteins. We also present evidence that our REMC algorithm can fold sequences which exhibit significant interaction between termini in the hydrophobic core relatively easily. Conclusion We demonstrate that REMC utilizing the pull move neighbourhood significantly outperforms current state-of-the-art methods for protein structure prediction in the HP model on 2D and 3D lattices. This is particularly noteworthy, since so far, the state-of-the-art methods for 2D and 3D HP protein folding – in particular, the pruned-enriched Rosenbluth method (PERM) and, to some extent, Ant Colony Optimisation (ACO) – were based on chain growth mechanisms. To the best of our knowledge, this is the first application of REMC to HP protein folding on the cubic lattice, and the first extension of the pull move neighbourhood to a 3D lattice.
Collapse
Affiliation(s)
- Chris Thachuk
- School of Computing Science, Simon Fraser University, Burnaby, B.C., V5A 1S6, Canada
| | - Alena Shmygelska
- Department of Structural Biology, Stanford University, Stanford, CA, 94305, USA
| | - Holger H Hoos
- Department of Computer Science, University of British Columbia, B.C., V6T 1Z4, Canada
| |
Collapse
|
75
|
Abstract
Molecular dynamics (MD) is an invaluable tool with which to study protein folding in silico. Although just a few years ago the dynamic behavior of a protein molecule could be simulated only in the neighborhood of the experimental conformation (or protein unfolding could be simulated at high temperature), the advent of distributed computing, new techniques such as replica-exchange MD, new approaches (based on, e.g., the stochastic difference equation), and physics-based reduced models of proteins now make it possible to study protein-folding pathways from completely unfolded structures. In this review, we present algorithms for MD and their extensions and applications to protein-folding studies, using all-atom models with explicit and implicit solvent as well as reduced models of polypeptide chains.
Collapse
Affiliation(s)
- Harold A Scheraga
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, USA.
| | | | | |
Collapse
|
76
|
Chen YT, Ding J. Lattice Monte Carlo simulation of coil-helix transition. CONFERENCE PROCEEDINGS : ... ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY. ANNUAL CONFERENCE 2007; 2005:4445-8. [PMID: 17281223 DOI: 10.1109/iembs.2005.1615453] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]
Abstract
An improved self-avoiding lattice model was put forward to study coil-helix transition via dynamic Monte Carlo simulation. Each residue occupies eight simple lattices. By introducing a virtual-imino group and a virtual-carbonyl group, the helical period is not necessarily to be an integer. The Gö-like restriction of hydrogen bonding interaction has also been partially released. The coil-helix transition was well reproduced and consistent with the Zimm-Bragg theory. Non-native hydrogen bonding interaction seems significant around transition temperature. In characterization of a statistically helical structure, a novel correlation function was put forward and verified.
Collapse
Affiliation(s)
- Y T Chen
- Key Laboratory of Molecular Engineering of Polymers, Department of Macromolecular Science, Fudan, University, Shanghai 200433, China
| | | |
Collapse
|
77
|
Pokarowski P, Droste K, Kolinski A. A minimal proteinlike lattice model: an alpha-helix motif. J Chem Phys 2007; 122:214915. [PMID: 15974798 DOI: 10.1063/1.1924601] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A simple protein model of a four-helix bundle motif on a face-centered cubic lattice has been studied. Total energy of a conformation includes attractive interactions between hydrophobic residues, repulsive interactions between hydrophobic and polar residues, and a potential that favors helical turns. Using replica exchange Monte Carlo simulations we have estimated a set of parameters for which the native structure is a global minimum of conformational energy. Then we have shown that all the above types of interactions are necessary to guarantee the cooperativity of folding transition and to satisfy the thermodynamic hypothesis.
Collapse
Affiliation(s)
- Piotr Pokarowski
- Institute of Applied Mathematics and Mechanics, Warsaw University, Banacha 2, 02-097 Warsaw, Poland.
| | | | | |
Collapse
|
78
|
Abstract
Backbone hydrogen bonds contribute very importantly to the stability of proteins and therefore they must be appropriately represented in protein folding simulations. Simple models are frequently used in theoretical approaches to this process, but their simplifications are often confronted with the need to be true to the physics of the interactions. Here we study the effects of different levels of coarse graining in the modeling of backbone hydrogen bonds. We study three different models taken from the bibliography in a twofold fashion. First, we calculate the hydrogen bonds in 2gb1, an (alpha + beta)-protein, and see how different backbone representations and potentials can mimic the effects of real hydrogen bonds both in helices and sheets. Second, we use an evolutionary method for protein fragment assembly to locate the global energy minimum for a set of small beta-proteins with these models. This way, we assess the effects of coarse graining in hydrogen bonding models and show what can be expected from them when used in simulation experiments.
Collapse
Affiliation(s)
- David De Sancho
- Departamento de Química Física I, Universidad Complutense, Madrid, Spain
| | | |
Collapse
|
79
|
Fogolari F, Corazza A, Viglino P, Zuccato P, Pieri L, Faccioli P, Bellotti V, Esposito G. Molecular dynamics simulation suggests possible interaction patterns at early steps of beta2-microglobulin aggregation. Biophys J 2006; 92:1673-81. [PMID: 17158575 PMCID: PMC1796822 DOI: 10.1529/biophysj.106.098483] [Citation(s) in RCA: 36] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Early events in aggregation of proteins are not easily accessible by experiments. In this work, we perform a 5-ns molecular dynamics simulation of an ensemble of 27 copies of beta(2)-microglobulin in explicit solvent. During the simulation, the formation of intermolecular contacts is observed. The simulation highlights the importance of apical residues and, in particular, of those at the N-terminus end of the molecule. The most frequently found pattern of interaction involves a head-to-head contact arrangement of molecules. Hydrophobic contacts appear to be important for the establishment of long-lived (on the simulation timescale) contacts. Although early events on the pathway to aggregation and fibril formation are not directly related to the end-state of the process, which is reached on a much longer timescale, simulation results are consistent with experimental data and in general with a parallel arrangement of intermolecular beta-strand pairs.
Collapse
Affiliation(s)
- Federico Fogolari
- Dipartimento di Scienze e Tecnologie Biomediche, Università di Udine, Udine, Italy.
| | | | | | | | | | | | | | | |
Collapse
|
80
|
|
81
|
Zhang J, Lin M, Chen R, Liang J, Liu JS. Monte Carlo sampling of near-native structures of proteins with applications. Proteins 2006; 66:61-8. [PMID: 17039507 DOI: 10.1002/prot.21203] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Since a protein's dynamic fluctuation inside cells affects the protein's biological properties, we present a novel method to study the ensemble of near-native structures (NNS) of proteins, namely, the conformations that are very similar to the experimentally determined native structure. We show that this method enables us to (i) quantify the difficulty of predicting a protein's structure, (ii) choose appropriate simplified representations of protein structures, and (iii) assess the effectiveness of knowledge-based potential functions. We found that well-designed simple representations of protein structures are likely as accurate as those more complex ones for certain potential functions. We also found that the widely used contact potential functions stabilize NNS poorly, whereas potential functions incorporating local structure information significantly increase the stability of NNS.
Collapse
Affiliation(s)
- Jinfeng Zhang
- Department of Statistics, Harvard University, Cambridge, Massachusetts, USA
| | | | | | | | | |
Collapse
|
82
|
Imamura H, Chen JZY. Dependence of folding dynamics and structural stability on the location of a hydrophobic pair in beta-hairpins. Proteins 2006; 63:555-70. [PMID: 16485280 DOI: 10.1002/prot.20846] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
We study the dependence of folding time, nucleation site, and stability of a model beta-hairpin on the location of a cross-strand hydrophobic pair, using a coarse-grained off-lattice model with the aid of Monte Carlo simulations. Our simulations have produced 6500 independent folding trajectories dynamically, forming the basis for extensive statistical analysis. Four folding pathways, zipping-out, middle-out, zipping-in, and reptation, have been closely monitored and discussed in all seven sequences studied. A hydrophobic pair placed near the beta-turn or in the middle section effectively speed up folding; a hydrophobic pair placed close to the terminal ends or next to the beta-turn encourages stability of the entire chain.
Collapse
Affiliation(s)
- Hideo Imamura
- Department of Physics, University of Waterloo, Waterloo, Ontario, Canada
| | | |
Collapse
|
83
|
Kenzaki H, Kikuchi M. Diversity in free energy landscape and folding pathway of proteins with the same native topology. Chem Phys Lett 2006. [DOI: 10.1016/j.cplett.2006.05.112] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
84
|
Zhang J, Chen R, Liang J. Empirical potential function for simplified protein models: combining contact and local sequence-structure descriptors. Proteins 2006; 63:949-60. [PMID: 16477624 DOI: 10.1002/prot.20809] [Citation(s) in RCA: 31] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Abstract
An effective potential function is critical for protein structure prediction and folding simulation. Simplified protein models such as those requiring only Calpha or backbone atoms are attractive because they enable efficient search of the conformational space. We show residue-specific reduced discrete-state models can represent the backbone conformations of proteins with small RMSD values. However, no potential functions exist that are designed for such simplified protein models. In this study, we develop optimal potential functions by combining contact interaction descriptors and local sequence-structure descriptors. The form of the potential function is a weighted linear sum of all descriptors, and the optimal weight coefficients are obtained through optimization using both native and decoy structures. The performance of the potential function in a test of discriminating native protein structures from decoys is evaluated using several benchmark decoy sets. Our potential function requiring only backbone atoms or Calpha atoms have comparable or better performance than several residue-based potential functions that require additional coordinates of side-chain centers or coordinates of all side-chain atoms. By reducing the residue alphabets down to size 10 for contact descriptors, the performance of the potential function can be further improved. Our results also suggest that local sequence-structure correlation may play important role in reducing the entropic cost of protein folding.
Collapse
Affiliation(s)
- Jinfeng Zhang
- Department of Bioengineering, University of Illinois, Chicago, Illinois, USA
| | | | | |
Collapse
|
85
|
Santana R, Larrañaga P, Lozano JA. Side chain placement using estimation of distribution algorithms. Artif Intell Med 2006; 39:49-63. [PMID: 16854574 DOI: 10.1016/j.artmed.2006.04.004] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2005] [Revised: 04/26/2006] [Accepted: 04/28/2006] [Indexed: 11/29/2022]
Abstract
OBJECTIVE This paper presents an algorithm for the solution of the side chain placement problem. METHODS AND MATERIALS The algorithm combines the application of the Goldstein elimination criterion with the univariate marginal distribution algorithm (UMDA), which stochastically searches the space of possible solutions. The suitability of the algorithm to address the problem is investigated using a set of 425 proteins. RESULTS For a number of difficult instances where inference algorithms do not converge, it has been shown that UMDA is able to find better structures. CONCLUSIONS The results obtained show that the algorithm can achieve better structures than those obtained with other state-of-the-art methods like inference-based techniques. Additionally, a theoretical and empirical analysis of the computational cost of the algorithm introduced has been presented.
Collapse
Affiliation(s)
- Roberto Santana
- Department of Computer Science and Artificial Intelligence, University of the Basque Country, CP-20080, Donostia-San Sebastián, Spain.
| | | | | |
Collapse
|
86
|
De Mori GMS, Colombo G, Micheletti C. Study of the Villin headpiece folding dynamics by combining coarse-grained Monte Carlo evolution and all-atom molecular dynamics. Proteins 2006; 58:459-71. [PMID: 15521059 DOI: 10.1002/prot.20313] [Citation(s) in RCA: 48] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
The folding mechanism of the Villin headpiece (HP36) is studied by means of a novel approach which entails an initial coarse-grained Monte Carlo (MC) scheme followed by all-atom molecular dynamics (MD) simulations in explicit solvent. The MC evolution occurs in a simplified free-energy landscape and allows an efficient selection of marginally-compact structures which are taken as viable initial conformations for the MD. The coarse-grained MC structural representation is connected to the one with atomic resolution through a "fine-graining" reconstruction algorithm. This two-stage strategy is used to select and follow the dynamics of seven different unrelated conformations of HP36. In a notable case the MD trajectory rapidly evolves towards the folded state, yielding a typical root-mean-square deviation (RMSD) of the core region of only 2.4 A from the closest NMR model (the typical RMSD over the whole structure being 4.0 A). The analysis of the various MC-MD trajectories provides valuable insight into the details of the folding and mis-folding mechanisms and particularly about the delicate influence of local and nonlocal interactions in steering the folding process.
Collapse
|
87
|
Global optimization of proteins using a dynamical lattice model: Ground states and energy landscapes. Chem Phys Lett 2006. [DOI: 10.1016/j.cplett.2006.05.007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
88
|
Chen Y, Zhang Q, Ding J. A coarse-grained model for the formation of α helix with a noninteger period on simple cubic lattices. J Chem Phys 2006; 124:184903. [PMID: 16709135 DOI: 10.1063/1.2196878] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Periodicity is an important parameter in the characterization of a helix in proteins. In this work, a coarse-grained model for a homopolypeptide in simple cubic lattices has been extended to build an alpha helix with a noninteger period. The lattice model is based on the bond fluctuation algorithm in which bond lengths and orientations are altered in a quasicontinuous way, and the simulation is performed via dynamic Monte Carlo simulation. Hydrogen bonds are assumed to be formed between a virtual-carbonyl group in a residue and a downstream virtual-imino group in another residue. Consequently, the period of the formed alpha helix is a noninteger. An improved spatial correlation function has been suggested to quantitatively describe the periodicity of the helical conformation, by which helical period and helical persistent length can be calculated by statistics. The resultant periods are very close to 3.6 residues; the persistent length based upon the improved definition can be larger or smaller than the chain length and reflect the inherent regularity of a chain including one or multiple helical blocks. The simulation outputs agree with the calculation of the Zimm-Bragg theory based upon the associated nucleation parameter and propagation parameter as well.
Collapse
Affiliation(s)
- Yantao Chen
- Key Laboratory of Molecular Engineering of Polymers of Chinese Ministry of Education, Department of Macromolecular Science, Fudan University, Shanghai 200433, China
| | | | | |
Collapse
|
89
|
Kenzaki H, Kikuchi M. Coarse-grained protein model, cooperativity of folding and subdomain structure. Chem Phys Lett 2006. [DOI: 10.1016/j.cplett.2006.03.016] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
90
|
Livesay DR, Jacobs DJ. Conserved quantitative stability/flexibility relationships (QSFR) in an orthologous RNase H pair. Proteins 2006; 62:130-43. [PMID: 16287093 PMCID: PMC4678005 DOI: 10.1002/prot.20745] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Many reports qualitatively describe conserved stability and flexibility profiles across protein families, but biophysical modeling schemes have not been available to robustly quantify both. Here we investigate an orthologous RNase H pair by using a minimal distance constraint model (DCM). The DCM is an all atom microscopic model [Jacobs and Dallakyan, Biophys J 2005;88(2):903-915] that accurately reproduces heat capacity measurements [Livesay et al., FEBS Lett 2004;576(3):468-476], and is unique in its ability to harmoniously calculate thermodynamic stability and flexibility in practical computing times. Consequently, quantified stability/flexibility relationships (QSFR) can be determined using the DCM. For the first time, a comparative QSFR analysis is performed, serving as a paradigm study to illustrate the utility of a QSFR analysis for elucidating evolutionarily conserved stability and flexibility profiles. Despite global conservation of QSFR profiles, distinct enthalpy-entropy compensation mechanisms are identified between the RNase H pair. In both cases, local flexibility metrics parallel H/D exchange experiments by correctly identifying the folding core and several flexible regions. Remarkably, at appropriately shifted temperatures (e.g., melting temperature), these differences lead to a global conservation in Landau free energy landscapes, which directly relate thermodynamic stability to global flexibility. Using ensemble-based sampling within free energy basins, rigidly, and flexibly correlated regions are quantified through cooperativity correlation plots. Five conserved flexible regions are identified within the structures of the orthologous pair. Evolutionary conservation of these flexibly correlated regions is strongly suggestive of their catalytic importance. Conclusions made herein are demonstrated to be robust with respect to the DCM parameterization.
Collapse
Affiliation(s)
- Dennis R. Livesay
- Department of Chemistry and Center for Macromolecular Modeling and Materials Design, California State Polytechnic University, Northridge, California
| | - Donald J. Jacobs
- Department of Physics and Astronomy, California State University, Northridge, California
- Correspondence to: Donald Jacobs, Department of Physics and Optical Science, University of North Carolina, Charlotte, 9201 University City Blvd, Charlotte, NC 28223.
| |
Collapse
|
91
|
Brannigan G, Lin LCL, Brown FLH. Implicit solvent simulation models for biomembranes. EUROPEAN BIOPHYSICS JOURNAL: EBJ 2005; 35:104-24. [PMID: 16187129 DOI: 10.1007/s00249-005-0013-y] [Citation(s) in RCA: 101] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2005] [Revised: 08/03/2005] [Accepted: 08/12/2005] [Indexed: 11/25/2022]
Abstract
Fully atomic simulation strategies are infeasible for the study of many processes of interest to membrane biology, biophysics and biochemistry. We review various coarse-grained simulation methodologies with special emphasis on methods and models that do not require the explicit simulation of water. Examples from our own research demonstrate that such models have potential for simulating a variety of biologically relevant phenomena at the membrane surface.
Collapse
Affiliation(s)
- Grace Brannigan
- Department of Physics and Astronomy, University of California, Santa Barbara, CA 93106-9530, USA
| | | | | |
Collapse
|
92
|
Abreu CRA, Escobedo FA. A Novel Configurational-Bias Monte Carlo Method for Lattice Polymers: Application to Molecules with Multicyclic Architectures. Macromolecules 2005. [DOI: 10.1021/ma050725t] [Citation(s) in RCA: 16] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Charlles R. A. Abreu
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, New York 14853
| | - Fernando A. Escobedo
- School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, New York 14853
| |
Collapse
|
93
|
Plewczynska D, Kolinski A. Protein Folding with a Reduced Model and Inaccurate Short-Range Restraints. MACROMOL THEOR SIMUL 2005. [DOI: 10.1002/mats.200500020] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
94
|
Małolepsza E, Boniecki M, Kolinski A, Piela L. Theoretical model of prion propagation: a misfolded protein induces misfolding. Proc Natl Acad Sci U S A 2005; 102:7835-40. [PMID: 15911770 PMCID: PMC1142357 DOI: 10.1073/pnas.0409389102] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
There is a hypothesis that dangerous diseases such as bovine spongiform encephalopathy, Creutzfeldt-Jakob, Alzheimer's, fatal familial insomnia, and several others are induced by propagation of wrong or misfolded conformations of some vital proteins. If for some reason the misfolded conformations were acquired by many such protein molecules it might lead to a "conformational" disease of the organism. Here, a theoretical model of the molecular mechanism of such a conformational disease is proposed, in which a metastable (or misfolded) form of a protein induces a similar misfolding of another protein molecule (conformational autocatalysis). First, a number of amino acid sequences composed of 32 aa have been designed that fold rapidly into a well defined native-like alpha-helical conformation. From a large number of such sequences a subset of 14 had a specific feature of their energy landscape, a well defined local energy minimum (higher than the global minimum for the alpha-helical fold) corresponding to beta-type structure. Only one of these 14 sequences exhibited a strong autocatalytic tendency to form a beta-sheet dimer capable of further propagation of protofibril-like structure. Simulations were done by using a reduced, although of high resolution, protein model and the replica exchange Monte Carlo sampling procedure.
Collapse
Affiliation(s)
- Edyta Małolepsza
- Faculty of Chemistry, Warsaw University, Pasteura 1, 02-093, Warsaw, Poland.
| | | | | | | |
Collapse
|
95
|
Buchete NV, Straub JE, Thirumalai D. Development of novel statistical potentials for protein fold recognition. Curr Opin Struct Biol 2005; 14:225-32. [PMID: 15093838 DOI: 10.1016/j.sbi.2004.03.002] [Citation(s) in RCA: 94] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
The need to perform large-scale studies of protein fold recognition, structure prediction and protein-protein interactions has led to novel developments of residue-level minimal models of proteins. A minimum requirement for useful protein force-fields is that they be successful in the recognition of native conformations. The balance between the level of detail in describing the specific interactions within proteins and the accuracy obtained using minimal protein models is the focus of many current protein studies. Recent results suggest that the introduction of explicit orientation dependence in a coarse-grained, residue-level model improves the ability of inter-residue potentials to recognize the native state. New statistical and optimization computational algorithms can be used to obtain accurate residue-dependent potentials for use in protein fold recognition and, more importantly, structure prediction.
Collapse
Affiliation(s)
- N-V Buchete
- Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health, Bethesda, Maryland 20892, USA
| | | | | |
Collapse
|
96
|
Andrec M, Felts AK, Gallicchio E, Levy RM. Protein folding pathways from replica exchange simulations and a kinetic network model. Proc Natl Acad Sci U S A 2005; 102:6801-6. [PMID: 15800044 PMCID: PMC1100763 DOI: 10.1073/pnas.0408970102] [Citation(s) in RCA: 113] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2004] [Indexed: 11/18/2022] Open
Abstract
We present an approach to the study of protein folding that uses the combined power of replica exchange simulations and a network model for the kinetics. We carry out replica exchange simulations to generate a large ( approximately 10(6)) set of states with an all-atom effective potential function and construct a kinetic model for folding, using an ansatz that allows kinetic transitions between states based on structural similarity. We use this network to perform random walks in the state space and examine the overall network structure. Results are presented for the C-terminal peptide from the B1 domain of protein G. The kinetics is two-state after small temperature perturbations. However, the coil-to-hairpin folding is dominated by pathways that visit metastable helical conformations. We propose possible mechanisms for the alpha-helix/beta-hairpin interconversion.
Collapse
Affiliation(s)
- Michael Andrec
- Department of Chemistry and Chemical Biology and BIOMAPS Institute for Quantitative Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA
| | | | | | | |
Collapse
|
97
|
Thomas GL, Sessions RB, Parker MJ. Density guided importance sampling: application to a reduced model of protein folding. Bioinformatics 2005; 21:2839-43. [PMID: 15802285 DOI: 10.1093/bioinformatics/bti421] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Monte Carlo methods are the most effective means of exploring the energy landscapes of protein folding. The rugged topography of folding energy landscapes causes sampling inefficiencies however, particularly at low, physiological temperatures. RESULTS A hybrid Monte Carlo method, termed density guided importance sampling (DGIS), is presented that overcomes these sampling inefficiencies. The method is shown to be highly accurate and efficient in determining Boltzmann weighted structural metrics of a discrete off-lattice protein model. In comparison to the Metropolis Monte Carlo method, and the hybrid Monte Carlo methods, jump-walking, smart-walking and replica-exchange, the DGIS method is shown to be more efficient, requiring no parameter optimization. The method guides the simulation towards under-sampled regions of the energy spectrum and recognizes when equilibrium has been reached, avoiding arbitrary and excessively long simulation times. AVAILABILITY Fortran code available from authors upon request. CONTACT m.j.parker@leeds.ac.uk.
Collapse
Affiliation(s)
- Geraint L Thomas
- Astbury Centre for Structural Molecular Biology, Department of Biochemistry and Microbiology, University of Leeds, Leeds LS2 9JT, UK
| | | | | |
Collapse
|
98
|
Gront D, Hansmann UHE, Kolinski A. Exploring protein energy landscapes with hierarchical clustering. INTERNATIONAL JOURNAL OF QUANTUM CHEMISTRY 2005; 105:826-830. [PMID: 16479277 PMCID: PMC1366497 DOI: 10.1002/qua.20741] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 05/06/2023]
Abstract
In this work we present a new method for investigating local energy minima on a protein energy landscape. The CABS (CAlpha, CBeta and the center of mass of the Side chain) method was employed for generating protein models, but any other method could be used instead. Cα traces from an ensemble of models are hierarchical clustered with the HCPM (Hierarchical Clustering of Protein Models) method. The efficiency of this method for sampling and analyzing energy landscapes is shown.
Collapse
|
99
|
Wroe R, Bornberg-Bauer E, Chan HS. Comparing folding codes in simple heteropolymer models of protein evolutionary landscape: robustness of the superfunnel paradigm. Biophys J 2005; 88:118-31. [PMID: 15501948 PMCID: PMC1304991 DOI: 10.1529/biophysj.104.050369] [Citation(s) in RCA: 33] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2004] [Accepted: 10/13/2004] [Indexed: 11/18/2022] Open
Abstract
Understanding the evolution of biopolymers is a key element in rationalizing their structures and functions. Simple exact models (SEMs) are well-positioned to address general principles of evolution as they permit the exhaustive enumeration of both sequence and structure (conformational) spaces. The physics-based models of the complete mapping between genotypes and phenotypes afforded by SEMs have proven valuable for gaining insight into how adaptation and selection operate among large collections of sequences and structures. This study compares the properties of evolutionary landscapes of a variety of SEMs to delineate robust predictions and possible model-specific artifacts. Among the models studied, the ruggedness of evolutionary landscape is significantly model-dependent; those derived from more protein-like models appear to be smoother. We found that a common practice of restricting protein structure space to maximally compact lattice conformations results in (i.e., "designs in") many encodable (designable) structures that are not otherwise encodable in the corresponding unrestrained structure space. This discrepancy is especially severe for model potentials that seek to mimic the major role of hydrophobic interactions in protein folding. In general, restricting conformations to be maximally compact leads to larger changes in the model genotype-phenotype mapping than a moderate shifting of reference state energy of the model potential function to allow for more specific encoding via the "designing out" effects of repulsive interactions. Despite these variations, the superfunnel paradigm applies to all SEMs we have tested: For a majority of neutral nets across different models, there exists a funnel-like organization of native stabilities for the sequences in a neutral net encoding for the same structure, and the thermodynamically most stable sequence is also the most robust against mutation.
Collapse
Affiliation(s)
- Richard Wroe
- Faculty of Life Sciences, University of Manchester, United Kingdom; Bioinformatics Division, School of Biological Sciences, University of Münster, Münster, Germany; and Protein Engineering Network of Centres of Excellence, Department of Biochemistry, and Department of Medical Genetics and Microbiology, Faculty of Medicine, University of Toronto, Ontario, Canada
| | - Erich Bornberg-Bauer
- Faculty of Life Sciences, University of Manchester, United Kingdom; Bioinformatics Division, School of Biological Sciences, University of Münster, Münster, Germany; and Protein Engineering Network of Centres of Excellence, Department of Biochemistry, and Department of Medical Genetics and Microbiology, Faculty of Medicine, University of Toronto, Ontario, Canada
| | - Hue Sun Chan
- Faculty of Life Sciences, University of Manchester, United Kingdom; Bioinformatics Division, School of Biological Sciences, University of Münster, Münster, Germany; and Protein Engineering Network of Centres of Excellence, Department of Biochemistry, and Department of Medical Genetics and Microbiology, Faculty of Medicine, University of Toronto, Ontario, Canada
| |
Collapse
|
100
|
|