1
|
Liu B, Gao X, Zhang H. BioSeq-Analysis2.0: an updated platform for analyzing DNA, RNA and protein sequences at sequence level and residue level based on machine learning approaches. Nucleic Acids Res 2020; 47:e127. [PMID: 31504851 PMCID: PMC6847461 DOI: 10.1093/nar/gkz740] [Citation(s) in RCA: 222] [Impact Index Per Article: 55.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2019] [Revised: 08/07/2019] [Accepted: 08/17/2019] [Indexed: 12/14/2022] Open
Abstract
As the first web server to analyze various biological sequences at sequence level based on machine learning approaches, many powerful predictors in the field of computational biology have been developed with the assistance of the BioSeq-Analysis. However, the BioSeq-Analysis can be only applied to the sequence-level analysis tasks, preventing its applications to the residue-level analysis tasks, and an intelligent tool that is able to automatically generate various predictors for biological sequence analysis at both residue level and sequence level is highly desired. In this regard, we decided to publish an important updated server covering a total of 26 features at the residue level and 90 features at the sequence level called BioSeq-Analysis2.0 (http://bliulab.net/BioSeq-Analysis2.0/), by which the users only need to upload the benchmark dataset, and the BioSeq-Analysis2.0 can generate the predictors for both residue-level analysis and sequence-level analysis tasks. Furthermore, the corresponding stand-alone tool was also provided, which can be downloaded from http://bliulab.net/BioSeq-Analysis2.0/download/. To the best of our knowledge, the BioSeq-Analysis2.0 is the first tool for generating predictors for biological sequence analysis tasks at residue level. Specifically, the experimental results indicated that the predictors developed by BioSeq-Analysis2.0 can achieve comparable or even better performance than the existing state-of-the-art predictors.
Collapse
Affiliation(s)
- Bin Liu
- School of Computer Science and Technology, Beijing Institute of Technology, Beijing, China.,Advanced Research Institute of Multidisciplinary Science, Beijing Institute of Technology, Beijing, China
| | - Xin Gao
- School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Guangdong, China
| | - Hanyu Zhang
- School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen, Guangdong, China
| |
Collapse
|
2
|
Wang C, Piroozan N, Javidpour L, Sahimi M. Effect of the geometry of confining media on the stability and folding rate of α -helix proteins. J Chem Phys 2018; 148:194305. [PMID: 30307193 DOI: 10.1063/1.5020841] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Protein folding in confined media has attracted wide attention over the past 15 years due to its importance to both in vivo and in vitro applications. It is generally believed that protein stability increases by decreasing the size of the confining medium, if the medium's walls are repulsive, and that the maximum folding temperature in confinement is in a pore whose size D 0 is only slightly larger than the smallest dimension of a protein's folded state. Until recently, the stability of proteins in pores with a size very close to that of the folded state has not received the attention it deserves. In a previous paper [L. Javidpour and M. Sahimi, J. Chem. Phys. 135, 125101 (2011)], we showed that, contrary to the current theoretical predictions, the maximum folding temperature occurs in larger pores for smaller α-helices. Moreover, in very tight pores, the free energy surface becomes rough, giving rise to a new barrier for protein folding close to the unfolded state. In contrast to unbounded domains, in small nanopores proteins with an α-helical native state that contain the β structures are entropically stabilized implying that folding rates decrease notably and that the free energy surface becomes rougher. In view of the potential significance of such results to interpretation of many sets of experimental data that could not be explained by the current theories, particularly the reported anomalously low rates of folding and the importance of entropic effects on proteins' misfolded states in highly confined environments, we address the following question in the present paper: To what extent the geometry of a confined medium affects the stability and folding rates of proteins? Using millisecond-long molecular dynamics simulations, we study the problem in three types of confining media, namely, cylindrical and slit pores and spherical cavities. Most importantly, we find that the prediction of the previous theories that the dependence of the maximum folding temperature T f on the size D of a confined medium occurs in larger media for larger proteins is correct only in spherical geometry, whereas the opposite is true in the two other geometries that we study. Also studied is the effect of the strength of the interaction between the confined media's walls and the proteins. If the walls are only weakly or moderately attractive, a complex behavior emerges that depends on the size of the confining medium.
Collapse
Affiliation(s)
- Congyue Wang
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California 90089-1211, USA
| | - Nariman Piroozan
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California 90089-1211, USA
| | - Leili Javidpour
- Departments of Energy Engineering and Physics, Amirkabir University of Technology, Tehran 15875-4413, Iran
| | - Muhammad Sahimi
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California 90089-1211, USA
| |
Collapse
|
3
|
Zheng S, Shing KS, Sahimi M. Dynamics of proteins aggregation. II. Dynamic scaling in confined media. J Chem Phys 2018; 148:104305. [PMID: 29544316 DOI: 10.1063/1.5008543] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
In this paper, the second in a series devoted to molecular modeling of protein aggregation, a mesoscale model of proteins together with extensive discontinuous molecular dynamics simulation is used to study the phenomenon in a confined medium. The medium, as a model of a crowded cellular environment, is represented by a spherical cavity, as well as cylindrical tubes with two aspect ratios. The aggregation process leads to the formation of β sheets and eventually fibrils, whose deposition on biological tissues is believed to be a major factor contributing to many neuro-degenerative diseases, such as Alzheimer's, Parkinson's, and amyotrophic lateral sclerosis diseases. Several important properties of the aggregation process, including dynamic evolution of the total number of the aggregates, the mean aggregate size, and the number of peptides that contribute to the formation of the β sheets, have been computed. We show, similar to the unconfined media studied in Paper I [S. Zheng et al., J. Chem. Phys. 145, 134306 (2016)], that the computed properties follow dynamic scaling, characterized by power laws. The existence of such dynamic scaling in unconfined media was recently confirmed by experiments. The exponents that characterize the power-law dependence on time of the properties of the aggregation process in spherical cavities are shown to agree with those in unbounded fluids at the same protein density, while the exponents for aggregation in the cylindrical tubes exhibit sensitivity to the geometry of the system. The effects of the number of amino acids in the protein, as well as the size of the confined media, have also been studied. Similarities and differences between aggregation in confined and unconfined media are described, including the possibility of no fibril formation, if confinement is severe.
Collapse
Affiliation(s)
- Size Zheng
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California 90089-1211, USA
| | - Katherine S Shing
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California 90089-1211, USA
| | - Muhammad Sahimi
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California 90089-1211, USA
| |
Collapse
|
4
|
Zheng S, Javidpour L, Shing KS, Sahimi M. Dynamics of proteins aggregation. I. Universal scaling in unbounded media. J Chem Phys 2017; 145:134306. [PMID: 27782447 DOI: 10.1063/1.4962837] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
It is well understood that in some cases proteins do not fold correctly and, depending on their environment, even properly-folded proteins change their conformation spontaneously, taking on a misfolded state that leads to protein aggregation and formation of large aggregates. An important factor that contributes to the aggregation is the interactions between the misfolded proteins. Depending on the aggregation environment, the aggregates may take on various shapes forming larger structures, such as protein plaques that are often toxic. Their deposition in tissues is a major contributing factor to many neuro-degenerative diseases, such as Alzheimer's, Parkinson's, amyotrophic lateral sclerosis, and prion. This paper represents the first part in a series devoted to molecular simulation of protein aggregation. We use the PRIME, a meso-scale model of proteins, together with extensive discontinuous molecular dynamics simulation to study the aggregation process in an unbounded fluid system, as the first step toward MD simulation of the same phenomenon in crowded cellular environments. Various properties of the aggregates have been computed, including dynamic evolution of aggregate-size distribution, mean aggregate size, number of peptides that contribute to the formation of β sheets, number of various types of hydrogen bonds formed in the system, radius of gyration of the aggregates, and the aggregates' diffusivity. We show that many of such quantities follow dynamic scaling, similar to those for aggregation of colloidal clusters. In particular, at long times the mean aggregate size S(t) grows with time as, S(t) ∼ tz, where z is the dynamic exponent. To our knowledge, this is the first time that the qualitative similarity between aggregation of proteins and colloidal aggregates has been pointed out.
Collapse
Affiliation(s)
- Size Zheng
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California 90089-1211, USA
| | - Leili Javidpour
- School of Physics, Institute for Research in Fundamental Sciences (IPM), Tehran 19395-5531, Iran
| | - Katherine S Shing
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California 90089-1211, USA
| | - Muhammad Sahimi
- Mork Family Department of Chemical Engineering and Materials Science, University of Southern California, Los Angeles, California 90089-1211, USA
| |
Collapse
|
5
|
|
6
|
Javidpour L, Sahimi M. Confinement in nanopores can destabilize α-helix folding proteins and stabilize the β structures. J Chem Phys 2012; 135:125101. [PMID: 21974560 DOI: 10.1063/1.3641482] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Protein folding in confined media has attracted wide attention over the past decade due to its importance in both in vivo and in vitro applications. Currently, it is generally believed that protein stability increases by decreasing the size of the confining medium, if its interaction with the confining walls is repulsive, and that the maximum folding temperature in confinement occurs for a pore size only slightly larger than the smallest dimension of the folded state of a protein. Protein stability in pore sizes, very close to the size of the folded state, has not however received the attention that it deserves. Using detailed, 0.3-ms-long molecular dynamics simulations, we show that proteins with an α-helix native state can have an optimal folding temperature in pore sizes that do not affect the folded-state structure. In contradiction to the current theoretical explanations, we find that the maximum folding temperature occurs in larger pores for smaller α-helices. In highly confined pores the free energy surface becomes rough, and a new barrier for protein folding may appear close to the unfolded state. In addition, in small nanopores the protein states that contain the β structures are entropically stabilized, in contrast to the bulk. As a consequence, folding rates decrease notably and the free energy surface becomes rougher. The results shed light on many recent experimental observations that cannot be explained by the current theories, and demonstrate the importance of entropic effects on proteins' misfolded states in highly confined environments. They also support the concept of passive effect of chaperonin GroEL on protein folding by preventing it from aggregation in crowded environment of biological cells, and provide deeper clues to the α → β conformational transition, believed to contribute to Alzheimer's and Parkinson's diseases. The strategy of protein and enzyme stabilization in confined media may also have to be revisited in the case of tight confinement. For in silico studies of protein folding in confined media, use of non-Go potentials may be more appropriate.
Collapse
Affiliation(s)
- Leili Javidpour
- School of Physics, Institute for Research in Fundamental Sciences, IPM, Tehran 19395-5531, Iran
| | | |
Collapse
|
7
|
Cheung KC, Demaine ED, Bachrach JR, Griffith S. Programmable Assembly With Universally Foldable Strings (Moteins). IEEE T ROBOT 2011. [DOI: 10.1109/tro.2011.2132951] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
8
|
Javidpour L, Tabar MRR, Sahimi M. Molecular simulation of protein dynamics in nanopores. II. Diffusion. J Chem Phys 2009; 130:085105. [PMID: 19256630 DOI: 10.1063/1.3080770] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
A novel combination of discontinuous molecular dynamics and the Langevin equation, together with an intermediate-resolution model of proteins, is used to carry out long (several microsecond) simulations in order to study transport of proteins in nanopores. We simulated single-domain proteins with the alpha-helical native structure. Both attractive and repulsive interaction potentials between the proteins and the pores' walls are considered. The diffusivity D of the proteins is computed not only under the bulk conditions but also as a function of their "length" (the number of the amino-acid groups), temperature T, pore size, and interaction potentials with the walls. Compared with the experimental data, the computed diffusivities under the bulk conditions are of the correct order of magnitude. The diffusivities both in the bulk and in the pores follow a power law in the length [script-l] of the proteins and are larger in pores with repulsive walls. D(+)/D(-), the ratio of the diffusivities in pores with attractive and repulsive walls, exhibits two local maxima in its dependence on the pore size h, which are attributed to the pore sizes and protein configurations that induce long-lasting simultaneous interactions with both walls of the pores. Far from the folding temperature T(f), D increases about linearly with T, but due to the thermal fluctuations and their effect on the proteins' structure near T(f), the dependence of D on T in this region is nonlinear. We propose a novel and general "phase diagram," consisting of four regions, that describes qualitatively the effect of h, T, and interaction potentials with the walls on the diffusivity D of a protein.
Collapse
Affiliation(s)
- Leili Javidpour
- Department of Physics, Sharif University of Technology, Tehran 11155-9161, IranInstitute of Physics, Carl von Ossietzky University, Oldenburg D-26111, Germany
| | | | | |
Collapse
|
9
|
Javidpour L, Tabar MRR, Sahimi M. Molecular simulation of protein dynamics in nanopores. I. Stability and folding. J Chem Phys 2008; 128:115105. [PMID: 18361620 DOI: 10.1063/1.2894299] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Discontinuous molecular dynamics simulations, together with the protein intermediate resolution model, an intermediate-resolution model of proteins, are used to carry out several microsecond-long simulations and study folding transition and stability of alpha-de novo-designed proteins in slit nanopores. Both attractive and repulsive interaction potentials between the proteins and the pore walls are considered. Near the folding temperature T(f) and in the presence of the attractive potential, the proteins undergo a repeating sequence of folding/partially folding/unfolding transitions, with T(f) decreasing with decreasing pore sizes. The unfolded states may even be completely adsorbed on the pore's walls with a negative potential energy. In such pores the energetic effects dominate the entropic effects. As a result, the unfolded state is stabilized, with a folding temperature T(f) which is lower than its value in the bulk and that, compared with the bulk, the folding rate decreases. The opposite is true in the presence of a repulsive interaction potential between the proteins and the walls. Moreover, for short proteins in very tight pores with attractive walls, there exists an unfolded state with only one alpha-helical hydrogen bond and an energy nearly equal to that of the folded state. The proteins have, however, high entropies, implying that they cannot fold onto their native structure, whereas in the presence of repulsive walls the proteins do attain their native structure. There is a pronounced asymmetry between the two termini of the protein with respect to their interaction with the pore walls. The effect of a variety of factors, including the pore size and the proteins' length, as well as the temperature, is studied in detail.
Collapse
Affiliation(s)
- Leili Javidpour
- Department of Physics, Sharif University of Technology, Tehran, Iran
| | | | | |
Collapse
|
10
|
|
11
|
Roadmap methods for protein folding. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2007. [PMID: 18075168 DOI: 10.1007/978-1-59745-574-9_9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register]
Abstract
Protein folding refers to the process whereby a protein assumes its intricate three-dimensional shape. This chapter reviews a class of methods for studying the folding process called roadmap methods. The goal of these methods is not to predict the folded structure of a protein, but rather to analyze the folding kinetics. It is assumed that the folded state is known. Roadmap methods maintain a graph representation of sampled conformations. By analyzing this graph one can predict structure formation order, the probability of folding, and get a coarse view of the energy landscape.
Collapse
|
12
|
Djurdjevic DP, Biggs MJ. Ab initio protein fold prediction using evolutionary algorithms: influence of design and control parameters on performance. J Comput Chem 2007; 27:1177-95. [PMID: 16752367 DOI: 10.1002/jcc.20440] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
True ab initio prediction of protein 3D structure requires only the protein primary structure, a physicochemical free energy model, and a search method for identifying the free energy global minimum. Various characteristics of evolutionary algorithms (EAs) mean they are in principle well suited to the latter. Studies to date have been less than encouraging, however. This is because of the limited consideration given to EA design and control parameter issues. A comprehensive study of these issues was, therefore, undertaken for ab initio protein fold prediction using a full atomistic protein model. The performance and optimal control parameter settings of twelve EA designs where first established using a 15-residue polyalanine molecule-design aspects varied include the encoding alphabet, crossover operator, and replacement strategy. It can be concluded that real encoding and multipoint crossover are superior, while both generational and steady-state replacement strategies have merits. The scaling between the optimal control parameter settings and polyalanine size was also identified for both generational and steady-state designs based on real encoding and multipoint crossover. Application of the steady-state design to met-enkephalin indicated that these scalings are potentially transferable to real proteins. Comparison of the performance of the steady state design for met-enkephalin with other ab initio methods indicates that EAs can be competitive provided the correct design and control parameter values are used.
Collapse
Affiliation(s)
- Dusan P Djurdjevic
- Institute for Materials and Processes, University of Edinburgh, King's Buildings, Mayfield Road, Edinburgh EH9 3JL, United Kingdom
| | | |
Collapse
|
13
|
Abstract
Protein motions, ranging from molecular flexibility to large-scale conformational change, play an essential role in many biochemical processes. Despite the explosion in our knowledge of structural and functional data, our understanding of protein movement is still very limited. In previous work, we developed and validated a motion planning based method for mapping protein folding pathways from unstructured conformations to the native state. In this paper, we propose a novel method based on rigidity theory to sample conformation space more effectively, and we describe extensions of our framework to automate the process and to map transitions between specified conformations. Our results show that these additions both improve the accuracy of our maps and enable us to study a broader range of motions for larger proteins. For example, we show that rigidity-based sampling results in maps that capture subtle folding differences between protein G and its mutants, NuG1 and NuG2, and we illustrate how our technique can be used to study large-scale conformational changes in calmodulin, a 148 residue signaling protein known to undergo conformational changes when binding to Ca(2+). Finally, we announce our web-based protein folding server which includes a publicly available archive of protein motions: (http://parasol.tamu.edu/foldingserver/).
Collapse
Affiliation(s)
- Shawna Thomas
- Parasol Lab, Department of Computer Science, Texas A&M University, College Station, TX 77843-3112, USA
| | | | | | | |
Collapse
|
14
|
Nguyen HD, Hall CK. Spontaneous fibril formation by polyalanines; discontinuous molecular dynamics simulations. J Am Chem Soc 2006; 128:1890-901. [PMID: 16464090 PMCID: PMC3215763 DOI: 10.1021/ja0539140] [Citation(s) in RCA: 84] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Fibrillary protein aggregates rich in beta-sheet structure have been implicated in the pathology of several neurodegenerative diseases. In this work, we investigate the formation of fibrils by performing discontinuous molecular dynamics simulations on systems containing 12 to 96 model Ac-KA(14)K-NH(2) peptides using our newly developed off-lattice, implicit-solvent, intermediate-resolution model, PRIME. We find that, at a low concentration, random-coil peptides assemble into alpha-helices at low temperatures. At intermediate concentrations, random-coil peptides assemble into alpha-helices at low temperatures and large beta-sheet structures at high temperatures. At high concentrations, the system forms beta-sheets over a wide range of temperatures. These assemble into fibrils above a critical temperature which decreases with concentration and exceeds the isolated peptide's folding temperature. At very high temperatures and all concentrations, the system is in a random-coil state. All of these results are in good qualitative agreement with those by Blondelle and co-workers on Ac-KA(14)K-NH(2) peptides. The fibrils observed in our simulations mimic the structural characteristics observed in experiments in terms of the number of sheets formed, the values of the intra- and intersheet separations, and the parallel peptide arrangement within each beta-sheet. Finally, we find that when the strength of the hydrophobic interaction between nonpolar side chains is high compared to the strength of hydrogen bonding, amorphous aggregates, rather than fibrillar aggregates, are formed.
Collapse
Affiliation(s)
- Hung D Nguyen
- Department of Chemical Engineering, North Carolina State University, Raleigh, NC 27695-7905, USA
| | | |
Collapse
|
15
|
Simulating Protein Motions with Rigidity Analysis. LECTURE NOTES IN COMPUTER SCIENCE 2006. [DOI: 10.1007/11732990_33] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2023]
|
16
|
Li X, Liang J. Geometric cooperativity and anticooperativity of three-body interactions in native proteins. Proteins 2005; 60:46-65. [PMID: 15849756 DOI: 10.1002/prot.20438] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Characterizing multibody interactions of hydrophobic, polar, and ionizable residues in protein is important for understanding the stability of protein structures. We introduce a geometric model for quantifying 3-body interactions in native proteins. With this model, empirical propensity values for many types of 3-body interactions can be reliably estimated from a database of native protein structures, despite the overwhelming presence of pairwise contacts. In addition, we define a nonadditive coefficient that characterizes cooperativity and anticooperativity of residue interactions in native proteins by measuring the deviation of 3-body interactions from 3 independent pairwise interactions. It compares the 3-body propensity value from what would be expected if only pairwise interactions were considered, and highlights the distinction of propensity and cooperativity of 3-body interaction. Based on the geometric model, and what can be inferred from statistical analysis of such a model, we find that hydrophobic interactions and hydrogen-bonding interactions make nonadditive contributions to protein stability, but the nonadditive nature depends on whether such interactions are located in the protein interior or on the protein surface. When located in the interior, many hydrophobic interactions such as those involving alkyl residues are anticooperative. Salt-bridge and regular hydrogen-bonding interactions, such as those involving ionizable residues and polar residues, are cooperative. When located on the protein surface, these salt-bridge and regular hydrogen-bonding interactions are anticooperative, and hydrophobic interactions involving alkyl residues become cooperative. We show with examples that incorporating 3-body interactions improves discrimination of protein native structures against decoy conformations. In addition, analysis of cooperative 3-body interaction may reveal spatial motifs that can suggest specific protein functions.
Collapse
Affiliation(s)
- Xiang Li
- Department of Bioengineering, SEO, MC-063, University of Illinois at Chicago, Chicago, Illinois 60607-7052, USA
| | | |
Collapse
|
17
|
Abstract
Ordered beta-sheet complexes, termed amyloid fibrils, are the underlying structural components of the intra- and extracellular fibrillar protein deposits that are associated with a variety of human diseases, including Alzheimer's, Parkinson's, and the prion diseases. In this work, we investigated the kinetics of fibril formation using our newly developed off-lattice intermediate resolution model, PRIME. The model is simple enough to allow the treatment of large multichain systems while maintaining a fairly realistic description of protein dynamics without built-in bias toward any conformation when used in conjunction with constant temperature discontinuous molecular dynamics, a fast alternative to conventional molecular dynamics. Simulations were performed on systems containing 48-96 model Ac-KA14K-NH2 peptides. We found that fibril formation for polyalanines incorporate features that are characteristic of three models, the templated assembly, nucleated polymerization, and nucleated conformational conversion models, but that none of them gave a completely satisfactory description of the simulation kinetics. Fibril formation was nucleation-dependent, occurring after a lag time that decreased with increasing peptide concentration and increased with increasing temperature. Fibril formation appeared to be a conformational conversion process in which small amorphous aggregates --> beta-sheets --> ordered nucleus --> subsequent rapid growth of a small stable fibril or protofilament. Fibril growth in our simulations involved both beta-sheet elongation, in which the fibril grew by adding individual peptides to the end of each beta-sheet, and lateral addition, in which the fibril grew by adding already formed beta-sheets to its side. The initial rate of fibril formation increased with increasing concentration and decreased with increasing temperature.
Collapse
Affiliation(s)
- Hung D Nguyen
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, North Carolina 27695-7905, USA
| | | |
Collapse
|
18
|
Lotan I, Schwarzer F, Halperin D, Latombe JC. Algorithm and Data Structures for Efficient Energy Maintenance during Monte Carlo Simulation of Proteins. J Comput Biol 2004; 11:902-32. [PMID: 15700409 DOI: 10.1089/cmb.2004.11.902] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Monte Carlo simulation (MCS) is a common methodology to compute pathways and thermodynamic properties of proteins. A simulation run is a series of random steps in conformation space, each perturbing some degrees of freedom of the molecule. A step is accepted with a probability that depends on the change in value of an energy function. Typical energy functions sum many terms. The most costly ones to compute are contributed by atom pairs closer than some cutoff distance. This paper introduces a new method that speeds up MCS by exploiting the facts that proteins are long kinematic chains and that few degrees of freedom are changed at each step. A novel data structure, called the ChainTree, captures both the kinematics and the shape of a protein at successive levels of detail. It is used to efficiently detect self-collision (steric clash between atoms) and/or find all atom pairs contributing to the energy. It also makes it possible to identify partial energy sums left unchanged by a perturbation, thus allowing the energy value to be incrementally updated. Computational tests on four proteins of sizes ranging from 68 to 755 amino acids show that MCS with the ChainTree method is significantly faster (as much as 10 times faster for the largest protein) than with the widely used grid method. They also indicate that speed-up increases with larger proteins.
Collapse
Affiliation(s)
- Itay Lotan
- Department of Computer Science, Stanford University, Stanford, CA 94305, USA
| | | | | | | |
Collapse
|
19
|
Locker CR, Hernandez R. Folding behavior of model proteins with weak energetic frustration. J Chem Phys 2004; 120:11292-303. [PMID: 15268157 DOI: 10.1063/1.1751394] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
The native structure of fast-folding proteins, albeit a deep local free-energy minimum, may involve a relatively small energetic penalty due to nonoptimal, though favorable, contacts between amino acid residues. The weak energetic frustration that such contacts represent varies among different proteins and may account for folding behavior not seen in unfrustrated models. Minimalist model proteins with heterogeneous contacts--as represented by lattice heteropolymers consisting of three types of monomers--also give rise to weak energetic frustration in their corresponding native structures, and the present study of their equilibrium and nonequilibrium properties reveals some of the breadth in their behavior. In order to capture this range within a detailed study of only a few proteins, four candidate protein structures (with their cognate sequences) have been selected according to a figure of merit called the winding index--a characteristic of the number of turns the protein winds about an axis. The temperature-dependent heat capacities reveal a high-temperature collapse transition, and an infrequently observed low-temperature rearrangement transition that arises because of the presence of weak energetic frustration. Simulation results motivate the definition of a new measure of folding affinity as a sequence-dependent free energy--a function of both a reduced stability gap and high accessibility to non-native structures--that correlates strongly with folding rates.
Collapse
Affiliation(s)
- C Rebecca Locker
- Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, Georgia 30332-0400, USA
| | | |
Collapse
|
20
|
Apaydin MS, Brutlag DL, Guestrin C, Hsu D, Latombe JC, Varma C. Stochastic roadmap simulation: an efficient representation and algorithm for analyzing molecular motion. J Comput Biol 2004; 10:257-81. [PMID: 12935328 DOI: 10.1089/10665270360688011] [Citation(s) in RCA: 62] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Classic molecular motion simulation techniques, such as Monte Carlo (MC) simulation, generate motion pathways one at a time and spend most of their time in the local minima of the energy landscape defined over a molecular conformation space. Their high computational cost prevents them from being used to compute ensemble properties (properties requiring the analysis of many pathways). This paper introduces stochastic roadmap simulation (SRS) as a new computational approach for exploring the kinetics of molecular motion by simultaneously examining multiple pathways. These pathways are compactly encoded in a graph, which is constructed by sampling a molecular conformation space at random. This computation, which does not trace any particular pathway explicitly, circumvents the local-minima problem. Each edge in the graph represents a potential transition of the molecule and is associated with a probability indicating the likelihood of this transition. By viewing the graph as a Markov chain, ensemble properties can be efficiently computed over the entire molecular energy landscape. Furthermore, SRS converges to the same distribution as MC simulation. SRS is applied to two biological problems: computing the probability of folding, an important order parameter that measures the "kinetic distance" of a protein's conformation from its native state; and estimating the expected time to escape from a ligand-protein binding site. Comparison with MC simulations on protein folding shows that SRS produces arguably more accurate results, while reducing computation time by several orders of magnitude. Computational studies on ligand-protein binding also demonstrate SRS as a promising approach to study ligand-protein interactions.
Collapse
|
21
|
Song G, Amato N. A Motion-Planning Approach to Folding: From Paper Craft to Protein Folding. ACTA ACUST UNITED AC 2004. [DOI: 10.1109/tra.2003.820926] [Citation(s) in RCA: 47] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
22
|
Kolodny R, Levitt M. Protein decoy assembly using short fragments under geometric constraints. Biopolymers 2003; 68:278-85. [PMID: 12601789 DOI: 10.1002/bip.10262] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
A small set of protein fragments can represent adequately all known local protein structure. This set of fragments, along with a construction scheme that assembles these fragments into structures, defines a discrete (relatively small) conformation space, which approximates protein structures accurately. We generate protein decoys by sampling geometrically valid structures from this conformation space, biased by the secondary structure prediction for the protein. Unlike other methods, secondary structure prediction is the only protein-specific information used for generating the decoys. Nevertheless, these decoys are qualitatively similar to those found by others. The method works well for all-alpha proteins, and shows promising results for alpha and beta proteins.
Collapse
Affiliation(s)
- R Kolodny
- Department of Computer Science, Stanford University, Stanford, CA 94305-5126, USA
| | | |
Collapse
|
23
|
Pal L, Chakrabarti P, Basu G. Sequence and structure patterns in proteins from an analysis of the shortest helices: implications for helix nucleation. J Mol Biol 2003; 326:273-91. [PMID: 12547209 DOI: 10.1016/s0022-2836(02)01338-4] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
Abstract
The shortest helices (three-length 3(10) and four-length alpha), most abundant among helices of different lengths, have been analyzed from a database of protein structures. A characteristic feature of three-length 3(10)-helices is the shifted backbone conformation for the C-terminal residue (phi,psi angles: -95 degrees,0 degrees ), compared to the rest of the helix (-62 degrees,-24 degrees ). The deviation can be attributed to the release of electrostatic repulsion between the carbonyl oxygen atoms at the two C-terminal residues and further stabilization (due to a more linear geometry) of an intrahelical hydrogen bond. A consequence of this non-canonical C-terminal backbone conformation can be a potential origin of helix kinks when a 3(10)-helix is sequence-contiguous at the alpha-helix N-terminal. An analysis of hydrogen bonding, as well as hydrophobic interactions in the shortest helices shows that capping interactions, some of them not observed for longer helices, dominate at the N termini. Further, consideration of the distribution of amino acid residues indicates that the shortest helices resemble the N-terminal end of alpha-helices rather than the C terminus, implying that the folding of helices may be initiated at the N-terminal end, which does not get propagated in the case of the shortest helices. Finally, pairwise comparison of beta-turns and the shortest helices, based on correlation matrices of site-specific amino acid composition, and the relative abundance of these short secondary structural elements, leads to a helix nucleation scheme that considers the formation of an isolated beta-turn (and not an alpha-turn) as the helix nucleation step, with shortest 3(10)-helices as intermediates between the shortest alpha-helix and the beta-turn. Our results ascribe an important role played by shortest 3(10)-helices in proteins with important structural and folding implications.
Collapse
Affiliation(s)
- Lipika Pal
- Department of Biochemistry, Bose Institute, P-1/12 CIT Scheme VIIM, 700 054, Calcutta, India
| | | | | |
Collapse
|
24
|
Genetic algorithms in molecular modelling: a review. ACTA ACUST UNITED AC 2003. [DOI: 10.1016/s0922-3487(03)23004-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
|
25
|
Abstract
We present a framework for studying protein folding pathways and potential landscapes which is based on techniques recently developed in the robotics motion planning community. Our focus in this work is to study the protein folding mechanism assuming we know the native fold. That is, instead of performing fold prediction, we aim to study issues related to the folding process, such as the formation of secondary and tertiary structure, and the dependence of the folding pathway on the initial denatured conformation. Our work uses probabilistic roadmap (PRM) motion planning techniques which have proven successful for problems involving high-dimensional configuration spaces. A strength of these methods is their efficiency in rapidly covering the planning space without becoming trapped in local minima. We have applied our PRM technique to several small proteins (~60 residues) and validated the pathways computed by comparing the secondary structure formation order on our paths to known hydrogen exchange experimental results. An advantage of the PRM framework over other simulation methods is that it enables one to easily and efficiently compute folding pathways from any denatured starting state to the (known) native fold. This aspect makes our approach ideal for studying global properties of the protein's potential landscape, most of which are difficult to simulate and study with other methods. For example, in the proteins we study, the folding pathways starting from different denatured states sometimes share common portions when they are close to the native fold, and moreover, the formation order of the secondary structure appears largely independent of the starting denatured conformation. Another feature of our technique is that the distribution of the sampled conformations is correlated with the formation of secondary structure and, in particular, appears to differentiate situations in which secondary structure clearly forms first and those in which the tertiary structure is obtained more directly. Overall, our results applying PRM techniques are very encouraging and indicate the promise of our approach for studying proteins for which experimental results are not available.
Collapse
Affiliation(s)
- Nancy M Amato
- Department of Computer Science, Texas A&M University, College Station, TX 77843-3112, USA.
| | | |
Collapse
|
26
|
Kobayashi Y, Saitĵ N. Secondary structure prediction and folding of globular protein: refolding of ferredoxin. JOURNAL OF PROTEIN CHEMISTRY 2001; 20:647-54. [PMID: 11890206 DOI: 10.1023/a:1013720403558] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
The physicochemical mechanism of protein folding has been elucidated by the island model, describing a growth type of folding. The folding pathway is closely related with nucleation on the polypeptide chain and thus the formation of small local structures or secondary structures at the earliest stage of folding is essential to all following steps. The island model is applicable to any protein, but a high precision of secondary structure prediction is indispensable to folding simulation. The secondary structures formed at the earliest stage of folding are supposed to be of standard form, but they are usually deformed during the folding process, especially at the last stage, although the degree of deformation is different for each protein. Ferredoxin is an example of a protein having this property. According to X-ray investigation (1FDX), ferredoxin is not supposed to have secondary structures. However, if we assumed that in ferredoxin all the residues are in a coil state, we could not attain the correct structure similar to the native one. Further, we found that some parts of the chain are not flexible, suggesting the presence of secondary structures, in agreement with the recent PDB data (1DUR). Assuming standard secondary structures (alpha-helices and beta-strands) at the nonflexible parts at the early stage of folding, and deforming these at the final stage, a structure similar to the native one was obtained. Another peculiarity of ferredoxin is the absence of disulfide bonds, in spite of its having eight cysteines. The reason cysteines do not form disulfide bonds became clear by applying the lampshade criterion, but more importantly, the two groups of cysteines are ready to make iron complexes, respectively, at a rather later stage of folding. The reason for poor prediction accuracy of secondary structure with conventional methods is discussed.
Collapse
Affiliation(s)
- Y Kobayashi
- Department of Information Systems Science, Faculty of Engineering, Soka University, Hachioji-shi, Tokyo, Japan.
| | | |
Collapse
|
27
|
Voegler Smith A, Hall CK. alpha-helix formation: discontinuous molecular dynamics on an intermediate-resolution protein model. Proteins 2001; 44:344-60. [PMID: 11455608 DOI: 10.1002/prot.1100] [Citation(s) in RCA: 133] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
An intermediate-resolution model of small, homogeneous peptides is introduced, and discontinuous molecular dynamics simulation is applied to study secondary structure formation. Physically, each model residue consists of a detailed three-bead backbone and a simplified single-bead side-chain. Excluded volume and hydrogen bond interactions are constructed with discontinuous (i.e., hard-sphere and square-well) potentials. Simulation results show that the backbone motion of the model is limited to realistic regions of Phi-Psi conformational space. Model polyalanine chains undergo a locally cooperative transition to form alpha-helices that are stabilized by backbone hydrogen bonding, while model polyglycine chains tend to adopt nonhelical structures. When side-chain size is increased beyond a critical diameter, steric interactions prevent formation of long alpha-helices. These trends in helicity as a function of residue type have been well documented by experimental, theoretical, and simulation studies and demonstrate the ability of the intermediate-resolution model developed in this work to accurately mimic realistic peptide behavior. The efficient algorithm used permits observation of the complete helix-coil transition within 15 min on a single-processor workstation, suggesting that simulations of very long times are possible with this model.
Collapse
Affiliation(s)
- A Voegler Smith
- Department of Chemical Engineering, North Carolina State University, Raleigh, North Carolina 27695-7905, USA
| | | |
Collapse
|
28
|
Smith AV, Hall CK. Assembly of a tetrameric alpha-helical bundle: computer simulations on an intermediate-resolution protein model. Proteins 2001; 44:376-91. [PMID: 11455611 DOI: 10.1002/prot.1103] [Citation(s) in RCA: 64] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
Abstract
Discontinuous molecular dynamics (DMD) simulation on an intermediate-resolution protein model is used to study the folding of an isolated, small model peptide to an amphipathic alpha-helix and the assembly of four of these model peptides into a four-helix bundle. A total of 129 simulations were performed on the isolated peptide, and 50 simulations were performed on the four-peptide system. Simulations efficiently sample conformational space allowing complete folding trajectories from random initial configurations to be observed within 15 min for the one-peptide system and within 15 h for the four-peptide system on a 500-MHz workstation. The native structures of both the alpha-helix and the four-helix bundle are consistent with experimental characterization studies and with results from previous simulations on these model peptides. In both the one- and four-peptide systems, the native state is achieved during simulations within an optimal temperature range, a phenomenon also observed experimentally. The ease with which our simulations yield reasonable estimates of folded structures demonstrates the power of the intermediate-resolution model developed for this work and the DMD algorithm and suggests that simulations of very long times and of multiprotein systems may be possible with this model.
Collapse
Affiliation(s)
- A V Smith
- Department of Chemical Engineering, North Carolina State University, Raleigh, North Carolina 27695-7905, USA
| | | |
Collapse
|
29
|
Locker CR, Hernandez R. A minimalist model protein with multiple folding funnels. Proc Natl Acad Sci U S A 2001; 98:9074-9. [PMID: 11470921 PMCID: PMC55375 DOI: 10.1073/pnas.161438898] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Kinetic and structural studies of wild-type proteins such as prions and amyloidogenic proteins provide suggestive evidence that proteins may adopt multiple long-lived states in addition to the native state. All of these states differ structurally because they lie far apart in configuration space, but their stability is not necessarily caused by cooperative (nucleation) effects. In this study, a minimalist model protein is designed to exhibit multiple long-lived states to explore the dynamics of the corresponding wild-type proteins. The minimalist protein is modeled as a 27-monomer sequence confined to a cubic lattice with three different monomer types. An order parameter-the winding index-is introduced to characterize the extent of folding. The winding index has several advantages over other commonly used order parameters like the number of native contacts. It can distinguish between enantiomers, its calculation requires less computational time than the number of native contacts, and reduced-dimensional landscapes can be developed when the native state structure is not known a priori. The results for the designed model protein prove by existence that the rugged energy landscape picture of protein folding can be generalized to include protein "misfolding" into long-lived states.
Collapse
Affiliation(s)
- C R Locker
- Center for Computational Molecular Science and Technology, School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA 30332-0400, USA
| | | |
Collapse
|
30
|
Chakrabarti P, Pal D. The interrelationships of side-chain and main-chain conformations in proteins. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2001; 76:1-102. [PMID: 11389934 DOI: 10.1016/s0079-6107(01)00005-0] [Citation(s) in RCA: 177] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
The accurate determination of a large number of protein structures by X-ray crystallography makes it possible to conduct a reliable statistical analysis of the distribution of the main-chain and side-chain conformational angles, how these are dependent on residue type, adjacent residue in the sequence, secondary structure, residue-residue interactions and location at the polypeptide chain termini. The interrelationship between the main-chain (phi, psi) and side-chain (chi 1) torsion angles leads to a classification of amino acid residues that simplify the folding alphabet considerably and can be a guide to the design of new proteins or mutational studies. Analyses of residues occurring with disallowed main-chain conformation or with multiple conformations shed some light on why some residues are less favoured in thermophiles.
Collapse
Affiliation(s)
- P Chakrabarti
- Department of Biochemistry, Bose Institute, P-1/12, CIT Scheme VIIM, 700 054, Calcutta, India. boseinst.ernet.in
| | | |
Collapse
|
31
|
Hassinen T, Peräkylä M. New energy terms for reduced protein models implemented in an off-lattice force field. J Comput Chem 2001. [DOI: 10.1002/jcc.1080] [Citation(s) in RCA: 106] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
|
32
|
Convex Global Underestimation for Molecular Structure Prediction. ACTA ACUST UNITED AC 2001. [DOI: 10.1007/978-1-4757-5284-7_1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register]
|
33
|
Voegler Smith A, Hall CK. Bridging the gap between homopolymer and protein models: A discontinuous molecular dynamics study. J Chem Phys 2000. [DOI: 10.1063/1.1320824] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
|
34
|
|
35
|
|
36
|
Abstract
A coarse-grained dynamic Monte Carlo (MC) simulation method is used to investigate the conformational dynamics of chymotrypsin inhibitor 2 (CI2). Each residue is represented therein by two interaction sites, one at the alpha-carbon and the other on the amino acid side-chain. The energy and geometry parameters extracted from databank structures are used. The calculated rms fluctuations of alpha-carbon atoms are in good agreement with crystallographic temperature factors. The two regions of the protein that pack against each other to form the main hydrophobic core exhibit negatively correlated fluctuations. The conformational dynamics could efficiently be probed by the time-delayed orientational and conformational correlation functions of the virtual bonds: the active site loop, excluding the active site bond, the turn region, and the N-terminal of the alpha-helix are relatively more mobile regions of the structure. A correlation is observed between the hydrogen/deuterium (H/D) exchange behavior and the long-time orientational and conformational autocorrelation function values for CI2. A cooperativity in the rotations of the bonds near in sequence is observed at all time windows, whereas the cooperative rotations of the bonds far along the sequence appear at long time windows; these correlations contribute to the stability of the secondary structures and the tertiary structure, respectively.
Collapse
Affiliation(s)
- N Kurt
- Chemical Engineering Department, Boğaziçi University, Bebek, Istanbul, Turkey
| | | |
Collapse
|
37
|
|
38
|
Huang ES, Samudrala R, Ponder JW. Ab initio fold prediction of small helical proteins using distance geometry and knowledge-based scoring functions. J Mol Biol 1999; 290:267-81. [PMID: 10388572 DOI: 10.1006/jmbi.1999.2861] [Citation(s) in RCA: 67] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
The problem of protein tertiary structure prediction from primary sequence can be separated into two subproblems: generation of a library of possible folds and specification of a best fold given the library. A distance geometry procedure based on random pairwise metrization with good sampling properties was used to generate a library of 500 possible structures for each of 11 small helical proteins. The input to distance geometry consisted of sets of restraints to enforce predicted helical secondary structure and a generic range of 5 to 11 A between predicted contact residues on all pairs of helices. For each of the 11 targets, the resulting library contained structures with low RMSD versus the native structure. Near-native sampling was enhanced by at least three orders of magnitude compared to a random sampling of compact folds. All library members were scored with a combination of an all-atom distance-dependent function, a residue pair-potential, and a hydrophobicity function. In six of the 11 cases, the best-ranking fold was considered to be near native. Each library was also reduced to a final ab initio prediction via consensus distance geometry performed over the 50 best-ranking structures from the full set of 500. The consensus results were of generally higher quality, yielding six predictions within 6.5 A of the native fold. These favorable predictions corresponded to those for which the correlation between the RMSD and the scoring function were highest. The advantage of the reported methodology is its extreme simplicity and potential for including other types of structural restraints.
Collapse
Affiliation(s)
- E S Huang
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, Saint Louis, MO, 63110, USA
| | | | | |
Collapse
|
39
|
Eyrich VA, Standley DM, Friesner RA. Prediction of protein tertiary structure to low resolution: performance for a large and structurally diverse test set. J Mol Biol 1999; 288:725-42. [PMID: 10329175 DOI: 10.1006/jmbi.1999.2702] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
Abstract
We report the tertiary structure predictions for 95 proteins ranging in size from 17 to 160 residues starting from known secondary structure. Predictions are obtained from global minimization of an empirical potential function followed by the application of a refined atomic overlap potential. The minimization strategy employed represents a variant of the Monte Carlo plus minimization scheme of Li and Scheraga applied to a reduced model of the protein chain. For all of the cases except beta-proteins larger than 75 residues, a native-like structure, usually 4-6 A root-mean-square deviation from the native, is located. For beta-proteins larger than 75 residues, the energy gap between native-like structures and the lowest energy structures produced in the simulation is large, so that low RMSD structures are not generated starting from an unfolded state. This is attributed to the lack of an explicit hydrogen bond term in the potential function, which we hypothesize is necessary to stabilize large assemblies of beta-strands.
Collapse
Affiliation(s)
- V A Eyrich
- Department of Chemistry and Center for Biomolecular Simulation, Columbia University, New York, NY, 10027, USA.
| | | | | |
Collapse
|
40
|
Abstract
We describe a method for making natural, physical movements in a chained polymer by sequentially adjusting a few neighboring torsion angles in the polymer backbone. In addition to being very fast and easy to implement, the method is also very general. It applies equally well to proteins and nucleic acids. This method is then used to design a local refinement procedure. We test the refinement procedure on the minimization of a simple energy function for proteins. The energy function has a simplified potential for hydrophobic interaction, a hydrogen-bond term, and a term for van der Waals interaction. There is considerable current interest in such simple energy functions for protein folding. When applied to refine structures found by a global search method, the refinement is able to produce large reduction in the hydrogen-bond term and the van der Waal term of the energy. We conclude that the method is particularly effective in finding good "packing" of residues in an initially compact conformation.
Collapse
Affiliation(s)
- W H Wong
- Department of Statistics, University of California, Los Angeles 90095-1554, USA.
| | | | | |
Collapse
|
41
|
|
42
|
Chaudhury P, Bhattacharyya S. Locating critical points on multi-dimensional surfaces by genetic algorithm: test cases including normal and perturbed argon clusters. Chem Phys 1999. [DOI: 10.1016/s0301-0104(98)00414-5] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/18/2022]
|
43
|
Huang ES, Koehl P, Levitt M, Pappu RV, Ponder JW. Accuracy of side-chain prediction upon near-native protein backbones generated by Ab initio folding methods. Proteins 1998; 33:204-17. [PMID: 9779788 DOI: 10.1002/(sici)1097-0134(19981101)33:2<204::aid-prot5>3.0.co;2-i] [Citation(s) in RCA: 39] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The ab initio folding problem can be divided into two sequential tasks of approximately equal computational complexity: the generation of native-like backbone folds and the positioning of side chains upon these backbones. The prediction of side-chain conformation in this context is challenging, because at best only the near-native global fold of the protein is known. To test the effect of displacements in the protein backbones on side-chain prediction for folds generated ab initio, sets of near-native backbones (< or = 4 A C alpha RMS error) for four small proteins were generated by two methods. The steric environment surrounding each residue was probed by placing the side chains in the native conformation on each of these decoys, followed by torsion-space optimization to remove steric clashes on a rigid backbone. We observe that on average 40% of the chi1 angles were displaced by 40 degrees or more, effectively setting the limits in accuracy for side-chain modeling under these conditions. Three different algorithms were subsequently used for prediction of side-chain conformation. The average prediction accuracy for the three methods was remarkably similar: 49% to 51% of the chi1 angles were predicted correctly overall (33% to 36% of the chi1+2 angles). Interestingly, when the inter-side-chain interactions were disregarded, the mean accuracy increased. A consensus approach is described, in which side-chain conformations are defined based on the most frequently predicted chi angles for a given method upon each set of near-native backbones. We find that consensus modeling, which de facto includes backbone flexibility, improves side-chain prediction: chi1 accuracy improved to 51-54% (36-42% of chi1+2). Implications of a consensus method for ab initio protein structure prediction are discussed.
Collapse
Affiliation(s)
- E S Huang
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, Missouri 63110, USA
| | | | | | | | | |
Collapse
|
44
|
Hutchinson EG, Sessions RB, Thornton JM, Woolfson DN. Determinants of strand register in antiparallel beta-sheets of proteins. Protein Sci 1998; 7:2287-300. [PMID: 9827995 PMCID: PMC2143855 DOI: 10.1002/pro.5560071106] [Citation(s) in RCA: 151] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Antiparallel beta-sheets present two distinct environments to inter-strand residue pairs: beta(A,HB) sites have two backbone hydrogen bonds; whereas at beta(A,NHB) positions backbone hydrogen bonding is precluded. We used statistical methods to compare the frequencies of amino acid pairs at each site. Only approximately 10% of the 210 possible pairs showed occupancies that differed significantly between the two sites. Trends were clear in the preferred pairs, and these could be explained using stereochemical arguments. Cys-Cys, Aromatic-Pro, Thr-Thr, and Val-Val pairs all preferred the beta(A,NHB) site. In each case, the residues usually adopted sterically favored chi1 conformations, which facilitated intra-pair interactions: Cys-Cys pairs formed disulfide bonds; Thr-Thr pairs made hydrogen bonds; Aromatic-Pro and Val-Val pairs formed close van der Waals contacts. In contrast, to make intimate interactions at a beta(A,HB) site, one or both residues had to adopt less favored chi1 geometries. Nonetheless, pairs containing glycine and/or aromatic residues were favored at this site. Where glycine and aromatic side chains combined, the aromatic residue usually adopted the gauche conformation, which promoted novel aromatic ring-peptide interactions. This work provides rules that link protein sequence and tertiary structure, which will be useful in protein modeling, redesign, and de novo design. Our findings are discussed in light of previous analyses and experimental studies.
Collapse
Affiliation(s)
- E G Hutchinson
- Department of Biochemistry and Molecular Biology, University College, London, United Kingdom
| | | | | | | |
Collapse
|
45
|
Huang ES, Samudrala R, Ponder JW. Distance geometry generates native-like folds for small helical proteins using the consensus distances of predicted protein structures. Protein Sci 1998; 7:1998-2003. [PMID: 9761481 PMCID: PMC2144160 DOI: 10.1002/pro.5560070916] [Citation(s) in RCA: 21] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
For successful ab initio protein structure prediction, a method is needed to identify native-like structures from a set containing both native and non-native protein-like conformations. In this regard, the use of distance geometry has shown promise when accurate inter-residue distances are available. We describe a method by which distance geometry restraints are culled from sets of 500 protein-like conformations for four small helical proteins generated by the method of Simons et al. (1997). A consensus-based approach was applied in which every inter-Calpha distance was measured, and the most frequently occurring distances were used as input restraints for distance geometry. For each protein, a structure with lower coordinate root-mean-square (RMS) error than the mean of the original set was constructed; in three cases the topology of the fold resembled that of the native protein. When the fold sets were filtered for the best scoring conformations with respect to an all-atom knowledge-based scoring function, the remaining subset of 50 structures yielded restraints of higher accuracy. A second round of distance geometry using these restraints resulted in an average coordinate RMS error of 4.38 A.
Collapse
Affiliation(s)
- E S Huang
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, Saint Louis, Missouri 63110, USA
| | | | | |
Collapse
|
46
|
|
47
|
Grigoriev IV, Rakhmaninova AB, Mironov AA. Simulated annealing for alpha-helical protein folding: searches in vicinity of the "molten globule" state. J Biomol Struct Dyn 1998; 16:115-22. [PMID: 9745900 DOI: 10.1080/07391102.1998.10508232] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
A new model for simulation of protein folding of alpha-helical proteins with known secondary structure is proposed. We are dealing here with the analysis of alpha-helix packings rather than with a detailed atom structure of a whole protein. Starting from a random compact packing of the helices the search is focused on a vicinity of "molten globule" states of a protein. In contrast to the majority of the known approaches for estimation of a protein free energy we introduce a simplified potential of interactions with solvent and consider conformational energy of the loops in addition to mean-force potential. The model was applied to several globular alpha-helical proteins and demonstrated high prediction accuracy in comparison with other known models.
Collapse
Affiliation(s)
- I V Grigoriev
- Research Institute for Genetics of Industrial Microorganisms, Moscow, Russia.
| | | | | |
Collapse
|
48
|
Benhabilès N, Gallet X, Thomas-Soumarmon A, Brasseur R. A descriptive analysis of populations of three-dimensional structures calculated from primary sequences of proteins by OSIRIS. J Comput Biol 1998; 5:351-66. [PMID: 9672837 DOI: 10.1089/cmb.1998.5.351] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Among different ab initio approaches to calculate 3D-structures of proteins out of primary sequences, a few are using restricted dihedral spaces and empirical equations of energy as is OSIRIS. All those approaches were calibrated on a few proteins or fragments of proteins. To optimize the calculation over a larger diversity of structures, we need first to define for each sequence what are good conditions of calculations in order to choose a consensus procedure fitting most 3D-structures best. This requires objective classification of calculated 3D-structures. In this work, populations of avian and bovine pancreatic polypeptides (APP, BPP) and of calcium-binding protein (CaBP) are obtained by varying the rate of the angular dynamics of the second step of OSIRIS. Then, 3D-structures are clustered using a nonhierarchical method, SICLA, using rmsd as a distance parameter. A good clustering was obtained for four subpopulations of APP, BPP and CaBP. Each subpopulation was characterized by its barycenter, relative frequency and dispersion. For the three alpha-helix proteins, after the step 1 of OSIRIS, most secondary structures were correct but molecules have a few atomic contacts. Step 2, i.e., the angular dynamics, resolves those atomic contacts and clustering demonstrates that it generates subpopulations of topological conformers as the barycenter topologies show.
Collapse
Affiliation(s)
- N Benhabilès
- INSERM U10, Hôp. C. Bernard-X. Bichat, Paris, France.
| | | | | | | |
Collapse
|
49
|
|
50
|
|