1
|
Abstract
Concerted interactions between all the cell components form the basis of biological processes. Protein-protein interactions (PPIs) constitute a tremendous part of this interaction network. Deeper insight into PPIs can help us better understand numerous diseases and lead to the development of new diagnostic and therapeutic strategies. PPI interfaces, until recently, were considered undruggable. However, it is now believed that the interfaces contain "hot spots," which could be targeted by small molecules. Such a strategy would require high-quality structural data of PPIs, which are difficult to obtain experimentally. Therefore, in silico modeling can complement or be an alternative to in vitro approaches. There are several computational methods for analyzing the structural data of the binding partners and modeling of the protein-protein dimer/oligomer structure. The major problem with in silico structure prediction of protein assemblies is obtaining sufficient sampling of protein dynamics. One of the methods that can take protein flexibility and the effects of the environment into account is Molecular Dynamics (MD). While sampling of the whole protein-protein association process with plain MD would be computationally expensive, there are several strategies to harness the method to PPI studies while maintaining reasonable use of resources. This chapter reviews known applications of MD in the PPI investigation workflows.
Collapse
Affiliation(s)
- Dominika Cieślak
- Laboratory of Plant Protein Phosphorylation, Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Warsaw, Poland
| | - Ivo Kabelka
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden
| | - Damian Bartuzi
- Science for Life Laboratory, Department of Cell and Molecular Biology, Uppsala University, Uppsala, Sweden.
- Department of Synthesis and Chemical Technology of Pharmaceutical Substances with Computer Modelling Lab, Medical University of Lublin, Lublin, Poland.
| |
Collapse
|
2
|
Casier R, Duhamel J. Appraisal of blob-Based Approaches in the Prediction of Protein Folding Times. J Phys Chem B 2023; 127:8852-8859. [PMID: 37793094 DOI: 10.1021/acs.jpcb.3c04958] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/06/2023]
Abstract
A series of reports published in the last 3 years has illustrated that a blob-based model (BBM) can predict the folding time of proteins from their primary amino acid (aa) sequence based on three simple rules established to characterize the long-range backbone dynamics (LRBD) of racemic polypeptides. The sole use of LRBD to predict protein folding times with the BBM represents a radical departure from all other prediction methods currently applied to determine protein folding times, which rely instead on parameters such as the structure content, folding kinetics, chain length, amino acid properties, or contact topography of proteins. Furthermore, the built-in modularity of the BBM enables the parametrization and inclusion of new phenomena affecting the LRBD of polypeptides, while its conceptual simplicity makes it an interesting new mathematical tool for studying protein folding. However, its novelty implies that its relationship with many other methods used to predict protein folding times has not been well researched. Consequently, the purpose of this report is to uncover the physical phenomena encountered during protein folding that are best described by the BBM through the identification of parameters that have been recognized over the years as being strong predictors for protein folding, such as protein size, topology, structural class, and folding kinetics. This was accomplished by determining the parameters most strongly correlated with the folding times predicted by the BBM. While the BBM in its present form appears to be a good indicator of the folding times of the vast majority of the 195 proteins considered so far, this report finds that it excels for moderately large proteins that are primarily composed of locally formed structural motifs such as α-helices or for proteins that fold in multiple steps. Altogether, these observations based on the use of the BBM support the notion that proteins fold the way they do because the LRBD of polypeptides is mostly driven by the local interactions experienced between aa's within reach of one another.
Collapse
Affiliation(s)
- Remi Casier
- Institute for Polymer Research, Waterloo Institute for Nanotechnology, Department of Chemistry, University of Waterloo, Waterloo, Ontario N2L3G1, Canada
| | - Jean Duhamel
- Institute for Polymer Research, Waterloo Institute for Nanotechnology, Department of Chemistry, University of Waterloo, Waterloo, Ontario N2L3G1, Canada
| |
Collapse
|
3
|
Evans R, Ramisetty S, Kulkarni P, Weninger K. Illuminating Intrinsically Disordered Proteins with Integrative Structural Biology. Biomolecules 2023; 13:124. [PMID: 36671509 PMCID: PMC9856150 DOI: 10.3390/biom13010124] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 01/01/2023] [Accepted: 01/04/2023] [Indexed: 01/11/2023] Open
Abstract
Intense study of intrinsically disordered proteins (IDPs) did not begin in earnest until the late 1990s when a few groups, working independently, convinced the community that these 'weird' proteins could have important functions. Over the past two decades, it has become clear that IDPs play critical roles in a multitude of biological phenomena with prominent examples including coordination in signaling hubs, enabling gene regulation, and regulating ion channels, just to name a few. One contributing factor that delayed appreciation of IDP functional significance is the experimental difficulty in characterizing their dynamic conformations. The combined application of multiple methods, termed integrative structural biology, has emerged as an essential approach to understanding IDP phenomena. Here, we review some of the recent applications of the integrative structural biology philosophy to study IDPs.
Collapse
Affiliation(s)
- Rachel Evans
- Department of Physics, North Carolina State University, Raleigh, NC 27695, USA
| | - Sravani Ramisetty
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA 91010, USA
| | - Prakash Kulkarni
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA 91010, USA
- Department of Systems Biology, City of Hope National Medical Center, Duarte, CA 91010, USA
| | - Keith Weninger
- Department of Physics, North Carolina State University, Raleigh, NC 27695, USA
| |
Collapse
|
4
|
Machine learning/molecular dynamic protein structure prediction approach to investigate the protein conformational ensemble. Sci Rep 2022; 12:10018. [PMID: 35705565 PMCID: PMC9200820 DOI: 10.1038/s41598-022-13714-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 05/11/2022] [Indexed: 11/25/2022] Open
Abstract
Proteins exist in several different conformations. These structural changes are often associated with fluctuations at the residue level. Recent findings show that co-evolutionary analysis coupled with machine-learning techniques improves the precision by providing quantitative distance predictions between pairs of residues. The predicted statistical distance distribution from Multi Sequence Analysis reveals the presence of different local maxima suggesting the flexibility of key residue pairs. Here we investigate the ability of the residue-residue distance prediction to provide insights into the protein conformational ensemble. We combine deep learning approaches with mechanistic modeling to a set of proteins that experimentally showed conformational changes. The predicted protein models were filtered based on energy scores, RMSD clustering, and the centroids selected as the lowest energy structure per cluster. These models were compared to the experimental-Molecular Dynamics (MD) relaxed structure by analyzing the backbone residue torsional distribution and the sidechain orientations. Our pipeline allows to retrieve the experimental structural dynamics experimentally represented by different X-ray conformations for the same sequence as well the conformational space observed with the MD simulations. We show the potential correlation between the experimental structure dynamics and the predicted model ensemble demonstrating the susceptibility of the current state-of-the-art methods in protein folding and dynamics prediction and pointing out the areas of improvement.
Collapse
|
5
|
Giulini M, Rigoli M, Mattiotti G, Menichetti R, Tarenzi T, Fiorentini R, Potestio R. From System Modeling to System Analysis: The Impact of Resolution Level and Resolution Distribution in the Computer-Aided Investigation of Biomolecules. Front Mol Biosci 2021; 8:676976. [PMID: 34164432 PMCID: PMC8215203 DOI: 10.3389/fmolb.2021.676976] [Citation(s) in RCA: 21] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2021] [Accepted: 05/06/2021] [Indexed: 12/18/2022] Open
Abstract
The ever increasing computer power, together with the improved accuracy of atomistic force fields, enables researchers to investigate biological systems at the molecular level with remarkable detail. However, the relevant length and time scales of many processes of interest are still hardly within reach even for state-of-the-art hardware, thus leaving important questions often unanswered. The computer-aided investigation of many biological physics problems thus largely benefits from the usage of coarse-grained models, that is, simplified representations of a molecule at a level of resolution that is lower than atomistic. A plethora of coarse-grained models have been developed, which differ most notably in their granularity; this latter aspect determines one of the crucial open issues in the field, i.e. the identification of an optimal degree of coarsening, which enables the greatest simplification at the expenses of the smallest information loss. In this review, we present the problem of coarse-grained modeling in biophysics from the viewpoint of system representation and information content. In particular, we discuss two distinct yet complementary aspects of protein modeling: on the one hand, the relationship between the resolution of a model and its capacity of accurately reproducing the properties of interest; on the other hand, the possibility of employing a lower resolution description of a detailed model to extract simple, useful, and intelligible information from the latter.
Collapse
Affiliation(s)
- Marco Giulini
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| | - Marta Rigoli
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| | - Giovanni Mattiotti
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| | - Roberto Menichetti
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| | - Thomas Tarenzi
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| | - Raffaele Fiorentini
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| | - Raffaello Potestio
- Physics Department, University of Trento, Trento, Italy.,INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, Trento, Italy
| |
Collapse
|
6
|
Oliveira Junior AB, Lin X, Kulkarni P, Onuchic JN, Roy S, Leite VBP. Exploring Energy Landscapes of Intrinsically Disordered Proteins: Insights into Functional Mechanisms. J Chem Theory Comput 2021; 17:3178-3187. [PMID: 33871257 DOI: 10.1021/acs.jctc.1c00027] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Intrinsically disordered proteins (IDPs) lack a rigid three-dimensional structure and populate a polymorphic ensemble of conformations. Because of the lack of a reference conformation, their energy landscape representation in terms of reaction coordinates presents a daunting challenge. Here, our newly developed energy landscape visualization method (ELViM), a reaction coordinate-free approach, shows its prime application to explore frustrated energy landscapes of an intrinsically disordered protein, prostate-associated gene 4 (PAGE4). PAGE4 is a transcriptional coactivator that potentiates the oncogene c-Jun. Two kinases, namely, HIPK1 and CLK2, phosphorylate PAGE4, generating variants phosphorylated at different serine/threonine residues (HIPK1-PAGE4 and CLK2-PAGE4, respectively) with opposing functions. While HIPK1-PAGE4 predominantly phosphorylates Thr51 and potentiates c-Jun, CLK2-PAGE4 hyperphosphorylates PAGE4 and attenuates transactivation. To understand the underlying mechanisms of conformational diversity among different phosphoforms, we have analyzed their atomistic trajectories simulated using AWSEM forcefield, and the energy landscapes were elucidated using ELViM. This method allows us to identify and compare the population distributions of different conformational ensembles of PAGE4 phosphoforms using the same effective phase space. The results reveal a predominant conformational ensemble with an extended C-terminal segment of WT PAGE4, which exposes a functional residue Thr51, implying its potential of undertaking a fly-casting mechanism while binding to its cognate partner. In contrast, for HIPK1-PAGE4, a compact conformational ensemble enhances its population sequestering phosphorylated-Thr51. This clearly explains the experimentally observed weaker affinity of HIPK1-PAGE4 for c-Jun. ELViM appears as a powerful tool, especially to analyze the highly frustrated energy landscape representation of IDPs where appropriate reaction coordinates are hard to apprehend.
Collapse
Affiliation(s)
- Antonio B Oliveira Junior
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005-1892, United States
| | - Xingcheng Lin
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts 02139-4307, United States
| | - Prakash Kulkarni
- Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, California 91010, United States
| | - José N Onuchic
- Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005-1892, United States
| | - Susmita Roy
- Department of Chemical Sciences, Indian Institute of Science Education and Research Kolkata, Mohanpur, West Bengal 741246, India
| | - Vitor B P Leite
- Departamento de Física, Instituto de Biociências, Letras e Ciências Exatas, Universidade Estadual Paulista (UNESP), São José do Rio Preto, São Paulo 15054-000, Brazil
| |
Collapse
|
7
|
Menichetti R, Giulini M, Potestio R. A journey through mapping space: characterising the statistical and metric properties of reduced representations of macromolecules. THE EUROPEAN PHYSICAL JOURNAL. B 2021; 94:204. [PMID: 34720709 PMCID: PMC8550479 DOI: 10.1140/epjb/s10051-021-00205-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/15/2021] [Accepted: 09/13/2021] [Indexed: 05/04/2023]
Abstract
ABSTRACT A mapping of a macromolecule is a prescription to construct a simplified representation of the system in which only a subset of its constituent atoms is retained. As the specific choice of the mapping affects the analysis of all-atom simulations as well as the construction of coarse-grained models, the characterisation of the mapping space has recently attracted increasing attention. We here introduce a notion of scalar product and distance between reduced representations, which allows the study of the metric and topological properties of their space in a quantitative manner. Making use of a Wang-Landau enhanced sampling algorithm, we exhaustively explore such space, and examine the qualitative features of mappings in terms of their squared norm. A one-to-one correspondence with an interacting lattice gas on a finite volume leads to the emergence of discontinuous phase transitions in mapping space, which mark the boundaries between qualitatively different reduced representations of the same molecule.
Collapse
Affiliation(s)
- Roberto Menichetti
- Physics Department, University of Trento, via Sommarive, 14, 38123 Trento, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, via Sommarive, 14, 38123 Trento, Italy
| | - Marco Giulini
- Physics Department, University of Trento, via Sommarive, 14, 38123 Trento, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, via Sommarive, 14, 38123 Trento, Italy
| | - Raffaello Potestio
- Physics Department, University of Trento, via Sommarive, 14, 38123 Trento, Italy
- INFN-TIFPA, Trento Institute for Fundamental Physics and Applications, via Sommarive, 14, 38123 Trento, Italy
| |
Collapse
|
8
|
Jin S, Miller MD, Chen M, Schafer NP, Lin X, Chen X, Phillips GN, Wolynes PG. Molecular-replacement phasing using predicted protein structures from AWSEM-Suite. IUCRJ 2020; 7:1168-1178. [PMID: 33209327 PMCID: PMC7642774 DOI: 10.1107/s2052252520013494] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/27/2020] [Accepted: 10/07/2020] [Indexed: 06/11/2023]
Abstract
The phase problem in X-ray crystallography arises from the fact that only the intensities, and not the phases, of the diffracting electromagnetic waves are measured directly. Molecular replacement can often estimate the relative phases of reflections starting with those derived from a template structure, which is usually a previously solved structure of a similar protein. The key factor in the success of molecular replacement is finding a good template structure. When no good solved template exists, predicted structures based partially on templates can sometimes be used to generate models for molecular replacement, thereby extending the lower bound of structural and sequence similarity required for successful structure determination. Here, the effectiveness is examined of structures predicted by a state-of-the-art prediction algorithm, the Associative memory, Water-mediated, Structure and Energy Model Suite (AWSEM-Suite), which has been shown to perform well in predicting protein structures in CASP13 when there is no significant sequence similarity to a solved protein or only very low sequence similarity to known templates. The performance of AWSEM-Suite structures in molecular replacement is discussed and the results show that AWSEM-Suite performs well in providing useful phase information, often performing better than I-TASSER-MR and the previous algorithm AWSEM-Template.
Collapse
Affiliation(s)
- Shikai Jin
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, USA
- Department of Biosciences, Rice University, Houston, Texas, USA
| | | | - Mingchen Chen
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, USA
| | - Nicholas P. Schafer
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, USA
- Department of Chemistry, Rice University, Houston, Texas, USA
| | - Xingcheng Lin
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA
| | - Xun Chen
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, USA
- Department of Chemistry, Rice University, Houston, Texas, USA
| | - George N. Phillips
- Department of Biosciences, Rice University, Houston, Texas, USA
- Department of Chemistry, Rice University, Houston, Texas, USA
| | - Peter G. Wolynes
- Center for Theoretical Biological Physics, Rice University, Houston, Texas, USA
- Department of Biosciences, Rice University, Houston, Texas, USA
- Department of Chemistry, Rice University, Houston, Texas, USA
- Department of Physics, Rice University, Houston, Texas, USA
| |
Collapse
|
9
|
Roche J, Potoyan DA. Disorder Mediated Oligomerization of DISC1 Proteins Revealed by Coarse-Grained Molecular Dynamics Simulations. J Phys Chem B 2019; 123:9567-9575. [PMID: 31614085 DOI: 10.1021/acs.jpcb.9b07467] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
Disrupted-in-schizophrenia-1 (DISC1) is a scaffold protein of significant importance for neuro-development and a prominent candidate protein in the etiology of mental disorders. In this work, we investigate the role of conformational heterogeneity and local structural disorder in the oligomerization pathway of the full-length DISC1 and of two truncation variants. Through extensive coarse-grained molecular dynamics simulations with a predictive energy landscape-based model, we shed light on the interplay of local and global disorder which lead to different oligomerization pathways. We found that both global conformational heterogeneity and local structural disorder play an important role in shaping distinct oligomerization trends of DISC1. This study also sheds light on the differences in oligomerization pathways of the full-length protein compared to the truncated variants produced by a chromosomal translocation associated with schizophrenia. We report that oligomerization of full-length DISC1 sequence works in a nonadditive manner with respect to truncated fragments that do not mirror the conformational landscape or binding affinities of the full-length unit.
Collapse
Affiliation(s)
- Julien Roche
- Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology , Iowa State University , Ames , Iowa 50011 , United States
| | - Davit A Potoyan
- Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology , Iowa State University , Ames , Iowa 50011 , United States.,Department of Chemistry , Iowa State University , Ames , Iowa 50011 , United States.,Bioinformatics and Computational Biology Program , Iowa State University , Ames , Iowa 50011 , United States
| |
Collapse
|
10
|
Geng H, Chen F, Ye J, Jiang F. Applications of Molecular Dynamics Simulation in Structure Prediction of Peptides and Proteins. Comput Struct Biotechnol J 2019; 17:1162-1170. [PMID: 31462972 PMCID: PMC6709365 DOI: 10.1016/j.csbj.2019.07.010] [Citation(s) in RCA: 54] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2019] [Revised: 07/07/2019] [Accepted: 07/23/2019] [Indexed: 12/21/2022] Open
Abstract
Compared with rapid accumulation of protein sequences from high-throughput DNA sequencing, obtaining experimental 3D structures of proteins is still much more difficult, making protein structure prediction (PSP) potentially very useful. Currently, a vast majority of PSP efforts are based on data mining of known sequences, structures and their relationships (informatics-based). However, if closely related template is not available, these methods are usually much less reliable than experiments. They may also be problematic in predicting the structures of naturally occurring or designed peptides. On the other hand, physics-based methods including molecular dynamics (MD) can utilize our understanding of detailed atomic interactions determining biomolecular structures. In this mini-review, we show that all-atom MD can predict structures of cyclic peptides and other peptide foldamers with accuracy similar to experiments. Then, some notable successes in reproducing experimental 3D structures of small proteins through MD simulations (some with replica-exchange) of the folding were summarized. We also describe advancements of MD-based refinement of structure models, and the integration of limited experimental or bioinformatics data into MD-based structure modeling.
Collapse
Affiliation(s)
- Hao Geng
- Lab of Computational Chemistry and Drug Design, State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen 518055, China
| | - Fangfang Chen
- Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Shenzhen PKU-HKUST Medical Center, Shenzhen 518036, China
| | - Jing Ye
- Guangdong and Shenzhen Key Laboratory of Male Reproductive Medicine and Genetics, Peking University Shenzhen Hospital, Shenzhen PKU-HKUST Medical Center, Shenzhen 518036, China
| | - Fan Jiang
- Lab of Computational Chemistry and Drug Design, State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen 518055, China
- NanoAI Biotech Co.,Ltd., Silicon Valley Compound, Longhua District, Shenzhen 518109, China
- Corresponding author at: Lab of Computational Chemistry and Drug Design, State Key Laboratory of Chemical Oncogenomics, Peking University Shenzhen Graduate School, Shenzhen 518055, China.
| |
Collapse
|
11
|
Structural and Dynamical Order of a Disordered Protein: Molecular Insights into Conformational Switching of PAGE4 at the Systems Level. Biomolecules 2019; 9:biom9020077. [PMID: 30813315 PMCID: PMC6406393 DOI: 10.3390/biom9020077] [Citation(s) in RCA: 20] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Revised: 02/10/2019] [Accepted: 02/10/2019] [Indexed: 01/10/2023] Open
Abstract
Folded proteins show a high degree of structural order and undergo (fairly constrained) collective motions related to their functions. On the other hand, intrinsically disordered proteins (IDPs), while lacking a well-defined three-dimensional structure, do exhibit some structural and dynamical ordering, but are less constrained in their motions than folded proteins. The larger structural plasticity of IDPs emphasizes the importance of entropically driven motions. Many IDPs undergo function-related disorder-to-order transitions driven by their interaction with specific binding partners. As experimental techniques become more sensitive and become better integrated with computational simulations, we are beginning to see how the modest structural ordering and large amplitude collective motions of IDPs endow them with an ability to mediate multiple interactions with different partners in the cell. To illustrate these points, here, we use Prostate-associated gene 4 (PAGE4), an IDP implicated in prostate cancer (PCa) as an example. We first review our previous efforts using molecular dynamics simulations based on atomistic AWSEM to study the conformational dynamics of PAGE4 and how its motions change in its different physiologically relevant phosphorylated forms. Our simulations quantitatively reproduced experimental observations and revealed how structural and dynamical ordering are encoded in the sequence of PAGE4 and can be modulated by different extents of phosphorylation by the kinases HIPK1 and CLK2. This ordering is reflected in changing populations of certain secondary structural elements as well as in the regularity of its collective motions. These ordered features are directly correlated with the functional interactions of WT-PAGE4, HIPK1-PAGE4 and CLK2-PAGE4 with the AP-1 signaling axis. These interactions give rise to repeated transitions between (high HIPK1-PAGE4, low CLK2-PAGE4) and (low HIPK1-PAGE4, high CLK2-PAGE4) cell phenotypes, which possess differing sensitivities to the standard PCa therapies, such as androgen deprivation therapy (ADT). We argue that, although the structural plasticity of an IDP is important in promoting promiscuous interactions, the modulation of the structural ordering is important for sculpting its interactions so as to rewire with agility biomolecular interaction networks with significant functional consequences.
Collapse
|
12
|
Wu H, Wolynes PG, Papoian GA. AWSEM-IDP: A Coarse-Grained Force Field for Intrinsically Disordered Proteins. J Phys Chem B 2018; 122:11115-11125. [PMID: 30091924 PMCID: PMC6713210 DOI: 10.1021/acs.jpcb.8b05791] [Citation(s) in RCA: 72] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The associative memory, water-mediated, structure and energy model (AWSEM) has been successfully used to study protein folding, binding, and aggregation problems. In this work, we introduce AWSEM-IDP, a new AWSEM branch for simulating intrinsically disordered proteins (IDPs), where the weights of the potentials determining secondary structure formation have been finely tuned, and a novel potential is introduced that helps to precisely control both the average extent of protein chain collapse and the chain's fluctuations in size. AWSEM-IDP can efficiently sample large conformational spaces, while retaining sufficient molecular accuracy to realistically model proteins. We applied this new model to two IDPs, demonstrating that AWSEM-IDP can reasonably well reproduce higher-resolution reference data, thus providing the foundation for a transferable IDP force field. Finally, we used thermodynamic perturbation theory to show that, in general, the conformational ensembles of IDPs are highly sensitive to fine-tuning of force field parameters.
Collapse
Affiliation(s)
- Hao Wu
- Biophysics Program, Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, United States
| | - Peter G. Wolynes
- Departments of Chemistry and Physics and Center for Theoretical Biological Physics, Rice University, Houston, Texas 77005, United States
| | - Garegin A. Papoian
- Biophysics Program, Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, United States
- Department of Chemistry and Biochemistry, University of Maryland, College Park, Maryland 20742, United States
| |
Collapse
|
13
|
Mallamace D, Fazio E, Mallamace F, Corsaro C. The Role of Hydrogen Bonding in the Folding/Unfolding Process of Hydrated Lysozyme: A Review of Recent NMR and FTIR Results. Int J Mol Sci 2018; 19:ijms19123825. [PMID: 30513664 PMCID: PMC6321052 DOI: 10.3390/ijms19123825] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2018] [Revised: 11/23/2018] [Accepted: 11/24/2018] [Indexed: 02/07/2023] Open
Abstract
The biological activity of proteins depends on their three-dimensional structure, known as the native state. The main force driving the correct folding mechanism is the hydrophobic effect and when this folding kinetics is altered, aggregation phenomena intervene causing the occurrence of illnesses such as Alzheimer and Parkinson’s diseases. The other important effect is performed by water molecules and by their ability to form a complex network of hydrogen bonds whose dynamics influence the mobility of protein amino acids. In this work, we review the recent results obtained by means of spectroscopic techniques, such as Fourier Transform Infrared (FTIR) and Nuclear Magnetic Resonance (NMR) spectroscopies, on hydrated lysozyme. In particular, we explore the Energy Landscape from the thermal region of configurational stability up to that of the irreversible denaturation. The importance of the coupling between the solute and the solvent will be highlighted as well as the different behaviors of hydrophilic and hydrophobic moieties of protein amino acid residues.
Collapse
Affiliation(s)
- Domenico Mallamace
- Dipartimento di Scienze Matematiche e Informatiche, Scienze Fisiche e Scienze della Terra (MIFT), Università di Messina, 98166 Messina, Italy.
| | - Enza Fazio
- Dipartimento di Scienze Matematiche e Informatiche, Scienze Fisiche e Scienze della Terra (MIFT), Università di Messina, 98166 Messina, Italy.
| | - Francesco Mallamace
- Department of Nuclear Science and Engineering, Massachusetts Institute of Technology (MIT), Cambridge, MA 02139, USA.
- Istituto dei Sistemi Complessi (ISC)-CNR, 00185 Rome, Italy.
| | - Carmelo Corsaro
- Dipartimento di Scienze Matematiche e Informatiche, Scienze Fisiche e Scienze della Terra (MIFT), Università di Messina, 98166 Messina, Italy.
| |
Collapse
|
14
|
Chen M, Lin X, Lu W, Schafer NP, Onuchic JN, Wolynes PG. Template-Guided Protein Structure Prediction and Refinement Using Optimized Folding Landscape Force Fields. J Chem Theory Comput 2018; 14:6102-6116. [PMID: 30240202 DOI: 10.1021/acs.jctc.8b00683] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
When good structural templates can be identified, template-based modeling is the most reliable way to predict the tertiary structure of proteins. In this study, we combine template-based modeling with a realistic coarse-grained force field, AWSEM, that has been optimized using the principles of energy landscape theory. The Associative memory, Water mediated, Structure and Energy Model (AWSEM) is a coarse-grained force field having both transferable tertiary interactions and knowledge-based local-in-sequence interaction terms. We incorporate template information into AWSEM by introducing soft collective biases to the template structures, resulting in a model that we call AWSEM-Template. Structure prediction tests on eight targets, four of which are in the low sequence identity "twilight zone" of homology modeling, show that AWSEM-Template can achieve high-resolution structure prediction. Our results also confirm that using a combination of AWSEM and a template-guided potential leads to more accurate prediction of protein structures than simply using a template-guided potential alone. Free energy profile analyses demonstrate that the soft collective biases to the template effectively increase funneling toward native-like structures while still allowing significant flexibility so as to allow for correction of discrepancies between the target structure and the template. A further stage of refinement using all-atom molecular dynamics augmented with soft collective biases to the structures predicted by AWSEM-Template leads to a further improvement of both backbone and side-chain accuracy by maintaining sufficient flexibility but at the same time discouraging unproductive unfolding events often seen in unrestrained all-atom refinement simulations. The all-atom refinement simulations also reduce patches of frustration of the initial predictions. Some of the backbones found among the structures produced during the initial coarse-grained prediction step already have CE-RMSD values of less than 3 Å with 90% or more of the residues aligned to the experimentally solved structure for all targets. All-atom structures generated during the following all-atom refinement simulations, which started from coarse-grained structures that were chosen without reference to any knowledge about the native structure, have CE-RMSD values of less than 2.5 Å with 90% or more of the residues aligned for 6 out of 8 targets. Clustering low energy structures generated during the initial coarse-grained annealing picks out reliably structures that are within 1 Å of the best sampled structures in 5 out of 8 cases. After the all-atom refinement, structures that are within 1 Å of the best sampled structures can be selected using a simple algorithm based on energetic features alone in 7 out of 8 cases.
Collapse
Affiliation(s)
- Mingchen Chen
- Center for Theoretical Biological Physics, Rice University , Houston , Texas 77030 , United States.,Department of Bioengineering , Rice University , Houston , Texas 77005 , United States
| | - Xingcheng Lin
- Center for Theoretical Biological Physics, Rice University , Houston , Texas 77030 , United States.,Department of Physics and Astronomy , Rice University , Houston , Texas 77005 , United States
| | - Wei Lu
- Center for Theoretical Biological Physics, Rice University , Houston , Texas 77030 , United States.,Department of Physics and Astronomy , Rice University , Houston , Texas 77005 , United States
| | - Nicholas P Schafer
- Center for Theoretical Biological Physics, Rice University , Houston , Texas 77030 , United States.,Department of Chemistry , Rice University , Houston , Texas 77005 , United States
| | - José N Onuchic
- Center for Theoretical Biological Physics, Rice University , Houston , Texas 77030 , United States.,Department of Physics and Astronomy , Rice University , Houston , Texas 77005 , United States.,Department of Chemistry , Rice University , Houston , Texas 77005 , United States.,Department of Biosciences , Rice University , Houston , Texas 77005 , United States
| | - Peter G Wolynes
- Center for Theoretical Biological Physics, Rice University , Houston , Texas 77030 , United States.,Department of Chemistry , Rice University , Houston , Texas 77005 , United States.,Department of Biosciences , Rice University , Houston , Texas 77005 , United States
| |
Collapse
|
15
|
Lin X, Roy S, Jolly MK, Bocci F, Schafer NP, Tsai MY, Chen Y, He Y, Grishaev A, Weninger K, Orban J, Kulkarni P, Rangarajan G, Levine H, Onuchic JN. PAGE4 and Conformational Switching: Insights from Molecular Dynamics Simulations and Implications for Prostate Cancer. J Mol Biol 2018; 430:2422-2438. [PMID: 29758263 DOI: 10.1016/j.jmb.2018.05.011] [Citation(s) in RCA: 31] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2018] [Revised: 04/13/2018] [Accepted: 05/07/2018] [Indexed: 11/15/2022]
Abstract
Prostate-associated gene 4 (PAGE4) is an intrinsically disordered protein implicated in prostate cancer. Thestress-response kinase homeodomain-interacting protein kinase 1 (HIPK1) phosphorylates two residues in PAGE4, serine 9 and threonine 51. Phosphorylation of these two residues facilitates the interaction of PAGE4 with activator protein-1 (AP-1) transcription factor complex to potentiate AP-1's activity. In contrast, hyperphosphorylation of PAGE4 by CDC-like kinase 2 (CLK2) attenuates this interaction with AP-1. Small-angleX-ray scattering and single-molecule fluorescence resonance energy transfer measurements have shown that PAGE4 expands upon hyperphosphorylation and that this expansion is localized to its N-terminal half. To understand the interactions underlying this structural transition, we performed molecular dynamics simulations using Atomistic AWSEM, a multi-scale molecular model that combines atomistic and coarse-grained simulation approaches. Our simulations show that electrostatic interactions drive transient formation of an N-terminal loop, the destabilization of which accounts for the dramatic change in size upon hyperphosphorylation. Phosphorylation also changes the preference of secondary structure formation of the PAGE4 ensemble, which leads to a transition between states that display different degrees of disorder. Finally, we construct a mechanism-based mathematical model that allows us to capture the interactions ofdifferent phosphoforms of PAGE4 with AP-1 and its downstream target, the androgen receptor (AR)-a key therapeutic target in prostate cancer. Our model predicts intracellular oscillatory dynamics of HIPK1-PAGE4, CLK2-PAGE4, and AR activity, indicating phenotypic heterogeneity in an isogenic cell population. Thus, conformational switching of PAGE4 may potentially affect the efficiency of therapeutically targeting AR activity.
Collapse
Affiliation(s)
- Xingcheng Lin
- Center for Theoretical Biological Physics, Rice University, Houston, TX 77005, United States; Department of Physics and Astronomy, Rice University, Houston, TX 77005, United States
| | - Susmita Roy
- Center for Theoretical Biological Physics, Rice University, Houston, TX 77005, United States
| | - Mohit Kumar Jolly
- Center for Theoretical Biological Physics, Rice University, Houston, TX 77005, United States
| | - Federico Bocci
- Center for Theoretical Biological Physics, Rice University, Houston, TX 77005, United States; Department of Chemistry, Rice University, Houston, TX 77005, United States
| | - Nicholas P Schafer
- Center for Theoretical Biological Physics, Rice University, Houston, TX 77005, United States; Department of Chemistry, Rice University, Houston, TX 77005, United States
| | - Min-Yeh Tsai
- Center for Theoretical Biological Physics, Rice University, Houston, TX 77005, United States; Department of Chemistry, Rice University, Houston, TX 77005, United States
| | - Yihong Chen
- Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD 20850, United States
| | - Yanan He
- Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD 20850, United States
| | - Alexander Grishaev
- Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD 20850, United States; National Institute of Standards and Technology, Gaithersburg, MD 20899, United States
| | - Keith Weninger
- Department of Physics, North Carolina State University, Raleigh, NC 27695, United States
| | - John Orban
- Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD 20850, United States; Department of Chemistry and Biochemistry, University of Maryland, College Park, MD 20742, United States
| | - Prakash Kulkarni
- Institute for Bioscience and Biotechnology Research, University of Maryland, Rockville, MD 20850, United States; Department of Medical Oncology and Therapeutics Research, City of Hope National Medical Center, Duarte, CA 91010, United States
| | - Govindan Rangarajan
- Department of Mathematics, Indian Institute of Science, Bangalore 560012, India; Center for Neuroscience, Indian Institute of Science, Bangalore 560012, India
| | - Herbert Levine
- Center for Theoretical Biological Physics, Rice University, Houston, TX 77005, United States; Department of Physics and Astronomy, Rice University, Houston, TX 77005, United States
| | - José N Onuchic
- Center for Theoretical Biological Physics, Rice University, Houston, TX 77005, United States; Department of Physics and Astronomy, Rice University, Houston, TX 77005, United States; Department of Chemistry, Rice University, Houston, TX 77005, United States; Department of BioSciences, Rice University, Houston, TX 77005, United States.
| |
Collapse
|
16
|
Chen M, Schafer NP, Zheng W, Wolynes PG. The Associative Memory, Water Mediated, Structure and Energy Model (AWSEM)-Amylometer: Predicting Amyloid Propensity and Fibril Topology Using an Optimized Folding Landscape Model. ACS Chem Neurosci 2018; 9:1027-1039. [PMID: 29241326 DOI: 10.1021/acschemneuro.7b00436] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
Amyloids are fibrillar protein aggregates with simple repeated structural motifs in their cores, usually β-strands but sometimes α-helices. Identifying the amyloid-prone regions within protein sequences is important both for understanding the mechanisms of amyloid-associated diseases and for understanding functional amyloids. Based on the crystal structures of seven cross-β amyloidogenic peptides with different topologies and one recently solved cross-α fiber structure, we have developed a computational approach for identifying amyloidogenic segments in protein sequences using the Associative memory, Water mediated, Structure and Energy Model (AWSEM). The AWSEM-Amylometer performs favorably in comparison with other predictors in predicting aggregation-prone sequences in multiple data sets. The method also predicts well the specific topologies (the relative arrangement of β-strands in the core) of the amyloid fibrils. An important advantage of the AWSEM-Amylometer over other existing methods is its direct connection with an efficient, optimized protein folding simulation model, AWSEM. This connection allows one to combine efficient and accurate search of protein sequences for amyloidogenic segments with the detailed study of the thermodynamic and kinetic roles that these segments play in folding and aggregation in the context of the entire protein sequence. We present new simulation results that highlight the free energy landscapes of peptides that can take on multiple fibril topologies. We also demonstrate how the Amylometer methodology can be straightforwardly extended to the study of functional amyloids that have the recently discovered cross-α fibril architecture.
Collapse
|
17
|
dos Santos RN, Ferrari AJR, de Jesus HCR, Gozzo FC, Morcos F, Martínez L. Enhancing protein fold determination by exploring the complementary information of chemical cross-linking and coevolutionary signals. Bioinformatics 2018; 34:2201-2208. [DOI: 10.1093/bioinformatics/bty074] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2017] [Accepted: 02/10/2018] [Indexed: 11/13/2022] Open
Affiliation(s)
- Ricardo N dos Santos
- Institute of Chemistry, University of Campinas, Campinas, Brazil
- Center for Computational Engineering and Sciences, University of Campinas, Campinas, Brazil
| | | | | | - Fábio C Gozzo
- Institute of Chemistry, University of Campinas, Campinas, Brazil
| | - Faruck Morcos
- Department of Biological Sciences, University of Texas at Dallas, Richardson, USA
| | - Leandro Martínez
- Institute of Chemistry, University of Campinas, Campinas, Brazil
- Center for Computational Engineering and Sciences, University of Campinas, Campinas, Brazil
| |
Collapse
|
18
|
Sirovetz BJ, Schafer NP, Wolynes PG. Protein structure prediction: making AWSEM AWSEM-ER by adding evolutionary restraints. Proteins 2017; 85:2127-2142. [PMID: 28799172 DOI: 10.1002/prot.25367] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2017] [Revised: 07/29/2017] [Accepted: 08/08/2017] [Indexed: 11/07/2022]
Abstract
Protein sequences have evolved to fold into functional structures, resulting in families of diverse protein sequences that all share the same overall fold. One can harness protein family sequence data to infer likely contacts between pairs of residues. In the current study, we combine this kind of inference from coevolutionary information with a coarse-grained protein force field ordinarily used with single sequence input, the Associative memory, Water mediated, Structure and Energy Model (AWSEM), to achieve improved structure prediction. The resulting Associative memory, Water mediated, Structure and Energy Model with Evolutionary Restraints (AWSEM-ER) yields a significant improvement in the quality of protein structure prediction over the single sequence prediction from AWSEM when a sufficiently large number of homologous sequences are available. Free energy landscape analysis shows that the addition of the evolutionary term shifts the free energy minimum to more native-like structures, which explains the improvement in the quality of structures when performing predictions using simulated annealing. Simulations using AWSEM without coevolutionary information have proved useful in elucidating not only protein folding behavior, but also mechanisms of protein function. The success of AWSEM-ER in de novo structure prediction suggests that the enhanced model opens the door to functional studies of proteins even when no experimentally solved structures are available.
Collapse
Affiliation(s)
- Brian J Sirovetz
- Center for Theoretical Biological Physics, Rice University, Houston, Texas.,Department of Chemistry, Rice University, Houston, Texas
| | - Nicholas P Schafer
- Center for Theoretical Biological Physics, Rice University, Houston, Texas
| | - Peter G Wolynes
- Center for Theoretical Biological Physics, Rice University, Houston, Texas.,Department of Chemistry, Rice University, Houston, Texas.,Department of Physics, Rice University, Houston, Texas.,Department of Biosciences, Rice University, Houston, Texas
| |
Collapse
|
19
|
Aggregation landscapes of Huntingtin exon 1 protein fragments and the critical repeat length for the onset of Huntington's disease. Proc Natl Acad Sci U S A 2017; 114:4406-4411. [PMID: 28400517 DOI: 10.1073/pnas.1702237114] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023] Open
Abstract
Huntington's disease (HD) is a neurodegenerative disease caused by an abnormal expansion in the polyglutamine (polyQ) track of the Huntingtin (HTT) protein. The severity of the disease depends on the polyQ repeat length, arising only in patients with proteins having 36 repeats or more. Previous studies have shown that the aggregation of N-terminal fragments (encoded by HTT exon 1) underlies the disease pathology in mouse models and that the HTT exon 1 gene product can self-assemble into amyloid structures. Here, we provide detailed structural mechanisms for aggregation of several protein fragments encoded by HTT exon 1 by using the associative memory, water-mediated, structure and energy model (AWSEM) to construct their free energy landscapes. We find that the addition of the N-terminal 17-residue sequence ([Formula: see text]) facilitates polyQ aggregation by encouraging the formation of prefibrillar oligomers, whereas adding the C-terminal polyproline sequence ([Formula: see text]) inhibits aggregation. The combination of both terminal additions in HTT exon 1 fragment leads to a complex aggregation mechanism with a basic core that resembles that found for the aggregation of pure polyQ repeats using AWSEM. At the extrapolated physiological concentration, although the grand canonical free energy profiles are uphill for HTT exon 1 fragments having 20 or 30 glutamines, the aggregation landscape for fragments with 40 repeats has become downhill. This computational prediction agrees with the critical length found for the onset of HD and suggests potential therapies based on blocking early binding events involving the terminal additions to the polyQ repeats.
Collapse
|
20
|
Brown DK, Tastan Bishop Ö. Role of Structural Bioinformatics in Drug Discovery by Computational SNP Analysis: Analyzing Variation at the Protein Level. Glob Heart 2017; 12:151-161. [PMID: 28302551 DOI: 10.1016/j.gheart.2017.01.009] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2017] [Accepted: 01/13/2017] [Indexed: 10/20/2022] Open
Abstract
With the completion of the human genome project at the beginning of the 21st century, the biological sciences entered an unprecedented age of data generation, and made its first steps toward an era of personalized medicine. This abundance of sequence data has led to the proliferation of numerous sequence-based techniques for associating variation with disease, such as genome-wide association studies and candidate gene association studies. However, these statistical methods do not provide an understanding of the functional effects of variation. Structure-based drug discovery and design is increasingly incorporating structural bioinformatics techniques to model and analyze protein targets, perform large scale virtual screening to identify hit to lead compounds, and simulate molecular interactions. These techniques are fast, cost-effective, and complement existing experimental techniques such as high throughput sequencing. In this paper, we discuss the contributions of structural bioinformatics to drug discovery, focusing particularly on the analysis of nonsynonymous single nucleotide polymorphisms. We conclude by suggesting a protocol for future analyses of the structural effects of nonsynonymous single nucleotide polymorphisms on proteins and protein complexes.
Collapse
Affiliation(s)
- David K Brown
- Research Unit in Bioinformatics (RUBi), Department of Biochemistry and Microbiology, Rhodes University, Grahamstown, South Africa
| | - Özlem Tastan Bishop
- Research Unit in Bioinformatics (RUBi), Department of Biochemistry and Microbiology, Rhodes University, Grahamstown, South Africa.
| |
Collapse
|
21
|
Chen M, Lin X, Lu W, Onuchic JN, Wolynes PG. Protein Folding and Structure Prediction from the Ground Up II: AAWSEM for α/β Proteins. J Phys Chem B 2016; 121:3473-3482. [PMID: 27797194 DOI: 10.1021/acs.jpcb.6b09347] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
The atomistic associative memory, water mediated, structure and energy model (AAWSEM) is an efficient coarse-grained force field with transferable tertiary interactions that incorporates local in sequence energetic biases using structural information derived from all-atom simulations of long segments of the protein. For α helical proteins, the accuracy of structure prediction using AAWSEM has been established previously. In this article, we examine the capability of AAWSEM to predict the structure of α/β proteins. We also elaborate on an iterative approach that uses the structures from a first round of AAWSEM simulation as fragment memories. This iterative scheme improves the quality of the structure prediction and makes the free energy profile more funneled toward native configurations. We explore the use of clustering analyses as a way of evaluating the confidence in various structure prediction models. Clustering using a local relative order parameter (mutual Q) of the predicted structural ensemble turns out to be optimal. The tightest cluster according to mutual Q generally has the most correctly folded structure. Since there is no bioinformatic input, AAWSEM amounts to an ab initio protein structure prediction method that combines the efficiency of coarse-grained simulations with the local structural accuracy that can be achieved from all-atom simulations.
Collapse
Affiliation(s)
- Mingchen Chen
- Center for Theoretical Biological Physics, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States.,Department of Bioengineering, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States
| | - Xingcheng Lin
- Center for Theoretical Biological Physics, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States.,Department of Physics and Astronomy, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States
| | - Wei Lu
- Center for Theoretical Biological Physics, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States.,Department of Physics and Astronomy, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States
| | - José N Onuchic
- Center for Theoretical Biological Physics, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States.,Department of Physics and Astronomy, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States.,Department of Chemistry, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States.,Department of Biosciences, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States
| | - Peter G Wolynes
- Center for Theoretical Biological Physics, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States.,Department of Physics and Astronomy, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States.,Department of Chemistry, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States.,Department of Biosciences, Rice University , 6100 Main St., Houston, Texas 77005-1892, United States
| |
Collapse
|
22
|
Chen M, Tsai M, Zheng W, Wolynes PG. The Aggregation Free Energy Landscapes of Polyglutamine Repeats. J Am Chem Soc 2016; 138:15197-15203. [PMID: 27786478 DOI: 10.1021/jacs.6b08665] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
Aggregates of proteins containing polyglutamine (polyQ) repeats are strongly associated with several neurodegenerative diseases. The length of the repeats correlates with the severity of the disease. Previous studies have shown that pure polyQ peptides aggregate by nucleated growth polymerization and that the size of the critical nucleus (n*) decreases from tetrameric to dimeric and monomeric as length increases from Q18 to Q26. Why the critical nucleus size changes with repeat-length has been unclear. Using the associative memory, water-mediated, structure and energy model, we construct the aggregation free energy landscapes for polyQ peptides of different repeat-lengths. These studies show that the monomer of the shorter repeat-length (Q20) prefers an extended conformation and that its aggregation indeed has a trimeric nucleus (n* ∼ 3), while a longer repeat-length monomer (Q30) prefers a β-hairpin conformation which then aggregates in a downhill fashion at 0.1 mM. For an intermediate length peptide (Q26), there is an equal preference for hairpin and extended forms in the monomer which leads to a mixed inhomogeneous nucleation mechanism for fibrils. The predicted changes of monomeric structure and nucleation mechanism are confirmed by studying the aggregation free energy profile for a polyglutamine repeat with site-specific PG mutations that favor the hairpin form, giving results in harmony with experiments on this system.
Collapse
Affiliation(s)
- Mingchen Chen
- Center for Theoretical Biological Physics, ‡Department of Bioengineering, and §Department of Chemistry, Rice University , Houston, Texas 77005, United States
| | - MinYeh Tsai
- Center for Theoretical Biological Physics, ‡Department of Bioengineering, and §Department of Chemistry, Rice University , Houston, Texas 77005, United States
| | - Weihua Zheng
- Center for Theoretical Biological Physics, ‡Department of Bioengineering, and §Department of Chemistry, Rice University , Houston, Texas 77005, United States
| | - Peter G Wolynes
- Center for Theoretical Biological Physics, ‡Department of Bioengineering, and §Department of Chemistry, Rice University , Houston, Texas 77005, United States
| |
Collapse
|
23
|
Exploring the aggregation free energy landscape of the amyloid-β protein (1-40). Proc Natl Acad Sci U S A 2016; 113:11835-11840. [PMID: 27698130 DOI: 10.1073/pnas.1612362113] [Citation(s) in RCA: 108] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022] Open
Abstract
A predictive coarse-grained protein force field [associative memory, water-mediated, structure, and energy model for molecular dynamics (AWSEM)-MD] is used to study the energy landscapes and relative stabilities of amyloid-β protein (1-40) in the monomer and all of its oligomeric forms up to an octamer. We find that an isolated monomer is mainly disordered with a short α-helix formed at the central hydrophobic core region (L17-D23). A less stable hairpin structure, however, becomes increasingly more stable in oligomers, where hydrogen bonds can form between neighboring monomers. We explore the structure and stability of both prefibrillar oligomers that consist of mainly antiparallel β-sheets and fibrillar oligomers with only parallel β-sheets. Prefibrillar oligomers are polymorphic but typically take on a cylindrin-like shape composed of mostly antiparallel β-strands. At the concentration of the simulation, the aggregation free energy landscape is nearly downhill. We use umbrella sampling along a structural progress coordinate for interconversion between prefibrillar and fibrillar forms to identify a conversion pathway between these forms. The fibrillar oligomer only becomes favored over its prefibrillar counterpart in the pentamer where an interconversion bottleneck appears. The structural characterization of the pathway along with statistical mechanical perturbation theory allow us to evaluate the effects of concentration on the free energy landscape of aggregation as well as the effects of the Dutch and Arctic mutations associated with early onset of Alzheimer's disease.
Collapse
|