1
|
Kryś JD, Gront D. Coarse-grained potential for hydrogen bond interactions. J Mol Graph Model 2023; 124:108507. [PMID: 37295157 DOI: 10.1016/j.jmgm.2023.108507] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 04/12/2023] [Accepted: 04/18/2023] [Indexed: 06/12/2023]
Abstract
Understanding protein structure and dynamics is crucial for investigating numerous biological processes. This however requires proper description of molecular interactions, most notably hydrogen bonds, which are the driving force behind the folding of protein sequences into working molecules. Due to the multi-body character of this interaction, proper mathematical formulation has been a matter of long debate in the literature. This description becomes even more complex in reduced protein models. In this contribution, we propose a novel hydrogen bond energy function definition that is based only on Cα positions and used for coarse-grained simulations. We show that this new method has the capability to recognize hydrogen bonds with over 80% accuracy and can successfully identify β-sheet in β-amyloid peptide simulations.
Collapse
Affiliation(s)
- Justyna D Kryś
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093, Warsaw, Poland.
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093, Warsaw, Poland
| |
Collapse
|
2
|
Sikorska C, Liwo A. Origin of Correlations between Local Conformational States of Consecutive Amino Acid Residues and Their Role in Shaping Protein Structures and in Allostery. J Phys Chem B 2022; 126:9493-9505. [PMID: 36367920 PMCID: PMC9706564 DOI: 10.1021/acs.jpcb.2c04610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 10/27/2022] [Indexed: 11/13/2022]
Abstract
By analyzing the Kubo-cluster-cumulant expansion of the potential of mean force of polypeptide chains corresponding to backbone-local interactions averaged over the rotation of the peptide groups about the Cα···Cα virtual bonds, we identified two important kinds of "along-chain" correlations that pertain to extended chain segments bordered by turns (usually the β-strands) and to the folded spring-like segments (usually α-helices), respectively, and are expressed as multitorsional potentials. These terms affect the positioning of structural elements with respect to each other and, consequently, contribute to determining their packing. Additionally, for extended chain segments, the correlation terms contribute to propagating the conformational change at one end to the other end, which is characteristic of allosteric interactions. We confirmed both findings by statistical analysis of the virtual-bond geometry of 77 950 proteins. Augmenting coarse-grained and, possibly, all-atom force fields with these correlation terms could improve their capacity to model protein structure and dynamics.
Collapse
Affiliation(s)
- Celina Sikorska
- The
MacDiarmid Institute for Advanced Materials and Nanotechnology, Department
of Physics, The University of Auckland,
Private Bag 92019, Auckland1142, New Zealand
| | - Adam Liwo
- Faculty
of Chemistry, University of Gdańsk,
Fahrenheit Union of Universities in Gdańsk, Wita Stwosza 63, 80-308Gdańsk, Poland
| |
Collapse
|
3
|
Automated Protein Secondary Structure Assignment from C α Positions Using Neural Networks. Biomolecules 2022; 12:biom12060841. [PMID: 35740966 PMCID: PMC9220970 DOI: 10.3390/biom12060841] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/25/2022] [Revised: 06/10/2022] [Accepted: 06/14/2022] [Indexed: 11/17/2022] Open
Abstract
The assignment of secondary structure elements in protein conformations is necessary to interpret a protein model that has been established by computational methods. The process essentially involves labeling the amino acid residues with H (Helix), E (Strand), or C (Coil, also known as Loop). When particular atoms are absent from an input protein structure, the procedure becomes more complicated, especially when only the alpha carbon locations are known. Various techniques have been tested and applied to this problem during the last forty years. The application of machine learning techniques is the most recent trend. This contribution presents the HECA classifier, which uses neural networks to assign protein secondary structure types. The technique exclusively employs Cα coordinates. The Keras (TensorFlow) library was used to implement and train the neural network model. The BioShell toolkit was used to calculate the neural network input features from raw coordinates. The study’s findings show that neural network-based methods may be successfully used to take on structure assignment challenges when only Cα trace is available. Thanks to the careful selection of input features, our approach’s accuracy (above 97%) exceeded that of the existing methods.
Collapse
|
4
|
Zhong Q, Li G. Adaptively Iterative Multiscale Switching Simulation Strategy and Applications to Protein Folding and Structure Prediction. J Phys Chem Lett 2021; 12:3151-3162. [PMID: 33755493 DOI: 10.1021/acs.jpclett.1c00618] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2023]
Abstract
Structure prediction is an important means to quickly understand new protein functions. However, the prediction of effects of proteins that have no detectable templates is still to be improved. Molecular dynamics simulation is supposed to be the primary research tool for structure predictions, but it still has limitations of huge computational cost in all-atom (AA) models and rough accuracy in coarse-grained (CG) models. We propose a universal multiscale simulation strategy named AIMS in which simulations can iteratively switch among multiple resolutions in order to adaptively trade off AA accuracy and CG high-efficiency. AIMS follows the idea of CG-guided enhanced sampling so that final results always keep AA accuracy. We successfully achieve four ab initio and four data-assisted protein structure predictions using AIMS. The prediction result is an ensemble rather than a structure and provides special insights on folding metastable states. AIMS is estimated to achieve a computational speed about 40 times faster than that of conventional AA simulations.
Collapse
Affiliation(s)
- Qinglu Zhong
- Laboratory of Molecular Modeling and Design, State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Guohui Li
- Laboratory of Molecular Modeling and Design, State Key Laboratory of Molecular Reaction Dynamics, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China
| |
Collapse
|
5
|
Badaczewska-Dawid AE, Kolinski A, Kmiecik S. Protocols for Fast Simulations of Protein Structure Flexibility Using CABS-Flex and SURPASS. Methods Mol Biol 2021; 2165:337-353. [PMID: 32621235 DOI: 10.1007/978-1-0716-0708-4_20] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Conformational flexibility of protein structures can play an important role in protein function. The flexibility is often studied using computational methods since experimental characterization can be difficult. Depending on protein system size, computational tools may require large computational resources or significant simplifications in the modeled systems to speed up calculations. In this work, we present the protocols for efficient simulations of flexibility of folded protein structures that use coarse-grained simulation tools of different resolutions: medium, represented by CABS-flex, and low, represented by SUPRASS. We test the protocols using a set of 140 globular proteins and compare the results with structure fluctuations observed in MD simulations, ENM modeling, and NMR ensembles. As demonstrated, CABS-flex predictions show high correlation to experimental and MD simulation data, while SURPASS is less accurate but promising in terms of future developments.
Collapse
Affiliation(s)
- Aleksandra E Badaczewska-Dawid
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Warsaw, Poland.,Department of Chemistry, Iowa State University, Ames, IA, USA
| | - Andrzej Kolinski
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Warsaw, Poland
| | - Sebastian Kmiecik
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Warsaw, Poland.
| |
Collapse
|
6
|
Badaczewska-Dawid AE, Kolinski A, Kmiecik S. Computational reconstruction of atomistic protein structures from coarse-grained models. Comput Struct Biotechnol J 2019; 18:162-176. [PMID: 31969975 PMCID: PMC6961067 DOI: 10.1016/j.csbj.2019.12.007] [Citation(s) in RCA: 34] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2019] [Accepted: 12/10/2019] [Indexed: 01/02/2023] Open
Abstract
Three-dimensional protein structures, whether determined experimentally or theoretically, are often too low resolution. In this mini-review, we outline the computational methods for protein structure reconstruction from incomplete coarse-grained to all atomistic models. Typical reconstruction schemes can be divided into four major steps. Usually, the first step is reconstruction of the protein backbone chain starting from the C-alpha trace. This is followed by side-chains rebuilding based on protein backbone geometry. Subsequently, hydrogen atoms can be reconstructed. Finally, the resulting all-atom models may require structure optimization. Many methods are available to perform each of these tasks. We discuss the available tools and their potential applications in integrative modeling pipelines that can transfer coarse-grained information from computational predictions, or experiment, to all atomistic structures.
Collapse
Affiliation(s)
| | | | - Sebastian Kmiecik
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland
| |
Collapse
|
7
|
Ciemny MP, Badaczewska-Dawid AE, Pikuzinska M, Kolinski A, Kmiecik S. Modeling of Disordered Protein Structures Using Monte Carlo Simulations and Knowledge-Based Statistical Force Fields. Int J Mol Sci 2019; 20:E606. [PMID: 30708941 PMCID: PMC6386871 DOI: 10.3390/ijms20030606] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/13/2018] [Revised: 01/23/2019] [Accepted: 01/29/2019] [Indexed: 12/20/2022] Open
Abstract
The description of protein disordered states is important for understanding protein folding mechanisms and their functions. In this short review, we briefly describe a simulation approach to modeling protein interactions, which involve disordered peptide partners or intrinsically disordered protein regions, and unfolded states of globular proteins. It is based on the CABS coarse-grained protein model that uses a Monte Carlo (MC) sampling scheme and a knowledge-based statistical force field. We review several case studies showing that description of protein disordered states resulting from CABS simulations is consistent with experimental data. The case studies comprise investigations of protein⁻peptide binding and protein folding processes. The CABS model has been recently made available as the simulation engine of multiscale modeling tools enabling studies of protein⁻peptide docking and protein flexibility. Those tools offer customization of the modeling process, driving the conformational search using distance restraints, reconstruction of selected models to all-atom resolution, and simulation of large protein systems in a reasonable computational time. Therefore, CABS can be combined in integrative modeling pipelines incorporating experimental data and other modeling tools of various resolution.
Collapse
Affiliation(s)
- Maciej Pawel Ciemny
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
- Faculty of Physics, University of Warsaw, Pasteura 5, 02-093 Warsaw, Poland.
| | | | - Monika Pikuzinska
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| | - Andrzej Kolinski
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| | - Sebastian Kmiecik
- Faculty of Chemistry, Biological and Chemical Research Center, University of Warsaw, Pasteura 1, 02-093 Warsaw, Poland.
| |
Collapse
|
8
|
Modeling of Protein Structural Flexibility and Large-Scale Dynamics: Coarse-Grained Simulations and Elastic Network Models. Int J Mol Sci 2018; 19:ijms19113496. [PMID: 30404229 PMCID: PMC6274762 DOI: 10.3390/ijms19113496] [Citation(s) in RCA: 42] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2018] [Revised: 10/29/2018] [Accepted: 10/31/2018] [Indexed: 12/13/2022] Open
Abstract
Fluctuations of protein three-dimensional structures and large-scale conformational transitions are crucial for the biological function of proteins and their complexes. Experimental studies of such phenomena remain very challenging and therefore molecular modeling can be a good alternative or a valuable supporting tool for the investigation of large molecular systems and long-time events. In this minireview, we present two alternative approaches to the coarse-grained (CG) modeling of dynamic properties of protein systems. We discuss two CG representations of polypeptide chains used for Monte Carlo dynamics simulations of protein local dynamics and conformational transitions, and highly simplified structure-based elastic network models of protein flexibility. In contrast to classical all-atom molecular dynamics, the modeling strategies discussed here allow the quite accurate modeling of much larger systems and longer-time dynamic phenomena. We briefly describe the main features of these models and outline some of their applications, including modeling of near-native structure fluctuations, sampling of large regions of the protein conformational space, or possible support for the structure prediction of large proteins and their complexes.
Collapse
|