1
|
Jilani M, Turcan A, Haspel N, Jagodzinski F. Elucidating the Structural Impacts of Protein InDels. Biomolecules 2022; 12:1435. [PMID: 36291643 PMCID: PMC9599607 DOI: 10.3390/biom12101435] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2022] [Revised: 09/23/2022] [Accepted: 09/27/2022] [Indexed: 09/17/2023] Open
Abstract
The effects of amino acid insertions and deletions (InDels) remain a rather under-explored area of structural biology. These variations oftentimes are the cause of numerous disease phenotypes. In spite of this, research to study InDels and their structural significance remains limited, primarily due to a lack of experimental information and computational methods. In this work, we fill this gap by modeling InDels computationally; we investigate the rigidity differences between the wildtype and a mutant variant with one or more InDels. Further, we compare how structural effects due to InDels differ from the effects of amino acid substitutions, which are another type of amino acid mutation. We finish by performing a correlation analysis between our rigidity-based metrics and wet lab data for their ability to infer the effects of InDels on protein fitness.
Collapse
Affiliation(s)
- Muneeba Jilani
- Department of Computer Science, University of Massachusetts Boston, Boston, MA 02125, USA
| | - Alistair Turcan
- Department of Computer Science, Western Washington University, Bellingham, WA 98225, USA
| | - Nurit Haspel
- Department of Computer Science, University of Massachusetts Boston, Boston, MA 02125, USA
| | - Filip Jagodzinski
- Department of Computer Science, Western Washington University, Bellingham, WA 98225, USA
| |
Collapse
|
2
|
PETRA: Drug Engineering via Rigidity Analysis. Molecules 2020; 25:molecules25061304. [PMID: 32178472 PMCID: PMC7144111 DOI: 10.3390/molecules25061304] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2020] [Revised: 02/19/2020] [Accepted: 02/25/2020] [Indexed: 11/23/2022] Open
Abstract
Rational drug design aims to develop pharmaceutical agents that impart maximal therapeutic benefits via their interaction with their intended biological targets. In the past several decades, advances in computational tools that inform wet-lab techniques have aided the development of a wide variety of new medicines with high efficacies. Nonetheless, drug development remains a time and cost intensive process. In this work, we have developed a computational pipeline for assessing how individual atoms contribute to a ligand’s effect on the structural stability of a biological target. Our approach takes as input a protein-ligand resolved PDB structure file and systematically generates all possible ligand variants. We assess how the atomic-level edits to the ligand alter the drug’s effect via a graph theoretic rigidity analysis approach. We demonstrate, via four case studies of common drugs, the utility of our pipeline and corroborate our analyses with known biophysical properties of the medicines, as reported in the literature.
Collapse
|
3
|
Robust Prediction of Single and Multiple Point Protein Mutations Stability Changes. Biomolecules 2019; 10:biom10010067. [PMID: 31906171 PMCID: PMC7023245 DOI: 10.3390/biom10010067] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2019] [Revised: 12/19/2019] [Accepted: 12/20/2019] [Indexed: 11/16/2022] Open
Abstract
Accurate prediction of protein stability changes resulting from amino acid substitutions is of utmost importance in medicine to better understand which mutations are deleterious, leading to diseases, and which are neutral. Since conducting wet lab experiments to get a better understanding of protein mutations is costly and time consuming, and because of huge number of possible mutations the need of computational methods that could accurately predict effects of amino acid mutations is of greatest importance. In this research, we present a robust methodology to predict the energy changes of a proteins upon mutations. The proposed prediction scheme is based on two step algorithm that is a Holdout Random Sampler followed by a neural network model for regression. The Holdout Random Sampler is utilized to analysis the energy change, the corresponding uncertainty, and to obtain a set of admissible energy changes, expressed as a cumulative distribution function. These values are further utilized to train a simple neural network model that can predict the energy changes. Results were blindly tested (validated) against experimental energy changes, giving Pearson correlation coefficients of 0.66 for Single Point Mutations and 0.77 for Multiple Point Mutations. These results confirm the successfulness of our method, since it outperforms majority of previous studies in this field.
Collapse
|
4
|
Rodriguez PM, Stratmann D, Duprat E, Papandreou N, Acuna R, Lacroix Z, Chomilier J. Correlating topology and thermodynamics to predict protein structure sensitivity to point mutations. BIO-ALGORITHMS AND MED-SYSTEMS 2018. [DOI: 10.1515/bams-2018-0026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
Abstract
AbstractThe relation between distribution of hydrophobic amino acids along with protein chains and their structure is far from being completely understood. No reliable method allowsab initioprediction of the folded structure from this distribution of physicochemical properties, even when they are highly degenerated by considering only two classes: hydrophobic and polar. Establishment of long-range hydrophobic three dimension (3D) contacts is essential for the formation of the nucleus, a key process in the early steps of protein folding. Thus, a large number of 3D simulation studies were developed to challenge this issue. They are nowadays evaluated in a specific chapter of the molecular modeling competition, Critical Assessment of Protein Structure Prediction. We present here a simulation of the early steps of the folding process for 850 proteins, performed in a discrete 3D space, which results in peaks in the predicted distribution of intra-chain noncovalent contacts. The residues located at these peak positions tend to be buried in the core of the protein and are expected to correspond to critical positions in the sequence, important both for folding and structural (or similarly, energetic in the thermodynamic hypothesis) stability. The degree of stabilization or destabilization due to a point mutation at the critical positions involved in numerous contacts is estimated from the calculated folding free energy difference between mutated and native structures. The results show that these critical positions are not tolerant towards mutation. This simulation of the noncovalent contacts only needs a sequence as input, and this paper proposes a validation of the method by comparison with the prediction of stability by well-established programs.
Collapse
|
5
|
Dehghanpoor R, Ricks E, Hursh K, Gunderson S, Farhoodi R, Haspel N, Hutchinson B, Jagodzinski F. Predicting the Effect of Single and Multiple Mutations on Protein Structural Stability. Molecules 2018; 23:molecules23020251. [PMID: 29382060 PMCID: PMC6017198 DOI: 10.3390/molecules23020251] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/24/2017] [Revised: 01/15/2018] [Accepted: 01/19/2018] [Indexed: 01/06/2023] Open
Abstract
Predicting how a point mutation alters a protein’s stability can guide pharmaceutical drug design initiatives which aim to counter the effects of serious diseases. Conducting mutagenesis studies in physical proteins can give insights about the effects of amino acid substitutions, but such wet-lab work is prohibitive due to the time as well as financial resources needed to assess the effect of even a single amino acid substitution. Computational methods for predicting the effects of a mutation on a protein structure can complement wet-lab work, and varying approaches are available with promising accuracy rates. In this work we compare and assess the utility of several machine learning methods and their ability to predict the effects of single and double mutations. We in silico generate mutant protein structures, and compute several rigidity metrics for each of them. We use these as features for our Support Vector Regression (SVR), Random Forest (RF), and Deep Neural Network (DNN) methods. We validate the predictions of our in silico mutations against experimental ΔΔG stability data, and attain Pearson Correlation values upwards of 0.71 for single mutations, and 0.81 for double mutations. We perform ablation studies to assess which features contribute most to a model’s success, and also introduce a voting scheme to synthesize a single prediction from the individual predictions of the three models.
Collapse
Affiliation(s)
- Ramin Dehghanpoor
- Department of Computer Science, University of Massachusetts Boston, Boston, MA 02125, USA.
| | - Evan Ricks
- Department of Computer Science, Western Washington University, Bellingham, WA 98225, USA.
| | - Katie Hursh
- Department of Computer Science, Western Washington University, Bellingham, WA 98225, USA.
| | - Sarah Gunderson
- Department of Computer Science, Western Washington University, Bellingham, WA 98225, USA.
| | - Roshanak Farhoodi
- Department of Computer Science, University of Massachusetts Boston, Boston, MA 02125, USA.
| | - Nurit Haspel
- Department of Computer Science, University of Massachusetts Boston, Boston, MA 02125, USA.
| | - Brian Hutchinson
- Department of Computer Science, Western Washington University, Bellingham, WA 98225, USA.
- Computing and Analytics Division, Pacific Northwest National Laboratory; Richland, WA 99354, USA.
| | - Filip Jagodzinski
- Department of Computer Science, Western Washington University, Bellingham, WA 98225, USA.
| |
Collapse
|
6
|
Siderius M, Jagodzinski F. Mutation Sensitivity Maps: Identifying Residue Substitutions That Impact Protein Structure Via a Rigidity Analysis In Silico Mutation Approach. J Comput Biol 2017; 25:89-102. [PMID: 29035580 DOI: 10.1089/cmb.2017.0165] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023] Open
Abstract
Understanding how an amino acid substitution affects a protein's structure can aid in the design of pharmaceutical drugs that aim at countering diseases caused by protein mutants. Unfortunately, performing even a few amino acid substitutions in vitro is both time and cost prohibitive, whereas an exhaustive analysis that involves systematically mutating all amino acids in the physical protein is infeasible. Computational methods have been developed to predict the effects of mutations, but even many of them are computationally intensive or are else dependent on homology or experimental data that may not be available for the protein being studied. In this work, we motivate and present a computation pipeline whose only input is a Protein Data Bank file containing the 3D coordinates of the atoms of a biomolecule. Our high-throughput approach uses our ProMuteHT algorithm to exhaustively generate in silico amino acid substitutions at each residue, and it also includes an energy minimization option. This is in contrast to our previous work, where we analyzed the effects of in silico mutations to Alanine, Serine, and Glycine only. We exploit the speed of a fast rigidity analysis approach to analyze our protein variants, and develop a Mutation Sensitivity (MuSe) Map, to permit identifying residues that are most sensitive to mutations. We present a case study to show the degree to which a MuSe Map and whisker plots are able to locate amino acids whose mutations most affect a protein's structure as inferred from a rigidity analysis approach.
Collapse
Affiliation(s)
- Michael Siderius
- Department of Computer Science, Western Washington University , Bellingham, Washington
| | - Filip Jagodzinski
- Department of Computer Science, Western Washington University , Bellingham, Washington
| |
Collapse
|
7
|
Abstract
We describe efficient methods for consistently coloring and visualizing collections of rigid cluster decompositions obtained from variations of a protein structure, and lay the foundation for more complex setups, that may involve different computational and experimental methods. The focus here is on three biological applications: the conceptually simpler problems of visualizing results of dilution and mutation analyses, and the more complex task of matching decompositions of multiple Nucleic Magnetic Resonance (NMR) models of the same protein. Implemented into the KINematics And RIgidity (KINARI) web server application, the improved visualization techniques give useful information about protein folding cores, help examining the effect of mutations on protein flexibility and function, and provide insights into the structural motions of Protein Data Bank proteins solved with solution NMR. These tools have been developed with the goal of improving and validating rigidity analysis as a credible coarse-grained model capturing essential information about a protein's slow motions near the native state.
Collapse
|
8
|
Abstract
In proteins, certain amino acids may play a critical role in determining their structure and function. Examples include flexible regions, which allow domain motions, and highly conserved residues on functional interfaces, which play a role in binding and interaction with other proteins. Detecting these regions facilitates the analysis and simulation of protein rigidity and conformational changes, and aids in characterizing protein-protein binding. We present a protocol that combines graph-theory rigidity analysis and machine-learning-based methods for predicting critical residues in proteins. Our approach combines amino-acid specific information and data obtained by two complementary methods. One method, KINARI, performs graph-based analysis to find rigid clusters of amino acids in a protein, while the other method relies on evolutionary conservation scores to find functional interfaces in proteins. Our machine learning model combines both methods, in addition to amino acid type and solvent-accessible surface area.
Collapse
|
9
|
Streinu I. Large scale rigidity-based flexibility analysis of biomolecules. STRUCTURAL DYNAMICS (MELVILLE, N.Y.) 2016; 3:012005. [PMID: 26958583 PMCID: PMC4760970 DOI: 10.1063/1.4942414] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/06/2016] [Accepted: 02/08/2016] [Indexed: 06/05/2023]
Abstract
KINematics And RIgidity (KINARI) is an on-going project for in silico flexibility analysis of proteins. The new version of the software, Kinari-2, extends the functionality of our free web server KinariWeb, incorporates advanced web technologies, emphasizes the reproducibility of its experiments, and makes substantially improved tools available to the user. It is designed specifically for large scale experiments, in particular, for (a) very large molecules, including bioassemblies with high degree of symmetry such as viruses and crystals, (b) large collections of related biomolecules, such as those obtained through simulated dilutions, mutations, or conformational changes from various types of dynamics simulations, and (c) is intended to work as seemlessly as possible on the large, idiosyncratic, publicly available repository of biomolecules, the Protein Data Bank. We describe the system design, along with the main data processing, computational, mathematical, and validation challenges underlying this phase of the KINARI project.
Collapse
Affiliation(s)
- Ileana Streinu
- Department of Computer Science, Smith College , Northampton, Massachusetts 01063, USA
| |
Collapse
|
10
|
Akbal-Delibas B, Jagodzinski F, Haspel N. A conservation and rigidity based method for detecting critical protein residues. BMC STRUCTURAL BIOLOGY 2013; 13 Suppl 1:S6. [PMID: 24565061 PMCID: PMC3952096 DOI: 10.1186/1472-6807-13-s1-s6] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
Background Certain amino acids in proteins play a critical role in determining their structural stability and function. Examples include flexible regions such as hinges which allow domain motion, and highly conserved residues on functional interfaces which allow interactions with other proteins. Detecting these regions can aid in the analysis and simulation of protein rigidity and conformational changes, and helps characterizing protein binding and docking. We present an analysis of critical residues in proteins using a combination of two complementary techniques. One method performs in-silico mutations and analyzes the protein's rigidity to infer the role of a point substitution to Glycine or Alanine. The other method uses evolutionary conservation to find functional interfaces in proteins. Results We applied the two methods to a dataset of proteins, including biomolecules with experimentally known critical residues as determined by the free energy of unfolding. Our results show that the combination of the two methods can detect the vast majority of critical residues in tested proteins. Conclusions Our results show that the combination of the two methods has the potential to detect more information than each method separately. Future work will provide a confidence level for the criticalness of a residue to improve the accuracy of our method and eliminate false positives. Once the combined methods are integrated into one scoring function, it can be applied to other domains such as estimating functional interfaces.
Collapse
|
11
|
Jagodzinski F, Clark P, Grant J, Liu T, Monastra S, Streinu I. Rigidity analysis of protein biological assemblies and periodic crystal structures. BMC Bioinformatics 2013; 14 Suppl 18:S2. [PMID: 24564201 PMCID: PMC3817814 DOI: 10.1186/1471-2105-14-s18-s2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022] Open
Abstract
Background We initiate in silico rigidity-theoretical studies of biological assemblies and small crystals for protein structures. The goal is to determine if, and how, the interactions among neighboring cells and subchains affect the flexibility of a molecule in its crystallized state. We use experimental X-ray crystallography data from the Protein Data Bank (PDB). The analysis relies on an effcient graph-based algorithm. Computational experiments were performed using new protein rigidity analysis tools available in the new release of our KINARI-Web server http://kinari.cs.umass.edu. Results We provide two types of results: on biological assemblies and on crystals. We found that when only isolated subchains are considered, structural and functional information may be missed. Indeed, the rigidity of biological assemblies is sometimes dependent on the count and placement of hydrogen bonds and other interactions among the individual subchains of the biological unit. Similarly, the rigidity of small crystals may be affected by the interactions between atoms belonging to different unit cells. We have analyzed a dataset of approximately 300 proteins, from which we generated 982 crystals (some of which are biological assemblies). We identified two types of behaviors. (a) Some crystals and/or biological assemblies will aggregate into rigid bodies that span multiple unit cells/asymmetric units. Some of them create substantially larger rigid cluster in the crystal/biological assembly form, while in other cases, the aggregation has a smaller effect just at the interface between the units. (b) In other cases, the rigidity properties of the asymmetric units are retained, because the rigid bodies did not combine. We also identified two interesting cases where rigidity analysis may be correlated with the functional behavior of the protein. This type of information, identified here for the first time, depends critically on the ability to create crystals and biological assemblies, and would not have been observed only from the asymmetric unit. For the Ribonuclease A protein (PDB file 5RSA), which is functionally active in the crystallized form, we found that the individual protein and its crystal form retain the flexibility parameters between the two states. In contrast, a derivative of Ribonuclease A (PDB file 9RSA), has no functional activity, and the protein in both the asymmetric and crystalline forms, is very rigid. For the vaccinia virus D13 scaffolding protein (PDB file 3SAQ), which has two biological assemblies, we observed a striking asymmetry in the rigidity cluster decomposition of one of them, which seems implausible, given its symmetry. Upon careful investigation, we tracked the cause to a placement decision by the Reduce software concerning the hydrogen atoms, thus affecting the distribution of certain hydrogen bonds. The surprising result is that the presence or lack of a very few, but critical, hydrogen bonds, can drastically affect the rigid cluster decomposition of the biological assembly. Conclusion The rigidity analysis of a single asymmetric unit may not accurately reflect the protein's behavior in the tightly packed crystal environment. Using our KINARI software, we demonstrated that additional functional and rigidity information can be gained by analyzing a protein's biological assembly and/or crystal structure. However, performing a larger scale study would be computationally expensive (due to the size of the molecules involved). Overcoming this limitation will require novel mathematical and computational extensions to our software.
Collapse
|
12
|
Abstract
Protein rigidity and flexibility can be analyzed accurately and efficiently using the program floppy inclusion and rigid substructure topography (FIRST). Previous studies using FIRST were designed to analyze the rigidity and flexibility of proteins using a single static (snapshot) structure. It is however well known that proteins can undergo spontaneous sub-molecular unfolding and refolding, or conformational dynamics, even under conditions that strongly favor a well-defined native structure. These (local) unfolding events result in a large number of conformers that differ from each other very slightly. In this context, proteins are better represented as a thermodynamic ensemble of 'native-like' structures, and not just as a single static low-energy structure. Working with this notion, we introduce a novel FIRST-based approach for predicting rigidity/flexibility of the protein ensemble by (i) averaging the hydrogen bonding strengths from the entire ensemble and (ii) by refining the mathematical model of hydrogen bonds. Furthermore, we combine our FIRST-ensemble rigidity predictions with the ensemble solvent accessibility data of the backbone amides and propose a novel computational method which uses both rigidity and solvent accessibility for predicting hydrogen-deuterium exchange (HDX). To validate our predictions, we report a novel site specific HDX experiment which characterizes the native structural ensemble of Acylphosphatase from hyperthermophile Sulfolobus solfataricus (Sso AcP). The sub-structural conformational dynamics that is observed by HDX data, is closely matched with the FIRST-ensemble rigidity predictions, which could not be attained using the traditional single 'snapshot' rigidity analysis. Moreover, the computational predictions of regions that are protected from HDX and those that undergo exchange are in very good agreement with the experimental HDX profile of Sso AcP.
Collapse
Affiliation(s)
- Adnan Sljoka
- Department of Mathematics and Statistics, York University, 4700 Keele Street, Toronto, M3J 1P3, Canada
| | | |
Collapse
|