1
|
Escobedo N, Monzon AM, Fornasari MS, Palopoli N, Parisi G. Combining Protein Conformational Diversity and Phylogenetic Information Using CoDNaS and CoDNaS-Q. Curr Protoc 2023; 3:e764. [PMID: 37184204 DOI: 10.1002/cpz1.764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
CoDNaS (http://ufq.unq.edu.ar/codnas/) and CoDNaS-Q (http://ufq.unq.edu.ar/codnasq) are repositories of proteins with different degrees of conformational diversity. Following the ensemble nature of the native state, conformational diversity represents the structural differences between the conformers in the ensemble. Each entry in CoDNaS and CoDNaS-Q contains a redundant collection of experimentally determined conformers obtained under different conditions. These conformers represent snapshots of the protein dynamism. While CoDNaS contains examples of conformational diversity at the tertiary level, a recent development, CoDNaS-Q, contains examples at the quaternary level. In the emerging age of accurate protein structure prediction by machine learning approaches, many questions remain open regarding the characterization of protein dynamism. In this context, most bioinformatics resources take advantage of distinct features derived from protein alignments, however, the complexity and heterogeneity of information makes it difficult to recover reliable biological signatures. Here we present five protocols to explore tertiary and quaternary conformational diversity at the individual protein level as well as for the characterization of the distribution of conformational diversity at the protein family level in a phylogenetic context. These protocols can provide curated protein families with experimentally known conformational diversity, facilitating the exploration of sequence determinants of protein dynamism. © 2023 Wiley Periodicals LLC. Basic Protocol 1: Assessing conformational diversity with CoDNaS Alternate Protocol 1: Assessing conformational diversity at the quaternary level with CoDNaS-Q Basic Protocol 2: Exploring conformational diversity in a protein family Alternate Protocol 2: Exploring quaternary conformational diversity in a protein family Basic Protocol 3: Representing conformational diversity in a phylogenetic context.
Collapse
Affiliation(s)
- Nahuel Escobedo
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | | | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| |
Collapse
|
2
|
Saldaño T, Escobedo N, Marchetti J, Zea DJ, Mac Donagh J, Velez Rueda AJ, Gonik E, García Melani A, Novomisky Nechcoff J, Salas MN, Peters T, Demitroff N, Fernandez Alberti S, Palopoli N, Fornasari MS, Parisi G. Impact of protein conformational diversity on AlphaFold predictions. Bioinformatics 2022; 38:2742-2748. [PMID: 35561203 DOI: 10.1093/bioinformatics/btac202] [Citation(s) in RCA: 82] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 02/10/2022] [Accepted: 03/31/2022] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION After the outstanding breakthrough of AlphaFold in predicting protein 3D models, new questions appeared and remain unanswered. The ensemble nature of proteins, for example, challenges the structural prediction methods because the models should represent a set of conformers instead of single structures. The evolutionary and structural features captured by effective deep learning techniques may unveil the information to generate several diverse conformations from a single sequence. Here, we address the performance of AlphaFold2 predictions obtained through ColabFold under this ensemble paradigm. RESULTS Using a curated collection of apo-holo pairs of conformers, we found that AlphaFold2 predicts the holo form of a protein in ∼70% of the cases, being unable to reproduce the observed conformational diversity with the same error for both conformers. More importantly, we found that AlphaFold2's performance worsens with the increasing conformational diversity of the studied protein. This impairment is related to the heterogeneity in the degree of conformational diversity found between different members of the homologous family of the protein under study. Finally, we found that main-chain flexibility associated with apo-holo pairs of conformers negatively correlates with the predicted local model quality score plDDT, indicating that plDDT values in a single 3D model could be used to infer local conformational changes linked to ligand binding transitions. AVAILABILITY AND IMPLEMENTATION Data and code used in this manuscript are publicly available at https://gitlab.com/sbgunq/publications/af2confdiv-oct2021. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Tadeo Saldaño
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Nahuel Escobedo
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Julia Marchetti
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | | | - Juan Mac Donagh
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Ana Julia Velez Rueda
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Eduardo Gonik
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
- INIFTA (CONICET-UNLP) - Fotoquímica y Nanomateriales para el Ambiente y la Biología (nanoFOT), La Plata, Argentina
| | | | | | - Martín N Salas
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
| | - Tomás Peters
- Fundación Instituto Leloir-Instituto de Investigaciones Bioquímicas de Buenos Aires, Buenos Aires, Argentina
| | - Nicolás Demitroff
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
- Fundación Instituto Leloir-Instituto de Investigaciones Bioquímicas de Buenos Aires, Buenos Aires, Argentina
| | - Sebastian Fernandez Alberti
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| |
Collapse
|
3
|
Palopoli N, Marchetti J, Monzon AM, Zea DJ, Tosatto SCE, Fornasari MS, Parisi G. Intrinsically Disordered Protein Ensembles Shape Evolutionary Rates Revealing Conformational Patterns. J Mol Biol 2020; 433:166751. [PMID: 33310020 DOI: 10.1016/j.jmb.2020.166751] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 12/01/2020] [Accepted: 12/05/2020] [Indexed: 10/22/2022]
Abstract
Intrinsically disordered proteins (IDPs) lack stable tertiary structure under physiological conditions. The unique composition and complex dynamical behaviour of IDPs make them a challenge for structural biology and molecular evolution studies. Using NMR ensembles, we found that IDPs evolve under a strong site-specific evolutionary rate heterogeneity, mainly originated by different constraints derived from their inter-residue contacts. Evolutionary rate profiles correlate with the experimentally observed conformational diversity of the protein, allowing the description of different conformational patterns possibly related to their structure-function relationships. The correlation between evolutionary rates and contact information improves when structural information is taken not from any individual conformer or the whole ensemble, but from combining a limited number of conformers. Our results suggest that residue contacts in disordered regions constrain evolutionary rates to conserve the dynamic behaviour of the ensemble and that evolutionary rates can be used as a proxy for the conformational diversity of IDPs.
Collapse
Affiliation(s)
- Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | - Julia Marchetti
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | | | - Diego J Zea
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Paris, France
| | | | - Maria S Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina.
| |
Collapse
|
4
|
Gabriel JE. Clustering accentuated matches and highly conserved domains between bacterial and human heat shock gene and protein. BRAZ J BIOL 2019; 80:943-945. [PMID: 31800763 DOI: 10.1590/1519-6984.219608] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2019] [Accepted: 08/20/2019] [Indexed: 11/21/2022] Open
Affiliation(s)
- J E Gabriel
- Universidade Federal do Vale do São Francisco - UNIVASF, Campus Centro Universitário, Av. José de Sá Maniçoba, s/n, Centro, CEP 56304-917, Petrolina, PE, Brasil
| |
Collapse
|
5
|
Goyal VD, Sullivan BJ, Magliery TJ. Phylogenetic spread of sequence data affects fitness of consensus enzymes: Insights from triosephosphate isomerase. Proteins 2019; 88:274-283. [PMID: 31407418 DOI: 10.1002/prot.25799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2018] [Revised: 07/26/2019] [Accepted: 08/08/2019] [Indexed: 11/08/2022]
Abstract
The concept of consensus in multiple sequence alignments (MSAs) has been used to design and engineer proteins previously with some success. However, consensus design implicitly assumes that all amino acid positions function independently, whereas in reality, the amino acids in a protein interact with each other and work cooperatively to produce the optimum structure required for its function. Correlation analysis is a tool that can capture the effect of such interactions. In a previously published study, we made consensus variants of the triosephosphate isomerase (TIM) protein using MSAs that included sequences form both prokaryotic and eukaryotic organisms. These variants were not completely native-like and were also surprisingly different from each other in terms of oligomeric state, structural dynamics, and activity. Extensive correlation analysis of the TIM database has revealed some clues about factors leading to the unusual behavior of the previously constructed consensus proteins. Among other things, we have found that the more ill-behaved consensus mutant had more broken correlations than the better-behaved consensus variant. Moreover, we report three correlation and phylogeny-based consensus variants of TIM. These variants were more native-like than the previous consensus mutants and considerably more stable than a wild-type TIM from a mesophilic organism. This study highlights the importance of choosing the appropriate diversity of MSA for consensus analysis and provides information that can be used to engineer stable enzymes.
Collapse
Affiliation(s)
- Venuka Durani Goyal
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio
| | - Brandon J Sullivan
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio.,Ohio State Biochemistry Program, The Ohio State University, Columbus, Ohio
| | - Thomas J Magliery
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio
| |
Collapse
|
6
|
Saldaño TE, Tosatto SCE, Parisi G, Fernandez-Alberti S. Network analysis of dynamically important residues in protein structures mediating ligand-binding conformational changes. EUROPEAN BIOPHYSICS JOURNAL: EBJ 2019; 48:559-568. [PMID: 31273390 DOI: 10.1007/s00249-019-01384-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 05/31/2019] [Accepted: 07/01/2019] [Indexed: 11/26/2022]
Abstract
According to the generalized conformational selection model, ligand binding involves the co-existence of at least two conformers with different ligand-affinities in a dynamical equilibrium. Conformational transitions between them should be guaranteed by intramolecular vibrational dynamics associated to each conformation. These motions are, therefore, related to the biological function of a protein. Positions whose mutations are found to alter these vibrations the most can be defined as key positions, that is, dynamically important residues that mediate the ligand-binding conformational change. In a previous study, we have shown that these positions are evolutionarily conserved. They correspond to buried aliphatic residues mostly localized in regular structured regions of the protein like β-sheets and α-helices. In the present paper, we perform a network analysis of these key positions for a large dataset of paired protein structures in the ligand-free and ligand-bound form. We observe that networks of interactions between these key positions present larger and more integrated networks with faster transmission of the information. Besides, networks of residues result that are robust to conformational changes. Our results reveal that the conformational diversity of proteins seems to be guaranteed by a network of strongly interconnected key positions rather than individual residues.
Collapse
Affiliation(s)
- Tadeo E Saldaño
- Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD, Bernal, Argentina
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, Viale G. Colombo 3, 5131, Padua, Italy
| | - Gustavo Parisi
- Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD, Bernal, Argentina
| | | |
Collapse
|
7
|
Abstract
Classically, phenotype is what is observed, and genotype is the genetic makeup. Statistical studies aim to project phenotypic likelihoods of genotypic patterns. The traditional genotype-to-phenotype theory embraces the view that the encoded protein shape together with gene expression level largely determines the resulting phenotypic trait. Here, we point out that the molecular biology revolution at the turn of the century explained that the gene encodes not one but ensembles of conformations, which in turn spell all possible gene-associated phenotypes. The significance of a dynamic ensemble view is in understanding the linkage between genetic change and the gained observable physical or biochemical characteristics. Thus, despite the transformative shift in our understanding of the basis of protein structure and function, the literature still commonly relates to the classical genotype-phenotype paradigm. This is important because an ensemble view clarifies how even seemingly small genetic alterations can lead to pleiotropic traits in adaptive evolution and in disease, why cellular pathways can be modified in monogenic and polygenic traits, and how the environment may tweak protein function.
Collapse
Affiliation(s)
- Ruth Nussinov
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, National Cancer Institute at Frederick, Frederick, Maryland, United States of America
- Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel
| | - Chung-Jung Tsai
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, National Cancer Institute at Frederick, Frederick, Maryland, United States of America
| | - Hyunbum Jang
- Cancer and Inflammation Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research, National Cancer Institute at Frederick, Frederick, Maryland, United States of America
| |
Collapse
|
8
|
Liang Z, Verkhivker GM, Hu G. Integration of network models and evolutionary analysis into high-throughput modeling of protein dynamics and allosteric regulation: theory, tools and applications. Brief Bioinform 2019; 21:815-835. [DOI: 10.1093/bib/bbz029] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2018] [Revised: 02/04/2019] [Accepted: 02/21/2019] [Indexed: 12/24/2022] Open
Abstract
Abstract
Proteins are dynamical entities that undergo a plethora of conformational changes, accomplishing their biological functions. Molecular dynamics simulation and normal mode analysis methods have become the gold standard for studying protein dynamics, analyzing molecular mechanism and allosteric regulation of biological systems. The enormous amount of the ensemble-based experimental and computational data on protein structure and dynamics has presented a major challenge for the high-throughput modeling of protein regulation and molecular mechanisms. In parallel, bioinformatics and systems biology approaches including genomic analysis, coevolution and network-based modeling have provided an array of powerful tools that complemented and enriched biophysical insights by enabling high-throughput analysis of biological data and dissection of global molecular signatures underlying mechanisms of protein function and interactions in the cellular environment. These developments have provided a powerful interdisciplinary framework for quantifying the relationships between protein dynamics and allosteric regulation, allowing for high-throughput modeling and engineering of molecular mechanisms. Here, we review fundamental advances in protein dynamics, network theory and coevolutionary analysis that have provided foundation for rapidly growing computational tools for modeling of allosteric regulation. We discuss recent developments in these interdisciplinary areas bridging computational biophysics and network biology, focusing on promising applications in allosteric regulations, including the investigation of allosteric communication pathways, protein–DNA/RNA interactions and disease mutations in genomic medicine. We conclude by formulating and discussing future directions and potential challenges facing quantitative computational investigations of allosteric regulatory mechanisms in protein systems.
Collapse
Affiliation(s)
- Zhongjie Liang
- School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| | - Gennady M Verkhivker
- Department of Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, CA, USA
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, CA, USA
| | - Guang Hu
- School of Biology and Basic Medical Sciences, Soochow University, Suzhou, China
| |
Collapse
|
9
|
Ensembles from Ordered and Disordered Proteins Reveal Similar Structural Constraints during Evolution. J Mol Biol 2019; 431:1298-1307. [DOI: 10.1016/j.jmb.2019.01.031] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Revised: 01/23/2019] [Accepted: 01/24/2019] [Indexed: 01/08/2023]
|
10
|
da Silva Neto AM, Silva SR, Vendruscolo M, Camilloni C, Montalvão RW. A superposition free method for protein conformational ensemble analyses and local clustering based on a differential geometry representation of backbone. Proteins 2019; 87:302-312. [PMID: 30582223 DOI: 10.1002/prot.25652] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2018] [Revised: 11/30/2018] [Accepted: 12/19/2018] [Indexed: 01/11/2023]
Abstract
Here a differential geometry (DG) representation of protein backbone is explored on the analyses of protein conformational ensembles. The protein backbone is described by curvature, κ, and torsion, τ, values per residue and we propose 1) a new dissimilarity and protein flexibility measurement and 2) a local conformational clustering method. The methods were applied to Ubiquitin and c-Myb-KIX protein conformational ensembles and results show that κ\τ metric space allows to properly judge protein flexibility by avoiding the superposition problem. The dmax measurement presents equally good or superior results when compared to RMSF, especially for the intrinsically unstructured protein. The clustering method is unique as it relates protein global to local dynamics by providing a global clustering solutions per residue. The methods proposed can be especially useful to the analyses of highly flexible proteins. The software written for the analyses presented here is available at https://github.com/AMarinhoSN/FleXgeo for academic usage only.
Collapse
Affiliation(s)
| | - Samuel Reghim Silva
- São Carlos Institute of Physics, University of São Paulo, São Carlos, Brazil
| | | | - Carlo Camilloni
- Department of Biosciences, University of Milano, Milano, Italy
| | | |
Collapse
|
11
|
Abstract
The native state of proteins is composed of conformers in dynamical equilibrium. In this chapter, different issues related to conformational diversity are explored using a curated and experimentally based database called CoDNaS (Conformational Diversity in the Native State). This database is a collection of redundant structures for the same sequence. CoDNaS estimates the degree of conformational diversity using different global and local structural similarity measures. It allows the user to explore how structural differences among conformers change as a function of several structural features providing further biological information. This chapter explores the measurement of conformational diversity and its relationship with sequence divergence. Also, it discusses how proteins with high conformational diversity could affect homology modeling techniques.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina
| | - Diego Javier Zea
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina.
| |
Collapse
|
12
|
How is structural divergence related to evolutionary information? Mol Phylogenet Evol 2018; 127:859-866. [DOI: 10.1016/j.ympev.2018.06.033] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Revised: 06/01/2018] [Accepted: 06/19/2018] [Indexed: 12/15/2022]
|
13
|
Afanasyeva A, Bockwoldt M, Cooney CR, Heiland I, Gossmann TI. Human long intrinsically disordered protein regions are frequent targets of positive selection. Genome Res 2018; 28:975-982. [PMID: 29858274 PMCID: PMC6028134 DOI: 10.1101/gr.232645.117] [Citation(s) in RCA: 45] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Accepted: 06/01/2018] [Indexed: 12/20/2022]
Abstract
Intrinsically disordered regions occur frequently in proteins and are characterized by a lack of a well-defined three-dimensional structure. Although these regions do not show a higher order of structural organization, they are known to be functionally important. Disordered regions are rapidly evolving, largely attributed to relaxed purifying selection and an increased role of genetic drift. It has also been suggested that positive selection might contribute to their rapid diversification. However, for our own species, it is currently unknown whether positive selection has played a role during the evolution of these protein regions. Here, we address this question by investigating the evolutionary pattern of more than 6600 human proteins with intrinsically disordered regions and their ordered counterparts. Our comparative approach with data from more than 90 mammalian genomes uses a priori knowledge of disordered protein regions, and we show that this increases the power to detect positive selection by an order of magnitude. We can confirm that human intrinsically disordered regions evolve more rapidly, not only within humans but also across the entire mammalian phylogeny. They have, however, experienced substantial evolutionary constraint, hinting at their fundamental functional importance. We find compelling evidence that disordered protein regions are frequent targets of positive selection and estimate that the relative rate of adaptive substitutions differs fourfold between disordered and ordered protein regions in humans. Our results suggest that disordered protein regions are important targets of genetic innovation and that the contribution of positive selection in these regions is more pronounced than in other protein parts.
Collapse
Affiliation(s)
- Arina Afanasyeva
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield S102TN, United Kingdom.,Institute of Nanobiotechnologies, Peter the Great St. Petersburg Polytechnic University, Saint-Petersburg 195251, Russia.,Petersburg Nuclear Physics Institute, B.P. Konstantinov NRC Kurchatov Institute, Gatchina, Leningrad District 188300, Russia.,National Institutes of Biomedical Innovation, Health and Nutrition, Ibaraki City, Osaka 567-0085, Japan
| | - Mathias Bockwoldt
- Department of Arctic and Marine Biology, UiT The Arctic University of Norway, 9037 Tromsø, Norway
| | - Christopher R Cooney
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield S102TN, United Kingdom
| | - Ines Heiland
- Department of Arctic and Marine Biology, UiT The Arctic University of Norway, 9037 Tromsø, Norway
| | - Toni I Gossmann
- Department of Animal and Plant Sciences, University of Sheffield, Sheffield S102TN, United Kingdom
| |
Collapse
|
14
|
Goyal VD, Magliery TJ. Phylogenetic spread of sequence data affects fitness of SOD1 consensus enzymes: Insights from sequence statistics and structural analyses. Proteins 2018; 86:609-620. [PMID: 29490429 DOI: 10.1002/prot.25486] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2017] [Revised: 01/25/2018] [Accepted: 02/24/2018] [Indexed: 12/19/2022]
Abstract
Non-natural protein sequences with native-like structures and functions can be constructed successfully using consensus design. This design strategy is relatively well understood in repeat proteins with simple binding function, however detailed studies are lacking in globular enzymes. The SOD1 family is a good model for such studies due to the availability of large amount of sequence and structure data motivated by involvement of human SOD1 in the fatal motor neuron disease amyotrophic lateral sclerosis (ALS). We constructed two consensus SOD1 enzymes from multiple sequence alignments from all organisms and eukaryotic organisms. A significant difference in their catalytic activities shows that the phylogenetic spread of the sequences used affects the fitness of the construct obtained. A mutation in an electrostatic loop and overall design incompatibilities between bacterial and eukaryotic sequences were implicated in this disparity. Based on this analysis, a bioinformatics approach was used to classify mutations thought to cause familial ALS providing a unique high level view of the physical basis of disease-causing aggregation of human SOD1.
Collapse
Affiliation(s)
- Venuka Durani Goyal
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio, 43210.,Department of Chemistry, Indian Institute of Technology Bombay, Powai, Mumbai, 400076, India
| | - Thomas J Magliery
- Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio, 43210
| |
Collapse
|
15
|
Cooperativity and flexibility in enzyme evolution. Curr Opin Struct Biol 2018; 48:83-92. [DOI: 10.1016/j.sbi.2017.10.020] [Citation(s) in RCA: 63] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2017] [Accepted: 10/24/2017] [Indexed: 11/23/2022]
|
16
|
Biswas K, Acharya D, Podder S, Ghosh TC. Evolutionary rate heterogeneity between multi- and single-interface hubs across human housekeeping and tissue-specific protein interaction network: Insights from proteins' and its partners' properties. Genomics 2017; 110:283-290. [PMID: 29198610 DOI: 10.1016/j.ygeno.2017.11.006] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2017] [Revised: 11/10/2017] [Accepted: 11/29/2017] [Indexed: 12/12/2022]
Abstract
Integrating gene expression into protein-protein interaction network (PPIN) leads to the construction of tissue-specific (TS) and housekeeping (HK) sub-networks, with distinctive TS- and HK-hubs. All such hub proteins are divided into multi-interface (MI) hubs and single-interface (SI) hubs, where MI hubs evolve slower than SI hubs. Here we explored the evolutionary rate difference between MI and SI proteins within TS- and HK-PPIN and observed that this difference is present only in TS, but not in HK-class. Next, we explored whether proteins' own properties or its partners' properties are more influential in such evolutionary discrepancy. Statistical analyses revealed that this evolutionary rate correlates negatively with protein's own properties like expression level, miRNA count, conformational diversity and functional properties and with its partners' properties like protein disorder and tissue expression similarity. Moreover, partial correlation and regression analysis revealed that both proteins' and its partners' properties have independent effects on protein evolutionary rate.
Collapse
Affiliation(s)
- Kakali Biswas
- Bioinformatics Centre, Bose Institute, P-1/12, C.I.T. Scheme VII M, Kolkata 700 054, India
| | - Debarun Acharya
- Bioinformatics Centre, Bose Institute, P-1/12, C.I.T. Scheme VII M, Kolkata 700 054, India
| | - Soumita Podder
- Bioinformatics Centre, Bose Institute, P-1/12, C.I.T. Scheme VII M, Kolkata 700 054, India; Department of Microbiology, Raiganj University, Raiganj, Uttar Dinajpur 733134, India
| | - Tapash Chandra Ghosh
- Bioinformatics Centre, Bose Institute, P-1/12, C.I.T. Scheme VII M, Kolkata 700 054, India.
| |
Collapse
|
17
|
Monzon AM, Zea DJ, Marino-Buslje C, Parisi G. Homology modeling in a dynamical world. Protein Sci 2017; 26:2195-2206. [PMID: 28815769 DOI: 10.1002/pro.3274] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Revised: 08/09/2017] [Accepted: 08/09/2017] [Indexed: 12/31/2022]
Abstract
A key concept in template-based modeling (TBM) is the high correlation between sequence and structural divergence, with the practical consequence that homologous proteins that are similar at the sequence level will also be similar at the structural level. However, conformational diversity of the native state will reduce the correlation between structural and sequence divergence, because structural variation can appear without sequence diversity. In this work, we explore the impact that conformational diversity has on the relationship between structural and sequence divergence. We find that the extent of conformational diversity can be as high as the maximum structural divergence among families. Also, as expected, conformational diversity impairs the well-established correlation between sequence and structural divergence, which is nosier than previously suggested. However, we found that this noise can be resolved using a priori information coming from the structure-function relationship. We show that protein families with low conformational diversity show a well-correlated relationship between sequence and structural divergence, which is severely reduced in proteins with larger conformational diversity. This lack of correlation could impair TBM results in highly dynamical proteins. Finally, we also find that the presence of order/disorder can provide useful beforehand information for better TBM performance.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, B1876BXD, Bernal, Argentina
| | - Diego Javier Zea
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, C1405BWE Ciudad Autónoma de Buenos Aires, Argentina
| | - Cristina Marino-Buslje
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, C1405BWE Ciudad Autónoma de Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, B1876BXD, Bernal, Argentina
| |
Collapse
|
18
|
Sharir-Ivry A, Xia Y. The Impact of Native State Switching on Protein Sequence Evolution. Mol Biol Evol 2017; 34:1378-1390. [DOI: 10.1093/molbev/msx071] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
|
19
|
Pabis A, Duarte F, Kamerlin SCL. Promiscuity in the Enzymatic Catalysis of Phosphate and Sulfate Transfer. Biochemistry 2016; 55:3061-81. [PMID: 27187273 PMCID: PMC4899807 DOI: 10.1021/acs.biochem.6b00297] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022]
Abstract
![]()
The
enzymes that facilitate phosphate and sulfate hydrolysis are
among the most proficient natural catalysts known to date. Interestingly,
a large number of these enzymes are promiscuous catalysts that exhibit
both phosphatase and sulfatase activities in the same active site
and, on top of that, have also been demonstrated to efficiently catalyze
the hydrolysis of other additional substrates with varying degrees
of efficiency. Understanding the factors that underlie such multifunctionality
is crucial both for understanding functional evolution in enzyme superfamilies
and for the development of artificial enzymes. In this Current Topic,
we have primarily focused on the structural and mechanistic basis
for catalytic promiscuity among enzymes that facilitate both phosphoryl
and sulfuryl transfer in the same active site, while comparing this
to how catalytic promiscuity manifests in other promiscuous phosphatases.
We have also drawn on the large number of experimental and computational
studies of selected model systems in the literature to explore the
different features driving the catalytic promiscuity of such enzymes.
Finally, on the basis of this comparative analysis, we probe the plausible
origins and determinants of catalytic promiscuity in enzymes that
catalyze phosphoryl and sulfuryl transfer.
Collapse
Affiliation(s)
- Anna Pabis
- Department of Cell and Molecular Biology, Science for Life Laboratory, Uppsala University , BMC Box 596, S-751 24 Uppsala, Sweden
| | - Fernanda Duarte
- Chemistry Research Laboratory, University of Oxford , 12 Mansfield Road, Oxford OX1 3TA, U.K.,Physical and Theoretical Chemistry Laboratory, University of Oxford , South Parks Road, Oxford OX1 3QZ, U.K
| | - Shina C L Kamerlin
- Department of Cell and Molecular Biology, Science for Life Laboratory, Uppsala University , BMC Box 596, S-751 24 Uppsala, Sweden
| |
Collapse
|
20
|
Addressing the Role of Conformational Diversity in Protein Structure Prediction. PLoS One 2016; 11:e0154923. [PMID: 27159429 PMCID: PMC4861349 DOI: 10.1371/journal.pone.0154923] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Accepted: 04/21/2016] [Indexed: 11/19/2022] Open
Abstract
Computational modeling of tertiary structures has become of standard use to study proteins that lack experimental characterization. Unfortunately, 3D structure prediction methods and model quality assessment programs often overlook that an ensemble of conformers in equilibrium populates the native state of proteins. In this work we collected sets of publicly available protein models and the corresponding target structures experimentally solved and studied how they describe the conformational diversity of the protein. For each protein, we assessed the quality of the models against known conformers by several standard measures and identified those models ranked best. We found that model rankings are defined by both the selected target conformer and the similarity measure used. 70% of the proteins in our datasets show that different models are structurally closest to different conformers of the same protein target. We observed that model building protocols such as template-based or ab initio approaches describe in similar ways the conformational diversity of the protein, although for template-based methods this description may depend on the sequence similarity between target and template sequences. Taken together, our results support the idea that protein structure modeling could help to identify members of the native ensemble, highlight the importance of considering conformational diversity in protein 3D quality evaluations and endorse the study of the variability of the native structure for a meaningful biological analysis.
Collapse
|
21
|
Monzon AM, Rohr CO, Fornasari MS, Parisi G. CoDNaS 2.0: a comprehensive database of protein conformational diversity in the native state. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw038. [PMID: 27022160 PMCID: PMC4809262 DOI: 10.1093/database/baw038] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2015] [Accepted: 03/02/2016] [Indexed: 01/01/2023]
Abstract
CoDNaS (conformational diversity of the native state) is a protein conformational diversity database. Conformational diversity describes structural differences between conformers that define the native state of proteins. It is a key concept to understand protein function and biological processes related to protein functions. CoDNaS offers a well curated database that is experimentally driven, thoroughly linked, and annotated. CoDNaS facilitates the extraction of key information on small structural differences based on protein movements. CoDNaS enables users to easily relate the degree of conformational diversity with physical, chemical and biological properties derived from experiments on protein structure and biological characteristics. The new version of CoDNaS includes ∼70% of all available protein structures, and new tools have been added that run sequence searches, display structural flexibility profiles and allow users to browse the database for different structural classes. These tools facilitate the exploration of protein conformational diversity and its role in protein function. Database URL:http://ufq.unq.edu.ar/codnas
Collapse
Affiliation(s)
| | - Cristian Oscar Rohr
- Instituto de Ecología Genética y Evolución de Buenos Aires (IEGEBA)-Laboratorio de Genómica Médica y Evolución, Universidad Nacional de Buenos Aires, Argentina
| | | | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
| |
Collapse
|
22
|
Saldaño TE, Monzon AM, Parisi G, Fernandez-Alberti S. Evolutionary Conserved Positions Define Protein Conformational Diversity. PLoS Comput Biol 2016; 12:e1004775. [PMID: 27008419 PMCID: PMC4805271 DOI: 10.1371/journal.pcbi.1004775] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2015] [Accepted: 01/27/2016] [Indexed: 12/18/2022] Open
Abstract
Conformational diversity of the native state plays a central role in modulating protein function. The selection paradigm sustains that different ligands shift the conformational equilibrium through their binding to highest-affinity conformers. Intramolecular vibrational dynamics associated to each conformation should guarantee conformational transitions, which due to its importance, could possibly be associated with evolutionary conserved traits. Normal mode analysis, based on a coarse-grained model of the protein, can provide the required information to explore these features. Herein, we present a novel procedure to identify key positions sustaining the conformational diversity associated to ligand binding. The method is applied to an adequate refined dataset of 188 paired protein structures in their bound and unbound forms. Firstly, normal modes most involved in the conformational change are selected according to their corresponding overlap with structural distortions introduced by ligand binding. The subspace defined by these modes is used to analyze the effect of simulated point mutations on preserving the conformational diversity of the protein. We find a negative correlation between the effects of mutations on these normal mode subspaces associated to ligand-binding and position-specific evolutionary conservations obtained from multiple sequence-structure alignments. Positions whose mutations are found to alter the most these subspaces are defined as key positions, that is, dynamically important residues that mediate the ligand-binding conformational change. These positions are shown to be evolutionary conserved, mostly buried aliphatic residues localized in regular structural regions of the protein like β-sheets and α-helix.
Collapse
|
23
|
Abstract
The rate and mechanism of protein sequence evolution have been central questions in evolutionary biology since the 1960s. Although the rate of protein sequence evolution depends primarily on the level of functional constraint, exactly what determines functional constraint has remained unclear. The increasing availability of genomic data has enabled much needed empirical examinations on the nature of functional constraint. These studies found that the evolutionary rate of a protein is predominantly influenced by its expression level rather than functional importance. A combination of theoretical and empirical analyses has identified multiple mechanisms behind these observations and demonstrated a prominent role in protein evolution of selection against errors in molecular and cellular processes.
Collapse
Affiliation(s)
- Jianzhi Zhang
- Department of Ecology and Evolutionary Biology, University of Michigan, 830 North University Avenue, Ann Arbor, Michigan 48109, USA
| | - Jian-Rong Yang
- Department of Ecology and Evolutionary Biology, University of Michigan, 830 North University Avenue, Ann Arbor, Michigan 48109, USA
| |
Collapse
|
24
|
Parisi G, Zea DJ, Monzon AM, Marino-Buslje C. Conformational diversity and the emergence of sequence signatures during evolution. Curr Opin Struct Biol 2015; 32:58-65. [DOI: 10.1016/j.sbi.2015.02.005] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2014] [Revised: 02/02/2015] [Accepted: 02/09/2015] [Indexed: 02/03/2023]
|
25
|
Sikosek T, Chan HS. Biophysics of protein evolution and evolutionary protein biophysics. J R Soc Interface 2015; 11:20140419. [PMID: 25165599 DOI: 10.1098/rsif.2014.0419] [Citation(s) in RCA: 163] [Impact Index Per Article: 16.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
The study of molecular evolution at the level of protein-coding genes often entails comparing large datasets of sequences to infer their evolutionary relationships. Despite the importance of a protein's structure and conformational dynamics to its function and thus its fitness, common phylogenetic methods embody minimal biophysical knowledge of proteins. To underscore the biophysical constraints on natural selection, we survey effects of protein mutations, highlighting the physical basis for marginal stability of natural globular proteins and how requirement for kinetic stability and avoidance of misfolding and misinteractions might have affected protein evolution. The biophysical underpinnings of these effects have been addressed by models with an explicit coarse-grained spatial representation of the polypeptide chain. Sequence-structure mappings based on such models are powerful conceptual tools that rationalize mutational robustness, evolvability, epistasis, promiscuous function performed by 'hidden' conformational states, resolution of adaptive conflicts and conformational switches in the evolution from one protein fold to another. Recently, protein biophysics has been applied to derive more accurate evolutionary accounts of sequence data. Methods have also been developed to exploit sequence-based evolutionary information to predict biophysical behaviours of proteins. The success of these approaches demonstrates a deep synergy between the fields of protein biophysics and protein evolution.
Collapse
Affiliation(s)
- Tobias Sikosek
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Physics, University of Toronto, Toronto, Ontario, Canada M5S 1A8
| | - Hue Sun Chan
- Department of Biochemistry, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Molecular Genetics, University of Toronto, Toronto, Ontario, Canada M5S 1A8 Department of Physics, University of Toronto, Toronto, Ontario, Canada M5S 1A8
| |
Collapse
|
26
|
Marsh JA, Teichmann SA. Parallel dynamics and evolution: Protein conformational fluctuations and assembly reflect evolutionary changes in sequence and structure. Bioessays 2013; 36:209-18. [DOI: 10.1002/bies.201300134] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Affiliation(s)
- Joseph A. Marsh
- European Molecular Biology Laboratory; European Bioinformatics Institute; Wellcome Trust Genome Campus, Hinxton Cambridge UK
| | - Sarah A. Teichmann
- European Molecular Biology Laboratory; European Bioinformatics Institute; Wellcome Trust Genome Campus, Hinxton Cambridge UK
- Wellcome Trust Sanger Institute; Wellcome Trust Genome Campus; Hinxton Cambridge UK
| |
Collapse
|
27
|
Arenas M, Dos Santos HG, Posada D, Bastolla U. Protein evolution along phylogenetic histories under structurally constrained substitution models. ACTA ACUST UNITED AC 2013; 29:3020-8. [PMID: 24037213 DOI: 10.1093/bioinformatics/btt530] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022]
Abstract
MOTIVATION Models of molecular evolution aim at describing the evolutionary processes at the molecular level. However, current models rarely incorporate information from protein structure. Conversely, structure-based models of protein evolution have not been commonly applied to simulate sequence evolution in a phylogenetic framework, and they often ignore relevant evolutionary processes such as recombination. A simulation evolutionary framework that integrates substitution models that account for protein structure stability should be able to generate more realistic in silico evolved proteins for a variety of purposes. RESULTS We developed a method to simulate protein evolution that combines models of protein folding stability, such that the fitness depends on the stability of the native state both with respect to unfolding and misfolding, with phylogenetic histories that can be either specified by the user or simulated with the coalescent under complex evolutionary scenarios, including recombination, demographics and migration. We have implemented this framework in a computer program called ProteinEvolver. Remarkably, comparing these models with empirical amino acid replacement models, we found that the former produce amino acid distributions closer to distributions observed in real protein families, and proteins that are predicted to be more stable. Therefore, we conclude that evolutionary models that consider protein stability and realistic evolutionary histories constitute a better approximation of the real evolutionary process.
Collapse
Affiliation(s)
- Miguel Arenas
- Centre for Molecular Biology 'Severo Ochoa', Consejo Superior de Investigaciones Científicas (CSIC), Madrid, Spain and Department of Biochemistry, Genetics and Immunology, University of Vigo, Vigo, Spain
| | | | | | | |
Collapse
|
28
|
Palopoli N, Lanzarotti E, Parisi G. BeEP Server: Using evolutionary information for quality assessment of protein structure models. Nucleic Acids Res 2013; 41:W398-405. [PMID: 23729471 PMCID: PMC3692104 DOI: 10.1093/nar/gkt453] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
The BeEP Server (http://www.embnet.qb.fcen.uba.ar/embnet/beep.php) is an online resource aimed to help in the endgame of protein structure prediction. It is able to rank submitted structural models of a protein through an explicit use of evolutionary information, a criterion differing from structural or energetic considerations commonly used in other assessment programs. The idea behind BeEP (Best Evolutionary Pattern) is to benefit from the substitution pattern derived from structural constraints present in a set of homologous proteins adopting a given protein conformation. The BeEP method uses a model of protein evolution that takes into account the structure of a protein to build site-specific substitution matrices. The suitability of these substitution matrices is assessed through maximum likelihood calculations from which position-specific and global scores can be derived. These scores estimate how well the structural constraints derived from each structural model are represented in a sequence alignment of homologous proteins. Our assessment on a subset of proteins from the Critical Assessment of techniques for protein Structure Prediction (CASP) experiment has shown that BeEP is capable of discriminating the models and selecting one or more native-like structures. Moreover, BeEP is not explicitly parameterized to find structural similarities between models and given targets, potentially helping to explore the conformational ensemble of the native state.
Collapse
Affiliation(s)
- Nicolas Palopoli
- Departamento de Ciencia y Tecnologia, Universidad Nacional de Quilmes, B1876BXD, Bernal, Buenos Aires, Argentina, Centre for Biological Sciences, University of Southampton, SO17 1BJ, Southampton, UK and Departamento de Quimica Biologica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, C1428EHA, Buenos Aires, Argentina
| | - Esteban Lanzarotti
- Departamento de Ciencia y Tecnologia, Universidad Nacional de Quilmes, B1876BXD, Bernal, Buenos Aires, Argentina, Centre for Biological Sciences, University of Southampton, SO17 1BJ, Southampton, UK and Departamento de Quimica Biologica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, C1428EHA, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnologia, Universidad Nacional de Quilmes, B1876BXD, Bernal, Buenos Aires, Argentina, Centre for Biological Sciences, University of Southampton, SO17 1BJ, Southampton, UK and Departamento de Quimica Biologica, Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, C1428EHA, Buenos Aires, Argentina
- *To whom correspondence should be addressed. Tel: +54 011 43657100 (ext. 4135); Fax: +54 011 437657101;
| |
Collapse
|