1
|
Escobedo N, Monzon AM, Fornasari MS, Palopoli N, Parisi G. Combining Protein Conformational Diversity and Phylogenetic Information Using CoDNaS and CoDNaS-Q. Curr Protoc 2023; 3:e764. [PMID: 37184204 DOI: 10.1002/cpz1.764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
CoDNaS (http://ufq.unq.edu.ar/codnas/) and CoDNaS-Q (http://ufq.unq.edu.ar/codnasq) are repositories of proteins with different degrees of conformational diversity. Following the ensemble nature of the native state, conformational diversity represents the structural differences between the conformers in the ensemble. Each entry in CoDNaS and CoDNaS-Q contains a redundant collection of experimentally determined conformers obtained under different conditions. These conformers represent snapshots of the protein dynamism. While CoDNaS contains examples of conformational diversity at the tertiary level, a recent development, CoDNaS-Q, contains examples at the quaternary level. In the emerging age of accurate protein structure prediction by machine learning approaches, many questions remain open regarding the characterization of protein dynamism. In this context, most bioinformatics resources take advantage of distinct features derived from protein alignments, however, the complexity and heterogeneity of information makes it difficult to recover reliable biological signatures. Here we present five protocols to explore tertiary and quaternary conformational diversity at the individual protein level as well as for the characterization of the distribution of conformational diversity at the protein family level in a phylogenetic context. These protocols can provide curated protein families with experimentally known conformational diversity, facilitating the exploration of sequence determinants of protein dynamism. © 2023 Wiley Periodicals LLC. Basic Protocol 1: Assessing conformational diversity with CoDNaS Alternate Protocol 1: Assessing conformational diversity at the quaternary level with CoDNaS-Q Basic Protocol 2: Exploring conformational diversity in a protein family Alternate Protocol 2: Exploring quaternary conformational diversity in a protein family Basic Protocol 3: Representing conformational diversity in a phylogenetic context.
Collapse
Affiliation(s)
- Nahuel Escobedo
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | | | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| |
Collapse
|
2
|
Shi XX, Wang ZZ, Wang YL, Huang GY, Yang JF, Wang F, Hao GF, Yang GF. PTMdyna: exploring the influence of post-translation modifications on protein conformational dynamics. Brief Bioinform 2021; 23:6394992. [PMID: 34643234 DOI: 10.1093/bib/bbab424] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Revised: 09/02/2021] [Accepted: 09/14/2021] [Indexed: 11/14/2022] Open
Abstract
Protein post-translational modifications (PTM) play vital roles in cellular regulation, modulating functions by driving changes in protein structure and dynamics. Exploring comprehensively the influence of PTM on conformational dynamics can facilitate the understanding of the related biological function and molecular mechanism. Currently, a series of excellent computation tools have been designed to analyze the time-dependent structural properties of proteins. However, the protocol aimed to explore conformational dynamics of post-translational modified protein is still a blank. To fill this gap, we present PTMdyna to visually predict the conformational dynamics differences between unmodified and modified proteins, thus indicating the influence of specific PTM. PTMdyna exhibits an AUC of 0.884 tested on 220 protein-protein complex structures. The case of heterochromatin protein 1α complexed with lysine 9-methylated histone H3, which is critical for genomic stability and cell differentiation, was used to demonstrate its applicability. PTMdyna provides a reliable platform to predict the influence of PTM on protein dynamics, making it easier to interpret PTM functionality at the structure level. The web server is freely available at http://ccbportal.com/PTMdyna.
Collapse
Affiliation(s)
- Xing-Xing Shi
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, Hubei, P. R. China.,International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan, Hubei, P. R. China
| | - Zhi-Zheng Wang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, Hubei, P. R. China.,International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan, Hubei, P. R. China
| | - Yu-Liang Wang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, Hubei, P. R. China.,International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan, Hubei, P. R. China
| | - Guang-Yi Huang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, Hubei, P. R. China.,International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan, Hubei, P. R. China
| | - Jing-Fang Yang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, Hubei, P. R. China.,International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan, Hubei, P. R. China
| | - Fan Wang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, Hubei, P. R. China.,International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan, Hubei, P. R. China
| | - Ge-Fei Hao
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, Hubei, P. R. China.,International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan, Hubei, P. R. China.,State Key Laboratory Breeding Base of Green Pesticide and Agricultural Bioengineering, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Research and Development Center for Fine Chemicals, Guizhou University, Guiyang, Guizhou, P. R. China
| | - Guang-Fu Yang
- Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, College of Chemistry, Central China Normal University, Wuhan, Hubei, P. R. China.,International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan, Hubei, P. R. China
| |
Collapse
|
3
|
Masrati G, Landau M, Ben-Tal N, Lupas A, Kosloff M, Kosinski J. Integrative Structural Biology in the Era of Accurate Structure Prediction. J Mol Biol 2021; 433:167127. [PMID: 34224746 DOI: 10.1016/j.jmb.2021.167127] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2021] [Revised: 06/28/2021] [Accepted: 06/28/2021] [Indexed: 11/16/2022]
Abstract
Characterizing the three-dimensional structure of macromolecules is central to understanding their function. Traditionally, structures of proteins and their complexes have been determined using experimental techniques such as X-ray crystallography, NMR, or cryo-electron microscopy-applied individually or in an integrative manner. Meanwhile, however, computational methods for protein structure prediction have been improving their accuracy, gradually, then suddenly, with the breakthrough advance by AlphaFold2, whose models of monomeric proteins are often as accurate as experimental structures. This breakthrough foreshadows a new era of computational methods that can build accurate models for most monomeric proteins. Here, we envision how such accurate modeling methods can combine with experimental structural biology techniques, enhancing integrative structural biology. We highlight the challenges that arise when considering multiple structural conformations, protein complexes, and polymorphic assemblies. These challenges will motivate further developments, both in modeling programs and in methods to solve experimental structures, towards better and quicker investigation of structure-function relationships.
Collapse
Affiliation(s)
- Gal Masrati
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Meytal Landau
- Department of Biology, Technion-Israel Institute of Technology, Haifa 3200003, Israel; European Molecular Biology Laboratory (EMBL), Hamburg 22607, Germany
| | - Nir Ben-Tal
- Department of Biochemistry and Molecular Biology, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv 6997801, Israel
| | - Andrei Lupas
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany.
| | - Mickey Kosloff
- Department of Human Biology, Faculty of Natural Sciences, University of Haifa, 199 Aba Khoushy Ave., Mt. Carmel, 3498838 Haifa, Israel.
| | - Jan Kosinski
- European Molecular Biology Laboratory (EMBL), Hamburg 22607, Germany; Centre for Structural Systems Biology (CSSB), Hamburg 22607, Germany; Structural and Computational Biology Unit, European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg, Germany.
| |
Collapse
|
4
|
Triki D, Kermarrec M, Visseaux B, Descamps D, Flatters D, Camproux AC, Regad L. Exploration of the effects of sequence variations between HIV-1 and HIV-2 proteases on their three-dimensional structures. J Biomol Struct Dyn 2019; 38:5014-5026. [PMID: 31830870 DOI: 10.1080/07391102.2019.1704877] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
HIV protease inhibitors (PIs) approved by the FDA (US Food and Drug Administration) are a major class of antiretroviral. HIV-2 protease (PR2) is naturally resistant to most of them as PIs were designed for HIV-1 protease (PR1). In this study, we explored the impact of amino-acid substitutions between PR1 and PR2 on the structure of protease (PR) by comparing the structural variability of 13 regions using 24 PR1 and PR2 structures complexed with diverse ligands. Our analyses confirmed structural rigidity of the catalytic region and highlighted the important role of three regions in the conservation of the catalytic region conformation. Surprisingly, we showed that the flap region, corresponding to a flexible region, exhibits similar conformations in PR1 and PR2. Furthermore, we identified regions exhibiting different conformations in PR1 and PR2, which could be explained by the intrinsic flexibility of these regions, by crystal packing, or by PR1 and PR2 substitutions. Some substitutions induce structural changes in the R2 and R4 regions that could have an impact on the properties of PI-binding site and could thus modify PI binding mode. Substitutions involved in structural changes in the elbow region could alter the flexibility of the PR2 flap regions relative to PR1, and thus play a role in the transition from the semi-open form to the closed form, and have an impact on ligand binding. These results improve the understanding of the impact of sequence variations between PR1 and PR2 on the natural resistance of HIV-2 to commercially available PIs.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Dhoha Triki
- Department of Chemistry, Bioinformatics Research Center, College of Science, North Carolina State University, Raleigh, North Carolina, USA.,Université de Paris, BFA, UMR 8251, CNRS, ERL U1133, Inserm, Paris, France
| | - Maxime Kermarrec
- Université de Paris, BFA, UMR 8251, CNRS, ERL U1133, Inserm, Paris, France
| | - Benoît Visseaux
- Université de Paris, IAME, UMR 1137, INSERM, AP-HP, Laboratoire de Virologie, Hôpital Bichat-Claude Bernard, Paris, France
| | - Diane Descamps
- Université de Paris, IAME, UMR 1137, INSERM, AP-HP, Laboratoire de Virologie, Hôpital Bichat-Claude Bernard, Paris, France
| | - Delphine Flatters
- Université de Paris, BFA, UMR 8251, CNRS, ERL U1133, Inserm, Paris, France
| | | | - Leslie Regad
- Université de Paris, BFA, UMR 8251, CNRS, ERL U1133, Inserm, Paris, France
| |
Collapse
|
5
|
De Las Rivas J, Bonavides-Martínez C, Campos-Laborie FJ. Bioinformatics in Latin America and SoIBio impact, a tale of spin-off and expansion around genomes and protein structures. Brief Bioinform 2019; 20:390-397. [PMID: 28981567 PMCID: PMC6433739 DOI: 10.1093/bib/bbx064] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2017] [Revised: 04/18/2017] [Indexed: 11/30/2022] Open
Abstract
Owing to the emerging impact of bioinformatics and computational biology, in this article, we present an overview of the history and current state of the research on this field in Latin America (LA). It will be difficult to cover without inequality all the efforts, initiatives and works that have happened for the past two decades in this vast region (that includes >19 million km2 and >600 million people). Despite the difficulty, we have done an analytical search looking for publications in the field made by researchers from 19 LA countries in the past 25 years. In this way, we find that research in bioinformatics in this region should develop twice to approach the average world scientific production in the field. We also found some of the pioneering scientists who initiated and led bioinformatics in the region and were promoters of this new scientific field. Our analysis also reveals that spin-off began around some specific areas within the biomolecular sciences: studies on genomes (anchored in the new generation of deep sequencing technologies, followed by developments in proteomics) and studies on protein structures (supported by three-dimensional structural determination technologies and their computational advancement). Finally, we show that the contribution to this endeavour of the Iberoamerican Society for Bioinformatics, founded in Mexico in 2009, has been significant, as it is a leading forum to join efforts of many scientists from LA interested in promoting research, training and education in bioinformatics.
Collapse
Affiliation(s)
- Javier De Las Rivas
- CSIC and Universidad de Salamanca, Bioinformatics and Functional Genomics Group, Cancer Research Center (IMBCC, CSIC/USAL/IBSAL), Salamanca, Spain
- Corresponding author. Javier De Las Rivas, Bioinformatics and Functional Genomics Group, Cancer Research Center (IMBCC, CSIC/USAL/IBSAL), Consejo Superior de Investigaciones Científicas (CSIC) and Universidad de Salamanca (USAL), Campus Miguel de Unamuno s/n, Salamanca 37007, Spain. Tel.: +34 923294819; Fax: +34923294743; E-mail:
| | - Cesar Bonavides-Martínez
- Universidad Nacional Autonoma de Mexico, Computational Genomics, Centro de Ciencias Genómicas, Cuernavaca, Morelos, Mexico
| | - Francisco Jose Campos-Laborie
- CSIC and Universidad de Salamanca, Bioinformatics and Functional Genomics Group, Cancer Research Center (IMBCC, CSIC/USAL/IBSAL), Salamanca, Spain
| |
Collapse
|
6
|
Saldaño TE, Tosatto SCE, Parisi G, Fernandez-Alberti S. Network analysis of dynamically important residues in protein structures mediating ligand-binding conformational changes. EUROPEAN BIOPHYSICS JOURNAL: EBJ 2019; 48:559-568. [PMID: 31273390 DOI: 10.1007/s00249-019-01384-1] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/11/2019] [Revised: 05/31/2019] [Accepted: 07/01/2019] [Indexed: 11/26/2022]
Abstract
According to the generalized conformational selection model, ligand binding involves the co-existence of at least two conformers with different ligand-affinities in a dynamical equilibrium. Conformational transitions between them should be guaranteed by intramolecular vibrational dynamics associated to each conformation. These motions are, therefore, related to the biological function of a protein. Positions whose mutations are found to alter these vibrations the most can be defined as key positions, that is, dynamically important residues that mediate the ligand-binding conformational change. In a previous study, we have shown that these positions are evolutionarily conserved. They correspond to buried aliphatic residues mostly localized in regular structured regions of the protein like β-sheets and α-helices. In the present paper, we perform a network analysis of these key positions for a large dataset of paired protein structures in the ligand-free and ligand-bound form. We observe that networks of interactions between these key positions present larger and more integrated networks with faster transmission of the information. Besides, networks of residues result that are robust to conformational changes. Our results reveal that the conformational diversity of proteins seems to be guaranteed by a network of strongly interconnected key positions rather than individual residues.
Collapse
Affiliation(s)
- Tadeo E Saldaño
- Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD, Bernal, Argentina
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, Viale G. Colombo 3, 5131, Padua, Italy
| | - Gustavo Parisi
- Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD, Bernal, Argentina
| | | |
Collapse
|
7
|
Abstract
The native state of proteins is composed of conformers in dynamical equilibrium. In this chapter, different issues related to conformational diversity are explored using a curated and experimentally based database called CoDNaS (Conformational Diversity in the Native State). This database is a collection of redundant structures for the same sequence. CoDNaS estimates the degree of conformational diversity using different global and local structural similarity measures. It allows the user to explore how structural differences among conformers change as a function of several structural features providing further biological information. This chapter explores the measurement of conformational diversity and its relationship with sequence divergence. Also, it discusses how proteins with high conformational diversity could affect homology modeling techniques.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina
| | - Diego Javier Zea
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina.
| |
Collapse
|
8
|
Marks C, Shi J, Deane CM. Predicting loop conformational ensembles. Bioinformatics 2018; 34:949-956. [PMID: 29136084 DOI: 10.1093/bioinformatics/btx718] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Accepted: 11/09/2017] [Indexed: 12/23/2022] Open
Abstract
Motivation Protein function is often facilitated by the existence of multiple stable conformations. Structure prediction algorithms need to be able to model these different conformations accurately and produce an ensemble of structures that represent a target's conformational diversity rather than just a single state. Here, we investigate whether current loop prediction algorithms are capable of this. We use the algorithms to predict the structures of loops with multiple experimentally determined conformations, and the structures of loops with only one conformation, and assess their ability to generate and select decoys that are close to any, or all, of the observed structures. Results We find that while loops with only one known conformation are predicted well, conformationally diverse loops are modelled poorly, and in most cases the predictions returned by the methods do not resemble any of the known conformers. Our results contradict the often-held assumption that multiple native conformations will be present in the decoy set, making the production of accurate conformational ensembles impossible, and hence indicating that current methodologies are not well suited to prediction of conformationally diverse, often functionally important protein regions. Contact marks@stats.ox.ac.uk. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Claire Marks
- Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
| | - Jiye Shi
- Department of Chemistry, UCB Pharma, Slough SL1 3WE, UK
| | | |
Collapse
|
9
|
Barletta GP, Hasenahuer MA, Fornasari MS, Parisi G, Fernandez-Alberti S. Dynamics fingerprints of active conformers of epidermal growth factor receptor kinase. J Comput Chem 2018; 39:2472-2480. [PMID: 30298935 DOI: 10.1002/jcc.25590] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Revised: 08/06/2018] [Accepted: 08/19/2018] [Indexed: 12/29/2022]
Abstract
Epidermal growth factor receptor (EGFR) is a prototypical cell-surface receptor that plays a key role in the regulation of cellular signaling, proliferation and differentiation. Mutations of its kinase domain have been associated with the development of a variety of cancers and, therefore, it has been the target of drug design. Single amino acid substitutions (SASs) in this domain have been proven to alter the equilibrium of pre-existing conformer populations. Despite the advances in structural descriptions of its so-called active and inactive conformations, the associated dynamics aspects that characterize them have not been thoroughly studied yet. As the dynamic behaviors and molecular motions of proteins are important for a complete understanding of their structure-function relationships we present a novel procedure, using (or based on) normal mode analysis, to identify the collective dynamics shared among different conformers in EGFR kinase. The method allows the comparison of patterns of low-frequency vibrational modes defining representative directions of motions. Our procedure is able to emphasize the main similarities and differences between the collective dynamics of different conformers. In the case of EGFR kinase, two representative directions of motions have been found as dynamics fingerprints of the active conformers. Protein motion along both directions reveals to have a significant impact on the cavity volume of the main pocket of the active site. Otherwise, the inactive conformers exhibit a more heterogeneous distribution of collective motions. © 2018 Wiley Periodicals, Inc.
Collapse
Affiliation(s)
- German P Barletta
- Departamento de Ciencia y Tecnologia, Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD, Bernal, Argentina
| | - Marcia Anahi Hasenahuer
- Departamento de Ciencia y Tecnologia, Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD, Bernal, Argentina
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnologia, Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD, Bernal, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnologia, Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD, Bernal, Argentina
| | - Sebastian Fernandez-Alberti
- Departamento de Ciencia y Tecnologia, Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD, Bernal, Argentina
| |
Collapse
|
10
|
Hasenahuer MA, Barletta GP, Fernandez-Alberti S, Parisi G, Fornasari MS. Pockets as structural descriptors of EGFR kinase conformations. PLoS One 2017; 12:e0189147. [PMID: 29228029 PMCID: PMC5724837 DOI: 10.1371/journal.pone.0189147] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Accepted: 11/20/2017] [Indexed: 12/19/2022] Open
Abstract
Epidermal Growth Factor Receptor (EGFR), a tyrosine kinase receptor, is one of the main tumor markers in different types of cancers. The kinase native state is mainly composed of two populations of conformers: active and inactive. Several sequence variations in EGFR kinase region promote the differential enrichment of conformers with higher activity. Some structural characteristics have been proposed to differentiate kinase conformations, but these considerations could lead to ambiguous classifications. We present a structural characterisation of EGFR kinase conformers, focused on active site pocket comparisons, and the mapping of known pathological sequence variations. A structural based clustering of this pocket accurately discriminates active from inactive, well-characterised conformations. Furthermore, this main pocket contains, or is in close contact with, ≈65% of cancer-related variation positions. Although the relevance of protein dynamics to explain biological function has been extensively recognised, the usage of the ensemble of conformations in dynamic equilibrium to represent the functional state of proteins and the importance of pockets, cavities and/or tunnels was often neglected in previous studies. These functional structures and the equilibrium between them could be structurally analysed in wild type as well as in sequence variants. Our results indicate that biologically important pockets, as well as their shape and dynamics, are central to understanding protein function in wild-type, polymorphic or disease-related variations.
Collapse
Affiliation(s)
- Marcia Anahi Hasenahuer
- Departamento de Ciencia Y Tecnología, Universidad Nacional de Quilmes, Bernal, Buenos Aires, Argentina
| | - German Patricio Barletta
- Departamento de Ciencia Y Tecnología, Universidad Nacional de Quilmes, Bernal, Buenos Aires, Argentina
| | | | - Gustavo Parisi
- Departamento de Ciencia Y Tecnología, Universidad Nacional de Quilmes, Bernal, Buenos Aires, Argentina
| | - María Silvina Fornasari
- Departamento de Ciencia Y Tecnología, Universidad Nacional de Quilmes, Bernal, Buenos Aires, Argentina
| |
Collapse
|
11
|
Abriata LA. Structural database resources for biological macromolecules. Brief Bioinform 2017; 18:659-669. [PMID: 27273290 DOI: 10.1093/bib/bbw049] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/11/2016] [Indexed: 12/30/2022] Open
Abstract
This Briefing reviews the widely used, currently active, up-to-date databases derived from the worldwide Protein Data Bank (PDB) to facilitate browsing, finding and exploring its entries. These databases contain visualization and analysis tools tailored to specific kinds of molecules and interactions, often including also complex metrics precomputed by experts or external programs, and connections to sequence and functional annotation databases. Importantly, updates of most of these databases involves steps of curation and error checks based on specific expertise about the subject molecules or interactions, and removal of sequence redundancy, both leading to better data sets for mining studies compared with the full list of raw PDB entries. The article presents the databases in groups such as those aimed to facilitate browsing through PDB entries, their molecules and their general information, those built to link protein structure with sequence and dynamics, those specific for transmembrane proteins, nucleic acids, interactions of biomacromolecules with each other and with small molecules or metal ions, and those concerning specific structural features or specific protein families. A few webservers directly connected to active databases, and a few databases that have been discontinued but would be important to have back, are also briefly commented on. Along the Briefing, sample cases where these databases have been used to aid structural studies or advance our knowledge about biological macromolecules are referenced. A few specific examples are also given where using these databases is easier and more informative than using raw PDB data.
Collapse
|
12
|
Monzon AM, Zea DJ, Marino-Buslje C, Parisi G. Homology modeling in a dynamical world. Protein Sci 2017; 26:2195-2206. [PMID: 28815769 DOI: 10.1002/pro.3274] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Revised: 08/09/2017] [Accepted: 08/09/2017] [Indexed: 12/31/2022]
Abstract
A key concept in template-based modeling (TBM) is the high correlation between sequence and structural divergence, with the practical consequence that homologous proteins that are similar at the sequence level will also be similar at the structural level. However, conformational diversity of the native state will reduce the correlation between structural and sequence divergence, because structural variation can appear without sequence diversity. In this work, we explore the impact that conformational diversity has on the relationship between structural and sequence divergence. We find that the extent of conformational diversity can be as high as the maximum structural divergence among families. Also, as expected, conformational diversity impairs the well-established correlation between sequence and structural divergence, which is nosier than previously suggested. However, we found that this noise can be resolved using a priori information coming from the structure-function relationship. We show that protein families with low conformational diversity show a well-correlated relationship between sequence and structural divergence, which is severely reduced in proteins with larger conformational diversity. This lack of correlation could impair TBM results in highly dynamical proteins. Finally, we also find that the presence of order/disorder can provide useful beforehand information for better TBM performance.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, B1876BXD, Bernal, Argentina
| | - Diego Javier Zea
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, C1405BWE Ciudad Autónoma de Buenos Aires, Argentina
| | - Cristina Marino-Buslje
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, C1405BWE Ciudad Autónoma de Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, B1876BXD, Bernal, Argentina
| |
Collapse
|
13
|
Regad L, Chéron JB, Triki D, Senac C, Flatters D, Camproux AC. Exploring the potential of a structural alphabet-based tool for mining multiple target conformations and target flexibility insight. PLoS One 2017; 12:e0182972. [PMID: 28817602 PMCID: PMC5560695 DOI: 10.1371/journal.pone.0182972] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Accepted: 07/27/2017] [Indexed: 11/18/2022] Open
Abstract
Protein flexibility is often implied in binding with different partners and is essential for protein function. The growing number of macromolecular structures in the Protein Data Bank entries and their redundancy has become a major source of structural knowledge of the protein universe. The analysis of structural variability through available redundant structures of a target, called multiple target conformations (MTC), obtained using experimental or modeling methods and under different biological conditions or different sources is one way to explore protein flexibility. This analysis is essential to improve the understanding of various mechanisms associated with protein target function and flexibility. In this study, we explored structural variability of three biological targets by analyzing different MTC sets associated with these targets. To facilitate the study of these MTC sets, we have developed an efficient tool, SA-conf, dedicated to capturing and linking the amino acid and local structure variability and analyzing the target structural variability space. The advantage of SA-conf is that it could be applied to divers sets composed of MTCs available in the PDB obtained using NMR and crystallography or homology models. This tool could also be applied to analyze MTC sets obtained by dynamics approaches. Our results showed that SA-conf tool is effective to quantify the structural variability of a MTC set and to localize the structural variable positions and regions of the target. By selecting adapted MTC subsets and comparing their variability detected by SA-conf, we highlighted different sources of target flexibility such as induced by binding partner, by mutation and intrinsic flexibility. Our results support the interest to mine available structures associated with a target using to offer valuable insight into target flexibility and interaction mechanisms. The SA-conf executable script, with a set of pre-compiled binaries are available at http://www.mti.univ-paris-diderot.fr/recherche/plateformes/logiciels.
Collapse
Affiliation(s)
- Leslie Regad
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- * E-mail: anne-claude.camproux@univ-paris-diderot (ACC); (LR)
| | - Jean-Baptiste Chéron
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- Institut de Chimie de Nice, UMR-CNRS 7272, Faculté des Sciences, Université de Nice-Sophia Antipolis, Nice, France
| | - Dhoha Triki
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Caroline Senac
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, INSERM, Laboratoire d'Imagerie Biomédicale (LIB), Paris, France
| | - Delphine Flatters
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Anne-Claude Camproux
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- * E-mail: anne-claude.camproux@univ-paris-diderot (ACC); (LR)
| |
Collapse
|
14
|
Mahajan S, Sanejouand YH. Jumping between protein conformers using normal modes. J Comput Chem 2017; 38:1622-1630. [PMID: 28470912 DOI: 10.1002/jcc.24803] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2016] [Revised: 02/03/2017] [Accepted: 03/19/2017] [Indexed: 12/27/2022]
Abstract
The relationship between the normal modes of a protein and its functional conformational change has been studied for decades. However, using this relationship in a predictive context remains a challenge. In this work, we demonstrate that, starting from a given protein conformer, it is possible to generate in a single step model conformers that are less than 1 Å (Cα -RMSD) from the conformer which is the known endpoint of the conformational change, particularly when the conformational change is collective in nature. Such accurate model conformers can be generated by following either the so-called robust or the 50 lowest-frequency modes obtained with various Elastic Network Models (ENMs). Interestingly, the quality of many of these models compares well with actual crystal structures, as assessed by the ROSETTA scoring function and PROCHECK. The most accurate and best quality conformers obtained in the present study were generated by using the 50 lowest-frequency modes of an all-atom ENM. However, with less than ten robust modes, which are identified without any prior knowledge of the nature of the conformational change, nearly 90% of the motion described by the 50 lowest-frequency modes of a protein can be captured. Such results strongly suggest that exploring the robust modes of ENMs may prove efficient for sampling the functionally relevant conformational repertoire of many proteins. © 2017 Wiley Periodicals, Inc.
Collapse
|
15
|
Monzon AM, Zea DJ, Fornasari MS, Saldaño TE, Fernandez-Alberti S, Tosatto SCE, Parisi G. Conformational diversity analysis reveals three functional mechanisms in proteins. PLoS Comput Biol 2017; 13:e1005398. [PMID: 28192432 PMCID: PMC5330503 DOI: 10.1371/journal.pcbi.1005398] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2016] [Revised: 02/28/2017] [Accepted: 02/02/2017] [Indexed: 02/02/2023] Open
Abstract
Protein motions are a key feature to understand biological function. Recently, a large-scale analysis of protein conformational diversity showed a positively skewed distribution with a peak at 0.5 Å C-alpha root-mean-square-deviation (RMSD). To understand this distribution in terms of structure-function relationships, we studied a well curated and large dataset of ~5,000 proteins with experimentally determined conformational diversity. We searched for global behaviour patterns studying how structure-based features change among the available conformer population for each protein. This procedure allowed us to describe the RMSD distribution in terms of three main protein classes sharing given properties. The largest of these protein subsets (~60%), which we call "rigid" (average RMSD = 0.83 Å), has no disordered regions, shows low conformational diversity, the largest tunnels and smaller and buried cavities. The two additional subsets contain disordered regions, but with differential sequence composition and behaviour. Partially disordered proteins have on average 67% of their conformers with disordered regions, average RMSD = 1.1 Å, the highest number of hinges and the longest disordered regions. In contrast, malleable proteins have on average only 25% of disordered conformers and average RMSD = 1.3 Å, flexible cavities affected in size by the presence of disordered regions and show the highest diversity of cognate ligands. Proteins in each set are mostly non-homologous to each other, share no given fold class, nor functional similarity but do share features derived from their conformer population. These shared features could represent conformational mechanisms related with biological functions.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| | - Diego Javier Zea
- Bioinformatics Unit, Fundación Instituto Leloir (CONICET), Buenos Aires, Argentina
| | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| | - Tadeo E. Saldaño
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| | - Sebastian Fernandez-Alberti
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| | | | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| |
Collapse
|
16
|
Addressing the Role of Conformational Diversity in Protein Structure Prediction. PLoS One 2016; 11:e0154923. [PMID: 27159429 PMCID: PMC4861349 DOI: 10.1371/journal.pone.0154923] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Accepted: 04/21/2016] [Indexed: 11/19/2022] Open
Abstract
Computational modeling of tertiary structures has become of standard use to study proteins that lack experimental characterization. Unfortunately, 3D structure prediction methods and model quality assessment programs often overlook that an ensemble of conformers in equilibrium populates the native state of proteins. In this work we collected sets of publicly available protein models and the corresponding target structures experimentally solved and studied how they describe the conformational diversity of the protein. For each protein, we assessed the quality of the models against known conformers by several standard measures and identified those models ranked best. We found that model rankings are defined by both the selected target conformer and the similarity measure used. 70% of the proteins in our datasets show that different models are structurally closest to different conformers of the same protein target. We observed that model building protocols such as template-based or ab initio approaches describe in similar ways the conformational diversity of the protein, although for template-based methods this description may depend on the sequence similarity between target and template sequences. Taken together, our results support the idea that protein structure modeling could help to identify members of the native ensemble, highlight the importance of considering conformational diversity in protein 3D quality evaluations and endorse the study of the variability of the native structure for a meaningful biological analysis.
Collapse
|
17
|
Zea DJ, Monzon AM, Gonzalez C, Fornasari MS, Tosatto SCE, Parisi G. Disorder transitions and conformational diversity cooperatively modulate biological function in proteins. Protein Sci 2016; 25:1138-46. [PMID: 27038125 DOI: 10.1002/pro.2931] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2016] [Revised: 03/30/2016] [Accepted: 03/31/2016] [Indexed: 12/20/2022]
Abstract
Structural differences between conformers sustain protein biological function. Here, we studied in a large dataset of 745 intrinsically disordered proteins, how ordered-disordered transitions modulate structural differences between conformers as derived from crystallographic data. We found that almost 50% of the proteins studied show no transitions and have low conformational diversity while the rest show transitions and a higher conformational diversity. In this last subset, 60% of the proteins become more ordered after ligand binding, while 40% more disordered. As protein conformational diversity is inherently connected with protein function our analysis suggests differences in structure-function relationships related to order-disorder transitions.
Collapse
Affiliation(s)
- Diego Javier Zea
- Structural Bioinformatics Group, Department of Science and Technology, National University of Quilmes, Argentina
| | - Alexander Miguel Monzon
- Structural Bioinformatics Group, Department of Science and Technology, National University of Quilmes, Argentina
| | - Claudia Gonzalez
- Structural Bioinformatics Group, Department of Science and Technology, National University of Quilmes, Argentina
| | - María Silvina Fornasari
- Structural Bioinformatics Group, Department of Science and Technology, National University of Quilmes, Argentina
| | - Silvio C E Tosatto
- Biocomputing up, Department of Biomedical Sciences, University of Padova, Italy
| | - Gustavo Parisi
- Structural Bioinformatics Group, Department of Science and Technology, National University of Quilmes, Argentina
| |
Collapse
|
18
|
Monzon AM, Rohr CO, Fornasari MS, Parisi G. CoDNaS 2.0: a comprehensive database of protein conformational diversity in the native state. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw038. [PMID: 27022160 PMCID: PMC4809262 DOI: 10.1093/database/baw038] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/19/2015] [Accepted: 03/02/2016] [Indexed: 01/01/2023]
Abstract
CoDNaS (conformational diversity of the native state) is a protein conformational diversity database. Conformational diversity describes structural differences between conformers that define the native state of proteins. It is a key concept to understand protein function and biological processes related to protein functions. CoDNaS offers a well curated database that is experimentally driven, thoroughly linked, and annotated. CoDNaS facilitates the extraction of key information on small structural differences based on protein movements. CoDNaS enables users to easily relate the degree of conformational diversity with physical, chemical and biological properties derived from experiments on protein structure and biological characteristics. The new version of CoDNaS includes ∼70% of all available protein structures, and new tools have been added that run sequence searches, display structural flexibility profiles and allow users to browse the database for different structural classes. These tools facilitate the exploration of protein conformational diversity and its role in protein function. Database URL:http://ufq.unq.edu.ar/codnas
Collapse
Affiliation(s)
| | - Cristian Oscar Rohr
- Instituto de Ecología Genética y Evolución de Buenos Aires (IEGEBA)-Laboratorio de Genómica Médica y Evolución, Universidad Nacional de Buenos Aires, Argentina
| | | | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
| |
Collapse
|
19
|
Chang CW, Chou CW, Chang DTH. CCProf: exploring conformational change profile of proteins. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2016; 2016:baw029. [PMID: 27016699 PMCID: PMC4808249 DOI: 10.1093/database/baw029] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/14/2015] [Accepted: 02/23/2016] [Indexed: 12/18/2022]
Abstract
In many biological processes, proteins have important interactions with various molecules such as proteins, ions or ligands. Many proteins undergo conformational changes upon these interactions, where regions with large conformational changes are critical to the interactions. This work presents the CCProf platform, which provides conformational changes of entire proteins, named conformational change profile (CCP) in the context. CCProf aims to be a platform where users can study potential causes of novel conformational changes. It provides 10 biological features, including conformational change, potential binding target site, secondary structure, conservation, disorder propensity, hydropathy propensity, sequence domain, structural domain, phosphorylation site and catalytic site. All these information are integrated into a well-aligned view, so that researchers can capture important relevance between different biological features visually. The CCProf contains 986 187 protein structure pairs for 3123 proteins. In addition, CCProf provides a 3D view in which users can see the protein structures before and after conformational changes as well as binding targets that induce conformational changes. All information (e.g. CCP, binding targets and protein structures) shown in CCProf, including intermediate data are available for download to expedite further analyses. Database URL: http://zoro.ee.ncku.edu.tw/ccprof/
Collapse
Affiliation(s)
- Che-Wei Chang
- Department of Electrical Engineering, National Cheng Kung University, Tainan, 70101, Taiwan
| | - Chai-Wei Chou
- Department of Electrical Engineering, National Cheng Kung University, Tainan, 70101, Taiwan
| | - Darby Tien-Hao Chang
- Department of Electrical Engineering, National Cheng Kung University, Tainan, 70101, Taiwan
| |
Collapse
|
20
|
Saldaño TE, Monzon AM, Parisi G, Fernandez-Alberti S. Evolutionary Conserved Positions Define Protein Conformational Diversity. PLoS Comput Biol 2016; 12:e1004775. [PMID: 27008419 PMCID: PMC4805271 DOI: 10.1371/journal.pcbi.1004775] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2015] [Accepted: 01/27/2016] [Indexed: 12/18/2022] Open
Abstract
Conformational diversity of the native state plays a central role in modulating protein function. The selection paradigm sustains that different ligands shift the conformational equilibrium through their binding to highest-affinity conformers. Intramolecular vibrational dynamics associated to each conformation should guarantee conformational transitions, which due to its importance, could possibly be associated with evolutionary conserved traits. Normal mode analysis, based on a coarse-grained model of the protein, can provide the required information to explore these features. Herein, we present a novel procedure to identify key positions sustaining the conformational diversity associated to ligand binding. The method is applied to an adequate refined dataset of 188 paired protein structures in their bound and unbound forms. Firstly, normal modes most involved in the conformational change are selected according to their corresponding overlap with structural distortions introduced by ligand binding. The subspace defined by these modes is used to analyze the effect of simulated point mutations on preserving the conformational diversity of the protein. We find a negative correlation between the effects of mutations on these normal mode subspaces associated to ligand-binding and position-specific evolutionary conservations obtained from multiple sequence-structure alignments. Positions whose mutations are found to alter the most these subspaces are defined as key positions, that is, dynamically important residues that mediate the ligand-binding conformational change. These positions are shown to be evolutionary conserved, mostly buried aliphatic residues localized in regular structural regions of the protein like β-sheets and α-helix.
Collapse
|
21
|
Bietz S, Rarey M. SIENA: Efficient Compilation of Selective Protein Binding Site Ensembles. J Chem Inf Model 2016; 56:248-59. [PMID: 26759067 DOI: 10.1021/acs.jcim.5b00588] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
Structural flexibility of proteins has an important influence on molecular recognition and enzymatic function. In modeling, structure ensembles are therefore often applied as a valuable source of alternative protein conformations. However, their usage is often complicated by structural artifacts and inconsistent data annotation. Here, we present SIENA, a new computational approach for the automated assembly and preprocessing of protein binding site ensembles. Starting with an arbitrarily defined binding site in a single protein structure, SIENA searches for alternative conformations of the same or sequentially closely related binding sites. The method is based on an indexed database for identifying perfect k-mer matches and a recently published algorithm for the alignment of protein binding site conformations. Furthermore, SIENA provides a new algorithm for the interaction-based selection of binding site conformations which aims at covering all known ligand-binding geometries. Various experiments highlight that SIENA is able to generate comprehensive and well selected binding site ensembles improving the compatibility to both known and unconsidered ligand molecules. Starting with the whole PDB as data source, the computation time of the whole ensemble generation takes only a few seconds. SIENA is available via a Web service at www.zbh.uni-hamburg.de/siena .
Collapse
Affiliation(s)
- Stefan Bietz
- Center for Bioinformatics, University of Hamburg , Bundesstrasse 43, 20146 Hamburg, Germany
| | - Matthias Rarey
- Center for Bioinformatics, University of Hamburg , Bundesstrasse 43, 20146 Hamburg, Germany
| |
Collapse
|
22
|
Hrabe T, Li Z, Sedova M, Rotkiewicz P, Jaroszewski L, Godzik A. PDBFlex: exploring flexibility in protein structures. Nucleic Acids Res 2015; 44:D423-8. [PMID: 26615193 PMCID: PMC4702920 DOI: 10.1093/nar/gkv1316] [Citation(s) in RCA: 55] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2015] [Accepted: 11/10/2015] [Indexed: 12/16/2022] Open
Abstract
The PDBFlex database, available freely and with no login requirements at http://pdbflex.org, provides information on flexibility of protein structures as revealed by the analysis of variations between depositions of different structural models of the same protein in the Protein Data Bank (PDB). PDBFlex collects information on all instances of such depositions, identifying them by a 95% sequence identity threshold, performs analysis of their structural differences and clusters them according to their structural similarities for easy analysis. The PDBFlex contains tools and viewers enabling in-depth examination of structural variability including: 2D-scaling visualization of RMSD distances between structures of the same protein, graphs of average local RMSD in the aligned structures of protein chains, graphical presentation of differences in secondary structure and observed structural disorder (unresolved residues), difference distance maps between all sets of coordinates and 3D views of individual structures and simulated transitions between different conformations, the latter displayed using JSMol visualization software.
Collapse
Affiliation(s)
- Thomas Hrabe
- Bioinformatics and Systems Biology Program, Sanford Burnham Prebys Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Zhanwen Li
- Bioinformatics and Systems Biology Program, Sanford Burnham Prebys Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Mayya Sedova
- Bioinformatics and Systems Biology Program, Sanford Burnham Prebys Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Piotr Rotkiewicz
- Bioinformatics and Systems Biology Program, Sanford Burnham Prebys Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Lukasz Jaroszewski
- Bioinformatics and Systems Biology Program, Sanford Burnham Prebys Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA
| | - Adam Godzik
- Bioinformatics and Systems Biology Program, Sanford Burnham Prebys Medical Discovery Institute, 10901 North Torrey Pines Road, La Jolla, CA 92037, USA
| |
Collapse
|
23
|
ConTemplate Suggests Possible Alternative Conformations for a Query Protein of Known Structure. Structure 2015; 23:2162-70. [DOI: 10.1016/j.str.2015.08.018] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2015] [Revised: 07/31/2015] [Accepted: 08/24/2015] [Indexed: 10/22/2022]
|
24
|
Hasenahuer MA, Parisi G, Gautier M, Lazarowski A, Bramuglia GF, Fornasari MS. Twenty-One Novel EGFR Kinase Domain variants in Patients with Nonsmall Cell Lung Cancer. Ann Hum Genet 2015; 79:385-93. [PMID: 26420346 DOI: 10.1111/ahg.12127] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2014] [Revised: 06/01/2015] [Accepted: 06/03/2015] [Indexed: 01/11/2023]
Abstract
Somatic sequence variants in the epidermal growth factor receptor (EGFR) kinase domain are associated with sensitivity to tyrosine kinase inhibitors (TKIs) in patients with nonsmall cell lung cancer (NSCLC). Patients exhibiting sequence variants in this domain that produce kinase activity enhancement, are more likely to benefit from TKIs than patients with EGFR wild-type disease. Although most NSCLC EGFR-related alleles are concentrated in a few positions, established protocols recommend sequencing EGFR exons 18-21. In this study, 21 novel somatic variants belonging to such exons in adult Argentinean patients affected with NSCLC are reported. Of these, 18 were single amino acid substitutions (SASs), occurring alone or in combination with another genetic alteration (complex cases), one was a short deletion, one was a short deletion-short insertion combination, and one was a duplication. New variants and different combinations of previously reported variants were also found. Moreover, two of the reported SASs occurred in previously unreported positions of the EGFR kinase domain. In order to characterize the new sequence variants, physicochemical, sequence and conformational analyses were also performed. A better understanding of sequence variants in NSCLC may facilitate the most appropriate treatment choice for this complex disease.
Collapse
Affiliation(s)
- Marcia A Hasenahuer
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Saenz Peña 352, Bernal, (B1876BXD), Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Saenz Peña 352, Bernal, (B1876BXD), Buenos Aires, Argentina
| | - Marien Gautier
- Fundación Investigar. Riobamba 429 Piso 15, Ciudad Autónoma de Buenos Aires, (C1022AAT), Argentina
| | - Alberto Lazarowski
- Fundación Investigar. Riobamba 429 Piso 15, Ciudad Autónoma de Buenos Aires, (C1022AAT), Argentina.,Facultad de Farmacia y Bioquímica, Universidad de Buenos Aires, Junín 954, Ciudad Autónoma de Buenos Aires, (C1113AAD), Argentina
| | - Guillermo F Bramuglia
- Fundación Investigar. Riobamba 429 Piso 15, Ciudad Autónoma de Buenos Aires, (C1022AAT), Argentina.,Facultad de Farmacia y Bioquímica, Universidad de Buenos Aires, Junín 954, Ciudad Autónoma de Buenos Aires, (C1113AAD), Argentina
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Saenz Peña 352, Bernal, (B1876BXD), Buenos Aires, Argentina
| |
Collapse
|
25
|
Li W, Kinch LN, Karplus PA, Grishin NV. ChSeq: A database of chameleon sequences. Protein Sci 2015; 24:1075-86. [PMID: 25970262 DOI: 10.1002/pro.2689] [Citation(s) in RCA: 40] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/20/2015] [Revised: 04/15/2015] [Accepted: 04/24/2015] [Indexed: 11/11/2022]
Abstract
Chameleon sequences (ChSeqs) refer to sequence strings of identical amino acids that can adopt different conformations in protein structures. Researchers have detected and studied ChSeqs to understand the interplay between local and global interactions in protein structure formation. The different secondary structures adopted by one ChSeq challenge sequence-based secondary structure predictors. With increasing numbers of available Protein Data Bank structures, we here identify a large set of ChSeqs ranging from 6 to 10 residues in length. The homologous ChSeqs discovered highlight the structural plasticity involved in biological function. When compared with previous studies, the set of unrelated ChSeqs found represents an about 20-fold increase in the number of detected sequences, as well as an increase in the longest ChSeq length from 8 to 10 residues. We applied secondary structure predictors on our ChSeqs and found that methods based on a sequence profile outperformed methods based on a single sequence. For the unrelated ChSeqs, the evolutionary information provided by the sequence profile typically allows successful prediction of the prevailing secondary structure adopted in each protein family. Our dataset will facilitate future studies of ChSeqs, as well as interpretations of the interplay between local and nonlocal interactions. A user-friendly web interface for this ChSeq database is available at prodata.swmed.edu/chseq.
Collapse
Affiliation(s)
- Wenlin Li
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050.,Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050
| | - Lisa N Kinch
- Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050
| | - P Andrew Karplus
- Department of Biochemistry and Biophysics, Oregon State University, Corvallis, Oregon, 97331
| | - Nick V Grishin
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050.,Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050.,Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas, 75390-9050
| |
Collapse
|
26
|
Parisi G, Zea DJ, Monzon AM, Marino-Buslje C. Conformational diversity and the emergence of sequence signatures during evolution. Curr Opin Struct Biol 2015; 32:58-65. [DOI: 10.1016/j.sbi.2015.02.005] [Citation(s) in RCA: 27] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2014] [Revised: 02/02/2015] [Accepted: 02/09/2015] [Indexed: 02/03/2023]
|