1
|
Raisinghani N, Parikh V, Foley B, Verkhivker G. AlphaFold2-Based Characterization of Apo and Holo Protein Structures and Conformational Ensembles Using Randomized Alanine Sequence Scanning Adaptation: Capturing Shared Signature Dynamics and Ligand-Induced Conformational Changes. Int J Mol Sci 2024; 25:12968. [PMID: 39684679 DOI: 10.3390/ijms252312968] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2024] [Revised: 11/24/2024] [Accepted: 11/29/2024] [Indexed: 12/18/2024] Open
Abstract
Proteins often exist in multiple conformational states, influenced by the binding of ligands or substrates. The study of these states, particularly the apo (unbound) and holo (ligand-bound) forms, is crucial for understanding protein function, dynamics, and interactions. In the current study, we use AlphaFold2, which combines randomized alanine sequence masking with shallow multiple sequence alignment subsampling to expand the conformational diversity of the predicted structural ensembles and capture conformational changes between apo and holo protein forms. Using several well-established datasets of structurally diverse apo-holo protein pairs, the proposed approach enables robust predictions of apo and holo structures and conformational ensembles, while also displaying notably similar dynamics distributions. These observations are consistent with the view that the intrinsic dynamics of allosteric proteins are defined by the structural topology of the fold and favor conserved conformational motions driven by soft modes. Our findings provide evidence that AlphaFold2 combined with randomized alanine sequence masking can yield accurate and consistent results in predicting moderate conformational adjustments between apo and holo states, especially for proteins with localized changes upon ligand binding. For large hinge-like domain movements, the proposed approach can predict functional conformations characteristic of both apo and ligand-bound holo ensembles in the absence of ligand information. These results are relevant for using this AlphaFold adaptation for probing conformational selection mechanisms according to which proteins can adopt multiple conformations, including those that are competent for ligand binding. The results of this study indicate that robust modeling of functional protein states may require more accurate characterization of flexible regions in functional conformations and the detection of high-energy conformations. By incorporating a wider variety of protein structures in training datasets, including both apo and holo forms, the model can learn to recognize and predict the structural changes that occur upon ligand binding.
Collapse
Affiliation(s)
- Nishank Raisinghani
- Keck Center for Science and Engineering, Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA
| | - Vedant Parikh
- Keck Center for Science and Engineering, Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA
| | - Brandon Foley
- Keck Center for Science and Engineering, Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA
| | - Gennady Verkhivker
- Keck Center for Science and Engineering, Schmid College of Science and Technology, Chapman University, Orange, CA 92866, USA
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, CA 92618, USA
| |
Collapse
|
2
|
Escobedo N, Saldaño T, Mac Donagh J, Sawicki LR, Palopoli N, Alberti SF, Fornasari MS, Parisi G. Revealing Missing Protein-Ligand Interactions Using AlphaFold Predictions. J Mol Biol 2024; 436:168852. [PMID: 39510344 DOI: 10.1016/j.jmb.2024.168852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Revised: 10/05/2024] [Accepted: 10/30/2024] [Indexed: 11/15/2024]
Abstract
Protein-ligand interactions represent an essential step to understand the bases of molecular recognition, an intense field of research in many scientific areas. Structural biology has played a central role in unveiling protein-ligand interactions, but current techniques are still not able to reliably describe the interactions of ligands with highly flexible regions. In this work, we explored the capacity of AlphaFold2 (AF2) to estimate the presence of interactions between ligands and residues belonging to disordered regions. As these interactions are missing in the crystallographic-derived structures, we called them "ghost interactions". Using a set of protein structures experimentally obtained after AF2 was trained, we found that the obtained models are good predictors of regions associated with order-disorder transitions. Additionally, we found that AF2 predicts residues making ghost interactions with ligands, which are mostly buried and show differential evolutionary conservation with the rest of the residues located in the flexible region. Our findings could fuel current areas of research that consider, given their biological relevance and their involvement in diseases, intrinsically disordered proteins as potentially valuable targets for drug development.
Collapse
Affiliation(s)
- Nahuel Escobedo
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal B1876BXD, Argentina
| | - Tadeo Saldaño
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal B1876BXD, Argentina; Departamento de Ciencias Básicas, Facultad de Agronomía, Universidad Nacional del Centro de la Provincia de Buenos Aires, Azul, Buenos Aires B7300, Argentina
| | - Juan Mac Donagh
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal B1876BXD, Argentina
| | | | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal B1876BXD, Argentina
| | | | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal B1876BXD, Argentina.
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal B1876BXD, Argentina.
| |
Collapse
|
3
|
Zhang F, Li Z, Zhao K, Zhao P, Zhang G. Prediction of Inter-Residue Multiple Distances and Exploration of Protein Multiple Conformations by Deep Learning. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; 21:1731-1739. [PMID: 38857126 DOI: 10.1109/tcbb.2024.3411825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/12/2024]
Abstract
AlphaFold2 has achieved a major breakthrough in end-to-end prediction for static protein structures. However, protein conformational change is considered to be a key factor in protein biological function. Inter-residue multiple distances prediction is of great significance for research on protein multiple conformations exploration. In this study, we proposed an inter-residue multiple distances prediction method, DeepMDisPre, based on an improved network which integrates triangle update, axial attention and ResNet to predict multiple distances of residue pairs. We built a dataset which contains proteins with a single structure and proteins with multiple conformations to train the network. We tested DeepMDisPre on 114 proteins with multiple conformations. The results show that the inter-residue distance distribution predicted by DeepMDisPre tends to have multiple peaks for flexible residue pairs than for rigid residue pairs. On two cases of proteins with multiple conformations, we modeled the multiple conformations relatively accurately by using the predicted inter-residue multiple distances. In addition, we also tested the performance of DeepMDisPre on 279 proteins with a single structure. Experimental results demonstrate that the average contact accuracy of DeepMDisPre is higher than that of the comparative method. In terms of static protein modeling, the average TM-score of the 3D models built by DeepMDisPre is also improved compared with the comparative method.
Collapse
|
4
|
Feidakis CP, Krivak R, Hoksza D, Novotny M. AHoJ-DB: A PDB-wide Assignment of apo & holo Relationships Based on Individual Protein-Ligand Interactions. J Mol Biol 2024; 436:168545. [PMID: 38508305 DOI: 10.1016/j.jmb.2024.168545] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 03/12/2024] [Accepted: 03/14/2024] [Indexed: 03/22/2024]
Abstract
A single protein structure is rarely sufficient to capture the conformational variability of a protein. Both bound and unbound (holo and apo) forms of a protein are essential for understanding its geometry and making meaningful comparisons. Nevertheless, docking or drug design studies often still consider only single protein structures in their holo form, which are for the most part rigid. With the recent explosion in the field of structural biology, large, curated datasets are urgently needed. Here, we use a previously developed application (AHoJ) to perform a comprehensive search for apo-holo pairs for 468,293 biologically relevant protein-ligand interactions across 27,983 proteins. In each search, the binding pocket is captured and mapped across existing structures within the same UniProt, and the mapped pockets are annotated as apo or holo, based on the presence or absence of ligands. We assemble the results into a database, AHoJ-DB (www.apoholo.cz/db), that captures the variability of proteins with identical sequences, thereby exposing the agents responsible for the observed differences in geometry. We report several metrics for each annotated pocket, and we also include binding pockets that form at the interface of multiple chains. Analysis of the database shows that about 24% of the binding sites occur at the interface of two or more chains and that less than 50% of the total binding sites processed have an apo form in the PDB. These results can be used to train and evaluate predictors, discover potentially druggable proteins, and reveal protein- and ligand-specific relationships that were previously obscured by intermittent or partial data. Availability: www.apoholo.cz/db.
Collapse
Affiliation(s)
- Christos P Feidakis
- Department of Cell Biology, Faculty of Science, Charles University, Prague 12843, Czech Republic.
| | - Radoslav Krivak
- Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Prague 12116, Czech Republic; Institute of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Prague 16000, Czech Republic
| | - David Hoksza
- Department of Software Engineering, Faculty of Mathematics and Physics, Charles University, Prague 12116, Czech Republic
| | - Marian Novotny
- Department of Cell Biology, Faculty of Science, Charles University, Prague 12843, Czech Republic.
| |
Collapse
|
5
|
Gavalda-Garcia J, Bickel D, Roca-Martinez J, Raimondi D, Orlando G, Vranken W. Data-driven probabilistic definition of the low energy conformational states of protein residues. NAR Genom Bioinform 2024; 6:lqae082. [PMID: 38984065 PMCID: PMC11231583 DOI: 10.1093/nargab/lqae082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2024] [Revised: 06/14/2024] [Accepted: 06/26/2024] [Indexed: 07/11/2024] Open
Abstract
Protein dynamics and related conformational changes are essential for their function but difficult to characterise and interpret. Amino acids in a protein behave according to their local energy landscape, which is determined by their local structural context and environmental conditions. The lowest energy state for a given residue can correspond to sharply defined conformations, e.g. in a stable helix, or can cover a wide range of conformations, e.g. in intrinsically disordered regions. A good definition of such low energy states is therefore important to describe the behaviour of a residue and how it changes with its environment. We propose a data-driven probabilistic definition of six low energy conformational states typically accessible for amino acid residues in proteins. This definition is based on solution NMR information of 1322 proteins through a combined analysis of structure ensembles with interpreted chemical shifts. We further introduce a conformational state variability parameter that captures, based on an ensemble of protein structures from molecular dynamics or other methods, how often a residue moves between these conformational states. The approach enables a different perspective on the local conformational behaviour of proteins that is complementary to their static interpretation from single structure models.
Collapse
Affiliation(s)
- Jose Gavalda-Garcia
- Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - David Bickel
- Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | - Joel Roca-Martinez
- Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| | | | | | - Wim Vranken
- Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
| |
Collapse
|
6
|
Nithin C, Fornari RP, Pilla SP, Wroblewski K, Zalewski M, Madaj R, Kolinski A, Macnar JM, Kmiecik S. Exploring protein functions from structural flexibility using CABS-flex modeling. Protein Sci 2024; 33:e5090. [PMID: 39194135 PMCID: PMC11350595 DOI: 10.1002/pro.5090] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/29/2024] [Revised: 05/06/2024] [Accepted: 06/10/2024] [Indexed: 08/29/2024]
Abstract
Understanding protein function often necessitates characterizing the flexibility of protein structures. However, simulating protein flexibility poses significant challenges due to the complex dynamics of protein systems, requiring extensive computational resources and accurate modeling techniques. In response to these challenges, the CABS-flex method has been developed as an efficient modeling tool that combines coarse-grained simulations with all-atom detail. Available both as a web server and a standalone package, CABS-flex is dedicated to a wide range of users. The web server version offers an accessible interface for straightforward tasks, while the standalone command-line program is designed for advanced users, providing additional features, analytical tools, and support for handling large systems. This paper examines the application of CABS-flex across various structure-function studies, facilitating investigations into the interplay among protein structure, dynamics, and function in diverse research fields. We present an overview of the current status of the CABS-flex methodology, highlighting its recent advancements, practical applications, and forthcoming challenges.
Collapse
Affiliation(s)
- Chandran Nithin
- Biological and Chemical Research Centre, Faculty of ChemistryUniversity of WarsawWarsawPoland
| | - Rocco Peter Fornari
- Biological and Chemical Research Centre, Faculty of ChemistryUniversity of WarsawWarsawPoland
| | - Smita P. Pilla
- Biological and Chemical Research Centre, Faculty of ChemistryUniversity of WarsawWarsawPoland
| | - Karol Wroblewski
- Biological and Chemical Research Centre, Faculty of ChemistryUniversity of WarsawWarsawPoland
| | - Mateusz Zalewski
- Biological and Chemical Research Centre, Faculty of ChemistryUniversity of WarsawWarsawPoland
| | - Rafał Madaj
- Institute of Evolutionary Biology, Biological and Chemical Research Centre, Faculty of BiologyUniversity of WarsawWarsawPoland
| | - Andrzej Kolinski
- Biological and Chemical Research Centre, Faculty of ChemistryUniversity of WarsawWarsawPoland
| | - Joanna M. Macnar
- Biological and Chemical Research Centre, Faculty of ChemistryUniversity of WarsawWarsawPoland
- Present address:
Ryvu TherapeuticsCracowPoland
| | - Sebastian Kmiecik
- Biological and Chemical Research Centre, Faculty of ChemistryUniversity of WarsawWarsawPoland
| |
Collapse
|
7
|
Palaniappan C, Rajendran S, Sekar K. Alternate conformations found in protein structures implies biological functions: A case study using cyclophilin A. Curr Res Struct Biol 2024; 7:100145. [PMID: 38690327 PMCID: PMC11059445 DOI: 10.1016/j.crstbi.2024.100145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2023] [Revised: 03/16/2024] [Accepted: 04/15/2024] [Indexed: 05/02/2024] Open
Abstract
Protein dynamics linked to numerous biomolecular functions, such as ligand binding, allosteric regulation, and catalysis, must be better understood at the atomic level. Reactive atoms of key residues drive a repertoire of biomolecular functions by flipping between alternate conformations or conformational substates, seldom found in protein structures. Probing such sparsely sampled alternate conformations would provide mechanistic insight into many biological functions. We are therefore interested in evaluating the instance of amino acids adopted alternate conformations, either in backbone or side-chain atoms or in both. Accordingly, over 70000 protein structures appear to contain alternate conformations only 'A' and 'B' for any atom, particularly the instance of amino acids that adopted alternate conformations are more for Arg, Cys, Met, and Ser than others. The resulting protein structure analysis depicts that amino acids with alternate conformations are mainly found in the helical and β-regions and are often seen in high-resolution X-ray crystal structures. Furthermore, a case study on human cyclophilin A (CypA) was performed to explain the pre-existing intrinsic dynamics of catalytically critical residues from the CypA and how such intrinsic dynamics perturbed upon Ser99Thr mutation using molecular dynamics simulations on the ns-μs timescale. Simulation results demonstrated that the Ser99Thr mutation had impaired the alternate conformations or the catalytically productive micro-environment of Phe113, mimicking the experimentally observed perturbation captured by X-ray crystallography. In brief, a deeper comprehension of alternate conformations adopted by the amino acids may shed light on the interplay between protein structure, dynamics, and function.
Collapse
Affiliation(s)
- Chandrasekaran Palaniappan
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India
- Department of Computational and Data Sciences, Indian Institute of Science, Bangalore, 560012, India
| | - Santhosh Rajendran
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India
- Department of Computational and Data Sciences, Indian Institute of Science, Bangalore, 560012, India
| | - Kanagaraj Sekar
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, 560012, India
| |
Collapse
|
8
|
Singh PK, Stan RC. ThermoPCD: a database of molecular dynamics trajectories of antibody-antigen complexes at physiologic and fever-range temperatures. Database (Oxford) 2024; 2024:baae015. [PMID: 38502609 PMCID: PMC10950042 DOI: 10.1093/database/baae015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 02/12/2024] [Accepted: 02/22/2024] [Indexed: 03/21/2024]
Abstract
Progression of various cancers and autoimmune diseases is associated with changes in systemic or local tissue temperatures, which may impact current therapies. The role of fever and acute inflammation-range temperatures on the stability and activity of antibodies relevant for cancers and autoimmunity is unknown. To produce molecular dynamics (MD) trajectories of immune complexes at relevant temperatures, we used the Research Collaboratory for Structural Bioinformatics (RCSB) database to identify 50 antibody:antigen complexes of interest, in addition to single antibodies and antigens, and deployed Groningen Machine for Chemical Simulations (GROMACS) to prepare and run the structures at different temperatures for 100-500 ns, in single or multiple random seeds. MD trajectories are freely available. Processed data include Protein Data Bank outputs for all files obtained every 50 ns, and free binding energy calculations for some of the immune complexes. Protocols for using the data are also available. Individual datasets contain unique DOIs. We created a web interface, ThermoPCD, as a platform to explore the data. The outputs of ThermoPCD allow the users to relate thermally-dependent changes in epitopes:paratopes interfaces to their free binding energies, or against own experimentally derived binding affinities. ThermoPCD is a free to use database of immune complexes' trajectories at different temperatures that does not require registration and allows for all the data to be available for download. Database URL: https://sites.google.com/view/thermopcd/home.
Collapse
Affiliation(s)
- Puneet K Singh
- Department of Basic Medical Science, Chonnam National University, Hwasun 58128, Republic of Korea
| | - Razvan C Stan
- Department of Basic Medical Science, Chonnam National University, Hwasun 58128, Republic of Korea
| |
Collapse
|
9
|
Doni D, Cavallari E, Noguera ME, Gentili HG, Cavion F, Parisi G, Fornasari MS, Sartori G, Santos J, Bellanda M, Carbonera D, Costantini P, Bortolus M. Searching for Frataxin Function: Exploring the Analogy with Nqo15, the Frataxin-like Protein of Respiratory Complex I from Thermus thermophilus. Int J Mol Sci 2024; 25:1912. [PMID: 38339189 PMCID: PMC10855754 DOI: 10.3390/ijms25031912] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 01/26/2024] [Accepted: 02/02/2024] [Indexed: 02/12/2024] Open
Abstract
Nqo15 is a subunit of respiratory complex I of the bacterium Thermus thermophilus, with strong structural similarity to human frataxin (FXN), a protein involved in the mitochondrial disease Friedreich's ataxia (FRDA). Recently, we showed that the expression of recombinant Nqo15 can ameliorate the respiratory phenotype of FRDA patients' cells, and this prompted us to further characterize both the Nqo15 solution's behavior and its potential functional overlap with FXN, using a combination of in silico and in vitro techniques. We studied the analogy of Nqo15 and FXN by performing extensive database searches based on sequence and structure. Nqo15's folding and flexibility were investigated by combining nuclear magnetic resonance (NMR), circular dichroism, and coarse-grained molecular dynamics simulations. Nqo15's iron-binding properties were studied using NMR, fluorescence, and specific assays and its desulfurase activation by biochemical assays. We found that the recombinant Nqo15 isolated from complex I is monomeric, stable, folded in solution, and highly dynamic. Nqo15 does not share the iron-binding properties of FXN or its desulfurase activation function.
Collapse
Affiliation(s)
- Davide Doni
- Department of Biology, University of Padova, 35121 Padova, Italy; (D.D.); (F.C.)
| | - Eva Cavallari
- Department of Biology, University of Padova, 35121 Padova, Italy; (D.D.); (F.C.)
- Grenoble Alpes University, CNRS, CEA, INRAE, IRIG-LPCV, 38000 Grenoble, France
| | - Martin Ezequiel Noguera
- Department of Physiology and Molecular and Cellular Biology, Institute of Biosciences, Biotechnology and Translational Biology (iB3), Faculty of Exact and Natural Sciences, University of Buenos Aires, Intendente Güiraldes 2160, Buenos Aires C1428EG, Argentina; (M.E.N.); (H.G.G.); (J.S.)
- Institute of Biological Chemistry and Physical Chemistry, Dr Alejandro Paladini (UBA-CONICET), University of Buenos Aires, Junín 956, Buenos Aires 1113AAD, Argentina
- Department of Science and Technology, National University of Quilmes, Roque Saenz Peña 352, Bernal B1876BXD, Argentina; (G.P.); (M.S.F.)
| | - Hernan Gustavo Gentili
- Department of Physiology and Molecular and Cellular Biology, Institute of Biosciences, Biotechnology and Translational Biology (iB3), Faculty of Exact and Natural Sciences, University of Buenos Aires, Intendente Güiraldes 2160, Buenos Aires C1428EG, Argentina; (M.E.N.); (H.G.G.); (J.S.)
| | - Federica Cavion
- Department of Biology, University of Padova, 35121 Padova, Italy; (D.D.); (F.C.)
| | - Gustavo Parisi
- Department of Science and Technology, National University of Quilmes, Roque Saenz Peña 352, Bernal B1876BXD, Argentina; (G.P.); (M.S.F.)
| | - Maria Silvina Fornasari
- Department of Science and Technology, National University of Quilmes, Roque Saenz Peña 352, Bernal B1876BXD, Argentina; (G.P.); (M.S.F.)
| | - Geppo Sartori
- Department of Biomedical Sciences, University of Padova, 35121 Padova, Italy;
| | - Javier Santos
- Department of Physiology and Molecular and Cellular Biology, Institute of Biosciences, Biotechnology and Translational Biology (iB3), Faculty of Exact and Natural Sciences, University of Buenos Aires, Intendente Güiraldes 2160, Buenos Aires C1428EG, Argentina; (M.E.N.); (H.G.G.); (J.S.)
| | - Massimo Bellanda
- Department of Chemical Sciences, University of Padova, 35131 Padova, Italy; (M.B.); (D.C.)
- Consiglio Nazionale delle Ricerche Institute of Biomolecular Chemistry, 35131 Padova, Italy
| | - Donatella Carbonera
- Department of Chemical Sciences, University of Padova, 35131 Padova, Italy; (M.B.); (D.C.)
| | - Paola Costantini
- Department of Biology, University of Padova, 35121 Padova, Italy; (D.D.); (F.C.)
| | - Marco Bortolus
- Department of Chemical Sciences, University of Padova, 35131 Padova, Italy; (M.B.); (D.C.)
| |
Collapse
|
10
|
Escobedo N, Monzon AM, Fornasari MS, Palopoli N, Parisi G. Combining Protein Conformational Diversity and Phylogenetic Information Using CoDNaS and CoDNaS-Q. Curr Protoc 2023; 3:e764. [PMID: 37184204 DOI: 10.1002/cpz1.764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
CoDNaS (http://ufq.unq.edu.ar/codnas/) and CoDNaS-Q (http://ufq.unq.edu.ar/codnasq) are repositories of proteins with different degrees of conformational diversity. Following the ensemble nature of the native state, conformational diversity represents the structural differences between the conformers in the ensemble. Each entry in CoDNaS and CoDNaS-Q contains a redundant collection of experimentally determined conformers obtained under different conditions. These conformers represent snapshots of the protein dynamism. While CoDNaS contains examples of conformational diversity at the tertiary level, a recent development, CoDNaS-Q, contains examples at the quaternary level. In the emerging age of accurate protein structure prediction by machine learning approaches, many questions remain open regarding the characterization of protein dynamism. In this context, most bioinformatics resources take advantage of distinct features derived from protein alignments, however, the complexity and heterogeneity of information makes it difficult to recover reliable biological signatures. Here we present five protocols to explore tertiary and quaternary conformational diversity at the individual protein level as well as for the characterization of the distribution of conformational diversity at the protein family level in a phylogenetic context. These protocols can provide curated protein families with experimentally known conformational diversity, facilitating the exploration of sequence determinants of protein dynamism. © 2023 Wiley Periodicals LLC. Basic Protocol 1: Assessing conformational diversity with CoDNaS Alternate Protocol 1: Assessing conformational diversity at the quaternary level with CoDNaS-Q Basic Protocol 2: Exploring conformational diversity in a protein family Alternate Protocol 2: Exploring quaternary conformational diversity in a protein family Basic Protocol 3: Representing conformational diversity in a phylogenetic context.
Collapse
Affiliation(s)
- Nahuel Escobedo
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | | | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| |
Collapse
|
11
|
Piovesan D, Del Conte A, Clementel D, Monzon A, Bevilacqua M, Aspromonte M, Iserte J, Orti FE, Marino-Buslje C, Tosatto SE. MobiDB: 10 years of intrinsically disordered proteins. Nucleic Acids Res 2022; 51:D438-D444. [PMID: 36416266 PMCID: PMC9825420 DOI: 10.1093/nar/gkac1065] [Citation(s) in RCA: 84] [Impact Index Per Article: 28.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2022] [Revised: 10/11/2022] [Accepted: 10/25/2022] [Indexed: 11/24/2022] Open
Abstract
The MobiDB database (URL: https://mobidb.org/) is a knowledge base of intrinsically disordered proteins. MobiDB aggregates disorder annotations derived from the literature and from experimental evidence along with predictions for all known protein sequences. MobiDB generates new knowledge and captures the functional significance of disordered regions by processing and combining complementary sources of information. Since its first release 10 years ago, the MobiDB database has evolved in order to improve the quality and coverage of protein disorder annotations and its accessibility. MobiDB has now reached its maturity in terms of data standardization and visualization. Here, we present a new release which focuses on the optimization of user experience and database content. The major advances compared to the previous version are the integration of AlphaFoldDB predictions and the re-implementation of the homology transfer pipeline, which expands manually curated annotations by two orders of magnitude. Finally, the entry page has been restyled in order to provide an overview of the available annotations along with two separate views that highlight structural disorder evidence and functions associated with different binding modes.
Collapse
Affiliation(s)
- Damiano Piovesan
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - Alessio Del Conte
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | - Damiano Clementel
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | | | | | | | - Javier A Iserte
- Bioinformatics Unit, Fundación Instituto Leloir, Buenos Aires, Argentina
| | - Fernando E Orti
- Bioinformatics Unit, Fundación Instituto Leloir, Buenos Aires, Argentina
| | | | | |
Collapse
|
12
|
Escobedo N, Tunque Cahui RR, Caruso G, García Ríos E, Hirsh L, Monzon AM, Parisi G, Palopoli N. CoDNaS-Q: a database of conformational diversity of the native state of proteins with quaternary structure. Bioinformatics 2022; 38:4959-4961. [DOI: 10.1093/bioinformatics/btac627] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2022] [Revised: 07/03/2022] [Accepted: 09/15/2022] [Indexed: 11/15/2022] Open
Abstract
Abstract
Summary
A collection of conformers that exist in a dynamical equilibrium defines the native state of a protein. The structural differences between them describe their conformational diversity, a defining characteristic of the protein with an essential role in multiple cellular processes. Since most proteins carry out their functions by assembling into complexes, we have developed CoDNaS-Q, the first online resource to explore conformational diversity in homooligomeric proteins. It features a curated collection of redundant protein structures with known quaternary structure. CoDNaS-Q integrates relevant annotations that allow researchers to identify and explore the extent and possible reasons of conformational diversity in homooligomeric protein complexes.
Availability and Implementation
CoDNaS-Q is freely accessible at http://ufq.unq.edu.ar/codnasq/ or https://codnas-q.bioinformatica.org/home. The data can be retrieved from the website. The source code of the database can be downloaded from https://github.com/SfrRonaldo/codnas-q.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Nahuel Escobedo
- Universidad Nacional de Quilmes Departamento de Ciencia y Tecnología, , Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas , Buenos Aires, Argentina
| | | | - Gastón Caruso
- Universidad Nacional de Quilmes Departamento de Ciencia y Tecnología, , Buenos Aires, Argentina
| | - Emilio García Ríos
- Pontificia Universidad Católica del Perú Departamento de Ingeniería, , Lima, Perú
| | - Layla Hirsh
- Pontificia Universidad Católica del Perú Departamento de Ingeniería, , Lima, Perú
| | | | - Gustavo Parisi
- Universidad Nacional de Quilmes Departamento de Ciencia y Tecnología, , Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas , Buenos Aires, Argentina
| | - Nicolas Palopoli
- Universidad Nacional de Quilmes Departamento de Ciencia y Tecnología, , Buenos Aires, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas , Buenos Aires, Argentina
| |
Collapse
|
13
|
Avery C, Patterson J, Grear T, Frater T, Jacobs DJ. Protein Function Analysis through Machine Learning. Biomolecules 2022; 12:1246. [PMID: 36139085 PMCID: PMC9496392 DOI: 10.3390/biom12091246] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2022] [Revised: 08/22/2022] [Accepted: 08/31/2022] [Indexed: 11/16/2022] Open
Abstract
Machine learning (ML) has been an important arsenal in computational biology used to elucidate protein function for decades. With the recent burgeoning of novel ML methods and applications, new ML approaches have been incorporated into many areas of computational biology dealing with protein function. We examine how ML has been integrated into a wide range of computational models to improve prediction accuracy and gain a better understanding of protein function. The applications discussed are protein structure prediction, protein engineering using sequence modifications to achieve stability and druggability characteristics, molecular docking in terms of protein-ligand binding, including allosteric effects, protein-protein interactions and protein-centric drug discovery. To quantify the mechanisms underlying protein function, a holistic approach that takes structure, flexibility, stability, and dynamics into account is required, as these aspects become inseparable through their interdependence. Another key component of protein function is conformational dynamics, which often manifest as protein kinetics. Computational methods that use ML to generate representative conformational ensembles and quantify differences in conformational ensembles important for function are included in this review. Future opportunities are highlighted for each of these topics.
Collapse
Affiliation(s)
- Chris Avery
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - John Patterson
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Tyler Grear
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
- Department of Physics and Optical Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Theodore Frater
- Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| | - Donald J. Jacobs
- Department of Physics and Optical Science, University of North Carolina at Charlotte, Charlotte, NC 28223, USA
| |
Collapse
|
14
|
Roca-Martinez J, Lazar T, Gavalda-Garcia J, Bickel D, Pancsa R, Dixit B, Tzavella K, Ramasamy P, Sanchez-Fornaris M, Grau I, Vranken WF. Challenges in describing the conformation and dynamics of proteins with ambiguous behavior. Front Mol Biosci 2022; 9:959956. [PMID: 35992270 PMCID: PMC9382080 DOI: 10.3389/fmolb.2022.959956] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/02/2022] [Accepted: 06/27/2022] [Indexed: 11/13/2022] Open
Abstract
Traditionally, our understanding of how proteins operate and how evolution shapes them is based on two main data sources: the overall protein fold and the protein amino acid sequence. However, a significant part of the proteome shows highly dynamic and/or structurally ambiguous behavior, which cannot be correctly represented by the traditional fixed set of static coordinates. Representing such protein behaviors remains challenging and necessarily involves a complex interpretation of conformational states, including probabilistic descriptions. Relating protein dynamics and multiple conformations to their function as well as their physiological context (e.g., post-translational modifications and subcellular localization), therefore, remains elusive for much of the proteome, with studies to investigate the effect of protein dynamics relying heavily on computational models. We here investigate the possibility of delineating three classes of protein conformational behavior: order, disorder, and ambiguity. These definitions are explored based on three different datasets, using interpretable machine learning from a set of features, from AlphaFold2 to sequence-based predictions, to understand the overlap and differences between these datasets. This forms the basis for a discussion on the current limitations in describing the behavior of dynamic and ambiguous proteins.
Collapse
Affiliation(s)
- Joel Roca-Martinez
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
| | - Tamas Lazar
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- VIB-VUB Center for Structural Biology, Brussels, Belgium
| | - Jose Gavalda-Garcia
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
| | - David Bickel
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
| | - Rita Pancsa
- Research Centre for Natural Sciences, Institute of Enzymology, Budapest, Hungary
| | - Bhawna Dixit
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
- IBiTech-Biommeda, Universiteit Gent, Gent, Belgium
| | - Konstantina Tzavella
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
| | - Pathmanaban Ramasamy
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
- VIB-UGent Center for Medical Biotechnology, Universiteit Gent, Gent, Belgium
| | - Maite Sanchez-Fornaris
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
- Department of Computer Sciences, University of Camagüey, Camagüey, Cuba
| | - Isel Grau
- Information Systems, Eindhoven University of Technology, Eindhoven, Netherlands
| | - Wim F. Vranken
- Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
- Interuniversity Institute of Bioinformatics in Brussels, VUB/ULB, Brussels, Belgium
| |
Collapse
|
15
|
González Buitrón M, Tunque Cahui RR, García Ríos E, Hirsh L, Parisi G, Fornasari MS, Palopoli N. CoDNaS-RNA: a database of conformational diversity in the native state of RNA. Bioinformatics 2022; 38:1745-1748. [PMID: 34954795 DOI: 10.1093/bioinformatics/btab858] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 09/29/2021] [Accepted: 12/23/2021] [Indexed: 02/03/2023] Open
Abstract
SUMMARY Conformational changes in RNA native ensembles are central to fulfill many of their biological roles. Systematic knowledge of the extent and possible modulators of this conformational diversity is desirable to better understand the relationship between RNA dynamics and function. We have developed CoDNaS-RNA as the first database of conformational diversity in RNA molecules. Known RNA structures are retrieved and clustered to identify alternative conformers of each molecule. Pairwise structural comparisons between all conformers within each cluster allows to measure the variability of the molecule. Additional annotations about structural features, molecular interactions and biological function are provided. All data in CoDNaS-RNA is free to download and available as a public website that can be of interest for researchers in computational biology and other life science disciplines. AVAILABILITY AND IMPLEMENTATION The data underlying this article are available at http://ufq.unq.edu.ar/codnasrna or https://codnas-rna.bioinformatica.org/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Martín González Buitrón
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Buenos Aires, Argentina.,Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | | | - Emilio García Ríos
- Departamento de Ingeniería, Pontificia Universidad Católica del Perú, Lima, Peru
| | - Layla Hirsh
- Departamento de Ingeniería, Pontificia Universidad Católica del Perú, Lima, Peru
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Buenos Aires, Argentina.,Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Buenos Aires, Argentina.,Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Buenos Aires, Argentina.,Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| |
Collapse
|
16
|
Kurgan L. Resources for computational prediction of intrinsic disorder in proteins. Methods 2022; 204:132-141. [DOI: 10.1016/j.ymeth.2022.03.018] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 03/25/2022] [Accepted: 03/29/2022] [Indexed: 12/26/2022] Open
|
17
|
D3PM: a comprehensive database for protein motions ranging from residue to domain. BMC Bioinformatics 2022; 23:70. [PMID: 35164668 PMCID: PMC8845362 DOI: 10.1186/s12859-022-04595-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2020] [Accepted: 02/01/2022] [Indexed: 11/24/2022] Open
Abstract
Background Knowledge of protein motions is significant to understand its functions. While currently available databases for protein motions are mostly focused on overall domain motions, little attention is paid on local residue motions. Albeit with relatively small scale, the local residue motions, especially those residues in binding pockets, may play crucial roles in protein functioning and ligands binding. Results A comprehensive protein motion database, namely D3PM, was constructed in this study to facilitate the analysis of protein motions. The protein motions in the D3PM range from overall structural changes of macromolecule to local flip motions of binding pocket residues. Currently, the D3PM has collected 7679 proteins with overall motions and 3513 proteins with pocket residue motions. The motion patterns are classified into 4 types of overall structural changes and 5 types of pocket residue motions. Impressively, we found that less than 15% of protein pairs have obvious overall conformational adaptations induced by ligand binding, while more than 50% of protein pairs have significant structural changes in ligand binding sites, indicating that ligand-induced conformational changes are drastic and mainly confined around ligand binding sites. Based on the residue preference in binding pocket, we classified amino acids into “pocketphilic” and “pocketphobic” residues, which should be helpful for pocket prediction and drug design. Conclusion D3PM is a comprehensive database about protein motions ranging from residue to domain, which should be useful for exploring diverse protein motions and for understanding protein function and drug design. The D3PM is available on www.d3pharma.com/D3PM/index.php. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-04595-0.
Collapse
|
18
|
Barletta GP, Barletta M, Saldaño TE, Fernandez-Alberti S. Analysis of changes of cavity volumes in predefined directions of protein motions and cavity flexibility. J Comput Chem 2021; 43:391-401. [PMID: 34962296 DOI: 10.1002/jcc.26799] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2021] [Accepted: 12/13/2021] [Indexed: 11/12/2022]
Abstract
Dynamics of protein cavities associated with protein fluctuations and conformational plasticity is essential for their biological function. NMR ensembles, molecular dynamics (MD) simulations, and normal mode analysis (NMA) provide appropriate frameworks to explore functionally relevant protein dynamics and cavity changes relationships. Within this context, we have recently developed analysis of null areas (ANA), an efficient method to calculate cavity volumes. ANA is based on a combination of algorithms that guarantees its robustness against numerical differentiations. This is a unique feature with respect to other methods. Herein, we present an updated and improved version that expands it use to quantify changes in cavity features, like volume and flexibility, due to protein structural distortions performed on predefined biologically relevant directions, for example, directions of largest contribution to protein fluctuations (principal component analysis [PCA modes]) obtained by MD simulations or ensembles of NMR structures, collective NMA modes or any other direction of motion associated with specific conformational changes. A web page has been developed where its facilities are explained in detail. First, we show that ANA can be useful to explore gradual changes of cavity volume and flexibility associated with protein ligand binding. Secondly, we perform a comparison study of the extent of variability between protein backbone structural distortions, and changes in cavity volumes and flexibilities evaluated for an ensemble of NMR active and inactive conformers of the epidermal growth factor receptor structures. Finally, we compare changes in size and flexibility between sets of NMR structures for different homologous chains of dynein.
Collapse
Affiliation(s)
- German P Barletta
- Unidad de Fisicoquímica, Universidad Nacional de Quilmes/CONICET, Bernal, Argentina
| | | | - Tadeo E Saldaño
- Unidad de Fisicoquímica, Universidad Nacional de Quilmes/CONICET, Bernal, Argentina
| | | |
Collapse
|
19
|
Schwarz D, Georges G, Kelm S, Shi J, Vangone A, Deane CM. Co-evolutionary distance predictions contain flexibility information. Bioinformatics 2021; 38:65-72. [PMID: 34383892 DOI: 10.1093/bioinformatics/btab562] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Revised: 06/19/2021] [Accepted: 08/10/2021] [Indexed: 02/03/2023] Open
Abstract
MOTIVATION Co-evolution analysis can be used to accurately predict residue-residue contacts from multiple sequence alignments. The introduction of machine-learning techniques has enabled substantial improvements in precision and a shift from predicting binary contacts to predict distances between pairs of residues. These developments have significantly improved the accuracy of de novo prediction of static protein structures. With AlphaFold2 lifting the accuracy of some predicted protein models close to experimental levels, structure prediction research will move on to other challenges. One of those areas is the prediction of more than one conformation of a protein. Here, we examine the potential of residue-residue distance predictions to be informative of protein flexibility rather than simply static structure. RESULTS We used DMPfold to predict distance distributions for every residue pair in a set of proteins that showed both rigid and flexible behaviour. Residue pairs that were in contact in at least one reference structure were classified as rigid, flexible or neither. The predicted distance distribution of each residue pair was analysed for local maxima of probability indicating the most likely distance or distances between a pair of residues. We found that rigid residue pairs tended to have only a single local maximum in their predicted distance distributions while flexible residue pairs more often had multiple local maxima. These results suggest that the shape of predicted distance distributions contains information on the rigidity or flexibility of a protein and its constituent residues. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Dominik Schwarz
- Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
| | - Guy Georges
- Department of Computational Engineering and Data Science, Large Molecule Research, Penzberg 82377, Germany
| | - Sebastian Kelm
- Computer-Aided Drug Design, UCB Pharma, Slough SL1 3WE, UK
| | - Jiye Shi
- Computer-Aided Drug Design, UCB Pharma, Slough SL1 3WE, UK
| | - Anna Vangone
- Department of Computational Engineering and Data Science, Large Molecule Research, Penzberg 82377, Germany
| | | |
Collapse
|
20
|
Mokhtari DA, Appel MJ, Fordyce PM, Herschlag D. High throughput and quantitative enzymology in the genomic era. Curr Opin Struct Biol 2021; 71:259-273. [PMID: 34592682 PMCID: PMC8648990 DOI: 10.1016/j.sbi.2021.07.010] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 07/23/2021] [Indexed: 12/28/2022]
Abstract
Accurate predictions from models based on physical principles are the ultimate metric of our biophysical understanding. Although there has been stunning progress toward structure prediction, quantitative prediction of enzyme function has remained challenging. Realizing this goal will require large numbers of quantitative measurements of rate and binding constants and the use of these ground-truth data sets to guide the development and testing of these quantitative models. Ground truth data more closely linked to the underlying physical forces are also desired. Here, we describe technological advances that enable both types of ground truth measurements. These advances allow classic models to be tested, provide novel mechanistic insights, and place us on the path toward a predictive understanding of enzyme structure and function.
Collapse
Affiliation(s)
- D A Mokhtari
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
| | - M J Appel
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
| | - P M Fordyce
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA; ChEM-H Institute, Stanford University, Stanford, CA, 94305, USA; Department of Genetics, Stanford University, Stanford, CA, 94305, USA; Chan Zuckerberg Biohub San Francisco, CA, 94110, USA.
| | - D Herschlag
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA; Department of Chemical Engineering, Stanford University, Stanford, CA, 94305, USA; ChEM-H Institute, Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|
21
|
Timonina D, Sharapova Y, Švedas V, Suplatov D. Bioinformatic analysis of subfamily-specific regions in 3D-structures of homologs to study functional diversity and conformational plasticity in protein superfamilies. Comput Struct Biotechnol J 2021; 19:1302-1311. [PMID: 33738079 PMCID: PMC7933735 DOI: 10.1016/j.csbj.2021.02.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2020] [Revised: 02/08/2021] [Accepted: 02/09/2021] [Indexed: 02/07/2023] Open
Abstract
Local 3D-structural differences in homologous proteins contribute to functional diversity observed in a superfamily, but so far received little attention as bioinformatic analysis was usually carried out at the level of amino acid sequences. We have developed Zebra3D - the first-of-its-kind bioinformatic software for systematic analysis of 3D-alignments of protein families using machine learning. The new tool identifies subfamily-specific regions (SSRs) - patterns of local 3D-structure (i.e. single residues, loops, or secondary structure fragments) that are spatially equivalent within families/subfamilies, but are different among them, and thus can be associated with functional diversity and function-related conformational plasticity. Bioinformatic analysis of protein superfamilies by Zebra3D can be used to study 3D-determinants of catalytic activity and specific accommodation of ligands, help to prepare focused libraries for directed evolution or assist development of chimeric enzymes with novel properties by exchange of equivalent regions between homologs, and to characterize plasticity in binding sites. A companion Mustguseal web-server is available to automatically construct a 3D-alignment of functionally diverse proteins, thus reducing the minimal input required to operate Zebra3D to a single PDB code. The Zebra3D + Mustguseal combined approach provides the opportunity to systematically explore the value of SSRs in superfamilies and to use this information for protein design and drug discovery. The software is available open-access at https://biokinet.belozersky.msu.ru/Zebra3D.
Collapse
Affiliation(s)
- Daria Timonina
- Lomonosov Moscow State University, Faculty of Bioengineering and Bioinformatics, Lenin Hills 1-73, Moscow 119234, Russia
| | - Yana Sharapova
- Lomonosov Moscow State University, Faculty of Bioengineering and Bioinformatics, Lenin Hills 1-73, Moscow 119234, Russia
- Lomonosov Moscow State University, Belozersky Institute of Physicochemical Biology, Lenin Hills 1-73, Moscow 119234, Russia
| | - Vytas Švedas
- Lomonosov Moscow State University, Faculty of Bioengineering and Bioinformatics, Lenin Hills 1-73, Moscow 119234, Russia
- Lomonosov Moscow State University, Belozersky Institute of Physicochemical Biology, Lenin Hills 1-73, Moscow 119234, Russia
| | - Dmitry Suplatov
- Lomonosov Moscow State University, Belozersky Institute of Physicochemical Biology, Lenin Hills 1-73, Moscow 119234, Russia
- Corresponding author.
| |
Collapse
|
22
|
Sedova M, Jaroszewski L, Iyer M, Li Z, Godzik A. ModFlex: Towards Function Focused Protein Modeling. J Mol Biol 2021; 433:166828. [PMID: 33972023 DOI: 10.1016/j.jmb.2021.166828] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 01/07/2021] [Accepted: 01/09/2021] [Indexed: 11/19/2022]
Abstract
There is a wide, and continuously widening, gap between the number of proteins known only by their amino acid sequence versus those structurally characterized by direct experiment. To close this gap, we mostly rely on homology-based inference and modeling to reason about the structures of the uncharacterized proteins by using structures of homologous proteins as templates. With the rapidly growing size of the Protein Data Bank, there are often multiple choices of templates, including multiple sets of coordinates from the same protein. The substantial conformational differences observed between different experimental structures of the same protein often reflect function related structural flexibility. Thus, depending on the questions being asked, using distant homologs, or coordinate sets with lower resolution but solved in the appropriate functional form, as templates may be more informative. The ModFlex server (https://modflex.org/) addresses this seldom mentioned gap in the standard homology modeling approach by providing the user with an interface with multiple options and tools to select the most relevant template and explore the range of structural diversity in the available templates. ModFlex is closely integrated with a range of other programs and servers developed in our group for the analysis and visualization of protein structural flexibility and divergence.
Collapse
Affiliation(s)
- Mayya Sedova
- University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States
| | - Lukasz Jaroszewski
- University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States
| | - Mallika Iyer
- Graduate School of Biomedical Sciences, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, United States
| | - Zhanwen Li
- University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States
| | - Adam Godzik
- University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States.
| |
Collapse
|
23
|
Maggio J, Cabrera M, Armando R, Chinestrad P, Pifano M, Menna PL, Gomez DE, Gómez DLM. Rational design of PIN1 inhibitors for cancer treatment based on conformational diversity analysis and docking based virtual screening. J Biomol Struct Dyn 2021; 40:5858-5867. [PMID: 33463409 DOI: 10.1080/07391102.2021.1874531] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The parvulin PIN1 (peptidyl-prolyl cis-trans isomerase NIMA-interacting 1), is the only enzyme capable of isomerizing prolines of phospho-Serine/Threonine-Proline motifs. PIN1 binds to a subset of proteins and plays an essential role in regulating protein function post-phosphorylation control. Furthermore, the activity of PIN1 regulates the outcome of the signalling of proline-directed kinases (e.g. MAPK, CDK, or GSK3) and thus regulates cell proliferation and cell survival. For these reasons, PIN1 inhibitors are interesting since they may have therapeutic implications for cancer. Several authors have already reported that the non-structural point mutation Trp34Ala prevents PIN1 from interacting with its downstream effector proteins. In this work, we characterized PIN1 structurally, intending to explore new inhibition targets for the rational design of pharmacological activity compounds. Through a conformational diversity analysis of PIN1, we identified and characterized a highly specific druggable pocket around the residue Trp34. This pocket was used in a high-throughput docking screening of 450,000 drug-like compounds, and the top 10 were selected for re-docking studies on the previously used conformers. Finally, we evaluated the binding of each compound by thermal shift assay and found four molecules with a high affinity for PIN1 and potential inhibitory activity. Through this strategy, we achieved novel drug candidates with the ability to interfere with the phosphorylation-dependent actions of PIN1 and with potential applications in the treatment of cancer.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Julián Maggio
- Departamento de Ciencia y Tecnología, Laboratorio de Oncología Molecular, Universidad Nacional de Quilmes, Bernal, Argentina
| | - Maia Cabrera
- Departamento de Ciencia y Tecnología, Laboratorio de Farmacología Molecular, Universidad Nacional de Quilmes, Bernal, Argentina
| | - Romina Armando
- Departamento de Ciencia y Tecnología, Laboratorio de Oncología Molecular, Universidad Nacional de Quilmes, Bernal, Argentina
| | - Patricio Chinestrad
- Departamento de Ciencia y Tecnología, Laboratorio de Farmacología Molecular, Universidad Nacional de Quilmes, Bernal, Argentina
| | - Marina Pifano
- Departamento de Ciencia y Tecnología, Laboratorio de Oncología Molecular, Universidad Nacional de Quilmes, Bernal, Argentina
| | - Pablo Lorenzano Menna
- Departamento de Ciencia y Tecnología, Laboratorio de Farmacología Molecular, Universidad Nacional de Quilmes, Bernal, Argentina
| | - Daniel E Gomez
- Departamento de Ciencia y Tecnología, Laboratorio de Oncología Molecular, Universidad Nacional de Quilmes, Bernal, Argentina
| | - Diego L Mengual Gómez
- Departamento de Ciencia y Tecnología, Laboratorio de Oncología Molecular, Universidad Nacional de Quilmes, Bernal, Argentina
| |
Collapse
|
24
|
Palopoli N, Marchetti J, Monzon AM, Zea DJ, Tosatto SCE, Fornasari MS, Parisi G. Intrinsically Disordered Protein Ensembles Shape Evolutionary Rates Revealing Conformational Patterns. J Mol Biol 2020; 433:166751. [PMID: 33310020 DOI: 10.1016/j.jmb.2020.166751] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2020] [Revised: 12/01/2020] [Accepted: 12/05/2020] [Indexed: 10/22/2022]
Abstract
Intrinsically disordered proteins (IDPs) lack stable tertiary structure under physiological conditions. The unique composition and complex dynamical behaviour of IDPs make them a challenge for structural biology and molecular evolution studies. Using NMR ensembles, we found that IDPs evolve under a strong site-specific evolutionary rate heterogeneity, mainly originated by different constraints derived from their inter-residue contacts. Evolutionary rate profiles correlate with the experimentally observed conformational diversity of the protein, allowing the description of different conformational patterns possibly related to their structure-function relationships. The correlation between evolutionary rates and contact information improves when structural information is taken not from any individual conformer or the whole ensemble, but from combining a limited number of conformers. Our results suggest that residue contacts in disordered regions constrain evolutionary rates to conserve the dynamic behaviour of the ensemble and that evolutionary rates can be used as a proxy for the conformational diversity of IDPs.
Collapse
Affiliation(s)
- Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | - Julia Marchetti
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | | | - Diego J Zea
- Sorbonne Université, CNRS, IBPS, Laboratoire de Biologie Computationnelle et Quantitative (LCQB), Paris, France
| | | | - Maria S Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Buenos Aires, Argentina.
| |
Collapse
|
25
|
Exploring Protein Intrinsic Disorder with MobiDB. METHODS IN MOLECULAR BIOLOGY (CLIFTON, N.J.) 2020; 2141:127-143. [PMID: 32696355 DOI: 10.1007/978-1-0716-0524-0_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Indexed: 10/23/2022]
Abstract
Nowadays, it is well established that many proteins or regions under physiological conditions lack a fixed three-dimensional structure and are intrinsically disordered. MobiDB is the main repository of protein disorder and mobility annotations, combining different data sources to provide an exhaustive overview of intrinsic disorder. MobiDB includes curated annotations from other databases, indirect disorder evidence from structural data, and disorder predictions from protein sequences. It provides an easy-to-use web server to visualize and explore disorder information. This chapter describes the data available in MobiDB, emphasizing how to use and access the intrinsic disorder data. MobiDB is available at URL http://mobidb.bio.unipd.it .
Collapse
|
26
|
Saldaño TE, Freixas VM, Tosatto SCE, Parisi G, Fernandez-Alberti S. Exploring Conformational Space with Thermal Fluctuations Obtained by Normal-Mode Analysis. J Chem Inf Model 2020; 60:3068-3080. [PMID: 32216314 DOI: 10.1021/acs.jcim.9b01136] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Proteins in their native states can be represented as ensembles of conformers in dynamical equilibrium. Thermal fluctuations are responsible for transitions between these conformers. Normal-modes analysis (NMA) using elastic network models (ENMs) provides an efficient procedure to explore global dynamics of proteins commonly associated with conformational transitions. In the present work, we present an iterative approach to explore protein conformational spaces by introducing structural distortions according to their equilibrium dynamics at room temperature. The approach can be used either to perform unbiased explorations of conformational space or to explore guided pathways connecting two different conformations, e.g., apo and holo forms. In order to test its performance, four proteins with different magnitudes of structural distortions upon ligand binding have been tested. In all cases, the conformational selection model has been confirmed and the conformational space between apo and holo forms has been encompassed. Different strategies have been tested that impact on the efficiency either to achieve a desired conformational change or to achieve a balanced exploration of the protein conformational multiplicity.
Collapse
Affiliation(s)
- Tadeo E Saldaño
- Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD Bernal, Argentina
| | - Victor M Freixas
- Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD Bernal, Argentina
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, Viale G. Colombo 3, 5131 Padova, Italy
| | - Gustavo Parisi
- Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD Bernal, Argentina
| | | |
Collapse
|
27
|
González N, Cardama GA, Chinestrad P, Robles-Valero J, Rodríguez-Fdez S, Lorenzo-Martín LF, Bustelo XR, Lorenzano Menna P, Gomez DE. Computational and in vitro Pharmacodynamics Characterization of 1A-116 Rac1 Inhibitor: Relevance of Trp56 in Its Biological Activity. Front Cell Dev Biol 2020; 8:240. [PMID: 32351958 PMCID: PMC7174510 DOI: 10.3389/fcell.2020.00240] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/20/2019] [Accepted: 03/23/2020] [Indexed: 12/20/2022] Open
Abstract
In the last years, the development of new drugs in oncology has evolved notably. In particular, drug development has shifted from empirical screening of active cytotoxic compounds to molecularly targeted drugs blocking specific biologic pathways that drive cancer progression and metastasis. Using a rational design approach, our group has developed 1A-116 as a promising Rac1 inhibitor, with antitumoral and antimetastatic effects in several types of cancer. Rac1 is over activated in a wide range of tumor types and and it is one of the most studied proteins of the Rho GTPase family. Its role in actin cytoskeleton reorganization has effects on endocytosis, vesicular trafficking, cell cycle progression and cellular migration. In this context, the regulatory activity of Rac1 affects several key processes in the course of the cancer including invasion and metastasis. The purpose of this preclinical study was to focus on the mode of action of 1A-116, conducting an interdisciplinary approach with in silico bioinformatics tools and in vitro assays. Here, we demonstrate that the tryptophan 56 residue is necessary for the inhibitory effects of 1A-116 since this compound interferes with protein-protein interactions (PPI) of Rac1GTPase involving several GEF activators. 1A-116 is also able to inhibit the oncogenic Rac1P29S mutant protein, one of the oncogenic drivers found in sun-exposed melanoma. It also inhibits numerous Rac1-regulated cellular processes such as membrane ruffling and lamellipodia formation. These results deepen our knowledge of 1A-116 inhibition of Rac1 and its biological impact on cancer progression. They also represent a good example of how in silico analyses represent a valuable approach for drug development.
Collapse
Affiliation(s)
- Nazareno González
- Laboratory of Molecular Oncology, National University of Quilmes, Bernal, Argentina
| | - Georgina A Cardama
- Laboratory of Molecular Oncology, National University of Quilmes, Bernal, Argentina.,National Scientific and Technical Research Council (CONICET), Buenos Aires, Argentina
| | - Patricio Chinestrad
- Molecular Pharmacology Laboratory, National University of Quilmes, Bernal, Argentina
| | - Javier Robles-Valero
- Centro de Investigación del Cáncer, CSIC-University of Salamanca, Salamanca, Spain.,Instituto de Biología Molecular y Celular del Cáncer, CSIC-University of Salamanca, Salamanca, Spain.,Centro de Investigación Biomédica en Red de Cáncer (CIBERONC), CSIC-University of Salamanca, Salamanca, Spain
| | - Sonia Rodríguez-Fdez
- Centro de Investigación del Cáncer, CSIC-University of Salamanca, Salamanca, Spain.,Instituto de Biología Molecular y Celular del Cáncer, CSIC-University of Salamanca, Salamanca, Spain.,Centro de Investigación Biomédica en Red de Cáncer (CIBERONC), CSIC-University of Salamanca, Salamanca, Spain
| | - L Francisco Lorenzo-Martín
- Centro de Investigación del Cáncer, CSIC-University of Salamanca, Salamanca, Spain.,Instituto de Biología Molecular y Celular del Cáncer, CSIC-University of Salamanca, Salamanca, Spain.,Centro de Investigación Biomédica en Red de Cáncer (CIBERONC), CSIC-University of Salamanca, Salamanca, Spain
| | - Xosé R Bustelo
- Centro de Investigación del Cáncer, CSIC-University of Salamanca, Salamanca, Spain.,Instituto de Biología Molecular y Celular del Cáncer, CSIC-University of Salamanca, Salamanca, Spain.,Centro de Investigación Biomédica en Red de Cáncer (CIBERONC), CSIC-University of Salamanca, Salamanca, Spain
| | - Pablo Lorenzano Menna
- National Scientific and Technical Research Council (CONICET), Buenos Aires, Argentina.,Molecular Pharmacology Laboratory, National University of Quilmes, Bernal, Argentina
| | - Daniel E Gomez
- Laboratory of Molecular Oncology, National University of Quilmes, Bernal, Argentina.,National Scientific and Technical Research Council (CONICET), Buenos Aires, Argentina
| |
Collapse
|
28
|
Velez Rueda AJ, Palopoli N, Zacarías M, Sommese LM, Parisi G. ProtMiscuity: a database of promiscuous proteins. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2020; 2019:5606773. [PMID: 31650170 PMCID: PMC6813136 DOI: 10.1093/database/baz103] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/22/2019] [Revised: 07/02/2019] [Accepted: 08/01/2019] [Indexed: 11/30/2022]
Abstract
Promiscuous behaviour in proteins and enzymes remains a challenging feature to understand the structure–function relationship. Here we present ProtMiscuity, a manually curated online database of proteins showing catalytic promiscuity. ProtMiscuity contains information about canonical and promiscuous activities comprising 88 different reactions in 57 proteins from 40 different organisms. It can be searched or browsed by protein names, organisms and descriptions of canonical and promiscuous reactions. Entries provide information on reaction substrates, products and kinetic parameters, mapping of active sites to sequence and structure and links to external resources with biological and functional annotations. ProtMiscuity could assist in studying the underlying mechanisms of promiscuous reactions by offering a unique and curated collection of experimentally derived data that is otherwise hard to find, retrieve and validate from literature.
Collapse
Affiliation(s)
- Ana Julia Velez Rueda
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes - CONICET, Roque Sáenz Peña 352, Bernal B1876BXD Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes - CONICET, Roque Sáenz Peña 352, Bernal B1876BXD Buenos Aires, Argentina
| | - Matías Zacarías
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes - CONICET, Roque Sáenz Peña 352, Bernal B1876BXD Buenos Aires, Argentina
| | - Leandro Matías Sommese
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes - CONICET, Roque Sáenz Peña 352, Bernal B1876BXD Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes - CONICET, Roque Sáenz Peña 352, Bernal B1876BXD Buenos Aires, Argentina
| |
Collapse
|
29
|
Iyer M, Li Z, Jaroszewski L, Sedova M, Godzik A. Difference contact maps: From what to why in the analysis of the conformational flexibility of proteins. PLoS One 2020; 15:e0226702. [PMID: 32163442 PMCID: PMC7067477 DOI: 10.1371/journal.pone.0226702] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Accepted: 12/03/2019] [Indexed: 02/05/2023] Open
Abstract
Protein structures, usually visualized in various highly idealized forms focusing on the three-dimensional arrangements of secondary structure elements, can also be described as lists of interacting residues or atoms and visualized as two-dimensional distance or contact maps. We show that contact maps provide an ideal tool to describe and analyze differences between structures of proteins in different conformations. Expanding functionality of the PDBFlex server and database developed previously in our group, we describe how analysis of difference contact maps (DCMs) can be used to identify critical interactions stabilizing alternative protein conformations, recognize residues and positions controlling protein functions and build hypotheses as to molecular mechanisms of disease mutations.
Collapse
Affiliation(s)
- Mallika Iyer
- Graduate School of Biomedical Sciences, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, United States of America
| | - Zhanwen Li
- Biosciences Division, University of California Riverside School of Medicine, Riverside, CA, United States of America
| | - Lukasz Jaroszewski
- Biosciences Division, University of California Riverside School of Medicine, Riverside, CA, United States of America
| | - Mayya Sedova
- Biosciences Division, University of California Riverside School of Medicine, Riverside, CA, United States of America
| | - Adam Godzik
- Biosciences Division, University of California Riverside School of Medicine, Riverside, CA, United States of America
- * E-mail:
| |
Collapse
|
30
|
Echeverría E, Velez Rueda AJ, Cabrera M, Juritz E, Burghi V, Fabián L, Davio C, Lorenzano Menna P, Fernández NC. Identification of inhibitors of the RGS homology domain of GRK2 by docking-based virtual screening. Life Sci 2019; 239:116872. [DOI: 10.1016/j.lfs.2019.116872] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Revised: 09/10/2019] [Accepted: 09/11/2019] [Indexed: 01/25/2023]
|
31
|
Vetrivel I, de Brevern AG, Cadet F, Srinivasan N, Offmann B. Structural variations within proteins can be as large as variations observed across their homologues. Biochimie 2019; 167:162-170. [PMID: 31560932 DOI: 10.1016/j.biochi.2019.09.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2019] [Accepted: 09/18/2019] [Indexed: 10/26/2022]
Abstract
Understanding the structural plasticity of proteins is key to understanding the intricacies of their functions and mechanistic basis. In the current study, we analyzed the available multiple crystal structures of the same protein for the structural differences. For this purpose we used an abstraction of protein structures referred as Protein Blocks (PBs) that was previously established. We also characterized the nature of the structural variations for a few proteins using molecular dynamics simulations. In both the cases, the structural variations were summarized in the form of substitution matrices of PBs. We show that certain conformational states are preferably replaced by other specific conformational states. Interestingly, these structural variations are highly similar to those previously observed across structures of homologous proteins (r2 = 0.923) or across the ensemble of conformations from NMR data (r2 = 0.919). Thus our study quantitatively shows that overall trends of structural changes in a given protein are nearly identical to the trends of structural differences that occur in the topologically equivalent positions in homologous proteins. Specific case studies are used to illustrate the nature of these structural variations.
Collapse
Affiliation(s)
- Iyanar Vetrivel
- Université de Nantes, UFIP UMR 6286 CNRS, UFR Sciences et Techniques, 2 Chemin de La Houssinière, Nantes, France
| | - Alexandre G de Brevern
- INSERM UMR_S 1134, DSIMB Team, Laboratory of Excellence, GR-Ex, Univ Paris Diderot, Univ Sorbonne Paris Cité, INTS, 6 Rue Alexandre Cabanel, Paris, France
| | - Frédéric Cadet
- University of Paris, UMR_S1134, BIGR, Inserm, F-75015, Paris, France; DSIMB, UMR_S1134, BIGR, Inserm, Laboratory of Excellence GR-Ex, Faculty of Sciences and Technology, University of La Reunion, F-97715, Saint-Denis, France; PEACCEL, Protein Engineering Accelerator, 6 Square Albin Cachot, Box 42, 75013, Paris, France
| | | | - Bernard Offmann
- Université de Nantes, UFIP UMR 6286 CNRS, UFR Sciences et Techniques, 2 Chemin de La Houssinière, Nantes, France.
| |
Collapse
|
32
|
Piovesan D, Tabaro F, Paladin L, Necci M, Micetic I, Camilloni C, Davey N, Dosztányi Z, Mészáros B, Monzon AM, Parisi G, Schad E, Sormanni P, Tompa P, Vendruscolo M, Vranken WF, Tosatto SCE. MobiDB 3.0: more annotations for intrinsic disorder, conformational diversity and interactions in proteins. Nucleic Acids Res 2019; 46:D471-D476. [PMID: 29136219 PMCID: PMC5753340 DOI: 10.1093/nar/gkx1071] [Citation(s) in RCA: 156] [Impact Index Per Article: 26.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2017] [Accepted: 10/19/2017] [Indexed: 01/30/2023] Open
Abstract
The MobiDB (URL: mobidb.bio.unipd.it) database of protein disorder and mobility annotations has been significantly updated and upgraded since its last major renewal in 2014. Several curated datasets for intrinsic disorder and folding upon binding have been integrated from specialized databases. The indirect evidence has also been expanded to better capture information available in the PDB, such as high temperature residues in X-ray structures and overall conformational diversity. Novel nuclear magnetic resonance chemical shift data provides an additional experimental information layer on conformational dynamics. Predictions have been expanded to provide new types of annotation on backbone rigidity, secondary structure preference and disordered binding regions. MobiDB 3.0 contains information for the complete UniProt protein set and synchronization has been improved by covering all UniParc sequences. An advanced search function allows the creation of a wide array of custom-made datasets for download and further analysis. A large amount of information and cross-links to more specialized databases are intended to make MobiDB the central resource for the scientific community working on protein intrinsic disorder and mobility.
Collapse
Affiliation(s)
- Damiano Piovesan
- Department of Biomedical Sciences, University of Padua, via U. Bassi 58/b, 35131 Padua, Italy
| | - Francesco Tabaro
- Department of Biomedical Sciences, University of Padua, via U. Bassi 58/b, 35131 Padua, Italy.,Institute of Biosciences and Medical Technology, Arvo Ylpön katu 34, 33520 Tampere, Finland
| | - Lisanna Paladin
- Department of Biomedical Sciences, University of Padua, via U. Bassi 58/b, 35131 Padua, Italy
| | - Marco Necci
- Department of Biomedical Sciences, University of Padua, via U. Bassi 58/b, 35131 Padua, Italy.,Department of Agricultural Sciences, University of Udine, via Palladio 8, 33100 Udine, Italy.,Fondazione Edmund Mach, Via E. Mach 1, 38010 S. Michele all'Adige, Italy
| | - Ivan Micetic
- Department of Biomedical Sciences, University of Padua, via U. Bassi 58/b, 35131 Padua, Italy
| | - Carlo Camilloni
- Department of Biosciences, University of Milan, 20133 Milano, Italy
| | - Norman Davey
- Conway Institute of Biomolecular & Biomedical Research, University College Dublin, Belfield, Dublin 4, Ireland.,UCD School of Medicine & Medical Science, University College Dublin, Belfield, Dublin 4, Ireland
| | - Zsuzsanna Dosztányi
- MTA-ELTE Lendület Bioinformatics Research Group, Department of Biochemistry, Eötvös Loránd University, 1/c Pázmány Péter sétány, H-1117, Budapest, Hungary
| | - Bálint Mészáros
- MTA-ELTE Lendület Bioinformatics Research Group, Department of Biochemistry, Eötvös Loránd University, 1/c Pázmány Péter sétány, H-1117, Budapest, Hungary.,Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7, H-1518 Budapest, Hungary
| | - Alexander M Monzon
- Structural Bioinformatics Group, Department of Science and Technology, National University of Quilmes, CONICET, Roque Saenz Pena 182, Bernal B1876BXD, Argentina
| | - Gustavo Parisi
- Structural Bioinformatics Group, Department of Science and Technology, National University of Quilmes, CONICET, Roque Saenz Pena 182, Bernal B1876BXD, Argentina
| | - Eva Schad
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7, H-1518 Budapest, Hungary
| | - Pietro Sormanni
- Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, UK
| | - Peter Tompa
- Institute of Enzymology, Research Centre for Natural Sciences, Hungarian Academy of Sciences, PO Box 7, H-1518 Budapest, Hungary.,Structural Biology Brussels, Vrije Universiteit Brussel (VUB), Brussels 1050, Belgium.,VIB-VUB Center for Structural Biology, Flanders Institute for Biotechnology (VIB), Brussels 1050, Belgium
| | | | - Wim F Vranken
- Structural Biology Brussels, Vrije Universiteit Brussel (VUB), Brussels 1050, Belgium.,VIB-VUB Center for Structural Biology, Flanders Institute for Biotechnology (VIB), Brussels 1050, Belgium.,Interuniversity Institute of Bioinformatics in Brussels, ULB/VUB, 1050 Brussels, Belgium
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padua, via U. Bassi 58/b, 35131 Padua, Italy.,CNR Institute of Neuroscience, via U. Bassi 58/b, 35131 Padua, Italy
| |
Collapse
|
33
|
Developments in integrative modeling with dynamical interfaces. Curr Opin Struct Biol 2019; 56:11-17. [DOI: 10.1016/j.sbi.2018.10.007] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2018] [Revised: 10/26/2018] [Accepted: 10/27/2018] [Indexed: 11/19/2022]
|
34
|
Ensembles from Ordered and Disordered Proteins Reveal Similar Structural Constraints during Evolution. J Mol Biol 2019; 431:1298-1307. [DOI: 10.1016/j.jmb.2019.01.031] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Revised: 01/23/2019] [Accepted: 01/24/2019] [Indexed: 01/08/2023]
|
35
|
Abstract
The native state of proteins is composed of conformers in dynamical equilibrium. In this chapter, different issues related to conformational diversity are explored using a curated and experimentally based database called CoDNaS (Conformational Diversity in the Native State). This database is a collection of redundant structures for the same sequence. CoDNaS estimates the degree of conformational diversity using different global and local structural similarity measures. It allows the user to explore how structural differences among conformers change as a function of several structural features providing further biological information. This chapter explores the measurement of conformational diversity and its relationship with sequence divergence. Also, it discusses how proteins with high conformational diversity could affect homology modeling techniques.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina
| | - Diego Javier Zea
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina.
| |
Collapse
|
36
|
Csizmadia G, Farkas B, Spagina Z, Tordai H, Hegedűs T. Quantitative comparison of ABC membrane protein type I exporter structures in a standardized way. Comput Struct Biotechnol J 2018; 16:396-403. [PMID: 30425800 PMCID: PMC6222291 DOI: 10.1016/j.csbj.2018.10.008] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2018] [Revised: 10/12/2018] [Accepted: 10/13/2018] [Indexed: 12/24/2022] Open
Abstract
An increasing number of ABC membrane protein structures are determined by cryo-electron microscopy and X-ray crystallography, consequently identifying differences between their conformations has become an arising issue. Therefore, we propose to define standardized measures for ABC Type I exporter structure characterization. We set conformational vectors, conftors, which describe the relative orientation of domains and can highlight structural differences. In addition, continuum electrostatics calculations were performed to characterize the energetics of membrane insertion illuminating functionally crucial regions. In summary, the proposed metrics contribute to deeper understanding of ABC membrane proteins' structural features, structure validation, and analysis of movements observed in a molecular dynamics trajectory. Moreover, the concept of standardized metrics can be applied not only to ABC membrane protein structures (http://conftors.hegelab.org).
Collapse
Key Words
- ABC proteins
- ABC, ATP binding cassette
- CFTR, cystic fibrosis transmembrane conductance regulator
- CG, coarse grained
- CH, coupling helix
- COG, center of geometry
- ICD, intracellular domain
- Membrane proteins
- NBD, nucleotide binding domain
- Quantitative structural properties
- Structure comparison
- Structure validation
- TH, transmembrane helix
- TM, transmembrane
- TMD, transmembrane domain
Collapse
Affiliation(s)
- Georgina Csizmadia
- MTA-SE Molecular Biophysics Research Group, Hungarian Academy of Sciences, Budapest, Hungary.,Department of Biophysics and Radiation Biology, Semmelweis University, Budapest, Hungary
| | - Bianka Farkas
- Department of Biophysics and Radiation Biology, Semmelweis University, Budapest, Hungary.,Faculty of Information Technology, Pázmány Péter Catholic University, Budapest, Hungary
| | - Zoltán Spagina
- Department of Biophysics and Radiation Biology, Semmelweis University, Budapest, Hungary.,Faculty of Information Technology, Pázmány Péter Catholic University, Budapest, Hungary
| | - Hedvig Tordai
- Department of Biophysics and Radiation Biology, Semmelweis University, Budapest, Hungary
| | - Tamás Hegedűs
- MTA-SE Molecular Biophysics Research Group, Hungarian Academy of Sciences, Budapest, Hungary.,Department of Biophysics and Radiation Biology, Semmelweis University, Budapest, Hungary
| |
Collapse
|
37
|
Scheps KG, Hasenahuer MA, Parisi G, Targovnik HM, García E, Veber ES, Crisp R, Elena G, Varela V, Fornasari MS. Two novel unstable hemoglobin variants due to in-frame deletions of key amino acids in the β-globin chain. Eur J Haematol 2018; 100:529-535. [DOI: 10.1111/ejh.13029] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/02/2018] [Indexed: 11/29/2022]
Affiliation(s)
- Karen G. Scheps
- Facultad de Farmacia y Bioquímica; Departamento de Microbiología, Inmunología, Biotecnología y Genética/Cátedra de Genética; Universidad de Buenos Aires; Ciudad Autónoma de Buenos Aires Argentina
- Instituto de Inmunología; Genética y Metabolismo (INIGEM); CONICET-Universidad de Buenos Aires; Ciudad Autónoma de Buenos Aires Argentina
| | - Marcia Anahí Hasenahuer
- Departamento de Ciencia y Tecnología; Universidad Nacional de Quilmes; Bernal, Buenos Aires Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología; Universidad Nacional de Quilmes; Bernal, Buenos Aires Argentina
| | - Héctor M. Targovnik
- Facultad de Farmacia y Bioquímica; Departamento de Microbiología, Inmunología, Biotecnología y Genética/Cátedra de Genética; Universidad de Buenos Aires; Ciudad Autónoma de Buenos Aires Argentina
- Instituto de Inmunología; Genética y Metabolismo (INIGEM); CONICET-Universidad de Buenos Aires; Ciudad Autónoma de Buenos Aires Argentina
| | - Eliana García
- Sección Hematología y Oncología; Servicio de Pediatría; Hospital Nacional A. Posadas; El Palomar Buenos Aires Argentina
| | - Ernesto Samuel Veber
- Servicio de Hemato-Oncología Pediátrica; Hospital General de Niños “Dr. Pedro de Elizalde”; Ciudad Autónoma de Buenos Aires Argentina
| | - Renée Crisp
- Sección Hematología y Oncología; Servicio de Pediatría; Hospital Nacional A. Posadas; El Palomar Buenos Aires Argentina
| | - Graciela Elena
- Servicio de Hemato-Oncología Pediátrica; Hospital General de Niños “Dr. Pedro de Elizalde”; Ciudad Autónoma de Buenos Aires Argentina
| | - Viviana Varela
- Facultad de Farmacia y Bioquímica; Departamento de Microbiología, Inmunología, Biotecnología y Genética/Cátedra de Genética; Universidad de Buenos Aires; Ciudad Autónoma de Buenos Aires Argentina
- Instituto de Inmunología; Genética y Metabolismo (INIGEM); CONICET-Universidad de Buenos Aires; Ciudad Autónoma de Buenos Aires Argentina
| | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología; Universidad Nacional de Quilmes; Bernal, Buenos Aires Argentina
| |
Collapse
|
38
|
Rueda AJV, Monzon AM, Ardanaz SM, Iglesias LE, Parisi G. Large scale analysis of protein conformational transitions from aqueous to non-aqueous media. BMC Bioinformatics 2018; 19:27. [PMID: 29382320 PMCID: PMC5791380 DOI: 10.1186/s12859-018-2044-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Accepted: 01/24/2018] [Indexed: 12/13/2022] Open
Abstract
Background Biocatalysis in organic solvents is nowadays a common practice with a large potential in Biotechnology. Several studies report that proteins which are co-crystallized or soaked in organic solvents preserve their fold integrity showing almost identical arrangements when compared to their aqueous forms. However, it is well established that the catalytic activity of proteins in organic solvents is much lower than in water. In order to explain this diminished activity and to further characterize the behaviour of proteins in non-aqueous environments, we performed a large-scale analysis (1737 proteins) of the conformational diversity of proteins crystallized in aqueous and co-crystallized or soaked in non-aqueous media. Results Using proteins’ experimentally determined conformational diversity taken from CoDNaS database, we found that proteins in non-aqueous media display much lower conformational diversity when compared to the corresponding conformers obtained in water. When conformational diversity is compared between conformers obtained in different non-aqueous media, their structural differences are larger and mostly independent of the presence of cognate ligands. We also found that conformers corresponding to non-aqueous media have larger but less flexible cavities, lower number of disordered regions and lower active-site residue mobility. Conclusions Our results show that non-aqueous media conformers have specific structural features and that they do not adopt extreme conformations found in aqueous media. This makes them clearly different from their corresponding aqueous conformers. Electronic supplementary material The online version of this article (10.1186/s12859-018-2044-2) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ana Julia Velez Rueda
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Sáenz Peña 352, B1876BXD, Bernal, Provincia de Buenos Aires, Argentina
| | - Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Sáenz Peña 352, B1876BXD, Bernal, Provincia de Buenos Aires, Argentina
| | - Sebastián M Ardanaz
- Laboratorio de Biocatálisis y Biotransformaciones, Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Sáenz Peña 352, B1876BXD, Bernal, Provincia de Buenos Aires, Argentina
| | - Luis E Iglesias
- Laboratorio de Biocatálisis y Biotransformaciones, Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Roque Sáenz Peña 352, B1876BXD, Bernal, Provincia de Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, CONICET, Universidad Nacional de Quilmes, Roque Sáenz Peña 352, B1876BXD, Bernal, Provincia de Buenos Aires, Argentina.
| |
Collapse
|
39
|
Automated NMR resonance assignments and structure determination using a minimal set of 4D spectra. Nat Commun 2018; 9:384. [PMID: 29374165 PMCID: PMC5786013 DOI: 10.1038/s41467-017-02592-z] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2017] [Accepted: 12/12/2017] [Indexed: 12/22/2022] Open
Abstract
Automated methods for NMR structure determination of proteins are continuously becoming more robust. However, current methods addressing larger, more complex targets rely on analyzing 6-10 complementary spectra, suggesting the need for alternative approaches. Here, we describe 4D-CHAINS/autoNOE-Rosetta, a complete pipeline for NOE-driven structure determination of medium- to larger-sized proteins. The 4D-CHAINS algorithm analyzes two 4D spectra recorded using a single, fully protonated protein sample in an iterative ansatz where common NOEs between different spin systems supplement conventional through-bond connectivities to establish assignments of sidechain and backbone resonances at high levels of completeness and with a minimum error rate. The 4D-CHAINS assignments are then used to guide automated assignment of long-range NOEs and structure refinement in autoNOE-Rosetta. Our results on four targets ranging in size from 15.5 to 27.3 kDa illustrate that the structures of proteins can be determined accurately and in an unsupervised manner in a matter of days.
Collapse
|
40
|
Monzon AM, Zea DJ, Marino-Buslje C, Parisi G. Homology modeling in a dynamical world. Protein Sci 2017; 26:2195-2206. [PMID: 28815769 DOI: 10.1002/pro.3274] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2017] [Revised: 08/09/2017] [Accepted: 08/09/2017] [Indexed: 12/31/2022]
Abstract
A key concept in template-based modeling (TBM) is the high correlation between sequence and structural divergence, with the practical consequence that homologous proteins that are similar at the sequence level will also be similar at the structural level. However, conformational diversity of the native state will reduce the correlation between structural and sequence divergence, because structural variation can appear without sequence diversity. In this work, we explore the impact that conformational diversity has on the relationship between structural and sequence divergence. We find that the extent of conformational diversity can be as high as the maximum structural divergence among families. Also, as expected, conformational diversity impairs the well-established correlation between sequence and structural divergence, which is nosier than previously suggested. However, we found that this noise can be resolved using a priori information coming from the structure-function relationship. We show that protein families with low conformational diversity show a well-correlated relationship between sequence and structural divergence, which is severely reduced in proteins with larger conformational diversity. This lack of correlation could impair TBM results in highly dynamical proteins. Finally, we also find that the presence of order/disorder can provide useful beforehand information for better TBM performance.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, B1876BXD, Bernal, Argentina
| | - Diego Javier Zea
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, C1405BWE Ciudad Autónoma de Buenos Aires, Argentina
| | - Cristina Marino-Buslje
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, C1405BWE Ciudad Autónoma de Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, B1876BXD, Bernal, Argentina
| |
Collapse
|
41
|
Putz I, Brock O. Elastic network model of learned maintained contacts to predict protein motion. PLoS One 2017; 12:e0183889. [PMID: 28854238 PMCID: PMC5576689 DOI: 10.1371/journal.pone.0183889] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2016] [Accepted: 08/14/2017] [Indexed: 12/21/2022] Open
Abstract
We present a novel elastic network model, lmcENM, to determine protein motion even for localized functional motions that involve substantial changes in the protein's contact topology. Existing elastic network models assume that the contact topology remains unchanged throughout the motion and are thus most appropriate to simulate highly collective function-related movements. lmcENM uses machine learning to differentiate breaking from maintained contacts. We show that lmcENM accurately captures functional transitions unexplained by the classical ENM and three reference ENM variants, while preserving the simplicity of classical ENM. We demonstrate the effectiveness of our approach on a large set of proteins covering different motion types. Our results suggest that accurately predicting a "deformation-invariant" contact topology offers a promising route to increase the general applicability of ENMs. We also find that to correctly predict this contact topology a combination of several features seems to be relevant which may vary slightly depending on the protein. Additionally, we present case studies of two biologically interesting systems, Ferric Citrate membrane transporter FecA and Arachidonate 15-Lipoxygenase.
Collapse
Affiliation(s)
- Ines Putz
- Robotics and Biology Laboratory, Department of Computer Science and Electrical Engineering, Technische Universität Berlin, Berlin, Berlin, Germany
| | - Oliver Brock
- Robotics and Biology Laboratory, Department of Computer Science and Electrical Engineering, Technische Universität Berlin, Berlin, Berlin, Germany
| |
Collapse
|
42
|
Regad L, Chéron JB, Triki D, Senac C, Flatters D, Camproux AC. Exploring the potential of a structural alphabet-based tool for mining multiple target conformations and target flexibility insight. PLoS One 2017; 12:e0182972. [PMID: 28817602 PMCID: PMC5560695 DOI: 10.1371/journal.pone.0182972] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2017] [Accepted: 07/27/2017] [Indexed: 11/18/2022] Open
Abstract
Protein flexibility is often implied in binding with different partners and is essential for protein function. The growing number of macromolecular structures in the Protein Data Bank entries and their redundancy has become a major source of structural knowledge of the protein universe. The analysis of structural variability through available redundant structures of a target, called multiple target conformations (MTC), obtained using experimental or modeling methods and under different biological conditions or different sources is one way to explore protein flexibility. This analysis is essential to improve the understanding of various mechanisms associated with protein target function and flexibility. In this study, we explored structural variability of three biological targets by analyzing different MTC sets associated with these targets. To facilitate the study of these MTC sets, we have developed an efficient tool, SA-conf, dedicated to capturing and linking the amino acid and local structure variability and analyzing the target structural variability space. The advantage of SA-conf is that it could be applied to divers sets composed of MTCs available in the PDB obtained using NMR and crystallography or homology models. This tool could also be applied to analyze MTC sets obtained by dynamics approaches. Our results showed that SA-conf tool is effective to quantify the structural variability of a MTC set and to localize the structural variable positions and regions of the target. By selecting adapted MTC subsets and comparing their variability detected by SA-conf, we highlighted different sources of target flexibility such as induced by binding partner, by mutation and intrinsic flexibility. Our results support the interest to mine available structures associated with a target using to offer valuable insight into target flexibility and interaction mechanisms. The SA-conf executable script, with a set of pre-compiled binaries are available at http://www.mti.univ-paris-diderot.fr/recherche/plateformes/logiciels.
Collapse
Affiliation(s)
- Leslie Regad
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- * E-mail: anne-claude.camproux@univ-paris-diderot (ACC); (LR)
| | - Jean-Baptiste Chéron
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- Institut de Chimie de Nice, UMR-CNRS 7272, Faculté des Sciences, Université de Nice-Sophia Antipolis, Nice, France
| | - Dhoha Triki
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Caroline Senac
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- Sorbonne Universités, UPMC Univ Paris 06, CNRS, INSERM, Laboratoire d'Imagerie Biomédicale (LIB), Paris, France
| | - Delphine Flatters
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Anne-Claude Camproux
- Molécules thérapeutiques in silico (MTi), INSERM UMR-S973, Paris, France
- Université Paris Diderot, Sorbonne Paris Cité, Paris, France
- * E-mail: anne-claude.camproux@univ-paris-diderot (ACC); (LR)
| |
Collapse
|
43
|
Brereton AE, Karplus PA. Ensemblator v3: Robust atom-level comparative analyses and classification of protein structure ensembles. Protein Sci 2017; 27:41-50. [PMID: 28762605 DOI: 10.1002/pro.3249] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/12/2017] [Revised: 07/24/2017] [Accepted: 07/27/2017] [Indexed: 12/19/2022]
Abstract
Ensembles of protein structures are increasingly used to represent the conformational variation of a protein as determined by experiment and/or by molecular simulations, as well as uncertainties that may be associated with structure determinations or predictions. Making the best use of such information requires the ability to quantitatively compare entire ensembles. For this reason, we recently introduced the Ensemblator (Clark et al., Protein Sci 2015; 24:1528), a novel approach to compare user-defined groups of models, in residue level detail. Here we describe Ensemblator v3, an open-source program that employs the same basic ensemble comparison strategy but includes major advances that make it more robust, powerful, and user-friendly. Ensemblator v3 carries out multiple sequence alignments to facilitate the generation of ensembles from non-identical input structures, automatically optimizes the key global overlay parameter, optionally performs "ensemble clustering" to classify the models into subgroups, and calculates a novel "discrimination index" that quantifies similarities and differences, at residue or atom level, between each pair of subgroups. The clustering and automatic options mean that no pre-knowledge about an ensemble is required for its analysis. After describing the novel features of Ensemblator v3, we demonstrate its utility using three case studies that illustrate the ease with which complex analyses are accomplished, and the kinds of insights derived from clustering into subgroups and from the detailed information that locates significant differences. The Ensemblator v3 enhances the structural biology toolbox by greatly expanding the kinds of problems to which this ensemble comparison strategy can be applied.
Collapse
Affiliation(s)
- Andrew E Brereton
- Department of Biochemistry and Biophysics, Oregon State University, Corvallis, Oregon, 97331
| | - P Andrew Karplus
- Department of Biochemistry and Biophysics, Oregon State University, Corvallis, Oregon, 97331
| |
Collapse
|
44
|
Halakou F, Kilic ES, Cukuroglu E, Keskin O, Gursoy A. Enriching Traditional Protein-protein Interaction Networks with Alternative Conformations of Proteins. Sci Rep 2017; 7:7180. [PMID: 28775330 PMCID: PMC5543104 DOI: 10.1038/s41598-017-07351-0] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Accepted: 06/27/2017] [Indexed: 12/19/2022] Open
Abstract
Traditional Protein-Protein Interaction (PPI) networks, which use a node and edge representation, lack some valuable information about the mechanistic details of biological processes. Mapping protein structures to these PPI networks not only provides structural details of each interaction but also helps us to find the mutual exclusive interactions. Yet it is not a comprehensive representation as it neglects the conformational changes of proteins which may lead to different interactions, functions, and downstream signalling. In this study, we proposed a new representation for structural PPI networks inspecting the alternative conformations of proteins. We performed a large-scale study by creating breast cancer metastasis network and equipped it with different conformers of proteins. Our results showed that although 88% of proteins in our network has at least two structures in Protein Data Bank (PDB), only 22% of them have alternative conformations and the remaining proteins have different regions saved in PDB. However, using even this small set of alternative conformations we observed a considerable increase in our protein docking predictions. Our protein-protein interaction predictions increased from 54% to 76% using the alternative conformations. We also showed the benefits of investigating structural data and alternative conformations of proteins through three case studies.
Collapse
Affiliation(s)
- Farideh Halakou
- Department of Computer Engineering, Koc University, Istanbul, 34450, Turkey
| | - Emel Sen Kilic
- Department of Chemical and Biological Engineering, Koc University, Istanbul, 34450, Turkey.,Microbiology, Immunology and Cell Biology Department, West Virginia University, Morgantown, 26505, WV, USA
| | - Engin Cukuroglu
- Computational Sciences and Engineering, Graduate School of Sciences and Engineering, Koc University, Istanbul, 34450, Turkey
| | - Ozlem Keskin
- Department of Chemical and Biological Engineering, Koc University, Istanbul, 34450, Turkey
| | - Attila Gursoy
- Department of Computer Engineering, Koc University, Istanbul, 34450, Turkey.
| |
Collapse
|
45
|
Monzon AM, Zea DJ, Fornasari MS, Saldaño TE, Fernandez-Alberti S, Tosatto SCE, Parisi G. Conformational diversity analysis reveals three functional mechanisms in proteins. PLoS Comput Biol 2017; 13:e1005398. [PMID: 28192432 PMCID: PMC5330503 DOI: 10.1371/journal.pcbi.1005398] [Citation(s) in RCA: 35] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2016] [Revised: 02/28/2017] [Accepted: 02/02/2017] [Indexed: 02/02/2023] Open
Abstract
Protein motions are a key feature to understand biological function. Recently, a large-scale analysis of protein conformational diversity showed a positively skewed distribution with a peak at 0.5 Å C-alpha root-mean-square-deviation (RMSD). To understand this distribution in terms of structure-function relationships, we studied a well curated and large dataset of ~5,000 proteins with experimentally determined conformational diversity. We searched for global behaviour patterns studying how structure-based features change among the available conformer population for each protein. This procedure allowed us to describe the RMSD distribution in terms of three main protein classes sharing given properties. The largest of these protein subsets (~60%), which we call "rigid" (average RMSD = 0.83 Å), has no disordered regions, shows low conformational diversity, the largest tunnels and smaller and buried cavities. The two additional subsets contain disordered regions, but with differential sequence composition and behaviour. Partially disordered proteins have on average 67% of their conformers with disordered regions, average RMSD = 1.1 Å, the highest number of hinges and the longest disordered regions. In contrast, malleable proteins have on average only 25% of disordered conformers and average RMSD = 1.3 Å, flexible cavities affected in size by the presence of disordered regions and show the highest diversity of cognate ligands. Proteins in each set are mostly non-homologous to each other, share no given fold class, nor functional similarity but do share features derived from their conformer population. These shared features could represent conformational mechanisms related with biological functions.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| | - Diego Javier Zea
- Bioinformatics Unit, Fundación Instituto Leloir (CONICET), Buenos Aires, Argentina
| | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| | - Tadeo E. Saldaño
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| | - Sebastian Fernandez-Alberti
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| | | | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes (CONICET), Bernal, Buenos Aires, Argentina
| |
Collapse
|
46
|
Addressing the Role of Conformational Diversity in Protein Structure Prediction. PLoS One 2016; 11:e0154923. [PMID: 27159429 PMCID: PMC4861349 DOI: 10.1371/journal.pone.0154923] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2015] [Accepted: 04/21/2016] [Indexed: 11/19/2022] Open
Abstract
Computational modeling of tertiary structures has become of standard use to study proteins that lack experimental characterization. Unfortunately, 3D structure prediction methods and model quality assessment programs often overlook that an ensemble of conformers in equilibrium populates the native state of proteins. In this work we collected sets of publicly available protein models and the corresponding target structures experimentally solved and studied how they describe the conformational diversity of the protein. For each protein, we assessed the quality of the models against known conformers by several standard measures and identified those models ranked best. We found that model rankings are defined by both the selected target conformer and the similarity measure used. 70% of the proteins in our datasets show that different models are structurally closest to different conformers of the same protein target. We observed that model building protocols such as template-based or ab initio approaches describe in similar ways the conformational diversity of the protein, although for template-based methods this description may depend on the sequence similarity between target and template sequences. Taken together, our results support the idea that protein structure modeling could help to identify members of the native ensemble, highlight the importance of considering conformational diversity in protein 3D quality evaluations and endorse the study of the variability of the native structure for a meaningful biological analysis.
Collapse
|