1
|
Agoni C, Fernández-Díaz R, Timmons PB, Adelfio A, Gómez H, Shields DC. Molecular Modelling in Bioactive Peptide Discovery and Characterisation. Biomolecules 2025; 15:524. [PMID: 40305228 PMCID: PMC12025251 DOI: 10.3390/biom15040524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2024] [Revised: 03/12/2025] [Accepted: 04/01/2025] [Indexed: 05/02/2025] Open
Abstract
Molecular modelling is a vital tool in the discovery and characterisation of bioactive peptides, providing insights into their structural properties and interactions with biological targets. Many models predicting bioactive peptide function or structure rely on their intrinsic properties, including the influence of amino acid composition, sequence, and chain length, which impact stability, folding, aggregation, and target interaction. Homology modelling predicts peptide structures based on known templates. Peptide-protein interactions can be explored using molecular docking techniques, but there are challenges related to the inherent flexibility of peptides, which can be addressed by more computationally intensive approaches that consider their movement over time, called molecular dynamics (MD). Virtual screening of many peptides, usually against a single target, enables rapid identification of potential bioactive peptides from large libraries, typically using docking approaches. The integration of artificial intelligence (AI) has transformed peptide discovery by leveraging large amounts of data. AlphaFold is a general protein structure prediction tool based on deep learning that has greatly improved the predictions of peptide conformations and interactions, in addition to providing estimates of model accuracy at each residue which greatly guide interpretation. Peptide function and structure prediction are being further enhanced using Protein Language Models (PLMs), which are large deep-learning-derived statistical models that learn computer representations useful to identify fundamental patterns of proteins. Recent methodological developments are discussed in the context of canonical peptides, as well as those with modifications and cyclisations. In designing potential peptide therapeutics, the main outstanding challenge for these methods is the incorporation of diverse non-canonical amino acids and cyclisations.
Collapse
Affiliation(s)
- Clement Agoni
- School of Medicine, University College Dublin, D04 C1P1 Dublin, Ireland;
- Conway Institute of Biomolecular and Biomedical Science, University College Dublin, D04 C1P Dublin, Ireland
- Discipline of Pharmaceutical Sciences, School of Health Sciences, University of KwaZulu-Natal, Durban 4000, South Africa
| | - Raúl Fernández-Díaz
- School of Medicine, University College Dublin, D04 C1P1 Dublin, Ireland;
- IBM Research, D15 HN66 Dublin, Ireland
| | | | - Alessandro Adelfio
- Nuritas Ltd., Joshua Dawson House, D02 RY95 Dublin, Ireland; (P.B.T.); (A.A.); (H.G.)
| | - Hansel Gómez
- Nuritas Ltd., Joshua Dawson House, D02 RY95 Dublin, Ireland; (P.B.T.); (A.A.); (H.G.)
| | - Denis C. Shields
- School of Medicine, University College Dublin, D04 C1P1 Dublin, Ireland;
- Conway Institute of Biomolecular and Biomedical Science, University College Dublin, D04 C1P Dublin, Ireland
| |
Collapse
|
2
|
Navarro AM, Alonso M, Martínez-Pérez E, Lazar T, Gibson TJ, Iserte JA, Tompa P, Marino-Buslje C. Unveiling the Complexity of cis-Regulation Mechanisms in Kinases: A Comprehensive Analysis. Proteins 2025; 93:575-587. [PMID: 39366918 DOI: 10.1002/prot.26751] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2024] [Revised: 08/29/2024] [Accepted: 09/12/2024] [Indexed: 10/06/2024]
Abstract
Protein cis-regulatory elements (CREs) are regions that modulate the activity of a protein through intramolecular interactions. Kinases, pivotal enzymes in numerous biological processes, often undergo regulatory control via inhibitory interactions in cis. This study delves into the mechanisms of cis regulation in kinases mediated by CREs, employing a combined structural and sequence analysis. To accomplish this, we curated an extensive dataset of kinases featuring annotated CREs, organized into homolog families through multiple sequence alignments. Key molecular attributes, including disorder and secondary structure content, active and ATP-binding sites, post-translational modifications, and disease-associated mutations, were systematically mapped onto all sequences. Additionally, we explored the potential for conformational changes between active and inactive states. Finally, we explored the presence of these kinases within membraneless organelles and elucidated their functional roles therein. CREs display a continuum of structures, ranging from short disordered stretches to fully folded domains. The adaptability demonstrated by CREs in achieving the common goal of kinase inhibition spans from direct autoinhibitory interaction with the active site within the kinase domain, to CREs binding to an alternative site, inducing allosteric regulation revealing distinct types of inhibitory mechanisms, which we exemplify by archetypical representative systems. While this study provides a systematic approach to comprehend kinase CREs, further experimental investigations are imperative to unravel the complexity within distinct kinase families. The insights gleaned from this research lay the foundation for future studies aiming to decipher the molecular basis of kinase dysregulation, and explore potential therapeutic interventions.
Collapse
Affiliation(s)
- Alvaro M Navarro
- Structural Bioinformatics Unit, Fundación Instituto Leloir, Buenos Aires, Argentina
| | - Macarena Alonso
- Structural Bioinformatics Unit, Fundación Instituto Leloir, Buenos Aires, Argentina
| | | | - Tamas Lazar
- VIB-VUB Center for Structural Biology, Flanders Institute for Biotechnology (VIB), Brussels, Belgium
- Structural Biology Brussels, Department of Bioengineering, Vrije Universiteit Brussel, Brussels, Belgium
| | - Toby J Gibson
- Structural and Computational Biology Unit, European Molecular Biology Laboratory, Heidelberg, Germany
| | - Javier A Iserte
- Structural Bioinformatics Unit, Fundación Instituto Leloir, Buenos Aires, Argentina
| | - Peter Tompa
- VIB-VUB Center for Structural Biology, Flanders Institute for Biotechnology (VIB), Brussels, Belgium
- Structural Biology Brussels, Department of Bioengineering, Vrije Universiteit Brussel, Brussels, Belgium
- Research Centre for Natural Sciences, Hungarian Research Network, Institute of Enzymology, Budapest, Hungary
| | | |
Collapse
|
3
|
Ritaparna P, Dhal AK, Mahapatra RK. An in-silico study of FIKK9.5 protein of Plasmodium falciparum for identification of therapeutics. J Biomol Struct Dyn 2024:1-14. [PMID: 39727019 DOI: 10.1080/07391102.2024.2446671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Accepted: 09/04/2024] [Indexed: 12/28/2024]
Abstract
The FIKK protein family, encompassing 21 serine-threonine protein kinases, is a distinctive cluster exclusive to the Apicomplexa phylum. Predominantly located in Plasmodium falciparum which is a malarial parasite, with a solitary gene identified in a distinct apicomplexan species, this family derives its nomenclature from - phenylalanine, isoleucine, lysine, lysine (FIKK), a conserved amino acid motif. Integral to the parasite's life cycle and consequential to malaria pathogenesis, the absence of orthologous proteins in eukaryotic organisms designates it as a promising antimalarial drug target. Among the FIKKs, FIKK9.5 plays a pivotal role in the parasite's development within red blood cells (RBCs). This investigation acquired the three-dimensional structure of FIKK9.5 and its ligands through extensive database searches and literature review. Computational screening of natural phytochemicals derived from plants traditionally used in antimalarial remedies was conducted by employing the Glide docking suite. AutoDock Vina was utilized to discern the inhibitor exhibiting optimal binding affinity. Subsequently, Molecular Dynamics (MD) simulations employing GROMACS validated Rufigallol as the most potent inhibitory compound against FIKK9.5. The robustness of the protein-ligand complex was scrutinized through a 200 nanosecond molecular dynamics (MD) trajectory. Trajectory analysis and determination of binding free energies were accomplished using MM-GBSA and MM-PBSA approaches. The ligand-binding exhibited sustained stability throughout the simulation, manifesting an approximate binding free energy of -25.5986 kcal/mol. This comprehensive computational study lays the groundwork for potential experimental validation in the laboratory, paving the way for the development of novel therapeutics targeting FIKK9.5 in the pursuit of innovative antimalarial.
Collapse
Affiliation(s)
- Prajna Ritaparna
- School of Biotechnology, KIIT Deemed To be University, Bhubaneswar, Odisha, India
- National Innovation Foundation-India, TBI-KIIT, Bhubaneswar, Odisha, India
| | - Ajit Kumar Dhal
- School of Biotechnology, KIIT Deemed To be University, Bhubaneswar, Odisha, India
| | | |
Collapse
|
4
|
Rosignoli S, Lustrino E, Di Silverio I, Paiardini A. Making Use of Averaging Methods in MODELLER for Protein Structure Prediction. Int J Mol Sci 2024; 25:1731. [PMID: 38339009 PMCID: PMC10855553 DOI: 10.3390/ijms25031731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 01/23/2024] [Accepted: 01/29/2024] [Indexed: 02/12/2024] Open
Abstract
Recent advances in protein structure prediction, driven by AlphaFold 2 and machine learning, demonstrate proficiency in static structures but encounter challenges in capturing essential dynamic features crucial for understanding biological function. In this context, homology-based modeling emerges as a cost-effective and computationally efficient alternative. The MODELLER (version 10.5, accessed on 30 November 2023) algorithm can be harnessed for this purpose since it computes intermediate models during simulated annealing, enabling the exploration of attainable configurational states and energies while minimizing its objective function. There have been a few attempts to date to improve the models generated by its algorithm, and in particular, there is no literature regarding the implementation of an averaging procedure involving the intermediate models in the MODELLER algorithm. In this study, we examined MODELLER's output using 225 target-template pairs, extracting the best representatives of intermediate models. Applying an averaging procedure to the selected intermediate structures based on statistical potentials, we aimed to determine: (1) whether averaging improves the quality of structural models during the building phase; (2) if ranking by statistical potentials reliably selects the best models, leading to improved final model quality; (3) whether using a single template versus multiple templates affects the averaging approach; (4) whether the "ensemble" nature of the MODELLER building phase can be harnessed to capture low-energy conformations in holo structures modeling. Our findings indicate that while improvements typically fall short of a few decimal points in the model evaluation metric, a notable fraction of configurations exhibit slightly higher similarity to the native structure than MODELLER's proposed final model. The averaging-building procedure proves particularly beneficial in (1) regions of low sequence identity between the target and template(s), the most challenging aspect of homology modeling; (2) holo protein conformations generation, an area in which MODELLER and related tools usually fall short of the expected performance.
Collapse
Affiliation(s)
| | | | | | - Alessandro Paiardini
- Department of Biochemical Sciences, Sapienza University of Rome, 00185 Rome, Italy; (S.R.); (E.L.); (I.D.S.)
| |
Collapse
|
5
|
Escobedo N, Monzon AM, Fornasari MS, Palopoli N, Parisi G. Combining Protein Conformational Diversity and Phylogenetic Information Using CoDNaS and CoDNaS-Q. Curr Protoc 2023; 3:e764. [PMID: 37184204 DOI: 10.1002/cpz1.764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/16/2023]
Abstract
CoDNaS (http://ufq.unq.edu.ar/codnas/) and CoDNaS-Q (http://ufq.unq.edu.ar/codnasq) are repositories of proteins with different degrees of conformational diversity. Following the ensemble nature of the native state, conformational diversity represents the structural differences between the conformers in the ensemble. Each entry in CoDNaS and CoDNaS-Q contains a redundant collection of experimentally determined conformers obtained under different conditions. These conformers represent snapshots of the protein dynamism. While CoDNaS contains examples of conformational diversity at the tertiary level, a recent development, CoDNaS-Q, contains examples at the quaternary level. In the emerging age of accurate protein structure prediction by machine learning approaches, many questions remain open regarding the characterization of protein dynamism. In this context, most bioinformatics resources take advantage of distinct features derived from protein alignments, however, the complexity and heterogeneity of information makes it difficult to recover reliable biological signatures. Here we present five protocols to explore tertiary and quaternary conformational diversity at the individual protein level as well as for the characterization of the distribution of conformational diversity at the protein family level in a phylogenetic context. These protocols can provide curated protein families with experimentally known conformational diversity, facilitating the exploration of sequence determinants of protein dynamism. © 2023 Wiley Periodicals LLC. Basic Protocol 1: Assessing conformational diversity with CoDNaS Alternate Protocol 1: Assessing conformational diversity at the quaternary level with CoDNaS-Q Basic Protocol 2: Exploring conformational diversity in a protein family Alternate Protocol 2: Exploring quaternary conformational diversity in a protein family Basic Protocol 3: Representing conformational diversity in a phylogenetic context.
Collapse
Affiliation(s)
- Nahuel Escobedo
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | | | - María Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Buenos Aires, Argentina
| |
Collapse
|
6
|
Haji-Allahverdipoor K, Jalali Javaran M, Rashidi Monfared S, Khadem-Erfan MB, Nikkhoo B, Bahrami Rad Z, Eslami H, Nasseri S. Insights Into The Effects of Amino Acid Substitutions on The Stability of Reteplase Structure: A Molecular Dynamics Simulation Study. IRANIAN JOURNAL OF BIOTECHNOLOGY 2023; 21:e3175. [PMID: 36811105 PMCID: PMC9938932 DOI: 10.30498/ijb.2022.308798.3175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/04/2021] [Accepted: 07/06/2022] [Indexed: 02/24/2023]
Abstract
Background Reteplase (recombinant plasminogen activator, r-PA) is a recombinant protein designed to imitate the endogenous tissue plasminogen activator and catalyze the plasmin production. It is known that the application of reteplase is limited by the complex production processes and protein's stability challenges. Computational redesign of proteins has gained momentum in recent years, particularly as a powerful tool for improving protein stability and consequently its production efficiency. Hence, in the current study, we implemented computational approaches to improve r-PA conformational stability, which fairly correlates with protein's resistance to proteolysis. Objectives The current study was developed in order to evaluate the effect of amino acid substitutions on the stability of reteplase structure using molecular dynamic simulations and computational predictions. Materials and Methods Several web servers designed for mutation analysis were utilized to select appropriate mutations. Additionally, the experimentally reported mutation, R103S, converting wild type r-PA into non-cleavable form, was also employed. Firstly, mutant collection, consisting of 15 structures, was constructed based on the combinations of four designated mutations. Then, 3D structures were generated using MODELLER. Finally, 17 independent 20-ns molecular dynamics (MD) simulations were conducted and different analysis were performed like root-mean-square deviation (RMSD), root-mean-square fluctuations (RMSF), secondary structure analysis, number of hydrogen bonds, principal components analysis (PCA), eigenvector projection, and density analysis. Results Predicted mutations successfully compensated the more flexible conformation caused by R103S substitution, so, improved conformational stability was analyzed from MD simulations. In particular, R103S/A286I/G322I indicated the best results and remarkably enhanced the protein stability. Conclusion The conformational stability conferred by these mutations will probably lead to more protection of r-PA in protease-rich environments in various recombinant systems and potentially enhance its production and expression level.
Collapse
Affiliation(s)
- Kaveh Haji-Allahverdipoor
- Cellular and Molecular Research Center, Research Institute for Health Development, Kurdistan University of Medical Sciences, Sanandaj, Iran
| | - Mokhtar Jalali Javaran
- Department of Biotechnology, Faculty of Agriculture, Tarbiat Modares University, Tehran, Iran
| | - Sajad Rashidi Monfared
- Department of Biotechnology, Faculty of Agriculture, Tarbiat Modares University, Tehran, Iran
| | - Mohamad Bagher Khadem-Erfan
- Cellular and Molecular Research Center, Research Institute for Health Development, Kurdistan University of Medical Sciences, Sanandaj, Iran
| | - Bahram Nikkhoo
- Cellular and Molecular Research Center, Research Institute for Health Development, Kurdistan University of Medical Sciences, Sanandaj, Iran
| | - Zhila Bahrami Rad
- Cellular and Molecular Research Center, Research Institute for Health Development, Kurdistan University of Medical Sciences, Sanandaj, Iran
| | - Habib Eslami
- Department of Pharmacology and Toxicology, School of Pharmacy, Hormozgan University of Medicinal sciences, Bandar Abbas, Iran
| | - Sherko Nasseri
- Cellular and Molecular Research Center, Research Institute for Health Development, Kurdistan University of Medical Sciences, Sanandaj, Iran
| |
Collapse
|
7
|
Zea DJ, Teppa E, Marino-Buslje C. Easy Not Easy: Comparative Modeling with High-Sequence Identity Templates. Methods Mol Biol 2023; 2627:83-100. [PMID: 36959443 DOI: 10.1007/978-1-0716-2974-1_5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/25/2023]
Abstract
Homology modeling is the most common technique to build structural models of a target protein based on the structure of proteins with high-sequence identity and available high-resolution structures. This technique is based on the idea that protein structure shows fewer changes than sequence through evolution. While in this scenario single mutations would minimally perturb the structure, experimental evidence shows otherwise: proteins with high conformational diversity impose a limit of the paradigm of comparative modeling as the same protein sequence can adopt dissimilar three-dimensional structures. These cases present challenges for modeling; at first glance, they may seem to be easy cases, but they have a complexity that is not evident at the sequence level. In this chapter, we address the following questions: Why should we care about conformational diversity? How to consider conformational diversity when doing template-based modeling in a practical way?
Collapse
Affiliation(s)
- Diego Javier Zea
- Laboratory of Computational and Quantitative Biology, LCQB, UMR 7238 CNRS, IBPS, Sorbonne Université, Paris, France
| | - Elin Teppa
- Toulouse Biotechnology Institute, TBI, Université de Toulouse, CNRS, INRA, INSA, Toulouse, France
| | | |
Collapse
|
8
|
Iyer M, Jaroszewski L, Sedova M, Godzik A. What the protein data bank tells us about the evolutionary conservation of protein conformational diversity. Protein Sci 2022; 31:e4325. [PMID: 35762711 PMCID: PMC9207624 DOI: 10.1002/pro.4325] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Revised: 03/29/2022] [Accepted: 04/06/2022] [Indexed: 11/09/2022]
Abstract
Proteins sample a multitude of different conformations by undergoing small- and large-scale conformational changes that are often intrinsic to their functions. Information about these changes is often captured in the Protein Data Bank by the apparently redundant deposition of independent structural solutions of identical proteins. Here, we mine these data to examine the conservation of large-scale conformational changes between homologous proteins. This is important for both practical reasons, such as predicting alternative conformations of a protein by comparative modeling, and conceptual reasons, such as understanding the extent of conservation of different features in evolution. To study this question, we introduce a novel approach to compare conformational changes between proteins by the comparison of their difference distance maps (DDMs). We found that proteins undergoing similar conformational changes have similar DDMs and that this similarity could be quantified by the correlation between the DDMs. By comparing the DDMs of homologous protein pairs, we found that large-scale conformational changes show a high level of conservation across a broad range of sequence identities. This shows that conformational space is usually conserved between homologs, even relatively distant ones.
Collapse
Affiliation(s)
- Mallika Iyer
- Graduate School of Biomedical SciencesSanford Burnham Prebys Medical Discovery InstituteLa JollaCaliforniaUSA
| | - Lukasz Jaroszewski
- Biosciences DivisionUniversity of California Riverside School of MedicineRiversideCaliforniaUSA
| | - Mayya Sedova
- Biosciences DivisionUniversity of California Riverside School of MedicineRiversideCaliforniaUSA
| | - Adam Godzik
- Biosciences DivisionUniversity of California Riverside School of MedicineRiversideCaliforniaUSA
| |
Collapse
|
9
|
Saldaño T, Escobedo N, Marchetti J, Zea DJ, Mac Donagh J, Velez Rueda AJ, Gonik E, García Melani A, Novomisky Nechcoff J, Salas MN, Peters T, Demitroff N, Fernandez Alberti S, Palopoli N, Fornasari MS, Parisi G. Impact of protein conformational diversity on AlphaFold predictions. Bioinformatics 2022; 38:2742-2748. [PMID: 35561203 DOI: 10.1093/bioinformatics/btac202] [Citation(s) in RCA: 82] [Impact Index Per Article: 27.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 02/10/2022] [Accepted: 03/31/2022] [Indexed: 11/12/2022] Open
Abstract
MOTIVATION After the outstanding breakthrough of AlphaFold in predicting protein 3D models, new questions appeared and remain unanswered. The ensemble nature of proteins, for example, challenges the structural prediction methods because the models should represent a set of conformers instead of single structures. The evolutionary and structural features captured by effective deep learning techniques may unveil the information to generate several diverse conformations from a single sequence. Here, we address the performance of AlphaFold2 predictions obtained through ColabFold under this ensemble paradigm. RESULTS Using a curated collection of apo-holo pairs of conformers, we found that AlphaFold2 predicts the holo form of a protein in ∼70% of the cases, being unable to reproduce the observed conformational diversity with the same error for both conformers. More importantly, we found that AlphaFold2's performance worsens with the increasing conformational diversity of the studied protein. This impairment is related to the heterogeneity in the degree of conformational diversity found between different members of the homologous family of the protein under study. Finally, we found that main-chain flexibility associated with apo-holo pairs of conformers negatively correlates with the predicted local model quality score plDDT, indicating that plDDT values in a single 3D model could be used to infer local conformational changes linked to ligand binding transitions. AVAILABILITY AND IMPLEMENTATION Data and code used in this manuscript are publicly available at https://gitlab.com/sbgunq/publications/af2confdiv-oct2021. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Tadeo Saldaño
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Nahuel Escobedo
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Julia Marchetti
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | | | - Juan Mac Donagh
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Ana Julia Velez Rueda
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Eduardo Gonik
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
- INIFTA (CONICET-UNLP) - Fotoquímica y Nanomateriales para el Ambiente y la Biología (nanoFOT), La Plata, Argentina
| | | | | | - Martín N Salas
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
| | - Tomás Peters
- Fundación Instituto Leloir-Instituto de Investigaciones Bioquímicas de Buenos Aires, Buenos Aires, Argentina
| | - Nicolás Demitroff
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
- Fundación Instituto Leloir-Instituto de Investigaciones Bioquímicas de Buenos Aires, Buenos Aires, Argentina
| | - Sebastian Fernandez Alberti
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal, Argentina
- Consejo Nacional de Investigaciones Científicas y Técnicas, Buenos Aires, Argentina
| |
Collapse
|
10
|
Timonina D, Sharapova Y, Švedas V, Suplatov D. Bioinformatic analysis of subfamily-specific regions in 3D-structures of homologs to study functional diversity and conformational plasticity in protein superfamilies. Comput Struct Biotechnol J 2021; 19:1302-1311. [PMID: 33738079 PMCID: PMC7933735 DOI: 10.1016/j.csbj.2021.02.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2020] [Revised: 02/08/2021] [Accepted: 02/09/2021] [Indexed: 02/07/2023] Open
Abstract
Local 3D-structural differences in homologous proteins contribute to functional diversity observed in a superfamily, but so far received little attention as bioinformatic analysis was usually carried out at the level of amino acid sequences. We have developed Zebra3D - the first-of-its-kind bioinformatic software for systematic analysis of 3D-alignments of protein families using machine learning. The new tool identifies subfamily-specific regions (SSRs) - patterns of local 3D-structure (i.e. single residues, loops, or secondary structure fragments) that are spatially equivalent within families/subfamilies, but are different among them, and thus can be associated with functional diversity and function-related conformational plasticity. Bioinformatic analysis of protein superfamilies by Zebra3D can be used to study 3D-determinants of catalytic activity and specific accommodation of ligands, help to prepare focused libraries for directed evolution or assist development of chimeric enzymes with novel properties by exchange of equivalent regions between homologs, and to characterize plasticity in binding sites. A companion Mustguseal web-server is available to automatically construct a 3D-alignment of functionally diverse proteins, thus reducing the minimal input required to operate Zebra3D to a single PDB code. The Zebra3D + Mustguseal combined approach provides the opportunity to systematically explore the value of SSRs in superfamilies and to use this information for protein design and drug discovery. The software is available open-access at https://biokinet.belozersky.msu.ru/Zebra3D.
Collapse
Affiliation(s)
- Daria Timonina
- Lomonosov Moscow State University, Faculty of Bioengineering and Bioinformatics, Lenin Hills 1-73, Moscow 119234, Russia
| | - Yana Sharapova
- Lomonosov Moscow State University, Faculty of Bioengineering and Bioinformatics, Lenin Hills 1-73, Moscow 119234, Russia
- Lomonosov Moscow State University, Belozersky Institute of Physicochemical Biology, Lenin Hills 1-73, Moscow 119234, Russia
| | - Vytas Švedas
- Lomonosov Moscow State University, Faculty of Bioengineering and Bioinformatics, Lenin Hills 1-73, Moscow 119234, Russia
- Lomonosov Moscow State University, Belozersky Institute of Physicochemical Biology, Lenin Hills 1-73, Moscow 119234, Russia
| | - Dmitry Suplatov
- Lomonosov Moscow State University, Belozersky Institute of Physicochemical Biology, Lenin Hills 1-73, Moscow 119234, Russia
- Corresponding author.
| |
Collapse
|
11
|
Sedova M, Jaroszewski L, Iyer M, Li Z, Godzik A. ModFlex: Towards Function Focused Protein Modeling. J Mol Biol 2021; 433:166828. [PMID: 33972023 DOI: 10.1016/j.jmb.2021.166828] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 01/07/2021] [Accepted: 01/09/2021] [Indexed: 11/19/2022]
Abstract
There is a wide, and continuously widening, gap between the number of proteins known only by their amino acid sequence versus those structurally characterized by direct experiment. To close this gap, we mostly rely on homology-based inference and modeling to reason about the structures of the uncharacterized proteins by using structures of homologous proteins as templates. With the rapidly growing size of the Protein Data Bank, there are often multiple choices of templates, including multiple sets of coordinates from the same protein. The substantial conformational differences observed between different experimental structures of the same protein often reflect function related structural flexibility. Thus, depending on the questions being asked, using distant homologs, or coordinate sets with lower resolution but solved in the appropriate functional form, as templates may be more informative. The ModFlex server (https://modflex.org/) addresses this seldom mentioned gap in the standard homology modeling approach by providing the user with an interface with multiple options and tools to select the most relevant template and explore the range of structural diversity in the available templates. ModFlex is closely integrated with a range of other programs and servers developed in our group for the analysis and visualization of protein structural flexibility and divergence.
Collapse
Affiliation(s)
- Mayya Sedova
- University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States
| | - Lukasz Jaroszewski
- University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States
| | - Mallika Iyer
- Graduate School of Biomedical Sciences, Sanford Burnham Prebys Medical Discovery Institute, La Jolla, CA, United States
| | - Zhanwen Li
- University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States
| | - Adam Godzik
- University of California Riverside School of Medicine, Biosciences Division, Riverside, CA, United States.
| |
Collapse
|
12
|
Dhingra S, Sowdhamini R, Cadet F, Offmann B. A glance into the evolution of template-free protein structure prediction methodologies. Biochimie 2020; 175:85-92. [DOI: 10.1016/j.biochi.2020.04.026] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 04/24/2020] [Accepted: 04/27/2020] [Indexed: 11/26/2022]
|
13
|
Saldaño TE, Freixas VM, Tosatto SCE, Parisi G, Fernandez-Alberti S. Exploring Conformational Space with Thermal Fluctuations Obtained by Normal-Mode Analysis. J Chem Inf Model 2020; 60:3068-3080. [PMID: 32216314 DOI: 10.1021/acs.jcim.9b01136] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Proteins in their native states can be represented as ensembles of conformers in dynamical equilibrium. Thermal fluctuations are responsible for transitions between these conformers. Normal-modes analysis (NMA) using elastic network models (ENMs) provides an efficient procedure to explore global dynamics of proteins commonly associated with conformational transitions. In the present work, we present an iterative approach to explore protein conformational spaces by introducing structural distortions according to their equilibrium dynamics at room temperature. The approach can be used either to perform unbiased explorations of conformational space or to explore guided pathways connecting two different conformations, e.g., apo and holo forms. In order to test its performance, four proteins with different magnitudes of structural distortions upon ligand binding have been tested. In all cases, the conformational selection model has been confirmed and the conformational space between apo and holo forms has been encompassed. Different strategies have been tested that impact on the efficiency either to achieve a desired conformational change or to achieve a balanced exploration of the protein conformational multiplicity.
Collapse
Affiliation(s)
- Tadeo E Saldaño
- Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD Bernal, Argentina
| | - Victor M Freixas
- Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD Bernal, Argentina
| | - Silvio C E Tosatto
- Department of Biomedical Sciences, University of Padova, Viale G. Colombo 3, 5131 Padova, Italy
| | - Gustavo Parisi
- Universidad Nacional de Quilmes/CONICET, Roque Saenz Peña 352, B1876BXD Bernal, Argentina
| | | |
Collapse
|
14
|
Dellafiora L, Oswald IP, Dorne JL, Galaverna G, Battilani P, Dall'Asta C. An in silico structural approach to characterize human and rainbow trout estrogenicity of mycotoxins: Proof of concept study using zearalenone and alternariol. Food Chem 2019; 312:126088. [PMID: 31911350 DOI: 10.1016/j.foodchem.2019.126088] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2019] [Revised: 11/28/2019] [Accepted: 12/18/2019] [Indexed: 02/06/2023]
Abstract
The mycotoxins zearalenone and alternariol may contaminate food and feed raising toxicological concerns due to their estrogenicity. Inter-species differences in their toxicokinetics and toxicodynamics may occur depending on evolution of taxa-specific traits. As a proof of principle, this manuscript investigates the comparative toxicodynamics of zearalenone, its metabolites (alpha-zearalenol and beta-zearalenol), and alternariol with regards to estrogenicity in humans and rainbow trout. An in silico structural approach based on docking simulations, pharmacophore modeling and molecular dynamics was applied and computational results were analyzed in comparison with available experimental data. The differences of estrogenicity among species of zearalenone and its metabolites have been structurally explained. Also, the low estrogenicity of alternariol in trout has been characterized here for the first time. This approach can provide a powerful tool for the characterization of interspecies differences in mycotoxin toxicity for a range of protein targets and relevant compounds for the food- and feed-safety area.
Collapse
Affiliation(s)
- Luca Dellafiora
- Department of Food and Drug, University of Parma, Area Parco delle Scienze 27/A, 43124 Parma, Italy.
| | - Isabelle P Oswald
- Toxalim (Research Centre in Food Toxicology), Université de Toulouse, INRA, ENVT, INP-Purpan, UPS, 31027 Toulouse, France.
| | | | - Gianni Galaverna
- Department of Food and Drug, University of Parma, Area Parco delle Scienze 27/A, 43124 Parma, Italy.
| | - Paola Battilani
- Department of Sustainable Crop Production, Università Cattolica del Sacro Cuore, Via Emilia Parmense 84, 29122 Piacenza, Italy.
| | - Chiara Dall'Asta
- Department of Food and Drug, University of Parma, Area Parco delle Scienze 27/A, 43124 Parma, Italy.
| |
Collapse
|
15
|
Ensembles from Ordered and Disordered Proteins Reveal Similar Structural Constraints during Evolution. J Mol Biol 2019; 431:1298-1307. [DOI: 10.1016/j.jmb.2019.01.031] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2018] [Revised: 01/23/2019] [Accepted: 01/24/2019] [Indexed: 01/08/2023]
|
16
|
Abstract
The native state of proteins is composed of conformers in dynamical equilibrium. In this chapter, different issues related to conformational diversity are explored using a curated and experimentally based database called CoDNaS (Conformational Diversity in the Native State). This database is a collection of redundant structures for the same sequence. CoDNaS estimates the degree of conformational diversity using different global and local structural similarity measures. It allows the user to explore how structural differences among conformers change as a function of several structural features providing further biological information. This chapter explores the measurement of conformational diversity and its relationship with sequence divergence. Also, it discusses how proteins with high conformational diversity could affect homology modeling techniques.
Collapse
Affiliation(s)
- Alexander Miguel Monzon
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina
| | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina
| | - Diego Javier Zea
- Structural Bioinformatics Unit, Fundación Instituto Leloir, CONICET, Buenos Aires, Argentina
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, CONICET, Bernal, Argentina.
| |
Collapse
|
17
|
How is structural divergence related to evolutionary information? Mol Phylogenet Evol 2018; 127:859-866. [DOI: 10.1016/j.ympev.2018.06.033] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2017] [Revised: 06/01/2018] [Accepted: 06/19/2018] [Indexed: 12/15/2022]
|