1
|
Torres Juárez JA, Hernández Puga AG, Sánchez Tusie AA. Differential molecular interactions between iberiotoxin and human SLO3 and SLO1 potassium channels. J Mol Model 2025; 31:155. [PMID: 40358624 DOI: 10.1007/s00894-025-06379-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2024] [Accepted: 04/28/2025] [Indexed: 05/15/2025]
Abstract
CONTEXT SLO1and SLO3 are similar voltage-gated K + channels. However, SLO3 expression is sperm specific and plays an important role in the hyperpolarization of the sperm membrane potential that is crucial for sperm fertilization. This makes SLO3 an excellent molecular target for the development of male contraceptives, and computational methods can facilitate structural insights for this drug development. Here, we evaluated the differential molecular interactions between the human SLO3 (hSLO3) and SLO1 (hSLO1) potassium channels and iberiotoxin (IbTX), a toxin that selectively blocks SLO channels. To do this, molecular docking and dynamics were implemented on the channel-toxin complexes to help elucidate atomistic details of their interaction and binding energy. Our analysis found that IbTX has a similar binding energy to both channels but interacts in a distinct manner with them. Particularly, Trp14 and Arg25 residues of IbTX diverges in their interaction with the residues Val283 and Asn260 residues of hSLO3 and the corresponding residues Tyr359 and Ala336 of hSLO1. Knowledge of key residues in the molecular interface of IbTX blockage can help guide and hasten non-hormonal contraceptive development. Our results encourage the use of toxins as scaffolds for specific SLO3 blockers. METHODS Atomistic molecular dynamics were implemented on the channel-toxin complexes. To generate the complexes, IbTX was docked to the channels using HADDOCK. CHARMM-GUI was used to generate simulation systems. GROMACS v2023.1 was used to run the simulations for 500 ns in an NPT ensemble at 297.26 K employing the CHARMM36 force field. Binding energy was evaluated by molecular mechanics generalized born surface area (MM/GBSA) with gmxMMPBGBSA.py.
Collapse
|
2
|
Vigneswaran A, Buschmann TA, Latham MP. Leveraging AlphaFold2 and residual dipolar couplings for side-chain methyl group assignment: A case study with S. cerevisiae Xrs2. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2025; 374:107865. [PMID: 40058108 PMCID: PMC11993329 DOI: 10.1016/j.jmr.2025.107865] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/20/2024] [Revised: 02/13/2025] [Accepted: 02/28/2025] [Indexed: 04/13/2025]
Abstract
Side-chain methyl group NMR spectroscopy provides invaluable insights into macromolecular structure, dynamics, and function, particularly for large biomolecular complexes. Accurate assignment of methyl group resonances in two-dimensional spectra is essential for structural and dynamics studies. Traditional methyl group assignment strategies rely on either transferring assignments from backbone resonance data or NOESY data and high-resolution experimental structures; however, these methods are often limited by molecular size or availability of structural information, respectively. Here, we describe the use of AlphaFold2 structural models as a basis for the manual, distance-based assignment of side-chain methyl group resonances in the folded domains of S. cerevisiae Xrs2. While AlphaFold2 models facilitated initial assignments for the methyl resonances, inaccuracies in the side-chain coordinates highlighted the need for improved structural models. By generating >500 ColabFold-derived models and filtering with methyl residual dipolar couplings (RDCs), we identified structural models with superior agreement to experimental data. These refined models enabled additional methyl group assignments while suggesting an iterative approach to simultaneously improve structure prediction and resonance assignment. Our findings outline a workflow that integrates machine learning-based structural predictions with experimental NMR data, offering a pathway for advancing methyl group assignment in systems lacking high-resolution experimental structures.
Collapse
Affiliation(s)
- Ajeak Vigneswaran
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN, United States
| | - Tanner A Buschmann
- Department of Chemistry and Biochemistry, Texas Tech University, Lubbock, TX, United States
| | - Michael P Latham
- Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, Minneapolis, MN, United States.
| |
Collapse
|
3
|
Luo Y, Zheng X, Qiu M, Gou Y, Yang Z, Qu X, Chen Z, Lin Y. Deep learning and its applications in nuclear magnetic resonance spectroscopy. PROGRESS IN NUCLEAR MAGNETIC RESONANCE SPECTROSCOPY 2025; 146-147:101556. [PMID: 40306798 DOI: 10.1016/j.pnmrs.2024.101556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/19/2024] [Revised: 12/26/2024] [Accepted: 12/30/2024] [Indexed: 05/02/2025]
Abstract
Nuclear Magnetic Resonance (NMR), as an advanced technology, has widespread applications in various fields like chemistry, biology, and medicine. However, issues such as long acquisition times for multidimensional spectra and low sensitivity limit the broader application of NMR. Traditional algorithms aim to address these issues but have limitations in speed and accuracy. Deep Learning (DL), a branch of Artificial Intelligence (AI) technology, has shown remarkable success in many fields including NMR. This paper presents an overview of the basics of DL and current applications of DL in NMR, highlights existing challenges, and suggests potential directions for improvement.
Collapse
Affiliation(s)
- Yao Luo
- Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Department of Electronic Science, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen 361005, China
| | - Xiaoxu Zheng
- Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Department of Electronic Science, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen 361005, China
| | - Mengjie Qiu
- Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Department of Electronic Science, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen 361005, China
| | - Yaoping Gou
- Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Department of Electronic Science, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen 361005, China
| | - Zhengxian Yang
- Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Department of Electronic Science, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen 361005, China
| | - Xiaobo Qu
- Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Department of Electronic Science, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen 361005, China
| | - Zhong Chen
- Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Department of Electronic Science, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen 361005, China
| | - Yanqin Lin
- Fujian Provincial Key Laboratory of Plasma and Magnetic Resonance, Department of Electronic Science, State Key Laboratory of Physical Chemistry of Solid Surfaces, Xiamen University, Xiamen 361005, China.
| |
Collapse
|
4
|
Huang YJ, Ramelot TA, Spaman LE, Kobayashi N, Montelione GT. Hidden Structural States of Proteins Revealed by Conformer Selection with AlphaFold-NMR. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2024.06.26.600902. [PMID: 38979209 PMCID: PMC11230435 DOI: 10.1101/2024.06.26.600902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/10/2024]
Abstract
We introduce AlphaFold-NMR, a novel approach to NMR structure determination that reveals previously undetected protein conformational states. Unlike conventional NMR methods that rely on NOE-derived spatial restraints, AlphaFold-NMR combines AI-driven conformational sampling with Bayesian scoring of realistic protein models against NOESY and chemical shift data. This method uncovers alternative conformational states of the enzyme Gaussia luciferase, involving large-scale changes in the lid, binding pockets, and other surface cavities. It also identifies similar yet distinct conformational states of the human tumor suppressor Cyclin-Dependent Kinase 2-Associated Protein 1. These studies demonstrate the potential of AI-based modeling with enhanced sampling to generate diverse structural models followed by conformer selection and validation with experimental data as an alternative to traditional restraint-satisfaction protocols for protein NMR structure determination. The AlphaFold-NMR framework enables discovery of conformational heterogeneity and cryptic pockets that conventional NMR analysis methods do not distinguish, providing new insights into protein structure-function relationships.
Collapse
Affiliation(s)
- Yuanpeng J. Huang
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, New York, 12180 USA
| | - Theresa A. Ramelot
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, New York, 12180 USA
| | - Laura E. Spaman
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, New York, 12180 USA
| | - Naohiro Kobayashi
- NMR Science and Development Division. RSC, RIKEN, 1-7-22 Suehiro-cho, Tsurumi-ku, Yokohama City, Kanagawa 230-0045, JAPAN
| | - Gaetano T. Montelione
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, New York, 12180 USA
| |
Collapse
|
5
|
Niazi SK, Mariam Z. Artificial intelligence in drug development: reshaping the therapeutic landscape. Ther Adv Drug Saf 2025; 16:20420986251321704. [PMID: 40008227 PMCID: PMC11851753 DOI: 10.1177/20420986251321704] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2024] [Accepted: 02/03/2025] [Indexed: 02/27/2025] Open
Abstract
Artificial intelligence (AI) is transforming medication research and development, giving clinicians new treatment options. Over the past 30 years, machine learning, deep learning, and neural networks have revolutionized drug design, target identification, and clinical trial predictions. AI has boosted pharmaceutical R&D (research and development) by identifying new therapeutic targets, improving chemical designs, and predicting complicated protein structures. Furthermore, generative AI is accelerating the development and re-engineering of medicinal molecules to cater to both common and rare diseases. Although, to date, no AI-generated medicinal drug has been FDA-approved, HLX-0201 for fragile X syndrome and new molecules for idiopathic pulmonary fibrosis have entered clinical trials. However, AI models are generally considered "black boxes," making their conclusions challenging to understand and limiting the potential due to a lack of model transparency and algorithmic bias. Despite these obstacles, AI-driven drug discovery has substantially reduced development times and costs, expediting the process and financial risks of bringing new medicines to market. In the future, AI is expected to continue to impact pharmaceutical innovation positively, making life-saving drug discoveries faster, more efficient, and more widespread.
Collapse
Affiliation(s)
- Sarfaraz K. Niazi
- College of Pharmacy, University of Illinois Chicago, 833 South Wood Street, Chicago, IL 60612, USA
| | - Zamara Mariam
- Centre for Health and Life Sciences, Coventry University, Coventry, UK
| |
Collapse
|
6
|
Gai J, File M, Erdei R, Czajlik A, Marx F, Galgóczy L, Váradi G, Batta G. Small Disulfide Proteins with Antifungal Impact: NMR Experimental Structures as Compared to Models of Alphafold Versions. Int J Mol Sci 2025; 26:1247. [PMID: 39941016 PMCID: PMC11818080 DOI: 10.3390/ijms26031247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2024] [Revised: 01/27/2025] [Accepted: 01/29/2025] [Indexed: 02/16/2025] Open
Abstract
In response to the growth of emerging resistance to conventional antifungal drugs, antifungal proteins (AFPs) of filamentous Ascomycetes origin have been discovered in recent years. Understanding the structure of AFPs is crucial for elucidating their antifungal mechanisms and developing new therapeutic agents. While nuclear magnetic resonance (NMR) has proven effective in determining the structures of small proteins, some AFP structures remain unresolved, necessitating the use of alternative prediction methods. Through bioinformatics analysis and heatmaps of amino acid sequence identity and similarity matrix, we categorized AFPs into three major classes and six subcategories, revealing structural and bioactivity differences. We employed AlphaFold (AF) to predict the 3D structures of six different AFPs, with predictions compared to NMR-derived structures. The results demonstrated a high degree of consistency between AF and NMR structures, with AF excelling in structural quality assessment and accurately capturing complex disulfide bond patterns. Both AF2 and AF3 models outperform the NMR model in overall structural quality and coherence, with AF3 showing the best performance. However, the limitations of AF should be considered, including its reduced accuracy in predicting multi-metal ion complexes, suboptimal performance in highly flexible or disordered regions, and its inability to account for multiple conformers, as it generates only a single dominant structure. Moreover, while AF3 accurately predicts all disulfide bond patterns, AF2 falls short in this regard. This study verifies the reliability of AF in the structural prediction of cysteine-rich AFPs while highlighting these constraints, offering important support for the rational design of new protein-based antifungal drugs.
Collapse
Affiliation(s)
- Jiawei Gai
- Department of Organic Chemistry, Faculty of Science and Technology, University of Debrecen, Egyetem tér 1, H-4032 Debrecen, Hungary; (J.G.); (M.F.); (R.E.); (A.C.)
| | - Márk File
- Department of Organic Chemistry, Faculty of Science and Technology, University of Debrecen, Egyetem tér 1, H-4032 Debrecen, Hungary; (J.G.); (M.F.); (R.E.); (A.C.)
| | - Réka Erdei
- Department of Organic Chemistry, Faculty of Science and Technology, University of Debrecen, Egyetem tér 1, H-4032 Debrecen, Hungary; (J.G.); (M.F.); (R.E.); (A.C.)
| | - András Czajlik
- Department of Organic Chemistry, Faculty of Science and Technology, University of Debrecen, Egyetem tér 1, H-4032 Debrecen, Hungary; (J.G.); (M.F.); (R.E.); (A.C.)
| | - Florentine Marx
- Institute of Molecular Biology, Biocenter, Medical University of Innsbruck, Innrain 80-82, A-6020 Innsbruck, Austria;
| | - László Galgóczy
- Department of Biotechnology and Microbiology, Faculty of Science and Informatics, University of Szeged, Közép fasor 52, H-6726 Szeged, Hungary;
| | - Györgyi Váradi
- Department of Medical Chemistry, Albert Szent-Györgyi Medical School, University of Szeged, Dóm tér 8, H-6720 Szeged, Hungary;
| | - Gyula Batta
- Department of Organic Chemistry, Faculty of Science and Technology, University of Debrecen, Egyetem tér 1, H-4032 Debrecen, Hungary; (J.G.); (M.F.); (R.E.); (A.C.)
| |
Collapse
|
7
|
Escobedo N, Saldaño T, Mac Donagh J, Sawicki LR, Palopoli N, Alberti SF, Fornasari MS, Parisi G. Revealing Missing Protein-Ligand Interactions Using AlphaFold Predictions. J Mol Biol 2024; 436:168852. [PMID: 39510344 DOI: 10.1016/j.jmb.2024.168852] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2024] [Revised: 10/05/2024] [Accepted: 10/30/2024] [Indexed: 11/15/2024]
Abstract
Protein-ligand interactions represent an essential step to understand the bases of molecular recognition, an intense field of research in many scientific areas. Structural biology has played a central role in unveiling protein-ligand interactions, but current techniques are still not able to reliably describe the interactions of ligands with highly flexible regions. In this work, we explored the capacity of AlphaFold2 (AF2) to estimate the presence of interactions between ligands and residues belonging to disordered regions. As these interactions are missing in the crystallographic-derived structures, we called them "ghost interactions". Using a set of protein structures experimentally obtained after AF2 was trained, we found that the obtained models are good predictors of regions associated with order-disorder transitions. Additionally, we found that AF2 predicts residues making ghost interactions with ligands, which are mostly buried and show differential evolutionary conservation with the rest of the residues located in the flexible region. Our findings could fuel current areas of research that consider, given their biological relevance and their involvement in diseases, intrinsically disordered proteins as potentially valuable targets for drug development.
Collapse
Affiliation(s)
- Nahuel Escobedo
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal B1876BXD, Argentina
| | - Tadeo Saldaño
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal B1876BXD, Argentina; Departamento de Ciencias Básicas, Facultad de Agronomía, Universidad Nacional del Centro de la Provincia de Buenos Aires, Azul, Buenos Aires B7300, Argentina
| | - Juan Mac Donagh
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal B1876BXD, Argentina
| | | | - Nicolas Palopoli
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal B1876BXD, Argentina
| | | | - Maria Silvina Fornasari
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal B1876BXD, Argentina.
| | - Gustavo Parisi
- Departamento de Ciencia y Tecnología, Universidad Nacional de Quilmes, Bernal B1876BXD, Argentina.
| |
Collapse
|
8
|
Lazou M, Khan O, Nguyen T, Padhorny D, Kozakov D, Joseph-McCarthy D, Vajda S. Predicting multiple conformations of ligand binding sites in proteins suggests that AlphaFold2 may remember too much. Proc Natl Acad Sci U S A 2024; 121:e2412719121. [PMID: 39565312 PMCID: PMC11621821 DOI: 10.1073/pnas.2412719121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2024] [Accepted: 10/21/2024] [Indexed: 11/21/2024] Open
Abstract
The goal of this paper is predicting the conformational distributions of ligand binding sites using the AlphaFold2 (AF2) protein structure prediction program with stochastic subsampling of the multiple sequence alignment (MSA). We explored the opening of cryptic ligand binding sites in 16 proteins, where the closed and open conformations define the expected extreme points of the conformational variation. Due to the many structures of these proteins in the Protein Data Bank (PDB), we were able to study whether the distribution of X-ray structures affects the distribution of AF2 models. We have found that AF2 generates both a cluster of open and a cluster of closed models for proteins that have comparable numbers of open and closed structures in the PDB and not too many other conformations. This was observed even with default MSA parameters, thus without further subsampling. In contrast, with the exception of a single protein, AF2 did not yield multiple clusters of conformations for proteins that had imbalanced numbers of open and closed structures in the PDB, or had substantial numbers of other structures. Subsampling improved the results only for a single protein, but very shallow MSA led to incorrect structures. The ability of generating both open and closed conformations for six out of the 16 proteins agrees with the success rates of similar studies reported in the literature. However, we showed that this partial success is due to AF2 "remembering" the conformational distributions in the PDB and that the approach fails to predict rarely seen conformations.
Collapse
Affiliation(s)
- Maria Lazou
- Department of Biomedical Engineering, Boston University, Boston, MA02215
| | - Omeir Khan
- Department of Chemistry, Boston University, Boston, MA02215
| | - Thu Nguyen
- Department of Computer Science, Stony Brook University, Stony Brook, NY11794
| | - Dzmitry Padhorny
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY11794
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY11794
| | - Dima Kozakov
- Department of Applied Mathematics and Statistics, Stony Brook University, Stony Brook, NY11794
- Laufer Center for Physical and Quantitative Biology, Stony Brook University, Stony Brook, NY11794
| | - Diane Joseph-McCarthy
- Department of Biomedical Engineering, Boston University, Boston, MA02215
- Department of Chemistry, Boston University, Boston, MA02215
| | - Sandor Vajda
- Department of Biomedical Engineering, Boston University, Boston, MA02215
- Department of Chemistry, Boston University, Boston, MA02215
| |
Collapse
|
9
|
Ortiz-Morazán AS, Moncada MM, Escobar D, Cabrera-Moreno LA, Fontecha G. Coevolutionary Analysis of the Pfs47-P47Rec Complex: A Bioinformatics Approach. Bioinform Biol Insights 2024; 18:11779322241284223. [PMID: 39386977 PMCID: PMC11462556 DOI: 10.1177/11779322241284223] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 08/28/2024] [Indexed: 10/12/2024] Open
Abstract
Background The ability to predict and comprehend molecular interactions offers significant insights into the biological functions of proteins. The interaction between surface protein 47 of Plasmodium falciparum (Pfs47) and receptor of the protein 47 (P47Rec) has attracted increased attention due to their role in parasite evasion of the mosquito immune system and the concept of geographical coevolution between species. The aims of this study were as follows: to apply a bioinformatics approach to investigate the interaction between Pfs47 and P47Rec proteins and to identify the potential binding sites, protein orientations and receptor specificity sites concerning the geographical origins of the vectors and the parasite. Methods Public sequences of the pfs47 and p47rec genes were downloaded and subsequently filtered to predict functional and structural annotations of the Pfs47-P47Rec complex. Phylogenetic analyses of both proteins were carried out. In addition, the p47Rec gene was subjected to sequencing and subsequent analysis in 2 distinct Anopheles species collected in Honduras. Results The examination of motifs reveals a significant degree of conservation in pfs47, suggesting that Pfs47 might have undergone recent evolutionary development and adaptation. Structural models and docking analyses supported the theory of selectivity of Plasmodium falciparum strains towards their vectors in diverse geographical regions. A detailed description of the putative interaction between the Pfs47-P47Rec complex is shown. Conclusions The study identifies coevolutionary patterns between P47Rec and Pfs47 related to the speciation and geographic dispersion of Anopheles species and Plasmodium falciparum, with Pfs47 evolving more recently than P47Rec. This suggests a link between the parasite's adaptability and existing anopheline species across different regions. P47Rec likely has a cytoplasmic localization due to its lack of membrane attachment elements. However, these findings are based on simulations and require validation through methods like cryo-electron microscopy. A significant limitation is the scarcity of sequences in global databases, which restricts precise interaction modelling. Further research with diverse parasite isolates and anopheline species is recommended to enhance understanding of these proteins' structure and interaction.
Collapse
Affiliation(s)
- Andrés S Ortiz-Morazán
- Instituto de Investigaciones en Microbiología, Facultad de Ciencias, Universidad Nacional Autónoma de Honduras, Tegucigalpa, Honduras
| | - Marcela María Moncada
- Instituto de Investigaciones en Microbiología, Facultad de Ciencias, Universidad Nacional Autónoma de Honduras, Tegucigalpa, Honduras
| | - Denis Escobar
- Instituto de Investigaciones en Microbiología, Facultad de Ciencias, Universidad Nacional Autónoma de Honduras, Tegucigalpa, Honduras
| | - Leonardo A Cabrera-Moreno
- Instituto de Investigaciones en Microbiología, Facultad de Ciencias, Universidad Nacional Autónoma de Honduras, Tegucigalpa, Honduras
| | - Gustavo Fontecha
- Instituto de Investigaciones en Microbiología, Facultad de Ciencias, Universidad Nacional Autónoma de Honduras, Tegucigalpa, Honduras
| |
Collapse
|
10
|
Mubeen H, Masood A, Zafar A, Khan ZQ, Khan MQ, Nisa AU. Insights into AlphaFold's breakthrough in neurodegenerative diseases. Ir J Med Sci 2024; 193:2577-2588. [PMID: 38833116 DOI: 10.1007/s11845-024-03721-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2024] [Accepted: 05/19/2024] [Indexed: 06/06/2024]
Abstract
Neurodegenerative diseases (ND) are disorders of the central nervous system (CNS) characterized by impairment in neurons' functions, and complete loss, leading to memory loss, and difficulty in learning, language, and movement processes. The most common among these NDs are Alzheimer's disease (AD) and Parkinson's disease (PD), although several other disorders also exist. These are frontotemporal dementia (FTD), amyotrophic lateral syndrome (ALS), Huntington's disease (HD), and others; the major pathological hallmark of NDs is the proteinopathies, either of amyloid-β (Aβ), tauopathies, or synucleinopathies. Aggregation of proteins that do not undergo normal configuration, either due to mutations or through some disturbance in cellular pathway contributes to the diseases. Artificial Intelligence (AI) and deep learning (DL) have proven to be successful in the diagnosis and treatment of various congenital diseases. DL approaches like AlphaFold (AF) are a major leap towards success in CNS disorders. This 3D protein geometry modeling algorithm developed by DeepMind has the potential to revolutionize biology. AF has the potential to predict 3D-protein confirmation at an accuracy level comparable to experimentally predicted one, with the additional advantage of precisely estimating protein interactions. This breakthrough will be beneficial to identify diseases' advancement and the disturbance of signaling pathways stimulating impaired functions of proteins. Though AlphaFold has solved a major problem in structural biology, it cannot predict membrane proteins-a beneficial approach for drug designing.
Collapse
Affiliation(s)
- Hira Mubeen
- Department of Biotechnology, Faculty of Science & Technology, University of Central Punjab, Lahore, Pakistan.
| | - Ammara Masood
- Department of Biotechnology, Faculty of Science & Technology, University of Central Punjab, Lahore, Pakistan
| | - Asma Zafar
- Department of Biotechnology, Faculty of Science & Technology, University of Central Punjab, Lahore, Pakistan
| | - Zohaira Qayyum Khan
- Department of Biotechnology, Faculty of Science & Technology, University of Central Punjab, Lahore, Pakistan
| | - Muneeza Qayyum Khan
- Department of Biotechnology, Faculty of Science & Technology, University of Central Punjab, Lahore, Pakistan
| | - Alim Un Nisa
- Pakistan Council of Scientific and Industrial Research, Lahore, Pakistan
| |
Collapse
|
11
|
Benavides TL, Montelione GT. Integrative Modeling of Protein-Polypeptide Complexes by Bayesian Model Selection using AlphaFold and NMR Chemical Shift Perturbation Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.19.613999. [PMID: 39345459 PMCID: PMC11430059 DOI: 10.1101/2024.09.19.613999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/01/2024]
Abstract
Protein-polypeptide interactions, including those involving intrinsically-disordered peptides and intrinsically-disordered regions of protein binding partners, are crucial for many biological functions. However, experimental structure determination of protein-peptide complexes can be challenging. Computational methods, while promising, generally require experimental data for validation and refinement. Here we present CSP_Rank, an integrated modeling approach to determine the structures of protein-peptide complexes. This method combines AlphaFold2 (AF2) enhanced sampling methods with a Bayesian conformational selection process based on experimental Nuclear Magnetic Resonance (NMR) Chemical Shift Perturbation (CSP) data and AF2 confidence metrics. Using a curated dataset of 108 protein-peptide complexes from the Biological Magnetic Resonance Data Bank (BMRB), we observe that while AF2 typically yields models with excellent consistency with experimental CSP data, applying enhanced sampling followed by data-guided conformational selection routinely results in ensembles of structures with improved agreement with NMR observables. For two systems, we cross-validate the CSP-selected models using independently acquired nuclear Overhauser effect (NOE) NMR data and demonstrate how CSP and NMR can be combined using our Bayesian framework for model selection. CSP_Rank is a novel method for integrative modeling of protein-peptide complexes and has broad implications for studies of protein-peptide interactions and aiding in understanding their biological functions.
Collapse
Affiliation(s)
- Tiburon L. Benavides
- Department of Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA
| | - Gaetano T. Montelione
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA
| |
Collapse
|
12
|
Agarwal V, McShan AC. The power and pitfalls of AlphaFold2 for structure prediction beyond rigid globular proteins. Nat Chem Biol 2024; 20:950-959. [PMID: 38907110 PMCID: PMC11956457 DOI: 10.1038/s41589-024-01638-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2023] [Accepted: 04/29/2024] [Indexed: 06/23/2024]
Abstract
Artificial intelligence-driven advances in protein structure prediction in recent years have raised the question: has the protein structure-prediction problem been solved? Here, with a focus on nonglobular proteins, we highlight the many strengths and potential weaknesses of DeepMind's AlphaFold2 in the context of its biological and therapeutic applications. We summarize the subtleties associated with evaluation of AlphaFold2 model quality and reliability using the predicted local distance difference test (pLDDT) and predicted aligned error (PAE) values. We highlight various classes of proteins that AlphaFold2 can be applied to and the caveats involved. Concrete examples of how AlphaFold2 models can be integrated with experimental data in the form of small-angle X-ray scattering (SAXS), solution NMR, cryo-electron microscopy (cryo-EM) and X-ray diffraction are discussed. Finally, we highlight the need to move beyond structure prediction of rigid, static structural snapshots toward conformational ensembles and alternate biologically relevant states. The overarching theme is that careful consideration is due when using AlphaFold2-generated models to generate testable hypotheses and structural models, rather than treating predicted models as de facto ground truth structures.
Collapse
Affiliation(s)
- Vinayak Agarwal
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, USA.
- School of Biological Sciences, Georgia Institute of Technology, Atlanta, GA, USA.
| | - Andrew C McShan
- School of Chemistry and Biochemistry, Georgia Institute of Technology, Atlanta, GA, USA.
| |
Collapse
|
13
|
Bonin JP, Aramini JM, Dong Y, Wu H, Kay LE. AlphaFold2 as a replacement for solution NMR structure determination of small proteins: Not so fast! JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2024; 364:107725. [PMID: 38917639 DOI: 10.1016/j.jmr.2024.107725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/24/2024] [Revised: 06/18/2024] [Accepted: 06/19/2024] [Indexed: 06/27/2024]
Abstract
The determination of a protein's structure is often a first step towards the development of a mechanistic understanding of its function. Considerable advances in computational protein structure prediction have been made in recent years, with AlphaFold2 (AF2) emerging as the primary tool used by researchers for this purpose. While AF2 generally predicts accurate structures of folded proteins, we present here a case where AF2 incorrectly predicts the structure of a small, folded and compact protein with high confidence. This protein, pro-interleukin-18 (pro-IL-18), is the precursor of the cytokine IL-18. Interestingly, the structure of pro-IL-18 predicted by AF2 matches that of the mature cytokine, and not the corresponding experimentally determined structure of the pro-form of the protein. Thus, while computational structure prediction holds immense promise for addressing problems in protein biophysics, there is still a need for experimental structure determination, even in the context of small well-folded, globular proteins.
Collapse
Affiliation(s)
- Jeffrey P Bonin
- Departments of Molecular Genetics and Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada; Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada; Program in Molecular Medicine, The Hospital for Sick Children Research Institute, Toronto, Ontario M5G 0A4, Canada
| | - James M Aramini
- Departments of Molecular Genetics and Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada; Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada; Program in Molecular Medicine, The Hospital for Sick Children Research Institute, Toronto, Ontario M5G 0A4, Canada
| | - Ying Dong
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA, USA; Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Hao Wu
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, Boston, MA, USA; Program in Cellular and Molecular Medicine, Boston Children's Hospital, Boston, MA, USA
| | - Lewis E Kay
- Departments of Molecular Genetics and Biochemistry, University of Toronto, Toronto, Ontario M5S 1A8, Canada; Department of Chemistry, University of Toronto, Toronto, Ontario M5S 3H6, Canada; Program in Molecular Medicine, The Hospital for Sick Children Research Institute, Toronto, Ontario M5G 0A4, Canada.
| |
Collapse
|
14
|
Jiao Z, He Y, Fu X, Zhang X, Geng Z, Ding W. A predicted model-aided reconstruction algorithm for X-ray free-electron laser single-particle imaging. IUCRJ 2024; 11:602-619. [PMID: 38904548 PMCID: PMC11220885 DOI: 10.1107/s2052252524004858] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/09/2024] [Accepted: 05/23/2024] [Indexed: 06/22/2024]
Abstract
Ultra-intense, ultra-fast X-ray free-electron lasers (XFELs) enable the imaging of single protein molecules under ambient temperature and pressure. A crucial aspect of structure reconstruction involves determining the relative orientations of each diffraction pattern and recovering the missing phase information. In this paper, we introduce a predicted model-aided algorithm for orientation determination and phase retrieval, which has been tested on various simulated datasets and has shown significant improvements in the success rate, accuracy and efficiency of XFEL data reconstruction.
Collapse
Affiliation(s)
- Zhichao Jiao
- Laboratory of Soft Matter PhysicsInstitute of Physics, Chinese Academy of SciencesBeijing100190People’s Republic of China
- University of Chinese Academy of SciencesBeijing100049People’s Republic of China
| | - Yao He
- Research Instrument ScientistNew York University Abu DhabiAbu DhabiUnited Arab Emirates
| | - Xingke Fu
- Laboratory of Soft Matter PhysicsInstitute of Physics, Chinese Academy of SciencesBeijing100190People’s Republic of China
- University of Chinese Academy of SciencesBeijing100049People’s Republic of China
| | - Xin Zhang
- The University of Hong KongHong Kong SARPeople’s Republic of China
| | - Zhi Geng
- Beijing Synchrotron Radiation FacilityInstitute of High Energy Physics, Chinese Academy of SciencesBeijing100049People’s Republic of China
- University of Chinese Academy of SciencesBeijing100049People’s Republic of China
| | - Wei Ding
- Laboratory of Soft Matter PhysicsInstitute of Physics, Chinese Academy of SciencesBeijing100190People’s Republic of China
- University of Chinese Academy of SciencesBeijing100049People’s Republic of China
| |
Collapse
|
15
|
Caparotta M, Perez A. Advancing Molecular Dynamics: Toward Standardization, Integration, and Data Accessibility in Structural Biology. J Phys Chem B 2024; 128:2219-2227. [PMID: 38418288 DOI: 10.1021/acs.jpcb.3c04823] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/01/2024]
Abstract
Molecular dynamics (MD) simulations have become a valuable tool in structural biology, offering insights into complex biological systems that are difficult to obtain through experimental techniques alone. The lack of available data sets and structures in most published computational work has limited other researchers' use of these models. In recent years, the emergence of online sharing platforms and MD database initiatives favor the deposition of ensembles and structures to accompany publications, favoring reuse of the data sets. However, the lack of uniform metadata collection, formats, and what data are deposited limits the impact and its use by different communities that are not necessarily experts in MD. This Perspective highlights the need for standardization and better resource sharing for processing and interpreting MD simulation results, akin to efforts in other areas of structural biology. As the field moves forward, we will see an increase in popularity and benefits of MD-based integrative approaches combining experimental data and simulations through probabilistic reasoning, but these too are limited by uniformity in experimental data availability and choices on how the data are modeled that are not trivial to decipher from papers. Other fields have addressed similar challenges comprehensively by establishing task forces with different degrees of success. The large scope and number of communities to represent the breadth of types of MD simulations complicates a parallel approach that would fit all. Thus, each group typically decides what data and which format to upload on servers like Zenodo. Uploading data with FAIR (findable, accessible, interoperable, reusable) principles in mind including optimal metadata collection will make the data more accessible and actionable by the community. Such a wealth of simulation data will foster method development and infrastructure advancements, thus propelling the field forward.
Collapse
Affiliation(s)
- Marcelo Caparotta
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, Florida 32611, United States
| | - Alberto Perez
- Department of Chemistry and Quantum Theory Project, University of Florida, Gainesville, Florida 32611, United States
| |
Collapse
|
16
|
Klukowski P, Damberger FF, Allain FHT, Iwai H, Kadavath H, Ramelot TA, Montelione GT, Riek R, Güntert P. The 100-protein NMR spectra dataset: A resource for biomolecular NMR data analysis. Sci Data 2024; 11:30. [PMID: 38177162 PMCID: PMC10767026 DOI: 10.1038/s41597-023-02879-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Accepted: 12/22/2023] [Indexed: 01/06/2024] Open
Abstract
Multidimensional NMR spectra are the basis for studying proteins by NMR spectroscopy and crucial for the development and evaluation of methods for biomolecular NMR data analysis. Nevertheless, in contrast to derived data such as chemical shift assignments in the BMRB and protein structures in the PDB databases, this primary data is in general not publicly archived. To change this unsatisfactory situation, we present a standardized set of solution NMR data comprising 1329 2-4-dimensional NMR spectra and associated reference (chemical shift assignments, structures) and derived (peak lists, restraints for structure calculation, etc.) annotations. With the 100-protein NMR spectra dataset that was originally compiled for the development of the ARTINA deep learning-based spectra analysis method, 100 protein structures can be reproduced from their original experimental data. The 100-protein NMR spectra dataset is expected to help the development of computational methods for NMR spectroscopy, in particular machine learning approaches, and enable consistent and objective comparisons of these methods.
Collapse
Affiliation(s)
- Piotr Klukowski
- Institute of Molecular Physical Science, ETH Zurich, 8093, Zurich, Switzerland.
| | - Fred F Damberger
- Institute of Biochemistry, ETH Zurich, 8093, Zurich, Switzerland
| | | | - Hideo Iwai
- Institute of Biotechnology, University of Helsinki, 00100, Helsinki, Finland
| | | | - Theresa A Ramelot
- Department of Chemistry and Chemical Biology, and Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Gaetano T Montelione
- Department of Chemistry and Chemical Biology, and Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY, 12180, USA
| | - Roland Riek
- Institute of Molecular Physical Science, ETH Zurich, 8093, Zurich, Switzerland.
| | - Peter Güntert
- Institute of Molecular Physical Science, ETH Zurich, 8093, Zurich, Switzerland.
- Institute of Biophysical Chemistry, Goethe University, 60438, Frankfurt am Main, Germany.
- Department of Chemistry, Tokyo Metropolitan University, Hachioji, 192-0397, Tokyo, Japan.
| |
Collapse
|
17
|
Abstract
Computational prediction of protein structure has been pursued intensely for decades, motivated largely by the goal of using structural models for drug discovery. Recently developed machine-learning methods such as AlphaFold 2 (AF2) have dramatically improved protein structure prediction, with reported accuracy approaching that of experimentally determined structures. To what extent do these advances translate to an ability to predict more accurately how drugs and drug candidates bind to their target proteins? Here, we carefully examine the utility of AF2 protein structure models for predicting binding poses of drug-like molecules at the largest class of drug targets, the G-protein-coupled receptors. We find that AF2 models capture binding pocket structures much more accurately than traditional homology models, with errors nearly as small as differences between structures of the same protein determined experimentally with different ligands bound. Strikingly, however, the accuracy of ligand-binding poses predicted by computational docking to AF2 models is not significantly higher than when docking to traditional homology models and is much lower than when docking to structures determined experimentally without these ligands bound. These results have important implications for all those who might use predicted protein structures for drug discovery.
Collapse
Affiliation(s)
- Masha Karelina
- Biophysics Program, Stanford UniversityStanfordUnited States
- Department of Computer Science, Stanford UniversityStanfordUnited States
- Department of Molecular and Cellular Physiology, Stanford University School of MedicineStanfordUnited States
- Department of Structural Biology, Stanford University School of MedicineStanfordUnited States
- Institute for Computational and Mathematical Engineering, Stanford UniversityStanfordUnited States
| | - Joseph J Noh
- Department of Computer Science, Stanford UniversityStanfordUnited States
- Department of Molecular and Cellular Physiology, Stanford University School of MedicineStanfordUnited States
- Department of Structural Biology, Stanford University School of MedicineStanfordUnited States
- Institute for Computational and Mathematical Engineering, Stanford UniversityStanfordUnited States
| | - Ron O Dror
- Biophysics Program, Stanford UniversityStanfordUnited States
- Department of Computer Science, Stanford UniversityStanfordUnited States
- Department of Molecular and Cellular Physiology, Stanford University School of MedicineStanfordUnited States
- Department of Structural Biology, Stanford University School of MedicineStanfordUnited States
- Institute for Computational and Mathematical Engineering, Stanford UniversityStanfordUnited States
| |
Collapse
|
18
|
Simoens L, Fijalkowski I, Van Damme P. Exposing the small protein load of bacterial life. FEMS Microbiol Rev 2023; 47:fuad063. [PMID: 38012116 PMCID: PMC10723866 DOI: 10.1093/femsre/fuad063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Revised: 11/10/2023] [Accepted: 11/24/2023] [Indexed: 11/29/2023] Open
Abstract
The ever-growing repertoire of genomic techniques continues to expand our understanding of the true diversity and richness of prokaryotic genomes. Riboproteogenomics laid the foundation for dynamic studies of previously overlooked genomic elements. Most strikingly, bacterial genomes were revealed to harbor robust repertoires of small open reading frames (sORFs) encoding a diverse and broadly expressed range of small proteins, or sORF-encoded polypeptides (SEPs). In recent years, continuous efforts led to great improvements in the annotation and characterization of such proteins, yet many challenges remain to fully comprehend the pervasive nature of small proteins and their impact on bacterial biology. In this work, we review the recent developments in the dynamic field of bacterial genome reannotation, catalog the important biological roles carried out by small proteins and identify challenges obstructing the way to full understanding of these elusive proteins.
Collapse
Affiliation(s)
- Laure Simoens
- iRIP Unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, K. L. Ledeganckstraat 35, 9000 Ghent, Belgium
| | - Igor Fijalkowski
- iRIP Unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, K. L. Ledeganckstraat 35, 9000 Ghent, Belgium
| | - Petra Van Damme
- iRIP Unit, Laboratory of Microbiology, Department of Biochemistry and Microbiology, Ghent University, K. L. Ledeganckstraat 35, 9000 Ghent, Belgium
| |
Collapse
|
19
|
Li EH, Spaman LE, Tejero R, Janet Huang Y, Ramelot TA, Fraga KJ, Prestegard JH, Kennedy MA, Montelione GT. Blind assessment of monomeric AlphaFold2 protein structure models with experimental NMR data. JOURNAL OF MAGNETIC RESONANCE (SAN DIEGO, CALIF. : 1997) 2023; 352:107481. [PMID: 37257257 PMCID: PMC10659763 DOI: 10.1016/j.jmr.2023.107481] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/22/2023] [Revised: 05/08/2023] [Accepted: 05/15/2023] [Indexed: 06/02/2023]
Abstract
Recent advances in molecular modeling of protein structures are changing the field of structural biology. AlphaFold-2 (AF2), an AI system developed by DeepMind, Inc., utilizes attention-based deep learning to predict models of protein structures with high accuracy relative to structures determined by X-ray crystallography and cryo-electron microscopy (cryoEM). Comparing AF2 models to structures determined using solution NMR data, both high similarities and distinct differences have been observed. Since AF2 was trained on X-ray crystal and cryoEM structures, we assessed how accurately AF2 can model small, monomeric, solution protein NMR structures which (i) were not used in the AF2 training data set, and (ii) did not have homologous structures in the Protein Data Bank at the time of AF2 training. We identified nine open-source protein NMR data sets for such "blind" targets, including chemical shift, raw NMR FID data, NOESY peak lists, and (for 1 case) 15N-1H residual dipolar coupling data. For these nine small (70-108 residues) monomeric proteins, we generated AF2 prediction models and assessed how well these models fit to these experimental NMR data, using several well-established NMR structure validation tools. In most of these cases, the AF2 models fit the NMR data nearly as well, or sometimes better than, the corresponding NMR structure models previously deposited in the Protein Data Bank. These results provide benchmark NMR data for assessing new NMR data analysis and protein structure prediction methods. They also document the potential for using AF2 as a guiding tool in protein NMR data analysis, and more generally for hypothesis generation in structural biology research.
Collapse
Affiliation(s)
- Ethan H Li
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180, USA
| | - Laura E Spaman
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180, USA.
| | - Roberto Tejero
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180, USA.
| | - Yuanpeng Janet Huang
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180, USA.
| | - Theresa A Ramelot
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180, USA.
| | - Keith J Fraga
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180, USA.
| | - James H Prestegard
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602, USA.
| | - Michael A Kennedy
- Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056, USA.
| | - Gaetano T Montelione
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180, USA.
| |
Collapse
|
20
|
Sedinkin SL, Burns D, Shukla D, Potoyan DA, Venditti V. Solution Structure Ensembles of the Open and Closed Forms of the ∼130 kDa Enzyme I via AlphaFold Modeling, Coarse Grained Simulations, and NMR. J Am Chem Soc 2023; 145:13347-13356. [PMID: 37278728 PMCID: PMC10772991 DOI: 10.1021/jacs.3c03425] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Large-scale interdomain rearrangements are essential to protein function, governing the activity of large enzymes and molecular machineries. Yet, obtaining an atomic-resolution understanding of how the relative domain positioning is affected by external stimuli is a hard task in modern structural biology. Here, we show that combining structural modeling by AlphaFold2 with coarse-grained molecular dynamics simulations and NMR residual dipolar coupling data is sufficient to characterize the spatial domain organization of bacterial enzyme I (EI), a ∼130 kDa multidomain oligomeric protein that undergoes large-scale conformational changes during its catalytic cycle. In particular, we solve conformational ensembles for EI at two different experimental temperatures and demonstrate that a lower temperature favors sampling of the catalytically competent closed state of the enzyme. These results suggest a role for conformational entropy in the activation of EI and demonstrate the ability of our protocol to detect and characterize the effect of external stimuli (such as mutations, ligand binding, and post-translational modifications) on the interdomain organization of multidomain proteins. We expect the ensemble refinement protocol described here to be easily transferrable to the investigation of the structure and dynamics of other uncharted multidomain systems and have assembled a Google Colab page (https://potoyangroup.github.io/Seq2Ensemble/) to facilitate implementation of the presented methodology elsewhere.
Collapse
Affiliation(s)
| | - Daniel Burns
- Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, Iowa 50011, USA
| | - Divyanshu Shukla
- Department of Chemistry, Iowa State University, Ames, Iowa 50011, USA
| | - Davit A. Potoyan
- Department of Chemistry, Iowa State University, Ames, Iowa 50011, USA
- Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, Iowa 50011, USA
| | - Vincenzo Venditti
- Department of Chemistry, Iowa State University, Ames, Iowa 50011, USA
- Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, Iowa 50011, USA
| |
Collapse
|
21
|
Nussinov R, Zhang M, Liu Y, Jang H. AlphaFold, allosteric, and orthosteric drug discovery: Ways forward. Drug Discov Today 2023; 28:103551. [PMID: 36907321 PMCID: PMC10238671 DOI: 10.1016/j.drudis.2023.103551] [Citation(s) in RCA: 45] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2022] [Revised: 02/27/2023] [Accepted: 03/07/2023] [Indexed: 03/13/2023]
Abstract
Drug discovery is arguably a highly challenging and significant interdisciplinary aim. The stunning success of the artificial intelligence-powered AlphaFold, whose latest version is buttressed by an innovative machine-learning approach that integrates physical and biological knowledge about protein structures, raised drug discovery hopes that unsurprisingly, have not come to bear. Even though accurate, the models are rigid, including the drug pockets. AlphaFold's mixed performance poses the question of how its power can be harnessed in drug discovery. Here we discuss possible ways of going forward wielding its strengths, while bearing in mind what AlphaFold can and cannot do. For kinases and receptors, an input enriched in active (ON) state models can better AlphaFold's chance of rational drug design success.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA; Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel.
| | - Mingzhen Zhang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| | - Yonglan Liu
- Cancer Innovation Laboratory, National Cancer Institute, Frederick, MD 21702, USA
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA
| |
Collapse
|
22
|
Koehler Leman J, Künze G. Recent Advances in NMR Protein Structure Prediction with ROSETTA. Int J Mol Sci 2023; 24:ijms24097835. [PMID: 37175539 PMCID: PMC10178863 DOI: 10.3390/ijms24097835] [Citation(s) in RCA: 19] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2023] [Revised: 04/15/2023] [Accepted: 04/21/2023] [Indexed: 05/15/2023] Open
Abstract
Nuclear magnetic resonance (NMR) spectroscopy is a powerful method for studying the structure and dynamics of proteins in their native state. For high-resolution NMR structure determination, the collection of a rich restraint dataset is necessary. This can be difficult to achieve for proteins with high molecular weight or a complex architecture. Computational modeling techniques can complement sparse NMR datasets (<1 restraint per residue) with additional structural information to elucidate protein structures in these difficult cases. The Rosetta software for protein structure modeling and design is used by structural biologists for structure determination tasks in which limited experimental data is available. This review gives an overview of the computational protocols available in the Rosetta framework for modeling protein structures from NMR data. We explain the computational algorithms used for the integration of different NMR data types in Rosetta. We also highlight new developments, including modeling tools for data from paramagnetic NMR and hydrogen-deuterium exchange, as well as chemical shifts in CS-Rosetta. Furthermore, strategies are discussed to complement and improve structure predictions made by the current state-of-the-art AlphaFold2 program using NMR-guided Rosetta modeling.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY 10010, USA
| | - Georg Künze
- Institute for Drug Discovery, Medical Faculty, University of Leipzig, Brüderstr. 34, D-04103 Leipzig, Germany
- Interdisciplinary Center for Bioinformatics, University of Leipzig, Härtelstr. 16-18, D-04107 Leipzig, Germany
| |
Collapse
|
23
|
Yang Z, Zeng X, Zhao Y, Chen R. AlphaFold2 and its applications in the fields of biology and medicine. Signal Transduct Target Ther 2023; 8:115. [PMID: 36918529 PMCID: PMC10011802 DOI: 10.1038/s41392-023-01381-z] [Citation(s) in RCA: 182] [Impact Index Per Article: 91.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2022] [Revised: 12/27/2022] [Accepted: 02/16/2023] [Indexed: 03/16/2023] Open
Abstract
AlphaFold2 (AF2) is an artificial intelligence (AI) system developed by DeepMind that can predict three-dimensional (3D) structures of proteins from amino acid sequences with atomic-level accuracy. Protein structure prediction is one of the most challenging problems in computational biology and chemistry, and has puzzled scientists for 50 years. The advent of AF2 presents an unprecedented progress in protein structure prediction and has attracted much attention. Subsequent release of structures of more than 200 million proteins predicted by AF2 further aroused great enthusiasm in the science community, especially in the fields of biology and medicine. AF2 is thought to have a significant impact on structural biology and research areas that need protein structure information, such as drug discovery, protein design, prediction of protein function, et al. Though the time is not long since AF2 was developed, there are already quite a few application studies of AF2 in the fields of biology and medicine, with many of them having preliminarily proved the potential of AF2. To better understand AF2 and promote its applications, we will in this article summarize the principle and system architecture of AF2 as well as the recipe of its success, and particularly focus on reviewing its applications in the fields of biology and medicine. Limitations of current AF2 prediction will also be discussed.
Collapse
Affiliation(s)
- Zhenyu Yang
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, 610041, China
| | - Xiaoxi Zeng
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, 610041, China.
| | - Yi Zhao
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, 610041, China.
- Key Laboratory of Intelligent Information Processing, Advanced Computer Research Center, Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China.
| | - Runsheng Chen
- West China Biomedical Big Data Center, West China Hospital, Sichuan University, Chengdu, 610041, China.
- Key Laboratory of RNA Biology, Center for Big Data Research in Health, Institute of Biophysics, Chinese Academy of Sciences, Beijing, 100101, China.
- Pingshan Translational Medicine Center, Shenzhen Bay Laboratory, Shenzhen, 518118, China.
| |
Collapse
|
24
|
Chang L, Perez A. Ranking Peptide Binders by Affinity with AlphaFold. Angew Chem Int Ed Engl 2023; 62:e202213362. [PMID: 36542066 DOI: 10.1002/anie.202213362] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Revised: 12/20/2022] [Accepted: 12/21/2022] [Indexed: 12/24/2022]
Abstract
AlphaFold has revolutionized structural biology by predicting highly accurate structures of proteins and their complexes with peptides and other proteins. However, for protein-peptide systems, we are also interested in identifying the highest affinity binder among a set of candidate peptides. We present a novel competitive binding assay using AlphaFold to predict structures of the receptor in the presence of two peptides. For systems in which the individual structures of the peptides are well predicted, the assay captures the higher affinity binder in the bound state, and the other peptide in the unbound form with statistical significance. We test the application on six protein receptors for which we have experimental binding affinities to several peptides. We find that the assay is best suited for identifying medium to strong peptide binders that adopt stable secondary structures upon binding.
Collapse
Affiliation(s)
- Liwei Chang
- Department of Chemistry, University of Florida, Gainesville, FL, USA.,Quantum Theory Project, University of Florida, Gainesville, FL, USA
| | - Alberto Perez
- Department of Chemistry, University of Florida, Gainesville, FL, USA.,Quantum Theory Project, University of Florida, Gainesville, FL, USA
| |
Collapse
|
25
|
Zhao H, Zhang H, She Z, Gao Z, Wang Q, Geng Z, Dong Y. Exploring AlphaFold2's Performance on Predicting Amino Acid Side-Chain Conformations and Its Utility in Crystal Structure Determination of B318L Protein. Int J Mol Sci 2023; 24:2740. [PMID: 36769074 PMCID: PMC9916901 DOI: 10.3390/ijms24032740] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Revised: 01/10/2023] [Accepted: 01/12/2023] [Indexed: 02/04/2023] Open
Abstract
Recent technological breakthroughs in machine-learning-based AlphaFold2 (AF2) are pushing the prediction accuracy of protein structures to an unprecedented level that is on par with experimental structural quality. Despite its outstanding structural modeling capability, further experimental validations and performance assessments of AF2 predictions are still required, thus necessitating the development of integrative structural biology in synergy with both computational and experimental methods. Focusing on the B318L protein that plays an essential role in the African swine fever virus (ASFV) for viral replication, we experimentally demonstrate the high quality of the AF2 predicted model and its practical utility in crystal structural determination. Structural alignment implies that the AF2 model shares nearly the same atomic arrangement as the B318L crystal structure except for some flexible and disordered regions. More importantly, side-chain-based analysis at the individual residue level reveals that AF2's performance is likely dependent on the specific amino acid type and that hydrophobic residues tend to be more accurately predicted by AF2 than hydrophilic residues. Quantitative per-residue RMSD comparisons and further molecular replacement trials suggest that AF2 has a large potential to outperform other computational modeling methods in terms of structural determination. Additionally, it is numerically confirmed that the AF2 model is accurate enough so that it may well potentially withstand experimental data quality to a large extent for structural determination. Finally, an overall structural analysis and molecular docking simulation of the B318L protein are performed. Taken together, our study not only provides new insights into AF2's performance in predicting side-chain conformations but also sheds light upon the significance of AF2 in promoting crystal structural determination, especially when the experimental data quality of the protein crystal is poor.
Collapse
Affiliation(s)
- Haifan Zhao
- School of Life Sciences, University of Science and Technology of China, Hefei 230027, China
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
| | - Heng Zhang
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
| | - Zhun She
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
| | - Zengqiang Gao
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
| | - Qi Wang
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhi Geng
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
| | - Yuhui Dong
- Beijing Synchrotron Radiation Facility, Institute of High Energy Physics, Chinese Academy of Sciences, Beijing 100049, China
- University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
26
|
Li EH, Spaman L, Tejero R, Huang YJ, Ramelot TA, Fraga KJ, Prestegard JH, Kennedy MA, Montelione GT. Blind Assessment of Monomeric AlphaFold2 Protein Structure Models with Experimental NMR Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.22.525096. [PMID: 36712039 PMCID: PMC9882346 DOI: 10.1101/2023.01.22.525096] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/24/2023]
Abstract
Recent advances in molecular modeling of protein structures are changing the field of structural biology. AlphaFold-2 (AF2), an AI system developed by DeepMind, Inc., utilizes attention-based deep learning to predict models of protein structures with high accuracy relative to structures determined by X-ray crystallography and cryo-electron microscopy (cryoEM). Comparing AF2 models to structures determined using solution NMR data, both high similarities and distinct differences have been observed. Since AF2 was trained on X-ray crystal and cryoEM structures, we assessed how accurately AF2 can model small, monomeric, solution protein NMR structures which (i) were not used in the AF2 training data set, and (ii) did not have homologous structures in the Protein Data Bank at the time of AF2 training. We identified nine open source protein NMR data sets for such "blind" targets, including chemical shift, raw NMR FID data, NOESY peak lists, and (for 1 case) 15 N- 1 H residual dipolar coupling data. For these nine small (70 - 108 residues) monomeric proteins, we generated AF2 prediction models and assessed how well these models fit to these experimental NMR data, using several well-established NMR structure validation tools. In most of these cases, the AF2 models fit the NMR data nearly as well, or sometimes better than, the corresponding NMR structure models previously deposited in the Protein Data Bank. These results provide benchmark NMR data for assessing new NMR data analysis and protein structure prediction methods. They also document the potential for using AF2 as a guiding tool in protein NMR data analysis, and more generally for hypothesis generation in structural biology research. Highlights AF2 models assessed against NMR data for 9 monomeric proteins not used in training.AF2 models fit NMR data almost as well as the experimentally-determined structures. RPF-DP, PSVS , and PDBStat software provide structure quality and RDC assessment. RPF-DP analysis using AF2 models suggests multiple conformational states.
Collapse
Affiliation(s)
- Ethan H. Li
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA
| | - Laura Spaman
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA
| | - Roberto Tejero
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA
| | - Yuanpeng Janet Huang
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA
| | - Theresa A. Ramelot
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA
| | - Keith J. Fraga
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA
| | - James H. Prestegard
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602 USA
| | - Michael A. Kennedy
- Department of Chemistry and Biochemistry, Miami University, Oxford, OH 45056 USA
| | - Gaetano T. Montelione
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA
| |
Collapse
|