51
|
Park H, Patel P, Haas R, Huerta EA. APACE: AlphaFold2 and advanced computing as a service for accelerated discovery in biophysics. Proc Natl Acad Sci U S A 2024; 121:e2311888121. [PMID: 38913887 PMCID: PMC11228474 DOI: 10.1073/pnas.2311888121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 12/25/2023] [Indexed: 06/26/2024] Open
Abstract
The prediction of protein 3D structure from amino acid sequence is a computational grand challenge in biophysics and plays a key role in robust protein structure prediction algorithms, from drug discovery to genome interpretation. The advent of AI models, such as AlphaFold, is revolutionizing applications that depend on robust protein structure prediction algorithms. To maximize the impact, and ease the usability, of these AI tools we introduce APACE, AlphaFold2 and advanced computing as a service, a computational framework that effectively handles this AI model and its TB-size database to conduct accelerated protein structure prediction analyses in modern supercomputing environments. We deployed APACE in the Delta and Polaris supercomputers and quantified its performance for accurate protein structure predictions using four exemplar proteins: 6AWO, 6OAN, 7MEZ, and 6D6U. Using up to 300 ensembles, distributed across 200 NVIDIA A100 GPUs, we found that APACE is up to two orders of magnitude faster than off-the-self AlphaFold2 implementations, reducing time-to-solution from weeks to minutes. This computational approach may be readily linked with robotics laboratories to automate and accelerate scientific discovery.
Collapse
Affiliation(s)
- Hyun Park
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439
- Theoretical and Computational Biophysics Group, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL 61801
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801
| | - Parth Patel
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801
- National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, IL 61801
| | - Roland Haas
- National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, IL 61801
| | - E A Huerta
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439
- Department of Computer Science, The University of Chicago, Chicago, IL 60637
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL 61801
| |
Collapse
|
52
|
Liu T, Gao X, Chen R, Tang K, Liu Z, Wang P, Wang X. A nuclease domain fused to the Snf2 helicase confers antiphage defence in coral-associated Halomonas meridiana. Microb Biotechnol 2024; 17:e14524. [PMID: 38980956 PMCID: PMC11232893 DOI: 10.1111/1751-7915.14524] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/02/2024] [Accepted: 06/26/2024] [Indexed: 07/11/2024] Open
Abstract
The coral reef microbiome plays a vital role in the health and resilience of reefs. Previous studies have examined phage therapy for coral pathogens and for modifying the coral reef microbiome, but defence systems against coral-associated bacteria have received limited attention. Phage defence systems play a crucial role in helping bacteria fight phage infections. In this study, we characterized a new defence system, Hma (HmaA-HmaB-HmaC), in the coral-associated Halomonas meridiana derived from the scleractinian coral Galaxea fascicularis. The Swi2/Snf2 helicase HmaA with a C-terminal nuclease domain exhibits antiviral activity against Escherichia phage T4. Mutation analysis revealed the nickase activity of the nuclease domain (belonging to PDD/EXK superfamily) of HmaA is essential in phage defence. Additionally, HmaA homologues are present in ~1000 bacterial and archaeal genomes. The high frequency of HmaA helicase in Halomonas strains indicates the widespread presence of these phage defence systems, while the insertion of defence genes in the hma region confirms the existence of a defence gene insertion hotspot. These findings offer insights into the diversity of phage defence systems in coral-associated bacteria and these diverse defence systems can be further applied into designing probiotics with high-phage resistance.
Collapse
Affiliation(s)
- Tianlang Liu
- Key Laboratory of Tropical Marine Bio‐resources and Ecology, Guangdong Key Laboratory of Marine Materia Medica, Innovation Academy of South China Sea Ecology and Environmental EngineeringSouth China Sea Institute of Oceanology, Chinese Academy of SciencesGuangzhouChina
- University of Chinese Academy of SciencesBeijingChina
| | - Xinyu Gao
- Key Laboratory of Tropical Marine Bio‐resources and Ecology, Guangdong Key Laboratory of Marine Materia Medica, Innovation Academy of South China Sea Ecology and Environmental EngineeringSouth China Sea Institute of Oceanology, Chinese Academy of SciencesGuangzhouChina
- University of Chinese Academy of SciencesBeijingChina
| | - Ran Chen
- Key Laboratory of Tropical Marine Bio‐resources and Ecology, Guangdong Key Laboratory of Marine Materia Medica, Innovation Academy of South China Sea Ecology and Environmental EngineeringSouth China Sea Institute of Oceanology, Chinese Academy of SciencesGuangzhouChina
| | - Kaihao Tang
- Key Laboratory of Tropical Marine Bio‐resources and Ecology, Guangdong Key Laboratory of Marine Materia Medica, Innovation Academy of South China Sea Ecology and Environmental EngineeringSouth China Sea Institute of Oceanology, Chinese Academy of SciencesGuangzhouChina
- University of Chinese Academy of SciencesBeijingChina
- Southern Marine Science and Engineering Guangdong Laboratory (Guangzhou)GuangzhouChina
| | - Ziyao Liu
- Key Laboratory of Tropical Marine Bio‐resources and Ecology, Guangdong Key Laboratory of Marine Materia Medica, Innovation Academy of South China Sea Ecology and Environmental EngineeringSouth China Sea Institute of Oceanology, Chinese Academy of SciencesGuangzhouChina
- University of Chinese Academy of SciencesBeijingChina
| | - Pengxia Wang
- Key Laboratory of Tropical Marine Bio‐resources and Ecology, Guangdong Key Laboratory of Marine Materia Medica, Innovation Academy of South China Sea Ecology and Environmental EngineeringSouth China Sea Institute of Oceanology, Chinese Academy of SciencesGuangzhouChina
- University of Chinese Academy of SciencesBeijingChina
- Southern Marine Science and Engineering Guangdong Laboratory (Guangzhou)GuangzhouChina
| | - Xiaoxue Wang
- Key Laboratory of Tropical Marine Bio‐resources and Ecology, Guangdong Key Laboratory of Marine Materia Medica, Innovation Academy of South China Sea Ecology and Environmental EngineeringSouth China Sea Institute of Oceanology, Chinese Academy of SciencesGuangzhouChina
- University of Chinese Academy of SciencesBeijingChina
- Southern Marine Science and Engineering Guangdong Laboratory (Guangzhou)GuangzhouChina
| |
Collapse
|
53
|
Basu S, Subedi U, Tonelli M, Afshinpour M, Tiwari N, Fuentes EJ, Chakravarty S. Assessing the functional roles of coevolving PHD finger residues. Protein Sci 2024; 33:e5065. [PMID: 38923615 PMCID: PMC11201814 DOI: 10.1002/pro.5065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 04/21/2024] [Accepted: 05/16/2024] [Indexed: 06/28/2024]
Abstract
Although in silico folding based on coevolving residue constraints in the deep-learning era has transformed protein structure prediction, the contributions of coevolving residues to protein folding, stability, and other functions in physical contexts remain to be clarified and experimentally validated. Herein, the PHD finger module, a well-known histone reader with distinct subtypes containing subtype-specific coevolving residues, was used as a model to experimentally assess the contributions of coevolving residues and to clarify their specific roles. The results of the assessment, including proteolysis and thermal unfolding of wildtype and mutant proteins, suggested that coevolving residues have varying contributions, despite their large in silico constraints. Residue positions with large constraints were found to contribute to stability in one subtype but not others. Computational sequence design and generative model-based energy estimates of individual structures were also implemented to complement the experimental assessment. Sequence design and energy estimates distinguish coevolving residues that contribute to folding from those that do not. The results of proteolytic analysis of mutations at positions contributing to folding were consistent with those suggested by sequence design and energy estimation. Thus, we report a comprehensive assessment of the contributions of coevolving residues, as well as a strategy based on a combination of approaches that should enable detailed understanding of the residue contributions in other large protein families.
Collapse
Affiliation(s)
- Shraddha Basu
- Department of Chemistry & BiochemistrySouth Dakota State UniversityBrookingsSouth DakotaUSA
| | - Ujwal Subedi
- Department of Chemistry & BiochemistrySouth Dakota State UniversityBrookingsSouth DakotaUSA
| | - Marco Tonelli
- National Magnetic Resonance Facility at Madison (NMRFAM), University of Wisconsin‐MadisonMadisonWisconsinUSA
| | - Maral Afshinpour
- Department of Chemistry & BiochemistrySouth Dakota State UniversityBrookingsSouth DakotaUSA
| | - Nitija Tiwari
- Department of Biochemistry & Molecular BiologyUniversity of IowaIowa CityIowaUSA
| | - Ernesto J. Fuentes
- Department of Biochemistry & Molecular BiologyUniversity of IowaIowa CityIowaUSA
| | - Suvobrata Chakravarty
- Department of Chemistry & BiochemistrySouth Dakota State UniversityBrookingsSouth DakotaUSA
| |
Collapse
|
54
|
Raisinghani N, Alshahrani M, Gupta G, Tian H, Xiao S, Tao P, Verkhivker GM. Integration of a Randomized Sequence Scanning Approach in AlphaFold2 and Local Frustration Profiling of Conformational States Enable Interpretable Atomistic Characterization of Conformational Ensembles and Detection of Hidden Allosteric States in the ABL1 Protein Kinase. J Chem Theory Comput 2024; 20:5317-5336. [PMID: 38865109 PMCID: PMC12100677 DOI: 10.1021/acs.jctc.4c00222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2024]
Abstract
Despite the success of AlphaFold methods in predicting single protein structures, these methods showed intrinsic limitations in the characterization of multiple functional conformations of allosteric proteins. The recent NMR-based structural determination of the unbound ABL kinase in the active state and discovery of the inactive low-populated functional conformations that are unique for ABL kinase present an ideal challenge for the AlphaFold2 approaches. In the current study, we employ several adaptations of the AlphaFold2 methodology to predict protein conformational ensembles and allosteric states of the ABL kinase including randomized alanine sequence scanning combined with the multiple sequence alignment subsampling proposed in this study. We show that the proposed new AlphaFold2 adaptation combined with local frustration profiling of conformational states enables accurate prediction of the protein kinase structures and conformational ensembles, also offering a robust approach for interpretable characterization of the AlphaFold2 predictions and detection of hidden allosteric states. We found that the large high frustration residue clusters are uniquely characteristic of the low-populated, fully inactive ABL form and can define energetically frustrated cracking sites of conformational transitions, presenting difficult targets for AlphaFold2. The results of this study uncovered previously unappreciated fundamental connections between local frustration profiles of the functional allosteric states and the ability of AlphaFold2 methods to predict protein structural ensembles of the active and inactive states. This study showed that integration of the randomized sequence scanning adaptation of AlphaFold2 with a robust landscape-based analysis allows for interpretable atomistic predictions and characterization of protein conformational ensembles, providing a physical basis for the successes and limitations of current AlphaFold2 methods in detecting functional allosteric states that play a significant role in protein kinase regulation.
Collapse
Affiliation(s)
- Nishank Raisinghani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Mohammed Alshahrani
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Grace Gupta
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
| | - Hao Tian
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States
| | - Sian Xiao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States
| | - Peng Tao
- Department of Chemistry, Center for Research Computing, Center for Drug Discovery, Design, and Delivery (CD4), Southern Methodist University, Dallas, Texas 75275, United States
| | - Gennady M Verkhivker
- Keck Center for Science and Engineering, Graduate Program in Computational and Data Sciences, Schmid College of Science and Technology, Chapman University, Orange, California 92866, United States
- Department of Biomedical and Pharmaceutical Sciences, Chapman University School of Pharmacy, Irvine, California 92618, United States
- Department of Pharmacology, Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093, United States
| |
Collapse
|
55
|
Urvas L, Chiesa L, Bret G, Jacquemard C, Kellenberger E. Benchmarking AlphaFold-Generated Structures of Chemokine-Chemokine Receptor Complexes. J Chem Inf Model 2024; 64:4587-4600. [PMID: 38809680 DOI: 10.1021/acs.jcim.3c01835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/31/2024]
Abstract
AlphaFold and AlphaFold-Multimer have become two essential tools for the modeling of unknown structures of proteins and protein complexes. In this work, we extensively benchmarked the quality of chemokine-chemokine receptor structures generated by AlphaFold-Multimer against experimentally determined structures. Our analysis considered both the global quality of the model, as well as key structural features for chemokine recognition. To study the effects of template and multiple sequence alignment parameters on the results, a new prediction pipeline called LIT-AlphaFold (https://github.com/LIT-CCM-lab/LIT-AlphaFold) was developed, allowing extensive input customization. AlphaFold-Multimer correctly predicted differences in chemokine binding orientation and accurately reproduced the unique binding orientation of the CXCL12-ACKR3 complex. Further, the predictions of the full receptor N-terminus provided insights into a putative chemokine recognition site 0.5. The accuracy of chemokine N-terminus binding mode prediction varied between complexes, but the confidence score permitted the distinguishing of residues that were very likely well positioned. Finally, we generated a high-confidence model of the unsolved CXCL12-CXCR4 complex, which agreed with experimental mutagenesis and cross-linking data.
Collapse
Affiliation(s)
- Lauri Urvas
- Laboratoire d'Innovation Thérapeutique, UMR 7200 CNRS, Université de Strasbourg, 67400 Illkirch, France
| | - Luca Chiesa
- Laboratoire d'Innovation Thérapeutique, UMR 7200 CNRS, Université de Strasbourg, 67400 Illkirch, France
| | - Guillaume Bret
- Laboratoire d'Innovation Thérapeutique, UMR 7200 CNRS, Université de Strasbourg, 67400 Illkirch, France
| | - Célien Jacquemard
- Laboratoire d'Innovation Thérapeutique, UMR 7200 CNRS, Université de Strasbourg, 67400 Illkirch, France
| | - Esther Kellenberger
- Laboratoire d'Innovation Thérapeutique, UMR 7200 CNRS, Université de Strasbourg, 67400 Illkirch, France
| |
Collapse
|
56
|
Pflüger T, Gschell M, Zhang L, Shnitsar V, Zabadné AJ, Zierep P, Günther S, Einsle O, Andrade SLA. How sensor Amt-like proteins integrate ammonium signals. SCIENCE ADVANCES 2024; 10:eadm9441. [PMID: 38838143 DOI: 10.1126/sciadv.adm9441] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 04/30/2024] [Indexed: 06/07/2024]
Abstract
Unlike aquaporins or potassium channels, ammonium transporters (Amts) uniquely discriminate ammonium from potassium and water. This feature has certainly contributed to their repurposing as ammonium receptors during evolution. Here, we describe the ammonium receptor Sd-Amt1, where an Amt module connects to a cytoplasmic diguanylate cyclase transducer module via an HAMP domain. Structures of the protein with and without bound ammonium were determined to 1.7- and 1.9-Ångstrom resolution, depicting the ON and OFF states of the receptor and confirming the presence of a binding site for two ammonium cations that is pivotal for signal perception and receptor activation. The transducer domain was disordered in the crystals, and an AlphaFold2 prediction suggests that the helices linking both domains are flexible. While the sensor domain retains the trimeric fold formed by all Amt family members, the HAMP domains interact as pairs and serve to dimerize the transducer domain upon activation.
Collapse
Affiliation(s)
- Tobias Pflüger
- Faculty of Chemistry and Pharmacy, Institute for Biochemistry, University Freiburg, Albertstr. 21, 79104 Freiburg, Germany
| | - Mathias Gschell
- Faculty of Chemistry and Pharmacy, Institute for Biochemistry, University Freiburg, Albertstr. 21, 79104 Freiburg, Germany
| | - Lin Zhang
- Faculty of Chemistry and Pharmacy, Institute for Biochemistry, University Freiburg, Albertstr. 21, 79104 Freiburg, Germany
| | - Volodymyr Shnitsar
- Faculty of Chemistry and Pharmacy, Institute for Biochemistry, University Freiburg, Albertstr. 21, 79104 Freiburg, Germany
| | - Annas J Zabadné
- Faculty of Chemistry and Pharmacy, Institute for Biochemistry, University Freiburg, Albertstr. 21, 79104 Freiburg, Germany
| | - Paul Zierep
- Faculty of Chemistry and Pharmacy, Institute for Pharmaceutical Sciences, University Freiburg, Hermann-Herder-Str. 9, 79104 Freiburg, Germany
| | - Stefan Günther
- Faculty of Chemistry and Pharmacy, Institute for Pharmaceutical Sciences, University Freiburg, Hermann-Herder-Str. 9, 79104 Freiburg, Germany
| | - Oliver Einsle
- Faculty of Chemistry and Pharmacy, Institute for Biochemistry, University Freiburg, Albertstr. 21, 79104 Freiburg, Germany
- BIOSS Centre for Biological Signaling Studies, University Freiburg, Schänzlerstr. 1, 79104 Freiburg, Germany
| | - Susana L A Andrade
- Faculty of Chemistry and Pharmacy, Institute for Biochemistry, University Freiburg, Albertstr. 21, 79104 Freiburg, Germany
- BIOSS Centre for Biological Signaling Studies, University Freiburg, Schänzlerstr. 1, 79104 Freiburg, Germany
| |
Collapse
|
57
|
Jiang T, Wan G, Zhang H, Gyawali YP, Underbakke ES, Feng C. Mapping the Intersubunit Interdomain FMN-Heme Interactions in Neuronal Nitric Oxide Synthase by Targeted Quantitative Cross-Linking Mass Spectrometry. Biochemistry 2024; 63:1395-1411. [PMID: 38747545 PMCID: PMC11893013 DOI: 10.1021/acs.biochem.4c00157] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Nitric oxide synthase (NOS) in mammals is a family of multidomain proteins in which interdomain electron transfer (IET) is controlled by domain-domain interactions. Calmodulin (CaM) binds to the canonical CaM-binding site in the linker region between the FMN and heme domains of NOS and allows tethered FMN domain motions, enabling an intersubunit FMN-heme IET in the output state for NO production. Our previous cross-linking mass spectrometric (XL MS) results demonstrated site-specific protein dynamics in the CaM-responsive regions of rat neuronal NOS (nNOS) reductase construct, a monomeric protein [Jiang et al., Biochemistry, 2023, 62, 2232-2237]. In this work, we have extended our combined approach of XL MS structural mapping and AlphaFold structural prediction to examine the homodimeric nNOS oxygenase/FMN (oxyFMN) construct, an established model of the NOS output state. We employed parallel reaction monitoring (PRM) based quantitative XL MS (qXL MS) to assess the CaM-induced changes in interdomain dynamics and interactions. Intersubunit cross-links were identified by mapping the cross-links onto top AlphaFold structural models, which was complemented by comparing their relative abundances in the cross-linked dimeric and monomeric bands. Furthermore, contrasting the CaM-free and CaM-bound nNOS samples shows that CaM enables the formation of the intersubunit FMN-heme docking complex and that CaM binding induces extensive, allosteric conformational changes across the NOS regions. Moreover, the observed cross-links sites specifically respond to changes in ionic strength. This indicates that interdomain salt bridges are responsible for stabilizing and orienting the output state for efficient FMN-heme IET. Taken together, our targeted qXL MS results have revealed that CaM and ionic strength modulate specific dynamic changes in the CaM/FMN/heme complexes, particularly in the context of intersubunit interdomain FMN-heme interactions.
Collapse
Affiliation(s)
- Ting Jiang
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Guanghua Wan
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Haikun Zhang
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Yadav Prasad Gyawali
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New Mexico, Albuquerque, New Mexico 87131, United States
| | - Eric S Underbakke
- Roy J. Carver Department of Biochemistry, Biophysics and Molecular Biology, Iowa State University, Ames, Iowa 50011, United States
| | - Changjian Feng
- Department of Pharmaceutical Sciences, College of Pharmacy, University of New Mexico, Albuquerque, New Mexico 87131, United States
- Department of Chemistry and Chemical Biology, University of New Mexico, Albuquerque, New Mexico 87131, United States
| |
Collapse
|
58
|
Raisinghani N, Alshahrani M, Gupta G, Tian H, Xiao S, Tao P, Verkhivker G. Prediction of Conformational Ensembles and Structural Effects of State-Switching Allosteric Mutants in the Protein Kinases Using Comparative Analysis of AlphaFold2 Adaptations with Sequence Masking and Shallow Subsampling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.17.594786. [PMID: 38798650 PMCID: PMC11118581 DOI: 10.1101/2024.05.17.594786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/29/2024]
Abstract
Despite the success of AlphaFold2 approaches in predicting single protein structures, these methods showed intrinsic limitations in predicting multiple functional conformations of allosteric proteins and have been challenged to accurately capture of the effects of single point mutations that induced significant structural changes. We systematically examined several implementations of AlphaFold2 methods to predict conformational ensembles for state-switching mutants of the ABL kinase. The results revealed that a combination of randomized alanine sequence masking with shallow multiple sequence alignment subsampling can significantly expand the conformational diversity of the predicted structural ensembles and capture shifts in populations of the active and inactive ABL states. Consistent with the NMR experiments, the predicted conformational ensembles for M309L/L320I and M309L/H415P ABL mutants that perturb the regulatory spine networks featured the increased population of the fully closed inactive state. On the other hand, the predicted conformational ensembles for the G269E/M309L/T334I and M309L/L320I/T334I triple ABL mutants that share activating T334I gate-keeper substitution are dominated by the active ABL form. The proposed adaptation of AlphaFold can reproduce the experimentally observed mutation-induced redistributions in the relative populations of the active and inactive ABL states and capture the effects of regulatory mutations on allosteric structural rearrangements of the kinase domain. The ensemble-based network analysis complemented AlphaFold predictions by revealing allosteric mediating centers that often directly correspond to state-switching mutational sites or reside in their immediate local structural proximity, which may explain the global effect of regulatory mutations on structural changes between the ABL states. This study suggested that attention-based learning of long-range dependencies between sequence positions in homologous folds and deciphering patterns of allosteric interactions may further augment the predictive abilities of AlphaFold methods for modeling of alternative protein sates, conformational ensembles and mutation-induced structural transformations.
Collapse
|
59
|
Renthal R. Arthropod repellent interactions with olfactory receptors and ionotropic receptors analyzed by molecular modeling. CURRENT RESEARCH IN INSECT SCIENCE 2024; 5:100082. [PMID: 38765913 PMCID: PMC11101704 DOI: 10.1016/j.cris.2024.100082] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 04/29/2024] [Accepted: 05/01/2024] [Indexed: 05/22/2024]
Abstract
The main insect chemoreceptors are olfactory receptors (ORs), gustatory receptors (GRs) and ionotropic receptors (IRs). The odorant binding sites of many insect ORs appear to be occluded and inaccessible from the surface of the receptor protein, based on the three-dimensional structure of OR5 from the jumping bristletail Machilis hrabei (MhraOR5) and a survey of a sample of vinegar fly (Drosophila melanogaster) OR structures obtained from artificial intellegence (A.I.) modeling. Molecular dynamics simulations revealed that the occluded site can become accessible through tunnels that transiently open and close. The present study extends this analysis to examine seventeen ORs and one GR docking with ligands that have known valence: nine that signal attraction and nine that signal aversion. All but one of the receptors displayed occluded ligand binding sites analogous to MhraOR5, and docking software predicted the known attractant and repellent ligands will bind to the occluded sites. Docking of the repellent DEET was examined, and more than half of the OR ligand sites were predicted to bind DEET, including receptors that signal aversion as well as those that signal attraction. However, DEET may not actually have access to all the attractant binding sites. The larger size and lower flexibility of repellent molecules may restrict their passage through the tunnel bottlenecks, which could act as filters to select access to the ligand binding sites. In contrast to ORs and GRs, the IR ligand binding site is in an extracellular domain known to undergo a large conformational change from an open to a closed state. A.I. models of two D. melanogaster IRs of known valence and two blacklegged tick (Ixodes scapularis) IRs having unknown ligands were computationally tested for attractant and repellent binding. The ligand-binding sites in the closed state appear inaccessible to the protein surface, so attractants and repellents must bind initially at an accessible site in the open state before triggering the conformational change. In some IRs, repellent binding sites were identified at exterior sites adjacent to the ligand-binding site. These may be allosteric sites that, when occupied by repellents, can stabilize the open state of an attractant IR, or stabilize the closed state of an IR in the absence of its activating ligand. The model of D. melanogaster IR64a suggests a possible molecular mechanism for the activation of this IR by H+. The amino acids involved in this proposed mechanism are conserved in IR64a from several Dipteran pest species and disease vectors, potentially offering a route to discovery of new repellents that act via the allosteric site.
Collapse
Affiliation(s)
- Robert Renthal
- Department of Neuroscience, Developmental and Regenerative Biology, University of Texas at San Antonio, San Antonio, TX, United States
| |
Collapse
|
60
|
Ellaway JIJ, Anyango S, Nair S, Zaki HA, Nadzirin N, Powell HR, Gutmanas A, Varadi M, Velankar S. Identifying protein conformational states in the Protein Data Bank: Toward unlocking the potential of integrative dynamics studies. STRUCTURAL DYNAMICS (MELVILLE, N.Y.) 2024; 11:034701. [PMID: 38774441 PMCID: PMC11106648 DOI: 10.1063/4.0000251] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 05/08/2024] [Indexed: 05/24/2024]
Abstract
Studying protein dynamics and conformational heterogeneity is crucial for understanding biomolecular systems and treating disease. Despite the deposition of over 215 000 macromolecular structures in the Protein Data Bank and the advent of AI-based structure prediction tools such as AlphaFold2, RoseTTAFold, and ESMFold, static representations are typically produced, which fail to fully capture macromolecular motion. Here, we discuss the importance of integrating experimental structures with computational clustering to explore the conformational landscapes that manifest protein function. We describe the method developed by the Protein Data Bank in Europe - Knowledge Base to identify distinct conformational states, demonstrate the resource's primary use cases, through examples, and discuss the need for further efforts to annotate protein conformations with functional information. Such initiatives will be crucial in unlocking the potential of protein dynamics data, expediting drug discovery research, and deepening our understanding of macromolecular mechanisms.
Collapse
Affiliation(s)
- Joseph I. J. Ellaway
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Stephen Anyango
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Sreenath Nair
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Hossam A. Zaki
- The Warren Alpert Medical School of Brown University, Providence, Rhode Island 02903, USA
| | - Nurul Nadzirin
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Harold R. Powell
- Imperial College London, Department of Life Sciences, London, United Kingdom
| | - Aleksandras Gutmanas
- WaveBreak Therapeutics Ltd., Clarendon House, Clarendon Road, Cambridge, United Kingdom
| | - Mihaly Varadi
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| | - Sameer Velankar
- Protein Data Bank in Europe, European Bioinformatics Institute, Hinxton, United Kingdom
| |
Collapse
|
61
|
Vallat B, Berman HM. Structural highlights of macromolecular complexes and assemblies. Curr Opin Struct Biol 2024; 85:102773. [PMID: 38271778 DOI: 10.1016/j.sbi.2023.102773] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 12/22/2023] [Accepted: 12/26/2023] [Indexed: 01/27/2024]
Abstract
The structures of macromolecular assemblies have given us deep insights into cellular processes and have profoundly impacted biological research and drug discovery. We highlight the structures of macromolecular assemblies that have been modeled using integrative and computational methods and describe how open access to these structures from structural archives has empowered the research community. The arsenal of experimental and computational methods for structure determination ensures a future where whole organelles and cells can be modeled.
Collapse
Affiliation(s)
- Brinda Vallat
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Cancer Institute of New Jersey, Rutgers, The State University of New Jersey, New Brunswick, NJ 08901, USA.
| | - Helen M Berman
- Research Collaboratory for Structural Bioinformatics Protein Data Bank and the Institute for Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ 08854, USA; Department of Quantitative and Computational Biology, University of Southern California, Los Angeles CA 90089, USA
| |
Collapse
|
62
|
Monté D, Lens Z, Dewitte F, Villeret V, Verger A. Assessment of machine-learning predictions for the Mediator complex subunit MED25 ACID domain interactions with transactivation domains. FEBS Lett 2024; 598:758-773. [PMID: 38436147 DOI: 10.1002/1873-3468.14837] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2023] [Revised: 02/01/2024] [Accepted: 02/10/2024] [Indexed: 03/05/2024]
Abstract
The human Mediator complex subunit MED25 binds transactivation domains (TADs) present in various cellular and viral proteins using two binding interfaces, named H1 and H2, which are found on opposite sides of its ACID domain. Here, we use and compare deep learning methods to characterize human MED25-TAD interfaces and assess the predicted models to published experimental data. For the H1 interface, AlphaFold produces predictions with high-reliability scores that agree well with experimental data, while the H2 interface predictions appear inconsistent, preventing reliable binding modes. Despite these limitations, we experimentally assess the validity of MED25 interface predictions with the viral transcriptional activators Lana-1 and IE62. AlphaFold predictions also suggest the existence of a unique hydrophobic pocket for the Arabidopsis MED25 ACID domain.
Collapse
Affiliation(s)
- Didier Monté
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| | - Zoé Lens
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| | - Frédérique Dewitte
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| | - Vincent Villeret
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| | - Alexis Verger
- CNRS EMR 9002 Integrative Structural Biology, Inserm U 1167 - RID-AGE, Univ. Lille, CHU Lille, Institut Pasteur de Lille, France
| |
Collapse
|
63
|
Marmorale LJ, Jin H, Reidy TG, Palomino-Alonso B, Zysnarski CJ, Jordan-Javed F, Lahiri S, Duncan MC. Fast-evolving cofactors regulate the role of HEATR5 complexes in intra-Golgi trafficking. J Cell Biol 2024; 223:e202309047. [PMID: 38240799 PMCID: PMC10798858 DOI: 10.1083/jcb.202309047] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2023] [Revised: 11/22/2023] [Accepted: 12/18/2023] [Indexed: 01/22/2024] Open
Abstract
The highly conserved HEATR5 proteins are best known for their roles in membrane traffic mediated by the adaptor protein complex-1 (AP1). HEATR5 proteins rely on fast-evolving cofactors to bind to AP1. However, how HEATR5 proteins interact with these cofactors is unknown. Here, we report that the budding yeast HEATR5 protein, Laa1, functions in two biochemically distinct complexes. These complexes are defined by a pair of mutually exclusive Laa1-binding proteins, Laa2 and the previously uncharacterized Lft1/Yml037c. Despite limited sequence similarity, biochemical analysis and structure predictions indicate that Lft1 and Laa2 bind Laa1 via structurally similar mechanisms. Both Laa1 complexes function in intra-Golgi recycling. However, only the Laa2-Laa1 complex binds to AP1 and contributes to its localization. Finally, structure predictions indicate that human HEATR5 proteins bind to a pair of fast-evolving interacting partners via a mechanism similar to that observed in yeast. These results reveal mechanistic insight into how HEATR5 proteins bind their cofactors and indicate that Laa1 performs functions besides recruiting AP1.
Collapse
Affiliation(s)
- Lucas J. Marmorale
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, Ann Arbor, MI, USA
| | - Huan Jin
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, Ann Arbor, MI, USA
| | - Thomas G. Reidy
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, Ann Arbor, MI, USA
| | - Brandon Palomino-Alonso
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, Ann Arbor, MI, USA
| | - Christopher J. Zysnarski
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, Ann Arbor, MI, USA
| | - Fatima Jordan-Javed
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, Ann Arbor, MI, USA
| | - Sagar Lahiri
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, Ann Arbor, MI, USA
| | - Mara C. Duncan
- Department of Cell and Developmental Biology, University of Michigan, Ann Arbor, Ann Arbor, MI, USA
| |
Collapse
|
64
|
Jänes J, Beltrao P. Deep learning for protein structure prediction and design-progress and applications. Mol Syst Biol 2024; 20:162-169. [PMID: 38291232 PMCID: PMC10912668 DOI: 10.1038/s44320-024-00016-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2023] [Revised: 12/21/2023] [Accepted: 01/11/2024] [Indexed: 02/01/2024] Open
Abstract
Proteins are the key molecular machines that orchestrate all biological processes of the cell. Most proteins fold into three-dimensional shapes that are critical for their function. Studying the 3D shape of proteins can inform us of the mechanisms that underlie biological processes in living cells and can have practical applications in the study of disease mutations or the discovery of novel drug treatments. Here, we review the progress made in sequence-based prediction of protein structures with a focus on applications that go beyond the prediction of single monomer structures. This includes the application of deep learning methods for the prediction of structures of protein complexes, different conformations, the evolution of protein structures and the application of these methods to protein design. These developments create new opportunities for research that will have impact across many areas of biomedical research.
Collapse
Affiliation(s)
- Jürgen Jänes
- Institute of Molecular Systems Biology, ETH Zürich, 8093, Zürich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Pedro Beltrao
- Institute of Molecular Systems Biology, ETH Zürich, 8093, Zürich, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
| |
Collapse
|
65
|
Smith MD, Darryl Quarles L, Demerdash O, Smith JC. Drugging the entire human proteome: Are we there yet? Drug Discov Today 2024; 29:103891. [PMID: 38246414 DOI: 10.1016/j.drudis.2024.103891] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2023] [Revised: 01/12/2024] [Accepted: 01/16/2024] [Indexed: 01/23/2024]
Abstract
Each of the ∼20,000 proteins in the human proteome is a potential target for compounds that bind to it and modify its function. The 3D structures of most of these proteins are now available. Here, we discuss the prospects for using these structures to perform proteome-wide virtual HTS (VHTS). We compare physics-based (docking) and AI VHTS approaches, some of which are now being applied with large databases of compounds to thousands of targets. Although preliminary proteome-wide screens are now within our grasp, further methodological developments are expected to improve the accuracy of the results.
Collapse
Affiliation(s)
- Micholas Dean Smith
- University of Tennessee/Oak Ridge National Laboratory Center for Molecular Biophysics, Oak Ridge, TN 37830, USA; Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA
| | - L Darryl Quarles
- Departments of Medicine, University of Tennessee Health Science Center, Memphis, TN 38163, USA; ORRxD LLC, 3404 Olney Drive, Durham, NC 27705, USA
| | - Omar Demerdash
- Biosciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37830, USA
| | - Jeremy C Smith
- University of Tennessee/Oak Ridge National Laboratory Center for Molecular Biophysics, Oak Ridge, TN 37830, USA; Department of Biochemistry and Cellular and Molecular Biology, University of Tennessee, Knoxville, TN 37996, USA.
| |
Collapse
|
66
|
Raisinghani N, Alshahrani M, Gupta G, Tian H, Xiao S, Tao P, Verkhivker G. Interpretable Atomistic Prediction and Functional Analysis of Conformational Ensembles and Allosteric States in Protein Kinases Using AlphaFold2 Adaptation with Randomized Sequence Scanning and Local Frustration Profiling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.15.580591. [PMID: 38496487 PMCID: PMC10942451 DOI: 10.1101/2024.02.15.580591] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/19/2024]
Abstract
The groundbreaking achievements of AlphaFold2 (AF2) approaches in protein structure modeling marked a transformative era in structural biology. Despite the success of AF2 tools in predicting single protein structures, these methods showed intrinsic limitations in predicting multiple functional conformations of allosteric proteins and fold-switching systems. The recent NMR-based structural determination of the unbound ABL kinase in the active state and two inactive low-populated functional conformations that are unique for ABL kinase presents an ideal challenge for AF2 approaches. In the current study we employ several implementations of AF2 methods to predict protein conformational ensembles and allosteric states of the ABL kinase including (a) multiple sequence alignments (MSA) subsampling approach; (b) SPEACH_AF approach in which alanine scanning is performed on generated MSAs; and (c) introduced in this study randomized full sequence mutational scanning for manipulation of sequence variations combined with the MSA subsampling. We show that the proposed AF2 adaptation combined with local frustration mapping of conformational states enable accurate prediction of the ABL active and intermediate structures and conformational ensembles, also offering a robust approach for interpretable characterization of the AF2 predictions and limitations in detecting hidden allosteric states. We found that the large high frustration residue clusters are uniquely characteristic of the low-populated, fully inactive ABL form and can define energetically frustrated cracking sites of conformational transitions, presenting difficult targets for AF2 methods. This study uncovered previously unappreciated, fundamental connections between distinct patterns of local frustration in functional kinase states and AF2 successes/limitations in detecting low-populated frustrated conformations, providing a better understanding of benefits and limitations of current AF2-based adaptations in modeling of conformational ensembles.
Collapse
|
67
|
Guo D, De Sciscio ML, Chi-Fung Ng J, Fraternali F. Modelling the assembly and flexibility of antibody structures. Curr Opin Struct Biol 2024; 84:102757. [PMID: 38118364 DOI: 10.1016/j.sbi.2023.102757] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 11/29/2023] [Accepted: 11/30/2023] [Indexed: 12/22/2023]
Abstract
Antibodies are large protein assemblies capable of both specifically recognising antigens and engaging with other proteins and receptors to coordinate immune action. Traditionally, structural studies have been dedicated to antibody variable regions, but efforts to determine and model full-length antibody structures are emerging. Here we review the current knowledge on modelling the structures of antibody assemblies, focusing on their conformational flexibility and the challenge this poses to obtaining and evaluating structural models. Integrative modelling approaches, combining experiments (cryo-electron microscopy, mass spectrometry, etc.) and computational methods (molecular dynamics simulations, deep-learning based approaches, etc.), hold the promise to map the complex conformational landscape of full-length antibody structures.
Collapse
Affiliation(s)
- Dongjun Guo
- Institute of Structural and Molecular Biology, University College London, Darwin Building, Gower Street, London, WC1E 6BT, United Kingdom; Randall Centre for Cell & Molecular Biophysics, King's College London, New Hunt's House, Guy's Campus, London, SE1 1UL, United Kingdom
| | - Maria Laura De Sciscio
- Institute of Structural and Molecular Biology, University College London, Darwin Building, Gower Street, London, WC1E 6BT, United Kingdom; Department of Chemistry, Sapienza University of Rome, P.le A. Moro 5, Rome, 00185, Italy
| | - Joseph Chi-Fung Ng
- Institute of Structural and Molecular Biology, University College London, Darwin Building, Gower Street, London, WC1E 6BT, United Kingdom
| | - Franca Fraternali
- Institute of Structural and Molecular Biology, University College London, Darwin Building, Gower Street, London, WC1E 6BT, United Kingdom.
| |
Collapse
|
68
|
Choudhury J, Yonezawa K, Anu A, Shimizu N, Chaudhuri B. SAXS/WAXS data of conformationally flexible ribose binding protein. Data Brief 2024; 52:109932. [PMID: 38178847 PMCID: PMC10764985 DOI: 10.1016/j.dib.2023.109932] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 12/03/2023] [Accepted: 12/05/2023] [Indexed: 01/06/2024] Open
Abstract
Modern artificial intelligence-based protein structure prediction methods, such as Alphafold2, can predict structures of folded proteins with reasonable accuracy. However, Alphafold2 provides a static view of a protein, which does not show the conformational variability of the protein, domain movement in a multi-domain protein, or ligand-induced conformational changes it might undergo in solution. Small-angle X-ay scattering (SAXS) and wide-angle X-ray scattering (WAXS) are solution techniques that can aid in integrative modeling of conformationally flexible proteins, or in validating their predicted ensemble structures. While SAXS is sensitive to global structural features, WAXS can expand the scope of structural modeling by including information about local structural changes. We present SAXS and WAXS datasets obtained from conformationally flexible d-ribose binding protein (RBP) from Escherichia coli in the ribose bound and unbound forms. SAXS/WAXS datasets of RBP provided here may aid in method development efforts for more accurate prediction of structural ensembles of conformationally flexible proteins, and their conformational changes.
Collapse
Affiliation(s)
- Jagrity Choudhury
- GN Ramachandran Protein Center, CSIR Institute of Microbial Technology, Chandigarh 160036, India
| | - Kento Yonezawa
- Structural Biology Research Center, Institute of Materials Structure Science, High Energy Accelerator Research Organization (KEK), 1-1 Oho, Tsukuba, Ibaraki 305-0801, Japan
| | - Anu Anu
- GN Ramachandran Protein Center, CSIR Institute of Microbial Technology, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Anusandhan Bhawan, 2 Rafi Marg, New Delhi 110001, India
| | - Nobutaka Shimizu
- Structural Biology Research Center, Institute of Materials Structure Science, High Energy Accelerator Research Organization (KEK), 1-1 Oho, Tsukuba, Ibaraki 305-0801, Japan
- Photon Factory, Institute of Materials Structure Science, High Energy Accelerator Research Organization (KEK), 1-1 Oho, Tsukuba, Ibaraki 305-0801, Japan
| | - Barnali Chaudhuri
- GN Ramachandran Protein Center, CSIR Institute of Microbial Technology, Chandigarh 160036, India
- Academy of Scientific and Innovative Research (AcSIR), Anusandhan Bhawan, 2 Rafi Marg, New Delhi 110001, India
| |
Collapse
|
69
|
Rosignoli S, Lustrino E, Di Silverio I, Paiardini A. Making Use of Averaging Methods in MODELLER for Protein Structure Prediction. Int J Mol Sci 2024; 25:1731. [PMID: 38339009 PMCID: PMC10855553 DOI: 10.3390/ijms25031731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2023] [Revised: 01/23/2024] [Accepted: 01/29/2024] [Indexed: 02/12/2024] Open
Abstract
Recent advances in protein structure prediction, driven by AlphaFold 2 and machine learning, demonstrate proficiency in static structures but encounter challenges in capturing essential dynamic features crucial for understanding biological function. In this context, homology-based modeling emerges as a cost-effective and computationally efficient alternative. The MODELLER (version 10.5, accessed on 30 November 2023) algorithm can be harnessed for this purpose since it computes intermediate models during simulated annealing, enabling the exploration of attainable configurational states and energies while minimizing its objective function. There have been a few attempts to date to improve the models generated by its algorithm, and in particular, there is no literature regarding the implementation of an averaging procedure involving the intermediate models in the MODELLER algorithm. In this study, we examined MODELLER's output using 225 target-template pairs, extracting the best representatives of intermediate models. Applying an averaging procedure to the selected intermediate structures based on statistical potentials, we aimed to determine: (1) whether averaging improves the quality of structural models during the building phase; (2) if ranking by statistical potentials reliably selects the best models, leading to improved final model quality; (3) whether using a single template versus multiple templates affects the averaging approach; (4) whether the "ensemble" nature of the MODELLER building phase can be harnessed to capture low-energy conformations in holo structures modeling. Our findings indicate that while improvements typically fall short of a few decimal points in the model evaluation metric, a notable fraction of configurations exhibit slightly higher similarity to the native structure than MODELLER's proposed final model. The averaging-building procedure proves particularly beneficial in (1) regions of low sequence identity between the target and template(s), the most challenging aspect of homology modeling; (2) holo protein conformations generation, an area in which MODELLER and related tools usually fall short of the expected performance.
Collapse
Affiliation(s)
| | | | | | - Alessandro Paiardini
- Department of Biochemical Sciences, Sapienza University of Rome, 00185 Rome, Italy; (S.R.); (E.L.); (I.D.S.)
| |
Collapse
|
70
|
Ohnuki J, Jaunet-Lahary T, Yamashita A, Okazaki KI. Accelerated Molecular Dynamics and AlphaFold Uncover a Missing Conformational State of Transporter Protein OxlT. J Phys Chem Lett 2024; 15:725-732. [PMID: 38215403 DOI: 10.1021/acs.jpclett.3c03052] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2024]
Abstract
Transporter proteins change their conformations to carry their substrate across the cell membrane. The conformational dynamics is vital to understanding the transport function. We have studied the oxalate transporter (OxlT), an oxalate:formate antiporter from Oxalobacter formigenes, significant in avoiding kidney stone formation. The atomic structure of OxlT has been recently solved in the outward-open and occluded states. However, the inward-open conformation is still missing, hindering a complete understanding of the transporter. Here, we performed a Gaussian accelerated molecular dynamics simulation to sample the extensive conformational space of OxlT and successfully predicted the inward-open conformation where cytoplasmic substrate formate binding was preferred over oxalate binding. We also identified critical interactions for the inward-open conformation. The results were complemented by an AlphaFold2 structure prediction. Although AlphaFold2 solely predicted OxlT in the outward-open conformation, mutation of the identified critical residues made it partly predict the inward-open conformation, identifying possible state-shifting mutations.
Collapse
Affiliation(s)
- Jun Ohnuki
- Research Center for Computational Science, Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki 444-8585, Japan
- Graduate Institute for Advanced Studies, SOKENDAI, Okazaki, Aichi 444-8585, Japan
| | - Titouan Jaunet-Lahary
- Research Center for Computational Science, Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki 444-8585, Japan
| | - Atsuko Yamashita
- Graduate School of Medicine, Dentistry and Pharmaceutical Sciences, Okayama University, Okayama 700-8530, Japan
| | - Kei-Ichi Okazaki
- Research Center for Computational Science, Institute for Molecular Science, National Institutes of Natural Sciences, Okazaki 444-8585, Japan
- Graduate Institute for Advanced Studies, SOKENDAI, Okazaki, Aichi 444-8585, Japan
| |
Collapse
|
71
|
Bret H, Gao J, Zea DJ, Andreani J, Guerois R. From interaction networks to interfaces, scanning intrinsically disordered regions using AlphaFold2. Nat Commun 2024; 15:597. [PMID: 38238291 PMCID: PMC10796318 DOI: 10.1038/s41467-023-44288-7] [Citation(s) in RCA: 36] [Impact Index Per Article: 36.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 12/07/2023] [Indexed: 01/22/2024] Open
Abstract
The revolution brought about by AlphaFold2 opens promising perspectives to unravel the complexity of protein-protein interaction networks. The analysis of interaction networks obtained from proteomics experiments does not systematically provide the delimitations of the interaction regions. This is of particular concern in the case of interactions mediated by intrinsically disordered regions, in which the interaction site is generally small. Using a dataset of protein-peptide complexes involving intrinsically disordered regions that are non-redundant with the structures used in AlphaFold2 training, we show that when using the full sequences of the proteins, AlphaFold2-Multimer only achieves 40% success rate in identifying the correct site and structure of the interface. By delineating the interaction region into fragments of decreasing size and combining different strategies for integrating evolutionary information, we manage to raise this success rate up to 90%. We obtain similar success rates using a much larger dataset of protein complexes taken from the ELM database. Beyond the correct identification of the interaction site, our study also explores specificity issues. We show the advantages and limitations of using the AlphaFold2 confidence score to discriminate between alternative binding partners, a task that can be particularly challenging in the case of small interaction motifs.
Collapse
Affiliation(s)
- Hélène Bret
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Jinmei Gao
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Diego Javier Zea
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France
| | - Jessica Andreani
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France.
| | - Raphaël Guerois
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198, Gif-sur-Yvette, France.
| |
Collapse
|
72
|
Versini R, Sritharan S, Aykac Fas B, Tubiana T, Aimeur SZ, Henri J, Erard M, Nüsse O, Andreani J, Baaden M, Fuchs P, Galochkina T, Chatzigoulas A, Cournia Z, Santuz H, Sacquin-Mora S, Taly A. A Perspective on the Prospective Use of AI in Protein Structure Prediction. J Chem Inf Model 2024; 64:26-41. [PMID: 38124369 DOI: 10.1021/acs.jcim.3c01361] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2023]
Abstract
AlphaFold2 (AF2) and RoseTTaFold (RF) have revolutionized structural biology, serving as highly reliable and effective methods for predicting protein structures. This article explores their impact and limitations, focusing on their integration into experimental pipelines and their application in diverse protein classes, including membrane proteins, intrinsically disordered proteins (IDPs), and oligomers. In experimental pipelines, AF2 models help X-ray crystallography in resolving the phase problem, while complementarity with mass spectrometry and NMR data enhances structure determination and protein flexibility prediction. Predicting the structure of membrane proteins remains challenging for both AF2 and RF due to difficulties in capturing conformational ensembles and interactions with the membrane. Improvements in incorporating membrane-specific features and predicting the structural effect of mutations are crucial. For intrinsically disordered proteins, AF2's confidence score (pLDDT) serves as a competitive disorder predictor, but integrative approaches including molecular dynamics (MD) simulations or hydrophobic cluster analyses are advocated for accurate dynamics representation. AF2 and RF show promising results for oligomeric models, outperforming traditional docking methods, with AlphaFold-Multimer showing improved performance. However, some caveats remain in particular for membrane proteins. Real-life examples demonstrate AF2's predictive capabilities in unknown protein structures, but models should be evaluated for their agreement with experimental data. Furthermore, AF2 models can be used complementarily with MD simulations. In this Perspective, we propose a "wish list" for improving deep-learning-based protein folding prediction models, including using experimental data as constraints and modifying models with binding partners or post-translational modifications. Additionally, a meta-tool for ranking and suggesting composite models is suggested, driving future advancements in this rapidly evolving field.
Collapse
Affiliation(s)
- Raphaelle Versini
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| | - Sujith Sritharan
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| | - Burcu Aykac Fas
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| | - Thibault Tubiana
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Sana Zineb Aimeur
- Université Paris-Saclay, CNRS, Institut de Chimie Physique, 91405 Orsay, France
| | - Julien Henri
- Sorbonne Université, CNRS, Laboratoire de Biologie, Computationnelle et Quantitative UMR 7238, Institut de Biologie Paris-Seine, 4 Place Jussieu, F-75005 Paris, France
| | - Marie Erard
- Université Paris-Saclay, CNRS, Institut de Chimie Physique, 91405 Orsay, France
| | - Oliver Nüsse
- Université Paris-Saclay, CNRS, Institut de Chimie Physique, 91405 Orsay, France
| | - Jessica Andreani
- Université Paris-Saclay, CEA, CNRS, Institute for Integrative Biology of the Cell (I2BC), 91198 Gif-sur-Yvette, France
| | - Marc Baaden
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| | - Patrick Fuchs
- Sorbonne Université, École Normale Supérieure, PSL University, CNRS, Laboratoire des Biomolécules, LBM, 75005 Paris, France
- Université de Paris, UFR Sciences du Vivant, 75013 Paris, France
| | - Tatiana Galochkina
- Université Paris Cité and Université des Antilles and Université de la Réunion, INSERM, BIGR, F-75014 Paris, France
| | - Alexios Chatzigoulas
- Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
- Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, 15784 Athens, Greece
| | - Zoe Cournia
- Biomedical Research Foundation, Academy of Athens, 11527 Athens, Greece
- Department of Informatics and Telecommunications, National and Kapodistrian University of Athens, 15784 Athens, Greece
| | - Hubert Santuz
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| | - Sophie Sacquin-Mora
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| | - Antoine Taly
- Laboratoire de Biochimie Théorique, CNRS (UPR9080), Université Paris Cité, F-75005 Paris, France
| |
Collapse
|
73
|
Peng CX, Liang F, Xia YH, Zhao KL, Hou MH, Zhang GJ. Recent Advances and Challenges in Protein Structure Prediction. J Chem Inf Model 2024; 64:76-95. [PMID: 38109487 DOI: 10.1021/acs.jcim.3c01324] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2023]
Abstract
Artificial intelligence has made significant advances in the field of protein structure prediction in recent years. In particular, DeepMind's end-to-end model, AlphaFold2, has demonstrated the capability to predict three-dimensional structures of numerous unknown proteins with accuracy levels comparable to those of experimental methods. This breakthrough has opened up new possibilities for understanding protein structure and function as well as accelerating drug discovery and other applications in the field of biology and medicine. Despite the remarkable achievements of artificial intelligence in the field, there are still some challenges and limitations. In this Review, we discuss the recent progress and some of the challenges in protein structure prediction. These challenges include predicting multidomain protein structures, protein complex structures, multiple conformational states of proteins, and protein folding pathways. Furthermore, we highlight directions in which further improvements can be conducted.
Collapse
Affiliation(s)
- Chun-Xiang Peng
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Fang Liang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Yu-Hao Xia
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Kai-Long Zhao
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Ming-Hua Hou
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| | - Gui-Jun Zhang
- College of Information Engineering, Zhejiang University of Technology, Hangzhou 310023, China
| |
Collapse
|
74
|
Ghafouri H, Lazar T, Del Conte A, Tenorio Ku LG, Tompa P, Tosatto SCE, Monzon AM. PED in 2024: improving the community deposition of structural ensembles for intrinsically disordered proteins. Nucleic Acids Res 2024; 52:D536-D544. [PMID: 37904608 PMCID: PMC10767937 DOI: 10.1093/nar/gkad947] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 10/10/2023] [Accepted: 10/13/2023] [Indexed: 11/01/2023] Open
Abstract
The Protein Ensemble Database (PED) (URL: https://proteinensemble.org) is the primary resource for depositing structural ensembles of intrinsically disordered proteins. This updated version of PED reflects advancements in the field, denoting a continual expansion with a total of 461 entries and 538 ensembles, including those generated without explicit experimental data through novel machine learning (ML) techniques. With this significant increment in the number of ensembles, a few yet-unprecedented new entries entered the database, including those also determined or refined by electron paramagnetic resonance or circular dichroism data. In addition, PED was enriched with several new features, including a novel deposition service, improved user interface, new database cross-referencing options and integration with the 3D-Beacons network-all representing efforts to improve the FAIRness of the database. Foreseeably, PED will keep growing in size and expanding with new types of ensembles generated by accurate and fast ML-based generative models and coarse-grained simulations. Therefore, among future efforts, priority will be given to further develop the database to be compatible with ensembles modeled at a coarse-grained level.
Collapse
Affiliation(s)
| | - Tamas Lazar
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie (VIB), Brussels, Belgium
- Structural Biology Brussels, Department of Bioengineering, Vrije Universiteit Brussel (VUB), Brussels, Belgium
| | - Alessio Del Conte
- Department of Biomedical Sciences, University of Padova, Padova, Italy
| | | | - Peter Tompa
- VIB-VUB Center for Structural Biology, Vlaams Instituut voor Biotechnologie (VIB), Brussels, Belgium
- Structural Biology Brussels, Department of Bioengineering, Vrije Universiteit Brussel (VUB), Brussels, Belgium
- Institute of Enzymology, Research Centre for Natural Sciences (RCNS), Budapest, Hungary
| | | | | |
Collapse
|
75
|
Platzer G, Ptaszek AL, Böttcher J, Fuchs JE, Geist L, Braun D, McConnell DB, Konrat R, Sánchez-Murcia PA, Mayer M. Ligand 1 H NMR Chemical Shifts as Accurate Reporters for Protein-Ligand Binding Interfaces in Solution. Chemphyschem 2024; 25:e202300636. [PMID: 37955910 DOI: 10.1002/cphc.202300636] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2023] [Revised: 10/23/2023] [Indexed: 11/14/2023]
Abstract
The availability of high-resolution 3D structural information is crucial for investigating guest-host systems across a wide range of fields. In the context of drug discovery, the information is routinely used to establish and validate structure-activity relationships, grow initial hits from screening campaigns, and to guide molecular docking. For the generation of protein-ligand complex structural information, X-ray crystallography is the experimental method of choice, however, with limited information on protein flexibility. An experimentally verified structural model of the binding interface in the native solution-state would support medicinal chemists in their molecular design decisions. Here we demonstrate that protein-bound ligand 1 H NMR chemical shifts are highly sensitive and accurate probes for the immediate chemical environment of protein-ligand interfaces. By comparing the experimental ligand 1 H chemical shift values with those computed from the X-ray structure using quantum mechanics methodology, we identify significant disagreements for parts of the ligand between the two experimental techniques. We show that quantum mechanics/molecular mechanics (QM/MM) molecular dynamics (MD) ensembles can be used to refine initial X-ray co-crystal structures resulting in a better agreement with experimental 1 H ligand chemical shift values. Overall, our findings highlight the usefulness of ligand 1 H NMR chemical shift information in combination with a QM/MM MD workflow for generating protein-ligand ensembles that accurately reproduce solution structural data.
Collapse
Affiliation(s)
- Gerald Platzer
- Christian Doppler Laboratory for High-Content Structural Biology and Biotechnology, Department of Structural and Computational Biology, Max Perutz Labs, University of Vienna, Campus Vienna Biocenter 5, 1030-, Vienna, Austria
- MAG-LAB GmbH, Karl-Farkas-Gasse 22, 1030-, Vienna, Austria
| | - Aleksandra L Ptaszek
- Christian Doppler Laboratory for High-Content Structural Biology and Biotechnology, Department of Structural and Computational Biology, Max Perutz Labs, University of Vienna, Campus Vienna Biocenter 5, 1030-, Vienna, Austria
- Laboratory for Computer-Aided Molecular Design, Division of Medicinal Chemistry, Otto Loewi Research Center, Medical University Graz, Neue Stiftingtalstrasse 6/III, 8010-, Graz, Austria
| | - Jark Böttcher
- Boehringer Ingelheim RCV GmbH & Co. KG, Dr. Boehringer Gasse 5-11, 1121-, Vienna, Austria
| | - Julian E Fuchs
- Boehringer Ingelheim RCV GmbH & Co. KG, Dr. Boehringer Gasse 5-11, 1121-, Vienna, Austria
| | - Leonhard Geist
- Boehringer Ingelheim RCV GmbH & Co. KG, Dr. Boehringer Gasse 5-11, 1121-, Vienna, Austria
| | - Daniel Braun
- Christian Doppler Laboratory for High-Content Structural Biology and Biotechnology, Department of Structural and Computational Biology, Max Perutz Labs, University of Vienna, Campus Vienna Biocenter 5, 1030-, Vienna, Austria
| | - Darryl B McConnell
- Boehringer Ingelheim RCV GmbH & Co. KG, Dr. Boehringer Gasse 5-11, 1121-, Vienna, Austria
| | - Robert Konrat
- Christian Doppler Laboratory for High-Content Structural Biology and Biotechnology, Department of Structural and Computational Biology, Max Perutz Labs, University of Vienna, Campus Vienna Biocenter 5, 1030-, Vienna, Austria
| | - Pedro A Sánchez-Murcia
- Laboratory for Computer-Aided Molecular Design, Division of Medicinal Chemistry, Otto Loewi Research Center, Medical University Graz, Neue Stiftingtalstrasse 6/III, 8010-, Graz, Austria
| | - Moriz Mayer
- Boehringer Ingelheim RCV GmbH & Co. KG, Dr. Boehringer Gasse 5-11, 1121-, Vienna, Austria
| |
Collapse
|
76
|
Ahmed M, Maldonado AM, Durrant JD. From byte to bench to bedside: molecular dynamics simulations and drug discovery. BMC Biol 2023; 21:299. [PMID: 38155355 PMCID: PMC10755930 DOI: 10.1186/s12915-023-01791-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2023] [Accepted: 12/01/2023] [Indexed: 12/30/2023] Open
Affiliation(s)
- Mayar Ahmed
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, USA
| | - Alex M Maldonado
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, USA
| | - Jacob D Durrant
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, USA.
| |
Collapse
|
77
|
Chakravarty D, Schafer JW, Chen EA, Thole JR, Porter LL. AlphaFold2 has more to learn about protein energy landscapes. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.12.571380. [PMID: 38168383 PMCID: PMC10760193 DOI: 10.1101/2023.12.12.571380] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2024]
Abstract
Recent work suggests that AlphaFold2 (AF2)-a deep learning-based model that can accurately infer protein structure from sequence-may discern important features of folded protein energy landscapes, defined by the diversity and frequency of different conformations in the folded state. Here, we test the limits of its predictive power on fold-switching proteins, which assume two structures with regions of distinct secondary and/or tertiary structure. Using several implementations of AF2, including two published enhanced sampling approaches, we generated >280,000 models of 93 fold-switching proteins whose experimentally determined conformations were likely in AF2's training set. Combining all models, AF2 predicted fold switching with a modest success rate of ~25%, indicating that it does not readily sample both experimentally characterized conformations of most fold switchers. Further, AF2's confidence metrics selected against models consistent with experimentally determined fold-switching conformations in favor of inconsistent models. Accordingly, these confidence metrics-though suggested to evaluate protein energetics reliably-did not discriminate between low and high energy states of fold-switching proteins. We then evaluated AF2's performance on seven fold-switching proteins outside of its training set, generating >159,000 models in total. Fold switching was accurately predicted in one of seven targets with moderate confidence. Further, AF2 demonstrated no ability to predict alternative conformations of two newly discovered targets without homologs in the set of 93 fold switchers. These results indicate that AF2 has more to learn about the underlying energetics of protein ensembles and highlight the need for further developments of methods that readily predict multiple protein conformations.
Collapse
Affiliation(s)
- Devlina Chakravarty
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
| | - Joseph W. Schafer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
| | - Ethan A. Chen
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
| | - Joseph R. Thole
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892
| | - Lauren L. Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, 20892
| |
Collapse
|
78
|
Weissenow K, Rost B. Rendering protein mutation movies with MutAmore. BMC Bioinformatics 2023; 24:469. [PMID: 38087198 PMCID: PMC10714560 DOI: 10.1186/s12859-023-05610-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Accepted: 12/08/2023] [Indexed: 12/18/2023] Open
Abstract
BACKGROUND The success of AlphaFold2 in reliable protein three-dimensional (3D) structure prediction, assists the move of structural biology toward studies of protein dynamics and mutational impact on structure and function. This transition needs tools that qualitatively assess alternative 3D conformations. RESULTS We introduce MutAmore, a bioinformatics tool that renders individual images of protein 3D structures for, e.g., sequence mutations into a visually intuitive movie format. MutAmore streamlines a pipeline casting single amino-acid variations (SAVs) into a dynamic 3D mutation movie providing a qualitative perspective on the mutational landscape of a protein. By default, the tool first generates all possible variants of the sequence reachable through SAVs (L*19 for proteins with L residues). Next, it predicts the structural conformation for all L*19 variants using state-of-the-art models. Finally, it visualizes the mutation matrix and produces a color-coded 3D animation. Alternatively, users can input other types of variants, e.g., from experimental structures. CONCLUSION MutAmore samples alternative protein configurations to study the dynamical space accessible from SAVs in the post-AlphaFold2 era of structural biology. As the field shifts towards the exploration of alternative conformations of proteins, MutAmore aids in the understanding of the structural impact of mutations by providing a flexible pipeline for the generation of protein mutation movies using current and future structure prediction models.
Collapse
Affiliation(s)
- Konstantin Weissenow
- Department of Informatics, Bioinformatics and Computational Biology i12, TUM (Technical University of Munich), Boltzmannstr. 3, 85748, Garching, Munich, Germany.
- TUM Graduate School, Center of Doctoral Studies in Informatics and Its Applications (CeDoSIA), Boltzmannstr. 11, 85748, Garching, Germany.
| | - Burkhard Rost
- Department of Informatics, Bioinformatics and Computational Biology i12, TUM (Technical University of Munich), Boltzmannstr. 3, 85748, Garching, Munich, Germany
- Institute for Advanced Study (TUM-IAS), Lichtenbergstr. 2a, 85748, Garching, Munich, Germany
- TUM School of Life Sciences Weihenstephan (WZW), Alte Akademie 8, Freising, Germany
| |
Collapse
|
79
|
Kryshtafovych A, Montelione GT, Rigden DJ, Mesdaghi S, Karaca E, Moult J. Breaking the conformational ensemble barrier: Ensemble structure modeling challenges in CASP15. Proteins 2023; 91:1903-1911. [PMID: 37872703 PMCID: PMC10840738 DOI: 10.1002/prot.26584] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2023] [Accepted: 08/14/2023] [Indexed: 10/25/2023]
Abstract
For the first time, the 2022 CASP (Critical Assessment of Structure Prediction) community experiment included a section on computing multiple conformations for protein and RNA structures. There was full or partial success in reproducing the ensembles for four of the nine targets, an encouraging result. For protein structures, enhanced sampling with variations of the AlphaFold2 deep learning method was by far the most effective approach. One substantial conformational change caused by a single mutation across a complex interface was accurately reproduced. In two other assembly modeling cases, methods succeeded in sampling conformations near to the experimental ones even though environmental factors were not included in the calculations. An experimentally derived flexibility ensemble allowed a single accurate RNA structure model to be identified. Difficulties included how to handle sparse or low-resolution experimental data and the current lack of effective methods for modeling RNA/protein complexes. However, these and other obstacles appear addressable.
Collapse
Affiliation(s)
| | - Gaetano T Montelione
- Department of Chemistry and Chemical Biology, Center for Biotechnology and Interdisciplinary Sciences, Rensselaer Polytechnic Institute, Troy, New York, USA
| | - Daniel J Rigden
- Institute of Systems, Molecular, and Integrative Biology, University of Liverpool, Liverpool, UK
| | - Shahram Mesdaghi
- Institute of Systems, Molecular, and Integrative Biology, University of Liverpool, Liverpool, UK
- Computational Biology Facility, MerseyBio, University of Liverpool, Liverpool, UK
| | - Ezgi Karaca
- Izmir Biomedicine and Genome Center, Izmir, Turkey
- Izmir International Biomedicine and Genome Institute, Dokuz Eylul University, Izmir, Turkey
| | - John Moult
- Institute for Bioscience and Biotechnology Research, Rockville, Maryland, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, Maryland, USA
| |
Collapse
|
80
|
Leemann M, Sagasta A, Eberhardt J, Schwede T, Robin X, Durairaj J. Automated benchmarking of combined protein structure and ligand conformation prediction. Proteins 2023; 91:1912-1924. [PMID: 37885318 DOI: 10.1002/prot.26605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Revised: 09/15/2023] [Accepted: 09/21/2023] [Indexed: 10/28/2023]
Abstract
The prediction of protein-ligand complexes (PLC), using both experimental and predicted structures, is an active and important area of research, underscored by the inclusion of the Protein-Ligand Interaction category in the latest round of the Critical Assessment of Protein Structure Prediction experiment CASP15. The prediction task in CASP15 consisted of predicting both the three-dimensional structure of the receptor protein as well as the position and conformation of the ligand. This paper addresses the challenges and proposed solutions for devising automated benchmarking techniques for PLC prediction. The reliability of experimentally solved PLC as ground truth reference structures is assessed using various validation criteria. Similarity of PLC to previously released complexes are employed to judge PLC diversity and the difficulty of a PLC as a prediction target. We show that the commonly used PDBBind time-split test-set is inappropriate for comprehensive PLC evaluation, with state-of-the-art tools showing conflicting results on a more representative and high quality dataset constructed for benchmarking purposes. We also show that redocking on crystal structures is a much simpler task than docking into predicted protein models, demonstrated by the two PLC-prediction-specific scoring metrics created. Finally, we introduce a fully automated pipeline that predicts PLC and evaluates the accuracy of the protein structure, ligand pose, and protein-ligand interactions.
Collapse
Affiliation(s)
- Michèle Leemann
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Ander Sagasta
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Jerome Eberhardt
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Torsten Schwede
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Xavier Robin
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| | - Janani Durairaj
- Biozentrum, University of Basel, Basel, Switzerland
- SIB Swiss Institute of Bioinformatics, Basel, Switzerland
| |
Collapse
|
81
|
Nussinov R, Liu Y, Zhang W, Jang H. Cell phenotypes can be predicted from propensities of protein conformations. Curr Opin Struct Biol 2023; 83:102722. [PMID: 37871498 PMCID: PMC10841533 DOI: 10.1016/j.sbi.2023.102722] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Revised: 09/26/2023] [Accepted: 09/27/2023] [Indexed: 10/25/2023]
Abstract
Proteins exist as dynamic conformational ensembles. Here we suggest that the propensities of the conformations can be predictors of cell function. The conformational states that the molecules preferentially visit can be viewed as phenotypic determinants, and their mutations work by altering the relative propensities, thus the cell phenotype. Our examples include (i) inactive state variants harboring cancer driver mutations that present active state-like conformational features, as in K-Ras4BG12V compared to other K-Ras4BG12X mutations; (ii) mutants of the same protein presenting vastly different phenotypic and clinical profiles: cancer and neurodevelopmental disorders; (iii) alterations in the occupancies of the conformational (sub)states influencing enzyme reactivity. Thus, protein conformational propensities can determine cell fate. They can also suggest the allosteric drugs efficiency.
Collapse
Affiliation(s)
- Ruth Nussinov
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA; Department of Human Molecular Genetics and Biochemistry, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel; Cancer Innovation Laboratory, National Cancer Institute, Frederick, MD 21702, USA.
| | - Yonglan Liu
- Cancer Innovation Laboratory, National Cancer Institute, Frederick, MD 21702, USA
| | - Wengang Zhang
- Cancer Innovation Laboratory, National Cancer Institute, Frederick, MD 21702, USA
| | - Hyunbum Jang
- Computational Structural Biology Section, Frederick National Laboratory for Cancer Research, Frederick, MD 21702, USA; Cancer Innovation Laboratory, National Cancer Institute, Frederick, MD 21702, USA
| |
Collapse
|
82
|
Sundaraj Y, Abdullah H, Nezhad NG, Rodrigues KF, Sabri S, Baharum SN. Cloning, Expression and Functional Characterization of a Novel α-Humulene Synthase, Responsible for the Formation of Sesquiterpene in Agarwood Originating from Aquilaria malaccensis. Curr Issues Mol Biol 2023; 45:8989-9002. [PMID: 37998741 PMCID: PMC10670791 DOI: 10.3390/cimb45110564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2023] [Revised: 11/04/2023] [Accepted: 11/06/2023] [Indexed: 11/25/2023] Open
Abstract
This study describes the cloning, expression and functional characterization of α-humulene synthase, responsible for the formation of the key aromatic compound α-humulene in agarwood originating from Aquilaria malaccensis. The partial sesquiterpene synthase gene from the transcriptome data of A. malaccensis was utilized for full-length gene isolation via a 3' RACE PCR. The complete gene, denoted as AmDG2, has an open reading frame (ORF) of 1671 bp and encodes for a polypeptide of 556 amino acids. In silico analysis of the protein highlighted several conserved motifs typically found in terpene synthases such as Asp-rich substrate binding (DDxxD), metal-binding residues (NSE/DTE), and cytoplasmic ER retention (RxR) motifs at their respective sites. The AmDG2 was successfully expressed in the E. coli:pET-28a(+) expression vector whereby an expected band of about 64 kDa in size was detected in the SDS-PAGE gel. In vitro enzyme assay using substrate farnesyl pyrophosphate (FPP) revealed that AmDG2 gave rise to two sesquiterpenes: α-humulene (major) and β-caryophyllene (minor), affirming its identity as α-humulene synthase. On the other hand, protein modeling performed using AlphaFold2 suggested that AmDG2 consists entirely of α-helices with short connecting loops and turns. Meanwhile, molecular docking via AutoDock Vina (Version 1.5.7) predicted that Asp307 and Asp311 act as catalytic residues in the α-humulene synthase. To our knowledge, this is the first comprehensive report on the cloning, expression and functional characterization of α-humulene synthase from agarwood originating from A. malaccensis species. These findings reveal a deeper understanding of the structure and functional properties of the α-humulene synthase and could be utilized for metabolic engineering work in the future.
Collapse
Affiliation(s)
- Yasotha Sundaraj
- Metabolomics Research Laboratory, Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Selangor, Malaysia;
- Faculty of Engineering and Life Sciences, Universiti Selangor (UNISEL), Bestari Jaya 45600, Selangor, Malaysia;
| | - Hasdianty Abdullah
- Faculty of Engineering and Life Sciences, Universiti Selangor (UNISEL), Bestari Jaya 45600, Selangor, Malaysia;
| | - Nima Ghahremani Nezhad
- Department of Cell and Molecular Biology, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia;
| | - Kenneth Francis Rodrigues
- Biotechnology Research Institute, Universiti Malaysia Sabah (UMS), Kota Kinabalu 88400, Sabah, Malaysia;
| | - Suriana Sabri
- Enzyme and Microbial Technology Research Centre, Faculty of Biotechnology and Biomolecular Sciences, Universiti Putra Malaysia (UPM), Serdang 43400, Selangor, Malaysia;
| | - Syarul Nataqain Baharum
- Metabolomics Research Laboratory, Institute of Systems Biology (INBIOSIS), Universiti Kebangsaan Malaysia (UKM), Bangi 43600, Selangor, Malaysia;
| |
Collapse
|
83
|
Wang C, Yao X, Li X, Wang Q, Jiang N, Hu X, Lv H, Mu B, Wang J. Fosthiazate, a soil-applied nematicide, induces oxidative stress, neurotoxicity and transcriptome aberrations in earthworm (Eisenia fetida). JOURNAL OF HAZARDOUS MATERIALS 2023; 463:132865. [PMID: 39491983 DOI: 10.1016/j.jhazmat.2023.132865] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2023] [Revised: 10/14/2023] [Accepted: 10/24/2023] [Indexed: 11/05/2024]
Abstract
Fosthiazate is a widely used organophosphorus nematicide that resides in the soil and controls soil root-knot nematodes. However, whether it has toxic effects on non-target soil organisms such as earthworms is unclear. Therefore, in this study, a 28-day experiment of fosthiazate exposure was conducted using the Eisenia fetida as the model organism. The results showed that fosthiazate stress caused excessive production of reactive oxygen species (ROS), increased the levels of malondialdehyde (MDA) and 8-hydroxy-2-deoxyguanosine (8-OHdG), and decreased the activities of superoxide dismutase (SOD) and catalase (CAT), suggesting that fosthiazate induced oxidative stress and DNA damage in E. fetida. Acetylcholinesterase (AChE) activity was significantly reduced, and the expression of its related functional genes was also altered, demonstrating that fosthiazate damaged the nervous system of E. fetida, which was further confirmed by AlphaFold2 modeling and molecular docking simulations. Furthermore, Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis indicated that fosthiazate exposure may induce apoptosis, inflammation, and viral infection in E. fetida, which adversely affect the organism. This study provides reference data for the ecotoxicity of fosthiazate.
Collapse
Affiliation(s)
- Can Wang
- College of Resources and Environment, Shandong Agricultural University, Tai'an, Shandong 271018, PR China; College of Natural Resources and Environment, Northwest A&F University, Yangling 712000, PR China
| | - Xiangfeng Yao
- College of Resources and Environment, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Xianxu Li
- College of Resources and Environment, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Qian Wang
- College of Resources and Environment, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Nan Jiang
- College of Resources and Environment, Shandong Agricultural University, Tai'an, Shandong 271018, PR China; College of Natural Resources and Environment, Northwest A&F University, Yangling 712000, PR China
| | - Xue Hu
- College of Resources and Environment, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Huijuan Lv
- College of Resources and Environment, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Baoyan Mu
- College of Resources and Environment, Shandong Agricultural University, Tai'an, Shandong 271018, PR China
| | - Jun Wang
- College of Resources and Environment, Shandong Agricultural University, Tai'an, Shandong 271018, PR China.
| |
Collapse
|