1
|
Nguyen JDM, da Hora GCA, Mifflin MC, Roberts AG, Swanson JMJ. In silico design of foldable lasso peptides. Biophys J 2025; 124:1532-1547. [PMID: 40181537 DOI: 10.1016/j.bpj.2025.03.036] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2025] [Revised: 03/03/2025] [Accepted: 03/31/2025] [Indexed: 04/05/2025] Open
Abstract
Lasso peptides are a unique class of natural products with distinctively threaded structures, conferring exceptional stability against thermal and proteolytic degradation. Despite their promising biotechnological and pharmaceutical applications, reported attempts to prepare them by chemical synthesis result in forming the nonthreaded branched-cyclic isomer, rather than the desired lassoed structure. This is likely due to the entropic challenge of folding a short, threaded motif before chemically mediated cyclization. Accordingly, this study aims to better understand and enhance the relative stability of pre-lasso conformations-the essential precursor to lasso peptide formation-through sequence optimization, chemical modification, and disulfide incorporation. Using Rosetta fixed backbone design, optimal sequences for several class II lasso peptides are identified. Enhanced sampling with well-tempered metadynamics confirmed that designed sequences derived from the lasso structures of rubrivinodin and microcin J25 exhibit a notable improvement in pre-lasso stability relative to the competing nonthreaded conformations. Chemical modifications to the isopeptide bond-forming residues of microcin J25 further increase the probability of pre-lasso formation, highlighting the beneficial role of noncanonical amino acid residues. Counterintuitively, the introduction of a disulfide cross-link decreased pre-lasso stability. Although cross-linking inherently constrains the peptide structure, decreasing the entropic dominance of unfolded phase space, it hinders the requisite wrapping of the N-terminal end around the tail to adopt the pre-lasso conformation. However, combining chemical modifications with the disulfide cross-link results in further pre-lasso stabilization, indicating that the ring modifications counteract the constraints and provide a cooperative benefit with cross-linking. These findings lay the groundwork for further design efforts to enable synthetic access to the lasso peptide scaffold.
Collapse
Affiliation(s)
- John D M Nguyen
- Department of Chemistry, University of Utah, Salt Lake City, Utah
| | | | - Marcus C Mifflin
- Department of Chemistry, University of Utah, Salt Lake City, Utah
| | - Andrew G Roberts
- Department of Chemistry, University of Utah, Salt Lake City, Utah
| | | |
Collapse
|
2
|
Nguyen JDM, da Hora GCA, Mifflin MC, Roberts AG, Swanson JMJ. Tying the Knot: In Silico Design of Foldable Lasso Peptides. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.17.633674. [PMID: 39896618 PMCID: PMC11785075 DOI: 10.1101/2025.01.17.633674] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/04/2025]
Abstract
Lasso peptides are a unique class of natural products with distinctively threaded structures, conferring exceptional stability against thermal and proteolytic degradation. Despite their promising biotechnological and pharmaceutical applications, reported attempts to prepare them by chemical synthesis result in forming the nonthreaded branched-cyclic isomer, rather than the desired lassoed structure. This is likely due to the entropic challenge of folding a short, threaded motif prior to chemically mediated cyclization. Accordingly, this study aims to better understand and enhance the relative stability of pre-lasso conformations-the essential precursor to lasso peptide formation-through sequence optimization, chemical modification, and disulfide incorporation. Using Rosetta fixed backbone design, optimal sequences for several class II lasso peptides are identified. Enhanced sampling with well-tempered metadynamics confirmed that designed sequences derived from the lasso structures of rubrivinodin and microcin J25 exhibit a notable improvement in pre-lasso stability relative to the competing nonthreaded conformations. Chemical modifications to the isopeptide bond-forming residues of microcin J25 further increase the probability of pre-lasso formation, highlighting the beneficial role of non-canonical amino acid residues. Counterintuitively, the introduction of a disulfide cross-link decreased pre-lasso stability. Although cross-linking inherently constrains the peptide structure, decreasing the entropic dominance of unfolded phase space, it hinders the requisite wrapping of the N-terminal end around the tail to adopt the pre-lasso conformation. However, combining chemical modifications with the disulfide cross-link results in further pre-lasso stabilization, indicating that the ring modifications counteract the constraints and provide a cooperative benefit with cross-linking. These findings lay the groundwork for further design efforts to enable synthetic access to the lasso peptide scaffold. SIGNIFICANCE Lasso peptides are a unique class of ribosomally synthesized and post-translationally modified natural products with diverse biological activities and potential for therapeutic applications. Although direct synthesis would facilitate therapeutic design, it has not yet been possible to fold these short sequences to their threaded architecture without the help of biosynthetic enzyme stabilization. Our work explores strategies to enhance the stability of the pre-lasso structure, the essential precursor to de novo lasso peptide formation. We find that sequence design, incorporating non-canonical amino acid residues, and design-guided cross-linking can augment stability to increase the likelihood of lasso motif accessibility. This work presents several strategies for the continued design of foldable lasso peptides.
Collapse
|
3
|
Imani S, Li X, Chen K, Maghsoudloo M, Jabbarzadeh Kaboli P, Hashemi M, Khoushab S, Li X. Computational biology and artificial intelligence in mRNA vaccine design for cancer immunotherapy. Front Cell Infect Microbiol 2025; 14:1501010. [PMID: 39902185 PMCID: PMC11788159 DOI: 10.3389/fcimb.2024.1501010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2024] [Accepted: 12/16/2024] [Indexed: 02/05/2025] Open
Abstract
Messenger RNA (mRNA) vaccines offer an adaptable and scalable platform for cancer immunotherapy, requiring optimal design to elicit a robust and targeted immune response. Recent advancements in bioinformatics and artificial intelligence (AI) have significantly enhanced the design, prediction, and optimization of mRNA vaccines. This paper reviews technologies that streamline mRNA vaccine development, from genomic sequencing to lipid nanoparticle (LNP) formulation. We discuss how accurate predictions of neoantigen structures guide the design of mRNA sequences that effectively target immune and cancer cells. Furthermore, we examine AI-driven approaches that optimize mRNA-LNP formulations, enhancing delivery and stability. These technological innovations not only improve vaccine design but also enhance pharmacokinetics and pharmacodynamics, offering promising avenues for personalized cancer immunotherapy.
Collapse
Affiliation(s)
- Saber Imani
- Shulan International Medical College, Zhejiang Shuren University, Hangzhou, Zhejiang, China
| | - Xiaoyan Li
- Shulan International Medical College, Zhejiang Shuren University, Hangzhou, Zhejiang, China
| | - Keyi Chen
- Key Laboratory of Artificial Organs and Computational Medicine in Zhejiang Province, Shulan International Medical College, Zhejiang Shuren University, Hangzhou, Zhejiang, China
| | - Mazaher Maghsoudloo
- Key Laboratory of Epigenetics and Oncology, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, Sichuan, China
| | | | - Mehrdad Hashemi
- Department of Genetics, Faculty of Advanced Science and Technology, Tehran Medical Sciences, Islamic Azad University, Tehran, Iran
- Farhikhtegan Medical Convergence sciences Research Center, Farhikhtegan Hospital Tehran Medical sciences, Islamic Azad University, Tehran, Iran
| | - Saloomeh Khoushab
- Department of Genetics, Faculty of Advanced Science and Technology, Tehran Medical Sciences, Islamic Azad University, Tehran, Iran
- Farhikhtegan Medical Convergence sciences Research Center, Farhikhtegan Hospital Tehran Medical sciences, Islamic Azad University, Tehran, Iran
| | - Xiaoping Li
- Key Laboratory of Artificial Organs and Computational Medicine in Zhejiang Province, Shulan International Medical College, Zhejiang Shuren University, Hangzhou, Zhejiang, China
| |
Collapse
|
4
|
Totaro MG, Vide U, Zausinger R, Winkler A, Oberdorfer G. ESM-scan-A tool to guide amino acid substitutions. Protein Sci 2024; 33:e5221. [PMID: 39565080 PMCID: PMC11577456 DOI: 10.1002/pro.5221] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2024] [Revised: 09/27/2024] [Accepted: 10/28/2024] [Indexed: 11/21/2024]
Abstract
Protein structure prediction and (re)design have gone through a revolution in the last 3 years. The tremendous progress in these fields has been almost exclusively driven by readily available machine learning algorithms applied to protein folding and sequence design problems. Despite these advancements, predicting site-specific mutational effects on protein stability and function remains an unsolved problem. This is a persistent challenge, mainly because the free energy of large systems is very difficult to compute with absolute accuracy and subtle changes to protein structures are hard to capture with computational models. Here, we describe the implementation and use of ESM-Scan, which uses the ESM zero-shot predictor to scan entire protein sequences for preferential amino acid changes, thus enabling in silico deep mutational scanning experiments. We benchmark ESM-Scan on its predictive capabilities for stability and functionality of sequence changes using three publicly available datasets and proceed by experimentally testing the tool's performance on a challenging test case of a blue-light-activated diguanylate cyclase from Methylotenera species (MsLadC), where it accurately predicted the importance of a highly conserved residue in a region involved in allosteric product inhibition. Our experimental results show that the ESM-zero shot model is capable of inferring the effects of a set of amino acid substitutions in their correlation between predicted fitness and experimental results. ESM-Scan is publicly available at https://huggingface.co/spaces/thaidaev/zsp.
Collapse
Affiliation(s)
| | - Uršula Vide
- Institute of BiochemistryGraz University of TechnologyGrazAustria
| | - Regina Zausinger
- Institute of BiochemistryGraz University of TechnologyGrazAustria
| | - Andreas Winkler
- Institute of BiochemistryGraz University of TechnologyGrazAustria
- BioTechMedGrazAustria
| | - Gustav Oberdorfer
- Institute of BiochemistryGraz University of TechnologyGrazAustria
- BioTechMedGrazAustria
| |
Collapse
|
5
|
Ertelt M, Schlegel P, Beining M, Kaysser L, Meiler J, Schoeder CT. HyperMPNN-A general strategy to design thermostable proteins learned from hyperthermophiles. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.11.26.625397. [PMID: 39651244 PMCID: PMC11623624 DOI: 10.1101/2024.11.26.625397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]
Abstract
Stability is a key factor to enable the use of recombinant proteins in therapeutic or biotechnological applications. Deep learning protein design approaches like ProteinMPNN have shown strong performance both in creating novel proteins or stabilizing existing ones. However, it is unlikely that the stability of the designs will significantly exceed that of the natural proteins in the training set, which are biophysically only marginally stable. Therefore, we collected predicted protein structures from hyperthermophiles, which differ substantially in their amino acid composition from mesophiles. Notably, ProteinMPNN fails to recover their unique amino acid composition. Here we show that a retrained network on predicted proteins from hyperthermophiles, termed HyperMPNN, not only recovers this unique amino acid composition but can also be applied to proteins from non-hyperthermophiles. Using this novel approach on a protein nanoparticle with a melting temperature of 65°C resulted in designs remaining stable at 95°C. In conclusion, we created a new way to design highly thermostable proteins through self-supervised learning on data from hyperthermophiles.
Collapse
Affiliation(s)
- Moritz Ertelt
- Institute for Drug Discovery, Leipzig University, Faculty of Medicine, Leipzig, Germany
- Center for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, Dresden/Leipzig, Germany
| | - Phillip Schlegel
- Institute for Drug Discovery, Leipzig University, Faculty of Medicine, Leipzig, Germany
| | - Max Beining
- Institute for Drug Discovery, Leipzig University, Faculty of Medicine, Leipzig, Germany
- School of Embedded Composite Artificial Intelligence (SECAI), Cooperation of University Leipzig and TU Dresden
| | - Leonard Kaysser
- Institute for Drug Discovery, Leipzig University, Faculty of Medicine, Leipzig, Germany
| | - Jens Meiler
- Institute for Drug Discovery, Leipzig University, Faculty of Medicine, Leipzig, Germany
- Center for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, Dresden/Leipzig, Germany
- School of Embedded Composite Artificial Intelligence (SECAI), Cooperation of University Leipzig and TU Dresden
- Department of Chemistry, Department of Pharmacology, Center for Structural Biology, Institute of Chemical Biology, Center for Applied Artificial Intelligence in Protein Dynamics, Vanderbilt University, Nashville, Tennessee, United States of America
- Institute for Computer Science, Faculty of Mathematics and Informatics, University Leipzig, Leipzig, Germany
- Wilhelm Ostwald Institute for Physical and Theoretical Chemistry, Faculty of Chemistry and Mineralogy, University Leipzig, Leipzig, Germany
| | - Clara T. Schoeder
- Institute for Drug Discovery, Leipzig University, Faculty of Medicine, Leipzig, Germany
- Center for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, Dresden/Leipzig, Germany
| |
Collapse
|
6
|
Ertelt M, Meiler J, Schoeder CT. Combining Rosetta Sequence Design with Protein Language Model Predictions Using Evolutionary Scale Modeling (ESM) as Restraint. ACS Synth Biol 2024; 13:1085-1092. [PMID: 38568188 PMCID: PMC11036486 DOI: 10.1021/acssynbio.3c00753] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2023] [Revised: 02/16/2024] [Accepted: 03/20/2024] [Indexed: 04/20/2024]
Abstract
Computational protein sequence design has the ambitious goal of modifying existing or creating new proteins; however, designing stable and functional proteins is challenging without predictability of protein dynamics and allostery. Informing protein design methods with evolutionary information limits the mutational space to more native-like sequences and results in increased stability while maintaining functions. Recently, language models, trained on millions of protein sequences, have shown impressive performance in predicting the effects of mutations. Assessing Rosetta-designed sequences with a language model showed scores that were worse than those of their original sequence. To inform Rosetta design protocols with language model predictions, we added a new metric to restrain the energy function during design using the Evolutionary Scale Modeling (ESM) model. The resulting sequences have better language model scores and similar sequence recovery, with only a minor decrease in the fitness as assessed by Rosetta energy. In conclusion, our work combines the strength of recent machine learning approaches with the Rosetta protein design toolbox.
Collapse
Affiliation(s)
- Moritz Ertelt
- Institute
for Drug Discovery, University Leipzig Medicine
Faculty, Liebigstr. 19, D-04103 Leipzig, Germany
- Center
for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, D-04105 Leipzig, Germany
| | - Jens Meiler
- Institute
for Drug Discovery, University Leipzig Medicine
Faculty, Liebigstr. 19, D-04103 Leipzig, Germany
- Center
for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, D-04105 Leipzig, Germany
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37235, United
States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37235, United States
| | - Clara T. Schoeder
- Institute
for Drug Discovery, University Leipzig Medicine
Faculty, Liebigstr. 19, D-04103 Leipzig, Germany
- Center
for Scalable Data Analytics and Artificial Intelligence ScaDS.AI, D-04105 Leipzig, Germany
| |
Collapse
|
7
|
Kim DN, McNaughton AD, Kumar N. Leveraging Artificial Intelligence to Expedite Antibody Design and Enhance Antibody-Antigen Interactions. Bioengineering (Basel) 2024; 11:185. [PMID: 38391671 PMCID: PMC10886287 DOI: 10.3390/bioengineering11020185] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2023] [Revised: 01/30/2024] [Accepted: 02/06/2024] [Indexed: 02/24/2024] Open
Abstract
This perspective sheds light on the transformative impact of recent computational advancements in the field of protein therapeutics, with a particular focus on the design and development of antibodies. Cutting-edge computational methods have revolutionized our understanding of protein-protein interactions (PPIs), enhancing the efficacy of protein therapeutics in preclinical and clinical settings. Central to these advancements is the application of machine learning and deep learning, which offers unprecedented insights into the intricate mechanisms of PPIs and facilitates precise control over protein functions. Despite these advancements, the complex structural nuances of antibodies pose ongoing challenges in their design and optimization. Our review provides a comprehensive exploration of the latest deep learning approaches, including language models and diffusion techniques, and their role in surmounting these challenges. We also present a critical analysis of these methods, offering insights to drive further progress in this rapidly evolving field. The paper includes practical recommendations for the application of these computational techniques, supplemented with independent benchmark studies. These studies focus on key performance metrics such as accuracy and the ease of program execution, providing a valuable resource for researchers engaged in antibody design and development. Through this detailed perspective, we aim to contribute to the advancement of antibody design, equipping researchers with the tools and knowledge to navigate the complexities of this field.
Collapse
Affiliation(s)
| | | | - Neeraj Kumar
- Pacific Northwest National Laboratory, 902 Battelle Blvd., Richland, WA 99352, USA; (D.N.K.); (A.D.M.)
| |
Collapse
|
8
|
Malbranke C, Bikard D, Cocco S, Monasson R, Tubiana J. Machine learning for evolutionary-based and physics-inspired protein design: Current and future synergies. Curr Opin Struct Biol 2023; 80:102571. [PMID: 36947951 DOI: 10.1016/j.sbi.2023.102571] [Citation(s) in RCA: 14] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Revised: 01/29/2023] [Accepted: 02/07/2023] [Indexed: 03/24/2023]
Abstract
Computational protein design facilitates the discovery of novel proteins with prescribed structure and functionality. Exciting designs were recently reported using novel data-driven methodologies that can be roughly divided into two categories: evolutionary-based and physics-inspired approaches. The former infer characteristic sequence features shared by sets of evolutionary-related proteins, such as conserved or coevolving positions, and recombine them to generate candidates with similar structure and function. The latter approaches estimate key biochemical properties, such as structure free energy, conformational entropy, or binding affinities using machine learning surrogates, and optimize them to yield improved designs. Here, we review recent progress along both tracks, discuss their strengths and weaknesses, and highlight opportunities for synergistic approaches.
Collapse
Affiliation(s)
- Cyril Malbranke
- Laboratory of Physics of the Ecole Normale Supérieure, PSL Research, CNRS UMR 8023, Sorbonne Université, Université de Paris, Paris, France; Institut Pasteur, Université Paris Cité, CNRS UMR 6047, Synthetic Biology, 75015 Paris, France.
| | - David Bikard
- Institut Pasteur, Université Paris Cité, CNRS UMR 6047, Synthetic Biology, 75015 Paris, France
| | - Simona Cocco
- Laboratory of Physics of the Ecole Normale Supérieure, PSL Research, CNRS UMR 8023, Sorbonne Université, Université de Paris, Paris, France
| | - Rémi Monasson
- Laboratory of Physics of the Ecole Normale Supérieure, PSL Research, CNRS UMR 8023, Sorbonne Université, Université de Paris, Paris, France
| | - Jérôme Tubiana
- Blavatnik School of Computer Science, Tel Aviv University, Tel Aviv, Israel.
| |
Collapse
|
9
|
Lin P, Yan Y, Huang SY. DeepHomo2.0: improved protein-protein contact prediction of homodimers by transformer-enhanced deep learning. Brief Bioinform 2023; 24:6849483. [PMID: 36440949 DOI: 10.1093/bib/bbac499] [Citation(s) in RCA: 10] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2022] [Revised: 10/08/2022] [Accepted: 10/21/2022] [Indexed: 11/30/2022] Open
Abstract
Protein-protein interactions play an important role in many biological processes. However, although structure prediction for monomer proteins has achieved great progress with the advent of advanced deep learning algorithms like AlphaFold, the structure prediction for protein-protein complexes remains an open question. Taking advantage of the Transformer model of ESM-MSA, we have developed a deep learning-based model, named DeepHomo2.0, to predict protein-protein interactions of homodimeric complexes by leveraging the direct-coupling analysis (DCA) and Transformer features of sequences and the structure features of monomers. DeepHomo2.0 was extensively evaluated on diverse test sets and compared with eight state-of-the-art methods including protein language model-based, DCA-based and machine learning-based methods. It was shown that DeepHomo2.0 achieved a high precision of >70% with experimental monomer structures and >60% with predicted monomer structures for the top 10 predicted contacts on the test sets and outperformed the other eight methods. Moreover, even the version without using structure information, named DeepHomoSeq, still achieved a good precision of >55% for the top 10 predicted contacts. Integrating the predicted contacts into protein docking significantly improved the structure prediction of realistic Critical Assessment of Protein Structure Prediction homodimeric complexes. DeepHomo2.0 and DeepHomoSeq are available at http://huanglab.phys.hust.edu.cn/DeepHomo2/.
Collapse
Affiliation(s)
- Peicong Lin
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Yumeng Yan
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| | - Sheng-You Huang
- School of Physics, Huazhong University of Science and Technology, Wuhan, Hubei 430074, P. R. China
| |
Collapse
|
10
|
Page BM, Martin TA, Wright CL, Fenton LA, Villar MT, Tang Q, Artigues A, Lamb A, Fenton AW, Swint‐Kruse L. Odd one out? Functional tuning of Zymomonas mobilis pyruvate kinase is narrower than its allosteric, human counterpart. Protein Sci 2022; 31:e4336. [PMID: 35762709 PMCID: PMC9202079 DOI: 10.1002/pro.4336] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Revised: 04/29/2022] [Accepted: 05/03/2022] [Indexed: 11/08/2022]
Abstract
Various protein properties are often illuminated using sequence comparisons of protein homologs. For example, in analyses of the pyruvate kinase multiple sequence alignment, the set of positions that changed during speciation ("phylogenetic" positions) were enriched for "rheostat" positions in human liver pyruvate kinase (hLPYK). (Rheostat positions are those which, when substituted with various amino acids, yield a range of functional outcomes). However, the correlation was moderate, which could result from multiple biophysical constraints acting on the same position during evolution and/or various sources of noise. To further examine this correlation, we here tested Zymomonas mobilis PYK (ZmPYK), which has <65% sequence identity to any other PYK sequence. Twenty-six ZmPYK positions were selected based on their phylogenetic scores, substituted with multiple amino acids, and assessed for changes in Kapp-PEP . Although we expected to identify multiple, strong rheostat positions, only one moderate rheostat position was detected. Instead, nearly half of the 271 ZmPYK variants were inactive and most others showed near wild-type function. Indeed, for the active ZmPYK variants, the total range of Kapp,PEP values ("tunability") was 40-fold less than that observed for hLPYK variants. The combined functional studies and sequence comparisons suggest that ZmPYK has evolved functional and/or structural attributes that differ from the rest of the family. We hypothesize that including such "orphan" sequences in MSA analyses obscures the correlations used to predict rheostat positions. Finally, results raise the intriguing biophysical question as to how the same protein fold can support rheostat positions in one homolog but not another.
Collapse
Affiliation(s)
- Braelyn M. Page
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| | - Tyler A. Martin
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| | - Collette L. Wright
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
- Department of Molecular BiosciencesThe University of KansasLawrenceKansasUSA
| | - Lauren A. Fenton
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| | - Maite T. Villar
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| | - Qingling Tang
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| | - Antonio Artigues
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| | - Audrey Lamb
- Department of Molecular BiosciencesThe University of KansasLawrenceKansasUSA
- Department of ChemistryUniversity of Texas at San AntonioSan AntonioTexasUSA
| | - Aron W. Fenton
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| | - Liskin Swint‐Kruse
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansas CityKansasUSA
| |
Collapse
|
11
|
Vishwakarma P, Vattekatte AM, Shinada N, Diharce J, Martins C, Cadet F, Gardebien F, Etchebest C, Nadaradjane AA, de Brevern AG. V HH Structural Modelling Approaches: A Critical Review. Int J Mol Sci 2022; 23:3721. [PMID: 35409081 PMCID: PMC8998791 DOI: 10.3390/ijms23073721] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2022] [Revised: 03/23/2022] [Accepted: 03/23/2022] [Indexed: 12/20/2022] Open
Abstract
VHH, i.e., VH domains of camelid single-chain antibodies, are very promising therapeutic agents due to their significant physicochemical advantages compared to classical mammalian antibodies. The number of experimentally solved VHH structures has significantly improved recently, which is of great help, because it offers the ability to directly work on 3D structures to humanise or improve them. Unfortunately, most VHHs do not have 3D structures. Thus, it is essential to find alternative ways to get structural information. The methods of structure prediction from the primary amino acid sequence appear essential to bypass this limitation. This review presents the most extensive overview of structure prediction methods applied for the 3D modelling of a given VHH sequence (a total of 21). Besides the historical overview, it aims at showing how model software programs have been shaping the structural predictions of VHHs. A brief explanation of each methodology is supplied, and pertinent examples of their usage are provided. Finally, we present a structure prediction case study of a recently solved VHH structure. According to some recent studies and the present analysis, AlphaFold 2 and NanoNet appear to be the best tools to predict a structural model of VHH from its sequence.
Collapse
Affiliation(s)
- Poonam Vishwakarma
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.)
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)
| | - Akhila Melarkode Vattekatte
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.)
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)
| | | | - Julien Diharce
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.)
| | - Carla Martins
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.)
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)
| | - Frédéric Cadet
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)
- PEACCEL, Artificial Intelligence Department, Square Albin Cachot, F-75013 Paris, France
| | - Fabrice Gardebien
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)
| | - Catherine Etchebest
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.)
| | - Aravindan Arun Nadaradjane
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.)
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)
| | - Alexandre G. de Brevern
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-75015 Paris, France; (P.V.); (A.M.V.); (J.D.); (C.M.); (C.E.); (A.A.N.)
- INSERM UMR_S 1134, BIGR, DSIMB Team, Université de Paris and Université de la Réunion, F-97715 Saint Denis Messag, France; (F.C.); (F.G.)
| |
Collapse
|
12
|
Liu Z, Yang Y, Li D, Lv X, Chen X, Dai Q. Prediction of the RNA Tertiary Structure Based on a Random Sampling Strategy and Parallel Mechanism. Front Genet 2022; 12:813604. [PMID: 35069706 PMCID: PMC8769045 DOI: 10.3389/fgene.2021.813604] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2021] [Accepted: 11/19/2021] [Indexed: 12/14/2022] Open
Abstract
Background: Macromolecule structure prediction remains a fundamental challenge of bioinformatics. Over the past several decades, the Rosetta framework has provided solutions to diverse challenges in computational biology. However, it is challenging to model RNA tertiary structures effectively when the de novo modeling of RNA involves solving a well-defined small puzzle. Methods: In this study, we introduce a stepwise Monte Carlo parallelization (SMCP) algorithm for RNA tertiary structure prediction. Millions of conformations were randomly searched using the Monte Carlo algorithm and stepwise ansatz hypothesis, and SMCP uses a parallel mechanism for efficient sampling. Moreover, to achieve better prediction accuracy and completeness, we judged and processed the modeling results. Results: A benchmark of nine single-stranded RNA loops drawn from riboswitches establishes the general ability of the algorithm to model RNA with high accuracy and integrity, including six motifs that cannot be solved by knowledge mining-based modeling algorithms. Experimental results show that the modeling accuracy of the SMCP algorithm is up to 0.14 Å, and the modeling integrity on this benchmark is extremely high. Conclusion: SMCP is an ab initio modeling algorithm that substantially outperforms previous algorithms in the Rosetta framework, especially in improving the accuracy and completeness of the model. It is expected that the work will provide new research ideas for macromolecular structure prediction in the future. In addition, this work will provide theoretical basis for the development of the biomedical field.
Collapse
Affiliation(s)
- Zhendong Liu
- School of Computer Science and Technology, Shandong Jianzhu University, Jinan, China
| | - Yurong Yang
- School of Computer Science and Technology, Shandong Jianzhu University, Jinan, China
| | - Dongyan Li
- School of Computer Science and Technology, Shandong Jianzhu University, Jinan, China
| | - Xinrong Lv
- School of Computer Science and Technology, Shandong Jianzhu University, Jinan, China
| | - Xi Chen
- School of Computer Science and Technology, Shandong Jianzhu University, Jinan, China
| | - Qionghai Dai
- Department of Automation, Tsinghua University, Beijing, China
| |
Collapse
|
13
|
Modulating Glycoside Hydrolase Activity between Hydrolysis and Transfer Reactions Using an Evolutionary Approach. Molecules 2021; 26:molecules26216586. [PMID: 34770995 PMCID: PMC8587830 DOI: 10.3390/molecules26216586] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2021] [Revised: 10/27/2021] [Accepted: 10/28/2021] [Indexed: 01/02/2023] Open
Abstract
The proteins within the CAZy glycoside hydrolase family GH13 catalyze the hydrolysis of polysaccharides such as glycogen and starch. Many of these enzymes also perform transglycosylation in various degrees, ranging from secondary to predominant reactions. Identifying structural determinants associated with GH13 family reaction specificity is key to modifying and designing enzymes with increased specificity towards individual reactions for further applications in industrial, chemical, or biomedical fields. This work proposes a computational approach for decoding the determinant structural composition defining the reaction specificity. This method is based on the conservation of coevolving residues in spatial contacts associated with reaction specificity. To evaluate the algorithm, mutants of α-amylase (TmAmyA) and glucanotransferase (TmGTase) from Thermotoga maritima were constructed to modify the reaction specificity. The K98P/D99A/H222Q variant from TmAmyA doubled the transglycosydation/hydrolysis (T/H) ratio while the M279N variant from TmGTase increased the hydrolysis/transglycosidation ratio five-fold. Molecular dynamic simulations of the variants indicated changes in flexibility that can account for the modified T/H ratio. An essential contribution of the presented computational approach is its capacity to identify residues outside of the active center that affect the reaction specificity.
Collapse
|