1
|
Guo AB, Akpinaroglu D, Kelly MJ, Kortemme T. Deep learning guided design of dynamic proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.17.603962. [PMID: 39071443 PMCID: PMC11275770 DOI: 10.1101/2024.07.17.603962] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Deep learning has greatly advanced design of highly stable static protein structures, but the controlled conformational dynamics that are hallmarks of natural switch-like signaling proteins have remained inaccessible to de novo design. Here, we describe a general deep-learning-guided approach for de novo design of dynamic changes between intra-domain geometries of proteins, similar to switch mechanisms prevalent in nature, with atom-level precision. We solve 4 structures validating the designed conformations, show microsecond transitions between them, and demonstrate that the conformational landscape can be modulated by orthosteric ligands and allosteric mutations. Physics-based simulations are in remarkable agreement with deep-learning predictions and experimental data, reveal distinct state-dependent residue interaction networks, and predict mutations that tune the designed conformational landscape. Our approach demonstrates that new modes of motion can now be realized through de novo design and provides a framework for constructing biology-inspired, tunable and controllable protein signaling behavior de novo.
Collapse
Affiliation(s)
- Amy B. Guo
- The UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco; San Francisco, CA 94143, USA
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco; San Francisco, CA 94143, USA
| | - Deniz Akpinaroglu
- The UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco; San Francisco, CA 94143, USA
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco; San Francisco, CA 94143, USA
| | - Mark J.S. Kelly
- Department of Pharmaceutical Chemistry, University of California, San Francisco; San Francisco, CA 94143, USA
| | - Tanja Kortemme
- The UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco; San Francisco, CA 94143, USA
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco; San Francisco, CA 94143, USA
- Quantitative Biosciences Institute, University of California, San Francisco; San Francisco, CA 94143, USA
- Chan Zuckerberg Biohub; San Francisco, CA 94143, USA
| |
Collapse
|
2
|
Guarra F, Colombo G. Computational Methods in Immunology and Vaccinology: Design and Development of Antibodies and Immunogens. J Chem Theory Comput 2023; 19:5315-5333. [PMID: 37527403 PMCID: PMC10448727 DOI: 10.1021/acs.jctc.3c00513] [Citation(s) in RCA: 17] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2023] [Indexed: 08/03/2023]
Abstract
The design of new biomolecules able to harness immune mechanisms for the treatment of diseases is a prime challenge for computational and simulative approaches. For instance, in recent years, antibodies have emerged as an important class of therapeutics against a spectrum of pathologies. In cancer, immune-inspired approaches are witnessing a surge thanks to a better understanding of tumor-associated antigens and the mechanisms of their engagement or evasion from the human immune system. Here, we provide a summary of the main state-of-the-art computational approaches that are used to design antibodies and antigens, and in parallel, we review key methodologies for epitope identification for both B- and T-cell mediated responses. A special focus is devoted to the description of structure- and physics-based models, privileged over purely sequence-based approaches. We discuss the implications of novel methods in engineering biomolecules with tailored immunological properties for possible therapeutic uses. Finally, we highlight the extraordinary challenges and opportunities presented by the possible integration of structure- and physics-based methods with emerging Artificial Intelligence technologies for the prediction and design of novel antigens, epitopes, and antibodies.
Collapse
Affiliation(s)
- Federica Guarra
- Department of Chemistry, University
of Pavia, Via Taramelli 12, 27100 Pavia, Italy
| | - Giorgio Colombo
- Department of Chemistry, University
of Pavia, Via Taramelli 12, 27100 Pavia, Italy
| |
Collapse
|
3
|
Dănăilă VR, Avram S, Buiu C. The applications of machine learning in HIV neutralizing antibodies research-A systematic review. Artif Intell Med 2022; 134:102429. [PMID: 36462896 DOI: 10.1016/j.artmed.2022.102429] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 09/03/2022] [Accepted: 10/13/2022] [Indexed: 12/14/2022]
Abstract
Machine learning algorithms play an essential role in bioinformatics and allow exploring the vast and noisy biological data in unrivaled ways. This paper is a systematic review of the applications of machine learning in the study of HIV neutralizing antibodies. This significant and vast research domain can pave the way to novel treatments and to a vaccine. We selected the relevant papers by investigating the available literature from the Web of Science and PubMed databases in the last decade. The computational methods are applied in neutralization potency prediction, neutralization span prediction against multiple viral strains, antibody-virus binding sites detection, enhanced antibodies design, and the study of the antibody-induced immune response. These methods are viewed from multiple angles spanning data processing, model description, feature selection, evaluation, and sometimes paper comparisons. The algorithms are diverse and include supervised, unsupervised, and generative types. Both classical machine learning and modern deep learning were taken into account. The review ends with our ideas regarding future research directions and challenges.
Collapse
Affiliation(s)
- Vlad-Rareş Dănăilă
- Department of Automatic Control and Systems Engineering, Politehnica University of Bucharest, 313 Splaiul Independenţei, Bucharest 060042, Romania.
| | - Speranţa Avram
- Department of Anatomy, Animal Physiology and Biophysics, Faculty of Biology, University of Bucharest, 91-95 Splaiul Independentei, Bucharest 050095, Romania.
| | - Cătălin Buiu
- Department of Automatic Control and Systems Engineering, Politehnica University of Bucharest, 313 Splaiul Independenţei, Bucharest 060042, Romania.
| |
Collapse
|
4
|
Magi Meconi G, Sasselli IR, Bianco V, Onuchic JN, Coluzza I. Key aspects of the past 30 years of protein design. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2022; 85:086601. [PMID: 35704983 DOI: 10.1088/1361-6633/ac78ef] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 06/15/2022] [Indexed: 06/15/2023]
Abstract
Proteins are the workhorse of life. They are the building infrastructure of living systems; they are the most efficient molecular machines known, and their enzymatic activity is still unmatched in versatility by any artificial system. Perhaps proteins' most remarkable feature is their modularity. The large amount of information required to specify each protein's function is analogically encoded with an alphabet of just ∼20 letters. The protein folding problem is how to encode all such information in a sequence of 20 letters. In this review, we go through the last 30 years of research to summarize the state of the art and highlight some applications related to fundamental problems of protein evolution.
Collapse
Affiliation(s)
- Giulia Magi Meconi
- Computational Biophysics Lab, Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramon 182, 20014, Donostia-San Sebastián, Spain
| | - Ivan R Sasselli
- Computational Biophysics Lab, Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramon 182, 20014, Donostia-San Sebastián, Spain
| | | | - Jose N Onuchic
- Center for Theoretical Biological Physics, Department of Physics & Astronomy, Department of Chemistry, Department of Biosciences, Rice University, Houston, TX 77251, United States of America
| | - Ivan Coluzza
- BCMaterials, Basque Center for Materials, Applications and Nanostructures, Bld. Martina Casiano, UPV/EHU Science Park, Barrio Sarriena s/n, 48940 Leioa, Spain
- Basque Foundation for Science, Ikerbasque, 48009, Bilbao, Spain
| |
Collapse
|
5
|
Dishman AF, Volkman BF. Design and discovery of metamorphic proteins. Curr Opin Struct Biol 2022; 74:102380. [PMID: 35561475 PMCID: PMC9664977 DOI: 10.1016/j.sbi.2022.102380] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2021] [Revised: 03/14/2022] [Accepted: 03/16/2022] [Indexed: 11/03/2022]
Abstract
Metamorphic proteins are single amino acid sequences that reversibly interconvert between multiple, dramatically different native structures, often with distinct functions. Since the discovery of the first metamorphic proteins in the early 2000s, several additional metamorphic proteins have been identified, and it was suggested that up to 4% of proteins in the PDB may switch folds. Metamorphic proteins have been found to share common features such as marginal thermostability and inconsistencies in predicted secondary structures. Outstanding challenges in the field include the search for more metamorphic proteins and the design of new proteins that switch folds. Identification of novel metamorphic proteins in nature will improve therapeutic targeting of fold-switching proteins involved in human pathology and will enhance the design of protein-based therapies. Designed fold switching proteins have applications as biosensors, molecular switches, molecular machines, and self-assembling systems.
Collapse
Affiliation(s)
- Acacia F Dishman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA; Medical Scientist Training Program, Medical College of Wisconsin, Milwaukee, WI, USA. https://twitter.com/@cacidish
| | - Brian F Volkman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA.
| |
Collapse
|
6
|
Alberstein RG, Guo AB, Kortemme T. Design principles of protein switches. Curr Opin Struct Biol 2022; 72:71-78. [PMID: 34537489 PMCID: PMC8860883 DOI: 10.1016/j.sbi.2021.08.004] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/16/2021] [Revised: 08/03/2021] [Accepted: 08/06/2021] [Indexed: 01/14/2023]
Abstract
Protein switches perform essential roles in many biological processes and are exciting targets for de novo protein design, which aims to produce proteins of arbitrary shape and functionality. However, the biophysical requirements for switch function - multiple conformational states, fine-tuned energetics, and stimuli-responsiveness - pose a formidable challenge for design by computation (or intuition). A variety of methods have been developed toward tackling this challenge, usually taking inspiration from the wealth of sequence and structural information available for naturally occurring protein switches. More recently, modular switches have been designed computationally, and new methods have emerged for sampling unexplored structure space, providing promising new avenues toward the generation of purpose-built switches and de novo signaling systems for cellular engineering.
Collapse
Affiliation(s)
- Robert G Alberstein
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Amy B Guo
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA; UC Berkeley-UCSF Graduate Program in Bioengineering, University of California San Francisco, San Francisco, CA, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA; UC Berkeley-UCSF Graduate Program in Bioengineering, University of California San Francisco, San Francisco, CA, USA; Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, CA, USA.
| |
Collapse
|
7
|
Schmitz S, Schmitz EA, Crowe JE, Meiler J. The human antibody sequence space and structural design of the V, J regions, and CDRH3 with Rosetta. MAbs 2022; 14:2068212. [PMID: 35544469 PMCID: PMC9103704 DOI: 10.1080/19420862.2022.2068212] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2021] [Revised: 04/05/2022] [Accepted: 04/14/2022] [Indexed: 12/14/2022] Open
Abstract
The human adaptive immune response enables the targeting of epitopes on pathogens with high specificity. Infection with a pathogen induces somatic hyper-mutation and B-cell selection processes that govern the shape and diversity of the antibody sequence landscape. To date, even the largest immunome repertoires of adaptive immune receptors acquired by next-generation sequencing cannot fully capture the vast antibody sequence space of a single individual, which is estimated to be at least 1012 potential sequences. Degeneracy of the genetic code means that the number of possible nucleotide triplets (64) is greater than the number of canonical amino acids (20), resulting in some amino acids being encoded by multiple triplets and different amino acids sharing the same nucleotide in 1 or 2 positions in the triplet. We hypothesize that the degeneracy of the genetic code can be used to statistically model an enlarged space of human antibody amino acid sequences, accommodating for the discrepancy between the observed and the hypothesized antibody sequence space. Facilitated by Bayesian statistics and immunome repertoire clustering, we calculated amino acid probabilities from single nucleotide frequencies to infer a human amino acid sequence space that is used to design human-like antibodies with Rosetta. We show that antibodies designed with our restraints are on average up to 16.6% more human-like in the V and J regions compared to the Rosetta designs produced without constraints. The human-likeness of the heavy-chain CDR3 region (CDRH3) could be increased for 8 of 27 antibodies compared to Rosetta designs with a similar number of mutations and could be successfully applied on Mus musculus antibodies to demonstrate humanization.
Collapse
Affiliation(s)
- Samuel Schmitz
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States
- Center of Structural Biology, Vanderbilt University, Nashville, Tennessee, United States
| | - Emily A. Schmitz
- Center of Structural Biology, Vanderbilt University, Nashville, Tennessee, United States
- Department of Molecular Physiology and Biophysics, Vanderbilt University, School of Medicine, Nashville, Tennessee, United States
| | - James E. Crowe
- Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, Tennessee, United States
- Department of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, Tennessee, United States
- Departments of Pediatrics, Vanderbilt University Medical Center, Nashville, Tennessee, United States
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States
- Center of Structural Biology, Vanderbilt University, Nashville, Tennessee, United States
- Institute for Drug Discovery, University Leipzig Medical School, Leipzig, Germany
| |
Collapse
|
8
|
Bozhanova NG, Harp JM, Bender BJ, Gavrikov AS, Gorbachev DA, Baranov MS, Mercado CB, Zhang X, Lukyanov KA, Mishin AS, Meiler J. Computational redesign of a fluorogen activating protein with Rosetta. PLoS Comput Biol 2021; 17:e1009555. [PMID: 34748541 PMCID: PMC8601599 DOI: 10.1371/journal.pcbi.1009555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2020] [Revised: 11/18/2021] [Accepted: 10/11/2021] [Indexed: 11/24/2022] Open
Abstract
The use of unnatural fluorogenic molecules widely expands the pallet of available genetically encoded fluorescent imaging tools through the design of fluorogen activating proteins (FAPs). While there is already a handful of such probes available, each of them went through laborious cycles of in vitro screening and selection. Computational modeling approaches are evolving incredibly fast right now and are demonstrating great results in many applications, including de novo protein design. It suggests that the easier task of fine-tuning the fluorogen-binding properties of an already functional protein in silico should be readily achievable. To test this hypothesis, we used Rosetta for computational ligand docking followed by protein binding pocket redesign to further improve the previously described FAP DiB1 that is capable of binding to a BODIPY-like dye M739. Despite an inaccurate initial docking of the chromophore, the incorporated mutations nevertheless improved multiple photophysical parameters as well as the overall performance of the tag. The designed protein, DiB-RM, shows higher brightness, localization precision, and apparent photostability in protein-PAINT super-resolution imaging compared to its parental variant DiB1. Moreover, DiB-RM can be cleaved to obtain an efficient split system with enhanced performance compared to a parental DiB-split system. The possible reasons for the inaccurate ligand binding pose prediction and its consequence on the outcome of the design experiment are further discussed. Computational approaches have recently made significant progress in the protein engineering field evolving from a tool for helping experimentalists to prioritize or short-list mutations for testing to being capable of making fully reliable predictions. However, not all the fields of protein modeling are evolving at a similar pace. That is why evaluating the capabilities of computational tools on different tasks is important to provide other scientists with up-to-date information on the state of the field. Here we tested the performance of Rosetta (one of the leading macromolecule modeling tools) in improving small molecule-binding proteins. We successfully redesigned a fluorogen binding protein DiB1 –a protein that binds a non-fluorescent molecule and enforces its fluorescence in the obtained complex–for improved brightness and better performance in super-resolution imaging. Our results suggest that such tasks can be already achieved without laborious library screenings. However, the flexibility of the proteins might still be underestimated during standard modeling protocols and should be closely evaluated.
Collapse
Affiliation(s)
- Nina G. Bozhanova
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Joel M. Harp
- Department of Biochemistry, School of Medicine, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Brian J. Bender
- Department of Pharmacology and Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Alexey S. Gavrikov
- Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Dmitry A. Gorbachev
- Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Mikhail S. Baranov
- Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
- Pirogov Russian National Research Medical University, Moscow, Russia
| | - Christina B. Mercado
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Xuan Zhang
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
| | | | - Alexander S. Mishin
- Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow, Russia
| | - Jens Meiler
- Department of Chemistry and Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America
- Institute for Drug Discovery, Leipzig University, Leipzig, Germany
- * E-mail:
| |
Collapse
|
9
|
Schulz S, Boyer S, Smerlak M, Cocco S, Monasson R, Nizak C, Rivoire O. Parameters and determinants of responses to selection in antibody libraries. PLoS Comput Biol 2021; 17:e1008751. [PMID: 33765014 PMCID: PMC7993935 DOI: 10.1371/journal.pcbi.1008751] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2020] [Accepted: 01/31/2021] [Indexed: 12/01/2022] Open
Abstract
The sequences of antibodies from a given repertoire are highly diverse at few sites located on the surface of a genome-encoded larger scaffold. The scaffold is often considered to play a lesser role than highly diverse, non-genome-encoded sites in controlling binding affinity and specificity. To gauge the impact of the scaffold, we carried out quantitative phage display experiments where we compare the response to selection for binding to four different targets of three different antibody libraries based on distinct scaffolds but harboring the same diversity at randomized sites. We first show that the response to selection of an antibody library may be captured by two measurable parameters. Second, we provide evidence that one of these parameters is determined by the degree of affinity maturation of the scaffold, affinity maturation being the process by which antibodies accumulate somatic mutations to evolve towards higher affinities during the natural immune response. In all cases, we find that libraries of antibodies built around maturated scaffolds have a lower response to selection to other arbitrary targets than libraries built around germline-based scaffolds. We thus propose that germline-encoded scaffolds have a higher selective potential than maturated ones as a consequence of a selection for this potential over the long-term evolution of germline antibody genes. Our results are a first step towards quantifying the evolutionary potential of biomolecules. Antibodies in the immune system consist of a genetically encoded scaffold that exposes a few highly diverse, non-genetically encoded sites. This focused diversity is sufficient to produce antibodies that bind to any target molecule. To understand the role of the scaffold, which acquires hypermutations during the immune response, over the selective response, we analyze quantitative in vitro experiments where large antibody populations based on different scaffolds are selected against different targets. We show that selective responses are described statistically by two parameters, one of which depends on prior evolution of the scaffold as part of a previous response. Our work provides methods to assay whether naïve antibody scaffolds are endowed with a distinctively high selective potential.
Collapse
Affiliation(s)
- Steven Schulz
- Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS UMR 7241, INSERM U1050, PSL University, Paris, France
| | - Sébastien Boyer
- Département de biochimie, Faculté de Médecine, Université de Montréal, Montréal, Canada
| | - Matteo Smerlak
- Max Planck Institute for Mathematics in the Sciences, Leipzig, Germany
| | - Simona Cocco
- Laboratory of Physics of École Normale Supérieure, UMR 8023, CNRS & PSL University, Paris, France
| | - Rémi Monasson
- Laboratory of Physics of École Normale Supérieure, UMR 8023, CNRS & PSL University, Paris, France
| | - Clément Nizak
- Laboratory of Biochemistry, CBI, UMR 8231, ESPCI Paris, PSL University, CNRS, Paris, France
- * E-mail: (CN); (OR)
| | - Olivier Rivoire
- Center for Interdisciplinary Research in Biology (CIRB), Collège de France, CNRS UMR 7241, INSERM U1050, PSL University, Paris, France
- * E-mail: (CN); (OR)
| |
Collapse
|
10
|
Schoeder C, Schmitz S, Adolf-Bryfogle J, Sevy AM, Finn JA, Sauer MF, Bozhanova NG, Mueller BK, Sangha AK, Bonet J, Sheehan JH, Kuenze G, Marlow B, Smith ST, Woods H, Bender BJ, Martina CE, del Alamo D, Kodali P, Gulsevin A, Schief WR, Correia BE, Crowe JE, Meiler J, Moretti R. Modeling Immunity with Rosetta: Methods for Antibody and Antigen Design. Biochemistry 2021; 60:825-846. [PMID: 33705117 PMCID: PMC7992133 DOI: 10.1021/acs.biochem.0c00912] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 03/02/2021] [Indexed: 01/16/2023]
Abstract
Structure-based antibody and antigen design has advanced greatly in recent years, due not only to the increasing availability of experimentally determined structures but also to improved computational methods for both prediction and design. Constant improvements in performance within the Rosetta software suite for biomolecular modeling have given rise to a greater breadth of structure prediction, including docking and design application cases for antibody and antigen modeling. Here, we present an overview of current protocols for antibody and antigen modeling using Rosetta and exemplify those by detailed tutorials originally developed for a Rosetta workshop at Vanderbilt University. These tutorials cover antibody structure prediction, docking, and design and antigen design strategies, including the addition of glycans in Rosetta. We expect that these materials will allow novice users to apply Rosetta in their own projects for modeling antibodies and antigens.
Collapse
Affiliation(s)
- Clara
T. Schoeder
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Samuel Schmitz
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Jared Adolf-Bryfogle
- Department
of Immunology and Microbiology, The Scripps
Research Institute, La Jolla, California 92037, United States
- IAVI
Neutralizing Antibody Center, The Scripps
Research Institute, La Jolla, California 92037, United States
| | - Alexander M. Sevy
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37232-0301, United States
- Vanderbilt
Vaccine Center, Vanderbilt University Medical
Center, Nashville, Tennessee 37232-0417, United States
| | - Jessica A. Finn
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Vanderbilt
Vaccine Center, Vanderbilt University Medical
Center, Nashville, Tennessee 37232-0417, United States
- Department
of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, Tennessee 37232, United States
| | - Marion F. Sauer
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37232-0301, United States
- Vanderbilt
Vaccine Center, Vanderbilt University Medical
Center, Nashville, Tennessee 37232-0417, United States
| | - Nina G. Bozhanova
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Benjamin K. Mueller
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Amandeep K. Sangha
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Jaume Bonet
- Institute
of Bioengineering, École Polytechnique
Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Jonathan H. Sheehan
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Georg Kuenze
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Institute
for Drug Discovery, University Leipzig Medical
School, 04103 Leipzig, Germany
| | - Brennica Marlow
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37232-0301, United States
| | - Shannon T. Smith
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37232-0301, United States
| | - Hope Woods
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37232-0301, United States
| | - Brian J. Bender
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Department
of Pharmacology, Vanderbilt University, Nashville, Tennessee 37212, United States
| | - Cristina E. Martina
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Diego del Alamo
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37232-0301, United States
| | - Pranav Kodali
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Alican Gulsevin
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - William R. Schief
- Department
of Immunology and Microbiology, The Scripps
Research Institute, La Jolla, California 92037, United States
- IAVI
Neutralizing Antibody Center, The Scripps
Research Institute, La Jolla, California 92037, United States
| | - Bruno E. Correia
- Institute
of Bioengineering, École Polytechnique
Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - James E. Crowe
- Vanderbilt
Vaccine Center, Vanderbilt University Medical
Center, Nashville, Tennessee 37232-0417, United States
- Department
of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, Tennessee 37232, United States
- Department
of Pediatrics, Vanderbilt University Medical
Center, Nashville, Tennessee 37232, United States
| | - Jens Meiler
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Institute
for Drug Discovery, University Leipzig Medical
School, 04103 Leipzig, Germany
| | - Rocco Moretti
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| |
Collapse
|
11
|
Schmitz S, Ertelt M, Merkl R, Meiler J. Rosetta design with co-evolutionary information retains protein function. PLoS Comput Biol 2021; 17:e1008568. [PMID: 33465067 PMCID: PMC7815116 DOI: 10.1371/journal.pcbi.1008568] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Accepted: 11/28/2020] [Indexed: 12/14/2022] Open
Abstract
Computational protein design has the ambitious goal of crafting novel proteins that address challenges in biology and medicine. To overcome these challenges, the computational protein modeling suite Rosetta has been tailored to address various protein design tasks. Recently, statistical methods have been developed that identify correlated mutations between residues in a multiple sequence alignment of homologous proteins. These subtle inter-dependencies in the occupancy of residue positions throughout evolution are crucial for protein function, but we found that three current Rosetta design approaches fail to recover these co-evolutionary couplings. Thus, we developed the Rosetta method ResCue (residue-coupling enhanced) that leverages co-evolutionary information to favor sequences which recapitulate correlated mutations, as observed in nature. To assess the protocols via recapitulation designs, we compiled a benchmark of ten proteins each represented by two, structurally diverse states. We could demonstrate that ResCue designed sequences with an average sequence recovery rate of 70%, whereas three other protocols reached not more than 50%, on average. Our approach had higher recovery rates also for functionally important residues, which were studied in detail. This improvement has only a minor negative effect on the fitness of the designed sequences as assessed by Rosetta energy. In conclusion, our findings support the idea that informing protocols with co-evolutionary signals helps to design stable and native-like proteins that are compatible with the different conformational states required for a complex function. In homologous proteins, functionally or structurally important residues are strongly conserved. Thus, the consideration of conservation signals during protein design protocols can help to create sequences that are more native-like. However, the number of conserved residues is small in many proteins and not all important residues can be captured by conservation analysis. Residues are forming networks whose composition is dictated by protein structure function and thus is visible through the co-evolutionary analysis. Nowadays, advanced methods allow us to deduce these networks from multiple sequence alignments. Thus, we have implemented the novel Rosetta method termed ‘ResCue’ that informs the design protocol with co-evolutionary signals. Recapitulation designs based on ten difficult benchmarks made clear that this protocol creates sequences that are more native-like than three other, state-of-the-art design protocols.
Collapse
Affiliation(s)
- Samuel Schmitz
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
| | - Moritz Ertelt
- Institute for Drug Discovery, Leipzig University, Leipzig, Germany
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Rainer Merkl
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Jens Meiler
- Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
- Institute for Drug Discovery, Leipzig University, Leipzig, Germany
- * E-mail:
| |
Collapse
|
12
|
Alexander MR, Schoeder CT, Brown JA, Smart CD, Moth C, Wikswo JP, Capra JA, Meiler J, Chen W, Madhur MS. Predicting susceptibility to SARS-CoV-2 infection based on structural differences in ACE2 across species. FASEB J 2020; 34:15946-15960. [PMID: 33015868 PMCID: PMC7675292 DOI: 10.1096/fj.202001808r] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2020] [Revised: 09/17/2020] [Accepted: 09/18/2020] [Indexed: 12/17/2022]
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) is the cause of the global pandemic of coronavirus disease-2019 (COVID-19). SARS-CoV-2 is a zoonotic disease, but little is known about variations in species susceptibility that could identify potential reservoir species, animal models, and the risk to pets, wildlife, and livestock. Certain species, such as domestic cats and tigers, are susceptible to SARS-CoV-2 infection, while other species such as mice and chickens are not. Most animal species, including those in close contact with humans, have unknown susceptibility. Hence, methods to predict the infection risk of animal species are urgently needed. SARS-CoV-2 spike protein binding to angiotensin-converting enzyme 2 (ACE2) is critical for viral cell entry and infection. Here we integrate species differences in susceptibility with multiple in-depth structural analyses to identify key ACE2 amino acid positions including 30, 83, 90, 322, and 354 that distinguish susceptible from resistant species. Using differences in these residues across species, we developed a susceptibility score that predicts an elevated risk of SARS-CoV-2 infection for multiple species including horses and camels. We also demonstrate that SARS-CoV-2 is nearly optimal for binding ACE2 of humans compared to other animals, which may underlie the highly contagious transmissibility of this virus among humans. Taken together, our findings define potential ACE2 and SARS-CoV-2 residues for therapeutic targeting and identification of animal species on which to focus research and protection measures for environmental and public health.
Collapse
Affiliation(s)
- Matthew R. Alexander
- Department of MedicineDivision of Cardiovascular MedicineVanderbilt University Medical Center (VUMC)NashvilleTNUSA
- Department of MedicineDivision of Clinical PharmacologyVanderbilt University Medical CenterNashvilleTNUSA
| | - Clara T. Schoeder
- Center for Structural BiologyVanderbilt UniversityNashvilleTNUSA
- Department of ChemistryVanderbilt UniversityNashvilleTNUSA
| | - Jacquelyn A. Brown
- Department of Physics and AstronomyVanderbilt UniversityNashvilleTNUSA
- Vanderbilt Institute for Integrative Biosystems Research and Education, Vanderbilt UniversityNashvilleTNUSA
| | - Charles D. Smart
- Department of Molecular Physiology and BiophysicsVanderbilt UniversityNashvilleTNUSA
| | - Chris Moth
- Center for Structural BiologyVanderbilt UniversityNashvilleTNUSA
- Department of Biological SciencesVanderbilt UniversityNashvilleTNUSA
| | - John P. Wikswo
- Department of Physics and AstronomyVanderbilt UniversityNashvilleTNUSA
- Vanderbilt Institute for Integrative Biosystems Research and Education, Vanderbilt UniversityNashvilleTNUSA
- Department of Molecular Physiology and BiophysicsVanderbilt UniversityNashvilleTNUSA
- Department of Biomedical EngineeringVanderbilt UniversityNashvilleTNUSA
| | - John A. Capra
- Center for Structural BiologyVanderbilt UniversityNashvilleTNUSA
- Department of Biological SciencesVanderbilt UniversityNashvilleTNUSA
| | - Jens Meiler
- Center for Structural BiologyVanderbilt UniversityNashvilleTNUSA
- Department of ChemistryVanderbilt UniversityNashvilleTNUSA
- Department of Biomedical EngineeringVanderbilt UniversityNashvilleTNUSA
- Institute for Drug DiscoveryLeipzig University Medical SchoolLeipzigGermany
| | - Wenbiao Chen
- Department of Molecular Physiology and BiophysicsVanderbilt UniversityNashvilleTNUSA
| | - Meena S. Madhur
- Department of MedicineDivision of Cardiovascular MedicineVanderbilt University Medical Center (VUMC)NashvilleTNUSA
- Department of MedicineDivision of Clinical PharmacologyVanderbilt University Medical CenterNashvilleTNUSA
- Department of Molecular Physiology and BiophysicsVanderbilt UniversityNashvilleTNUSA
- Vanderbilt Institute for Infection, Immunology, and InflammationNashvilleTNUSA
| |
Collapse
|
13
|
Alexander MR, Schoeder CT, Brown JA, Smart CD, Moth C, Wikswo JP, Capra JA, Meiler J, Chen W, Madhur MS. Which animals are at risk? Predicting species susceptibility to Covid-19. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020. [PMID: 32676592 DOI: 10.1101/2020.07.09.194563] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
In only a few months, the novel coronavirus severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has caused a global pandemic, leaving physicians, scientists, and public health officials racing to understand, treat, and contain this zoonotic disease. SARS-CoV-2 has made the leap from animals to humans, but little is known about variations in species susceptibility that could identify potential reservoir species, animal models, and the risk to pets, wildlife, and livestock. While there is evidence that certain species, such as cats, are susceptible, the vast majority of animal species, including those in close contact with humans, have unknown susceptibility. Hence, methods to predict their infection risk are urgently needed. SARS-CoV-2 spike protein binding to angiotensin converting enzyme 2 (ACE2) is critical for viral cell entry and infection. Here we identified key ACE2 residues that distinguish susceptible from resistant species using in-depth sequence and structural analyses of ACE2 and its binding to SARS-CoV-2. Our findings have important implications for identification of ACE2 and SARS-CoV-2 residues for therapeutic targeting and identification of animal species with increased susceptibility for infection on which to focus research and protection measures for environmental and public health.
Collapse
|
14
|
Leman JK, Weitzner BD, Lewis SM, Adolf-Bryfogle J, Alam N, Alford RF, Aprahamian M, Baker D, Barlow KA, Barth P, Basanta B, Bender BJ, Blacklock K, Bonet J, Boyken SE, Bradley P, Bystroff C, Conway P, Cooper S, Correia BE, Coventry B, Das R, De Jong RM, DiMaio F, Dsilva L, Dunbrack R, Ford AS, Frenz B, Fu DY, Geniesse C, Goldschmidt L, Gowthaman R, Gray JJ, Gront D, Guffy S, Horowitz S, Huang PS, Huber T, Jacobs TM, Jeliazkov JR, Johnson DK, Kappel K, Karanicolas J, Khakzad H, Khar KR, Khare SD, Khatib F, Khramushin A, King IC, Kleffner R, Koepnick B, Kortemme T, Kuenze G, Kuhlman B, Kuroda D, Labonte JW, Lai JK, Lapidoth G, Leaver-Fay A, Lindert S, Linsky T, London N, Lubin JH, Lyskov S, Maguire J, Malmström L, Marcos E, Marcu O, Marze NA, Meiler J, Moretti R, Mulligan VK, Nerli S, Norn C, Ó'Conchúir S, Ollikainen N, Ovchinnikov S, Pacella MS, Pan X, Park H, Pavlovicz RE, Pethe M, Pierce BG, Pilla KB, Raveh B, Renfrew PD, Burman SSR, Rubenstein A, Sauer MF, Scheck A, Schief W, Schueler-Furman O, Sedan Y, Sevy AM, Sgourakis NG, Shi L, Siegel JB, Silva DA, Smith S, Song Y, et alLeman JK, Weitzner BD, Lewis SM, Adolf-Bryfogle J, Alam N, Alford RF, Aprahamian M, Baker D, Barlow KA, Barth P, Basanta B, Bender BJ, Blacklock K, Bonet J, Boyken SE, Bradley P, Bystroff C, Conway P, Cooper S, Correia BE, Coventry B, Das R, De Jong RM, DiMaio F, Dsilva L, Dunbrack R, Ford AS, Frenz B, Fu DY, Geniesse C, Goldschmidt L, Gowthaman R, Gray JJ, Gront D, Guffy S, Horowitz S, Huang PS, Huber T, Jacobs TM, Jeliazkov JR, Johnson DK, Kappel K, Karanicolas J, Khakzad H, Khar KR, Khare SD, Khatib F, Khramushin A, King IC, Kleffner R, Koepnick B, Kortemme T, Kuenze G, Kuhlman B, Kuroda D, Labonte JW, Lai JK, Lapidoth G, Leaver-Fay A, Lindert S, Linsky T, London N, Lubin JH, Lyskov S, Maguire J, Malmström L, Marcos E, Marcu O, Marze NA, Meiler J, Moretti R, Mulligan VK, Nerli S, Norn C, Ó'Conchúir S, Ollikainen N, Ovchinnikov S, Pacella MS, Pan X, Park H, Pavlovicz RE, Pethe M, Pierce BG, Pilla KB, Raveh B, Renfrew PD, Burman SSR, Rubenstein A, Sauer MF, Scheck A, Schief W, Schueler-Furman O, Sedan Y, Sevy AM, Sgourakis NG, Shi L, Siegel JB, Silva DA, Smith S, Song Y, Stein A, Szegedy M, Teets FD, Thyme SB, Wang RYR, Watkins A, Zimmerman L, Bonneau R. Macromolecular modeling and design in Rosetta: recent methods and frameworks. Nat Methods 2020; 17:665-680. [PMID: 32483333 PMCID: PMC7603796 DOI: 10.1038/s41592-020-0848-2] [Show More Authors] [Citation(s) in RCA: 484] [Impact Index Per Article: 96.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 04/22/2020] [Indexed: 12/12/2022]
Abstract
The Rosetta software for macromolecular modeling, docking and design is extensively used in laboratories worldwide. During two decades of development by a community of laboratories at more than 60 institutions, Rosetta has been continuously refactored and extended. Its advantages are its performance and interoperability between broad modeling capabilities. Here we review tools developed in the last 5 years, including over 80 methods. We discuss improvements to the score function, user interfaces and usability. Rosetta is available at http://www.rosettacommons.org.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA.
- Department of Biology, New York University, New York, New York, USA.
| | - Brian D Weitzner
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Lyell Immunopharma Inc., Seattle, WA, USA
| | - Steven M Lewis
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Department of Biochemistry, Duke University, Durham, NC, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Jared Adolf-Bryfogle
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, USA
| | - Nawsad Alam
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Melanie Aprahamian
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Kyle A Barlow
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, USA
| | - Patrick Barth
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Baylor College of Medicine, Department of Pharmacology, Houston, TX, USA
| | - Benjamin Basanta
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Biological Physics Structure and Design PhD Program, University of Washington, Seattle, WA, USA
| | - Brian J Bender
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
| | - Kristin Blacklock
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Jaume Bonet
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Scott E Boyken
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Lyell Immunopharma Inc., Seattle, WA, USA
| | - Phil Bradley
- Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Chris Bystroff
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, USA
| | - Patrick Conway
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Seth Cooper
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Bruno E Correia
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Brian Coventry
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lorna Dsilva
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Roland Dunbrack
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Alexander S Ford
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Brandon Frenz
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Darwin Y Fu
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Caleb Geniesse
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Ragul Gowthaman
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, USA
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
| | - Sharon Guffy
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Scott Horowitz
- Department of Chemistry & Biochemistry, University of Denver, Denver, CO, USA
- The Knoebel Institute for Healthy Aging, University of Denver, Denver, CO, USA
| | - Po-Ssu Huang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Thomas Huber
- Research School of Chemistry, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Tim M Jacobs
- Program in Bioinformatics and Computational Biology, Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | | | - David K Johnson
- Center for Computational Biology, University of Kansas, Lawrence, KS, USA
| | - Kalli Kappel
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - John Karanicolas
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Hamed Khakzad
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute for Computational Science, University of Zurich, Zurich, Switzerland
- S3IT, University of Zurich, Zurich, Switzerland
| | - Karen R Khar
- Cyrus Biotechnology, Seattle, WA, USA
- Center for Computational Biology, University of Kansas, Lawrence, KS, USA
| | - Sagar D Khare
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, The State University of New Jersey, Piscataway, NJ, USA
- Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Computational Biology and Molecular Biophysics Program, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Firas Khatib
- Department of Computer and Information Science, University of Massachusetts Dartmouth, Dartmouth, MA, USA
| | - Alisa Khramushin
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Indigo C King
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Robert Kleffner
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Brian Koepnick
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Georg Kuenze
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Daisuke Kuroda
- Medical Device Development and Regulation Research Center, School of Engineering, University of Tokyo, Tokyo, Japan
- Department of Bioengineering, School of Engineering, University of Tokyo, Tokyo, Japan
| | - Jason W Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Chemistry, Franklin & Marshall College, Lancaster, PA, USA
| | - Jason K Lai
- Baylor College of Medicine, Department of Pharmacology, Houston, TX, USA
| | - Gideon Lapidoth
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Andrew Leaver-Fay
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, USA
| | - Thomas Linsky
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Nir London
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Joseph H Lubin
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jack Maguire
- Program in Bioinformatics and Computational Biology, Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Lars Malmström
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute for Computational Science, University of Zurich, Zurich, Switzerland
- S3IT, University of Zurich, Zurich, Switzerland
- Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden
| | - Enrique Marcos
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Research in Biomedicine Barcelona, The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Orly Marcu
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Nicholas A Marze
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
- Departments of Chemistry, Pharmacology and Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
- Institute for Chemical Biology, Vanderbilt University, Nashville, TN, USA
| | - Rocco Moretti
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Vikram Khipple Mulligan
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Santrupti Nerli
- Department of Computer Science, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Christoffer Norn
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Shane Ó'Conchúir
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Noah Ollikainen
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Sergey Ovchinnikov
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Molecular and Cellular Biology Program, University of Washington, Seattle, WA, USA
| | - Michael S Pacella
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Xingjie Pan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Hahnbeom Park
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Ryan E Pavlovicz
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Manasi Pethe
- Department of Chemistry and Chemical Biology, The State University of New Jersey, Piscataway, NJ, USA
- Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Brian G Pierce
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA
| | - Kala Bharath Pilla
- Research School of Chemistry, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Barak Raveh
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - P Douglas Renfrew
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
| | - Shourya S Roy Burman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Aliza Rubenstein
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Computational Biology and Molecular Biophysics Program, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Marion F Sauer
- Chemical and Physical Biology Program, Vanderbilt Vaccine Center, Vanderbilt University, Nashville, TN, USA
| | - Andreas Scheck
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - William Schief
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, USA
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Yuval Sedan
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Alexander M Sevy
- Chemical and Physical Biology Program, Vanderbilt Vaccine Center, Vanderbilt University, Nashville, TN, USA
| | - Nikolaos G Sgourakis
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Lei Shi
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Justin B Siegel
- Department of Chemistry, University of California, Davis, Davis, CA, USA
- Department of Biochemistry and Molecular Medicine, University of California, Davis, Davis, California, USA
- Genome Center, University of California, Davis, Davis, CA, USA
| | | | - Shannon Smith
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Yifan Song
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Amelie Stein
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Maria Szegedy
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Frank D Teets
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Summer B Thyme
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Ray Yu-Ruei Wang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Andrew Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | - Lior Zimmerman
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Richard Bonneau
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA.
- Department of Biology, New York University, New York, New York, USA.
- Department of Computer Science, New York University, New York, NY, USA.
- Center for Data Science, New York University, New York, NY, USA.
| |
Collapse
|
15
|
Surpeta B, Sequeiros-Borja CE, Brezovsky J. Dynamics, a Powerful Component of Current and Future in Silico Approaches for Protein Design and Engineering. Int J Mol Sci 2020; 21:E2713. [PMID: 32295283 PMCID: PMC7215530 DOI: 10.3390/ijms21082713] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2020] [Revised: 04/10/2020] [Accepted: 04/12/2020] [Indexed: 12/13/2022] Open
Abstract
Computational prediction has become an indispensable aid in the processes of engineering and designing proteins for various biotechnological applications. With the tremendous progress in more powerful computer hardware and more efficient algorithms, some of in silico tools and methods have started to apply the more realistic description of proteins as their conformational ensembles, making protein dynamics an integral part of their prediction workflows. To help protein engineers to harness benefits of considering dynamics in their designs, we surveyed new tools developed for analyses of conformational ensembles in order to select engineering hotspots and design mutations. Next, we discussed the collective evolution towards more flexible protein design methods, including ensemble-based approaches, knowledge-assisted methods, and provable algorithms. Finally, we highlighted apparent challenges that current approaches are facing and provided our perspectives on their further development.
Collapse
Affiliation(s)
- Bartłomiej Surpeta
- Laboratory of Biomolecular Interactions and Transport, Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznanskiego 6, 61-614 Poznan, Poland; (B.S.); (C.E.S.-B.)
- International Institute of Molecular and Cell Biology in Warsaw, Ks Trojdena 4, 02-109 Warsaw, Poland
| | - Carlos Eduardo Sequeiros-Borja
- Laboratory of Biomolecular Interactions and Transport, Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznanskiego 6, 61-614 Poznan, Poland; (B.S.); (C.E.S.-B.)
- International Institute of Molecular and Cell Biology in Warsaw, Ks Trojdena 4, 02-109 Warsaw, Poland
| | - Jan Brezovsky
- Laboratory of Biomolecular Interactions and Transport, Department of Gene Expression, Institute of Molecular Biology and Biotechnology, Faculty of Biology, Adam Mickiewicz University, Uniwersytetu Poznanskiego 6, 61-614 Poznan, Poland; (B.S.); (C.E.S.-B.)
- International Institute of Molecular and Cell Biology in Warsaw, Ks Trojdena 4, 02-109 Warsaw, Poland
| |
Collapse
|
16
|
Nerattini F, Tubiana L, Cardelli C, Bianco V, Dellago C, Coluzza I. Protein design under competing conditions for the availability of amino acids. Sci Rep 2020; 10:2684. [PMID: 32060385 PMCID: PMC7021711 DOI: 10.1038/s41598-020-59401-9] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2019] [Accepted: 12/08/2019] [Indexed: 11/09/2022] Open
Abstract
Isolating the properties of proteins that allow them to convert sequence into the structure is a long-lasting biophysical problem. In particular, studies focused extensively on the effect of a reduced alphabet size on the folding properties. However, the natural alphabet is a compromise between versatility and optimisation of the available resources. Here, for the first time, we include the impact of the relative availability of the amino acids to extract from the 20 letters the core necessary for protein stability. We present a computational protein design scheme that involves the competition for resources between a protein and a potential interaction partner that, additionally, gives us the chance to investigate the effect of the reduced alphabet on protein-protein interactions. We devise a scheme that automatically identifies the optimal reduced set of letters for the design of the protein, and we observe that even alphabets reduced down to 4 letters allow for single protein folding. However, it is only with 6 letters that we achieve optimal folding, thus recovering experimental observations. Additionally, we notice that the binding between the protein and a potential interaction partner could not be avoided with the investigated reduced alphabets. Therefore, we suggest that aggregation could have been a driving force in the evolution of the large protein alphabet.
Collapse
Affiliation(s)
- Francesca Nerattini
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Luca Tubiana
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Chiara Cardelli
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Valentino Bianco
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Christoph Dellago
- Faculty of Physics, University of Vienna, Boltzmanngasse 5, 1090, Vienna, Austria
| | - Ivan Coluzza
- Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo Miramon 182, 20014, San Sebastian, Spain. .,IKERBASQUE, Basque Foundation for Science, 48013, Bilbao, Spain.
| |
Collapse
|
17
|
Sauer MF, Sevy AM, Crowe JE, Meiler J. Multi-state design of flexible proteins predicts sequences optimal for conformational change. PLoS Comput Biol 2020; 16:e1007339. [PMID: 32032348 PMCID: PMC7032724 DOI: 10.1371/journal.pcbi.1007339] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2019] [Revised: 02/20/2020] [Accepted: 12/23/2019] [Indexed: 12/11/2022] Open
Abstract
Computational protein design of an ensemble of conformations for one protein–i.e., multi-state design–determines the side chain identity by optimizing the energetic contributions of that side chain in each of the backbone conformations. Sampling the resulting large sequence-structure search space limits the number of conformations and the size of proteins in multi-state design algorithms. Here, we demonstrated that the REstrained CONvergence (RECON) algorithm can simultaneously evaluate the sequence of large proteins that undergo substantial conformational changes. Simultaneous optimization of side chain conformations across all conformations increased sequence conservation when compared to single-state designs in all cases. More importantly, the sequence space sampled by RECON MSD resembled the evolutionary sequence space of flexible proteins, particularly when confined to predicting the mutational preferences of limited common ancestral descent, such as in the case of influenza type A hemagglutinin. Additionally, we found that sequence positions which require substantial changes in their local environment across an ensemble of conformations are more likely to be conserved. These increased conservation rates are better captured by RECON MSD over multiple conformations and thus multiple local residue environments during design. To quantify this rewiring of contacts at a certain position in sequence and structure, we introduced a new metric designated ‘contact proximity deviation’ that enumerates contact map changes. This measure allows mapping of global conformational changes into local side chain proximity adjustments, a property not captured by traditional global similarity metrics such as RMSD or local similarity metrics such as changes in φ and ψ angles. Multi-state design can be used to engineer proteins that need to exist in multiple conformations or that bind to multiple partner molecules. In essence, multi-state design selects a compromise of protein sequences that allow for an ensemble of protein conformations, or states, associated with a particular biological function. In this paper, we used the REstrained CONvergence (RECON) algorithm with Rosetta to show that multi-state design of flexible proteins predicts sequences optimal for conformational change, mimicking mutation preferences sampled in evolution. Modeling optimal local side chain physicochemical environments within an ensemble selected significantly more native-like sequences than selections performed when all conformations states are designed independently. This outcome was particularly true for amino acids whose local side chain environment change between conformations. To quantify such contact map changes, we introduced a novel metric to show that sequence conservation is dependent on protein flexibility, i.e., changes in local side chain environments between stated limit the space of tolerated mutations. Additionally, such positions in sequence and structure are more likely to be energetically frustrated, at least in some states. Importantly, we showed that multi-state design over an ensemble of conformations (space) can explore evolutionary tolerated sequence space (time), thus enabling RECON to not only design proteins that require multiple states for function but also predict mutations that might be tolerated in native proteins but have not yet been explored by evolution. The latter aspect can be important to anticipate escape mutations, for example in pathogens or oncoproteins.
Collapse
Affiliation(s)
- Marion F Sauer
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America.,Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Alexander M Sevy
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America.,Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - James E Crowe
- Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America.,Department of Pediatrics, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America.,Department of Pathology, Microbiology and Immunology, Vanderbilt University Medical Center, Nashville, Tennessee, United States of America
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University, Nashville, Tennessee, United States of America.,Department of Chemistry, Vanderbilt University, Nashville, Tennessee, United States of America
| |
Collapse
|
18
|
Kaushik AC, Mehmood A, Khan MT, Kumar A, Dai X, Wei DQ. RETRACTED ARTICLE: Protein blueprint and their interactions while approachability struggle for amino acids. J Biomol Struct Dyn 2020; 39:i-ix. [PMID: 31914855 DOI: 10.1080/07391102.2020.1713894] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
Affiliation(s)
| | - Aamir Mehmood
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| | - Muhammad Tahir Khan
- Department of Bioinformatics and Biosciences, Capital University of Science and Technology, Islamabad, Pakistan
| | - Ajay Kumar
- Institute of Biomedical Sciences, National Sun Yat-Sen University, Kaohsiung City, Taiwan
| | - Xiaofeng Dai
- Wuxi School of Medicine, Jiangnan University, Wuxi, China
| | - Dong-Qing Wei
- School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, Shanghai, China
| |
Collapse
|
19
|
Multistate design of influenza antibodies improves affinity and breadth against seasonal viruses. Proc Natl Acad Sci U S A 2019; 116:1597-1602. [PMID: 30642961 PMCID: PMC6358683 DOI: 10.1073/pnas.1806004116] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/02/2022] Open
Abstract
Influenza is an annual threat to global public health, in part because of constant antigenic drift that facilitates evasion of the antibody response. Rapid changes in the influenza HA protein make it difficult for an antibody to achieve broad activity against different virus subtypes. We developed a computational method that can optimize an antibody sequence to be robust against seasonal variation. As a proof of concept, we tested this method by redesigning a known antibody against a set of diverse HA antigens and showed that the variant redesigned antibodies have improved activity against the virus panel, as predicted. This work shows that computational design can improve naturally occurring antibodies for recognition of different virus strains. Influenza is a yearly threat to global public health. Rapid changes in influenza surface proteins resulting from antigenic drift and shift events make it difficult to readily identify antibodies with broadly neutralizing activity against different influenza subtypes with high frequency, specifically antibodies targeting the receptor binding domain (RBD) on influenza HA protein. We developed an optimized computational design method that is able to optimize an antibody for recognition of large panels of antigens. To demonstrate the utility of this multistate design method, we used it to redesign an antiinfluenza antibody against a large panel of more than 500 seasonal HA antigens of the H1 subtype. As a proof of concept, we tested this method on a variety of known antiinfluenza antibodies and identified those that could be improved computationally. We generated redesigned variants of antibody C05 to the HA RBD and experimentally characterized variants that exhibited improved breadth and affinity against our panel. C05 mutants exhibited improved affinity for three of the subtypes used in design by stabilizing the CDRH3 loop and creating favorable electrostatic interactions with the antigen. These mutants possess increased breadth and affinity of binding while maintaining high-affinity binding to existing targets, surpassing a major limitation up to this point.
Collapse
|
20
|
Lechner H, Ferruz N, Höcker B. Strategies for designing non-natural enzymes and binders. Curr Opin Chem Biol 2018; 47:67-76. [PMID: 30248579 DOI: 10.1016/j.cbpa.2018.07.022] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2018] [Revised: 07/16/2018] [Accepted: 07/17/2018] [Indexed: 12/20/2022]
Abstract
The design of tailor-made enzymes is a major goal in biochemical research that can result in wide-range applications and will lead to a better understanding of how proteins fold and function. In this review we highlight recent advances in enzyme and small molecule binder design. A focus is placed on novel strategies for the design of scaffolds, developments in computational methods, and recent applications of these techniques on receptors, sensors, and enzymes. Further, the integration of computational and experimental methodologies is discussed. The outlined examples of designed enzymes and binders for various purposes highlight the importance of this topic and underline the need for tailor-made proteins.
Collapse
Affiliation(s)
- Horst Lechner
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany.
| |
Collapse
|
21
|
Abstract
Motivation Multistate protein design addresses real-world challenges, such as multi-specificity design and backbone flexibility, by considering both positive and negative protein states with an ensemble of substates for each. It also presents an enormous challenge to exact algorithms that guarantee the optimal solutions and enable a direct test of mechanistic hypotheses behind models. However, efficient exact algorithms are lacking for multistate protein design. Results We have developed an efficient exact algorithm called interconnected cost function networks (iCFN) for multistate protein design. Its generic formulation allows for a wide array of applications such as stability, affinity and specificity designs while addressing concerns such as global flexibility of protein backbones. iCFN treats each substate design as a weighted constraint satisfaction problem (WCSP) modeled through a CFN; and it solves the coupled WCSPs using novel bounds and a depth-first branch-and-bound search over a tree structure of sequences, substates, and conformations. When iCFN is applied to specificity design of a T-cell receptor, a problem of unprecedented size to exact methods, it drastically reduces search space and running time to make the problem tractable. Moreover, iCFN generates experimentally-agreeing receptor designs with improved accuracy compared with state-of-the-art methods, highlights the importance of modeling backbone flexibility in protein design, and reveals molecular mechanisms underlying binding specificity. Availability and implementation https://shen-lab.github.io/software/iCFN. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mostafa Karimi
- Department of Electrical and Computer Engineering and TEES-AgriLife Center for Bioinformatics and Genomic Systems Engineering, Texas A&M University, College Station, USA
| | - Yang Shen
- Department of Electrical and Computer Engineering and TEES-AgriLife Center for Bioinformatics and Genomic Systems Engineering, Texas A&M University, College Station, USA
| |
Collapse
|
22
|
Schlee S, Klein T, Schumacher M, Nazet J, Merkl R, Steinhoff HJ, Sterner R. Relationship of Catalysis and Active Site Loop Dynamics in the (βα)8-Barrel Enzyme Indole-3-glycerol Phosphate Synthase. Biochemistry 2018; 57:3265-3277. [DOI: 10.1021/acs.biochem.8b00167] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Sandra Schlee
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstrasse 31, D-93053 Regensburg, Germany
| | - Thomas Klein
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstrasse 31, D-93053 Regensburg, Germany
| | - Magdalena Schumacher
- Department of Physics, University of Osnabrück, Barbarastrasse 7, D-49076 Osnabrück, Germany
| | - Julian Nazet
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstrasse 31, D-93053 Regensburg, Germany
| | - Rainer Merkl
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstrasse 31, D-93053 Regensburg, Germany
| | - Heinz-Jürgen Steinhoff
- Department of Physics, University of Osnabrück, Barbarastrasse 7, D-49076 Osnabrück, Germany
| | - Reinhard Sterner
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Universitätsstrasse 31, D-93053 Regensburg, Germany
| |
Collapse
|
23
|
Sevy AM, Panda S, Crowe JE, Meiler J, Vorobeychik Y. Integrating linear optimization with structural modeling to increase HIV neutralization breadth. PLoS Comput Biol 2018; 14:e1005999. [PMID: 29451898 PMCID: PMC5833279 DOI: 10.1371/journal.pcbi.1005999] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2017] [Revised: 03/01/2018] [Accepted: 01/24/2018] [Indexed: 11/18/2022] Open
Abstract
Computational protein design has been successful in modeling fixed backbone proteins in a single conformation. However, when modeling large ensembles of flexible proteins, current methods in protein design have been insufficient. Large barriers in the energy landscape are difficult to traverse while redesigning a protein sequence, and as a result current design methods only sample a fraction of available sequence space. We propose a new computational approach that combines traditional structure-based modeling using the Rosetta software suite with machine learning and integer linear programming to overcome limitations in the Rosetta sampling methods. We demonstrate the effectiveness of this method, which we call BROAD, by benchmarking the performance on increasing predicted breadth of anti-HIV antibodies. We use this novel method to increase predicted breadth of naturally-occurring antibody VRC23 against a panel of 180 divergent HIV viral strains and achieve 100% predicted binding against the panel. In addition, we compare the performance of this method to state-of-the-art multistate design in Rosetta and show that we can outperform the existing method significantly. We further demonstrate that sequences recovered by this method recover known binding motifs of broadly neutralizing anti-HIV antibodies. Finally, our approach is general and can be extended easily to other protein systems. Although our modeled antibodies were not tested in vitro, we predict that these variants would have greatly increased breadth compared to the wild-type antibody. In this article, we report a new approach for protein design, which combines traditional structural modeling with machine learning and integer programming. Using this method, we are able to design antibodies that are predicted to bind large panels of antigenically diverse HIV variants. The combination of methods from these fields allows us to surpass protein design limitations that have been seen up to this point. We predict that if we tested these modified antibodies against HIV variants they would have greater neutralization breadth than any antibodies seen to this point.
Collapse
Affiliation(s)
- Alexander M. Sevy
- Center for Structural Biology, Vanderbilt University, Nashville, TN, United States of America
| | - Swetasudha Panda
- Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, United States of America
| | - James E. Crowe
- Vanderbilt Vaccine Center, Vanderbilt University Medical Center, Nashville, TN, United States of America
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University, Nashville, TN, United States of America
- Department of Chemistry, Vanderbilt University, Nashville, TN, United States of America
| | - Yevgeniy Vorobeychik
- Electrical Engineering and Computer Science, Vanderbilt University, Nashville, TN, United States of America
- * E-mail:
| |
Collapse
|
24
|
Nanda V, Belure SV, Shir OM. Searching for the Pareto frontier in multi-objective protein design. Biophys Rev 2017; 9:339-344. [PMID: 28799089 DOI: 10.1007/s12551-017-0288-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2017] [Accepted: 07/25/2017] [Indexed: 12/26/2022] Open
Abstract
The goal of protein engineering and design is to identify sequences that adopt three-dimensional structures of desired function. Often, this is treated as a single-objective optimization problem, identifying the sequence-structure solution with the lowest computed free energy of folding. However, many design problems are multi-state, multi-specificity, or otherwise require concurrent optimization of multiple objectives. There may be tradeoffs among objectives, where improving one feature requires compromising another. The challenge lies in determining solutions that are part of the Pareto optimal set-designs where no further improvement can be achieved in any of the objectives without degrading one of the others. Pareto optimality problems are found in all areas of study, from economics to engineering to biology, and computational methods have been developed specifically to identify the Pareto frontier. We review progress in multi-objective protein design, the development of Pareto optimization methods, and present a specific case study using multi-objective optimization methods to model the tradeoff between three parameters, stability, specificity, and complexity, of a set of interacting synthetic collagen peptides.
Collapse
Affiliation(s)
- Vikas Nanda
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ, USA.
- Department of Biochemistry and Molecular Biophysics, Robert Wood Johnson Medical School, Rutgers University, Piscataway, NJ, USA.
| | - Sandeep V Belure
- Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ, USA
- Department of Biochemistry and Molecular Biophysics, Robert Wood Johnson Medical School, Rutgers University, Piscataway, NJ, USA
| | - Ofer M Shir
- Department of Computer Science, Tel-Hai College, Kiryat Shmona, Upper Galilee, Israel
- The Galilee Research Institute-Migal, Kiryat Shmona, Upper Galilee, Israel
| |
Collapse
|
25
|
Löffler P, Schmitz S, Hupfeld E, Sterner R, Merkl R. Rosetta:MSF: a modular framework for multi-state computational protein design. PLoS Comput Biol 2017; 13:e1005600. [PMID: 28604768 PMCID: PMC5484525 DOI: 10.1371/journal.pcbi.1005600] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2017] [Revised: 06/26/2017] [Accepted: 05/27/2017] [Indexed: 12/20/2022] Open
Abstract
Computational protein design (CPD) is a powerful technique to engineer existing proteins or to design novel ones that display desired properties. Rosetta is a software suite including algorithms for computational modeling and analysis of protein structures and offers many elaborate protocols created to solve highly specific tasks of protein engineering. Most of Rosetta’s protocols optimize sequences based on a single conformation (i. e. design state). However, challenging CPD objectives like multi-specificity design or the concurrent consideration of positive and negative design goals demand the simultaneous assessment of multiple states. This is why we have developed the multi-state framework MSF that facilitates the implementation of Rosetta’s single-state protocols in a multi-state environment and made available two frequently used protocols. Utilizing MSF, we demonstrated for one of these protocols that multi-state design yields a 15% higher performance than single-state design on a ligand-binding benchmark consisting of structural conformations. With this protocol, we designed de novo nine retro-aldolases on a conformational ensemble deduced from a (βα)8-barrel protein. All variants displayed measurable catalytic activity, testifying to a high success rate for this concept of multi-state enzyme design. Protein engineering, i. e. the targeted modification or design of proteins has tremendous potential for medical and industrial applications. One generally applicable strategy for protein engineering is rational protein design: based on detailed knowledge of structure and function, computer programs like Rosetta propose the sequence of a protein possessing the desired properties. So far, most computer protocols have used rigid structures for design, which is a simplification because a protein’s structure is more accurately specified by a conformational ensemble. We have now implemented a framework for computational protein design that allows certain design protocols of Rosetta to make use of multiple design states like structural ensembles. An in silico assessment simulating ligand-binding design showed that this new approach generates more reliably native-like sequences than a single-state approach. As a proof-of-concept, we introduced de novo retro-aldolase activity into a scaffold protein and characterized nine variants experimentally, all of which were catalytically active.
Collapse
Affiliation(s)
- Patrick Löffler
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Samuel Schmitz
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Enrico Hupfeld
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Reinhard Sterner
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
| | - Rainer Merkl
- Institute of Biophysics and Physical Biochemistry, University of Regensburg, Regensburg, Germany
- * E-mail:
| |
Collapse
|
26
|
Bender BJ, Cisneros A, Duran AM, Finn JA, Fu D, Lokits AD, Mueller BK, Sangha AK, Sauer MF, Sevy AM, Sliwoski G, Sheehan JH, DiMaio F, Meiler J, Moretti R. Protocols for Molecular Modeling with Rosetta3 and RosettaScripts. Biochemistry 2016; 55:4748-63. [PMID: 27490953 PMCID: PMC5007558 DOI: 10.1021/acs.biochem.6b00444] [Citation(s) in RCA: 150] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
![]()
Previously, we published an article
providing an overview of the
Rosetta suite of biomacromolecular modeling software and a series
of step-by-step tutorials [Kaufmann, K. W., et al. (2010) Biochemistry 49, 2987–2998]. The overwhelming positive
response to this publication we received motivates us to here share
the next iteration of these tutorials that feature de novo folding, comparative modeling, loop construction, protein docking,
small molecule docking, and protein design. This updated and expanded
set of tutorials is needed, as since 2010 Rosetta has been fully redesigned
into an object-oriented protein modeling program Rosetta3. Notable
improvements include a substantially improved energy function, an
XML-like language termed “RosettaScripts” for flexibly
specifying modeling task, new analysis tools, the addition of the
TopologyBroker to control conformational sampling, and support for
multiple templates in comparative modeling. Rosetta’s ability
to model systems with symmetric proteins, membrane proteins, noncanonical
amino acids, and RNA has also been greatly expanded and improved.
Collapse
Affiliation(s)
- Brian J Bender
- Department of Pharmacology, Vanderbilt University , Nashville, Tennessee 37232-6600, United States.,Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States
| | - Alberto Cisneros
- Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States.,Chemical and Physical Biology Program, Vanderbilt University , Nashville, Tennessee 37232-0301, United States
| | - Amanda M Duran
- Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States.,Department of Chemistry, Vanderbilt University , Nashville, Tennessee 37235, United States
| | - Jessica A Finn
- Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States.,Department of Pathology, Microbiology and Immunology, Vanderbilt University , Nashville, Tennessee 37232-2561, United States
| | - Darwin Fu
- Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States.,Department of Chemistry, Vanderbilt University , Nashville, Tennessee 37235, United States
| | - Alyssa D Lokits
- Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States.,Neuroscience Program, Vanderbilt University , Nashville, Tennessee 37235, United States
| | - Benjamin K Mueller
- Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States.,Department of Chemistry, Vanderbilt University , Nashville, Tennessee 37235, United States
| | - Amandeep K Sangha
- Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States.,Department of Chemistry, Vanderbilt University , Nashville, Tennessee 37235, United States
| | - Marion F Sauer
- Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States.,Chemical and Physical Biology Program, Vanderbilt University , Nashville, Tennessee 37232-0301, United States
| | - Alexander M Sevy
- Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States.,Chemical and Physical Biology Program, Vanderbilt University , Nashville, Tennessee 37232-0301, United States
| | - Gregory Sliwoski
- Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States.,Department of Chemistry, Vanderbilt University , Nashville, Tennessee 37235, United States
| | - Jonathan H Sheehan
- Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States
| | - Frank DiMaio
- Department of Biochemistry, University of Washington , Seattle, Washington 98195, United States
| | - Jens Meiler
- Department of Pharmacology, Vanderbilt University , Nashville, Tennessee 37232-6600, United States.,Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States.,Chemical and Physical Biology Program, Vanderbilt University , Nashville, Tennessee 37232-0301, United States.,Department of Chemistry, Vanderbilt University , Nashville, Tennessee 37235, United States.,Department of Pathology, Microbiology and Immunology, Vanderbilt University , Nashville, Tennessee 37232-2561, United States.,Neuroscience Program, Vanderbilt University , Nashville, Tennessee 37235, United States
| | - Rocco Moretti
- Center for Structural Biology, Vanderbilt University , Nashville, Tennessee 37240-7917, United States.,Department of Chemistry, Vanderbilt University , Nashville, Tennessee 37235, United States
| |
Collapse
|
27
|
Gainza P, Nisonoff HM, Donald BR. Algorithms for protein design. Curr Opin Struct Biol 2016; 39:16-26. [PMID: 27086078 PMCID: PMC5065368 DOI: 10.1016/j.sbi.2016.03.006] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2015] [Revised: 03/15/2016] [Accepted: 03/22/2016] [Indexed: 02/05/2023]
Abstract
Computational structure-based protein design programs are becoming an increasingly important tool in molecular biology. These programs compute protein sequences that are predicted to fold to a target structure and perform a desired function. The success of a program's predictions largely relies on two components: first, the input biophysical model, and second, the algorithm that computes the best sequence(s) and structure(s) according to the biophysical model. Improving both the model and the algorithm in tandem is essential to improving the success rate of current programs, and here we review recent developments in algorithms for protein design, emphasizing how novel algorithms enable the use of more accurate biophysical models. We conclude with a list of algorithmic challenges in computational protein design that we believe will be especially important for the design of therapeutic proteins and protein assemblies.
Collapse
Affiliation(s)
- Pablo Gainza
- Department of Computer Science, Duke University, Durham, NC, United States
| | - Hunter M Nisonoff
- Department of Computer Science, Duke University, Durham, NC, United States
| | - Bruce R Donald
- Department of Computer Science, Duke University, Durham, NC, United States; Department of Biochemistry, Duke University Medical Center, Durham, NC, United States; Department of Chemistry, Duke University, Durham, NC, United States.
| |
Collapse
|
28
|
Liu H, Chen Q. Computational protein design for given backbone: recent progresses in general method-related aspects. Curr Opin Struct Biol 2016; 39:89-95. [PMID: 27348345 DOI: 10.1016/j.sbi.2016.06.013] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2016] [Revised: 05/18/2016] [Accepted: 06/15/2016] [Indexed: 10/21/2022]
Abstract
To achieve high success rate in protein design requires a reliable sequence design method to find amino acid sequences that stably fold into a desired backbone structure. This problem is addressed by computational protein design through the approach of energy minimization. Here we review recent method progresses related to improving the accuracy of this approach. First, the quality of the energy model is a key factor. Second, with structure sensitive energy functions, whether and how backbone flexibility is considered can have large effects on design accuracy, although usually only small adjustments of the backbone structure itself are involved. Third, the effective accuracy of design results can be boosted by post-processing a small number of designed sequences with complementary models that may not be efficient enough for full sequence optimization. Finally, computational method development will benefit greatly from increasingly efficient experimental approaches that can be applied to obtain extensive feedbacks.
Collapse
Affiliation(s)
- Haiyan Liu
- School of Life Sciences, University of Science and Technology of China, China; Hefei National Laboratory for Physical Sciences at the Microscales, China; Collaborative Innovation Center of Chemistry for Life Sciences, Hefei, Anhui 230027, China; Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, Anhui 230031, China.
| | - Quan Chen
- School of Life Sciences, University of Science and Technology of China, China
| |
Collapse
|
29
|
Abstract
The Rosetta macromolecular modeling software is a versatile, rapidly developing set of tools that are now being routinely utilized to address state-of-the-art research challenges in academia and industrial research settings. A Rosetta Conference (RosettaCon) describing updates to the Rosetta source code is held annually. Every two years, a Rosetta Conference (RosettaCon) special collection describing the results presented at the annual conference by participating RosettaCommons labs is published by the Public Library of Science (PLOS). This is the introduction to the third RosettaCon 2014 Special Collection published by PLOS.
Collapse
Affiliation(s)
- Sagar D. Khare
- Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, NJ, United States of America
- * E-mail: (SDK); (TAW)
| | - Timothy A. Whitehead
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI, United States of America
- Department of Biosystems and Agricultural Engineering, Michigan State University, East Lansing, MI, United States of America
- * E-mail: (SDK); (TAW)
| |
Collapse
|