51
|
Adolf-Bryfogle J, Teets FD, Bahl CD. Toward complete rational control over protein structure and function through computational design. Curr Opin Struct Biol 2020; 66:170-177. [PMID: 33276237 DOI: 10.1016/j.sbi.2020.10.015] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2020] [Revised: 10/08/2020] [Accepted: 10/19/2020] [Indexed: 11/28/2022]
Abstract
The grand challenge of protein design is a general method for producing a polypeptide with arbitrary functionality, conformation, and biochemical properties. To that end, a wide variety of methods have been developed for the improvement of native proteins, the design of ideal proteins de novo, and the redesign of suboptimal proteins with better-performing substructures. These methods employ informatic comparisons of function-structure-sequence relationships as well as knowledge-based evaluation of protein properties to narrow the immense protein sequence search space down to an enumerable and often manually evaluable set of structures that meet specified criteria. While arbitrary manipulation of protein-protein interfaces and molecular catalysis remains an unsolved problem, and no protein shape or behavior manipulation algorithm is universally applicable, the promising results thus far are a strong indicator that a general approach to the arbitrary manipulation of polypeptides is within reach.
Collapse
Affiliation(s)
- Jared Adolf-Bryfogle
- Institute for Protein Innovation, Boston, MA 02115, USA; Division of Hematology/Oncology, Boston Children's Hospital, Boston, MA, USA; Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Frank D Teets
- Institute for Protein Innovation, Boston, MA 02115, USA; Division of Hematology/Oncology, Boston Children's Hospital, Boston, MA, USA; Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Christopher D Bahl
- Institute for Protein Innovation, Boston, MA 02115, USA; Division of Hematology/Oncology, Boston Children's Hospital, Boston, MA, USA; Department of Pediatrics, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
52
|
Robust folding of a de novo designed ideal protein even with most of the core mutated to valine. Proc Natl Acad Sci U S A 2020; 117:31149-31156. [PMID: 33229587 PMCID: PMC7739874 DOI: 10.1073/pnas.2002120117] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
De novo designed proteins exhibit a remarkable property of extremely high thermal stability compared with naturally occurring proteins. The designed proteins are completely optimized for folding; the backbone structures are created by using a set of rules that relate local backbone structures to preferred tertiary motifs and the side chains are designed to favor both the local backbone structures and the entire tertiary structures. Here, we found that one of the de novo designed proteins, which was mutated to fill the core with mostly valine residues, still has the folding ability and shows high stability (Tm = 106 °C) even with its reduced and loosened core packing. This result supports the importance of local backbone structures to protein folding. Protein design provides a stringent test for our understanding of protein folding. We previously described principles for designing ideal protein structures stabilized by consistent local and nonlocal interactions, based on a set of rules relating local backbone structures to tertiary packing motifs. The principles have made possible the design of protein structures having various topologies with high thermal stability. Whereas nonlocal interactions such as tight hydrophobic core packing have traditionally been considered to be crucial for protein folding and stability, the rules proposed by our previous studies suggest the importance of local backbone structures to protein folding. In this study, we investigated the robustness of folding of de novo designed proteins to the reduction of the hydrophobic core, by extensive mutation of large hydrophobic residues (Leu, Ile) to smaller ones (Val) for one of the designs. Surprisingly, even after 10 Leu and Ile residues were mutated to Val, this mutant with the core mostly filled with Val was found to not be in a molten globule state and fold into the same backbone structure as the original design, with high stability. These results indicate the importance of local backbone structures to the folding ability and high thermal stability of designed proteins and suggest a method for engineering thermally stabilized natural proteins.
Collapse
|
53
|
Lipsh-Sokolik R, Listov D, Fleishman SJ. The AbDesign computational pipeline for modular backbone assembly and design of binders and enzymes. Protein Sci 2020; 30:151-159. [PMID: 33040418 PMCID: PMC7737780 DOI: 10.1002/pro.3970] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 10/07/2020] [Accepted: 10/09/2020] [Indexed: 12/12/2022]
Abstract
The functional sites of many protein families are dominated by diverse backbone regions that lack secondary structure (loops) but fold stably into their functionally competent state. Nevertheless, the design of structured loop regions from scratch, especially in functional sites, has met with great difficulty. We therefore developed an approach, called AbDesign, to exploit the natural modularity of many protein families and computationally assemble a large number of new backbones by combining naturally occurring modular fragments. This strategy yielded large, atomically accurate, and highly efficient proteins, including antibodies and enzymes exhibiting dozens of mutations from any natural protein. The combinatorial backbone‐conformation space that can be accessed by AbDesign even for a modestly sized family of homologs may exceed the diversity in the entire PDB, providing the sub‐Ångstrom level of control over the positioning of active‐site groups that is necessary for obtaining highly active proteins. This manuscript describes how to implement the pipeline using code that is freely available at https://github.com/Fleishman‐Lab/AbDesign_for_enzymes.
Collapse
Affiliation(s)
| | - Dina Listov
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| |
Collapse
|
54
|
Yeh CT, Obendorf L, Parmeggiani F. Elfin UI: A Graphical Interface for Protein Design With Modular Building Blocks. Front Bioeng Biotechnol 2020; 8:568318. [PMID: 33195130 PMCID: PMC7644802 DOI: 10.3389/fbioe.2020.568318] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Accepted: 10/02/2020] [Indexed: 02/01/2023] Open
Abstract
Molecular models have enabled understanding of biological structures and functions and allowed design of novel macro-molecules. Graphical user interfaces (GUIs) in molecular modeling are generally focused on atomic representations, but, especially for proteins, do not usually address designs of complex and large architectures, from nanometers to microns. Therefore, we have developed Elfin UI as a Blender add-on for the interactive design of large protein architectures with custom shapes. Elfin UI relies on compatible building blocks to design single- and multiple-chain protein structures. The software can be used: (1) as an interactive environment to explore building blocks combinations; and (2) as a computer aided design (CAD) tool to define target shapes that guide automated design. Elfin UI allows users to rapidly build new protein shapes, without the need to focus on amino acid sequence, and aims to make design of proteins and protein-based materials intuitive and accessible to researchers and members of the general public with limited expertise in protein engineering.
Collapse
Affiliation(s)
- Chun-Ting Yeh
- School of Chemistry and School of Biochemistry, University of Bristol, Bristol, United Kingdom
| | - Leon Obendorf
- School of Chemistry and School of Biochemistry, University of Bristol, Bristol, United Kingdom.,Institute of Chemistry and Biochemistry, Freie Universität Berlin, Berlin, Germany
| | - Fabio Parmeggiani
- School of Chemistry and School of Biochemistry, University of Bristol, Bristol, United Kingdom.,Bristol Biodesign Institute and BrisSynBio, a BBSRC/EPSRC Synthetic Biology Research Centre, University of Bristol, Bristol, United Kingdom
| |
Collapse
|
55
|
Lucas JE, Kortemme T. New computational protein design methods for de novo small molecule binding sites. PLoS Comput Biol 2020; 16:e1008178. [PMID: 33017412 PMCID: PMC7575090 DOI: 10.1371/journal.pcbi.1008178] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2020] [Revised: 10/20/2020] [Accepted: 07/22/2020] [Indexed: 11/19/2022] Open
Abstract
Protein binding to small molecules is fundamental to many biological processes, yet it remains challenging to predictively design this functionality de novo. Current state-of-the-art computational design methods typically rely on existing small molecule binding sites or protein scaffolds with existing shape complementarity for a target ligand. Here we introduce new methods that utilize pools of discrete contacts between protein side chains and defined small molecule ligand substructures (ligand fragments) observed in the Protein Data Bank. We use the Rosetta Molecular Modeling Suite to recombine protein side chains in these contact pools to generate hundreds of thousands of energetically favorable binding sites for a target ligand. These composite binding sites are built into existing scaffold proteins matching the intended binding site geometry with high accuracy. In addition, we apply pools of side chain rotamers interacting with the target ligand to augment Rosetta's conventional design machinery and improve key metrics known to be predictive of design success. We demonstrate that our method reliably builds diverse binding sites into different scaffold proteins for a variety of target molecules. Our generalizable de novo ligand binding site design method provides a foundation for versatile design of protein to interface previously unattainable molecules for applications in medical diagnostics and synthetic biology.
Collapse
Affiliation(s)
- James E. Lucas
- UC Berkeley–UCSF Graduate Program in Bioengineering, University of California San Francisco, San Francisco, CA, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, United States of America
| | - Tanja Kortemme
- UC Berkeley–UCSF Graduate Program in Bioengineering, University of California San Francisco, San Francisco, CA, United States of America
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, United States of America
| |
Collapse
|
56
|
Pan X, Thompson MC, Zhang Y, Liu L, Fraser JS, Kelly MJS, Kortemme T. Expanding the space of protein geometries by computational design of de novo fold families. Science 2020; 369:1132-1136. [PMID: 32855341 DOI: 10.1126/science.abc0881] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2020] [Accepted: 07/14/2020] [Indexed: 01/03/2023]
Abstract
Naturally occurring proteins vary the precise geometries of structural elements to create distinct shapes optimal for function. We present a computational design method, loop-helix-loop unit combinatorial sampling (LUCS), that mimics nature's ability to create families of proteins with the same overall fold but precisely tunable geometries. Through near-exhaustive sampling of loop-helix-loop elements, LUCS generates highly diverse geometries encompassing those found in nature but also surpassing known structure space. Biophysical characterization showed that 17 (38%) of 45 tested LUCS designs encompassing two different structural topologies were well folded, including 16 with designed non-native geometries. Four experimentally solved structures closely matched the designs. LUCS greatly expands the designable structure space and offers a new paradigm for designing proteins with tunable geometries that may be customizable for novel functions.
Collapse
Affiliation(s)
- Xingjie Pan
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA. .,UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA, USA
| | - Michael C Thompson
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - Yang Zhang
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - Lin Liu
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA.,Quantitative Biosciences Institute, University of California, San Francisco, CA, USA
| | - Mark J S Kelly
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, USA. .,UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco, CA, USA.,Quantitative Biosciences Institute, University of California, San Francisco, CA, USA.,Chan Zuckerberg Biohub, San Francisco, CA, USA
| |
Collapse
|
57
|
Basanta B, Bick MJ, Bera AK, Norn C, Chow CM, Carter LP, Goreshnik I, Dimaio F, Baker D. An enumerative algorithm for de novo design of proteins with diverse pocket structures. Proc Natl Acad Sci U S A 2020; 117:22135-22145. [PMID: 32839327 PMCID: PMC7486743 DOI: 10.1073/pnas.2005412117] [Citation(s) in RCA: 56] [Impact Index Per Article: 11.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
To create new enzymes and biosensors from scratch, precise control over the structure of small-molecule binding sites is of paramount importance, but systematically designing arbitrary protein pocket shapes and sizes remains an outstanding challenge. Using the NTF2-like structural superfamily as a model system, we developed an enumerative algorithm for creating a virtually unlimited number of de novo proteins supporting diverse pocket structures. The enumerative algorithm was tested and refined through feedback from two rounds of large-scale experimental testing, involving in total the assembly of synthetic genes encoding 7,896 designs and assessment of their stability on yeast cell surface, detailed biophysical characterization of 64 designs, and crystal structures of 5 designs. The refined algorithm generates proteins that remain folded at high temperatures and exhibit more pocket diversity than naturally occurring NTF2-like proteins. We expect this approach to transform the design of small-molecule sensors and enzymes by enabling the creation of binding and active site geometries much more optimal for specific design challenges than is accessible by repurposing the limited number of naturally occurring NTF2-like proteins.
Collapse
Affiliation(s)
- Benjamin Basanta
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- Biochemistry Department, School of Medicine, University of Washington, Seattle, WA 98195
| | - Matthew J Bick
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- Biochemistry Department, School of Medicine, University of Washington, Seattle, WA 98195
| | - Asim K Bera
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- Biochemistry Department, School of Medicine, University of Washington, Seattle, WA 98195
| | - Christoffer Norn
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- Biochemistry Department, School of Medicine, University of Washington, Seattle, WA 98195
| | - Cameron M Chow
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- Biochemistry Department, School of Medicine, University of Washington, Seattle, WA 98195
| | - Lauren P Carter
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- Biochemistry Department, School of Medicine, University of Washington, Seattle, WA 98195
| | - Inna Goreshnik
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- Biochemistry Department, School of Medicine, University of Washington, Seattle, WA 98195
| | - Frank Dimaio
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- Biochemistry Department, School of Medicine, University of Washington, Seattle, WA 98195
| | - David Baker
- Institute for Protein Design, University of Washington, Seattle, WA 98195;
- Biochemistry Department, School of Medicine, University of Washington, Seattle, WA 98195
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195
| |
Collapse
|
58
|
Chowdhury R, Grisewood MJ, Boorla VS, Yan Q, Pfleger BF, Maranas CD. IPRO+/-: Computational Protein Design Tool Allowing for Insertions and Deletions. Structure 2020; 28:1344-1357.e4. [PMID: 32857964 DOI: 10.1016/j.str.2020.08.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2020] [Revised: 07/01/2020] [Accepted: 08/07/2020] [Indexed: 12/30/2022]
Abstract
Insertions and deletions (indels) in protein sequences alter the residue spacing along the polypeptide backbone and consequently open up possibilities for tuning protein function in a way that is inaccessible by amino acid substitution alone. We describe an optimization-based computational protein redesign approach centered around predicting beneficial combinations of indels along with substitutions and also obtain putative substrate-docked structures for these protein variants. This modified algorithmic capability would be of interest for enzyme engineering and broadly inform other protein design tasks. We highlight this capability by (1) identifying active variants of a bacterial thioesterase enzyme ('TesA) with experimental corroboration, (2) recapitulating existing active TEM-1 β-Lactamase sequences of different sizes, and (3) identifying shorter 4-Coumarate:CoA ligases with enhanced in vitro activities toward non-native substrates. A separate PyRosetta-based open-source tool, Indel-Maker (http://www.maranasgroup.com/software.htm), has also been created to construct computational models of user-defined protein variants with specific indels and substitutions.
Collapse
Affiliation(s)
- Ratul Chowdhury
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | - Matthew J Grisewood
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | - Veda Sheersh Boorla
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA
| | - Qiang Yan
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Brian F Pfleger
- Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Costas D Maranas
- Department of Chemical Engineering, The Pennsylvania State University, University Park, PA 16802, USA.
| |
Collapse
|
59
|
Leman JK, Weitzner BD, Lewis SM, Adolf-Bryfogle J, Alam N, Alford RF, Aprahamian M, Baker D, Barlow KA, Barth P, Basanta B, Bender BJ, Blacklock K, Bonet J, Boyken SE, Bradley P, Bystroff C, Conway P, Cooper S, Correia BE, Coventry B, Das R, De Jong RM, DiMaio F, Dsilva L, Dunbrack R, Ford AS, Frenz B, Fu DY, Geniesse C, Goldschmidt L, Gowthaman R, Gray JJ, Gront D, Guffy S, Horowitz S, Huang PS, Huber T, Jacobs TM, Jeliazkov JR, Johnson DK, Kappel K, Karanicolas J, Khakzad H, Khar KR, Khare SD, Khatib F, Khramushin A, King IC, Kleffner R, Koepnick B, Kortemme T, Kuenze G, Kuhlman B, Kuroda D, Labonte JW, Lai JK, Lapidoth G, Leaver-Fay A, Lindert S, Linsky T, London N, Lubin JH, Lyskov S, Maguire J, Malmström L, Marcos E, Marcu O, Marze NA, Meiler J, Moretti R, Mulligan VK, Nerli S, Norn C, Ó'Conchúir S, Ollikainen N, Ovchinnikov S, Pacella MS, Pan X, Park H, Pavlovicz RE, Pethe M, Pierce BG, Pilla KB, Raveh B, Renfrew PD, Burman SSR, Rubenstein A, Sauer MF, Scheck A, Schief W, Schueler-Furman O, Sedan Y, Sevy AM, Sgourakis NG, Shi L, Siegel JB, Silva DA, Smith S, Song Y, et alLeman JK, Weitzner BD, Lewis SM, Adolf-Bryfogle J, Alam N, Alford RF, Aprahamian M, Baker D, Barlow KA, Barth P, Basanta B, Bender BJ, Blacklock K, Bonet J, Boyken SE, Bradley P, Bystroff C, Conway P, Cooper S, Correia BE, Coventry B, Das R, De Jong RM, DiMaio F, Dsilva L, Dunbrack R, Ford AS, Frenz B, Fu DY, Geniesse C, Goldschmidt L, Gowthaman R, Gray JJ, Gront D, Guffy S, Horowitz S, Huang PS, Huber T, Jacobs TM, Jeliazkov JR, Johnson DK, Kappel K, Karanicolas J, Khakzad H, Khar KR, Khare SD, Khatib F, Khramushin A, King IC, Kleffner R, Koepnick B, Kortemme T, Kuenze G, Kuhlman B, Kuroda D, Labonte JW, Lai JK, Lapidoth G, Leaver-Fay A, Lindert S, Linsky T, London N, Lubin JH, Lyskov S, Maguire J, Malmström L, Marcos E, Marcu O, Marze NA, Meiler J, Moretti R, Mulligan VK, Nerli S, Norn C, Ó'Conchúir S, Ollikainen N, Ovchinnikov S, Pacella MS, Pan X, Park H, Pavlovicz RE, Pethe M, Pierce BG, Pilla KB, Raveh B, Renfrew PD, Burman SSR, Rubenstein A, Sauer MF, Scheck A, Schief W, Schueler-Furman O, Sedan Y, Sevy AM, Sgourakis NG, Shi L, Siegel JB, Silva DA, Smith S, Song Y, Stein A, Szegedy M, Teets FD, Thyme SB, Wang RYR, Watkins A, Zimmerman L, Bonneau R. Macromolecular modeling and design in Rosetta: recent methods and frameworks. Nat Methods 2020; 17:665-680. [PMID: 32483333 PMCID: PMC7603796 DOI: 10.1038/s41592-020-0848-2] [Show More Authors] [Citation(s) in RCA: 484] [Impact Index Per Article: 96.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2019] [Accepted: 04/22/2020] [Indexed: 12/12/2022]
Abstract
The Rosetta software for macromolecular modeling, docking and design is extensively used in laboratories worldwide. During two decades of development by a community of laboratories at more than 60 institutions, Rosetta has been continuously refactored and extended. Its advantages are its performance and interoperability between broad modeling capabilities. Here we review tools developed in the last 5 years, including over 80 methods. We discuss improvements to the score function, user interfaces and usability. Rosetta is available at http://www.rosettacommons.org.
Collapse
Affiliation(s)
- Julia Koehler Leman
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA.
- Department of Biology, New York University, New York, New York, USA.
| | - Brian D Weitzner
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Lyell Immunopharma Inc., Seattle, WA, USA
| | - Steven M Lewis
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
- Department of Biochemistry, Duke University, Durham, NC, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Jared Adolf-Bryfogle
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, USA
| | - Nawsad Alam
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Melanie Aprahamian
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Kyle A Barlow
- Graduate Program in Bioinformatics, University of California San Francisco, San Francisco, CA, USA
| | - Patrick Barth
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Baylor College of Medicine, Department of Pharmacology, Houston, TX, USA
| | - Benjamin Basanta
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Biological Physics Structure and Design PhD Program, University of Washington, Seattle, WA, USA
| | - Brian J Bender
- Department of Pharmacology, Vanderbilt University, Nashville, TN, USA
| | - Kristin Blacklock
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Jaume Bonet
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Scott E Boyken
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Lyell Immunopharma Inc., Seattle, WA, USA
| | - Phil Bradley
- Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Chris Bystroff
- Department of Biological Sciences, Rensselaer Polytechnic Institute, Troy, NY, USA
| | - Patrick Conway
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Seth Cooper
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Bruno E Correia
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Brian Coventry
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Rhiju Das
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lorna Dsilva
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Roland Dunbrack
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Alexander S Ford
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Brandon Frenz
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Darwin Y Fu
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Caleb Geniesse
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | | | - Ragul Gowthaman
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, MD, USA
| | - Dominik Gront
- Faculty of Chemistry, Biological and Chemical Research Centre, University of Warsaw, Warsaw, Poland
| | - Sharon Guffy
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Scott Horowitz
- Department of Chemistry & Biochemistry, University of Denver, Denver, CO, USA
- The Knoebel Institute for Healthy Aging, University of Denver, Denver, CO, USA
| | - Po-Ssu Huang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Thomas Huber
- Research School of Chemistry, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Tim M Jacobs
- Program in Bioinformatics and Computational Biology, Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | | | - David K Johnson
- Center for Computational Biology, University of Kansas, Lawrence, KS, USA
| | - Kalli Kappel
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - John Karanicolas
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA, USA
| | - Hamed Khakzad
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute for Computational Science, University of Zurich, Zurich, Switzerland
- S3IT, University of Zurich, Zurich, Switzerland
| | - Karen R Khar
- Cyrus Biotechnology, Seattle, WA, USA
- Center for Computational Biology, University of Kansas, Lawrence, KS, USA
| | - Sagar D Khare
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Department of Chemistry and Chemical Biology, The State University of New Jersey, Piscataway, NJ, USA
- Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Computational Biology and Molecular Biophysics Program, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Firas Khatib
- Department of Computer and Information Science, University of Massachusetts Dartmouth, Dartmouth, MA, USA
| | - Alisa Khramushin
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Indigo C King
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Robert Kleffner
- Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA
| | - Brian Koepnick
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Georg Kuenze
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Daisuke Kuroda
- Medical Device Development and Regulation Research Center, School of Engineering, University of Tokyo, Tokyo, Japan
- Department of Bioengineering, School of Engineering, University of Tokyo, Tokyo, Japan
| | - Jason W Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Department of Chemistry, Franklin & Marshall College, Lancaster, PA, USA
| | - Jason K Lai
- Baylor College of Medicine, Department of Pharmacology, Houston, TX, USA
| | - Gideon Lapidoth
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Andrew Leaver-Fay
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Steffen Lindert
- Department of Chemistry and Biochemistry, Ohio State University, Columbus, OH, USA
| | - Thomas Linsky
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Nir London
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Joseph H Lubin
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jack Maguire
- Program in Bioinformatics and Computational Biology, Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Lars Malmström
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- Institute for Computational Science, University of Zurich, Zurich, Switzerland
- S3IT, University of Zurich, Zurich, Switzerland
- Division of Infection Medicine, Department of Clinical Sciences Lund, Faculty of Medicine, Lund University, Lund, Sweden
| | - Enrique Marcos
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Research in Biomedicine Barcelona, The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Orly Marcu
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Nicholas A Marze
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Jens Meiler
- Center for Structural Biology, Vanderbilt University, Nashville, TN, USA
- Departments of Chemistry, Pharmacology and Biomedical Informatics, Vanderbilt University, Nashville, TN, USA
- Institute for Chemical Biology, Vanderbilt University, Nashville, TN, USA
| | - Rocco Moretti
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Vikram Khipple Mulligan
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Santrupti Nerli
- Department of Computer Science, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Christoffer Norn
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - Shane Ó'Conchúir
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Noah Ollikainen
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Sergey Ovchinnikov
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Molecular and Cellular Biology Program, University of Washington, Seattle, WA, USA
| | - Michael S Pacella
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Xingjie Pan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Hahnbeom Park
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Ryan E Pavlovicz
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Manasi Pethe
- Department of Chemistry and Chemical Biology, The State University of New Jersey, Piscataway, NJ, USA
- Center for Integrative Proteomics Research, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Brian G Pierce
- University of Maryland Institute for Bioscience and Biotechnology Research, Rockville, MD, USA
- Department of Cell Biology and Molecular Genetics, University of Maryland, College Park, MD, USA
| | - Kala Bharath Pilla
- Research School of Chemistry, Australian National University, Canberra, Australian Capital Territory, Australia
| | - Barak Raveh
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - P Douglas Renfrew
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA
| | - Shourya S Roy Burman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Aliza Rubenstein
- Institute of Quantitative Biomedicine, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
- Computational Biology and Molecular Biophysics Program, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Marion F Sauer
- Chemical and Physical Biology Program, Vanderbilt Vaccine Center, Vanderbilt University, Nashville, TN, USA
| | - Andreas Scheck
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - William Schief
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA, USA
| | - Ora Schueler-Furman
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Yuval Sedan
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Alexander M Sevy
- Chemical and Physical Biology Program, Vanderbilt Vaccine Center, Vanderbilt University, Nashville, TN, USA
| | - Nikolaos G Sgourakis
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA, USA
| | - Lei Shi
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Justin B Siegel
- Department of Chemistry, University of California, Davis, Davis, CA, USA
- Department of Biochemistry and Molecular Medicine, University of California, Davis, Davis, California, USA
- Genome Center, University of California, Davis, Davis, CA, USA
| | | | - Shannon Smith
- Department of Chemistry, Vanderbilt University, Nashville, TN, USA
| | - Yifan Song
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Cyrus Biotechnology, Seattle, WA, USA
| | - Amelie Stein
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Maria Szegedy
- Department of Chemistry and Chemical Biology, Rutgers, The State University of New Jersey, Piscataway, NJ, USA
| | - Frank D Teets
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Summer B Thyme
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Ray Yu-Ruei Wang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Andrew Watkins
- Department of Biochemistry, Stanford University School of Medicine, Stanford, CA, USA
| | - Lior Zimmerman
- Department of Microbiology and Molecular Genetics, IMRIC, Ein Kerem Faculty of Medicine, Hebrew University of Jerusalem, Jerusalem, Israel
| | - Richard Bonneau
- Center for Computational Biology, Flatiron Institute, Simons Foundation, New York, NY, USA.
- Department of Biology, New York University, New York, New York, USA.
- Department of Computer Science, New York University, New York, NY, USA.
- Center for Data Science, New York University, New York, NY, USA.
| |
Collapse
|
60
|
Lapenta F, Jerala R. Design of novel protein building modules and modular architectures. Curr Opin Struct Biol 2020; 63:90-96. [PMID: 32505942 DOI: 10.1016/j.sbi.2020.04.006] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Revised: 04/14/2020] [Accepted: 04/15/2020] [Indexed: 12/31/2022]
Abstract
Nature uses only a limited number of protein topologies and while several folds have evolved independently over time, there are clearly many possible topologies that have not been explored by evolution. With recent advances of protein design concepts, computational modeling tools, high resolution and high-throughput experimental methods it is now possible to design new protein architectures. The collection of building blocks and design principles widened both in size and complexity, offering an expanded toolset for building new modular folds and functional protein structures. Here we review and discuss recent achievements of protein design, focusing in particular on the use and prospects of modular approaches for assembling new protein folds.
Collapse
Affiliation(s)
- Fabio Lapenta
- Department of Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana, Slovenia
| | - Roman Jerala
- Department of Synthetic Biology and Immunology, National Institute of Chemistry, Ljubljana, Slovenia; EN-FIST Centre of Excellence, Ljubljana, Slovenia.
| |
Collapse
|
61
|
Glasgow AA, Huang YM, Mandell DJ, Thompson M, Ritterson R, Loshbaugh AL, Pellegrino J, Krivacic C, Pache RA, Barlow KA, Ollikainen N, Jeon D, Kelly MJS, Fraser JS, Kortemme T. Computational design of a modular protein sense-response system. Science 2020; 366:1024-1028. [PMID: 31754004 DOI: 10.1126/science.aax8780] [Citation(s) in RCA: 83] [Impact Index Per Article: 16.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2019] [Accepted: 10/07/2019] [Indexed: 12/28/2022]
Abstract
Sensing and responding to signals is a fundamental ability of living systems, but despite substantial progress in the computational design of new protein structures, there is no general approach for engineering arbitrary new protein sensors. Here, we describe a generalizable computational strategy for designing sensor-actuator proteins by building binding sites de novo into heterodimeric protein-protein interfaces and coupling ligand sensing to modular actuation through split reporters. Using this approach, we designed protein sensors that respond to farnesyl pyrophosphate, a metabolic intermediate in the production of valuable compounds. The sensors are functional in vitro and in cells, and the crystal structure of the engineered binding site closely matches the design model. Our computational design strategy opens broad avenues to link biological outputs to new signals.
Collapse
Affiliation(s)
- Anum A Glasgow
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Yao-Ming Huang
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Daniel J Mandell
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA.,Bioinformatics Graduate Program, University of California San Francisco, San Francisco, CA, USA
| | - Michael Thompson
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Ryan Ritterson
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Amanda L Loshbaugh
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA.,Biophysics Graduate Program, University of California San Francisco, San Francisco, CA, USA
| | - Jenna Pellegrino
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA.,Biophysics Graduate Program, University of California San Francisco, San Francisco, CA, USA
| | - Cody Krivacic
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA.,UC Berkeley-UCSF Graduate Program in Bioengineering, University of California San Francisco, San Francisco, CA, USA
| | - Roland A Pache
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Kyle A Barlow
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA.,Bioinformatics Graduate Program, University of California San Francisco, San Francisco, CA, USA
| | - Noah Ollikainen
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA.,Bioinformatics Graduate Program, University of California San Francisco, San Francisco, CA, USA
| | - Deborah Jeon
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA
| | - Mark J S Kelly
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, USA
| | - James S Fraser
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA.,Biophysics Graduate Program, University of California San Francisco, San Francisco, CA, USA.,Quantitative Biosciences Institute, University of California San Francisco, San Francisco, CA, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, CA, USA. .,Bioinformatics Graduate Program, University of California San Francisco, San Francisco, CA, USA.,Biophysics Graduate Program, University of California San Francisco, San Francisco, CA, USA.,UC Berkeley-UCSF Graduate Program in Bioengineering, University of California San Francisco, San Francisco, CA, USA.,Quantitative Biosciences Institute, University of California San Francisco, San Francisco, CA, USA.,Chan Zuckerberg Biohub, San Francisco, CA, USA
| |
Collapse
|
62
|
Brunette TJ, Bick MJ, Hansen JM, Chow CM, Kollman JM, Baker D. Modular repeat protein sculpting using rigid helical junctions. Proc Natl Acad Sci U S A 2020; 117:8870-8875. [PMID: 32245816 PMCID: PMC7183188 DOI: 10.1073/pnas.1908768117] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/18/2023] Open
Abstract
The ability to precisely design large proteins with diverse shapes would enable applications ranging from the design of protein binders that wrap around their target to the positioning of multiple functional sites in specified orientations. We describe a protein backbone design method for generating a wide range of rigid fusions between helix-containing proteins and use it to design 75,000 structurally unique junctions between monomeric and homo-oligomeric de novo designed and ankyrin repeat proteins (RPs). Of the junction designs that were experimentally characterized, 82% have circular dichroism and solution small-angle X-ray scattering profiles consistent with the design models and are stable at 95 °C. Crystal structures of four designed junctions were in close agreement with the design models with rmsds ranging from 0.9 to 1.6 Å. Electron microscopic images of extended tetrameric structures and ∼10-nm-diameter "L" and "V" shapes generated using the junctions are close to the design models, demonstrating the control the rigid junctions provide for protein shape sculpting over multiple nanometer length scales.
Collapse
Affiliation(s)
- T J Brunette
- Department of Biochemistry, University of Washington, Seattle, WA 98195;
- Institute for Protein Design, University of Washington, Seattle, WA 98195
| | - Matthew J Bick
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Institute for Protein Design, University of Washington, Seattle, WA 98195
| | - Jesse M Hansen
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Graduate Program in Biological Physics, Structure, and Design, University of Washington, Seattle, WA 98195
| | - Cameron M Chow
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Institute for Protein Design, University of Washington, Seattle, WA 98195
| | - Justin M Kollman
- Department of Biochemistry, University of Washington, Seattle, WA 98195
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98195
- Institute for Protein Design, University of Washington, Seattle, WA 98195
- Howard Hughes Medical Institute, University of Washington, Seattle, WA 98195
| |
Collapse
|
63
|
Kuhlman B. Designing protein structures and complexes with the molecular modeling program Rosetta. J Biol Chem 2019; 294:19436-19443. [PMID: 31699898 DOI: 10.1074/jbc.aw119.008144] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
Proteins perform an amazingly diverse set of functions in all aspects of life. Critical to the function of many proteins are the highly specific three-dimensional structures they adopt. For this reason, there is strong interest in learning how to rationally design proteins that adopt user-defined structures. Over the last 25 years, there has been significant progress in the field of computational protein design as rotamer-based sequence optimization protocols have enabled accurate design of protein tertiary and quaternary structure. In this award article, I will summarize how the molecular modeling program Rosetta is used to design new protein structures and describe how we have taken advantage of this capability to create proteins that have important applications in research and medicine. I will highlight three protein design stories: the use of protein interface design to create therapeutic bispecific antibodies, the engineering of light-inducible proteins that can be used to recruit proteins to specific locations in the cell, and the de novo design of new protein structures from pieces of naturally occurring proteins.
Collapse
Affiliation(s)
- Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill School of Medicine, Chapel Hill, North Carolina 27599-7260 .,Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina 27599
| |
Collapse
|
64
|
Kuhlman B, Bradley P. Advances in protein structure prediction and design. Nat Rev Mol Cell Biol 2019; 20:681-697. [PMID: 31417196 PMCID: PMC7032036 DOI: 10.1038/s41580-019-0163-x] [Citation(s) in RCA: 437] [Impact Index Per Article: 72.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/19/2019] [Indexed: 12/18/2022]
Abstract
The prediction of protein three-dimensional structure from amino acid sequence has been a grand challenge problem in computational biophysics for decades, owing to its intrinsic scientific interest and also to the many potential applications for robust protein structure prediction algorithms, from genome interpretation to protein function prediction. More recently, the inverse problem - designing an amino acid sequence that will fold into a specified three-dimensional structure - has attracted growing attention as a potential route to the rational engineering of proteins with functions useful in biotechnology and medicine. Methods for the prediction and design of protein structures have advanced dramatically in the past decade. Increases in computing power and the rapid growth in protein sequence and structure databases have fuelled the development of new data-intensive and computationally demanding approaches for structure prediction. New algorithms for designing protein folds and protein-protein interfaces have been used to engineer novel high-order assemblies and to design from scratch fluorescent proteins with novel or enhanced properties, as well as signalling proteins with therapeutic potential. In this Review, we describe current approaches for protein structure prediction and design and highlight a selection of the successful applications they have enabled.
Collapse
Affiliation(s)
- Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, USA.
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC, USA.
| | - Philip Bradley
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA, USA.
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
| |
Collapse
|
65
|
Towards functional de novo designed proteins. Curr Opin Chem Biol 2019; 52:102-111. [DOI: 10.1016/j.cbpa.2019.06.011] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Revised: 04/25/2019] [Accepted: 06/06/2019] [Indexed: 12/31/2022]
|
66
|
Cheung NJ, Yu W. Sibe: a computation tool to apply protein sequence statistics to predict folding and design in silico. BMC Bioinformatics 2019; 20:455. [PMID: 31492097 PMCID: PMC6728967 DOI: 10.1186/s12859-019-2984-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2019] [Accepted: 07/03/2019] [Indexed: 02/01/2023] Open
Abstract
BACKGROUND Evolutionary information contained in the amino acid sequences of proteins specifies the biological function and fold, but exactly what information contained in the protein sequence drives both of these processes? Considerable progress has been made to answer this fundamental question, but it remains challenging to explore the potential space of cooperative interactions between amino acids. Statistical analysis plays a significant role in studying such interactions and its use has expanded in recent years to studies ranging from coevolution-guided rational protein design to protein folding in silico. RESULTS Here we describe a computational tool named Sibe for use in studies of protein sequence, folding, and design using evolutionary coupling between amino acids as a driving factor. In this study, Sibe is used to identify positionally conserved couplings between pairwise amino acids and aid rational protein design. In this process, pairwise couplings are filtered according to the relative entropy computed from the positional conservations and grouped into several 'blocks', which could contribute to driving protein folding and design. A human β2-adrenergic receptor (β2AR) was used to demonstrate that those 'blocks' contribute the rational design for specifying functional residues. Sibe also provides folding modules based on both the positionally conserved couplings and well-established statistical potentials for simulating protein folding in silico and predicting tertiary structure. Our results show that statistically inferences of basic evolutionary principles, such as conservations and coupled-mutations, can be used to rapidly design a diverse set of proteins and study protein folding. CONCLUSIONS The developed software Sibe provides a computational tool for systematical analysis from protein primary to its tertiary structure using the evolutionary couplings as a driving factor. Sibe, written in C++, accounts for compatibility with the 'big data' era in biological science, and it primarily focuses on protein sequence analysis, but it is also applicable to extend to other modeling and predictions of experimental measurements.
Collapse
Affiliation(s)
- Ngaam J. Cheung
- Department of Brain and Cognitive Science, DGIST, Daegu, 42988 South Korea
- Cavendish Laboratory, Department of Physics, University of Cambridge, Cambridge, CB3 0HA UK
| | - Wookyung Yu
- Department of Brain and Cognitive Science, DGIST, Daegu, 42988 South Korea
- Core Protein Resources Center, DGIST, Daegu, 42988 South Korea
| |
Collapse
|
67
|
Berezovsky IN. Towards descriptor of elementary functions for protein design. Curr Opin Struct Biol 2019; 58:159-165. [PMID: 31352188 DOI: 10.1016/j.sbi.2019.06.010] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2019] [Accepted: 06/18/2019] [Indexed: 11/18/2022]
Abstract
We review studies of the protein evolution that help to formulate rules for protein design. Acknowledging the fundamental importance of Dayhoff's provision on the emergence of functional proteins from short peptides, we discuss multiple evidences of the omnipresent partitioning of protein globules into structural/functional units, using which greatly facilitates the engineering and design efforts. Closed loops and elementary functional loops, which are descendants of ancient ring-like peptides that formed fist protein domains in agreement with Dayhoff's hypothesis, can be considered as basic units of protein structure and function. We argue that future developments in protein design approaches should consider descriptors of the elementary functions, which will help to complement designed scaffolds with functional signatures and flexibility necessary for their functions.
Collapse
Affiliation(s)
- Igor N Berezovsky
- Bioinformatics Institute (BII), Agency for Science, Technology and Research (A⁎STAR), 30 Biopolis Street, #07-01, Matrix 138671, Singapore; Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore.
| |
Collapse
|
68
|
Abstract
Online citizen science projects such as GalaxyZoo1, Eyewire2 and Phylo3 have been very successful for data collection, annotation, and processing, but for the most part have harnessed human pattern recognition skills rather than human creativity. An exception is the game EteRNA4, in which game players learn to build new RNA structures by exploring the discrete two-dimensional space of Watson-Crick base pairing possibilities. Building new proteins, however, is a more challenging task to present in a game, as both the representation and evaluation of a protein structure are intrinsically three-dimensional. We posed the challenge of de novo protein design in the online protein folding game Foldit5. Players were presented with a fully extended peptide chain and challenged to craft a folded protein structure with an amino acid sequence encoding that structure. After many iterations of player design, analysis of the top scoring solutions, and subsequent game improvement, Foldit players can now, starting from an extended polypeptide chain, generate a diversity of protein structures and sequences which encode them in silico. 146 Foldit player designs with sequences unrelated to naturally occurring proteins were encoded in synthetic genes; 56 were found to be expressed in E. coli with good solubility and to adopt stable monomeric folded structures in solution. The diversity of these structures is unprecedented in de novo protein design, representing 20 different folds—including a new fold not observed in natural proteins. High resolution structures were determined for four of the designs, and are nearly identical to the player models. This work makes explicit the considerable implicit knowledge contributing to success in de novo protein design, and shows that citizen scientists can discover creative new solutions to outstanding scientific challenges, such as the protein design problem.
Collapse
|
69
|
Frappier V, Jenson JM, Zhou J, Grigoryan G, Keating AE. Tertiary Structural Motif Sequence Statistics Enable Facile Prediction and Design of Peptides that Bind Anti-apoptotic Bfl-1 and Mcl-1. Structure 2019; 27:606-617.e5. [PMID: 30773399 PMCID: PMC6447450 DOI: 10.1016/j.str.2019.01.008] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2018] [Revised: 12/20/2018] [Accepted: 01/18/2019] [Indexed: 12/25/2022]
Abstract
Understanding the relationship between protein sequence and structure well enough to design new proteins with desired functions is a longstanding goal in protein science. Here, we show that recurring tertiary structural motifs (TERMs) in the PDB provide rich information for protein-peptide interaction prediction and design. TERM statistics can be used to predict peptide binding energies for Bcl-2 family proteins as accurately as widely used structure-based tools. Furthermore, design using TERM energies (dTERMen) rapidly and reliably generates high-affinity peptide binders of anti-apoptotic proteins Bfl-1 and Mcl-1 with just 15%-38% sequence identity to any known native Bcl-2 family protein ligand. High-resolution structures of four designed peptides bound to their targets provide opportunities to analyze the strengths and limitations of the computational design method. Our results support dTERMen as a powerful approach that can complement existing tools for protein engineering.
Collapse
Affiliation(s)
- Vincent Frappier
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Justin M Jenson
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Jianfu Zhou
- Department of Computer Science, Dartmouth College, Hanover, NH 03755, USA
| | - Gevorg Grigoryan
- Department of Computer Science, Dartmouth College, Hanover, NH 03755, USA; Institute for Quantitative Biomedical Sciences, Dartmouth College, Hanover, NH 03755, USA; Department of Biological Sciences, Dartmouth College, Hanover, NH 03755, USA.
| | - Amy E Keating
- Department of Biology, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA; Koch Center for Integrative Cancer Research, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.
| |
Collapse
|
70
|
Room Temperature X-Ray Crystallography Reveals Conformational Heterogeneity of Engineered Proteins. Structure 2019; 25:691-692. [PMID: 28467914 DOI: 10.1016/j.str.2017.04.004] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022]
Abstract
In this issue of Structure, Biel et al. (2017) present multi-conformer analyses from room temperature X-ray data of two ubiquitin phage display variants binding deubiquitinase USP7. The first contains core mutations. The second matured variant contains additional surface mutations. Alternate conformations detected in the core mutant were removed by maturation.
Collapse
|
71
|
Keedy DA. Journey to the center of the protein: allostery from multitemperature multiconformer X-ray crystallography. Acta Crystallogr D Struct Biol 2019; 75:123-137. [PMID: 30821702 PMCID: PMC6400254 DOI: 10.1107/s2059798318017941] [Citation(s) in RCA: 26] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2018] [Accepted: 12/19/2018] [Indexed: 02/08/2023] Open
Abstract
Proteins inherently fluctuate between conformations to perform functions in the cell. For example, they sample product-binding, transition-state-stabilizing and product-release states during catalysis, and they integrate signals from remote regions of the structure for allosteric regulation. However, there is a lack of understanding of how these dynamic processes occur at the basic atomic level. This gap can be at least partially addressed by combining variable-temperature (instead of traditional cryogenic temperature) X-ray crystallography with algorithms for modeling alternative conformations based on electron-density maps, in an approach called multitemperature multiconformer X-ray crystallography (MMX). Here, the use of MMX to reveal alternative conformations at different sites in a protein structure and to estimate the degree of energetic coupling between them is discussed. These insights can suggest testable hypotheses about allosteric mechanisms. Temperature is an easily manipulated experimental parameter, so the MMX approach is widely applicable to any protein that yields well diffracting crystals. Moreover, the general principles of MMX are extensible to other perturbations such as pH, pressure, ligand concentration etc. Future work will explore strategies for leveraging X-ray data across such perturbation series to more quantitatively measure how different parts of a protein structure are coupled to each other, and the consequences thereof for allostery and other aspects of protein function.
Collapse
Affiliation(s)
- Daniel A. Keedy
- Structural Biology Initiative, CUNY Advanced Science Research Center, New York, USA
- Department of Chemistry and Biochemistry, City College of New York, New York, USA
- PhD Programs in Chemistry and Biochemistry, The Graduate Center of the City University of New York, New York, USA
| |
Collapse
|
72
|
Silva DA, Yu S, Ulge UY, Spangler JB, Jude KM, Labão-Almeida C, Ali LR, Quijano-Rubio A, Ruterbusch M, Leung I, Biary T, Crowley SJ, Marcos E, Walkey CD, Weitzner BD, Pardo-Avila F, Castellanos J, Carter L, Stewart L, Riddell SR, Pepper M, Bernardes GJL, Dougan M, Garcia KC, Baker D. De novo design of potent and selective mimics of IL-2 and IL-15. Nature 2019; 565:186-191. [PMID: 30626941 PMCID: PMC6521699 DOI: 10.1038/s41586-018-0830-7] [Citation(s) in RCA: 364] [Impact Index Per Article: 60.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2018] [Accepted: 11/15/2018] [Indexed: 12/28/2022]
Abstract
We describe a de novo computational approach for designing proteins that recapitulate the binding sites of natural cytokines, but are otherwise unrelated in topology or amino acid sequence. We use this strategy to design mimics of the central immune cytokine interleukin-2 (IL-2) that bind to the IL-2 receptor βγc heterodimer (IL-2Rβγc) but have no binding site for IL-2Rα (also called CD25) or IL-15Rα (also known as CD215). The designs are hyper-stable, bind human and mouse IL-2Rβγc with higher affinity than the natural cytokines, and elicit downstream cell signalling independently of IL-2Rα and IL-15Rα. Crystal structures of the optimized design neoleukin-2/15 (Neo-2/15), both alone and in complex with IL-2Rβγc, are very similar to the designed model. Neo-2/15 has superior therapeutic activity to IL-2 in mouse models of melanoma and colon cancer, with reduced toxicity and undetectable immunogenicity. Our strategy for building hyper-stable de novo mimetics could be applied generally to signalling proteins, enabling the creation of superior therapeutic candidates.
Collapse
Affiliation(s)
- Daniel-Adriano Silva
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
- Department of Biochemistry, University of Washington, Seattle, WA, USA.
| | - Shawn Yu
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Umut Y Ulge
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Jamie B Spangler
- Departments of Biomedical Engineering and Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
- Departments of Molecular and Cellular Physiology and Structural Biology, Stanford University School of Medicine, Stanford, CA, USA
| | - Kevin M Jude
- Departments of Molecular and Cellular Physiology and Structural Biology, Stanford University School of Medicine, Stanford, CA, USA
| | - Carlos Labão-Almeida
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
| | - Lestat R Ali
- Division of Gastroenterology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Alfredo Quijano-Rubio
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Mikel Ruterbusch
- Department of Immunology, University of Washington School of Medicine, Seattle, WA, USA
| | - Isabel Leung
- Fred Hutchinson Cancer Research Center, Clinical Research Division, Seattle, WA, USA
| | - Tamara Biary
- Division of Gastroenterology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Stephanie J Crowley
- Division of Gastroenterology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - Enrique Marcos
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Spain
| | - Carl D Walkey
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Brian D Weitzner
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Fátima Pardo-Avila
- Department of Structural Biology, Stanford University School of Medicine, Stanford, CA, USA
| | - Javier Castellanos
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Biochemistry, University of Washington, Seattle, WA, USA
| | - Lauren Carter
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Lance Stewart
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Stanley R Riddell
- Fred Hutchinson Cancer Research Center, Clinical Research Division, Seattle, WA, USA
| | - Marion Pepper
- Department of Immunology, University of Washington School of Medicine, Seattle, WA, USA
| | - Gonçalo J L Bernardes
- Instituto de Medicina Molecular, Faculdade de Medicina, Universidade de Lisboa, Lisbon, Portugal
- Department of Chemistry, University of Cambridge, Cambridge, UK
| | - Michael Dougan
- Division of Gastroenterology, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
| | - K Christopher Garcia
- Departments of Molecular and Cellular Physiology and Structural Biology, Stanford University School of Medicine, Stanford, CA, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| | - David Baker
- Institute for Protein Design, University of Washington, Seattle, WA, USA.
- Department of Biochemistry, University of Washington, Seattle, WA, USA.
- Howard Hughes Medical Institute, Chevy Chase, MD, USA.
| |
Collapse
|
73
|
Netzer R, Listov D, Lipsh R, Dym O, Albeck S, Knop O, Kleanthous C, Fleishman SJ. Ultrahigh specificity in a network of computationally designed protein-interaction pairs. Nat Commun 2018; 9:5286. [PMID: 30538236 PMCID: PMC6290019 DOI: 10.1038/s41467-018-07722-9] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2018] [Accepted: 11/21/2018] [Indexed: 01/21/2023] Open
Abstract
Protein networks in all organisms comprise homologous interacting pairs. In these networks, some proteins are specific, interacting with one or a few binding partners, whereas others are multispecific and bind a range of targets. We describe an algorithm that starts from an interacting pair and designs dozens of new pairs with diverse backbone conformations at the binding site as well as new binding orientations and sequences. Applied to a high-affinity bacterial pair, the algorithm results in 18 new ones, with cognate affinities from pico- to micromolar. Three pairs exhibit 3-5 orders of magnitude switch in specificity relative to the wild type, whereas others are multispecific, collectively forming a protein-interaction network. Crystallographic analysis confirms design accuracy, including in new backbones and polar interactions. Preorganized polar interaction networks are responsible for high specificity, thus defining design principles that can be applied to program synthetic cellular interaction networks of desired affinity and specificity. The molecular basis of ultrahigh specificity in protein-protein interactions remains obscure. The authors present a computational method to design atomically accurate new pairs exhibiting >100,000-fold specificity switches, generating a large and complex interaction network.
Collapse
Affiliation(s)
- Ravit Netzer
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001, Rehovot, Israel
| | - Dina Listov
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001, Rehovot, Israel
| | - Rosalie Lipsh
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001, Rehovot, Israel
| | - Orly Dym
- Structural Proteomics Unit, Weizmann Institute of Science, 7610001, Rehovot, Israel
| | - Shira Albeck
- Structural Proteomics Unit, Weizmann Institute of Science, 7610001, Rehovot, Israel
| | - Orli Knop
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001, Rehovot, Israel
| | - Colin Kleanthous
- Department of Biochemistry, University of Oxford, South Parks Road, Oxford, OX1 3QU, UK
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001, Rehovot, Israel.
| |
Collapse
|
74
|
ElGamacy M, Coles M, Lupas A. Asymmetric protein design from conserved supersecondary structures. J Struct Biol 2018; 204:380-387. [DOI: 10.1016/j.jsb.2018.10.010] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2018] [Revised: 10/19/2018] [Accepted: 10/25/2018] [Indexed: 10/28/2022]
|
75
|
Simoncini D, Zhang KYJ, Schiex T, Barbe S. A structural homology approach for computational protein design with flexible backbone. Bioinformatics 2018; 35:2418-2426. [DOI: 10.1093/bioinformatics/bty975] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Revised: 11/01/2018] [Accepted: 11/28/2018] [Indexed: 01/09/2023] Open
Abstract
Abstract
Motivation
Structure-based Computational Protein design (CPD) plays a critical role in advancing the field of protein engineering. Using an all-atom energy function, CPD tries to identify amino acid sequences that fold into a target structure and ultimately perform a desired function. Energy functions remain however imperfect and injecting relevant information from known structures in the design process should lead to improved designs.
Results
We introduce Shades, a data-driven CPD method that exploits local structural environments in known protein structures together with energy to guide sequence design, while sampling side-chain and backbone conformations to accommodate mutations. Shades (Structural Homology Algorithm for protein DESign), is based on customized libraries of non-contiguous in-contact amino acid residue motifs. We have tested Shades on a public benchmark of 40 proteins selected from different protein families. When excluding homologous proteins, Shades achieved a protein sequence recovery of 30% and a protein sequence similarity of 46% on average, compared with the PFAM protein family of the target protein. When homologous structures were added, the wild-type sequence recovery rate achieved 93%.
Availability and implementation
Shades source code is available at https://bitbucket.org/satsumaimo/shades as a patch for Rosetta 3.8 with a curated protein structure database and ITEM library creation software.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- David Simoncini
- Laboratoire d'Ingénierie des Systèmes Biologiques et des Procédés, LISBP, Université de Toulouse, CNRS, INRA, INSA, F Toulouse cedex 04, France
- Institut de recherche en informatique de Toulouse, IRIT, UMR 5505-CNRS, Université de Toulouse, Cedex 9, France
| | - Kam Y J Zhang
- Laboratory for Structural Bioinformatics, Center for Biosystems Dynamics Research, RIKEN, Yokohama, Kanagawa, Japan
| | - Thomas Schiex
- Institut de recherche en informatique de Toulouse, UMR 5505-CNRS, Université de Toulouse, Cedex 9, France
| | - Sophie Barbe
- Laboratoire d'Ingénierie des Systèmes Biologiques et des Procédés, LISBP, Université de Toulouse, CNRS, INRA, INSA, F Toulouse cedex 04, France
| |
Collapse
|
76
|
Rosetta FunFolDes - A general framework for the computational design of functional proteins. PLoS Comput Biol 2018; 14:e1006623. [PMID: 30452434 PMCID: PMC6277116 DOI: 10.1371/journal.pcbi.1006623] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/23/2018] [Revised: 12/03/2018] [Accepted: 11/06/2018] [Indexed: 01/11/2023] Open
Abstract
The robust computational design of functional proteins has the potential to deeply impact translational research and broaden our understanding of the determinants of protein function and stability. The low success rates of computational design protocols and the extensive in vitro optimization often required, highlight the challenge of designing proteins that perform essential biochemical functions, such as binding or catalysis. One of the most simplistic approaches for the design of function is to adopt functional motifs in naturally occurring proteins and transplant them to computationally designed proteins. The structural complexity of the functional motif largely determines how readily one can find host protein structures that are "designable", meaning that are likely to present the functional motif in the desired conformation. One promising route to enhance the "designability" of protein structures is to allow backbone flexibility. Here, we present a computational approach that couples conformational folding with sequence design to embed functional motifs into heterologous proteins-Rosetta Functional Folding and Design (FunFolDes). We performed extensive computational benchmarks, where we observed that the enforcement of functional requirements resulted in designs distant from the global energetic minimum of the protein. An observation consistent with several experimental studies that have revealed function-stability tradeoffs. To test the design capabilities of FunFolDes we transplanted two viral epitopes into distant structural templates including one de novo "functionless" fold, which represent two typical challenges where the designability problem arises. The designed proteins were experimentally characterized showing high binding affinities to monoclonal antibodies, making them valuable candidates for vaccine design endeavors. Overall, we present an accessible strategy to repurpose old protein folds for new functions. This may lead to important improvements on the computational design of proteins, with structurally complex functional sites, that can perform elaborate biochemical functions related to binding and catalysis.
Collapse
|
77
|
Greener JG, Moffat L, Jones DT. Design of metalloproteins and novel protein folds using variational autoencoders. Sci Rep 2018; 8:16189. [PMID: 30385875 PMCID: PMC6212568 DOI: 10.1038/s41598-018-34533-1] [Citation(s) in RCA: 63] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2018] [Accepted: 10/19/2018] [Indexed: 12/26/2022] Open
Abstract
The design of novel proteins has many applications but remains an attritional process with success in isolated cases. Meanwhile, deep learning technologies have exploded in popularity in recent years and are increasingly applicable to biology due to the rise in available data. We attempt to link protein design and deep learning by using variational autoencoders to generate protein sequences conditioned on desired properties. Potential copper and calcium binding sites are added to non-metal binding proteins without human intervention and compared to a hidden Markov model. In another use case, a grammar of protein structures is developed and used to produce sequences for a novel protein topology. One candidate structure is found to be stable by molecular dynamics simulation. The ability of our model to confine the vast search space of protein sequences and to scale easily has the potential to assist in a variety of protein design tasks.
Collapse
Affiliation(s)
- Joe G Greener
- Department of Computer Science, University College London, Gower Street, London, WC1E 6BT, UK
- Francis Crick Institute, 1 Midland Road, London, NW1 1AT, UK
| | - Lewis Moffat
- Department of Computer Science, University College London, Gower Street, London, WC1E 6BT, UK
- Francis Crick Institute, 1 Midland Road, London, NW1 1AT, UK
| | - David T Jones
- Department of Computer Science, University College London, Gower Street, London, WC1E 6BT, UK.
- Francis Crick Institute, 1 Midland Road, London, NW1 1AT, UK.
| |
Collapse
|
78
|
Castiglione GM, Chang BS. Functional trade-offs and environmental variation shaped ancient trajectories in the evolution of dim-light vision. eLife 2018; 7:35957. [PMID: 30362942 PMCID: PMC6203435 DOI: 10.7554/elife.35957] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2018] [Accepted: 09/09/2018] [Indexed: 12/11/2022] Open
Abstract
Trade-offs between protein stability and activity can restrict access to evolutionary trajectories, but widespread epistasis may facilitate indirect routes to adaptation. This may be enhanced by natural environmental variation, but in multicellular organisms this process is poorly understood. We investigated a paradoxical trajectory taken during the evolution of tetrapod dim-light vision, where in the rod visual pigment rhodopsin, E122 was fixed 350 million years ago, a residue associated with increased active-state (MII) stability but greatly diminished rod photosensitivity. Here, we demonstrate that high MII stability could have likely evolved without E122, but instead, selection appears to have entrenched E122 in tetrapods via epistatic interactions with nearby coevolving sites. In fishes by contrast, selection may have exploited these epistatic effects to explore alternative trajectories, but via indirect routes with low MII stability. Our results suggest that within tetrapods, E122 and high MII stability cannot be sacrificed-not even for improvements to rod photosensitivity.
Collapse
Affiliation(s)
- Gianni M Castiglione
- Department of Cell and Systems Biology, University of Toronto, Toronto, Canada.,Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada
| | - Belinda Sw Chang
- Department of Cell and Systems Biology, University of Toronto, Toronto, Canada.,Department of Ecology and Evolutionary Biology, University of Toronto, Toronto, Canada.,Centre for the Analysis of Genome Evolution and Function, University of Toronto, Toronto, Canada
| |
Collapse
|
79
|
Lechner H, Ferruz N, Höcker B. Strategies for designing non-natural enzymes and binders. Curr Opin Chem Biol 2018; 47:67-76. [PMID: 30248579 DOI: 10.1016/j.cbpa.2018.07.022] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2018] [Revised: 07/16/2018] [Accepted: 07/17/2018] [Indexed: 12/20/2022]
Abstract
The design of tailor-made enzymes is a major goal in biochemical research that can result in wide-range applications and will lead to a better understanding of how proteins fold and function. In this review we highlight recent advances in enzyme and small molecule binder design. A focus is placed on novel strategies for the design of scaffolds, developments in computational methods, and recent applications of these techniques on receptors, sensors, and enzymes. Further, the integration of computational and experimental methodologies is discussed. The outlined examples of designed enzymes and binders for various purposes highlight the importance of this topic and underline the need for tailor-made proteins.
Collapse
Affiliation(s)
- Horst Lechner
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany.
| |
Collapse
|
80
|
ElGamacy M, Coles M, Ernst P, Zhu H, Hartmann MD, Plückthun A, Lupas AN. An Interface-Driven Design Strategy Yields a Novel, Corrugated Protein Architecture. ACS Synth Biol 2018; 7:2226-2235. [PMID: 30148951 DOI: 10.1021/acssynbio.8b00224] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Designing proteins with novel folds remains a major challenge, as the biophysical properties of the target fold are not known a priori and no sequence profile exists to describe its features. Therefore, most computational design efforts so far have been directed toward creating proteins that recapitulate existing folds. Here we present a strategy centered upon the design of novel intramolecular interfaces that enables the construction of a target fold from a set of starting fragments. This strategy effectively reduces the amount of computational sampling necessary to achieve an optimal sequence, without compromising the level of topological control. The solenoid architecture has been a target of extensive protein design efforts, as it provides a highly modular platform of low topological complexity. However, none of the previous efforts have attempted to depart from the natural form, which is characterized by a uniformly handed superhelical architecture. Here we aimed to design a more complex platform, abolishing the superhelicity by introducing internally alternating handedness, resulting in a novel, corrugated architecture. We employed our interface-driven strategy, designing three proteins and confirming the design by solving the structure of two examples.
Collapse
Affiliation(s)
- Mohammad ElGamacy
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Murray Coles
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Patrick Ernst
- Department of Biochemistry, University of Zurich, 8057 Zurich, Switzerland
| | - Hongbo Zhu
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Marcus D. Hartmann
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Andreas Plückthun
- Department of Biochemistry, University of Zurich, 8057 Zurich, Switzerland
| | - Andrei N. Lupas
- Department of Protein Evolution, Max-Planck-Institute for Developmental Biology, 72076 Tübingen, Germany
| |
Collapse
|
81
|
Marcos E, Silva D. Essentials of
de novo
protein design: Methods and applications. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE 2018. [DOI: 10.1002/wcms.1374] [Citation(s) in RCA: 25] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Affiliation(s)
- Enrique Marcos
- Institute for Research in Biomedicine (IRB Barcelona)The Barcelona Institute of Science and TechnologyBarcelonaSpain
| | - Daniel‐Adriano Silva
- Department of BiochemistryUniversity of WashingtonSeattleWashington
- Institute for Protein DesignUniversity of WashingtonSeattleWashington
| |
Collapse
|
82
|
Wood CW, Heal JW, Thomson AR, Bartlett GJ, Ibarra AÁ, Brady RL, Sessions RB, Woolfson DN. ISAMBARD: an open-source computational environment for biomolecular analysis, modelling and design. Bioinformatics 2018; 33:3043-3050. [PMID: 28582565 PMCID: PMC5870769 DOI: 10.1093/bioinformatics/btx352] [Citation(s) in RCA: 36] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2017] [Accepted: 05/31/2017] [Indexed: 12/03/2022] Open
Abstract
Motivation The rational design of biomolecules is becoming a reality. However, further computational tools are needed to facilitate and accelerate this, and to make it accessible to more users. Results Here we introduce ISAMBARD, a tool for structural analysis, model building and rational design of biomolecules. ISAMBARD is open-source, modular, computationally scalable and intuitive to use. These features allow non-experts to explore biomolecular design in silico. ISAMBARD addresses a standing issue in protein design, namely, how to introduce backbone variability in a controlled manner. This is achieved through the generalization of tools for parametric modelling, describing the overall shape of proteins geometrically, and without input from experimentally determined structures. This will allow backbone conformations for entire folds and assemblies not observed in nature to be generated de novo, that is, to access the ‘dark matter of protein-fold space’. We anticipate that ISAMBARD will find broad applications in biomolecular design, biotechnology and synthetic biology. Availability and implementation A current stable build can be downloaded from the python package index (https://pypi.python.org/pypi/isambard/) with development builds available on GitHub (https://github.com/woolfson-group/) along with documentation, tutorial material and all the scripts used to generate the data described in this paper. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Christopher W Wood
- School of Chemistry, University of Bristol, Bristol BS8 1TS, UK.,School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK
| | - Jack W Heal
- School of Chemistry, University of Bristol, Bristol BS8?1TS, UK
| | - Andrew R Thomson
- School of Chemistry, University of Bristol, Bristol BS8 1TS, UK.,School of Chemistry, University of Glasgow, Glasgow G12 8QQ, UK
| | - Gail J Bartlett
- School of Chemistry, University of Bristol, Bristol BS8?1TS, UK
| | - Amaurys Á Ibarra
- School of Biochemistry, University of Bristol, Bristol BS8?1TD, UK
| | - R Leo Brady
- School of Biochemistry, University of Bristol, Bristol BS8?1TD, UK
| | - Richard B Sessions
- School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK.,BrisSynBio, University of Bristol, Bristol BS8 1TQ, UK
| | - Derek N Woolfson
- School of Chemistry, University of Bristol, Bristol BS8 1TS, UK.,School of Biochemistry, University of Bristol, Bristol BS8 1TD, UK.,BrisSynBio, University of Bristol, Bristol BS8 1TQ, UK
| |
Collapse
|
83
|
Adolf-Bryfogle J, Kalyuzhniy O, Kubitz M, Weitzner BD, Hu X, Adachi Y, Schief WR, Dunbrack RL. RosettaAntibodyDesign (RAbD): A general framework for computational antibody design. PLoS Comput Biol 2018; 14:e1006112. [PMID: 29702641 PMCID: PMC5942852 DOI: 10.1371/journal.pcbi.1006112] [Citation(s) in RCA: 118] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2017] [Revised: 05/09/2018] [Accepted: 04/02/2018] [Indexed: 01/12/2023] Open
Abstract
A structural-bioinformatics-based computational methodology and framework have been developed for the design of antibodies to targets of interest. RosettaAntibodyDesign (RAbD) samples the diverse sequence, structure, and binding space of an antibody to an antigen in highly customizable protocols for the design of antibodies in a broad range of applications. The program samples antibody sequences and structures by grafting structures from a widely accepted set of the canonical clusters of CDRs (North et al., J. Mol. Biol., 406:228-256, 2011). It then performs sequence design according to amino acid sequence profiles of each cluster, and samples CDR backbones using a flexible-backbone design protocol incorporating cluster-based CDR constraints. Starting from an existing experimental or computationally modeled antigen-antibody structure, RAbD can be used to redesign a single CDR or multiple CDRs with loops of different length, conformation, and sequence. We rigorously benchmarked RAbD on a set of 60 diverse antibody-antigen complexes, using two design strategies-optimizing total Rosetta energy and optimizing interface energy alone. We utilized two novel metrics for measuring success in computational protein design. The design risk ratio (DRR) is equal to the frequency of recovery of native CDR lengths and clusters divided by the frequency of sampling of those features during the Monte Carlo design procedure. Ratios greater than 1.0 indicate that the design process is picking out the native more frequently than expected from their sampled rate. We achieved DRRs for the non-H3 CDRs of between 2.4 and 4.0. The antigen risk ratio (ARR) is the ratio of frequencies of the native amino acid types, CDR lengths, and clusters in the output decoys for simulations performed in the presence and absence of the antigen. For CDRs, we achieved cluster ARRs as high as 2.5 for L1 and 1.5 for H2. For sequence design simulations without CDR grafting, the overall recovery for the native amino acid types for residues that contact the antigen in the native structures was 72% in simulations performed in the presence of the antigen and 48% in simulations performed without the antigen, for an ARR of 1.5. For the non-contacting residues, the ARR was 1.08. This shows that the sequence profiles are able to maintain the amino acid types of these conserved, buried sites, while recovery of the exposed, contacting residues requires the presence of the antigen-antibody interface. We tested RAbD experimentally on both a lambda and kappa antibody-antigen complex, successfully improving their affinities 10 to 50 fold by replacing individual CDRs of the native antibody with new CDR lengths and clusters.
Collapse
Affiliation(s)
- Jared Adolf-Bryfogle
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA, United States of America
- Program in Molecular and Cell Biology and Genetics, Drexel University College of Medicine, Philadelphia, PA, United States of America
- The Scripps Research Institute, La Jolla, CA, United States of America
| | - Oleks Kalyuzhniy
- The Scripps Research Institute, La Jolla, CA, United States of America
- IAVI Neutralizing Antibody Center at TSRI, La Jolla, CA, United States of America
| | - Michael Kubitz
- The Scripps Research Institute, La Jolla, CA, United States of America
| | - Brian D. Weitzner
- Department of Biochemistry, University of Washington, Seattle, WA, United States of America
- Institute for Protein Design, University of Washington, Seattle, WA, United States of America
| | - Xiaozhen Hu
- The Scripps Research Institute, La Jolla, CA, United States of America
| | - Yumiko Adachi
- IAVI Neutralizing Antibody Center at TSRI, La Jolla, CA, United States of America
| | - William R. Schief
- The Scripps Research Institute, La Jolla, CA, United States of America
- IAVI Neutralizing Antibody Center at TSRI, La Jolla, CA, United States of America
| | - Roland L. Dunbrack
- Institute for Cancer Research, Fox Chase Cancer Center, Philadelphia, PA, United States of America
- * E-mail:
| |
Collapse
|
84
|
Guffy SL, Teets FD, Langlois MI, Kuhlman B. Protocols for Requirement-Driven Protein Design in the Rosetta Modeling Program. J Chem Inf Model 2018; 58:895-901. [PMID: 29659276 DOI: 10.1021/acs.jcim.8b00060] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
We have developed a set of protocols in the molecular modeling program Rosetta for performing requirement-driven protein design. First, the user specifies a set of structural features that need to be present in the designed protein. These requirements can be general (e.g., "create a protein with five helices"), or they can be very specific and require the correct placement of a set of amino acids to bind a ligand. Next, a large set of protein models are generated that satisfy the design requirements. The models are built using a method that we recently introduced into Rosetta, called SEWING, that rapidly assembles novel protein backbones by combining pieces of naturally occurring proteins. In the last step of the process, rotamer-based sequence optimization and backbone refinement are performed with Rosetta, and a variety of quality metrics are used to pick sequences for experimental characterization. Here we describe the input files and user options needed to run SEWING and perform requirement-driven design and provide detailed instructions for two specific applications of the process: the design of new structural elements at a protein-protein interface and the design of ligand binding sites.
Collapse
Affiliation(s)
- Sharon L Guffy
- Department of Biochemistry and Biophysics , University of North Carolina at Chapel Hill , 120 Mason Farm Road , Chapel Hill , North Carolina 27599 , United States
| | - Frank D Teets
- Department of Biochemistry and Biophysics , University of North Carolina at Chapel Hill , 120 Mason Farm Road , Chapel Hill , North Carolina 27599 , United States
| | - Minnie I Langlois
- Department of Biochemistry and Biophysics , University of North Carolina at Chapel Hill , 120 Mason Farm Road , Chapel Hill , North Carolina 27599 , United States
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics , University of North Carolina at Chapel Hill , 120 Mason Farm Road , Chapel Hill , North Carolina 27599 , United States.,Lineberger Comprensive Cancer Center , University of North Carolina at Chapel Hill , Chapel Hill , North Carolina 27599 , United States
| |
Collapse
|
85
|
Abstract
Nuclease dead Cas9 (dCas9) has been widely used for modulating gene expression by fusing with different activation or repression domains. However, delivery of the CRISPR/Cas system fused with various effector domains in a single adeno-associated virus (AAV) remains challenging due to the payload limit. Here, we engineered a set of downsized variants of Cas9 including Staphylococcus aureus Cas9 (SaCas9) that retained DNA binding activity by deleting conserved functional domains. We demonstrated that fusing FokI nuclease domain to the N-terminal of the minimal SaCas9 (mini-SaCas9) or to the middle of the split mini-SaCas9 can trigger efficient DNA cleavage. In addition, we constructed a set of compact transactivation domains based on the tripartite VPR activation domain and self-assembled arrays of split SpyTag:SpyCatch peptides, which are suitable for fusing to the mini-SaCas9. Lastly, we produced a single AAV containing the mini-SaCas9 fused with a downsized transactivation domain along with an optimized gRNA expression cassette, which showed efficient transactivation activity. Our results highlighted a practical approach to generate down-sized CRISPR/Cas9 and gene activation systems for in vivo applications.
Collapse
Affiliation(s)
- Dacheng Ma
- MOE Key Laboratory of Bioinformatics and Bioinformatics Division, Center for Synthetic and System Biology, Department of Automation, Tsinghua National Lab for Information Science and Technology , Tsinghua University , Beijing 100084 , China
| | - Shuguang Peng
- MOE Key Laboratory of Bioinformatics and Bioinformatics Division, Center for Synthetic and System Biology, Department of Automation, Tsinghua National Lab for Information Science and Technology , Tsinghua University , Beijing 100084 , China
| | - Weiren Huang
- State Engineering Laboratory of Medical Key Technologies Application of Synthetic Biology , Shenzhen Second People's Hospital, the First Affiliated Hospital of Shenzhen University , Shenzhen , China
| | - Zhiming Cai
- State Engineering Laboratory of Medical Key Technologies Application of Synthetic Biology , Shenzhen Second People's Hospital, the First Affiliated Hospital of Shenzhen University , Shenzhen , China
| | - Zhen Xie
- MOE Key Laboratory of Bioinformatics and Bioinformatics Division, Center for Synthetic and System Biology, Department of Automation, Tsinghua National Lab for Information Science and Technology , Tsinghua University , Beijing 100084 , China
| |
Collapse
|
86
|
Hallen MA, Donald BR. CATS (Coordinates of Atoms by Taylor Series): protein design with backbone flexibility in all locally feasible directions. Bioinformatics 2018; 33:i5-i12. [PMID: 28882005 PMCID: PMC5870559 DOI: 10.1093/bioinformatics/btx277] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Motivation When proteins mutate or bind to ligands, their backbones often move significantly, especially in loop regions. Computational protein design algorithms must model these motions in order to accurately optimize protein stability and binding affinity. However, methods for backbone conformational search in design have been much more limited than for sidechain conformational search. This is especially true for combinatorial protein design algorithms, which aim to search a large sequence space efficiently and thus cannot rely on temporal simulation of each candidate sequence. Results We alleviate this difficulty with a new parameterization of backbone conformational space, which represents all degrees of freedom of a specified segment of protein chain that maintain valid bonding geometry (by maintaining the original bond lengths and angles and ω dihedrals). In order to search this space, we present an efficient algorithm, CATS, for computing atomic coordinates as a function of our new continuous backbone internal coordinates. CATS generalizes the iMinDEE and EPIC protein design algorithms, which model continuous flexibility in sidechain dihedrals, to model continuous, appropriately localized flexibility in the backbone dihedrals ϕ and ψ as well. We show using 81 test cases based on 29 different protein structures that CATS finds sequences and conformations that are significantly lower in energy than methods with less or no backbone flexibility do. In particular, we show that CATS can model the viability of an antibody mutation known experimentally to increase affinity, but that appears sterically infeasible when modeled with less or no backbone flexibility. Availability and implementation Our code is available as free software at https://github.com/donaldlab/OSPREY_refactor. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Mark A Hallen
- Department of Computer Science, Duke University, Durham, NC, USA.,Toyota Technological Institute at Chicago, Chicago, IL, USA
| | - Bruce R Donald
- Department of Computer Science, Duke University, Durham, NC, USA.,Department of Chemistry, Duke University, Durham, NC, USA.,Department of Biochemistry, Duke University Medical Center, Durham, NC, USA
| |
Collapse
|
87
|
Yagi S, Akanuma S, Yamagishi A. Creation of artificial protein-protein interactions using α-helices as interfaces. Biophys Rev 2018; 10:411-420. [PMID: 29214605 PMCID: PMC5899712 DOI: 10.1007/s12551-017-0352-9] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2017] [Accepted: 11/15/2017] [Indexed: 12/31/2022] Open
Abstract
Designing novel protein-protein interactions (PPIs) with high affinity is a challenging task. Directed evolution, a combination of randomization of the gene for the protein of interest and selection using a display technique, is one of the most powerful tools for producing a protein binder. However, the selected proteins often bind to the target protein at an undesired surface. More problematically, some selected proteins bind to their targets even though they are unfolded. Current state-of-the-art computational design methods have successfully created novel protein binders. These computational methods have optimized the non-covalent interactions at interfaces and thus produced artificial protein complexes. However, to date there are only a limited number of successful examples of computationally designed de novo PPIs. De novo design of coiled-coil proteins has been extensively performed and, therefore, a large amount of knowledge of the sequence-structure relationship of coiled-coil proteins has been accumulated. Taking advantage of this knowledge, de novo design of inter-helical interactions has been used to produce artificial PPIs. Here, we review recent progress in the in silico design and rational design of de novo PPIs and the use of α-helices as interfaces.
Collapse
Affiliation(s)
- Sota Yagi
- Department of Applied Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, 192-0392, Japan
| | - Satoshi Akanuma
- Faculty of Human Sciences, Waseda University, 2-579-15 Mikajima, Tokorozawa, Saitama, 359-1192, Japan
| | - Akihiko Yamagishi
- Department of Applied Life Sciences, Tokyo University of Pharmacy and Life Sciences, 1432-1 Horinouchi, Hachioji, Tokyo, 192-0392, Japan.
| |
Collapse
|
88
|
Chu H, Liu H. TetraBASE: A Side Chain-Independent Statistical Energy for Designing Realistically Packed Protein Backbones. J Chem Inf Model 2018; 58:430-442. [PMID: 29314837 DOI: 10.1021/acs.jcim.7b00677] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
To construct backbone structures of high designability is a primary aspect of computational protein design. We report here a side chain-independent statistical energy that aims at realistic modeling of through-space packing of polypeptide backbones. To mitigate the lack of explicit amino acid side chains, the model treats the interbackbone site packing as being dependent on peptide local conformation. In addition, new variables suitable for statistical analysis, one for relative orientation and another for distance, have been introduced to represent the intersite geometry based on the asymmetrical tetrahedron organization of distinct chemical groups surrounding the Cα-carbon atoms. The resulting tetrahedron-based backbone statistical energy (tetraBASE) model has been used to optimize the tertiary organizations of secondary structure elements (SSEs) of designated types with Monte Caro simulated annealing, starting from artificial initial configurations. The tetraBASE minimum energy structures can reproduce SSE packing frequently observed in native proteins with atomic root-mean-square deviations of 1-2 Å. The model has also been tested by examining the stability of native SSE arrangements under tetraBASE. The results suggest that tetraBASE model can be used to effectively represent interbackbone packing when designing backbone structures without explicitly knowing side chain types.
Collapse
Affiliation(s)
- Huanyu Chu
- School of Life Sciences, University of Science and Technology of China , 230027 Hefei, Anhui China.,Hefei National Laboratory for Physical Sciences at the Microscales , 230027 Hefei, Anhui China
| | - Haiyan Liu
- School of Life Sciences, University of Science and Technology of China , 230027 Hefei, Anhui China.,Hefei National Laboratory for Physical Sciences at the Microscales , 230027 Hefei, Anhui China.,Collaborative Innovation Center of Chemistry for Life Sciences , 230027 Hefei, Anhui China
| |
Collapse
|
89
|
Elfin: An algorithm for the computational design of custom three-dimensional structures from modular repeat protein building blocks. J Struct Biol 2018; 201:100-107. [DOI: 10.1016/j.jsb.2017.09.001] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2017] [Revised: 08/11/2017] [Accepted: 09/02/2017] [Indexed: 11/17/2022]
|
90
|
Alva V, Lupas AN. From ancestral peptides to designed proteins. Curr Opin Struct Biol 2017; 48:103-109. [PMID: 29195087 DOI: 10.1016/j.sbi.2017.11.006] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2017] [Accepted: 11/20/2017] [Indexed: 11/16/2022]
Abstract
The diversity of modern proteins arose through the combinatorial shuffling and differentiation of a limited number of autonomously folding domain prototypes, but the origin of these prototypes themselves has long remained poorly understood. In recent years, the proposal that they originated by repetition, accretion, and recombination from an ancestral set of peptides, which evolved as cofactors of RNA-based replication and catalysis, has gained wide acceptance, supported by the systematic identification of such ancestral peptides and the experimental recapitulation of the mechanisms by which they could have yielded the first folded proteins. Inspired by this evolutionary process, protein engineers have seized on design from pre-optimized peptide components as a powerful approach to generating proteins with novel topology and functionality.
Collapse
Affiliation(s)
- Vikram Alva
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany
| | - Andrei N Lupas
- Department of Protein Evolution, Max Planck Institute for Developmental Biology, 72076 Tübingen, Germany.
| |
Collapse
|
91
|
Garcia-Borràs M, Houk KN, Jiménez-Osés G. Computational Design of Protein Function. COMPUTATIONAL TOOLS FOR CHEMICAL BIOLOGY 2017. [DOI: 10.1039/9781788010139-00087] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
Abstract
The computational design of enzymes is a tremendous challenge for both chemistry and biochemistry. The ability to design stable and functional biocatalysts that could operate under different conditions to perform chemical reactions without precedent in nature, allowing the large-scale production of chemicals à la carte, would revolutionise both synthetic, pharmacologic and materials chemistry. Despite the great advances achieved, this highly multidisciplinary area of research is still in its infancy. This chapter describes the ‘inside-out’ protocol for computational enzyme design and both the achievements and limitations of the current technology are highlighted. Furthermore, molecular dynamics simulations have proved to be invaluable in the enzyme design process, constituting an important tool for discovering elusive catalytically relevant conformations of the engineered or designed enzyme. As a complement to the ‘inside-out’ design protocol, different examples where hybrid QM/MM approaches have been directly applied to discover beneficial mutations in rational computational enzyme design are described.
Collapse
Affiliation(s)
- Marc Garcia-Borràs
- Department of Chemistry and Biochemistry, University of California Los Angeles California CA 90095-1569 USA
| | - Kendall N. Houk
- Department of Chemistry and Biochemistry, University of California Los Angeles California CA 90095-1569 USA
| | - Gonzalo Jiménez-Osés
- Departamento de Química, Centro de Investigación en Síntesis Química Universidad de La Rioja 26006 Logroño La Rioja Spain
| |
Collapse
|
92
|
Bianco V, Pagès-Gelabert N, Coluzza I, Franzese G. How the stability of a folded protein depends on interfacial water properties and residue-residue interactions. J Mol Liq 2017. [DOI: 10.1016/j.molliq.2017.08.026] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
|
93
|
Dang B, Wu H, Mulligan VK, Mravic M, Wu Y, Lemmin T, Ford A, Silva DA, Baker D, DeGrado WF. De novo design of covalently constrained mesosize protein scaffolds with unique tertiary structures. Proc Natl Acad Sci U S A 2017; 114:10852-10857. [PMID: 28973862 PMCID: PMC5642715 DOI: 10.1073/pnas.1710695114] [Citation(s) in RCA: 42] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022] Open
Abstract
The folding of natural proteins typically relies on hydrophobic packing, metal binding, or disulfide bond formation in the protein core. Alternatively, a 3D structure can be defined by incorporating a multivalent cross-linking agent, and this approach has been successfully developed for the selection of bicyclic peptides from large random-sequence libraries. By contrast, there is no general method for the de novo computational design of multicross-linked proteins with predictable and well-defined folds, including ones not found in nature. Here we use Rosetta and Tertiary Motifs (TERMs) to design small proteins that fold around multivalent cross-linkers. The hydrophobic cross-linkers stabilize the fold by macrocyclic restraints, and they also form an integral part of a small apolar core. The designed CovCore proteins were prepared by chemical synthesis, and their structures were determined by solution NMR or X-ray crystallography. These mesosized proteins, lying between conventional proteins and small peptides, are easily accessible either through biosynthetic precursors or chemical synthesis. The unique tertiary structures and ease of synthesis of CovCore proteins indicate that they should provide versatile templates for developing inhibitors of protein-protein interactions.
Collapse
Affiliation(s)
- Bobo Dang
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158
| | - Haifan Wu
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158
| | | | - Marco Mravic
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158
| | - Yibing Wu
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158
| | - Thomas Lemmin
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158
| | - Alexander Ford
- Department of Biochemistry, University of Washington, Seattle, WA 98195
| | | | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA 98195
| | - William F DeGrado
- Department of Pharmaceutical Chemistry, University of California, San Francisco, CA 94158;
| |
Collapse
|
94
|
Abstract
Natural proteins must both fold into a stable conformation and exert their molecular function. To date, computational design has successfully produced stable and atomically accurate proteins by using so-called "ideal" folds rich in regular secondary structures and almost devoid of loops and destabilizing elements, such as cavities. Molecular function, such as binding and catalysis, however, often demands nonideal features, including large and irregular loops and buried polar interaction networks, which have remained challenging for fold design. Through five design/experiment cycles, we learned principles for designing stable and functional antibody variable fragments (Fvs). Specifically, we (i) used sequence-design constraints derived from antibody multiple-sequence alignments, and (ii) during backbone design, maintained stabilizing interactions observed in natural antibodies between the framework and loops of complementarity-determining regions (CDRs) 1 and 2. Designed Fvs bound their ligands with midnanomolar affinities and were as stable as natural antibodies, despite having >30 mutations from mammalian antibody germlines. Furthermore, crystallographic analysis demonstrated atomic accuracy throughout the framework and in four of six CDRs in one design and atomic accuracy in the entire Fv in another. The principles we learned are general, and can be implemented to design other nonideal folds, generating stable, specific, and precise antibodies and enzymes.
Collapse
|
95
|
Wood CW, Woolfson DN. CCBuilder 2.0: Powerful and accessible coiled-coil modeling. Protein Sci 2017; 27:103-111. [PMID: 28836317 PMCID: PMC5734305 DOI: 10.1002/pro.3279] [Citation(s) in RCA: 95] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2017] [Accepted: 08/22/2017] [Indexed: 01/06/2023]
Abstract
The increased availability of user-friendly and accessible computational tools for biomolecular modeling would expand the reach and application of biomolecular engineering and design. For protein modeling, one key challenge is to reduce the complexities of 3D protein folds to sets of parametric equations that nonetheless capture the salient features of these structures accurately. At present, this is possible for a subset of proteins, namely, repeat proteins. The α-helical coiled coil provides one such example, which represents ≈ 3-5% of all known protein-encoding regions of DNA. Coiled coils are bundles of α helices that can be described by a small set of structural parameters. Here we describe how this parametric description can be implemented in an easy-to-use web application, called CCBuilder 2.0, for modeling and optimizing both α-helical coiled coils and polyproline-based collagen triple helices. This has many applications from providing models to aid molecular replacement for X-ray crystallography, in silico model building and engineering of natural and designed protein assemblies, and through to the creation of completely de novo "dark matter" protein structures. CCBuilder 2.0 is available as a web-based application, the code for which is open-source and can be downloaded freely. http://coiledcoils.chm.bris.ac.uk/ccbuilder2. LAY SUMMARY We have created CCBuilder 2.0, an easy to use web-based application that can model structures for a whole class of proteins, the α-helical coiled coil, which is estimated to account for 3-5% of all proteins in nature. CCBuilder 2.0 will be of use to a large number of protein scientists engaged in fundamental studies, such as protein structure determination, through to more-applied research including designing and engineering novel proteins that have potential applications in biotechnology.
Collapse
Affiliation(s)
- Christopher W Wood
- School of Chemistry, University of Bristol, Cantock's Close, Bristol, BS8 1TS, United Kingdom
| | - Derek N Woolfson
- School of Chemistry, University of Bristol, Cantock's Close, Bristol, BS8 1TS, United Kingdom.,School of Biochemistry, University of Bristol, Medical Sciences Building, University Walk, Bristol, BS8 1TD, United Kingdom.,BrisSynBio, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol, BS8 1TQ, United Kingdom
| |
Collapse
|
96
|
Designing repeat proteins: a modular approach to protein design. Curr Opin Struct Biol 2017; 45:116-123. [DOI: 10.1016/j.sbi.2017.02.001] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2017] [Revised: 02/06/2017] [Accepted: 02/16/2017] [Indexed: 01/01/2023]
|
97
|
Salt-bridge networks within globular and disordered proteins: characterizing trends for designable interactions. J Mol Model 2017. [PMID: 28626846 DOI: 10.1007/s00894-017-3376-y] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/19/2022]
Abstract
There has been considerable debate about the contribution of salt bridges to the stabilization of protein folds, in spite of their participation in crucial protein functions. Salt bridges appear to contribute to the activity-stability trade-off within proteins by bringing high-entropy charged amino acids into close contacts during the course of their functions. The current study analyzes the modes of association of salt bridges (in terms of networks) within globular proteins and at protein-protein interfaces. While the most common and trivial type of salt bridge is the isolated salt bridge, bifurcated salt bridge appears to be a distinct salt-bridge motif having a special topology and geometry. Bifurcated salt bridges are found ubiquitously in proteins and interprotein complexes. Interesting and attractive examples presenting different modes of interaction are highlighted. Bifurcated salt bridges appear to function as molecular clips that are used to stitch together large surface contours at interacting protein interfaces. The present work also emphasizes the key role of salt-bridge-mediated interactions in the partial folding of proteins containing long stretches of disordered regions. Salt-bridge-mediated interactions seem to be pivotal to the promotion of "disorder-to-order" transitions in small disordered protein fragments and their stabilization upon binding. The results obtained in this work should help to guide efforts to elucidate the modus operandi of these partially disordered proteins, and to conceptualize how these proteins manage to maintain the required amount of disorder even in their bound forms. This work could also potentially facilitate explorations of geometrically specific designable salt bridges through the characterization of composite salt-bridge networks. Graphical abstract ᅟ.
Collapse
|
98
|
Mackenzie CO, Grigoryan G. Protein structural motifs in prediction and design. Curr Opin Struct Biol 2017; 44:161-167. [PMID: 28460216 PMCID: PMC5513761 DOI: 10.1016/j.sbi.2017.03.012] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2016] [Revised: 03/18/2017] [Accepted: 03/28/2017] [Indexed: 01/11/2023]
Abstract
The Protein Data Bank (PDB) has been an integral resource for shaping our fundamental understanding of protein structure and for the advancement of such applications as protein design and structure prediction. Over the years, information from the PDB has been used to generate models ranging from specific structural mechanisms to general statistical potentials. With accumulating structural data, it has become possible to mine for more complete and complex structural observations, deducing more accurate generalizations. Motif libraries, which capture recurring structural features along with their sequence preferences, have exposed modularity in the structural universe and found successful application in various problems of structural biology. Here we summarize recent achievements in this arena, focusing on subdomain level structural patterns and their applications to protein design and structure prediction, and suggest promising future directions as the structural database continues to grow.
Collapse
Affiliation(s)
- Craig O Mackenzie
- Institute for Quantitative Biomedical Sciences, Dartmouth College, Hanover, NH 03755, United States
| | - Gevorg Grigoryan
- Institute for Quantitative Biomedical Sciences, Dartmouth College, Hanover, NH 03755, United States; Department of Computer Science, Dartmouth College, Hanover, NH 03755, United States.
| |
Collapse
|
99
|
D'Souza A, Wu X, Yeow EKL, Bhattacharjya S. Designed Heme-Cage β-Sheet Miniproteins. Angew Chem Int Ed Engl 2017. [DOI: 10.1002/ange.201702472] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Areetha D'Souza
- School of Biological Sciences; Nanyang Technological University; Singapore 637551 Singapore
| | - Xiangyang Wu
- School of Physical and Mathematical Sciences; Nanyang Technological University; Singapore 637371 Singapore
| | - Edwin Kok Lee Yeow
- School of Physical and Mathematical Sciences; Nanyang Technological University; Singapore 637371 Singapore
| | - Surajit Bhattacharjya
- School of Biological Sciences; Nanyang Technological University; Singapore 637551 Singapore
| |
Collapse
|
100
|
D'Souza A, Wu X, Yeow EKL, Bhattacharjya S. Designed Heme-Cage β-Sheet Miniproteins. Angew Chem Int Ed Engl 2017; 56:5904-5908. [PMID: 28440962 DOI: 10.1002/anie.201702472] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2017] [Indexed: 01/21/2023]
Abstract
The structure and function of naturally occurring proteins are governed by a large number of amino acids (≥100). The design of miniature proteins with desired structures and functions not only substantiates our knowledge about proteins but can also contribute to the development of novel applications. Excellent progress has been made towards the design of helical proteins with diverse functions. However, the development of functional β-sheet proteins remains challenging. Herein, we describe the construction and characterization of four-stranded β-sheet miniproteins made up of about 19 amino acids that bind heme inside a hydrophobic binding pocket or "heme cage" by bis-histidine coordination in an aqueous environment. The designed miniproteins bound to heme with high affinity comparable to that of native heme proteins. Atomic-resolution structures confirmed the presence of a four-stranded β-sheet fold. The heme-protein complexes also exhibited high stability against thermal and chaotrope-induced unfolding.
Collapse
Affiliation(s)
- Areetha D'Souza
- School of Biological Sciences, Nanyang Technological University, Singapore, 637551, Singapore
| | - Xiangyang Wu
- School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore, 637371, Singapore
| | - Edwin Kok Lee Yeow
- School of Physical and Mathematical Sciences, Nanyang Technological University, Singapore, 637371, Singapore
| | - Surajit Bhattacharjya
- School of Biological Sciences, Nanyang Technological University, Singapore, 637551, Singapore
| |
Collapse
|