1
|
Chen KYM, Lai JK, Rudden LSP, Wang J, Russell AM, Conners K, Rutter ME, Condon B, Tung F, Kodandapani L, Chau B, Zhao X, Benach J, Baker K, Hembre EJ, Barth P. Computational design of highly signalling-active membrane receptors through solvent-mediated allosteric networks. Nat Chem 2025; 17:429-438. [PMID: 39849110 PMCID: PMC11882447 DOI: 10.1038/s41557-024-01719-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2021] [Accepted: 12/11/2024] [Indexed: 01/25/2025]
Abstract
Protein catalysis and allostery require the atomic-level orchestration and motion of residues and ligand, solvent and protein effector molecules. However, the ability to design protein activity through precise protein-solvent cooperative interactions has not yet been demonstrated. Here we report the design of 14 membrane receptors that catalyse G protein nucleotide exchange through diverse engineered allosteric pathways mediated by cooperative networks of intraprotein, protein-ligand and -solvent molecule interactions. Consistent with predictions, the designed protein activities correlated well with the level of plasticity of the networks at flexible transmembrane helical interfaces. Several designs displayed considerably enhanced thermostability and activity compared with related natural receptors. The most stable and active variant crystallized in an unforeseen signalling-active conformation, in excellent agreement with the design models. The allosteric network topologies of the best designs bear limited similarity to those of natural receptors and reveal an allosteric interaction space larger than previously inferred from natural proteins. The approach should prove useful for engineering proteins with novel complex protein binding, catalytic and signalling activities.
Collapse
Affiliation(s)
- K-Y M Chen
- Institute of Bioengineering, Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland
- Department of Cell Biology and Human Anatomy, University of California at Davis, Davis, CA, USA
| | - J K Lai
- Department of Pharmacology, Baylor College of Medicine, Houston, TX, USA
- Tessella, Houston, TX, USA
| | - L S P Rudden
- Institute of Bioengineering, Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland
| | - J Wang
- Lilly Biotechnology Center San Diego, San Diego, CA, USA
| | - A M Russell
- Lilly Biotechnology Center San Diego, San Diego, CA, USA
| | - K Conners
- Lilly Biotechnology Center San Diego, San Diego, CA, USA
| | - M E Rutter
- Lilly Biotechnology Center San Diego, San Diego, CA, USA
| | - B Condon
- Lilly Biotechnology Center San Diego, San Diego, CA, USA
| | - F Tung
- Lilly Biotechnology Center San Diego, San Diego, CA, USA
| | - L Kodandapani
- Lilly Biotechnology Center San Diego, San Diego, CA, USA
| | - B Chau
- Lilly Biotechnology Center San Diego, San Diego, CA, USA
| | - X Zhao
- Lilly Biotechnology Center San Diego, San Diego, CA, USA
| | - J Benach
- Lilly Biotechnology Center San Diego, San Diego, CA, USA
| | - K Baker
- Lilly Biotechnology Center San Diego, San Diego, CA, USA
- AbbVie, North Chicago, IL, USA
| | - E J Hembre
- Lilly Research Laboratories, Lilly Corporate Center, Indianapolis, IN, USA
| | - P Barth
- Institute of Bioengineering, Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland.
| |
Collapse
|
2
|
Harteveld Z, Van Hall-Beauvais A, Morozova I, Southern J, Goverde C, Georgeon S, Rosset S, Defferrard M, Loukas A, Vandergheynst P, Bronstein MM, Correia BE. Exploring "dark-matter" protein folds using deep learning. Cell Syst 2024; 15:898-910.e5. [PMID: 39383860 DOI: 10.1016/j.cels.2024.09.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Revised: 06/13/2024] [Accepted: 09/16/2024] [Indexed: 10/11/2024]
Abstract
De novo protein design explores uncharted sequence and structure space to generate novel proteins not sampled by evolution. A main challenge in de novo design involves crafting "designable" structural templates to guide the sequence searches toward adopting target structures. We present a convolutional variational autoencoder that learns patterns of protein structure, dubbed Genesis. We coupled Genesis with trRosetta to design sequences for a set of protein folds and found that Genesis is capable of reconstructing native-like distance and angle distributions for five native folds and three novel, the so-called "dark-matter" folds as a demonstration of generalizability. We used a high-throughput assay to characterize the stability of the designs through protease resistance, obtaining encouraging success rates for folded proteins. Genesis enables exploration of the protein fold space within minutes, unrestricted by protein topologies. Our approach addresses the backbone designability problem, showing that small neural networks can efficiently learn structural patterns in proteins. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Zander Harteveld
- École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Alexandra Van Hall-Beauvais
- École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Irina Morozova
- École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | | | - Casper Goverde
- École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | | | - Stéphane Rosset
- École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | | | - Andreas Loukas
- École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland; Prescient Design, gRED, Roche, Basel, Switzerland
| | | | | | - Bruno E Correia
- École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland; Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland.
| |
Collapse
|
3
|
Kutlu Y, Axel G, Kolodny R, Ben-Tal N, Haliloglu T. Reused Protein Segments Linked to Functional Dynamics. Mol Biol Evol 2024; 41:msae184. [PMID: 39226145 PMCID: PMC11412252 DOI: 10.1093/molbev/msae184] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Revised: 08/10/2024] [Accepted: 08/26/2024] [Indexed: 09/05/2024] Open
Abstract
Protein space is characterized by extensive recurrence, or "reuse," of parts, suggesting that new proteins and domains can evolve by mixing-and-matching of existing segments. From an evolutionary perspective, for a given combination to persist, the protein segments should presumably not only match geometrically but also dynamically communicate with each other to allow concerted motions that are key to function. Evidence from protein space supports the premise that domains indeed combine in this manner; we explore whether a similar phenomenon can be observed at the sub-domain level. To this end, we use Gaussian Network Models (GNMs) to calculate the so-called soft modes, or low-frequency modes of motion for a dataset of 150 protein domains. Modes of motion can be used to decompose a domain into segments of consecutive amino acids that we call "dynamic elements", each of which belongs to one of two parts that move in opposite senses. We find that, in many cases, the dynamic elements, detected based on GNM analysis, correspond to established "themes": Sub-domain-level segments that have been shown to recur in protein space, and which were detected in previous research using sequence similarity alone (i.e. completely independently of the GNM analysis). This statistically significant correlation hints at the importance of dynamics in evolution. Overall, the results are consistent with an evolutionary scenario where proteins have emerged from themes that need to match each other both geometrically and dynamically, e.g. to facilitate allosteric regulation.
Collapse
Affiliation(s)
- Yiğit Kutlu
- Department of Chemical Engineering and Polymer Research Center, Bogazici University, Istanbul, Turkey
| | - Gabriel Axel
- School of Neurobiology, Biochemistry & Biophysics, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Rachel Kolodny
- Department of Computer Science, University of Haifa, Haifa, Israel
| | - Nir Ben-Tal
- School of Neurobiology, Biochemistry & Biophysics, George S. Wise Faculty of Life Sciences, Tel Aviv University, Tel Aviv, Israel
| | - Turkan Haliloglu
- Department of Chemical Engineering and Polymer Research Center, Bogazici University, Istanbul, Turkey
| |
Collapse
|
4
|
Lipsh-Sokolik R, Fleishman SJ. Addressing epistasis in the design of protein function. Proc Natl Acad Sci U S A 2024; 121:e2314999121. [PMID: 39133844 PMCID: PMC11348311 DOI: 10.1073/pnas.2314999121] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/29/2024] Open
Abstract
Mutations in protein active sites can dramatically improve function. The active site, however, is densely packed and extremely sensitive to mutations. Therefore, some mutations may only be tolerated in combination with others in a phenomenon known as epistasis. Epistasis reduces the likelihood of obtaining improved functional variants and dramatically slows natural and lab evolutionary processes. Research has shed light on the molecular origins of epistasis and its role in shaping evolutionary trajectories and outcomes. In addition, sequence- and AI-based strategies that infer epistatic relationships from mutational patterns in natural or experimental evolution data have been used to design functional protein variants. In recent years, combinations of such approaches and atomistic design calculations have successfully predicted highly functional combinatorial mutations in active sites. These were used to design thousands of functional active-site variants, demonstrating that, while our understanding of epistasis remains incomplete, some of the determinants that are critical for accurate design are now sufficiently understood. We conclude that the space of active-site variants that has been explored by evolution may be expanded dramatically to enhance natural activities or discover new ones. Furthermore, design opens the way to systematically exploring sequence and structure space and mutational impacts on function, deepening our understanding and control over protein activity.
Collapse
Affiliation(s)
- Rosalie Lipsh-Sokolik
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| |
Collapse
|
5
|
Amado D, Chaves OA, Cruz PF, Loureiro RJS, Almeida ZL, Jesus CSH, Serpa C, Brito RMM. Folding Kinetics and Volume Variation of the β-Hairpin Peptide Chignolin upon Ultrafast pH-Jumps. J Phys Chem B 2024; 128:4898-4910. [PMID: 38733339 DOI: 10.1021/acs.jpcb.3c08271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2024]
Abstract
In-depth characterization of fundamental folding steps of small model peptides is crucial for a better understanding of the folding mechanisms of more complex biomacromolecules. We have previously reported on the folding/unfolding kinetics of a model α-helix. Here, we study folding transitions in chignolin (GYDPETGTWG), a short β-hairpin peptide previously used as a model to study conformational changes in β-sheet proteins. Although previously suggested, until now, the role of the Tyr2-Trp9 interaction in the folding mechanism of chignolin was not clear. In the present work, pH-dependent conformational changes of chignolin were characterized by circular dichroism (CD), nuclear magnetic resonance (NMR), ultrafast pH-jump coupled with time-resolved photoacoustic calorimetry (TR-PAC), and molecular dynamics (MD) simulations. Taken together, our results present a comprehensive view of chignolin's folding kinetics upon local pH changes and the role of the Tyr2-Trp9 interaction in the folding process. CD data show that chignolin's β-hairpin formation displays a pH-dependent skew bell-shaped curve, with a maximum close to pH 6, and a large decrease in β-sheet content at alkaline pH. The β-hairpin structure is mainly stabilized by aromatic interactions between Tyr2 and Trp9 and CH-π interactions between Tyr2 and Pro4. Unfolding of chignolin at high pH demonstrates that protonation of Tyr2 is essential for the stability of the β-hairpin. Refolding studies were triggered by laser-induced pH-jumps and detected by TR-PAC. The refolding of chignolin from high pH, mainly due to the protonation of Tyr2, is characterized by a volume expansion (10.4 mL mol-1), independent of peptide concentration, in the microsecond time range (lifetime of 1.15 μs). At high pH, the presence of the deprotonated hydroxyl (tyrosinate) hinders the formation of the aromatic interaction between Tyr2 and Trp9 resulting in a more disorganized and dynamic tridimensional structure of the peptide. This was also confirmed by comparing MD simulations of chignolin under conditions mimicking neutral and high pH.
Collapse
Affiliation(s)
- Daniela Amado
- CQC-IMS, Department of Chemistry, University of Coimbra, 3004-535 Coimbra, Portugal
| | - Otávio A Chaves
- CQC-IMS, Department of Chemistry, University of Coimbra, 3004-535 Coimbra, Portugal
| | - Pedro F Cruz
- CQC-IMS, Department of Chemistry, University of Coimbra, 3004-535 Coimbra, Portugal
| | - Rui J S Loureiro
- CQC-IMS, Department of Chemistry, University of Coimbra, 3004-535 Coimbra, Portugal
| | - Zaida L Almeida
- CQC-IMS, Department of Chemistry, University of Coimbra, 3004-535 Coimbra, Portugal
| | - Catarina S H Jesus
- CQC-IMS, Department of Chemistry, University of Coimbra, 3004-535 Coimbra, Portugal
| | - Carlos Serpa
- CQC-IMS, Department of Chemistry, University of Coimbra, 3004-535 Coimbra, Portugal
| | - Rui M M Brito
- CQC-IMS, Department of Chemistry, University of Coimbra, 3004-535 Coimbra, Portugal
| |
Collapse
|
6
|
Zimmerman L, Alon N, Levin I, Koganitsky A, Shpigel N, Brestel C, Lapidoth GD. Context-dependent design of induced-fit enzymes using deep learning generates well-expressed, thermally stable and active enzymes. Proc Natl Acad Sci U S A 2024; 121:e2313809121. [PMID: 38437538 PMCID: PMC10945820 DOI: 10.1073/pnas.2313809121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Accepted: 02/09/2024] [Indexed: 03/06/2024] Open
Abstract
The potential of engineered enzymes in industrial applications is often limited by their expression levels, thermal stability, and catalytic diversity. De novo enzyme design faces challenges due to the complexity of enzymatic catalysis. An alternative approach involves expanding natural enzyme capabilities for new substrates and parameters. Here, we introduce CoSaNN (Conformation Sampling using Neural Network), an enzyme design strategy using deep learning for structure prediction and sequence optimization. CoSaNN controls enzyme conformations to expand chemical space beyond simple mutagenesis. It employs a context-dependent approach for generating enzyme designs, considering non-linear relationships in sequence and structure space. We also developed SolvIT, a graph NN predicting protein solubility in Escherichia coli, optimizing enzyme expression selection from larger design sets. Using this method, we engineered enzymes with superior expression levels, with 54% expressed in E. coli, and increased thermal stability, with over 30% having higher Tm than the template, with no high-throughput screening. Our research underscores AI's transformative role in protein design, capturing high-order interactions and preserving allosteric mechanisms in extensively modified enzymes, and notably enhancing expression success rates. This method's ease of use and efficiency streamlines enzyme design, opening broad avenues for biotechnological applications and broadening field accessibility.
Collapse
Affiliation(s)
| | - Noga Alon
- Enzymit Ltd., Ness-Ziona7403626, Israel
| | | | | | | | | | | |
Collapse
|
7
|
Sakuma K, Kobayashi N, Sugiki T, Nagashima T, Fujiwara T, Suzuki K, Kobayashi N, Murata T, Kosugi T, Tatsumi-Koga R, Koga N. Design of complicated all-α protein structures. Nat Struct Mol Biol 2024; 31:275-282. [PMID: 38177681 PMCID: PMC11377298 DOI: 10.1038/s41594-023-01147-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Accepted: 10/04/2023] [Indexed: 01/06/2024]
Abstract
A wide range of de novo protein structure designs have been achieved, but the complexity of naturally occurring protein structures is still far beyond these designs. Here, to expand the diversity and complexity of de novo designed protein structures, we sought to develop a method for designing 'difficult-to-describe' α-helical protein structures composed of irregularly aligned α-helices like globins. Backbone structure libraries consisting of a myriad of α-helical structures with five or six helices were generated by combining 18 helix-loop-helix motifs and canonical α-helices, and five distinct topologies were selected for de novo design. The designs were found to be monomeric with high thermal stability in solution and fold into the target topologies with atomic accuracy. This study demonstrated that complicated α-helical proteins are created using typical building blocks. The method we developed will enable us to explore the universe of protein structures for designing novel functional proteins.
Collapse
Affiliation(s)
- Koya Sakuma
- Department of Structural Molecular Science, School of Physical Sciences, SOKENDAI (The Graduate University for Advanced Studies), Hayama, Japan
| | - Naohiro Kobayashi
- RIKEN Center for Biosystems Dynamics Research, RIKEN, Yokohama, Japan
- Institute for Protein Research, Osaka University, Suita, Japan
| | | | - Toshio Nagashima
- RIKEN Center for Biosystems Dynamics Research, RIKEN, Yokohama, Japan
| | | | - Kano Suzuki
- Department of Chemistry, Graduate School of Science, Chiba University, Chiba, Japan
| | - Naoya Kobayashi
- Protein Design Group, Exploratory Research Center on Life and Living Systems (ExCELLS), National Institutes of National Sciences, Okazaki, Japan
| | - Takeshi Murata
- Department of Chemistry, Graduate School of Science, Chiba University, Chiba, Japan
- Membrane Protein Research Center, Chiba University, Chiba, Japan
- Structural Biology Research Center, Institute of Materials Structure Science, High Energy Accelerator Research Organization (KEK), Tsukuba, Japan
| | - Takahiro Kosugi
- Department of Structural Molecular Science, School of Physical Sciences, SOKENDAI (The Graduate University for Advanced Studies), Hayama, Japan
- Protein Design Group, Exploratory Research Center on Life and Living Systems (ExCELLS), National Institutes of National Sciences, Okazaki, Japan
- Research Center of Integrative Molecular Systems, Institute for Molecular Science, National Institutes of National Sciences, Okazaki, Japan
| | - Rie Tatsumi-Koga
- Protein Design Group, Exploratory Research Center on Life and Living Systems (ExCELLS), National Institutes of National Sciences, Okazaki, Japan
| | - Nobuyasu Koga
- Department of Structural Molecular Science, School of Physical Sciences, SOKENDAI (The Graduate University for Advanced Studies), Hayama, Japan.
- Protein Design Group, Exploratory Research Center on Life and Living Systems (ExCELLS), National Institutes of National Sciences, Okazaki, Japan.
- Research Center of Integrative Molecular Systems, Institute for Molecular Science, National Institutes of National Sciences, Okazaki, Japan.
- Institute for Protein Research, Osaka University, Suita, Japan.
| |
Collapse
|
8
|
Kortemme T. De novo protein design-From new structures to programmable functions. Cell 2024; 187:526-544. [PMID: 38306980 PMCID: PMC10990048 DOI: 10.1016/j.cell.2023.12.028] [Citation(s) in RCA: 46] [Impact Index Per Article: 46.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2023] [Revised: 12/03/2023] [Accepted: 12/19/2023] [Indexed: 02/04/2024]
Abstract
Methods from artificial intelligence (AI) trained on large datasets of sequences and structures can now "write" proteins with new shapes and molecular functions de novo, without starting from proteins found in nature. In this Perspective, I will discuss the state of the field of de novo protein design at the juncture of physics-based modeling approaches and AI. New protein folds and higher-order assemblies can be designed with considerable experimental success rates, and difficult problems requiring tunable control over protein conformations and precise shape complementarity for molecular recognition are coming into reach. Emerging approaches incorporate engineering principles-tunability, controllability, and modularity-into the design process from the beginning. Exciting frontiers lie in deconstructing cellular functions with de novo proteins and, conversely, constructing synthetic cellular signaling from the ground up. As methods improve, many more challenges are unsolved.
Collapse
Affiliation(s)
- Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA; Chan Zuckerberg Biohub, San Francisco, CA 94158, USA.
| |
Collapse
|
9
|
Cummins MC, Tripathy A, Sondek J, Kuhlman B. De novo design of stable proteins that efficaciously inhibit oncogenic G proteins. Protein Sci 2023; 32:e4713. [PMID: 37368504 PMCID: PMC10360382 DOI: 10.1002/pro.4713] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/26/2023] [Revised: 06/23/2023] [Accepted: 06/24/2023] [Indexed: 06/29/2023]
Abstract
Many protein therapeutics are competitive inhibitors that function by binding to endogenous proteins and preventing them from interacting with native partners. One effective strategy for engineering competitive inhibitors is to graft structural motifs from a native partner into a host protein. Here, we develop and experimentally test a computational protocol for embedding binding motifs in de novo designed proteins. The protocol uses an "inside-out" approach: Starting with a structural model of the binding motif docked against the target protein, the de novo protein is built by growing new structural elements off the termini of the binding motif. During backbone assembly, a score function favors backbones that introduce new tertiary contacts within the designed protein and do not introduce clashes with the target binding partner. Final sequences are designed and optimized using the molecular modeling program Rosetta. To test our protocol, we designed small helical proteins to inhibit the interaction between Gαq and its effector PLC-β isozymes. Several of the designed proteins remain folded above 90°C and bind to Gαq with equilibrium dissociation constants tighter than 80 nM. In cellular assays with oncogenic variants of Gαq , the designed proteins inhibit activation of PLC-β isozymes and Dbl-family RhoGEFs. Our results demonstrate that computational protein design, in combination with motif grafting, can be used to directly generate potent inhibitors without further optimization via high throughput screening or selection.
Collapse
Affiliation(s)
- Matthew C. Cummins
- Department of PharmacologyUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
| | - Ashutosh Tripathy
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
| | - John Sondek
- Department of PharmacologyUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
- Lineberger Comprehensive Cancer CenterUniversity of North Carolina at Chapel HillChapel HillNorth CarolinaUSA
| | - Brian Kuhlman
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
- Lineberger Comprehensive Cancer CenterUniversity of North Carolina at Chapel HillChapel HillNorth CarolinaUSA
| |
Collapse
|
10
|
Cordes MHJ, Sundman AK, Fox HC, Binford GJ. Protein salvage and repurposing in evolution: Phospholipase D toxins are stabilized by a remodeled scrap of a membrane association domain. Protein Sci 2023; 32:e4701. [PMID: 37313620 PMCID: PMC10303701 DOI: 10.1002/pro.4701] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/27/2022] [Revised: 06/03/2023] [Accepted: 06/07/2023] [Indexed: 06/15/2023]
Abstract
The glycerophosphodiester phosphodiesterase (GDPD)-like SMaseD/PLD domain family, which includes phospholipase D (PLD) toxins in recluse spiders and actinobacteria, evolved anciently in bacteria from the GDPD. The PLD enzymes retained the core (β/α)8 barrel fold of GDPD, while gaining a signature C-terminal expansion motif and losing a small insertion domain. Using sequence alignments and phylogenetic analysis, we infer that the C-terminal motif derives from a segment of an ancient bacterial PLAT domain. Formally, part of a protein containing a PLAT domain repeat underwent fusion to the C terminus of a GDPD barrel, leading to attachment of a segment of a PLAT domain, followed by a second complete PLAT domain. The complete domain was retained only in some basal homologs, but the PLAT segment was conserved and repurposed as the expansion motif. The PLAT segment corresponds to strands β7-β8 of a β-sandwich, while the expansion motif as represented in spider PLD toxins has been remodeled as an α-helix, a β-strand, and an ordered loop. The GDPD-PLAT fusion led to two acquisitions in founding the GDPD-like SMaseD/PLD family: (1) a PLAT domain that presumably supported early lipase activity by mediating membrane association, and (2) an expansion motif that putatively stabilized the catalytic domain, possibly compensating for, or permitting, loss of the insertion domain. Of wider significance, messy domain shuffling events can leave behind scraps of domains that can be salvaged, remodeled, and repurposed.
Collapse
Affiliation(s)
| | | | - Holden C. Fox
- Department of Chemistry and BiochemistryUniversity of ArizonaTucsonArizonaUSA
| | | |
Collapse
|
11
|
Lutz ID, Wang S, Norn C, Courbet A, Borst AJ, Zhao YT, Dosey A, Cao L, Xu J, Leaf EM, Treichel C, Litvicov P, Li Z, Goodson AD, Rivera-Sánchez P, Bratovianu AM, Baek M, King NP, Ruohola-Baker H, Baker D. Top-down design of protein architectures with reinforcement learning. Science 2023; 380:266-273. [PMID: 37079676 DOI: 10.1126/science.adf6591] [Citation(s) in RCA: 34] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/05/2022] [Accepted: 03/21/2023] [Indexed: 04/22/2023]
Abstract
As a result of evolutionary selection, the subunits of naturally occurring protein assemblies often fit together with substantial shape complementarity to generate architectures optimal for function in a manner not achievable by current design approaches. We describe a "top-down" reinforcement learning-based design approach that solves this problem using Monte Carlo tree search to sample protein conformers in the context of an overall architecture and specified functional constraints. Cryo-electron microscopy structures of the designed disk-shaped nanopores and ultracompact icosahedra are very close to the computational models. The icosohedra enable very-high-density display of immunogens and signaling molecules, which potentiates vaccine response and angiogenesis induction. Our approach enables the top-down design of complex protein nanomaterials with desired system properties and demonstrates the power of reinforcement learning in protein design.
Collapse
Affiliation(s)
- Isaac D Lutz
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| | - Shunzhi Wang
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Christoffer Norn
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- BioInnovation Institute, DK2200 Copenhagen N, Denmark
| | - Alexis Courbet
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Andrew J Borst
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Yan Ting Zhao
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
- Oral Health Sciences, University of Washington, Seattle, WA, USA
| | - Annie Dosey
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Longxing Cao
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, Hangzhou, Zhejiang, China
| | - Jinwei Xu
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Elizabeth M Leaf
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Catherine Treichel
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Patrisia Litvicov
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
| | - Zhe Li
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Alexander D Goodson
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | | | | | - Minkyung Baek
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- School of Biological Sciences, Seoul National University, Seoul, Republic of Korea
| | - Neil P King
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
| | - Hannele Ruohola-Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
- Institute for Stem Cell and Regenerative Medicine, University of Washington, Seattle, WA, USA
- Oral Health Sciences, University of Washington, Seattle, WA, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, USA
- Institute for Protein Design, University of Washington, Seattle, WA, USA
- Department of Bioengineering, University of Washington, Seattle, WA, USA
| |
Collapse
|
12
|
Cummins MC, Tripathy A, Sondek J, Kuhlman B. De novo design of stable proteins that efficaciously inhibit oncogenic G proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.28.534629. [PMID: 37034763 PMCID: PMC10081213 DOI: 10.1101/2023.03.28.534629] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 04/22/2023]
Abstract
Many protein therapeutics are competitive inhibitors that function by binding to endogenous proteins and preventing them from interacting with native partners. One effective strategy for engineering competitive inhibitors is to graft structural motifs from a native partner into a host protein. Here, we develop and experimentally test a computational protocol for embedding binding motifs in de novo designed proteins. The protocol uses an "inside-out" approach: Starting with a structural model of the binding motif docked against the target protein, the de novo protein is built by growing new structural elements off the termini of the binding motif. During backbone assembly, a score function favors backbones that introduce new tertiary contacts within the designed protein and do not introduce clashes with the target binding partner. Final sequences are designed and optimized using the molecular modeling program Rosetta. To test our protocol, we designed small helical proteins to inhibit the interaction between Gα q and its effector PLC-β isozymes. Several of the designed proteins remain folded above 90°C and bind to Gα q with equilibrium dissociation constants tighter than 80 nM. In cellular assays with oncogenic variants of Gα q , the designed proteins inhibit activation of PLC-β isozymes and Dbl-family RhoGEFs. Our results demonstrate that computational protein design, in combination with motif grafting, can be used to directly generate potent inhibitors without further optimization via high throughput screening or selection. statement for broader audience Engineered proteins that bind to specific target proteins are useful as research reagents, diagnostics, and therapeutics. We used computational protein design to engineer de novo proteins that bind and competitively inhibit the G protein, Gα q , which is an oncogene for uveal melanomas. This computational method is a general approach that should be useful for designing competitive inhibitors against other proteins of interest.
Collapse
Affiliation(s)
- Matthew C. Cummins
- Department of Pharmacology, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
| | - Ashutosh Tripathy
- Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
| | - John Sondek
- Department of Pharmacology, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
- Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina School of Medicine, Chapel Hill, North Carolina, USA
- Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| |
Collapse
|
13
|
Guffy SL, Pulavarti SVSRK, Harrison J, Fleming D, Szyperski T, Kuhlman B. Inside-Out Design of Zinc-Binding Proteins with Non-Native Backbones. Biochemistry 2023; 62:770-781. [PMID: 36634348 PMCID: PMC9939277 DOI: 10.1021/acs.biochem.2c00595] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/14/2023]
Abstract
The de novo design of functional proteins requires specification of tertiary structure and incorporation of molecular binding sites. Here, we develop an inside-out design strategy in the molecular modeling program Rosetta that begins with amino acid side chains from one or two α-helices making well-defined contacts with a ligand. A full-sized protein is then built around the ligand by adding additional helices that promote the formation of a protein core and allow additional contacts with the ligand. The protocol was tested by designing 12 zinc-binding proteins, each with 4-5 helices. Four of the designs were folded and bound to zinc with equilibrium dissociation constants varying between 95 nM and 1.1 μM. The design with the tightest affinity for zinc, N12, adopts a unique conformation in the folded state as assessed with nuclear magnetic resonance (NMR) and the design model closely matches (backbone root-mean-square deviation (RMSD) < 1 Å) an AlphaFold model of the sequence. Retrospective analysis with AlphaFold suggests that the sequences of many of the failed designs did not encode the desired tertiary packing.
Collapse
Affiliation(s)
- Sharon L. Guffy
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, 27599, USA
| | | | - Joseph Harrison
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Drew Fleming
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, 27599, USA
| | - Thomas Szyperski
- Department of Chemistry, State University of New York, Buffalo, NY, 14260, USA
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, NC, 27599, USA
- Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, NC, 27599, USA
| |
Collapse
|
14
|
Pearce R, Huang X, Omenn GS, Zhang Y. De novo protein fold design through sequence-independent fragment assembly simulations. Proc Natl Acad Sci U S A 2023; 120:e2208275120. [PMID: 36656852 PMCID: PMC9942881 DOI: 10.1073/pnas.2208275120] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2022] [Accepted: 12/22/2022] [Indexed: 01/20/2023] Open
Abstract
De novo protein design generally consists of two steps, including structure and sequence design. Many protein design studies have focused on sequence design with scaffolds adapted from native structures in the PDB, which renders novel areas of protein structure and function space unexplored. We developed FoldDesign to create novel protein folds from specific secondary structure (SS) assignments through sequence-independent replica-exchange Monte Carlo (REMC) simulations. The method was tested on 354 non-redundant topologies, where FoldDesign consistently created stable structural folds, while recapitulating on average 87.7% of the SS elements. Meanwhile, the FoldDesign scaffolds had well-formed structures with buried residues and solvent-exposed areas closely matching their native counterparts. Despite the high fidelity to the input SS restraints and local structural characteristics of native proteins, a large portion of the designed scaffolds possessed global folds completely different from natural proteins in the PDB, highlighting the ability of FoldDesign to explore novel areas of protein fold space. Detailed data analyses revealed that the major contributions to the successful structure design lay in the optimal energy force field, which contains a balanced set of SS packing terms, and REMC simulations, which were coupled with multiple auxiliary movements to efficiently search the conformational space. Additionally, the ability to recognize and assemble uncommon super-SS geometries, rather than the unique arrangement of common SS motifs, was the key to generating novel folds. These results demonstrate a strong potential to explore both structural and functional spaces through computational design simulations that natural proteins have not reached through evolution.
Collapse
Affiliation(s)
- Robin Pearce
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI48109
| | - Xiaoqiang Huang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI48109
| | - Gilbert S. Omenn
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI48109
- Department of Internal Medicine, University of Michigan, Ann Arbor, MI48109
- Department of Human Genetics, University of Michigan, Ann Arbor, MI48109
- School of Public Health, University of Michigan, Ann Arbor, MI48109
| | - Yang Zhang
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI48109
- Department of Biological Chemistry, University of Michigan, Ann Arbor, MI48109
- Department of Computer Science, School of Computing, National University of Singapore117417, Singapore
- Cancer Science Institute of Singapore, National University of Singapore117599, Singapore
| |
Collapse
|
15
|
Lu H, Cheng Z, Hu Y, Tang LV. What Can De Novo Protein Design Bring to the Treatment of Hematological Disorders? BIOLOGY 2023; 12:166. [PMID: 36829445 PMCID: PMC9952452 DOI: 10.3390/biology12020166] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Revised: 01/17/2023] [Accepted: 01/18/2023] [Indexed: 01/22/2023]
Abstract
Protein therapeutics have been widely used to treat hematological disorders. With the advent of de novo protein design, protein therapeutics are not limited to ameliorating natural proteins but also produce novel protein sequences, folds, and functions with shapes and functions customized to bind to the therapeutic targets. De novo protein techniques have been widely used biomedically to design novel diagnostic and therapeutic drugs, novel vaccines, and novel biological materials. In addition, de novo protein design has provided new options for treating hematological disorders. Scientists have designed protein switches called Colocalization-dependent Latching Orthogonal Cage-Key pRoteins (Co-LOCKR) that perform computations on the surface of cells. De novo designed molecules exhibit a better capacity than the currently available tyrosine kinase inhibitors in chronic myeloid leukemia therapy. De novo designed protein neoleukin-2/15 enhances chimeric antigen receptor T-cell activity. This new technique has great biomedical potential, especially in exploring new treatment methods for hematological disorders. This review discusses the development of de novo protein design and its biological applications, with emphasis on the treatment of hematological disorders.
Collapse
Affiliation(s)
| | | | | | - Liang V. Tang
- Department of Hematology, Union Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan 430022, China
| |
Collapse
|
16
|
Lipsh-Sokolik R, Khersonsky O, Schröder SP, de Boer C, Hoch SY, Davies GJ, Overkleeft HS, Fleishman SJ. Combinatorial assembly and design of enzymes. Science 2023; 379:195-201. [PMID: 36634164 DOI: 10.1126/science.ade9434] [Citation(s) in RCA: 41] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Abstract
The design of structurally diverse enzymes is constrained by long-range interactions that are necessary for accurate folding. We introduce an atomistic and machine learning strategy for the combinatorial assembly and design of enzymes (CADENZ) to design fragments that combine with one another to generate diverse, low-energy structures with stable catalytic constellations. We applied CADENZ to endoxylanases and used activity-based protein profiling to recover thousands of structurally diverse enzymes. Functional designs exhibit high active-site preorganization and more stable and compact packing outside the active site. Implementing these lessons into CADENZ led to a 10-fold improved hit rate and more than 10,000 recovered enzymes. This design-test-learn loop can be applied, in principle, to any modular protein family, yielding huge diversity and general lessons on protein design principles.
Collapse
Affiliation(s)
- R Lipsh-Sokolik
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001 Rehovot, Israel
| | - O Khersonsky
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001 Rehovot, Israel
| | - S P Schröder
- Leiden Institute of Chemistry, Leiden University, Einsteinweg 55, 2300 RA Leiden, Netherlands
| | - C de Boer
- Leiden Institute of Chemistry, Leiden University, Einsteinweg 55, 2300 RA Leiden, Netherlands
| | - S-Y Hoch
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001 Rehovot, Israel
| | - G J Davies
- York Structural Biology Laboratory, Department of Chemistry, The University of York, Heslington, York YO10 5DD, UK
| | - H S Overkleeft
- Leiden Institute of Chemistry, Leiden University, Einsteinweg 55, 2300 RA Leiden, Netherlands
| | - S J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, 7610001 Rehovot, Israel
| |
Collapse
|
17
|
Syrlybaeva R, Strauch EM. Deep learning of protein sequence design of protein-protein interactions. Bioinformatics 2023; 39:btac733. [PMID: 36377772 PMCID: PMC9947925 DOI: 10.1093/bioinformatics/btac733] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2022] [Revised: 09/16/2022] [Accepted: 11/14/2022] [Indexed: 11/16/2022] Open
Abstract
MOTIVATION As more data of experimentally determined protein structures are becoming available, data-driven models to describe protein sequence-structure relationships become more feasible. Within this space, the amino acid sequence design of protein-protein interactions is still a rather challenging subproblem with very low success rates-yet, it is central to most biological processes. RESULTS We developed an attention-based deep learning model inspired by algorithms used for image-caption assignments to design peptides or protein fragment sequences. Our trained model can be applied for the redesign of natural protein interfaces or the designed protein interaction fragments. Here, we validate the potential by recapitulating naturally occurring protein-protein interactions including antibody-antigen complexes. The designed interfaces accurately capture essential native interactions and have comparable native-like binding affinities in silico. Furthermore, our model does not need a precise backbone location, making it an attractive tool for working with de novo design of protein-protein interactions. AVAILABILITY AND IMPLEMENTATION The source code of the method is available at https://github.com/strauchlab/iNNterfaceDesign. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Raulia Syrlybaeva
- Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, GA 30602, USA
| | - Eva-Maria Strauch
- Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, GA 30602, USA
- Institute of Bioinformatics, University of Georgia, Athens, GA 30602, USA
| |
Collapse
|
18
|
Linsky TW, Noble K, Tobin AR, Crow R, Carter L, Urbauer JL, Baker D, Strauch EM. Sampling of structure and sequence space of small protein folds. Nat Commun 2022; 13:7151. [PMID: 36418330 PMCID: PMC9684540 DOI: 10.1038/s41467-022-34937-8] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Accepted: 11/10/2022] [Indexed: 11/24/2022] Open
Abstract
Nature only samples a small fraction of the sequence space that can fold into stable proteins. Furthermore, small structural variations in a single fold, sometimes only a few amino acids, can define a protein's molecular function. Hence, to design proteins with novel functionalities, such as molecular recognition, methods to control and sample shape diversity are necessary. To explore this space, we developed and experimentally validated a computational platform that can design a wide variety of small protein folds while sampling shape diversity. We designed and evaluated stability of about 30,000 de novo protein designs of eight different folds. Among these designs, about 6,200 stable proteins were identified, including some predicted to have a first-of-its-kind minimalized thioredoxin fold. Obtained data revealed protein folding rules for structural features such as helix-connecting loops. Beyond serving as a resource for protein engineering, this massive and diverse dataset also provides training data for machine learning. We developed an accurate classifier to predict the stability of our designed proteins. The methods and the wide range of protein shapes provide a basis for designing new protein functions without compromising stability.
Collapse
Affiliation(s)
- Thomas W Linsky
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Kyle Noble
- Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, GA, 30602, USA
| | - Autumn R Tobin
- Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, GA, 30602, USA
| | - Rachel Crow
- Department of Microbiology, University of Washington, Seattle, WA, 98195, USA
| | - Lauren Carter
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
| | - Jeffrey L Urbauer
- Department of Chemistry, University of Georgia, Athens, GA, 30602, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, WA, 98195, USA
- Institute for Protein Design, University of Washington, Seattle, WA, 98195, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, 98195, USA
| | - Eva-Maria Strauch
- Department of Pharmaceutical and Biomedical Sciences, University of Georgia, Athens, GA, 30602, USA.
- Institute of Bioinformatics, University of Georgia, Athens, GA, 30602, USA.
| |
Collapse
|
19
|
Liu H, Chen Q. Computational protein design with data‐driven approaches: Recent developments and perspectives. WIRES COMPUTATIONAL MOLECULAR SCIENCE 2022. [DOI: 10.1002/wcms.1646] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
Affiliation(s)
- Haiyan Liu
- MOE Key Laboratory for Membraneless Organelles and Cellular Dynamics, School of Life Sciences, Division of Life Sciences and Medicine University of Science and Technology of China Hefei Anhui China
- Biomedical Sciences and Health Laboratory of Anhui Province University of Science and Technology of China Hefei Anhui China
- School of Data Science University of Science and Technology of China Hefei Anhui China
| | - Quan Chen
- MOE Key Laboratory for Membraneless Organelles and Cellular Dynamics, School of Life Sciences, Division of Life Sciences and Medicine University of Science and Technology of China Hefei Anhui China
- Biomedical Sciences and Health Laboratory of Anhui Province University of Science and Technology of China Hefei Anhui China
| |
Collapse
|
20
|
Abstract
De novo protein design enables the exploration of novel sequences and structures absent from the natural protein universe. De novo design also stands as a stringent test for our understanding of the underlying physical principles of protein folding and may lead to the development of proteins with unmatched functional characteristics. The first fundamental challenge of de novo design is to devise "designable" structural templates leading to sequences that will adopt the predicted fold. Here, we built on the TopoBuilder (TB) de novo design method, to automatically assemble structural templates with native-like features starting from string descriptors that capture the overall topology of proteins. Our framework eliminates the dependency of hand-crafted and fold-specific rules through an iterative, data-driven approach that extracts geometrical parameters from structural tertiary motifs. We evaluated the TopoBuilder framework by designing sequences for a set of five protein folds and experimental characterization revealed that several sequences were folded and stable in solution. The TopoBuilder de novo design framework will be broadly useful to guide the generation of artificial proteins with customized geometries, enabling the exploration of the protein universe.
Collapse
|
21
|
Cano-Garrido O, Serna N, Unzueta U, Parladé E, Mangues R, Villaverde A, Vázquez E. Protein scaffolds in human clinics. Biotechnol Adv 2022; 61:108032. [PMID: 36089254 DOI: 10.1016/j.biotechadv.2022.108032] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2022] [Revised: 07/30/2022] [Accepted: 09/03/2022] [Indexed: 11/02/2022]
Abstract
Fundamental clinical areas such as drug delivery and regenerative medicine require biocompatible materials as mechanically stable scaffolds or as nanoscale drug carriers. Among the wide set of emerging biomaterials, polypeptides offer enticing properties over alternative polymers, including full biocompatibility, biodegradability, precise interactivity, structural stability and conformational and functional versatility, all of them tunable by conventional protein engineering. However, proteins from non-human sources elicit immunotoxicities that might bottleneck further development and narrow their clinical applicability. In this context, selecting human proteins or developing humanized protein versions as building blocks is a strict demand to design non-immunogenic protein materials. We review here the expanding catalogue of human or humanized proteins tailored to execute different levels of scaffolding functions and how they can be engineered as self-assembling materials in form of oligomers, polymers or complex networks. In particular, we emphasize those that are under clinical development, revising their fields of applicability and how they have been adapted to offer, apart from mere mechanical support, highly refined functions and precise molecular interactions.
Collapse
Affiliation(s)
- Olivia Cano-Garrido
- Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès (Barcelona), Spain
| | - Naroa Serna
- Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès (Barcelona), Spain; CIBER de Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN), Instituto de Salud Carlos III, 08193 Cerdanyola del Vallès (Barcelona), Spain
| | - Ugutz Unzueta
- CIBER de Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN), Instituto de Salud Carlos III, 08193 Cerdanyola del Vallès (Barcelona), Spain; Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès (Barcelona), Spain; Biomedical Research Institute Sant Pau (IIB Sant Pau), 08025 Barcelona, Spain; Josep Carreras Leukaemia Research Institute, 08916 Badalona (Barcelona), Spain
| | - Eloi Parladé
- Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès (Barcelona), Spain; CIBER de Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN), Instituto de Salud Carlos III, 08193 Cerdanyola del Vallès (Barcelona), Spain
| | - Ramón Mangues
- Biomedical Research Institute Sant Pau (IIB Sant Pau), 08025 Barcelona, Spain; Josep Carreras Leukaemia Research Institute, 08916 Badalona (Barcelona), Spain
| | - Antonio Villaverde
- Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès (Barcelona), Spain; CIBER de Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN), Instituto de Salud Carlos III, 08193 Cerdanyola del Vallès (Barcelona), Spain; Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès (Barcelona), Spain.
| | - Esther Vázquez
- Institut de Biotecnologia i de Biomedicina, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès (Barcelona), Spain; CIBER de Bioingeniería, Biomateriales y Nanomedicina (CIBER-BBN), Instituto de Salud Carlos III, 08193 Cerdanyola del Vallès (Barcelona), Spain; Departament de Genètica i de Microbiologia, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès (Barcelona), Spain.
| |
Collapse
|
22
|
Listov D, Lipsh‐Sokolik R, Rosset S, Yang C, Correia BE, Fleishman SJ. Assessing and enhancing foldability in designed proteins. Protein Sci 2022; 31:e4400. [PMID: 36040259 PMCID: PMC9375437 DOI: 10.1002/pro.4400] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2022] [Revised: 07/09/2022] [Accepted: 07/11/2022] [Indexed: 11/11/2022]
Abstract
Recent advances in protein-design methodology have led to a dramatic increase in reliability and scale. With these advances, dozens and even thousands of designed proteins are automatically generated and screened. Nevertheless, the success rate, particularly in design of functional proteins, is low and fundamental goals such as reliable de novo design of efficient enzymes remain beyond reach. Experimental analyses have consistently indicated that a major reason for design failure is inaccuracy and misfolding relative to the design conception. To address this challenge, we describe complementary methods to diagnose and ameliorate suboptimal regions in designed proteins: first, we develop a Rosetta atomistic computational mutation scanning approach to detect energetically suboptimal positions in designs (available on a web server https://pSUFER.weizmann.ac.il); second, we demonstrate that AlphaFold2 ab initio structure prediction flags regions that may misfold in designed enzymes and binders; and third, we focus FuncLib design calculations on suboptimal positions in a previously designed low-efficiency enzyme, improving its catalytic efficiency by 330-fold. Furthermore, applied to a de novo designed protein that exhibited limited stability, the same approach markedly improved stability and expressibility. Thus, foldability analysis and enhancement may dramatically increase the success rate in design of functional proteins.
Collapse
Affiliation(s)
- Dina Listov
- Department of Biomolecular SciencesWeizmann Institute of ScienceRehovotIsrael
| | | | - Stéphane Rosset
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne (EPFL)LausanneSwitzerland
| | - Che Yang
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne (EPFL)LausanneSwitzerland
| | - Bruno E. Correia
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne (EPFL)LausanneSwitzerland
| | | |
Collapse
|
23
|
Li L, Chen G. Precise Assembly of Proteins and Carbohydrates for Next-Generation Biomaterials. J Am Chem Soc 2022; 144:16232-16251. [PMID: 36044681 DOI: 10.1021/jacs.2c04418] [Citation(s) in RCA: 27] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The complexity and diversity of biomacromolecules make them a unique class of building blocks for generating precise assemblies. They are particularly available to a new generation of biomaterials integrated with living systems due to their intrinsic properties such as accurate recognition, self-organization, and adaptability. Therefore, many excellent approaches have been developed, leading to a variety of quite practical outcomes. Here, we review recent advances in the fabrication and application of artificially precise assemblies by employing proteins and carbohydrates as building blocks, followed by our perspectives on some of new challenges, goals, and opportunities for the future research directions in this field.
Collapse
Affiliation(s)
- Long Li
- The State Key Laboratory of Molecular Engineering of Polymers and Department of Macromolecular Science, Fudan University, Shanghai 200433, People's Republic of China
| | - Guosong Chen
- The State Key Laboratory of Molecular Engineering of Polymers and Department of Macromolecular Science, Fudan University, Shanghai 200433, People's Republic of China.,Multiscale Research Institute for Complex Systems, Fudan University, Shanghai 200433, People's Republic of China
| |
Collapse
|
24
|
Abstract
The potential of miniproteins in the biological and chemical sciences is constantly increasing. Significant progress in the design methodologies has been achieved over the last 30 years. Early approaches based on propensities of individual amino acid residues to form individual secondary structures were subsequently improved by structural analyses using NMR spectroscopy and crystallography. Consequently, computational algorithms were developed, which are now highly successful in designing structures with accuracy often close to atomic range. Further perspectives include construction of miniproteins incorporating non-native secondary structures derived from sequences with units other than α-amino acids. Noteworthy, miniproteins with extended structures, which are now feasibly accessible, are excellent scaffolds for construction of functional molecules.
Collapse
|
25
|
Koebke KJ, Pinter TBJ, Pitts WC, Pecoraro VL. Catalysis and Electron Transfer in De Novo Designed Metalloproteins. Chem Rev 2022; 122:12046-12109. [PMID: 35763791 PMCID: PMC10735231 DOI: 10.1021/acs.chemrev.1c01025] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
One of the hallmark advances in our understanding of metalloprotein function is showcased in our ability to design new, non-native, catalytically active protein scaffolds. This review highlights progress and milestone achievements in the field of de novo metalloprotein design focused on reports from the past decade with special emphasis on de novo designs couched within common subfields of bioinorganic study: heme binding proteins, monometal- and dimetal-containing catalytic sites, and metal-containing electron transfer sites. Within each subfield, we highlight several of what we have identified as significant and important contributions to either our understanding of that subfield or de novo metalloprotein design as a discipline. These reports are placed in context both historically and scientifically. General suggestions for future directions that we feel will be important to advance our understanding or accelerate discovery are discussed.
Collapse
Affiliation(s)
- Karl J. Koebke
- Department of Chemistry, University of Michigan Ann Arbor, MI 48109 USA
| | | | - Winston C. Pitts
- Department of Chemistry, University of Michigan Ann Arbor, MI 48109 USA
| | | |
Collapse
|
26
|
Magi Meconi G, Sasselli IR, Bianco V, Onuchic JN, Coluzza I. Key aspects of the past 30 years of protein design. REPORTS ON PROGRESS IN PHYSICS. PHYSICAL SOCIETY (GREAT BRITAIN) 2022; 85:086601. [PMID: 35704983 DOI: 10.1088/1361-6633/ac78ef] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Accepted: 06/15/2022] [Indexed: 06/15/2023]
Abstract
Proteins are the workhorse of life. They are the building infrastructure of living systems; they are the most efficient molecular machines known, and their enzymatic activity is still unmatched in versatility by any artificial system. Perhaps proteins' most remarkable feature is their modularity. The large amount of information required to specify each protein's function is analogically encoded with an alphabet of just ∼20 letters. The protein folding problem is how to encode all such information in a sequence of 20 letters. In this review, we go through the last 30 years of research to summarize the state of the art and highlight some applications related to fundamental problems of protein evolution.
Collapse
Affiliation(s)
- Giulia Magi Meconi
- Computational Biophysics Lab, Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramon 182, 20014, Donostia-San Sebastián, Spain
| | - Ivan R Sasselli
- Computational Biophysics Lab, Center for Cooperative Research in Biomaterials (CIC biomaGUNE), Basque Research and Technology Alliance (BRTA), Paseo de Miramon 182, 20014, Donostia-San Sebastián, Spain
| | | | - Jose N Onuchic
- Center for Theoretical Biological Physics, Department of Physics & Astronomy, Department of Chemistry, Department of Biosciences, Rice University, Houston, TX 77251, United States of America
| | - Ivan Coluzza
- BCMaterials, Basque Center for Materials, Applications and Nanostructures, Bld. Martina Casiano, UPV/EHU Science Park, Barrio Sarriena s/n, 48940 Leioa, Spain
- Basque Foundation for Science, Ikerbasque, 48009, Bilbao, Spain
| |
Collapse
|
27
|
Cummins MC, Jacobs TM, Teets FD, DiMaio F, Tripathy A, Kuhlman B. AlphaFold accurately predicts distinct conformations based on the oligomeric state of a de novo designed protein. Protein Sci 2022; 31:e4368. [PMID: 35762713 PMCID: PMC9207892 DOI: 10.1002/pro.4368] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2022] [Revised: 05/09/2022] [Accepted: 05/28/2022] [Indexed: 11/12/2022]
Abstract
Using the molecular modeling program Rosetta, we designed a de novo protein, called SEWN0.1, which binds the heterotrimeric G protein Gαq. The design is helical, well-folded, and primarily monomeric in solution at a concentration of 10 μM. However, when we solved the crystal structure of SEWN0.1 at 1.9 Å, we observed a dimer in a conformation incompatible with binding Gαq . Unintentionally, we had designed a protein that adopts alternate conformations depending on its oligomeric state. Recently, there has been tremendous progress in the field of protein structure prediction as new methods in artificial intelligence have been used to predict structures with high accuracy. We were curious if the structure prediction method AlphaFold could predict the structure of SEWN0.1 and if the prediction depended on oligomeric state. When AlphaFold was used to predict the structure of monomeric SEWN0.1, it produced a model that resembles the Rosetta design model and is compatible with binding Gαq , but when used to predict the structure of a dimer, it predicted a conformation that closely resembles the SEWN0.1 crystal structure. AlphaFold's ability to predict multiple conformations for a single protein sequence should be useful for engineering protein switches.
Collapse
Affiliation(s)
- Matthew C. Cummins
- Department of PharmacologyUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
| | - Tim M. Jacobs
- Department of Bioinformatics and Computational BiologyUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
- AbCellera Biologics Inc.VancouverBritish ColumbiaCanada
| | - Frank D. Teets
- Department of Bioinformatics and Computational BiologyUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
- Department of Computational BiologyAndoverMassachusettsUSA
| | - Frank DiMaio
- Department of BiochemistryUniversity of WashingtonSeattleWashingtonUSA
| | - Ashutosh Tripathy
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
| | - Brian Kuhlman
- Department of Biochemistry and BiophysicsUniversity of North Carolina School of MedicineChapel HillNorth CarolinaUSA
- Lineburger Comprehensive Cancer CenterUniversity of North Carolina at Chapel HillChapel HillNorth CarolinaUSA
| |
Collapse
|
28
|
Swanson S, Sivaraman V, Grigoryan G, Keating AE. Tertiary motifs as building blocks for the design of protein-binding peptides. Protein Sci 2022; 31:e4322. [PMID: 35634780 PMCID: PMC9088223 DOI: 10.1002/pro.4322] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Revised: 04/12/2022] [Accepted: 04/14/2022] [Indexed: 11/07/2022]
Abstract
Despite advances in protein engineering, the de novo design of small proteins or peptides that bind to a desired target remains a difficult task. Most computational methods search for binder structures in a library of candidate scaffolds, which can lead to designs with poor target complementarity and low success rates. Instead of choosing from pre-defined scaffolds, we propose that custom peptide structures can be constructed to complement a target surface. Our method mines tertiary motifs (TERMs) from known structures to identify surface-complementing fragments or "seeds." We combine seeds that satisfy geometric overlap criteria to generate peptide backbones and score the backbones to identify the most likely binding structures. We found that TERM-based seeds can describe known binding structures with high resolution: the vast majority of peptide binders from 486 peptide-protein complexes can be covered by seeds generated from single-chain structures. Furthermore, we demonstrate that known peptide structures can be reconstructed with high accuracy from peptide-covering seeds. As a proof of concept, we used our method to design 100 peptide binders of TRAF6, seven of which were predicted by Rosetta to form higher-quality interfaces than a native binder. The designed peptides interact with distinct sites on TRAF6, including the native peptide-binding site. These results demonstrate that known peptide-binding structures can be constructed from TERMs in single-chain structures and suggest that TERM information can be applied to efficiently design novel target-complementing binders.
Collapse
Affiliation(s)
- Sebastian Swanson
- Department of BiologyMassachusetts Institute of TechnologyCambridgeMassachusettsUSA
| | - Venkatesh Sivaraman
- Department of BiologyMassachusetts Institute of TechnologyCambridgeMassachusettsUSA
| | - Gevorg Grigoryan
- Department of Computer ScienceDartmouth CollegeHanoverNew HampshireUSA
| | - Amy E. Keating
- Department of BiologyMassachusetts Institute of TechnologyCambridgeMassachusettsUSA
- Department of Biological EngineeringMassachusetts Institute of TechnologyCambridgeMassachusettsUSA
- Koch Center for Integrative Cancer ResearchMassachusetts Institute of TechnologyCambridgeMassachusettsUSA
| |
Collapse
|
29
|
Khersonsky O, Fleishman SJ. What Have We Learned from Design of Function in Large Proteins? BIODESIGN RESEARCH 2022; 2022:9787581. [PMID: 37850148 PMCID: PMC10521758 DOI: 10.34133/2022/9787581] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Accepted: 02/21/2022] [Indexed: 10/19/2023] Open
Abstract
The overarching goal of computational protein design is to gain complete control over protein structure and function. The majority of sophisticated binders and enzymes, however, are large and exhibit diverse and complex folds that defy atomistic design calculations. Encouragingly, recent strategies that combine evolutionary constraints from natural homologs with atomistic calculations have significantly improved design accuracy. In these approaches, evolutionary constraints mitigate the risk from misfolding and aggregation, focusing atomistic design calculations on a small but highly enriched sequence subspace. Such methods have dramatically optimized diverse proteins, including vaccine immunogens, enzymes for sustainable chemistry, and proteins with therapeutic potential. The new generation of deep learning-based ab initio structure predictors can be combined with these methods to extend the scope of protein design, in principle, to any natural protein of known sequence. We envision that protein engineering will come to rely on completely computational methods to efficiently discover and optimize biomolecular activities.
Collapse
Affiliation(s)
- Olga Khersonsky
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Sarel J. Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| |
Collapse
|
30
|
A backbone-centred energy function of neural networks for protein design. Nature 2022; 602:523-528. [PMID: 35140398 DOI: 10.1038/s41586-021-04383-5] [Citation(s) in RCA: 50] [Impact Index Per Article: 16.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2021] [Accepted: 12/23/2021] [Indexed: 12/29/2022]
Abstract
A protein backbone structure is designable if a substantial number of amino acid sequences exist that autonomously fold into it1,2. It has been suggested that the designability of backbones is governed mainly by side chain-independent or side chain type-insensitive molecular interactions3-5, indicating an approach for designing new backbones (ready for amino acid selection) based on continuous sampling and optimization of the backbone-centred energy surface. However, a sufficiently comprehensive and precise energy function has yet to be established for this purpose. Here we show that this goal is met by a statistical model named SCUBA (for Side Chain-Unknown Backbone Arrangement) that uses neural network-form energy terms. These terms are learned with a two-step approach that comprises kernel density estimation followed by neural network training and can analytically represent multidimensional, high-order correlations in known protein structures. We report the crystal structures of nine de novo proteins whose backbones were designed to high precision using SCUBA, four of which have novel, non-natural overall architectures. By eschewing use of fragments from existing protein structures, SCUBA-driven structure design facilitates far-reaching exploration of the designable backbone space, thus extending the novelty and diversity of the proteins amenable to de novo design.
Collapse
|
31
|
Boral A, Khamaru M, Mitra D. Designing synthetic transcription factors: A structural perspective. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2022; 130:245-287. [PMID: 35534109 DOI: 10.1016/bs.apcsb.2021.12.003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/14/2023]
Abstract
In this chapter, we discuss different design strategies of synthetic proteins, especially synthetic transcription factors. Design and engineering of synthetic transcription factors is particularly relevant for the need-based manipulation of gene expression. With recent advances in structural biology techniques and with the emergence of other precision biochemical/physical tools, accurate knowledge on structure-function relations is increasingly becoming available. Besides discussing the underlying principles of design, we go through individual cases, especially those involving four major groups of transcription factors-basic leucine zippers, zinc fingers, helix-turn-helix and homeodomains. We further discuss how synthetic biology can come together with structural biology to alter the genetic blueprint of life.
Collapse
Affiliation(s)
- Aparna Boral
- Department of Life Sciences, Presidency University, Kolkata, West Bengal, India
| | - Madhurima Khamaru
- Department of Life Sciences, Presidency University, Kolkata, West Bengal, India
| | - Devrani Mitra
- Department of Life Sciences, Presidency University, Kolkata, West Bengal, India.
| |
Collapse
|
32
|
Srinivasan S, Vanni S. Computational Approaches to Investigate and Design Lipid-binding Domains for Membrane Biosensing. Chimia (Aarau) 2021; 75:1031-1036. [PMID: 34920773 DOI: 10.2533/chimia.2021.1031] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
Association of proteins with cellular membranes is critical for signaling and membrane trafficking processes. Many peripheral lipid-binding domains have been identified in the last few decades and have been investigated for their specific lipid-sensing properties using traditional in vivo and in vitro studies. However, several knowledge gaps remain owing to intrinsic limitations of these methodologies. Thus, novel approaches are necessary to further our understanding in lipid-protein biology. This review briefly discusses lipid-binding domains that act as specific lipid biosensors and provides a broad perspective on the computational approaches such as molecular dynamics (MD) simulations and machine learning (ML)-based techniques that can be used to study protein-membrane interactions. We also highlight the need for de novo design of proteins that elicit specific lipid-binding properties.
Collapse
Affiliation(s)
| | - Stefano Vanni
- Department of Biology, University of Fribourg, Switzerland;,
| |
Collapse
|
33
|
Hussain M, Cummins MC, Endo-Streeter S, Sondek J, Kuhlman B. Designer proteins that competitively inhibit Gα q by targeting its effector site. J Biol Chem 2021; 297:101348. [PMID: 34715131 PMCID: PMC8633581 DOI: 10.1016/j.jbc.2021.101348] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2021] [Revised: 10/12/2021] [Accepted: 10/22/2021] [Indexed: 11/30/2022] Open
Abstract
During signal transduction, the G protein, Gαq, binds and activates phospholipase C-β isozymes. Several diseases have been shown to manifest upon constitutively activating mutation of Gαq, such as uveal melanoma. Therefore, methods are needed to directly inhibit Gαq. Previously, we demonstrated that a peptide derived from a helix-turn-helix (HTH) region of PLC-β3 (residues 852-878) binds Gαq with low micromolar affinity and inhibits Gαq by competing with full-length PLC-β isozymes for binding. Since the HTH peptide is unstructured in the absence of Gαq, we hypothesized that embedding the HTH in a folded protein might stabilize the binding-competent conformation and further improve the potency of inhibition. Using the molecular modeling software Rosetta, we searched the Protein Data Bank for proteins with similar HTH structures near their surface. The candidate proteins were computationally docked against Gαq, and their surfaces were redesigned to stabilize this interaction. We then used yeast surface display to affinity mature the designs. The most potent design bound Gαq/i with high affinity in vitro (KD = 18 nM) and inhibited activation of PLC-β isozymes in HEK293 cells. We anticipate that our genetically encoded inhibitor will help interrogate the role of Gαq in healthy and disease model systems. Our work demonstrates that grafting interaction motifs into folded proteins is a powerful approach for generating inhibitors of protein-protein interactions.
Collapse
Affiliation(s)
- Mahmud Hussain
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Matthew C Cummins
- Department of Pharmacology, University of North Carolina, Chapel Hill, North Carolina, USA
| | - Stuart Endo-Streeter
- Department of Pharmacology, University of North Carolina, Chapel Hill, North Carolina, USA
| | - John Sondek
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, North Carolina, USA; Department of Pharmacology, University of North Carolina, Chapel Hill, North Carolina, USA; Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina, USA.
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina, Chapel Hill, North Carolina, USA; Lineberger Comprehensive Cancer Center, University of North Carolina, Chapel Hill, North Carolina, USA.
| |
Collapse
|
34
|
Lindenburg LH, Pantelejevs T, Gielen F, Zuazua-Villar P, Butz M, Rees E, Kaminski CF, Downs JA, Hyvönen M, Hollfelder F. Improved RAD51 binders through motif shuffling based on the modularity of BRC repeats. Proc Natl Acad Sci U S A 2021; 118:e2017708118. [PMID: 34772801 PMCID: PMC8727024 DOI: 10.1073/pnas.2017708118] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/10/2021] [Indexed: 01/20/2023] Open
Abstract
Exchanges of protein sequence modules support leaps in function unavailable through point mutations during evolution. Here we study the role of the two RAD51-interacting modules within the eight binding BRC repeats of BRCA2. We created 64 chimeric repeats by shuffling these modules and measured their binding to RAD51. We found that certain shuffled module combinations were stronger binders than any of the module combinations in the natural repeats. Surprisingly, the contribution from the two modules was poorly correlated with affinities of natural repeats, with a weak BRC8 repeat containing the most effective N-terminal module. The binding of the strongest chimera, BRC8-2, to RAD51 was improved by -2.4 kCal/mol compared to the strongest natural repeat, BRC4. A crystal structure of RAD51:BRC8-2 complex shows an improved interface fit and an extended β-hairpin in this repeat. BRC8-2 was shown to function in human cells, preventing the formation of nuclear RAD51 foci after ionizing radiation.
Collapse
Affiliation(s)
- Laurens H Lindenburg
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, United Kingdom
| | - Teodors Pantelejevs
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, United Kingdom
| | - Fabrice Gielen
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, United Kingdom
- Living Systems Institute, University of Exeter, Exeter EX4 4QD, United Kingdom
| | - Pedro Zuazua-Villar
- Division of Cancer Biology, The Institute of Cancer Research, London SW3 6JB, United Kingdom
| | - Maren Butz
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, United Kingdom
| | - Eric Rees
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge CB3 0AS, United Kingdom
| | - Clemens F Kaminski
- Department of Chemical Engineering and Biotechnology, University of Cambridge, Cambridge CB3 0AS, United Kingdom
| | - Jessica A Downs
- Division of Cancer Biology, The Institute of Cancer Research, London SW3 6JB, United Kingdom
| | - Marko Hyvönen
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, United Kingdom;
| | - Florian Hollfelder
- Department of Biochemistry, University of Cambridge, Cambridge CB2 1GA, United Kingdom;
| |
Collapse
|
35
|
Pinto GP, Corbella M, Demkiv AO, Kamerlin SCL. Exploiting enzyme evolution for computational protein design. Trends Biochem Sci 2021; 47:375-389. [PMID: 34544655 DOI: 10.1016/j.tibs.2021.08.008] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Revised: 08/18/2021] [Accepted: 08/24/2021] [Indexed: 11/15/2022]
Abstract
Recent years have seen an explosion of interest in understanding the physicochemical parameters that shape enzyme evolution, as well as substantial advances in computational enzyme design. This review discusses three areas where evolutionary information can be used as part of the design process: (i) using ancestral sequence reconstruction (ASR) to generate new starting points for enzyme design efforts; (ii) learning from how nature uses conformational dynamics in enzyme evolution to mimic this process in silico; and (iii) modular design of enzymes from smaller fragments, again mimicking the process by which nature appears to create new protein folds. Using showcase examples, we highlight the importance of incorporating evolutionary information to continue to push forward the boundaries of enzyme design studies.
Collapse
Affiliation(s)
- Gaspar P Pinto
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | - Marina Corbella
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | - Andrey O Demkiv
- Department of Chemistry - BMC, Uppsala University, BMC Box 576, S-751 23 Uppsala, Sweden
| | | |
Collapse
|
36
|
Pereira JM, Vieira M, Santos SM. Step-by-step design of proteins for small molecule interaction: A review on recent milestones. Protein Sci 2021; 30:1502-1520. [PMID: 33934427 PMCID: PMC8284594 DOI: 10.1002/pro.4098] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2021] [Revised: 04/21/2021] [Accepted: 04/23/2021] [Indexed: 01/01/2023]
Abstract
Protein design is the field of synthetic biology that aims at developing de novo custom-made proteins and peptides for specific applications. Despite exploring an ambitious goal, recent computational advances in both hardware and software technologies have paved the way to high-throughput screening and detailed design of novel folds and improved functionalities. Modern advances in the field of protein design for small molecule targeting are described in this review, organized in a step-by-step fashion: from the conception of a new or upgraded active binding site, to scaffold design, sequence optimization, and experimental expression of the custom protein. In each step, contemporary examples are described, and state-of-the-art software is briefly explored.
Collapse
Affiliation(s)
- José M. Pereira
- CICECO & Departamento de QuímicaUniversidade de AveiroAveiroPortugal
| | - Maria Vieira
- CICECO & Departamento de QuímicaUniversidade de AveiroAveiroPortugal
| | - Sérgio M. Santos
- CICECO & Departamento de QuímicaUniversidade de AveiroAveiroPortugal
| |
Collapse
|
37
|
Woolfson DN. A Brief History of De Novo Protein Design: Minimal, Rational, and Computational. J Mol Biol 2021; 433:167160. [PMID: 34298061 DOI: 10.1016/j.jmb.2021.167160] [Citation(s) in RCA: 82] [Impact Index Per Article: 20.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2021] [Revised: 07/07/2021] [Accepted: 07/12/2021] [Indexed: 12/26/2022]
Abstract
Protein design has come of age, but how will it mature? In the 1980s and the 1990s, the primary motivation for de novo protein design was to test our understanding of the informational aspect of the protein-folding problem; i.e., how does protein sequence determine protein structure and function? This necessitated minimal and rational design approaches whereby the placement of each residue in a design was reasoned using chemical principles and/or biochemical knowledge. At that time, though with some notable exceptions, the use of computers to aid design was not widespread. Over the past two decades, the tables have turned and computational protein design is firmly established. Here, I illustrate this progress through a timeline of de novo protein structures that have been solved to atomic resolution and deposited in the Protein Data Bank. From this, it is clear that the impact of rational and computational design has been considerable: More-complex and more-sophisticated designs are being targeted with many being resolved to atomic resolution. Furthermore, our ability to generate and manipulate synthetic proteins has advanced to a point where they are providing realistic alternatives to natural protein functions for applications both in vitro and in cells. Also, and increasingly, computational protein design is becoming accessible to non-specialists. This all begs the questions: Is there still a place for minimal and rational design approaches? And, what challenges lie ahead for the burgeoning field of de novo protein design as a whole?
Collapse
Affiliation(s)
- Derek N Woolfson
- School of Chemistry, University of Bristol, Cantock's Close, Bristol BS8 1TS, UK; School of Biochemistry, University of Bristol, Biomedical Sciences Building, University Walk, Bristol BS8 1TD, UK; Bristol BioDesign Institute, University of Bristol, Life Sciences Building, Tyndall Avenue, Bristol BS8 1TQ, UK.
| |
Collapse
|
38
|
Romero-Romero S, Kordes S, Michel F, Höcker B. Evolution, folding, and design of TIM barrels and related proteins. Curr Opin Struct Biol 2021; 68:94-104. [PMID: 33453500 PMCID: PMC8250049 DOI: 10.1016/j.sbi.2020.12.007] [Citation(s) in RCA: 36] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 12/13/2020] [Accepted: 12/14/2020] [Indexed: 12/16/2022]
Abstract
Proteins are chief actors in life that perform a myriad of exquisite functions. This diversity has been enabled through the evolution and diversification of protein folds. Analysis of sequences and structures strongly suggest that numerous protein pieces have been reused as building blocks and propagated to many modern folds. This information can be traced to understand how the protein world has diversified. In this review, we discuss the latest advances in the analysis of protein evolutionary units, and we use as a model system one of the most abundant and versatile topologies, the TIM-barrel fold, to highlight the existing common principles that interconnect protein evolution, structure, folding, function, and design.
Collapse
Affiliation(s)
| | - Sina Kordes
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Florian Michel
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, 95447 Bayreuth, Germany.
| |
Collapse
|
39
|
Ferruz N, Noske J, Höcker B. Protlego: A Python package for the analysis and design of chimeric proteins. Bioinformatics 2021; 37:3182-3189. [PMID: 33901273 PMCID: PMC8504633 DOI: 10.1093/bioinformatics/btab253] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2020] [Revised: 03/05/2021] [Accepted: 04/19/2021] [Indexed: 01/03/2023] Open
Abstract
Motivation Duplication and recombination of protein fragments have led to the highly diverse protein space that we observe today. By mimicking this natural process, the design of protein chimeras via fragment recombination has proven experimentally successful and has opened a new era for the design of customizable proteins. The in silico building of structural models for these chimeric proteins, however, remains a manual task that requires a considerable degree of expertise and is not amenable for high-throughput studies. Energetic and structural analysis of the designed proteins often require the use of several tools, each with their unique technical difficulties and available in different programming languages or web servers. Results We implemented a Python package that enables automated, high-throughput design of chimeras and their structural analysis. First, it fetches evolutionarily conserved fragments from a built-in database (also available at fuzzle.uni-bayreuth.de). These relationships can then be represented via networks or further selected for chimera construction via recombination. Designed chimeras or natural proteins are then scored and minimized with the Charmm and Amber forcefields and their diverse structural features can be analyzed at ease. Here, we showcase Protlego’s pipeline by exploring the relationships between the P-loop and Rossmann superfolds, building and characterizing their offspring chimeras. We believe that Protlego provides a powerful new tool for the protein design community. Availability and implementation Protlego runs on the Linux platform and is freely available at (https://hoecker-lab.github.io/protlego/) with tutorials and documentation. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Noelia Ferruz
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Jakob Noske
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| | - Birte Höcker
- Department of Biochemistry, University of Bayreuth, Bayreuth, Germany
| |
Collapse
|
40
|
Frappier V, Keating AE. Data-driven computational protein design. Curr Opin Struct Biol 2021; 69:63-69. [PMID: 33910104 DOI: 10.1016/j.sbi.2021.03.009] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2020] [Revised: 03/18/2021] [Accepted: 03/19/2021] [Indexed: 01/28/2023]
Abstract
Computational protein design can generate proteins not found in nature that adopt desired structures and perform novel functions. Although proteins could, in theory, be designed with ab initio methods, practical success has come from using large amounts of data that describe the sequences, structures, and functions of existing proteins and their variants. We present recent creative uses of multiple-sequence alignments, protein structures, and high-throughput functional assays in computational protein design. Approaches range from enhancing structure-based design with experimental data to building regression models to training deep neural nets that generate novel sequences. Looking ahead, deep learning will be increasingly important for maximizing the value of data for protein design.
Collapse
Affiliation(s)
- Vincent Frappier
- Generate Biomedicines, 26 Landsdowne Street, Cambridge, MA, 02139, USA
| | - Amy E Keating
- MIT Departments of Biology and Biological Engineering, 77 Massachusetts Ave., Cambridge, MA, 02139, USA.
| |
Collapse
|
41
|
Meinen BA, Bahl CD. Breakthroughs in computational design methods open up new frontiers for de novo protein engineering. Protein Eng Des Sel 2021; 34:6243354. [DOI: 10.1093/protein/gzab007] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Revised: 02/16/2021] [Accepted: 03/08/2021] [Indexed: 02/03/2023] Open
Abstract
Abstract
Proteins catalyze the majority of chemical reactions in organisms, and harnessing this power has long been the focus of the protein engineering field. Computational protein design aims to create new proteins and functions in silico, and in doing so, accelerate the process, reduce costs and enable more sophisticated engineering goals to be accomplished. Challenges that very recently seemed impossible are now within reach thanks to several landmark advances in computational protein design methods. Here, we summarize these new methods, with a particular emphasis on de novo protein design advancements occurring within the past 5 years.
Collapse
Affiliation(s)
- Ben A Meinen
- Institute for Protein Innovation, Harvard Institutes of Medicine 4 Blackfan Circle, Room 941 Boston, MA 02115-5701 Boston, USA
- Division of Hematology/Oncology, Boston Children's Hospital, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| | - Christopher D Bahl
- Institute for Protein Innovation, Harvard Institutes of Medicine 4 Blackfan Circle, Room 941 Boston, MA 02115-5701 Boston, USA
- Division of Hematology/Oncology, Boston Children's Hospital, Boston, MA, USA
- Department of Pediatrics, Harvard Medical School, Boston, MA, USA
| |
Collapse
|
42
|
Le KH, Adolf-Bryfogle J, Klima JC, Lyskov S, Labonte J, Bertolani S, Burman SSR, Leaver-Fay A, Weitzner B, Maguire J, Rangan R, Adrianowycz MA, Alford RF, Adal A, Nance ML, Wu Y, Willis J, Kulp DW, Das R, Dunbrack RL, Schief W, Kuhlman B, Siegel JB, Gray JJ. PyRosetta Jupyter Notebooks Teach Biomolecular Structure Prediction and Design. BIOPHYSICIST (ROCKVILLE, MD.) 2021; 2:108-122. [PMID: 35128343 DOI: 10.35459/tbp.2019.000147] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/27/2022]
Abstract
Biomolecular structure drives function, and computational capabilities have progressed such that the prediction and computational design of biomolecular structures is increasingly feasible. Because computational biophysics attracts students from many different backgrounds and with different levels of resources, teaching the subject can be challenging. One strategy to teach diverse learners is with interactive multimedia material that promotes self-paced, active learning. We have created a hands-on education strategy with a set of sixteen modules that teach topics in biomolecular structure and design, from fundamentals of conformational sampling and energy evaluation to applications like protein docking, antibody design, and RNA structure prediction. Our modules are based on PyRosetta, a Python library that encapsulates all computational modules and methods in the Rosetta software package. The workshop-style modules are implemented as Jupyter Notebooks that can be executed in the Google Colaboratory, allowing learners access with just a web browser. The digital format of Jupyter Notebooks allows us to embed images, molecular visualization movies, and interactive coding exercises. This multimodal approach may better reach students from different disciplines and experience levels as well as attract more researchers from smaller labs and cognate backgrounds to leverage PyRosetta in their science and engineering research. All materials are freely available at https://github.com/RosettaCommons/PyRosetta.notebooks.
Collapse
Affiliation(s)
- Kathy H Le
- T. C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland, United States
| | - Jared Adolf-Bryfogle
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, California, United States
| | - Jason C Klima
- Institute for Protein Design, University of Washington, Seattle, Washington, United States.,Department of Biochemistry, University of Washington, Seattle, Washington, United States.,Lyell Immunopharma, Inc., Seattle, Washington, United States
| | - Sergey Lyskov
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States
| | - Jason Labonte
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States.,Department of Chemistry, Franklin & Marshall College, Lancaster, Pennsylvania, United States
| | - Steven Bertolani
- Department of Chemistry, Department of Biochemistry and Molecular Medicine, Genome Center, University of California, Davis, Davis, California, United States
| | - Shourya S Roy Burman
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States
| | - Andrew Leaver-Fay
- Department of Biochemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States
| | - Brian Weitzner
- Institute for Protein Design, University of Washington, Seattle, Washington, United States.,Department of Biochemistry, University of Washington, Seattle, Washington, United States.,Lyell Immunopharma, Inc., Seattle, Washington, United States
| | - Jack Maguire
- Program in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States
| | - Ramya Rangan
- Program in Biophysics, Stanford University, Stanford, California, United States
| | - Matt A Adrianowycz
- Program in Biophysics, Stanford University, Stanford, California, United States
| | - Rebecca F Alford
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States
| | - Aleexsan Adal
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States
| | - Morgan L Nance
- Program in Molecular Biophysics, Johns Hopkins University, Baltimore, Maryland, United States
| | - Yuanhan Wu
- Vaccine and Immunotherapy Center, Wistar Institute, Philadelphia, Pennsylvania, United States
| | - Jordan Willis
- RubrYc Therapeutics, San Ramon, California, United States
| | - Daniel W Kulp
- Vaccine and Immunotherapy Center, Wistar Institute, Philadelphia, Pennsylvania, United States
| | - Rhiju Das
- Program in Biophysics, Stanford University, Stanford, California, United States
| | | | - William Schief
- Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, California, United States
| | - Brian Kuhlman
- Department of Biochemistry, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States.,Program in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, United States
| | - Justin B Siegel
- Department of Chemistry, Department of Biochemistry and Molecular Medicine, Genome Center, University of California, Davis, Davis, California, United States
| | - Jeffrey J Gray
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States.,Program in Molecular Biophysics, Johns Hopkins University, Baltimore, Maryland, United States
| |
Collapse
|
43
|
Schoeder C, Schmitz S, Adolf-Bryfogle J, Sevy AM, Finn JA, Sauer MF, Bozhanova NG, Mueller BK, Sangha AK, Bonet J, Sheehan JH, Kuenze G, Marlow B, Smith ST, Woods H, Bender BJ, Martina CE, del Alamo D, Kodali P, Gulsevin A, Schief WR, Correia BE, Crowe JE, Meiler J, Moretti R. Modeling Immunity with Rosetta: Methods for Antibody and Antigen Design. Biochemistry 2021; 60:825-846. [PMID: 33705117 PMCID: PMC7992133 DOI: 10.1021/acs.biochem.0c00912] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Revised: 03/02/2021] [Indexed: 01/16/2023]
Abstract
Structure-based antibody and antigen design has advanced greatly in recent years, due not only to the increasing availability of experimentally determined structures but also to improved computational methods for both prediction and design. Constant improvements in performance within the Rosetta software suite for biomolecular modeling have given rise to a greater breadth of structure prediction, including docking and design application cases for antibody and antigen modeling. Here, we present an overview of current protocols for antibody and antigen modeling using Rosetta and exemplify those by detailed tutorials originally developed for a Rosetta workshop at Vanderbilt University. These tutorials cover antibody structure prediction, docking, and design and antigen design strategies, including the addition of glycans in Rosetta. We expect that these materials will allow novice users to apply Rosetta in their own projects for modeling antibodies and antigens.
Collapse
Affiliation(s)
- Clara
T. Schoeder
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Samuel Schmitz
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Jared Adolf-Bryfogle
- Department
of Immunology and Microbiology, The Scripps
Research Institute, La Jolla, California 92037, United States
- IAVI
Neutralizing Antibody Center, The Scripps
Research Institute, La Jolla, California 92037, United States
| | - Alexander M. Sevy
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37232-0301, United States
- Vanderbilt
Vaccine Center, Vanderbilt University Medical
Center, Nashville, Tennessee 37232-0417, United States
| | - Jessica A. Finn
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Vanderbilt
Vaccine Center, Vanderbilt University Medical
Center, Nashville, Tennessee 37232-0417, United States
- Department
of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, Tennessee 37232, United States
| | - Marion F. Sauer
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37232-0301, United States
- Vanderbilt
Vaccine Center, Vanderbilt University Medical
Center, Nashville, Tennessee 37232-0417, United States
| | - Nina G. Bozhanova
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Benjamin K. Mueller
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Amandeep K. Sangha
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Jaume Bonet
- Institute
of Bioengineering, École Polytechnique
Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - Jonathan H. Sheehan
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Georg Kuenze
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Institute
for Drug Discovery, University Leipzig Medical
School, 04103 Leipzig, Germany
| | - Brennica Marlow
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37232-0301, United States
| | - Shannon T. Smith
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37232-0301, United States
| | - Hope Woods
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37232-0301, United States
| | - Brian J. Bender
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Department
of Pharmacology, Vanderbilt University, Nashville, Tennessee 37212, United States
| | - Cristina E. Martina
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Diego del Alamo
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Chemical
and Physical Biology Program, Vanderbilt
University, Nashville, Tennessee 37232-0301, United States
| | - Pranav Kodali
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - Alican Gulsevin
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| | - William R. Schief
- Department
of Immunology and Microbiology, The Scripps
Research Institute, La Jolla, California 92037, United States
- IAVI
Neutralizing Antibody Center, The Scripps
Research Institute, La Jolla, California 92037, United States
| | - Bruno E. Correia
- Institute
of Bioengineering, École Polytechnique
Fédérale de Lausanne, CH-1015 Lausanne, Switzerland
| | - James E. Crowe
- Vanderbilt
Vaccine Center, Vanderbilt University Medical
Center, Nashville, Tennessee 37232-0417, United States
- Department
of Pathology, Microbiology, and Immunology, Vanderbilt University Medical Center, Nashville, Tennessee 37232, United States
- Department
of Pediatrics, Vanderbilt University Medical
Center, Nashville, Tennessee 37232, United States
| | - Jens Meiler
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
- Institute
for Drug Discovery, University Leipzig Medical
School, 04103 Leipzig, Germany
| | - Rocco Moretti
- Department
of Chemistry, Vanderbilt University, Nashville, Tennessee 37212, United States
- Center
for Structural Biology, Vanderbilt University, Nashville, Tennessee 37240-7917, United States
| |
Collapse
|
44
|
Lindorff-Larsen K, Teilum K. Linking thermodynamics and measurements of protein stability. Protein Eng Des Sel 2021; 34:6173616. [PMID: 33724431 DOI: 10.1093/protein/gzab002] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2020] [Revised: 12/21/2020] [Accepted: 01/12/2021] [Indexed: 11/13/2022] Open
Abstract
We review the background, theory and general equations for the analysis of equilibrium protein unfolding experiments, focusing on denaturant and heat-induced unfolding. The primary focus is on the thermodynamics of reversible folding/unfolding transitions and the experimental methods that are available for extracting thermodynamic parameters. We highlight the importance of modelling both how the folding equilibrium depends on a perturbing variable such as temperature or denaturant concentration, and the importance of modelling the baselines in the experimental observables.
Collapse
Affiliation(s)
- Kresten Lindorff-Larsen
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kaare Teilum
- Structural Biology and NMR Laboratory & Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
45
|
Laniado J, Meador K, Yeates TO. A fragment-based protein interface design algorithm for symmetric assemblies. Protein Eng Des Sel 2021; 34:gzab008. [PMID: 33955480 PMCID: PMC8101011 DOI: 10.1093/protein/gzab008] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/16/2021] [Accepted: 03/08/2021] [Indexed: 11/13/2022] Open
Abstract
Theoretical and experimental advances in protein engineering have led to the creation of precisely defined, novel protein assemblies of great size and complexity, with diverse applications. One powerful approach involves designing a new attachment or binding interface between two simpler symmetric oligomeric protein components. The required methods of design, which present both similarities and key differences compared to problems in protein docking, remain challenging and are not yet routine. With the aim of more fully enabling this emerging area of protein material engineering, we developed a computer program, nanohedra, to introduce two key advances. First, we encoded in the program the construction rules (i.e. the search space parameters) that underlie all possible symmetric material constructions. Second, we developed algorithms for rapidly identifying favorable docking/interface arrangements based on tabulations of empirical patterns of known protein fragment-pair associations. As a result, the candidate poses that nanohedra generates for subsequent amino acid interface design appear highly native-like (at the protein backbone level), while simultaneously conforming to the exacting requirements for symmetry-based assembly. A retrospective computational analysis of successful vs failed experimental studies supports the expectation that this should improve the success rate for this challenging area of protein engineering.
Collapse
Affiliation(s)
- Joshua Laniado
- UCLA Molecular Biology Institute, Los Angeles, CA 90095, USA
| | - Kyle Meador
- UCLA Department of Chemistry and Biochemistry, Los Angeles, CA 90095, USA
| | - Todd O Yeates
- UCLA Molecular Biology Institute, Los Angeles, CA 90095, USA
- UCLA Department of Chemistry and Biochemistry, Los Angeles, CA 90095, USA
- UCLA DOE Institute for Genomics and Proteomics, Los Angeles, CA 90095, USA
| |
Collapse
|
46
|
Abstract
While native proteins cover diverse structural spaces and achieve various biological events, not many of them can directly serve human needs. One reason is that the native proteins usually contain idiosyncrasies evolved for their native functions but disfavoring engineering requirements. To overcome this issue, one strategy is to create de novo proteins which are designed to possess improved stability, high environmental tolerance, and enhanced engineering potential. Compared to other protein engineering strategies, in silico design of de novo proteins has significantly expanded the protein structural and sequence spaces, reduced wet lab workload, and incorporated engineered features in a guided and efficient manner. In the Baker laboratory we have been applying a design pipeline that uses the blueprint builder to design different folds of de novo proteins, and have successfully obtained libraries of de novo proteins with improved stability and engineering potential. In this article, we will use the design of de novo β-barrel proteins as an example to describe the principles and basic procedures of the blueprint builder-based design pipeline. © 2020 Wiley Periodicals LLC. Basic Protocol 1: The construction of blueprints Alternate Protocol: Build blueprints based on existing protein .pdb files Basic Protocol 2: De novo protein design pipeline using the blueprint builder.
Collapse
Affiliation(s)
- Linna An
- Institute for Protein Design, University of Washington, Seattle, Washington
| | - Gyu Rie Lee
- Institute for Protein Design, University of Washington, Seattle, Washington
| |
Collapse
|
47
|
Searching protein space for ancient sub-domain segments. Curr Opin Struct Biol 2021; 68:105-112. [PMID: 33476896 DOI: 10.1016/j.sbi.2020.11.006] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/11/2020] [Accepted: 11/29/2020] [Indexed: 01/08/2023]
Abstract
Evolutionary processes that formed the current protein universe left their traces, among them homologous segments that recur, or are 'reused,' in multiple proteins. These reused segments, called 'themes,' can be found at various scales, the best known of which is the domain. Yet, recent studies have begun to focus on the evolutionary insights that can be derived from sub-domain-scale themes, which are candidates for traces of more ancient events. Characterizing these may provide clues to the emergence of domains. Particularly interesting are themes that are reused across dissimilar contexts, that is, where the rest of the protein domain differs. We survey computational studies identifying reused themes within different contexts at the sub-domain level.
Collapse
|
48
|
Pan X, Kortemme T. Recent advances in de novo protein design: Principles, methods, and applications. J Biol Chem 2021; 296:100558. [PMID: 33744284 PMCID: PMC8065224 DOI: 10.1016/j.jbc.2021.100558] [Citation(s) in RCA: 111] [Impact Index Per Article: 27.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2021] [Revised: 03/12/2021] [Accepted: 03/16/2021] [Indexed: 02/06/2023] Open
Abstract
The computational de novo protein design is increasingly applied to address a number of key challenges in biomedicine and biological engineering. Successes in expanding applications are driven by advances in design principles and methods over several decades. Here, we review recent innovations in major aspects of the de novo protein design and include how these advances were informed by principles of protein architecture and interactions derived from the wealth of structures in the Protein Data Bank. We describe developments in de novo generation of designable backbone structures, optimization of sequences, design scoring functions, and the design of the function. The advances not only highlight design goals reachable now but also point to the challenges and opportunities for the future of the field.
Collapse
Affiliation(s)
- Xingjie Pan
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, USA; UC Berkeley - UCSF Graduate Program in Bioengineering, University of California San Francisco, San Francisco, California, USA.
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, California, USA; UC Berkeley - UCSF Graduate Program in Bioengineering, University of California San Francisco, San Francisco, California, USA; Quantitative Biosciences Institute (QBI), University of California San Francisco, San Francisco, California, USA.
| |
Collapse
|
49
|
Pirro F, Schmidt N, Lincoff J, Widel ZX, Polizzi NF, Liu L, Therien MJ, Grabe M, Chino M, Lombardi A, DeGrado WF. Allosteric cooperation in a de novo-designed two-domain protein. Proc Natl Acad Sci U S A 2020; 117:33246-33253. [PMID: 33318174 PMCID: PMC7776816 DOI: 10.1073/pnas.2017062117] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
We describe the de novo design of an allosterically regulated protein, which comprises two tightly coupled domains. One domain is based on the DF (Due Ferri in Italian or two-iron in English) family of de novo proteins, which have a diiron cofactor that catalyzes a phenol oxidase reaction, while the second domain is based on PS1 (Porphyrin-binding Sequence), which binds a synthetic Zn-porphyrin (ZnP). The binding of ZnP to the original PS1 protein induces changes in structure and dynamics, which we expected to influence the catalytic rate of a fused DF domain when appropriately coupled. Both DF and PS1 are four-helix bundles, but they have distinct bundle architectures. To achieve tight coupling between the domains, they were connected by four helical linkers using a computational method to discover the most designable connections capable of spanning the two architectures. The resulting protein, DFP1 (Due Ferri Porphyrin), bound the two cofactors in the expected manner. The crystal structure of fully reconstituted DFP1 was also in excellent agreement with the design, and it showed the ZnP cofactor bound over 12 Å from the dimetal center. Next, a substrate-binding cleft leading to the diiron center was introduced into DFP1. The resulting protein acts as an allosterically modulated phenol oxidase. Its Michaelis-Menten parameters were strongly affected by the binding of ZnP, resulting in a fourfold tighter Km and a 7-fold decrease in kcat These studies establish the feasibility of designing allosterically regulated catalytic proteins, entirely from scratch.
Collapse
Affiliation(s)
- Fabio Pirro
- Department of Chemical Sciences, University of Napoli Federico II, 80126 Napoli, Italy
| | - Nathan Schmidt
- Department of Pharmaceutical Chemistry and the Cardiovascular Research Institute, University of California, San Francisco, CA 94158-9001
| | - James Lincoff
- Department of Pharmaceutical Chemistry and the Cardiovascular Research Institute, University of California, San Francisco, CA 94158-9001
| | - Zachary X Widel
- Department of Chemistry, Duke University, Durham, NC 27708-0346
| | - Nicholas F Polizzi
- Department of Pharmaceutical Chemistry and the Cardiovascular Research Institute, University of California, San Francisco, CA 94158-9001
| | - Lijun Liu
- State Key Laboratory of Chemical Oncogenomics, School of Chemical Biology and Biotechnology, Peking University Shenzhen Graduate School, 518055 Shenzhen, China
- DLX Scientific, Lawrence, KS 66049
| | | | - Michael Grabe
- Department of Pharmaceutical Chemistry and the Cardiovascular Research Institute, University of California, San Francisco, CA 94158-9001
| | - Marco Chino
- Department of Chemical Sciences, University of Napoli Federico II, 80126 Napoli, Italy
| | - Angela Lombardi
- Department of Chemical Sciences, University of Napoli Federico II, 80126 Napoli, Italy;
| | - William F DeGrado
- Department of Pharmaceutical Chemistry and the Cardiovascular Research Institute, University of California, San Francisco, CA 94158-9001;
| |
Collapse
|
50
|
Maguire JB, Haddox HK, Strickland D, Halabiya SF, Coventry B, Griffin JR, Pulavarti SVSRK, Cummins M, Thieker DF, Klavins E, Szyperski T, DiMaio F, Baker D, Kuhlman B. Perturbing the energy landscape for improved packing during computational protein design. Proteins 2020; 89:436-449. [PMID: 33249652 DOI: 10.1002/prot.26030] [Citation(s) in RCA: 101] [Impact Index Per Article: 20.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 10/04/2020] [Accepted: 11/21/2020] [Indexed: 01/04/2023]
Abstract
The FastDesign protocol in the molecular modeling program Rosetta iterates between sequence optimization and structure refinement to stabilize de novo designed protein structures and complexes. FastDesign has been used previously to design novel protein folds and assemblies with important applications in research and medicine. To promote sampling of alternative conformations and sequences, FastDesign includes stages where the energy landscape is smoothened by reducing repulsive forces. Here, we discover that this process disfavors larger amino acids in the protein core because the protein compresses in the early stages of refinement. By testing alternative ramping strategies for the repulsive weight, we arrive at a scheme that produces lower energy designs with more native-like sequence composition in the protein core. We further validate the protocol by designing and experimentally characterizing over 4000 proteins and show that the new protocol produces higher stability proteins.
Collapse
Affiliation(s)
- Jack B Maguire
- Program in Bioinformatics and Computational Biology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Hugh K Haddox
- Department of Biochemistry, University of Washington, Seattle, Washington, USA.,Institute for Protein Design, University of Washington, Seattle, Washington, USA
| | - Devin Strickland
- Department of Electrical and Computer Engineering, University of Washington, Seattle, Washington, USA
| | - Samer F Halabiya
- Department of Electrical and Computer Engineering, University of Washington, Seattle, Washington, USA
| | - Brian Coventry
- Institute for Protein Design, University of Washington, Seattle, Washington, USA.,Molecular Engineering PhD Program, University of Washington, Seattle, Washington, USA
| | - Jermel R Griffin
- Department of Chemistry, State University of New York at Buffalo, Buffalo, New York, USA
| | | | - Matthew Cummins
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - David F Thieker
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.,Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| | - Eric Klavins
- Department of Electrical and Computer Engineering, University of Washington, Seattle, Washington, USA
| | - Thomas Szyperski
- Department of Chemistry, State University of New York at Buffalo, Buffalo, New York, USA
| | - Frank DiMaio
- Department of Biochemistry, University of Washington, Seattle, Washington, USA.,Institute for Protein Design, University of Washington, Seattle, Washington, USA
| | - David Baker
- Department of Biochemistry, University of Washington, Seattle, Washington, USA.,Institute for Protein Design, University of Washington, Seattle, Washington, USA.,Howard Hughes Medical Institute, University of Washington, Seattle, Washington, USA
| | - Brian Kuhlman
- Department of Biochemistry and Biophysics, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA.,Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina, USA
| |
Collapse
|