1
|
Hu Y, Zhang Q, Bai X, Men L, Ma J, Li D, Xu M, Wei Q, Chen R, Wang D, Yin X, Hu T, Xie T. Screening and modification of (+)-germacrene A synthase for the production of the anti-tumor drug (-)-β-elemene in engineered Saccharomyces cerevisiae. Int J Biol Macromol 2024; 279:135455. [PMID: 39260653 DOI: 10.1016/j.ijbiomac.2024.135455] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2024] [Revised: 09/06/2024] [Accepted: 09/06/2024] [Indexed: 09/13/2024]
Abstract
(-)-β-Elemene is a primary bioactive compound derived from Curcuma wenyujin and has been widely utilized as an anti-tumor agent for various types of cancer. Due to the inefficiency of plant extraction methods for β-elemene, significant efforts have been directed toward the heterogeneous biosynthesis of β-elemene using microbial cell factories. However, there has been less emphasis on the stereochemical configuration of germacrene A and its rearranged product, β-elemene. In this study, we constructed a yeast cell factory to produce (-)-β-elemene by optimizing the mevalonate pathway and screening for germacrene A synthases (GASs) from both plant and microbial sources. Notably, we discovered that the rearranged products of GASs exhibited different conformations, and only (+)-germacrene A produced by plant-derived GASs could rearrange to form (-)-β-elemene. Building on this discovery, we further investigated the catalytic mechanisms of GASs and developed an efficient catalytic gene module for generating (+)-germacrene A. Ultimately, the engineered yeast produced 1152 mg/L of (-)-β-elemene, marking the highest titer reported in yeast to date. Overall, this work highlights the differences in the stereoconformations of catalytic products between plant- and microbial-derived germacrene A synthases and establishes a foundation for the green and efficient production of β-elemene with a specific stereochemical configuration.
Collapse
Affiliation(s)
- Yuhan Hu
- School of Pharmacy, Chengdu University of Traditional Chinese Medicine, Chengdu 611137, China
| | - Qin Zhang
- School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China
| | - Xue Bai
- School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China
| | - Lianhui Men
- School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China
| | - Jing Ma
- School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China
| | - Dengyu Li
- School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China
| | - Mengdie Xu
- School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China
| | - Qiuhui Wei
- School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China; Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, Engineering Laboratory of Development and Application of Traditional Chinese Medicines, Collaborative Innovation Center of Traditional Chinese Medicines of Zhejiang Province, Hangzhou Normal University, Hangzhou 311121, China
| | - Rong Chen
- School of Public Health, Hangzhou Normal University, Hangzhou 311121, China
| | - Daming Wang
- School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China; Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, Engineering Laboratory of Development and Application of Traditional Chinese Medicines, Collaborative Innovation Center of Traditional Chinese Medicines of Zhejiang Province, Hangzhou Normal University, Hangzhou 311121, China
| | - Xiaopu Yin
- School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China; Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, Engineering Laboratory of Development and Application of Traditional Chinese Medicines, Collaborative Innovation Center of Traditional Chinese Medicines of Zhejiang Province, Hangzhou Normal University, Hangzhou 311121, China.
| | - Tianyuan Hu
- School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China; Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, Engineering Laboratory of Development and Application of Traditional Chinese Medicines, Collaborative Innovation Center of Traditional Chinese Medicines of Zhejiang Province, Hangzhou Normal University, Hangzhou 311121, China.
| | - Tian Xie
- School of Pharmacy, Hangzhou Normal University, Hangzhou 311121, China; Key Laboratory of Elemene Class Anti-Cancer Chinese Medicines, Engineering Laboratory of Development and Application of Traditional Chinese Medicines, Collaborative Innovation Center of Traditional Chinese Medicines of Zhejiang Province, Hangzhou Normal University, Hangzhou 311121, China.
| |
Collapse
|
2
|
Rosenberg AM, Ayres CM, Medina-Cucurella AV, Whitehead TA, Baker BM. Enhanced T cell receptor specificity through framework engineering. Front Immunol 2024; 15:1345368. [PMID: 38545094 PMCID: PMC10967027 DOI: 10.3389/fimmu.2024.1345368] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Accepted: 02/15/2024] [Indexed: 04/12/2024] Open
Abstract
Development of T cell receptors (TCRs) as immunotherapeutics is hindered by inherent TCR cross-reactivity. Engineering more specific TCRs has proven challenging, as unlike antibodies, improving TCR affinity does not usually improve specificity. Although various protein design approaches have been explored to surmount this, mutations in TCR binding interfaces risk broadening specificity or introducing new reactivities. Here we explored if TCR specificity could alternatively be tuned through framework mutations distant from the interface. Studying the 868 TCR specific for the HIV SL9 epitope presented by HLA-A2, we used deep mutational scanning to identify a framework mutation above the mobile CDR3β loop. This glycine to proline mutation had no discernable impact on binding affinity or functional avidity towards the SL9 epitope but weakened recognition of SL9 escape variants and led to fewer responses in a SL9-derived positional scanning library. In contrast, an interfacial mutation near the tip of CDR3α that also did not impact affinity or functional avidity towards SL9 weakened specificity. Simulations indicated that the specificity-enhancing mutation functions by reducing the range of loop motions, limiting the ability of the TCR to adjust to different ligands. Although our results are likely to be TCR dependent, using framework engineering to control TCR loop motions may be a viable strategy for improving the specificity of TCR-based immunotherapies.
Collapse
Affiliation(s)
- Aaron M. Rosenberg
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, IN, United States
| | - Cory M. Ayres
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, IN, United States
| | | | - Timothy A. Whitehead
- Department of Chemical and Biological Engineering, University of Colorado Boulder, Boulder, CO, United States
| | - Brian M. Baker
- Department of Chemistry and Biochemistry and the Harper Cancer Research Institute, University of Notre Dame, Notre Dame, IN, United States
| |
Collapse
|
3
|
Park SY, Qiu J, Wei S, Peterson FC, Beltrán J, Medina-Cucurella AV, Vaidya AS, Xing Z, Volkman BF, Nusinow DA, Whitehead TA, Wheeldon I, Cutler SR. An orthogonalized PYR1-based CID module with reprogrammable ligand-binding specificity. Nat Chem Biol 2024; 20:103-110. [PMID: 37872402 PMCID: PMC10746540 DOI: 10.1038/s41589-023-01447-7] [Citation(s) in RCA: 9] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Accepted: 09/13/2023] [Indexed: 10/25/2023]
Abstract
Plants sense abscisic acid (ABA) using chemical-induced dimerization (CID) modules, including the receptor PYR1 and HAB1, a phosphatase inhibited by ligand-activated PYR1. This system is unique because of the relative ease with which ligand recognition can be reprogrammed. To expand the PYR1 system, we designed an orthogonal '*' module, which harbors a dimer interface salt bridge; X-ray crystallographic, biochemical and in vivo analyses confirm its orthogonality. We used this module to create PYR1*MANDI/HAB1* and PYR1*AZIN/HAB1*, which possess nanomolar sensitivities to their activating ligands mandipropamid and azinphos-ethyl. Experiments in Arabidopsis thaliana and Saccharomyces cerevisiae demonstrate the sensitive detection of banned organophosphate contaminants using living biosensors and the construction of multi-input/output genetic circuits. Our new modules enable ligand-programmable multi-channel CID systems for plant and eukaryotic synthetic biology that can empower new plant-based and microbe-based sensing modalities.
Collapse
Affiliation(s)
- Sang-Youl Park
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, USA
- Institute for Integrative Genome Biology, University of California, Riverside, Riverside, CA, USA
| | - Jingde Qiu
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, USA
- Institute for Integrative Genome Biology, University of California, Riverside, Riverside, CA, USA
| | - Shuang Wei
- Department of Biochemistry, University of California, Riverside, Riverside, CA, USA
| | - Francis C Peterson
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | - Jesús Beltrán
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, USA
- Institute for Integrative Genome Biology, University of California, Riverside, Riverside, CA, USA
- Department of Plant and Soil Sciences, Delaware Biotechnology Institute, University of Delaware, Newark, DE, USA
| | | | - Aditya S Vaidya
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, USA
- Institute for Integrative Genome Biology, University of California, Riverside, Riverside, CA, USA
| | - Zenan Xing
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, USA
- Institute for Integrative Genome Biology, University of California, Riverside, Riverside, CA, USA
| | - Brian F Volkman
- Department of Biochemistry, Medical College of Wisconsin, Milwaukee, WI, USA
| | | | - Timothy A Whitehead
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO, USA
| | - Ian Wheeldon
- Institute for Integrative Genome Biology, University of California, Riverside, Riverside, CA, USA.
- Department of Chemical and Environmental Engineering, University of California, Riverside, Riverside, CA, USA.
- Center for Industrial Biotechnology, University of California, Riverside, Riverside, CA, USA.
| | - Sean R Cutler
- Department of Botany and Plant Sciences, University of California, Riverside, Riverside, CA, USA.
- Institute for Integrative Genome Biology, University of California, Riverside, Riverside, CA, USA.
| |
Collapse
|
4
|
Leonard AC, Weinstein JJ, Steiner PJ, Erbse AH, Fleishman SJ, Whitehead TA. Stabilization of the SARS-CoV-2 receptor binding domain by protein core redesign and deep mutational scanning. Protein Eng Des Sel 2022; 35:gzac002. [PMID: 35325236 PMCID: PMC9077414 DOI: 10.1093/protein/gzac002] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2021] [Revised: 01/21/2022] [Accepted: 02/16/2022] [Indexed: 11/12/2022] Open
Abstract
Stabilizing antigenic proteins as vaccine immunogens or diagnostic reagents is a stringent case of protein engineering and design as the exterior surface must maintain recognition by receptor(s) and antigen-specific antibodies at multiple distinct epitopes. This is a challenge, as stability enhancing mutations must be focused on the protein core, whereas successful computational stabilization algorithms typically select mutations at solvent-facing positions. In this study, we report the stabilization of SARS-CoV-2 Wuhan Hu-1 Spike receptor binding domain using a combination of deep mutational scanning and computational design, including the FuncLib algorithm. Our most successful design encodes I358F, Y365W, T430I, and I513L receptor binding domain mutations, maintains recognition by the receptor ACE2 and a panel of different anti-receptor binding domain monoclonal antibodies, is between 1 and 2°C more thermally stable than the original receptor binding domain using a thermal shift assay, and is less proteolytically sensitive to chymotrypsin and thermolysin than the original receptor binding domain. Our approach could be applied to the computational stabilization of a wide range of proteins without requiring detailed knowledge of active sites or binding epitopes. We envision that this strategy may be particularly powerful for cases when there are multiple or unknown binding sites.
Collapse
Affiliation(s)
- Alison C Leonard
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO 80303, USA
| | - Jonathan J Weinstein
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Paul J Steiner
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO 80303, USA
| | - Annette H Erbse
- Department of Biochemistry, University of Colorado, Boulder, CO 80303, USA
| | - Sarel J Fleishman
- Department of Biomolecular Sciences, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Timothy A Whitehead
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO 80303, USA
| |
Collapse
|
5
|
Kirby MB, Whitehead TA. Facile Assembly of Combinatorial Mutagenesis Libraries Using Nicking Mutagenesis. Methods Mol Biol 2022; 2461:85-109. [PMID: 35727445 PMCID: PMC9730894 DOI: 10.1007/978-1-0716-2152-3_6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Combinatorial mutagenesis is a method where multiple user-defined mutations are encoded at defined positions in a sequence. Combinatorial mutagenic libraries can be used in a variety of applications including evaluating fundamental questions about molecular evolution, directed evolution workflows for enzyme engineering, and in better understanding of biological processes like antibody affinity maturation. Here, we show a method of combinatorial mutagenesis utilizing the template-based nicking mutagenesis with several modifications. We show an example for generating a combinatorial library with 14 mutated positions, a total of 16,384 library variants, and a protocol for the generation of large, user-defined combinatorial libraries. The reader can use this protocol to create such libraries in 2 days.
Collapse
Affiliation(s)
- Monica B Kirby
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO, USA
| | - Timothy A Whitehead
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO, USA.
| |
Collapse
|
6
|
Haas CM, Francino-Urdaniz IM, Steiner PJ, Whitehead TA. Identification of SARS-CoV-2 S RBD escape mutants using yeast screening and deep mutational scanning. STAR Protoc 2021; 2:100869. [PMID: 34568839 PMCID: PMC8455247 DOI: 10.1016/j.xpro.2021.100869] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Here, we describe a protocol to identify escape mutants on the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) spike (S) receptor-binding domain (RBD) using a yeast screen combined with deep mutational scanning. Over 90% of all potential single S RBD escape mutants can be identified for monoclonal antibodies that directly compete with angiotensin-converting enzyme 2 for binding. Six to 10 antibodies can be assessed in parallel. This approach has been shown to determine escape mutants that are consistent with more laborious SARS-CoV-2 pseudoneutralization assays. For complete details on the use and execution of this protocol, please refer to Francino-Urdaniz et al. (2021). A protocol for identifying S escape mutations for anti-S monoclonal antibodies All anti-S antibodies that competitively inhibit ACE2 can be tested Protocol uses yeast display screening of freely available S RBD libraries Open-source software used to analyze sequencing data and identify escape mutants
Collapse
Affiliation(s)
- Cyrus M Haas
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, Boulder, CO 80303, USA
| | - Irene M Francino-Urdaniz
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, Boulder, CO 80303, USA
| | - Paul J Steiner
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, Boulder, CO 80303, USA
| | - Timothy A Whitehead
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, Boulder, CO 80303, USA
| |
Collapse
|
7
|
Chu HY, Wong ASL. Facilitating Machine Learning-Guided Protein Engineering with Smart Library Design and Massively Parallel Assays. ADVANCED GENETICS (HOBOKEN, N.J.) 2021; 2:2100038. [PMID: 36619853 PMCID: PMC9744531 DOI: 10.1002/ggn2.202100038] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/07/2021] [Revised: 11/08/2021] [Indexed: 01/11/2023]
Abstract
Protein design plays an important role in recent medical advances from antibody therapy to vaccine design. Typically, exhaustive mutational screens or directed evolution experiments are used for the identification of the best design or for improvements to the wild-type variant. Even with a high-throughput screening on pooled libraries and Next-Generation Sequencing to boost the scale of read-outs, surveying all the variants with combinatorial mutations for their empirical fitness scores is still of magnitudes beyond the capacity of existing experimental settings. To tackle this challenge, in-silico approaches using machine learning to predict the fitness of novel variants based on a subset of empirical measurements are now employed. These machine learning models turn out to be useful in many cases, with the premise that the experimentally determined fitness scores and the amino-acid descriptors of the models are informative. The machine learning models can guide the search for the highest fitness variants, resolve complex epistatic relationships, and highlight bio-physical rules for protein folding. Using machine learning-guided approaches, researchers can build more focused libraries, thus relieving themselves from labor-intensive screens and fast-tracking the optimization process. Here, we describe the current advances in massive-scale variant screens, and how machine learning and mutagenesis strategies can be integrated to accelerate protein engineering. More specifically, we examine strategies to make screens more economical, informative, and effective in discovery of useful variants.
Collapse
Affiliation(s)
- Hoi Yee Chu
- Laboratory of Combinatorial Genetics and Synthetic BiologySchool of Biomedical SciencesThe University of Hong KongHong Kong852China
| | - Alan S. L. Wong
- Laboratory of Combinatorial Genetics and Synthetic BiologySchool of Biomedical SciencesThe University of Hong KongHong Kong852China
- Electrical and Electronic EngineeringThe University of Hong KongPokfulamHong Kong852China
| |
Collapse
|
8
|
Leonard AC, Weinstein JJ, Steiner PJ, Erbse AH, Fleishman SJ, Whitehead TA. Stabilization of the SARS-CoV-2 Receptor Binding Domain by Protein Core Redesign and Deep Mutational Scanning.. [PMID: 34845448 PMCID: PMC8629191 DOI: 10.1101/2021.11.22.469552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
Stabilizing antigenic proteins as vaccine immunogens or diagnostic reagents is a stringent case of protein engineering and design as the exterior surface must maintain recognition by receptor(s) and antigen—specific antibodies at multiple distinct epitopes. This is a challenge, as stability-enhancing mutations must be focused on the protein core, whereas successful computational stabilization algorithms typically select mutations at solvent-facing positions. In this study we report the stabilization of SARS-CoV-2 Wuhan Hu-1 Spike receptor binding domain (S RBD) using a combination of deep mutational scanning and computational design, including the FuncLib algorithm. Our most successful design encodes I358F, Y365W, T430I, and I513L RBD mutations, maintains recognition by the receptor ACE2 and a panel of different anti-RBD monoclonal antibodies, is between 1–2°C more thermally stable than the original RBD using a thermal shift assay, and is less proteolytically sensitive to chymotrypsin and thermolysin than the original RBD. Our approach could be applied to the computational stabilization of a wide range of proteins without requiring detailed knowledge of active sites or binding epitopes, particularly powerful for cases when there are multiple or unknown binding sites.
Collapse
|
9
|
Kirby MB, Medina-Cucurella AV, Baumer ZT, Whitehead TA. Optimization of multi-site nicking mutagenesis for generation of large, user-defined combinatorial libraries. Protein Eng Des Sel 2021; 34:gzab017. [PMID: 34341824 PMCID: PMC8502461 DOI: 10.1093/protein/gzab017] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Revised: 06/09/2021] [Accepted: 06/17/2021] [Indexed: 11/13/2022] Open
Abstract
Generating combinatorial libraries of specific sets of mutations are essential for addressing protein engineering questions involving contingency in molecular evolution, epistatic relationships between mutations, as well as functional antibody and enzyme engineering. Here we present optimization of a combinatorial mutagenesis method involving template-based nicking mutagenesis, which allows for the generation of libraries with >99% coverage for tens of thousands of user-defined variants. The non-optimized method resulted in low library coverage, which could be rationalized by a model of oligonucleotide annealing bias resulting from the nucleotide mismatch free-energy difference between mutagenic oligo and template. The optimized method mitigated this thermodynamic bias using longer primer sets and faster annealing conditions. Our updated method, applied to two antibody fragments, delivered between 99.0% (32451/32768 library members) to >99.9% coverage (32757/32768) for our desired libraries in 2 days and at an approximate 140-fold sequencing depth of coverage.
Collapse
Affiliation(s)
- Monica B Kirby
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO 80305, USA
| | - Angélica V Medina-Cucurella
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI 48824, USA
- GigaGen Inc., South San Francisco, CA 94080, USA
| | - Zachary T Baumer
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO 80305, USA
| | - Timothy A Whitehead
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO 80305, USA
| |
Collapse
|
10
|
Munro D, Singh M. DeMaSk: a deep mutational scanning substitution matrix and its use for variant impact prediction. Bioinformatics 2020; 36:5322-5329. [PMID: 33325500 PMCID: PMC8016454 DOI: 10.1093/bioinformatics/btaa1030] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Revised: 10/16/2020] [Accepted: 11/30/2020] [Indexed: 01/27/2023] Open
Abstract
Motivation Accurately predicting the quantitative impact of a substitution on a protein’s molecular function would be a great aid in understanding the effects of observed genetic variants across populations. While this remains a challenging task, new approaches can leverage data from the increasing numbers of comprehensive deep mutational scanning (DMS) studies that systematically mutate proteins and measure fitness. Results We introduce DeMaSk, an intuitive and interpretable method based only upon DMS datasets and sequence homologs that predicts the impact of missense mutations within any protein. DeMaSk first infers a directional amino acid substitution matrix from DMS datasets and then fits a linear model that combines these substitution scores with measures of per-position evolutionary conservation and variant frequency across homologs. Despite its simplicity, DeMaSk has state-of-the-art performance in predicting the impact of amino acid substitutions, and can easily and rapidly be applied to any protein sequence. Availability and implementation https://demask.princeton.edu generates fitness impact predictions and visualizations for any user-submitted protein sequence. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Daniel Munro
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, 08544, USA
| | - Mona Singh
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, 08544, USA.,Department of Computer Science, Princeton University, Princeton, 08544, USA
| |
Collapse
|
11
|
Song H, Bremer BJ, Hinds EC, Raskutti G, Romero PA. Inferring Protein Sequence-Function Relationships with Large-Scale Positive-Unlabeled Learning. Cell Syst 2020; 12:92-101.e8. [PMID: 33212013 DOI: 10.1016/j.cels.2020.10.007] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2019] [Revised: 08/13/2020] [Accepted: 10/22/2020] [Indexed: 10/22/2022]
Abstract
Machine learning can infer how protein sequence maps to function without requiring a detailed understanding of the underlying physical or biological mechanisms. It is challenging to apply existing supervised learning frameworks to large-scale experimental data generated by deep mutational scanning (DMS) and related methods. DMS data often contain high-dimensional and correlated sequence variables, experimental sampling error and bias, and the presence of missing data. Notably, most DMS data do not contain examples of negative sequences, making it challenging to directly estimate how sequence affects function. Here, we develop a positive-unlabeled (PU) learning framework to infer sequence-function relationships from large-scale DMS data. Our PU learning method displays excellent predictive performance across ten large-scale sequence-function datasets, representing proteins of different folds, functions, and library types. The estimated parameters pinpoint key residues that dictate protein structure and function. Finally, we apply our statistical sequence-function model to design highly stabilized enzymes.
Collapse
Affiliation(s)
- Hyebin Song
- Department of Statistics, The Pennsylvania State University, State College, PA 16802, USA; Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Bennett J Bremer
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Emily C Hinds
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Garvesh Raskutti
- Department of Statistics, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Philip A Romero
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA; Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
12
|
Faber MS, Wrenbeck EE, Azouz LR, Steiner PJ, Whitehead TA. Impact of In Vivo Protein Folding Probability on Local Fitness Landscapes. Mol Biol Evol 2020; 36:2764-2777. [PMID: 31400199 DOI: 10.1093/molbev/msz184] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
It is incompletely understood how biophysical properties like protein stability impact molecular evolution and epistasis. Epistasis is defined as specific when a mutation exclusively influences the phenotypic effect of another mutation, often at physically interacting residues. In contrast, nonspecific epistasis results when a mutation is influenced by a large number of nonlocal mutations. As most mutations are pleiotropic, the in vivo folding probability-governed by basal protein stability-is thought to determine activity-enhancing mutational tolerance, implying that nonspecific epistasis is dominant. However, evidence exists for both specific and nonspecific epistasis as the prevalent factor, with limited comprehensive data sets to support either claim. Here, we use deep mutational scanning to probe how in vivo enzyme folding probability impacts local fitness landscapes. We computationally designed two different variants of the amidase AmiE with statistically indistinguishable catalytic efficiencies but lower probabilities of folding in vivo compared with wild-type. Local fitness landscapes show slight alterations among variants, with essentially the same global distribution of fitness effects. However, specific epistasis was predominant for the subset of mutations exhibiting positive sign epistasis. These mutations mapped to spatially distinct locations on AmiE near the initial mutation or proximal to the active site. Intriguingly, the majority of specific epistatic mutations were codon dependent, with different synonymous codons resulting in fitness sign reversals. Together, these results offer a nuanced view of how protein folding probability impacts local fitness landscapes and suggest that transcriptional-translational effects are as important as stability in determining evolutionary outcomes.
Collapse
Affiliation(s)
- Matthew S Faber
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI
| | - Emily E Wrenbeck
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI
| | - Laura R Azouz
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI
| | - Paul J Steiner
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO
| | - Timothy A Whitehead
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI.,Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO.,E.E.W. Ginkgo Bioworks, L.R.A. McKetta Department of Chemical Engineering, University of Texas at Austin, Austin, TX
| |
Collapse
|
13
|
Medina-Cucurella AV, Steiner PJ, Faber MS, Beltrán J, Borelli AN, Kirby MB, Cutler SR, Whitehead TA. User-defined single pot mutagenesis using unamplified oligo pools. Protein Eng Des Sel 2020; 32:41-45. [PMID: 31297523 DOI: 10.1093/protein/gzz013] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2019] [Revised: 05/17/2019] [Accepted: 06/11/2019] [Indexed: 11/13/2022] Open
Abstract
User-defined mutagenic libraries are fundamental for applied protein engineering workflows. Here we show that unamplified oligo pools can be used to prepare site saturation mutagenesis libraries from plasmid DNA with near-complete coverage of desired mutations and few off-target mutations. We find that oligo pools yield higher quality libraries when compared to individually synthesized degenerate oligos. We also show that multiple libraries can be multiplexed into a single oligo pool, making preparation of multiple libraries less expensive and more convenient. We provide software for automatic oligo pool design that can generate mutagenic oligos for saturating or focused libraries.
Collapse
Affiliation(s)
| | - Paul J Steiner
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO, USA
| | - Matthew S Faber
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, MI, USA
| | - Jesús Beltrán
- Department of Plant Cell Biology, University of California, Riverside, CA, USA
| | - Alexandra N Borelli
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO, USA
| | - Monica B Kirby
- Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO, USA
| | - Sean R Cutler
- Department of Plant Cell Biology, University of California, Riverside, CA, USA
| | - Timothy A Whitehead
- Department of Chemical Engineering and Materials Science, Michigan State University, East Lansing, MI, USA.,Department of Chemical and Biological Engineering, University of Colorado, Boulder, CO, USA
| |
Collapse
|
14
|
Faber MS, Van Leuven JT, Ederer MM, Sapozhnikov Y, Wilson ZL, Wichman HA, Whitehead TA, Miller CR. Saturation Mutagenesis Genome Engineering of Infective ΦX174 Bacteriophage via Unamplified Oligo Pools and Golden Gate Assembly. ACS Synth Biol 2020; 9:125-131. [PMID: 31825605 PMCID: PMC7055157 DOI: 10.1021/acssynbio.9b00411] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Here we present a novel protocol for the construction of saturation single-site-and massive multisite-mutant libraries of a bacteriophage. We segmented the ΦX174 genome into 14 nontoxic and nonreplicative fragments compatible with Golden Gate assembly. We next used nicking mutagenesis with oligonucleotides prepared from unamplified oligo pools with individual segments as templates to prepare near-comprehensive single-site mutagenesis libraries of genes encoding the F capsid protein (421 amino acids scanned) and G spike protein (172 amino acids scanned). Libraries possessed greater than 99% of all 11 860 programmed mutations. Golden Gate cloning was then used to assemble the complete ΦX174 mutant genome and generate libraries of infective viruses. This protocol will enable reverse genetics experiments for studying viral evolution and, with some modifications, can be applied for engineering therapeutically relevant bacteriophages with larger genomes.
Collapse
Affiliation(s)
- Matthew S. Faber
- Department of Biochemistry and Molecular Biology, Michigan State University, East Lansing, Michigan 48824, United States
| | - James T. Van Leuven
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, Idaho 83844, United States
- Department of Biological Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Martina M. Ederer
- Department of Biological Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Yesol Sapozhnikov
- Department of Biological Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Zoë L. Wilson
- Department of Biological Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Holly A. Wichman
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, Idaho 83844, United States
- Department of Biological Sciences, University of Idaho, Moscow, Idaho 83844, United States
| | - Timothy A. Whitehead
- Department of Chemical & Biological Engineering, University of Colorado, Boulder, Colorado 80303, United States
- Department of Chemical Engineering & Materials Science, Michigan State University, East Lansing, Michigan 48824, United States
| | - Craig R. Miller
- Institute for Modeling Collaboration and Innovation, University of Idaho, Moscow, Idaho 83844, United States
- Department of Biological Sciences, University of Idaho, Moscow, Idaho 83844, United States
| |
Collapse
|
15
|
Klesmith JR, Wu L, Lobb RR, Rennert PD, Hackel BJ. Fine Epitope Mapping of the CD19 Extracellular Domain Promotes Design. Biochemistry 2019; 58:4869-4881. [PMID: 31702909 DOI: 10.1021/acs.biochem.9b00808] [Citation(s) in RCA: 24] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The B-cell surface protein CD19 is present throughout the cell life cycle and is uniformly expressed in leukemias, making it a target for chimeric antigen receptor engineered immune cell therapy. Identifying the sequence dependence of the binding of CD19 to antibodies empowers fundamental study and more tailored development of CD19-targeted therapeutics. To identify the antibody-binding epitopes on CD19, we screened a comprehensive single-site saturation mutation library of the human CD19 extracellular domain to identify mutations detrimental to binding FMC63-the dominant CD19 antibody used in chimeric antigen receptor development-as well as 4G7-2E3 and 3B10, which have been used in various types of CD19 research and development. All three antibodies had partially overlapping, yet distinct, epitopes near the published epitope of antibody B43. The FMC63 conformational epitope spans spatially adjacent, but genetically distant, loops in exons 3 and 4. The 3B10 epitope is a linear peptide sequence that binds CD19 with 440 pM affinity. Along with their primary goal of epitope mapping, the mutational tolerance data also empowered additional CD19 variant design and analysis. A designed CD19 variant with all N-linked glycosylation sites removed successfully bound antibody in the yeast display context, which provides a lead for aglycosylated applications. Screening for thermally stable variants identified mutations to guide further CD19 stabilization for fusion protein applications and revealed evolutionary affinity-stability trade-offs. These fundamental insights into CD19 sequence-function relationships enhance our understanding of antibody-mediated CD19-targeted therapeutics.
Collapse
Affiliation(s)
- Justin R Klesmith
- Department of Chemical Engineering and Materials Science , University of Minnesota-Twin Cities , 421 Washington Avenue SE , Minneapolis , Minnesota 55455 , United States
| | - Lan Wu
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| | - Roy R Lobb
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| | - Paul D Rennert
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| | - Benjamin J Hackel
- Department of Chemical Engineering and Materials Science , University of Minnesota-Twin Cities , 421 Washington Avenue SE , Minneapolis , Minnesota 55455 , United States
| |
Collapse
|
16
|
Klesmith JR, Su L, Wu L, Schrack IA, Dufort FJ, Birt A, Ambrose C, Hackel BJ, Lobb RR, Rennert PD. Retargeting CD19 Chimeric Antigen Receptor T Cells via Engineered CD19-Fusion Proteins. Mol Pharm 2019; 16:3544-3558. [PMID: 31242389 DOI: 10.1021/acs.molpharmaceut.9b00418] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/08/2023]
Abstract
CD19-targeted chimeric antigen receptor (CAR) T-cells (CAR19s) show remarkable efficacy in the treatment of relapsed/refractory acute lymphocytic leukemia and Non-Hodgkin's lymphoma. However, the use of CAR T-cell therapy against CD19-negative hematological cancers and solid tumors has been challenging. We propose CD19-fusion proteins (CD19-FPs) to leverage the benefits of CAR19s while retargeting this validated cellular therapy to alternative tumor antigens. We demonstrate the ability of a fusion of CD19 extracellular domain (ECD) and a human epidermal growth factor receptor 2 (HER2) single-chain antibody fragment to retarget CAR19s to kill HER2+ CD19- tumor cells. To enhance the modularity of this technology, we engineered a more robust CD19 ECD via deep mutational scanning with yeast display and flow cytometric selections for improved protease resistance and anti-CD19 antibody binding. These enhanced CD19 ECDs significantly increase, and in some cases recover, fusion protein expression while maintaining target antigen affinity. Importantly, CD19-FPs retarget CAR19s to kill tumor cells expressing multiple distinct antigens, including HER2, CD20, EGFR, BCMA, and Clec12A as N- or C-terminal fusions and linked to both antibody fragments and fibronectin ligands. This study provides fundamental insights into CD19 sequence-function relationships and defines a flexible and modular platform to retarget CAR19s to any tumor antigen.
Collapse
Affiliation(s)
- Justin R Klesmith
- Department of Chemical Engineering and Materials Science , University of Minnesota Twin Cities , 421 Washington Avenue SE , Minneapolis , Minnesota 55455 , United States
| | - Lihe Su
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| | - Lan Wu
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| | - Ian A Schrack
- Department of Chemical Engineering and Materials Science , University of Minnesota Twin Cities , 421 Washington Avenue SE , Minneapolis , Minnesota 55455 , United States
| | - Fay J Dufort
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| | - Alyssa Birt
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| | - Christine Ambrose
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| | - Benjamin J Hackel
- Department of Chemical Engineering and Materials Science , University of Minnesota Twin Cities , 421 Washington Avenue SE , Minneapolis , Minnesota 55455 , United States
| | - Roy R Lobb
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| | - Paul D Rennert
- Aleta Biotherapeutics , 27 Strathmore Road , Natick , Massachusetts 01760 , United States
| |
Collapse
|