1
|
Zheng Y, Wu L, Zhang Q, Hu L, Tian Y, Wang M, Zheng H, Zhang Z. A constant pH molecular dynamics and experimental study on the effect of different pH on the structure of urease from Sporosarcina pasteurii. J Mol Model 2025; 31:164. [PMID: 40387959 DOI: 10.1007/s00894-025-06369-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2025] [Accepted: 04/07/2025] [Indexed: 05/20/2025]
Abstract
CONTEXT Urease is pivotal in microbial-induced calcium carbonate precipitation (MICP), where its catalytic efficiency directly governs calcium carbonate formation. However, practical MICP applications in extreme environments (e.g., acidic mine drainage, industrial waste sites) are hindered by limited understanding of urease behavior under extreme pH conditions. This study combines laboratory experiments and constant pH molecular dynamics (CpHMD) simulations to investigate how pH variations (3-11) affect the structural stability of Sporosarcina pasteurii urease, focusing on its α-subunit (PDB: 4CEU). Experimental validation identified pH 7-8 as optimal for urease activity, aligning with molecular dynamics results showing minimal structural deviations (RMSD) and stable protonation states under neutral to mildly alkaline conditions. Extreme pH (3, 4, 11) disrupted active-site geometry and induced charge fluctuations, impairing catalytic function. CpHMD simulations revealed that the α-subunit retains structural integrity at pH 7-8, suggesting potential reassembly post-environmental stress. This work bridges gaps in enzymatic stability under harsh conditions, offering insights for optimizing MICP in geotechnical and environmental remediation applications. METHODS The study combined experimental and computational approaches. Sporosarcina pasteurii urease activity was experimentally assessed across pH 3-11 by monitoring urea hydrolysis-induced conductivity changes. Computational analyses employed GROMACS constant pH with the CHARMM36 force field to perform pH-dependent molecular dynamics simulations. The urease structure was solvated, neutralized, energy-minimized, and subjected to constant pH simulations. Structural stability, active site dynamics, and protonation states of titratable residues were analyzed via RMSD, hydrogen bonds, solvent-accessible surface area (SASA), and Epock 1.0.5. Free energy landscapes and residue interactions were evaluated using principal component analysis (PCA) and λ-dynamics. Experimental data were processed with OriginPro 2024b and Python, linking pH-induced conformational shifts to enzymatic function.
Collapse
Affiliation(s)
- Yifei Zheng
- School of Resources, Environment and Safety Engineering, University of South China, Zhengxiang, Hengyang, 421001, China
- Hunan Provincial Mining Geotechnical Engineering Disaster Prediction and Control Engineering Technology Research Center, Hengyang, 421001, China
| | - Lingling Wu
- School of Resources, Environment and Safety Engineering, University of South China, Zhengxiang, Hengyang, 421001, China
- Hunan Provincial Mining Geotechnical Engineering Disaster Prediction and Control Engineering Technology Research Center, Hengyang, 421001, China
| | - Qiucai Zhang
- School of Resources, Environment and Safety Engineering, University of South China, Zhengxiang, Hengyang, 421001, China
- Hunan Provincial Mining Geotechnical Engineering Disaster Prediction and Control Engineering Technology Research Center, Hengyang, 421001, China
| | - Lin Hu
- School of Resources, Environment and Safety Engineering, University of South China, Zhengxiang, Hengyang, 421001, China
- Hunan Provincial Mining Geotechnical Engineering Disaster Prediction and Control Engineering Technology Research Center, Hengyang, 421001, China
| | - Yakun Tian
- School of Resources, Environment and Safety Engineering, University of South China, Zhengxiang, Hengyang, 421001, China
- Hunan Provincial Mining Geotechnical Engineering Disaster Prediction and Control Engineering Technology Research Center, Hengyang, 421001, China
| | - Min Wang
- School of Resources, Environment and Safety Engineering, University of South China, Zhengxiang, Hengyang, 421001, China
- Hunan Provincial Mining Geotechnical Engineering Disaster Prediction and Control Engineering Technology Research Center, Hengyang, 421001, China
| | - Huaimiao Zheng
- School of Economics, Management and Law, University of South China, Hengyang, 421001, China.
| | - Zhijun Zhang
- School of Resources, Environment and Safety Engineering, University of South China, Zhengxiang, Hengyang, 421001, China.
- Hunan Provincial Mining Geotechnical Engineering Disaster Prediction and Control Engineering Technology Research Center, Hengyang, 421001, China.
| |
Collapse
|
2
|
Yusoff MHM, Salleh MSH, Shafie MH. Isolation and characterization of antioxidant and anti-tyrosinase activities of Cosmos caudatus leaf polysaccharides using microwave-assisted extraction. Int J Biol Macromol 2025; 311:144154. [PMID: 40368199 DOI: 10.1016/j.ijbiomac.2025.144154] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2024] [Revised: 05/08/2025] [Accepted: 05/10/2025] [Indexed: 05/16/2025]
Abstract
This study investigates the extraction of Cosmos caudatus leaf polysaccharides (CCLP) using citric acid monohydrate (CAM) as the extraction medium. Moreover, the extraction was assisted with microwave-assisted extraction due to its advantages as high extraction efficiency with low energy consumption and short extraction period. Optimal conditions yielded a maximum of 36.06 % which achieved by Box-Behnken design analysis. Characterization studies revealed that CCLP has β-configuration with branching properties, indicated by the presence of methyl, acetyl, and sugar ring structures. CCLP exhibit low methoxyl and considered as glucose-rich polysaccharides due to glucose as its major monosaccharide compositions. Additionally, CCLP demonstrated good gelling properties, moderate viscosity and high-water solubility which further supported by high water and oil-holding capacities, enhancing its formulation potential. Bioactivity evaluation revealed significant antioxidant properties with IC50 values of 2.15 mg/mL and 8.00 mg/mL for DPPH and ABTS radicals, respectively. CCLP also exhibited potent tyrosinase inhibition, with IC50 values of 1.69 mg/mL for monophenolase and 1.31 mg/mL for diphenolase. Furthermore, its photoprotective potential, reflected by a sun protection factor (SPF) of 25.33 %, highlights its potential utility in skincare applications. These findings suggest that CCLP, with its unique structural features and strong bioactivities is a promising bioactive ingredient for managing hyperpigmentation.
Collapse
Affiliation(s)
- Muhammad Hasnun Md Yusoff
- Analytical Biochemistry Research Centre (ABrC), University Innovation Incubator (I(2)U), sains@usm Campus, Universiti Sains Malaysia, Lebuh Bukit Jambul, 11900 Bayan Lepas, Penang, Malaysia
| | - Muhammad Syahmi Hairul Salleh
- Analytical Biochemistry Research Centre (ABrC), University Innovation Incubator (I(2)U), sains@usm Campus, Universiti Sains Malaysia, Lebuh Bukit Jambul, 11900 Bayan Lepas, Penang, Malaysia
| | - Muhammad Hakimin Shafie
- Analytical Biochemistry Research Centre (ABrC), University Innovation Incubator (I(2)U), sains@usm Campus, Universiti Sains Malaysia, Lebuh Bukit Jambul, 11900 Bayan Lepas, Penang, Malaysia.
| |
Collapse
|
3
|
Lee B, Sunden F, Miller M, Pak B, Krebber A, Lutz S, Fordyce PM. Hydrophilic/Omniphobic Droplet Arrays for High-throughput and Quantitative Enzymology. Anal Chem 2025; 97:8957-8967. [PMID: 40238746 PMCID: PMC12044592 DOI: 10.1021/acs.analchem.5c00333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2025] [Revised: 03/21/2025] [Accepted: 03/24/2025] [Indexed: 04/18/2025]
Abstract
Engineered enzymes with enhanced or novel functions are specific catalysts with wide-ranging applications in industry and medicine. Here, we introduce droplet array microfluidic enzyme kinetics (DA-MEK), a high-throughput enzyme screening platform that combines water-in-air droplet microarrays formed on patterned superhydrophilic/omniphobic surfaces with cell-free protein synthesis to enable cost-effective expression and quantitative kinetic characterization of enzyme variants. By printing DNA templates encoding enzyme variants onto hydrophilic spots, stamping slides to add cell-free expression mix, and imaging the resulting arrays, we demonstrate reproducible expression of enzyme variants across hundreds of microwells per slide, with line of sight toward replicating this across larger libraries. By specifically patterning slides with antibodies, we further demonstrate parallel immobilization, purification, and iterative characterization of the expressed variants. Subsequent stamping of fluorogenic substrates and time-lapse imaging allows determination of Michaelis-Menten parameters for each variant, with measured catalytic efficiencies spanning 5 orders of magnitude and agreeing well with values obtained via traditional microtiter plate assays. DA-MEK consumes orders of magnitude less reagents than plate-based assays, while providing accurate and detailed kinetic information for both beneficial and deleterious mutations. In future work, we anticipate that DA-MEK will provide a powerful and versatile platform to accelerate enzyme engineering and enable screening of large variant libraries under diverse conditions.
Collapse
Affiliation(s)
- Byungjin Lee
- Department
of Genetics, Stanford University, Stanford, California 94305, United States
| | - Fanny Sunden
- Department
of Biochemistry, Stanford University, Stanford, California 94305, United States
| | - Michael Miller
- Codexis,
Inc., Redwood
City, California 94063, United States
| | - Bumshik Pak
- Codexis,
Inc., Redwood
City, California 94063, United States
| | - Anke Krebber
- Codexis,
Inc., Redwood
City, California 94063, United States
| | - Stefan Lutz
- Codexis,
Inc., Redwood
City, California 94063, United States
| | - Polly Morrell Fordyce
- Department
of Genetics, Stanford University, Stanford, California 94305, United States
- Department
of Biochemistry, Stanford University, Stanford, California 94305, United States
- Sarafan
ChEM-H, Stanford, California 94305, United States
- Chan
Zuckerberg Biohub San Francisco, San Francisco, California 94110, United States
| |
Collapse
|
4
|
Campbell ME, Ohler AR, McGill MJ, Buller AR. Promiscuity Guided Evolution of Decarboxylative Aldolases for Synthesis of Tertiary γ-Hydroxy Amino Acids. Angew Chem Int Ed Engl 2025; 64:e202422109. [PMID: 39874202 PMCID: PMC11976203 DOI: 10.1002/anie.202422109] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2024] [Revised: 01/27/2025] [Accepted: 01/28/2025] [Indexed: 01/30/2025]
Abstract
Many applications of enzymes benefit from activity on structurally diverse substrates. Here, we sought to engineer the decarboxylative aldolase UstD to perform a challenging C-C bond forming reaction with ketone electrophiles. The parent enzyme had only low levels of activity, portending multiple rounds of directed evolution and a possibility that mutations may inadvertently increase the specificity of the enzyme for a single model screening substrate. We show how to intentionally guide UstD towards generality through multi-generational directed evolution using substrate-multiplexed screening (SUMS). Mutations outside of the active site that impact catalytic function were immediately revealed by shifts in promiscuity, even when the overall activity was lower. By re-targeting these distal residues that couple to the active site with saturation mutagenesis, broadly activating mutations were readily identified. When analyzing active site mutants, SUMS identified both specialist enzymes that would have more limited utility as well as generalist enzymes with complementary activity on diverse substrates. These new UstD enzymes catalyze convergent synthesis of non-canonical amino acids bearing tertiary alcohol side chains. This methodology is easy to implement and enables the rapid and effective evolution of enzymes to catalyze desirable new functions.
Collapse
Affiliation(s)
- Meghan E. Campbell
- Department of ChemistryUniversity of Wisconsin-MadisonUSAMadisonWI 53706
| | - Amanda R. Ohler
- Department of ChemistryUniversity of Wisconsin-MadisonUSAMadisonWI 53706
| | - Matthew J. McGill
- Department of ChemistryUniversity of Wisconsin-MadisonUSAMadisonWI 53706
| | - Andrew R. Buller
- Department of ChemistryUniversity of Wisconsin-MadisonUSAMadisonWI 53706
| |
Collapse
|
5
|
Borkakoti N, Ribeiro AJM, Thornton JM. A structural perspective on enzymes and their catalytic mechanisms. Curr Opin Struct Biol 2025; 92:103040. [PMID: 40158299 DOI: 10.1016/j.sbi.2025.103040] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2025] [Revised: 02/14/2025] [Accepted: 03/03/2025] [Indexed: 04/02/2025]
Abstract
In this perspective, we analyse the progress made in our knowledge of enzyme sequences, structures and functions in the last 2 years. We review how much new enzyme data have been garnered and annotated, derived from the study of proteins using structural and computational approaches. Recent advances towards capturing 'Catalysis in silico' are described, including knowledge and predictions of enzyme structures, their interactions and mechanisms. We highlight the flood of enzyme data, driven by metagenomic sequencing, the improved enzyme data resources, the high coverage in Protein Data Bank of E.C. classes and the AI-driven structure prediction techniques that facilitate the accurate prediction of protein structures. We note the focus on disordered regions in the context of enzyme regulation and specificity and comment on emerging bioinformatic approaches that capture reaction mechanisms computationally for comparing and predicting enzyme mechanisms. We also consider the drivers of progress in this field in the next five years.
Collapse
Affiliation(s)
- Neera Borkakoti
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK.
| | - António J M Ribeiro
- LAQV-REQUIMTE, Departamento de Química e Bioquímica, Faculdade de Ciências da, Universidade do Porto, 4169-007, Porto, Portugal
| | - Janet M Thornton
- European Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Genome Campus, Hinxton, CB10 1SD, UK
| |
Collapse
|
6
|
Thomas N, Belanger D, Xu C, Lee H, Hirano K, Iwai K, Polic V, Nyberg KD, Hoff KG, Frenz L, Emrich CA, Kim JW, Chavarha M, Ramanan A, Agresti JJ, Colwell LJ. Engineering highly active nuclease enzymes with machine learning and high-throughput screening. Cell Syst 2025; 16:101236. [PMID: 40081373 DOI: 10.1016/j.cels.2025.101236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2024] [Revised: 09/17/2024] [Accepted: 02/19/2025] [Indexed: 03/16/2025]
Abstract
Optimizing enzymes to function in novel chemical environments is a central goal of synthetic biology, but optimization is often hindered by a rugged fitness landscape and costly experiments. In this work, we present TeleProt, a machine learning (ML) framework that blends evolutionary and experimental data to design diverse protein libraries, and employ it to improve the catalytic activity of a nuclease enzyme that degrades biofilms that accumulate on chronic wounds. After multiple rounds of high-throughput experiments, TeleProt found a significantly better top-performing enzyme than directed evolution (DE), had a better hit rate at finding diverse, high-activity variants, and was even able to design a high-performance initial library using no prior experimental data. We have released a dataset of 55,000 nuclease variants, one of the most extensive genotype-phenotype enzyme activity landscapes to date, to drive further progress in ML-guided design. A record of this paper's transparent peer review process is included in the supplemental information.
Collapse
Affiliation(s)
- Neil Thomas
- X, the Moonshot Factory, Mountain View, CA 94043, USA.
| | | | | | | | | | | | | | | | | | | | | | - Jun W Kim
- X, the Moonshot Factory, Mountain View, CA 94043, USA
| | | | - Abi Ramanan
- X, the Moonshot Factory, Mountain View, CA 94043, USA
| | | | - Lucy J Colwell
- Google DeepMind, Cambridge, MA 02142, USA; Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, UK.
| |
Collapse
|
7
|
Shepherd RA, Fihn CA, Tabag AJ, McKinnie SMK, Sanchez LM. 'Need for speed: high throughput' - mass spectrometry approaches for high-throughput directed evolution screening of natural product enzymes. Nat Prod Rep 2025. [PMID: 40013466 PMCID: PMC11866321 DOI: 10.1039/d4np00053f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2024] [Indexed: 02/28/2025]
Abstract
Covering: 2015 to 2024Mass spectrometry (MS)-based methods have been implemented extensively for enzyme engineering due to their label-free nature, making them suitable for screening a wide range of biochemical systems. Over the past decade, advancements in mass spectrometry, separation science, and the implementation of hyphenated methods have allowed for more streamlined analysis of large volumes of samples while maximizing the richness and dimensionality of the data collected. In this review we highlight recent advancements in mass spectrometry that have allowed for more efficient, robust, and rigorous enzyme engineering for various applications relating to natural products chemistry.
Collapse
Affiliation(s)
- Robert A Shepherd
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA 95064, USA, +1 831-459-4676.
| | - Conrad A Fihn
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA 95064, USA, +1 831-459-4676.
| | - Alex J Tabag
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA 95064, USA, +1 831-459-4676.
- Pharmaceutical Sciences, University of Kentucky, Lexington, KY 40506, USA
| | - Shaun M K McKinnie
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA 95064, USA, +1 831-459-4676.
| | - Laura M Sanchez
- Department of Chemistry and Biochemistry, University of California Santa Cruz, Santa Cruz, CA 95064, USA, +1 831-459-4676.
| |
Collapse
|
8
|
Anna Sajeevan K, Osinuga A, B A, Ferdous S, Shahreen N, Noor MS, Koneru S, Santos-Correa LM, Salehi R, Chowdhury NB, Calderon-Lopez B, Mali A, Saha R, Chowdhury R. Robust Prediction of Enzyme Variant Kinetics with RealKcat. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.10.637555. [PMID: 39990461 PMCID: PMC11844551 DOI: 10.1101/2025.02.10.637555] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/25/2025]
Abstract
Accurate prediction of kinetic parameters is crucial for understanding known and tailoring novel enzymes for biocatalysis. Current models fail to capture mutation effects on catalytically essential residues, limiting their utility in enzyme design. We grid-searched through ten model architectures (25,671 hyperparameter combinations) to identify a gradient-based additive framework called RealKcat, trained on 27,176 experimental entries curated manually (KinHub-27k) by screening 2,158 articles. Clustering catalytic turnover (k cat ) and substrate affinity (K M ) by rational orders of magnitude, RealKcat achieves >85% test accuracy, demonstrating highest sensitivity to mutation-induced variability thus far, and is the first-of-its-kind-model to demonstrate complete loss of activity upon deletion of the catalytic apparatus. Finally, state-of-the-art k cat validation accuracy (96%) on alkaline phosphatase (PafA) mutant industrial dataset confirms RealKcat's generalizability in learning per-residue catalytic relevance.
Collapse
Affiliation(s)
- Karuna Anna Sajeevan
- Department of Chemical and Biological Engineering, Iowa State University, Ames, Iowa, USA
- The Center for Biorenewable Chemicals, Iowa State University, Ames, Iowa, USA
| | - Abraham Osinuga
- Department of Chemical and Biomolecular Engineering, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Arunraj B
- Department of Chemical and Biological Engineering, Iowa State University, Ames, Iowa, USA
| | - Sakib Ferdous
- Department of Chemical and Biological Engineering, Iowa State University, Ames, Iowa, USA
| | - Nabia Shahreen
- Department of Chemical and Biomolecular Engineering, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Mohammed Sakib Noor
- Department of Chemical and Biological Engineering, Iowa State University, Ames, Iowa, USA
| | - Shashank Koneru
- Department of Chemical and Biological Engineering, Iowa State University, Ames, Iowa, USA
| | | | - Rahil Salehi
- Department of Chemical and Biological Engineering, Iowa State University, Ames, Iowa, USA
| | - Niaz Bahar Chowdhury
- Department of Chemical and Biomolecular Engineering, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Brisa Calderon-Lopez
- Department of Chemical and Biological Engineering, Iowa State University, Ames, Iowa, USA
| | - Ankur Mali
- Department of Computer Science and Engineering, University of South Florida, Tampa, Florida, USA
| | - Rajib Saha
- Department of Chemical and Biomolecular Engineering, University of Nebraska-Lincoln, Lincoln, Nebraska, USA
| | - Ratul Chowdhury
- Department of Chemical and Biological Engineering, Iowa State University, Ames, Iowa, USA
- The Center for Biorenewable Chemicals, Iowa State University, Ames, Iowa, USA
| |
Collapse
|
9
|
Prywes N, Phillips NR, Oltrogge LM, Lindner S, Taylor-Kearney LJ, Tsai YCC, de Pins B, Cowan AE, Chang HA, Wang RZ, Hall LN, Bellieny-Rabelo D, Nisonoff HM, Weissman RF, Flamholz AI, Ding D, Bhatt AY, Mueller-Cajar O, Shih PM, Milo R, Savage DF. A map of the rubisco biochemical landscape. Nature 2025; 638:823-828. [PMID: 39843747 PMCID: PMC11839469 DOI: 10.1038/s41586-024-08455-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Accepted: 11/26/2024] [Indexed: 01/24/2025]
Abstract
Rubisco is the primary CO2-fixing enzyme of the biosphere1, yet it has slow kinetics2. The roles of evolution and chemical mechanism in constraining its biochemical function remain debated3,4. Engineering efforts aimed at adjusting the biochemical parameters of rubisco have largely failed5, although recent results indicate that the functional potential of rubisco has a wider scope than previously known6. Here we developed a massively parallel assay, using an engineered Escherichia coli7 in which enzyme activity is coupled to growth, to systematically map the sequence-function landscape of rubisco. Composite assay of more than 99% of single-amino acid mutants versus CO2 concentration enabled inference of enzyme velocity and apparent CO2 affinity parameters for thousands of substitutions. This approach identified many highly conserved positions that tolerate mutation and rare mutations that improve CO2 affinity. These data indicate that non-trivial biochemical changes are readily accessible and that the functional distance between rubiscos from diverse organisms can be traversed, laying the groundwork for further enzyme engineering efforts.
Collapse
Affiliation(s)
- Noam Prywes
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, CA, USA
| | - Naiya R Phillips
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Luke M Oltrogge
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, CA, USA
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | | | - Leah J Taylor-Kearney
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
| | - Yi-Chin Candace Tsai
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Benoit de Pins
- Department of Biology, University of Naples Federico II, Naples, Italy
| | - Aidan E Cowan
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
- Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Emeryville, CA, USA
| | - Hana A Chang
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
| | - Renée Z Wang
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
| | - Laina N Hall
- Biophysics, University of California Berkeley, Berkeley, CA, USA
| | - Daniel Bellieny-Rabelo
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
- California Institute for Quantitative Biosciences (QB3), University of California Berkeley, Berkeley, CA, USA
| | - Hunter M Nisonoff
- Center for Computational Biology, University of California Berkeley, Berkeley, CA, USA
| | - Rachel F Weissman
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Avi I Flamholz
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
| | - David Ding
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, CA, USA
| | - Abhishek Y Bhatt
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
- School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Oliver Mueller-Cajar
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Patrick M Shih
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA, USA
| | - Ron Milo
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - David F Savage
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA.
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, CA, USA.
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA.
| |
Collapse
|
10
|
Chen Q, Wang M, Gao L, Lou Q, Gan Y, Li X, Li Y, Xin T, Xu W, Song J. A pivotal switch in β-domain determines the substrate selectivity of terpene synthases involved in Gardenia jasminoides floral scent synthesis. Int J Biol Macromol 2025; 288:138333. [PMID: 39653212 DOI: 10.1016/j.ijbiomac.2024.138333] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2024] [Revised: 11/15/2024] [Accepted: 12/02/2024] [Indexed: 12/19/2024]
Abstract
Plants have evolved a diverse array of secondary metabolites to enhance their adaptability to environmental stresses, with volatile terpenoids being a notable example. Gardenia (Gardenia jasminoides), celebrated for its unique fragrance, is a key natural source of volatile terpenoids. Using our chromosome-level genome and transcriptome data of G. jasminoides, we previously identified six terpene synthases (TPSs) involved in the production of its floral scent. Here, we functionally characterized these six key TPS enzymes, aligning their product profiles with volatile organic compound (VOC) analysis. Notably, we identified two highly similar TPSs, GjTPS1 and GjTPS2, which share high sequence homology but differ in substrate selectivity. Through AI-based predictions and site-directed mutagenesis, we pinpointed a single amino acid in the β-domain that acts as a "switch," modulating substrate selectivity-an unusual finding, as this residue is outside the active site region. Comparative genomic analyses with Coffea canephora and other eudicots revealed that G. jasminoides TPS genes primarily expanded through dispersed duplications (DSD) and tandem duplications (TD). Our study is the first to reveal a "switch" in a previously deemed "non-functional" region, regulating substrate selectivity in TPS proteins. These insights could facilitate breeding elite Gardenia varieties and support the rational engineering of terpene synthases.
Collapse
Affiliation(s)
- Qizhen Chen
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, China
| | - Menglan Wang
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, China
| | - Longlong Gao
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, China
| | - Qian Lou
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, China
| | - Yutong Gan
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, China
| | - Xinyao Li
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, China
| | - Yanfei Li
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, China
| | - Tianyi Xin
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, China
| | - Wenjie Xu
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, China.
| | - Jingyuan Song
- State Key Laboratory of Bioactive Substance and Function of Natural Medicines, Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences & Peking Union Medical College, Beijing 100193, China; Yunnan Key Laboratory of Southern Medicinal Resources, Yunnan Branch Institute of Medicinal Plant Development, Chinese Academy of Medical Sciences, Jinghong 666100, China; Key Laboratory of Chinese Medicine Resources Conservation, State Administration of Traditional Chinese Medicine of the People's Republic of China, Engineering Research Center of Chinese Medicine Resource, Ministry of Education, Beijing 100193, China.
| |
Collapse
|
11
|
Hu L, Jiao L, Chen C, Jia X, Li X, Yan D, Zhai Y, Lu X. Nanozymes with Modulable Inhibition Transfer Pathways for Thiol and Cell Identification. Anal Chem 2025; 97:1767-1774. [PMID: 39806811 DOI: 10.1021/acs.analchem.4c05355] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2025]
Abstract
The elementary mechanism and site studies of nanozyme-based inhibition reactions are ambiguous and urgently require advanced nanozymes as mediators to elucidate the inhibition effect. To this end, we develop a class of nanozymes featuring single Cu-N catalytic configurations and B-O sites as binding configurations on a porous nitrogen-doped carbon substrate (B6/CuSA) for inducing modulable inhibition transfer at the atomic level. The full redistribution of electrons across the Cu-N sites, induced by B-O sites incorporation, yields B6/CuSA with enhanced peroxidase-like activity versus CuSA. More importantly, CuSA with single Cu-N sites features in cysteine binding and expresses a competitive inhibition through coordination bonds, with an inhibition constant of 0.048 mM. Benefiting from the modulable binding way in nanozymes, B6/CuSA possesses mixed binding approaches for cysteine through noncovalent bonds and delivers a record-mixed inhibition interaction with a competitive inhibition constant of 0.054 mM and a noncompetitive inhibition constant of 0.71 mM. Based on the modulable inhibition of B6/CuSA and CuSA, a multichannel sensor array accomplishes the detection of various cancer cells, normal cells, and thiols. The design principle of this work is endowed with guidelines for the preliminary inhibition mechanism evaluation of massive potential thiols, cell discrimination, and disease prediction.
Collapse
Affiliation(s)
- Lijun Hu
- Institute of Molecular Metrology, College of Chemistry and Chemical Engineering, Qingdao University, Qingdao 266071, P. R. China
| | - Lei Jiao
- Institute of Molecular Metrology, College of Chemistry and Chemical Engineering, Qingdao University, Qingdao 266071, P. R. China
| | - Chengjie Chen
- Institute of Molecular Metrology, College of Chemistry and Chemical Engineering, Qingdao University, Qingdao 266071, P. R. China
| | - Xiangkun Jia
- Institute of Molecular Metrology, College of Chemistry and Chemical Engineering, Qingdao University, Qingdao 266071, P. R. China
| | - Xiaotong Li
- Institute of Molecular Metrology, College of Chemistry and Chemical Engineering, Qingdao University, Qingdao 266071, P. R. China
| | - Dongbo Yan
- Institute of Molecular Metrology, College of Chemistry and Chemical Engineering, Qingdao University, Qingdao 266071, P. R. China
| | - Yanling Zhai
- Institute of Molecular Metrology, College of Chemistry and Chemical Engineering, Qingdao University, Qingdao 266071, P. R. China
| | - Xiaoquan Lu
- Institute of Molecular Metrology, College of Chemistry and Chemical Engineering, Qingdao University, Qingdao 266071, P. R. China
| |
Collapse
|
12
|
Jiang K, Yan Z, Di Bernardo M, Sgrizzi SR, Villiger L, Kayabolen A, Kim BJ, Carscadden JK, Hiraizumi M, Nishimasu H, Gootenberg JS, Abudayyeh OO. Rapid in silico directed evolution by a protein language model with EVOLVEpro. Science 2025; 387:eadr6006. [PMID: 39571002 DOI: 10.1126/science.adr6006] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2024] [Accepted: 11/12/2024] [Indexed: 01/25/2025]
Abstract
Directed protein evolution is central to biomedical applications but faces challenges such as experimental complexity, inefficient multiproperty optimization, and local maxima traps. Although in silico methods that use protein language models (PLMs) can provide modeled fitness landscape guidance, they struggle to generalize across diverse protein families and map to protein activity. We present EVOLVEpro, a few-shot active learning framework that combines PLMs and regression models to rapidly improve protein activity. EVOLVEpro surpasses current methods, yielding up to 100-fold improvements in desired properties. We demonstrate its effectiveness across six proteins in RNA production, genome editing, and antibody binding applications. These results highlight the advantages of few-shot active learning with minimal experimental data over zero-shot predictions. EVOLVEpro opens new possibilities for artificial intelligence-guided protein engineering in biology and medicine.
Collapse
Affiliation(s)
- Kaiyi Jiang
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
- Department of Bioengineering Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Zhaoqing Yan
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
| | - Matteo Di Bernardo
- Whitehead Institute Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Samantha R Sgrizzi
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
| | - Lukas Villiger
- Department of Dermatology and Allergology Kantonspital St. Gallen, St. Gallen, Switzerland
| | - Alisan Kayabolen
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
| | - B J Kim
- Koch Institute for Integrative Cancer Research at MIT Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Josephine K Carscadden
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
| | - Masahiro Hiraizumi
- Department of Chemistry and Biotechnology, Graduate School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan
| | - Hiroshi Nishimasu
- Department of Chemistry and Biotechnology, Graduate School of Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo, Japan
- Structural Biology Division, Research Center for Advanced Science and Technology, The University of Tokyo, 4-6-1 Komaba, Meguro-ku, Tokyo, Japan
- Inamori Research Institute for Science, 620 Suiginya-cho, Shimogyo-ku, Kyoto, Japan
| | - Jonathan S Gootenberg
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
| | - Omar O Abudayyeh
- Department of Medicine Division of Engineering in Medicine Brigham and Women's Hospital Harvard Medical School, Boston, MA, USA
- Gene and Cell Therapy Institute Mass General Brigham, Cambridge, MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School, Boston, MA, USA
| |
Collapse
|
13
|
Xu Z, Barghout RA, Wu J, Garg D, Song YS, Mahadevan R. CPI-Pred: A deep learning framework for predicting functional parameters of compound-protein interactions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.16.633372. [PMID: 39896624 PMCID: PMC11785036 DOI: 10.1101/2025.01.16.633372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/04/2025]
Abstract
Recent advancements in deep learning have enabled functional annotation of genome sequences, facilitating the discovery of new enzymes and metabolites. However, accurately predicting compound-protein interactions (CPI) from sequences remains challenging due to the complexity of these interactions and the sparsity and heterogeneity of available data, which constrain the generalization of patterns across their solution space. In this work, we introduce CPI-Pred, a versatile deep learning model designed to predict compound-protein interaction function. CPI-Pred integrates compound representations derived from a novel message-passing neural network and enzyme representations generated by state-of-the-art protein language models, leveraging innovative sequence pooling and cross-attention mechanisms. To train and evaluate CPI-Pred, we compiled the largest dataset of enzyme kinetic parameters to date, encompassing four key metrics: the Michaelis-Menten constant (K M), enzyme turnover number (k cat), catalytic efficiency (k cat /K M), and inhibition constant (K I). These kinetic parameters are critical for elucidating enzyme function in metabolic contexts and understanding their regulation by compounds within biological networks. We demonstrate that CPI-Pred can predict diverse types of CPI using only the amino acid sequence of enzymes and structural representations of compounds, outperforming state-of-the-art models on unseen compounds and structurally dissimilar enzymes. Over workflow provides a valuable tool for tackling a range of metabolic engineering challenges, including the designing of novel enzyme sequences and compounds, such as enzyme inhibitors. Additionally, the datasets curated in this study offer a valuable resource for the scientific community, serving as a benchmark for machine learning models focused on enzyme activity and promiscuity prediction.
Collapse
Affiliation(s)
- Zhiqing Xu
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON, Canada
| | - Rana Ahmed Barghout
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON, Canada
| | - Jinghao Wu
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON, Canada
| | - Dhruv Garg
- Computer Science and Engineering Department, Thapar Institute of Engineering and Technology, Patiala, India
| | - Yun S Song
- Computer Science Division and Department of Statistics, University of California, Berkeley, CA, US
| | - Radhakrishnan Mahadevan
- Department of Chemical Engineering and Applied Chemistry, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
14
|
Atsavapranee B, Sunden F, Herschlag D, Fordyce P. Quantifying protein unfolding kinetics with a high-throughput microfluidic platform. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.15.633299. [PMID: 39868203 PMCID: PMC11761748 DOI: 10.1101/2025.01.15.633299] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/28/2025]
Abstract
Even after folding, proteins transiently sample unfolded or partially unfolded intermediates, and these species are often at risk of irreversible alteration (e.g. via proteolysis, aggregation, or post-translational modification). Kinetic stability, in addition to thermodynamic stability, can directly impact protein lifetime, abundance, and the formation of alternative, sometimes disruptive states. However, we have very few measurements of protein unfolding rates or how mutations alter these rates, largely due to technical challenges associated with their measurement. To address this need, we developed SPARKfold (Simultaneous Proteolysis Assay Revealing Kinetics of Folding), a microfluidic platform to express, purify, and measure unfolding rate constants for >1000 protein variants in parallel via on-chip native proteolysis. To demonstrate the power and potential of SPARKfold, we determined unfolding rate constants for 1,104 protein samples in parallel. We built a library of 31 dihydrofolate reductase (DHFR) orthologs with up to 78 chamber replicates per variant to provide the statistical power required to evaluate the system's ability to resolve subtle effects. SPARKfold rate constants for 5 constructs agreed with those obtained using traditional techniques across a 150-fold range, validating the accuracy of the technique. Comparisons of mutant kinetic effects via SPARKfold with previously published measurements impacts on folding thermodynamics provided information about the folding transition state and pathways via φ analysis. Overall, SPARKfold enables rapid characterization of protein variants to dissect the nature of the unfolding transition state. In future work, SPARKfold can reveal mutations that drive misfolding and aggregation and enable rational design of kinetically hyperstable variants for industrial use in harsh environments.
Collapse
Affiliation(s)
- B. Atsavapranee
- Department of Bioengineering, Stanford University, Stanford, CA 94305
| | - F. Sunden
- Department of Biochemistry, Stanford University, Stanford, CA 94305
| | - D. Herschlag
- Department of Biochemistry, Stanford University, Stanford, CA 94305
| | - P.M. Fordyce
- Department of Bioengineering, Stanford University, Stanford, CA 94305
- Department of Genetics, Stanford University, Stanford, CA 94305
- Sarafan ChEM-H Institute, Stanford University, Stanford, CA 94305
- Chan Zuckerberg Biohub, Stanford University, Stanford, CA 94305
| |
Collapse
|
15
|
Pan T, Bi Y, Wang X, Zhang Y, Webb GI, Gasser RB, Kurgan L, Song J. SCREEN: A Graph-based Contrastive Learning Tool to Infer Catalytic Residues and Assess Enzyme Mutations. GENOMICS, PROTEOMICS & BIOINFORMATICS 2025; 22:qzae094. [PMID: 39724324 PMCID: PMC11961199 DOI: 10.1093/gpbjnl/qzae094] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2024] [Revised: 12/05/2024] [Accepted: 12/06/2024] [Indexed: 12/28/2024]
Abstract
The accurate identification of catalytic residues contributes to our understanding of enzyme functions in biological processes and pathways. The increasing number of protein sequences necessitates computational tools for the automated prediction of catalytic residues in enzymes. Here, we introduce SCREEN, a graph neural network for the high-throughput prediction of catalytic residues via the integration of enzyme functional and structural information. SCREEN constructs residue representations based on spatial arrangements and incorporates enzyme function priors into such representations through contrastive learning. We demonstrate that SCREEN (1) consistently outperforms currently-available predictors; (2) provides accurate results when applied to inferred enzyme structures; and (3) generalizes well to enzymes dissimilar from those in the training set. We also show that the putative catalytic residues predicted by SCREEN mimic key structural and biophysical characteristics of native catalytic residues. Moreover, using experimental datasets, we show that SCREEN's predictions can be used to distinguish residues with a high mutation tolerance from those likely to cause functional loss when mutated, indicating that this tool might be used to infer disease-associated mutations. SCREEN is publicly available at https://github.com/BioColLab/SCREEN and https://ngdc.cncb.ac.cn/biocode/tool/7580.
Collapse
Affiliation(s)
- Tong Pan
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia
- Monash Biomedicine Discovery Institute-Wenzhou Medical University Alliance in Clinical and Experimental Biomedicine, Monash University, Clayton, VIC 3800, Australia
| | - Yue Bi
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia
- Monash Biomedicine Discovery Institute-Wenzhou Medical University Alliance in Clinical and Experimental Biomedicine, Monash University, Clayton, VIC 3800, Australia
| | - Xiaoyu Wang
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia
- Monash Biomedicine Discovery Institute-Wenzhou Medical University Alliance in Clinical and Experimental Biomedicine, Monash University, Clayton, VIC 3800, Australia
| | - Ying Zhang
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia
- School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing 210094, China
| | - Geoffrey I Webb
- Department of Data Science and Artificial Intelligence, Monash University, Clayton, VIC 3800, Australia
| | - Robin B Gasser
- Department of Veterinary Biosciences, Melbourne Veterinary School, The University of Melbourne, Parkville, VIC 3010, Australia
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Jiangning Song
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Clayton, VIC 3800, Australia
- Monash Biomedicine Discovery Institute-Wenzhou Medical University Alliance in Clinical and Experimental Biomedicine, Monash University, Clayton, VIC 3800, Australia
- Key Laboratory of Clinical Laboratory Diagnosis and Translational Research of Zhejiang Province, Department of Clinical Laboratory, The First Affiliated Hospital of Wenzhou Medical University, Wenzhou 325015, China
| |
Collapse
|
16
|
Hastings R, Aditham AK, DelRosso N, Suzuki PH, Fordyce PM. Mutations to transcription factor MAX allosterically increase DNA selectivity by altering folding and binding pathways. Nat Commun 2025; 16:636. [PMID: 39805837 PMCID: PMC11729911 DOI: 10.1038/s41467-024-55672-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2024] [Accepted: 12/19/2024] [Indexed: 01/16/2025] Open
Abstract
Understanding how proteins discriminate between preferred and non-preferred ligands ('selectivity') is essential for predicting biological function and a central goal of protein engineering efforts, yet the biophysical mechanisms underpinning selectivity remain poorly understood. Towards this end, we study how variants of the promiscuous transcription factor (TF) MAX (H. sapiens) alter DNA specificity and selectivity, yielding >1700 Kds and >500 rate constants in complex with multiple DNA sequences. Twenty-two of the 240 assayed MAX point mutations enhance selectivity, yet none of these mutations occur at residues that contact nucleotides in published structures. By applying thermodynamic and kinetic models to these results and previous observations for the highly similar yet far more selective TF Pho4 (S. cerevisiae), we find that these mutations enhance selectivity by altering partitioning between or affinity within conformations with different intrinsic selectivity, providing a mechanistic basis for allosteric modulation of ligand selectivity. These results highlight the importance of conformational heterogeneity in determining sequence selectivity and can guide future efforts to engineer selective proteins.
Collapse
Affiliation(s)
- Renee Hastings
- Biophysics Program, Stanford University, Stanford, CA, USA
| | - Arjun K Aditham
- Department of Bioengineering, Stanford University, Stanford, CA, USA
- Sarafan ChEM-H, Stanford University, Stanford, CA, USA
| | | | - Peter H Suzuki
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Polly M Fordyce
- Biophysics Program, Stanford University, Stanford, CA, USA.
- Department of Bioengineering, Stanford University, Stanford, CA, USA.
- Sarafan ChEM-H, Stanford University, Stanford, CA, USA.
- Department of Genetics, Stanford University, Stanford, CA, USA.
- Chan Zuckerberg Biohub, San Francisco, CA, USA.
| |
Collapse
|
17
|
Hunt A, Rasor BJ, Seki K, Ekas HM, Warfel KF, Karim AS, Jewett MC. Cell-Free Gene Expression: Methods and Applications. Chem Rev 2025; 125:91-149. [PMID: 39700225 PMCID: PMC11719329 DOI: 10.1021/acs.chemrev.4c00116] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Revised: 07/29/2024] [Accepted: 10/21/2024] [Indexed: 12/21/2024]
Abstract
Cell-free gene expression (CFE) systems empower synthetic biologists to build biological molecules and processes outside of living intact cells. The foundational principle is that precise, complex biomolecular transformations can be conducted in purified enzyme or crude cell lysate systems. This concept circumvents mechanisms that have evolved to facilitate species survival, bypasses limitations on molecular transport across the cell wall, and provides a significant departure from traditional, cell-based processes that rely on microscopic cellular "reactors." In addition, cell-free systems are inherently distributable through freeze-drying, which allows simple distribution before rehydration at the point-of-use. Furthermore, as cell-free systems are nonliving, they provide built-in safeguards for biocontainment without the constraints attendant on genetically modified organisms. These features have led to a significant increase in the development and use of CFE systems over the past two decades. Here, we discuss recent advances in CFE systems and highlight how they are transforming efforts to build cells, control genetic networks, and manufacture biobased products.
Collapse
Affiliation(s)
- Andrew
C. Hunt
- Department
of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, United States
- Center
for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, United States
| | - Blake J. Rasor
- Department
of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, United States
- Center
for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, United States
| | - Kosuke Seki
- Department
of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, United States
- Center
for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, United States
| | - Holly M. Ekas
- Department
of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, United States
- Center
for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, United States
| | - Katherine F. Warfel
- Department
of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, United States
- Center
for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, United States
| | - Ashty S. Karim
- Department
of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, United States
- Center
for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, United States
| | - Michael C. Jewett
- Department
of Chemical and Biological Engineering, Northwestern University, Evanston, Illinois 60208, United States
- Center
for Synthetic Biology, Northwestern University, Evanston, Illinois 60208, United States
- Chemistry
of Life Processes Institute, Northwestern
University, Evanston, Illinois 60208, United States
- Robert
H. Lurie Comprehensive Cancer Center, Northwestern
University, Chicago, Illinois 60611, United States
- Department
of Bioengineering, Stanford University, Stanford, California 94305, United States
| |
Collapse
|
18
|
Maciá Valero A, Prins RC, de Vroet T, Billerbeck S. Combining Oligo Pools and Golden Gate Cloning to Create Protein Variant Libraries or Guide RNA Libraries for CRISPR Applications. Methods Mol Biol 2025; 2850:265-295. [PMID: 39363077 DOI: 10.1007/978-1-0716-4220-7_15] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/05/2024]
Abstract
Oligo pools are array-synthesized, user-defined mixtures of single-stranded oligonucleotides that can be used as a source of synthetic DNA for library cloning. While currently offering the most affordable source of synthetic DNA, oligo pools also come with limitations such as a maximum synthesis length (approximately 350 bases), a higher error rate compared to alternative synthesis methods, and the presence of truncated molecules in the pool due to incomplete synthesis. Here, we provide users with a comprehensive protocol that details how oligo pools can be used in combination with Golden Gate cloning to create user-defined protein mutant libraries, as well as single-guide RNA libraries for CRISPR applications. Our methods are optimized to work within the Yeast Toolkit Golden Gate scheme, but are in principle compatible with any other Golden Gate-based modular cloning toolkit and extendable to other restriction enzyme-based cloning methods beyond Golden Gate. Our methods yield high-quality, affordable, in-house variant libraries.
Collapse
Affiliation(s)
- Alicia Maciá Valero
- Molecular Microbiology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, The Netherlands
| | - Rianne C Prins
- Molecular Microbiology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, The Netherlands
| | - Thijs de Vroet
- Molecular Microbiology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, The Netherlands
| | - Sonja Billerbeck
- Molecular Microbiology, Groningen Biomolecular Sciences and Biotechnology Institute, University of Groningen, Groningen, The Netherlands.
| |
Collapse
|
19
|
Arutyunyan A, Seuma M, Faure AJ, Bolognesi B, Lehner B. Massively parallel genetic perturbation reveals the energetic architecture of an amyloid beta nucleation reaction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.24.604935. [PMID: 39091732 PMCID: PMC11291115 DOI: 10.1101/2024.07.24.604935] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 08/04/2024]
Abstract
Amyloid protein aggregates are pathological hallmarks of more than fifty human diseases but how soluble proteins nucleate to form amyloids is poorly understood. Here we use combinatorial mutagenesis, a kinetic selection assay, and machine learning to massively perturb the energetics of the nucleation reaction of amyloid beta (Aβ42), the protein that aggregates in Alzheimer's disease. In total, we quantify the nucleation rates of >140,000 variants of Aβ42. This allows us to accurately quantify the changes in reaction activation energy for all possible amino acid substitutions in a protein for the first time and, in addition, to quantify >600 energetic interactions between mutations. The data reveal the simple and interpretable genetic architecture of an amyloid nucleation reaction. Strikingly, strong energetic couplings are rare and identify a subset of structural contacts in mature fibrils. Together with the activation energy changes, this strongly suggests that the Aβ42 nucleation reaction transition state is structured in a short C-terminal region, providing a structural model for the reaction that may initiate Alzheimer's disease. We believe this approach can be widely applied to probe the energetics and transition state structures of protein reactions.
Collapse
Affiliation(s)
| | - Mireia Seuma
- Institute for Bioengineering of Catalonia (IBEC), The Barcelona Institute of Science and Technology (BIST) , Baldiri Reixac 10-12, 08028, Barcelona, Spain
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology (BIST), Barcelona, Spain
| | - Andre J. Faure
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology (BIST), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Current address: ALLOX, C/ Dr. Aiguader, 88, PRBB Building, 08003 Barcelona, Spain
| | - Benedetta Bolognesi
- Institute for Bioengineering of Catalonia (IBEC), The Barcelona Institute of Science and Technology (BIST) , Baldiri Reixac 10-12, 08028, Barcelona, Spain
| | - Ben Lehner
- Wellcome Sanger Institute, Cambridge, UK
- Centre for Genomic Regulation (CRG), The Barcelona Institute for Science and Technology (BIST), Barcelona, Spain
- Universitat Pompeu Fabra (UPF), Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
| |
Collapse
|
20
|
Notbohm J, Perica T. Biochemistry and genetics are coming together to improve our understanding of genotype to phenotype relationships. Curr Opin Struct Biol 2024; 89:102952. [PMID: 39522438 DOI: 10.1016/j.sbi.2024.102952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Revised: 10/15/2024] [Accepted: 10/18/2024] [Indexed: 11/16/2024]
Abstract
Since genome sequencing became accessible, determining how specific differences in genotypes lead to complex phenotypes such as disease has become one of the key goals in biomedicine. Predicting effects of sequence variants on cellular or organismal phenotype faces several challenges. First, variants simultaneously affect multiple protein properties and predicting their combined effect is complex. Second, effects of changes in a single protein propagate through the cellular network, which we only partially understand. In this review, we emphasize the importance of both biochemistry and genetics in addressing these challenges. Moreover, we highlight work that blurs the distinction between biochemistry and genetics fields to provide new insights into the genotype-to-phenotype relationships.
Collapse
Affiliation(s)
- Judith Notbohm
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland; Biomolecular Structure and Mechanism PhD Program, Life Science Graduate School Zurich, Switzerland
| | - Tina Perica
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland.
| |
Collapse
|
21
|
Tanaka M, Szabó Á, Vécsei L. Redefining Roles: A Paradigm Shift in Tryptophan-Kynurenine Metabolism for Innovative Clinical Applications. Int J Mol Sci 2024; 25:12767. [PMID: 39684480 DOI: 10.3390/ijms252312767] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2024] [Revised: 11/16/2024] [Accepted: 11/23/2024] [Indexed: 12/18/2024] Open
Abstract
The tryptophan-kynurenine (KYN) pathway has long been recognized for its essential role in generating metabolites that influence various physiological processes. Traditionally, these metabolites have been categorized into distinct, often opposing groups, such as pro-oxidant versus antioxidant, excitotoxic/neurotoxic versus neuroprotective. This dichotomous framework has shaped much of the research on conditions like neurodegenerative and neuropsychiatric disorders, as well as cancer, where metabolic imbalances are a key feature. The effects are significantly influenced by various factors, including the concentration of metabolites and the particular cellular milieu in which they are generated. A molecule that acts as neuroprotective at low concentrations may exhibit neurotoxic effects at elevated levels. The oxidative equilibrium of the surrounding environment can alter the function of KYN from an antioxidant to a pro-oxidant. This narrative review offers a comprehensive examination and analysis of the contemporary understanding of KYN metabolites, emphasizing their multifaceted biological functions and their relevance in numerous physiological and pathological processes. This underscores the pressing necessity for a paradigm shift in the comprehension of KYN metabolism. Understanding the context-dependent roles of KYN metabolites is vital for novel therapies in conditions like Alzheimer's disease, multiple sclerosis, and cancer. Comprehensive pathway modulation, including balancing inflammatory signals and enzyme regulation, offers promising avenues for targeted, effective treatments.
Collapse
Affiliation(s)
- Masaru Tanaka
- HUN-REN-SZTE Neuroscience Research Group, Hungarian Research Network, University of Szeged (HUN-REN-SZTE), Danube Neuroscience Research Laboratory, Tisza Lajos krt. 113, H-6725 Szeged, Hungary
| | - Ágnes Szabó
- Department of Neurology, Albert Szent-Györgyi Medical School, University of Szeged, Semmelweis u. 6, H-6725 Szeged, Hungary
- Doctoral School of Clinical Medicine, University of Szeged, Korányi fasor 6, H-6720 Szeged, Hungary
| | - László Vécsei
- HUN-REN-SZTE Neuroscience Research Group, Hungarian Research Network, University of Szeged (HUN-REN-SZTE), Danube Neuroscience Research Laboratory, Tisza Lajos krt. 113, H-6725 Szeged, Hungary
- Department of Neurology, Albert Szent-Györgyi Medical School, University of Szeged, Semmelweis u. 6, H-6725 Szeged, Hungary
| |
Collapse
|
22
|
Abdollahramezani S, Omo-Lamai D, Bosman G, Hemmatyar O, Dagli S, Dolia V, Chang K, Güsken NA, Delgado HC, Boons GJ, Brongersma ML, Safir F, Khuri-Yakub BT, Moradifar P, Dionne J. High-throughput antibody screening with high-quality factor nanophotonics and bioprinting. ARXIV 2024:arXiv:2411.18557v1. [PMID: 39650601 PMCID: PMC11623700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 12/11/2024]
Abstract
Empirical investigation of the quintillion-scale, functionally diverse antibody repertoires that can be generated synthetically or naturally is critical for identifying potential biotherapeutic leads, yet remains burdensome. We present high-throughput nanophotonics- and bioprinter-enabled screening (HT-NaBS), a multiplexed assay for large-scale, sample-efficient, and rapid characterization of antibody libraries. Our platform is built upon independently addressable pixelated nanoantennas exhibiting wavelength-scale mode volumes, high-quality factors (high-Q) exceeding 5000, and pattern densities exceeding one million sensors per square centimeter. Our custom-built acoustic bioprinter enables individual sensor functionalization via the deposition of picoliter droplets from a library of capture antigens at rates up to 25,000 droplets per second. We detect subtle differentiation in the target binding signature through spatially-resolved spectral imaging of hundreds of resonators simultaneously, elucidating antigen-antibody binding kinetic rates, affinity constant, and specificity. We demonstrate HT-NaBS on a panel of antibodies targeting SARS-CoV-2, Influenza A, and Influenza B antigens, with a sub-picomolar limit of detection within 30 minutes. Furthermore, through epitope binning analysis, we demonstrate the competence and diversity of a library of native antibodies targeting functional epitopes on a priority pathogen (H5N1 bird flu) and on glycosylated therapeutic Cetuximab antibodies against epidermal growth factor receptor. With a roadmap to image tens of thousands of sensors simultaneously, this high-throughput, resource-efficient, and label-free platform can rapidly screen for high-affinity and broad epitope coverage, accelerating biotherapeutic discovery and de novo protein design.
Collapse
Affiliation(s)
| | - Darrell Omo-Lamai
- Department of Materials Science and Engineering, Stanford University, Stanford, CA, USA
| | - Gerlof Bosman
- Department of Chemical Biology and Drug Discovery, Utrecht University, Utrecht, Netherlands
| | - Omid Hemmatyar
- Department of Materials Science and Engineering, Stanford University, Stanford, CA, USA
| | - Sahil Dagli
- Department of Materials Science and Engineering, Stanford University, Stanford, CA, USA
| | - Varun Dolia
- Department of Materials Science and Engineering, Stanford University, Stanford, CA, USA
| | - Kai Chang
- Department of Electrical Engineering, Stanford University, Stanford, CA, USA
| | - Nicholas A. Güsken
- Geballe Laboratory for Advanced Materials, Stanford University, Stanford University, Stanford, CA, USA
| | - Hamish Carr Delgado
- Department of Materials Science and Engineering, Stanford University, Stanford, CA, USA
| | - Geert-Jan Boons
- Department of Chemical Biology and Drug Discovery, Utrecht University, Utrecht, Netherlands
| | - Mark L. Brongersma
- Geballe Laboratory for Advanced Materials, Stanford University, Stanford University, Stanford, CA, USA
| | | | | | - Parivash Moradifar
- Department of Materials Science and Engineering, Stanford University, Stanford, CA, USA
| | - Jennifer Dionne
- Department of Materials Science and Engineering, Stanford University, Stanford, CA, USA
- Department of Radiology, Stanford University, Stanford, CA, USA
| |
Collapse
|
23
|
Swint-Kruse L, Martin TA, Wu T, Dougherty LL, Fenton AW. Identification of positions in human aldolase a that are neutral for apparent K M. Arch Biochem Biophys 2024; 761:110183. [PMID: 39461494 PMCID: PMC11908651 DOI: 10.1016/j.abb.2024.110183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2024] [Revised: 10/16/2024] [Accepted: 10/20/2024] [Indexed: 10/29/2024]
Abstract
According to evolutionary theory, many naturally-occurring amino acid substitutions are expected to be neutral or near-neutral, with little effect on protein structure or function. Accordingly, most changes observed in human exomes are also expected to be neutral. As such, accurate algorithms for identifying medically-relevant changes must discriminate rare, non-neutral substitutions against a background of neutral substitutions. However, due to historical biases in biochemical experiments, the data available to train and validate prediction algorithms mostly contains non-neutral substitutions, with few examples of neutral substitutions. Thus, available training sets have the opposite composition of the desired test sets. Towards improving a dataset of these critical negative controls, we have concentrated on identifying neutral positions - those positions for which most of the possible 19 amino acid substitutions have little effect on protein structure or function. Here, we used a strategy based on multiple sequence alignments to identify putative neutral positions in human aldolase A, followed by biochemical assays for 147 aldolase substitutions. Results showed that most variants had little effect on either the apparent Michaelis constant for substrate fructose-1,6-bisphosphate or its apparent cooperativity. Thus, these data are useful for training and validating prediction algorithms. In addition, we created a database of these and other biochemically characterized aldolase variants along with aldolase sequences and characteristics derived from sequence and structure analyses. This database is publicly available at https://github.com/liskinsk/Aldolase-variant-and-sequence-database.
Collapse
Affiliation(s)
- Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd, MSN 3030, Kansas City, KS, 66160, USA.
| | - Tyler A Martin
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd, MSN 3030, Kansas City, KS, 66160, USA
| | - Tiffany Wu
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd, MSN 3030, Kansas City, KS, 66160, USA
| | - Larissa L Dougherty
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd, MSN 3030, Kansas City, KS, 66160, USA
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, 3901 Rainbow Blvd, MSN 3030, Kansas City, KS, 66160, USA.
| |
Collapse
|
24
|
Muir DF, Asper GPR, Notin P, Posner JA, Marks DS, Keiser MJ, Pinney MM. Evolutionary-Scale Enzymology Enables Biochemical Constant Prediction Across a Multi-Peaked Catalytic Landscape. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.23.619915. [PMID: 39484523 PMCID: PMC11526920 DOI: 10.1101/2024.10.23.619915] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 11/03/2024]
Abstract
Quantitatively mapping enzyme sequence-catalysis landscapes remains a critical challenge in understanding enzyme function, evolution, and design. Here, we expand an emerging microfluidic platform to measure catalytic constants-k cat and K M-for hundreds of diverse naturally occurring sequences and mutants of the model enzyme Adenylate Kinase (ADK). This enables us to dissect the sequence-catalysis landscape's topology, navigability, and mechanistic underpinnings, revealing distinct catalytic peaks organized by structural motifs. These results challenge long-standing hypotheses in enzyme adaptation, demonstrating that thermophilic enzymes are not slower than their mesophilic counterparts. Combining the rich representations of protein sequences provided by deep-learning models with our custom high-throughput kinetic data yields semi-supervised models that significantly outperform existing models at predicting catalytic parameters of naturally occurring ADK sequences. Our work demonstrates a promising strategy for dissecting sequence-catalysis landscapes across enzymatic evolution and building family-specific models capable of accurately predicting catalytic constants, opening new avenues for enzyme engineering and functional prediction.
Collapse
Affiliation(s)
- Duncan F Muir
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, CA, USA
- Program in Biophysics, University of California, San Francisco, San Francisco, CA, USA
| | - Garrison P R Asper
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, CA, USA
| | - Pascal Notin
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Department of Computer Science, University of Oxford, Oxford, UK
| | - Jacob A Posner
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, CA, USA
- Department of Biology, San Francisco State University, San Francisco, CA, USA
| | - Debora S Marks
- Department of Systems Biology, Harvard Medical School, Boston, MA, USA
- Broad Institute of Harvard and MIT, Cambridge, MA, USA
| | - Michael J Keiser
- Department of Pharmaceutical Chemistry, University of California, San Francisco, San Francisco, CA, USA
- Institute for Neurodegenerative Diseases, University of California, San Francisco, San Francisco, CA, USA
- Bakar Computational Health Sciences Institute, University of California, San Francisco, San Francisco, CA, USA
| | - Margaux M Pinney
- Department of Biochemistry and Biophysics, University of California San Francisco, San Francisco, CA, USA
- Valhalla Fellow, University of California San Francisco, San Francisco, CA, USA
| |
Collapse
|
25
|
Barhoosh H, Dixit A, Cochrane WG, Cavett V, Prince RN, Blair BO, Ward FR, McClure KF, Patten PA, Paulick MG, Paegel BM. Activity-Based DNA-Encoded Library Screening for Selective Inhibitors of Eukaryotic Translation. ACS CENTRAL SCIENCE 2024; 10:1960-1968. [PMID: 39463829 PMCID: PMC11503492 DOI: 10.1021/acscentsci.4c01218] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 07/26/2024] [Revised: 09/04/2024] [Accepted: 09/24/2024] [Indexed: 10/29/2024]
Abstract
Small molecule probes exist for only ∼2% of human proteins because most lack functional binding pockets or cannot be assayed for high-throughput screening. Selective translation modulation circumvents canonical druggability and assay development constraints by using in vitro transcription-translation (IVTT) as a universal biochemical screening assay. We developed an IVTT activity assay by fusing a GFP reporter to various target gene sequences and screened the target sequences for inhibitors in microfluidic picoliter-scale droplets using a 5,348-member translation inhibitor DNA-encoded library (DEL). Screening a proof-of-concept PCSK9-GFP reporter yielded many hits; 6/7 hits inhibited PCSK9-GFP IVTT (IC50 1-20 μM), and the lead hit reduced PCSK9 levels in HepG2 cells. Preliminary selectivity was informed by counterscreening the DEL against a frameshift mutant PCSK9-GFP reporter. A plug-and-play approach to assay development and screening was demonstrated by scouting the DEL for activity using reporter genes of targets with difficult-to-assay or even unknown function (RPL27, KRASG12D, MST1, USO1). This microfluidic IVTT DEL screening platform could scale probe discovery to the human proteome and perhaps more broadly across the tree of life.
Collapse
Affiliation(s)
- Huda Barhoosh
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California 92697, United States
| | - Anjali Dixit
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California 92697, United States
| | - Wesley G. Cochrane
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California 92697, United States
| | - Valerie Cavett
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California 92697, United States
| | - Robin N. Prince
- Initial
Therapeutics, South San Francisco, California 94080, United States
| | - Brooke O. Blair
- Initial
Therapeutics, South San Francisco, California 94080, United States
| | - Fred R. Ward
- Initial
Therapeutics, South San Francisco, California 94080, United States
| | - Kim F. McClure
- Initial
Therapeutics, South San Francisco, California 94080, United States
| | - Phillip A. Patten
- Initial
Therapeutics, South San Francisco, California 94080, United States
| | - Margot G. Paulick
- Initial
Therapeutics, South San Francisco, California 94080, United States
| | - Brian M. Paegel
- Department
of Pharmaceutical Sciences, University of
California, Irvine, California 92697, United States
- Departments
of Chemistry & Biomedical Engineering, University of California, Irvine, California 92697, United States
| |
Collapse
|
26
|
Savinov A. Enzoology: understanding enzyme interactions and epistasis in the cell. Trends Biochem Sci 2024; 49:841-842. [PMID: 39048479 DOI: 10.1016/j.tibs.2024.07.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2024] [Accepted: 07/16/2024] [Indexed: 07/27/2024]
Abstract
Recent work from Nguyen et al. unveils massively parallel measurements of epistatic interactions between two enzymes, dihydrofolate reductase and thymidylate synthase, in their natural cellular context. Almost 3000 mutations of DHFR in three TYMS backgrounds reveal a complex interaction network. The authors capture much of this complexity using a simple model.
Collapse
Affiliation(s)
- Andrew Savinov
- Department of Biology, Massachusetts Institute of Technology, MA, USA.
| |
Collapse
|
27
|
Gantz M, Mathis SV, Nintzel FEH, Lio P, Hollfelder F. On synergy between ultrahigh throughput screening and machine learning in biocatalyst engineering. Faraday Discuss 2024; 252:89-114. [PMID: 39133073 PMCID: PMC11318516 DOI: 10.1039/d4fd00065j] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2024] [Accepted: 04/23/2024] [Indexed: 08/13/2024]
Abstract
Protein design and directed evolution have separately contributed enormously to protein engineering. Without being mutually exclusive, the former relies on computation from first principles, while the latter is a combinatorial approach based on chance. Advances in ultrahigh throughput (uHT) screening, next generation sequencing and machine learning may create alternative routes to engineered proteins, where functional information linked to specific sequences is interpreted and extrapolated in silico. In particular, the miniaturisation of functional tests in water-in-oil emulsion droplets with picoliter volumes and their rapid generation and analysis (>1 kHz) allows screening of >107-membered libraries in a day. Subsequently, decoding the selected clones by short or long-read sequencing methods leads to large sequence-function datasets that may allow extrapolation from experimental directed evolution to further improved mutants beyond the observed hits. In this work, we explore experimental strategies for how to draw up 'fitness landscapes' in sequence space with uHT droplet microfluidics, review the current state of AI/ML in enzyme engineering and discuss how uHT datasets may be combined with AI/ML to make meaningful predictions and accelerate biocatalyst engineering.
Collapse
Affiliation(s)
- Maximilian Gantz
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, UK
| | - Simon V Mathis
- Department of Computer Science, University of Cambridge, 15 JJ Thomson Avenue, Cambridge CB3 0FD, UK
| | - Friederike E H Nintzel
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, UK
| | - Pietro Lio
- Department of Computer Science, University of Cambridge, 15 JJ Thomson Avenue, Cambridge CB3 0FD, UK
| | - Florian Hollfelder
- Department of Biochemistry, University of Cambridge, 80 Tennis Court Road, Cambridge, CB2 1GA, UK
| |
Collapse
|
28
|
Billerbeck S, Walker RSK, Pretorius IS. Killer yeasts: expanding frontiers in the age of synthetic biology. Trends Biotechnol 2024; 42:1081-1096. [PMID: 38575438 DOI: 10.1016/j.tibtech.2024.03.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2024] [Revised: 03/07/2024] [Accepted: 03/07/2024] [Indexed: 04/06/2024]
Abstract
Killer yeasts secrete protein toxins that are selectively lethal to other yeast and filamentous fungi. These exhibit exceptional genetic and functional diversity, and have several biotechnological applications. However, despite decades of research, several limitations hinder their widespread adoption. In this perspective we contend that technical advances in synthetic biology present an unprecedented opportunity to unlock the full potential of yeast killer systems across a spectrum of applications. By leveraging these new technologies, engineered killer toxins may emerge as a pivotal new tool to address antifungal resistance and food security. Finally, we speculate on the biotechnological potential of re-engineering host double-stranded (ds) RNA mycoviruses, from which many toxins derive, as a safe and noninfectious system to produce designer RNA.
Collapse
Affiliation(s)
- Sonja Billerbeck
- Molecular Microbiology, Groningen Biomolecular Sciences and Biotechnology institute, University of Groningen, Groningen 9747, AG, The Netherlands
| | - Roy S K Walker
- Department of Molecular Sciences, Macquarie University, Sydney, New South Wales 2109, Australia; ARC Centre of Excellence in Synthetic Biology, Macquarie University, Sydney, New South Wales 2109, Australia
| | - Isak S Pretorius
- ARC Centre of Excellence in Synthetic Biology, Macquarie University, Sydney, New South Wales 2109, Australia.
| |
Collapse
|
29
|
DelRosso N, Suzuki PH, Griffith D, Lotthammer JM, Novak B, Kocalar S, Sheth MU, Holehouse AS, Bintu L, Fordyce P. High-throughput affinity measurements of direct interactions between activation domains and co-activators. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.08.19.608698. [PMID: 39229005 PMCID: PMC11370418 DOI: 10.1101/2024.08.19.608698] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 09/05/2024]
Abstract
Sequence-specific activation by transcription factors is essential for gene regulation1,2. Key to this are activation domains, which often fall within disordered regions of transcription factors3,4 and recruit co-activators to initiate transcription5. These interactions are difficult to characterize via most experimental techniques because they are typically weak and transient6,7. Consequently, we know very little about whether these interactions are promiscuous or specific, the mechanisms of binding, and how these interactions tune the strength of gene activation. To address these questions, we developed a microfluidic platform for expression and purification of hundreds of activation domains in parallel followed by direct measurement of co-activator binding affinities (STAMMPPING, for Simultaneous Trapping of Affinity Measurements via a Microfluidic Protein-Protein INteraction Generator). By applying STAMMPPING to quantify direct interactions between eight co-activators and 204 human activation domains (>1,500 K ds), we provide the first quantitative map of these interactions and reveal 334 novel binding pairs. We find that the metazoan-specific co-activator P300 directly binds >100 activation domains, potentially explaining its widespread recruitment across the genome to influence transcriptional activation. Despite sharing similar molecular properties (e.g. enrichment of negative and hydrophobic residues), activation domains utilize distinct biophysical properties to recruit certain co-activator domains. Co-activator domain affinity and occupancy are well-predicted by analytical models that account for multivalency, and in vitro affinities quantitatively predict activation in cells with an ultrasensitive response. Not only do our results demonstrate the ability to measure affinities between even weak protein-protein interactions in high throughput, but they also provide a necessary resource of over 1,500 activation domain/co-activator affinities which lays the foundation for understanding the molecular basis of transcriptional activation.
Collapse
Affiliation(s)
| | - Peter H Suzuki
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Daniel Griffith
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Jeffrey M Lotthammer
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Borna Novak
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Selin Kocalar
- Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Maya U Sheth
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St. Louis, MO, USA
- Center for Biomolecular Condensates, Washington University in St. Louis, St. Louis, MO, USA
| | - Lacramioara Bintu
- Biophysics Program, Stanford University, Stanford, CA, USA
- Department of Bioengineering, Stanford University, Stanford, CA, USA
| | - Polly Fordyce
- Biophysics Program, Stanford University, Stanford, CA, USA
- Department of Bioengineering, Stanford University, Stanford, CA, USA
- Sarafan ChEM-H Institute, Stanford University, Stanford, CA, USA
- Chan Zuckerberg Biohub San Francisco, CA, USA
| |
Collapse
|
30
|
Cui H, Zhang Y, Liu S, Cao Y, Ma Q, Liu Y, Lin H, Li C, Xiao Y, Hassan SU, Shum HC. Thermo-responsive aqueous two-phase system for two-level compartmentalization. Nat Commun 2024; 15:6771. [PMID: 39117632 PMCID: PMC11310206 DOI: 10.1038/s41467-024-51043-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2023] [Accepted: 07/26/2024] [Indexed: 08/10/2024] Open
Abstract
Hierarchical compartmentalization responding to changes in intracellular and extracellular environments is ubiquitous in living eukaryotic cells but remains a formidable task in synthetic systems. Here we report a two-level compartmentalization approach based on a thermo-responsive aqueous two-phase system (TR-ATPS) comprising poly(N-isopropylacrylamide) (PNIPAM) and dextran (DEX). Liquid membraneless compartments enriched in PNIPAM are phase-separated from the continuous DEX solution via liquid-liquid phase separation at 25 °C and shrink dramatically with small second-level compartments generated at the interface, resembling the structure of colloidosome, by increasing the temperature to 35 °C. The TR-ATPS can store biomolecules, program the spatial distribution of enzymes, and accelerate the overall biochemical reaction efficiency by nearly 7-fold. The TR-ATPS inspires on-demand, stimulus-triggered spatiotemporal enrichment of biomolecules via two-level compartmentalization, creating opportunities in synthetic biology and biochemical engineering.
Collapse
Affiliation(s)
- Huanqing Cui
- Department of Mechanical Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong (SAR), China
| | - Yage Zhang
- Advanced Biomedical Instrumentation Centre, Hong Kong Science Park, Shatin, New Territories, Hong Kong (SAR), China
- School of Biomedical Engineering, Shenzhen University Medical School, Shenzhen University, 518055, Shenzhen, Guangdong, China
| | - Sihan Liu
- Department of Mechanical Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong (SAR), China
| | - Yang Cao
- Department of Mechanical Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong (SAR), China
| | - Qingming Ma
- School of Pharmacy, Qingdao University, 266071, Qingdao, China
| | - Yuan Liu
- Department of Mechanical Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong (SAR), China
- Advanced Biomedical Instrumentation Centre, Hong Kong Science Park, Shatin, New Territories, Hong Kong (SAR), China
| | - Haisong Lin
- Department of Mechanical Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong (SAR), China
- Advanced Biomedical Instrumentation Centre, Hong Kong Science Park, Shatin, New Territories, Hong Kong (SAR), China
| | - Chang Li
- Department of Mechanical Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong (SAR), China
| | - Yang Xiao
- Department of Mechanical Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong (SAR), China
- College of Chemistry and Materials Science, Anhui Normal University, Wuhu, Anhui, China
| | - Sammer Ul Hassan
- Department of Mechanical Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong (SAR), China
- Advanced Biomedical Instrumentation Centre, Hong Kong Science Park, Shatin, New Territories, Hong Kong (SAR), China
| | - Ho Cheung Shum
- Department of Mechanical Engineering, The University of Hong Kong, Pokfulam Road, Hong Kong (SAR), China.
- Advanced Biomedical Instrumentation Centre, Hong Kong Science Park, Shatin, New Territories, Hong Kong (SAR), China.
| |
Collapse
|
31
|
Johnston KE, Almhjell PJ, Watkins-Dulaney EJ, Liu G, Porter NJ, Yang J, Arnold FH. A combinatorially complete epistatic fitness landscape in an enzyme active site. Proc Natl Acad Sci U S A 2024; 121:e2400439121. [PMID: 39074291 PMCID: PMC11317637 DOI: 10.1073/pnas.2400439121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2024] [Accepted: 06/17/2024] [Indexed: 07/31/2024] Open
Abstract
Protein engineering often targets binding pockets or active sites which are enriched in epistasis-nonadditive interactions between amino acid substitutions-and where the combined effects of multiple single substitutions are difficult to predict. Few existing sequence-fitness datasets capture epistasis at large scale, especially for enzyme catalysis, limiting the development and assessment of model-guided enzyme engineering approaches. We present here a combinatorially complete, 160,000-variant fitness landscape across four residues in the active site of an enzyme. Assaying the native reaction of a thermostable β-subunit of tryptophan synthase (TrpB) in a nonnative environment yielded a landscape characterized by significant epistasis and many local optima. These effects prevent simulated directed evolution approaches from efficiently reaching the global optimum. There is nonetheless wide variability in the effectiveness of different directed evolution approaches, which together provide experimental benchmarks for computational and machine learning workflows. The most-fit TrpB variants contain a substitution that is nearly absent in natural TrpB sequences-a result that conservation-based predictions would not capture. Thus, although fitness prediction using evolutionary data can enrich in more-active variants, these approaches struggle to identify and differentiate among the most-active variants, even for this near-native function. Overall, this work presents a large-scale testing ground for model-guided enzyme engineering and suggests that efficient navigation of epistatic fitness landscapes can be improved by advances in both machine learning and physical modeling.
Collapse
Affiliation(s)
- Kadina E. Johnston
- Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA91125
| | - Patrick J. Almhjell
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA91125
| | - Ella J. Watkins-Dulaney
- Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA91125
| | - Grace Liu
- Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA91125
| | - Nicholas J. Porter
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA91125
| | - Jason Yang
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA91125
| | - Frances H. Arnold
- Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA91125
- Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, CA91125
| |
Collapse
|
32
|
Wang T, Xiang G, He S, Su L, Wang Y, Yan X, Lu H. DeepEnzyme: a robust deep learning model for improved enzyme turnover number prediction by utilizing features of protein 3D-structures. Brief Bioinform 2024; 25:bbae409. [PMID: 39162313 PMCID: PMC11880767 DOI: 10.1093/bib/bbae409] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2024] [Revised: 07/13/2024] [Accepted: 08/04/2024] [Indexed: 08/21/2024] Open
Abstract
Turnover numbers (kcat), which indicate an enzyme's catalytic efficiency, have a wide range of applications in fields including protein engineering and synthetic biology. Experimentally measuring the enzymes' kcat is always time-consuming. Recently, the prediction of kcat using deep learning models has mitigated this problem. However, the accuracy and robustness in kcat prediction still needs to be improved significantly, particularly when dealing with enzymes with low sequence similarity compared to those within the training dataset. Herein, we present DeepEnzyme, a cutting-edge deep learning model that combines the most recent Transformer and Graph Convolutional Network (GCN) to capture the information of both the sequence and 3D-structure of a protein. To improve the prediction accuracy, DeepEnzyme was trained by leveraging the integrated features from both sequences and 3D-structures. Consequently, DeepEnzyme exhibits remarkable robustness when processing enzymes with low sequence similarity compared to those in the training dataset by utilizing additional features from high-quality protein 3D-structures. DeepEnzyme also makes it possible to evaluate how point mutations affect the catalytic activity of the enzyme, which helps identify residue sites that are crucial for the catalytic function. In summary, DeepEnzyme represents a pioneering effort in predicting enzymes' kcat values with improved accuracy and robustness compared to previous algorithms. This advancement will significantly contribute to our comprehension of enzyme function and its evolutionary patterns across species.
Collapse
Affiliation(s)
- Tong Wang
- State Key Laboratory of Microbial Metabolism, School of Life Science and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan RD. Minhang District, Shanghai 200240, China
- College of Science, Chongqing University of Technology, 69 Hongguang Avenue, Banan District, Chongqing 400054, China
| | - Guangming Xiang
- State Key Laboratory of Microbial Metabolism, School of Life Science and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan RD. Minhang District, Shanghai 200240, China
| | - Siwei He
- State Key Laboratory of Microbial Metabolism, School of Life Science and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan RD. Minhang District, Shanghai 200240, China
| | - Liyun Su
- College of Science, Chongqing University of Technology, 69 Hongguang Avenue, Banan District, Chongqing 400054, China
| | - Yuguang Wang
- Institute of Natural Sciences, School of Mathematical Sciences, Zhangjiang Institute of Advanced Study, Shanghai Jiao Tong University, 800 Dongchuan RD. Minhang District, Shanghai 200240, China
- Shanghai Artificial Intelligence Laboratory, 701 Yunjin Road, Xuhui District, Shanghai 200237, China
| | - Xuefeng Yan
- Key Laboratory of Smart Manufacturing in Energy Chemical Process, Ministry of Education, East China University of Science and Technology, 130 Meilong Road, Xuhui District, Shanghai 200237, China
- State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, 130 Meilong Road, Xuhui District, Shanghai 200237, China
| | - Hongzhong Lu
- State Key Laboratory of Microbial Metabolism, School of Life Science and Biotechnology, Shanghai Jiao Tong University, 800 Dongchuan RD. Minhang District, Shanghai 200240, China
| |
Collapse
|
33
|
Jiang K, Yan Z, Di Bernardo M, Sgrizzi SR, Villiger L, Kayabolen A, Kim B, Carscadden JK, Hiraizumi M, Nishimasu H, Gootenberg JS, Abudayyeh OO. Rapid protein evolution by few-shot learning with a protein language model. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.07.17.604015. [PMID: 39071429 PMCID: PMC11275896 DOI: 10.1101/2024.07.17.604015] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/30/2024]
Abstract
Directed evolution of proteins is critical for applications in basic biological research, therapeutics, diagnostics, and sustainability. However, directed evolution methods are labor intensive, cannot efficiently optimize over multiple protein properties, and are often trapped by local maxima. In silico-directed evolution methods incorporating protein language models (PLMs) have the potential to accelerate this engineering process, but current approaches fail to generalize across diverse protein families. We introduce EVOLVEpro, a few-shot active learning framework to rapidly improve protein activity using a combination of PLMs and protein activity predictors, achieving improved activity with as few as four rounds of evolution. EVOLVEpro substantially enhances the efficiency and effectiveness of in silico protein evolution, surpassing current state-of-the-art methods and yielding proteins with up to 100-fold improvement of desired properties. We showcase EVOLVEpro for five proteins across three applications: T7 RNA polymerase for RNA production, a miniature CRISPR nuclease, a prime editor, and an integrase for genome editing, and a monoclonal antibody for epitope binding. These results demonstrate the advantages of few-shot active learning with small amounts of experimental data over zero-shot predictions. EVOLVEpro paves the way for broader applications of AI-guided protein engineering in biology and medicine.
Collapse
Affiliation(s)
- Kaiyi Jiang
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
- Department of Bioengineering Massachusetts Institute of Technology Cambridge, 02139 MA, USA
| | - Zhaoqing Yan
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
| | - Matteo Di Bernardo
- Department of Bioengineering Massachusetts Institute of Technology Cambridge, 02139 MA, USA
| | - Samantha R. Sgrizzi
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
| | - Lukas Villiger
- Department of Dermatology and Allergology Kantonspital St. Gallen St. Gallen, 9000, Switzerland
| | - Alisan Kayabolen
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
| | - Byungji Kim
- Koch Institute for Integrative Cancer Research At MIT Massachusetts Institute of Technology Cambridge, 02139 MA, USA
| | - Josephine K. Carscadden
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
| | - Masahiro Hiraizumi
- Department of Chemistry and Biotechnology, Graduate School of Engineering, The University of Tokyo 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
| | - Hiroshi Nishimasu
- Department of Chemistry and Biotechnology, Graduate School of Engineering, The University of Tokyo 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
- Structural Biology Division, Research Center for Advanced Science and Technology, The University of Tokyo 4-6-1 Komaba, Meguro-ku, Tokyo 153-8904, Japan
- Inamori Research Institute for Science 620 Suiginya-cho, Shimogyo-ku, Kyoto 600-8411, Japan
| | - Jonathan S. Gootenberg
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
| | - Omar O. Abudayyeh
- Department of Medicine Division of Engineering in Medicine Brigham and Women’s Hospital Harvard Medical School Boston, 02115 MA, USA
- Gene and Cell Therapy Institute Mass General Brigham Cambridge, 02139 MA, USA
- Center for Virology and Vaccine Research Beth Israel Deaconess Medical Center Harvard Medical School Boston, 02115 MA, USA
| |
Collapse
|
34
|
Wardman JF, Withers SG. Carbohydrate-active enzyme (CAZyme) discovery and engineering via (Ultra)high-throughput screening. RSC Chem Biol 2024; 5:595-616. [PMID: 38966674 PMCID: PMC11221537 DOI: 10.1039/d4cb00024b] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Accepted: 05/16/2024] [Indexed: 07/06/2024] Open
Abstract
Carbohydrate-active enzymes (CAZymes) constitute a diverse set of enzymes that catalyze the assembly, degradation, and modification of carbohydrates. These enzymes have been fashioned into potent, selective catalysts by millennia of evolution, and yet are also highly adaptable and readily evolved in the laboratory. To identify and engineer CAZymes for different purposes, (ultra)high-throughput screening campaigns have been frequently utilized with great success. This review provides an overview of the different approaches taken in screening for CAZymes and how mechanistic understandings of CAZymes can enable new approaches to screening. Within, we also cover how cutting-edge techniques such as microfluidics, advances in computational approaches and synthetic biology, as well as novel assay designs are leading the field towards more informative and effective screening approaches.
Collapse
Affiliation(s)
- Jacob F Wardman
- Department of Biochemistry and Molecular Biology, University of British Columbia Vancouver BC V6T 1Z3 Canada
- Michael Smith Laboratories, University of British Columbia Vancouver BC V6T 1Z4 Canada
| | - Stephen G Withers
- Department of Biochemistry and Molecular Biology, University of British Columbia Vancouver BC V6T 1Z3 Canada
- Michael Smith Laboratories, University of British Columbia Vancouver BC V6T 1Z4 Canada
- Department of Chemistry, University of British Columbia Vancouver BC V6T 1Z1 Canada
| |
Collapse
|
35
|
Kissman EN, Sosa MB, Millar DC, Koleski EJ, Thevasundaram K, Chang MCY. Expanding chemistry through in vitro and in vivo biocatalysis. Nature 2024; 631:37-48. [PMID: 38961155 DOI: 10.1038/s41586-024-07506-w] [Citation(s) in RCA: 20] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2021] [Accepted: 05/01/2024] [Indexed: 07/05/2024]
Abstract
Living systems contain a vast network of metabolic reactions, providing a wealth of enzymes and cells as potential biocatalysts for chemical processes. The properties of protein and cell biocatalysts-high selectivity, the ability to control reaction sequence and operation in environmentally benign conditions-offer approaches to produce molecules at high efficiency while lowering the cost and environmental impact of industrial chemistry. Furthermore, biocatalysis offers the opportunity to generate chemical structures and functions that may be inaccessible to chemical synthesis. Here we consider developments in enzymes, biosynthetic pathways and cellular engineering that enable their use in catalysis for new chemistry and beyond.
Collapse
Affiliation(s)
- Elijah N Kissman
- Department of Chemistry, University of California Berkeley, Berkeley, CA, USA
| | - Max B Sosa
- Department of Chemistry, University of California Berkeley, Berkeley, CA, USA
| | - Douglas C Millar
- Department of Chemical and Biomolecular Engineering, University of California Berkeley, Berkeley, CA, USA
| | - Edward J Koleski
- Department of Chemistry, University of California Berkeley, Berkeley, CA, USA
| | | | - Michelle C Y Chang
- Department of Chemistry, University of California Berkeley, Berkeley, CA, USA.
- Department of Chemical and Biomolecular Engineering, University of California Berkeley, Berkeley, CA, USA.
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA.
- Department of Chemistry, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
36
|
O'Neil PT, Swint‐Kruse L, Fenton AW. Rheostatic contributions to protein stability can obscure a position's functional role. Protein Sci 2024; 33:e5075. [PMID: 38895978 PMCID: PMC11187868 DOI: 10.1002/pro.5075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 05/24/2024] [Accepted: 05/27/2024] [Indexed: 06/21/2024]
Abstract
Rheostat positions, which can be substituted with various amino acids to tune protein function across a range of outcomes, are a developing area for advancing personalized medicine and bioengineering. Current methods cannot accurately predict which proteins contain rheostat positions or their substitution outcomes. To compare the prevalence of rheostat positions in homologs, we previously investigated their occurrence in two pyruvate kinase (PYK) isozymes. Human liver PYK contained numerous rheostat positions that tuned the apparent affinity for the substrate phosphoenolpyruvate (Kapp-PEP) across a wide range. In contrast, no functional rheostat positions were identified in Zymomonas mobilis PYK (ZmPYK). Further, the set of ZmPYK substitutions included an unusually large number that lacked measurable activity. We hypothesized that the inactive substitution variants had reduced protein stability, precluding detection of Kapp-PEP tuning. Using modified buffers, robust enzymatic activity was obtained for 19 previously-inactive ZmPYK substitution variants at three positions. Surprisingly, both previously-inactive and previously-active substitution variants all had Kapp-PEP values close to wild-type. Thus, none of the three positions were functional rheostat positions, and, unlike human liver PYK, ZmPYK's Kapp-PEP remained poorly tunable by single substitutions. To directly assess effects on stability, we performed thermal denaturation experiments for all ZmPYK substitution variants. Many diminished stability, two enhanced stability, and the three positions showed different thermal sensitivity to substitution, with one position acting as a "stability rheostat." The differences between the two PYK homologs raises interesting questions about the underlying mechanism(s) that permit functional tuning by single substitutions in some proteins but not in others.
Collapse
Affiliation(s)
- Pierce T. O'Neil
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansasUSA
| | - Liskin Swint‐Kruse
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansasUSA
| | - Aron W. Fenton
- Department of Biochemistry and Molecular BiologyThe University of Kansas Medical CenterKansasUSA
| |
Collapse
|
37
|
McShea H, Weibel C, Wehbi S, Goodman P, James JE, Wheeler AL, Masel J. The effectiveness of selection in a species affects the direction of amino acid frequency evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.02.01.526552. [PMID: 38948853 PMCID: PMC11212923 DOI: 10.1101/2023.02.01.526552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]
Abstract
Nearly neutral theory predicts that species with higher effective population size (N e ) are better able to purge slightly deleterious mutations. We compare evolution in high-N e vs. low-N e vertebrates to reveal which amino acid frequencies are subject to subtle selective preferences. We take three complementary approaches, two measuring flux and one measuring outcomes. First, we fit non-stationary substitution models of amino acid flux using maximum likelihood, comparing the high-N e clade of rodents and lagomorphs to its low-N e sister clade of primates and colugos. Second, we compare evolutionary outcomes across a wider range of vertebrates, via correlations between amino acid frequencies and N e . Third, we dissect the details of flux in human, chimpanzee, mouse, and rat, as scored by parsimony - this also enables comparison to a historical paper. All three methods agree on which amino acids are preferred under more effective selection. Preferred amino acids tend to be smaller, less costly to synthesize, and to promote intrinsic structural disorder. Parsimony-induced bias in the historical study produces an apparent reduction in structural disorder, perhaps driven by slightly deleterious substitutions. Within highly exchangeable pairs of amino acids, arginine is strongly preferred over lysine, and valine over isoleucine, consistent with more effective selection preferring a marginally larger free energy of folding. These two preferences match differences between thermophiles and mesophilic relatives. These results reveal the biophysical consequences of mutation-selection-drift balance, and demonstrate the utility of nearly neutral theory for understanding protein evolution.
Collapse
Affiliation(s)
- Hanon McShea
- Department of Earth System Science, Stanford University
| | - Catherine Weibel
- Department of Ecology & Evolutionary Biology, University of Arizona
- Department of Applied Physics, Stanford University
| | - Sawsan Wehbi
- Graduate Interdisciplinary Program in Genetics, University of Arizona
| | | | - Jennifer E James
- Department of Ecology & Evolutionary Biology, University of Arizona
- Department of Ecology and Genetics, Uppsala University
| | - Andrew L Wheeler
- Graduate Interdisciplinary Program in Genetics, University of Arizona
| | - Joanna Masel
- Department of Ecology & Evolutionary Biology, University of Arizona
| |
Collapse
|
38
|
Yee SW, Macdonald CB, Mitrovic D, Zhou X, Koleske ML, Yang J, Buitrago Silva D, Rockefeller Grimes P, Trinidad DD, More SS, Kachuri L, Witte JS, Delemotte L, Giacomini KM, Coyote-Maestas W. The full spectrum of SLC22 OCT1 mutations illuminates the bridge between drug transporter biophysics and pharmacogenomics. Mol Cell 2024; 84:1932-1947.e10. [PMID: 38703769 PMCID: PMC11382353 DOI: 10.1016/j.molcel.2024.04.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Revised: 01/04/2024] [Accepted: 04/15/2024] [Indexed: 05/06/2024]
Abstract
Mutations in transporters can impact an individual's response to drugs and cause many diseases. Few variants in transporters have been evaluated for their functional impact. Here, we combine saturation mutagenesis and multi-phenotypic screening to dissect the impact of 11,213 missense single-amino-acid deletions, and synonymous variants across the 554 residues of OCT1, a key liver xenobiotic transporter. By quantifying in parallel expression and substrate uptake, we find that most variants exert their primary effect on protein abundance, a phenotype not commonly measured alongside function. Using our mutagenesis results combined with structure prediction and molecular dynamic simulations, we develop accurate structure-function models of the entire transport cycle, providing biophysical characterization of all known and possible human OCT1 polymorphisms. This work provides a complete functional map of OCT1 variants along with a framework for integrating functional genomics, biophysical modeling, and human genetics to predict variant effects on disease and drug efficacy.
Collapse
Affiliation(s)
- Sook Wah Yee
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Christian B Macdonald
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Darko Mitrovic
- Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, 12121 Solna, Stockholm, Stockholm County 114 28, Sweden
| | - Xujia Zhou
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Megan L Koleske
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Jia Yang
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Dina Buitrago Silva
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Patrick Rockefeller Grimes
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Donovan D Trinidad
- Department of Medicine, Division of Infectious Disease, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Swati S More
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA
| | - Linda Kachuri
- Department of Epidemiology and Population Health, Stanford University, Stanford, CA 94305, USA; Stanford Cancer Institute, Stanford University, Stanford, CA 94305, USA
| | - John S Witte
- Department of Epidemiology and Population Health, Stanford University, Stanford, CA 94305, USA; Stanford Cancer Institute, Stanford University, Stanford, CA 94305, USA
| | - Lucie Delemotte
- Science for Life Laboratory, Department of Applied Physics, KTH Royal Institute of Technology, 12121 Solna, Stockholm, Stockholm County 114 28, Sweden.
| | - Kathleen M Giacomini
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA.
| | - Willow Coyote-Maestas
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94143, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94143, USA; Chan Zuckerberg Biohub, San Francisco, CA 94148, USA.
| |
Collapse
|
39
|
Syrén PO. Ancestral terpene cyclases: From fundamental science to applications in biosynthesis. Methods Enzymol 2024; 699:311-341. [PMID: 38942509 DOI: 10.1016/bs.mie.2024.04.025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/30/2024]
Abstract
Terpenes constitute one of the largest family of natural products with potent applications as renewable platform chemicals and medicines. The low activity, selectivity and stability displayed by terpene biosynthetic machineries can constitute an obstacle towards achieving expedient biosynthesis of terpenoids in processes that adhere to the 12 principles of green chemistry. Accordingly, engineering of terpene synthase enzymes is a prerequisite for industrial biotechnology applications, but obstructed by their complex catalysis that depend on reactive carbocationic intermediates that are prone to undergo bifurcation mechanisms. Rational redesign of terpene synthases can be tedious and requires high-resolution structural information, which is not always available. Furthermore, it has proven difficult to link sequence space of terpene synthase enzymes to specific product profiles. Herein, the author shows how ancestral sequence reconstruction (ASR) can favorably be used as a protein engineering tool in the redesign of terpene synthases without the need of a structure, and without excessive screening. A detailed workflow of ASR is presented along with associated limitations, with a focus on applying this methodology on terpene synthases. From selected examples of both class I and II enzymes, the author advocates that ancestral terpene cyclases constitute valuable assets to shed light on terpene-synthase catalysis and in enabling accelerated biosynthesis.
Collapse
Affiliation(s)
- Per-Olof Syrén
- School of Chemistry, Biotechnology and Health, Science for Life Laboratory, KTH Royal Institute of Technology, Solna, Sweden; School of Engineering Sciences in Chemistry, Biotechnology and Health, Department of Fibre and Polymer Technology, KTH Royal Institute of Technology, Stockholm, Sweden.
| |
Collapse
|
40
|
Nguyen TN, Ingle C, Thompson S, Reynolds KA. The genetic landscape of a metabolic interaction. Nat Commun 2024; 15:3351. [PMID: 38637543 PMCID: PMC11026382 DOI: 10.1038/s41467-024-47671-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 04/09/2024] [Indexed: 04/20/2024] Open
Abstract
While much prior work has explored the constraints on protein sequence and evolution induced by physical protein-protein interactions, the sequence-level constraints emerging from non-binding functional interactions in metabolism remain unclear. To quantify how variation in the activity of one enzyme constrains the biochemical parameters and sequence of another, we focus on dihydrofolate reductase (DHFR) and thymidylate synthase (TYMS), a pair of enzymes catalyzing consecutive reactions in folate metabolism. We use deep mutational scanning to quantify the growth rate effect of 2696 DHFR single mutations in 3 TYMS backgrounds under conditions selected to emphasize biochemical epistasis. Our data are well-described by a relatively simple enzyme velocity to growth rate model that quantifies how metabolic context tunes enzyme mutational tolerance. Together our results reveal the structural distribution of epistasis in a metabolic enzyme and establish a foundation for the design of multi-enzyme systems.
Collapse
Affiliation(s)
- Thuy N Nguyen
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- The Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- The Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- Form Bio, Dallas, TX, 75226, USA
| | - Christine Ingle
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- The Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- The Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - Samuel Thompson
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, 94158, USA
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA
| | - Kimberly A Reynolds
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA.
- The Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA.
- The Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA.
| |
Collapse
|
41
|
Meger AT, Spence MA, Sandhu M, Matthews D, Chen J, Jackson CJ, Raman S. Rugged fitness landscapes minimize promiscuity in the evolution of transcriptional repressors. Cell Syst 2024; 15:374-387.e6. [PMID: 38537640 PMCID: PMC11299162 DOI: 10.1016/j.cels.2024.03.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 09/08/2023] [Accepted: 03/05/2024] [Indexed: 04/20/2024]
Abstract
How a protein's function influences the shape of its fitness landscape, smooth or rugged, is a fundamental question in evolutionary biochemistry. Smooth landscapes arise when incremental mutational steps lead to a progressive change in function, as commonly seen in enzymes and binding proteins. On the other hand, rugged landscapes are poorly understood because of the inherent unpredictability of how sequence changes affect function. Here, we experimentally characterize the entire sequence phylogeny, comprising 1,158 extant and ancestral sequences, of the DNA-binding domain (DBD) of the LacI/GalR transcriptional repressor family. Our analysis revealed an extremely rugged landscape with rapid switching of specificity, even between adjacent nodes. Further, the ruggedness arises due to the necessity of the repressor to simultaneously evolve specificity for asymmetric operators and disfavors potentially adverse regulatory crosstalk. Our study provides fundamental insight into evolutionary, molecular, and biophysical rules of genetic regulation through the lens of fitness landscapes.
Collapse
Affiliation(s)
- Anthony T Meger
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Matthew A Spence
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Mahakaran Sandhu
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia
| | - Dana Matthews
- Research School of Biology, Australian National University, Canberra, ACT 2601, Australia
| | - Jackie Chen
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA
| | - Colin J Jackson
- Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia; ARC Centre of Excellence for Innovations in Peptide & Protein Science, Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia; ARC Centre of Excellence for Innovations in Synthetic Biology, Research School of Chemistry, Australian National University, Canberra, ACT 2601, Australia.
| | - Srivatsan Raman
- Department of Biochemistry, University of Wisconsin-Madison, Madison, WI 53706, USA; Department of Bacteriology, University of Wisconsin-Madison, Madison, WI 53706, USA; Department of Chemical and Biological Engineering, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
42
|
Prywes N, Philips NR, Oltrogge LM, Lindner S, Candace Tsai YC, de Pins B, Cowan AE, Taylor-Kearney LJ, Chang HA, Hall LN, Bellieny-Rabelo D, Nisonoff HM, Weissman RF, Flamholz AI, Ding D, Bhatt AY, Shih PM, Mueller-Cajar O, Milo R, Savage DF. A map of the rubisco biochemical landscape. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.09.27.559826. [PMID: 38645011 PMCID: PMC11030240 DOI: 10.1101/2023.09.27.559826] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]
Abstract
Rubisco is the primary CO2 fixing enzyme of the biosphere yet has slow kinetics. The roles of evolution and chemical mechanism in constraining the sequence landscape of rubisco remain debated. In order to map sequence to function, we developed a massively parallel assay for rubisco using an engineered E. coli where enzyme function is coupled to growth. By assaying >99% of single amino acid mutants across CO2 concentrations, we inferred enzyme velocity and CO2 affinity for thousands of substitutions. We identified many highly conserved positions that tolerate mutation and rare mutations that improve CO2 affinity. These data suggest that non-trivial kinetic improvements are readily accessible and provide a comprehensive sequence-to-function mapping for enzyme engineering efforts.
Collapse
Affiliation(s)
- Noam Prywes
- Innovative Genomics Institute, University of California; Berkeley, California 94720, USA
- Howard Hughes Medical Institute, University of California; Berkeley, California 94720, USA
| | - Naiya R Philips
- Department of Molecular and Cell Biology, University of California; Berkeley, California 94720, USA
| | - Luke M Oltrogge
- Howard Hughes Medical Institute, University of California; Berkeley, California 94720, USA
- Department of Molecular and Cell Biology, University of California; Berkeley, California 94720, USA
| | | | - Yi-Chin Candace Tsai
- School of Biological Sciences, Nanyang Technological University; Singapore 637551, Singapore
| | - Benoit de Pins
- Department of Plant and Environmental Sciences, Weizmann Institute of Science; Rehovot 76100, Israel
| | - Aidan E Cowan
- Department of Molecular and Cell Biology, University of California; Berkeley, California 94720, USA
- Joint BioEnergy Institute, Lawrence Berkeley National Laboratory; Emeryville, CA 94608, USA
| | - Leah J Taylor-Kearney
- Department of Plant and Microbial Biology, University of California, Berkeley; Berkeley, CA 94720, USA
| | - Hana A Chang
- Department of Plant and Microbial Biology, University of California, Berkeley; Berkeley, CA 94720, USA
| | - Laina N Hall
- Biophysics, University of California, Berkeley; Berkeley, CA 94720, USA
| | - Daniel Bellieny-Rabelo
- Innovative Genomics Institute, University of California; Berkeley, California 94720, USA
- California Institute for Quantitative Biosciences (QB3), University of California; Berkeley, CA 94720, USA
| | - Hunter M Nisonoff
- Center for Computational Biology, University of California, Berkeley; Berkeley, CA, USA
| | - Rachel F Weissman
- Department of Molecular and Cell Biology, University of California; Berkeley, California 94720, USA
| | - Avi I Flamholz
- Division of Biology and Biological Engineering, California Institute of Technology; Pasadena, CA 91125
| | - David Ding
- Innovative Genomics Institute, University of California; Berkeley, California 94720, USA
- Howard Hughes Medical Institute, University of California; Berkeley, California 94720, USA
| | - Abhishek Y Bhatt
- Department of Molecular and Cell Biology, University of California; Berkeley, California 94720, USA
- School of Medicine, University of California, San Diego; La Jolla, CA 92092, USA
| | - Patrick M Shih
- Innovative Genomics Institute, University of California; Berkeley, California 94720, USA
- Department of Plant and Microbial Biology, University of California, Berkeley; Berkeley, CA 94720, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory; Berkeley, CA 94720, USA
- Feedstocks Division, Joint BioEnergy Institute; Emeryville, CA 94608, USA
| | - Oliver Mueller-Cajar
- School of Biological Sciences, Nanyang Technological University; Singapore 637551, Singapore
| | - Ron Milo
- Department of Plant and Environmental Sciences, Weizmann Institute of Science; Rehovot 76100, Israel
| | - David F Savage
- Innovative Genomics Institute, University of California; Berkeley, California 94720, USA
- Howard Hughes Medical Institute, University of California; Berkeley, California 94720, USA
- Department of Molecular and Cell Biology, University of California; Berkeley, California 94720, USA
| |
Collapse
|
43
|
Schmutzer M, Dasmeh P, Wagner A. Frustration can Limit the Adaptation of Promiscuous Enzymes Through Gene Duplication and Specialisation. J Mol Evol 2024; 92:104-120. [PMID: 38470504 PMCID: PMC10978624 DOI: 10.1007/s00239-024-10161-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2023] [Accepted: 02/16/2024] [Indexed: 03/14/2024]
Abstract
Virtually all enzymes catalyse more than one reaction, a phenomenon known as enzyme promiscuity. It is unclear whether promiscuous enzymes are more often generalists that catalyse multiple reactions at similar rates or specialists that catalyse one reaction much more efficiently than other reactions. In addition, the factors that shape whether an enzyme evolves to be a generalist or a specialist are poorly understood. To address these questions, we follow a three-pronged approach. First, we examine the distribution of promiscuity in empirical enzymes reported in the BRENDA database. We find that the promiscuity distribution of empirical enzymes is bimodal. In other words, a large fraction of promiscuous enzymes are either generalists or specialists, with few intermediates. Second, we demonstrate that enzyme biophysics is not sufficient to explain this bimodal distribution. Third, we devise a constraint-based model of promiscuous enzymes undergoing duplication and facing selection pressures favouring subfunctionalization. The model posits the existence of constraints between the catalytic efficiencies of an enzyme for different reactions and is inspired by empirical case studies. The promiscuity distribution predicted by our constraint-based model is consistent with the empirical bimodal distribution. Our results suggest that subfunctionalization is possible and beneficial only in certain enzymes. Furthermore, the model predicts that conflicting constraints and selection pressures can cause promiscuous enzymes to enter a 'frustrated' state, in which competing interactions limit the specialisation of enzymes. We find that frustration can be both a driver and an inhibitor of enzyme evolution by duplication and subfunctionalization. In addition, our model predicts that frustration becomes more likely as enzymes catalyse more reactions, implying that natural selection may prefer catalytically simple enzymes. In sum, our results suggest that frustration may play an important role in enzyme evolution.
Collapse
Affiliation(s)
- Michael Schmutzer
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Pouria Dasmeh
- Center for Human Genetics, Philipps University of Marburg, Marburg, Germany
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland.
- Swiss Institute of Bioinformatics, Lausanne, Switzerland.
- Santa Fe Institute, Santa Fe, NM, USA.
| |
Collapse
|
44
|
Yan W, Li X, Zhao D, Xie M, Li T, Qian L, Ye C, Shi T, Wu L, Wang Y. Advanced strategies in high-throughput droplet screening for enzyme engineering. Biosens Bioelectron 2024; 248:115972. [PMID: 38171222 DOI: 10.1016/j.bios.2023.115972] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 11/05/2023] [Accepted: 12/23/2023] [Indexed: 01/05/2024]
Abstract
Enzymes, as biocatalysts, play a cumulatively important role in environmental purification and industrial production of chemicals and pharmaceuticals. However, natural enzymes are limited by their physiological properties in practice, which need to be modified driven by requirements. Screening and isolating certain enzyme variants or ideal industrial strains with high yielding of target product enzymes is one of the main directions of enzyme engineering research. Droplet-based high-throughput screening (DHTS) technology employs massive monodisperse emulsion droplets as microreactors to achieve single strain encapsulation, as well as continuous monitoring for the inside mutant library. It can effectively sort out strains or enzymes with desired characteristics, offering a throughput of 108 events per hour. Much of the early literature focused on screening various engineered strains or designing signalling sorting strategies based on DHTS technology. However, the field of enzyme engineering lacks a comprehensive overview of advanced methods for microfluidic droplets and their cutting-edge developments in generation and manipulation. This review emphasizes the advanced strategies and frontiers of microfluidic droplet generation and manipulation facilitating enzyme engineering development. We also introduce design for various screening signals that cooperate with DHTS and devote to enzyme engineering.
Collapse
Affiliation(s)
- Wenxin Yan
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210046, China
| | - Xiang Li
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210046, China
| | - Danshan Zhao
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210046, China
| | - Meng Xie
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210046, China
| | - Ting Li
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210046, China
| | - Lu Qian
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210046, China
| | - Chao Ye
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210046, China; Ministry of Education Key Laboratory of NSLSCS, Nanjing Normal University, Nanjing 210046, China.
| | - Tianqiong Shi
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210046, China.
| | - Lina Wu
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210046, China; Food Laboratory of Zhongyuan, Luohe, 462300, Henan, China.
| | - Yuetong Wang
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, Nanjing 210046, China.
| |
Collapse
|
45
|
Lidbury IDEA, Hitchcock A, Groenhof SRM, Connolly AN, Moushtaq L. New insights in bacterial organophosphorus cycling: From human pathogens to environmental bacteria. Adv Microb Physiol 2024; 84:1-49. [PMID: 38821631 DOI: 10.1016/bs.ampbs.2023.12.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/02/2024]
Abstract
In terrestrial and aquatic ecosystems, phosphorus (P) availability controls primary production, with consequences for climate regulation and global food security. Understanding the microbial controls on the global P cycle is a prerequisite for minimising our reliance on non-renewable phosphate rock reserves and reducing pollution associated with excessive P fertiliser use. This recognised importance has reinvigorated research into microbial P cycling, which was pioneered over 75 years ago through the study of human pathogenic bacteria-host interactions. Immobilised organic P represents a significant fraction of the total P pool. Hence, microbes have evolved a plethora of mechanisms to transform this fraction into labile inorganic phosphate, the building block for numerous biological molecules. The 'genomics era' has revealed an extraordinary diversity of organic P cycling genes exist in the environment and studies going 'back to the lab' are determining how this diversity relates to function. Through this integrated approach, many hitherto unknown genes and proteins that are involved in microbial P cycling have been discovered. Not only do these fundamental discoveries push the frontier of our knowledge, but several examples also provide exciting opportunities for biotechnology and present possible solutions for improving the sustainability of how we grow our food, both locally and globally. In this review, we provide a comprehensive overview of bacterial organic P cycling, covering studies on human pathogens and how this knowledge is informing new discoveries in environmental microbiology.
Collapse
Affiliation(s)
- Ian D E A Lidbury
- Molecular Microbiology - Biochemistry and Disease, School of Biosciences, The University of Sheffield, Sheffield, United Kingdom.
| | - Andrew Hitchcock
- Molecular Microbiology - Biochemistry and Disease, School of Biosciences, The University of Sheffield, Sheffield, United Kingdom; Plants, Photosynthesis, and Soil, School of Biosciences, The University of Sheffield, Sheffield, United Kingdom
| | - Sophie R M Groenhof
- Molecular Microbiology - Biochemistry and Disease, School of Biosciences, The University of Sheffield, Sheffield, United Kingdom
| | - Alex N Connolly
- Molecular Microbiology - Biochemistry and Disease, School of Biosciences, The University of Sheffield, Sheffield, United Kingdom
| | - Laila Moushtaq
- Molecular Microbiology - Biochemistry and Disease, School of Biosciences, The University of Sheffield, Sheffield, United Kingdom
| |
Collapse
|
46
|
Baghdassarian HM, Lewis NE. Resource allocation in mammalian systems. Biotechnol Adv 2024; 71:108305. [PMID: 38215956 PMCID: PMC11182366 DOI: 10.1016/j.biotechadv.2023.108305] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Revised: 12/17/2023] [Accepted: 12/18/2023] [Indexed: 01/14/2024]
Abstract
Cells execute biological functions to support phenotypes such as growth, migration, and secretion. Complementarily, each function of a cell has resource costs that constrain phenotype. Resource allocation by a cell allows it to manage these costs and optimize their phenotypes. In fact, the management of resource constraints (e.g., nutrient availability, bioenergetic capacity, and macromolecular machinery production) shape activity and ultimately impact phenotype. In mammalian systems, quantification of resource allocation provides important insights into higher-order multicellular functions; it shapes intercellular interactions and relays environmental cues for tissues to coordinate individual cells to overcome resource constraints and achieve population-level behavior. Furthermore, these constraints, objectives, and phenotypes are context-dependent, with cells adapting their behavior according to their microenvironment, resulting in distinct steady-states. This review will highlight the biological insights gained from probing resource allocation in mammalian cells and tissues.
Collapse
Affiliation(s)
- Hratch M Baghdassarian
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA 92093, USA; Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA
| | - Nathan E Lewis
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, USA; Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
47
|
Swint-Kruse L, Fenton AW. Rheostats, toggles, and neutrals, Oh my! A new framework for understanding how amino acid changes modulate protein function. J Biol Chem 2024; 300:105736. [PMID: 38336297 PMCID: PMC10914490 DOI: 10.1016/j.jbc.2024.105736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 01/09/2024] [Accepted: 01/25/2024] [Indexed: 02/12/2024] Open
Abstract
Advances in personalized medicine and protein engineering require accurately predicting outcomes of amino acid substitutions. Many algorithms correctly predict that evolutionarily-conserved positions show "toggle" substitution phenotypes, which is defined when a few substitutions at that position retain function. In contrast, predictions often fail for substitutions at the less-studied "rheostat" positions, which are defined when different amino acid substitutions at a position sample at least half of the possible functional range. This review describes efforts to understand the impact and significance of rheostat positions: (1) They have been observed in globular soluble, integral membrane, and intrinsically disordered proteins; within single proteins, their prevalence can be up to 40%. (2) Substitutions at rheostat positions can have biological consequences and ∼10% of substitutions gain function. (3) Although both rheostat and "neutral" (defined when all substitutions exhibit wild-type function) positions are nonconserved, the two classes have different evolutionary signatures. (4) Some rheostat positions have pleiotropic effects on function, simultaneously modulating multiple parameters (e.g., altering both affinity and allosteric coupling). (5) In structural studies, substitutions at rheostat positions appear to cause only local perturbations; the overall conformations appear unchanged. (6) Measured functional changes show promising correlations with predicted changes in protein dynamics; the emergent properties of predicted, dynamically coupled amino acid networks might explain some of the complex functional outcomes observed when substituting rheostat positions. Overall, rheostat positions provide unique opportunities for using single substitutions to tune protein function. Future studies of these positions will yield important insights into the protein sequence/function relationship.
Collapse
Affiliation(s)
- Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA.
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| |
Collapse
|
48
|
Holehouse AS, Kragelund BB. The molecular basis for cellular function of intrinsically disordered protein regions. Nat Rev Mol Cell Biol 2024; 25:187-211. [PMID: 37957331 PMCID: PMC11459374 DOI: 10.1038/s41580-023-00673-0] [Citation(s) in RCA: 193] [Impact Index Per Article: 193.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/26/2023] [Indexed: 11/15/2023]
Abstract
Intrinsically disordered protein regions exist in a collection of dynamic interconverting conformations that lack a stable 3D structure. These regions are structurally heterogeneous, ubiquitous and found across all kingdoms of life. Despite the absence of a defined 3D structure, disordered regions are essential for cellular processes ranging from transcriptional control and cell signalling to subcellular organization. Through their conformational malleability and adaptability, disordered regions extend the repertoire of macromolecular interactions and are readily tunable by their structural and chemical context, making them ideal responders to regulatory cues. Recent work has led to major advances in understanding the link between protein sequence and conformational behaviour in disordered regions, yet the link between sequence and molecular function is less well defined. Here we consider the biochemical and biophysical foundations that underlie how and why disordered regions can engage in productive cellular functions, provide examples of emerging concepts and discuss how protein disorder contributes to intracellular information processing and regulation of cellular function.
Collapse
Affiliation(s)
- Alex S Holehouse
- Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, St Louis, MO, USA.
- Center for Biomolecular Condensates, Washington University in St Louis, St Louis, MO, USA.
| | - Birthe B Kragelund
- REPIN, Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
49
|
Vanella R, Küng C, Schoepfer AA, Doffini V, Ren J, Nash MA. Understanding activity-stability tradeoffs in biocatalysts by enzyme proximity sequencing. Nat Commun 2024; 15:1807. [PMID: 38418512 PMCID: PMC10902396 DOI: 10.1038/s41467-024-45630-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2023] [Accepted: 01/26/2024] [Indexed: 03/01/2024] Open
Abstract
Understanding the complex relationships between enzyme sequence, folding stability and catalytic activity is crucial for applications in industry and biomedicine. However, current enzyme assay technologies are limited by an inability to simultaneously resolve both stability and activity phenotypes and to couple these to gene sequences at large scale. Here we present the development of enzyme proximity sequencing, a deep mutational scanning method that leverages peroxidase-mediated radical labeling with single cell fidelity to dissect the effects of thousands of mutations on stability and catalytic activity of oxidoreductase enzymes in a single experiment. We use enzyme proximity sequencing to analyze how 6399 missense mutations influence folding stability and catalytic activity in a D-amino acid oxidase from Rhodotorula gracilis. The resulting datasets demonstrate activity-based constraints that limit folding stability during natural evolution, and identify hotspots distant from the active site as candidates for mutations that improve catalytic activity without sacrificing stability. Enzyme proximity sequencing can be extended to other enzyme classes and provides valuable insights into biophysical principles governing enzyme structure and function.
Collapse
Affiliation(s)
- Rosario Vanella
- Institute of Physical Chemistry, Department of Chemistry, University of Basel, 4058, Basel, Switzerland.
- Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland.
| | - Christoph Küng
- Institute of Physical Chemistry, Department of Chemistry, University of Basel, 4058, Basel, Switzerland
- Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland
| | - Alexandre A Schoepfer
- Institute of Physical Chemistry, Department of Chemistry, University of Basel, 4058, Basel, Switzerland
- Institute of Chemical Sciences and Engineering, École Polytechnique Fédérale de Lausanne (EPFL), 1015, Lausanne, Switzerland
- National Center for Competence in Research (NCCR), Catalysis, École Polytechnique Fédérale de Lausanne (EPFL), 1015, Lausanne, Switzerland
| | - Vanni Doffini
- Institute of Physical Chemistry, Department of Chemistry, University of Basel, 4058, Basel, Switzerland
- Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland
| | - Jin Ren
- Institute of Physical Chemistry, Department of Chemistry, University of Basel, 4058, Basel, Switzerland
- Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland
| | - Michael A Nash
- Institute of Physical Chemistry, Department of Chemistry, University of Basel, 4058, Basel, Switzerland.
- Department of Biosystems Science and Engineering, ETH Zurich, 4058, Basel, Switzerland.
- National Center for Competence in Research (NCCR), Molecular Systems Engineering, 4058, Basel, Switzerland.
- Swiss Nanoscience Institute, 4056, Basel, Switzerland.
| |
Collapse
|
50
|
Yang J, Li FZ, Arnold FH. Opportunities and Challenges for Machine Learning-Assisted Enzyme Engineering. ACS CENTRAL SCIENCE 2024; 10:226-241. [PMID: 38435522 PMCID: PMC10906252 DOI: 10.1021/acscentsci.3c01275] [Citation(s) in RCA: 25] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/17/2023] [Revised: 12/26/2023] [Accepted: 01/16/2024] [Indexed: 03/05/2024]
Abstract
Enzymes can be engineered at the level of their amino acid sequences to optimize key properties such as expression, stability, substrate range, and catalytic efficiency-or even to unlock new catalytic activities not found in nature. Because the search space of possible proteins is vast, enzyme engineering usually involves discovering an enzyme starting point that has some level of the desired activity followed by directed evolution to improve its "fitness" for a desired application. Recently, machine learning (ML) has emerged as a powerful tool to complement this empirical process. ML models can contribute to (1) starting point discovery by functional annotation of known protein sequences or generating novel protein sequences with desired functions and (2) navigating protein fitness landscapes for fitness optimization by learning mappings between protein sequences and their associated fitness values. In this Outlook, we explain how ML complements enzyme engineering and discuss its future potential to unlock improved engineering outcomes.
Collapse
Affiliation(s)
- Jason Yang
- Division
of Chemistry and Chemical Engineering, California
Institute of Technology, Pasadena, California 91125, United States
| | - Francesca-Zhoufan Li
- Division
of Biology and Biological Engineering, California
Institute of Technology, Pasadena, California 91125, United States
| | - Frances H. Arnold
- Division
of Chemistry and Chemical Engineering, California
Institute of Technology, Pasadena, California 91125, United States
- Division
of Biology and Biological Engineering, California
Institute of Technology, Pasadena, California 91125, United States
| |
Collapse
|