1
|
Campitelli P, Kazan IC, Hamilton S, Ozkan SB. Dynamic Allostery: Evolution's Double-Edged Sword in Protein Function and Disease. J Mol Biol 2025:169175. [PMID: 40286867 DOI: 10.1016/j.jmb.2025.169175] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2025] [Revised: 04/21/2025] [Accepted: 04/21/2025] [Indexed: 04/29/2025]
Abstract
Allostery is a core mechanism in biology that allows proteins to communicate and regulate activity over long structural distances. While classical models of allostery focus on conformational changes triggered by ligand binding, dynamic allostery-where protein function is modulated through alterations in thermal fluctuations without major conformational shifts-has emerged as a critical evolutionary mechanism. This review explores how evolution leverages dynamic allostery to fine-tune protein function through subtle mutations at distal sites, preserving core structural architecture while dramatically altering functional properties. Using a combination of computational approaches including Dynamic Flexibility Index (DFI), Dynamic Coupling Index (DCI), and vibrational density of states (VDOS) analysis, we demonstrate that functional adaptations in proteins often involve "hinge-shift" mechanisms, where redistribution of rigid and flexible regions modulates collective motions without changing the overall fold. This evolutionary principle is a double-edged sword: the same mechanisms that enable functional innovation also create vulnerabilities that can be exploited in disease states. Disease-associated variants frequently occur at positions highly coupled to functional sites despite being physically distant, forming Dynamic Allosteric Residue Couples (DARC sites). We demonstrate applications of these principles in understanding viral evolution, drug resistance, and capsid assembly dynamics. Understanding dynamic allostery provides critical insights into protein evolution and offers new avenues for therapeutic interventions targeting allosteric regulation.
Collapse
Affiliation(s)
- Paul Campitelli
- Department of Physics, Arizona State University, Tempe, AZ, United States; Center for Biological Physics, Arizona State University, Tempe, AZ, United States
| | - I Can Kazan
- Department of Physics, Arizona State University, Tempe, AZ, United States; Center for Biological Physics, Arizona State University, Tempe, AZ, United States
| | - Sean Hamilton
- Department of Physics, Arizona State University, Tempe, AZ, United States; Center for Biological Physics, Arizona State University, Tempe, AZ, United States
| | - S Banu Ozkan
- Department of Physics, Arizona State University, Tempe, AZ, United States; Center for Biological Physics, Arizona State University, Tempe, AZ, United States.
| |
Collapse
|
2
|
Jena C, Chinnaraj S, Deolankar S, Matange N. Proteostasis modulates gene dosage evolution in antibiotic-resistant bacteria. eLife 2025; 13:RP99785. [PMID: 40073078 PMCID: PMC11903035 DOI: 10.7554/elife.99785] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/14/2025] Open
Abstract
Evolution of gene expression frequently drives antibiotic resistance in bacteria. We had previously (Patel and Matange, eLife, 2021) shown that, in Escherichia coli, mutations at the mgrB locus were beneficial under trimethoprim exposure and led to overexpression of dihydrofolate reductase (DHFR), encoded by the folA gene. Here, we show that DHFR levels are further enhanced by spontaneous duplication of a genomic segment encompassing folA and spanning hundreds of kilobases. This duplication was rare in wild-type E. coli. However, its frequency was elevated in a lon-knockout strain, altering the mutational landscape early during trimethoprim adaptation. We then exploit this system to investigate the relationship between trimethoprim pressure and folA copy number. During long-term evolution, folA duplications were frequently reversed. Reversal was slower under antibiotic pressure, first requiring the acquisition of point mutations in DHFR or its promoter. Unexpectedly, despite resistance-conferring point mutations, some populations under high trimethoprim pressure maintained folA duplication to compensate for low abundance DHFR mutants. We find that evolution of gene dosage depends on expression demand, which is generated by antibiotic and exacerbated by proteolysis of drug-resistant mutants of DHFR. We propose a novel role for proteostasis as a determinant of copy number evolution in antibiotic-resistant bacteria.
Collapse
Affiliation(s)
- Chinmaya Jena
- Department of Biology, Indian Institute of Science Education and ResearchPuneIndia
| | - Saillesh Chinnaraj
- Department of Biology, Indian Institute of Science Education and ResearchPuneIndia
| | - Soham Deolankar
- Department of Biology, Indian Institute of Science Education and ResearchPuneIndia
| | - Nishad Matange
- Department of Biology, Indian Institute of Science Education and ResearchPuneIndia
| |
Collapse
|
3
|
Diaz Arenas C, Alvarez M, Wilson RH, Shakhnovich EI, Ogbunugafor CB. Protein Quality Control is a Master Modulator of Molecular Evolution in Bacteria. Genome Biol Evol 2025; 17:evaf010. [PMID: 39837347 PMCID: PMC11789785 DOI: 10.1093/gbe/evaf010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2024] [Revised: 01/05/2025] [Accepted: 01/15/2025] [Indexed: 01/23/2025] Open
Abstract
The bacterial protein quality control (PQC) network comprises a set of genes that promote proteostasis (proteome homeostasis) through proper protein folding and function via chaperones, proteases, and protein translational machinery. It participates in vital cellular processes and influences organismal development and evolution. In this review, we examine the mechanistic bases for how the bacterial PQC network influences molecular evolution. We discuss the relevance of PQC components to contemporary issues in evolutionary biology including epistasis, evolvability, and the navigability of protein space. We examine other areas where proteostasis affects aspects of evolution and physiology, including host-parasite interactions. More generally, we demonstrate that the study of bacterial systems can aid in broader efforts to understand the relationship between genotype and phenotype across the biosphere.
Collapse
Affiliation(s)
- Carolina Diaz Arenas
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA
| | - Maristella Alvarez
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA
| | - Robert H Wilson
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
| | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA 02138, USA
| | - C Brandon Ogbunugafor
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520, USA
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, MA 02139, USA
- Santa Fe Institute, Santa Fe, NM 87501, USA
| |
Collapse
|
4
|
Prywes N, Phillips NR, Oltrogge LM, Lindner S, Taylor-Kearney LJ, Tsai YCC, de Pins B, Cowan AE, Chang HA, Wang RZ, Hall LN, Bellieny-Rabelo D, Nisonoff HM, Weissman RF, Flamholz AI, Ding D, Bhatt AY, Mueller-Cajar O, Shih PM, Milo R, Savage DF. A map of the rubisco biochemical landscape. Nature 2025; 638:823-828. [PMID: 39843747 PMCID: PMC11839469 DOI: 10.1038/s41586-024-08455-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2024] [Accepted: 11/26/2024] [Indexed: 01/24/2025]
Abstract
Rubisco is the primary CO2-fixing enzyme of the biosphere1, yet it has slow kinetics2. The roles of evolution and chemical mechanism in constraining its biochemical function remain debated3,4. Engineering efforts aimed at adjusting the biochemical parameters of rubisco have largely failed5, although recent results indicate that the functional potential of rubisco has a wider scope than previously known6. Here we developed a massively parallel assay, using an engineered Escherichia coli7 in which enzyme activity is coupled to growth, to systematically map the sequence-function landscape of rubisco. Composite assay of more than 99% of single-amino acid mutants versus CO2 concentration enabled inference of enzyme velocity and apparent CO2 affinity parameters for thousands of substitutions. This approach identified many highly conserved positions that tolerate mutation and rare mutations that improve CO2 affinity. These data indicate that non-trivial biochemical changes are readily accessible and that the functional distance between rubiscos from diverse organisms can be traversed, laying the groundwork for further enzyme engineering efforts.
Collapse
Affiliation(s)
- Noam Prywes
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, CA, USA
| | - Naiya R Phillips
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Luke M Oltrogge
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, CA, USA
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | | | - Leah J Taylor-Kearney
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
| | - Yi-Chin Candace Tsai
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Benoit de Pins
- Department of Biology, University of Naples Federico II, Naples, Italy
| | - Aidan E Cowan
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
- Joint BioEnergy Institute, Lawrence Berkeley National Laboratory, Emeryville, CA, USA
| | - Hana A Chang
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
| | - Renée Z Wang
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
| | - Laina N Hall
- Biophysics, University of California Berkeley, Berkeley, CA, USA
| | - Daniel Bellieny-Rabelo
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
- California Institute for Quantitative Biosciences (QB3), University of California Berkeley, Berkeley, CA, USA
| | - Hunter M Nisonoff
- Center for Computational Biology, University of California Berkeley, Berkeley, CA, USA
| | - Rachel F Weissman
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
| | - Avi I Flamholz
- Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA, USA
| | - David Ding
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, CA, USA
| | - Abhishek Y Bhatt
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA
- School of Medicine, University of California San Diego, La Jolla, CA, USA
| | - Oliver Mueller-Cajar
- School of Biological Sciences, Nanyang Technological University, Singapore, Singapore
| | - Patrick M Shih
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA
- Department of Plant and Microbial Biology, University of California Berkeley, Berkeley, CA, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory, Berkeley, CA, USA
- Feedstocks Division, Joint BioEnergy Institute, Emeryville, CA, USA
| | - Ron Milo
- Department of Plant and Environmental Sciences, Weizmann Institute of Science, Rehovot, Israel
| | - David F Savage
- Innovative Genomics Institute, University of California Berkeley, Berkeley, CA, USA.
- Howard Hughes Medical Institute, University of California Berkeley, Berkeley, CA, USA.
- Department of Molecular and Cell Biology, University of California Berkeley, Berkeley, CA, USA.
| |
Collapse
|
5
|
Romanowicz KJ, Resnick C, Hinton SR, Plesa C. Exploring Antibiotic Resistance in Diverse Homologs of the Dihydrofolate Reductase Protein Family through Broad Mutational Scanning. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.23.634126. [PMID: 39896582 PMCID: PMC11785229 DOI: 10.1101/2025.01.23.634126] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/04/2025]
Abstract
Current antibiotic resistance studies often focus on individual protein variants, neglecting broader protein family dynamics. Dihydrofolate reductase (DHFR), a key antibiotic target, has been extensively studied using deep mutational scanning, yet resistance mechanisms across this diverse protein family remain poorly understood. Using DropSynth, a scalable gene synthesis platform, we designed a library of 1,536 synthetic DHFR homologs representing 778 species of bacteria, archaea, and viruses, including clinically relevant pathogens. A multiplexed in vivo assay tested their ability to restore metabolic function and confer trimethoprim resistance in an E. coli ∆folA strain. Over half of the synthetic homologs rescued the phenotype without supplementation, and mutants with up to five amino acid substitutions increased the rescue rate to 90%, highlighting DHFR's evolutionary resilience. Broad Mutational Scanning (BMS) of homologs and 100,000 mutants provided critical insights into DHFR's fitness landscape and resistance pathways, representing the most extensive analysis of homolog complementation and inhibitor tolerance to date and advancing our understanding of antibiotic resistance mechanisms.
Collapse
Affiliation(s)
- Karl J. Romanowicz
- Department of Bioengineering, Phil and Penny Knight Campus for Accelerating Scientific Impact, University of Oregon, 1505 Franklin Boulevard, Eugene, OR 97403, USA
| | - Carmen Resnick
- Department of Bioengineering, Phil and Penny Knight Campus for Accelerating Scientific Impact, University of Oregon, 1505 Franklin Boulevard, Eugene, OR 97403, USA
| | - Samuel R. Hinton
- Department of Bioengineering, Phil and Penny Knight Campus for Accelerating Scientific Impact, University of Oregon, 1505 Franklin Boulevard, Eugene, OR 97403, USA
| | - Calin Plesa
- Department of Bioengineering, Phil and Penny Knight Campus for Accelerating Scientific Impact, University of Oregon, 1505 Franklin Boulevard, Eugene, OR 97403, USA
| |
Collapse
|
6
|
Atsavapranee B, Sunden F, Herschlag D, Fordyce P. Quantifying protein unfolding kinetics with a high-throughput microfluidic platform. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.01.15.633299. [PMID: 39868203 PMCID: PMC11761748 DOI: 10.1101/2025.01.15.633299] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 01/28/2025]
Abstract
Even after folding, proteins transiently sample unfolded or partially unfolded intermediates, and these species are often at risk of irreversible alteration (e.g. via proteolysis, aggregation, or post-translational modification). Kinetic stability, in addition to thermodynamic stability, can directly impact protein lifetime, abundance, and the formation of alternative, sometimes disruptive states. However, we have very few measurements of protein unfolding rates or how mutations alter these rates, largely due to technical challenges associated with their measurement. To address this need, we developed SPARKfold (Simultaneous Proteolysis Assay Revealing Kinetics of Folding), a microfluidic platform to express, purify, and measure unfolding rate constants for >1000 protein variants in parallel via on-chip native proteolysis. To demonstrate the power and potential of SPARKfold, we determined unfolding rate constants for 1,104 protein samples in parallel. We built a library of 31 dihydrofolate reductase (DHFR) orthologs with up to 78 chamber replicates per variant to provide the statistical power required to evaluate the system's ability to resolve subtle effects. SPARKfold rate constants for 5 constructs agreed with those obtained using traditional techniques across a 150-fold range, validating the accuracy of the technique. Comparisons of mutant kinetic effects via SPARKfold with previously published measurements impacts on folding thermodynamics provided information about the folding transition state and pathways via φ analysis. Overall, SPARKfold enables rapid characterization of protein variants to dissect the nature of the unfolding transition state. In future work, SPARKfold can reveal mutations that drive misfolding and aggregation and enable rational design of kinetically hyperstable variants for industrial use in harsh environments.
Collapse
Affiliation(s)
- B. Atsavapranee
- Department of Bioengineering, Stanford University, Stanford, CA 94305
| | - F. Sunden
- Department of Biochemistry, Stanford University, Stanford, CA 94305
| | - D. Herschlag
- Department of Biochemistry, Stanford University, Stanford, CA 94305
| | - P.M. Fordyce
- Department of Bioengineering, Stanford University, Stanford, CA 94305
- Department of Genetics, Stanford University, Stanford, CA 94305
- Sarafan ChEM-H Institute, Stanford University, Stanford, CA 94305
- Chan Zuckerberg Biohub, Stanford University, Stanford, CA 94305
| |
Collapse
|
7
|
Notbohm J, Perica T. Biochemistry and genetics are coming together to improve our understanding of genotype to phenotype relationships. Curr Opin Struct Biol 2024; 89:102952. [PMID: 39522438 DOI: 10.1016/j.sbi.2024.102952] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2024] [Revised: 10/15/2024] [Accepted: 10/18/2024] [Indexed: 11/16/2024]
Abstract
Since genome sequencing became accessible, determining how specific differences in genotypes lead to complex phenotypes such as disease has become one of the key goals in biomedicine. Predicting effects of sequence variants on cellular or organismal phenotype faces several challenges. First, variants simultaneously affect multiple protein properties and predicting their combined effect is complex. Second, effects of changes in a single protein propagate through the cellular network, which we only partially understand. In this review, we emphasize the importance of both biochemistry and genetics in addressing these challenges. Moreover, we highlight work that blurs the distinction between biochemistry and genetics fields to provide new insights into the genotype-to-phenotype relationships.
Collapse
Affiliation(s)
- Judith Notbohm
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland; Biomolecular Structure and Mechanism PhD Program, Life Science Graduate School Zurich, Switzerland
| | - Tina Perica
- Department of Biochemistry, University of Zurich, Winterthurerstrasse 190, 8057 Zurich, Switzerland.
| |
Collapse
|
8
|
Chen JZ, Bisardi M, Lee D, Cotogno S, Zamponi F, Weigt M, Tokuriki N. Understanding epistatic networks in the B1 β-lactamases through coevolutionary statistical modeling and deep mutational scanning. Nat Commun 2024; 15:8441. [PMID: 39349467 PMCID: PMC11442494 DOI: 10.1038/s41467-024-52614-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Accepted: 09/16/2024] [Indexed: 10/02/2024] Open
Abstract
Throughout evolution, protein families undergo substantial sequence divergence while preserving structure and function. Although most mutations are deleterious, evolution can explore sequence space via epistatic networks of intramolecular interactions that alleviate the harmful mutations. However, comprehensive analysis of such epistatic networks across protein families remains limited. Thus, we conduct a family wide analysis of the B1 metallo-β-lactamases, combining experiments (deep mutational scanning, DMS) on two distant homologs (NDM-1 and VIM-2) and computational analyses (in silico DMS based on Direct Coupling Analysis, DCA) of 100 homologs. The methods jointly reveal and quantify prevalent epistasis, as ~1/3rd of equivalent mutations are epistatic in DMS. From DCA, half of the positions have a >6.5 fold difference in effective number of tolerated mutations across the entire family. Notably, both methods locate residues with the strongest epistasis in regions of intermediate residue burial, suggesting a balance of residue packing and mutational freedom in forming epistatic networks. We identify entrenched WT residues between NDM-1 and VIM-2 in DMS, which display statistically distinct behaviors in DCA from non-entrenched residues. Entrenched residues are not easily compensated by changes in single nearby interactions, reinforcing existing findings where a complex epistatic network compounds smaller effects from many interacting residues.
Collapse
Affiliation(s)
- J Z Chen
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada
| | - M Bisardi
- Laboratoire de Physique de l'Ecole Normale Supérieure, ENS, Université PSL, CNRS, Sorbonne Université, Université de Paris, F-75005, Paris, France
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biologie Computationnelle et Quantitative LCQB, F-75005, Paris, France
| | - D Lee
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada
| | - S Cotogno
- Laboratoire de Physique de l'Ecole Normale Supérieure, ENS, Université PSL, CNRS, Sorbonne Université, Université de Paris, F-75005, Paris, France
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biologie Computationnelle et Quantitative LCQB, F-75005, Paris, France
| | - F Zamponi
- Laboratoire de Physique de l'Ecole Normale Supérieure, ENS, Université PSL, CNRS, Sorbonne Université, Université de Paris, F-75005, Paris, France
- Dipartimento di Fisica, Sapienza Università di Roma, I-00185, Rome, Italy
| | - M Weigt
- Sorbonne Université, CNRS, Institut de Biologie Paris Seine, Biologie Computationnelle et Quantitative LCQB, F-75005, Paris, France
| | - N Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, BC, Canada.
| |
Collapse
|
9
|
Dall NR, Mendonça CATF, Torres Vera HL, Marqusee S. The importance of the location of the N-terminus in successful protein folding in vivo and in vitro. Proc Natl Acad Sci U S A 2024; 121:e2321999121. [PMID: 39145938 PMCID: PMC11348275 DOI: 10.1073/pnas.2321999121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2023] [Accepted: 07/16/2024] [Indexed: 08/16/2024] Open
Abstract
Protein folding in the cell often begins during translation. Many proteins fold more efficiently cotranslationally than when refolding from a denatured state. Changing the vectorial synthesis of the polypeptide chain through circular permutation could impact functional, soluble protein expression and interactions with cellular proteostasis factors. Here, we measure the solubility and function of every possible circular permutant (CP) of HaloTag in Escherichia coli cell lysate using a gel-based assay, and in living E. coli cells via FACS-seq. We find that 78% of HaloTag CPs retain protein function, though a subset of these proteins are also highly aggregation-prone. We examine the function of each CP in E. coli cells lacking the cotranslational chaperone trigger factor and the intracellular protease Lon and find no significant changes in function as a result of modifying the cellular proteostasis network. Finally, we biophysically characterize two topologically interesting CPs in vitro via circular dichroism and hydrogen-deuterium exchange coupled with mass spectrometry to reveal changes in global stability and folding kinetics with circular permutation. For CP33, we identify a change in the refolding intermediate as compared to wild-type (WT) HaloTag. Finally, we show that the strongest predictor of aggregation-prone expression in cells is the introduction of termini within the refolding intermediate. These results, in addition to our finding that termini insertion within the conformationally restrained core is most disruptive to protein function, indicate that successful folding of circular permutants may depend more on changes in folding pathway and termini insertion in flexible regions than on the availability of proteostasis factors.
Collapse
Affiliation(s)
- Natalie R. Dall
- Department of Molecular and Cell Biology, University of California, Berkeley, CA94720
- California Institute for Quantitative Biosciences, University of California, Berkeley, CA94720
| | | | - Héctor L. Torres Vera
- Department of Molecular and Cell Biology, University of California, Berkeley, CA94720
| | - Susan Marqusee
- Department of Molecular and Cell Biology, University of California, Berkeley, CA94720
- California Institute for Quantitative Biosciences, University of California, Berkeley, CA94720
- Department of Chemistry, University of California, Berkeley, CA94720
| |
Collapse
|
10
|
Parkhill SL, Johnson EO. Integrating bacterial molecular genetics with chemical biology for renewed antibacterial drug discovery. Biochem J 2024; 481:839-864. [PMID: 38958473 PMCID: PMC11346456 DOI: 10.1042/bcj20220062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2024] [Revised: 06/20/2024] [Accepted: 06/24/2024] [Indexed: 07/04/2024]
Abstract
The application of dyes to understanding the aetiology of infection inspired antimicrobial chemotherapy and the first wave of antibacterial drugs. The second wave of antibacterial drug discovery was driven by rapid discovery of natural products, now making up 69% of current antibacterial drugs. But now with the most prevalent natural products already discovered, ∼107 new soil-dwelling bacterial species must be screened to discover one new class of natural product. Therefore, instead of a third wave of antibacterial drug discovery, there is now a discovery bottleneck. Unlike natural products which are curated by billions of years of microbial antagonism, the vast synthetic chemical space still requires artificial curation through the therapeutics science of antibacterial drugs - a systematic understanding of how small molecules interact with bacterial physiology, effect desired phenotypes, and benefit the host. Bacterial molecular genetics can elucidate pathogen biology relevant to therapeutics development, but it can also be applied directly to understanding mechanisms and liabilities of new chemical agents with new mechanisms of action. Therefore, the next phase of antibacterial drug discovery could be enabled by integrating chemical expertise with systematic dissection of bacterial infection biology. Facing the ambitious endeavour to find new molecules from nature or new-to-nature which cure bacterial infections, the capabilities furnished by modern chemical biology and molecular genetics can be applied to prospecting for chemical modulators of new targets which circumvent prevalent resistance mechanisms.
Collapse
Affiliation(s)
- Susannah L. Parkhill
- Systems Chemical Biology of Infection and Resistance Laboratory, The Francis Crick Institute, London, U.K
- Faculty of Life Sciences, University College London, London, U.K
| | - Eachan O. Johnson
- Systems Chemical Biology of Infection and Resistance Laboratory, The Francis Crick Institute, London, U.K
- Faculty of Life Sciences, University College London, London, U.K
- Department of Chemistry, Imperial College, London, U.K
- Department of Chemistry, King's College London, London, U.K
| |
Collapse
|
11
|
Nguyen TN, Ingle C, Thompson S, Reynolds KA. The genetic landscape of a metabolic interaction. Nat Commun 2024; 15:3351. [PMID: 38637543 PMCID: PMC11026382 DOI: 10.1038/s41467-024-47671-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2023] [Accepted: 04/09/2024] [Indexed: 04/20/2024] Open
Abstract
While much prior work has explored the constraints on protein sequence and evolution induced by physical protein-protein interactions, the sequence-level constraints emerging from non-binding functional interactions in metabolism remain unclear. To quantify how variation in the activity of one enzyme constrains the biochemical parameters and sequence of another, we focus on dihydrofolate reductase (DHFR) and thymidylate synthase (TYMS), a pair of enzymes catalyzing consecutive reactions in folate metabolism. We use deep mutational scanning to quantify the growth rate effect of 2696 DHFR single mutations in 3 TYMS backgrounds under conditions selected to emphasize biochemical epistasis. Our data are well-described by a relatively simple enzyme velocity to growth rate model that quantifies how metabolic context tunes enzyme mutational tolerance. Together our results reveal the structural distribution of epistasis in a metabolic enzyme and establish a foundation for the design of multi-enzyme systems.
Collapse
Affiliation(s)
- Thuy N Nguyen
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- The Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- The Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- Form Bio, Dallas, TX, 75226, USA
| | - Christine Ingle
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- The Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
- The Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA
| | - Samuel Thompson
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, 94158, USA
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA
| | - Kimberly A Reynolds
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA.
- The Lyda Hill Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA.
- The Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, TX, 75390, USA.
| |
Collapse
|
12
|
Prywes N, Philips NR, Oltrogge LM, Lindner S, Candace Tsai YC, de Pins B, Cowan AE, Taylor-Kearney LJ, Chang HA, Hall LN, Bellieny-Rabelo D, Nisonoff HM, Weissman RF, Flamholz AI, Ding D, Bhatt AY, Shih PM, Mueller-Cajar O, Milo R, Savage DF. A map of the rubisco biochemical landscape. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.09.27.559826. [PMID: 38645011 PMCID: PMC11030240 DOI: 10.1101/2023.09.27.559826] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/23/2024]
Abstract
Rubisco is the primary CO2 fixing enzyme of the biosphere yet has slow kinetics. The roles of evolution and chemical mechanism in constraining the sequence landscape of rubisco remain debated. In order to map sequence to function, we developed a massively parallel assay for rubisco using an engineered E. coli where enzyme function is coupled to growth. By assaying >99% of single amino acid mutants across CO2 concentrations, we inferred enzyme velocity and CO2 affinity for thousands of substitutions. We identified many highly conserved positions that tolerate mutation and rare mutations that improve CO2 affinity. These data suggest that non-trivial kinetic improvements are readily accessible and provide a comprehensive sequence-to-function mapping for enzyme engineering efforts.
Collapse
Affiliation(s)
- Noam Prywes
- Innovative Genomics Institute, University of California; Berkeley, California 94720, USA
- Howard Hughes Medical Institute, University of California; Berkeley, California 94720, USA
| | - Naiya R Philips
- Department of Molecular and Cell Biology, University of California; Berkeley, California 94720, USA
| | - Luke M Oltrogge
- Howard Hughes Medical Institute, University of California; Berkeley, California 94720, USA
- Department of Molecular and Cell Biology, University of California; Berkeley, California 94720, USA
| | | | - Yi-Chin Candace Tsai
- School of Biological Sciences, Nanyang Technological University; Singapore 637551, Singapore
| | - Benoit de Pins
- Department of Plant and Environmental Sciences, Weizmann Institute of Science; Rehovot 76100, Israel
| | - Aidan E Cowan
- Department of Molecular and Cell Biology, University of California; Berkeley, California 94720, USA
- Joint BioEnergy Institute, Lawrence Berkeley National Laboratory; Emeryville, CA 94608, USA
| | - Leah J Taylor-Kearney
- Department of Plant and Microbial Biology, University of California, Berkeley; Berkeley, CA 94720, USA
| | - Hana A Chang
- Department of Plant and Microbial Biology, University of California, Berkeley; Berkeley, CA 94720, USA
| | - Laina N Hall
- Biophysics, University of California, Berkeley; Berkeley, CA 94720, USA
| | - Daniel Bellieny-Rabelo
- Innovative Genomics Institute, University of California; Berkeley, California 94720, USA
- California Institute for Quantitative Biosciences (QB3), University of California; Berkeley, CA 94720, USA
| | - Hunter M Nisonoff
- Center for Computational Biology, University of California, Berkeley; Berkeley, CA, USA
| | - Rachel F Weissman
- Department of Molecular and Cell Biology, University of California; Berkeley, California 94720, USA
| | - Avi I Flamholz
- Division of Biology and Biological Engineering, California Institute of Technology; Pasadena, CA 91125
| | - David Ding
- Innovative Genomics Institute, University of California; Berkeley, California 94720, USA
- Howard Hughes Medical Institute, University of California; Berkeley, California 94720, USA
| | - Abhishek Y Bhatt
- Department of Molecular and Cell Biology, University of California; Berkeley, California 94720, USA
- School of Medicine, University of California, San Diego; La Jolla, CA 92092, USA
| | - Patrick M Shih
- Innovative Genomics Institute, University of California; Berkeley, California 94720, USA
- Department of Plant and Microbial Biology, University of California, Berkeley; Berkeley, CA 94720, USA
- Environmental Genomics and Systems Biology Division, Lawrence Berkeley National Laboratory; Berkeley, CA 94720, USA
- Feedstocks Division, Joint BioEnergy Institute; Emeryville, CA 94608, USA
| | - Oliver Mueller-Cajar
- School of Biological Sciences, Nanyang Technological University; Singapore 637551, Singapore
| | - Ron Milo
- Department of Plant and Environmental Sciences, Weizmann Institute of Science; Rehovot 76100, Israel
| | - David F Savage
- Innovative Genomics Institute, University of California; Berkeley, California 94720, USA
- Howard Hughes Medical Institute, University of California; Berkeley, California 94720, USA
- Department of Molecular and Cell Biology, University of California; Berkeley, California 94720, USA
| |
Collapse
|
13
|
Notin P, Kollasch AW, Ritter D, van Niekerk L, Paul S, Spinner H, Rollins N, Shaw A, Weitzman R, Frazer J, Dias M, Franceschi D, Orenbuch R, Gal Y, Marks DS. ProteinGym: Large-Scale Benchmarks for Protein Design and Fitness Prediction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.07.570727. [PMID: 38106144 PMCID: PMC10723403 DOI: 10.1101/2023.12.07.570727] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Predicting the effects of mutations in proteins is critical to many applications, from understanding genetic disease to designing novel proteins that can address our most pressing challenges in climate, agriculture and healthcare. Despite a surge in machine learning-based protein models to tackle these questions, an assessment of their respective benefits is challenging due to the use of distinct, often contrived, experimental datasets, and the variable performance of models across different protein families. Addressing these challenges requires scale. To that end we introduce ProteinGym, a large-scale and holistic set of benchmarks specifically designed for protein fitness prediction and design. It encompasses both a broad collection of over 250 standardized deep mutational scanning assays, spanning millions of mutated sequences, as well as curated clinical datasets providing high-quality expert annotations about mutation effects. We devise a robust evaluation framework that combines metrics for both fitness prediction and design, factors in known limitations of the underlying experimental methods, and covers both zero-shot and supervised settings. We report the performance of a diverse set of over 70 high-performing models from various subfields (eg., alignment-based, inverse folding) into a unified benchmark suite. We open source the corresponding codebase, datasets, MSAs, structures, model predictions and develop a user-friendly website that facilitates data access and analysis.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Ada Shaw
- Applied Mathematics, Harvard University
| | | | | | - Mafalda Dias
- Centre for Genomic Regulation, Universitat Pompeu Fabra
| | | | | | - Yarin Gal
- Computer Science, University of Oxford
| | | |
Collapse
|
14
|
Notin P, Marks DS, Weitzman R, Gal Y. ProteinNPT: Improving Protein Property Prediction and Design with Non-Parametric Transformers. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.06.570473. [PMID: 38106034 PMCID: PMC10723423 DOI: 10.1101/2023.12.06.570473] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Protein design holds immense potential for optimizing naturally occurring proteins, with broad applications in drug discovery, material design, and sustainability. However, computational methods for protein engineering are confronted with significant challenges, such as an expansive design space, sparse functional regions, and a scarcity of available labels. These issues are further exacerbated in practice by the fact most real-life design scenarios necessitate the simultaneous optimization of multiple properties. In this work, we introduce ProteinNPT, a non-parametric transformer variant tailored to protein sequences and particularly suited to label-scarce and multi-task learning settings. We first focus on the supervised fitness prediction setting and develop several cross-validation schemes which support robust performance assessment. We subsequently reimplement prior top-performing baselines, introduce several extensions of these baselines by integrating diverse branches of the protein engineering literature, and demonstrate that ProteinNPT consistently outperforms all of them across a diverse set of protein property prediction tasks. Finally, we demonstrate the value of our approach for iterative protein design across extensive in silico Bayesian optimization and conditional sampling experiments.
Collapse
Affiliation(s)
| | | | | | - Yarin Gal
- Computer Science, University of Oxford
| |
Collapse
|
15
|
Papkou A, Garcia-Pastor L, Escudero JA, Wagner A. A rugged yet easily navigable fitness landscape. Science 2023; 382:eadh3860. [PMID: 37995212 DOI: 10.1126/science.adh3860] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2023] [Accepted: 09/29/2023] [Indexed: 11/25/2023]
Abstract
Fitness landscape theory predicts that rugged landscapes with multiple peaks impair Darwinian evolution, but experimental evidence is limited. In this study, we used genome editing to map the fitness of >260,000 genotypes of the key metabolic enzyme dihydrofolate reductase in the presence of the antibiotic trimethoprim, which targets this enzyme. The resulting landscape is highly rugged and harbors 514 fitness peaks. However, its highest peaks are accessible to evolving populations via abundant fitness-increasing paths. Different peaks share large basins of attraction that render the outcome of adaptive evolution highly contingent on chance events. Our work shows that ruggedness need not be an obstacle to Darwinian evolution but can reduce its predictability. If true in general, the complexity of optimization problems on realistic landscapes may require reappraisal.
Collapse
Affiliation(s)
- Andrei Papkou
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
| | - Lucia Garcia-Pastor
- Departamento de Sanidad Animal and VISAVET Health Surveillance Centre, Universidad Complutense de Madrid, Madrid, Spain
| | - José Antonio Escudero
- Departamento de Sanidad Animal and VISAVET Health Surveillance Centre, Universidad Complutense de Madrid, Madrid, Spain
| | - Andreas Wagner
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
- Swiss Institute of Bioinformatics, Lausanne, Switzerland
- The Santa Fe Institute, Santa Fe, NM, USA
| |
Collapse
|
16
|
Kouba P, Kohout P, Haddadi F, Bushuiev A, Samusevich R, Sedlar J, Damborsky J, Pluskal T, Sivic J, Mazurenko S. Machine Learning-Guided Protein Engineering. ACS Catal 2023; 13:13863-13895. [PMID: 37942269 PMCID: PMC10629210 DOI: 10.1021/acscatal.3c02743] [Citation(s) in RCA: 45] [Impact Index Per Article: 22.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 09/20/2023] [Indexed: 11/10/2023]
Abstract
Recent progress in engineering highly promising biocatalysts has increasingly involved machine learning methods. These methods leverage existing experimental and simulation data to aid in the discovery and annotation of promising enzymes, as well as in suggesting beneficial mutations for improving known targets. The field of machine learning for protein engineering is gathering steam, driven by recent success stories and notable progress in other areas. It already encompasses ambitious tasks such as understanding and predicting protein structure and function, catalytic efficiency, enantioselectivity, protein dynamics, stability, solubility, aggregation, and more. Nonetheless, the field is still evolving, with many challenges to overcome and questions to address. In this Perspective, we provide an overview of ongoing trends in this domain, highlight recent case studies, and examine the current limitations of machine learning-based methods. We emphasize the crucial importance of thorough experimental validation of emerging models before their use for rational protein design. We present our opinions on the fundamental problems and outline the potential directions for future research.
Collapse
Affiliation(s)
- Petr Kouba
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
- Faculty of
Electrical Engineering, Czech Technical
University in Prague, Technicka 2, 166 27 Prague 6, Czech Republic
| | - Pavel Kohout
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Faraneh Haddadi
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Anton Bushuiev
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Raman Samusevich
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
- Institute
of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
| | - Jiri Sedlar
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Jiri Damborsky
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| | - Tomas Pluskal
- Institute
of Organic Chemistry and Biochemistry of the Czech Academy of Sciences, Flemingovo nám. 2, 160 00 Prague 6, Czech Republic
| | - Josef Sivic
- Czech Institute
of Informatics, Robotics and Cybernetics, Czech Technical University in Prague, Jugoslavskych partyzanu 1580/3, 160 00 Prague 6, Czech Republic
| | - Stanislav Mazurenko
- Loschmidt
Laboratories, Department of Experimental Biology and RECETOX, Faculty
of Science, Masaryk University, Kamenice 5, 625 00 Brno, Czech
Republic
- International
Clinical Research Center, St. Anne’s
University Hospital Brno, Pekarska 53, 656 91 Brno, Czech Republic
| |
Collapse
|
17
|
Abstract
Understanding the factors that shape viral evolution is critical for developing effective antiviral strategies, accurately predicting viral evolution, and preventing pandemics. One fundamental determinant of viral evolution is the interplay between viral protein biophysics and the host machineries that regulate protein folding and quality control. Most adaptive mutations in viruses are biophysically deleterious, resulting in a viral protein product with folding defects. In cells, protein folding is assisted by a dynamic system of chaperones and quality control processes known as the proteostasis network. Host proteostasis networks can determine the fates of viral proteins with biophysical defects, either by assisting with folding or by targeting them for degradation. In this review, we discuss and analyze new discoveries revealing that host proteostasis factors can profoundly shape the sequence space accessible to evolving viral proteins. We also discuss the many opportunities for research progress proffered by the proteostasis perspective on viral evolution and adaptation.
Collapse
Affiliation(s)
- Jimin Yoon
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
| | - Jessica E Patrick
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
| | - C Brandon Ogbunugafor
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, Connecticut, USA
- Santa Fe Institute, Santa Fe, New Mexico, USA
| | - Matthew D Shoulders
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA;
| |
Collapse
|
18
|
Nguyen TN, Ingle C, Thompson S, Reynolds KA. The Genetic Landscape of a Metabolic Interaction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.28.542639. [PMID: 37645784 PMCID: PMC10461916 DOI: 10.1101/2023.05.28.542639] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 08/31/2023]
Abstract
Enzyme abundance, catalytic activity, and ultimately sequence are all shaped by the need of growing cells to maintain metabolic flux while minimizing accumulation of deleterious intermediates. While much prior work has explored the constraints on protein sequence and evolution induced by physical protein-protein interactions, the sequence-level constraints emerging from non-binding functional interactions in metabolism remain unclear. To quantify how variation in the activity of one enzyme constrains the biochemical parameters and sequence of another, we focused on dihydrofolate reductase (DHFR) and thymidylate synthase (TYMS), a pair of enzymes catalyzing consecutive reactions in folate metabolism. We used deep mutational scanning to quantify the growth rate effect of 2,696 DHFR single mutations in 3 TYMS backgrounds under conditions selected to emphasize biochemical epistasis. Our data are well-described by a relatively simple enzyme velocity to growth rate model that quantifies how metabolic context tunes enzyme mutational tolerance. Together our results reveal the structural distribution of epistasis in a metabolic enzyme and establish a foundation for the design of multi-enzyme systems.
Collapse
Affiliation(s)
- Thuy N. Nguyen
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, USA, 75390
- Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, USA, 75390
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, USA, 75390
| | - Christine Ingle
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, USA, 75390
- Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, USA, 75390
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, USA, 75390
| | - Samuel Thompson
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158
| | - Kimberly A. Reynolds
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, USA, 75390
- Department of Bioinformatics, University of Texas Southwestern Medical Center, Dallas, USA, 75390
- Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, USA, 75390
| |
Collapse
|
19
|
Cetin E, Guclu TF, Kantarcioglu I, Gaszek IK, Toprak E, Atilgan AR, Dedeoglu B, Atilgan C. Kinetic Barrier to Enzyme Inhibition Is Manipulated by Dynamical Local Interactions in E. coli DHFR. J Chem Inf Model 2023; 63:4839-4849. [PMID: 37491825 PMCID: PMC10428214 DOI: 10.1021/acs.jcim.3c00818] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2023] [Indexed: 07/27/2023]
Abstract
Dihydrofolate reductase (DHFR) is an important drug target and a highly studied model protein for understanding enzyme dynamics. DHFR's crucial role in folate synthesis renders it an ideal candidate to understand protein function and protein evolution mechanisms. In this study, to understand how a newly proposed DHFR inhibitor, 4'-deoxy methyl trimethoprim (4'-DTMP), alters evolutionary trajectories, we studied interactions that lead to its superior performance over that of trimethoprim (TMP). To elucidate the inhibition mechanism of 4'-DTMP, we first confirmed, both computationally and experimentally, that the relative binding free energy cost for the mutation of TMP and 4'-DTMP is the same, pointing the origin of the characteristic differences to be kinetic rather than thermodynamic. We then employed an interaction-based analysis by focusing first on the active site and then on the whole enzyme. We confirmed that the polar modification in 4'-DTMP induces additional local interactions with the enzyme, particularly, the M20 loop. These changes are propagated to the whole enzyme as shifts in the hydrogen bond networks. To shed light on the allosteric interactions, we support our analysis with network-based community analysis and show that segmentation of the loop domain of inhibitor-bound DHFR must be avoided by a successful inhibitor.
Collapse
Affiliation(s)
- Ebru Cetin
- Faculty
of Engineering and Natural Sciences, Sabanci
University, Tuzla 34956, Istanbul, Turkey
| | - Tandac F. Guclu
- Faculty
of Engineering and Natural Sciences, Sabanci
University, Tuzla 34956, Istanbul, Turkey
| | - Isik Kantarcioglu
- Faculty
of Engineering and Natural Sciences, Sabanci
University, Tuzla 34956, Istanbul, Turkey
- Department
of Pharmacology, University of Texas Southwestern
Medical Center, Dallas 75390, Texas, United States
| | - Ilona K. Gaszek
- Department
of Pharmacology, University of Texas Southwestern
Medical Center, Dallas 75390, Texas, United States
| | - Erdal Toprak
- Department
of Pharmacology, University of Texas Southwestern
Medical Center, Dallas 75390, Texas, United States
| | - Ali Rana Atilgan
- Faculty
of Engineering and Natural Sciences, Sabanci
University, Tuzla 34956, Istanbul, Turkey
| | - Burcu Dedeoglu
- Department
of Chemistry, Gebze Technical University, Gebze 41400, Kocaeli, Turkey
| | - Canan Atilgan
- Faculty
of Engineering and Natural Sciences, Sabanci
University, Tuzla 34956, Istanbul, Turkey
| |
Collapse
|
20
|
Kazan IC, Mills JH, Ozkan SB. Allosteric regulatory control in dihydrofolate reductase is revealed by dynamic asymmetry. Protein Sci 2023; 32:e4700. [PMID: 37313628 PMCID: PMC10357497 DOI: 10.1002/pro.4700] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2022] [Revised: 06/05/2023] [Accepted: 06/06/2023] [Indexed: 06/15/2023]
Abstract
We investigated the relationship between mutations and dynamics in Escherichia coli dihydrofolate reductase (DHFR) using computational methods. Our study focused on the M20 and FG loops, which are known to be functionally important and affected by mutations distal to the loops. We used molecular dynamics simulations and developed position-specific metrics, including the dynamic flexibility index (DFI) and dynamic coupling index (DCI), to analyze the dynamics of wild-type DHFR and compared our results with existing deep mutational scanning data. Our analysis showed a statistically significant association between DFI and mutational tolerance of the DHFR positions, indicating that DFI can predict functionally beneficial or detrimental substitutions. We also applied an asymmetric version of our DCI metric (DCIasym ) to DHFR and found that certain distal residues control the dynamics of the M20 and FG loops, whereas others are controlled by them. Residues that are suggested to control the M20 and FG loops by our DCIasym metric are evolutionarily nonconserved; mutations at these sites can enhance enzyme activity. On the other hand, residues controlled by the loops are mostly deleterious to function when mutated and are also evolutionary conserved. Our results suggest that dynamics-based metrics can identify residues that explain the relationship between mutation and protein function or can be targeted to rationally engineer enzymes with enhanced activity.
Collapse
Affiliation(s)
- I. Can Kazan
- Center for Biological Physics and Department of PhysicsArizona State UniversityTempeArizonaUSA
| | - Jeremy H. Mills
- School of Molecular Sciences and The Biodesign Center for Molecular Design and BiomimeticsArizona State UniversityTempeArizonaUSA
| | - S. Banu Ozkan
- Center for Biological Physics and Department of PhysicsArizona State UniversityTempeArizonaUSA
| |
Collapse
|
21
|
Nguyen V, Ahler E, Sitko KA, Stephany JJ, Maly DJ, Fowler DM. Molecular determinants of Hsp90 dependence of Src kinase revealed by deep mutational scanning. Protein Sci 2023; 32:e4656. [PMID: 37167432 PMCID: PMC10273359 DOI: 10.1002/pro.4656] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2023] [Revised: 05/05/2023] [Accepted: 05/10/2023] [Indexed: 05/13/2023]
Abstract
Hsp90 is a molecular chaperone involved in the refolding and activation of numerous protein substrates referred to as clients. While the molecular determinants of Hsp90 client specificity are poorly understood and limited to a handful of client proteins, strong clients are thought to be destabilized and conformationally extended. Here, we measured the phosphotransferase activity of 3929 variants of the tyrosine kinase Src in both the presence and absence of an Hsp90 inhibitor. We identified 84 previously unknown functionally dependent client variants. Unexpectedly, many destabilized or extended variants were not functionally dependent on Hsp90. Instead, functionally dependent client variants were clustered in the αF pocket and β1-β2 strand regions of Src, which have yet to be described in driving Hsp90 dependence. Hsp90 dependence was also strongly correlated with kinase activity. We found that a combination of activation, global extension, and general conformational flexibility, primarily induced by variants at the αF pocket and β1-β2 strands, was necessary to render Src functionally dependent on Hsp90. Moreover, the degree of activation and flexibility required to transform Src into a functionally dependent client varied with variant location, suggesting that a combination of regulatory domain disengagement and catalytic domain flexibility are required for chaperone dependence. Thus, by studying the chaperone dependence of a massive number of variants, we highlight factors driving Hsp90 client specificity and propose a model of chaperone-kinase interactions.
Collapse
Affiliation(s)
- Vanessa Nguyen
- Department of BioengineeringUniversity of WashingtonSeattleWashingtonUSA
| | - Ethan Ahler
- Department of Genome SciencesUniversity of WashingtonSeattleWashingtonUSA
| | - Katherine A. Sitko
- Department of Genome SciencesUniversity of WashingtonSeattleWashingtonUSA
| | - Jason J. Stephany
- Department of Genome SciencesUniversity of WashingtonSeattleWashingtonUSA
| | - Dustin J. Maly
- Department of ChemistryUniversity of WashingtonSeattleWashingtonUSA
| | - Douglas M. Fowler
- Department of BioengineeringUniversity of WashingtonSeattleWashingtonUSA
- Department of Genome SciencesUniversity of WashingtonSeattleWashingtonUSA
| |
Collapse
|
22
|
Abildgaard AB, Nielsen SV, Bernstein I, Stein A, Lindorff-Larsen K, Hartmann-Petersen R. Lynch syndrome, molecular mechanisms and variant classification. Br J Cancer 2023; 128:726-734. [PMID: 36434153 PMCID: PMC9978028 DOI: 10.1038/s41416-022-02059-z] [Citation(s) in RCA: 24] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2022] [Revised: 10/31/2022] [Accepted: 11/02/2022] [Indexed: 11/27/2022] Open
Abstract
Patients with the heritable cancer disease, Lynch syndrome, carry germline variants in the MLH1, MSH2, MSH6 and PMS2 genes, encoding the central components of the DNA mismatch repair system. Loss-of-function variants disrupt the DNA mismatch repair system and give rise to a detrimental increase in the cellular mutational burden and cancer development. The treatment prospects for Lynch syndrome rely heavily on early diagnosis; however, accurate diagnosis is inextricably linked to correct clinical interpretation of individual variants. Protein variant classification traditionally relies on cumulative information from occurrence in patients, as well as experimental testing of the individual variants. The complexity of variant classification is due to (1) that variants of unknown significance are rare in the population and phenotypic information on the specific variants is missing, and (2) that individual variant testing is challenging, costly and slow. Here, we summarise recent developments in high-throughput technologies and computational prediction tools for the assessment of variants of unknown significance in Lynch syndrome. These approaches may vastly increase the number of interpretable variants and could also provide important mechanistic insights into the disease. These insights may in turn pave the road towards developing personalised treatment approaches for Lynch syndrome.
Collapse
Affiliation(s)
- Amanda B Abildgaard
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Sofie V Nielsen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Inge Bernstein
- Department of Surgical Gastroenterology, Aalborg University Hospital, Aalborg, Denmark
- Institute of Clinical Medicine, Aalborg University Hospital, Aalborg University, Aalborg, Denmark
| | - Amelie Stein
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Kresten Lindorff-Larsen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| | - Rasmus Hartmann-Petersen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark.
| |
Collapse
|
23
|
Michael E, Saint-Jalme R, Mignon D, Simonson T. Computational protein design repurposed to explore enzyme vitality and help predict antibiotic resistance. Front Mol Biosci 2023; 9:905588. [PMID: 36699702 PMCID: PMC9868620 DOI: 10.3389/fmolb.2022.905588] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Accepted: 12/19/2022] [Indexed: 01/11/2023] Open
Abstract
In response to antibiotics that inhibit a bacterial enzyme, resistance mutations inevitably arise. Predicting them ahead of time would aid target selection and drug design. The simplest resistance mechanism would be to reduce antibiotic binding without sacrificing too much substrate binding. The property that reflects this is the enzyme "vitality", defined here as the difference between the inhibitor and substrate binding free energies. To predict such mutations, we borrow methodology from computational protein design. We use a Monte Carlo exploration of mutation space and vitality changes, allowing us to rank thousands of mutations and identify ones that might provide resistance through the simple mechanism considered. As an illustration, we chose dihydrofolate reductase, an essential enzyme targeted by several antibiotics. We simulated its complexes with the inhibitor trimethoprim and the substrate dihydrofolate. 20 active site positions were mutated, or "redesigned" individually, then in pairs or quartets. We computed the resulting binding free energy and vitality changes. Out of seven known resistance mutations involving active site positions, five were correctly recovered. Ten positions exhibited mutations with significant predicted vitality gains. Direct couplings between designed positions were predicted to be small, which reduces the combinatorial complexity of the mutation space to be explored. It also suggests that over the course of evolution, resistance mutations involving several positions do not need the underlying point mutations to arise all at once: they can appear and become fixed one after the other.
Collapse
|
24
|
Fu Y, Bedő J, Papenfuss AT, Rubin AF. Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants. Gigascience 2022; 12:giad073. [PMID: 37721410 PMCID: PMC10506130 DOI: 10.1093/gigascience/giad073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 07/02/2023] [Accepted: 08/23/2023] [Indexed: 09/19/2023] Open
Abstract
BACKGROUND Evaluating the impact of amino acid variants has been a critical challenge for studying protein function and interpreting genomic data. High-throughput experimental methods like deep mutational scanning (DMS) can measure the effect of large numbers of variants in a target protein, but because DMS studies have not been performed on all proteins, researchers also model DMS data computationally to estimate variant impacts by predictors. RESULTS In this study, we extended a linear regression-based predictor to explore whether incorporating data from alanine scanning (AS), a widely used low-throughput mutagenesis method, would improve prediction results. To evaluate our model, we collected 146 AS datasets, mapping to 54 DMS datasets across 22 distinct proteins. CONCLUSIONS We show that improved model performance depends on the compatibility of the DMS and AS assays, and the scale of improvement is closely related to the correlation between DMS and AS results.
Collapse
Affiliation(s)
- Yunfan Fu
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| | - Justin Bedő
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| | - Anthony T Papenfuss
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
- Peter MacCallum Cancer Centre, Melbourne, Victoria 3000, Australia
| | - Alan F Rubin
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| |
Collapse
|
25
|
Cetin E, Atilgan AR, Atilgan C. DHFR Mutants Modulate Their Synchronized Dynamics with the Substrate by Shifting Hydrogen Bond Occupancies. J Chem Inf Model 2022; 62:6715-6726. [PMID: 35984987 PMCID: PMC9795552 DOI: 10.1021/acs.jcim.2c00507] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
Antibiotic resistance is a global health problem in which mutations occurring in functional proteins render drugs ineffective. The working mechanisms of the arising mutants are seldom apparent; a methodology to decipher these mechanisms systematically would render devising therapies to control the arising mutational pathways possible. Here we utilize Cα-Cβ bond vector relaxations obtained from moderate length MD trajectories to determine conduits for functionality of the resistance conferring mutants of Escherichia coli dihydrofolate reductase. We find that the whole enzyme is synchronized to the motions of the substrate, irrespective of the mutation introducing gain-of-function or loss-of function. The total coordination of the motions suggests changes in the hydrogen bond dynamics with respect to the wild type as a possible route to determine and classify the mode-of-action of individual mutants. As a result, nine trimethoprim-resistant point mutations arising frequently in evolution experiments are categorized. One group of mutants that display the largest occurrence (L28R, W30G) work directly by modifying the dihydrofolate binding region. Conversely, W30R works indirectly by the formation of the E139-R30 salt bridge which releases energy resulting from tight binding by distorting the binding cavity. A third group (D27E, F153S, I94L) arising as single, resistance invoking mutants in evolution experiment trajectories allosterically and dynamically affects a hydrogen bonding motif formed at residues 59-69-71 which in turn modifies the binding site dynamics. The final group (I5F, A26T, R98P) consists of those mutants that have properties most similar to the wild type; these only appear after one of the other mutants is fixed on the protein structure and therefore display clear epistasis. Thus, we show that the binding event is governed by the entire enzyme dynamics while the binding site residues play gating roles. The adjustments made in the total enzyme in response to point mutations are what make quantifying and pinpointing their effect a hard problem. Here, we show that hydrogen bond dynamics recorded on sub-μs time scales provide the necessary fingerprints to decipher the various mechanisms at play.
Collapse
|
26
|
Radkov A, Sapiro AL, Flores S, Henderson C, Saunders H, Kim R, Massa S, Thompson S, Mateusiak C, Biboy J, Zhao Z, Starita LM, Hatleberg WL, Vollmer W, Russell AB, Simorre JP, Anthony-Cahill S, Brzovic P, Hayes B, Chou S. Antibacterial potency of Type VI amidase effector toxins is dependent on substrate topology and cellular context. eLife 2022; 11:79796. [PMID: 35762582 PMCID: PMC9270033 DOI: 10.7554/elife.79796] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2022] [Accepted: 06/23/2022] [Indexed: 11/15/2022] Open
Abstract
Members of the bacterial T6SS amidase effector (Tae) superfamily of toxins are delivered between competing bacteria to degrade cell wall peptidoglycan. Although Taes share a common substrate, they exhibit distinct antimicrobial potency across different competitor species. To investigate the molecular basis governing these differences, we quantitatively defined the functional determinants of Tae1 from Pseudomonas aeruginosa PAO1 using a combination of nuclear magnetic resonance and a high-throughput in vivo genetic approach called deep mutational scanning (DMS). As expected, combined analyses confirmed the role of critical residues near the Tae1 catalytic center. Unexpectedly, DMS revealed substantial contributions to enzymatic activity from a much larger, ring-like functional hot spot extending around the entire circumference of the enzyme. Comparative DMS across distinct growth conditions highlighted how functional contribution of different surfaces is highly context-dependent, varying alongside composition of targeted cell walls. These observations suggest that Tae1 engages with the intact cell wall network through a more distributed three-dimensional interaction interface than previously appreciated, providing an explanation for observed differences in antimicrobial potency across divergent Gram-negative competitors. Further binding studies of several Tae1 variants with their cognate immunity protein demonstrate that requirements to maintain protection from Tae activity may be a significant constraint on the mutational landscape of tae1 toxicity in the wild. In total, our work reveals that Tae diversification has likely been shaped by multiple independent pressures to maintain interactions with binding partners that vary across bacterial species and conditions.
Collapse
Affiliation(s)
- Atanas Radkov
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, United States
| | - Anne L Sapiro
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, United States
| | | | | | - Hayden Saunders
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, United States
| | - Rachel Kim
- Pacific Northwest University of Health Sciences, Yakima, United States
| | - Steven Massa
- Department of Biology, Stanford University, Stanford, United States
| | - Samuel Thompson
- Department of Bioengineering, Stanford University, Stanford, United States
| | - Chase Mateusiak
- Computer Science Department, Washington University in St. Louis, St. Louis, United States
| | - Jacob Biboy
- Centre for Bacterial Cell Biology, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Ziyi Zhao
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, United States
| | - Lea M Starita
- Department of Genome Sciences, University of Washington, Seattle, United States
| | | | - Waldemar Vollmer
- Center for Bacterial Cell Biology, Newcastle University, Newcastle upon Tyne, United Kingdom
| | - Alistair B Russell
- Division of Biological Sciences, University of California, San Diego, La Jolla, United States
| | - Jean-Pierre Simorre
- Institut de Biologie Structurale, Université Grenoble Alpes, Grenoble, France
| | | | - Peter Brzovic
- Department of Biochemistry, University of Washington, Seattle, United States
| | - Beth Hayes
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, United States
| | - Seemay Chou
- Department of Biochemistry and Biophysics, University of California, San Francisco, San Francisco, United States
| |
Collapse
|
27
|
Kampmeyer C, Larsen-Ledet S, Wagnkilde MR, Michelsen M, Iversen HKM, Nielsen SV, Lindemose S, Caregnato A, Ravid T, Stein A, Teilum K, Lindorff-Larsen K, Hartmann-Petersen R. Disease-linked mutations cause exposure of a protein quality control degron. Structure 2022; 30:1245-1253.e5. [PMID: 35700725 DOI: 10.1016/j.str.2022.05.016] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2021] [Revised: 04/08/2022] [Accepted: 05/18/2022] [Indexed: 10/18/2022]
Abstract
More than half of disease-causing missense variants are thought to lead to protein degradation, but the molecular mechanism of how these variants are recognized by the cell remains enigmatic. Degrons are stretches of amino acids that help mediate recognition by E3 ligases and thus confer protein degradation via the ubiquitin-proteasome system. While degrons that mediate controlled degradation of, for example, signaling components and cell-cycle regulators are well described, so-called protein-quality-control degrons that mediate the degradation of destabilized proteins are poorly understood. Here, we show that disease-linked dihydrofolate reductase (DHFR) missense variants are structurally destabilized and chaperone-dependent proteasome targets. We find two regions in DHFR that act as degrons, and the proteasomal turnover of one of these was dependent on the molecular chaperone Hsp70. Structural analyses by nuclear magnetic resonance (NMR) and hydrogen/deuterium exchange revealed that this degron is buried in wild-type DHFR but becomes transiently exposed in the disease-linked missense variants.
Collapse
Affiliation(s)
- Caroline Kampmeyer
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark
| | - Sven Larsen-Ledet
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark
| | - Morten Rose Wagnkilde
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark
| | - Mathias Michelsen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark
| | - Henriette K M Iversen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark
| | - Sofie V Nielsen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark
| | - Søren Lindemose
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark
| | - Alberto Caregnato
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark
| | - Tommer Ravid
- Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Edmond J. Safra Campus, Givat-Ram, 91904 Jerusalem, Israel
| | - Amelie Stein
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark.
| | - Kaare Teilum
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark.
| | - Kresten Lindorff-Larsen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark.
| | - Rasmus Hartmann-Petersen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200 Copenhagen, Denmark.
| |
Collapse
|
28
|
Environmental selection and epistasis in an empirical phenotype-environment-fitness landscape. Nat Ecol Evol 2022; 6:427-438. [PMID: 35210579 DOI: 10.1038/s41559-022-01675-5] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/19/2021] [Accepted: 12/14/2021] [Indexed: 11/08/2022]
Abstract
Fitness landscapes, mappings of genotype/phenotype to their effects on fitness, are invaluable concepts in evolutionary biochemistry. Although widely discussed, measurements of phenotype-fitness landscapes in proteins remain scarce. Here, we quantify all single mutational effects on fitness and phenotype (EC50) of VIM-2 β-lactamase across a 64-fold range of ampicillin concentrations. We then construct a phenotype-fitness landscape that takes variations in environmental selection pressure into account. We found that a simple, empirical landscape accurately models the ~39,000 mutational data points, suggesting that the evolution of VIM-2 can be predicted on the basis of the selection environment. Our landscape provides new quantitative knowledge on the evolution of the β-lactamases and proteins in general, particularly their evolutionary dynamics under subinhibitory antibiotic concentrations, as well as the mechanisms and environmental dependence of non-specific epistasis.
Collapse
|
29
|
The endoplasmic reticulum proteostasis network profoundly shapes the protein sequence space accessible to HIV envelope. PLoS Biol 2022; 20:e3001569. [PMID: 35180219 PMCID: PMC8906867 DOI: 10.1371/journal.pbio.3001569] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/24/2021] [Revised: 03/09/2022] [Accepted: 02/07/2022] [Indexed: 12/27/2022] Open
Abstract
The sequence space accessible to evolving proteins can be enhanced by cellular chaperones that assist biophysically defective clients in navigating complex folding landscapes. It is also possible, at least in theory, for proteostasis mechanisms that promote strict quality control to greatly constrain accessible protein sequence space. Unfortunately, most efforts to understand how proteostasis mechanisms influence evolution rely on artificial inhibition or genetic knockdown of specific chaperones. The few experiments that perturb quality control pathways also generally modulate the levels of only individual quality control factors. Here, we use chemical genetic strategies to tune proteostasis networks via natural stress response pathways that regulate the levels of entire suites of chaperones and quality control mechanisms. Specifically, we upregulate the unfolded protein response (UPR) to test the hypothesis that the host endoplasmic reticulum (ER) proteostasis network shapes the sequence space accessible to human immunodeficiency virus-1 (HIV-1) envelope (Env) protein. Elucidating factors that enhance or constrain Env sequence space is critical because Env evolves extremely rapidly, yielding HIV strains with antibody- and drug-escape mutations. We find that UPR-mediated upregulation of ER proteostasis factors, particularly those controlled by the IRE1-XBP1s UPR arm, globally reduces Env mutational tolerance. Conserved, functionally important Env regions exhibit the largest decreases in mutational tolerance upon XBP1s induction. Our data indicate that this phenomenon likely reflects strict quality control endowed by XBP1s-mediated remodeling of the ER proteostasis environment. Intriguingly, and in contrast, specific regions of Env, including regions targeted by broadly neutralizing antibodies, display enhanced mutational tolerance when XBP1s is induced, hinting at a role for host proteostasis network hijacking in potentiating antibody escape. These observations reveal a key function for proteostasis networks in decreasing instead of expanding the sequence space accessible to client proteins, while also demonstrating that the host ER proteostasis network profoundly shapes the mutational tolerance of Env in ways that could have important consequences for HIV adaptation. The host cell’s endoplasmic reticulum proteostasis network has a profound, constraining impact on the protein sequence space accessible to HIV’s envelope protein, which is a major target of the host’s adaptive immune system; in particular, upregulation of stringent quality control pathways appears to restrict the viability of destabilizing envelope variants.
Collapse
|
30
|
Mokhtari DA, Appel MJ, Fordyce PM, Herschlag D. High throughput and quantitative enzymology in the genomic era. Curr Opin Struct Biol 2021; 71:259-273. [PMID: 34592682 PMCID: PMC8648990 DOI: 10.1016/j.sbi.2021.07.010] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2021] [Accepted: 07/23/2021] [Indexed: 12/28/2022]
Abstract
Accurate predictions from models based on physical principles are the ultimate metric of our biophysical understanding. Although there has been stunning progress toward structure prediction, quantitative prediction of enzyme function has remained challenging. Realizing this goal will require large numbers of quantitative measurements of rate and binding constants and the use of these ground-truth data sets to guide the development and testing of these quantitative models. Ground truth data more closely linked to the underlying physical forces are also desired. Here, we describe technological advances that enable both types of ground truth measurements. These advances allow classic models to be tested, provide novel mechanistic insights, and place us on the path toward a predictive understanding of enzyme structure and function.
Collapse
Affiliation(s)
- D A Mokhtari
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
| | - M J Appel
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA
| | - P M Fordyce
- Department of Bioengineering, Stanford University, Stanford, CA, 94305, USA; ChEM-H Institute, Stanford University, Stanford, CA, 94305, USA; Department of Genetics, Stanford University, Stanford, CA, 94305, USA; Chan Zuckerberg Biohub San Francisco, CA, 94110, USA.
| | - D Herschlag
- Department of Biochemistry, Stanford University, Stanford, CA, 94305, USA; Department of Chemical Engineering, Stanford University, Stanford, CA, 94305, USA; ChEM-H Institute, Stanford University, Stanford, CA, 94305, USA.
| |
Collapse
|
31
|
Zhao VY, Rodrigues JV, Lozovsky ER, Hartl DL, Shakhnovich EI. Switching an active site helix in dihydrofolate reductase reveals limits to subdomain modularity. Biophys J 2021; 120:4738-4750. [PMID: 34571014 PMCID: PMC8595743 DOI: 10.1016/j.bpj.2021.09.032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2021] [Revised: 09/14/2021] [Accepted: 09/22/2021] [Indexed: 11/23/2022] Open
Abstract
To what degree are individual structural elements within proteins modular such that similar structures from unrelated proteins can be interchanged? We study subdomain modularity by creating 20 chimeras of an enzyme, Escherichia coli dihydrofolate reductase (DHFR), in which a catalytically important, 10-residue α-helical sequence is replaced by α-helical sequences from a diverse set of proteins. The chimeras stably fold but have a range of diminished thermal stabilities and catalytic activities. Evolutionary coupling analysis indicates that the residues of this α-helix are under selection pressure to maintain catalytic activity in DHFR. Reversion to phenylalanine at key position 31 was found to partially restore catalytic activity, which could be explained by evolutionary coupling values. We performed molecular dynamics simulations using replica exchange with solute tempering. Chimeras with low catalytic activity exhibit nonhelical conformations that block the binding site and disrupt the positioning of the catalytically essential residue D27. Simulation observables and in vitro measurements of thermal stability and substrate-binding affinity are strongly correlated. Several E. coli strains with chromosomally integrated chimeric DHFRs can grow, with growth rates that follow predictions from a kinetic flux model that depends on the intracellular abundance and catalytic activity of DHFR. Our findings show that although α-helices are not universally substitutable, the molecular and fitness effects of modular segments can be predicted by the biophysical compatibility of the replacement segment.
Collapse
Affiliation(s)
- Victor Y Zhao
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - João V Rodrigues
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts
| | - Elena R Lozovsky
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts
| | - Daniel L Hartl
- Department of Organismic and Evolutionary Biology, Harvard University, Cambridge, Massachusetts
| | - Eugene I Shakhnovich
- Department of Chemistry and Chemical Biology, Harvard University, Cambridge, Massachusetts.
| |
Collapse
|
32
|
Atilgan AR, Atilgan C. Computational strategies for protein conformational ensemble detection. Curr Opin Struct Biol 2021; 72:79-87. [PMID: 34563946 DOI: 10.1016/j.sbi.2021.08.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2021] [Revised: 08/13/2021] [Accepted: 08/17/2021] [Indexed: 01/18/2023]
Abstract
Protein function is constrained by the three-dimensional structure but is delineated by its dynamics. This framework must satisfy specificity of function along with adaptability to changing environments and evolvability under external constraints. The accessibility of the available regions of the energy landscape for a set of conditions and shifts in the populations upon their modulation have effects propagating across scales, from biomolecular interactions, to organisms, to populations. Developing the ability to detect and juggle protein conformations supplemented by a physics-based understanding has implications for not only in vivo problems but also for resistance impeding drug discovery and bionano-sensor design.
Collapse
Affiliation(s)
- Ali Rana Atilgan
- Faculty of Engineering and Natural Sciences, Sabanci University, 34956, Istanbul, Turkey
| | - Canan Atilgan
- Faculty of Engineering and Natural Sciences, Sabanci University, 34956, Istanbul, Turkey.
| |
Collapse
|
33
|
Markin CJ, Mokhtari DA, Sunden F, Appel MJ, Akiva E, Longwell SA, Sabatti C, Herschlag D, Fordyce PM. Revealing enzyme functional architecture via high-throughput microfluidic enzyme kinetics. Science 2021; 373:373/6553/eabf8761. [PMID: 34437092 DOI: 10.1126/science.abf8761] [Citation(s) in RCA: 108] [Impact Index Per Article: 27.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2020] [Accepted: 05/24/2021] [Indexed: 12/21/2022]
Abstract
Systematic and extensive investigation of enzymes is needed to understand their extraordinary efficiency and meet current challenges in medicine and engineering. We present HT-MEK (High-Throughput Microfluidic Enzyme Kinetics), a microfluidic platform for high-throughput expression, purification, and characterization of more than 1500 enzyme variants per experiment. For 1036 mutants of the alkaline phosphatase PafA (phosphate-irrepressible alkaline phosphatase of Flavobacterium), we performed more than 670,000 reactions and determined more than 5000 kinetic and physical constants for multiple substrates and inhibitors. We uncovered extensive kinetic partitioning to a misfolded state and isolated catalytic effects, revealing spatially contiguous regions of residues linked to particular aspects of function. Regions included active-site proximal residues but extended to the enzyme surface, providing a map of underlying architecture not possible to derive from existing approaches. HT-MEK has applications that range from understanding molecular mechanisms to medicine, engineering, and design.
Collapse
Affiliation(s)
- C J Markin
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
| | - D A Mokhtari
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
| | - F Sunden
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
| | - M J Appel
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA
| | - E Akiva
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA 94158, USA
| | - S A Longwell
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
| | - C Sabatti
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA.,Department of Statistics, Stanford University, Stanford, CA 94305, USA
| | - D Herschlag
- Department of Biochemistry, Stanford University, Stanford, CA 94305, USA. .,Department of Chemical Engineering, Stanford University, Stanford, CA 94305, USA.,ChEM-H Institute, Stanford University, Stanford, CA 94305, USA
| | - P M Fordyce
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA. .,ChEM-H Institute, Stanford University, Stanford, CA 94305, USA.,Department of Genetics, Stanford University, Stanford, CA 94305, USA.,Chan Zuckerberg Biohub; San Francisco, CA 94110, USA
| |
Collapse
|
34
|
McCormick JW, Russo MA, Thompson S, Blevins A, Reynolds KA. Structurally distributed surface sites tune allosteric regulation. eLife 2021; 10:68346. [PMID: 34132193 PMCID: PMC8324303 DOI: 10.7554/elife.68346] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2021] [Accepted: 06/15/2021] [Indexed: 11/30/2022] Open
Abstract
Our ability to rationally optimize allosteric regulation is limited by incomplete knowledge of the mutations that tune allostery. Are these mutations few or abundant, structurally localized or distributed? To examine this, we conducted saturation mutagenesis of a synthetic allosteric switch in which Dihydrofolate reductase (DHFR) is regulated by a blue-light sensitive LOV2 domain. Using a high-throughput assay wherein DHFR catalytic activity is coupled to E. coli growth, we assessed the impact of 1548 viable DHFR single mutations on allostery. Despite most mutations being deleterious to activity, fewer than 5% of mutations had a statistically significant influence on allostery. Most allostery disrupting mutations were proximal to the LOV2 insertion site. In contrast, allostery enhancing mutations were structurally distributed and enriched on the protein surface. Combining several allostery enhancing mutations yielded near-additive improvements to dynamic range. Our results indicate a path toward optimizing allosteric function through variation at surface sites. Many proteins exhibit a property called ‘allostery’. In allostery, an input signal at a specific site of a protein – such as a molecule binding, or the protein absorbing a photon of light – leads to a change in output at another site far away. For example, the protein might catalyze a chemical reaction faster or bind to another molecule more tightly in the presence of the input signal. This protein ‘remote control’ allows cells to sense and respond to changes in their environment. An ability to rapidly engineer new allosteric mechanisms into proteins is much sought after because this would provide an approach for building biosensors and other useful tools. One common approach to engineering new allosteric regulation is to combine a ‘sensor’ or input region from one protein with an ‘output’ region or domain from another. When researchers engineer allostery using this approach of combining input and output domains from different proteins, the difference in the output when the input is ‘on’ versus ‘off’ is often small, a situation called ‘modest allostery’. McCormick et al. wanted to know how to optimize this domain combination approach to increase the difference in output between the ‘on’ and ‘off’ states. More specifically, McCormick et al. wanted to find out whether swapping out or mutating specific amino acids (each of the individual building blocks that make up a protein) enhances or disrupts allostery. They also wanted to know if there are many possible mutations that change the effectiveness of allostery, or if this property is controlled by just a few amino acids. Finally, McCormick et al. questioned where in a protein most of these allostery-tuning mutations were located. To answer these questions, McCormick et al. engineered a new allosteric protein by inserting a light-sensing domain (input) into a protein involved in metabolism (a metabolic enzyme that produces a biomolecule called a tetrahydrofolate) to yield a light-controlled enzyme. Next, they introduced mutations into both the ‘input’ and ‘output’ domains to see where they had a greater effect on allostery. After filtering out mutations that destroyed the function of the output domain, McCormick et al. found that only about 5% of mutations to the ‘output’ domain altered the allosteric response of their engineered enzyme. In fact, most mutations that disrupted allostery were found near the site where the ‘input’ domain was inserted, while mutations that enhanced allostery were sprinkled throughout the enzyme, often on its protein surface. This was surprising in light of the commonly-held assumption that mutations on protein surfaces have little impact on the activity of the ‘output’ domain. Overall, the effect of individual mutations on allostery was small, but McCormick et al. found that these mutations can sometimes be combined to yield larger effects. McCormick et al.’s results suggest a new approach for optimizing engineered allosteric proteins: by introducing mutations on the protein surface. It also opens up new questions: mechanically, how do surface sites affect allostery? In the future, it will be important to characterize how combinations of mutations can optimize allosteric regulation, and to determine what evolutionary trajectories to high performance allosteric ‘switches’ look like.
Collapse
Affiliation(s)
- James W McCormick
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, United States.,Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, United States
| | - Marielle Ax Russo
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, United States.,Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, United States
| | - Samuel Thompson
- Department of Bioengineering, Stanford University, Stanford, United States
| | - Aubrie Blevins
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, United States
| | - Kimberly A Reynolds
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, United States.,Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, United States
| |
Collapse
|