1
|
Metzger BPH, Park Y, Starr TN, Thornton JW. Epistasis facilitates functional evolution in an ancient transcription factor. eLife 2024; 12:RP88737. [PMID: 38767330 PMCID: PMC11105156 DOI: 10.7554/elife.88737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/22/2024] Open
Abstract
A protein's genetic architecture - the set of causal rules by which its sequence produces its functions - also determines its possible evolutionary trajectories. Prior research has proposed that the genetic architecture of proteins is very complex, with pervasive epistatic interactions that constrain evolution and make function difficult to predict from sequence. Most of this work has analyzed only the direct paths between two proteins of interest - excluding the vast majority of possible genotypes and evolutionary trajectories - and has considered only a single protein function, leaving unaddressed the genetic architecture of functional specificity and its impact on the evolution of new functions. Here, we develop a new method based on ordinal logistic regression to directly characterize the global genetic determinants of multiple protein functions from 20-state combinatorial deep mutational scanning (DMS) experiments. We use it to dissect the genetic architecture and evolution of a transcription factor's specificity for DNA, using data from a combinatorial DMS of an ancient steroid hormone receptor's capacity to activate transcription from two biologically relevant DNA elements. We show that the genetic architecture of DNA recognition consists of a dense set of main and pairwise effects that involve virtually every possible amino acid state in the protein-DNA interface, but higher-order epistasis plays only a tiny role. Pairwise interactions enlarge the set of functional sequences and are the primary determinants of specificity for different DNA elements. They also massively expand the number of opportunities for single-residue mutations to switch specificity from one DNA target to another. By bringing variants with different functions close together in sequence space, pairwise epistasis therefore facilitates rather than constrains the evolution of new functions.
Collapse
Affiliation(s)
- Brian PH Metzger
- Department of Ecology and Evolution, University of ChicagoChicagoUnited States
| | - Yeonwoo Park
- Program in Genetics, Genomics, and Systems Biology, University of ChicagoChicagoUnited States
| | - Tyler N Starr
- Department of Biochemistry and Molecular Biophysics, University of ChicagoChicagoUnited States
| | - Joseph W Thornton
- Department of Ecology and Evolution, University of ChicagoChicagoUnited States
- Department of Human Genetics, University of ChicagoChicagoUnited States
| |
Collapse
|
2
|
Olgenblum GI, Hutcheson BO, Pielak GJ, Harries D. Protecting Proteins from Desiccation Stress Using Molecular Glasses and Gels. Chem Rev 2024; 124:5668-5694. [PMID: 38635951 PMCID: PMC11082905 DOI: 10.1021/acs.chemrev.3c00752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2023] [Revised: 02/18/2024] [Accepted: 02/22/2024] [Indexed: 04/20/2024]
Abstract
Faced with desiccation stress, many organisms deploy strategies to maintain the integrity of their cellular components. Amorphous glassy media composed of small molecular solutes or protein gels present general strategies for protecting against drying. We review these strategies and the proposed molecular mechanisms to explain protein protection in a vitreous matrix under conditions of low hydration. We also describe efforts to exploit similar strategies in technological applications for protecting proteins in dry or highly desiccated states. Finally, we outline open questions and possibilities for future explorations.
Collapse
Affiliation(s)
- Gil I. Olgenblum
- Institute
of Chemistry, Fritz Haber Research Center, and The Harvey M. Krueger
Family Center for Nanoscience & Nanotechnology, The Hebrew University, Jerusalem 9190401, Israel
| | - Brent O. Hutcheson
- Department
of Chemistry, University of North Carolina
at Chapel Hill (UNC-CH), Chapel
Hill, North Carolina 27599, United States
| | - Gary J. Pielak
- Department
of Chemistry, University of North Carolina
at Chapel Hill (UNC-CH), Chapel
Hill, North Carolina 27599, United States
- Department
of Chemistry, Department of Biochemistry & Biophysics, Integrated
Program for Biological & Genome Sciences, Lineberger Comprehensive
Cancer Center, University of North Carolina
at Chapel Hill, Chapel Hill, North Carolina 27599, United States
| | - Daniel Harries
- Institute
of Chemistry, Fritz Haber Research Center, and The Harvey M. Krueger
Family Center for Nanoscience & Nanotechnology, The Hebrew University, Jerusalem 9190401, Israel
| |
Collapse
|
3
|
Swint-Kruse L, Fenton AW. Rheostats, toggles, and neutrals, Oh my! A new framework for understanding how amino acid changes modulate protein function. J Biol Chem 2024; 300:105736. [PMID: 38336297 PMCID: PMC10914490 DOI: 10.1016/j.jbc.2024.105736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 01/09/2024] [Accepted: 01/25/2024] [Indexed: 02/12/2024] Open
Abstract
Advances in personalized medicine and protein engineering require accurately predicting outcomes of amino acid substitutions. Many algorithms correctly predict that evolutionarily-conserved positions show "toggle" substitution phenotypes, which is defined when a few substitutions at that position retain function. In contrast, predictions often fail for substitutions at the less-studied "rheostat" positions, which are defined when different amino acid substitutions at a position sample at least half of the possible functional range. This review describes efforts to understand the impact and significance of rheostat positions: (1) They have been observed in globular soluble, integral membrane, and intrinsically disordered proteins; within single proteins, their prevalence can be up to 40%. (2) Substitutions at rheostat positions can have biological consequences and ∼10% of substitutions gain function. (3) Although both rheostat and "neutral" (defined when all substitutions exhibit wild-type function) positions are nonconserved, the two classes have different evolutionary signatures. (4) Some rheostat positions have pleiotropic effects on function, simultaneously modulating multiple parameters (e.g., altering both affinity and allosteric coupling). (5) In structural studies, substitutions at rheostat positions appear to cause only local perturbations; the overall conformations appear unchanged. (6) Measured functional changes show promising correlations with predicted changes in protein dynamics; the emergent properties of predicted, dynamically coupled amino acid networks might explain some of the complex functional outcomes observed when substituting rheostat positions. Overall, rheostat positions provide unique opportunities for using single substitutions to tune protein function. Future studies of these positions will yield important insights into the protein sequence/function relationship.
Collapse
Affiliation(s)
- Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA.
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| |
Collapse
|
4
|
Williams C, Dong KC, Arkinson C, Martin A. Preparation of site-specifically fluorophore-labeled polyubiquitin chains for FRET studies of Cdc48 substrate processing. STAR Protoc 2023; 4:102659. [PMID: 37889757 PMCID: PMC10630674 DOI: 10.1016/j.xpro.2023.102659] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2023] [Revised: 08/24/2023] [Accepted: 10/02/2023] [Indexed: 10/29/2023] Open
Abstract
A critical step in the removal of polyubiquitinated proteins from macromolecular complexes and membranes for subsequent proteasomal degradation is the unfolding of an ubiquitin moiety by the cofactor Ufd1/Npl4 (UN) and its insertion into the Cdc48 ATPase for mechanical translocation. Here, we present a stepwise protocol for the assembly and purification of Lys48-linked ubiquitin chains that are fluorophore labeled at specific ubiquitin moieties and allow monitoring polyubiquitin engagement by the Cdc48-UN complex in a FRET-based assay. For complete details on the use and execution of this protocol, please refer to Williams et al. (2023).1.
Collapse
Affiliation(s)
- Cameron Williams
- Biophysics Graduate Group, University of California, Berkeley, CA 94720, USA; California Institute for Quantitative Biosciences, University of California at Berkeley, Berkeley, CA 94720, USA
| | - Ken C Dong
- California Institute for Quantitative Biosciences, University of California at Berkeley, Berkeley, CA 94720, USA; Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA; Howard Hughes Medical Institute, University of California at Berkeley, Berkeley CA 94720
| | - Connor Arkinson
- California Institute for Quantitative Biosciences, University of California at Berkeley, Berkeley, CA 94720, USA; Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA; Howard Hughes Medical Institute, University of California at Berkeley, Berkeley CA 94720
| | - Andreas Martin
- California Institute for Quantitative Biosciences, University of California at Berkeley, Berkeley, CA 94720, USA; Department of Molecular and Cell Biology, University of California at Berkeley, Berkeley, CA 94720, USA; Howard Hughes Medical Institute, University of California at Berkeley, Berkeley CA 94720.
| |
Collapse
|
5
|
Notin P, Kollasch AW, Ritter D, van Niekerk L, Paul S, Spinner H, Rollins N, Shaw A, Weitzman R, Frazer J, Dias M, Franceschi D, Orenbuch R, Gal Y, Marks DS. ProteinGym: Large-Scale Benchmarks for Protein Design and Fitness Prediction. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.07.570727. [PMID: 38106144 PMCID: PMC10723403 DOI: 10.1101/2023.12.07.570727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Predicting the effects of mutations in proteins is critical to many applications, from understanding genetic disease to designing novel proteins that can address our most pressing challenges in climate, agriculture and healthcare. Despite a surge in machine learning-based protein models to tackle these questions, an assessment of their respective benefits is challenging due to the use of distinct, often contrived, experimental datasets, and the variable performance of models across different protein families. Addressing these challenges requires scale. To that end we introduce ProteinGym, a large-scale and holistic set of benchmarks specifically designed for protein fitness prediction and design. It encompasses both a broad collection of over 250 standardized deep mutational scanning assays, spanning millions of mutated sequences, as well as curated clinical datasets providing high-quality expert annotations about mutation effects. We devise a robust evaluation framework that combines metrics for both fitness prediction and design, factors in known limitations of the underlying experimental methods, and covers both zero-shot and supervised settings. We report the performance of a diverse set of over 70 high-performing models from various subfields (eg., alignment-based, inverse folding) into a unified benchmark suite. We open source the corresponding codebase, datasets, MSAs, structures, model predictions and develop a user-friendly website that facilitates data access and analysis.
Collapse
Affiliation(s)
| | | | | | | | | | | | | | - Ada Shaw
- Applied Mathematics, Harvard University
| | | | | | - Mafalda Dias
- Centre for Genomic Regulation, Universitat Pompeu Fabra
| | | | | | - Yarin Gal
- Computer Science, University of Oxford
| | | |
Collapse
|
6
|
Notin P, Marks DS, Weitzman R, Gal Y. ProteinNPT: Improving Protein Property Prediction and Design with Non-Parametric Transformers. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.12.06.570473. [PMID: 38106034 PMCID: PMC10723423 DOI: 10.1101/2023.12.06.570473] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]
Abstract
Protein design holds immense potential for optimizing naturally occurring proteins, with broad applications in drug discovery, material design, and sustainability. However, computational methods for protein engineering are confronted with significant challenges, such as an expansive design space, sparse functional regions, and a scarcity of available labels. These issues are further exacerbated in practice by the fact most real-life design scenarios necessitate the simultaneous optimization of multiple properties. In this work, we introduce ProteinNPT, a non-parametric transformer variant tailored to protein sequences and particularly suited to label-scarce and multi-task learning settings. We first focus on the supervised fitness prediction setting and develop several cross-validation schemes which support robust performance assessment. We subsequently reimplement prior top-performing baselines, introduce several extensions of these baselines by integrating diverse branches of the protein engineering literature, and demonstrate that ProteinNPT consistently outperforms all of them across a diverse set of protein property prediction tasks. Finally, we demonstrate the value of our approach for iterative protein design across extensive in silico Bayesian optimization and conditional sampling experiments.
Collapse
Affiliation(s)
| | | | | | - Yarin Gal
- Computer Science, University of Oxford
| |
Collapse
|
7
|
Qu Y, Niu Z, Ding Q, Zhao T, Kong T, Bai B, Ma J, Zhao Y, Zheng J. Ensemble Learning with Supervised Methods Based on Large-Scale Protein Language Models for Protein Mutation Effects Prediction. Int J Mol Sci 2023; 24:16496. [PMID: 38003686 PMCID: PMC10671426 DOI: 10.3390/ijms242216496] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 11/11/2023] [Accepted: 11/17/2023] [Indexed: 11/26/2023] Open
Abstract
Machine learning has been increasingly utilized in the field of protein engineering, and research directed at predicting the effects of protein mutations has attracted increasing attention. Among them, so far, the best results have been achieved by related methods based on protein language models, which are trained on a large number of unlabeled protein sequences to capture the generally hidden evolutionary rules in protein sequences, and are therefore able to predict their fitness from protein sequences. Although numerous similar models and methods have been successfully employed in practical protein engineering processes, the majority of the studies have been limited to how to construct more complex language models to capture richer protein sequence feature information and utilize this feature information for unsupervised protein fitness prediction. There remains considerable untapped potential in these developed models, such as whether the prediction performance can be further improved by integrating different models to further improve the accuracy of prediction. Furthermore, how to utilize large-scale models for prediction methods of mutational effects on quantifiable properties of proteins due to the nonlinear relationship between protein fitness and the quantification of specific functionalities has yet to be explored thoroughly. In this study, we propose an ensemble learning approach for predicting mutational effects of proteins integrating protein sequence features extracted from multiple large protein language models, as well as evolutionarily coupled features extracted in homologous sequences, while comparing the differences between linear regression and deep learning models in mapping these features to quantifiable functional changes. We tested our approach on a dataset of 17 protein deep mutation scans and indicated that the integrated approach together with linear regression enables the models to have higher prediction accuracy and generalization. Moreover, we further illustrated the reliability of the integrated approach by exploring the differences in the predictive performance of the models across species and protein sequence lengths, as well as by visualizing clustering of ensemble and non-ensemble features.
Collapse
Affiliation(s)
- Yang Qu
- Cixi Biomedical Research Institute, Wenzhou Medical University, Ningbo 315300, China; (Y.Q.); (Z.N.); (Q.D.); (T.Z.)
- Cixi Institute of Biomedical Engineering, Ningbo Institute of Materials Technology and Engineering, Chinese Academy of Sciences, Ningbo 315300, China; (T.K.); (B.B.); (J.M.)
| | - Zitong Niu
- Cixi Biomedical Research Institute, Wenzhou Medical University, Ningbo 315300, China; (Y.Q.); (Z.N.); (Q.D.); (T.Z.)
- Cixi Institute of Biomedical Engineering, Ningbo Institute of Materials Technology and Engineering, Chinese Academy of Sciences, Ningbo 315300, China; (T.K.); (B.B.); (J.M.)
| | - Qiaojiao Ding
- Cixi Biomedical Research Institute, Wenzhou Medical University, Ningbo 315300, China; (Y.Q.); (Z.N.); (Q.D.); (T.Z.)
- Cixi Institute of Biomedical Engineering, Ningbo Institute of Materials Technology and Engineering, Chinese Academy of Sciences, Ningbo 315300, China; (T.K.); (B.B.); (J.M.)
| | - Taowa Zhao
- Cixi Biomedical Research Institute, Wenzhou Medical University, Ningbo 315300, China; (Y.Q.); (Z.N.); (Q.D.); (T.Z.)
- Cixi Institute of Biomedical Engineering, Ningbo Institute of Materials Technology and Engineering, Chinese Academy of Sciences, Ningbo 315300, China; (T.K.); (B.B.); (J.M.)
| | - Tong Kong
- Cixi Institute of Biomedical Engineering, Ningbo Institute of Materials Technology and Engineering, Chinese Academy of Sciences, Ningbo 315300, China; (T.K.); (B.B.); (J.M.)
| | - Bing Bai
- Cixi Institute of Biomedical Engineering, Ningbo Institute of Materials Technology and Engineering, Chinese Academy of Sciences, Ningbo 315300, China; (T.K.); (B.B.); (J.M.)
| | - Jianwei Ma
- Cixi Institute of Biomedical Engineering, Ningbo Institute of Materials Technology and Engineering, Chinese Academy of Sciences, Ningbo 315300, China; (T.K.); (B.B.); (J.M.)
| | - Yitian Zhao
- Cixi Biomedical Research Institute, Wenzhou Medical University, Ningbo 315300, China; (Y.Q.); (Z.N.); (Q.D.); (T.Z.)
- Cixi Institute of Biomedical Engineering, Ningbo Institute of Materials Technology and Engineering, Chinese Academy of Sciences, Ningbo 315300, China; (T.K.); (B.B.); (J.M.)
| | - Jianping Zheng
- Cixi Biomedical Research Institute, Wenzhou Medical University, Ningbo 315300, China; (Y.Q.); (Z.N.); (Q.D.); (T.Z.)
- Cixi Institute of Biomedical Engineering, Ningbo Institute of Materials Technology and Engineering, Chinese Academy of Sciences, Ningbo 315300, China; (T.K.); (B.B.); (J.M.)
| |
Collapse
|
8
|
Derbel H, Zhao Z, Liu Q. Accurate prediction of functional effect of single amino acid variants with deep learning. Comput Struct Biotechnol J 2023; 21:5776-5784. [PMID: 38074467 PMCID: PMC10709104 DOI: 10.1016/j.csbj.2023.11.017] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Revised: 11/08/2023] [Accepted: 11/09/2023] [Indexed: 02/12/2024] Open
Abstract
The assessment of functional effect of amino acid variants is a critical biological problem in proteomics for clinical medicine and protein engineering. Although natively occurring variants offer insights into deleterious variants, high-throughput deep mutational experiments enable comprehensive investigation of amino acid variants for a given protein. However, these mutational experiments are too expensive to dissect millions of variants on thousands of proteins. Thus, computational approaches have been proposed, but they heavily rely on hand-crafted evolutionary conservation, limiting their accuracy. Recent advancement in transformers provides a promising solution to precisely estimate the functional effects of protein variants on high-throughput experimental data. Here, we introduce a novel deep learning model, namely Rep2Mut-V2, which leverages learned representation from transformer models. Rep2Mut-V2 significantly enhances the prediction accuracy for 27 types of measurements of functional effects of protein variants. In the evaluation of 38 protein datasets with 118,933 single amino acid variants, Rep2Mut-V2 achieved an average Spearman's correlation coefficient of 0.7. This surpasses the performance of six state-of-the-art methods, including the recently released methods ESM, DeepSequence and EVE. Even with limited training data, Rep2Mut-V2 outperforms ESM and DeepSequence, showing its potential to extend high-throughput experimental analysis for more protein variants to reduce experimental cost. In conclusion, Rep2Mut-V2 provides accurate predictions of the functional effects of single amino acid variants of protein coding sequences. This tool can significantly aid in the interpretation of variants in human disease studies.
Collapse
Affiliation(s)
- Houssemeddine Derbel
- Nevada Institute of Personalized Medicine, University of Nevada, Las Vegas, Las Vegas, NV 89154, USA
| | - Zhongming Zhao
- Center for Precision Health, McWilliams School of Biomedical Informatics, The University of Texas Health Science Center at Houston, Houston, TX 77030, USA
| | - Qian Liu
- Nevada Institute of Personalized Medicine, University of Nevada, Las Vegas, Las Vegas, NV 89154, USA
- School of Life Sciences, College of Sciences, University of Nevada, Las Vegas, Las Vegas, NV 89154, USA
| |
Collapse
|
9
|
Posani L, Rizzato F, Monasson R, Cocco S. Infer global, predict local: Quantity-relevance trade-off in protein fitness predictions from sequence data. PLoS Comput Biol 2023; 19:e1011521. [PMID: 37883593 PMCID: PMC10645369 DOI: 10.1371/journal.pcbi.1011521] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2023] [Revised: 11/14/2023] [Accepted: 09/15/2023] [Indexed: 10/28/2023] Open
Abstract
Predicting the effects of mutations on protein function is an important issue in evolutionary biology and biomedical applications. Computational approaches, ranging from graphical models to deep-learning architectures, can capture the statistical properties of sequence data and predict the outcome of high-throughput mutagenesis experiments probing the fitness landscape around some wild-type protein. However, how the complexity of the models and the characteristics of the data combine to determine the predictive performance remains unclear. Here, based on a theoretical analysis of the prediction error, we propose descriptors of the sequence data, characterizing their quantity and relevance relative to the model. Our theoretical framework identifies a trade-off between these two quantities, and determines the optimal subset of data for the prediction task, showing that simple models can outperform complex ones when inferred from adequately-selected sequences. We also show how repeated subsampling of the sequence data is informative about how much epistasis in the fitness landscape is not captured by the computational model. Our approach is illustrated on several protein families, as well as on in silico solvable protein models.
Collapse
Affiliation(s)
- Lorenzo Posani
- Laboratory of Physics of the Ecole Normale Supérieure, CNRS UMR8023 & PSL Research, Sorbonne Université, Paris, France
| | - Francesca Rizzato
- Laboratory of Physics of the Ecole Normale Supérieure, CNRS UMR8023 & PSL Research, Sorbonne Université, Paris, France
| | - Rémi Monasson
- Laboratory of Physics of the Ecole Normale Supérieure, CNRS UMR8023 & PSL Research, Sorbonne Université, Paris, France
| | - Simona Cocco
- Laboratory of Physics of the Ecole Normale Supérieure, CNRS UMR8023 & PSL Research, Sorbonne Université, Paris, France
| |
Collapse
|
10
|
Padhy AA, Mavor D, Sahoo S, Bolon DNA, Mishra P. Systematic profiling of dominant ubiquitin variants reveals key functional nodes contributing to evolutionary selection. Cell Rep 2023; 42:113064. [PMID: 37656625 DOI: 10.1016/j.celrep.2023.113064] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 06/30/2023] [Accepted: 08/21/2023] [Indexed: 09/03/2023] Open
Abstract
Dominant-negative mutations can help to investigate the biological mechanisms and to understand the selective pressures for multifunctional proteins. However, most studies have focused on recessive mutant effects that occur in the absence of a second functional gene copy, which overlooks the fact that most eukaryotic genomes contain more than one copy of many genes. We have identified dominant effects on yeast growth rate among all possible point mutations in ubiquitin expressed alongside a wild-type allele. Our results reveal more than 400 dominant-negative mutations, indicating that dominant-negative effects make a sizable contribution to selection acting on ubiquitin. Cellular and biochemical analyses of individual ubiquitin variants show that dominant-negative effects are explained by varied accumulation of polyubiquitinated cellular proteins and/or defects in conjugation of ubiquitin variants to ubiquitin ligases. Our approach to identify dominant-negative mutations is general and can be applied to other proteins of interest.
Collapse
Affiliation(s)
- Amrita Arpita Padhy
- Department of Animal Biology, School of Life Sciences, University of Hyderabad, Telangana 500046, India
| | - David Mavor
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Medical School, Worcester, MA 01655, USA
| | - Subhashree Sahoo
- Department of Animal Biology, School of Life Sciences, University of Hyderabad, Telangana 500046, India
| | - Daniel N A Bolon
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Medical School, Worcester, MA 01655, USA.
| | - Parul Mishra
- Department of Animal Biology, School of Life Sciences, University of Hyderabad, Telangana 500046, India.
| |
Collapse
|
11
|
Christensen S, Wernersson C, André I. Facile Method for High-throughput Identification of Stabilizing Mutations. J Mol Biol 2023; 435:168209. [PMID: 37479080 DOI: 10.1016/j.jmb.2023.168209] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 07/13/2023] [Accepted: 07/13/2023] [Indexed: 07/23/2023]
Abstract
Characterizing the effects of mutations on stability is critical for understanding the function and evolution of proteins and improving their biophysical properties. High throughput folding and abundance assays have been successfully used to characterize missense mutations associated with reduced stability. However, screening for increased thermodynamic stability is more challenging since such mutations are rarer and their impact on assay readout is more subtle. Here, a multiplex assay for high throughput screening of protein folding was developed by combining deep mutational scanning, fluorescence-activated cell sorting, and deep sequencing. By analyzing a library of 2000 variants of Adenylate kinase we demonstrate that the readout of the method correlates with stability and that mutants with up to 13 °C increase in thermal melting temperature could be identified with low false positive rate. The discovery of many stabilizing mutations also enabled the analysis of general substitution patterns associated with increased stability in Adenylate kinase. This high throughput method to identify stabilizing mutations can be combined with functional screens to identify mutations that improve both stability and activity.
Collapse
Affiliation(s)
- Signe Christensen
- Department of Biochemistry and Structural Biology, Lund University, Lund, Sweden
| | - Camille Wernersson
- Department of Biochemistry and Structural Biology, Lund University, Lund, Sweden
| | - Ingemar André
- Department of Biochemistry and Structural Biology, Lund University, Lund, Sweden.
| |
Collapse
|
12
|
Tang JQ, Marchand MM, Veggiani G. Ubiquitin Engineering for Interrogating the Ubiquitin-Proteasome System and Novel Therapeutic Strategies. Cells 2023; 12:2117. [PMID: 37626927 PMCID: PMC10453149 DOI: 10.3390/cells12162117] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 08/10/2023] [Accepted: 08/11/2023] [Indexed: 08/27/2023] Open
Abstract
Protein turnover, a highly regulated process governed by the ubiquitin-proteasome system (UPS), is essential for maintaining cellular homeostasis. Dysregulation of the UPS has been implicated in various diseases, including viral infections and cancer, making the proteins in the UPS attractive targets for therapeutic intervention. However, the functional and structural redundancies of UPS enzymes present challenges in identifying precise drug targets and achieving target selectivity. Consequently, only 26S proteasome inhibitors have successfully advanced to clinical use thus far. To overcome these obstacles, engineered peptides and proteins, particularly engineered ubiquitin, have emerged as promising alternatives. In this review, we examine the impact of engineered ubiquitin on UPS and non-UPS proteins, as well as on viral enzymes. Furthermore, we explore their potential to guide the development of small molecules targeting novel surfaces, thereby expanding the range of druggable targets.
Collapse
Affiliation(s)
- Jason Q. Tang
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, 160 College Street, Toronto, ON M5S3E1, Canada
- Department of Molecular Genetics, University of Toronto, 160 College Street, Toronto, ON M5S3E1, Canada
| | - Mary M. Marchand
- Department of Pathobiological Sciences, School of Veterinary Medicine, Louisiana State University, Baton Rouge, LA 70803, USA
| | - Gianluca Veggiani
- Department of Pathobiological Sciences, School of Veterinary Medicine, Louisiana State University, Baton Rouge, LA 70803, USA
- Division of Biotechnology and Molecular Medicine, Louisiana State University, Baton Rouge, LA 70803, USA
| |
Collapse
|
13
|
McConnell A, Hackel BJ. Protein engineering via sequence-performance mapping. Cell Syst 2023; 14:656-666. [PMID: 37494931 PMCID: PMC10527434 DOI: 10.1016/j.cels.2023.06.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 05/10/2023] [Accepted: 06/21/2023] [Indexed: 07/28/2023]
Abstract
Discovery and evolution of new and improved proteins has empowered molecular therapeutics, diagnostics, and industrial biotechnology. Discovery and evolution both require efficient screens and effective libraries, although they differ in their challenges because of the absence or presence, respectively, of an initial protein variant with the desired function. A host of high-throughput technologies-experimental and computational-enable efficient screens to identify performant protein variants. In partnership, an informed search of sequence space is needed to overcome the immensity, sparsity, and complexity of the sequence-performance landscape. Early in the historical trajectory of protein engineering, these elements aligned with distinct approaches to identify the most performant sequence: selection from large, randomized combinatorial libraries versus rational computational design. Substantial advances have now emerged from the synergy of these perspectives. Rational design of combinatorial libraries aids the experimental search of sequence space, and high-throughput, high-integrity experimental data inform computational design. At the core of the collaborative interface, efficient protein characterization (rather than mere selection of optimal variants) maps sequence-performance landscapes. Such quantitative maps elucidate the complex relationships between protein sequence and performance-e.g., binding, catalytic efficiency, biological activity, and developability-thereby advancing fundamental protein science and facilitating protein discovery and evolution.
Collapse
Affiliation(s)
- Adam McConnell
- Department of Biomedical Engineering, University of Minnesota - Twin Cities, 421 Washington Avenue SE, Minneapolis, MN 55455, USA
| | - Benjamin J Hackel
- Department of Biomedical Engineering, University of Minnesota - Twin Cities, 421 Washington Avenue SE, Minneapolis, MN 55455, USA; Department of Chemical Engineering and Materials Science, University of Minnesota - Twin Cities, 421 Washington Avenue SE, Minneapolis, MN 55455, USA.
| |
Collapse
|
14
|
Hauser BM, Luo Y, Nathan A, Gaiha GD, Vavvas D, Comander J, Pierce EA, Place EM, Bujakowska KM, Rossin EJ. Structure-based network analysis predicts mutations associated with inherited retinal disease. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2023:2023.07.05.23292247. [PMID: 37461650 PMCID: PMC10350150 DOI: 10.1101/2023.07.05.23292247] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Abstract
With continued advances in gene sequencing technologies comes the need to develop better tools to understand which mutations cause disease. Here we validate structure-based network analysis (SBNA)1,2 in well-studied human proteins and report results of using SBNA to identify critical amino acids that may cause retinal disease if subject to missense mutation. We computed SBNA scores for genes with high-quality structural data, starting with validating the method using 4 well-studied human disease-associated proteins. We then analyzed 47 inherited retinal disease (IRD) genes. We compared SBNA scores to phenotype data from the ClinVar database and found a significant difference between benign and pathogenic mutations with respect to network score. Finally, we applied this approach to 65 patients at Massachusetts Eye and Ear (MEE) who were diagnosed with IRD but for whom no genetic cause was found. Multivariable logistic regression models built using SBNA scores for IRD-associated genes successfully predicted pathogenicity of novel mutations, allowing us to identify likely causative disease variants in 37 patients with IRD from our clinic. In conclusion, SBNA can be meaningfully applied to human proteins and may help predict mutations causative of IRD.
Collapse
Affiliation(s)
| | - Yuyang Luo
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| | - Anusha Nathan
- Ragon Institute of Mass General, MIT, and Harvard, Cambridge, MA
| | - Gaurav D. Gaiha
- Ragon Institute of Mass General, MIT, and Harvard, Cambridge, MA
| | - Demetrios Vavvas
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| | - Jason Comander
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| | - Eric A. Pierce
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| | - Emily M. Place
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| | - Kinga M. Bujakowska
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| | - Elizabeth J. Rossin
- Department of Ophthalmology, Massachusetts Eye and Ear, Harvard Medical School, Boston, MA
| |
Collapse
|
15
|
Gersing S, Cagiada M, Gebbia M, Gjesing AP, Coté AG, Seesankar G, Li R, Tabet D, Weile J, Stein A, Gloyn AL, Hansen T, Roth FP, Lindorff-Larsen K, Hartmann-Petersen R. A comprehensive map of human glucokinase variant activity. Genome Biol 2023; 24:97. [PMID: 37101203 PMCID: PMC10131484 DOI: 10.1186/s13059-023-02935-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2022] [Accepted: 04/10/2023] [Indexed: 04/28/2023] Open
Abstract
BACKGROUND Glucokinase (GCK) regulates insulin secretion to maintain appropriate blood glucose levels. Sequence variants can alter GCK activity to cause hyperinsulinemic hypoglycemia or hyperglycemia associated with GCK-maturity-onset diabetes of the young (GCK-MODY), collectively affecting up to 10 million people worldwide. Patients with GCK-MODY are frequently misdiagnosed and treated unnecessarily. Genetic testing can prevent this but is hampered by the challenge of interpreting novel missense variants. RESULT Here, we exploit a multiplexed yeast complementation assay to measure both hyper- and hypoactive GCK variation, capturing 97% of all possible missense and nonsense variants. Activity scores correlate with in vitro catalytic efficiency, fasting glucose levels in carriers of GCK variants and with evolutionary conservation. Hypoactive variants are concentrated at buried positions, near the active site, and at a region of known importance for GCK conformational dynamics. Some hyperactive variants shift the conformational equilibrium towards the active state through a relative destabilization of the inactive conformation. CONCLUSION Our comprehensive assessment of GCK variant activity promises to facilitate variant interpretation and diagnosis, expand our mechanistic understanding of hyperactive variants, and inform development of therapeutics targeting GCK.
Collapse
Affiliation(s)
- Sarah Gersing
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark
| | - Matteo Cagiada
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark
| | - Marinella Gebbia
- Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada
| | - Anette P Gjesing
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Atina G Coté
- Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada
| | - Gireesh Seesankar
- Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada
| | - Roujia Li
- Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, M5T 3A1, Canada
| | - Daniel Tabet
- Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, M5T 3A1, Canada
| | - Jochen Weile
- Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, M5T 3A1, Canada
| | - Amelie Stein
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark
| | - Anna L Gloyn
- Division of Endocrinology, Department of Pediatrics, Stanford University School of Medicine, Stanford, CA, USA
- Stanford Diabetes Research Center, Stanford University, Stanford, CA, USA
| | - Torben Hansen
- Novo Nordisk Foundation Center for Basic Metabolic Research, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Frederick P Roth
- Donnelly Centre, University of Toronto, Toronto, ON, M5S 3E1, Canada.
- Department of Molecular Genetics, University of Toronto, Toronto, ON, M5S 1A8, Canada.
- Lunenfeld-Tanenbaum Research Institute, Sinai Health, Toronto, ON, M5G 1X5, Canada.
- Department of Computer Science, University of Toronto, Toronto, ON, M5T 3A1, Canada.
| | - Kresten Lindorff-Larsen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark.
| | - Rasmus Hartmann-Petersen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Ole Maaløes Vej 5, 2200, Copenhagen, Denmark.
| |
Collapse
|
16
|
Mathy CJP, Mishra P, Flynn JM, Perica T, Mavor D, Bolon DNA, Kortemme T. A complete allosteric map of a GTPase switch in its native cellular network. Cell Syst 2023; 14:237-246.e7. [PMID: 36801015 PMCID: PMC10173951 DOI: 10.1016/j.cels.2023.01.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Revised: 11/08/2022] [Accepted: 01/06/2023] [Indexed: 02/19/2023]
Abstract
Allosteric regulation is central to protein function in cellular networks. A fundamental open question is whether cellular regulation of allosteric proteins occurs only at a few defined positions or at many sites distributed throughout the structure. Here, we probe the regulation of GTPases-protein switches that control signaling through regulated conformational cycling-at residue-level resolution by deep mutagenesis in the native biological network. For the GTPase Gsp1/Ran, we find that 28% of the 4,315 assayed mutations show pronounced gain-of-function responses. Twenty of the sixty positions enriched for gain-of-function mutations are outside the canonical GTPase active site switch regions. Kinetic analysis shows that these distal sites are allosterically coupled to the active site. We conclude that the GTPase switch mechanism is broadly sensitive to cellular allosteric regulation. Our systematic discovery of new regulatory sites provides a functional map to interrogate and target GTPases controlling many essential biological processes.
Collapse
Affiliation(s)
- Christopher J P Mathy
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA; The UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Parul Mishra
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Medical School, Worcester, MA 01605, USA; School of Life Sciences, University of Hyderabad, Hyderabad, Telangana, India
| | - Julia M Flynn
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Tina Perica
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA
| | - David Mavor
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Daniel N A Bolon
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Medical School, Worcester, MA 01605, USA.
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA; The UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco, San Francisco, CA 94158, USA; Chan Zuckerberg Biohub, San Francisco, CA 94158, USA.
| |
Collapse
|
17
|
Chandra S, Manjunath K, Asok A, Varadarajan R. Mutational scan inferred binding energetics and structure in intrinsically disordered protein CcdA. Protein Sci 2023; 32:e4580. [PMID: 36714997 PMCID: PMC9951195 DOI: 10.1002/pro.4580] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 01/02/2023] [Accepted: 01/25/2023] [Indexed: 01/31/2023]
Abstract
Unlike globular proteins, mutational effects on the function of Intrinsically Disordered Proteins (IDPs) are not well-studied. Deep Mutational Scanning of a yeast surface displayed mutant library yields insights into sequence-function relationships in the CcdA IDP. The approach enables facile prediction of interface residues and local structural signatures of the bound conformation. In contrast to previous titration-based approaches which use a number of ligand concentrations, we show that use of a single rationally chosen ligand concentration can provide quantitative estimates of relative binding constants for large numbers of protein variants. This is because the extended interface of IDP ensures that energetic effects of point mutations are spread over a much smaller range than for globular proteins. Our data also provides insights into the much-debated role of helicity and disorder in partner binding of IDPs. Based on this exhaustive mutational sensitivity dataset, a rudimentary model was developed in an attempt to predict mutational effects on binding affinity of IDPs that form alpha-helical structures upon binding.
Collapse
Affiliation(s)
| | | | - Aparna Asok
- Molecular Biophysics Unit, Indian Institute of ScienceBangaloreIndia
| | | |
Collapse
|
18
|
Flynn J, Samant N, Schneider-Nachum G, Tenzin T, Bolon DNA. Mutational fitness landscape and drug resistance. Curr Opin Struct Biol 2023; 78:102525. [PMID: 36621152 PMCID: PMC10243218 DOI: 10.1016/j.sbi.2022.102525] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2022] [Revised: 11/29/2022] [Accepted: 12/06/2022] [Indexed: 01/08/2023]
Abstract
Robust technology has been developed to systematically quantify fitness landscapes that provide valuable opportunities to improve our understanding of drug resistance and define new avenues to develop drugs with reduced resistance susceptibility. We outline the critical importance of drug resistance studies and the potential for fitness landscape approaches to contribute to this effort. We describe the major technical advancements in mutational scanning, which is the primary approach used to quantify protein fitness landscapes. There are many complex steps to consider in planning and executing mutational scanning projects including developing a selection scheme, generating mutant libraries, tracking the frequency of variants using next-generation sequencing, and processing and interpreting the data. Key experimental parameters impacting each of these steps are discussed to aid in planning fitness landscape studies. There is a strong need for improved understanding of drug resistance, and fitness landscapes provide a promising new approach.
Collapse
Affiliation(s)
- Julia Flynn
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Neha Samant
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Gily Schneider-Nachum
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Tsepal Tenzin
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA
| | - Daniel N A Bolon
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Chan Medical School, Worcester, MA 01605, USA.
| |
Collapse
|
19
|
Warren GD, Kitao T, Franklin TG, Nguyen JV, Geurink PP, Kubori T, Nagai H, Pruneda JN. Mechanism of Lys6 poly-ubiquitin specificity by the L. pneumophila deubiquitinase LotA. Mol Cell 2023; 83:105-120.e5. [PMID: 36538933 PMCID: PMC9825671 DOI: 10.1016/j.molcel.2022.11.022] [Citation(s) in RCA: 12] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2022] [Revised: 10/13/2022] [Accepted: 11/29/2022] [Indexed: 12/23/2022]
Abstract
The versatility of ubiquitination to control vast domains of eukaryotic biology is due, in part, to diversification through differently linked poly-ubiquitin chains. Deciphering signaling roles for some chain types, including those linked via K6, has been stymied by a lack of specificity among the implicated regulatory proteins. Forged through strong evolutionary pressures, pathogenic bacteria have evolved intricate mechanisms to regulate host ubiquitin during infection. Herein, we identify and characterize a deubiquitinase domain of the secreted effector LotA from Legionella pneumophila that specifically regulates K6-linked poly-ubiquitin. We demonstrate the utility of LotA for studying K6 poly-ubiquitin signals. We identify the structural basis of LotA activation and poly-ubiquitin specificity and describe an essential "adaptive" ubiquitin-binding domain. Without LotA activity during infection, the Legionella-containing vacuole becomes decorated with K6 poly-ubiquitin as well as the AAA ATPase VCP/p97/Cdc48. We propose that LotA's deubiquitinase activity guards Legionella-containing vacuole components from ubiquitin-dependent extraction.
Collapse
Affiliation(s)
- Gus D Warren
- Department of Molecular Microbiology & Immunology, Oregon Health & Science University, Portland, OR 97239, USA
| | - Tomoe Kitao
- Department of Microbiology, Graduate School of Medicine, Gifu University, Gifu, Gifu 501-1194, Japan
| | - Tyler G Franklin
- Department of Molecular Microbiology & Immunology, Oregon Health & Science University, Portland, OR 97239, USA
| | - Justine V Nguyen
- Department of Molecular Microbiology & Immunology, Oregon Health & Science University, Portland, OR 97239, USA
| | - Paul P Geurink
- Oncode Institute, Department of Cell and Chemical Biology, Leiden University Medical Centre, Leiden, the Netherlands
| | - Tomoko Kubori
- Department of Microbiology, Graduate School of Medicine, Gifu University, Gifu, Gifu 501-1194, Japan; G-CHAIN, Gifu University, Gifu, Gifu 501-1194, Japan
| | - Hiroki Nagai
- Department of Microbiology, Graduate School of Medicine, Gifu University, Gifu, Gifu 501-1194, Japan; G-CHAIN, Gifu University, Gifu, Gifu 501-1194, Japan
| | - Jonathan N Pruneda
- Department of Molecular Microbiology & Immunology, Oregon Health & Science University, Portland, OR 97239, USA.
| |
Collapse
|
20
|
Evolutionary scaling of maximum growth rate with organism size. Sci Rep 2022; 12:22586. [PMID: 36585440 PMCID: PMC9803686 DOI: 10.1038/s41598-022-23626-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2022] [Accepted: 11/02/2022] [Indexed: 12/31/2022] Open
Abstract
Data from nearly 1000 species reveal the upper bound to rates of biomass production achievable by natural selection across the Tree of Life. For heterotrophs, maximum growth rates scale positively with organism size in bacteria but negatively in eukaryotes, whereas for phototrophs, the scaling is negligible for cyanobacteria and weakly negative for eukaryotes. These results have significant implications for understanding the bioenergetic consequences of the transition from prokaryotes to eukaryotes, and of the expansion of some groups of the latter into multicellularity. The magnitudes of the scaling coefficients for eukaryotes are significantly lower than expected under any proposed physical-constraint model. Supported by genomic, bioenergetic, and population-genetic data and theory, an alternative hypothesis for the observed negative scaling in eukaryotes postulates that growth-diminishing mutations with small effects passively accumulate with increasing organism size as a consequence of associated increases in the power of random genetic drift. In contrast, conditional on the structural and functional features of ribosomes, natural selection has been able to promote bacteria with the fastest possible growth rates, implying minimal conflicts with both bioenergetic constraints and random genetic drift. If this extension of the drift-barrier hypothesis is correct, the interpretations of comparative studies of biological traits that have traditionally ignored differences in population-genetic environments will require revisiting.
Collapse
|
21
|
Fu Y, Bedő J, Papenfuss AT, Rubin AF. Integrating deep mutational scanning and low-throughput mutagenesis data to predict the impact of amino acid variants. Gigascience 2022; 12:giad073. [PMID: 37721410 PMCID: PMC10506130 DOI: 10.1093/gigascience/giad073] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Revised: 07/02/2023] [Accepted: 08/23/2023] [Indexed: 09/19/2023] Open
Abstract
BACKGROUND Evaluating the impact of amino acid variants has been a critical challenge for studying protein function and interpreting genomic data. High-throughput experimental methods like deep mutational scanning (DMS) can measure the effect of large numbers of variants in a target protein, but because DMS studies have not been performed on all proteins, researchers also model DMS data computationally to estimate variant impacts by predictors. RESULTS In this study, we extended a linear regression-based predictor to explore whether incorporating data from alanine scanning (AS), a widely used low-throughput mutagenesis method, would improve prediction results. To evaluate our model, we collected 146 AS datasets, mapping to 54 DMS datasets across 22 distinct proteins. CONCLUSIONS We show that improved model performance depends on the compatibility of the DMS and AS assays, and the scale of improvement is closely related to the correlation between DMS and AS results.
Collapse
Affiliation(s)
- Yunfan Fu
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| | - Justin Bedő
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| | - Anthony T Papenfuss
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
- Peter MacCallum Cancer Centre, Melbourne, Victoria 3000, Australia
| | - Alan F Rubin
- The Walter and Eliza Hall Institute of Medical Research, Bioinformatics Division, 1G Royal Pde, Parkville, Victoria 3052, Australia
- The University of Melbourne, Department of Medical Biology, Parkville, Victoria 3010, Australia
| |
Collapse
|
22
|
Rani R, Parsa KVL, Chatti K, Sevilimedu A. An efficient and cost-effective method for directed mutagenesis at multiple dispersed sites-a case study with Omicron Spike DNA. Biol Methods Protoc 2022; 8:bpac037. [PMID: 36654942 PMCID: PMC9838316 DOI: 10.1093/biomethods/bpac037] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 12/13/2022] [Accepted: 12/15/2022] [Indexed: 12/24/2022] Open
Abstract
Site-directed mutagenesis is an invaluable technique that enables the elucidation of the contribution of specific residues to protein structure and function. The simultaneous introduction of mutations at a large number of sites (>10), singly and in multiple combinations, is often necessary to fully understand the functional contributions. We report a simple, efficient, time and cost-effective method to achieve this using commonly available molecular biology reagents and protocols, as an alternative to gene synthesis. We demonstrate this method using the Omicron Spike DNA construct as an example, and create a construct bearing 37 mutations (as compared to wild-type Spike DNA), as well as 4 other constructs bearing subsets of the full spectrum of mutations. We believe that this method can be an excellent alternative to gene synthesis, especially when three or more variants are required.
Collapse
Affiliation(s)
- Rita Rani
- Correspondence address. (R.R. and A.S.) Center for Innovation in Molecular and Pharmaceutical Sciences, Dr. Reddy’s Institute of Life Sciences, University of Hyderabad Campus, Gachibowli, Hyderabad, Telangana, 500046, India; (R.R) E-mail: . (A.S.)
| | - Kishore V L Parsa
- Center for Innovation in Molecular and Pharmaceutical Sciences, Dr. Reddy’s Institute of Life Sciences, University of Hyderabad Campus, Hyderabad, 500046, India
| | - Kiranam Chatti
- Center for Innovation in Molecular and Pharmaceutical Sciences, Dr. Reddy’s Institute of Life Sciences, University of Hyderabad Campus, Hyderabad, 500046, India
| | - Aarti Sevilimedu
- Correspondence address. (R.R. and A.S.) Center for Innovation in Molecular and Pharmaceutical Sciences, Dr. Reddy’s Institute of Life Sciences, University of Hyderabad Campus, Gachibowli, Hyderabad, Telangana, 500046, India; (R.R) E-mail: . (A.S.)
| |
Collapse
|
23
|
Chandra S, Gupta K, Khare S, Kohli P, Asok A, Mohan SV, Gowda H, Varadarajan R. The High Mutational Sensitivity of ccdA Antitoxin Is Linked to Codon Optimality. Mol Biol Evol 2022; 39:6693774. [PMID: 36069948 PMCID: PMC9555053 DOI: 10.1093/molbev/msac187] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Deep mutational scanning studies suggest that synonymous mutations are typically silent and that most exposed, nonactive-site residues are tolerant to mutations. Here, we show that the ccdA antitoxin component of the Escherichia coli ccdAB toxin-antitoxin system is unusually sensitive to mutations when studied in the operonic context. A large fraction (∼80%) of single-codon mutations, including many synonymous mutations in the ccdA gene shows inactive phenotype, but they retain native-like binding affinity towards cognate toxin, CcdB. Therefore, the observed phenotypic effects are largely not due to alterations in protein structure/stability, consistent with a large region of CcdA being intrinsically disordered. E. coli codon preference and strength of ribosome-binding associated with translation of downstream ccdB gene are found to be major contributors of the observed ccdA mutant phenotypes. In select cases, proteomics studies reveal altered ratios of CcdA:CcdB protein levels in vivo, suggesting that the ccdA mutations likely alter relative translation efficiencies of the two genes in the operon. We extend these results by studying single-site synonymous mutations that lead to loss of function phenotypes in the relBE operon upon introduction of rarer codons. Thus, in their operonic context, genes are likely to be more sensitive to both synonymous and nonsynonymous point mutations than inferred previously.
Collapse
Affiliation(s)
| | | | - Shruti Khare
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | - Pehu Kohli
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | - Aparna Asok
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore 560012, India
| | | | - Harsha Gowda
- Institute of Bioinformatics, Bangalore 560100, India
| | | |
Collapse
|
24
|
Gabzi T, Pilpel Y, Friedlander T. Fitness landscape analysis of a tRNA gene reveals that the wild type allele is sub-optimal, yet mutationally robust. Mol Biol Evol 2022; 39:6670756. [PMID: 35976926 PMCID: PMC9447856 DOI: 10.1093/molbev/msac178] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Fitness landscape mapping and the prediction of evolutionary trajectories on these landscapes are major tasks in evolutionary biology research. Evolutionary dynamics is tightly linked to the landscape topography, but this relation is not straightforward. Here, we analyze a fitness landscape of a yeast tRNA gene, previously measured under four different conditions. We find that the wild type allele is sub-optimal, and 8–10% of its variants are fitter. We rule out the possibilities that the wild type is fittest on average on these four conditions or located on a local fitness maximum. Notwithstanding, we cannot exclude the possibility that the wild type might be fittest in some of the many conditions in the complex ecology that yeast lives at. Instead, we find that the wild type is mutationally robust (“flat”), while more fit variants are typically mutationally fragile. Similar observations of mutational robustness or flatness have been so far made in very few cases, predominantly in viral genomes.
Collapse
Affiliation(s)
- Tzahi Gabzi
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Yitzhak Pilpel
- Department of Molecular Genetics, Weizmann Institute of Science, Rehovot 7610001, Israel
| | - Tamar Friedlander
- The Robert H. Smith Institute of Plant Sciences and Genetics in Agriculture Faculty of Agriculture, Hebrew University of Jerusalem, 229 Herzl St., Rehovot 7610001, Israel
| |
Collapse
|
25
|
Flynn JM, Samant N, Schneider-Nachum G, Bakan DT, Yilmaz NK, Schiffer CA, Moquin SA, Dovala D, Bolon DNA. Comprehensive fitness landscape of SARS-CoV-2 M pro reveals insights into viral resistance mechanisms. eLife 2022; 11:77433. [PMID: 35723575 PMCID: PMC9323007 DOI: 10.7554/elife.77433] [Citation(s) in RCA: 35] [Impact Index Per Article: 17.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2022] [Accepted: 06/17/2022] [Indexed: 11/13/2022] Open
Abstract
With the continual evolution of new strains of SARS-CoV-2 that are more virulent, transmissible, and able to evade current vaccines, there is an urgent need for effective anti-viral drugs SARS-CoV-2 main protease (Mpro) is a leading target for drug design due to its conserved and indispensable role in the viral life cycle. Drugs targeting Mpro appear promising but will elicit selection pressure for resistance. To understand resistance potential in Mpro, we performed a comprehensive mutational scan of the protease that analyzed the function of all possible single amino acid changes. We developed three separate high-throughput assays of Mpro function in yeast, based on either the ability of Mpro variants to cleave at a defined cut-site or on the toxicity of their expression to yeast. We used deep sequencing to quantify the functional effects of each variant in each screen. The protein fitness landscapes from all three screens were strongly correlated, indicating that they captured the biophysical properties critical to Mpro function. The fitness landscapes revealed a non-active site location on the surface that is extremely sensitive to mutation making it a favorable location to target with inhibitors. In addition, we found a network of critical amino acids that physically bridge the two active sites of the Mpro dimer. The clinical variants of Mpro were predominantly functional in our screens, indicating that Mpro is under strong selection pressure in the human population. Our results provide predictions of mutations that will be readily accessible to Mpro evolution and that are likely to contribute to drug resistance. This complete mutational guide of Mpro can be used in the design of inhibitors with reduced potential of evolving viral resistance.
Collapse
Affiliation(s)
- Julia M Flynn
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| | - Neha Samant
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| | - Gily Schneider-Nachum
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| | | | - Nese Kurt Yilmaz
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| | - Celia A Schiffer
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| | | | | | - Daniel N A Bolon
- Department of Biochemistry and Molecular Pharmacology, University of Massachusetts Medical School, Worcester, United States
| |
Collapse
|
26
|
On the Study of Deubiquitinases: Using the Right Tools for the Job. Biomolecules 2022; 12:biom12050703. [PMID: 35625630 PMCID: PMC9139131 DOI: 10.3390/biom12050703] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2022] [Revised: 05/05/2022] [Accepted: 05/12/2022] [Indexed: 02/01/2023] Open
Abstract
Deubiquitinases (DUBs) have been the subject of intense scrutiny in recent years. Many of their diverse enzymatic mechanisms are well characterized in vitro; however, our understanding of these enzymes at the cellular level lags due to the lack of quality tool reagents. DUBs play a role in seemingly every biological process and are central to many human pathologies, thus rendering them very desirable and challenging therapeutic targets. This review aims to provide researchers entering the field of ubiquitination with knowledge of the pharmacological modulators and tool molecules available to study DUBs. A focus is placed on small molecule inhibitors, ubiquitin variants (UbVs), and activity-based probes (ABPs). Leveraging these tools to uncover DUB biology at the cellular level is of particular importance and may lead to significant breakthroughs. Despite significant drug discovery efforts, only approximately 15 chemical probe-quality small molecule inhibitors have been reported, hitting just 6 of about 100 DUB targets. UbV technology is a promising approach to rapidly expand the library of known DUB inhibitors and may be used as a combinatorial platform for structure-guided drug design.
Collapse
|
27
|
Shearer RF, Typas D, Coscia F, Schovsbo S, Kruse T, Mund A, Mailand N. K27-linked ubiquitylation promotes p97 substrate processing and is essential for cell proliferation. EMBO J 2022; 41:e110145. [PMID: 35349166 PMCID: PMC9058539 DOI: 10.15252/embj.2021110145] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2021] [Revised: 03/09/2022] [Accepted: 03/11/2022] [Indexed: 02/06/2023] Open
Abstract
Conjugation of ubiquitin (Ub) to numerous substrate proteins regulates virtually all cellular processes. Eight distinct ubiquitin polymer linkages specifying different functional outcomes are generated in cells. However, the roles of some atypical poly-ubiquitin topologies, in particular linkages via lysine 27 (K27), remain poorly understood due to a lack of tools for their specific detection and manipulation. Here, we adapted a cell-based ubiquitin replacement strategy to enable selective and conditional abrogation of K27-linked ubiquitylation, revealing that this ubiquitin linkage type is essential for proliferation of human cells. We demonstrate that K27-linked ubiquitylation is predominantly a nuclear modification whose ablation deregulates nuclear ubiquitylation dynamics and impairs cell cycle progression in an epistatic manner with inactivation of the ATPase p97/VCP. Moreover, we show that a p97-proteasome pathway model substrate (Ub(G76V)-GFP) is directly modified by K27-linked ubiquitylation, and that disabling the formation of K27-linked ubiquitin signals or blocking their decoding via overexpression of the K27 linkage-specific binder UCHL3 impedes Ub(G76V)-GFP turnover at the level of p97 function. Our findings suggest a critical role of K27-linked ubiquitylation in supporting cell fitness by facilitating p97-dependent processing of ubiquitylated nuclear proteins.
Collapse
Affiliation(s)
- Robert F Shearer
- Protein Signaling Program, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Dimitris Typas
- Protein Signaling Program, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Fabian Coscia
- Proteomics Program, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Sofie Schovsbo
- Protein Signaling Program, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Thomas Kruse
- Protein Signaling Program, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Andreas Mund
- Proteomics Program, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark
| | - Niels Mailand
- Protein Signaling Program, Novo Nordisk Foundation Center for Protein Research, University of Copenhagen, Copenhagen, Denmark.,Center for Chromosome Stability, Department of Cellular and Molecular Medicine, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
28
|
A Panel of Engineered Ubiquitin Variants Targeting the Family of Domains Found in Ubiquitin Specific Proteases (DUSPs). J Mol Biol 2021; 433:167300. [PMID: 34666042 DOI: 10.1016/j.jmb.2021.167300] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2021] [Revised: 09/16/2021] [Accepted: 10/04/2021] [Indexed: 11/21/2022]
Abstract
Domains found in ubiquitin specific proteases (DUSPs) occur in seven members of the ubiquitin specific protease (USP) family. DUSPs are defined by a distinct structural fold but their functions remain largely unknown, although studies with USP4 suggest that its DUSP enhances deubiquitination activity. We used phage-displayed libraries of ubiquitin variants (UbVs) to derive protein-based tools to target DUSP family members with high affinity and specificity. We designed a UbV library based on insights from the structure of a previously identified UbV bound to the DUSP of USP15. The new library yielded 33 unique UbVs that bound to DUSPs from five different USPs (USP4, USP11, USP15, USP20 and USP33). For each USP, we were able to identify at least one DUSP that bound with high affinity and absolute specificity relative to the other DUSPs. We showed that UbVs targeting the DUSPs of USP15, USP11 and USP20 inhibited the catalytic activity of the enzyme, despite the fact that the DUSP is located outside of the catalytic domain. These findings provide an alternative means of inhibiting USP activity by targeting DUSPs, and this mechanism could be potentially extended other DUSP-containing USPs.
Collapse
|
29
|
Lara Ortiz MT, Martinell García V, Del Rio G. Saturation Mutagenesis of the Transmembrane Region of HokC in Escherichia coli Reveals Its High Tolerance to Mutations. Int J Mol Sci 2021; 22:ijms221910359. [PMID: 34638709 PMCID: PMC8509063 DOI: 10.3390/ijms221910359] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/03/2021] [Revised: 09/20/2021] [Accepted: 09/22/2021] [Indexed: 11/16/2022] Open
Abstract
Cells adapt to different stress conditions, such as the antibiotics presence. This adaptation sometimes is achieved by changing relevant protein positions, of which the mutability is limited by structural constrains. Understanding the basis of these constrains represent an important challenge for both basic science and potential biotechnological applications. To study these constraints, we performed a systematic saturation mutagenesis of the transmembrane region of HokC, a toxin used by Escherichia coli to control its own population, and observed that 92% of single-point mutations are tolerated and that all the non-tolerated mutations have compensatory mutations that reverse their effect. We provide experimental evidence that HokC accumulates multiple compensatory mutations that are found as correlated mutations in the HokC family multiple sequence alignment. In agreement with these observations, transmembrane proteins show higher probability to present correlated mutations and are less densely packed locally than globular proteins; previous mutagenesis results on transmembrane proteins further support our observations on the high tolerability to mutations of transmembrane regions of proteins. Thus, our experimental results reveal the HokC transmembrane region high tolerance to loss-of-function mutations that is associated with low sequence conservation and high rate of correlated mutations in the HokC family sequences alignment, which are features shared with other transmembrane proteins.
Collapse
|
30
|
Dunham AS, Beltrao P. Exploring amino acid functions in a deep mutational landscape. Mol Syst Biol 2021; 17:e10305. [PMID: 34292650 PMCID: PMC8297461 DOI: 10.15252/msb.202110305] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Revised: 06/29/2021] [Accepted: 06/30/2021] [Indexed: 12/21/2022] Open
Abstract
Amino acids fulfil a diverse range of roles in proteins, each utilising its chemical properties in different ways in different contexts to create required functions. For example, cysteines form disulphide or hydrogen bonds in different circumstances and charged amino acids do not always make use of their charge. The repertoire of amino acid functions and the frequency at which they occur in proteins remains understudied. Measuring large numbers of mutational consequences, which can elucidate the role an amino acid plays, was prohibitively time-consuming until recent developments in deep mutational scanning. In this study, we gathered data from 28 deep mutational scanning studies, covering 6,291 positions in 30 proteins, and used the consequences of mutation at each position to define a mutational landscape. We demonstrated rich relationships between this landscape and biophysical or evolutionary properties. Finally, we identified 100 functional amino acid subtypes with a data-driven clustering analysis and studied their features, including their frequencies and chemical properties such as tolerating polarity, hydrophobicity or being intolerant of charge or specific amino acids. The mutational landscape and amino acid subtypes provide a foundational catalogue of amino acid functional diversity, which will be refined as the number of studied protein positions increases.
Collapse
Affiliation(s)
- Alistair S Dunham
- European Molecular Biology LaboratoryEuropean Bioinformatics Institute (EMBL‐EBI)CambridgeUK
| | - Pedro Beltrao
- European Molecular Biology LaboratoryEuropean Bioinformatics Institute (EMBL‐EBI)CambridgeUK
| |
Collapse
|
31
|
Quantifying the Mutational Robustness of Protein-Coding Genes. J Mol Evol 2021; 89:357-369. [PMID: 33934169 DOI: 10.1007/s00239-021-10009-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2020] [Accepted: 04/05/2021] [Indexed: 10/21/2022]
Abstract
We use large-scale mutagenesis data and computer simulations to quantify the mutational robustness of protein-coding genes by taking into account constraints arising from protein function and the genetic code. Analyses of the distribution of amino acid substitutions from 18 mutagenesis studies revealed an average of 45% of neutral variants; while mutagenesis data of 12 proteins artificially designed under no other constraints but stability, reach an average of 60%. Simulations using a lattice protein model allow us to contrast these estimates to the expected mutational robustness of protein families by generating unbiased samples of foldable sequences, which we find to have 30% of neutral variants. In agreement with mutagenesis data of designed proteins, the model shows that maximally robust protein families might access up to twice the amount of neutral variants observed in the unbiased samples (i.e. 60%). A biophysical model of protein-ligand binding suggests that constraints associated to molecular function have only a moderate impact on robustness of approximately 5 to 10% of neutral variants; and that the direction of this effect depends on the relation between functional performance and thermodynamic stability. Although the genetic code constraints the access of a gene's nucleotide sequence to only 30% of the full distribution of amino acid mutations, it provides an extra 15 to 20% of neutral variants to the estimations above, such that the expected, observed, and maximal robustness of protein-coding genes are approximately 50, 65, and 75%, respectively. We discuss our results in the light of three main hypothesis put forward to explain the existence of mutationally robust genes.
Collapse
|
32
|
Mechanistic basis for ubiquitin modulation of a protein energy landscape. Proc Natl Acad Sci U S A 2021; 118:2025126118. [PMID: 33723075 DOI: 10.1073/pnas.2025126118] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Ubiquitin is a common posttranslational modification canonically associated with targeting proteins to the 26S proteasome for degradation and also plays a role in numerous other nondegradative cellular processes. Ubiquitination at certain sites destabilizes the substrate protein, with consequences for proteasomal processing, while ubiquitination at other sites has little energetic effect. How this site specificity-and, by extension, the myriad effects of ubiquitination on substrate proteins-arises remains unknown. Here, we systematically characterize the atomic-level effects of ubiquitination at various sites on a model protein, barstar, using a combination of NMR, hydrogen-deuterium exchange mass spectrometry, and molecular dynamics simulation. We find that, regardless of the site of modification, ubiquitination does not induce large structural rearrangements in the substrate. Destabilizing modifications, however, increase fluctuations from the native state resulting in exposure of the substrate's C terminus. Both of the sites occur in regions of barstar with relatively high conformational flexibility. Nevertheless, destabilization appears to occur through different thermodynamic mechanisms, involving a reduction in entropy in one case and a loss in enthalpy in another. By contrast, ubiquitination at a nondestabilizing site protects the substrate C terminus through intermittent formation of a structural motif with the last three residues of ubiquitin. Thus, the biophysical effects of ubiquitination at a given site depend greatly on local context. Taken together, our results reveal how a single posttranslational modification can generate a broad array of distinct effects, providing a framework to guide the design of proteins and therapeutics with desired degradation and quality control properties.
Collapse
|
33
|
Bhasin M, Varadarajan R. Prediction of Function Determining and Buried Residues Through Analysis of Saturation Mutagenesis Datasets. Front Mol Biosci 2021; 8:635425. [PMID: 33778004 PMCID: PMC7991590 DOI: 10.3389/fmolb.2021.635425] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 01/25/2021] [Indexed: 11/13/2022] Open
Abstract
Mutational scanning can be used to probe effects of large numbers of point mutations on protein function. Positions affected by mutation are primarily at either buried or at exposed residues directly involved in function, hereafter designated as active-site residues. In the absence of prior structural information, it has not been easy to distinguish between these two categories of residues. We curated and analyzed a set of twelve published deep mutational scanning datasets. The analysis revealed differential patterns of mutational sensitivity and substitution preferences at buried and exposed positions. Prediction of buried-sites solely from the mutational sensitivity data was facilitated by incorporating predicted sequence-based accessibility values. For active-site residues we observed mean sensitivity, specificity and accuracy of 61, 90 and 88% respectively. For buried residues the corresponding figures were 59, 90 and 84% while for exposed non active-site residues these were 98, 44 and 82% respectively. We also identified positions which did not follow these general trends and might require further experimental re-validation. This analysis highlights the ability of deep mutational scans to provide important structural and functional insights, even in the absence of three-dimensional structures determined using conventional structure determination techniques, and also discuss some limitations of the methodology.
Collapse
Affiliation(s)
- Munmun Bhasin
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
| | - Raghavan Varadarajan
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India
- Jawaharlal Nehru Centre for Advanced Scientific Research, Bangalore, India
| |
Collapse
|
34
|
Strokach A, Lu TY, Kim PM. ELASPIC2 (EL2): Combining Contextualized Language Models and Graph Neural Networks to Predict Effects of Mutations. J Mol Biol 2021; 433:166810. [PMID: 33450251 DOI: 10.1016/j.jmb.2021.166810] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/26/2020] [Revised: 12/19/2020] [Accepted: 01/03/2021] [Indexed: 12/21/2022]
Abstract
The ELASPIC web server allows users to evaluate the effect of mutations on protein folding and protein-protein interaction on a proteome-wide scale. It uses homology models of proteins and protein-protein interactions, which have been precalculated for several proteomes, and machine learning models, which integrate structural information with sequence conservation scores, in order to make its predictions. Since the original publication of the ELASPIC web server, several advances have motivated a revisiting of the problem of mutation effect prediction. First, progress in neural network architectures and self-supervised pre-trained has resulted in models which provide more informative embeddings of protein sequence and structure than those used by the original version of ELASPIC. Second, the amount of training data has increased several-fold, largely driven by advances in deep mutation scanning and other multiplexed assays of variant effect. Here, we describe two machine learning models which leverage the recent advances in order to achieve superior accuracy in predicting the effect of mutation on protein folding and protein-protein interaction. The models incorporate features generated using pre-trained transformer- and graph convolution-based neural networks, and are trained to optimize a ranking objective function, which permits the use of heterogeneous training data. The outputs from the new models have been incorporated into the ELASPIC web server, available at http://elaspic.kimlab.org.
Collapse
Affiliation(s)
- Alexey Strokach
- Department of Computer Science, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Tian Yu Lu
- Department of Computer Science, University of Toronto, Toronto, ON M5S 3E1, Canada
| | - Philip M Kim
- Department of Computer Science, University of Toronto, Toronto, ON M5S 3E1, Canada; Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON M5S 3E1, Canada; Department of Molecular Genetics, University of Toronto, Toronto, ON M5S 3E1, Canada.
| |
Collapse
|
35
|
Srinivasan S, Zoni V, Vanni S. Estimating the accuracy of the MARTINI model towards the investigation of peripheral protein–membrane interactions. Faraday Discuss 2021; 232:131-148. [DOI: 10.1039/d0fd00058b] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
In this article, we investigate the ability of the MARTINI CG force field, specifically the 3 open-beta version, to reproduce known experimental observations regarding the membrane binding behavior of 12 peripheral membrane proteins and peptides.
Collapse
Affiliation(s)
| | - Valeria Zoni
- Department of Biology, University of Fribourg, Switzerland
| | - Stefano Vanni
- Department of Biology, University of Fribourg, Switzerland
| |
Collapse
|
36
|
Munro D, Singh M. DeMaSk: a deep mutational scanning substitution matrix and its use for variant impact prediction. Bioinformatics 2020; 36:5322-5329. [PMID: 33325500 PMCID: PMC8016454 DOI: 10.1093/bioinformatics/btaa1030] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Revised: 10/16/2020] [Accepted: 11/30/2020] [Indexed: 01/27/2023] Open
Abstract
Motivation Accurately predicting the quantitative impact of a substitution on a protein’s molecular function would be a great aid in understanding the effects of observed genetic variants across populations. While this remains a challenging task, new approaches can leverage data from the increasing numbers of comprehensive deep mutational scanning (DMS) studies that systematically mutate proteins and measure fitness. Results We introduce DeMaSk, an intuitive and interpretable method based only upon DMS datasets and sequence homologs that predicts the impact of missense mutations within any protein. DeMaSk first infers a directional amino acid substitution matrix from DMS datasets and then fits a linear model that combines these substitution scores with measures of per-position evolutionary conservation and variant frequency across homologs. Despite its simplicity, DeMaSk has state-of-the-art performance in predicting the impact of amino acid substitutions, and can easily and rapidly be applied to any protein sequence. Availability and implementation https://demask.princeton.edu generates fitness impact predictions and visualizations for any user-submitted protein sequence. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Daniel Munro
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, 08544, USA
| | - Mona Singh
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, 08544, USA.,Department of Computer Science, Princeton University, Princeton, 08544, USA
| |
Collapse
|
37
|
Wagner A. Information Theory Can Help Quantify the Potential of New Phenotypes to Originate as Exaptations. Front Ecol Evol 2020. [DOI: 10.3389/fevo.2020.564071] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
Exaptations are adaptive traits that do not originate de novo but from other adaptive traits. They include complex macroscopic traits, such as the middle ear bones of mammals, which originated from reptile jaw bones, but also molecular traits, such as new binding sites of transcriptional regulators. What determines whether a trait originates de novo or as an exaptation is unknown. I here use simple information theoretic concepts to quantify a molecular phenotype’s potential to give rise to new phenotypes. These quantities rely on the amount of genetic information needed to encode a phenotype. I use these quantities to estimate the propensity of new transcription factor binding phenotypes to emerge de novo or exaptively, and do so for 187 mouse transcription factors. I also use them to quantify whether an organism’s viability in one of 10 different chemical environment is likely to arise exaptively. I show that informationally expensive traits are more likely to originate exaptively. Exaptive evolution is only sometimes favored for new transcription factor binding, but it is always favored for the informationally complex metabolic phenotypes I consider. As our ability to genotype evolving populations increases, so will our ability to understand how phenotypes of ever-increasing informational complexity originate in evolution.
Collapse
|
38
|
Ruggiero MJ, Malhotra S, Fenton AW, Swint-Kruse L, Karanicolas J, Hagenbuch B. A clinically relevant polymorphism in the Na +/taurocholate cotransporting polypeptide (NTCP) occurs at a rheostat position. J Biol Chem 2020; 296:100047. [PMID: 33168628 PMCID: PMC7948949 DOI: 10.1074/jbc.ra120.014889] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2020] [Revised: 10/22/2020] [Accepted: 11/09/2020] [Indexed: 12/28/2022] Open
Abstract
Conventionally, most amino acid substitutions at “important” protein positions are expected to abolish function. However, in several soluble-globular proteins, we identified a class of nonconserved positions for which various substitutions produced progressive functional changes; we consider these evolutionary “rheostats”. Here, we report a strong rheostat position in the integral membrane protein, Na+/taurocholate (TCA) cotransporting polypeptide, at the site of a pharmacologically relevant polymorphism (S267F). Functional studies were performed for all 20 substitutions (S267X) with three substrates (TCA, estrone-3-sulfate, and rosuvastatin). The S267X set showed strong rheostatic effects on overall transport, and individual substitutions showed varied effects on transport kinetics (Km and Vmax) and substrate specificity. To assess protein stability, we measured surface expression and used the Rosetta software (https://www.rosettacommons.org) suite to model structure and stability changes of S267X. Although buried near the substrate-binding site, S267X substitutions were easily accommodated in the Na+/TCA cotransporting polypeptide structure model. Across the modest range of changes, calculated stabilities correlated with surface-expression differences, but neither parameter correlated with altered transport. Thus, substitutions at rheostat position 267 had wide-ranging effects on the phenotype of this integral membrane protein. We further propose that polymorphic positions in other proteins might be locations of rheostat positions.
Collapse
Affiliation(s)
- Melissa J Ruggiero
- Department of Pharmacology, Toxicology and Therapeutics, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Shipra Malhotra
- Program in Molecular Therapeutics, Fox Chase Cancer Center, Philadelphia, Pennsylvania, USA; Center for Computational Biology, University of Kansas, Lawrence, Kansas, USA
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - John Karanicolas
- Program in Molecular Therapeutics, Fox Chase Cancer Center, Philadelphia, Pennsylvania, USA
| | - Bruno Hagenbuch
- Department of Pharmacology, Toxicology and Therapeutics, The University of Kansas Medical Center, Kansas City, Kansas, USA.
| |
Collapse
|
39
|
Wang YS, Wu KP, Jiang HK, Kurkute P, Chen RH. Branched Ubiquitination: Detection Methods, Biological Functions and Chemical Synthesis. Molecules 2020; 25:E5200. [PMID: 33182242 PMCID: PMC7664869 DOI: 10.3390/molecules25215200] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/18/2020] [Revised: 11/05/2020] [Accepted: 11/07/2020] [Indexed: 11/16/2022] Open
Abstract
Ubiquitination is a versatile posttranslational modification that elicits signaling roles to impact on various cellular processes and disease states. The versatility is a result of the complexity of ubiquitin conjugates, ranging from a single ubiquitin monomer to polymers with different length and linkage types. Recent studies have revealed the abundant existence of branched ubiquitin chains in which one ubiquitin molecule is connected to two or more ubiquitin moieties in the same ubiquitin polymer. Compared to the homotypic ubiquitin chain, the branched chain is recognized or processed differently by readers and erasers of the ubiquitin system, respectively, resulting in a qualitative or quantitative alteration of the functional output. Furthermore, certain types of branched ubiquitination are induced by cellular stresses, implicating their important physiological role in stress adaption. In addition, the current chemical methodologies of solid phase peptide synthesis and expanding genetic code approach have been developed to synthesize different architectures of branched ubiquitin chains. The synthesized branched ubiquitin chains have shown their significance in understanding the topologies and binding partners of the branched chains. Here, we discuss the recent progresses on the detection, functional characterization and synthesis of branched ubiquitin chains as well as the future perspectives of this emerging field.
Collapse
Affiliation(s)
- Yane-Shih Wang
- Institute of Biological Chemistry, Academia Sinica, Taipei 11529, Taiwan; (H.-K.J.); (P.K.)
- Institute of Biochemical Sciences, College of Life Science, National Taiwan University, Taipei 10617, Taiwan
- Chemical Biology and Molecular Biophysics Program, Taiwan International Graduate Program, Academia Sinica, Taipei 11529, Taiwan
| | - Kuen-Phon Wu
- Institute of Biological Chemistry, Academia Sinica, Taipei 11529, Taiwan; (H.-K.J.); (P.K.)
- Institute of Biochemical Sciences, College of Life Science, National Taiwan University, Taipei 10617, Taiwan
| | - Han-Kai Jiang
- Institute of Biological Chemistry, Academia Sinica, Taipei 11529, Taiwan; (H.-K.J.); (P.K.)
- Chemical Biology and Molecular Biophysics Program, Taiwan International Graduate Program, Academia Sinica, Taipei 11529, Taiwan
- Department of Chemistry, National Tsing Hua University, Hsinchu 30044, Taiwan
| | - Prashant Kurkute
- Institute of Biological Chemistry, Academia Sinica, Taipei 11529, Taiwan; (H.-K.J.); (P.K.)
- Chemical Biology and Molecular Biophysics Program, Taiwan International Graduate Program, Academia Sinica, Taipei 11529, Taiwan
- Department of Chemistry, National Taiwan University, Taipei 10617, Taiwan
| | - Ruey-Hwa Chen
- Institute of Biological Chemistry, Academia Sinica, Taipei 11529, Taiwan; (H.-K.J.); (P.K.)
- Institute of Biochemical Sciences, College of Life Science, National Taiwan University, Taipei 10617, Taiwan
| |
Collapse
|
40
|
Pacheco-García JL, Cano-Muñoz M, Sánchez-Ramos I, Salido E, Pey AL. Naturally-Occurring Rare Mutations Cause Mild to Catastrophic Effects in the Multifunctional and Cancer-Associated NQO1 Protein. J Pers Med 2020; 10:E207. [PMID: 33153185 PMCID: PMC7711955 DOI: 10.3390/jpm10040207] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2020] [Revised: 10/27/2020] [Accepted: 11/02/2020] [Indexed: 12/13/2022] Open
Abstract
The functional and pathological implications of the enormous genetic diversity of the human genome are mostly unknown, primarily due to our unability to predict pathogenicity in a high-throughput manner. In this work, we characterized the phenotypic consequences of eight naturally-occurring missense variants on the multifunctional and disease-associated NQO1 protein using biophysical and structural analyses on several protein traits. Mutations found in both exome-sequencing initiatives and in cancer cell lines cause mild to catastrophic effects on NQO1 stability and function. Importantly, some mutations perturb functional features located structurally far from the mutated site. These effects are well rationalized by considering the nature of the mutation, its location in protein structure and the local stability of its environment. Using a set of 22 experimentally characterized mutations in NQO1, we generated experimental scores for pathogenicity that correlate reasonably well with bioinformatic scores derived from a set of commonly used algorithms, although the latter fail to semiquantitatively predict the phenotypic alterations caused by a significant fraction of mutations individually. These results provide insight into the propagation of mutational effects on multifunctional proteins, the implementation of in silico approaches for establishing genotype-phenotype correlations and the molecular determinants underlying loss-of-function in genetic diseases.
Collapse
Affiliation(s)
- Juan Luis Pacheco-García
- Departamento de Química Física, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain; (J.L.P.-G.); (M.C.-M.); (I.S.-R.)
| | - Mario Cano-Muñoz
- Departamento de Química Física, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain; (J.L.P.-G.); (M.C.-M.); (I.S.-R.)
| | - Isabel Sánchez-Ramos
- Departamento de Química Física, Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain; (J.L.P.-G.); (M.C.-M.); (I.S.-R.)
| | - Eduardo Salido
- Centre for Biomedical Research on Rare Diseases (CIBERER), Hospital Universitario de Canarias, 38320 Tenerife, Spain;
| | - Angel L. Pey
- Departamento de Química Física y Unidad de Excelencia de Química Aplicada a Biomedicina y Medioambiente (UEQ), Facultad de Ciencias, Universidad de Granada, 18071 Granada, Spain
| |
Collapse
|
41
|
Clausen L, Stein A, Grønbæk-Thygesen M, Nygaard L, Søltoft CL, Nielsen SV, Lisby M, Ravid T, Lindorff-Larsen K, Hartmann-Petersen R. Folliculin variants linked to Birt-Hogg-Dubé syndrome are targeted for proteasomal degradation. PLoS Genet 2020; 16:e1009187. [PMID: 33137092 PMCID: PMC7660926 DOI: 10.1371/journal.pgen.1009187] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2020] [Revised: 11/12/2020] [Accepted: 10/10/2020] [Indexed: 01/24/2023] Open
Abstract
Germline mutations in the folliculin (FLCN) tumor suppressor gene are linked to Birt-Hogg-Dubé (BHD) syndrome, a dominantly inherited genetic disease characterized by predisposition to fibrofolliculomas, lung cysts, and renal cancer. Most BHD-linked FLCN variants include large deletions and splice site aberrations predicted to cause loss of function. The mechanisms by which missense variants and short in-frame deletions in FLCN trigger disease are unknown. Here, we present an integrated computational and experimental study that reveals that the majority of such disease-causing FLCN variants cause loss of function due to proteasomal degradation of the encoded FLCN protein, rather than directly ablating FLCN function. Accordingly, several different single-site FLCN variants are present at strongly reduced levels in cells. In line with our finding that FLCN variants are protein quality control targets, several are also highly insoluble and fail to associate with the FLCN-binding partners FNIP1 and FNIP2. The lack of FLCN binding leads to rapid proteasomal degradation of FNIP1 and FNIP2. Half of the tested FLCN variants are mislocalized in cells, and one variant (ΔE510) forms perinuclear protein aggregates. A yeast-based stability screen revealed that the deubiquitylating enzyme Ubp15/USP7 and molecular chaperones regulate the turnover of the FLCN variants. Lowering the temperature led to a stabilization of two FLCN missense proteins, and for one (R362C), function was re-established at low temperature. In conclusion, we propose that most BHD-linked FLCN missense variants and small in-frame deletions operate by causing misfolding and degradation of the FLCN protein, and that stabilization and resulting restoration of function may hold therapeutic potential of certain disease-linked variants. Our computational saturation scan encompassing both missense variants and single site deletions in FLCN may allow classification of rare FLCN variants of uncertain clinical significance.
Collapse
Affiliation(s)
- Lene Clausen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Amelie Stein
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Martin Grønbæk-Thygesen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Lasse Nygaard
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Cecilie L. Søltoft
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Sofie V. Nielsen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Michael Lisby
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Tommer Ravid
- Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Jerusalem, Israel
| | - Kresten Lindorff-Larsen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| | - Rasmus Hartmann-Petersen
- The Linderstrøm-Lang Centre for Protein Science, Department of Biology, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
42
|
Potter ZE, Lau HT, Chakraborty S, Fang L, Guttman M, Ong SE, Fowler DM, Maly DJ. Parallel Chemoselective Profiling for Mapping Protein Structure. Cell Chem Biol 2020; 27:1084-1096.e4. [PMID: 32649906 PMCID: PMC7484201 DOI: 10.1016/j.chembiol.2020.06.014] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2020] [Revised: 05/27/2020] [Accepted: 06/19/2020] [Indexed: 01/01/2023]
Abstract
Solution-based structural techniques complement high-resolution structural data by providing insight into the oft-missed links between protein structure and dynamics. Here, we present Parallel Chemoselective Profiling, a solution-based structural method for characterizing protein structure and dynamics. Our method utilizes deep mutational scanning saturation mutagenesis data to install amino acid residues with specific chemistries at defined positions on the solvent-exposed surface of a protein. Differences in the extent of labeling of installed mutant residues are quantified using targeted mass spectrometry, reporting on each residue's local environment and structural dynamics. Using our method, we studied how conformation-selective, ATP-competitive inhibitors affect the local and global structure and dynamics of full-length Src kinase. Our results highlight how parallel chemoselective profiling can be used to study a dynamic multi-domain protein, and suggest that our method will be a useful addition to the relatively small toolkit of existing protein footprinting techniques.
Collapse
Affiliation(s)
- Zachary E Potter
- Department of Chemistry, University of Washington, Seattle, WA 98195, USA
| | - Ho-Tak Lau
- Department of Pharmacology, University of Washington, Seattle, WA 98195, USA
| | - Sujata Chakraborty
- Department of Chemistry, University of Washington, Seattle, WA 98195, USA
| | - Linglan Fang
- Department of Chemistry, University of Washington, Seattle, WA 98195, USA
| | - Miklos Guttman
- Department of Medicinal Chemistry, University of Washington, Seattle, WA 98195, USA
| | - Shao-En Ong
- Department of Pharmacology, University of Washington, Seattle, WA 98195, USA
| | - Douglas M Fowler
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA; Department of Bioengineering, University of Washington, Seattle, WA 98195, USA
| | - Dustin J Maly
- Department of Chemistry, University of Washington, Seattle, WA 98195, USA; Department of Biochemistry, University of Washington, Seattle, WA 98195, USA.
| |
Collapse
|
43
|
Thompson S, Zhang Y, Ingle C, Reynolds KA, Kortemme T. Altered expression of a quality control protease in E. coli reshapes the in vivo mutational landscape of a model enzyme. eLife 2020; 9:53476. [PMID: 32701056 PMCID: PMC7377907 DOI: 10.7554/elife.53476] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2019] [Accepted: 07/09/2020] [Indexed: 12/03/2022] Open
Abstract
Protein mutational landscapes are shaped by the cellular environment, but key factors and their quantitative effects are often unknown. Here we show that Lon, a quality control protease naturally absent in common E. coli expression strains, drastically reshapes the mutational landscape of the metabolic enzyme dihydrofolate reductase (DHFR). Selection under conditions that resolve highly active mutants reveals that 23.3% of all single point mutations in DHFR are advantageous in the absence of Lon, but advantageous mutations are largely suppressed when Lon is reintroduced. Protein stability measurements demonstrate extensive activity-stability tradeoffs for the advantageous mutants and provide a mechanistic explanation for Lon’s widespread impact. Our findings suggest possibilities for tuning mutational landscapes by modulating the cellular environment, with implications for protein design and combatting antibiotic resistance.
Collapse
Affiliation(s)
- Samuel Thompson
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, United States
| | - Yang Zhang
- Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, United States
| | - Christine Ingle
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, United States
| | - Kimberly A Reynolds
- The Green Center for Systems Biology, University of Texas Southwestern Medical Center, Dallas, United States.,Department of Biophysics, University of Texas Southwestern Medical Center, Dallas, United States
| | - Tanja Kortemme
- Graduate Group in Biophysics, University of California San Francisco, San Francisco, United States.,Department of Bioengineering and Therapeutic Sciences, University of California San Francisco, San Francisco, United States.,Chan Zuckerberg Biohub, San Francisco, United States
| |
Collapse
|
44
|
Martin TA, Wu T, Tang Q, Dougherty LL, Parente DJ, Swint-Kruse L, Fenton AW. Identification of biochemically neutral positions in liver pyruvate kinase. Proteins 2020; 88:1340-1350. [PMID: 32449829 DOI: 10.1002/prot.25953] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Revised: 03/10/2020] [Accepted: 05/16/2020] [Indexed: 01/08/2023]
Abstract
Understanding how each residue position contributes to protein function has been a long-standing goal in protein science. Substitution studies have historically focused on conserved protein positions. However, substitutions of nonconserved positions can also modify function. Indeed, we recently identified nonconserved positions that have large substitution effects in human liver pyruvate kinase (hLPYK), including altered allosteric coupling. To facilitate a comparison of which characteristics determine when a nonconserved position does vs does not contribute to function, the goal of the current work was to identify neutral positions in hLPYK. However, existing hLPYK data showed that three features commonly associated with neutral positions-high sequence entropy, high surface exposure, and alanine scanning-lacked the sensitivity needed to guide experimental studies. We used multiple evolutionary patterns identified in a sequence alignment of the PYK family to identify which positions were least patterned, reasoning that these were most likely to be neutral. Nine positions were tested with a total of 117 amino acid substitutions. Although exploring all potential functions is not feasible for any protein, five parameters associated with substrate/effector affinities and allosteric coupling were measured for hLPYK variants. For each position, the aggregate functional outcomes of all variants were used to quantify a "neutrality" score. Three positions showed perfect neutral scores for all five parameters. Furthermore, the nine positions showed larger neutral scores than 17 positions located near allosteric binding sites. Thus, our strategy successfully enriched the dataset for positions with neutral and modest substitutions.
Collapse
Affiliation(s)
- Tyler A Martin
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Tiffany Wu
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Qingling Tang
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Larissa L Dougherty
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Daniel J Parente
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA.,Department of Family and Community Medicine, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| |
Collapse
|
45
|
A novel polyubiquitin chain linkage formed by viral Ubiquitin is resistant to host deubiquitinating enzymes. Biochem J 2020; 477:2193-2219. [PMID: 32478812 DOI: 10.1042/bcj20200289] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2020] [Revised: 05/29/2020] [Accepted: 05/29/2020] [Indexed: 11/17/2022]
Abstract
The Baculoviridae family of viruses encode a viral Ubiquitin (vUb) gene. Though the vUb is homologous to the host eukaryotic Ubiquitin (Ub), its preservation in the viral genome indicates unique functions that are not compensated by the host Ub. We report the structural, biophysical, and biochemical properties of the vUb from Autographa californica multiple nucleo-polyhedrosis virus (AcMNPV). The packing of central helix α1 to the beta-sheet β1-β5 is different between vUb and Ub. Consequently, its stability is lower compared with Ub. However, the surface properties, ubiquitination activity, and the interaction with Ubiquitin-binding domains are similar between vUb and Ub. Interestingly, vUb forms atypical polyubiquitin chain linked by lysine at the 54th position (K54), and the deubiquitinating enzymes are ineffective against the K54-linked polyubiquitin chains. We propose that the modification of host/viral proteins with the K54-linked chains is an effective way selected by the virus to protect the vUb signal from host DeUbiquitinases.
Collapse
|
46
|
Chen JZ, Fowler DM, Tokuriki N. Comprehensive exploration of the translocation, stability and substrate recognition requirements in VIM-2 lactamase. eLife 2020; 9:56707. [PMID: 32510322 PMCID: PMC7308095 DOI: 10.7554/elife.56707] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2020] [Accepted: 06/06/2020] [Indexed: 12/12/2022] Open
Abstract
Metallo-β-lactamases (MBLs) degrade a broad spectrum of β-lactam antibiotics, and are a major disseminating source for multidrug resistant bacteria. Despite many biochemical studies in diverse MBLs, molecular understanding of the roles of residues in the enzyme’s stability and function, and especially substrate specificity, is lacking. Here, we employ deep mutational scanning (DMS) to generate comprehensive single amino acid variant data on a major clinical MBL, VIM-2, by measuring the effect of thousands of VIM-2 mutants on the degradation of three representative classes of β-lactams (ampicillin, cefotaxime, and meropenem) and at two different temperatures (25°C and 37°C). We revealed residues responsible for expression and translocation, and mutations that increase resistance and/or alter substrate specificity. The distribution of specificity-altering mutations unveiled distinct molecular recognition of the three substrates. Moreover, these function-altering mutations are frequently observed among naturally occurring variants, suggesting that the enzymes have continuously evolved to become more potent resistance genes.
Collapse
Affiliation(s)
- John Z Chen
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
| | - Douglas M Fowler
- Department of Genome Sciences, University of Washington, Seattle, United States.,Department of Bioengineering, University of Washington, Seattle, United States
| | - Nobuhiko Tokuriki
- Michael Smith Laboratories, University of British Columbia, Vancouver, Canada
| |
Collapse
|
47
|
Abstract
To achieve the full potential of pharmacogenomics, one must accurately predict the functional outcomes that arise from amino acid substitutions in proteins. Classically, researchers have focused on understanding the consequences of individual substitutions. However, literature surveys have shown that most substitutions were created at evolutionarily conserved positions. Awareness of this bias leads to a shift in perspective, from considering the outcomes of individual substitutions to understanding the roles of individual protein positions. Conserved positions tend to act as “toggle” switches, with most substitutions abolishing function. However, nonconserved positions have been found equally capable of affecting protein function. Indeed, many nonconserved positions act like functional dimmer switches (“rheostat” positions): this is revealed when multiple substitutions are made at a single position. Each substitution has a different functional outcome; the set of substitutions spans a range of outcomes. Finally, some nonconserved positions appear neutral, capable of accommodating all amino acid types without modifying function. This paper reviews the currently-known properties of rheostat positions, with examples shown for pyruvate kinase, organic anion transporting polypeptide 1B1, the beta-lactamase inhibitory protein, and angiotensin-converting enzyme 2. Outcomes observed for rheostat positions have implications for the rational design of drug analogs and allosteric drugs. Furthermore, this new framework—comprising three types of protein positions—provides a new approach to interpreting disease and population-based databases of amino acid changes. In conclusion, although a full understanding of substitution outcomes at rheostat positions poses a challenge, utilization of this new frame of reference will further advance the application of pharmacogenomics.
Collapse
|
48
|
Aharon L, Aharoni SL, Radisky ES, Papo N. Quantitative mapping of binding specificity landscapes for homologous targets by using a high-throughput method. Biochem J 2020; 477:1701-1719. [PMID: 32296833 PMCID: PMC7376575 DOI: 10.1042/bcj20200188] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2020] [Revised: 04/11/2020] [Accepted: 04/14/2020] [Indexed: 01/08/2023]
Abstract
To facilitate investigations of protein-protein interactions (PPIs), we developed a novel platform for quantitative mapping of protein binding specificity landscapes, which combines the multi-target screening of a mutagenesis library into high- and low-affinity populations with sophisticated next-generation sequencing analysis. Importantly, this method generates accurate models to predict affinity and specificity values for any mutation within a protein complex, and requires only a few experimental binding affinity measurements using purified proteins for calibration. We demonstrated the utility of the approach by mapping quantitative landscapes for interactions between the N-terminal domain of the tissue inhibitor of metalloproteinase 2 (N-TIMP2) and three matrix metalloproteinases (MMPs) having homologous structures but different affinities (MMP-1, MMP-3, and MMP-14). The binding landscapes for N-TIMP2/MMP-1 and N-TIMP2/MMP-3 showed the PPIs to be almost fully optimized, with most single mutations giving a loss of affinity. In contrast, the non-optimized PPI for N-TIMP2/MMP-14 was reflected in a wide range of binding affinities, where single mutations exhibited a far more attenuated effect on the PPI. Our new platform reliably and comprehensively identified not only hot- and cold-spot residues, but also specificity-switch mutations that shape target affinity and specificity. Thus, our approach provides a methodology giving an unprecedentedly rich quantitative analysis of the binding specificity landscape, which will broaden the understanding of the mechanisms and evolutionary origins of specific PPIs and facilitate the rational design of specific inhibitors for structurally similar target proteins.
Collapse
Affiliation(s)
- Lidan Aharon
- Department of Biotechnology Engineering and the National Institute of Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva 84105, Israel
| | - Shay-Lee Aharoni
- Department of Biotechnology Engineering and the National Institute of Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva 84105, Israel
| | - Evette S. Radisky
- Department of Cancer Biology, Mayo Clinic Comprehensive Cancer Center, Jacksonville 32224, Florida, USA
| | - Niv Papo
- Department of Biotechnology Engineering and the National Institute of Biotechnology in the Negev, Ben-Gurion University of the Negev, Beer-Sheva 84105, Israel
| |
Collapse
|
49
|
Abstract
The distribution of fitness effects of mutation plays a central role in constraining protein evolution. The underlying mechanisms by which mutations lead to fitness effects are typically attributed to changes in protein specific activity or abundance. Here, we reveal the importance of a mutation's collateral fitness effects, which we define as effects that do not derive from changes in the protein's ability to perform its physiological function. We comprehensively measured the collateral fitness effects of missense mutations in the Escherichia coli TEM-1 β-lactamase antibiotic resistance gene using growth competition experiments in the absence of antibiotic. At least 42% of missense mutations in TEM-1 were deleterious, indicating that for some proteins collateral fitness effects occur as frequently as effects on protein activity and abundance. Deleterious mutations caused improper posttranslational processing, incorrect disulfide-bond formation, protein aggregation, changes in gene expression, and pleiotropic effects on cell phenotype. Deleterious collateral fitness effects occurred more frequently in TEM-1 than deleterious effects on antibiotic resistance in environments with low concentrations of the antibiotic. The surprising prevalence of deleterious collateral fitness effects suggests they may play a role in constraining protein evolution, particularly for highly expressed proteins, for proteins under intermittent selection for their physiological function, and for proteins whose contribution to fitness is buffered against deleterious effects on protein activity and protein abundance.
Collapse
|
50
|
Abstract
Knowledge of the distribution of fitness effects (DFE) of mutations is critical to the understanding of protein evolution. Here, we describe methods for large-scale, systematic measurements of the DFE using growth competition and deep mutational scanning. We discuss techniques for producing comprehensive libraries of gene variants as well as provide necessary considerations for designing these experiments. Using these methods, we have constructed libraries containing over 18,000 variants, measured fitness effects of these mutations by deep mutational scanning, and verified the presence of fitness effects in individual variants. Our methods provide a high-throughput protocol for measuring biological fitness effects of mutations and the dependence of fitness effects on the environment.
Collapse
|