1
|
Lefèbre J, Falk T, Ning Y, Rademacher C. Secondary Sites of the C-type Lectin-Like Fold. Chemistry 2024; 30:e202400660. [PMID: 38527187 DOI: 10.1002/chem.202400660] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/18/2024] [Revised: 03/23/2024] [Accepted: 03/25/2024] [Indexed: 03/27/2024]
Abstract
C-type lectins are a large superfamily of proteins involved in a multitude of biological processes. In particular, their involvement in immunity and homeostasis has rendered them attractive targets for diverse therapeutic interventions. They share a characteristic C-type lectin-like domain whose adaptability enables them to bind a broad spectrum of ligands beyond the originally defined canonical Ca2+-dependent carbohydrate binding. Together with variable domain architecture and high-level conformational plasticity, this enables C-type lectins to meet diverse functional demands. Secondary sites provide another layer of regulation and are often intricately linked to functional diversity. Located remote from the canonical primary binding site, secondary sites can accommodate ligands with other physicochemical properties and alter protein dynamics, thus enhancing selectivity and enabling fine-tuning of the biological response. In this review, we outline the structural determinants allowing C-type lectins to perform a large variety of tasks and to accommodate the ligands associated with it. Using the six well-characterized Ca2+-dependent and Ca2+-independent C-type lectin receptors DC-SIGN, langerin, MGL, dectin-1, CLEC-2 and NKG2D as examples, we focus on the characteristics of non-canonical interactions and secondary sites and their potential use in drug discovery endeavors.
Collapse
Affiliation(s)
- Jonathan Lefèbre
- Department of Pharmaceutical Sciences, University of Vienna, Vienna, Austria
- Vienna Doctoral School of Pharmaceutical, Nutritional and Sport, Sciences, University of Vienna, Vienna, Austria
- Department of Microbiology, Immunology and Genetics, University of Vienna, Max F. Perutz Labs, Vienna, Austria
| | - Torben Falk
- Department of Pharmaceutical Sciences, University of Vienna, Vienna, Austria
- Vienna Doctoral School of Pharmaceutical, Nutritional and Sport, Sciences, University of Vienna, Vienna, Austria
- Department of Microbiology, Immunology and Genetics, University of Vienna, Max F. Perutz Labs, Vienna, Austria
| | - Yunzhan Ning
- Department of Pharmaceutical Sciences, University of Vienna, Vienna, Austria
- Vienna Doctoral School of Pharmaceutical, Nutritional and Sport, Sciences, University of Vienna, Vienna, Austria
- Department of Microbiology, Immunology and Genetics, University of Vienna, Max F. Perutz Labs, Vienna, Austria
| | - Christoph Rademacher
- Department of Pharmaceutical Sciences, University of Vienna, Vienna, Austria
- Department of Microbiology, Immunology and Genetics, University of Vienna, Max F. Perutz Labs, Vienna, Austria
| |
Collapse
|
2
|
Omelchenko AA, Siwek JC, Chhibbar P, Arshad S, Nazarali I, Nazarali K, Rosengart A, Rahimikollu J, Tilstra J, Shlomchik MJ, Koes DR, Joglekar AV, Das J. Sliding Window INteraction Grammar (SWING): a generalized interaction language model for peptide and protein interactions. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.05.01.592062. [PMID: 38746274 PMCID: PMC11092674 DOI: 10.1101/2024.05.01.592062] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
The explosion of sequence data has allowed the rapid growth of protein language models (pLMs). pLMs have now been employed in many frameworks including variant-effect and peptide-specificity prediction. Traditionally, for protein-protein or peptide-protein interactions (PPIs), corresponding sequences are either co-embedded followed by post-hoc integration or the sequences are concatenated prior to embedding. Interestingly, no method utilizes a language representation of the interaction itself. We developed an interaction LM (iLM), which uses a novel language to represent interactions between protein/peptide sequences. Sliding Window Interaction Grammar (SWING) leverages differences in amino acid properties to generate an interaction vocabulary. This vocabulary is the input into a LM followed by a supervised prediction step where the LM's representations are used as features. SWING was first applied to predicting peptide:MHC (pMHC) interactions. SWING was not only successful at generating Class I and Class II models that have comparable prediction to state-of-the-art approaches, but the unique Mixed Class model was also successful at jointly predicting both classes. Further, the SWING model trained only on Class I alleles was predictive for Class II, a complex prediction task not attempted by any existing approach. For de novo data, using only Class I or Class II data, SWING also accurately predicted Class II pMHC interactions in murine models of SLE (MRL/lpr model) and T1D (NOD model), that were validated experimentally. To further evaluate SWING's generalizability, we tested its ability to predict the disruption of specific protein-protein interactions by missense mutations. Although modern methods like AlphaMissense and ESM1b can predict interfaces and variant effects/pathogenicity per mutation, they are unable to predict interaction-specific disruptions. SWING was successful at accurately predicting the impact of both Mendelian mutations and population variants on PPIs. This is the first generalizable approach that can accurately predict interaction-specific disruptions by missense mutations with only sequence information. Overall, SWING is a first-in-class generalizable zero-shot iLM that learns the language of PPIs.
Collapse
Affiliation(s)
- Alisa A. Omelchenko
- Center for Systems immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, PA, USA
- The joint CMU-Pitt PhD program in computational biology, School of Medicine, University of Pittsburgh, PA, USA
| | - Jane C. Siwek
- Center for Systems immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, PA, USA
- The joint CMU-Pitt PhD program in computational biology, School of Medicine, University of Pittsburgh, PA, USA
| | - Prabal Chhibbar
- Center for Systems immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Integrative systems biology PhD program, School of Medicine, University of Pittsburgh, PA, USA
| | - Sanya Arshad
- Center for Systems immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Iliyan Nazarali
- Center for Systems immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Kiran Nazarali
- Center for Systems immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - AnnaElaine Rosengart
- Center for Systems immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - Javad Rahimikollu
- Center for Systems immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, PA, USA
- The joint CMU-Pitt PhD program in computational biology, School of Medicine, University of Pittsburgh, PA, USA
| | - Jeremy Tilstra
- Department of Immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Division of Rheumatology and Clinical Immunology, Department of Medicine, School of Medicine, University of Pittsburgh, PA, USA
| | - Mark J. Shlomchik
- Department of Immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
| | - David R. Koes
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, PA, USA
| | - Alok V. Joglekar
- Center for Systems immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, PA, USA
| | - Jishnu Das
- Center for Systems immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Immunology, School of Medicine, University of Pittsburgh, Pittsburgh, PA, USA
- Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, PA, USA
| |
Collapse
|
3
|
Klyshko E, Kim JSH, McGough L, Valeeva V, Lee E, Ranganathan R, Rauscher S. Functional protein dynamics in a crystal. Nat Commun 2024; 15:3244. [PMID: 38622111 PMCID: PMC11018856 DOI: 10.1038/s41467-024-47473-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2023] [Accepted: 04/02/2024] [Indexed: 04/17/2024] Open
Abstract
Proteins are molecular machines and to understand how they work, we need to understand how they move. New pump-probe time-resolved X-ray diffraction methods open up ways to initiate and observe protein motions with atomistic detail in crystals on biologically relevant timescales. However, practical limitations of these experiments demands parallel development of effective molecular dynamics approaches to accelerate progress and extract meaning. Here, we establish robust and accurate methods for simulating dynamics in protein crystals, a nontrivial process requiring careful attention to equilibration, environmental composition, and choice of force fields. With more than seven milliseconds of sampling of a single chain, we identify critical factors controlling agreement between simulation and experiments and show that simulated motions recapitulate ligand-induced conformational changes. This work enables a virtuous cycle between simulation and experiments for visualizing and understanding the basic functional motions of proteins.
Collapse
Affiliation(s)
- Eugene Klyshko
- Department of Physics, University of Toronto, Toronto, ON, Canada
- Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, ON, Canada
| | - Justin Sung-Ho Kim
- Department of Physics, University of Toronto, Toronto, ON, Canada
- Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, ON, Canada
| | - Lauren McGough
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Victoria Valeeva
- Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, ON, Canada
| | - Ethan Lee
- Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, ON, Canada
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
| | - Rama Ranganathan
- Center for Physics of Evolving Systems and Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, USA
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, IL, USA
| | - Sarah Rauscher
- Department of Physics, University of Toronto, Toronto, ON, Canada.
- Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, ON, Canada.
- Department of Chemistry, University of Toronto, Toronto, ON, Canada.
| |
Collapse
|
4
|
Biswas A, Choudhuri I, Arnold E, Lyumkis D, Haldane A, Levy RM. Kinetic coevolutionary models predict the temporal emergence of HIV-1 resistance mutations under drug selection pressure. Proc Natl Acad Sci U S A 2024; 121:e2316662121. [PMID: 38557187 PMCID: PMC11009627 DOI: 10.1073/pnas.2316662121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/25/2023] [Accepted: 02/23/2024] [Indexed: 04/04/2024] Open
Abstract
Drug resistance in HIV type 1 (HIV-1) is a pervasive problem that affects the lives of millions of people worldwide. Although records of drug-resistant mutations (DRMs) have been extensively tabulated within public repositories, our understanding of the evolutionary kinetics of DRMs and how they evolve together remains limited. Epistasis, the interaction between a DRM and other residues in HIV-1 protein sequences, is key to the temporal evolution of drug resistance. We use a Potts sequence-covariation statistical-energy model of HIV-1 protein fitness under drug selection pressure, which captures epistatic interactions between all positions, combined with kinetic Monte-Carlo simulations of sequence evolutionary trajectories, to explore the acquisition of DRMs as they arise in an ensemble of drug-naive patient protein sequences. We follow the time course of 52 DRMs in the enzymes protease, RT, and integrase, the primary targets of antiretroviral therapy. The rates at which DRMs emerge are highly correlated with their observed acquisition rates reported in the literature when drug pressure is applied. This result highlights the central role of epistasis in determining the kinetics governing DRM emergence. Whereas rapidly acquired DRMs begin to accumulate as soon as drug pressure is applied, slowly acquired DRMs are contingent on accessory mutations that appear only after prolonged drug pressure. We provide a foundation for using computational methods to determine the temporal evolution of drug resistance using Potts statistical potentials, which can be used to gain mechanistic insights into drug resistance pathways in HIV-1 and other infectious agents.
Collapse
Affiliation(s)
- Avik Biswas
- Center for Biophysics and Computational Biology, College of Science and Technology, Temple University, Philadelphia, PA19122
- Laboratory of Genetics, The Salk Institute for Biological Studies, La Jolla, CA92037
- Department of Physics, University of California San Diego, La Jolla, CA92093
| | - Indrani Choudhuri
- Center for Biophysics and Computational Biology, College of Science and Technology, Temple University, Philadelphia, PA19122
- Department of Chemistry, Temple University, Philadelphia, PA19122
| | - Eddy Arnold
- Department of Chemistry and Chemical Biology, Center for Advanced Biotechnology and Medicine, Rutgers University, Piscataway, NJ08854
| | - Dmitry Lyumkis
- Laboratory of Genetics, The Salk Institute for Biological Studies, La Jolla, CA92037
- Graduate School of Biological Sciences, Department of Molecular Biology, University of California San Diego, La Jolla, CA92093
| | - Allan Haldane
- Center for Biophysics and Computational Biology, College of Science and Technology, Temple University, Philadelphia, PA19122
- Department of Physics, Temple University, Philadelphia, PA19122
| | - Ronald M. Levy
- Center for Biophysics and Computational Biology, College of Science and Technology, Temple University, Philadelphia, PA19122
- Department of Chemistry, Temple University, Philadelphia, PA19122
| |
Collapse
|
5
|
Ohler A, Taylor PE, Bledsoe JA, Iavarone AT, Gilbert NC, Offenbacher AR. Identification of the Thermal Activation Network in Human 15-Lipoxygenase-2: Divergence from Plant Orthologs and Its Relationship to Hydrogen Tunneling Activation Barriers. ACS Catal 2024; 14:5444-5457. [PMID: 38601784 PMCID: PMC11003420 DOI: 10.1021/acscatal.4c00439] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2024] [Revised: 03/05/2024] [Accepted: 03/13/2024] [Indexed: 04/12/2024]
Abstract
The oxidation of polyunsaturated fatty acids by lipoxygenases (LOXs) is initiated by a C-H cleavage step in which the hydrogen atom is transferred quantum mechanically (i.e., via tunneling). In these reactions, protein thermal motions facilitate the conversion of ground-state enzyme-substrate complexes to tunneling-ready configurations and are thus important for transferring energy from the solvent to the active site for the activation of catalysis. In this report, we employed temperature-dependent hydrogen-deuterium exchange mass spectrometry (TDHDX-MS) to identify catalytically linked, thermally activated peptides in a representative animal LOX, human epithelial 15-LOX-2. TDHDX-MS of wild-type 15-LOX-2 was compared to two active site mutations that retain structural stability but have increased activation energies (Ea) of catalysis. The Ea value of one variant, V427L, is implicated to arise from suboptimal substrate positioning by increased active-site side chain rotamer dynamics, as determined by X-ray crystallography and ensemble refinement. The resolved thermal network from the comparative Eas of TDHDX-MS between wild-type and V426A is localized along the front face of the 15-LOX-2 catalytic domain. The network contains a clustering of isoleucine, leucine, and valine side chains within the helical peptides. This thermal network of 15-LOX-2 is different in location, area, and backbone structure compared to a model plant lipoxygenase from soybean that exhibits a low Ea value of catalysis compared to the human ortholog. The presented data provide insights into the divergence of thermally activated protein motions in plant and animal LOXs and their relationships to the enthalpic barriers for facilitating hydrogen tunneling.
Collapse
Affiliation(s)
- Amanda Ohler
- Department
of Chemistry, East Carolina University, Greenville, North Carolina 27858, United States
| | - Paris E. Taylor
- Department
of Biological Sciences, Louisiana State
University, Baton
Rouge, Louisiana 70803, United States
| | - Jasmine A. Bledsoe
- Department
of Biological Sciences, Louisiana State
University, Baton
Rouge, Louisiana 70803, United States
| | - Anthony T. Iavarone
- QB3/Chemistry
Mass Spectrometry Facility, University of
California, Berkeley, Berkeley, California 94720, United States
| | - Nathaniel C. Gilbert
- Department
of Biological Sciences, Louisiana State
University, Baton
Rouge, Louisiana 70803, United States
| | - Adam R. Offenbacher
- Department
of Chemistry, East Carolina University, Greenville, North Carolina 27858, United States
| |
Collapse
|
6
|
Klyshko E, Sung-Ho Kim J, McGough L, Valeeva V, Lee E, Ranganathan R, Rauscher S. Functional Protein Dynamics in a Crystal. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.07.06.548023. [PMID: 37461732 PMCID: PMC10350071 DOI: 10.1101/2023.07.06.548023] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 07/27/2023]
Abstract
Proteins are molecular machines and to understand how they work, we need to understand how they move. New pump-probe time-resolved X-ray diffraction methods open up ways to initiate and observe protein motions with atomistic detail in crystals on biologically relevant timescales. However, practical limitations of these experiments demands parallel development of effective molecular dynamics approaches to accelerate progress and extract meaning. Here, we establish robust and accurate methods for simulating dynamics in protein crystals, a nontrivial process requiring careful attention to equilibration, environmental composition, and choice of force fields. With more than seven milliseconds of sampling of a single chain, we identify critical factors controlling agreement between simulation and experiments and show that simulated motions recapitulate ligand-induced conformational changes. This work enables a virtuous cycle between simulation and experiments for visualizing and understanding the basic functional motions of proteins.
Collapse
Affiliation(s)
- Eugene Klyshko
- Department of Physics, University of Toronto, Toronto, ON, Canada
- Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, ON, Canada
| | - Justin Sung-Ho Kim
- Department of Physics, University of Toronto, Toronto, ON, Canada
- Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, ON, Canada
| | - Lauren McGough
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, USA
| | - Victoria Valeeva
- Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, ON, Canada
| | - Ethan Lee
- Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, ON, Canada
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
| | - Rama Ranganathan
- Center for Physics of Evolving Systems and Department of Biochemistry and Molecular Biology, University of Chicago, Chicago, IL, USA
- Pritzker School of Molecular Engineering, University of Chicago, Chicago, IL, USA
| | - Sarah Rauscher
- Department of Physics, University of Toronto, Toronto, ON, Canada
- Department of Chemical and Physical Sciences, University of Toronto Mississauga, Mississauga, ON, Canada
- Department of Chemistry, University of Toronto, Toronto, ON, Canada
| |
Collapse
|
7
|
Mabesoone MFJ, Leopold-Messer S, Minas HA, Chepkirui C, Chawengrum P, Reiter S, Meoded RA, Wolf S, Genz F, Magnus N, Piechulla B, Walker AS, Piel J. Evolution-guided engineering of trans-acyltransferase polyketide synthases. Science 2024; 383:1312-1317. [PMID: 38513027 DOI: 10.1126/science.adj7621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 02/13/2024] [Indexed: 03/23/2024]
Abstract
Bacterial multimodular polyketide synthases (PKSs) are giant enzymes that generate a wide range of therapeutically important but synthetically challenging natural products. Diversification of polyketide structures can be achieved by engineering these enzymes. However, notwithstanding successes made with textbook cis-acyltransferase (cis-AT) PKSs, tailoring such large assembly lines remains challenging. Unlike textbook PKSs, trans-AT PKSs feature an extraordinary diversity of PKS modules and commonly evolve to form hybrid PKSs. In this study, we analyzed amino acid coevolution to identify a common module site that yields functional PKSs. We used this site to insert and delete diverse PKS parts and create 22 engineered trans-AT PKSs from various pathways and in two bacterial producers. The high success rates of our engineering approach highlight the broader applicability to generate complex designer polyketides.
Collapse
Affiliation(s)
- Mathijs F J Mabesoone
- Institute of Microbiology, Eidgenössische Technische Hochschule (ETH) Zürich, Vladimir-Prelog-Weg 4, 8093 Zürich, Switzerland
| | - Stefan Leopold-Messer
- Institute of Microbiology, Eidgenössische Technische Hochschule (ETH) Zürich, Vladimir-Prelog-Weg 4, 8093 Zürich, Switzerland
| | - Hannah A Minas
- Institute of Microbiology, Eidgenössische Technische Hochschule (ETH) Zürich, Vladimir-Prelog-Weg 4, 8093 Zürich, Switzerland
| | - Clara Chepkirui
- Institute of Microbiology, Eidgenössische Technische Hochschule (ETH) Zürich, Vladimir-Prelog-Weg 4, 8093 Zürich, Switzerland
| | - Pornsuda Chawengrum
- Institute of Microbiology, Eidgenössische Technische Hochschule (ETH) Zürich, Vladimir-Prelog-Weg 4, 8093 Zürich, Switzerland
- Chemical Biology Program, Chulabhorn Graduate Institute, Chulabhorn Royal Academy, Bangkok 10210, Thailand
| | - Silke Reiter
- Institute of Microbiology, Eidgenössische Technische Hochschule (ETH) Zürich, Vladimir-Prelog-Weg 4, 8093 Zürich, Switzerland
| | - Roy A Meoded
- Institute of Microbiology, Eidgenössische Technische Hochschule (ETH) Zürich, Vladimir-Prelog-Weg 4, 8093 Zürich, Switzerland
| | - Sarah Wolf
- Institute of Microbiology, Eidgenössische Technische Hochschule (ETH) Zürich, Vladimir-Prelog-Weg 4, 8093 Zürich, Switzerland
| | - Ferdinand Genz
- Institute of Microbiology, Eidgenössische Technische Hochschule (ETH) Zürich, Vladimir-Prelog-Weg 4, 8093 Zürich, Switzerland
| | - Nancy Magnus
- Institute for Biological Sciences, University of Rostock, Albert-Einstein-Straße 3, 18059 Rostock, Germany
| | - Birgit Piechulla
- Institute for Biological Sciences, University of Rostock, Albert-Einstein-Straße 3, 18059 Rostock, Germany
| | - Allison S Walker
- Department of Biological Chemistry and Molecular Pharmacology, Harvard Medical School, 240 Longwood Avenue, Boston, MA 02115, USA
- Department of Chemistry, Vanderbilt University, 1234 Stevenson Center Lane, Nashville, TN 37240, USA
- Department of Biological Sciences, Vanderbilt University, 465 21st Avenue S, Nashville, TN 37232, USA
| | - Jörn Piel
- Institute of Microbiology, Eidgenössische Technische Hochschule (ETH) Zürich, Vladimir-Prelog-Weg 4, 8093 Zürich, Switzerland
| |
Collapse
|
8
|
Wang X, Li A, Li X, Cui H. Empowering Protein Engineering through Recombination of Beneficial Substitutions. Chemistry 2024; 30:e202303889. [PMID: 38288640 DOI: 10.1002/chem.202303889] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2024] [Indexed: 02/24/2024]
Abstract
Directed evolution stands as a seminal technology for generating novel protein functionalities, a cornerstone in biocatalysis, metabolic engineering, and synthetic biology. Today, with the development of various mutagenesis methods and advanced analytical machines, the challenge of diversity generation and high-throughput screening platforms is largely solved, and one of the remaining challenges is: how to empower the potential of single beneficial substitutions with recombination to achieve the epistatic effect. This review overviews experimental and computer-assisted recombination methods in protein engineering campaigns. In addition, integrated and machine learning-guided strategies were highlighted to discuss how these recombination approaches contribute to generating the screening library with better diversity, coverage, and size. A decision tree was finally summarized to guide the further selection of proper recombination strategies in practice, which was beneficial for accelerating protein engineering.
Collapse
Affiliation(s)
- Xinyue Wang
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, No. 2 Xuelin Road, Nanjing, 210097, China
| | - Anni Li
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, No. 2 Xuelin Road, Nanjing, 210097, China
| | - Xiujuan Li
- School of Food Science and Pharmaceutical Engineering, Nanjing Normal University, No. 2 Xuelin Road, Nanjing, 210097, China
| | - Haiyang Cui
- School of Life Sciences, Nanjing Normal University, No. 2 Xuelin Road, Nanjing, 210097, China
| |
Collapse
|
9
|
Swint-Kruse L, Fenton AW. Rheostats, toggles, and neutrals, Oh my! A new framework for understanding how amino acid changes modulate protein function. J Biol Chem 2024; 300:105736. [PMID: 38336297 PMCID: PMC10914490 DOI: 10.1016/j.jbc.2024.105736] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2023] [Revised: 01/09/2024] [Accepted: 01/25/2024] [Indexed: 02/12/2024] Open
Abstract
Advances in personalized medicine and protein engineering require accurately predicting outcomes of amino acid substitutions. Many algorithms correctly predict that evolutionarily-conserved positions show "toggle" substitution phenotypes, which is defined when a few substitutions at that position retain function. In contrast, predictions often fail for substitutions at the less-studied "rheostat" positions, which are defined when different amino acid substitutions at a position sample at least half of the possible functional range. This review describes efforts to understand the impact and significance of rheostat positions: (1) They have been observed in globular soluble, integral membrane, and intrinsically disordered proteins; within single proteins, their prevalence can be up to 40%. (2) Substitutions at rheostat positions can have biological consequences and ∼10% of substitutions gain function. (3) Although both rheostat and "neutral" (defined when all substitutions exhibit wild-type function) positions are nonconserved, the two classes have different evolutionary signatures. (4) Some rheostat positions have pleiotropic effects on function, simultaneously modulating multiple parameters (e.g., altering both affinity and allosteric coupling). (5) In structural studies, substitutions at rheostat positions appear to cause only local perturbations; the overall conformations appear unchanged. (6) Measured functional changes show promising correlations with predicted changes in protein dynamics; the emergent properties of predicted, dynamically coupled amino acid networks might explain some of the complex functional outcomes observed when substituting rheostat positions. Overall, rheostat positions provide unique opportunities for using single substitutions to tune protein function. Future studies of these positions will yield important insights into the protein sequence/function relationship.
Collapse
Affiliation(s)
- Liskin Swint-Kruse
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA.
| | - Aron W Fenton
- Department of Biochemistry and Molecular Biology, The University of Kansas Medical Center, Kansas City, Kansas, USA
| |
Collapse
|
10
|
Nam K, Shao Y, Major DT, Wolf-Watz M. Perspectives on Computational Enzyme Modeling: From Mechanisms to Design and Drug Development. ACS OMEGA 2024; 9:7393-7412. [PMID: 38405524 PMCID: PMC10883025 DOI: 10.1021/acsomega.3c09084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Revised: 01/15/2024] [Accepted: 01/19/2024] [Indexed: 02/27/2024]
Abstract
Understanding enzyme mechanisms is essential for unraveling the complex molecular machinery of life. In this review, we survey the field of computational enzymology, highlighting key principles governing enzyme mechanisms and discussing ongoing challenges and promising advances. Over the years, computer simulations have become indispensable in the study of enzyme mechanisms, with the integration of experimental and computational exploration now established as a holistic approach to gain deep insights into enzymatic catalysis. Numerous studies have demonstrated the power of computer simulations in characterizing reaction pathways, transition states, substrate selectivity, product distribution, and dynamic conformational changes for various enzymes. Nevertheless, significant challenges remain in investigating the mechanisms of complex multistep reactions, large-scale conformational changes, and allosteric regulation. Beyond mechanistic studies, computational enzyme modeling has emerged as an essential tool for computer-aided enzyme design and the rational discovery of covalent drugs for targeted therapies. Overall, enzyme design/engineering and covalent drug development can greatly benefit from our understanding of the detailed mechanisms of enzymes, such as protein dynamics, entropy contributions, and allostery, as revealed by computational studies. Such a convergence of different research approaches is expected to continue, creating synergies in enzyme research. This review, by outlining the ever-expanding field of enzyme research, aims to provide guidance for future research directions and facilitate new developments in this important and evolving field.
Collapse
Affiliation(s)
- Kwangho Nam
- Department
of Chemistry and Biochemistry, University
of Texas at Arlington, Arlington, Texas 76019, United States
| | - Yihan Shao
- Department
of Chemistry and Biochemistry, University
of Oklahoma, Norman, Oklahoma 73019-5251, United States
| | - Dan T. Major
- Department
of Chemistry and Institute for Nanotechnology & Advanced Materials, Bar-Ilan University, Ramat-Gan 52900, Israel
| | | |
Collapse
|
11
|
Liu Z, Gillis T, Raman S, Cui Q. A parametrized two-domain thermodynamic model explains diverse mutational effects on protein allostery. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2023.08.06.552196. [PMID: 37662419 PMCID: PMC10473640 DOI: 10.1101/2023.08.06.552196] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 09/05/2023]
Abstract
New experimental findings continue to challenge our understanding of protein allostery. Recent deep mutational scanning study showed that allosteric hotspots in the tetracycline repressor (TetR) and its homologous transcriptional factors are broadly distributed rather than spanning well-defined structural pathways as often assumed. Moreover, hotspot mutation-induced allostery loss was rescued by distributed additional mutations in a degenerate fashion. Here, we develop a two-domain thermodynamic model for TetR, which readily rationalizes these intriguing observations. The model accurately captures the in vivo activities of various mutants with changes in physically transparent parameters, allowing the data-based quantification of mutational effects using statistical inference. Our analysis reveals the intrinsic connection of intra- and inter-domain properties for allosteric regulation and illustrate epistatic interactions that are consistent with structural features of the protein. The insights gained from this study into the nature of two-domain allostery are expected to have broader implications for other multidomain allosteric proteins.
Collapse
Affiliation(s)
- Zhuang Liu
- Department of Physics, Boston University, Boston, United States
| | - Thomas Gillis
- Department of Biochemistry, University of Wisconsin, Madison, United States
| | - Srivatsan Raman
- Department of Biochemistry, University of Wisconsin, Madison, United States
- Department of Chemistry, University of Wisconsin, Madison, United States
- Department of Bacteriology, University of Wisconsin, Madison, United States
| | - Qiang Cui
- Department of Physics, Boston University, Boston, United States
- Department of Chemistry, Boston University, Boston, United States
| |
Collapse
|
12
|
Demirtaş K, Erman B, Haliloğlu T. Dynamic correlations: exact and approximate methods for mutual information. Bioinformatics 2024; 40:btae076. [PMID: 38341647 PMCID: PMC10898342 DOI: 10.1093/bioinformatics/btae076] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 01/17/2024] [Accepted: 02/08/2024] [Indexed: 02/12/2024] Open
Abstract
MOTIVATION Proteins are dynamic entities that undergo conformational changes critical for their functions. Understanding the communication pathways and information transfer within proteins is crucial for elucidating allosteric interactions in their mechanisms. This study utilizes mutual information (MI) analysis to probe dynamic allostery. Using two cases, Ubiquitin and PLpro, we have evaluated the accuracy and limitations of different approximations including the exact anisotropic and isotropic models, multivariate Gaussian model, isotropic Gaussian model, and the Gaussian Network Model (GNM) in revealing allosteric interactions. RESULTS Our findings emphasize the required trajectory length for capturing accurate mutual information profiles. Long molecular dynamics trajectories, 1 ms for Ubiquitin and 100 µs for PLpro are used as benchmarks, assuming they represent the ground truth. Trajectory lengths of approximately 5 µs for Ubiquitin and 1 µs for PLpro marked the onset of convergence, while the multivariate Gaussian model accurately captured mutual information with trajectories of 5 ns for Ubiquitin and 350 ns for PLpro. However, the isotropic Gaussian model is less successful in representing the anisotropic nature of protein dynamics, particularly in the case of PLpro, highlighting its limitations. The GNM, however, provides reasonable approximations of long-range information exchange as a minimalist network model based on a single crystal structure. Overall, the optimum trajectory lengths for effective Gaussian approximations of long-time dynamic behavior depend on the inherent dynamics within the protein's topology. The GNM, by showcasing dynamics across relatively diverse time scales, can be used either as a standalone method or to gauge the adequacy of MD simulation lengths. AVAILABILITY AND IMPLEMENTATION Mutual information codes are available at https://github.com/kemaldemirtas/prc-MI.git.
Collapse
Affiliation(s)
- Kemal Demirtaş
- Department of Chemical Engineering, Bogazici University, 34342 Istanbul, Turkey
- Polymer Research Center, Bogazici University, 34342 Istanbul, Turkey
| | - Burak Erman
- Department of Chemical and Biological Engineering, Koc University, 34450 Istanbul, Turkey
| | - Türkan Haliloğlu
- Department of Chemical Engineering, Bogazici University, 34342 Istanbul, Turkey
- Polymer Research Center, Bogazici University, 34342 Istanbul, Turkey
| |
Collapse
|
13
|
Wu N, Barahona M, Yaliraki SN. Allosteric communication and signal transduction in proteins. Curr Opin Struct Biol 2024; 84:102737. [PMID: 38171189 DOI: 10.1016/j.sbi.2023.102737] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 11/06/2023] [Accepted: 11/07/2023] [Indexed: 01/05/2024]
Abstract
Allostery is one of the cornerstones of biological function, as it plays a fundamental role in regulating protein activity. The modelling of allostery has gradually moved from a conformation-based framework, linked to structural changes, to dynamics-based allostery, whereby the effects of ligand binding propagate via signal transduction from the allosteric site to other regions of the protein via inter-residue interactions. Characterising such allosteric signalling pathways, which do not necessarily lead to conformational changes, has been pursued experimentally and complemented by computational analysis of protein networks to detect subtle dynamic propagation paths. Considering allostery from the perspective of signal transduction broadens the understanding of allosteric mechanisms, underscores the importance of protein topology, and can provide insights into allosteric drug design.
Collapse
Affiliation(s)
- Nan Wu
- Department of Chemistry, Imperial College London, United Kingdom
| | - Mauricio Barahona
- Department of Mathematics, Imperial College London, United Kingdom. https://twitter.com/@CMPHImperial
| | | |
Collapse
|
14
|
Castelli M, Marchetti F, Osuna S, F. Oliveira AS, Mulholland AJ, Serapian SA, Colombo G. Decrypting Allostery in Membrane-Bound K-Ras4B Using Complementary In Silico Approaches Based on Unbiased Molecular Dynamics Simulations. J Am Chem Soc 2024; 146:901-919. [PMID: 38116743 PMCID: PMC10785808 DOI: 10.1021/jacs.3c11396] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2023] [Revised: 12/06/2023] [Accepted: 12/07/2023] [Indexed: 12/21/2023]
Abstract
Protein functions are dynamically regulated by allostery, which enables conformational communication even between faraway residues, and expresses itself in many forms, akin to different "languages": allosteric control pathways predominating in an unperturbed protein are often unintuitively reshaped whenever biochemical perturbations arise (e.g., mutations). To accurately model allostery, unbiased molecular dynamics (MD) simulations require integration with a reliable method able to, e.g., detect incipient allosteric changes or likely perturbation pathways; this is because allostery can operate at longer time scales than those accessible by plain MD. Such methods are typically applied singularly, but we here argue their joint application─as a "multilingual" approach─could work significantly better. We successfully prove this through unbiased MD simulations (∼100 μs) of the widely studied, allosterically active oncotarget K-Ras4B, solvated and embedded in a phospholipid membrane, from which we decrypt allostery using four showcase "languages": Distance Fluctuation analysis and the Shortest Path Map capture allosteric hotspots at equilibrium; Anisotropic Thermal Diffusion and Dynamical Non-Equilibrium MD simulations assess perturbations upon, respectively, either superheating or hydrolyzing the GTP that oncogenically activates K-Ras4B. Chosen "languages" work synergistically, providing an articulate, mutually coherent, experimentally consistent picture of K-Ras4B allostery, whereby distinct traits emerge at equilibrium and upon GTP cleavage. At equilibrium, combined evidence confirms prominent allosteric communication from the membrane-embedded hypervariable region, through a hub comprising helix α5 and sheet β5, and up to the active site, encompassing allosteric "switches" I and II (marginally), and two proposed pockets. Upon GTP cleavage, allosteric perturbations mostly accumulate on the switches and documented interfaces.
Collapse
Affiliation(s)
- Matteo Castelli
- Department
of Chemistry, University of Pavia, viale T. Taramelli 12, 27100 Pavia, Italy
| | - Filippo Marchetti
- Department
of Chemistry, University of Pavia, viale T. Taramelli 12, 27100 Pavia, Italy
- INSTM, via G. Giusti 9, 50121 Florence, Italy
- E4
Computer Engineering, via Martiri delle libertà 66, 42019 Scandiano (RE), Italy
| | - Sílvia Osuna
- Institut
de Química Computacional i Catàlisi (IQCC) and Departament
de Química, Universitat de Girona, Girona, Catalonia E-17071, Spain
- ICREA, Barcelona, Catalonia E-08010, Spain
| | - A. Sofia F. Oliveira
- Centre for
Computational Chemistry, School of Chemistry, University of Bristol, Bristol BS8 1TS, U.K.
| | - Adrian J. Mulholland
- Centre for
Computational Chemistry, School of Chemistry, University of Bristol, Bristol BS8 1TS, U.K.
| | - Stefano A. Serapian
- Department
of Chemistry, University of Pavia, viale T. Taramelli 12, 27100 Pavia, Italy
| | - Giorgio Colombo
- Department
of Chemistry, University of Pavia, viale T. Taramelli 12, 27100 Pavia, Italy
| |
Collapse
|
15
|
Chon NL, Tran S, Miller CS, Lin H, Knight JD. A conserved electrostatic membrane-binding surface in synaptotagmin-like proteins revealed using molecular phylogenetic analysis and homology modeling. Protein Sci 2024; 33:e4850. [PMID: 38038838 PMCID: PMC10731544 DOI: 10.1002/pro.4850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Revised: 10/29/2023] [Accepted: 11/28/2023] [Indexed: 12/02/2023]
Abstract
Protein structure prediction has emerged as a core technology for understanding biomolecules and their interactions. Here, we combine homology-based structure prediction with molecular phylogenetic analysis to study the evolution of electrostatic membrane binding among the vertebrate synaptotagmin-like protein (Slp) family. Slp family proteins play key roles in the membrane trafficking of large dense-core secretory vesicles. Our previous experimental and computational study found that the C2A domain of Slp-4 (also called granuphilin) binds with high affinity to anionic phospholipids in the cytoplasmic leaflet of the plasma membrane through a large positively charged protein surface centered on a cluster of phosphoinositide-binding lysine residues. Because this surface contributes greatly to Slp-4 C2A domain membrane binding, we hypothesized that the net charge on the surface might be evolutionarily conserved. To test this hypothesis, the known C2A sequences of Slp-4 among vertebrates were organized by class (from mammalia to pisces) using molecular phylogenetic analysis. Consensus sequences for each class were then identified and used to generate homology structures, from which Poisson-Boltzmann electrostatic potentials were calculated. For comparison, homology structures and electrostatic potentials were also calculated for the five human Slp protein family members. The results demonstrate that the charge on the membrane-binding surface is highly conserved throughout the evolution of Slp-4, and more highly conserved than many individual residues among the human Slp family paralogs. Such molecular phylogenetic-driven computational analysis can help to describe the evolution of electrostatic interactions between proteins and membranes which are crucial for their function.
Collapse
Affiliation(s)
- Nara L. Chon
- Department of ChemistryUniversity of Colorado DenverDenverColoradoUSA
| | - Sherleen Tran
- Department of ChemistryUniversity of Colorado DenverDenverColoradoUSA
| | | | - Hai Lin
- Department of ChemistryUniversity of Colorado DenverDenverColoradoUSA
| | | |
Collapse
|
16
|
Wayment-Steele HK, Ojoawo A, Otten R, Apitz JM, Pitsawong W, Hömberger M, Ovchinnikov S, Colwell L, Kern D. Predicting multiple conformations via sequence clustering and AlphaFold2. Nature 2024; 625:832-839. [PMID: 37956700 PMCID: PMC10808063 DOI: 10.1038/s41586-023-06832-9] [Citation(s) in RCA: 17] [Impact Index Per Article: 17.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2023] [Accepted: 11/03/2023] [Indexed: 11/15/2023]
Abstract
AlphaFold2 (ref. 1) has revolutionized structural biology by accurately predicting single structures of proteins. However, a protein's biological function often depends on multiple conformational substates2, and disease-causing point mutations often cause population changes within these substates3,4. We demonstrate that clustering a multiple-sequence alignment by sequence similarity enables AlphaFold2 to sample alternative states of known metamorphic proteins with high confidence. Using this method, named AF-Cluster, we investigated the evolutionary distribution of predicted structures for the metamorphic protein KaiB5 and found that predictions of both conformations were distributed in clusters across the KaiB family. We used nuclear magnetic resonance spectroscopy to confirm an AF-Cluster prediction: a cyanobacteria KaiB variant is stabilized in the opposite state compared with the more widely studied variant. To test AF-Cluster's sensitivity to point mutations, we designed and experimentally verified a set of three mutations predicted to flip KaiB from Rhodobacter sphaeroides from the ground to the fold-switched state. Finally, screening for alternative states in protein families without known fold switching identified a putative alternative state for the oxidoreductase Mpt53 in Mycobacterium tuberculosis. Further development of such bioinformatic methods in tandem with experiments will probably have a considerable impact on predicting protein energy landscapes, essential for illuminating biological function.
Collapse
Affiliation(s)
- Hannah K Wayment-Steele
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA
| | - Adedolapo Ojoawo
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA
| | - Renee Otten
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA
- Treeline Biosciences, Watertown, MA, USA
| | - Julia M Apitz
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA
| | - Warintra Pitsawong
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA
- Biomolecular Discovery, Relay Therapeutics, Cambridge, MA, USA
| | - Marc Hömberger
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA
- Treeline Biosciences, Watertown, MA, USA
| | | | - Lucy Colwell
- Google Research, Cambridge, MA, USA
- Cambridge University, Cambridge, UK
| | - Dorothee Kern
- Department of Biochemistry, Brandeis University and Howard Hughes Medical Institute, Waltham, MA, USA.
| |
Collapse
|
17
|
Knight KM, Obarow EG, Wei W, Mani S, Esteller MI, Cui M, Ma N, Martin SA, Brinson E, Hewitt N, Soden GM, Logothetis DE, Vaidehi N, Dohlman HG. Molecular annotation of G protein variants in a neurological disorder. Cell Rep 2023; 42:113462. [PMID: 37980565 PMCID: PMC10872635 DOI: 10.1016/j.celrep.2023.113462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 10/04/2023] [Accepted: 11/02/2023] [Indexed: 11/21/2023] Open
Abstract
Heterotrimeric G proteins transduce extracellular chemical messages to generate appropriate intracellular responses. Point mutations in GNAO1, encoding the G protein αo subunit, have been implicated in a pathogenic condition characterized by seizures, movement disorders, intellectual disability, and developmental delay (GNAO1 disorder). However, the effects of these mutations on G protein structure and function are unclear. Here, we report the effects of 55 mutations on Gαo conformation, thermostability, nucleotide binding, and hydrolysis, as well as interaction with Gβγ subunits, receptors, and effectors. Our effort reveals four functionally distinct groups of mutants, including one group that sequesters receptors and another that sequesters Gβγ, both acting in a genetically dominant manner. These findings provide a more comprehensive understanding of disease-relevant mutations and reveal that GNAO1 disorder is likely composed of multiple mechanistically distinct disorders that will likely require multiple therapeutic strategies.
Collapse
Affiliation(s)
- Kevin M Knight
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Elizabeth G Obarow
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Wenyuan Wei
- Department of Computational and Quantitative Medicine, Beckman Research Institute of the City of Hope, Duarte, CA 91010, USA
| | - Sepehr Mani
- Department of Pharmaceutical Sciences, Northeastern University, Boston, MA 02115, USA
| | - Maria I Esteller
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Meng Cui
- Department of Pharmaceutical Sciences, Northeastern University, Boston, MA 02115, USA
| | - Ning Ma
- Department of Computational and Quantitative Medicine, Beckman Research Institute of the City of Hope, Duarte, CA 91010, USA
| | - Sarah A Martin
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Emily Brinson
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Natalie Hewitt
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Gaby M Soden
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA
| | - Diomedes E Logothetis
- Department of Pharmaceutical Sciences, Northeastern University, Boston, MA 02115, USA.
| | - Nagarajan Vaidehi
- Department of Computational and Quantitative Medicine, Beckman Research Institute of the City of Hope, Duarte, CA 91010, USA.
| | - Henrik G Dohlman
- Department of Pharmacology, University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA.
| |
Collapse
|
18
|
Musil M, Jezik A, Horackova J, Borko S, Kabourek P, Damborsky J, Bednar D. FireProt 2.0: web-based platform for the fully automated design of thermostable proteins. Brief Bioinform 2023; 25:bbad425. [PMID: 38018911 PMCID: PMC10685400 DOI: 10.1093/bib/bbad425] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Revised: 10/25/2023] [Accepted: 11/01/2023] [Indexed: 11/30/2023] Open
Abstract
Thermostable proteins find their use in numerous biomedical and biotechnological applications. However, the computational design of stable proteins often results in single-point mutations with a limited effect on protein stability. However, the construction of stable multiple-point mutants can prove difficult due to the possibility of antagonistic effects between individual mutations. FireProt protocol enables the automated computational design of highly stable multiple-point mutants. FireProt 2.0 builds on top of the previously published FireProt web, retaining the original functionality and expanding it with several new stabilization strategies. FireProt 2.0 integrates the AlphaFold database and the homology modeling for structure prediction, enabling calculations starting from a sequence. Multiple-point designs are constructed using the Bron-Kerbosch algorithm minimizing the antagonistic effect between the individual mutations. Users can newly limit the FireProt calculation to a set of user-defined mutations, run a saturation mutagenesis of the whole protein or select rigidifying mutations based on B-factors. Evolution-based back-to-consensus strategy is complemented by ancestral sequence reconstruction. FireProt 2.0 is significantly faster and a reworked graphical user interface broadens the tool's availability even to users with older hardware. FireProt 2.0 is freely available at http://loschmidt.chemi.muni.cz/fireprotweb.
Collapse
Affiliation(s)
- Milos Musil
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Masaryk University, Brno, Czech Republic
- Department of Information Systems, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic
- International Clinical Research Centre, St. Anne’s University Hospital Brno, Brno, Czech Republic
| | - Andrej Jezik
- Department of Information Systems, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic
| | - Jana Horackova
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Masaryk University, Brno, Czech Republic
| | - Simeon Borko
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Masaryk University, Brno, Czech Republic
- Department of Information Systems, Faculty of Information Technology, Brno University of Technology, Brno, Czech Republic
- International Clinical Research Centre, St. Anne’s University Hospital Brno, Brno, Czech Republic
| | - Petr Kabourek
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Masaryk University, Brno, Czech Republic
- International Clinical Research Centre, St. Anne’s University Hospital Brno, Brno, Czech Republic
| | - Jiri Damborsky
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Masaryk University, Brno, Czech Republic
- International Clinical Research Centre, St. Anne’s University Hospital Brno, Brno, Czech Republic
| | - David Bednar
- Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Masaryk University, Brno, Czech Republic
- International Clinical Research Centre, St. Anne’s University Hospital Brno, Brno, Czech Republic
| |
Collapse
|
19
|
Fongang B, Wadop YN, Zhu Y, Wagner EJ, Kudlicki A, Rowicka M. Coevolution combined with molecular dynamics simulations provides structural and mechanistic insights into the interactions between the integrator complex subunits. Comput Struct Biotechnol J 2023; 21:5686-5697. [PMID: 38074468 PMCID: PMC10700540 DOI: 10.1016/j.csbj.2023.11.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2023] [Revised: 11/10/2023] [Accepted: 11/10/2023] [Indexed: 01/18/2024] Open
Abstract
Finding the 3D structure of large, multi-subunit complexes is difficult, despite recent advances in cryo-EM technology, due to remaining challenges to expressing and purifying subunits. Computational approaches that predict protein-protein interactions, including Direct Coupling Analysis (DCA), represent an attractive alternative for dissecting interactions within protein complexes. However, they are readily applicable only to small proteins due to high computational complexity and a high number of false positives. To solve this problem, we proposed a modified DCA approach, a powerful tool to predict the most likely interfaces of protein complexes. Since our modified approach cannot provide structural and mechanistic details of interacting peptides, we combine it with Molecular Dynamics (MD) simulations. To illustrate this novel approach, we predict interacting domains and structural details of interactions of two Integrator complex subunits, INTS9 and INTS11. Our predictions of interacting residues of INTS9/INTS11 are highly consistent with crystallographic structure. We then expand our procedure to two complexes whose structures are not well-studied: 1) The heterodimer formed by the Cleavage and Polyadenylation Specificity Factor 100-kD (CPSF100) and 73-kD (CPSF73); 2) The heterotrimer formed by INTS4/INTS9/INTS11. Experimental data supports our predictions of interactions within these two complexes, demonstrating that combining DCA and MD simulations is a powerful approach to revealing structural insights of large protein complexes.
Collapse
Affiliation(s)
- Bernard Fongang
- Glenn Biggs Institute for Alzheimer's & Neurodegenerative Diseases, The University of Texas Health Science Center at San Antonio, San Antonio, TX, United States
- Department of Biochemistry and Structural Biology, The University of Texas Health Science Center at San Antonio, San Antonio, TX, United States
- Department of Population Health Sciences, The University of Texas Health Science Center at San Antonio, San Antonio, TX, United States
- Institute for Translational Sciences, The University of Texas Medical Branch, Galveston, TX, United States
| | - Yannick N. Wadop
- Glenn Biggs Institute for Alzheimer's & Neurodegenerative Diseases, The University of Texas Health Science Center at San Antonio, San Antonio, TX, United States
- Institute for Translational Sciences, The University of Texas Medical Branch, Galveston, TX, United States
| | - Yingjie Zhu
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, TX, United States
- Institute for Translational Sciences, The University of Texas Medical Branch, Galveston, TX, United States
| | - Eric J. Wagner
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, TX, United States
- Department of Biochemistry and Biophysics, The University of Rochester Medical Center, Rochester, NY, United States
- Institute for Translational Sciences, The University of Texas Medical Branch, Galveston, TX, United States
| | - Andrzej Kudlicki
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, TX, United States
- Institute for Translational Sciences, The University of Texas Medical Branch, Galveston, TX, United States
- Informatics Service Center, The University of Texas Medical Branch, Galveston, TX, United States
| | - Maga Rowicka
- Department of Biochemistry and Molecular Biology, The University of Texas Medical Branch, Galveston, TX, United States
- Institute for Translational Sciences, The University of Texas Medical Branch, Galveston, TX, United States
| |
Collapse
|
20
|
Rix G, Williams RL, Spinner H, Hu VJ, Marks DS, Liu CC. Continuous evolution of user-defined genes at 1-million-times the genomic mutation rate. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.11.13.566922. [PMID: 38014077 PMCID: PMC10680746 DOI: 10.1101/2023.11.13.566922] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2023]
Abstract
When nature maintains or evolves a gene's function over millions of years at scale, it produces a diversity of homologous sequences whose patterns of conservation and change contain rich structural, functional, and historical information about the gene. However, natural gene diversity likely excludes vast regions of functional sequence space and includes phylogenetic and evolutionary eccentricities, limiting what information we can extract. We introduce an accessible experimental approach for compressing long-term gene evolution to laboratory timescales, allowing for the direct observation of extensive adaptation and divergence followed by inference of structural, functional, and environmental constraints for any selectable gene. To enable this approach, we developed a new orthogonal DNA replication (OrthoRep) system that durably hypermutates chosen genes at a rate of >10 -4 substitutions per base in vivo . When OrthoRep was used to evolve a conditionally essential maladapted enzyme, we obtained thousands of unique multi-mutation sequences with many pairs >60 amino acids apart (>15% divergence), revealing known and new factors influencing enzyme adaptation. The fitness of evolved sequences was not predictable by advanced machine learning models trained on natural variation. We suggest that OrthoRep supports the prospective and systematic discovery of constraints shaping gene evolution, uncovering of new regions in fitness landscapes, and general applications in biomolecular engineering.
Collapse
|
21
|
Godbole SS, Dokholyan NV. Allosteric regulation of kinase activity in living cells. eLife 2023; 12:RP90574. [PMID: 37943025 PMCID: PMC10635643 DOI: 10.7554/elife.90574] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2023] Open
Abstract
The dysregulation of protein kinases is associated with multiple diseases due to the kinases' involvement in a variety of cell signaling pathways. Manipulating protein kinase function, by controlling the active site, is a promising therapeutic and investigative strategy to mitigate and study diseases. Kinase active sites share structural similarities, making it difficult to specifically target one kinase, and allosteric control allows specific regulation and study of kinase function without directly targeting the active site. Allosteric sites are distal to the active site but coupled via a dynamic network of inter-atomic interactions between residues in the protein. Establishing an allosteric control over a kinase requires understanding the allosteric wiring of the protein. Computational techniques offer effective and inexpensive mapping of the allosteric sites on a protein. Here, we discuss the methods to map and regulate allosteric communications in proteins, and strategies to establish control over kinase functions in live cells and organisms. Protein molecules, or 'sensors,' are engineered to function as tools to control allosteric activity of the protein as these sensors have high spatiotemporal resolution and help in understanding cell phenotypes after immediate activation or inactivation of a kinase. Traditional methods used to study protein functions, such as knockout, knockdown, or mutation, cannot offer a sufficiently high spatiotemporal resolution. We discuss the modern repertoire of tools to regulate protein kinases as we enter a new era in deciphering cellular signaling and developing novel approaches to treat diseases associated with signal dysregulation.
Collapse
Affiliation(s)
| | - Nikolay V Dokholyan
- Department of Pharmacology, Penn State College of MedicineHersheyUnited States
- Department of Biomedical Engineering, Penn State University, University ParkHersheyUnited States
- Department of Engineering Science and Mechanics, Penn State University, University ParkHersheyUnited States
- Department of Biochemistry & Molecular Biology, Penn State College of MedicineHersheyUnited States
- Department of Chemistry, Penn State University, University ParkHersheyUnited States
| |
Collapse
|
22
|
Chon NL, Tran S, Miller CS, Lin H, Knight JD. A Conserved Electrostatic Membrane-Binding Surface in Synaptotagmin-Like Proteins Revealed Using Molecular Phylogenetic Analysis and Homology Modeling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.13.548768. [PMID: 37502952 PMCID: PMC10369986 DOI: 10.1101/2023.07.13.548768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
Protein structure prediction has emerged as a core technology for understanding biomolecules and their interactions. Here, we combine homology-based structure prediction with molecular phylogenetic analysis to study the evolution of electrostatic membrane binding among vertebrate synaptotagmin-like proteins (Slps). Slp family proteins play key roles in the membrane trafficking of large dense-core secretory vesicles. Our previous experimental and computational study found that the C2A domain of Slp-4 (also called granuphilin) binds with high affinity to anionic phospholipids in the cytoplasmic leaflet of the plasma membrane through a large positively charged protein surface centered on a cluster of phosphoinositide-binding lysine residues. Because this surface contributes greatly to Slp-4 C2A domain membrane binding, we hypothesized that the net charge on the surface might be evolutionarily conserved. To test this hypothesis, the known C2A sequences of Slp-4 among vertebrates were organized by class (from mammalia to pisces) using molecular phylogenetic analysis. Consensus sequences for each class were then identified and used to generate homology structures, from which Poisson-Boltzmann electrostatic potentials were calculated. For comparison, homology structures and electrostatic potentials were also calculated for the five human Slp protein family members. The results demonstrate that the charge on the membrane-binding surface is highly conserved throughout the evolution of Slp-4, and more highly conserved than many individual residues among the human Slp family paralogs. Such molecular phylogenetic-driven computational analysis can help to describe the evolution of electrostatic interactions between proteins and membranes which are crucial for their function.
Collapse
Affiliation(s)
- Nara L. Chon
- Department of Chemistry, University of Colorado Denver
| | - Sherleen Tran
- Department of Chemistry, University of Colorado Denver
| | | | - Hai Lin
- Department of Chemistry, University of Colorado Denver
| | | |
Collapse
|
23
|
Godbole S, Dokholyan NV. Allosteric regulation of kinase activity in living cells. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.07.19.549709. [PMID: 37503033 PMCID: PMC10370130 DOI: 10.1101/2023.07.19.549709] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/29/2023]
Abstract
The dysregulation of protein kinases is associated with multiple diseases due to the kinases' involvement in a variety of cell signaling pathways. Manipulating protein kinase function, by controlling the active site, is a promising therapeutic and investigative strategy to mitigate and study diseases. Kinase active sites share structural similarities making it difficult to specifically target one kinase, allosteric control allows specific regulation and study of kinase function without directly targeting the active site. Allosteric sites are distal to the active site but coupled via a dynamic network of inter-atomic interactions between residues in the protein. Establishing an allosteric control over a kinase requires understanding the allosteric wiring of the protein. Computational techniques offer effective and inexpensive mapping of the allosteric sites on a protein. Here, we discuss methods to map and regulate allosteric communications in proteins, and strategies to establish control over kinase functions in live cells and organisms. Protein molecules, or "sensors" are engineered to function as tools to control allosteric activity of the protein as these sensors have high spatiotemporal resolution and help in understanding cell phenotypes after immediate activation or inactivation of a kinase. Traditional methods used to study protein functions, such as knockout, knockdown, or mutation, cannot offer a sufficiently high spatiotemporal resolution. We discuss the modern repertoire of tools to regulate protein kinases as we enter a new era in deciphering cellular signaling and developing novel approaches to treat diseases associated with signal dysregulation.
Collapse
Affiliation(s)
- Shivani Godbole
- Department of Pharmacology, Penn State College of Medicine, Hershey, PA 17033-0850, USA
| | - Nikolay V. Dokholyan
- Department of Pharmacology, Penn State College of Medicine, Hershey, PA 17033-0850, USA
- Department of Biomedical Engineering, Penn State University, University Park, PA 16802, USA
- Department of Engineering Science and Mechanics, Penn State University, University Park, PA 16802, USA
- Department of Biochemistry & Molecular Biology, Penn State College of Medicine, Hershey, PA 17033-0850, USA
- Department of Chemistry, Penn State University, University Park, PA 16802, USA
| |
Collapse
|
24
|
Manley LJ, Lin MM. Kinetic and thermodynamic allostery in the Ras protein family. Biophys J 2023; 122:3882-3893. [PMID: 37598291 PMCID: PMC10560677 DOI: 10.1016/j.bpj.2023.08.010] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2023] [Revised: 07/20/2023] [Accepted: 08/14/2023] [Indexed: 08/21/2023] Open
Abstract
Allostery, the transfer of information between distant parts of a macromolecule, is a fundamental feature of protein function and regulation. However, allosteric mechanisms are usually not explained by protein structure, requiring information on correlated fluctuations uniquely accessible to molecular simulation. Existing work to extract allosteric pathways from molecular dynamics simulations has focused on thermodynamic correlations. Here, we show how kinetic correlations encode complementary information essential to explain observed variations in allosteric regulation. We applied kinetic and thermodynamic correlation analysis on atomistic simulations of H, K, and NRas isoforms in the apo, GTP, and GDP-bound states of Ras protein, with and without complexing to its downstream effector, Raf. We show that switch I and switch II are the primary components of thermodynamic and kinetic allosteric networks, consistent with the key roles of these two motifs. These networks connect the switches to an allosteric loop recently discovered from a crystal structure of HRas. This allosteric loop is inactive in KRas, but is coupled to the hydrolysis arm switch II in NRas and HRas. We find that the mechanism in the latter two isoforms are thermodynamic and kinetic, respectively. Binding of Raf-RBD further activates thermodynamic allostery in HRas and KRas but has limited effect on NRas. These results indicate that kinetic and thermodynamic correlations are both needed to explain protein function and allostery. These two distinct channels of allosteric regulation, and their combinatorial variability, may explain how subtle mutational differences can lead to diverse regulatory profiles among enzymatic proteins.
Collapse
Affiliation(s)
- Leigh J Manley
- Green Center for Systems Biology, Lyda Hill Department of Bioinformatics, Department of Biophysics, Center for Alzheimer's and Neurodegenerative Diseases, University of Texas Southwestern Medical Center, Dallas, Texas
| | - Milo M Lin
- Green Center for Systems Biology, Lyda Hill Department of Bioinformatics, Department of Biophysics, Center for Alzheimer's and Neurodegenerative Diseases, University of Texas Southwestern Medical Center, Dallas, Texas.
| |
Collapse
|
25
|
Schafer JW, Porter LL. Evolutionary selection of proteins with two folds. Nat Commun 2023; 14:5478. [PMID: 37673981 PMCID: PMC10482954 DOI: 10.1038/s41467-023-41237-2] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 08/24/2023] [Indexed: 09/08/2023] Open
Abstract
Although most globular proteins fold into a single stable structure, an increasing number have been shown to remodel their secondary and tertiary structures in response to cellular stimuli. State-of-the-art algorithms predict that these fold-switching proteins adopt only one stable structure, missing their functionally critical alternative folds. Why these algorithms predict a single fold is unclear, but all of them infer protein structure from coevolved amino acid pairs. Here, we hypothesize that coevolutionary signatures are being missed. Suspecting that single-fold variants could be masking these signatures, we developed an approach, called Alternative Contact Enhancement (ACE), to search both highly diverse protein superfamilies-composed of single-fold and fold-switching variants-and protein subfamilies with more fold-switching variants. ACE successfully revealed coevolution of amino acid pairs uniquely corresponding to both conformations of 56/56 fold-switching proteins from distinct families. Then, we used ACE-derived contacts to (1) predict two experimentally consistent conformations of a candidate protein with unsolved structure and (2) develop a blind prediction pipeline for fold-switching proteins. The discovery of widespread dual-fold coevolution indicates that fold-switching sequences have been preserved by natural selection, implying that their functionalities provide evolutionary advantage and paving the way for predictions of diverse protein structures from single sequences.
Collapse
Affiliation(s)
- Joseph W Schafer
- National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA
| | - Lauren L Porter
- National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD, 20894, USA.
- National Heart, Lung, and Blood Institute, Biochemistry and Biophysics Center, National Institutes of Health, Bethesda, MD, 20892, USA.
| |
Collapse
|
26
|
Jones NH, Kapoor TM. Achieving the promise and avoiding the peril of chemical probes using genetics. Curr Opin Struct Biol 2023; 81:102628. [PMID: 37364429 PMCID: PMC10561518 DOI: 10.1016/j.sbi.2023.102628] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 05/17/2023] [Accepted: 05/17/2023] [Indexed: 06/28/2023]
Abstract
Chemical probes can be valuable tools for studying protein targets, but addressing concerns about a probe's cellular target or its specificity can be challenging. A reliable strategy is to use a mutation that does not alter a target's function but confers resistance (or sensitizes) to the inhibitor in both cellular and biochemical assays. However, challenges remain in finding such mutations. Here, we discuss structure- and cell-based approaches to identify resistance- and sensitivity-conferring mutations. Further, we describe how resistance-conferring mutations can help with compound design, and the use of saturation mutagenesis to characterize a compound binding site. We highlight how genetic approaches can ensure the proper use of chemical inhibitors to pursue mechanistic studies and test therapeutic hypotheses.
Collapse
Affiliation(s)
- Natalie H Jones
- Laboratory of Chemistry and Cell Biology, The Rockefeller University, New York, NY, USA; Tri-Institutional PhD Program in Chemical Biology, New York, NY, USA
| | - Tarun M Kapoor
- Laboratory of Chemistry and Cell Biology, The Rockefeller University, New York, NY, USA.
| |
Collapse
|
27
|
Yang A, Jude KM, Lai B, Minot M, Kocyla AM, Glassman CR, Nishimiya D, Kim YS, Reddy ST, Khan AA, Garcia KC. Deploying synthetic coevolution and machine learning to engineer protein-protein interactions. Science 2023; 381:eadh1720. [PMID: 37499032 PMCID: PMC10403280 DOI: 10.1126/science.adh1720] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Accepted: 06/16/2023] [Indexed: 07/29/2023]
Abstract
Fine-tuning of protein-protein interactions occurs naturally through coevolution, but this process is difficult to recapitulate in the laboratory. We describe a platform for synthetic protein-protein coevolution that can isolate matched pairs of interacting muteins from complex libraries. This large dataset of coevolved complexes drove a systems-level analysis of molecular recognition between Z domain-affibody pairs spanning a wide range of structures, affinities, cross-reactivities, and orthogonalities, and captured a broad spectrum of coevolutionary networks. Furthermore, we harnessed pretrained protein language models to expand, in silico, the amino acid diversity of our coevolution screen, predicting remodeled interfaces beyond the reach of the experimental library. The integration of these approaches provides a means of simulating protein coevolution and generating protein complexes with diverse molecular recognition properties for biotechnology and synthetic biology.
Collapse
Affiliation(s)
- Aerin Yang
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Kevin M. Jude
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, CA 94305, USA
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Ben Lai
- Toyota Technological Institute at Chicago, Chicago, IL 60637, USA
| | - Mason Minot
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
| | - Anna M. Kocyla
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Caleb R. Glassman
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Daisuke Nishimiya
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Yoon Seok Kim
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, CA 94305, USA
| | - Sai T. Reddy
- Department of Biosystems Science and Engineering, ETH Zurich, Basel, Switzerland
| | - Aly A. Khan
- Toyota Technological Institute at Chicago, Chicago, IL 60637, USA
- Departments of Pathology, and Family Medicine, University of Chicago, Chicago, IL 60637, USA
| | - K. Christopher Garcia
- Department of Molecular and Cellular Physiology, Stanford University School of Medicine, Stanford, CA 94305, USA
- Howard Hughes Medical Institute, Stanford University School of Medicine, Stanford, CA 94305, USA
- Department of Structural Biology, Stanford University School of Medicine, Stanford, CA 94305, USA
| |
Collapse
|
28
|
Konovalov KA, Wu CG, Qiu Y, Balakrishnan VK, Parihar PS, O’Connor MS, Xing Y, Huang X. Disease mutations and phosphorylation alter the allosteric pathways involved in autoinhibition of protein phosphatase 2A. J Chem Phys 2023; 158:215101. [PMID: 37260014 PMCID: PMC10238128 DOI: 10.1063/5.0150272] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2023] [Accepted: 05/16/2023] [Indexed: 06/02/2023] Open
Abstract
Mutations in protein phosphatase 2A (PP2A) are connected to intellectual disability and cancer. It has been hypothesized that these mutations might disrupt the autoinhibition and phosphorylation-induced activation of PP2A. Since they are located far from both the active and substrate binding sites, it is unclear how they exert their effect. We performed allosteric pathway analysis based on molecular dynamics simulations and combined it with biochemical experiments to investigate the autoinhibition of PP2A. In the wild type (WT), the C-arm of the regulatory subunit B56δ obstructs the active and substrate binding sites exerting a dual autoinhibition effect. We find that the disease mutant, E198K, severely weakens the allosteric pathways that stabilize the C-arm in the WT. Instead, the strongest allosteric pathways in E198K take a different route that promotes exposure of the substrate binding site. To facilitate the allosteric pathway analysis, we introduce a path clustering algorithm for lumping pathways into channels. We reveal remarkable similarities between the allosteric channels of E198K and those in phosphorylation-activated WT, suggesting that the autoinhibition can be alleviated through a conserved mechanism. In contrast, we find that another disease mutant, E200K, which is in spatial proximity of E198, does not repartition the allosteric pathways leading to the substrate binding site; however, it may still induce exposure of the active site. This finding agrees with our biochemical data, allowing us to predict the activity of PP2A with the phosphorylated B56δ and provide insight into how disease mutations in spatial proximity alter the enzymatic activity in surprisingly different mechanisms.
Collapse
Affiliation(s)
- Kirill A. Konovalov
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | | | - Yunrui Qiu
- Department of Chemistry, Theoretical Chemistry Institute, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Vijaya Kumar Balakrishnan
- McArdle Laboratory for Cancer Research, Department of Oncology, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | - Pankaj Singh Parihar
- McArdle Laboratory for Cancer Research, Department of Oncology, School of Medicine and Public Health, University of Wisconsin-Madison, Madison, Wisconsin 53705, USA
| | - Michael S. O’Connor
- Biophysics Graduate Program, University of Wisconsin-Madison, Madison, Wisconsin 53706, USA
| | - Yongna Xing
- Authors to whom correspondence should be addressed: and
| | - Xuhui Huang
- Authors to whom correspondence should be addressed: and
| |
Collapse
|
29
|
Han Z, Wang X, Wu Z, Li C. Study of the Allosteric Mechanism of Human Mitochondrial Phenylalanyl-tRNA Synthetase by Transfer Entropy via an Improved Gaussian Network Model and Co-evolution Analyses. J Phys Chem Lett 2023; 14:3452-3460. [PMID: 37010935 DOI: 10.1021/acs.jpclett.3c00366] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/19/2023]
Abstract
We propose an improved transfer entropy approach called the dynamic version of the force constant fitted Gaussian network model based on molecular dynamics ensemble (dfcfGNMMD) to explore the allosteric mechanism of human mitochondrial phenylalanyl-tRNA synthetase (hmPheRS), one of the aminoacyl-tRNA synthetases that play a crucial role in translation of the genetic code. The dfcfGNMMD method can provide reliable estimates of the transfer entropy and give new insights into the role of the anticodon binding domain in driving the catalytic domain in aminoacylation activity and into the effects of tRNA binding and residue mutation on the enzyme activity, revealing the causal mechanism of the allosteric communication in hmPheRS. In addition, we incorporate the residue dynamic and co-evolutionary information to further investigate the key residues in hmPheRS allostery. This study sheds light on the mechanisms of hmPheRS allostery and can provide important information for related drug design.
Collapse
Affiliation(s)
- Zhongjie Han
- Faculty of Environmental and Life Sciences, Beijing University of Technology, Beijing 100124, China
| | - Xiaoli Wang
- Faculty of Environmental and Life Sciences, Beijing University of Technology, Beijing 100124, China
| | - Zhixiang Wu
- Faculty of Environmental and Life Sciences, Beijing University of Technology, Beijing 100124, China
| | - Chunhua Li
- Faculty of Environmental and Life Sciences, Beijing University of Technology, Beijing 100124, China
| |
Collapse
|
30
|
Duan B, Qiu C, Sze SH, Kaplan C. Widespread epistasis shapes RNA Polymerase II active site function and evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.27.530048. [PMID: 36909581 PMCID: PMC10002619 DOI: 10.1101/2023.02.27.530048] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/04/2023]
Abstract
Multi-subunit RNA Polymerases (msRNAPs) are responsible for transcription in all kingdoms of life. At the heart of these msRNAPs is an ultra-conserved active site domain, the trigger loop (TL), coordinating transcription speed and fidelity by critical conformational changes impacting multiple steps in substrate selection, catalysis, and translocation. Previous studies have observed several different types of genetic interactions between eukaryotic RNA polymerase II (Pol II) TL residues, suggesting that the TL's function is shaped by functional interactions of residues within and around the TL. The extent of these interaction networks and how they control msRNAP function and evolution remain to be determined. Here we have dissected the Pol II TL interaction landscape by deep mutational scanning in Saccharomyces cerevisiae Pol II. Through analysis of over 15000 alleles, representing all single mutants, a rationally designed subset of double mutants, and evolutionarily observed TL haplotypes, we identify interaction networks controlling TL function. Substituting residues creates allele-specific networks and propagates epistatic effects across the Pol II active site. Furthermore, the interaction landscape further distinguishes alleles with similar growth phenotypes, suggesting increased resolution over the previously reported single mutant phenotypic landscape. Finally, co-evolutionary analyses reveal groups of co-evolving residues across Pol II converge onto the active site, where evolutionary constraints interface with pervasive epistasis. Our studies provide a powerful system to understand the plasticity of RNA polymerase mechanism and evolution, and provide the first example of pervasive epistatic landscape in a highly conserved and constrained domain within an essential enzyme.
Collapse
Affiliation(s)
- Bingbing Duan
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260
| | - Chenxi Qiu
- Department of Genetics, Harvard Medical School, Boston, MA 02215
| | - Sing-Hoi Sze
- Department of Computer Science and Engineering, Texas A&M University, College Station, TX 77843
- Department of Biochemistry & Biophysics, Texas A&M University, College Station, TX 77843
| | - Craig Kaplan
- Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA 15260
| |
Collapse
|
31
|
Kelly MS, Macke AC, Kahawatte S, Stump JE, Miller AR, Dima RI. The quaternary question: Determining allostery in spastin through dynamics classification learning and bioinformatics. J Chem Phys 2023; 158:125102. [PMID: 37003743 DOI: 10.1063/5.0139273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2023] Open
Abstract
The nanomachine from the ATPases associated with various cellular activities superfamily, called spastin, severs microtubules during cellular processes. To characterize the functionally important allostery in spastin, we employed methods from evolutionary information, to graph-based networks, to machine learning applied to atomistic molecular dynamics simulations of spastin in its monomeric and the functional hexameric forms, in the presence or absence of ligands. Feature selection, using machine learning approaches, for transitions between spastin states recognizes all the regions that have been proposed as allosteric or functional in the literature. The analysis of the composition of the Markov State Model macrostates in the spastin monomer, and the analysis of the direction of change in the top machine learning features for the transitions, indicate that the monomer favors the binding of ATP, which primes the regions involved in the formation of the inter-protomer interfaces for binding to other protomer(s). Allosteric path analysis of graph networks, built based on the cross-correlations between residues in simulations, shows that perturbations to a hub specific for the pre-hydrolysis hexamer propagate throughout the structure by passing through two obligatory regions: the ATP binding pocket, and pore loop 3, which connects the substrate binding site to the ATP binding site. Our findings support a model where the changes in the terminal protomers due to the binding of ligands play an active role in the force generation in spastin. The secondary structures in spastin, which are found to be highly degenerative within the network paths, are also critical for feature transitions of the classification models, which can guide the design of allosteric effectors to enhance or block allosteric signaling.
Collapse
Affiliation(s)
- Maria S Kelly
- Department of Chemistry, University of Cincinnati, Cincinnati, Ohio 45221, USA
| | - Amanda C Macke
- Department of Chemistry, University of Cincinnati, Cincinnati, Ohio 45221, USA
| | - Shehani Kahawatte
- Department of Chemistry, University of Cincinnati, Cincinnati, Ohio 45221, USA
| | - Jacob E Stump
- Department of Chemistry, University of Cincinnati, Cincinnati, Ohio 45221, USA
| | - Abigail R Miller
- Department of Chemistry, University of Cincinnati, Cincinnati, Ohio 45221, USA
| | - Ruxandra I Dima
- Department of Chemistry, University of Cincinnati, Cincinnati, Ohio 45221, USA
| |
Collapse
|
32
|
Luan Y, Tang Z, He Y, Xie Z. Intra-Domain Residue Coevolution in Transcription Factors Contributes to DNA Binding Specificity. Microbiol Spectr 2023; 11:e0365122. [PMID: 36943132 PMCID: PMC10100741 DOI: 10.1128/spectrum.03651-22] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2022] [Accepted: 02/22/2023] [Indexed: 03/23/2023] Open
Abstract
Understanding the basis of the DNA-binding specificity of transcription factors (TFs) has been of long-standing interest. Despite extensive efforts to map millions of putative TF binding sequences, identifying the critical determinants for DNA binding specificity remains a major challenge. The coevolution of residues in proteins occurs due to a shared evolutionary history. However, it is unclear how coevolving residues in TFs contribute to DNA binding specificity. Here, we systematically collected publicly available data sets from multiple large-scale high-throughput TF-DNA interaction screening experiments for the major TF families with large numbers of TF members. These families included the Homeobox, HLH, bZIP_1, Ets, HMG_box, ZF-C4, and Zn_clus TFs. We detected TF subclass-determining sites (TSDSs) and showed that the TSDSs were more likely to coevolve with other TSDSs than with non-TSDSs, particularly for the Homeobox, HLH, Ets, bZIP_1, and HMG_box TF families. By in silico modeling, we showed that mutation of the highly coevolving residues could significantly reduce the stability of the TF-DNA complex. The distant residues from the DNA interface also contributed to TF-DNA binding activity. Overall, our study gave evidence that coevolved residues relate to transcriptional regulation and provided insights into the potential application of engineered DNA-binding domains and proteins. IMPORTANCE While unraveling DNA-binding specificity of TFs is the key to understanding the basis and molecular mechanism of gene expression regulation, identifying the critical determinants that contribute to DNA binding specificity remains a major challenge. In this study, we provided evidence showing that coevolving residues in TF domains contributed to DNA binding specificity. We demonstrated that the TSDSs were more likely to coevolve with other TSDSs than with non-TSDSs. Mutation of the coevolving residue pairs (CRPs) could significantly reduce the stability of THE TF-DNA complex, and even the distant residues from the DNA interface contribute to TF-DNA binding activity. Collectively, our study expands our knowledge of the interactions among coevolved residues in TFs, tertiary contacting, and functional importance in refined transcriptional regulation. Understanding the impact of coevolving residues in TFs will help understand the details of transcription of gene regulation and advance the application of engineered DNA-binding domains and protein.
Collapse
Affiliation(s)
- Yizhao Luan
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Zehua Tang
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Yao He
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| | - Zhi Xie
- State Key Laboratory of Ophthalmology, Guangdong Provincial Key Laboratory of Ophthalmology and Visual Science, Zhongshan Ophthalmic Center, Sun Yat-sen University, Guangzhou, China
| |
Collapse
|
33
|
Alegre-Martí A, Jiménez-Panizo A, Martínez-Tébar A, Poulard C, Peralta-Moreno MN, Abella M, Antón R, Chiñas M, Eckhard U, Piulats JM, Rojas AM, Fernández-Recio J, Rubio-Martínez J, Le Romancer M, Aytes Á, Fuentes-Prior P, Estébanez-Perpiñá E. A hotspot for posttranslational modifications on the androgen receptor dimer interface drives pathology and anti-androgen resistance. SCIENCE ADVANCES 2023; 9:eade2175. [PMID: 36921044 PMCID: PMC10017050 DOI: 10.1126/sciadv.ade2175] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 08/02/2022] [Accepted: 02/15/2023] [Indexed: 06/18/2023]
Abstract
Mutations of the androgen receptor (AR) associated with prostate cancer and androgen insensitivity syndrome may profoundly influence its structure, protein interaction network, and binding to chromatin, resulting in altered transcription signatures and drug responses. Current structural information fails to explain the effect of pathological mutations on AR structure-function relationship. Here, we have thoroughly studied the effects of selected mutations that span the complete dimer interface of AR ligand-binding domain (AR-LBD) using x-ray crystallography in combination with in vitro, in silico, and cell-based assays. We show that these variants alter AR-dependent transcription and responses to anti-androgens by inducing a previously undescribed allosteric switch in the AR-LBD that increases exposure of a major methylation target, Arg761. We also corroborate the relevance of residues Arg761 and Tyr764 for AR dimerization and function. Together, our results reveal allosteric coupling of AR dimerization and posttranslational modifications as a disease mechanism with implications for precision medicine.
Collapse
Affiliation(s)
- Andrea Alegre-Martí
- Structural Biology of Nuclear Receptors, Department of Biochemistry and Molecular Biomedicine, Faculty of Biology, University of Barcelona (UB), 08028 Barcelona, Spain
- Institute of Biomedicine of the University of Barcelona (IBUB), University of Barcelona (UB), 08028 Barcelona, Spain
| | - Alba Jiménez-Panizo
- Structural Biology of Nuclear Receptors, Department of Biochemistry and Molecular Biomedicine, Faculty of Biology, University of Barcelona (UB), 08028 Barcelona, Spain
- Institute of Biomedicine of the University of Barcelona (IBUB), University of Barcelona (UB), 08028 Barcelona, Spain
| | - Adrián Martínez-Tébar
- Programs of Molecular Mechanisms and Experimental Therapeutics in Oncology (ONCOBell) and Cancer Therapeutics Resistance (ProCURE), Catalan Institute of Oncology, Bellvitge Institute for Biomedical Research, 08908 Barcelona, Spain
| | - Coralie Poulard
- Cancer Research Center of Lyon, CNRS UMR5286, Inserm U1502, University of Lyon, 69000 Lyon, France
| | - M. Núria Peralta-Moreno
- Department of Materials Science and Physical Chemistry, Faculty of Chemistry and Institut de Recerca en Química Teorica i Computacional (IQTCUB), 08028 Barcelona, Spain
| | - Montserrat Abella
- Structural Biology of Nuclear Receptors, Department of Biochemistry and Molecular Biomedicine, Faculty of Biology, University of Barcelona (UB), 08028 Barcelona, Spain
- Institute of Biomedicine of the University of Barcelona (IBUB), University of Barcelona (UB), 08028 Barcelona, Spain
| | - Rosa Antón
- Biomedical Research Institute Sant Pau (IIB Sant Pau), 08041 Barcelona, Spain
| | - Marcos Chiñas
- Programs of Molecular Mechanisms and Experimental Therapeutics in Oncology (ONCOBell) and Cancer Therapeutics Resistance (ProCURE), Catalan Institute of Oncology, Bellvitge Institute for Biomedical Research, 08908 Barcelona, Spain
- Universidad Nacional Autónoma de México, Centro de Ciencias Genómicas, Cuernavaca, 61740 Morelos, Mexico
| | - Ulrich Eckhard
- Department of Structural and Molecular Biology, Molecular Biology Institute of Barcelona (IBMB-CSIC), 08028 Barcelona, Spain
| | - Josep M. Piulats
- Programs of Molecular Mechanisms and Experimental Therapeutics in Oncology (ONCOBell) and Cancer Therapeutics Resistance (ProCURE), Catalan Institute of Oncology, Bellvitge Institute for Biomedical Research, 08908 Barcelona, Spain
| | - Ana M. Rojas
- Computational Biology and Bioinformatics, Andalusian Center for Developmental Biology (CABD-CSIC), 41013 Sevilla, Spain
| | - Juan Fernández-Recio
- Instituto de Ciencias de la Vid y del Vino (ICVV-CSIC), CSIC-UR-Gobierno de La Rioja, 26007 Logroño, Spain
| | - Jaime Rubio-Martínez
- Department of Materials Science and Physical Chemistry, Faculty of Chemistry and Institut de Recerca en Química Teorica i Computacional (IQTCUB), 08028 Barcelona, Spain
| | - Muriel Le Romancer
- Cancer Research Center of Lyon, CNRS UMR5286, Inserm U1502, University of Lyon, 69000 Lyon, France
| | - Álvaro Aytes
- Programs of Molecular Mechanisms and Experimental Therapeutics in Oncology (ONCOBell) and Cancer Therapeutics Resistance (ProCURE), Catalan Institute of Oncology, Bellvitge Institute for Biomedical Research, 08908 Barcelona, Spain
| | - Pablo Fuentes-Prior
- Biomedical Research Institute Sant Pau (IIB Sant Pau), 08041 Barcelona, Spain
| | - Eva Estébanez-Perpiñá
- Structural Biology of Nuclear Receptors, Department of Biochemistry and Molecular Biomedicine, Faculty of Biology, University of Barcelona (UB), 08028 Barcelona, Spain
- Institute of Biomedicine of the University of Barcelona (IBUB), University of Barcelona (UB), 08028 Barcelona, Spain
| |
Collapse
|
34
|
Mathy CJP, Mishra P, Flynn JM, Perica T, Mavor D, Bolon DNA, Kortemme T. A complete allosteric map of a GTPase switch in its native cellular network. Cell Syst 2023; 14:237-246.e7. [PMID: 36801015 PMCID: PMC10173951 DOI: 10.1016/j.cels.2023.01.003] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2022] [Revised: 11/08/2022] [Accepted: 01/06/2023] [Indexed: 02/19/2023]
Abstract
Allosteric regulation is central to protein function in cellular networks. A fundamental open question is whether cellular regulation of allosteric proteins occurs only at a few defined positions or at many sites distributed throughout the structure. Here, we probe the regulation of GTPases-protein switches that control signaling through regulated conformational cycling-at residue-level resolution by deep mutagenesis in the native biological network. For the GTPase Gsp1/Ran, we find that 28% of the 4,315 assayed mutations show pronounced gain-of-function responses. Twenty of the sixty positions enriched for gain-of-function mutations are outside the canonical GTPase active site switch regions. Kinetic analysis shows that these distal sites are allosterically coupled to the active site. We conclude that the GTPase switch mechanism is broadly sensitive to cellular allosteric regulation. Our systematic discovery of new regulatory sites provides a functional map to interrogate and target GTPases controlling many essential biological processes.
Collapse
Affiliation(s)
- Christopher J P Mathy
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA; The UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco, San Francisco, CA 94158, USA
| | - Parul Mishra
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Medical School, Worcester, MA 01605, USA; School of Life Sciences, University of Hyderabad, Hyderabad, Telangana, India
| | - Julia M Flynn
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Tina Perica
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA
| | - David Mavor
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Daniel N A Bolon
- Department of Biochemistry and Molecular Biotechnology, University of Massachusetts Medical School, Worcester, MA 01605, USA.
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, San Francisco, CA 94158, USA; Quantitative Biosciences Institute, University of California, San Francisco, San Francisco, CA 94158, USA; The UC Berkeley-UCSF Graduate Program in Bioengineering, University of California, San Francisco, San Francisco, CA 94158, USA; Chan Zuckerberg Biohub, San Francisco, CA 94158, USA.
| |
Collapse
|
35
|
Rappoport D, Jinich A. Enzyme Substrate Prediction from Three-Dimensional Feature Representations Using Space-Filling Curves. J Chem Inf Model 2023; 63:1637-1648. [PMID: 36802628 DOI: 10.1021/acs.jcim.3c00005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Abstract
Compact and interpretable structural feature representations are required for accurately predicting properties and function of proteins. In this work, we construct and evaluate three-dimensional feature representations of protein structures based on space-filling curves (SFCs). We focus on the problem of enzyme substrate prediction, using two ubiquitous enzyme families as case studies: the short-chain dehydrogenase/reductases (SDRs) and the S-adenosylmethionine-dependent methyltransferases (SAM-MTases). Space-filling curves such as the Hilbert curve and the Morton curve generate a reversible mapping from discretized three-dimensional to one-dimensional representations and thus help to encode three-dimensional molecular structures in a system-independent way and with only a few adjustable parameters. Using three-dimensional structures of SDRs and SAM-MTases generated using AlphaFold2, we assess the performance of the SFC-based feature representations in predictions on a new benchmark database of enzyme classification tasks including their cofactor and substrate selectivity. Gradient-boosted tree classifiers yield binary prediction accuracy of 0.77-0.91 and area under curve (AUC) characteristics of 0.83-0.92 for the classification tasks. We investigate the effects of amino acid encoding, spatial orientation, and (the few) parameters of SFC-based encodings on the accuracy of the predictions. Our results suggest that geometry-based approaches such as SFCs are promising for generating protein structural representations and are complementary to the existing protein feature representations such as evolutionary scale modeling (ESM) sequence embeddings.
Collapse
Affiliation(s)
- Dmitrij Rappoport
- Department of Chemistry, University of California, Irvine, 1102 Natural Sciences 2, Irvine, California 92697, United States
| | - Adrian Jinich
- Weill Cornell Medicine, 1300 York Avenue, Box 65, New York, New York 10065, United States
| |
Collapse
|
36
|
Glasgow A, Hobbs HT, Perry ZR, Wells ML, Marqusee S, Kortemme T. Ligand-specific changes in conformational flexibility mediate long-range allostery in the lac repressor. Nat Commun 2023; 14:1179. [PMID: 36859492 PMCID: PMC9977783 DOI: 10.1038/s41467-023-36798-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Accepted: 02/17/2023] [Indexed: 03/03/2023] Open
Abstract
Biological regulation ubiquitously depends on protein allostery, but the regulatory mechanisms are incompletely understood, especially in proteins that undergo ligand-induced allostery with few structural changes. Here we used hydrogen-deuterium exchange with mass spectrometry (HDX/MS) to map allosteric effects in a paradigm ligand-responsive transcription factor, the lac repressor (LacI), in different functional states (apo, or bound to inducer, anti-inducer, and/or DNA). Although X-ray crystal structures of the LacI core domain in these states are nearly indistinguishable, HDX/MS experiments reveal widespread differences in flexibility. We integrate these results with modeling of protein-ligand-solvent interactions to propose a revised model for allostery in LacI, where ligand binding allosterically shifts the conformational ensemble as a result of distinct changes in the rigidity of secondary structures in the different states. Our model provides a mechanistic basis for the altered function of distal mutations. More generally, our approach provides a platform for characterizing and engineering protein allostery.
Collapse
Affiliation(s)
- Anum Glasgow
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, 94158, USA.
- Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA.
| | - Helen T Hobbs
- Department of Chemistry, University of California, Berkeley, Berkeley, CA, 94720, USA
| | - Zion R Perry
- Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, CT, 06511, USA
| | - Malcolm L Wells
- Department of Physics, Columbia University, New York, NY, 10032, USA
| | - Susan Marqusee
- Department of Chemistry, University of California, Berkeley, Berkeley, CA, 94720, USA
- Department of Molecular & Cell Biology, University of California, Berkeley, Berkeley, CA, 94720, USA
| | - Tanja Kortemme
- Department of Bioengineering and Therapeutic Sciences, University of California, San Francisco, CA, 94158, USA
| |
Collapse
|
37
|
Spirov AV, Myasnikova EM. Problem of Domain/Building Block Preservation in the Evolution of Biological Macromolecules and Evolutionary Computation. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2023; 20:1345-1362. [PMID: 35594219 DOI: 10.1109/tcbb.2022.3175908] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/04/2023]
Abstract
Structurally and functionally isolated domains in biological macromolecular evolution, both natural and artificial, are largely similar to "schemata", building blocks (BBs), in evolutionary computation (EC). The problem of preserving in subsequent evolutionary searches the already found domains / BBs is well known and quite relevant in biology as well as in EC. Both biology and EC are seeing parallel and independent development of several approaches to identifying and preserving previously identified domains / BBs. First, we notice the similarity of DNA shuffling methods in synthetic biology and multi-parent recombination algorithms in EC. Furthermore, approaches to computer identification of domains in proteins that are being developed in biology can be aligned with BB identification methods in EC. Finally, approaches to chimeric protein libraries optimization in biology can be compared to evolutionary search methods based on probabilistic models in EC. We propose to validate the prospects of mutual exchange of ideas and transfer of algorithms and approaches between evolutionary systems biology and EC in these three principal directions. A crucial aim of this transfer is the design of new advanced experimental techniques capable of solving more complex problems of in vitro evolution.
Collapse
|
38
|
Chakravarty D, Schafer JW, Porter LL. Distinguishing features of fold-switching proteins. Protein Sci 2023; 32:e4596. [PMID: 36782353 PMCID: PMC9951197 DOI: 10.1002/pro.4596] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 01/30/2023] [Accepted: 02/09/2023] [Indexed: 02/15/2023]
Abstract
Though many folded proteins assume one stable structure that performs one function, a small-but-increasing number remodel their secondary and tertiary structures and change their functions in response to cellular stimuli. These fold-switching proteins regulate biological processes and are associated with autoimmune dysfunction, severe acute respiratory syndrome coronavirus-2 infection, and more. Despite their biological importance, it is difficult to computationally predict fold switching. With the aim of advancing computational prediction and experimental characterization of fold switchers, this review discusses several features that distinguish fold-switching proteins from their single-fold and intrinsically disordered counterparts. First, the isolated structures of fold switchers are less stable and more heterogeneous than single folders but more stable and less heterogeneous than intrinsically disordered proteins (IDPs). Second, the sequences of single fold, fold switching, and intrinsically disordered proteins can evolve at distinct rates. Third, proteins from these three classes are best predicted using different computational techniques. Finally, late-breaking results suggest that single folders, fold switchers, and IDPs have distinct patterns of residue-residue coevolution. The review closes by discussing high-throughput and medium-throughput experimental approaches that might be used to identify new fold-switching proteins.
Collapse
Affiliation(s)
- Devlina Chakravarty
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthBethesdaMarylandUSA
| | - Joseph W. Schafer
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthBethesdaMarylandUSA
| | - Lauren L. Porter
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of HealthBethesdaMarylandUSA
- Biochemistry and Biophysics Center, National Heart, Lung, and Blood Institute, National Institutes of HealthBethesdaMarylandUSA
| |
Collapse
|
39
|
Xie J, Zhang W, Zhu X, Deng M, Lai L. Coevolution-based prediction of key allosteric residues for protein function regulation. eLife 2023; 12:81850. [PMID: 36799896 PMCID: PMC9981151 DOI: 10.7554/elife.81850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2022] [Accepted: 02/16/2023] [Indexed: 02/18/2023] Open
Abstract
Allostery is fundamental to many biological processes. Due to the distant regulation nature, how allosteric mutations, modifications, and effector binding impact protein function is difficult to forecast. In protein engineering, remote mutations cannot be rationally designed without large-scale experimental screening. Allosteric drugs have raised much attention due to their high specificity and possibility of overcoming existing drug-resistant mutations. However, optimization of allosteric compounds remains challenging. Here, we developed a novel computational method KeyAlloSite to predict allosteric site and to identify key allosteric residues (allo-residues) based on the evolutionary coupling model. We found that protein allosteric sites are strongly coupled to orthosteric site compared to non-functional sites. We further inferred key allo-residues by pairwise comparing the difference of evolutionary coupling scores of each residue in the allosteric pocket with the functional site. Our predicted key allo-residues are in accordance with previous experimental studies for typical allosteric proteins like BCR-ABL1, Tar, and PDZ3, as well as key cancer mutations. We also showed that KeyAlloSite can be used to predict key allosteric residues distant from the catalytic site that are important for enzyme catalysis. Our study demonstrates that weak coevolutionary couplings contain important information of protein allosteric regulation function. KeyAlloSite can be applied in studying the evolution of protein allosteric regulation, designing and optimizing allosteric drugs, and performing functional protein design and enzyme engineering.
Collapse
Affiliation(s)
- Juan Xie
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking UniversityBeijingChina
| | - Weilin Zhang
- BNLMS, Peking-Tsinghua Center for Life Sciences at the College of Chemistry and Molecular Engineering, Peking UniversityBeijingChina
| | - Xiaolei Zhu
- School of Sciences, Anhui Agricultural UniversityHefeiChina
| | - Minghua Deng
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking UniversityBeijingChina
- School of Mathematical Sciences, Peking UniversityBeijingChina
- Center for Statistical Science, Peking UniversityBeijingChina
| | - Luhua Lai
- Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking UniversityBeijingChina
- BNLMS, Peking-Tsinghua Center for Life Sciences at the College of Chemistry and Molecular Engineering, Peking UniversityBeijingChina
- Research Unit of Drug Design Method, Chinese Academy of Medical Sciences (2021RU014)BeijingChina
| |
Collapse
|
40
|
Kiewhuo K, Priyadarsinee L, Sarma H, Sastry GN. Molecular dynamics simulations reveal the effect of mutations in the RING domains of BRCA1-BARD1 complex and its relevance to the prognosis of breast cancer. J Biomol Struct Dyn 2023; 41:12734-12752. [PMID: 36775657 DOI: 10.1080/07391102.2023.2175383] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2022] [Accepted: 01/05/2023] [Indexed: 02/14/2023]
Abstract
The N-terminal RING-RING domain of BRCA1-BARD1 is an E3 ubiquitin ligase complex that plays a critical role in tumor suppression through DNA double stranded repair mechanism. Mutations in the BRCA1-BARD1 heterodimer RING domains were found to have an association with breast and ovarian cancer by a way of hampering the E3 ubiquitin ligase activity. Herein, the molecular mechanism of interaction, conformational change due to the specific mutations on the BRCA1-BARD1 complex at atomic level has been examined by employing molecular modeling techniques. Sixteen mutations have been selected for the study. Molecular dynamics simulation results reveal that the mutant complexes have more local perturbation with a high residual fluctuation in the zinc binding sites and central helix. A few of the BRCA1 (V11A, I21V, I42V, R71G, I31M and L51W) mutants have been experimentally identified that do not impair E3 ligase activity, display an enhanced number of H-bonds and non-bonded contacts at the interacting interface as revealed by MD simulation. The mutation of BRCA1 (C61G, C64Y, C39Y and C24R) and BARD1 (C53W, C71Y and C83R) zinc binding residues displayed a smaller number of significant H-bonds, other interactions and also loss of some of the hotspot residues. Additionally, most of the mutant complexes display relatively lower electrostatic energy, H-bonding and total stabilizing energy as compared to wild-type. The current study attempts to unravel the role of BRCA1-BARD1 mutations and delineates the structural and conformational dynamics in the progression of breast cancer.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Kikrusenuo Kiewhuo
- Advanced Computation and Data Sciences Division, CSIR-North East Institute of Science and Technology, Jorhat, Assam, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Lipsa Priyadarsinee
- Advanced Computation and Data Sciences Division, CSIR-North East Institute of Science and Technology, Jorhat, Assam, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| | - Himakshi Sarma
- Advanced Computation and Data Sciences Division, CSIR-North East Institute of Science and Technology, Jorhat, Assam, India
| | - G Narahari Sastry
- Advanced Computation and Data Sciences Division, CSIR-North East Institute of Science and Technology, Jorhat, Assam, India
- Academy of Scientific and Innovative Research (AcSIR), Ghaziabad, India
| |
Collapse
|
41
|
Cowan B, Beveridge DL, Thayer KM. Allosteric Signaling in PDZ Energetic Networks: Embedding Error Analysis. J Phys Chem B 2023; 127:623-633. [PMID: 36626697 PMCID: PMC9884075 DOI: 10.1021/acs.jpcb.2c06546] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Revised: 12/23/2022] [Indexed: 01/12/2023]
Abstract
Allosteric signaling in proteins has been known for some half a century, yet how the signal traverses the protein remains an active area of research. Recently, the importance of electrostatics to achieve long-range signaling has become increasingly appreciated. Our laboratory has been working on developing network approaches to capture such interactions. In this study, we turn our attention to the well-studied allosteric model protein, PDZ. We study the allosteric dynamics on a per-residue basis in key constructs involving the PDZ domain, its allosteric effector, and its peptide ligand. We utilize molecular dynamics trajectories to create the networks for the constructs to explore the allosteric effect by plotting the heat kernel results onto axes defined by principal components. We introduce a new metric to quantitate the volume sampled by a residue in the latent space. We relate our findings to PDZ and the greater field of allostery.
Collapse
Affiliation(s)
- Benjamin
S. Cowan
- Department
of Computer Science, Wesleyan University, Middletown, Connecticut06457, United States
- College
of Integrative Sciences, Wesleyan University, Middletown, Connecticut06457, United States
| | - David L. Beveridge
- Molecular
Biophysics Program, Wesleyan University, Middletown, Connecticut06457, United States
- Department
of Chemistry, Wesleyan University, Middletown, Connecticut06457, United States
| | - Kelly M. Thayer
- Department
of Computer Science, Wesleyan University, Middletown, Connecticut06457, United States
- Molecular
Biophysics Program, Wesleyan University, Middletown, Connecticut06457, United States
- Department
of Chemistry, Wesleyan University, Middletown, Connecticut06457, United States
- College
of Integrative Sciences, Wesleyan University, Middletown, Connecticut06457, United States
| |
Collapse
|
42
|
Schafer JW, Porter LL. Evolutionary selection of proteins with two folds. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.18.524637. [PMID: 36789442 PMCID: PMC9928049 DOI: 10.1101/2023.01.18.524637] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Although most globular proteins fold into a single stable structure 1 , an increasing number have been shown to remodel their secondary and tertiary structures in response to cellular stimuli 2 . State-of-the-art algorithms 3-5 predict that these fold-switching proteins assume only one stable structure 6,7 , missing their functionally critical alternative folds. Why these algorithms predict a single fold is unclear, but all of them infer protein structure from coevolved amino acid pairs. Here, we hypothesize that coevolutionary signatures are being missed. Suspecting that over-represented single-fold sequences may be masking these signatures, we developed an approach to search both highly diverse protein superfamilies-composed of single-fold and fold-switching variants-and protein subfamilies with more fold-switching variants. This approach successfully revealed coevolution of amino acid pairs uniquely corresponding to both conformations of 56/58 fold-switching proteins from distinct families. Then, using a set of coevolved amino acid pairs predicted by our approach, we successfully biased AlphaFold2 5 to predict two experimentally consistent conformations of a candidate protein with unsolved structure. The discovery of widespread dual-fold coevolution indicates that fold-switching sequences have been preserved by natural selection, implying that their functionalities provide evolutionary advantage and paving the way for predictions of diverse protein structures from single sequences.
Collapse
Affiliation(s)
- Joseph W. Schafer
- National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA
| | - Lauren L. Porter
- National Library of Medicine, National Center for Biotechnology Information, National Institutes of Health, Bethesda, MD 20894, USA
- National Heart, Lung, and Blood Institute, Biochemistry and Biophysics Center, National Institutes of Health, Bethesda, MD 20892, USA
| |
Collapse
|
43
|
Govindaraj RG, Thangapandian S, Schauperl M, Denny RA, Diller DJ. Recent applications of computational methods to allosteric drug discovery. Front Mol Biosci 2023; 9:1070328. [PMID: 36710877 PMCID: PMC9877542 DOI: 10.3389/fmolb.2022.1070328] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2022] [Accepted: 12/13/2022] [Indexed: 01/13/2023] Open
Abstract
Interest in exploiting allosteric sites for the development of new therapeutics has grown considerably over the last two decades. The chief driving force behind the interest in allostery for drug discovery stems from the fact that in comparison to orthosteric sites, allosteric sites are less conserved across a protein family, thereby offering greater opportunity for selectivity and ultimately tolerability. While there is significant overlap between structure-based drug design for orthosteric and allosteric sites, allosteric sites offer additional challenges mostly involving the need to better understand protein flexibility and its relationship to protein function. Here we examine the extent to which structure-based drug design is impacting allosteric drug design by highlighting several targets across a variety of target classes.
Collapse
Affiliation(s)
- Rajiv Gandhi Govindaraj
- Computational Chemistry, HotSpot Therapeutics Inc., Boston, MA, United States,*Correspondence: Rajiv Gandhi Govindaraj,
| | | | - Michael Schauperl
- Computational Chemistry, HotSpot Therapeutics Inc., Boston, MA, United States
| | | | - David J. Diller
- Computational Chemistry, HotSpot Therapeutics Inc., Boston, MA, United States
| |
Collapse
|
44
|
Thermodynamic architecture and conformational plasticity of GPCRs. Nat Commun 2023; 14:128. [PMID: 36624096 PMCID: PMC9829892 DOI: 10.1038/s41467-023-35790-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2022] [Accepted: 12/29/2022] [Indexed: 01/11/2023] Open
Abstract
G-protein-coupled receptors (GPCRs) are ubiquitous integral membrane proteins involved in diverse cellular signaling processes. Here, we carry out a large-scale ensemble thermodynamic study of 45 ligand-free GPCRs employing a structure-based statistical mechanical framework. We find that multiple partially structured states co-exist in the GPCR native ensemble, with the TM helices 1, 6 and 7 displaying varied folding status, and shaping the conformational landscape. Strongly coupled residues are anisotropically distributed, accounting for only 13% of the residues, illustrating that a large number of residues are inherently dynamic. Active-state GPCRs are characterized by reduced conformational heterogeneity with altered coupling-patterns distributed throughout the structural scaffold. In silico alanine-scanning mutagenesis reveals that extra- and intra-cellular faces of GPCRs are coupled thermodynamically, highlighting an exquisite structural specialization and the fluid nature of the intramolecular interaction network. The ensemble-based perturbation methodology presented here lays the foundation for understanding allosteric mechanisms and the effects of disease-causing mutations in GCPRs.
Collapse
|
45
|
Erman B. Mutual information analysis of mutation, nonlinearity, and triple interactions in proteins. Proteins 2023; 91:121-133. [PMID: 36000344 DOI: 10.1002/prot.26415] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Revised: 07/16/2022] [Accepted: 07/18/2022] [Indexed: 12/15/2022]
Abstract
Mutations are the cause of several diseases as well as the underlying force of evolution. A thorough understanding of their biophysical consequences is essential. We present a computational framework for evaluating different levels of mutual information (MI) and its dependence on mutation. We used molecular dynamics trajectories of the third PDZ domain and its different mutations. Nonlinear MI between all residue pairs are calculated by tensor Hermite polynomials up to the fifth order and compared with results from multivariate Gaussian distribution of joint probabilities. We show that MI is written as the sum of a Gaussian and a nonlinear component. Results for the PDZ domain show that the Gaussian term gives a sufficiently accurate representation of MI when compared with nonlinear terms up to the fifth order. Changes in MI between residue pairs show the characteristic patterns resulting from specific mutations. Emergence of new peaks in the MI versus residue index plots of mutated PDZ shows how mutation may change allosteric pathways. Triple correlations are characterized by evaluating MI between triplets of residues. We observed that certain triplets are strongly affected by mutation. Susceptibility of residues to perturbation is obtained by MI and discussed in terms of linear response theory.
Collapse
Affiliation(s)
- Burak Erman
- Department of Chemical and Biological Engineering, Koc University, Istanbul, Turkey
| |
Collapse
|
46
|
Shea A, Bartz J, Zhang L, Dong X. Predicting mutational function using machine learning. MUTATION RESEARCH. REVIEWS IN MUTATION RESEARCH 2023; 791:108457. [PMID: 36965820 PMCID: PMC10239318 DOI: 10.1016/j.mrrev.2023.108457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 03/11/2023] [Accepted: 03/20/2023] [Indexed: 03/27/2023]
Abstract
Genetic variations are one of the major causes of phenotypic variations between human individuals. Although beneficial as being the substrate of evolution, germline mutations may cause diseases, including Mendelian diseases and complex diseases such as diabetes and heart diseases. Mutations occurring in somatic cells are a main cause of cancer and likely cause age-related phenotypes and other age-related diseases. Because of the high abundance of genetic variations in the human genome, i.e., millions of germline variations per human subject and thousands of additional somatic mutations per cell, it is technically challenging to experimentally verify the function of every possible mutation and their interactions. Significant progress has been made to solve this problem using computational approaches, especially machine learning (ML). Here, we review the progress and achievements made in recent years in this field of research. We classify the computational models in two ways: one according to their prediction goals including protein structural alterations, gene expression changes, and disease risks, and the other according to their methodologies, including non-machine learning methods, classical machine learning methods, and deep neural network methods. For models in each category, we discuss their architecture, prediction accuracy, and potential limitations. This review provides new insights into the applications and future directions of computational approaches in understanding the role of mutations in aging and disease.
Collapse
Affiliation(s)
- Anthony Shea
- Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA
| | - Josh Bartz
- Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA; Bioinformatics and Computational Biology Program, University of Minnesota, Minneapolis, MN 55455, USA
| | - Lei Zhang
- Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Biochemistry, Molecular Biology and Biophysics, University of Minnesota, Minneapolis, MN 55455, USA
| | - Xiao Dong
- Institute on the Biology of Aging and Metabolism, University of Minnesota, Minneapolis, MN 55455, USA; Department of Genetics, Cell Biology and Development, University of Minnesota, Minneapolis, MN 55455, USA.
| |
Collapse
|
47
|
Llinares-López F, Berthet Q, Blondel M, Teboul O, Vert JP. Deep embedding and alignment of protein sequences. Nat Methods 2023; 20:104-111. [PMID: 36522501 DOI: 10.1038/s41592-022-01700-2] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2021] [Accepted: 10/24/2022] [Indexed: 12/23/2022]
Abstract
Protein sequence alignment is a key component of most bioinformatics pipelines to study the structures and functions of proteins. Aligning highly divergent sequences remains, however, a difficult task that current algorithms often fail to perform accurately, leaving many proteins or open reading frames poorly annotated. Here we leverage recent advances in deep learning for language modeling and differentiable programming to propose DEDAL (deep embedding and differentiable alignment), a flexible model to align protein sequences and detect homologs. DEDAL is a machine learning-based model that learns to align sequences by observing large datasets of raw protein sequences and of correct alignments. Once trained, we show that DEDAL improves by up to two- or threefold the alignment correctness over existing methods on remote homologs and better discriminates remote homologs from evolutionarily unrelated sequences, paving the way to improvements on many downstream tasks relying on sequence alignment in structural and functional genomics.
Collapse
|
48
|
Choudhuri I, Biswas A, Haldane A, Levy RM. Contingency and Entrenchment of Drug-Resistance Mutations in HIV Viral Proteins. J Phys Chem B 2022; 126:10622-10636. [PMID: 36493468 PMCID: PMC9841799 DOI: 10.1021/acs.jpcb.2c06123] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
The ability of HIV-1 to rapidly mutate leads to antiretroviral therapy (ART) failure among infected patients. Drug-resistance mutations (DRMs), which cause a fitness penalty to intrinsic viral fitness, are compensated by accessory mutations with favorable epistatic interactions which cause an evolutionary trapping effect, but the kinetics of this overall process has not been well characterized. Here, using a Potts Hamiltonian model describing epistasis combined with kinetic Monte Carlo simulations of evolutionary trajectories, we explore how epistasis modulates the evolutionary dynamics of HIV DRMs. We show how the occurrence of a drug-resistance mutation is contingent on favorable epistatic interactions with many other residues of the sequence background and that subsequent mutations entrench DRMs. We measure the time-autocorrelation of fluctuations in the likelihood of DRMs due to epistatic coupling with the sequence background, which reveals the presence of two evolutionary processes controlling DRM kinetics with two distinct time scales. Further analysis of waiting times for the evolutionary trapping effect to reverse reveals that the sequences which entrench (trap) a DRM are responsible for the slower time scale. We also quantify the overall strength of epistatic effects on the evolutionary kinetics for different mutations and show these are much larger for DRM positions than polymorphic positions, and we also show that trapping of a DRM is often caused by the collective effect of many accessory mutations, rather than a few strongly coupled ones, suggesting the importance of multiresidue sequence variations in HIV evolution. The analysis presented here provides a framework to explore the kinetic pathways through which viral proteins like HIV evolve under drug-selection pressure.
Collapse
Affiliation(s)
| | | | - Allan Haldane
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122, United States; Department of Physics, Temple University, Philadelphia, Pennsylvania 19122-6008, United States
| | - Ronald M. Levy
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122, United States; Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122, United States
| |
Collapse
|
49
|
Kennedy EN, Foster CA, Barr SA, Bourret RB. General strategies for using amino acid sequence data to guide biochemical investigation of protein function. Biochem Soc Trans 2022; 50:1847-1858. [PMID: 36416676 PMCID: PMC10257402 DOI: 10.1042/bst20220849] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2022] [Revised: 11/04/2022] [Accepted: 11/09/2022] [Indexed: 11/24/2022]
Abstract
The rapid increase of '-omics' data warrants the reconsideration of experimental strategies to investigate general protein function. Studying individual members of a protein family is likely insufficient to provide a complete mechanistic understanding of family functions, especially for diverse families with thousands of known members. Strategies that exploit large amounts of available amino acid sequence data can inspire and guide biochemical experiments, generating broadly applicable insights into a given family. Here we review several methods that utilize abundant sequence data to focus experimental efforts and identify features truly representative of a protein family or domain. First, coevolutionary relationships between residues within primary sequences can be successfully exploited to identify structurally and/or functionally important positions for experimental investigation. Second, functionally important variable residue positions typically occupy a limited sequence space, a property useful for guiding biochemical characterization of the effects of the most physiologically and evolutionarily relevant amino acids. Third, amino acid sequence variation within domains shared between different protein families can be used to sort a particular domain into multiple subtypes, inspiring further experimental designs. Although generally applicable to any kind of protein domain because they depend solely on amino acid sequences, the second and third approaches are reviewed in detail because they appear to have been used infrequently and offer immediate opportunities for new advances. Finally, we speculate that future technologies capable of analyzing and manipulating conserved and variable aspects of the three-dimensional structures of a protein family could lead to broad insights not attainable by current methods.
Collapse
Affiliation(s)
- Emily N. Kennedy
- Department of Microbiology & Immunology, University of North Carolina, Chapel Hill, NC, United States of America
| | - Clay A. Foster
- Department of Pediatrics, Section Hematology/Oncology, University of Oklahoma Health Sciences Center, Oklahoma City, Oklahoma, United States of America
| | - Sarah A. Barr
- Department of Microbiology & Immunology, University of North Carolina, Chapel Hill, NC, United States of America
| | - Robert B. Bourret
- Department of Microbiology & Immunology, University of North Carolina, Chapel Hill, NC, United States of America
| |
Collapse
|
50
|
Jiménez-Panizo A, Alegre-Martí A, Tettey T, Fettweis G, Abella M, Antón R, Johnson T, Kim S, Schiltz R, Núñez-Barrios I, Font-Díaz J, Caelles C, Valledor A, Pérez P, Rojas A, Fernández-Recio J, Presman D, Hager G, Fuentes-Prior P, Estébanez-Perpiñá E. The multivalency of the glucocorticoid receptor ligand-binding domain explains its manifold physiological activities. Nucleic Acids Res 2022; 50:13063-13082. [PMID: 36464162 PMCID: PMC9825158 DOI: 10.1093/nar/gkac1119] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2022] [Revised: 10/28/2022] [Accepted: 11/08/2022] [Indexed: 12/05/2022] Open
Abstract
The glucocorticoid receptor (GR) is a ubiquitously expressed transcription factor that controls metabolic and homeostatic processes essential for life. Although numerous crystal structures of the GR ligand-binding domain (GR-LBD) have been reported, the functional oligomeric state of the full-length receptor, which is essential for its transcriptional activity, remains disputed. Here we present five new crystal structures of agonist-bound GR-LBD, along with a thorough analysis of previous structural work. We identify four distinct homodimerization interfaces on the GR-LBD surface, which can associate into 20 topologically different homodimers. Biologically relevant homodimers were identified by studying a battery of GR point mutants including crosslinking assays in solution, quantitative fluorescence microscopy in living cells, and transcriptomic analyses. Our results highlight the relevance of non-canonical dimerization modes for GR, especially of contacts made by loop L1-3 residues such as Tyr545. Our work illustrates the unique flexibility of GR's LBD and suggests different dimeric conformations within cells. In addition, we unveil pathophysiologically relevant quaternary assemblies of the receptor with important implications for glucocorticoid action and drug design.
Collapse
Affiliation(s)
| | | | | | - Gregory Fettweis
- National Cancer Institute, National Institutes of Health, Bethesda, MD 20892-5055, USA
| | - Montserrat Abella
- Department of Biochemistry and Molecular Biomedicine, Faculty of Biology, University of Barcelona (UB), 08028 Barcelona, Spain,Institute of Biomedicine of the University of Barcelona (IBUB), University of Barcelona (UB), 08028 Barcelona, Spain
| | - Rosa Antón
- Biomedical Research Institute Sant Pau (IIB Sant Pau), 08041 Barcelona, Spain
| | - Thomas A Johnson
- National Cancer Institute, National Institutes of Health, Bethesda, MD 20892-5055, USA
| | - Sohyoung Kim
- National Cancer Institute, National Institutes of Health, Bethesda, MD 20892-5055, USA
| | - R Louis Schiltz
- National Cancer Institute, National Institutes of Health, Bethesda, MD 20892-5055, USA
| | - Israel Núñez-Barrios
- Andalusian Center for Developmental Biology (CABD-CSIC). Campus Universitario Pablo de Olavide, 41013 Sevilla, Spain
| | - Joan Font-Díaz
- Institute of Biomedicine of the University of Barcelona (IBUB), University of Barcelona (UB), 08028 Barcelona, Spain,Department of Cell Biology, Physiology and Immunology, Faculty of Biology, University of Barcelona, 08028 Barcelona, Spain
| | - Carme Caelles
- Institute of Biomedicine of the University of Barcelona (IBUB), University of Barcelona (UB), 08028 Barcelona, Spain,Department of Biochemistry and Physiology, Faculty of Pharmacy and Food Sciences, University of Barcelona, Barcelona 08028, Spain
| | - Annabel F Valledor
- Institute of Biomedicine of the University of Barcelona (IBUB), University of Barcelona (UB), 08028 Barcelona, Spain,Department of Cell Biology, Physiology and Immunology, Faculty of Biology, University of Barcelona, 08028 Barcelona, Spain
| | - Paloma Pérez
- Instituto de Biomedicina de Valencia (IBV)-CSIC, 46010, Valencia, Spain
| | - Ana M Rojas
- Andalusian Center for Developmental Biology (CABD-CSIC). Campus Universitario Pablo de Olavide, 41013 Sevilla, Spain
| | - Juan Fernández-Recio
- Instituto de Ciencias de la Vid y del Vino (ICVV), CSIC - Universidad de La Rioja - Gobierno de La Rioja, 26007 Logroño, Spain
| | - Diego M Presman
- IFIBYNE, UBA-CONICET, Universidad de Buenos Aires, Facultad de Ciencias Exactas y Naturales, Buenos Aires C1428EGA, Argentina
| | - Gordon L Hager
- Correspondence may also be addressed to Gordon L. Hager. Tel: +1 240 760 6618;
| | | | | |
Collapse
|