1
|
Mohanty V, Shakhnovich EI. Biophysical fitness landscape design traps viral evolution. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.30.646233. [PMID: 40236159 PMCID: PMC11996392 DOI: 10.1101/2025.03.30.646233] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/17/2025]
Abstract
We introduce foundational principles for designing customizable fitness landscapes for proteins. We focus on crafting antibody ensembles to create evolutionary traps which restrict viral fitness enhancement. By deriving a fundamental relationship between a mutant protein's fitness and its binding affinities to host receptors and antibodies, we show that the fitnesses of different protein sequences are designable, meaning they can be independently tuned by careful choice of antibodies. Given a user-defined target fitness landscape, stochastic optimization can be performed to obtain such an ensemble of antibodies which force the protein to evolve according to the designed target fitness landscape. We conduct in silico serial dilution experiments using microscopic chemical reaction dynamics to simulate viral evolution and validate the fitness landscape design. We then apply the design protocol to control the relative fitnesses of two SARS-CoV-2 neutral genotype networks while ensuring absolute fitness reduction. Finally, we introduce an iterative design protocol which consistently discovers better vaccination target sequences, generating antibodies that restrict the post-vaccination fitness growth of escape variants while simultaneously suppressing wildtype fitness. Biophysical fitness landscape design thus opens the door to prescient vaccine, antibody, and peptide design, thinking several steps ahead of pathogen evolution.
Collapse
|
2
|
Maisuradze GG, Thakur A, Khatri K, Haldane A, Levy RM. Using AlphaFold2 to Predict the Conformations of Side Chains in Folded Proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.02.10.637534. [PMID: 39990457 PMCID: PMC11844428 DOI: 10.1101/2025.02.10.637534] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 02/25/2025]
Abstract
AlphaFold has revolutionized protein structure prediction by accurately creating 3D structures from just the amino acid sequence. However, even with extensive research validating its overall accuracy, a key question remains: Can AlphaFold predict the conformation of individual amino acid residue side chains within a folded protein? This is important for the field of molecular modeling, particularly when predicting the effects of mutations on protein stability and ligand binding. AlphaFold generates a set of atomic coordinates not just for the mutated side chain but also for potential rearrangements across the entire protein structure. In this study we investigate the ability of ColabFold, an online implementation of AlphaFold2 (AF2), to predict the conformations of residue side chains in folded proteins. We find that over a set of 10 benchmark proteins, the side chain conformation prediction error of ColabFold is on average ~ 14 % forχ 1 dihedral angles, and increases to ~ 48 % forχ 3 dihedral angles. The prediction error is smaller for non-polar side chains and is somewhat improved using structural templates. ColabFold demonstrates a bias towards the most prevalent rotamer states in the protein data bank (PDB), potentially limiting its ability to capture rare side chain conformations effectively. As an application of AlphaFold to explore the structural consequences of strongly cooperative mutations on side chain rearrangements, we employ a Potts sequence-based statistical energy model to perform large scale mutational scans of two proteins ABL1 and PIM1 kinase, searching for the most strongly cooperative mutational pairs, and then use ColabFold to predict the structural signatures of this cooperativity on the interacting side chains. Our results demonstrate that integration of the sequence-based Potts model with AlphaFold into a single pipeline provides a new tool that can be used to explore the fundamental relationship between protein mutations, cooperative changes in structure, and fitness.
Collapse
Affiliation(s)
- Gia G. Maisuradze
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA
- Department of Chemistry, Temple University, Philadelphia, PA, USA
- Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY, USA
| | - Abhishek Thakur
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA
- Department of Chemistry, Temple University, Philadelphia, PA, USA
| | - Kisan Khatri
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA
- Department of Physics, Temple University, Philadelphia, PA, USA
| | - Allan Haldane
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA
- Department of Physics, Temple University, Philadelphia, PA, USA
| | - Ronald M. Levy
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA
- Department of Chemistry, Temple University, Philadelphia, PA, USA
| |
Collapse
|
3
|
Gu X, Aranganathan A, Tiwary P. Empowering AlphaFold2 for protein conformation selective drug discovery with AlphaFold2-RAVE. eLife 2024; 13:RP99702. [PMID: 39240197 PMCID: PMC11379456 DOI: 10.7554/elife.99702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2024] Open
Abstract
Small-molecule drug design hinges on obtaining co-crystallized ligand-protein structures. Despite AlphaFold2's strides in protein native structure prediction, its focus on apo structures overlooks ligands and associated holo structures. Moreover, designing selective drugs often benefits from the targeting of diverse metastable conformations. Therefore, direct application of AlphaFold2 models in virtual screening and drug discovery remains tentative. Here, we demonstrate an AlphaFold2-based framework combined with all-atom enhanced sampling molecular dynamics and Induced Fit docking, named AF2RAVE-Glide, to conduct computational model-based small-molecule binding of metastable protein kinase conformations, initiated from protein sequences. We demonstrate the AF2RAVE-Glide workflow on three different mammalian protein kinases and their type I and II inhibitors, with special emphasis on binding of known type II kinase inhibitors which target the metastable classical DFG-out state. These states are not easy to sample from AlphaFold2. Here, we demonstrate how with AF2RAVE these metastable conformations can be sampled for different kinases with high enough accuracy to enable subsequent docking of known type II kinase inhibitors with more than 50% success rates across docking calculations. We believe the protocol should be deployable for other kinases and more proteins generally.
Collapse
Affiliation(s)
- Xinyu Gu
- Institute for Physical Science and Technology, University of MarylandCollege ParkUnited States
- University of Maryland Institute for Health ComputingBethesdaUnited States
| | - Akashnathan Aranganathan
- Institute for Physical Science and Technology, University of MarylandCollege ParkUnited States
- Biophysics Program, University of MarylandCollege ParkUnited States
| | - Pratyush Tiwary
- Institute for Physical Science and Technology, University of MarylandCollege ParkUnited States
- University of Maryland Institute for Health ComputingBethesdaUnited States
- Department of Chemistry and Biochemistry, University of MarylandCollege ParkUnited States
| |
Collapse
|
4
|
Gizzio J, Thakur A, Haldane A, Post CB, Levy RM. Evolutionary sequence and structural basis for the distinct conformational landscapes of Tyr and Ser/Thr kinases. Nat Commun 2024; 15:6545. [PMID: 39095350 PMCID: PMC11297160 DOI: 10.1038/s41467-024-50812-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2024] [Accepted: 07/22/2024] [Indexed: 08/04/2024] Open
Abstract
Protein kinases are molecular machines with rich sequence variation that distinguishes the two main evolutionary branches - tyrosine kinases (TKs) from serine/threonine kinases (STKs). Using a sequence co-variation Potts statistical energy model we previously concluded that TK catalytic domains are more likely than STKs to adopt an inactive conformation with the activation loop in an autoinhibitory folded conformation, due to intrinsic sequence effects. Here we investigate the structural basis for this phenomenon by integrating the sequence-based model with structure-based molecular dynamics (MD) to determine the effects of mutations on the free energy difference between active and inactive conformations, using a thermodynamic cycle involving many (n = 108) protein-mutation free energy perturbation (FEP) simulations in the active and inactive conformations. The sequence and structure-based results are consistent and support the hypothesis that the inactive conformation DFG-out Activation Loop Folded, is a functional regulatory state that has been stabilized in TKs relative to STKs over the course of their evolution via the accumulation of residue substitutions in the activation loop and catalytic loop that facilitate distinct substrate binding modes in trans and additional modes of regulation in cis for TKs.
Collapse
Affiliation(s)
- Joan Gizzio
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA
- Department of Chemistry, Temple University, Philadelphia, PA, USA
| | - Abhishek Thakur
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA
- Department of Chemistry, Temple University, Philadelphia, PA, USA
| | - Allan Haldane
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA
- Department of Physics, Temple University, Philadelphia, PA, USA
| | - Carol Beth Post
- Borch Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, IN, USA
| | - Ronald M Levy
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, PA, USA.
- Department of Chemistry, Temple University, Philadelphia, PA, USA.
| |
Collapse
|
5
|
Gu X, Aranganathan A, Tiwary P. Empowering AlphaFold2 for protein conformation selective drug discovery with AlphaFold2-RAVE. ARXIV 2024:arXiv:2404.07102v3. [PMID: 38659642 PMCID: PMC11042445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]
Abstract
Small molecule drug design hinges on obtaining co-crystallized ligand-protein structures. Despite AlphaFold2's strides in protein native structure prediction, its focus on apo structures overlooks ligands and associated holo structures. Moreover, designing selective drugs often benefits from the targeting of diverse metastable conformations. Therefore, direct application of AlphaFold2 models in virtual screening and drug discovery remains tentative. Here, we demonstrate an AlphaFold2 based framework combined with all-atom enhanced sampling molecular dynamics and induced fit docking, named AF2RAVE-Glide, to conduct computational model based small molecule binding of metastable protein kinase conformations, initiated from protein sequences. We demonstrate the AF2RAVE-Glide workflow on three different protein kinases and their type I and II inhibitors, with special emphasis on binding of known type II kinase inhibitors which target the metastable classical DFG-out state. These states are not easy to sample from AlphaFold2. Here we demonstrate how with AF2RAVE these metastable conformations can be sampled for different kinases with high enough accuracy to enable subsequent docking of known type II kinase inhibitors with more than 50% success rates across docking calculations. We believe the protocol should be deployable for other kinases and more proteins generally.
Collapse
Affiliation(s)
- Xinyu Gu
- Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
- University of Maryland Institute for Health Computing, Bethesda, United States
| | - Akashnathan Aranganathan
- Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
- Biophysics Program, University of Maryland, College Park 20742, USA
| | - Pratyush Tiwary
- Institute for Physical Science and Technology, University of Maryland, College Park, Maryland 20742, USA
- Department of Chemistry and Biochemistry, University of Maryland, College Park 20742, USA
- University of Maryland Institute for Health Computing, Bethesda, United States
| |
Collapse
|
6
|
Gizzio J, Thakur A, Haldane A, Levy RM. Evolutionary sequence and structural basis for the distinct conformational landscapes of Tyr and Ser/Thr kinases. RESEARCH SQUARE 2024:rs.3.rs-4048991. [PMID: 38746330 PMCID: PMC11092858 DOI: 10.21203/rs.3.rs-4048991/v1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Protein kinases are molecular machines with rich sequence variation that distinguishes the two main evolutionary branches - tyrosine kinases (TKs) from serine/threonine kinases (STKs). Using a sequence co-variation Potts statistical energy model we previously concluded that TK catalytic domains are more likely than STKs to adopt an inactive conformation with the activation loop in an autoinhibitory "folded" conformation, due to intrinsic sequence effects. Here we investigated the structural basis for this phenomenon by integrating the sequence-based model with structure-based molecular dynamics (MD) to determine the effects of mutations on the free energy difference between active and inactive conformations, using a novel thermodynamic cycle involving many (n=108) protein-mutation free energy perturbation (FEP) simulations in the active and inactive conformations. The sequence and structure-based results are consistent and support the hypothesis that the inactive conformation "DFG-out Activation Loop Folded", is a functional regulatory state that has been stabilized in TKs relative to STKs over the course of their evolution via the accumulation of residue substitutions in the activation loop and catalytic loop that facilitate distinct substrate binding modes in trans and additional modes of regulation in cis for TKs.
Collapse
Affiliation(s)
- Joan Gizzio
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122
| | - Abhishek Thakur
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122
| | - Allan Haldane
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122
- Department of Physics, Temple University, Philadelphia, Pennsylvania 19122
| | - Ronald M. Levy
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122
| |
Collapse
|
7
|
Gizzio J, Thakur A, Haldane A, Post CB, Levy RM. Evolutionary sequence and structural basis for the distinct conformational landscapes of Tyr and Ser/Thr kinases. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.03.08.584161. [PMID: 38559238 PMCID: PMC10979876 DOI: 10.1101/2024.03.08.584161] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/04/2024]
Abstract
Protein kinases are molecular machines with rich sequence variation that distinguishes the two main evolutionary branches - tyrosine kinases (TKs) from serine/threonine kinases (STKs). Using a sequence co-variation Potts statistical energy model we previously concluded that TK catalytic domains are more likely than STKs to adopt an inactive conformation with the activation loop in an autoinhibitory "folded" conformation, due to intrinsic sequence effects. Here we investigated the structural basis for this phenomenon by integrating the sequence-based model with structure-based molecular dynamics (MD) to determine the effects of mutations on the free energy difference between active and inactive conformations, using a novel thermodynamic cycle involving many (n=108) protein-mutation free energy perturbation (FEP) simulations in the active and inactive conformations. The sequence and structure-based results are consistent and support the hypothesis that the inactive conformation "DFG-out Activation Loop Folded", is a functional regulatory state that has been stabilized in TKs relative to STKs over the course of their evolution via the accumulation of residue substitutions in the activation loop and catalytic loop that facilitate distinct substrate binding modes in trans and additional modes of regulation in cis for TKs.
Collapse
Affiliation(s)
- Joan Gizzio
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122
| | - Abhishek Thakur
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122
| | - Allan Haldane
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122
- Department of Physics, Temple University, Philadelphia, Pennsylvania 19122
| | - Carol Beth Post
- Borch Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University, West Lafayette, Indiana 47907
| | - Ronald M. Levy
- Center for Biophysics and Computational Biology, Temple University, Philadelphia, Pennsylvania 19122
- Department of Chemistry, Temple University, Philadelphia, Pennsylvania 19122
| |
Collapse
|