Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Vani BP, Aranganathan A, Wang D, Tiwary P. AlphaFold2-RAVE: From Sequence to Boltzmann Ranking. J Chem Theory Comput 2023;19:4351-4354. [PMID: 37171364 PMCID: PMC10524496 DOI: 10.1021/acs.jctc.3c00290] [Citation(s) in RCA: 50] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]

For:	Vani BP, Aranganathan A, Wang D, Tiwary P. AlphaFold2-RAVE: From Sequence to Boltzmann Ranking. J Chem Theory Comput 2023;19:4351-4354. [PMID: 37171364 PMCID: PMC10524496 DOI: 10.1021/acs.jctc.3c00290] [Citation(s) in RCA: 50] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/13/2023]

Number

Cited by Other Article(s)

Lazar T, Connor A, DeLisle CF, Burger V, Tompa P. Targeting protein disorder: the next hurdle in drug discovery. Nat Rev Drug Discov 2025:10.1038/s41573-025-01220-6. [PMID: 40490488 DOI: 10.1038/s41573-025-01220-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/08/2025] [Indexed: 06/11/2025]

Sun Q, Wang H, Xie J, Wang L, Mu J, Li J, Ren Y, Lai L. Computer-Aided Drug Discovery for Undruggable Targets. Chem Rev 2025. [PMID: 40423592 DOI: 10.1021/acs.chemrev.4c00969] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/28/2025]

Karatzas P, Brotzakis ZF, Sarimveis H. Small Molecules Targeting the Structural Dynamics of AR-V7 Partially Disordered Proteins Using Deep Ensemble Docking. J Chem Theory Comput 2025;21:4898-4909. [PMID: 40231860 PMCID: PMC12080126 DOI: 10.1021/acs.jctc.5c00171] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2025] [Revised: 04/05/2025] [Accepted: 04/07/2025] [Indexed: 04/16/2025]

Wankowicz SA, Fraser JS. Advances in uncovering the mechanisms of macromolecular conformational entropy. Nat Chem Biol 2025;21:623-634. [PMID: 40275100 PMCID: PMC12103944 DOI: 10.1038/s41589-025-01879-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2024] [Accepted: 03/10/2025] [Indexed: 04/26/2025]

Vargas-Rosales PA, Caflisch A. The physics-AI dialogue in drug design. RSC Med Chem 2025;16:1499-1515. [PMID: 39906313 PMCID: PMC11788922 DOI: 10.1039/d4md00869c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2024] [Accepted: 01/16/2025] [Indexed: 02/06/2025] Open

Sil S, Datta I, Basu S. Use of AI-methods over MD simulations in the sampling of conformational ensembles in IDPs. Front Mol Biosci 2025;12:1542267. [PMID: 40264953 PMCID: PMC12011600 DOI: 10.3389/fmolb.2025.1542267] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2024] [Accepted: 03/17/2025] [Indexed: 04/24/2025] Open

Kalakoti Y, Sanjeev A, Wallner B. Prediction of structural variation. Curr Opin Struct Biol 2025;91:103003. [PMID: 39983409 DOI: 10.1016/j.sbi.2025.103003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2024] [Revised: 01/15/2025] [Accepted: 01/26/2025] [Indexed: 02/23/2025]

Aranganathan A, Gu X, Wang D, Vani BP, Tiwary P. Modeling Boltzmann-weighted structural ensembles of proteins using artificial intelligence-based methods. Curr Opin Struct Biol 2025;91:103000. [PMID: 39923288 PMCID: PMC12011212 DOI: 10.1016/j.sbi.2025.103000] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2024] [Revised: 01/09/2025] [Accepted: 01/20/2025] [Indexed: 02/11/2025]

Schafer JW, Porter LL. AlphaFold2's training set powers its predictions of some fold-switched conformations. Protein Sci 2025;34:e70105. [PMID: 40130805 PMCID: PMC11934219 DOI: 10.1002/pro.70105] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2024] [Revised: 02/04/2025] [Accepted: 03/07/2025] [Indexed: 03/26/2025]

Abstract

AlphaFold2 (AF2), a deep-learning-based model that predicts protein structures from their amino acid sequences, has recently been used to predict multiple protein conformations. In some cases, AF2 has successfully predicted both dominant and alternative conformations of fold-switching proteins, which remodel their secondary and/or tertiary structures in response to cellular stimuli. Whether AF2 has learned enough protein folding principles to reliably predict alternative conformations outside of its training set is unclear. Previous work suggests that AF2 predicted these alternative conformations by memorizing them during training. Here, we use CFold-an implementation of the AF2 network trained on a more limited subset of experimentally determined protein structures-to directly test how well the AF2 architecture predicts alternative conformations of fold switchers outside of its training set. We tested CFold on eight fold switchers from six protein families. These proteins-whose secondary structures switch between α-helix and β-sheet and/or whose hydrogen bonding networks are reconfigured dramatically-had not been tested previously, and only one of their alternative conformations was in CFold's training set. Successful CFold predictions would indicate that the AF2 architecture can predict disparate alternative conformations of fold-switched conformations outside of its training set, while unsuccessful predictions would suggest that AF2 predictions of these alternative conformations likely arise from association with structures learned during training. Despite sampling 1300-4300 structures/protein with various sequence sampling techniques, CFold predicted only one alternative structure outside of its training set accurately and with high confidence while also generating experimentally inconsistent structures with higher confidence. Though these results indicate that AF2's current success in predicting alternative conformations of fold switchers stems largely from its training data, results from a sequence pruning technique suggest developments that could lead to a more reliable generative model in the future.

Collapse

Janson G, Jussupow A, Feig M. Deep generative modeling of temperature-dependent structural ensembles of proteins. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2025:2025.03.09.642148. [PMID: 40161645 PMCID: PMC11952339 DOI: 10.1101/2025.03.09.642148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/02/2025]

Brotzakis ZF, Zhang S, Murtada MH, Vendruscolo M. AlphaFold prediction of structural ensembles of disordered proteins. Nat Commun 2025;16:1632. [PMID: 39952928 PMCID: PMC11829000 DOI: 10.1038/s41467-025-56572-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Accepted: 01/23/2025] [Indexed: 02/17/2025] Open

Bemelmans MP, Cournia Z, Damm-Ganamet KL, Gervasio FL, Pande V. Computational advances in discovering cryptic pockets for drug discovery. Curr Opin Struct Biol 2025;90:102975. [PMID: 39778412 DOI: 10.1016/j.sbi.2024.102975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2024] [Revised: 11/27/2024] [Accepted: 12/06/2024] [Indexed: 01/11/2025]

Montserrat-Canals M, Cordara G, Krengel U. Allostery. Q Rev Biophys 2025;58:e5. [PMID: 39849666 DOI: 10.1017/s0033583524000209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2025]

van Aalst EJ, Wylie BJ. An in silico framework to visualize how cancer-associated mutations influence structural plasticity of the chemokine receptor CCR3. Protein Sci 2025;34:e70013. [PMID: 39723881 DOI: 10.1002/pro.70013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2024] [Revised: 11/06/2024] [Accepted: 12/12/2024] [Indexed: 12/28/2024]

Abstract

G protein Coupled Receptors (GPCRs) are the largest family of cell surface receptors in humans. Somatic mutations in GPCRs are implicated in cancer progression and metastasis, but mechanisms are poorly understood. Emerging evidence implicates perturbation of intra-receptor activation pathway motifs whereby extracellular signals are transmitted intracellularly. Recently, sufficiently sensitive methodology was described to calculate structural strain as a function of missense mutations in AlphaFold-predicted model structures, which was extensively validated on experimental and predicted structural datasets. When paired with Molecular Dynamics (MD) simulations, these tools provide a facile approach to screen mutations in silico. We applied this framework to calculate the structural and dynamic effects of cancer-associated mutations in the chemokine receptor CCR3, a Class A GPCR involved in cancer and autoimmune disorders. Residue-residue contact scoring refined effective strain results, highlighting significant remodeling of inter- and intra-motif contacts along the highly conserved GPCR activation pathway network. We then integrated AlphaFold-derived predicted Local Distance Difference Test scores with per-residue Root Mean Square Fluctuations and activation pathway Contact Analysis (CONAN) from coarse grain MD simulations to identify statistically significant changes in receptor dynamics upon mutation. Finally, analysis of negative control mutants suggests false positive results in AlphaFold pipelines should be considered but can be mitigated with stricter control of statistical analysis. Our results indicate selected mutants influence structural plasticity of CCR3 related to ligand interaction, activation, and G protein coupling, using a framework that could be applicable to a wide range of biochemically relevant protein targets following further validation.

Collapse

O'Donnell TJ, Kanduri C, Isacchini G, Limenitakis JP, Brachman RA, Alvarez RA, Haff IH, Sandve GK, Greiff V. Reading the repertoire: Progress in adaptive immune receptor analysis using machine learning. Cell Syst 2024;15:1168-1189. [PMID: 39701034 DOI: 10.1016/j.cels.2024.11.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2024] [Revised: 08/16/2024] [Accepted: 11/14/2024] [Indexed: 12/21/2024]

Wang D, Tiwary P. Augmenting Human Expertise in Weighted Ensemble Simulations through Deep Learning-Based Information Bottleneck. J Chem Theory Comput 2024;20:10371-10383. [PMID: 39589127 DOI: 10.1021/acs.jctc.4c00919] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2024]

Harding-Larsen D, Funk J, Madsen NG, Gharabli H, Acevedo-Rocha CG, Mazurenko S, Welner DH. Protein representations: Encoding biological information for machine learning in biocatalysis. Biotechnol Adv 2024;77:108459. [PMID: 39366493 DOI: 10.1016/j.biotechadv.2024.108459] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/18/2024] [Revised: 09/19/2024] [Accepted: 09/29/2024] [Indexed: 10/06/2024]

Abstract

Enzymes offer a more environmentally friendly and low-impact solution to conventional chemistry, but they often require additional engineering for their application in industrial settings, an endeavour that is challenging and laborious. To address this issue, the power of machine learning can be harnessed to produce predictive models that enable the in silico study and engineering of improved enzymatic properties. Such machine learning models, however, require the conversion of the complex biological information to a numerical input, also called protein representations. These inputs demand special attention to ensure the training of accurate and precise models, and, in this review, we therefore examine the critical step of encoding protein information to numeric representations for use in machine learning. We selected the most important approaches for encoding the three distinct biological protein representations - primary sequence, 3D structure, and dynamics - to explore their requirements for employment and inductive biases. Combined representations of proteins and substrates are also introduced as emergent tools in biocatalysis. We propose the division of fixed representations, a collection of rule-based encoding strategies, and learned representations extracted from the latent spaces of large neural networks. To select the most suitable protein representation, we propose two main factors to consider. The first one is the model setup, which is influenced by the size of the training dataset and the choice of architecture. The second factor is the model objectives such as consideration about the assayed property, the difference between wild-type models and mutant predictors, and requirements for explainability. This review is aimed at serving as a source of information and guidance for properly representing enzymes in future machine learning models for biocatalysis.

Collapse

Wang D, Tiwary P. Augmenting Human Expertise in Weighted Ensemble Simulations through Deep Learning based Information Bottleneck. ARXIV 2024:arXiv:2406.14839v2. [PMID: 38947925 PMCID: PMC11213147] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]

Riccabona JR, Spoendlin FC, Fischer ALM, Loeffler JR, Quoika PK, Jenkins TP, Ferguson JA, Smorodina E, Laustsen AH, Greiff V, Forli S, Ward AB, Deane CM, Fernández-Quintero ML. Assessing AF2's ability to predict structural ensembles of proteins. Structure 2024;32:2147-2159.e2. [PMID: 39332396 DOI: 10.1016/j.str.2024.09.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2024] [Revised: 08/07/2024] [Accepted: 09/02/2024] [Indexed: 09/29/2024]

Affiliation(s)

Jakob R Riccabona Center for Molecular Biosciences Innsbruck, Department of General, Inorganic and Theoretical Chemistry, University of Innsbruck, Innsbruck, Austria
Fabian C Spoendlin Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, UK
Anna-Lena M Fischer Center for Molecular Biosciences Innsbruck, Department of General, Inorganic and Theoretical Chemistry, University of Innsbruck, Innsbruck, Austria
Johannes R Loeffler Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
Patrick K Quoika Center for Functional Protein Assemblies, Technical University of Munich, Ernst-Otto-Fischer-Str. 8, 85748 Garching, Germany
Timothy P Jenkins Department of Biotechnology and Biomedicine, Technical University of Denmark, DK-2800 Kongens Lyngby, Denmark
James A Ferguson Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
Eva Smorodina Department of Immunology, University of Oslo, Oslo, Norway
Andreas H Laustsen Department of Biotechnology and Biomedicine, Technical University of Denmark, DK-2800 Kongens Lyngby, Denmark
Victor Greiff Department of Immunology, University of Oslo, Oslo, Norway
Stefano Forli Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
Andrew B Ward Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA.
Charlotte M Deane Oxford Protein Informatics Group, Department of Statistics, University of Oxford, Oxford OX1 3LB, UK.
Monica L Fernández-Quintero Center for Molecular Biosciences Innsbruck, Department of General, Inorganic and Theoretical Chemistry, University of Innsbruck, Innsbruck, Austria; Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA; Department of Biotechnology and Biomedicine, Technical University of Denmark, DK-2800 Kongens Lyngby, Denmark.

Collapse

Georgouli K, Stephany RR, Tempkin JOB, Santiago C, Aydin F, Heimann MA, Pottier L, Zhang X, Carpenter TS, Hsu T, Nissley DV, Streitz FH, Lightstone FC, Ingolfsson HI, Bremer PT. Generating Protein Structures for Pathway Discovery Using Deep Learning. J Chem Theory Comput 2024;20:8795-8806. [PMID: 39388723 PMCID: PMC11500303 DOI: 10.1021/acs.jctc.4c00816] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2024] [Revised: 09/27/2024] [Accepted: 09/30/2024] [Indexed: 10/12/2024]

Affiliation(s)

Konstantia Georgouli Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore 94550, California, United States
Robert R. Stephany Center for Applied Mathematics, Cornell University, Ithaca 14853, New York, United States
Jeremy O. B. Tempkin Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore 94550, California, United States
Claudio Santiago Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore 94550, California, United States
Fikret Aydin Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore 94550, California, United States
Mark A. Heimann Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore 94550, California, United States
Loïc Pottier Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore 94550, California, United States
Xiaohua Zhang Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore 94550, California, United States
Timothy S. Carpenter Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore 94550, California, United States
Tim Hsu Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore 94550, California, United States
Dwight V. Nissley RAS Initiative, The Cancer Research Technology Program, Frederick National Laboratory, Frederick 21701, Maryland, United States
Frederick H. Streitz Computing Directorate, Lawrence Livermore National Laboratory, Livermore 94550, California, United States
Felice C. Lightstone Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore 94550, California, United States
Helgi I. Ingolfsson Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore 94550, California, United States
Peer-Timo Bremer Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore 94550, California, United States

Collapse

Schafer JW, Porter LL. AlphaFold2's training set powers its predictions of fold-switched conformations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.10.11.617857. [PMID: 39803493 PMCID: PMC11722258 DOI: 10.1101/2024.10.11.617857] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/23/2025]

Benavides TL, Montelione GT. Integrative Modeling of Protein-Polypeptide Complexes by Bayesian Model Selection using AlphaFold and NMR Chemical Shift Perturbation Data. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.09.19.613999. [PMID: 39345459 PMCID: PMC11430059 DOI: 10.1101/2024.09.19.613999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 10/01/2024]

Gu X, Aranganathan A, Tiwary P. Empowering AlphaFold2 for protein conformation selective drug discovery with AlphaFold2-RAVE. eLife 2024;13:RP99702. [PMID: 39240197 PMCID: PMC11379456 DOI: 10.7554/elife.99702] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/07/2024] Open

Liu ZH, Tsanai M, Zhang O, Forman-Kay J, Head-Gordon T. Computational Methods to Investigate Intrinsically Disordered Proteins and their Complexes. ARXIV 2024:arXiv:2409.02240v1. [PMID: 39279844 PMCID: PMC11398552] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 09/18/2024]

Vats S, Bobrovs R, Söderhjelm P, Bhakat S. AlphaFold-SFA: Accelerated sampling of cryptic pocket opening, protein-ligand binding and allostery by AlphaFold, slow feature analysis and metadynamics. PLoS One 2024;19:e0307226. [PMID: 39190764 DOI: 10.1371/journal.pone.0307226] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2024] [Accepted: 07/02/2024] [Indexed: 08/29/2024] Open

Abstract

Sampling rare events in proteins is crucial for comprehending complex phenomena like cryptic pocket opening, where transient structural changes expose new binding sites. Understanding these rare events also sheds light on protein-ligand binding and allosteric communications, where distant site interactions influence protein function. Traditional unbiased molecular dynamics simulations often fail to sample such rare events, as the free energy barrier between metastable states is large relative to the thermal energy. This renders these events inaccessible on the timescales typically simulated by unbiased molecular dynamics, limiting our understanding of these critical processes. In this paper, we proposed a novel unsupervised learning approach termed as slow feature analysis (SFA) which aims to extract slowly varying features from high-dimensional temporal data. SFA trained on small unbiased molecular dynamics simulations launched from AlphaFold generated conformational ensembles manages to capture rare events governing cryptic pocket opening, protein-ligand binding, and allosteric communications in a kinase. Metadynamics simulations using SFA as collective variables manage to sample 'deep' cryptic pocket opening within a few hundreds of nanoseconds which was beyond the reach of microsecond long unbiased molecular dynamics simulations. SFA augmented metadynamics also managed to capture conformational plasticity of protein upon ligand binding/unbinding and provided novel insights into allosteric communication in receptor-interacting protein kinase 2 (RIPK2) which dictates protein-protein interaction. Taken together, our results show how SFA acts as a dimensionality reduction tool which bridges the gap between AlphaFold, molecular dynamics simulation and metadynamics in context of capturing rare events in biomolecules, extending the scope of structure-based drug discovery in the era of AlphaFold.

Collapse

Zhou J, Huang M. Navigating the landscape of enzyme design: from molecular simulations to machine learning. Chem Soc Rev 2024;53:8202-8239. [PMID: 38990263 DOI: 10.1039/d4cs00196f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/12/2024]

Bowman GR. AlphaFold and Protein Folding: Not Dead Yet! The Frontier Is Conformational Ensembles. Annu Rev Biomed Data Sci 2024;7:51-57. [PMID: 38603560 PMCID: PMC11892350 DOI: 10.1146/annurev-biodatasci-102423-011435] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/13/2024]

Frasnetti E, Magni A, Castelli M, Serapian SA, Moroni E, Colombo G. Structures, dynamics, complexes, and functions: From classic computation to artificial intelligence. Curr Opin Struct Biol 2024;87:102835. [PMID: 38744148 DOI: 10.1016/j.sbi.2024.102835] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2024] [Revised: 04/14/2024] [Accepted: 04/22/2024] [Indexed: 05/16/2024]

Biriukov D, Vácha R. Pathways to a Shiny Future: Building the Foundation for Computational Physical Chemistry and Biophysics in 2050. ACS PHYSICAL CHEMISTRY AU 2024;4:302-313. [PMID: 39069976 PMCID: PMC11274290 DOI: 10.1021/acsphyschemau.4c00003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/07/2024] [Revised: 03/15/2024] [Accepted: 03/18/2024] [Indexed: 07/30/2024]

Lee S, Wang D, Seeliger MA, Tiwary P. Calculating Protein-Ligand Residence Times through State Predictive Information Bottleneck Based Enhanced Sampling. J Chem Theory Comput 2024;20:6341-6349. [PMID: 38991145 PMCID: PMC11990086 DOI: 10.1021/acs.jctc.4c00503] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/13/2024]

Gu X, Aranganathan A, Tiwary P. Empowering AlphaFold2 for protein conformation selective drug discovery with AlphaFold2-RAVE. ARXIV 2024:arXiv:2404.07102v3. [PMID: 38659642 PMCID: PMC11042445] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]

Wang D, Qiu Y, Beyerle ER, Huang X, Tiwary P. Information Bottleneck Approach for Markov Model Construction. J Chem Theory Comput 2024;20:5352-5367. [PMID: 38859575 PMCID: PMC11199095 DOI: 10.1021/acs.jctc.4c00449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/12/2024]

Abstract

Markov state models (MSMs) have proven valuable in studying the dynamics of protein conformational changes via statistical analysis of molecular dynamics simulations. In MSMs, the complex configuration space is coarse-grained into conformational states, with dynamics modeled by a series of Markovian transitions among these states at discrete lag times. Constructing the Markovian model at a specific lag time necessitates defining states that circumvent significant internal energy barriers, enabling internal dynamics relaxation within the lag time. This process effectively coarse-grains time and space, integrating out rapid motions within metastable states. Thus, MSMs possess a multiresolution nature, where the granularity of states can be adjusted according to the time-resolution, offering flexibility in capturing system dynamics. This work introduces a continuous embedding approach for molecular conformations using the state predictive information bottleneck (SPIB), a framework that unifies dimensionality reduction and state space partitioning via a continuous, machine learned basis set. Without explicit optimization of the VAMP-based scores, SPIB demonstrates state-of-the-art performance in identifying slow dynamical processes and constructing predictive multiresolution Markovian models. Through applications to well-validated mini-proteins, SPIB showcases unique advantages compared to competing methods. It autonomously and self-consistently adjusts the number of metastable states based on a specified minimal time resolution, eliminating the need for manual tuning. While maintaining efficacy in dynamical properties, SPIB excels in accurately distinguishing metastable states and capturing numerous well-populated macrostates. This contrasts with existing VAMP-based methods, which often emphasize slow dynamics at the expense of incorporating numerous sparsely populated states. Furthermore, SPIB's ability to learn a low-dimensional continuous embedding of the underlying MSMs enhances the interpretation of dynamic pathways. With these benefits, we propose SPIB as an easy-to-implement methodology for end-to-end MSM construction.

Collapse

Wang D, Qiu Y, Beyerle ER, Huang X, Tiwary P. An Information Bottleneck Approach for Markov Model Construction. ARXIV 2024:arXiv:2404.02856v2. [PMID: 38947932 PMCID: PMC11213129] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 07/02/2024]

Abstract

Markov state models (MSMs) have proven valuable in studying dynamics of protein conformational changes via statistical analysis of molecular dynamics (MD) simulations. In MSMs, the complex configuration space is coarse-grained into conformational states, with dynamics modeled by a series of Markovian transitions among these states at discrete lag times. Constructing the Markovian model at a specific lag time necessitates defining states that circumvent significant internal energy barriers, enabling internal dynamics relaxation within the lag time. This process effectively coarse-grains time and space, integrating out rapid motions within metastable states. Thus, MSMs possess a multi-resolution nature, where the granularity of states can be adjusted according to the time-resolution, offering flexibility in capturing system dynamics. This work introduces a continuous embedding approach for molecular conformations using the state predictive information bottleneck (SPIB), a framework that unifies dimensionality reduction and state space partitioning via a continuous, machine learned basis set. Without explicit optimization of the VAMP-based scores, SPIB demonstrates state-of-the-art performance in identifying slow dynamical processes and constructing predictive multi-resolution Markovian models. Through applications to well-validated mini-proteins, SPIB showcases unique advantages compared to competing methods. It autonomously and self-consistently adjusts the number of metastable states based on specified minimal time resolution, eliminating the need for manual tuning. While maintaining efficacy in dynamical properties, SPIB excels in accurately distinguishing metastable states and capturing numerous well-populated macrostates. This contrasts with existing VAMP-based methods, which often emphasize slow dynamics at the expense of incorporating numerous sparsely populated states. Furthermore, SPIB's ability to learn a low-dimensional continuous embedding of the underlying MSMs enhances the interpretation of dynamic pathways. With these benefits, we propose SPIB as an easy-to-implement methodology for end-to-end MSMs construction.

Collapse

Mehdi S, Smith Z, Herron L, Zou Z, Tiwary P. Enhanced Sampling with Machine Learning. Annu Rev Phys Chem 2024;75:347-370. [PMID: 38382572 PMCID: PMC11213683 DOI: 10.1146/annurev-physchem-083122-125941] [Citation(s) in RCA: 39] [Impact Index Per Article: 39.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]

Lee S, Wang D, Seeliger MA, Tiwary P. Calculating Protein-Ligand Residence Times Through State Predictive Information Bottleneck based Enhanced Sampling. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.04.16.589710. [PMID: 38659748 PMCID: PMC11042289 DOI: 10.1101/2024.04.16.589710] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 04/26/2024]

Smith Z, Strobel M, Vani BP, Tiwary P. Graph Attention Site Prediction (GrASP): Identifying Druggable Binding Sites Using Graph Neural Networks with Attention. J Chem Inf Model 2024;64:2637-2644. [PMID: 38453912 PMCID: PMC11182664 DOI: 10.1021/acs.jcim.3c01698] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/09/2024]

Vani BP, Aranganathan A, Tiwary P. Exploring Kinase Asp-Phe-Gly (DFG) Loop Conformational Stability with AlphaFold2-RAVE. J Chem Inf Model 2024;64:2789-2797. [PMID: 37981824 PMCID: PMC11001530 DOI: 10.1021/acs.jcim.3c01436] [Citation(s) in RCA: 22] [Impact Index Per Article: 22.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2023]

Müllender L, Rizzi A, Parrinello M, Carloni P, Mandelli D. Effective data-driven collective variables for free energy calculations from metadynamics of paths. PNAS NEXUS 2024;3:pgae159. [PMID: 38665160 PMCID: PMC11044970 DOI: 10.1093/pnasnexus/pgae159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2023] [Accepted: 04/04/2024] [Indexed: 04/28/2024]

Monteiro da Silva G, Cui JY, Dalgarno DC, Lisi GP, Rubenstein BM. High-throughput prediction of protein conformational distributions with subsampled AlphaFold2. Nat Commun 2024;15:2464. [PMID: 38538622 PMCID: PMC10973385 DOI: 10.1038/s41467-024-46715-9] [Citation(s) in RCA: 56] [Impact Index Per Article: 56.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 02/28/2024] [Indexed: 04/12/2024] Open

Lotthammer JM, Ginell GM, Griffith D, Emenecker RJ, Holehouse AS. Direct prediction of intrinsically disordered protein conformational properties from sequence. Nat Methods 2024;21:465-476. [PMID: 38297184 PMCID: PMC10927563 DOI: 10.1038/s41592-023-02159-5] [Citation(s) in RCA: 66] [Impact Index Per Article: 66.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2023] [Accepted: 12/20/2023] [Indexed: 02/02/2024]

Meller A, Kelly D, Smith LG, Bowman GR. Toward physics-based precision medicine: Exploiting protein dynamics to design new therapeutics and interpret variants. Protein Sci 2024;33:e4902. [PMID: 38358129 PMCID: PMC10868452 DOI: 10.1002/pro.4902] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Revised: 12/01/2023] [Accepted: 01/04/2024] [Indexed: 02/16/2024]

Brown BP, Stein RA, Meiler J, Mchaourab HS. Approximating Projections of Conformational Boltzmann Distributions with AlphaFold2 Predictions: Opportunities and Limitations. J Chem Theory Comput 2024;20:1434-1447. [PMID: 38215214 PMCID: PMC10867840 DOI: 10.1021/acs.jctc.3c01081] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2023] [Revised: 12/13/2023] [Accepted: 12/13/2023] [Indexed: 01/14/2024]

Miller EB, Hwang H, Shelley M, Placzek A, Rodrigues JPGLM, Suto RK, Wang L, Akinsanya K, Abel R. Enabling structure-based drug discovery utilizing predicted models. Cell 2024;187:521-525. [PMID: 38306979 DOI: 10.1016/j.cell.2023.12.034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2023] [Revised: 12/28/2023] [Accepted: 12/29/2023] [Indexed: 02/04/2024]

Hoff SE, Zinke M, Izadi-Pruneyre N, Bonomi M. Bonds and bytes: The odyssey of structural biology. Curr Opin Struct Biol 2024;84:102746. [PMID: 38101027 DOI: 10.1016/j.sbi.2023.102746] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2023] [Revised: 11/20/2023] [Accepted: 11/24/2023] [Indexed: 12/17/2023]

Kobayashi H. Potential for artificial intelligence in medicine and its application to male infertility. Reprod Med Biol 2024;23:e12590. [PMID: 38948339 PMCID: PMC11211808 DOI: 10.1002/rmb2.12590] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 05/15/2024] [Accepted: 05/27/2024] [Indexed: 07/02/2024] Open

Kleiman DE, Nadeem H, Shukla D. Adaptive Sampling Methods for Molecular Dynamics in the Era of Machine Learning. J Phys Chem B 2023;127:10669-10681. [PMID: 38081185 DOI: 10.1021/acs.jpcb.3c04843] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2023]

Day EC, Chittari SS, Bogen MP, Knight AS. Navigating the Expansive Landscapes of Soft Materials: A User Guide for High-Throughput Workflows. ACS POLYMERS AU 2023;3:406-427. [PMID: 38107416 PMCID: PMC10722570 DOI: 10.1021/acspolymersau.3c00025] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 11/02/2023] [Accepted: 11/07/2023] [Indexed: 12/19/2023]

Ramelot TA, Tejero R, Montelione GT. Representing structures of the multiple conformational states of proteins. Curr Opin Struct Biol 2023;83:102703. [PMID: 37776602 PMCID: PMC10841472 DOI: 10.1016/j.sbi.2023.102703] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/04/2023] [Revised: 08/18/2023] [Accepted: 08/23/2023] [Indexed: 10/02/2023]

Ahmed M, Maldonado AM, Durrant JD. From Byte to Bench to Bedside: Molecular Dynamics Simulations and Drug Discovery. ARXIV 2023:arXiv:2311.16946v1. [PMID: 38076508 PMCID: PMC10705576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 12/19/2023]

Sala D, Engelberger F, Mchaourab HS, Meiler J. Modeling conformational states of proteins with AlphaFold. Curr Opin Struct Biol 2023;81:102645. [PMID: 37392556 DOI: 10.1016/j.sbi.2023.102645] [Citation(s) in RCA: 76] [Impact Index Per Article: 38.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2023] [Revised: 05/16/2023] [Accepted: 06/01/2023] [Indexed: 07/03/2023]