1
|
Gu J, Agarwal PK, Bonomo RA, Haider S. Evolutionary Dynamics and Functional Differences in Clinically Relevant Pen β-Lactamases from Burkholderia spp. J Chem Inf Model 2025; 65:5086-5098. [PMID: 40314617 PMCID: PMC12117567 DOI: 10.1021/acs.jcim.5c00271] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2025] [Revised: 04/22/2025] [Accepted: 04/28/2025] [Indexed: 05/03/2025]
Abstract
Antimicrobial resistance (AMR) is a global threat, with Burkholderia species contributing significantly to difficult-to-treat infections. The Pen family of β-lactamases are produced by all Burkholderia spp., and their mutation or overproduction leads to the resistance of β-lactam antibiotics. Here we investigate the dynamic differences among four Pen β-lactamases (PenA, PenI, PenL and PenP) using machine learning driven enhanced sampling molecular dynamics simulations, Markov State Models (MSMs), convolutional variational autoencoder-based deep learning (CVAE) and the BindSiteS-CNN model. In spite of sharing the same catalytic mechanisms, these enzymes exhibit distinct dynamic features due to low sequence identity, resulting in different substrate profiles and catalytic turnover. The BindSiteS-CNN model further reveals local active site dynamics, offering insights into the Pen β-lactamase evolutionary adaptation. Our findings reported here identify critical mutations and propose new hot spots affecting Pen β-lactamase flexibility and function, which can be used to fight emerging resistance in these enzymes.
Collapse
Affiliation(s)
- Jing Gu
- UCL
School of Pharmacy, University College London, LondonWC1N 1AX, U.K.
| | - Pratul K. Agarwal
- High-Performance
Computing Center, Oklahoma State University, Stillwater, Oklahoma74078-1010, United
States
| | - Robert A. Bonomo
- Research
Service, Department of Veterans Affairs Medical Center, Louis Stokes Cleveland, Cleveland, Ohio44106, United States
- Department
of Molecular Biology and Microbiology, Case
Western Reserve University School of Medicine, Cleveland, Ohio44106, United States
- Department
of Medicine, Case Western Reserve University
School of Medicine, Cleveland, Ohio44106, United States
- Clinician
Scientist Investigator, Department of Veterans Affairs Medical Center, Louis Stokes Cleveland, Cleveland, Ohio44106, United States
- Departments
of Pharmacology, Biochemistry, and Proteomics and Bioinformatics, CaseWestern Reserve University School of Medicine,Cleveland, Ohio44106, United States
- CWRU-Cleveland
VAMC Centerfor Antimicrobial Resistance and Epidemiology (Case VA
CARES), Cleveland, Ohio44106, United States
| | - Shozeb Haider
- UCL
School of Pharmacy, University College London, LondonWC1N 1AX, U.K.
- University
of Tabuk (PFSCBR), Tabuk47512, Saudi Arabia
- UCL
Center for Advanced Research Computing, University College London, LondonWC1H 9RL, U.K.
| |
Collapse
|
2
|
Xia R, Li W, Cheng Y, Xie L, Xu X. Molecular surfaces modeling: Advancements in deep learning for molecular interactions and predictions. Biochem Biophys Res Commun 2025; 763:151799. [PMID: 40239539 DOI: 10.1016/j.bbrc.2025.151799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2025] [Revised: 03/20/2025] [Accepted: 04/10/2025] [Indexed: 04/18/2025]
Abstract
Molecular surface analysis can provide a high-dimensional, rich representation of molecular properties and interactions, which is crucial for enabling powerful predictive modeling and rational molecular design across diverse scientific and technological domains. With remarkable successes achieved by artificial intelligence (AI) in different fields such as computer vision and natural language processing, there is a growing imperative to harness AI's potential in accelerating molecular discovery and innovation. The integration of AI techniques with molecular surface analysis has opened up new frontiers, allowing researchers to uncover hidden patterns, relationships, and design principles that were previously elusive. By leveraging the complementary strengths of molecular surface representations and advanced AI algorithms, scientists can now explore chemical space more efficiently, optimize molecular properties with greater precision, and drive transformative advancements in areas like drug development, materials engineering, and catalysis. In this review, we aim to provide an overview of recent advancements in the field of molecular surface analysis and its integration with AI techniques. These AI-driven approaches have led to significant advancements in various downstream tasks, including interface site prediction, protein-protein interaction prediction, surface-centric molecular generation and design.
Collapse
Affiliation(s)
- Renjie Xia
- Institute of Bioinformatics and Medical Engineering, Jiangsu University of Technology, Changzhou, 213001, China
| | - Wei Li
- Institute of Bioinformatics and Medical Engineering, Jiangsu University of Technology, Changzhou, 213001, China
| | - Yi Cheng
- College of Engineering, Lishui University, Lishui, 323000, China
| | - Liangxu Xie
- Institute of Bioinformatics and Medical Engineering, Jiangsu University of Technology, Changzhou, 213001, China.
| | - Xiaojun Xu
- Institute of Bioinformatics and Medical Engineering, Jiangsu University of Technology, Changzhou, 213001, China.
| |
Collapse
|
3
|
Vural O, Jololian L, Pan L. DeepLigType: Predicting Ligand Types of ProteinLigand Binding Sites Using a Deep Learning Model. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2024; PP:116-123. [PMID: 39509302 DOI: 10.1109/tcbb.2024.3493820] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/15/2024]
Abstract
The analysis of protein-ligand binding sites plays a crucial role in the initial stages of drug discovery. Accurately predicting the ligand types that are likely to bind to protein-ligand binding sites enables more informed decision making in drug design. Our study, DeepLigType, determines protein-ligand binding sites using Fpocket and then predicts the ligand type of these pockets with the deep learning model, Convolutional Block Attention Module (CBAM) with ResNet. CBAM-ResNet has been trained to accurately predict five distinct ligand types. We classified protein-ligand binding sites into five different categories according to the type of response ligands cause when they bind to their target proteins, which are antagonist, agonist, activator, inhibitor, and others. We created a novel dataset, referred to as LigType5, from the widely recognized PDBbind and scPDB dataset for training and testing our model. While the literature mostly focuses on the specificity and characteristic analysis of protein binding sites by experimental (laboratory-based) methods, we propose a computational method with the DeepLigType architecture. DeepLigType demonstrated an accuracy of 74.30% and an AUC of 0.83 in ligand type prediction on a novel test dataset using the CBAM-ResNet deep learning model.
Collapse
|
4
|
de Azevedo WF, Quiroga R, Villarreal MA, da Silveira NJF, Bitencourt-Ferreira G, da Silva AD, Veit-Acosta M, Oliveira PR, Tutone M, Biziukova N, Poroikov V, Tarasova O, Baud S. SAnDReS 2.0: Development of machine-learning models to explore the scoring function space. J Comput Chem 2024; 45:2333-2346. [PMID: 38900052 DOI: 10.1002/jcc.27449] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2024] [Revised: 05/04/2024] [Accepted: 06/02/2024] [Indexed: 06/21/2024]
Abstract
Classical scoring functions may exhibit low accuracy in determining ligand binding affinity for proteins. The availability of both protein-ligand structures and affinity data make it possible to develop machine-learning models focused on specific protein systems with superior predictive performance. Here, we report a new methodology named SAnDReS that combines AutoDock Vina 1.2 with 54 regression methods available in Scikit-Learn to calculate binding affinity based on protein-ligand structures. This approach allows exploration of the scoring function space. SAnDReS generates machine-learning models based on crystal, docked, and AlphaFold-generated structures. As a proof of concept, we examine the performance of SAnDReS-generated models in three case studies. For all three cases, our models outperformed classical scoring functions. Also, SAnDReS-generated models showed predictive performance close to or better than other machine-learning models such as KDEEP, CSM-lig, and ΔVinaRF20. SAnDReS 2.0 is available to download at https://github.com/azevedolab/sandres.
Collapse
Affiliation(s)
| | - Rodrigo Quiroga
- Instituto de Investigaciones en Fisicoquímica de Córdoba (INFIQC), CONICET-Departamento de Química Teórica y Computacional, Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, Ciudad Universitaria, Córdoba, Argentina
| | - Marcos Ariel Villarreal
- Instituto de Investigaciones en Fisicoquímica de Córdoba (INFIQC), CONICET-Departamento de Química Teórica y Computacional, Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, Ciudad Universitaria, Córdoba, Argentina
| | | | | | - Amauri Duarte da Silva
- Programa de Pós-Graduação em Tecnologias da Informação e Gestão em Saúde, Universidade Federal de Ciências da Saúde de Porto Alegre, Porto Alegre, Brazil
| | | | | | - Marco Tutone
- Dipartimento di Scienze e Tecnologie Biologiche Chimiche e Farmaceutiche (STEBICEF), Università di Palermo, Palermo, Italy
| | | | | | | | - Stéphaine Baud
- Laboratoire SiRMa, UMR CNRS/URCA 7369, UFR Sciences Exactes et Naturelles, Université de Reims Champagne-Ardenne, CNRS, MEDYC, Reims, France
| |
Collapse
|
5
|
Comajuncosa-Creus A, Jorba G, Barril X, Aloy P. Comprehensive detection and characterization of human druggable pockets through binding site descriptors. Nat Commun 2024; 15:7917. [PMID: 39256431 PMCID: PMC11387482 DOI: 10.1038/s41467-024-52146-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Accepted: 08/27/2024] [Indexed: 09/12/2024] Open
Abstract
Druggable pockets are protein regions that have the ability to bind organic small molecules, and their characterization is essential in target-based drug discovery. However, deriving pocket descriptors is challenging and existing strategies are often limited in applicability. We introduce PocketVec, an approach to generate pocket descriptors via inverse virtual screening of lead-like molecules. PocketVec performs comparably to leading methodologies while addressing key limitations. Additionally, we systematically search for druggable pockets in the human proteome, using experimentally determined structures and AlphaFold2 models, identifying over 32,000 binding sites across 20,000 protein domains. We then generate PocketVec descriptors for each site and conduct an extensive similarity search, exploring over 1.2 billion pairwise comparisons. Our results reveal druggable pocket similarities not detected by structure- or sequence-based methods, uncovering clusters of similar pockets in proteins lacking crystallized inhibitors and opening the door to strategies for prioritizing chemical probe development to explore the druggable space.
Collapse
Affiliation(s)
- Arnau Comajuncosa-Creus
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain
| | - Guillem Jorba
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain
| | - Xavier Barril
- Facultat de Farmàcia and Institut de Biomedicina, Universitat de Barcelona, Barcelona, Catalonia, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Catalonia, Spain
| | - Patrick Aloy
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute of Science and Technology, Barcelona, Catalonia, Spain.
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Catalonia, Spain.
| |
Collapse
|
6
|
Zhu Y, Gu J, Zhao Z, Chan AWE, Mojica MF, Hujer AM, Bonomo RA, Haider S. Deciphering the Coevolutionary Dynamics of L2 β-Lactamases via Deep Learning. J Chem Inf Model 2024; 64:3706-3717. [PMID: 38687957 PMCID: PMC11094718 DOI: 10.1021/acs.jcim.4c00189] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/02/2024] [Revised: 03/10/2024] [Accepted: 04/09/2024] [Indexed: 05/02/2024]
Abstract
L2 β-lactamases, serine-based class A β-lactamases expressed by Stenotrophomonas maltophilia, play a pivotal role in antimicrobial resistance (AMR). However, limited studies have been conducted on these important enzymes. To understand the coevolutionary dynamics of L2 β-lactamase, innovative computational methodologies, including adaptive sampling molecular dynamics simulations, and deep learning methods (convolutional variational autoencoders and BindSiteS-CNN) explored conformational changes and correlations within the L2 β-lactamase family together with other representative class A enzymes including SME-1 and KPC-2. This work also investigated the potential role of hydrophobic nodes and binding site residues in facilitating the functional mechanisms. The convergence of analytical approaches utilized in this effort yielded comprehensive insights into the dynamic behavior of the β-lactamases, specifically from an evolutionary standpoint. In addition, this analysis presents a promising approach for understanding how the class A β-lactamases evolve in response to environmental pressure and establishes a theoretical foundation for forthcoming endeavors in drug development aimed at combating AMR.
Collapse
Affiliation(s)
- Yu Zhu
- Pharmaceutical
and Biological Chemistry, UCL School of
Pharmacy, London WC1N 1AX, U.K.
| | - Jing Gu
- Pharmaceutical
and Biological Chemistry, UCL School of
Pharmacy, London WC1N 1AX, U.K.
| | - Zhuoran Zhao
- Pharmaceutical
and Biological Chemistry, UCL School of
Pharmacy, London WC1N 1AX, U.K.
| | - A. W. Edith Chan
- Division
of Medicine, UCL School of Pharmacy, London WC1E 6BT, U.K.
| | - Maria F. Mojica
- Department
of Molecular Biology and Microbiology, Case
Western Reserve University School of Medicine, Cleveland, Ohio 44106-5029, United
States
- Research
Service, Department of Veterans Affairs Medical Center, Louis Stokes Cleveland, Cleveland, Ohio 44106-1702, United States
- CWRU-Cleveland
VAMC Center for Antimicrobial Resistance and Epidemiology (Case VA
CARES), Cleveland, Ohio 44106-5029, United States
| | - Andrea M. Hujer
- Research
Service, Department of Veterans Affairs Medical Center, Louis Stokes Cleveland, Cleveland, Ohio 44106-1702, United States
- Department
of Medicine, Case Western Reserve University
School of Medicine, Cleveland, Ohio 44106-5029, United States
| | - Robert A. Bonomo
- Research
Service, Department of Veterans Affairs Medical Center, Louis Stokes Cleveland, Cleveland, Ohio 44106-1702, United States
- CWRU-Cleveland
VAMC Center for Antimicrobial Resistance and Epidemiology (Case VA
CARES), Cleveland, Ohio 44106-5029, United States
- Clinician
Scientist Investigator, Department of Veterans Affairs Medical Center, Louis Stokes Cleveland, Cleveland, Ohio 44106-1702, United States
- Departments
of Pharmacology, Biochemistry, and Proteomics and Bioinformatics, Case Western Reserve University School of Medicine, Cleveland, Ohio 44106-5029, United
States
- Departments
of Molecular Biology and Microbiology, Medicine, Case Western Reserve University School of Medicine, Cleveland, Ohio 44106-5029, United
States
| | - Shozeb Haider
- Pharmaceutical
and Biological Chemistry, UCL School of
Pharmacy, London WC1N 1AX, U.K.
- UCL
Centre for Advanced Research in Computing, University College London, London WC1H 9RL, U.K.
| |
Collapse
|
7
|
Reim T, Ehrt C, Graef J, Günther S, Meents A, Rarey M. SiteMine: Large-scale binding site similarity searching in protein structure databases. Arch Pharm (Weinheim) 2024; 357:e2300661. [PMID: 38335311 DOI: 10.1002/ardp.202300661] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 01/10/2024] [Accepted: 01/16/2024] [Indexed: 02/12/2024]
Abstract
Drug discovery and design challenges, such as drug repurposing, analyzing protein-ligand and protein-protein complexes, ligand promiscuity studies, or function prediction, can be addressed by protein binding site similarity analysis. Although numerous tools exist, they all have individual strengths and drawbacks with regard to run time, provision of structure superpositions, and applicability to diverse application domains. Here, we introduce SiteMine, an all-in-one database-driven, alignment-providing binding site similarity search tool to tackle the most pressing challenges of binding site comparison. The performance of SiteMine is evaluated on the ProSPECCTs benchmark, showing a promising performance on most of the data sets. The method performs convincingly regarding all quality criteria for reliable binding site comparison, offering a novel state-of-the-art approach for structure-based molecular design based on binding site comparisons. In a SiteMine showcase, we discuss the high structural similarity between cathepsin L and calpain 1 binding sites and give an outlook on the impact of this finding on structure-based drug design. SiteMine is available at https://uhh.de/naomi.
Collapse
Affiliation(s)
- Thorben Reim
- ZBH - Center for Bioinformatics, Universität Hamburg, Hamburg, Germany
| | - Christiane Ehrt
- ZBH - Center for Bioinformatics, Universität Hamburg, Hamburg, Germany
| | - Joel Graef
- ZBH - Center for Bioinformatics, Universität Hamburg, Hamburg, Germany
| | - Sebastian Günther
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, Hamburg, Germany
| | - Alke Meents
- Center for Free-Electron Laser Science CFEL, Deutsches Elektronen-Synchrotron DESY, Hamburg, Germany
| | - Matthias Rarey
- ZBH - Center for Bioinformatics, Universität Hamburg, Hamburg, Germany
| |
Collapse
|
8
|
Bitencourt-Ferreira G, Villarreal MA, Quiroga R, Biziukova N, Poroikov V, Tarasova O, de Azevedo Junior WF. Exploring Scoring Function Space: Developing Computational Models for Drug Discovery. Curr Med Chem 2024; 31:2361-2377. [PMID: 36944627 DOI: 10.2174/0929867330666230321103731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2022] [Revised: 12/15/2022] [Accepted: 12/29/2022] [Indexed: 03/23/2023]
Abstract
BACKGROUND The idea of scoring function space established a systems-level approach to address the development of models to predict the affinity of drug molecules by those interested in drug discovery. OBJECTIVE Our goal here is to review the concept of scoring function space and how to explore it to develop machine learning models to address protein-ligand binding affinity. METHODS We searched the articles available in PubMed related to the scoring function space. We also utilized crystallographic structures found in the protein data bank (PDB) to represent the protein space. RESULTS The application of systems-level approaches to address receptor-drug interactions allows us to have a holistic view of the process of drug discovery. The scoring function space adds flexibility to the process since it makes it possible to see drug discovery as a relationship involving mathematical spaces. CONCLUSION The application of the concept of scoring function space has provided us with an integrated view of drug discovery methods. This concept is useful during drug discovery, where we see the process as a computational search of the scoring function space to find an adequate model to predict receptor-drug binding affinity.
Collapse
Affiliation(s)
| | - Marcos A Villarreal
- CONICET-Departamento de Matemática y Física, Instituto de Investigaciones en Fisicoquímica de Córdoba (INFIQC), Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, Ciudad Universitaria, Córdoba, Argentina
| | - Rodrigo Quiroga
- CONICET-Departamento de Matemática y Física, Instituto de Investigaciones en Fisicoquímica de Córdoba (INFIQC), Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, Ciudad Universitaria, Córdoba, Argentina
| | - Nadezhda Biziukova
- Institute of Biomedical Chemistry, Pogodinskaya Str., 10/8, Moscow, 119121, Russia
| | - Vladimir Poroikov
- Institute of Biomedical Chemistry, Pogodinskaya Str., 10/8, Moscow, 119121, Russia
| | - Olga Tarasova
- Institute of Biomedical Chemistry, Pogodinskaya Str., 10/8, Moscow, 119121, Russia
| | - Walter F de Azevedo Junior
- Pontifical Catholic University of Rio Grande do Sul - PUCRS, Porto Alegre-RS, Brazil
- Specialization Program in Bioinformatics, The Pontifical Catholic University of Rio Grande do Sul (PUCRS), Av. Ipiranga, 6681 Porto Alegre / RS 90619-900, Brazil
| |
Collapse
|
9
|
Lecca P, Lecca M. Graph embedding and geometric deep learning relevance to network biology and structural chemistry. Front Artif Intell 2023; 6:1256352. [PMID: 38035201 PMCID: PMC10687447 DOI: 10.3389/frai.2023.1256352] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2023] [Accepted: 10/16/2023] [Indexed: 12/02/2023] Open
Abstract
Graphs are used as a model of complex relationships among data in biological science since the advent of systems biology in the early 2000. In particular, graph data analysis and graph data mining play an important role in biology interaction networks, where recent techniques of artificial intelligence, usually employed in other type of networks (e.g., social, citations, and trademark networks) aim to implement various data mining tasks including classification, clustering, recommendation, anomaly detection, and link prediction. The commitment and efforts of artificial intelligence research in network biology are motivated by the fact that machine learning techniques are often prohibitively computational demanding, low parallelizable, and ultimately inapplicable, since biological network of realistic size is a large system, which is characterised by a high density of interactions and often with a non-linear dynamics and a non-Euclidean latent geometry. Currently, graph embedding emerges as the new learning paradigm that shifts the tasks of building complex models for classification, clustering, and link prediction to learning an informative representation of the graph data in a vector space so that many graph mining and learning tasks can be more easily performed by employing efficient non-iterative traditional models (e.g., a linear support vector machine for the classification task). The great potential of graph embedding is the main reason of the flourishing of studies in this area and, in particular, the artificial intelligence learning techniques. In this mini review, we give a comprehensive summary of the main graph embedding algorithms in light of the recent burgeoning interest in geometric deep learning.
Collapse
Affiliation(s)
- Paola Lecca
- Faculty of Engineering, Free University of Bozen-Bolzano, Bolzano, Italy
| | - Michela Lecca
- Fondazione Bruno Kessler, Digital Industry Center, Technologies of Vision, Trento, Italy
| |
Collapse
|