1
|
Carpenter KA, Altman RB. Databases of ligand-binding pockets and protein-ligand interactions. Comput Struct Biotechnol J 2024; 23:1320-1338. [PMID: 38585646 PMCID: PMC10997877 DOI: 10.1016/j.csbj.2024.03.015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2024] [Revised: 03/16/2024] [Accepted: 03/17/2024] [Indexed: 04/09/2024] Open
Abstract
Many research groups and institutions have created a variety of databases curating experimental and predicted data related to protein-ligand binding. The landscape of available databases is dynamic, with new databases emerging and established databases becoming defunct. Here, we review the current state of databases that contain binding pockets and protein-ligand binding interactions. We have compiled a list of such databases, fifty-three of which are currently available for use. We discuss variation in how binding pockets are defined and summarize pocket-finding methods. We organize the fifty-three databases into subgroups based on goals and contents, and describe standard use cases. We also illustrate that pockets within the same protein are characterized differently across different databases. Finally, we assess critical issues of sustainability, accessibility and redundancy.
Collapse
Affiliation(s)
- Kristy A. Carpenter
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA
| | - Russ B. Altman
- Department of Biomedical Data Science, Stanford University, Stanford, CA 94305, USA
- Department of Bioengineering, Stanford University, Stanford, CA 94305, USA
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
- Department of Medicine, Stanford University, Stanford, CA 94305, USA
| |
Collapse
|
2
|
Ntie-Kang F, Eni DB, Telukunta KK, Osamor VC, Egieyeh SA, Duran-Frigola M, Mishra P, Shadrack DM, Paul L, Musyoka TM, Blin K, Farid MM, Chen Y, Djogang LK, Betow JY, Ibezim A, Joshi D, Edwin AT, Chama MA, Ongagna JM, Kemdoum Sinda PV, Metuge JA, Bekono BD, Isa MA, Medina-Franco JL, Weber T, Dorrestein PC, Janezic D, Bishop ÖT, Ludwig-Müller J. The workshops on computational applications in secondary metabolite discovery (CAiSMD). PHYSICAL SCIENCES REVIEWS 2024; 9:3289-3304. [PMID: 39478877 PMCID: PMC11519840 DOI: 10.1515/psr-2024-0015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Accepted: 02/03/2024] [Indexed: 11/02/2024]
Abstract
We report the outcomes of the second session of the free online open-access workshop "Computational Applications in Secondary Metabolite Discovery (CAiSMD) 2022" that took place from 09 to 11 March 2022. The first session was held from 08 to 10 March 2021 and drew the attention of many early career scientists from academia and industry. The 23 invited speakers of this year's workshop also came from academia and industry and 222 registered participants from five continents (Africa, Asia, Europe, South, and North America) took part in the workshop. The workshop highlighted the potential applications of computational methodologies in the search for secondary metabolites or natural products as drug candidates and drug leads. For three days, the participants of this online workshop discussed modern computer-based approaches for exploring NP discovery in the "omics" age. The invited experts gave keynote lectures, trained participants in hands-on sessions, and held round table discussions. These were followed by oral presentations during which much interaction between the speakers and the audience was observed. Selected applicants (early-career scientists) were offered the opportunity to give oral presentations (15 min) upon submission of an abstract. The final program available on the workshop website (https://indiayouth.info/index.php/caismd) comprised three keynote lectures, 14 oral presentations, two round table discussions, and four hands-on sessions. This meeting report also references internet resources for computational biology around secondary metabolites that are of use outside of the workshop areas and will constitute a long-term valuable source for the community.
Collapse
Affiliation(s)
- Fidele Ntie-Kang
- Center for Drug Discovery, University of Buea, P. O. Box 63Buea, Cameroon
- Department of Chemistry, University of Buea, P. O. Box 63Buea, Cameroon
- Institute of Pharmacy, Martin-Luther University of Halle-Wittenberg, Kurt-Mothes-Str. 3, 06120Halle, Germany
| | - Donatus B. Eni
- Center for Drug Discovery, University of Buea, P. O. Box 63Buea, Cameroon
- Department of Chemistry, University of Buea, P. O. Box 63Buea, Cameroon
| | - Kiran K. Telukunta
- Tarunavadaanenasaha Muktbharatonnayana Samstha Foundation, Hyderabad, India
| | | | - Samuel A. Egieyeh
- School of Pharmacy, University of the Western Cape, Cape Town, 7535South Africa
- South African Medical Research Council Bioinformatics Unit, South African National Bioinformatics Institute, University of the Western Cape, Cape Town7535, South Africa
| | | | - Pankaj Mishra
- Department of Education, Uresearcher Growth Labs Private Limited, 2/44-45, Wazir Hasan Road, Lucknow, Uttar Pradesh226001, India
| | - Daniel M. Shadrack
- Department of Chemistry, St. John’s University of Tanzania, P. O. Box 47Dodoma, Tanzania
| | - Lucas Paul
- Department of Chemistry, Dar es Salaam University College of Education (DUCE), P. O. Box 2329Dar es Salaam, Tanzania
| | - Thommas M. Musyoka
- Department of Biochemistry, Microbiology and Biotechnology, Kenyatta University, Nairobi, Kenya
| | - Kai Blin
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Mai M. Farid
- Department of Phytochemistry and Plant Systematics, Pharmaceutical and Drug Industries Research Institute, National Research Center, Cairo, Egypt
| | - Ya Chen
- Department of Pharmaceutical Sciences, Division of Pharmaceutical Chemistry, Faculty of Life Sciences, University of Vienna, 1090Vienna, Austria
| | | | - Jude Y. Betow
- Center for Drug Discovery, University of Buea, P. O. Box 63Buea, Cameroon
- Department of Chemistry, University of Buea, P. O. Box 63Buea, Cameroon
| | - Akachukwu Ibezim
- Department of Pharmaceutical and Medicinal Chemistry, Faculty of Pharmaceutical Sciences, University of Nigeria, Nsukka, Nigeria
| | - Darshana Joshi
- Tarunavadaanenasaha Muktbharatonnayana Samstha Foundation, Hyderabad, India
| | - Alanis T. Edwin
- Tarunavadaanenasaha Muktbharatonnayana Samstha Foundation, Hyderabad, India
| | - Mary A. Chama
- Department of Chemistry, School of Physical and Mathematical Sciences, University of Ghana, Accra, Ghana
| | | | | | - Jonathan A. Metuge
- Department of Natural Resources and Environmental Sciences, Alabama A & M University, Huntsville, USA
| | - Boris D. Bekono
- Department of Physics, Ecole Normale Supérieure, University of Yaoundé I, BP. 47, Yaoundé, Cameroon
| | - Mustafa A. Isa
- Bioinformatics and Computational Biology Lab, Department of Microbiology, Faculty of Sciences, University of Maiduguri, P.M.B. 1069, Maiduguri, Borno State, Nigeria
- School of Pharmacy, Taipei Medical University, Taipei, Taiwan
| | - José L. Medina-Franco
- DIFACQUIM Research Group, Department of Pharmacy, School of Chemistry, Universidad Nacional Autónoma de Mexico, Mexico City04510, Mexico
| | - Tilmann Weber
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Kongens Lyngby, Denmark
| | - Pieter C. Dorrestein
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, 9500 Gilman Drive, MC 0751, La Jolla, CA92093-0751, USA
| | - Dusanka Janezic
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Glagoljaška 8, SI-6000Koper, Slovenia
| | - Özlem Tastan Bishop
- Research Unit in Bioinformatics (RUBi), Department of Biochemistry and Microbiology, Rhodes University, Makhanda, 6140, South Africa
| | - Jutta Ludwig-Müller
- Faculty of Biology, Technische Universität Dresden, Zellescher Weg 20b, 01062Dresden, Germany
| |
Collapse
|
3
|
Saifi I, Bhat BA, Hamdani SS, Bhat UY, Lobato-Tapia CA, Mir MA, Dar TUH, Ganie SA. Artificial intelligence and cheminformatics tools: a contribution to the drug development and chemical science. J Biomol Struct Dyn 2024; 42:6523-6541. [PMID: 37434311 DOI: 10.1080/07391102.2023.2234039] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/12/2023] [Accepted: 07/03/2023] [Indexed: 07/13/2023]
Abstract
In the ever-evolving field of drug discovery, the integration of Artificial Intelligence (AI) and Machine Learning (ML) with cheminformatics has proven to be a powerful combination. Cheminformatics, which combines the principles of computer science and chemistry, is used to extract chemical information and search compound databases, while the application of AI and ML allows for the identification of potential hit compounds, optimization of synthesis routes, and prediction of drug efficacy and toxicity. This collaborative approach has led to the discovery, preclinical evaluations and approval of over 70 drugs in recent years. To aid researchers in the pursuit of new drugs, this article presents a comprehensive list of databases, datasets, predictive and generative models, scoring functions and web platforms that have been launched between 2021 and 2022. These resources provide a wealth of information and tools for computer-assisted drug development, and are a valuable asset for those working in the field of cheminformatics. Overall, the integration of AI, ML and cheminformatics has greatly advanced the drug discovery process and continues to hold great potential for the future. As new resources and technologies become available, we can expect to see even more groundbreaking discoveries and advancements in these fields.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Ifra Saifi
- Chaudhary Charan Singh University, Meerut, Uttar Pradesh, India
| | - Basharat Ahmad Bhat
- Department of Bioresources, School of Biological Sciences, University of Kashmir, Srinagar, J&K, India
| | - Syed Suhail Hamdani
- Department of Bioresources, School of Biological Sciences, University of Kashmir, Srinagar, J&K, India
| | - Umar Yousuf Bhat
- Department of Zoology, School of Biological Sciences, University of Kashmir, Srinagar, J&K, India
| | | | - Mushtaq Ahmad Mir
- Department of Clinical Laboratory Sciences, College of Applied Medical Science, King Khalid University, KSA, Saudi Arabia
| | - Tanvir Ul Hasan Dar
- Department of Biotechnology, School of Biosciences and Biotechnology, BGSB University, Rajouri, India
| | - Showkat Ahmad Ganie
- Department of Clinical Biochemistry, School of Biological Sciences, University of Kashmir, Srinagar, J&K, India
| |
Collapse
|
4
|
Liu JX, Zhang X, Huang YQ, Hao GF, Yang GF. Multi-level bioinformatics resources support drug target discovery of protein-protein interactions. Drug Discov Today 2024; 29:103979. [PMID: 38608830 DOI: 10.1016/j.drudis.2024.103979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2024] [Revised: 03/14/2024] [Accepted: 04/05/2024] [Indexed: 04/14/2024]
Abstract
Drug discovery often begins with a new target. Protein-protein interactions (PPIs) are crucial to multitudinous cellular processes and offer a promising avenue for drug-target discovery. PPIs are characterized by multi-level complexity: at the protein level, interaction networks can be used to identify potential targets, whereas at the residue level, the details of the interactions of individual PPIs can be used to examine a target's druggability. Much great progress has been made in target discovery through multi-level PPI-related computational approaches, but these resources have not been fully discussed. Here, we systematically survey bioinformatics tools for identifying and assessing potential drug targets, examining their characteristics, limitations and applications. This work will aid the integration of the broader protein-to-network context with the analysis of detailed binding mechanisms to support the discovery of drug targets.
Collapse
Affiliation(s)
- Jia-Xin Liu
- National Key Laboratory of Green Pesticide, Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, PR China
| | - Xiao Zhang
- State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, PR China
| | - Yuan-Qin Huang
- State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, PR China
| | - Ge-Fei Hao
- National Key Laboratory of Green Pesticide, Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, PR China; State Key Laboratory of Green Pesticide, Key Laboratory of Green Pesticide and Agricultural Bioengineering, Ministry of Education, Center for R&D of Fine Chemicals, Guizhou University, Guiyang 550025, PR China.
| | - Guang-Fu Yang
- National Key Laboratory of Green Pesticide, Key Laboratory of Pesticide & Chemical Biology, Ministry of Education, International Joint Research Center for Intelligent Biosensor Technology and Health, Central China Normal University, Wuhan 430079, PR China.
| |
Collapse
|
5
|
Wang G, Zhu ZM, Wang K. Identification of ROS and KEAP1-related genes and verified targets of α-hederin induce cell death for CRC. Drug Dev Res 2024; 85:e22200. [PMID: 38747107 DOI: 10.1002/ddr.22200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2023] [Revised: 04/18/2024] [Accepted: 05/06/2024] [Indexed: 06/05/2024]
Abstract
In this study, we analyzed and verified differentially expressed genes (DEGs) in ROS and KEAP1 crosstalk in oncogenic signatures using GEO data sets (GSE4107 and GSE41328). Multiple pathway enrichment analyses were finished based on DEGs. The genetic signature for colorectal adenocarcinoma (COAD) was identified by using the Cox regression analysis. Kaplan-Meier survival and receiver operating characteristic curve analysis were used to explore the prognosis value of specific genes in COAD. The potential immune signatures and drug sensitivity prediction were also analyzed. Promising small-molecule agents were identified and predicted targets of α-hederin in SuperPred were validated by molecular docking. Also, expression levels of genes and Western blot analysis were conducted. In total, 48 genes were identified as DEGs, and the hub genes such as COL1A1, CXCL12, COL1A2, FN1, CAV1, TIMP3, and IGFBP7 were identified. The ROS and KEAP1-associated gene signatures comprised of hub key genes were developed for predicting the prognosis and evaluating the immune cell responses and immune infiltration in COAD. α-hederin, a potential anti-colorectal cancer (CRC) agent, was found to enhance the sensitivity of HCT116 cells, regulate CAV1 and COL1A1, and decrease KEAP1, Nrf2, and HO-1 expression significantly. KEAP1-related genes could be an essential mediator of ROS in CRC, and KEAP1-associated genes were effective in predicting prognosis and evaluating individualized CRC treatment. Therefore, α-hederin may be an effective chemosensitizer for CRC treatments in clinical settings.
Collapse
Affiliation(s)
- Gang Wang
- Department of Pharmaceutics, Shanghai Eighth People's Hospital, Jiangsu University, Shanghai, China
| | - Zhi-Min Zhu
- Department of Pharmaceutics, Shanghai Eighth People's Hospital, Jiangsu University, Shanghai, China
| | - Kun Wang
- Department of Medicine, Jiangsu University, Zhenjiang, Jiangsu, China
| |
Collapse
|
6
|
Zeng X, Su GP, Li SJ, Lv SQ, Wen ML, Li Y. Drug-Online: an online platform for drug-target interaction, affinity, and binding sites identification using deep learning. BMC Bioinformatics 2024; 25:156. [PMID: 38641811 PMCID: PMC11031932 DOI: 10.1186/s12859-024-05783-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2024] [Accepted: 04/12/2024] [Indexed: 04/21/2024] Open
Abstract
BACKGROUND Accurately identifying drug-target interaction (DTI), affinity (DTA), and binding sites (DTS) is crucial for drug screening, repositioning, and design, as well as for understanding the functions of target. Although there are a few online platforms based on deep learning for drug-target interaction, affinity, and binding sites identification, there is currently no integrated online platforms for all three aspects. RESULTS Our solution, the novel integrated online platform Drug-Online, has been developed to facilitate drug screening, target identification, and understanding the functions of target in a progressive manner of "interaction-affinity-binding sites". Drug-Online platform consists of three parts: the first part uses the drug-target interaction identification method MGraphDTA, based on graph neural networks (GNN) and convolutional neural networks (CNN), to identify whether there is a drug-target interaction. If an interaction is identified, the second part employs the drug-target affinity identification method MMDTA, also based on GNN and CNN, to calculate the strength of drug-target interaction, i.e., affinity. Finally, the third part identifies drug-target binding sites, i.e., pockets. The method pt-lm-gnn used in this part is also based on GNN. CONCLUSIONS Drug-Online is a reliable online platform that integrates drug-target interaction, affinity, and binding sites identification. It is freely available via the Internet at http://39.106.7.26:8000/Drug-Online/ .
Collapse
Affiliation(s)
- Xin Zeng
- College of Mathematics and Computer Science, Dali University, Dali, 671003, China
| | - Guang-Peng Su
- College of Mathematics and Computer Science, Dali University, Dali, 671003, China
| | - Shu-Juan Li
- Yunnan Institute of Endemic Diseases Control and Prevention, Dali, 671000, China
| | - Shuang-Qing Lv
- Institute of Surveying and Information Engineering West, Yunnan University of Applied Science, Dali, 671000, China
| | - Meng-Liang Wen
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, 650000, China
| | - Yi Li
- College of Mathematics and Computer Science, Dali University, Dali, 671003, China.
| |
Collapse
|
7
|
Stevenson GA, Kirshner D, Bennion BJ, Yang Y, Zhang X, Zemla A, Torres MW, Epstein A, Jones D, Kim H, Bennett WFD, Wong SE, Allen JE, Lightstone FC. Clustering Protein Binding Pockets and Identifying Potential Drug Interactions: A Novel Ligand-Based Featurization Method. J Chem Inf Model 2023; 63:6655-6666. [PMID: 37847557 PMCID: PMC10647021 DOI: 10.1021/acs.jcim.3c00722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Indexed: 10/18/2023]
Abstract
Protein-ligand interactions are essential to drug discovery and drug development efforts. Desirable on-target or multitarget interactions are the first step in finding an effective therapeutic, while undesirable off-target interactions are the first step in assessing safety. In this work, we introduce a novel ligand-based featurization and mapping of human protein pockets to identify closely related protein targets and to project novel drugs into a hybrid protein-ligand feature space to identify their likely protein interactions. Using structure-based template matches from PDB, protein pockets are featured by the ligands that bind to their best co-complex template matches. The simplicity and interpretability of this approach provide a granular characterization of the human proteome at the protein-pocket level instead of the traditional protein-level characterization by family, function, or pathway. We demonstrate the power of this featurization method by clustering a subset of the human proteome and evaluating the predicted cluster associations of over 7000 compounds.
Collapse
Affiliation(s)
- Garrett A. Stevenson
- Computational
Engineering Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Dan Kirshner
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Brian J. Bennion
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Yue Yang
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Xiaohua Zhang
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Adam Zemla
- Global
Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
| | - Marisa W. Torres
- Global
Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
| | - Aidan Epstein
- Global
Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
| | - Derek Jones
- Global
Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
- Department
of Computer Science and Engineering, University
of California, San Diego, La Jolla, California 92093, United States
| | - Hyojin Kim
- Center
for Applied Scientific Computing, Lawrence
Livermore National Laboratory, Livermore, California 94550, United States
| | - W. F. Drew Bennett
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Sergio E. Wong
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Jonathan E. Allen
- Global
Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
| | - Felice C. Lightstone
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| |
Collapse
|
8
|
Chemically inducible split protein regulators for mammalian cells. Nat Chem Biol 2023; 19:64-71. [PMID: 36163385 DOI: 10.1038/s41589-022-01136-x] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Accepted: 08/08/2022] [Indexed: 12/31/2022]
Abstract
Chemically inducible systems represent valuable synthetic biology tools that enable the external control of biological processes. However, their translation to therapeutic applications has been limited because of unfavorable ligand characteristics or the immunogenicity of xenogeneic protein domains. To address these issues, we present a strategy for engineering inducible split protein regulators (INSPIRE) in which ligand-binding proteins of human origin are split into two fragments that reassemble in the presence of a cognate physiological ligand or clinically approved drug. We show that the INSPIRE platform can be used for dynamic, orthogonal and multiplex control of gene expression in mammalian cells. Furthermore, we demonstrate the functionality of a glucocorticoid-responsive INSPIRE platform in vivo and apply it for perturbing an endogenous regulatory network. INSPIRE presents a generalizable approach toward designing small-molecule responsive systems that can be implemented for the construction of new sensors, regulatory networks and therapeutic applications.
Collapse
|
9
|
Konc J, Janežič D. Protein binding sites for drug design. Biophys Rev 2022; 14:1413-1421. [PMID: 36532870 PMCID: PMC9734416 DOI: 10.1007/s12551-022-01028-3] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/12/2022] [Accepted: 12/01/2022] [Indexed: 12/13/2022] Open
Abstract
Drug development is a lengthy and challenging process that can be accelerated at early stages by new mathematical approaches and modern computers. To address this important issue, we are developing new mathematical solutions for the detection and characterization of protein binding sites that are important for new drug development. In this review, we present algorithms based on graph theory combined with molecular dynamics simulations that we have developed for studying biological target proteins to provide important data for optimizing the early stages of new drug development. A particular focus is the development of new protein binding site prediction algorithms (ProBiS) and new web tools for modeling pharmaceutically interesting molecules-ProBiS Tools (algorithm, database, web server), which have evolved into a full-fledged graphical tool for studying proteins in the proteome. ProBiS differs from other structural algorithms in that it can align proteins with different folds without prior knowledge of the binding sites. It allows detection of similar binding sites and can predict molecular ligands of various types of pharmaceutical interest that could be advanced to drugs to treat a disease, based on the entire Protein Data Bank (PDB) and AlphaFold database, including proteins not yet in the PDB. All ProBiS Tools are freely available to the academic community at http://insilab.org and https://probis.nih.gov.
Collapse
Affiliation(s)
- Janez Konc
- Theory Department, National Institute of Chemistry, Hajdrihova 19, SI-1001 Ljubljana, Slovenia
| | - Dušanka Janežič
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Glagoljaška 8, SI-6000 Koper, Slovenia
| |
Collapse
|
10
|
Konc J, Janežič D. ProBiS-Fold Approach for Annotation of Human Structures from the AlphaFold Database with No Corresponding Structure in the PDB to Discover New Druggable Binding Sites. J Chem Inf Model 2022; 62:5821-5829. [PMID: 36269348 DOI: 10.1021/acs.jcim.2c00947] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
ProBiS (Protein Binding Sites), a local structure-based comparison algorithm, is used in the new ProBiS-Fold web server to annotate human structures from the AlphaFold database without a corresponding structure in the Protein Data Bank (PDB) to discover new druggable binding sites. The ProBiS algorithm is used to compare each query protein structure predicted by the AlphaFold approach with the protein structures from the PDB to identify similarities between known binding sites found in the PDB and yet unknown binding sites in the AlphaFold database. Ligands bound in these identified similar PDB sites are then transferred to each query protein from the AlphaFold database, and binding sites are identified as ligand clusters on an AlphaFold protein. Small molecule binding sites are assigned druggability scores based on the similarity of their ligands to known drugs, allowing them to be ranked according to their perceived and actual potential for drug development. ProBiS-Fold provides interactive and downloadable binding sites for the entire human structural proteome, including more than 3000 new druggable binding sites that have no corresponding structure in the PDB, taking into account AlphaFold's model quality, to enable protein structure-function relationship studies and pharmaceutical drug discovery research. The web server is freely accessible to academic users at http://probis-fold.insilab.org.
Collapse
Affiliation(s)
- Janez Konc
- Theory Department, National Institute of Chemistry, Hajdrihova 19, SI-1001 Ljubljana, Slovenia
| | - Dušanka Janežič
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Glagoljaška 8, SI-6000 Koper, Slovenia
| |
Collapse
|
11
|
Sim J, Kwon S, Seok C. HProteome-BSite: predicted binding sites and ligands in human 3D proteome. Nucleic Acids Res 2022; 51:D403-D408. [PMID: 36243970 PMCID: PMC9825455 DOI: 10.1093/nar/gkac873] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 09/20/2022] [Accepted: 09/29/2022] [Indexed: 01/29/2023] Open
Abstract
Atomic-level knowledge of protein-ligand interactions allows a detailed understanding of protein functions and provides critical clues to discovering molecules regulating the functions. While recent innovative deep learning methods for protein structure prediction dramatically increased the structural coverage of the human proteome, molecular interactions remain largely unknown. A new database, HProteome-BSite, provides predictions of binding sites and ligands in the enlarged 3D human proteome. The model structures for human proteins from the AlphaFold Protein Structure Database were processed to structural domains of high confidence to maximize the coverage and reliability of interaction prediction. For ligand binding site prediction, an updated version of a template-based method GalaxySite was used. A high-level performance of the updated GalaxySite was confirmed. HProteome-BSite covers 80.74% of the UniProt entries in the AlphaFold human 3D proteome. Predicted binding sites and binding poses of potential ligands are provided for effective applications to further functional studies and drug discovery. The HProteome-BSite database is available at https://galaxy.seoklab.org/hproteome-bsite/database and is free and open to all users.
Collapse
Affiliation(s)
- Jiho Sim
- Department of Chemistry, Seoul National University, Seoul 08826, Republic of Korea
| | - Sohee Kwon
- Department of Chemistry, Seoul National University, Seoul 08826, Republic of Korea,Galux Inc, Gwanak-gu, Seoul 08738, Republic of Korea
| | - Chaok Seok
- To whom correspondence should be addressed. Tel: +82 2 880 9197; Fax: +82 2 889 1568;
| |
Collapse
|
12
|
CavitySpace: A Database of Potential Ligand Binding Sites in the Human Proteome. Biomolecules 2022; 12:biom12070967. [PMID: 35883523 PMCID: PMC9312471 DOI: 10.3390/biom12070967] [Citation(s) in RCA: 20] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Revised: 07/07/2022] [Accepted: 07/08/2022] [Indexed: 02/01/2023] Open
Abstract
Location and properties of ligand binding sites provide important information to uncover protein functions and to direct structure-based drug design approaches. However, as binding site detection depends on the three-dimensional (3D) structural data of proteins, functional analysis based on protein ligand binding sites is formidable for proteins without structural information. Recent developments in protein structure prediction and the 3D structures built by AlphaFold provide an unprecedented opportunity for analyzing ligand binding sites in human proteins. Here, we constructed the CavitySpace database, the first pocket library for all the proteins in the human proteome, using a widely-applied ligand binding site detection program CAVITY. Our analysis showed that known ligand binding sites could be well recovered. We grouped the predicted binding sites according to their similarity which can be used in protein function prediction and drug repurposing studies. Novel binding sites in highly reliable predicted structure regions provide new opportunities for drug discovery. Our CavitySpace is freely available and provides a valuable tool for drug discovery and protein function studies.
Collapse
|
13
|
Kores K, Kolenc Z, Furlan V, Bren U. Inverse Molecular Docking Elucidating the Anticarcinogenic Potential of the Hop Natural Product Xanthohumol and Its Metabolites. Foods 2022; 11:foods11091253. [PMID: 35563976 PMCID: PMC9104229 DOI: 10.3390/foods11091253] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2022] [Revised: 04/19/2022] [Accepted: 04/25/2022] [Indexed: 01/27/2023] Open
Abstract
Natural products from plants exert a promising potential to act as antioxidants, antimicrobials, anti-inflammatory, and anticarcinogenic agents. Xanthohumol, a natural compound from hops, is indeed known for its anticarcinogenic properties. Xanthohumol is converted into three metabolites: isoxanthohumol (non-enzymatically) as well as 8- and 6-prenylnaringenin (enzymatically). An inverse molecular docking approach was applied to xanthohumol and its three metabolites to discern their potential protein targets. The aim of our study was to disclose the potential protein targets of xanthohumol and its metabolites in order to expound on the potential anticarcinogenic mechanisms of xanthohumol based on the found target proteins. The investigated compounds were docked into the predicted binding sites of all human protein structures from the Protein Data Bank, and the best docking poses were examined. Top scoring human protein targets with successfully docked compounds were identified, and their experimental connection with the anticarcinogenic function or cancer was investigated. The obtained results were carefully checked against the existing experimental findings from the scientific literature as well as further validated using retrospective metrics. More than half of the human protein targets of xanthohumol with the highest docking scores have already been connected with the anticarcinogenic function, and four of them (including two important representatives of the matrix metalloproteinase family, MMP-2 and MMP-9) also have a known experimental correlation with xanthohumol. Another important protein target is acyl-protein thioesterase 2, to which xanthohumol, isoxanthohumol, and 6-prenylnaringenin were successfully docked with the lowest docking scores. Moreover, the results for the metabolites show that their most promising protein targets are connected with the anticarcinogenic function as well. We firmly believe that our study can help to elucidate the anticarcinogenic mechanisms of xanthohumol and its metabolites as after consumption, all four compounds can be simultaneously present in the organism.
Collapse
Affiliation(s)
- Katarina Kores
- Laboratory of Physical Chemistry and Chemical Thermodynamics, Faculty for Chemistry and Chemical Technology, University of Maribor, Smetanova 17, SI-2000 Maribor, Slovenia; (K.K.); (Z.K.); (V.F.)
| | - Zala Kolenc
- Laboratory of Physical Chemistry and Chemical Thermodynamics, Faculty for Chemistry and Chemical Technology, University of Maribor, Smetanova 17, SI-2000 Maribor, Slovenia; (K.K.); (Z.K.); (V.F.)
| | - Veronika Furlan
- Laboratory of Physical Chemistry and Chemical Thermodynamics, Faculty for Chemistry and Chemical Technology, University of Maribor, Smetanova 17, SI-2000 Maribor, Slovenia; (K.K.); (Z.K.); (V.F.)
| | - Urban Bren
- Laboratory of Physical Chemistry and Chemical Thermodynamics, Faculty for Chemistry and Chemical Technology, University of Maribor, Smetanova 17, SI-2000 Maribor, Slovenia; (K.K.); (Z.K.); (V.F.)
- Department of Applied Natural Sciences, Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Glagoljaška 8, SI-6000 Koper, Slovenia
- Correspondence: ; Tel.: +386-2-229-4421
| |
Collapse
|
14
|
Jukič M, Kores K, Janežič D, Bren U. Repurposing of Drugs for SARS-CoV-2 Using Inverse Docking Fingerprints. Front Chem 2021; 9:757826. [PMID: 35028304 PMCID: PMC8748264 DOI: 10.3389/fchem.2021.757826] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/12/2021] [Accepted: 11/12/2021] [Indexed: 01/08/2023] Open
Abstract
Severe acute respiratory syndrome coronavirus 2 or SARS-CoV-2 is a virus that belongs to the Coronaviridae family. This group of viruses commonly causes colds but possesses a tremendous pathogenic potential. In humans, an outbreak of SARS caused by the SARS-CoV virus was first reported in 2003, followed by 2012 when the Middle East respiratory syndrome coronavirus (MERS-CoV) led to an outbreak of Middle East respiratory syndrome (MERS). Moreover, COVID-19 represents a serious socioeconomic and global health problem that has already claimed more than four million lives. To date, there are only a handful of therapeutic options to combat this disease, and only a single direct-acting antiviral, the conditionally approved remdesivir. Since there is an urgent need for active drugs against SARS-CoV-2, the strategy of drug repurposing represents one of the fastest ways to achieve this goal. An in silico drug repurposing study using two methods was conducted. A structure-based virtual screening of the FDA-approved drug database on SARS-CoV-2 main protease was performed, and the 11 highest-scoring compounds with known 3CLpro activity were identified while the methodology was used to report further 11 potential and completely novel 3CLpro inhibitors. Then, inverse molecular docking was performed on the entire viral protein database as well as on the Coronaviridae family protein subset to examine the hit compounds in detail. Instead of target fishing, inverse docking fingerprints were generated for each hit compound as well as for the five most frequently reported and direct-acting repurposed drugs that served as controls. In this way, the target-hitting space was examined and compared and we can support the further biological evaluation of all 11 newly reported hits on SARS-CoV-2 3CLpro as well as recommend further in-depth studies on antihelminthic class member compounds. The authors acknowledge the general usefulness of this approach for a full-fledged inverse docking fingerprint screening in the future.
Collapse
Affiliation(s)
- Marko Jukič
- Laboratory of Physical Chemistry and Chemical Thermodynamics, Faculty of Chemistry and Chemical Engineering, University of Maribor, Maribor, Slovenia
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Koper, Slovenia
| | - Katarina Kores
- Laboratory of Physical Chemistry and Chemical Thermodynamics, Faculty of Chemistry and Chemical Engineering, University of Maribor, Maribor, Slovenia
| | - Dušanka Janežič
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Koper, Slovenia
| | - Urban Bren
- Laboratory of Physical Chemistry and Chemical Thermodynamics, Faculty of Chemistry and Chemical Engineering, University of Maribor, Maribor, Slovenia
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Koper, Slovenia
| |
Collapse
|
15
|
Lešnik S, Bren U. Mechanistic Insights into Biological Activities of Polyphenolic Compounds from Rosemary Obtained by Inverse Molecular Docking. Foods 2021; 11:67. [PMID: 35010191 PMCID: PMC8750736 DOI: 10.3390/foods11010067] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Revised: 12/23/2021] [Accepted: 12/24/2021] [Indexed: 01/18/2023] Open
Abstract
Rosemary (Rosmarinus officinalis L.) represents a medicinal plant known for its various health-promoting properties. Its extracts and essential oils exhibit antioxidative, anti-inflammatory, anticarcinogenic, and antimicrobial activities. The main compounds responsible for these effects are the diterpenes carnosic acid, carnosol, and rosmanol, as well as the phenolic acid ester rosmarinic acid. However, surprisingly little is known about the molecular mechanisms responsible for the pharmacological activities of rosemary and its compounds. To discern these mechanisms, we performed a large-scale inverse molecular docking study to identify their potential protein targets. Listed compounds were separately docked into predicted binding sites of all non-redundant holo proteins from the Protein Data Bank and those with the top scores were further examined. We focused on proteins directly related to human health, including human and mammalian proteins as well as proteins from pathogenic bacteria, viruses, and parasites. The observed interactions of rosemary compounds indeed confirm the beforementioned activities, whereas we also identified their potential for anticoagulant and antiparasitic actions. The obtained results were carefully checked against the existing experimental findings from the scientific literature as well as further validated using both redocking procedures and retrospective metrics.
Collapse
Affiliation(s)
- Samo Lešnik
- Laboratory of Physical Chemistry and Chemical Thermodynamics, Faculty of Chemistry and Chemical Engineering, University of Maribor, Smetanova 17, SI-2000 Maribor, Slovenia;
| | - Urban Bren
- Laboratory of Physical Chemistry and Chemical Thermodynamics, Faculty of Chemistry and Chemical Engineering, University of Maribor, Smetanova 17, SI-2000 Maribor, Slovenia;
- Faculty of Mathematics, Natural Sciences and Information Technologies, University of Primorska, Glagoljaška 8, SI-6000 Koper, Slovenia
| |
Collapse
|