Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Gao M, Skolnick J. APoc: large-scale identification of similar protein pockets. ACTA ACUST UNITED AC 2013;29:597-604. [PMID: 23335017 DOI: 10.1093/bioinformatics/btt024] [Citation(s) in RCA: 91] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

For:	Gao M, Skolnick J. APoc: large-scale identification of similar protein pockets. ACTA ACUST UNITED AC 2013;29:597-604. [PMID: 23335017 DOI: 10.1093/bioinformatics/btt024] [Citation(s) in RCA: 91] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]

Number

Cited by Other Article(s)

Skolnick J, Srinivasan B, Skolnick S, Edelman B, Zhou H. Entabolons: How Metabolites Modify the Biochemical Function of Proteins and Cause the Correlated Behavior of Proteins in Pathways. J Chem Inf Model 2025. [PMID: 40378093 DOI: 10.1021/acs.jcim.5c00462] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/18/2025]

Maity D, Qiao B. AlloBench: A Data Set Pipeline for the Development and Benchmarking of Allosteric Site Prediction Tools. ACS OMEGA 2025;10:17973-17982. [PMID: 40352555 PMCID: PMC12059942 DOI: 10.1021/acsomega.5c01263] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/10/2025] [Revised: 04/14/2025] [Accepted: 04/17/2025] [Indexed: 05/14/2025]

Zhang R, Chen Z, Li S, Lv H, Li J, Yang N, Dai S. Proteome-Wide Identification and Comparison of Drug Pockets for Discovering New Drug Indications and Side Effects. Molecules 2025;30:260. [PMID: 39860130 PMCID: PMC11767986 DOI: 10.3390/molecules30020260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2024] [Revised: 01/04/2025] [Accepted: 01/08/2025] [Indexed: 01/27/2025] Open

Affiliation(s)

Renxin Zhang State Key Laboratory of Primate Biomedical Research, Institute of Primate Translational Medicine, Kunming University of Science and Technology, Kunming 650500, China; (R.Z.); (Z.C.); (S.L.); (H.L.); (J.L.) Yunnan Key Laboratory of Primate Biomedical Research, Kunming 650500, China
Zhiyuan Chen State Key Laboratory of Primate Biomedical Research, Institute of Primate Translational Medicine, Kunming University of Science and Technology, Kunming 650500, China; (R.Z.); (Z.C.); (S.L.); (H.L.); (J.L.) Yunnan Key Laboratory of Primate Biomedical Research, Kunming 650500, China
Shuhan Li State Key Laboratory of Primate Biomedical Research, Institute of Primate Translational Medicine, Kunming University of Science and Technology, Kunming 650500, China; (R.Z.); (Z.C.); (S.L.); (H.L.); (J.L.) Yunnan Key Laboratory of Primate Biomedical Research, Kunming 650500, China
Haohao Lv State Key Laboratory of Primate Biomedical Research, Institute of Primate Translational Medicine, Kunming University of Science and Technology, Kunming 650500, China; (R.Z.); (Z.C.); (S.L.); (H.L.); (J.L.) Yunnan Key Laboratory of Primate Biomedical Research, Kunming 650500, China
Jinjun Li State Key Laboratory of Primate Biomedical Research, Institute of Primate Translational Medicine, Kunming University of Science and Technology, Kunming 650500, China; (R.Z.); (Z.C.); (S.L.); (H.L.); (J.L.) Yunnan Key Laboratory of Primate Biomedical Research, Kunming 650500, China
Naixue Yang State Key Laboratory of Primate Biomedical Research, Institute of Primate Translational Medicine, Kunming University of Science and Technology, Kunming 650500, China; (R.Z.); (Z.C.); (S.L.); (H.L.); (J.L.) Yunnan Key Laboratory of Primate Biomedical Research, Kunming 650500, China
Shaoxing Dai State Key Laboratory of Primate Biomedical Research, Institute of Primate Translational Medicine, Kunming University of Science and Technology, Kunming 650500, China; (R.Z.); (Z.C.); (S.L.); (H.L.); (J.L.) Yunnan Key Laboratory of Primate Biomedical Research, Kunming 650500, China

Collapse

Zhang H, Gur M, Bahar I. Global hinge sites of proteins as target sites for drug binding. Proc Natl Acad Sci U S A 2024;121:e2414333121. [PMID: 39585988 PMCID: PMC11626116 DOI: 10.1073/pnas.2414333121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2024] [Accepted: 10/17/2024] [Indexed: 11/27/2024] Open

Gao M, Skolnick J. Predicting protein interactions of the kinase Lck critical to T cell modulation. Structure 2024;32:2168-2179.e2. [PMID: 39368461 PMCID: PMC11560573 DOI: 10.1016/j.str.2024.09.010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2024] [Revised: 08/19/2024] [Accepted: 09/10/2024] [Indexed: 10/07/2024]

Ugurlu SY, McDonald D, He S. MEF-AlloSite: an accurate and robust Multimodel Ensemble Feature selection for the Allosteric Site identification model. J Cheminform 2024;16:116. [PMID: 39444016 PMCID: PMC11515501 DOI: 10.1186/s13321-024-00882-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2024] [Accepted: 07/09/2024] [Indexed: 10/25/2024] Open

Abstract

A crucial mechanism for controlling the actions of proteins is allostery. Allosteric modulators have the potential to provide many benefits compared to orthosteric ligands, such as increased selectivity and saturability of their effect. The identification of new allosteric sites presents prospects for the creation of innovative medications and enhances our comprehension of fundamental biological mechanisms. Allosteric sites are increasingly found in different protein families through various techniques, such as machine learning applications, which opens up possibilities for creating completely novel medications with a diverse variety of chemical structures. Machine learning methods, such as PASSer, exhibit limited efficacy in accurately finding allosteric binding sites when relying solely on 3D structural information.Scientific ContributionPrior to conducting feature selection for allosteric binding site identification, integration of supporting amino-acid-based information to 3D structural knowledge is advantageous. This approach can enhance performance by ensuring accuracy and robustness. Therefore, we have developed an accurate and robust model called Multimodel Ensemble Feature Selection for Allosteric Site Identification (MEF-AlloSite) after collecting 9460 relevant and diverse features from the literature to characterise pockets. The model employs an accurate and robust multimodal feature selection technique for the small training set size of only 90 proteins to improve predictive performance. This state-of-the-art technique increased the performance in allosteric binding site identification by selecting promising features from 9460 features. Also, the relationship between selected features and allosteric binding sites enlightened the understanding of complex allostery for proteins by analysing selected features. MEF-AlloSite and state-of-the-art allosteric site identification methods such as PASSer2.0 and PASSerRank have been tested on three test cases 51 times with a different split of the training set. The Student's t test and Cohen's D value have been used to evaluate the average precision and ROC AUC score distribution. On three test cases, most of the p-values ( < 0.05 ) and the majority of Cohen's D values ( > 0.5 ) showed that MEF-AlloSite's 1-6% higher mean of average precision and ROC AUC than state-of-the-art allosteric site identification methods are statistically significant.

Collapse

Reim T, Ehrt C, Graef J, Günther S, Meents A, Rarey M. SiteMine: Large-scale binding site similarity searching in protein structure databases. Arch Pharm (Weinheim) 2024;357:e2300661. [PMID: 38335311 DOI: 10.1002/ardp.202300661] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/13/2023] [Revised: 01/10/2024] [Accepted: 01/16/2024] [Indexed: 02/12/2024]

Pallante L, Cannariato M, Androutsos L, Zizzi EA, Bompotas A, Hada X, Grasso G, Kalogeras A, Mavroudi S, Di Benedetto G, Theofilatos K, Deriu MA. VirtuousPocketome: a computational tool for screening protein-ligand complexes to identify similar binding sites. Sci Rep 2024;14:6296. [PMID: 38491261 PMCID: PMC10943019 DOI: 10.1038/s41598-024-56893-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/09/2024] [Accepted: 03/12/2024] [Indexed: 03/18/2024] Open

Abstract

Protein residues within binding pockets play a critical role in determining the range of ligands that can interact with a protein, influencing its structure and function. Identifying structural similarities in proteins offers valuable insights into their function and activation mechanisms, aiding in predicting protein-ligand interactions, anticipating off-target effects, and facilitating the development of therapeutic agents. Numerous computational methods assessing global or local similarity in protein cavities have emerged, but their utilization is impeded by complexity, impractical automation for amino acid pattern searches, and an inability to evaluate the dynamics of scrutinized protein-ligand systems. Here, we present a general, automatic and unbiased computational pipeline, named VirtuousPocketome, aimed at screening huge databases of proteins for similar binding pockets starting from an interested protein-ligand complex. We demonstrate the pipeline's potential by exploring a recently-solved human bitter taste receptor, i.e. the TAS2R46, complexed with strychnine. We pinpointed 145 proteins sharing similar binding sites compared to the analysed bitter taste receptor and the enrichment analysis highlighted the related biological processes, molecular functions and cellular components. This work represents the foundation for future studies aimed at understanding the effective role of tastants outside the gustatory system: this could pave the way towards the rationalization of the diet as a supplement to standard pharmacological treatments and the design of novel tastants-inspired compounds to target other proteins involved in specific diseases or disorders. The proposed pipeline is publicly accessible, can be applied to any protein-ligand complex, and could be expanded to screen any database of protein structures.

Collapse

Shen Y, Parks JM, Smith JC. HLA-Clus: HLA class I clustering based on 3D structure. BMC Bioinformatics 2023;24:189. [PMID: 37161375 PMCID: PMC10169335 DOI: 10.1186/s12859-023-05297-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2023] [Accepted: 04/18/2023] [Indexed: 05/11/2023] Open

Shen Y, Parks JM, Smith JC. HLA Class I Supertype Classification Based on Structural Similarity. JOURNAL OF IMMUNOLOGY (BALTIMORE, MD. : 1950) 2023;210:103-114. [PMID: 36453976 DOI: 10.4049/jimmunol.2200685] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 10/31/2022] [Indexed: 12/24/2022]

Chan L, Kumar R, Verdonk M, Poelking C. A multilevel generative framework with hierarchical self-contrasting for bias control and transparency in structure-based ligand design. NAT MACH INTELL 2022. [DOI: 10.1038/s42256-022-00564-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Guo Z, Liu J, Skolnick J, Cheng J. Prediction of inter-chain distance maps of protein complexes with 2D attention-based deep neural networks. Nat Commun 2022;13:6963. [PMID: 36379943 PMCID: PMC9666547 DOI: 10.1038/s41467-022-34600-2] [Citation(s) in RCA: 23] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/08/2022] [Accepted: 10/24/2022] [Indexed: 11/16/2022] Open

Eguida M, Rognan D. Estimating the Similarity between Protein Pockets. Int J Mol Sci 2022;23:12462. [PMID: 36293316 PMCID: PMC9604425 DOI: 10.3390/ijms232012462] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 10/15/2022] [Accepted: 10/16/2022] [Indexed: 10/28/2023] Open

Skolnick J, Zhou H. Implications of the Essential Role of Small Molecule Ligand Binding Pockets in Protein-Protein Interactions. J Phys Chem B 2022;126:6853-6867. [PMID: 36044742 PMCID: PMC9484464 DOI: 10.1021/acs.jpcb.2c04525] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2022] [Revised: 08/18/2022] [Indexed: 11/28/2022]

D’Arrigo G, Autiero I, Gianquinto E, Siragusa L, Baroni M, Cruciani G, Spyrakis F. Exploring Ligand Binding Domain Dynamics in the NRs Superfamily. Int J Mol Sci 2022;23:ijms23158732. [PMID: 35955864 PMCID: PMC9369052 DOI: 10.3390/ijms23158732] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2022] [Revised: 07/29/2022] [Accepted: 08/04/2022] [Indexed: 11/16/2022] Open

Yang L, He W, Yun Y, Gao Y, Zhu Z, Teng M, Liang Z, Niu L. Defining A Global Map of Functional Group-based 3D Ligand-binding Motifs. GENOMICS, PROTEOMICS & BIOINFORMATICS 2022;20:765-779. [PMID: 35288344 PMCID: PMC9881048 DOI: 10.1016/j.gpb.2021.08.014] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Revised: 06/30/2021] [Accepted: 09/27/2021] [Indexed: 01/31/2023]

Affiliation(s)

Liu Yang School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230026, China; Division of Molecular and Cellular Biophysics, Hefei National Laboratory for Physical Sciences at the Microscale, Hefei 230026, China
Wei He School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230026, China; Division of Molecular and Cellular Biophysics, Hefei National Laboratory for Physical Sciences at the Microscale, Hefei 230026, China.
Yuehui Yun School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230026, China; Division of Molecular and Cellular Biophysics, Hefei National Laboratory for Physical Sciences at the Microscale, Hefei 230026, China
Yongxiang Gao School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230026, China; Division of Molecular and Cellular Biophysics, Hefei National Laboratory for Physical Sciences at the Microscale, Hefei 230026, China
Zhongliang Zhu School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230026, China; Division of Molecular and Cellular Biophysics, Hefei National Laboratory for Physical Sciences at the Microscale, Hefei 230026, China
Maikun Teng School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230026, China; Division of Molecular and Cellular Biophysics, Hefei National Laboratory for Physical Sciences at the Microscale, Hefei 230026, China
Zhi Liang School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230026, China; Division of Molecular and Cellular Biophysics, Hefei National Laboratory for Physical Sciences at the Microscale, Hefei 230026, China.
Liwen Niu School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei 230026, China; Division of Molecular and Cellular Biophysics, Hefei National Laboratory for Physical Sciences at the Microscale, Hefei 230026, China.

Collapse

Dankwah KO, Mohl JE, Begum K, Leung MY. What Makes GPCRs from Different Families Bind to the Same Ligand? Biomolecules 2022;12:863. [PMID: 35883418 PMCID: PMC9313020 DOI: 10.3390/biom12070863] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2022] [Revised: 06/09/2022] [Accepted: 06/19/2022] [Indexed: 12/10/2022] Open

Sankar S, Chandra N. SiteMotif: A graph-based algorithm for deriving structural motifs in Protein Ligand binding sites. PLoS Comput Biol 2022;18:e1009901. [PMID: 35202398 PMCID: PMC8903255 DOI: 10.1371/journal.pcbi.1009901] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Revised: 03/08/2022] [Accepted: 02/07/2022] [Indexed: 12/03/2022] Open

Abstract

Studying similarities in protein molecules has become a fundamental activity in much of biology and biomedical research, for which methods such as multiple sequence alignments are widely used. Most methods available for such comparisons cater to studying proteins which have clearly recognizable evolutionary relationships but not to proteins that recognize the same or similar ligands but do not share similarities in their sequence or structural folds. In many cases, proteins in the latter class share structural similarities only in their binding sites. While several algorithms are available for comparing binding sites, there are none for deriving structural motifs of the binding sites, independent of the whole proteins. We report the development of SiteMotif, a new algorithm that compares binding sites from multiple proteins and derives sequence-order independent structural site motifs. We have tested the algorithm at multiple levels of complexity and demonstrate its performance in different scenarios. We have benchmarked against 3 current methods available for binding site comparison and demonstrate superior performance of our algorithm. We show that SiteMotif identifies new structural motifs of spatially conserved residues in proteins, even when there is no sequence or fold-level similarity. We expect SiteMotif to be useful for deriving key mechanistic insights into the mode of ligand interaction, predict the ligand type that a protein can bind and improve the sensitivity of functional annotation.

A large number of biological functions are orchestrated by proteins. The function of proteins is governed by its structure and its interacting ligand. However, it is known that not all residues are involved in ligand recognition. More specifically, residues that are located within 4.5 Å of ligand atoms are considered to be ’binding sites’. Here, we have developed an algorithm called SiteMotif that efficiently aligns multiple binding sites into a common frame. This process enables us to derive conservation among the binding site residues in a sequence order independent manner. The algorithm was validated extensively across five different levels and measured binding site similarities in each of them. Previous research has found multiple instances where different proteins have comparable binding sites and hence perform the same function. We present the ability of our method to detect such scenarios. Finally, As a use case, we applied SiteMotif to a set of glutathione binding proteins and derived a site based sequence motif characteristic of all glutathione binding proteins.

Collapse

Zhang W, Huang J. EViS: An Enhanced Virtual Screening Approach Based on Pocket-Ligand Similarity. J Chem Inf Model 2022;62:498-510. [PMID: 35084171 DOI: 10.1021/acs.jcim.1c00944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Rao L, Jia NX, Hu J, Yu DJ, Zhang GJ. ATPdock: a template-based method for ATP-specific protein-ligand docking. Bioinformatics 2022;38:556-558. [PMID: 34546290 DOI: 10.1093/bioinformatics/btab667] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2021] [Revised: 09/15/2021] [Accepted: 09/18/2021] [Indexed: 02/03/2023] Open

Gao M, Nakajima An D, Skolnick J. Deep learning-driven insights into super protein complexes for outer membrane protein biogenesis in bacteria. eLife 2022;11:82885. [PMID: 36576775 PMCID: PMC9797188 DOI: 10.7554/elife.82885] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2022] [Accepted: 11/28/2022] [Indexed: 12/29/2022] Open

Li J, Moumbock AFA, Qaseem A, Xu Q, Feng Y, Wang D, Günther S. AroCageDB: A Web-Based Resource for Aromatic Cage Binding Sites and Their Intrinsic Ligands. J Chem Inf Model 2021;61:5327-5330. [PMID: 34738791 DOI: 10.1021/acs.jcim.1c00927] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Gao M, Lund-Andersen P, Morehead A, Mahmud S, Chen C, Chen X, Giri N, Roy RS, Quadir F, Effler TC, Prout R, Abraham S, Elwasif W, Haas NQ, Skolnick J, Cheng J, Sedova A. High-Performance Deep Learning Toolbox for Genome-Scale Prediction of Protein Structure and Function. WORKSHOP ON MACHINE LEARNING IN HPC ENVIRONMENTS. WORKSHOP ON MACHINE LEARNING IN HPC ENVIRONMENTS 2021;2021:46-57. [PMID: 35112110 PMCID: PMC8802329 DOI: 10.1109/mlhpc54614.2021.00010] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

Guterres H, Park SJ, Zhang H, Im W. CHARMM-GUI LBS Finder & Refiner for Ligand Binding Site Prediction and Refinement. J Chem Inf Model 2021;61:3744-3751. [PMID: 34296608 DOI: 10.1021/acs.jcim.1c00561] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]

Li S, Cai C, Gong J, Liu X, Li H. A fast protein binding site comparison algorithm for proteome-wide protein function prediction and drug repurposing. Proteins 2021;89:1541-1556. [PMID: 34245187 DOI: 10.1002/prot.26176] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Revised: 06/26/2021] [Accepted: 06/30/2021] [Indexed: 01/18/2023]

Gao M, Skolnick J. A novel sequence alignment algorithm based on deep learning of the protein folding code. Bioinformatics 2021;37:490-496. [PMID: 32960943 PMCID: PMC8599902 DOI: 10.1093/bioinformatics/btaa810] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2020] [Revised: 08/11/2020] [Accepted: 09/08/2020] [Indexed: 11/12/2022] Open

Gao M, Skolnick J. A General Framework to Learn Tertiary Structure for Protein Sequence Characterization. FRONTIERS IN BIOINFORMATICS 2021;1. [PMID: 34308415 PMCID: PMC8301223 DOI: 10.3389/fbinf.2021.689960] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open

Bhadra A, Yeturu K. Site2Vec: a reference frame invariant algorithm for vector embedding of protein–ligand binding sites. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abad88] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open

Abstract Abstract Protein–ligand interactions are one of the fundamental types of molecular interactions in living systems. Ligands are small molecules that interact with protein molecules at specific regions on their surfaces called binding sites. Binding sites would also determine ADMET properties of a drug molecule. Tasks such as assessment of protein functional similarity and detection of side effects of drugs need identification of similar binding sites of disparate proteins across diverse pathways. To this end, methods for computing similarities between binding sites are still evolving and is an active area of research even today. Machine learning methods for similarity assessment require feature descriptors of binding sites. Traditional methods based on hand engineered motifs and atomic configurations are not scalable across several thousands of sites. In this regard, deep neural network algorithms are now deployed which can capture very complex input feature space. However, one fundamental challenge in applying deep learning to structures of binding sites is the input representation and the reference frame. We report here a novel algorithm, Site2Vec, that derives reference frame invariant vector embedding of a protein–ligand binding site. The method is based on pairwise distances between representative points and chemical compositions in terms of constituent amino acids of a site. The vector embedding serves as a locality sensitive hash function for proximity queries and determining similar sites. The method has been the top performer with more than 95% quality scores in extensive benchmarking studies carried over 10 data sets and against 23 other site comparison methods in the field. The algorithm serves for high throughput processing and has been evaluated for stability with respect to reference frame shifts, coordinate perturbations and residue mutations. We also provide the method as a standalone executable and a web service hosted at (http://services.iittp.ac.in/bioinfo/home). Collapse

Predicting binding sites from unbound versus bound protein structures. Sci Rep 2020;10:15856. [PMID: 32985584 PMCID: PMC7522209 DOI: 10.1038/s41598-020-72906-7] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 07/27/2020] [Indexed: 11/30/2022] Open

Chaudhari R, Fong LW, Tan Z, Huang B, Zhang S. An up-to-date overview of computational polypharmacology in modern drug discovery. Expert Opin Drug Discov 2020;15:1025-1044. [PMID: 32452701 PMCID: PMC7415563 DOI: 10.1080/17460441.2020.1767063] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2020] [Accepted: 05/06/2020] [Indexed: 12/30/2022]

Trigueiro-Louro J, Correia V, Figueiredo-Nunes I, Gíria M, Rebelo-de-Andrade H. Unlocking COVID therapeutic targets: A structure-based rationale against SARS-CoV-2, SARS-CoV and MERS-CoV Spike. Comput Struct Biotechnol J 2020;18:2117-2131. [PMID: 32913581 PMCID: PMC7452956 DOI: 10.1016/j.csbj.2020.07.017] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2020] [Revised: 07/20/2020] [Accepted: 07/22/2020] [Indexed: 12/11/2022] Open

Cao Y, Park SJ, Im W. A systematic analysis of protein-carbohydrate interactions in the Protein Data Bank. Glycobiology 2020;31:126-136. [PMID: 32614943 DOI: 10.1093/glycob/cwaa062] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Revised: 06/26/2020] [Accepted: 06/26/2020] [Indexed: 12/17/2022] Open

Katigbak J, Li H, Rooklin D, Zhang Y. AlphaSpace 2.0: Representing Concave Biomolecular Surfaces Using β-Clusters. J Chem Inf Model 2020;60:1494-1508. [PMID: 31995373 PMCID: PMC7093224 DOI: 10.1021/acs.jcim.9b00652] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]

Ribeiro VS, Santana CA, Fassio AV, Cerqueira FR, da Silveira CH, Romanelli JPR, Patarroyo-Vargas A, Oliveira MGA, Gonçalves-Almeida V, Izidoro SC, de Melo-Minardi RC, Silveira SDA. visGReMLIN: graph mining-based detection and visualization of conserved motifs at 3D protein-ligand interface at the atomic level. BMC Bioinformatics 2020;21:80. [PMID: 32164574 PMCID: PMC7068867 DOI: 10.1186/s12859-020-3347-7] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open

Simonovsky M, Meyers J. DeeplyTough: Learning Structural Comparison of Protein Binding Sites. J Chem Inf Model 2020;60:2356-2366. [DOI: 10.1021/acs.jcim.9b00554] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]

On the possible origin of protein homochirality, structure, and biochemical function. Proc Natl Acad Sci U S A 2019;116:26571-26579. [PMID: 31822617 DOI: 10.1073/pnas.1908241116] [Citation(s) in RCA: 28] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open

Naderi M, Lemoine JM, Govindaraj RG, Kana OZ, Feinstein WP, Brylinski M. Binding site matching in rational drug design: algorithms and applications. Brief Bioinform 2019;20:2167-2184. [PMID: 30169563 PMCID: PMC6954434 DOI: 10.1093/bib/bby078] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2018] [Revised: 07/18/2018] [Accepted: 07/29/2018] [Indexed: 01/06/2023] Open

Guterres H, Lee HS, Im W. Ligand-Binding-Site Structure Refinement Using Molecular Dynamics with Restraints Derived from Predicted Binding Site Templates. J Chem Theory Comput 2019;15:6524-6535. [PMID: 31557013 DOI: 10.1021/acs.jctc.9b00751] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]

Rifaioglu AS, Atas H, Martin MJ, Cetin-Atalay R, Atalay V, Doğan T. Recent applications of deep learning and machine intelligence on in silico drug discovery: methods, tools and databases. Brief Bioinform 2019;20:1878-1912. [PMID: 30084866 PMCID: PMC6917215 DOI: 10.1093/bib/bby061] [Citation(s) in RCA: 267] [Impact Index Per Article: 44.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2018] [Revised: 05/25/2018] [Indexed: 01/16/2023] Open

Abstract

The identification of interactions between drugs/compounds and their targets is crucial for the development of new drugs. In vitro screening experiments (i.e. bioassays) are frequently used for this purpose; however, experimental approaches are insufficient to explore novel drug-target interactions, mainly because of feasibility problems, as they are labour intensive, costly and time consuming. A computational field known as 'virtual screening' (VS) has emerged in the past decades to aid experimental drug discovery studies by statistically estimating unknown bio-interactions between compounds and biological targets. These methods use the physico-chemical and structural properties of compounds and/or target proteins along with the experimentally verified bio-interaction information to generate predictive models. Lately, sophisticated machine learning techniques are applied in VS to elevate the predictive performance. The objective of this study is to examine and discuss the recent applications of machine learning techniques in VS, including deep learning, which became highly popular after giving rise to epochal developments in the fields of computer vision and natural language processing. The past 3 years have witnessed an unprecedented amount of research studies considering the application of deep learning in biomedicine, including computational drug discovery. In this review, we first describe the main instruments of VS methods, including compound and protein features (i.e. representations and descriptors), frequently used libraries and toolkits for VS, bioactivity databases and gold-standard data sets for system training and benchmarking. We subsequently review recent VS studies with a strong emphasis on deep learning applications. Finally, we discuss the present state of the field, including the current challenges and suggest future directions. We believe that this survey will provide insight to the researchers working in the field of computational drug discovery in terms of comprehending and developing novel bio-prediction methods.

Collapse

Cerisier N, Petitjean M, Regad L, Bayard Q, Réau M, Badel A, Camproux AC. High Impact: The Role of Promiscuous Binding Sites in Polypharmacology. Molecules 2019;24:molecules24142529. [PMID: 31295958 PMCID: PMC6680532 DOI: 10.3390/molecules24142529] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Revised: 06/27/2019] [Accepted: 06/27/2019] [Indexed: 02/06/2023] Open

Engineering brain activity patterns by neuromodulator polytherapy for treatment of disorders. Nat Commun 2019;10:2620. [PMID: 31197165 PMCID: PMC6565674 DOI: 10.1038/s41467-019-10541-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2018] [Accepted: 05/15/2019] [Indexed: 11/08/2022] Open

Ehrt C, Brinkjost T, Koch O. Binding site characterization - similarity, promiscuity, and druggability. MEDCHEMCOMM 2019;10:1145-1159. [PMID: 31391887 DOI: 10.1039/c9md00102f] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Accepted: 05/31/2019] [Indexed: 12/19/2022]

Updates to Binding MOAD (Mother of All Databases): Polypharmacology Tools and Their Utility in Drug Repurposing. J Mol Biol 2019;431:2423-2433. [PMID: 31125569 DOI: 10.1016/j.jmb.2019.05.024] [Citation(s) in RCA: 47] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2019] [Revised: 05/13/2019] [Accepted: 05/14/2019] [Indexed: 01/02/2023]

Trigueiro-Louro JM, Correia V, Santos LA, Guedes RC, Brito RMM, Rebelo-de-Andrade H. To hit or not to hit: Large-scale sequence analysis and structure characterization of influenza A NS1 unlocks new antiviral target potential. Virology 2019;535:297-307. [PMID: 31104825 DOI: 10.1016/j.virol.2019.04.009] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Revised: 04/22/2019] [Accepted: 04/23/2019] [Indexed: 12/13/2022]

Kaiser F, Labudde D. Unsupervised Discovery of Geometrically Common Structural Motifs and Long-Range Contacts in Protein 3D Structures. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2019;16:671-680. [PMID: 29990265 DOI: 10.1109/tcbb.2017.2786250] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]

Bhagavat R, Sankar S, Srinivasan N, Chandra N. An Augmented Pocketome: Detection and Analysis of Small-Molecule Binding Pockets in Proteins of Known 3D Structure. Structure 2019. [PMID: 29514079 DOI: 10.1016/j.str.2018.02.001] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022]

Yamaotsu N, Hirono S. In silico fragment-mapping method: a new tool for fragment-based/structure-based drug discovery. J Comput Aided Mol Des 2018;32:1229-1245. [PMID: 30196523 DOI: 10.1007/s10822-018-0160-8] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2018] [Accepted: 09/04/2018] [Indexed: 01/09/2023]

Correia V, Abecasis AB, Rebelo-de-Andrade H. Molecular footprints of selective pressure in the neuraminidase gene of currently circulating human influenza subtypes and lineages. Virology 2018;522:122-130. [PMID: 30029011 DOI: 10.1016/j.virol.2018.07.002] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2017] [Revised: 07/03/2018] [Accepted: 07/04/2018] [Indexed: 12/20/2022]

Budowski-Tal I, Kolodny R, Mandel-Gutfreund Y. A Novel Geometry-Based Approach to Infer Protein Interface Similarity. Sci Rep 2018;8:8192. [PMID: 29844500 PMCID: PMC5974305 DOI: 10.1038/s41598-018-26497-z] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2017] [Accepted: 05/10/2018] [Indexed: 11/21/2022] Open

Govindaraj RG, Brylinski M. Comparative assessment of strategies to identify similar ligand-binding pockets in proteins. BMC Bioinformatics 2018. [PMID: 29523085 PMCID: PMC5845264 DOI: 10.1186/s12859-018-2109-2] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022] Open