1
|
Stevenson GA, Kirshner D, Bennion BJ, Yang Y, Zhang X, Zemla A, Torres MW, Epstein A, Jones D, Kim H, Bennett WFD, Wong SE, Allen JE, Lightstone FC. Clustering Protein Binding Pockets and Identifying Potential Drug Interactions: A Novel Ligand-Based Featurization Method. J Chem Inf Model 2023; 63:6655-6666. [PMID: 37847557 PMCID: PMC10647021 DOI: 10.1021/acs.jcim.3c00722] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2023] [Indexed: 10/18/2023]
Abstract
Protein-ligand interactions are essential to drug discovery and drug development efforts. Desirable on-target or multitarget interactions are the first step in finding an effective therapeutic, while undesirable off-target interactions are the first step in assessing safety. In this work, we introduce a novel ligand-based featurization and mapping of human protein pockets to identify closely related protein targets and to project novel drugs into a hybrid protein-ligand feature space to identify their likely protein interactions. Using structure-based template matches from PDB, protein pockets are featured by the ligands that bind to their best co-complex template matches. The simplicity and interpretability of this approach provide a granular characterization of the human proteome at the protein-pocket level instead of the traditional protein-level characterization by family, function, or pathway. We demonstrate the power of this featurization method by clustering a subset of the human proteome and evaluating the predicted cluster associations of over 7000 compounds.
Collapse
Affiliation(s)
- Garrett A. Stevenson
- Computational
Engineering Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Dan Kirshner
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Brian J. Bennion
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Yue Yang
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Xiaohua Zhang
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Adam Zemla
- Global
Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
| | - Marisa W. Torres
- Global
Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
| | - Aidan Epstein
- Global
Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
| | - Derek Jones
- Global
Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
- Department
of Computer Science and Engineering, University
of California, San Diego, La Jolla, California 92093, United States
| | - Hyojin Kim
- Center
for Applied Scientific Computing, Lawrence
Livermore National Laboratory, Livermore, California 94550, United States
| | - W. F. Drew Bennett
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Sergio E. Wong
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| | - Jonathan E. Allen
- Global
Security Computing Applications Division, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
| | - Felice C. Lightstone
- Biosciences
and Biotechnology Division, Lawrence Livermore
National Laboratory, Livermore, California 94550, United States
| |
Collapse
|
2
|
Ehrt C, Brinkjost T, Koch O. Binding site characterization - similarity, promiscuity, and druggability. Medchemcomm 2019; 10:1145-1159. [PMID: 31391887 DOI: 10.1039/c9md00102f] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2019] [Accepted: 05/31/2019] [Indexed: 12/19/2022]
Abstract
The elucidation of non-obvious binding site similarities has provided useful indications for the establishment of polypharmacology, the identification of potential off-targets, or the repurposing of known drugs. The concept underlying all of these approaches is promiscuous binding which can be analyzed from a ligand-based or a binding site-based perspective. Herein, we applied methods for the automated analysis and comparison of protein binding sites to study promiscuous binding on a novel dataset of sites in complex with ligands sharing common shape and physicochemical properties. We show the suitability of this dataset for the benchmarking of novel binding site comparison methods. Our investigations also reveal promising directions for further in-depth analyses of promiscuity and druggability in a pocket-centered manner. Drawbacks concerning binding site similarity assessment and druggability prediction are outlined, enabling researchers to avoid the typical pitfalls of binding site analyses.
Collapse
Affiliation(s)
- Christiane Ehrt
- Faculty of Chemistry and Chemical Biology , TU Dortmund University , Dortmund , Germany
| | - Tobias Brinkjost
- Faculty of Chemistry and Chemical Biology , TU Dortmund University , Dortmund , Germany.,Department of Computer Science , TU Dortmund University , Dortmund , Germany
| | - Oliver Koch
- Faculty of Chemistry and Chemical Biology , TU Dortmund University , Dortmund , Germany
| |
Collapse
|
3
|
Friedrich NO, Simsir M, Kirchmair J. How Diverse Are the Protein-Bound Conformations of Small-Molecule Drugs and Cofactors? Front Chem 2018; 6:68. [PMID: 29637066 PMCID: PMC5880911 DOI: 10.3389/fchem.2018.00068] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/16/2018] [Accepted: 03/05/2018] [Indexed: 12/19/2022] Open
Abstract
Knowledge of the bioactive conformations of small molecules or the ability to predict them with theoretical methods is of key importance to the design of bioactive compounds such as drugs, agrochemicals, and cosmetics. Using an elaborate cheminformatics pipeline, which also evaluates the support of individual atom coordinates by the measured electron density, we compiled a complete set ("Sperrylite Dataset") of high-quality structures of protein-bound ligand conformations from the PDB. The Sperrylite Dataset consists of a total of 10,936 high-quality structures of 4,548 unique ligands. Based on this dataset, we assessed the variability of the bioactive conformations of 91 small molecules-each represented by a minimum of ten structures-and found it to be largely independent of the number of rotatable bonds. Sixty-nine molecules had at least two distinct conformations (defined by an RMSD greater than 1 Å). For a representative subset of 17 approved drugs and cofactors we observed a clear trend for the formation of few clusters of highly similar conformers. Even for proteins that share a very low sequence identity, ligands were regularly found to adopt similar conformations. For cofactors, a clear trend for extended conformations was measured, although in few cases also coiled conformers were observed. The Sperrylite Dataset is available for download from http://www.zbh.uni-hamburg.de/sperrylite_dataset.
Collapse
Affiliation(s)
- Nils-Ole Friedrich
- Department of Informatics, Center for Bioinformatics, Universität Hamburg, Hamburg, Germany
| | - Méliné Simsir
- Department of Informatics, Center for Bioinformatics, Universität Hamburg, Hamburg, Germany.,Molécules Thérapeutiques In Silico, Université Paris Diderot, Sorbonne Paris Cité, Paris, France
| | - Johannes Kirchmair
- Department of Informatics, Center for Bioinformatics, Universität Hamburg, Hamburg, Germany
| |
Collapse
|
4
|
Najmanovich RJ. Evolutionary studies of ligand binding sites in proteins. Curr Opin Struct Biol 2017; 45:85-90. [PMID: 27992825 DOI: 10.1016/j.sbi.2016.11.024] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2016] [Revised: 11/30/2016] [Accepted: 11/30/2016] [Indexed: 01/27/2023]
Abstract
Biological processes at their most fundamental molecular aspects are defined by molecular interactions with ligand-protein interactions in particular at the core of cellular functions such as metabolism and signalling. Divergent and convergent processes shape the evolution of ligand binding sites. The competition between similar ligands and binding sites across protein families create evolutionary pressures that affect the specificity and selectivity of interactions. This short review showcases recent studies of the evolution of ligand binding-sites and methods used to detect binding-site similarities.
Collapse
|
5
|
Abstract
The binding of drugs and reagents to off-targets is well-known. Whereas many off-targets are related to the primary target by sequence and fold, many ligands bind to unrelated pairs of proteins, and these are harder to anticipate. If the binding site in the off-target can be related to that of the primary target, this challenge resolves into aligning the two pockets. However, other cases are possible: the ligand might interact with entirely different residues and environments in the off-target, or wholly different ligand atoms may be implicated in the two complexes. To investigate these scenarios at atomic resolution, the structures of 59 ligands in 116 complexes (62 pairs in total), where the protein pairs were unrelated by fold but bound an identical ligand, were examined. In almost half of the pairs, the ligand interacted with unrelated residues in the two proteins (29 pairs), and in 14 of the pairs wholly different ligand moieties were implicated in each complex. Even in those 19 pairs of complexes that presented similar environments to the ligand, ligand superposition rarely resulted in the overlap of related residues. There appears to be no single pattern-matching "code" for identifying binding sites in unrelated proteins that bind identical ligands, though modeling suggests that there might be a limited number of different patterns that suffice to recognize different ligand functional groups.
Collapse
Affiliation(s)
- Sarah Barelier
- Department of Pharmaceutical
Chemistry, University of California San Francisco, 1700 Fourth
Street, Byers Hall, San Francisco, California 94158, United States
| | - Teague Sterling
- Department of Pharmaceutical
Chemistry, University of California San Francisco, 1700 Fourth
Street, Byers Hall, San Francisco, California 94158, United States
| | - Matthew J. O’Meara
- Department of Pharmaceutical
Chemistry, University of California San Francisco, 1700 Fourth
Street, Byers Hall, San Francisco, California 94158, United States
| | - Brian K. Shoichet
- Department of Pharmaceutical
Chemistry, University of California San Francisco, 1700 Fourth
Street, Byers Hall, San Francisco, California 94158, United States
| |
Collapse
|
6
|
Zheng Z, Goncearenco A, Berezovsky IN. Nucleotide binding database NBDB--a collection of sequence motifs with specific protein-ligand interactions. Nucleic Acids Res 2015; 44:D301-7. [PMID: 26507856 PMCID: PMC4702817 DOI: 10.1093/nar/gkv1124] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2015] [Accepted: 10/14/2015] [Indexed: 11/14/2022] Open
Abstract
NBDB database describes protein motifs, elementary functional loops (EFLs) that are involved in binding of nucleotide-containing ligands and other biologically relevant cofactors/coenzymes, including ATP, AMP, ATP, GMP, GDP, GTP, CTP, PAP, PPS, FMN, FAD(H), NAD(H), NADP, cAMP, cGMP, c-di-AMP and c-di-GMP, ThPP, THD, F-420, ACO, CoA, PLP and SAM. The database is freely available online at http://nbdb.bii.a-star.edu.sg. In total, NBDB contains data on 249 motifs that work in interactions with 24 ligands. Sequence profiles of EFL motifs were derived de novo from nonredundant Uniprot proteome sequences. Conserved amino acid residues in the profiles interact specifically with distinct chemical parts of nucleotide-containing ligands, such as nitrogenous bases, phosphate groups, ribose, nicotinamide, and flavin moieties. Each EFL profile in the database is characterized by a pattern of corresponding ligand–protein interactions found in crystallized ligand–protein complexes. NBDB database helps to explore the determinants of nucleotide and cofactor binding in different protein folds and families. NBDB can also detect fragments that match to profiles of particular EFLs in the protein sequence provided by user. Comprehensive information on sequence, structures, and interactions of EFLs with ligands provides a foundation for experimental and computational efforts on design of required protein functions.
Collapse
Affiliation(s)
- Zejun Zheng
- Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore
| | | | - Igor N Berezovsky
- Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), 30 Biopolis Street, #07-01, Matrix, 138671, Singapore Department of Biological Sciences (DBS), National University of Singapore (NUS), 8 Medical Drive, 117579, Singapore
| |
Collapse
|
7
|
Park S, Yoo S, Kim J, An HT, Kang M, Ko J. 14-3-3β and γ differentially regulate peroxisome proliferator activated receptor γ2 transactivation and hepatic lipid metabolism. Biochim Biophys Acta. 2015;1849:1237-1247. [PMID: 26260846 DOI: 10.1016/j.bbagrm.2015.08.002] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Subscribe] [Scholar Register] [Received: 01/02/2015] [Revised: 08/04/2015] [Accepted: 08/06/2015] [Indexed: 02/06/2023]
Abstract
Peroxisome proliferator activated receptor (PPAR) γ2 plays important roles in glucose and lipid metabolism in hepatocytes. PPARγ2 is involved in metabolic disorders, including obesity, diabetes, and fatty liver disease. Although the 14-3-3 proteins participate in a variety of cell signal pathways, the roles of the 14-3-3 proteins in regulating PPARγ2 transactivation and hepatic lipid metabolism are unknown. We identified 14-3-3β and γ as PPARγ2 transcriptional regulators. We found that 14-3-3β and γ competitively interacted with the phosphorylated Ser273 of PPARγ2, which is important for regulating glucose and lipid metabolism. 14-3-3β increased the transcriptional activity of PPARγ2 and enhanced the expression levels of PPARγ2 target genes involved in lipogenesis and lipid transport. In contrast, 14-3-3γ decreased PPARγ2 transactivation and reduced the expression levels of PPARγ2 target genes. A high concentration of free fatty acids increased PPARγ2 expression and lipid accumulation. 14-3-3β enhanced hepatic lipogenesis, which is a major symptom of non-alcoholic fatty liver disease. However, 14-3-3γ suppressed hepatic lipid accumulation in the presence of high free fatty acids. These findings indicate that 14-3-3β and γ are novel PPARγ2 regulators and are involved in hepatic lipid metabolism. 14-3-3β and γ can be therapeutic target molecules to treat non-alcoholic fatty liver disease.
Collapse
|
8
|
Abstract
The comparison of protein binding sites is a prominent task in computational chemistry and has been studied in many different ways. For the automatic detection and comparison of putative binding cavities the Cavbase system has been developed which uses a coarse-grained set of pseudocenters to represent the physicochemical properties of a binding site and employs a graph-based procedure to calculate similarities between two binding sites. However, the comparison of two graphs is computationally quite demanding which makes large-scale studies such as the rapid screening of entire databases hardly feasible. In a recent work, we proposed the method Local Cliques (LC) for the efficient comparison of Cavbase binding sites. It employs a clique heuristic to detect the maximum common subgraph of two binding sites and an extended graph model to additionally compare the shape of individual surface patches. In this study, we present an alternative to further accelerate the LC method by partitioning the binding-site graphs into disjoint components prior to their comparisons. The pseudocenter sets are split with regard to their assigned phyiscochemical type, which leads to seven much smaller graphs than the original one. Applying this approach on the same test scenarios as in the former comprehensive way results in a significant speed-up without sacrificing accuracy.
Collapse
Affiliation(s)
- Timo Krotzky
- Department of Pharmaceutical Chemistry, Philipps-Universität, 35032 Marburg, Germany
| | - Gerhard Klebe
- Department of Pharmaceutical Chemistry, Philipps-Universität, 35032 Marburg, Germany.
| |
Collapse
|
9
|
Krotzky T, Grunwald C, Egerland U, Klebe G. Large-scale mining for similar protein binding pockets: with RAPMAD retrieval on the fly becomes real. J Chem Inf Model 2014; 55:165-79. [PMID: 25474400 DOI: 10.1021/ci5005898] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
Determination of structural similarities between protein binding pockets is an important challenge in in silico drug design. It can help to understand selectivity considerations, predict unexpected ligand cross-reactivity, and support the putative annotation of function to orphan proteins. To this end, Cavbase was developed as a tool for the automated detection, storage, and classification of putative protein binding sites. In this context, binding sites are characterized as sets of pseudocenters, which denote surface-exposed physicochemical properties, and can be used to enable mutual binding site comparisons. However, these comparisons tend to be computationally very demanding and often lead to very slow computations of the similarity measures. In this study, we propose RAPMAD (RApid Pocket MAtching using Distances), a new evaluation formalism for Cavbase entries that allows for ultrafast similarity comparisons. Protein binding sites are represented by sets of distance histograms that are both generated and compared with linear complexity. Attaining a speed of more than 20 000 comparisons per second, screenings across large data sets and even entire databases become easily feasible. We demonstrate the discriminative power and the short runtime by performing several classification and retrieval experiments. RAPMAD attains better success rates than the comparison formalism originally implemented into Cavbase or several alternative approaches developed in recent time, while requiring only a fraction of their runtime. The pratical use of our method is finally proven by a successful prospective virtual screening study that aims for the identification of novel inhibitors of the NMDA receptor.
Collapse
Affiliation(s)
- Timo Krotzky
- Department of Pharmaceutical Chemistry, Philipps-Universität Marburg , Marbacher Weg 6-10, 35032 Marburg, Germany
| | | | | | | |
Collapse
|
10
|
Kuppuraj G, Kruise D, Yura K. Conformational behavior of flavin adenine dinucleotide: conserved stereochemistry in bound and free states. J Phys Chem B 2014; 118:13486-97. [PMID: 25389798 DOI: 10.1021/jp507629n] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Metabolic enzymes utilize the cofactor flavin adenine dinucleotide (FAD) to catalyze essential biochemical reactions. Because these enzymes have been implicated in disease pathways, it will be necessary to target them via FAD-based structural analogues that can either activate/inhibit the enzymatic activity. To achieve this, it is important to explore the conformational space of FAD in the enzyme-bound and free states. Herein, we analyze X-ray crystallographic data of the enzyme-bound FAD conformations and sample conformations of the molecule in explicit water by molecular dynamics (MD) simulations. Enzyme-bound FAD conformations segregate into five distinct groups based on dihedral angle principal component analysis (PCA). A notable feature in the bound FADs is that the adenine base and isoalloxazine ring are oppositely oriented relative to the pyrophosphate axis characterized by near trans hypothetical dihedral angle "δV" values. Not surprisingly, MD simulations in water show final compact but not perfectly stacked ring structures in FAD. Simulation data did not reveal noticeable changes in overall conformational dynamics of the dinucleotide in reduced and oxidized forms and in the presence and/or absence of ions. During unfolding-folding dynamics, the riboflavin moiety is more flexible than the adenosine monophosphate group in the molecule. Conversely, the isoalloxazine ring is more stable than the variable adenine base. The pyrophosphate group depicts an unusually highly organized fluctuation illustrated by its dihedral angle distribution. Conformations sampled from enzymes and MD are quantified. The extent to which the protein shifts the distribution from the unbound state is discussed in terms of prevalent FAD shapes and dihedral angle population.
Collapse
Affiliation(s)
- Gopi Kuppuraj
- Center for Informational Biology, Ochanomizu University , 2-1-1 Otsuka, Bunkyo, Tokyo 112-8610, Japan
| | | | | |
Collapse
|
11
|
Krotzky T, Fober T, Hüllermeier E, Klebe G. Extended Graph-Based Models for Enhanced Similarity Search in Cavbase. IEEE/ACM Trans Comput Biol Bioinform 2014; 11:878-890. [PMID: 26356860 DOI: 10.1109/tcbb.2014.2325020] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
To calculate similarities between molecular structures, measures based on the maximum common subgraph are frequently applied. For the comparison of protein binding sites, these measures are not fully appropriate since graphs representing binding sites on a detailed atomic level tend to get very large. In combination with an NP-hard problem, a large graph leads to a computationally demanding task. Therefore, for the comparison of binding sites, a less detailed coarse graph model is used building upon so-called pseudocenters. Consistently, a loss of structural data is caused since many atoms are discarded and no information about the shape of the binding site is considered. This is usually resolved by performing subsequent calculations based on additional information. These steps are usually quite expensive, making the whole approach very slow. The main drawback of a graph-based model solely based on pseudocenters, however, is the loss of information about the shape of the protein surface. In this study, we propose a novel and efficient modeling formalism that does not increase the size of the graph model compared to the original approach, but leads to graphs containing considerably more information assigned to the nodes. More specifically, additional descriptors considering surface characteristics are extracted from the local surface and attributed to the pseudocenters stored in Cavbase. These properties are evaluated as additional node labels, which lead to a gain of information and allow for much faster but still very accurate comparisons between different structures.
Collapse
|
12
|
Giangreco I, Packer MJ. Pharmacophore binding motifs for nicotinamide adenine dinucleotide analogues across multiple protein families: a detailed contact-based analysis of the interaction between proteins and NAD(P) cofactors. J Med Chem 2013; 56:6175-89. [PMID: 23889609 DOI: 10.1021/jm400644z] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
Abstract
We have analyzed the protein-binding pharmacophore of NAD and its close analogues in all protein-ligand structures available in the RCSB database as of February 2012; this analysis has then been used to assess the novelty of structures emerging after that date. We show that proteins have evolved diverse pharmacophore motifs for binding the adenine moiety, fewer, but still diverse, motifs for nicotinamide, and a very limited set of motifs for binding the pyrophosphate linker. Our exhaustive analysis includes a pharmacophore contact analysis for over 1900 protein-ligand structures containing NAD analogues; we have benchmarked this set of contacts against nearly 27 000 protein-ligand structures to demonstrate that the diversity of interactions seen with NAD is very similar to that seen for all other ligands. Hence, variation in binding motifs for NAD is not distinct from that observed for other ligands and they show significant variation across protein families.
Collapse
Affiliation(s)
- Ilenia Giangreco
- AstraZeneca, Mereside, Alderley Park, Macclesfield SK10 4TG, UK.
| | | |
Collapse
|
13
|
Vulpetti A, Kalliokoski T, Milletti F. Chemogenomics in drug discovery: computational methods based on the comparison of binding sites. Future Med Chem 2012; 4:1971-9. [PMID: 23088277 DOI: 10.4155/fmc.12.147] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Novel computational methods for understanding relationships between ligands and all possible biological targets have emerged in recent years. Proteins are connected to each other based on the similarity of their ligands or based on the similarity of their binding sites. The assumption is that compounds sharing chemical similarity should share targets and that targets with a similar binding site should also share ligands. A large number of computational techniques have been developed to assess ligand and binding site similarity, which can be used to make quantitative predictions of the most probable biological target of a given compound. This review covers the recent advances in new computational methods for relating biological targets based on the similarity of their binding sites. Binding site comparisons are used for the prediction of their most likely ligands, their possible cross reactivity and selectivity. These comparisons can also be used to infer the function of novel uncharacterized proteins.
Collapse
|
14
|
Abstract
Although two binding sites might be dissimilar overall, they might still bind the same fragments if they share suitable subpockets. Information about shared subpockets can be therefore used in fragment-based drug design to suggest new fragments or to replace existing fragments within an already known compound. A novel computational method called SubCav is described which allows the similarity searching and alignment of subpockets from a PDB-wide database against a user-defined query. The method is based on pharmacophoric fingerprints combined with a subpocket alignment algorithm. SubCav was shown to be effective in producing reasonable alignments for subpockets with low sequence similarity and be able to retrieve relevant subpockets from a large database of structures including those with different folds. It can also be used to analyze subpockets inside a protein family to facilitate drug design and to rationalize compound selectivity.
Collapse
Affiliation(s)
- Tuomo Kalliokoski
- Novartis Institutes for Biomedical Research, Postfach, CH-4002 Basel, Switzerland
| | | | | |
Collapse
|