1
|
Yadav RS, Das C, Chakrabarti R. Dynamics of a spherical self-propelled tracer in a polymeric medium: interplay of self-propulsion, stickiness, and crowding. SOFT MATTER 2023; 19:689-700. [PMID: 36598025 DOI: 10.1039/d2sm01626e] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/17/2023]
Abstract
We employ computer simulations to study the dynamics of a self-propelled spherical tracer particle in a viscoelastic medium, made of a long polymer chain. Here, the interplay between viscoelasticity, stickiness, and activity (self-propulsion) brings additional complexity to the tracer dynamics. Our simulations show that on increasing the stickiness of the tracer particle to the polymer beads, the dynamics of the tracer particle slows down as it gets stuck to the polymer chain and moves along with it. But with increasing self-propulsion velocity, the dynamics gets enhanced. In the case of increasing stickiness as well as activity, the non-Gaussian parameter (NGP) exhibits non-monotonic behavior, which also shows up in the re-scaled self part of the van-Hove function. Non-Gaussianity results owing to the enhanced binding events and the sticky motion of the tracer along with the chain with increasing stickiness. On the other hand, with increasing activity, initially non-Gaussianity increases as the tracer moves through the heterogeneous polymeric environment but for higher activity, the tracer escapes resulting in a negative NGP. For higher values of stickiness, the trapping time distributions of the passive tracer particle broaden and have long tails. On the other hand, for a given stickiness with increasing self-propulsion force, the trapping time distributions become narrower and have short tails. We believe that our current simulation study will be helpful in elucidating the complex motion of activity-driven probes in viscoelastic media.
Collapse
Affiliation(s)
- Ramanand Singh Yadav
- Department of Chemistry, Indian Institute of Technology Bombay, Mumbai 400076, India.
| | - Chintu Das
- Department of Chemistry, Indian Institute of Technology Bombay, Mumbai 400076, India.
| | - Rajarshi Chakrabarti
- Department of Chemistry, Indian Institute of Technology Bombay, Mumbai 400076, India.
| |
Collapse
|
2
|
Ramírez-Velásquez I, Bedoya-Calle ÁH, Vélez E, Caro-Lopera FJ. Shape Theory Applied to Molecular Docking and Automatic Localization of Ligand Binding Pockets in Large Proteins. ACS OMEGA 2022; 7:45991-46002. [PMID: 36570297 PMCID: PMC9773186 DOI: 10.1021/acsomega.2c02227] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/10/2022] [Accepted: 10/11/2022] [Indexed: 06/17/2023]
Abstract
Automatic search of cavities and binding mode analysis between a ligand and a 3D protein receptor are challenging problems in drug design or repositioning. We propose a solution based on a shape theory theorem for an invariant coupled system of ligand-protein. The theorem provides a matrix representation with the exact formulas to be implemented in an algorithm. The method involves the following results: (1) exact formulae for the shape coordinates of a located-rotated invariant coupled system; (2) a parameterized search based on a suitable domain of van der Waals radii; (3) a scoring function for the discrimination of sites by measuring the distance between two invariant coupled systems including the atomic mass; (4) a matrix representation of the Lennard-Jones potential type 6-12 and 6-10 as the punctuation function of the algorithm for a molecular docking; and (5) the optimal molecular docking as a solution of an optimization problem based on the exploration of an exhaustive set of rotations. We apply the method in the xanthine oxidase protein with the following ligands: hypoxanthine, febuxostat, and chlorogenic acid. The results show automatic cavity detection and molecular docking not assisted by experts with meaningful amino acid interactions. The method finds better affinities than the expert software for known published cavities.
Collapse
Affiliation(s)
- Iliana Ramírez-Velásquez
- Faculty
of Exact and Applied Sciences, Instituto
Tecnológico Metropolitano ITM, Cll. 73 # 76A-354, Medellín050034, Colombia
- Doctorate
in Modeling and Scientific Computing, Faculty of Basic Sciences, University of Medellin, Medellin050026, Colombia
| | - Álvaro H. Bedoya-Calle
- Faculty
of Basic Sciences, University of Medellin, Cra. 87 # 30-65, Medellín050026, Colombia
| | - Ederley Vélez
- Faculty
of Basic Sciences, University of Medellin, Cra. 87 # 30-65, Medellín050026, Colombia
| | | |
Collapse
|
3
|
Scott O, Gu J, Chan AE. Classification of Protein-Binding Sites Using a Spherical Convolutional Neural Network. J Chem Inf Model 2022; 62:5383-5396. [PMID: 36341715 PMCID: PMC9709917 DOI: 10.1021/acs.jcim.2c00832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The analysis and comparison of protein-binding sites aid various applications in the drug discovery process, e.g., hit finding, drug repurposing, and polypharmacology. Classification of binding sites has been a hot topic for the past 30 years, and many different methods have been published. The rapid development of machine learning computational algorithms, coupled with the large volume of publicly available protein-ligand 3D structures, makes it possible to apply deep learning techniques in binding site comparison. Our method uses a cutting-edge spherical convolutional neural network based on the DeepSphere architecture to learn global representations of protein-binding sites. The model was trained on TOUGH-C1 and TOUGH-M1 data and validated with the ProSPECCTs datasets. Our results show that our model can (1) perform well in protein-binding site similarity and classification tasks and (2) learn and separate the physicochemical properties of binding sites. Lastly, we tested the model on a set of kinases, where the results show that it is able to cluster the different kinase subfamilies effectively. This example demonstrates the method's promise for lead hopping within or outside a protein target, directly based on binding site information.
Collapse
|
4
|
Issa NT, Byers SW, Dakshanamurthy S. ES-Screen: A Novel Electrostatics-Driven Method for Drug Discovery Virtual Screening. Int J Mol Sci 2022; 23:ijms232314830. [PMID: 36499162 PMCID: PMC9736079 DOI: 10.3390/ijms232314830] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2022] [Revised: 11/23/2022] [Accepted: 11/24/2022] [Indexed: 12/02/2022] Open
Abstract
Electrostatic interactions drive biomolecular interactions and associations. Computational modeling of electrostatics in biomolecular systems, such as protein-ligand, protein-protein, and protein-DNA, has provided atomistic insights into the binding process. In drug discovery, finding biologically plausible ligand-protein target interactions is challenging as current virtual screening and adjuvant techniques such as docking methods do not provide optimal treatment of electrostatic interactions. This study describes a novel electrostatics-driven virtual screening method called 'ES-Screen' that performs well across diverse protein target systems. ES-Screen provides a unique treatment of electrostatic interaction energies independent of total electrostatic free energy, typically employed by current software. Importantly, ES-Screen uses initial ligand pose input obtained from a receptor-based pharmacophore, thus independent of molecular docking. ES-Screen integrates individual polar and nonpolar replacement energies, which are the energy costs of replacing the cognate ligand for a target with a query ligand from the screening. This uniquely optimizes thermodynamic stability in electrostatic and nonpolar interactions relative to an experimentally determined stable binding state. ES-Screen also integrates chemometrics through shape and other physicochemical properties to prioritize query ligands with the greatest physicochemical similarities to the cognate ligand. The applicability of ES-Screen is demonstrated with in vitro experiments by identifying novel targets for many drugs. The present version includes a combination of many other descriptor components that, in a future version, will be purely based on electrostatics. Therefore, ES-Screen is a first-in-class unique electrostatics-driven virtual screening method with a unique implementation of replacement electrostatic interaction energies with broad applicability in drug discovery.
Collapse
|
5
|
Scheck A, Rosset S, Defferrard M, Loukas A, Bonet J, Vandergheynst P, Correia BE. RosettaSurf—A surface-centric computational design approach. PLoS Comput Biol 2022; 18:e1009178. [PMID: 35294435 PMCID: PMC9015148 DOI: 10.1371/journal.pcbi.1009178] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2021] [Revised: 04/18/2022] [Accepted: 02/21/2022] [Indexed: 11/19/2022] Open
Abstract
Proteins are typically represented by discrete atomic coordinates providing an accessible framework to describe different conformations. However, in some fields proteins are more accurately represented as near-continuous surfaces, as these are imprinted with geometric (shape) and chemical (electrostatics) features of the underlying protein structure. Protein surfaces are dependent on their chemical composition and, ultimately determine protein function, acting as the interface that engages in interactions with other molecules. In the past, such representations were utilized to compare protein structures on global and local scales and have shed light on functional properties of proteins. Here we describe RosettaSurf, a surface-centric computational design protocol, that focuses on the molecular surface shape and electrostatic properties as means for protein engineering, offering a unique approach for the design of proteins and their functions. The RosettaSurf protocol combines the explicit optimization of molecular surface features with a global scoring function during the sequence design process, diverging from the typical design approaches that rely solely on an energy scoring function. With this computational approach, we attempt to address a fundamental problem in protein design related to the design of functional sites in proteins, even when structurally similar templates are absent in the characterized structural repertoire. Surface-centric design exploits the premise that molecular surfaces are, to a certain extent, independent of the underlying sequence and backbone configuration, meaning that different sequences in different proteins may present similar surfaces. We benchmarked RosettaSurf on various sequence recovery datasets and showcased its design capabilities by generating epitope mimics that were biochemically validated. Overall, our results indicate that the explicit optimization of surface features may lead to new routes for the design of functional proteins. Finely orchestrated protein-protein interactions are at the heart of virtually all fundamental cellular processes. Altering these processes or encoding new functions in proteins has been a long-standing goal in computational protein design. Protein design methods commonly rely on scoring functions that seek to identify amino acid sequences that optimize structural configurations of atoms while minimizing a variety of physics-based and statistical terms. The objectives of the large majority of computational design protocols have been focused on obtaining a predefined structural conformation. However, routinely introducing a functional aspect on designer proteins has been more challenging. Our results suggest that the molecular surface features can be a useful optimization parameter to guide the design process towards functional surfaces that mimic known protein binding sites and interact with their intended targets. Specifically, we demonstrate that our design method can optimize experimental libraries through computational screening, creating a basis for highly specific protein binders, as well as design a potent immunogen that engages with site-specific antibodies. The ability to create proteins with novel functions will be transformative for biomedical applications, providing many opportunities for the design of novel immunogens, protein components for synthetic biology, and other protein-based biotechnologies.
Collapse
Affiliation(s)
- Andreas Scheck
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Stéphane Rosset
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Michaël Defferrard
- Signal Processing Laboratory (LTS2), École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Andreas Loukas
- Signal Processing Laboratory (LTS2), École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Jaume Bonet
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
| | - Pierre Vandergheynst
- Signal Processing Laboratory (LTS2), École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
| | - Bruno E. Correia
- Institute of Bioengineering, École Polytechnique Fédérale de Lausanne, Lausanne, Switzerland
- Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
- * E-mail:
| |
Collapse
|
6
|
Carrillo-Cabada H, Benson J, Razavi AM, Mulligan B, Cuendet MA, Weinstein H, Taufer M, Estrada T. A Graphic Encoding Method for Quantitative Classification of Protein Structure and Representation of Conformational Changes. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2021; 18:1336-1349. [PMID: 31603792 PMCID: PMC9119144 DOI: 10.1109/tcbb.2019.2945291] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
In order to successfully predict a proteins function throughout its trajectory, in addition to uncovering changes in its conformational state, it is necessary to employ techniques that maintain its 3D information while performing at scale. We extend a protein representation that encodes secondary and tertiary structure into fix-sized, color images, and a neural network architecture (called GEM-net) that leverages our encoded representation. We show the applicability of our method in two ways: (1) performing protein function prediction, hitting accuracy between 78 and 83 percent, and (2) visualizing and detecting conformational changes in protein trajectories during molecular dynamics simulations.
Collapse
|
7
|
Xu M, Ran T, Chen H. De Novo Molecule Design Through the Molecular Generative Model Conditioned by 3D Information of Protein Binding Sites. J Chem Inf Model 2021; 61:3240-3254. [PMID: 34197105 DOI: 10.1021/acs.jcim.0c01494] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
De novo molecule design through the molecular generative model has gained increasing attention in recent years. Here, a novel generative model was proposed by integrating the three-dimensional (3D) structural information of the protein binding pocket into the conditional RNN (cRNN) model to control the generation of drug-like molecules. In this model, the composition of the protein binding pocket is effectively characterized through a coarse-grain strategy and the 3D information of the pocket can be represented by the sorted eigenvalues of the Coulomb matrix (EGCM) of the coarse-grained atoms composing the binding pocket. In current work, we used our EGCM method and a previously reported binding pocket descriptor, DeeplyTough, to train cRNN models and evaluated their performance. It has been shown that the model trained with the constraint of protein environment information has a clear tendency on generating compounds with higher similarity to the original X-ray-bound ligand than the normal RNN model and also better docking scores. Our results demonstrate the potential application of the controlled generative model for the targeted molecule generation and guided exploration on the drug-like chemical space.
Collapse
Affiliation(s)
- Mingyuan Xu
- Bioland Laboratory (Guangzhou Regenerative Medicine and Health-Guangdong Laboratory), Guangzhou 510530, P. R. China
| | - Ting Ran
- Bioland Laboratory (Guangzhou Regenerative Medicine and Health-Guangdong Laboratory), Guangzhou 510530, P. R. China
| | - Hongming Chen
- Bioland Laboratory (Guangzhou Regenerative Medicine and Health-Guangdong Laboratory), Guangzhou 510530, P. R. China
| |
Collapse
|
8
|
Bhadra A, Yeturu K. Site2Vec: a reference frame invariant algorithm for vector embedding of protein–ligand binding sites. MACHINE LEARNING: SCIENCE AND TECHNOLOGY 2021. [DOI: 10.1088/2632-2153/abad88] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Abstract
Protein–ligand interactions are one of the fundamental types of molecular interactions in living systems. Ligands are small molecules that interact with protein molecules at specific regions on their surfaces called binding sites. Binding sites would also determine ADMET properties of a drug molecule. Tasks such as assessment of protein functional similarity and detection of side effects of drugs need identification of similar binding sites of disparate proteins across diverse pathways. To this end, methods for computing similarities between binding sites are still evolving and is an active area of research even today. Machine learning methods for similarity assessment require feature descriptors of binding sites. Traditional methods based on hand engineered motifs and atomic configurations are not scalable across several thousands of sites. In this regard, deep neural network algorithms are now deployed which can capture very complex input feature space. However, one fundamental challenge in applying deep learning to structures of binding sites is the input representation and the reference frame. We report here a novel algorithm, Site2Vec, that derives reference frame invariant vector embedding of a protein–ligand binding site. The method is based on pairwise distances between representative points and chemical compositions in terms of constituent amino acids of a site. The vector embedding serves as a locality sensitive hash function for proximity queries and determining similar sites. The method has been the top performer with more than 95% quality scores in extensive benchmarking studies carried over 10 data sets and against 23 other site comparison methods in the field. The algorithm serves for high throughput processing and has been evaluated for stability with respect to reference frame shifts, coordinate perturbations and residue mutations. We also provide the method as a standalone executable and a web service hosted at (http://services.iittp.ac.in/bioinfo/home).
Collapse
|
9
|
Bajardi F, Altucci L, Benedetti R, Capozziello S, Sorbo MRD, Franci G, Altucci C. DNA Mutations via Chern-Simons Currents. EUROPEAN PHYSICAL JOURNAL PLUS 2021; 136:1080. [PMID: 34725629 PMCID: PMC8551353 DOI: 10.1140/epjp/s13360-021-01960-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Accepted: 09/13/2021] [Indexed: 05/04/2023]
Abstract
We test the validity of a possible schematization of DNA structure and dynamics based on the Chern-Simons theory, that is a topological field theory mostly considered in the context of effective gravity theories. By means of the expectation value of the Wilson Loop, derived from this analogue gravity approach, we find the point-like curvature of genomic strings in KRAS human gene and COVID-19 sequences, correlating this curvature with the genetic mutations. The point-like curvature profile, obtained by means of the Chern-Simons currents, can be used to infer the position of the given mutations within the genetic string. Generally, mutations take place in the highest Chern-Simons current gradient locations and subsequent mutated sequences appear to have a smoother curvature than the initial ones, in agreement with a free energy minimization argument.
Collapse
Affiliation(s)
- Francesco Bajardi
- Dipartimento di Fisica “Ettore Pancini”, Università degli Studi di Napoli“Federico II”, Compl. Univ. di Monte S. Angelo, Edificio G, Via Cinthia, 80126 Napoli, Italy
- INFN Sezione di Napoli, Compl. Univ. di Monte S. Angelo, Edificio G, Via Cinthia, 80126 Napoli, Italy
| | - Lucia Altucci
- Dipartimento di Medicina di Precisione, Università degli Studi della Campania “L. Vanvitelli”, Napoli, Italy
- Biogem “Istituto di Biologia molecolare e genetica”, 83031 Ariano Irpino, Italy
| | - Rosaria Benedetti
- Dipartimento di Medicina di Precisione, Università degli Studi della Campania “L. Vanvitelli”, Napoli, Italy
| | - Salvatore Capozziello
- Dipartimento di Fisica “Ettore Pancini”, Università degli Studi di Napoli“Federico II”, Compl. Univ. di Monte S. Angelo, Edificio G, Via Cinthia, 80126 Napoli, Italy
- INFN Sezione di Napoli, Compl. Univ. di Monte S. Angelo, Edificio G, Via Cinthia, 80126 Napoli, Italy
- Scuola Superiore Meridionale, Largo San Marcellino 10, 80138 Napoli, Italy
| | - Maria Rosaria Del Sorbo
- Istituto Statale d’Istruzione Superiore “Leonardo da Vinci”, via F. Turati Poggiomarino, Naples, Italy
- Dipartimento di Ingegneria Industriale, Università degli Studi di Napoli“Federico II”, Via Claudio n.21, 80125 Napoli, Italy
| | - Gianluigi Franci
- Department of Medicine, Surgery and Dentistry “Scuola Medica Salernitana”, University of Salerno, 84081 Baronissi, SA Italy
- Sezione Microbiologia Clinica, A.O.U. S. Giovanni di Dio e Ruggi D’Aragona, Largo Città di Ippocrate, 84131 Salerno, Italy
| | - Carlo Altucci
- INFN Sezione di Napoli, Compl. Univ. di Monte S. Angelo, Edificio G, Via Cinthia, 80126 Napoli, Italy
- Dipartimento di Scienze Biomediche Avanzate, Università degli Studi di Napoli “Federico II”, via Pansini 5, Napoli, Italy
| |
Collapse
|
10
|
Cao Y, Park SJ, Im W. A systematic analysis of protein-carbohydrate interactions in the Protein Data Bank. Glycobiology 2020; 31:126-136. [PMID: 32614943 DOI: 10.1093/glycob/cwaa062] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2020] [Revised: 06/26/2020] [Accepted: 06/26/2020] [Indexed: 12/17/2022] Open
Abstract
Protein-carbohydrate interactions underlie essential biological processes. Elucidating the mechanism of protein-carbohydrate recognition is a prerequisite for modeling and optimizing protein-carbohydrate interactions, which will help in discovery of carbohydrate-derived therapeutics. In this work, we present a survey of a curated database consisting of 6,402 protein-carbohydrate complexes in the Protein Data Bank (PDB). We performed an all-against-all comparison of a subset of nonredundant binding sites, and the result indicates that the interaction pattern similarity is not completely relevant to the binding site structural similarity. Investigation of both binding site and ligand promiscuities reveals that the geometry of chemical feature points is more important than local backbone structure in determining protein-carbohydrate interactions. A further analysis on the frequency and geometry of atomic interactions shows that carbohydrate functional groups are not equally involved in binding interactions. Finally, we discuss the usefulness of protein-carbohydrate complexes in the PDB with acknowledgement that the carbohydrates in many structures are incomplete.
Collapse
Affiliation(s)
- Yiwei Cao
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Sciences and Engineering, Lehigh University, Bethlehem, PA 18015, USA
| | - Sang-Jun Park
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Sciences and Engineering, Lehigh University, Bethlehem, PA 18015, USA
| | - Wonpil Im
- Departments of Biological Sciences, Chemistry, Bioengineering, and Computer Sciences and Engineering, Lehigh University, Bethlehem, PA 18015, USA.,School of Computational Sciences, Korea Institute for Advanced Study, Seoul 02455, Republic of Korea
| |
Collapse
|
11
|
Georgiev GD, Dodd KF, Chen BY. Precise parallel volumetric comparison of molecular surfaces and electrostatic isopotentials. Algorithms Mol Biol 2020; 15:11. [PMID: 32489400 PMCID: PMC7247173 DOI: 10.1186/s13015-020-00168-z] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2019] [Accepted: 04/15/2020] [Indexed: 11/10/2022] Open
Abstract
Geometric comparisons of binding sites and their electrostatic properties can identify subtle variations that select different binding partners and subtle similarities that accommodate similar partners. Because subtle features are central for explaining how proteins achieve specificity, algorithmic efficiency and geometric precision are central to algorithmic design. To address these concerns, this paper presents pClay, the first algorithm to perform parallel and arbitrarily precise comparisons of molecular surfaces and electrostatic isopotentials as geometric solids. pClay was presented at the 2019 Workshop on Algorithms in Bioinformatics (WABI 2019) and is described in expanded detail here, especially with regard to the comparison of electrostatic isopotentials. Earlier methods have generally used parallelism to enhance computational throughput, pClay is the first algorithm to use parallelism to make arbitrarily high precision comparisons practical. It is also the first method to demonstrate that high precision comparisons of geometric solids can yield more precise structural inferences than algorithms that use existing standards of precision. One advantage of added precision is that statistical models can be trained with more accurate data. Using structural data from an existing method, a model of steric variations between binding cavities can overlook 53% of authentic steric influences on specificity, whereas a model trained with data from pClay overlooks none. Our results also demonstrate the parallel performance of pClay on both workstation CPUs and a 61-core Xeon Phi. While slower on one core, additional processor cores rapidly outpaced single core performance and existing methods. Based on these results, it is clear that pClay has applications in the automatic explanation of binding mechanisms and in the rational design of protein binding preferences.
Collapse
Affiliation(s)
- Georgi D. Georgiev
- Department of Computer Science and Engineering, Lehigh University, 113 Research Drive, Bethlehem, PA USA
| | - Kevin F. Dodd
- Department of Computer Science and Engineering, Lehigh University, 113 Research Drive, Bethlehem, PA USA
| | - Brian Y. Chen
- Department of Computer Science and Engineering, Lehigh University, 113 Research Drive, Bethlehem, PA USA
| |
Collapse
|
12
|
Rey J, Rasolohery I, Tufféry P, Guyon F, Moroy G. PatchSearch: a web server for off-target protein identification. Nucleic Acids Res 2020; 47:W365-W372. [PMID: 31131411 PMCID: PMC6602448 DOI: 10.1093/nar/gkz478] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2019] [Revised: 04/26/2019] [Accepted: 05/21/2019] [Indexed: 01/17/2023] Open
Abstract
The large number of proteins found in the human body implies that a drug may interact with many proteins, called off-target proteins, besides its intended target. The PatchSearch web server provides an automated workflow that allows users to identify structurally conserved binding sites at the protein surfaces in a set of user-supplied protein structures. Thus, this web server may help to detect potential off-target protein. It takes as input a protein complexed with a ligand and identifies within user-defined or predefined collections of protein structures, those having a binding site compatible with this ligand in terms of geometry and physicochemical properties. It is based on a non-sequential local alignment of the patch over the entire protein surface. Then the PatchSearch web server proposes a ligand binding mode for the potential off-target, as well as an estimated affinity calculated by the Vinardo scoring function. This novel tool is able to efficiently detects potential interactions of ligands with distant off-target proteins. Furthermore, by facilitating the discovery of unexpected off-targets, PatchSearch could contribute to the repurposing of existing drugs. The server is freely available at http://bioserv.rpbs.univ-paris-diderot.fr/services/PatchSearch.
Collapse
Affiliation(s)
- Julien Rey
- Université Paris Diderot, Sorbonne Paris Cité, INSERM UMRS-973, Molécules Thérapeutiques in silico (MTi), F-75205 Paris, France.,Ressource Parisienne en Bioinformatique Structurale (RPBS), Paris, France
| | - Inès Rasolohery
- Université Paris Diderot, Sorbonne Paris Cité, INSERM UMRS-973, Molécules Thérapeutiques in silico (MTi), F-75205 Paris, France
| | - Pierre Tufféry
- Université Paris Diderot, Sorbonne Paris Cité, INSERM UMRS-973, Molécules Thérapeutiques in silico (MTi), F-75205 Paris, France.,Ressource Parisienne en Bioinformatique Structurale (RPBS), Paris, France
| | - Frédéric Guyon
- Université Paris Diderot, Sorbonne Paris Cité, INSERM UMRS-973, Molécules Thérapeutiques in silico (MTi), F-75205 Paris, France
| | - Gautier Moroy
- Université Paris Diderot, Sorbonne Paris Cité, INSERM UMRS-973, Molécules Thérapeutiques in silico (MTi), F-75205 Paris, France
| |
Collapse
|
13
|
Medyukhina A, Blickensdorf M, Cseresnyés Z, Ruef N, Stein JV, Figge MT. Dynamic spherical harmonics approach for shape classification of migrating cells. Sci Rep 2020; 10:6072. [PMID: 32269257 PMCID: PMC7142146 DOI: 10.1038/s41598-020-62997-7] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2019] [Accepted: 03/24/2020] [Indexed: 11/19/2022] Open
Abstract
Cell migration involves dynamic changes in cell shape. Intricate patterns of cell shape can be analyzed and classified using advanced shape descriptors, including spherical harmonics (SPHARM). Though SPHARM have been used to analyze and classify migrating cells, such classification did not exploit SPHARM spectra in their dynamics. Here, we examine whether additional information from dynamic SPHARM improves classification of cell migration patterns. We combine the static and dynamic SPHARM approach with a support-vector-machine classifier and compare their classification accuracies. We demonstrate that the dynamic SPHARM analysis classifies cell migration patterns more accurately than the static one for both synthetic and experimental data. Furthermore, by comparing the computed accuracies with that of a naive classifier, we can identify the experimental conditions and model parameters that significantly affect cell shape. This capability should – in the future – help to pinpoint factors that play an essential role in cell migration.
Collapse
Affiliation(s)
- Anna Medyukhina
- Applied Systems Biology, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute (HKI), Jena, Germany.,Center for Bioimage Informatics, St. Jude Children's Research Hospital, Memphis, TN, USA
| | - Marco Blickensdorf
- Applied Systems Biology, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute (HKI), Jena, Germany
| | - Zoltán Cseresnyés
- Applied Systems Biology, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute (HKI), Jena, Germany
| | - Nora Ruef
- Department of Oncology, Microbiology and Immunology, University of Fribourg, Fribourg, Switzerland
| | - Jens V Stein
- Department of Oncology, Microbiology and Immunology, University of Fribourg, Fribourg, Switzerland
| | - Marc Thilo Figge
- Applied Systems Biology, Leibniz Institute for Natural Product Research and Infection Biology - Hans Knöll Institute (HKI), Jena, Germany. .,Institute of Microbiology, Faculty of Biological Sciences, Friedrich Schiller University Jena, Jena, Germany. .,Center for Sepsis Control and Care (CSCC), Jena University Hospital, Jena, Germany.
| |
Collapse
|
14
|
Trosset JY, Cavé C. In Silico Target Druggability Assessment: From Structural to Systemic Approaches. Methods Mol Biol 2019; 1953:63-88. [PMID: 30912016 DOI: 10.1007/978-1-4939-9145-7_5] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Abstract
This chapter will focus on today's in silico direct and indirect approaches to assess therapeutic target druggability. The direct approach tries to infer from the 3D structure the capacity of the target protein to bind small molecule in order to modulate its biological function. Algorithms to recognize and characterize the quality of the ligand interaction sites whether within buried protein cavities or within large protein-protein interface will be reviewed in the first part of the paper. In the case a ligand-binding site is already identified, indirect aspects of target druggability can be assessed. These indirect approaches focus first on target promiscuity and the potential difficulties in developing specific drugs. It is based on large-scale comparison of protein-binding sites. The second aspect concerns the capacity of the target to induce resistant pathway once it is inhibited or activated by a drug. The emergence of drug-resistant pathways can be assessed through systemic analysis of biological networks implementing metabolism and/or cell regulation signaling.
Collapse
Affiliation(s)
| | - Christian Cavé
- BioCIS UFR Pharmacie UMR CNRS 8076, Université Paris Saclay, Orsay, France
| |
Collapse
|
15
|
Kumar A, Zhang KYJ. Advances in the Development of Shape Similarity Methods and Their Application in Drug Discovery. Front Chem 2018; 6:315. [PMID: 30090808 PMCID: PMC6068280 DOI: 10.3389/fchem.2018.00315] [Citation(s) in RCA: 81] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/10/2018] [Accepted: 07/09/2018] [Indexed: 12/21/2022] Open
Abstract
Molecular similarity is a key concept in drug discovery. It is based on the assumption that structurally similar molecules frequently have similar properties. Assessment of similarity between small molecules has been highly effective in the discovery and development of various drugs. Especially, two-dimensional (2D) similarity approaches have been quite popular due to their simplicity, accuracy and efficiency. Recently, the focus has been shifted toward the development of methods involving the representation and comparison of three-dimensional (3D) conformation of small molecules. Among the 3D similarity methods, evaluation of shape similarity is now gaining attention for its application not only in virtual screening but also in molecular target prediction, drug repurposing and scaffold hopping. A wide range of methods have been developed to describe molecular shape and to determine the shape similarity between small molecules. The most widely used methods include atom distance-based methods, surface-based approaches such as spherical harmonics and 3D Zernike descriptors, atom-centered Gaussian overlay based representations. Several of these methods demonstrated excellent virtual screening performance not only retrospectively but also prospectively. In addition to methods assessing the similarity between small molecules, shape similarity approaches have been developed to compare shapes of protein structures and binding pockets. Additionally, shape comparisons between atomic models and 3D density maps allowed the fitting of atomic models into cryo-electron microscopy maps. This review aims to summarize the methodological advances in shape similarity assessment highlighting advantages, disadvantages and their application in drug discovery.
Collapse
Affiliation(s)
| | - Kam Y. J. Zhang
- Laboratory for Structural Bioinformatics, Center for Biosystems Dynamics Research, RIKEN, Yokohama, Japan
| |
Collapse
|
16
|
Melkikh AV, Meijer DK. On a generalized Levinthal's paradox: The role of long- and short range interactions in complex bio-molecular reactions, including protein and DNA folding. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2018; 132:57-79. [DOI: 10.1016/j.pbiomolbio.2017.09.018] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/18/2017] [Revised: 08/27/2017] [Accepted: 09/17/2017] [Indexed: 01/06/2023]
|
17
|
Rasolohery I, Moroy G, Guyon F. PatchSearch: A Fast Computational Method for Off-Target Detection. J Chem Inf Model 2017; 57:769-777. [PMID: 28282119 DOI: 10.1021/acs.jcim.6b00529] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Many therapeutic molecules are known to bind several proteins, which can be different from the initially targeted one. Such unexpected interactions with proteins called off-targets can lead to adverse effects. Potential off-target identification is important to predict to avoid drug side effects or to discover new targets for existing drugs. We propose a new program named PatchSearch that implements local nonsequential searching for similar binding sites on protein surfaces with a controlled amount of flexibility. It is based on detection of quasi-cliques in product graphs representing all the possible matchings between two compared structures. This method has been benchmarked on a large diversity of ligands and on five data sets ranging from 12 to more than 7000 protein structures. The experiments conducted in this study show that the PatchSearch method could be useful in the early identification of off-targets. The program and the benchmarks presented in this paper are available as an R package at https://github.com/MTiPatchSearch .
Collapse
Affiliation(s)
- Inès Rasolohery
- Molécules Thérapeutiques in Silico, UMRS 973, Université Paris Diderot, INSERM , F-75013 Paris, France
| | - Gautier Moroy
- Molécules Thérapeutiques in Silico, UMRS 973, Université Paris Diderot, INSERM , F-75013 Paris, France
| | - Frédéric Guyon
- Molécules Thérapeutiques in Silico, UMRS 973, Université Paris Diderot, INSERM , F-75013 Paris, France
| |
Collapse
|
18
|
Chen YC, Tolbert R, Aronov AM, McGaughey G, Walters WP, Meireles L. Prediction of Protein Pairs Sharing Common Active Ligands Using Protein Sequence, Structure, and Ligand Similarity. J Chem Inf Model 2016; 56:1734-45. [PMID: 27559831 DOI: 10.1021/acs.jcim.6b00118] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We benchmarked the ability of comparative computational approaches to correctly discriminate protein pairs sharing a common active ligand (positive protein pairs) from protein pairs with no common active ligands (negative protein pairs). Since the target and the off-targets of a drug share at least a common ligand, i.e., the drug itself, the prediction of positive protein pairs may help identify off-targets. We evaluated representative protein-centric and ligand-centric approaches, including (1) 2D and 3D ligand similarity, (2) several measures of protein sequence similarity in conjunction with different sequence sources (e.g., full protein sequence versus binding site residues), and (3) a newly described pocket shape similarity and alignment program called SiteHopper. While the sequence-based alignment of pocket residues achieved the best overall performance, SiteHopper outperformed sequence-based approaches for unrelated proteins with only 20-30% pocket residue identity. Analogously, among ligand-centric approaches, path-based fingerprints achieved the best overall performance, but ROCS-based ligand shape similarity outperformed path-based fingerprints for structurally dissimilar ligands (Tanimoto 25%-40%). A significant drop in recognition performance was observed for ligand-centric approaches when PDB ligands were used instead of ChEMBL ligands. Finally, we analyzed the relationship between pocket shape and ligand shape in our data set and found that similar ligands tend to bind to similar pockets while similar pockets may accept a range of different-shaped ligands.
Collapse
Affiliation(s)
- Yu-Chen Chen
- Vertex Pharmaceuticals Incorporated , 11010 Torreyana Road, San Diego, California 92121, United States
| | - Robert Tolbert
- OpenEye Scientific Software , 9 Bisbee Court, Suite D, Santa Fe, New Mexico 87508, United States
| | - Alex M Aronov
- Vertex Pharmaceuticals Incorporated , 50 Northern Avenue, Boston, Massachusetts 02210, United States
| | - Georgia McGaughey
- Vertex Pharmaceuticals Incorporated , 50 Northern Avenue, Boston, Massachusetts 02210, United States
| | - W Patrick Walters
- Vertex Pharmaceuticals Incorporated , 50 Northern Avenue, Boston, Massachusetts 02210, United States
| | - Lidio Meireles
- Vertex Pharmaceuticals Incorporated , 11010 Torreyana Road, San Diego, California 92121, United States
| |
Collapse
|
19
|
Axenopoulos A, Rafailidis D, Papadopoulos G, Houstis EN, Daras P. Similarity Search of Flexible 3D Molecules Combining Local and Global Shape Descriptors. IEEE/ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS 2016; 13:954-970. [PMID: 26561479 DOI: 10.1109/tcbb.2015.2498553] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
In this paper, a framework for shape-based similarity search of 3D molecular structures is presented. The proposed framework exploits simultaneously the discriminative capabilities of a global, a local, and a hybrid local-global shape feature to produce a geometric descriptor that achieves higher retrieval accuracy than each feature does separately. Global and hybrid features are extracted using pairwise computations of diffusion distances between the points of the molecular surface, while the local feature is based on accumulating pairwise relations among oriented surface points into local histograms. The local features are integrated into a global descriptor vector using the bag-of-features approach. Due to the intrinsic property of its constituting shape features to be invariant to articulations of the 3D objects, the framework is appropriate for similarity search of flexible 3D molecules, while at the same time it is also accurate in retrieving rigid 3D molecules. The proposed framework is evaluated in flexible and rigid shape matching of 3D protein structures as well as in shape-based virtual screening of large ligand databases with quite promising results.
Collapse
|
20
|
Issa NT, Peters OJ, Byers SW, Dakshanamurthy S. RepurposeVS: A Drug Repurposing-Focused Computational Method for Accurate Drug-Target Signature Predictions. Comb Chem High Throughput Screen 2016; 18:784-94. [PMID: 26234515 DOI: 10.2174/1386207318666150803130138] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2015] [Revised: 06/17/2015] [Accepted: 07/28/2015] [Indexed: 11/22/2022]
Abstract
We describe here RepurposeVS for the reliable prediction of drug-target signatures using X-ray protein crystal structures. RepurposeVS is a virtual screening method that incorporates docking, drug-centric and protein-centric 2D/3D fingerprints with a rigorous mathematical normalization procedure to account for the variability in units and provide high-resolution contextual information for drug-target binding. Validity was confirmed by the following: (1) providing the greatest enrichment of known drug binders for multiple protein targets in virtual screening experiments, (2) determining that similarly shaped protein target pockets are predicted to bind drugs of similar 3D shapes when RepurposeVS is applied to 2,335 human protein targets, and (3) determining true biological associations in vitro for mebendazole (MBZ) across many predicted kinase targets for potential cancer repurposing. Since RepurposeVS is a drug repurposing-focused method, benchmarking was conducted on a set of 3,671 FDA approved and experimental drugs rather than the Database of Useful Decoys (DUDE) so as to streamline downstream repurposing experiments. We further apply RepurposeVS to explore the overall potential drug repurposing space for currently approved drugs. RepurposeVS is not computationally intensive and increases performance accuracy, thus serving as an efficient and powerful in silico tool to predict drug-target associations in drug repurposing.
Collapse
|
21
|
Gao Z, Rostami R, Pang X, Fu Z, Yu Z. Mesh Generation and Flexible Shape Comparisons for Bio-Molecules. COMPUTATIONAL AND MATHEMATICAL BIOPHYSICS 2016. [DOI: 10.1515/mlbmb-2016-0001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
AbstractNovel approaches for generating and comparing flexible (non-rigid) molecular surface meshes are
developed. The mesh-generating method is fast and memory-efficient. The resulting meshes are smooth and
accurate, and possess high mesh quality. An isometric-invariant shape descriptor based on the Laplace-
Beltrami operator is then explored for mesh comparing. The new shape descriptor is more powerful in discriminating
different surface shapes but rely only on a small set of signature values. The shape descriptor is
applied to shape comparison between molecules with deformed structures. The proposed methods are implemented
into a program that can be used as a stand-alone software tool or as a plug-in to other existing
molecular modeling tools. Particularly, the code is encapsulated into a software toolkit with a user-friendly
graphical interface developed by the authors.
Collapse
Affiliation(s)
- Zhanheng Gao
- 2College of Computer Science and Technology, Jilin University, 2699 Qianjin Street, 130012 Changchun, China
| | - Reihaneh Rostami
- 1Department of Computer Science, University of Wisconsin at Milwaukee, 3200 North Cramer Street, 53211 Milwaukee, United States
| | - Xiaoli Pang
- 3The First Hospital of Jilin University 71 Xinmin Street, 130021 Changchun, China
| | - Zhicheng Fu
- 1Department of Computer Science, University of Wisconsin at Milwaukee, 3200 North Cramer Street, 53211 Milwaukee, United States
| | - Zeyun Yu
- 1Department of Computer Science, University of Wisconsin at Milwaukee, 3200 North Cramer Street, 53211 Milwaukee, United States
| |
Collapse
|
22
|
Yao B, Li Z, Ding M, Chen M. Three-dimensional protein model similarity analysis based on salient shape index. BMC Bioinformatics 2016; 17:131. [PMID: 26987968 PMCID: PMC4797110 DOI: 10.1186/s12859-016-0983-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2015] [Accepted: 03/09/2016] [Indexed: 11/25/2022] Open
Abstract
Background Proteins play a special role in bioinformatics. The surface shape of a protein, which is an important characteristic of the protein, defines a geometric and biochemical domain where the protein interacts with other proteins. The similarity analysis among protein models has become an important topic of protein analysis, by which it can reveal the structure and the function of proteins. Results In this paper, a new protein similarity analysis method based on three-dimensional protein models is proposed. It constructs a feature matrix descriptor for each protein model combined by calculating the shape index (SI) and the related salient geometric feature (SGF), and then analyzes the protein model similarity by using this feature matrix and the extended grey relation analysis. Conclusions We compare our method to the Multi-resolution Reeb Graph (MRG) skeleton method, the L1-medial skeleton method and the local-diameter descriptor method. Experimental results show that our protein similarity analysis method is accurate and reliable while keeping the high computational efficiency.
Collapse
Affiliation(s)
- Bo Yao
- Departments of Mathematical Sciences, Zhejiang Sci-Tech University, Hangzhou, 310018, China
| | - Zhong Li
- Departments of Mathematical Sciences, Zhejiang Sci-Tech University, Hangzhou, 310018, China.
| | - Meng Ding
- Departments of Mathematical Sciences, Zhejiang Sci-Tech University, Hangzhou, 310018, China
| | - Minhong Chen
- Departments of Mathematical Sciences, Zhejiang Sci-Tech University, Hangzhou, 310018, China
| |
Collapse
|
23
|
High Throughput Profiling of Molecular Shapes in Crystals. Sci Rep 2016; 6:22204. [PMID: 26908351 PMCID: PMC4764928 DOI: 10.1038/srep22204] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2015] [Accepted: 02/09/2016] [Indexed: 11/09/2022] Open
Abstract
Molecular shape is important in both crystallisation and supramolecular assembly, yet its role is not completely understood. We present a computationally efficient scheme to describe and classify the molecular shapes in crystals. The method involves rotation invariant description of Hirshfeld surfaces in terms of of spherical harmonic functions. Hirshfeld surfaces represent the boundaries of a molecule in the crystalline environment, and are widely used to visualise and interpret crystalline interactions. The spherical harmonic description of molecular shapes are compared and classified by means of principal component analysis and cluster analysis. When applied to a series of metals, the method results in a clear classification based on their lattice type. When applied to around 300 crystal structures comprising of series of substituted benzenes, naphthalenes and phenylbenzamide it shows the capacity to classify structures based on chemical scaffolds, chemical isosterism, and conformational similarity. The computational efficiency of the method is demonstrated with an application to over 14 thousand crystal structures. High throughput screening of molecular shapes and interaction surfaces in the Cambridge Structural Database (CSD) using this method has direct applications in drug discovery, supramolecular chemistry and materials design.
Collapse
|
24
|
Nakamura T, Tomii K. Protein ligand-binding site comparison by a reduced vector representation derived from multidimensional scaling of generalized description of binding sites. Methods 2016; 93:35-40. [DOI: 10.1016/j.ymeth.2015.08.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2015] [Revised: 07/25/2015] [Accepted: 08/10/2015] [Indexed: 11/25/2022] Open
|
25
|
Barelier S, Sterling T, O’Meara MJ, Shoichet BK. The Recognition of Identical Ligands by Unrelated Proteins. ACS Chem Biol 2015; 10:2772-84. [PMID: 26421501 DOI: 10.1021/acschembio.5b00683] [Citation(s) in RCA: 58] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The binding of drugs and reagents to off-targets is well-known. Whereas many off-targets are related to the primary target by sequence and fold, many ligands bind to unrelated pairs of proteins, and these are harder to anticipate. If the binding site in the off-target can be related to that of the primary target, this challenge resolves into aligning the two pockets. However, other cases are possible: the ligand might interact with entirely different residues and environments in the off-target, or wholly different ligand atoms may be implicated in the two complexes. To investigate these scenarios at atomic resolution, the structures of 59 ligands in 116 complexes (62 pairs in total), where the protein pairs were unrelated by fold but bound an identical ligand, were examined. In almost half of the pairs, the ligand interacted with unrelated residues in the two proteins (29 pairs), and in 14 of the pairs wholly different ligand moieties were implicated in each complex. Even in those 19 pairs of complexes that presented similar environments to the ligand, ligand superposition rarely resulted in the overlap of related residues. There appears to be no single pattern-matching "code" for identifying binding sites in unrelated proteins that bind identical ligands, though modeling suggests that there might be a limited number of different patterns that suffice to recognize different ligand functional groups.
Collapse
Affiliation(s)
- Sarah Barelier
- Department of Pharmaceutical
Chemistry, University of California San Francisco, 1700 Fourth
Street, Byers Hall, San Francisco, California 94158, United States
| | - Teague Sterling
- Department of Pharmaceutical
Chemistry, University of California San Francisco, 1700 Fourth
Street, Byers Hall, San Francisco, California 94158, United States
| | - Matthew J. O’Meara
- Department of Pharmaceutical
Chemistry, University of California San Francisco, 1700 Fourth
Street, Byers Hall, San Francisco, California 94158, United States
| | - Brian K. Shoichet
- Department of Pharmaceutical
Chemistry, University of California San Francisco, 1700 Fourth
Street, Byers Hall, San Francisco, California 94158, United States
| |
Collapse
|
26
|
Lee HC, Liao T, Zhang YJ, Yang G. Shape component analysis: structure-preserving dimension reduction on biological shape spaces. Bioinformatics 2015; 32:755-63. [PMID: 26543176 DOI: 10.1093/bioinformatics/btv648] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2015] [Accepted: 10/18/2015] [Indexed: 11/14/2022] Open
Abstract
MOTIVATION Quantitative shape analysis is required by a wide range of biological studies across diverse scales, ranging from molecules to cells and organisms. In particular, high-throughput and systems-level studies of biological structures and functions have started to produce large volumes of complex high-dimensional shape data. Analysis and understanding of high-dimensional biological shape data require dimension-reduction techniques. RESULTS We have developed a technique for non-linear dimension reduction of 2D and 3D biological shape representations on their Riemannian spaces. A key feature of this technique is that it preserves distances between different shapes in an embedded low-dimensional shape space. We demonstrate an application of this technique by combining it with non-linear mean-shift clustering on the Riemannian spaces for unsupervised clustering of shapes of cellular organelles and proteins. AVAILABILITY AND IMPLEMENTATION Source code and data for reproducing results of this article are freely available at https://github.com/ccdlcmu/shape_component_analysis_Matlab The implementation was made in MATLAB and supported on MS Windows, Linux and Mac OS. CONTACT geyang@andrew.cmu.edu.
Collapse
Affiliation(s)
| | - Tao Liao
- Department of Mechanical Engineering, and
| | | | - Ge Yang
- Department of Biomedical Engineering, Department of Computational Biology, Carnegie Mellon University, Pittsburgh, PA 15213, USA
| |
Collapse
|
27
|
Gowthaman R, Miller SA, Rogers S, Khowsathit J, Lan L, Bai N, Johnson DK, Liu C, Xu L, Anbanandam A, Aubé J, Roy A, Karanicolas J. DARC: Mapping Surface Topography by Ray-Casting for Effective Virtual Screening at Protein Interaction Sites. J Med Chem 2015; 59:4152-70. [PMID: 26126123 DOI: 10.1021/acs.jmedchem.5b00150] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Protein-protein interactions represent an exciting and challenging target class for therapeutic intervention using small molecules. Protein interaction sites are often devoid of the deep surface pockets presented by "traditional" drug targets, and crystal structures reveal that inhibitors typically engage these sites using very shallow binding modes. As a consequence, modern virtual screening tools developed to identify inhibitors of traditional drug targets do not perform as well when they are instead deployed at protein interaction sites. To address the need for novel inhibitors of important protein interactions, here we introduce an alternate docking strategy specifically designed for this regime. Our method, termed DARC (Docking Approach using Ray-Casting), matches the topography of a surface pocket "observed" from within the protein to the topography "observed" when viewing a potential ligand from the same vantage point. We applied DARC to carry out a virtual screen against the protein interaction site of human antiapoptotic protein Mcl-1 and found that four of the top-scoring 21 compounds showed clear inhibition in a biochemical assay. The Ki values for these compounds ranged from 1.2 to 21 μM, and each had ligand efficiency comparable to promising small-molecule inhibitors of other protein-protein interactions. These hit compounds do not resemble the natural (protein) binding partner of Mcl-1, nor do they resemble any known inhibitors of Mcl-1. Our results thus demonstrate the utility of DARC for identifying novel inhibitors of protein-protein interactions.
Collapse
Affiliation(s)
- Ragul Gowthaman
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Sven A Miller
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Steven Rogers
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Jittasak Khowsathit
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Lan Lan
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Nan Bai
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - David K Johnson
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Chunjing Liu
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Liang Xu
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Asokan Anbanandam
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Jeffrey Aubé
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - Anuradha Roy
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| | - John Karanicolas
- Center for Computational Biology, ‡Department of Molecular Biosciences, §Center of Biomedical Research Excellence, Center for Cancer Experimental Therapeutics, ∥Department of Radiation Oncology, ⊥Biomolecular NMR Laboratory, #Department of Medicinal Chemistry, and ∇High Throughput Screening Laboratory University of Kansas , 2030 Becker Drive, Lawrence, Kansas 66045-7534, United States
| |
Collapse
|
28
|
Martínez-Jiménez F, Marti-Renom MA. Ligand-target prediction by structural network biology using nAnnoLyze. PLoS Comput Biol 2015; 11:e1004157. [PMID: 25816344 PMCID: PMC4376866 DOI: 10.1371/journal.pcbi.1004157] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2014] [Accepted: 01/27/2015] [Indexed: 11/24/2022] Open
Abstract
Target identification is essential for drug design, drug-drug interaction prediction, dosage adjustment and side effect anticipation. Specifically, the knowledge of structural details is essential for understanding the mode of action of a compound on a target protein. Here, we present nAnnoLyze, a method for target identification that relies on the hypothesis that structurally similar binding sites bind similar ligands. nAnnoLyze integrates structural information into a bipartite network of interactions and similarities to predict structurally detailed compound-protein interactions at proteome scale. The method was benchmarked on a dataset of 6,282 pairs of known interacting ligand-target pairs reaching a 0.96 of area under the Receiver Operating Characteristic curve (AUC) when using the drug names as an input feature for the classifier, and a 0.70 of AUC for “anonymous” compounds or compounds not present in the training set. nAnnoLyze resulted in higher accuracies than its predecessor, AnnoLyze. We applied the method to predict interactions for all the compounds in the DrugBank database with each human protein structure and provide examples of target identification for known drugs against human diseases. The accuracy and applicability of our method to any compound indicate that a comparative docking approach such as nAnnoLyze enables large-scale annotation and analysis of compound–protein interactions and thus may benefit drug development. Description of the “mode-of-action” of a small chemical compound against a protein target is essential for the drug discovery process. Such description relies on three main steps: i) the identification of the target protein within the thousands of proteins in an organism, ii) the localization of the binding interaction site in the identified target protein, and iii) the molecular characterization of the compound’s binding mode in the binding site of the target protein. Here, we introduce a new computational method, called nAnnoLyze, which uses graph theory principles to relate compounds and target proteins based on comparative principles. nAnnoLyze aims at addressing two of the three previous steps, that is, target identification and binding site localization. Our results suggest that the nAnnoLyze accuracy and proteome-wide applicability enables the large-scale annotation and analysis of compound–protein interaction and thus may benefit drug development.
Collapse
Affiliation(s)
- Francisco Martínez-Jiménez
- Genome Biology Group, Centre Nacional d’Aanàlisi Genòmica (CNAG), Barcelona, Spain
- Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
| | - Marc A. Marti-Renom
- Genome Biology Group, Centre Nacional d’Aanàlisi Genòmica (CNAG), Barcelona, Spain
- Gene Regulation, Stem Cells and Cancer Program, Centre for Genomic Regulation (CRG), Barcelona, Spain
- Institució Catalana de Recerca i Estudis Avançats (ICREA), Barcelona, Spain
- * E-mail:
| |
Collapse
|
29
|
Zhu X, Xiong Y, Kihara D. Large-scale binding ligand prediction by improved patch-based method Patch-Surfer2.0. ACTA ACUST UNITED AC 2014; 31:707-13. [PMID: 25359888 DOI: 10.1093/bioinformatics/btu724] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Abstract
MOTIVATION Ligand binding is a key aspect of the function of many proteins. Thus, binding ligand prediction provides important insight in understanding the biological function of proteins. Binding ligand prediction is also useful for drug design and examining potential drug side effects. RESULTS We present a computational method named Patch-Surfer2.0, which predicts binding ligands for a protein pocket. By representing and comparing pockets at the level of small local surface patches that characterize physicochemical properties of the local regions, the method can identify binding pockets of the same ligand even if they do not share globally similar shapes. Properties of local patches are represented by an efficient mathematical representation, 3D Zernike Descriptor. Patch-Surfer2.0 has significant technical improvements over our previous prototype, which includes a new feature that captures approximate patch position with a geodesic distance histogram. Moreover, we constructed a large comprehensive database of ligand binding pockets that will be searched against by a query. The benchmark shows better performance of Patch-Surfer2.0 over existing methods. AVAILABILITY AND IMPLEMENTATION http://kiharalab.org/patchsurfer2.0/ CONTACT: dkihara@purdue.edu SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Xiaolei Zhu
- Department of Biological Science, Purdue University, West Lafayette, IN 47906, USA and Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA
| | - Yi Xiong
- Department of Biological Science, Purdue University, West Lafayette, IN 47906, USA and Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA
| | - Daisuke Kihara
- Department of Biological Science, Purdue University, West Lafayette, IN 47906, USA and Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA Department of Biological Science, Purdue University, West Lafayette, IN 47906, USA and Department of Computer Science, Purdue University, West Lafayette, IN 47906, USA
| |
Collapse
|
30
|
Yock AD, Rao A, Dong L, Beadle BM, Garden AS, Kudchadker RJ, Court LE. Forecasting longitudinal changes in oropharyngeal tumor morphology throughout the course of head and neck radiation therapy. Med Phys 2014; 41:081708. [PMID: 25086518 DOI: 10.1118/1.4887815] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022] Open
Abstract
PURPOSE To create models that forecast longitudinal trends in changing tumor morphology and to evaluate and compare their predictive potential throughout the course of radiation therapy. METHODS Two morphology feature vectors were used to describe 35 gross tumor volumes (GTVs) throughout the course of intensity-modulated radiation therapy for oropharyngeal tumors. The feature vectors comprised the coordinates of the GTV centroids and a description of GTV shape using either interlandmark distances or a spherical harmonic decomposition of these distances. The change in the morphology feature vector observed at 33 time points throughout the course of treatment was described using static, linear, and mean models. Models were adjusted at 0, 1, 2, 3, or 5 different time points (adjustment points) to improve prediction accuracy. The potential of these models to forecast GTV morphology was evaluated using leave-one-out cross-validation, and the accuracy of the models was compared using Wilcoxon signed-rank tests. RESULTS Adding a single adjustment point to the static model without any adjustment points decreased the median error in forecasting the position of GTV surface landmarks by the largest amount (1.2 mm). Additional adjustment points further decreased the forecast error by about 0.4 mm each. Selection of the linear model decreased the forecast error for both the distance-based and spherical harmonic morphology descriptors (0.2 mm), while the mean model decreased the forecast error for the distance-based descriptor only (0.2 mm). The magnitude and statistical significance of these improvements decreased with each additional adjustment point, and the effect from model selection was not as large as that from adding the initial points. CONCLUSIONS The authors present models that anticipate longitudinal changes in tumor morphology using various models and model adjustment schemes. The accuracy of these models depended on their form, and the utility of these models includes the characterization of patient-specific response with implications for treatment management and research study design.
Collapse
Affiliation(s)
- Adam D Yock
- Department of Radiation Physics, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030 and The Graduate School of Biomedical Sciences, The University of Texas Health Science Center at Houston, Houston, Texas 77030
| | - Arvind Rao
- Department of Bioinformatics and Computational Biology, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030 and The Graduate School of Biomedical Sciences, The University of Texas Health Science Center at Houston, Houston, Texas 77030
| | - Lei Dong
- Scripps Proton Therapy Center, San Diego, California 92121 and The Graduate School of Biomedical Sciences, The University of Texas Health Science Center at Houston, Houston, Texas 77030
| | - Beth M Beadle
- Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030
| | - Adam S Garden
- Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030
| | - Rajat J Kudchadker
- Department of Radiation Physics, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030 and The Graduate School of Biomedical Sciences, The University of Texas Health Science Center at Houston, Houston, Texas 77030
| | - Laurence E Court
- Department of Radiation Physics and Department of Imaging Physics, The University of Texas MD Anderson Cancer Center, Houston, Texas 77030 and The Graduate School of Biomedical Sciences, The University of Texas Health Science Center at Houston, Houston, Texas 77030
| |
Collapse
|
31
|
Skolnick J, Gao M, Zhou H. On the role of physics and evolution in dictating protein structure and function. Isr J Chem 2014; 54:1176-1188. [PMID: 25484448 PMCID: PMC4255337 DOI: 10.1002/ijch.201400013] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
How many of the structural and functional properties of proteins are inherent? Computer simulations provide a powerful tool to address this question. A series of studies on QS, quasi-spherical, compact polypeptides which lack any secondary structure; ART, artificial, proteins comprised of compact homopolypeptides with protein-like secondary structure; and PDB, native, single domain proteins shows that essentially all native global folds, pockets and protein-protein interfaces are in the ART library. This suggests that many protein properties are inherent and that evolution is involved in fine-tuning. The completeness of the space of ligand binding pockets and protein-protein interfaces suggests that promiscuous interactions are intrinsic to proteins and that the capacity to perform the biochemistry of life at low level does not require evolution. If so, this has profound consequences for the origin of life.
Collapse
Affiliation(s)
- Jeffrey Skolnick
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, 250 14th Street NW, Atlanta, GA 30318, USA
| | - Mu Gao
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, 250 14th Street NW, Atlanta, GA 30318, USA
| | - Hongyi Zhou
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, 250 14th Street NW, Atlanta, GA 30318, USA
| |
Collapse
|
32
|
Yan X, Li J, Gu Q, Xu J. gWEGA: GPU-accelerated WEGA for molecular superposition and shape comparison. J Comput Chem 2014; 35:1122-30. [PMID: 24729358 DOI: 10.1002/jcc.23603] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2014] [Revised: 03/06/2014] [Accepted: 03/14/2014] [Indexed: 01/13/2023]
Affiliation(s)
- Xin Yan
- Research Center for Drug Discovery, School of Pharmaceutical Sciences, Sun Yat-sen University, 132 East Circle at University City, Guangzhou, Guangdong, 510006, China
| | | | | | | |
Collapse
|
33
|
Abstract
The amount of known protein structures is continuously growing, exhibited in over 95,000 3D structures freely available via the PDB. Over the last decade, pharmaceutical research has sparked interest in computationally extracting information from this large data pool, resulting in a homology-driven knowledge transfer from annotated to new structures. Studying protein structures with respect to understanding and modulating their functional behavior means analyzing their centers of action. Therefore, the detection and description of potential binding sites on the protein surface is a major step towards protein classification and assessment. Subsequently, these representations can be incorporated to compare proteins, and to predict their druggability or function. Especially in the context of target identification and polypharmacology, automated tools for large-scale target comparisons are highly needed. In this article, developments for automated structure-based target assessment are reviewed and remaining challenges as well as future perspectives are discussed.
Collapse
|
34
|
Abstract
Drug action can be rationalized as interaction of a molecule with proteins in a regulatory network of targets from a specific biological system. Both drug and side effects are often governed by interaction of the drug molecule with many, often unrelated, targets. Accordingly, arrays of protein–ligand interaction data from numerous in vitro profiling assays today provide growing evidence of polypharmacological drug interactions, even for marketed drugs. In vitro off-target profiling has therefore become an important tool in early drug discovery to learn about potential off-target liabilities, which are sometimes beneficial, but more often safety relevant. The rapidly developing field of in silico profiling approaches is complementing in vitro profiling. These approaches capitalize from large amounts of biochemical data from multiple sources to be exploited for optimizing undesirable side effects in pharmaceutical research. Therefore, current in silico profiling models are nowadays perceived as valuable tools in drug discovery, and promise a platform to support optimally informed decisions.
Collapse
|
35
|
Yamanishi Y. Inferring Chemogenomic Features from Drug-Target Interaction Networks. Mol Inform 2013; 32:991-9. [DOI: 10.1002/minf.201300079] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Accepted: 11/19/2013] [Indexed: 01/23/2023]
|
36
|
Gao M, Skolnick J. A comprehensive survey of small-molecule binding pockets in proteins. PLoS Comput Biol 2013; 9:e1003302. [PMID: 24204237 PMCID: PMC3812058 DOI: 10.1371/journal.pcbi.1003302] [Citation(s) in RCA: 91] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2013] [Accepted: 09/11/2013] [Indexed: 11/19/2022] Open
Abstract
Many biological activities originate from interactions between small-molecule ligands and their protein targets. A detailed structural and physico-chemical characterization of these interactions could significantly deepen our understanding of protein function and facilitate drug design. Here, we present a large-scale study on a non-redundant set of about 20,000 known ligand-binding sites, or pockets, of proteins. We find that the structural space of protein pockets is crowded, likely complete, and may be represented by about 1,000 pocket shapes. Correspondingly, the growth rate of novel pockets deposited in the Protein Data Bank has been decreasing steadily over the recent years. Moreover, many protein pockets are promiscuous and interact with ligands of diverse scaffolds. Conversely, many ligands are promiscuous and interact with structurally different pockets. Through a physico-chemical and structural analysis, we provide insights into understanding both pocket promiscuity and ligand promiscuity. Finally, we discuss the implications of our study for the prediction of protein-ligand interactions based on pocket comparison. The life of a living cell relies on many distinct proteins to carry out their functions. Most of these functions are rooted in interactions between the proteins and metabolites, small-molecules essential for life. By targeting specific proteins relevant to a disease, drug molecules may provide a cure. A deep understanding of the nature of interactions between proteins and small-molecules (or ligands) through analyzing their structures may help predict protein function or improve drug design. In this contribution, we present a large-scale analysis of a non-redundant set of over 20,000 experimental protein-ligand complex structures available in the current Protein Data Bank. We seek answers to several fundamental questions: How many representative pockets are there that serve as ligand-binding sites in proteins? To what extent can we infer a similar protein-ligand interaction by matching the structures of protein pockets? How different are the ligands found in the same pocket? For a promiscuous protein pocket, how does a pocket maintain favorable interactions with very different ligands? Conversely, how different are those pockets that interact with the same ligand? We find the structural space of protein pocket is small and that both protein promiscuity and ligand promiscuity are very common in Nature.
Collapse
Affiliation(s)
- Mu Gao
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, Georgia, United States of America
| | - Jeffrey Skolnick
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, Georgia, United States of America
- * E-mail:
| |
Collapse
|
37
|
3D time series analysis of cell shape using Laplacian approaches. BMC Bioinformatics 2013; 14:296. [PMID: 24090312 PMCID: PMC3871028 DOI: 10.1186/1471-2105-14-296] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2013] [Accepted: 07/15/2013] [Indexed: 11/18/2022] Open
Abstract
Background Fundamental cellular processes such as cell movement, division or food uptake critically depend on cells being able to change shape. Fast acquisition of three-dimensional image time series has now become possible, but we lack efficient tools for analysing shape deformations in order to understand the real three-dimensional nature of shape changes. Results We present a framework for 3D+time cell shape analysis. The main contribution is three-fold: First, we develop a fast, automatic random walker method for cell segmentation. Second, a novel topology fixing method is proposed to fix segmented binary volumes without spherical topology. Third, we show that algorithms used for each individual step of the analysis pipeline (cell segmentation, topology fixing, spherical parameterization, and shape representation) are closely related to the Laplacian operator. The framework is applied to the shape analysis of neutrophil cells. Conclusions The method we propose for cell segmentation is faster than the traditional random walker method or the level set method, and performs better on 3D time-series of neutrophil cells, which are comparatively noisy as stacks have to be acquired fast enough to account for cell motion. Our method for topology fixing outperforms the tools provided by SPHARM-MAT and SPHARM-PDM in terms of their successful fixing rates. The different tasks in the presented pipeline for 3D+time shape analysis of cells can be solved using Laplacian approaches, opening the possibility of eventually combining individual steps in order to speed up computations.
Collapse
|
38
|
Cai C, Gong J, Liu X, Gao D, Li H. SimG: an alignment based method for evaluating the similarity of small molecules and binding sites. J Chem Inf Model 2013; 53:2103-15. [PMID: 23889471 DOI: 10.1021/ci400139j] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/02/2023]
Abstract
In this study, a Gaussian volume overlap and chemical feature based molecular similarity metric was devised, and a downhill simplex searching was carried out to evaluate the corresponding similarity. By representing the shapes of both the candidate small molecules and the binding site with chemical features and comparing the corresponding Gaussian volumes overlaps, the active compounds could be identified. These two aspects compose the proposed method named SimG which supports both structure-based and ligand-based strategies. The validity of the proposed method was examined by analyzing the similarity score variation between actives and decoys as well as correlation among distinct reference methods. A retrospective virtual screening test was carried out on DUD data sets, demonstrating that the performance of structure-based shape matching virtual screening in DUD data sets is substantially dependent on some physical properties, especially the solvent-exposure extent of the binding site: The enrichments of targets with less solvent-exposed binding sites generally exceeds that of the one with more solvent-exposed binding sites and even surpasses the corresponding ligand-based virtual screening.
Collapse
Affiliation(s)
- Chaoqian Cai
- School of Information Science and Engineering, Shanghai Key Laboratory of New Drug Design, School of Pharmacy, East China University of Science and Technology, 130 Mei Long Road, Shanghai 200237, China
| | | | | | | | | |
Collapse
|
39
|
Yan X, Li J, Liu Z, Zheng M, Ge H, Xu J. Enhancing Molecular Shape Comparison by Weighted Gaussian Functions. J Chem Inf Model 2013; 53:1967-78. [DOI: 10.1021/ci300601q] [Citation(s) in RCA: 64] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Affiliation(s)
- Xin Yan
- Research Center
for Drug Discovery,
School of Pharmaceutical Sciences, Sun Yat-sen University, 132 East Circle at University City, Guangzhou, 510006, China
| | - Jiabo Li
- SciNet Technologies, 9943 Fieldthron Street, San Diego, California 92127, United States
- Accelrys, Inc., 10188 Telesis Court, San Diego, California 92121, United States
| | - Zhihong Liu
- Research Center
for Drug Discovery,
School of Pharmaceutical Sciences, Sun Yat-sen University, 132 East Circle at University City, Guangzhou, 510006, China
| | - Minghao Zheng
- Research Center
for Drug Discovery,
School of Pharmaceutical Sciences, Sun Yat-sen University, 132 East Circle at University City, Guangzhou, 510006, China
| | - Hu Ge
- Research Center
for Drug Discovery,
School of Pharmaceutical Sciences, Sun Yat-sen University, 132 East Circle at University City, Guangzhou, 510006, China
| | - Jun Xu
- Research Center
for Drug Discovery,
School of Pharmaceutical Sciences, Sun Yat-sen University, 132 East Circle at University City, Guangzhou, 510006, China
| |
Collapse
|
40
|
Protein pocket and ligand shape comparison and its application in virtual screening. J Comput Aided Mol Des 2013; 27:511-24. [PMID: 23807262 DOI: 10.1007/s10822-013-9659-1] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2013] [Accepted: 06/12/2013] [Indexed: 10/26/2022]
Abstract
Understanding molecular recognition is one major requirement for drug discovery and design. Physicochemical and shape complementarity between two binding partners is the driving force during complex formation. In this study, the impact of shape within this process is analyzed. Protein binding pockets and co-crystallized ligands are represented by normalized principal moments of inertia ratios (NPRs). The corresponding descriptor space is triangular, with its corners occupied by spherical, discoid, and elongated shapes. An analysis of a selected set of sc-PDB complexes suggests that pockets and bound ligands avoid spherical shapes, which are, however, prevalent in small unoccupied pockets. Furthermore, a direct shape comparison confirms previous studies that on average only one third of a pocket is filled by its bound ligand, supplemented by a 50 % subpocket coverage. In this study, we found that shape complementary is expressed by low pairwise shape distances in NPR space, short distances between the centers-of-mass, and small deviations in the angle between the first principal ellipsoid axes. Furthermore, it is assessed how different binding pocket parameters are related to bioactivity and binding efficiency of the co-crystallized ligand. In addition, the performance of different shape and size parameters of pockets and ligands is evaluated in a virtual screening scenario performed on four representative targets.
Collapse
|
41
|
Pérot S, Regad L, Reynès C, Spérandio O, Miteva MA, Villoutreix BO, Camproux AC. Insights into an original pocket-ligand pair classification: a promising tool for ligand profile prediction. PLoS One 2013; 8:e63730. [PMID: 23840299 PMCID: PMC3688729 DOI: 10.1371/journal.pone.0063730] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/05/2012] [Accepted: 04/05/2013] [Indexed: 11/18/2022] Open
Abstract
Pockets are today at the cornerstones of modern drug discovery projects and at the crossroad of several research fields, from structural biology to mathematical modeling. Being able to predict if a small molecule could bind to one or more protein targets or if a protein could bind to some given ligands is very useful for drug discovery endeavors, anticipation of binding to off- and anti-targets. To date, several studies explore such questions from chemogenomic approach to reverse docking methods. Most of these studies have been performed either from the viewpoint of ligands or targets. However it seems valuable to use information from both ligands and target binding pockets. Hence, we present a multivariate approach relating ligand properties with protein pocket properties from the analysis of known ligand-protein interactions. We explored and optimized the pocket-ligand pair space by combining pocket and ligand descriptors using Principal Component Analysis and developed a classification engine on this paired space, revealing five main clusters of pocket-ligand pairs sharing specific and similar structural or physico-chemical properties. These pocket-ligand pair clusters highlight correspondences between pocket and ligand topological and physico-chemical properties and capture relevant information with respect to protein-ligand interactions. Based on these pocket-ligand correspondences, a protocol of prediction of clusters sharing similarity in terms of recognition characteristics is developed for a given pocket-ligand complex and gives high performances. It is then extended to cluster prediction for a given pocket in order to acquire knowledge about its expected ligand profile or to cluster prediction for a given ligand in order to acquire knowledge about its expected pocket profile. This prediction approach shows promising results and could contribute to predict some ligand properties critical for binding to a given pocket, and conversely, some key pocket properties for ligand binding.
Collapse
Affiliation(s)
- Stéphanie Pérot
- INSERM, UMRS 973, MTi, Paris, France
- Univ Paris Diderot, Sorbonne Paris Cité, UMRS 973, MTi, Paris, France
| | - Leslie Regad
- INSERM, UMRS 973, MTi, Paris, France
- Univ Paris Diderot, Sorbonne Paris Cité, UMRS 973, MTi, Paris, France
| | - Christelle Reynès
- INSERM, UMRS 973, MTi, Paris, France
- Univ Paris Diderot, Sorbonne Paris Cité, UMRS 973, MTi, Paris, France
| | - Olivier Spérandio
- INSERM, UMRS 973, MTi, Paris, France
- Univ Paris Diderot, Sorbonne Paris Cité, UMRS 973, MTi, Paris, France
| | - Maria A. Miteva
- INSERM, UMRS 973, MTi, Paris, France
- Univ Paris Diderot, Sorbonne Paris Cité, UMRS 973, MTi, Paris, France
| | - Bruno O. Villoutreix
- INSERM, UMRS 973, MTi, Paris, France
- Univ Paris Diderot, Sorbonne Paris Cité, UMRS 973, MTi, Paris, France
| | - Anne-Claude Camproux
- INSERM, UMRS 973, MTi, Paris, France
- Univ Paris Diderot, Sorbonne Paris Cité, UMRS 973, MTi, Paris, France
- * E-mail:
| |
Collapse
|
42
|
Tu H, Shi T. Ligand Binding Site Similarity Identification Based on Chemical and Geometric Similarity. Protein J 2013; 32:373-85. [DOI: 10.1007/s10930-013-9494-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
|
43
|
Tabei Y, Pauwels E, Stoven V, Takemoto K, Yamanishi Y. Identification of chemogenomic features from drug-target interaction networks using interpretable classifiers. Bioinformatics 2013; 28:i487-i494. [PMID: 22962471 PMCID: PMC3436839 DOI: 10.1093/bioinformatics/bts412] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Motivation: Drug effects are mainly caused by the interactions between drug molecules and their target proteins including primary targets and off-targets. Identification of the molecular mechanisms behind overall drug–target interactions is crucial in the drug design process. Results: We develop a classifier-based approach to identify chemogenomic features (the underlying associations between drug chemical substructures and protein domains) that are involved in drug–target interaction networks. We propose a novel algorithm for extracting informative chemogenomic features by using L1 regularized classifiers over the tensor product space of possible drug–target pairs. It is shown that the proposed method can extract a very limited number of chemogenomic features without loosing the performance of predicting drug–target interactions and the extracted features are biologically meaningful. The extracted substructure–domain association network enables us to suggest ligand chemical fragments specific for each protein domain and ligand core substructures important for a wide range of protein families. Availability: Softwares are available at the supplemental website. Contact:yamanishi@bioreg.kyushu-u.ac.jp Supplementary Information: Datasets and all results are available at http://cbio.ensmp.fr/~yyamanishi/l1binary/
.
Collapse
Affiliation(s)
- Yasuo Tabei
- ERATO Minato Project, Japan Science and Technology Agency, Sapporo 060-0814, Japan
| | | | | | | | | |
Collapse
|
44
|
Benchmarking of HPCC: A novel 3D molecular representation combining shape and pharmacophoric descriptors for efficient molecular similarity assessments. J Mol Graph Model 2013; 41:20-30. [DOI: 10.1016/j.jmgm.2013.01.003] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2012] [Revised: 01/11/2013] [Accepted: 01/16/2013] [Indexed: 01/15/2023]
|
45
|
Dadgar S, Ramjan Z, Floriano WB. Paclitaxel is an inhibitor and its boron dipyrromethene derivative is a fluorescent recognition agent for botulinum neurotoxin subtype A. J Med Chem 2013; 56:2791-803. [PMID: 23484537 DOI: 10.1021/jm301829h] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
We have successfully identified one new inhibitor and one new fluorescent recognition agent for the botulinum neurotoxin subtype A (BoNT/A) using the virtual screening protocol "protein scanning with virtual ligand screening" (PSVLS). Hit selection used an in-house developed holistic binding scoring method. Selected hits were tested experimentally for inhibitory activity using fluorescence resonance energy transfer (FRET) assays against the light chain (catalytic domain) of BoNT/A. Ligand binding was determined against the light and heavy chain BoNT/A complex through either radiolabeled ligand binding assays (nonfluorescent ligands) or fluorescence intensity assays (fluorescent ligands). These experimental assays have confirmed one compound (paclitaxel) to inhibit BoNT/A's proteolytic activity experimentally with an IC50 of 5.2 μM. A fluorescent derivative was also confirmed to bind to the toxin and therefore is a suitable candidate for the rational design of new detection agents and for the development of fluorescence-based multiprobe detection assays.
Collapse
Affiliation(s)
- Saedeh Dadgar
- Department of Chemistry, Lakehead University and Thunder Bay Regional Research Institute, Thunder Bay, Ontario P7B 5E1, Canada
| | | | | |
Collapse
|
46
|
Pires DEV, de Melo-Minardi RC, da Silveira CH, Campos FF, Meira W. aCSM: noise-free graph-based signatures to large-scale receptor-based ligand prediction. ACTA ACUST UNITED AC 2013; 29:855-61. [PMID: 23396119 DOI: 10.1093/bioinformatics/btt058] [Citation(s) in RCA: 47] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022]
Abstract
MOTIVATION Receptor-ligand interactions are a central phenomenon in most biological systems. They are characterized by molecular recognition, a complex process mainly driven by physicochemical and structural properties of both receptor and ligand. Understanding and predicting these interactions are major steps towards protein ligand prediction, target identification, lead discovery and drug design. RESULTS We propose a novel graph-based-binding pocket signature called aCSM, which proved to be efficient and effective in handling large-scale protein ligand prediction tasks. We compare our results with those described in the literature and demonstrate that our algorithm overcomes the competitor's techniques. Finally, we predict novel ligands for proteins from Trypanosoma cruzi, the parasite responsible for Chagas disease, and validate them in silico via a docking protocol, showing the applicability of the method in suggesting ligands for pockets in a real-world scenario. AVAILABILITY AND IMPLEMENTATION Datasets and the source code are available at http://www.dcc.ufmg.br/∼dpires/acsm. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Douglas E V Pires
- Department of Computer Science, Universidade Federal de Minas Gerais, Av. Antônio Carlos, 6627, Pampulha Belo Horizonte - MG, 31270-901, Brazil.
| | | | | | | | | |
Collapse
|
47
|
von Behren MM, Volkamer A, Henzler AM, Schomburg KT, Urbaczek S, Rarey M. Fast protein binding site comparison via an index-based screening technology. J Chem Inf Model 2013; 53:411-22. [PMID: 23390978 DOI: 10.1021/ci300469h] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/25/2023]
Abstract
We present TrixP, a new index-based method for fast protein binding site comparison and function prediction. TrixP determines binding site similarities based on the comparison of descriptors that encode pharmacophoric and spatial features. Therefore, it adopts the efficient core components of TrixX, a structure-based virtual screening technology for large compound libraries. TrixP expands this technology by new components in order to allow a screening of protein libraries. TrixP accounts for the inherent flexibility of proteins employing a partial shape matching routine. After the identification of structures with matching pharmacophoric features and geometric shape, TrixP superimposes the binding sites and, finally, assesses their similarity according to the fit of pharmacophoric properties. TrixP is able to find analogies between closely and distantly related binding sites. Recovery rates of 81.8% for similar binding site pairs, assisted by rejecting rates of 99.5% for dissimilar pairs on a test data set containing 1331 pairs, confirm this ability. TrixP exclusively identifies members of the same protein family on top ranking positions out of a library consisting of 9802 binding sites. Furthermore, 30 predicted kinase binding sites can almost perfectly be classified into their known subfamilies.
Collapse
Affiliation(s)
- Mathias M von Behren
- Center for Bioinformatics, University of Hamburg, Bundesstrasse 43, 20146 Hamburg, Germany
| | | | | | | | | | | |
Collapse
|
48
|
Predicting PDZ domain mediated protein interactions from structure. BMC Bioinformatics 2013; 14:27. [PMID: 23336252 PMCID: PMC3602153 DOI: 10.1186/1471-2105-14-27] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2012] [Accepted: 12/19/2012] [Indexed: 12/03/2022] Open
Abstract
Background PDZ domains are structural protein domains that recognize simple linear amino acid motifs, often at protein C-termini, and mediate protein-protein interactions (PPIs) in important biological processes, such as ion channel regulation, cell polarity and neural development. PDZ domain-peptide interaction predictors have been developed based on domain and peptide sequence information. Since domain structure is known to influence binding specificity, we hypothesized that structural information could be used to predict new interactions compared to sequence-based predictors. Results We developed a novel computational predictor of PDZ domain and C-terminal peptide interactions using a support vector machine trained with PDZ domain structure and peptide sequence information. Performance was estimated using extensive cross validation testing. We used the structure-based predictor to scan the human proteome for ligands of 218 PDZ domains and show that the predictions correspond to known PDZ domain-peptide interactions and PPIs in curated databases. The structure-based predictor is complementary to the sequence-based predictor, finding unique known and novel PPIs, and is less dependent on training–testing domain sequence similarity. We used a functional enrichment analysis of our hits to create a predicted map of PDZ domain biology. This map highlights PDZ domain involvement in diverse biological processes, some only found by the structure-based predictor. Based on this analysis, we predict novel PDZ domain involvement in xenobiotic metabolism and suggest new interactions for other processes including wound healing and Wnt signalling. Conclusions We built a structure-based predictor of PDZ domain-peptide interactions, which can be used to scan C-terminal proteomes for PDZ interactions. We also show that the structure-based predictor finds many known PDZ mediated PPIs in human that were not found by our previous sequence-based predictor and is less dependent on training–testing domain sequence similarity. Using both predictors, we defined a functional map of human PDZ domain biology and predict novel PDZ domain function. Users may access our structure-based and previous sequence-based predictors at
http://webservice.baderlab.org/domains/POW.
Collapse
|
49
|
Abstract
MOTIVATION Most proteins interact with small-molecule ligands such as metabolites or drug compounds. Over the past several decades, many of these interactions have been captured in high-resolution atomic structures. From a geometric point of view, most interaction sites for grasping these small-molecule ligands, as revealed in these structures, form concave shapes, or 'pockets', on the protein's surface. An efficient method for comparing these pockets could greatly assist the classification of ligand-binding sites, prediction of protein molecular function and design of novel drug compounds. RESULTS We introduce a computational method, APoc (Alignment of Pockets), for the large-scale, sequence order-independent, structural comparison of protein pockets. A scoring function, the Pocket Similarity Score (PS-score), is derived to measure the level of similarity between pockets. Statistical models are used to estimate the significance of the PS-score based on millions of comparisons of randomly related pockets. APoc is a general robust method that may be applied to pockets identified by various approaches, such as ligand-binding sites as observed in experimental complex structures, or predicted pockets identified by a pocket-detection method. Finally, we curate large benchmark datasets to evaluate the performance of APoc and present interesting examples to demonstrate the usefulness of the method. We also demonstrate that APoc has better performance than the geometric hashing-based method SiteEngine. AVAILABILITY AND IMPLEMENTATION The APoc software package including the source code is freely available at http://cssb.biology.gatech.edu/APoc.
Collapse
Affiliation(s)
- Mu Gao
- Center for the Study of Systems Biology, School of Biology, Georgia Institute of Technology, Atlanta, GA 30076, USA
| | | |
Collapse
|
50
|
Abstract
The focus of this chapter is on the important concepts behind the in silico techniques that are used today to assess target druggability. The first step of the assessment consists of finding cavity space in the protein using 2D and/or 3D topological concepts. These concepts underlie the geometry and energy-based pocketfinder algorithms. Analysis pursues on the physico-chemical complementarity between the binding site and the drug like molecule. Geometrical and molecular flexibility aspect are also included in this assessment. The presence of hot interaction spots are shown to be particularly important for targeting protein-protein interactions. Finally, binding site promiscuity can be assessed by large scale structural comparison with other targets. Common chemical features amongst protein cavities can predict potential cross-reactivity with unwanted targets.
Collapse
|