1
|
Sandholtz SH, Drocco JA, Zemla AT, Torres MW, Silva MS, Allen JE. A Computational Pipeline to Identify and Characterize Binding Sites and Interacting Chemotypes in SARS-CoV-2. ACS OMEGA 2023; 8:21871-21884. [PMID: 37309388 PMCID: PMC10254058 DOI: 10.1021/acsomega.3c01621] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 03/09/2023] [Accepted: 05/17/2023] [Indexed: 06/14/2023]
Abstract
Minimizing the human and economic costs of the COVID-19 pandemic and future pandemics requires the ability to develop and deploy effective treatments for novel pathogens as soon as possible after they emerge. To this end, we introduce a new computational pipeline for the rapid identification and characterization of binding sites in viral proteins along with the key chemical features, which we call chemotypes, of the compounds predicted to interact with those same sites. The composition of source organisms for the structural models associated with an individual binding site is used to assess the site's degree of structural conservation across different species, including other viruses and humans. We propose a search strategy for novel therapeutics that involves the selection of molecules preferentially containing the most structurally rich chemotypes identified by our algorithm. While we demonstrate the pipeline on SARS-CoV-2, it is generalizable to any new virus, as long as either experimentally solved structures for its proteins are available or sufficiently accurate predicted structures can be constructed.
Collapse
Affiliation(s)
- Sarah H. Sandholtz
- Biosciences
and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
of America
| | - Jeffrey A. Drocco
- Biosciences
and Biotechnology Division, Physical and Life Sciences Directorate, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
of America
| | - Adam T. Zemla
- Global
Security Computing Applications Division, Computing Directorate, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
of America
| | - Marisa W. Torres
- Global
Security Computing Applications Division, Computing Directorate, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
of America
| | - Mary S. Silva
- Global
Security Computing Applications Division, Computing Directorate, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
of America
| | - Jonathan E. Allen
- Global
Security Computing Applications Division, Computing Directorate, Lawrence Livermore National Laboratory, Livermore, California 94550, United States
of America
| |
Collapse
|
2
|
Du Y, Shi T. Ligand cluster-based protein network and ePlatton, a multi-target ligand finder. J Cheminform 2016; 8:23. [PMID: 27143991 PMCID: PMC4853874 DOI: 10.1186/s13321-016-0135-5] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2015] [Accepted: 04/18/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Small molecules are information carriers that make cells aware of external changes and couple internal metabolic and signalling pathway systems with each other. In some specific physiological status, natural or artificial molecules are used to interact with selective biological targets to activate or inhibit their functions to achieve expected biological and physiological output. Millions of years of evolution have optimized biological processes and pathways and now the endocrine and immune system cannot work properly without some key small molecules. In the past thousands of years, the human race has managed to find many medicines against diseases by trail-and-error experience. In the recent decades, with the deepening understanding of life and the progress of molecular biology, researchers spare no effort to design molecules targeting one or two key enzymes and receptors related to corresponding diseases. But recent studies in pharmacogenomics have shown that polypharmacology may be necessary for the effects of drugs, which challenge the paradigm, 'one drug, one target, one disease'. Nowadays, cheminformatics and structural biology can help us reasonably take advantage of the polypharmacology to design next-generation promiscuous drugs and drug combination therapies. RESULTS 234,591 protein-ligand interactions were extracted from ChEMBL. By the 2D structure similarity, 13,769 ligand emerged from 156,151 distinct ligands which were recognized by 1477 proteins. Ligand cluster- and sequence-based protein networks (LCBN, SBN) were constructed, compared and analysed. For assisting compound designing, exploring polypharmacology and finding possible drug combination, we integrated the pathway, disease, drug adverse reaction and the relationship of targets and ligand clusters into the web platform, ePlatton, which is available at http://www.megabionet.org/eplatton. CONCLUSIONS Although there were some disagreements between the LCBN and SBN, communities in both networks were largely the same with normalized mutual information at 0.9. The study of target and ligand cluster promiscuity underlying the LCBN showed that light ligand clusters were more promiscuous than the heavy one and that highly connected nodes tended to be protein kinases and involved in phosphorylation. ePlatton considerably reduced the redundancy of the ligand set of targets and made it easy to deduce the possible relationship between compounds and targets, pathways and side effects. ePlatton behaved reliably in validation experiments and also fast in virtual screening and information retrieval.Graphical abstractCluster exemplars and ePlatton's mechanism.
Collapse
Affiliation(s)
- Yu Du
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, 200241 China
| | - Tieliu Shi
- Center for Bioinformatics and Computational Biology, Shanghai Key Laboratory of Regulatory Biology, Institute of Biomedical Sciences and School of Life Sciences, East China Normal University, Shanghai, 200241 China
| |
Collapse
|
3
|
Fang Y. Compound annotation with real time cellular activity profiles to improve drug discovery. Expert Opin Drug Discov 2016; 11:269-80. [PMID: 26787137 DOI: 10.1517/17460441.2016.1143460] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
INTRODUCTION In the past decade, a range of innovative strategies have been developed to improve the productivity of pharmaceutical research and development. In particular, compound annotation, combined with informatics, has provided unprecedented opportunities for drug discovery. AREAS COVERED In this review, a literature search from 2000 to 2015 was conducted to provide an overview of the compound annotation approaches currently used in drug discovery. Based on this, a framework related to a compound annotation approach using real-time cellular activity profiles for probe, drug, and biology discovery is proposed. EXPERT OPINION Compound annotation with chemical structure, drug-like properties, bioactivities, genome-wide effects, clinical phenotypes, and textural abstracts has received significant attention in early drug discovery. However, these annotations are mostly associated with endpoint results. Advances in assay techniques have made it possible to obtain real-time cellular activity profiles of drug molecules under different phenotypes, so it is possible to generate compound annotation with real-time cellular activity profiles. Combining compound annotation with informatics, such as similarity analysis, presents a good opportunity to improve the rate of discovery of novel drugs and probes, and enhance our understanding of the underlying biology.
Collapse
Affiliation(s)
- Ye Fang
- a Biochemical Technologies, Science and Technology Division , Corning Incorporated , Corning , NY , USA
| |
Collapse
|
4
|
Nicola G, Berthold MR, Hedrick MP, Gilson MK. Connecting proteins with drug-like compounds: Open source drug discovery workflows with BindingDB and KNIME. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION 2015; 2015:bav087. [PMID: 26384374 PMCID: PMC4572361 DOI: 10.1093/database/bav087] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/09/2015] [Accepted: 08/17/2015] [Indexed: 12/24/2022]
Abstract
Today's large, public databases of protein-small molecule interaction data are creating important new opportunities for data mining and integration. At the same time, new graphical user interface-based workflow tools offer facile alternatives to custom scripting for informatics and data analysis. Here, we illustrate how the large protein-ligand database BindingDB may be incorporated into KNIME workflows as a step toward the integration of pharmacological data with broader biomolecular analyses. Thus, we describe a collection of KNIME workflows that access BindingDB data via RESTful webservices and, for more intensive queries, via a local distillation of the full BindingDB dataset. We focus in particular on the KNIME implementation of knowledge-based tools to generate informed hypotheses regarding protein targets of bioactive compounds, based on notions of chemical similarity. A number of variants of this basic approach are tested for seven existing drugs with relatively ill-defined therapeutic targets, leading to replication of some previously confirmed results and discovery of new, high-quality hits. Implications for future development are discussed. Database URL: www.bindingdb.org.
Collapse
Affiliation(s)
- George Nicola
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA,
| | - Michael R Berthold
- Department of Computer and Information Science, Konstanz University, 78457 Konstanz, Germany, and
| | | | - Michael K Gilson
- Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, 9500 Gilman Drive, La Jolla, CA 92093, USA,
| |
Collapse
|
5
|
Fang Y. Combining label-free cell phenotypic profiling with computational approaches for novel drug discovery. Expert Opin Drug Discov 2015; 10:331-343. [PMID: 25727255 DOI: 10.1517/17460441.2015.1020788] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023]
Affiliation(s)
- Ye Fang
- Corning Inc., Biochemical Technologies, Science and Technology Division, Corning, NY 14831, USA
| |
Collapse
|
6
|
Cao DS, Xu QS, Hu QN, Liang YZ. ChemoPy: freely available python package for computational biology and chemoinformatics. ACTA ACUST UNITED AC 2013; 29:1092-4. [PMID: 23493324 DOI: 10.1093/bioinformatics/btt105] [Citation(s) in RCA: 136] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
MOTIVATION Molecular representation for small molecules has been routinely used in QSAR/SAR, virtual screening, database search, ranking, drug ADME/T prediction and other drug discovery processes. To facilitate extensive studies of drug molecules, we developed a freely available, open-source python package called chemoinformatics in python (ChemoPy) for calculating the commonly used structural and physicochemical features. It computes 16 drug feature groups composed of 19 descriptors that include 1135 descriptor values. In addition, it provides seven types of molecular fingerprint systems for drug molecules, including topological fingerprints, electro-topological state (E-state) fingerprints, MACCS keys, FP4 keys, atom pairs fingerprints, topological torsion fingerprints and Morgan/circular fingerprints. By applying a semi-empirical quantum chemistry program MOPAC, ChemoPy can also compute a large number of 3D molecular descriptors conveniently. AVAILABILITY The python package, ChemoPy, is freely available via http://code.google.com/p/pychem/downloads/list, and it runs on Linux and MS-Windows. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Dong-Sheng Cao
- Research Center of Modernization of Traditional Chinese Medicines, Central South University, Changsha, P. R. China
| | | | | | | |
Collapse
|
7
|
Volkamer A, Kuhn D, Rippmann F, Rarey M. Predicting enzymatic function from global binding site descriptors. Proteins 2012; 81:479-89. [DOI: 10.1002/prot.24205] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2012] [Revised: 09/21/2012] [Accepted: 10/11/2012] [Indexed: 11/09/2022]
|
8
|
|
9
|
Mohammed A, Guda C. Computational Approaches for Automated Classification of Enzyme Sequences. ACTA ACUST UNITED AC 2011; 4:147-152. [PMID: 22114367 DOI: 10.4172/jpb.1000183] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Determining the functional role(s) of enzymes is very important to build the metabolic blueprint of an organism and to identify the potential roles enzymes may play in metabolic and disease pathways. With exponential growth in gene and protein sequence data, it is not feasible to experimentally characterize the function(s) of all enzymes. Alternatively, computational methods can be used to annotate the enormous amount of unannotated enzyme sequences. For function prediction and classification of enzymes, features based on amino acid composition, sequence and structural properties, domain composition and specific peptide information have been widely used by different computational approaches. Each feature space has its own merits and limitations on the overall prediction accuracy. Prediction accuracy improves when machine-learning methods are used to classify enzymes. Given the incomplete and unbalanced nature of annotations in biological databases, ensemble methods or methods that bank on a combination of orthogonal feature are more desirable for achieving higher accuracy and coverage in enzyme classification. In this review article, we systematically describe all the features and methods used thus far for enzyme class prediction. To the authors' knowledge, this review represents the most exhaustive description of methods used for computational prediction of enzyme classes.
Collapse
Affiliation(s)
- Akram Mohammed
- Department of Genetics, Cell Biology and Anatomy, University of Nebraska Medical Center, NE, USA
| | | |
Collapse
|
10
|
Abstract
Despite the dramatic increase of global spending on drug discovery and development, the approval rate for new drugs is declining, due chiefly to toxicity and undesirable side effects. Simultaneously, the growth of available biomedical data in the postgenomic era has provided fresh insight into the nature of redundant and compensatory drug-target pathways. This stagnation in drug approval can be overcome by the novel concept of polypharmacology, which is built on the fundamental concept that drugs modulate multiple targets. Polypharmacology can be studied with molecular networks that integrate multidisciplinary concepts including cheminformatics, bioinformatics, and systems biology. In silico techniques such as structure- and ligand-based approaches can be employed to study molecular networks and reduce costs by predicting adverse drug reactions and toxicity in the early stage of drug development. By amalgamating strides in this informatics-driven era, designing polypharmacological drugs with molecular network technology exemplifies the next generation of therapeutics with less of-target properties and toxicity. In this review, we will first describe the challenges in drug discovery, and showcase successes using multitarget drugs toward diseases such as cancer and mood disorders. We will then focus on recent development of in silico polypharmacology predictions. Finally, our technologies in molecular network analysis will be presented.
Collapse
Affiliation(s)
- John Kenneth Morrow
- The Integrated Molecular Discovery Laboratory, Department of Experimental Therapeutics, The University of Texas MD Anderson Cancer Center, 1515 Holcombe Boulevard, Houston, TX 77030, USA
| | | | | |
Collapse
|
11
|
Chen L, Qian Z, Fen K, Cai Y. Prediction of interactiveness between small molecules and enzymes by combining gene ontology and compound similarity. J Comput Chem 2010; 31:1766-76. [PMID: 20033913 DOI: 10.1002/jcc.21467] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Determination of whether a small organic molecule interacts with an enzyme can help to understand the molecular and cellular functions of organisms, and the metabolic pathways. In this research, we present a prediction model, by combining compound similarity and enzyme similarity, to predict the interactiveness between small molecules and enzymes. A dataset consisting of 2859 positive couples of small molecule and enzyme and 286,056 negative couples was employed. Compound similarity is a measurement of how similar two small molecules are, proposed by Hattori et al., J Am Chem Soc 2003, 125, 11853 which can be availed at http://www.genome.jp/ligand-bin/search_compound, while enzyme similarity was obtained by three ways, they are blast method, using gene ontology items and functional domain composition. Then a new distance between a pair of couples was established and nearest neighbor algorithm (NNA) was employed to predict the interactiveness of enzymes and small molecules. A data distribution strategy was adopted to get a better data balance between the positive samples and the negative samples during training the prediction model, by singling out one-fourth couples as testing samples and dividing the rest data into seven training datasets-the rest positive samples were added into each training dataset while only the negative samples were divided. In this way, seven NNAs were built. Finally, simple majority voting system was applied to integrate these seven models to predict the testing dataset, which was demonstrated to have better prediction results than using any single prediction model. As a result, the highest overall prediction accuracy achieved 97.30%.
Collapse
Affiliation(s)
- Lei Chen
- Shanghai Key Laboratory of Trustworthy Computing, East China Normal University, Shanghai 200062, People's Republic of China
| | | | | | | |
Collapse
|
12
|
A multidimensional strategy to detect polypharmacological targets in the absence of structural and sequence homology. PLoS Comput Biol 2010; 6:e1000648. [PMID: 20098496 PMCID: PMC2799658 DOI: 10.1371/journal.pcbi.1000648] [Citation(s) in RCA: 57] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2009] [Accepted: 12/16/2009] [Indexed: 01/18/2023] Open
Abstract
Conventional drug design embraces the “one gene, one drug, one disease” philosophy. Polypharmacology, which focuses on multi-target drugs, has emerged as a new paradigm in drug discovery. The rational design of drugs that act via polypharmacological mechanisms can produce compounds that exhibit increased therapeutic potency and against which resistance is less likely to develop. Additionally, identifying multiple protein targets is also critical for side-effect prediction. One third of potential therapeutic compounds fail in clinical trials or are later removed from the market due to unacceptable side effects often caused by off-target binding. In the current work, we introduce a multidimensional strategy for the identification of secondary targets of known small-molecule inhibitors in the absence of global structural and sequence homology with the primary target protein. To demonstrate the utility of the strategy, we identify several targets of 4,5-dihydroxy-3-(1-naphthyldiazenyl)-2,7-naphthalenedisulfonic acid, a known micromolar inhibitor of Trypanosoma brucei RNA editing ligase 1. As it is capable of identifying potential secondary targets, the strategy described here may play a useful role in future efforts to reduce drug side effects and/or to increase polypharmacology. Proteins play a critical role in human disease; bacteria, viruses, and parasites have unique proteins that can interfere with human health, and dysfunctional human proteins can likewise lead to illness. In order to find cures, scientists often try to identify small molecules (drugs) that can inhibit disease-causing proteins. The goal is to identify a molecule that can fit snugly into the pockets and grooves, or “active sites,” on the protein's surface. Unfortunately, drugs that inhibit a single disease-causing protein are problematic. A single protein can evolve to evade drug action. Additionally, when only one protein is targeted, drug potency is often diminished. Single drugs that simultaneously target multiple disease-causing proteins are much more effective. On the other hand, if scientists are not careful, the drugs they design might inhibit essential human proteins in addition to inhibiting their intended targets, leading to unexpected side effects. In our current work, we have developed a computer-based procedure that can be used to identify proteins with similar active sites. Once unexpected protein targets have been identified, scientists can modify drugs under development in order to increase the simultaneous inhibition of multiple disease-causing proteins while avoiding potential side effects by decreasing the inhibition of useful human proteins.
Collapse
|
13
|
Sacher O, Reitz M, Gasteiger J. Investigations of Enzyme-Catalyzed Reactions Based on Physicochemical Descriptors Applied to Hydrolases. J Chem Inf Model 2009; 49:1525-34. [DOI: 10.1021/ci800277f] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Oliver Sacher
- Molecular Networks GmbH, Henkestrasse 91, D-91052 Erlangen, Germany, and Universität Erlangen-Nürnberg, Computer-Chemie-Centrum and Institute of Organic Chemistry, Nägelsbachstrasse 25, D-91052 Erlangen, Germany
| | - Martin Reitz
- Molecular Networks GmbH, Henkestrasse 91, D-91052 Erlangen, Germany, and Universität Erlangen-Nürnberg, Computer-Chemie-Centrum and Institute of Organic Chemistry, Nägelsbachstrasse 25, D-91052 Erlangen, Germany
| | - Johann Gasteiger
- Molecular Networks GmbH, Henkestrasse 91, D-91052 Erlangen, Germany, and Universität Erlangen-Nürnberg, Computer-Chemie-Centrum and Institute of Organic Chemistry, Nägelsbachstrasse 25, D-91052 Erlangen, Germany
| |
Collapse
|
14
|
Abstract
Chemically similar drugs often bind biologically diverse protein targets, and proteins with similar sequences or structures do not always recognize the same ligands. How can we uncover the pharmacological relationships among proteins, when drugs may bind them in defiance of bioinformatic criteria? Here we consider a technique that quantitatively relates proteins based on the chemical similarity of their ligands. Starting with tens of thousands of ligands organized into sets for hundreds of drug targets, we calculated the similarity among sets using ligand topology. We developed a statistical model to rank the resulting scores, which were then expressed in minimum spanning trees. We have shown that biologically sensible groups of targets emerged from these maps, as well as experimentally validated predictions of drug off-target effects.
Collapse
Affiliation(s)
- Michael J Keiser
- Department of Pharmaceutical Chemistry, University of California San Francisco, San Francisco, CA, USA
| | | |
Collapse
|
15
|
Sutherland JJ, Higgs RE, Watson I, Vieth M. Chemical Fragments as Foundations for Understanding Target Space and Activity Prediction. J Med Chem 2008; 51:2689-700. [DOI: 10.1021/jm701399f] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Jeffrey J. Sutherland
- Discovery Informatics, Discovery Statistics, and Discovery Chemistry of Lilly Research Laboratories, Eli Lilly and Company, Lilly Corporate Center, Indianapolis, Indiana 46285
| | - Richard E. Higgs
- Discovery Informatics, Discovery Statistics, and Discovery Chemistry of Lilly Research Laboratories, Eli Lilly and Company, Lilly Corporate Center, Indianapolis, Indiana 46285
| | - Ian Watson
- Discovery Informatics, Discovery Statistics, and Discovery Chemistry of Lilly Research Laboratories, Eli Lilly and Company, Lilly Corporate Center, Indianapolis, Indiana 46285
| | - Michal Vieth
- Discovery Informatics, Discovery Statistics, and Discovery Chemistry of Lilly Research Laboratories, Eli Lilly and Company, Lilly Corporate Center, Indianapolis, Indiana 46285
| |
Collapse
|
16
|
Hert J, Keiser MJ, Irwin JJ, Oprea TI, Shoichet BK. Quantifying the relationships among drug classes. J Chem Inf Model 2008; 48:755-65. [PMID: 18335977 PMCID: PMC2722950 DOI: 10.1021/ci8000259] [Citation(s) in RCA: 115] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
Abstract
The similarity of drug targets is typically measured using sequence or structural information. Here, we consider chemo-centric approaches that measure target similarity on the basis of their ligands, asking how chemoinformatics similarities differ from those derived bioinformatically, how stable the ligand networks are to changes in chemoinformatics metrics, and which network is the most reliable for prediction of pharmacology. We calculated the similarities between hundreds of drug targets and their ligands and mapped the relationship between them in a formal network. Bioinformatics networks were based on the BLAST similarity between sequences, while chemoinformatics networks were based on the ligand-set similarities calculated with either the Similarity Ensemble Approach (SEA) or a method derived from Bayesian statistics. By multiple criteria, bioinformatics and chemoinformatics networks differed substantially, and only occasionally did a high sequence similarity correspond to a high ligand-set similarity. In contrast, the chemoinformatics networks were stable to the method used to calculate the ligand-set similarities and to the chemical representation of the ligands. Also, the chemoinformatics networks were more natural and more organized, by network theory, than their bioinformatics counterparts: ligand-based networks were found to be small-world and broad-scale.
Collapse
Affiliation(s)
- Jérôme Hert
- Department of Pharmaceutical Chemistry, University of California—San Francisco, 1700 4th St., San Francisco, California 94143-2550
| | - Michael J. Keiser
- Department of Pharmaceutical Chemistry, University of California—San Francisco, 1700 4th St., San Francisco, California 94143-2550
| | - John J. Irwin
- Department of Pharmaceutical Chemistry, University of California—San Francisco, 1700 4th St., San Francisco, California 94143-2550
| | - Tudor I. Oprea
- Division of Biocomputing, MSC11 6145, University of New Mexico School of Medicine, 2703 Frontier NE, Albuquerque, New Mexico 87131
| | - Brian K. Shoichet
- Department of Pharmaceutical Chemistry, University of California—San Francisco, 1700 4th St., San Francisco, California 94143-2550
| |
Collapse
|
17
|
Predicting Selectivity and Druggability in Drug Discovery. ANNUAL REPORTS IN COMPUTATIONAL CHEMISTRY 2008. [DOI: 10.1016/s1574-1400(08)00002-9] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
18
|
Keiser MJ, Roth BL, Armbruster BN, Ernsberger P, Irwin JJ, Shoichet BK. Relating protein pharmacology by ligand chemistry. Nat Biotechnol 2007; 25:197-206. [PMID: 17287757 DOI: 10.1038/nbt1284] [Citation(s) in RCA: 1495] [Impact Index Per Article: 83.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
The identification of protein function based on biological information is an area of intense research. Here we consider a complementary technique that quantitatively groups and relates proteins based on the chemical similarity of their ligands. We began with 65,000 ligands annotated into sets for hundreds of drug targets. The similarity score between each set was calculated using ligand topology. A statistical model was developed to rank the significance of the resulting similarity scores, which are expressed as a minimum spanning tree to map the sets together. Although these maps are connected solely by chemical similarity, biologically sensible clusters nevertheless emerged. Links among unexpected targets also emerged, among them that methadone, emetine and loperamide (Imodium) may antagonize muscarinic M3, alpha2 adrenergic and neurokinin NK2 receptors, respectively. These predictions were subsequently confirmed experimentally. Relating receptors by ligand chemistry organizes biology to reveal unexpected relationships that may be assayed using the ligands themselves.
Collapse
Affiliation(s)
- Michael J Keiser
- Department of Pharmaceutical Chemistry, University of California San Francisco, 1700 4th St, San Francisco California 94143-2550, USA
| | | | | | | | | | | |
Collapse
|
19
|
Cummings MD, Farnum MA, Nelen MI. Universal screening methods and applications of ThermoFluor. ACTA ACUST UNITED AC 2006; 11:854-63. [PMID: 16943390 DOI: 10.1177/1087057106292746] [Citation(s) in RCA: 141] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
Abstract
The genomics revolution has unveiled a wealth of poorly characterized proteins. Scientists are often able to produce milligram quantities of proteins for which function is unknown or hypothetical, based only on very distant sequence homology. Broadly applicable tools for functional characterization are essential to the illumination of these orphan proteins. An additional challenge is the direct detection of inhibitors of protein-protein interactions (and allosteric effectors). Both of these research problems are relevant to, among other things, the challenge of finding and validating new protein targets for drug action. Screening collections of small molecules has long been used in the pharmaceutical industry as 1 method of discovering drug leads. Screening in this context typically involves a function-based assay. Given a sufficient quantity of a protein of interest, significant effort may still be required for functional characterization, assay development, and assay configuration for screening. Increasingly, techniques are being reported that facilitate screening for specific ligands for a protein of unknown function. Such techniques also allow for function-independent screening with better characterized proteins. ThermoFluor, a screening instrument based on monitoring ligand effects on temperature-dependent protein unfolding, can be applied when protein function is unknown. This technology has proven useful in the decryption of an essential bacterial enzyme and in the discovery of a series of inhibitors of a cancer-related, protein-protein interaction. The authors review some of the tools relevant to these research problems in drug discovery, and describe our experiences with 2 different proteins.
Collapse
Affiliation(s)
- Maxwell D Cummings
- Johnson & Johnson Pharmaceutical Research & Development, L.L.C., Exton, PA 19341, USA.
| | | | | |
Collapse
|