Reference Citation Analysis: Find an Article, Find a Category, Find a Journal, Find a Scholar

For: Good AC, Hermsmeier MA, Hindle SA. Measuring CAMD technique performance: a virtual screening case study in the design of validation experiments. J Comput Aided Mol Des 2005;18:529-36. [PMID: 15729852 DOI: 10.1007/s10822-004-4067-1] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

For:	Good AC, Hermsmeier MA, Hindle SA. Measuring CAMD technique performance: a virtual screening case study in the design of validation experiments. J Comput Aided Mol Des 2005;18:529-36. [PMID: 15729852 DOI: 10.1007/s10822-004-4067-1] [Citation(s) in RCA: 44] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]

Number

Cited by Other Article(s)

Sciabola S, Torella R, Nagata A, Boehm M. Critical Assessment of State‐of‐the‐Art Ligand‐Based Virtual Screening Methods. Mol Inform 2022;41:e2200103. [DOI: 10.1002/minf.202200103] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2022] [Accepted: 07/24/2022] [Indexed: 11/10/2022]

Li M, Hu J, Wang Y, Li Y, Zhang L, Liu Z. Challenging Reverse Screening: A Benchmark Study for Comprehensive Evaluation. Mol Inform 2021;41:e2100063. [PMID: 34787366 DOI: 10.1002/minf.202100063] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/16/2021] [Accepted: 10/15/2021] [Indexed: 11/08/2022]

Šribar D, Grabowski M, Murgueitio MS, Bermudez M, Weindl G, Wolber G. Identification and characterization of a novel chemotype for human TLR8 inhibitors. Eur J Med Chem 2019;179:744-752. [DOI: 10.1016/j.ejmech.2019.06.084] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2019] [Revised: 06/27/2019] [Accepted: 06/28/2019] [Indexed: 10/26/2022]

Evaluation of different virtual screening strategies on the basis of compound sets with characteristic core distributions and dissimilarity relationships. J Comput Aided Mol Des 2019;33:729-743. [DOI: 10.1007/s10822-019-00218-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Accepted: 08/13/2019] [Indexed: 02/07/2023]

Tutone M, Perricone U, Almerico AM. Conf-VLKA: A structure-based revisitation of the Virtual Lock-and-key Approach. J Mol Graph Model 2016;71:50-57. [PMID: 27842227 DOI: 10.1016/j.jmgm.2016.11.006] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2016] [Revised: 11/03/2016] [Accepted: 11/07/2016] [Indexed: 02/02/2023]

Kim S. Getting the most out of PubChem for virtual screening. Expert Opin Drug Discov 2016;11:843-55. [PMID: 27454129 PMCID: PMC5045798 DOI: 10.1080/17460441.2016.1216967] [Citation(s) in RCA: 86] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023]

Ibrahim TM, Bauer MR, Dörr A, Veyisoglu E, Boeckler FM. pROC-Chemotype Plots Enhance the Interpretability of Benchmarking Results in Structure-Based Virtual Screening. J Chem Inf Model 2015;55:2297-307. [DOI: 10.1021/acs.jcim.5b00475] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Lagarde N, Zagury JF, Montes M. Benchmarking Data Sets for the Evaluation of Virtual Ligand Screening Methods: Review and Perspectives. J Chem Inf Model 2015;55:1297-307. [PMID: 26038804 DOI: 10.1021/acs.jcim.5b00090] [Citation(s) in RCA: 59] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]

Lindh M, Svensson F, Schaal W, Zhang J, Sköld C, Brandt P, Karlén A. Toward a Benchmarking Data Set Able to Evaluate Ligand- and Structure-based Virtual Screening Using Public HTS Data. J Chem Inf Model 2015;55:343-53. [DOI: 10.1021/ci5005465] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]

Hamza A, Wagner JM, Wei NN, Kwiatkowski S, Zhan CG, Watt DS, Korotkov KV. Application of the 4D fingerprint method with a robust scoring function for scaffold-hopping and drug repurposing strategies. J Chem Inf Model 2014;54:2834-45. [PMID: 25229183 PMCID: PMC4210175 DOI: 10.1021/ci5003872] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Rosenbaum L, Jahn A, Dörr A, Zell A. Optimization and visualization of the edge weights in optimal assignment methods for virtual screening. BioData Min 2013;6:7. [PMID: 23531368 PMCID: PMC3639874 DOI: 10.1186/1756-0381-6-7] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2012] [Accepted: 03/10/2013] [Indexed: 11/17/2022] Open

Abstract

BACKGROUND

Ligand-based virtual screening plays a fundamental part in the early drug discovery stage. In a virtual screening, a chemical library is searched for molecules with similar properties to a query molecule by means of a similarity function. The optimal assignment of chemical graphs has proven to be a valuable similarity function for many cheminformatic tasks, such as virtual screening. The optimal assignment assumes all atoms of a query molecule to be equally important, which is not realistic depending on the binding mode of a ligand. The importance of a query molecule's atoms can be integrated in the optimal assignment by weighting the assignment edges. We optimized the edge weights with respect to the virtual screening performance by means of evolutionary algorithms. Furthermore, we propose a visualization approach for the interpretation of the edge weights.

RESULTS

We evaluated two different evolutionary algorithms, differential evolution and particle swarm optimization, for their suitability for optimizing the assignment edge weights. The results showed that both optimization methods are suited to optimize the edge weights. Furthermore, we compared our approach to the optimal assignment with equal edge weights and two literature similarity functions on a subset of the Directory of Useful Decoys using sophisticated virtual screening performance metrics. Our approach achieved a considerably better overall and early enrichment performance. The visualization of the edge weights enables the identification of substructures that are important for a good retrieval of ligands and for the binding to the protein target.

CONCLUSIONS

The optimization of the edge weights in optimal assignment methods is a valuable approach for ligand-based virtual screening experiments. The approach can be applied to any similarity function that employs the optimal assignment method, which includes a variety of similarity measures that have proven to be valuable in various cheminformatic tasks. The proposed visualization helps to get a better understanding of the binding mode of the analyzed query molecule.

Collapse

Kim S, Bolton EE, Bryant SH. Effects of multiple conformers per compound upon 3-D similarity search and bioassay data analysis. J Cheminform 2012;4:28. [PMID: 23134593 PMCID: PMC3537644 DOI: 10.1186/1758-2946-4-28] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2012] [Accepted: 10/03/2012] [Indexed: 01/08/2023] Open

Abstract

Background

To improve the utility of PubChem, a public repository containing biological activities of small molecules, the PubChem3D project adds computationally-derived three-dimensional (3-D) descriptions to the small-molecule records contained in the PubChem Compound database and provides various search and analysis tools that exploit 3-D molecular similarity. Therefore, the efficient use of PubChem3D resources requires an understanding of the statistical and biological meaning of computed 3-D molecular similarity scores between molecules.

Results

The present study investigated effects of employing multiple conformers per compound upon the 3-D similarity scores between ten thousand randomly selected biologically-tested compounds (10-K set) and between non-inactive compounds in a given biological assay (156-K set). When the “best-conformer-pair” approach, in which a 3-D similarity score between two compounds is represented by the greatest similarity score among all possible conformer pairs arising from a compound pair, was employed with ten diverse conformers per compound, the average 3-D similarity scores for the 10-K set increased by 0.11, 0.09, 0.15, 0.16, 0.07, and 0.18 for ST^ST-opt, CT^ST-opt, ComboT^ST-opt, ST^CT-opt, CT^CT-opt, and ComboT^CT-opt, respectively, relative to the corresponding averages computed using a single conformer per compound. Interestingly, the best-conformer-pair approach also increased the average 3-D similarity scores for the non-inactive–non-inactive (NN) pairs for a given assay, by comparable amounts to those for the random compound pairs, although some assays showed a pronounced increase in the per-assay NN-pair 3-D similarity scores, compared to the average increase for the random compound pairs.

Conclusion

These results suggest that the use of ten diverse conformers per compound in PubChem bioassay data analysis using 3-D molecular similarity is not expected to increase the separation of non-inactive from random and inactive spaces “on average”, although some assays show a noticeable separation between the non-inactive and random spaces when multiple conformers are used for each compound. The present study is a critical next step to understand effects of conformational diversity of the molecules upon the 3-D molecular similarity and its application to biological activity data analysis in PubChem. The results of this study may be helpful to build search and analysis tools that exploit 3-D molecular similarity between compounds archived in PubChem and other molecular libraries in a more efficient way.

Collapse

Ripphausen P, Wassermann AM, Bajorath J. REPROVIS-DB: A Benchmark System for Ligand-Based Virtual Screening Derived from Reproducible Prospective Applications. J Chem Inf Model 2011;51:2467-73. [DOI: 10.1021/ci200309j] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Vogel SM, Bauer MR, Boeckler FM. DEKOIS: Demanding Evaluation Kits for Objective in Silico Screening — A Versatile Tool for Benchmarking Docking Programs and Scoring Functions. J Chem Inf Model 2011;51:2650-65. [DOI: 10.1021/ci2001549] [Citation(s) in RCA: 54] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Jahn A, Rosenbaum L, Hinselmann G, Zell A. 4D Flexible Atom-Pairs: An efficient probabilistic conformational space comparison for ligand-based virtual screening. J Cheminform 2011;3:23. [PMID: 21733172 PMCID: PMC3156737 DOI: 10.1186/1758-2946-3-23] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/29/2011] [Accepted: 07/06/2011] [Indexed: 01/28/2023] Open

Koeppen H, Kriegl J, Lessel U, Tautermann CS, Wellenzohn B. Ligand-Based Virtual Screening. METHODS AND PRINCIPLES IN MEDICINAL CHEMISTRY 2011. [DOI: 10.1002/9783527633326.ch3] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]

Willett P. Similarity methods in chemoinformatics. ACTA ACUST UNITED AC 2011. [DOI: 10.1002/aris.2009.1440430108] [Citation(s) in RCA: 53] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]

Khanna V, Ranganathan S. Molecular similarity and diversity approaches in chemoinformatics. Drug Dev Res 2010. [DOI: 10.1002/ddr.20404] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]

Giganti D, Guillemain H, Spadoni JL, Nilges M, Zagury JF, Montes M. Comparative evaluation of 3D virtual ligand screening methods: impact of the molecular alignment on enrichment. J Chem Inf Model 2010;50:992-1004. [PMID: 20527883 DOI: 10.1021/ci900507g] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Bender A. How similar are those molecules after all? Use two descriptors and you will have three different answers. Expert Opin Drug Discov 2010;5:1141-51. [DOI: 10.1517/17460441.2010.517832] [Citation(s) in RCA: 38] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]

Murata K, Nagata N, Nakanishi I, Kitaura K. SDOVS: A solvent dipole ordering-based method for virtual screening. J Comput Chem 2010;31:2714-22. [DOI: 10.1002/jcc.21565] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]

Geppert H, Vogt M, Bajorath J. Current trends in ligand-based virtual screening: molecular representations, data mining methods, new application areas, and performance evaluation. J Chem Inf Model 2010;50:205-16. [PMID: 20088575 DOI: 10.1021/ci900419k] [Citation(s) in RCA: 231] [Impact Index Per Article: 16.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]

Krüger DM, Evers A. Comparison of structure- and ligand-based virtual screening protocols considering hit list complementarity and enrichment factors. ChemMedChem 2010;5:148-58. [PMID: 19908272 DOI: 10.1002/cmdc.200900314] [Citation(s) in RCA: 87] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]

Fechner N, Jahn A, Hinselmann G, Zell A. Estimation of the applicability domain of kernel-based machine learning models for virtual screening. J Cheminform 2010;2:2. [PMID: 20222949 PMCID: PMC2851576 DOI: 10.1186/1758-2946-2-2] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/15/2009] [Accepted: 03/11/2010] [Indexed: 11/10/2022] Open

Abstract

BACKGROUND

The virtual screening of large compound databases is an important application of structural-activity relationship models. Due to the high structural diversity of these data sets, it is impossible for machine learning based QSAR models, which rely on a specific training set, to give reliable results for all compounds. Thus, it is important to consider the subset of the chemical space in which the model is applicable. The approaches to this problem that have been published so far mostly use vectorial descriptor representations to define this domain of applicability of the model. Unfortunately, these cannot be extended easily to structured kernel-based machine learning models. For this reason, we propose three approaches to estimate the domain of applicability of a kernel-based QSAR model.

RESULTS

We evaluated three kernel-based applicability domain estimations using three different structured kernels on three virtual screening tasks. Each experiment consisted of the training of a kernel-based QSAR model using support vector regression and the ranking of a disjoint screening data set according to the predicted activity. For each prediction, the applicability of the model for the respective compound is quantitatively described using a score obtained by an applicability domain formulation. The suitability of the applicability domain estimation is evaluated by comparing the model performance on the subsets of the screening data sets obtained by different thresholds for the applicability scores. This comparison indicates that it is possible to separate the part of the chemspace, in which the model gives reliable predictions, from the part consisting of structures too dissimilar to the training set to apply the model successfully. A closer inspection reveals that the virtual screening performance of the model is considerably improved if half of the molecules, those with the lowest applicability scores, are omitted from the screening.

CONCLUSION

The proposed applicability domain formulations for kernel-based QSAR models can successfully identify compounds for which no reliable predictions can be expected from the model. The resulting reduction of the search space and the elimination of some of the active compounds should not be considered as a drawback, because the results indicate that, in most cases, these omitted ligands would not be found by the model anyway.

Collapse

Leach AR, Gillet VJ, Lewis RA, Taylor R. Three-dimensional pharmacophore methods in drug discovery. J Med Chem 2010;53:539-58. [PMID: 19831387 DOI: 10.1021/jm900817u] [Citation(s) in RCA: 264] [Impact Index Per Article: 18.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]

Hessler G, Baringhaus KH. The scaffold hopping potential of pharmacophores. DRUG DISCOVERY TODAY. TECHNOLOGIES 2010;7:e203-e270. [PMID: 24103802 DOI: 10.1016/j.ddtec.2010.09.001] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/02/2023]

Willett P. Similarity searching using 2D structural fingerprints. Methods Mol Biol 2010;672:133-58. [PMID: 20838967 DOI: 10.1007/978-1-60761-839-3_5] [Citation(s) in RCA: 89] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]

Medina-Franco J, MartÃnez-Mayorga K, Bender A, Scior T. Scaffold Diversity Analysis of Compound Data Sets Using an Entropy-Based Measure. ACTA ACUST UNITED AC 2009. [DOI: 10.1002/qsar.200960069] [Citation(s) in RCA: 69] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]

Jahn A, Hinselmann G, Fechner N, Zell A. Optimal assignment methods for ligand-based virtual screening. J Cheminform 2009;1:14. [PMID: 20150995 PMCID: PMC2820492 DOI: 10.1186/1758-2946-1-14] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2009] [Accepted: 08/25/2009] [Indexed: 11/10/2022] Open

Hammerling U, Tallsjö A, Grafström R, Ilbäck NG. Comparative Hazard Characterization in Food Toxicology. Crit Rev Food Sci Nutr 2009;49:626-69. [DOI: 10.1080/10408390802145617] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]

Rohrer SG, Baumann K. Maximum unbiased validation (MUV) data sets for virtual screening based on PubChem bioactivity data. J Chem Inf Model 2009;49:169-84. [PMID: 19434821 DOI: 10.1021/ci8002649] [Citation(s) in RCA: 223] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Wong WW, Burkowski FJ. A constructive approach for discovering new drug leads: Using a kernel methodology for the inverse-QSAR problem. J Cheminform 2009;1:4. [PMID: 20142987 PMCID: PMC2816860 DOI: 10.1186/1758-2946-1-4] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2009] [Accepted: 04/28/2009] [Indexed: 12/04/2022] Open

Abstract

Background

The inverse-QSAR problem seeks to find a new molecular descriptor from which one can recover the structure of a molecule that possess a desired activity or property. Surprisingly, there are very few papers providing solutions to this problem. It is a difficult problem because the molecular descriptors involved with the inverse-QSAR algorithm must adequately address the forward QSAR problem for a given biological activity if the subsequent recovery phase is to be meaningful. In addition, one should be able to construct a feasible molecule from such a descriptor. The difficulty of recovering the molecule from its descriptor is the major limitation of most inverse-QSAR methods.

Results

In this paper, we describe the reversibility of our previously reported descriptor, the vector space model molecular descriptor (VSMMD) based on a vector space model that is suitable for kernel studies in QSAR modeling. Our inverse-QSAR approach can be described using five steps: (1) generate the VSMMD for the compounds in the training set; (2) map the VSMMD in the input space to the kernel feature space using an appropriate kernel function; (3) design or generate a new point in the kernel feature space using a kernel feature space algorithm; (4) map the feature space point back to the input space of descriptors using a pre-image approximation algorithm; (5) build the molecular structure template using our VSMMD molecule recovery algorithm.

Conclusion

The empirical results reported in this paper show that our strategy of using kernel methodology for an inverse-Quantitative Structure-Activity Relationship is sufficiently powerful to find a meaningful solution for practical problems.

Electronic supplementary material

The online version of this article (doi:10.1186/1758-2946-1-4) contains supplementary material, which is available to authorized users.

Collapse

Mackey MD, Melville JL. Better than Random? The Chemotype Enrichment Problem. J Chem Inf Model 2009;49:1154-62. [DOI: 10.1021/ci8003978] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

von Korff M, Freyss J, Sander T. Comparison of Ligand- and Structure-Based Virtual Screening on the DUD Data Set. J Chem Inf Model 2009;49:209-31. [DOI: 10.1021/ci800303k] [Citation(s) in RCA: 73] [Impact Index Per Article: 4.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Lessel U, Wellenzohn B, Lilienthal M, Claussen H. Searching Fragment Spaces with Feature Trees. J Chem Inf Model 2009;49:270-9. [DOI: 10.1021/ci800272a] [Citation(s) in RCA: 66] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Cheeseright TJ, Mackey MD, Melville JL, Vinter JG. FieldScreen: Virtual Screening Using Molecular Fields. Application to the DUD Data Set. J Chem Inf Model 2008;48:2108-17. [DOI: 10.1021/ci800110p] [Citation(s) in RCA: 103] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]

Boehm M, Wu TY, Claussen H, Lemmen C. Similarity Searching and Scaffold Hopping in Synthetically Accessible Combinatorial Chemistry Spaces. J Med Chem 2008;51:2468-80. [DOI: 10.1021/jm0707727] [Citation(s) in RCA: 76] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]

Rohrer SG, Baumann K. Impact of Benchmark Data Set Topology on the Validation of Virtual Screening Methods: Exploration and Quantification by Spatial Statistics. J Chem Inf Model 2008;48:704-18. [DOI: 10.1021/ci700099u] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]

Clark RD, Webster-Clark DJ. Managing bias in ROC curves. J Comput Aided Mol Des 2008;22:141-6. [PMID: 18256892 DOI: 10.1007/s10822-008-9181-z] [Citation(s) in RCA: 77] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2007] [Accepted: 01/14/2008] [Indexed: 10/22/2022]

Good AC, Oprea TI. Optimization of CAMD techniques 3. Virtual screening enrichment studies: a help or hindrance in tool selection? J Comput Aided Mol Des 2008;22:169-78. [DOI: 10.1007/s10822-007-9167-2] [Citation(s) in RCA: 153] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/30/2007] [Accepted: 12/19/2007] [Indexed: 11/28/2022]

Lin HH, Han LY, Yap CW, Xue Y, Liu XH, Zhu F, Chen YZ. Prediction of factor Xa inhibitors by machine learning methods. J Mol Graph Model 2007;26:505-18. [PMID: 17418603 DOI: 10.1016/j.jmgm.2007.03.003] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/22/2006] [Revised: 02/04/2007] [Accepted: 03/07/2007] [Indexed: 01/04/2023]

Fontaine F, Bolton E, Borodina Y, Bryant SH. Fast 3D shape screening of large chemical databases through alignment-recycling. Chem Cent J 2007;1:12. [PMID: 17880744 PMCID: PMC1994057 DOI: 10.1186/1752-153x-1-12] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2007] [Accepted: 06/06/2007] [Indexed: 11/25/2022] Open

Abstract

BACKGROUND

Large chemical databases require fast, efficient, and simple ways of looking for similar structures. Although such tasks are now fairly well resolved for graph-based similarity queries, they remain an issue for 3D approaches, particularly for those based on 3D shape overlays. Inspired by a recent technique developed to compare molecular shapes, we designed a hybrid methodology, alignment-recycling, that enables efficient retrieval and alignment of structures with similar 3D shapes.

RESULTS

Using a dataset of more than one million PubChem compounds of limited size (< 28 heavy atoms) and flexibility (< 6 rotatable bonds), we obtained a set of a few thousand diverse structures covering entirely the 3D shape space of the conformers of the dataset. Transformation matrices gathered from the overlays between these diverse structures and the 3D conformer dataset allowed us to drastically (100-fold) reduce the CPU time required for shape overlay. The alignment-recycling heuristic produces results consistent with de novo alignment calculation, with better than 80% hit list overlap on average.

CONCLUSION

Overlay-based 3D methods are computationally demanding when searching large databases. Alignment-recycling reduces the CPU time to perform shape similarity searches by breaking the alignment problem into three steps: selection of diverse shapes to describe the database shape-space; overlay of the database conformers to the diverse shapes; and non-optimized overlay of query and database conformers using common reference shapes. The precomputation, required by the first two steps, is a significant cost of the method; however, once performed, querying is two orders of magnitude faster. Extensions and variations of this methodology, for example, to handle more flexible and larger small-molecules are discussed.

Collapse

Perekhodtsev G. Neighborhood Behavior: Validation of Two-Dimensional Molecular Similarity as a Predictor of Similar Biological Activities and Docking Scores. ACTA ACUST UNITED AC 2007. [DOI: 10.1002/qsar.200610052] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]

Williams C. Reverse fingerprinting, similarity searching by group fusion and fingerprint bit importance. Mol Divers 2006;10:311-32. [PMID: 17031535 DOI: 10.1007/s11030-006-9039-z] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2005] [Accepted: 01/25/2006] [Indexed: 11/29/2022]

Davies JW, Glick M, Jenkins JL. Streamlining lead discovery by aligning in silico and high-throughput screening. Curr Opin Chem Biol 2006;10:343-51. [PMID: 16822701 DOI: 10.1016/j.cbpa.2006.06.022] [Citation(s) in RCA: 60] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2006] [Accepted: 06/21/2006] [Indexed: 12/01/2022]