1
|
Positioning of an unprecedented spiro[5.5]undeca ring system into kinase inhibitor space. Sci Rep 2020; 10:21265. [PMID: 33277542 PMCID: PMC7719162 DOI: 10.1038/s41598-020-78158-9] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2020] [Accepted: 11/20/2020] [Indexed: 11/09/2022] Open
Abstract
In-house 1,5-oxaza spiroquinone 1, with spiro[5.5]undeca ring system, was announced as an unprecedented anti-inflammatory scaffold through chemistry-oriented synthesis (ChOS), a chemocentric approach. Herein, we studied how to best position the spiro[5.5]undeca ring system in kinase inhibitor space. Notably, late-stage modification of the scaffold 1 into compounds 2a-r enhanced kinase-likeness of the scaffold 1. The improvement could be depicted with (1) selectivity with target shift (from JNK-1 into GSK-3) and (2) potency (> 20-fold). In addition, ATP independent IC50 of compound 2j suggested a unique binding mode of this scaffold between ATP site and substrate site, which was explained by docking based optimal site selection and molecular dynamic simulations of the optimal binding site. Despite the shift of kinase profiling, the anti-inflammatory activity of compounds 2a-r could be retained in hyperactivated microglial cells.
Collapse
|
2
|
Repurposing of FDA approved ring systems through bi-directional target-ring system dual screening. Sci Rep 2020; 10:21133. [PMID: 33273509 PMCID: PMC7713353 DOI: 10.1038/s41598-020-78077-9] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Accepted: 10/28/2020] [Indexed: 12/13/2022] Open
Abstract
In drug repurposing approaches, the chemically diverse and potentially safe molecules can be explored as therapeutic potential beyond those originally targeted indications. However, accessible information on a limited number of drug pipelines can lead to competitive over-heating issues, and intellectual property rights also restrict the free investigation in chemical space. As a complementary approach to the drawbacks, ring systems of approved drugs (instead of clinical drugs) can be optimized and used for repurposing purposes. In this study, bi-directional target (T) and ring system (R) dual screening (TR screening) was developed for the repurposing of their rarely used ring systems from FDA approved drugs. The TR screening suggested RAR β and cyproheptadine as the best pair of target and ring system to escape a saddle point. The selected ring system was virtually grown and elaborated with the defined criteria: synthesizability, drug-likeness, and docking pose showing the top scores. The achieved compounds were synthesized and biologically tested with an acceptable ADME/T profile.
Collapse
|
3
|
Comparing a Query Compound with Drug Target Classes Using 3D-Chemical Similarity. Int J Mol Sci 2020; 21:ijms21124208. [PMID: 32545691 PMCID: PMC7352980 DOI: 10.3390/ijms21124208] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2020] [Revised: 06/01/2020] [Accepted: 06/11/2020] [Indexed: 12/16/2022] Open
Abstract
3D similarity is useful in predicting the profiles of unprecedented molecular frameworks that are 2D dissimilar to known compounds. When comparing pairs of compounds, 3D similarity of the pairs depends on conformational sampling, the alignment method, the chosen descriptors, and the similarity coefficients. In addition to these four factors, 3D chemocentric target prediction of an unknown compound requires compound-target associations, which replace compound-to-compound comparisons with compound-to-target comparisons. In this study, quantitative comparison of query compounds to target classes (one-to-group) was achieved via two types of 3D similarity distributions for the respective target class with parameter optimization for the fitting models: (1) maximum likelihood (ML) estimation of queries, and (2) the Gaussian mixture model (GMM) of target classes. While Jaccard-Tanimoto similarity of query-to-ligand pairs with 3D structures (sampled multi-conformers) can be transformed into query distribution using ML estimation, the ligand pair similarity within each target class can be transformed into a representative distribution of a target class through GMM, which is hyperparameterized via the expectation-maximization (EM) algorithm. To quantify the discriminativeness of a query ligand against target classes, the Kullback-Leibler (K-L) divergence of each query was calculated and compared between targets. 3D similarity-based K-L divergence together with the probability and the feasibility index, (Fm), showed discriminative power with regard to some query-class associations. The K-L divergence of 3D similarity distributions can be an additional method for (1) the rank of the 3D similarity score or (2) the p-value of one 3D similarity distribution to predict the target of unprecedented drug scaffolds.
Collapse
|
4
|
Prosser K, Stokes RW, Cohen SM. Evaluation of 3-Dimensionality in Approved and Experimental Drug Space. ACS Med Chem Lett 2020; 11:1292-1298. [PMID: 32551014 PMCID: PMC7294711 DOI: 10.1021/acsmedchemlett.0c00121] [Citation(s) in RCA: 42] [Impact Index Per Article: 8.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 05/18/2020] [Indexed: 11/28/2022] Open
Abstract
The 3-dimensional (3D) structure of therapeutics and other bioactive molecules is an important factor in determining the strength and selectivity of their protein-ligand interactions. Previous efforts have considered the strain introduced and tolerated through conformational changes induced upon protein binding. Herein, we present an analysis of 3-dimentionality for energy-minimized structures from the DrugBank and ligands bound to proteins identified in the Protein Data Bank (PDB). This analysis reveals that the majority of molecules found in both the DrugBank and the PDB tend toward linearity and planarity, with few molecules having highly 3D conformations. Decidedly 3D geometries have been historically difficult to achieve, likely due to the synthetic challenge of making 3D organic molecules, and other considerations, such as adherence to the 'rule-of-five'. This has resulted in the dominance of planar and/or linear topologies of the molecules described here. Strategies to address the generally flat nature of these data sets are explored, including the use of 3D organic fragments and inorganic scaffolds as a means of accessing privileged 3D space. This work highlights the potential utility of libraries with greater 3D topological diversity so that the importance of molecular shape to biological behavior can be more fully understood in drug discovery campaigns.
Collapse
Affiliation(s)
- Kathleen
E. Prosser
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California 92093, United States
| | - Ryjul W. Stokes
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California 92093, United States
| | - Seth M. Cohen
- Department of Chemistry and Biochemistry, University of California, San Diego, La Jolla, California 92093, United States
| |
Collapse
|
5
|
Teli MK, Kumar S, Yadav DK, Kim MH. In silico identification of hydantoin derivatives: a novel natural prolyl hydroxylase inhibitor. J Biomol Struct Dyn 2020; 39:703-717. [DOI: 10.1080/07391102.2020.1714480] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Affiliation(s)
- Mahesh Kumar Teli
- Gachon Institute of Pharmaceutical Science & College of Pharmacy, Gachon University, Incheon, Korea
| | - Surendra Kumar
- Gachon Institute of Pharmaceutical Science & College of Pharmacy, Gachon University, Incheon, Korea
| | - Dharmendra Kumar Yadav
- Gachon Institute of Pharmaceutical Science & College of Pharmacy, Gachon University, Incheon, Korea
| | - Mi-hyun Kim
- Gachon Institute of Pharmaceutical Science & College of Pharmacy, Gachon University, Incheon, Korea
| |
Collapse
|
6
|
Balupuri A, Balasubramanian PK, Cho SJ. 3D-QSAR, docking, molecular dynamics simulation and free energy calculation studies of some pyrimidine derivatives as novel JAK3 inhibitors. ARAB J CHEM 2020. [DOI: 10.1016/j.arabjc.2017.09.009] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022] Open
|
7
|
Lee J, Kumar S, Lee SY, Park SJ, Kim MH. Development of Predictive Models for Identifying Potential S100A9 Inhibitors Based on Machine Learning Methods. Front Chem 2019; 7:779. [PMID: 31824919 PMCID: PMC6886474 DOI: 10.3389/fchem.2019.00779] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2019] [Accepted: 10/29/2019] [Indexed: 01/05/2023] Open
Abstract
S100A9 is a potential therapeutic target for various disease including prostate cancer, colorectal cancer, and Alzheimer's disease. However, the sparsity of atomic level data, such as protein-protein interaction of S100A9 with RAGE, TLR4/MD2, or CD147 (EMMPRIN) hinders the rational drug design of S100A9 inhibitors. Herein we first report predictive models of S100A9 inhibitory effect by applying machine learning classifiers on 2D-molecular descriptors. The models were optimized through feature selectors as well as classifiers to produce the top eight random forest models with robust predictability and high cost-effectiveness. Notably, optimal feature sets were obtained after the reduction of 2,798 features into dozens of features with the chopping of fingerprint bits. Moreover, the high efficiency of compact feature sets allowed us to further screen a large-scale dataset (over 6,000,000 compounds) within a week. Through a consensus vote of the top models, 46 hits (hit rate = 0.000713%) were identified as potential S100A9 inhibitors. We expect that our models will facilitate the drug discovery process by providing high predictive power as well as cost-reduction ability and give insights into designing novel drugs targeting S100A9.
Collapse
Affiliation(s)
- Jihyeun Lee
- Department of Pharmacy, Gachon Institute of Pharmaceutical Science, College of Pharmacy, Gachon University, Incheon, South Korea
| | - Surendra Kumar
- Department of Pharmacy, Gachon Institute of Pharmaceutical Science, College of Pharmacy, Gachon University, Incheon, South Korea
| | - Sang-Yoon Lee
- Gachon Advanced Institute for Health Science and Technology, Graduate School and Neuroscience Research Institute, Gachon University, Incheon, South Korea
| | - Sung Jean Park
- Department of Pharmacy, Gachon Institute of Pharmaceutical Science, College of Pharmacy, Gachon University, Incheon, South Korea
| | - Mi-hyun Kim
- Department of Pharmacy, Gachon Institute of Pharmaceutical Science, College of Pharmacy, Gachon University, Incheon, South Korea
| |
Collapse
|
8
|
Zeb A, Son M, Yoon S, Kim JH, Park SJ, Lee KW. Computational Simulations Identified Two Candidate Inhibitors of Cdk5/p25 to Abrogate Tau-associated Neurological Disorders. Comput Struct Biotechnol J 2019; 17:579-590. [PMID: 31073393 PMCID: PMC6495220 DOI: 10.1016/j.csbj.2019.04.010] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2019] [Revised: 04/14/2019] [Accepted: 04/17/2019] [Indexed: 01/26/2023] Open
Abstract
Deregulation of Cdk5 is a hallmark in neurodegenerative diseases and its complex with p25 forms Cdk5/p25, thereby causes severe neuropathological insults. Cdk5/p25 abnormally phosphorylates tau protein, and induces tau-associated neurofibrillary tangles in neurological disorders. Therefore, the pharmacological inhibition of Cdk5/p25 alleviates tau-associated neurological disorders. Herein, computational simulations probed two candidate inhibitors of Cdk5/p25. Structure-based pharmacophore investigated the essential complementary chemical features of ATP-binding site of Cdk5 in complex with roscovitine. Resultant pharmacophore harbored polar interactions with Cys83 and Asp86 residues and non-polar interactions with Ile10, Phe80, and Lys133 residues of Cdk5. The chemical space of selected pharmacophore was comprised of two hydrogen bond donors, one hydrogen bond acceptor, and three hydrophobic features. Decoy test validation of pharmacophore obtained highest Guner-Henry score (0.88) and enrichment factor score (7.23). The screening of natural product drug-like databases by validated pharmacophore retrieved 1126 compounds as candidate inhibitors of Cdk5/p25. The docking of candidate inhibitors filtered 10 molecules with docking score >80.00 and established polar and non-polar interactions with the ATP-binding site residues of Cdk5/p25. Finally, molecular dynamics simulation and binding free energy analyses identified two candidate inhibitors of Cdk5/p25. During 30 ns simulation, the candidate inhibitors established <3.0 Å root mean square deviation and stable hydrogen bond interactions with the ATP-binding site residues of Cdk5/p25. The final candidate inhibitors obtained lowest binding free energies of -122.18 kJ/mol and - 117.26 kJ/mol with Cdk5/p25. Overall, we recommend two natural product candidate inhibitors to target the pharmacological inhibition of Cdk5/p25 in tau-associated neurological disorders.
Collapse
Key Words
- 2D, Two-dimentional
- 3D, Three-dimentional
- AD, Alzheimer's disease
- ADMET, Absorption, distribution, metabolism, excretion, and toxicity
- ASP, Astex statistical potential
- Aβ, Amyloid beta
- BBB, Blood-brain barrier
- CGMC, Cyclin-dependent kinases, mitogen-activated protein kinases, glycogen synthase kinases, and Cdk-like kinases
- Cdk5, Cyclin-dependent kinase 5
- Cdk5/p25 inhibitors
- Cdks, Cyclin-dependent kinases
- DS, Discovery Studio
- EF, Enrichment factor
- GA, Genetic algorithm
- GFA, Genetic Function Approximation
- GH, Guner-Henry
- GOLD, Genetic optimization of ligand docking
- GROMACS, Groningen Machine for Chemical Simulation
- H-bond, Hydrogen bond
- HBA, Hydrogen bond acceptor
- HBD, Hydrogen bond donor
- HD, Hungtington's disease
- HYP, Hydrophobic
- IBS, InterBioScreen
- K, kelvin
- MD, Molecular dynamics
- MPTP, 1-methyl-4-phenyl-1,2,3,6-tetrahydropyridine
- Molecular docking
- Molecular dynamics simulation
- NPT, Number particle, pressure, and temperature
- NVT, Number of particles, volume, and temperature
- P5, A 24-residues mimetic peptide of p35
- PD, Parkinson's disease
- PDB, Protein databank
- PLP, Piecewise linear potential
- PME, Particle mesh ewald
- RMSD, Root mean square deviation
- ROF, Rule of five
- Structure-based pharmacophore modeling
- TAT, Twin-arginine targeting
- TIP3P, Transferable intermolecular potential with 3 points
- Tau-pathogenesis
- ZNPD, Zinc Natural Product Database
Collapse
Affiliation(s)
- Amir Zeb
- Division of Life Science, Division of Applied Life Sciences (BK21 Plus), Research Institute of Natural Sciences (RINS), Gyeongsang National University (GNU), 501 Jinju-daero, Jinju 52828, Gyeongnam, Republic of Korea
| | - Minky Son
- Division of Life Science, Division of Applied Life Sciences (BK21 Plus), Research Institute of Natural Sciences (RINS), Gyeongsang National University (GNU), 501 Jinju-daero, Jinju 52828, Gyeongnam, Republic of Korea
| | - Sanghwa Yoon
- Division of Life Science, Division of Applied Life Sciences (BK21 Plus), Research Institute of Natural Sciences (RINS), Gyeongsang National University (GNU), 501 Jinju-daero, Jinju 52828, Gyeongnam, Republic of Korea
| | - Ju Hyun Kim
- Department of Chemistry (BK21 Plus), Research Institute of Natural Science (RINS), Geyongsang National University (GNU), 501 Jinju-daero, Jinju 52828, Gyeongnam, Republic of Korea
| | - Seok Ju Park
- Department of Internal Medicine, College of Medicine, Busan Paik Hospital, Inje University, Busan 47392, Republic of Korea
| | - Keun Woo Lee
- Division of Life Science, Division of Applied Life Sciences (BK21 Plus), Research Institute of Natural Sciences (RINS), Gyeongsang National University (GNU), 501 Jinju-daero, Jinju 52828, Gyeongnam, Republic of Korea
| |
Collapse
|
9
|
Jang C, Yadav DK, Subedi L, Venkatesan R, Venkanna A, Afzal S, Lee E, Yoo J, Ji E, Kim SY, Kim MH. Identification of novel acetylcholinesterase inhibitors designed by pharmacophore-based virtual screening, molecular docking and bioassay. Sci Rep 2018; 8:14921. [PMID: 30297729 PMCID: PMC6175823 DOI: 10.1038/s41598-018-33354-6] [Citation(s) in RCA: 61] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/27/2017] [Accepted: 09/06/2018] [Indexed: 11/30/2022] Open
Abstract
In this study, pharmacophore based 3D QSAR models for human acetylcholinesterase (AChE) inhibitors were generated, with good significance, statistical values (r2training = 0.73) and predictability (q2training = 0.67). It was further validated by three methods (Fischer's test, decoy set and Güner-Henry scoring method) to show that the models can be used to predict the biological activities of compounds without costly and time-consuming synthesis. The criteria for virtual screening were also validated by testing the selective AChE inhibitors. Virtual screening experiments and subsequent in vitro evaluation of promising hits revealed a novel and selective AChE inhibitor. Thus, the findings reported herein may provide a new strategy for the discovery of selective AChE inhibitors. The IC50 value of compounds 5c and 6a presented selective inhibition of AChE without inhibiting butyrylcholinesterase (BChE) at uM level. Molecular docking studies were performed to explain the potent AChE inhibition of the target compounds studies to explain high affinity.
Collapse
Affiliation(s)
- Cheongyun Jang
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Yeonsu-gu, Incheon, Republic of Korea
| | - Dharmendra K Yadav
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Yeonsu-gu, Incheon, Republic of Korea
| | - Lalita Subedi
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Yeonsu-gu, Incheon, Republic of Korea
| | - Ramu Venkatesan
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Yeonsu-gu, Incheon, Republic of Korea
| | - Arramshetti Venkanna
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Yeonsu-gu, Incheon, Republic of Korea
| | - Sualiha Afzal
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Yeonsu-gu, Incheon, Republic of Korea
| | - Eunhee Lee
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Yeonsu-gu, Incheon, Republic of Korea
| | - Jaewook Yoo
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Yeonsu-gu, Incheon, Republic of Korea
| | - Eunhee Ji
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Yeonsu-gu, Incheon, Republic of Korea
| | - Sun Yeou Kim
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Yeonsu-gu, Incheon, Republic of Korea
| | - Mi-Hyun Kim
- Gachon Institute of Pharmaceutical Science and Department of Pharmacy, College of Pharmacy, Gachon University, Yeonsu-gu, Incheon, Republic of Korea.
| |
Collapse
|
10
|
Lee JH, Cho SJ, Kim MH. Discovery of CNS-Like D3R-Selective Antagonists Using 3D Pharmacophore Guided Virtual Screening. Molecules 2018; 23:E2452. [PMID: 30257450 PMCID: PMC6222863 DOI: 10.3390/molecules23102452] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2018] [Revised: 09/14/2018] [Accepted: 09/21/2018] [Indexed: 01/06/2023] Open
Abstract
The dopamine D3 receptor is an important CNS target for the treatment of a variety of neurological diseases. Selective dopamine D3 receptor antagonists modulate the improvement of psychostimulant addiction and relapse. In this study, five and six featured pharmacophore models of D3R antagonists were generated and evaluated with the post-hoc score combining two survival scores of active and inactive. Among the Top 10 models, APRRR215 and AHPRRR104 were chosen based on the coefficient of determination (APRRR215: R²training = 0.80; AHPRRR104: R²training = 0.82) and predictability (APRRR215: Q²test = 0.73, R²predictive = 0.82; AHPRRR104: Q²test = 0.86, R²predictive = 0.74) of their 3D-quantitative structure⁻activity relationship models. Pharmacophore-based virtual screening of a large compound library from eMolecules (>3 million compounds) using two optimal models expedited the search process by a 100-fold speed increase compared to the docking-based screening (HTVS scoring function in Glide) and identified a series of hit compounds having promising novel scaffolds. After the screening, docking scores, as an adjuvant predictor, were added to two fitness scores (from the pharmacophore models) and predicted Ki (from PLSs of the QSAR models) to improve accuracy. Final selection of the most promising hit compounds were also evaluated for CNS-like properties as well as expected D3R antagonism.
Collapse
Affiliation(s)
- June Hyeong Lee
- Gachon Institute of Pharmaceutical Science & Department of Pharmacy, College of Pharmacy, Gachon University, 191 Hambakmoeiro, Yeonsu-gu, Incheon 21936, Korea.
| | - Sung Jin Cho
- CimplSoft, Thousand Oaks, CA 91320, USA.
- CimplRx, Euni-ro, Seoul 06764, Korea.
| | - Mi-Hyun Kim
- Gachon Institute of Pharmaceutical Science & Department of Pharmacy, College of Pharmacy, Gachon University, 191 Hambakmoeiro, Yeonsu-gu, Incheon 21936, Korea.
| |
Collapse
|
11
|
Kim H, Jang C, Yadav DK, Kim MH. The comparison of automated clustering algorithms for resampling representative conformer ensembles with RMSD matrix. J Cheminform 2017; 9:21. [PMID: 29086188 PMCID: PMC5364127 DOI: 10.1186/s13321-017-0208-0] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2017] [Accepted: 03/15/2017] [Indexed: 12/01/2022] Open
Abstract
Background The accuracy of any 3D-QSAR, Pharmacophore and 3D-similarity based chemometric target fishing models are highly dependent on a reasonable sample of active conformations. Since a number of diverse conformational sampling algorithm exist, which exhaustively generate enough conformers, however model building methods relies on explicit number of common conformers. Results In this work, we have attempted to make clustering algorithms, which could find reasonable number of representative conformer ensembles automatically with asymmetric dissimilarity matrix generated from openeye tool kit. RMSD was the important descriptor (variable) of each column of the N × N matrix considered as N variables describing the relationship (network) between the conformer (in a row) and the other N conformers. This approach used to evaluate the performance of the well-known clustering algorithms by comparison in terms of generating representative conformer ensembles and test them over different matrix transformation functions considering the stability. In the network, the representative conformer group could be resampled for four kinds of algorithms with implicit parameters. The directed dissimilarity matrix becomes the only input to the clustering algorithms. Conclusions Dunn index, Davies–Bouldin index, Eta-squared values and omega-squared values were used to evaluate the clustering algorithms with respect to the compactness and the explanatory power. The evaluation includes the reduction (abstraction) rate of the data, correlation between the sizes of the population and the samples, the computational complexity and the memory usage as well. Every algorithm could find representative conformers automatically without any user intervention, and they reduced the data to 14–19% of the original values within 1.13 s per sample at the most. The clustering methods are simple and practical as they are fast and do not ask for any explicit parameters. RCDTC presented the maximum Dunn and omega-squared values of the four algorithms in addition to consistent reduction rate between the population size and the sample size. The performance of the clustering algorithms was consistent over different transformation functions. Moreover, the clustering method can also be applied to molecular dynamics sampling simulation results. Electronic supplementary material The online version of this article (doi:10.1186/s13321-017-0208-0) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Hyoungrae Kim
- Department of Data Management, KEIS, 56 Mullae-ro 20-gil, Yeongdeungpo-gu, Seoul, Republic of Korea.
| | - Cheongyun Jang
- Department of Pharmacy, College of Pharmacy, Yeonsu-gu, Incheon, Republic of Korea
| | - Dharmendra K Yadav
- Department of Pharmacy, College of Pharmacy, Yeonsu-gu, Incheon, Republic of Korea
| | - Mi-Hyun Kim
- Department of Pharmacy, College of Pharmacy, Yeonsu-gu, Incheon, Republic of Korea. .,Gachon Institute of Pharmaceutical Science, Gachon University, Yeonsu-gu, Incheon, Republic of Korea.
| |
Collapse
|