101
|
A methodology for evaluating multi-objective evolutionary feature selection for classification in the context of virtual screening. Soft comput 2018. [DOI: 10.1007/s00500-018-3479-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/28/2022]
|
102
|
Wang NN, Dong J, Zhang L, Ouyang D, Cheng Y, Chen AF, Lu AP, Cao DS. HAMdb: a database of human autophagy modulators with specific pathway and disease information. J Cheminform 2018; 10:34. [PMID: 30066211 PMCID: PMC6068059 DOI: 10.1186/s13321-018-0289-4] [Citation(s) in RCA: 54] [Impact Index Per Article: 7.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2018] [Accepted: 07/17/2018] [Indexed: 01/07/2023] Open
Abstract
Autophagy is an important homeostatic cellular recycling mechanism responsible for degrading unnecessary or dysfunctional cellular organelles and proteins in all living cells. In addition to its vital homeostatic role, this degradation pathway also involves in various human disorders, including metabolic conditions, neurodegenerative diseases, cancers and infectious diseases. Therefore, the comprehensive understanding of autophagy process, autophagy-related modulators and corresponding pathway and disease information will be of great help for identifying the new autophagy modulators, potential drug candidates, new diagnostic and therapeutic targets. In recent years, some autophagy databases providing structural and functional information were developed, but the specific databases covering autophagy modulator (proteins, chemicals and microRNAs)-related target, pathway and disease information do not exist. Hence, we developed an online resource, Human Autophagy Modulator Database (HAMdb, http://hamdb.scbdd.com), to provide researchers related pathway and disease information as many as possible. HAMdb contains 796 proteins, 841 chemicals and 132 microRNAs. Their specific effects on autophagy, physicochemical information, biological information and disease information were manually collected and compiled. Additionally, lots of external links were available for more information covering extensive biomedical knowledge. HAMdb provides a user-friendly interface to query, search, browse autophagy modulators and their comprehensive related information. HAMdb will help researchers understand the whole autophagy process and provide detailed information about related diseases. Furthermore, it can give hints for the identification of new diagnostic and therapeutic targets and the discovery of new autophagy modulators. In a word, we hope that HAMdb has the potential to promote the autophagy research in pharmacological and pathophysiological area.
Collapse
Affiliation(s)
- Ning-Ning Wang
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China
| | - Jie Dong
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China.,Hunan Key Laboratory of Grain-Oil Deep Process and Quality Control, Hunan Key Laboratory of Processed Food for Special Medical Purpose, National Engineering Laboratory for Deep Processing of Rice and Byproducts, Central South University of Forestry and Technology, Changsha, People's Republic of China
| | - Lin Zhang
- Hunan Key Laboratory of Grain-Oil Deep Process and Quality Control, Hunan Key Laboratory of Processed Food for Special Medical Purpose, National Engineering Laboratory for Deep Processing of Rice and Byproducts, Central South University of Forestry and Technology, Changsha, People's Republic of China
| | - Defang Ouyang
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences (ICMS), University of Macau, Macau, People's Republic of China
| | - Yan Cheng
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China
| | - Alex F Chen
- Center for Vascular Disease and Translational Medicine, The Third Xiangya Hospital of Central South University, Changsha, People's Republic of China
| | - Ai-Ping Lu
- Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, SAR, People's Republic of China
| | - Dong-Sheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China. .,Center for Vascular Disease and Translational Medicine, The Third Xiangya Hospital of Central South University, Changsha, People's Republic of China. .,Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, SAR, People's Republic of China.
| |
Collapse
|
103
|
Okamoto Y, Kubo Y. Ab Initio Calculations of the Redox Potentials of Additives for Lithium-Ion Batteries and Their Prediction through Machine Learning. ACS OMEGA 2018; 3:7868-7874. [PMID: 31458929 PMCID: PMC6644342 DOI: 10.1021/acsomega.8b00576] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2018] [Accepted: 06/08/2018] [Indexed: 05/11/2023]
Abstract
Ab initio molecular orbital calculations were carried out to examine the redox potentials of 149 electrolyte additives for lithium-ion batteries. These potentials were employed to construct regression models using a machine learning approach. We chose simple descriptors based on the chemical structure of the additive molecules. The descriptors predicted the redox potentials well, in particular, the oxidation potentials. We found that the redox potentials can be explained by a small number of features, which improve the interpretability of the results and seem to be related to the amplitude of the eigenstate of the frontier orbitals.
Collapse
Affiliation(s)
- Yasuharu Okamoto
- Data
Platform Center, National Institute for
Materials Science, 1-2-1
Sengen, Tsukuba, Ibaraki 305-0047, Japan
- E-mail: (Y.O.)
| | - Yoshimi Kubo
- GREEN,
National Institute for Materials Science, 1-1 Namiki, Tsukuba, Ibaraki 305-0044, Japan
| |
Collapse
|
104
|
Artificial intelligence in drug design. SCIENCE CHINA-LIFE SCIENCES 2018; 61:1191-1204. [PMID: 30054833 DOI: 10.1007/s11427-018-9342-2] [Citation(s) in RCA: 105] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/24/2018] [Accepted: 05/22/2018] [Indexed: 12/27/2022]
Abstract
Thanks to the fast improvement of the computing power and the rapid development of the computational chemistry and biology, the computer-aided drug design techniques have been successfully applied in almost every stage of the drug discovery and development pipeline to speed up the process of research and reduce the cost and risk related to preclinical and clinical trials. Owing to the development of machine learning theory and the accumulation of pharmacological data, the artificial intelligence (AI) technology, as a powerful data mining tool, has cut a figure in various fields of the drug design, such as virtual screening, activity scoring, quantitative structure-activity relationship (QSAR) analysis, de novo drug design, and in silico evaluation of absorption, distribution, metabolism, excretion and toxicity (ADME/T) properties. Although it is still challenging to provide a physical explanation of the AI-based models, it indeed has been acting as a great power to help manipulating the drug discovery through the versatile frameworks. Recently, due to the strong generalization ability and powerful feature extraction capability, deep learning methods have been employed in predicting the molecular properties as well as generating the desired molecules, which will further promote the application of AI technologies in the field of drug design.
Collapse
|
105
|
Mayr A, Klambauer G, Unterthiner T, Steijaert M, Wegner JK, Ceulemans H, Clevert DA, Hochreiter S. Large-scale comparison of machine learning methods for drug target prediction on ChEMBL. Chem Sci 2018; 9:5441-5451. [PMID: 30155234 PMCID: PMC6011237 DOI: 10.1039/c8sc00148k] [Citation(s) in RCA: 277] [Impact Index Per Article: 39.6] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Accepted: 05/16/2018] [Indexed: 12/24/2022] Open
Abstract
Deep learning is currently the most successful machine learning technique in a wide range of application areas and has recently been applied successfully in drug discovery research to predict potential drug targets and to screen for active molecules. However, due to (1) the lack of large-scale studies, (2) the compound series bias that is characteristic of drug discovery datasets and (3) the hyperparameter selection bias that comes with the high number of potential deep learning architectures, it remains unclear whether deep learning can indeed outperform existing computational methods in drug discovery tasks. We therefore assessed the performance of several deep learning methods on a large-scale drug discovery dataset and compared the results with those of other machine learning and target prediction methods. To avoid potential biases from hyperparameter selection or compound series, we used a nested cluster-cross-validation strategy. We found (1) that deep learning methods significantly outperform all competing methods and (2) that the predictive performance of deep learning is in many cases comparable to that of tests performed in wet labs (i.e., in vitro assays).
Collapse
Affiliation(s)
- Andreas Mayr
- LIT AI Lab and Institute of Bioinformatics , Johannes Kepler University Linz , Austria . ; ; Tel: +43-732-2468-4521
| | - Günter Klambauer
- LIT AI Lab and Institute of Bioinformatics , Johannes Kepler University Linz , Austria . ; ; Tel: +43-732-2468-4521
| | - Thomas Unterthiner
- LIT AI Lab and Institute of Bioinformatics , Johannes Kepler University Linz , Austria . ; ; Tel: +43-732-2468-4521
| | | | | | | | | | - Sepp Hochreiter
- LIT AI Lab and Institute of Bioinformatics , Johannes Kepler University Linz , Austria . ; ; Tel: +43-732-2468-4521
| |
Collapse
|
106
|
Dong J, Wang NN, Yao ZJ, Zhang L, Cheng Y, Ouyang D, Lu AP, Cao DS. ADMETlab: a platform for systematic ADMET evaluation based on a comprehensively collected ADMET database. J Cheminform 2018; 10:29. [PMID: 29943074 PMCID: PMC6020094 DOI: 10.1186/s13321-018-0283-x] [Citation(s) in RCA: 416] [Impact Index Per Article: 59.4] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2018] [Accepted: 06/16/2018] [Indexed: 01/24/2023] Open
Abstract
Current pharmaceutical research and development (R&D) is a high-risk investment which is usually faced with some unexpected even disastrous failures in different stages of drug discovery. One main reason for R&D failures is the efficacy and safety deficiencies which are related largely to absorption, distribution, metabolism and excretion (ADME) properties and various toxicities (T). Therefore, rapid ADMET evaluation is urgently needed to minimize failures in the drug discovery process. Here, we developed a web-based platform called ADMETlab for systematic ADMET evaluation of chemicals based on a comprehensively collected ADMET database consisting of 288,967 entries. Four function modules in the platform enable users to conveniently perform six types of drug-likeness analysis (five rules and one prediction model), 31 ADMET endpoints prediction (basic property: 3, absorption: 6, distribution: 3, metabolism: 10, elimination: 2, toxicity: 7), systematic evaluation and database/similarity searching. We believe that this web platform will hopefully facilitate the drug discovery process by enabling early drug-likeness evaluation, rapid ADMET virtual screening or filtering and prioritization of chemical structures. The ADMETlab web platform is designed based on the Django framework in Python, and is freely accessible at http://admet.scbdd.com/ .
Collapse
Affiliation(s)
- Jie Dong
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China
- Hunan Key Laboratory of Grain-oil Deep Process and Quality Control, College of Food Science and Engineering, National Engineering Laboratory for Deep Processing of Rice and Byproducts, Central South University of Forestry and Technology, Changsha, People's Republic of China
- Hunan Key Laboratory of Processed Food for Special Medical Purpose, Central South University of Forestry and Technology, Changsha, People's Republic of China
| | - Ning-Ning Wang
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China
| | - Zhi-Jiang Yao
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China
| | - Lin Zhang
- Hunan Key Laboratory of Processed Food for Special Medical Purpose, Central South University of Forestry and Technology, Changsha, People's Republic of China
| | - Yan Cheng
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China
| | - Defang Ouyang
- State Key Laboratory of Quality Research in Chinese Medicine, Institute of Chinese Medical Sciences (ICMS), University of Macau, Macau, People's Republic of China
| | - Ai-Ping Lu
- Institute for Advancing Translational Medicine in Bone & Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong SAR, People's Republic of China
| | - Dong-Sheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China.
- Institute for Advancing Translational Medicine in Bone & Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong SAR, People's Republic of China.
| |
Collapse
|
107
|
Preuer K, Lewis RPI, Hochreiter S, Bender A, Bulusu KC, Klambauer G. DeepSynergy: predicting anti-cancer drug synergy with Deep Learning. Bioinformatics 2018; 34:1538-1546. [PMID: 29253077 PMCID: PMC5925774 DOI: 10.1093/bioinformatics/btx806] [Citation(s) in RCA: 295] [Impact Index Per Article: 42.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2017] [Revised: 12/06/2017] [Accepted: 12/14/2017] [Indexed: 12/29/2022] Open
Abstract
Motivation While drug combination therapies are a well-established concept in cancer treatment, identifying novel synergistic combinations is challenging due to the size of combinatorial space. However, computational approaches have emerged as a time- and cost-efficient way to prioritize combinations to test, based on recently available large-scale combination screening data. Recently, Deep Learning has had an impact in many research areas by achieving new state-of-the-art model performance. However, Deep Learning has not yet been applied to drug synergy prediction, which is the approach we present here, termed DeepSynergy. DeepSynergy uses chemical and genomic information as input information, a normalization strategy to account for input data heterogeneity, and conical layers to model drug synergies. Results DeepSynergy was compared to other machine learning methods such as Gradient Boosting Machines, Random Forests, Support Vector Machines and Elastic Nets on the largest publicly available synergy dataset with respect to mean squared error. DeepSynergy significantly outperformed the other methods with an improvement of 7.2% over the second best method at the prediction of novel drug combinations within the space of explored drugs and cell lines. At this task, the mean Pearson correlation coefficient between the measured and the predicted values of DeepSynergy was 0.73. Applying DeepSynergy for classification of these novel drug combinations resulted in a high predictive performance of an AUC of 0.90. Furthermore, we found that all compared methods exhibit low predictive performance when extrapolating to unexplored drugs or cell lines, which we suggest is due to limitations in the size and diversity of the dataset. We envision that DeepSynergy could be a valuable tool for selecting novel synergistic drug combinations. Availability and implementation DeepSynergy is available via www.bioinf.jku.at/software/DeepSynergy. Contact klambauer@bioinf.jku.at. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Kristina Preuer
- Institute of Bioinformatics, Johannes Kepler University, Linz, Austria
| | - Richard P I Lewis
- Department of Chemistry, Centre for Molecular Science Informatics, University of Cambridge, Cambridge, UK
| | - Sepp Hochreiter
- Institute of Bioinformatics, Johannes Kepler University, Linz, Austria
| | - Andreas Bender
- Department of Chemistry, Centre for Molecular Science Informatics, University of Cambridge, Cambridge, UK
| | - Krishna C Bulusu
- Department of Chemistry, Centre for Molecular Science Informatics, University of Cambridge, Cambridge, UK
- Oncology Innovative Medicines and Early Development, AstraZeneca, Hodgkin Building, Chesterford Research Campus, Saffron Walden, Cambs, UK
| | - Günter Klambauer
- Institute of Bioinformatics, Johannes Kepler University, Linz, Austria
| |
Collapse
|
108
|
Mangiatordi GF, Trisciuzzi D, Iacobazzi R, Denora N, Pisani L, Catto M, Leonetti F, Alberga D, Nicolotti O. Automated identification of structurally heterogeneous and patentable antiproliferative hits as potential tubulin inhibitors. Chem Biol Drug Des 2018; 92:1161-1170. [PMID: 29633572 DOI: 10.1111/cbdd.13200] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/31/2017] [Revised: 02/05/2018] [Accepted: 03/03/2018] [Indexed: 12/27/2022]
Abstract
By employing a recently developed hierarchical computational platform, we identified 37 novel and structurally diverse tubulin targeting compounds. In particular, hierarchical molecular filters, based on molecular shape similarity, structure-based pharmacophore, and molecular docking, were applied on a large chemical collection of commercial compounds to identify unexplored and patentable microtubule-destabilizing candidates. The herein proposed 37 novel hits, showing new molecular scaffolds (such as 1,3,3a,4-tetraaza-1,2,3,4,5,6,7,7a-octahydroindene or dihydropyrrolidin-2-one fused to a chromen-4-one), are provided with antiproliferative activity in the μm range toward MCF-7 (human breast cancer lines). Importantly, there is a likely causative relationship between cytotoxicity and the inhibition of tubulin polymerization at the colchicine binding site, assessed through fluorescence polymerization assays.
Collapse
Affiliation(s)
| | - Daniela Trisciuzzi
- Dipartimento di Farmacia-Scienze del Farmaco, Università di Bari 'Aldo Moro', Bari, Italy
| | | | - Nunzio Denora
- Dipartimento di Farmacia-Scienze del Farmaco, Università di Bari 'Aldo Moro', Bari, Italy
| | - Leonardo Pisani
- Dipartimento di Farmacia-Scienze del Farmaco, Università di Bari 'Aldo Moro', Bari, Italy
| | - Marco Catto
- Dipartimento di Farmacia-Scienze del Farmaco, Università di Bari 'Aldo Moro', Bari, Italy
| | - Francesco Leonetti
- Dipartimento di Farmacia-Scienze del Farmaco, Università di Bari 'Aldo Moro', Bari, Italy
| | - Domenico Alberga
- Dipartimento di Farmacia-Scienze del Farmaco, Università di Bari 'Aldo Moro', Bari, Italy
| | - Orazio Nicolotti
- Dipartimento di Farmacia-Scienze del Farmaco, Università di Bari 'Aldo Moro', Bari, Italy
| |
Collapse
|
109
|
TopP-S: Persistent homology-based multi-task deep neural networks for simultaneous predictions of partition coefficient and aqueous solubility. J Comput Chem 2018; 39:1444-1454. [DOI: 10.1002/jcc.25213] [Citation(s) in RCA: 39] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2017] [Revised: 01/15/2018] [Accepted: 02/25/2018] [Indexed: 01/09/2023]
|
110
|
Dong J, Yao ZJ, Zhang L, Luo F, Lin Q, Lu AP, Chen AF, Cao DS. PyBioMed: a python library for various molecular representations of chemicals, proteins and DNAs and their interactions. J Cheminform 2018; 10:16. [PMID: 29556758 PMCID: PMC5861255 DOI: 10.1186/s13321-018-0270-2] [Citation(s) in RCA: 82] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2017] [Accepted: 03/12/2018] [Indexed: 11/15/2022] Open
Abstract
Background
With the increasing development of biotechnology and informatics technology, publicly available data in chemistry and biology are undergoing explosive growth. Such wealthy information in these data needs to be extracted and transformed to useful knowledge by various data mining methods. Considering the amazing rate at which data are accumulated in chemistry and biology fields, new tools that process and interpret large and complex interaction data are increasingly important. So far, there are no suitable toolkits that can effectively link the chemical and biological space in view of molecular representation. To further explore these complex data, an integrated toolkit for various molecular representation is urgently needed which could be easily integrated with data mining algorithms to start a full data analysis pipeline. Results Herein, the python library PyBioMed is presented, which comprises functionalities for online download for various molecular objects by providing different IDs, the pretreatment of molecular structures, the computation of various molecular descriptors for chemicals, proteins, DNAs and their interactions. PyBioMed is a feature-rich and highly customized python library used for the characterization of various complex chemical and biological molecules and interaction samples. The current version of PyBioMed could calculate 775 chemical descriptors and 19 kinds of chemical fingerprints, 9920 protein descriptors based on protein sequences, more than 6000 DNA descriptors from nucleotide sequences, and interaction descriptors from pairwise samples using three different combining strategies. Several examples and five real-life applications were provided to clearly guide the users how to use PyBioMed as an integral part of data analysis projects. By using PyBioMed, users are able to start a full pipelining from getting molecular data, pretreating molecules, molecular representation to constructing machine learning models conveniently. Conclusion PyBioMed provides various user-friendly and highly customized APIs to calculate various features of biological molecules and complex interaction samples conveniently, which aims at building integrated analysis pipelines from data acquisition, data checking, and descriptor calculation to modeling. PyBioMed is freely available at http://projects.scbdd.com/pybiomed.html.![]()
Collapse
Affiliation(s)
- Jie Dong
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China.,College of Food Science and Engineering, National Engineering Laboratory for Deep Processing of Rice and Byproducts, Central South University of Forestry and Technology, Changsha, China
| | - Zhi-Jiang Yao
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China
| | - Lin Zhang
- College of Food Science and Engineering, National Engineering Laboratory for Deep Processing of Rice and Byproducts, Central South University of Forestry and Technology, Changsha, China
| | - Feijun Luo
- College of Food Science and Engineering, National Engineering Laboratory for Deep Processing of Rice and Byproducts, Central South University of Forestry and Technology, Changsha, China
| | - Qinlu Lin
- College of Food Science and Engineering, National Engineering Laboratory for Deep Processing of Rice and Byproducts, Central South University of Forestry and Technology, Changsha, China
| | - Ai-Ping Lu
- Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong SAR, China
| | - Alex F Chen
- Center for Vascular Disease and Translational Medicine, Third Xiangya Hospital, Central South University, Changsha, People's Republic of China
| | - Dong-Sheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China. .,Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong SAR, China. .,Center for Vascular Disease and Translational Medicine, Third Xiangya Hospital, Central South University, Changsha, People's Republic of China.
| |
Collapse
|
111
|
Fan D, Yang H, Li F, Sun L, Di P, Li W, Tang Y, Liu G. In silico prediction of chemical genotoxicity using machine learning methods and structural alerts. Toxicol Res (Camb) 2018; 7:211-220. [PMID: 30090576 PMCID: PMC6062245 DOI: 10.1039/c7tx00259a] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2017] [Accepted: 12/14/2017] [Indexed: 01/19/2023] Open
Abstract
Genotoxicity tests can detect compounds that have an adverse effect on the process of heredity. The in vivo micronucleus assay, a genotoxicity test method, has been widely used to evaluate the presence and extent of chromosomal damage in human beings. Due to the high cost and laboriousness of experimental tests, computational approaches for predicting genotoxicity based on chemical structures and properties are recognized as an alternative. In this study, a dataset containing 641 diverse chemicals was collected and the molecules were represented by both fingerprints and molecular descriptors. Then classification models were constructed by six machine learning methods, including the support vector machine (SVM), naïve Bayes (NB), k-nearest neighbor (kNN), C4.5 decision tree (DT), random forest (RF) and artificial neural network (ANN). The performance of the models was estimated by five-fold cross-validation and an external validation set. The top ten models showed excellent performance for the external validation with accuracies ranging from 0.846 to 0.938, among which models Pubchem_SVM and MACCS_RF showed a more reliable predictive ability. The applicability domain was also defined to distinguish favorable predictions from unfavorable ones. Finally, ten structural fragments which can be used to assess the genotoxicity potential of a chemical were identified by using information gain and structural fragment frequency analysis. Our models might be helpful for the initial screening of potential genotoxic compounds.
Collapse
Affiliation(s)
- Defang Fan
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Hongbin Yang
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Fuxing Li
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Lixia Sun
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Peiwen Di
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Weihua Li
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Yun Tang
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| | - Guixia Liu
- Shanghai Key Laboratory of New Drug Design , School of Pharmacy , East China University of Science and Technology , Shanghai 200237 , China . ; ; ; Tel: +86-21-64250811
| |
Collapse
|
112
|
Moriwaki H, Tian YS, Kawashita N, Takagi T. Mordred: a molecular descriptor calculator. J Cheminform 2018; 10:4. [PMID: 29411163 PMCID: PMC5801138 DOI: 10.1186/s13321-018-0258-y] [Citation(s) in RCA: 581] [Impact Index Per Article: 83.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Accepted: 01/23/2018] [Indexed: 01/05/2023] Open
Abstract
Molecular descriptors are widely employed to present molecular characteristics in cheminformatics. Various molecular-descriptor-calculation software programs have been developed. However, users of those programs must contend with several issues, including software bugs, insufficient update frequencies, and software licensing constraints. To address these issues, we propose Mordred, a developed descriptor-calculation software application that can calculate more than 1800 two- and three-dimensional descriptors. It is freely available via GitHub. Mordred can be easily installed and used in the command line interface, as a web application, or as a high-flexibility Python package on all major platforms (Windows, Linux, and macOS). Performance benchmark results show that Mordred is at least twice as fast as the well-known PaDEL-Descriptor and it can calculate descriptors for large molecules, which cannot be accomplished by other software. Owing to its good performance, convenience, number of descriptors, and a lax licensing constraint, Mordred is a promising choice of molecular descriptor calculation software that can be utilized for cheminformatics studies, such as those on quantitative structure-property relationships.
Collapse
Affiliation(s)
- Hirotomo Moriwaki
- Graduate School of Pharmaceutical Sciences, Osaka University, 1-6 Yamadaoka, Suita City, Osaka 565-0871 Japan
| | - Yu-Shi Tian
- Graduate School of Pharmaceutical Sciences, Osaka University, 1-6 Yamadaoka, Suita City, Osaka 565-0871 Japan
| | - Norihito Kawashita
- Faculty of Sciences and Engineering, Kindai University, 3-4-1 Kowakae, Higashiosaka City, Osaka 577-8502 Japan
| | - Tatsuya Takagi
- Graduate School of Pharmaceutical Sciences, Osaka University, 1-6 Yamadaoka, Suita City, Osaka 565-0871 Japan
| |
Collapse
|
113
|
Nolte TM, Pinto-Gil K, Hendriks AJ, Ragas AMJ, Pastor M. Quantitative structure-activity relationships for primary aerobic biodegradation of organic chemicals in pristine surface waters: starting points for predicting biodegradation under acclimatization. ENVIRONMENTAL SCIENCE. PROCESSES & IMPACTS 2018; 20:157-170. [PMID: 29192704 DOI: 10.1039/c7em00375g] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Microbial biomass and acclimation can affect the removal of organic chemicals in natural surface waters. In order to account for these effects and develop more robust models for biodegradation, we have compiled and curated removal data for un-acclimated (pristine) surface waters on which we developed quantitative structure-activity relationships (QSARs). Global analysis of the very heterogeneous dataset including neutral, anionic, cationic and zwitterionic chemicals (N = 233) using a random forest algorithm showed that useful predictions were possible (Qext2 = 0.4-0.5) though relatively large standard errors were associated (SDEP ∼0.7). Classification of the chemicals based on speciation state and metabolic pathway showed that biodegradation is influenced by the two, and that the dependence of biodegradation on chemical characteristics is non-linear. Class-specific QSAR analysis indicated that shape and charge distribution determine the biodegradation of neutral chemicals (R2 ∼ 0.6), e.g. through membrane permeation or binding to P450 enzymes, whereas the average biodegradation of charged chemicals is 1 to 2 orders of magnitude lower, for which degradation depends more directly on cellular uptake (R2 ∼ 0.6). Further analysis showed that specific chemical classes such as peptides and organic halogens are relatively less biodegradable in pristine surface waters, resulting in the need for the microbial consortia to acclimate. Additional literature data was used to verify an acclimation model (based on Monod-type kinetics) capable of extrapolating QSAR predictions to acclimating conditions such as in water treatment, downstream lakes and large rivers under μg L-1 to mg L-1 concentrations. The framework developed, despite being based on multiple assumptions, is promising and needs further validation using experimentation with more standardised and homogenised conditions as well as adequate characterization of the inoculum used.
Collapse
Affiliation(s)
- Tom M Nolte
- Department of Environmental Science, Institute for Water and Wetland Research, Radboud University Nijmegen, P. O. Box 9010, 6500 GL Nijmegen, The Netherlands.
| | | | | | | | | |
Collapse
|
114
|
Phanus-umporn C, Shoombuatong W, Prachayasittikul V, Anuwongcharoen N, Nantasenamat C. Privileged substructures for anti-sickling activity via cheminformatic analysis. RSC Adv 2018; 8:5920-5935. [PMID: 35539618 PMCID: PMC9078244 DOI: 10.1039/c7ra12079f] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2017] [Revised: 02/21/2018] [Accepted: 01/12/2018] [Indexed: 11/21/2022] Open
Abstract
Sickle cell disease (SCD), an autosomal recessive genetic disorder, has been recognized by the World Health Organization (WHO) as a major public health problem as it affects 300 000 individuals worldwide. Complications arising from SCD include anemia, microvascular occlusion, severe pain, stokes, renal dysfunction and infections. A lucrative therapeutic strategy is to employ anti-sickling agents that can disrupt the formation of the HbS polymer. This study therefore employed cheminformatic approaches, encompassing classification structure–activity relationship (CSAR) modeling, to deduce the privileged substructures giving rise to the anti-sickling activity of an investigated set of 115 compounds, followed by substructure analysis. Briefly, the compiled compounds were described by fingerprint descriptors and used in the construction of CSAR models via several machine learning algorithms. The modelability of the data set, as exemplified by the MODI index, was determined to be in the range of 0.70–0.84. The predictive performance was deduced by the accuracy, sensitivity, specificity and Matthews correlation coefficient, which was found to be statistically robust, whereby the former three parameters afforded values in excess of 0.7 while the latter statistical parameter provided a value greater than 0.5. An analysis of the top 20 important substructure descriptors for anti-sickling activity revealed that 10 important features were significant in the differentiation of actives from inactives, as illustrated by aromaticity/conjugation (e.g. SubFPC287, SubFPC171 and SubFPC5), carbonyl groups (e.g. SubFPC137, SubFPC139, SubFPC49 and SubFPC135) and miscellaneous groups (e.g. SubFPC303, SubFPC302 and SubFPC275). Furthermore, an analysis of the structure–activity relationship revealed that the length of alkyl chains, choice of functional moiety and position of substitution on the benzene ring may affect the anti-sickling activity of these compounds. Thus, this knowledge is anticipated to be useful for guiding the design of robust compounds against the gelling activity of HbS, as preliminarily demonstrated in the data-driven compound design presented herein. Cheminformatic approaches (classification structure–activity relationship models based on 12 fingerprint classes) were employed for deducing privileged substructures giving rise to the anti-sickling activity of an investigated set of 115 compounds.![]()
Collapse
Affiliation(s)
- Chuleeporn Phanus-umporn
- Center of Data Mining and Biomedical Informatics
- Faculty of Medical Technology
- Mahidol University
- Bangkok 10700
- Thailand
| | - Watshara Shoombuatong
- Center of Data Mining and Biomedical Informatics
- Faculty of Medical Technology
- Mahidol University
- Bangkok 10700
- Thailand
| | - Veda Prachayasittikul
- Center of Data Mining and Biomedical Informatics
- Faculty of Medical Technology
- Mahidol University
- Bangkok 10700
- Thailand
| | - Nuttapat Anuwongcharoen
- Center of Data Mining and Biomedical Informatics
- Faculty of Medical Technology
- Mahidol University
- Bangkok 10700
- Thailand
| | - Chanin Nantasenamat
- Center of Data Mining and Biomedical Informatics
- Faculty of Medical Technology
- Mahidol University
- Bangkok 10700
- Thailand
| |
Collapse
|
115
|
Deng YH, Wang NN, Zou ZX, Zhang L, Xu KP, Chen AF, Cao DS, Tan GS. Multi-Target Screening and Experimental Validation of Natural Products from Selaginella Plants against Alzheimer's Disease. Front Pharmacol 2017; 8:539. [PMID: 28890698 PMCID: PMC5574911 DOI: 10.3389/fphar.2017.00539] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2017] [Accepted: 08/03/2017] [Indexed: 11/30/2022] Open
Abstract
Alzheimer's disease (AD) is a progressive and irreversible neurodegenerative disorder which is considered to be the most common cause of dementia. It has a greater impact not only on the learning and memory disturbances but also on social and economy. Currently, there are mainly single-target drugs for AD treatment but the complexity and multiple etiologies of AD make them difficult to obtain desirable therapeutic effects. Therefore, the choice of multi-target drugs will be a potential effective strategy inAD treatment. To find multi-target active ingredients for AD treatment from Selaginella plants, we firstly explored the behaviors effects on AD mice of total extracts (TE) from Selaginella doederleinii on by Morris water maze test and found that TE has a remarkable improvement on learning and memory function for AD mice. And then, multi-target SAR models associated with AD-related proteins were built based on Random Forest (RF) and different descriptors to preliminarily screen potential active ingredients from Selaginella. Considering the prediction outputs and the quantity of existing compounds in our laboratory, 13 compounds were chosen to carry out the in vitro enzyme inhibitory experiments and 4 compounds with BACE1/MAO-B dual inhibitory activity were determined. Finally, the molecular docking was applied to verify the prediction results and enzyme inhibitory experiments. Based on these study and validation processes, we explored a new strategy to improve the efficiency of active ingredients screening based on trace amount of natural product and numbers of targets and found some multi-target compounds with biological activity for the development of novel drugs for AD treatment.
Collapse
Affiliation(s)
- Yin-Hua Deng
- Xiangya School of Pharmaceutical Sciences, Central South UniversityChangsha, China
| | - Ning-Ning Wang
- Xiangya School of Pharmaceutical Sciences, Central South UniversityChangsha, China
| | - Zhen-Xing Zou
- Xiangya School of Pharmaceutical Sciences, Central South UniversityChangsha, China.,Pharmacy Department, Xiangya Hospital, Central South UniversityChangsha, China
| | - Lin Zhang
- College of Food Science and Technology, Central South University of Forestry and TechnologyChangsha, China
| | - Kang-Ping Xu
- Xiangya School of Pharmaceutical Sciences, Central South UniversityChangsha, China
| | - Alex F Chen
- Xiangya School of Pharmaceutical Sciences, Central South UniversityChangsha, China.,Center for Vascular Disease and Translational Medicine, Third Xiangya Hospital, Central South UniversityChangsha, China
| | - Dong-Sheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South UniversityChangsha, China.,Center for Vascular Disease and Translational Medicine, Third Xiangya Hospital, Central South UniversityChangsha, China
| | - Gui-Shan Tan
- Xiangya School of Pharmaceutical Sciences, Central South UniversityChangsha, China.,Pharmacy Department, Xiangya Hospital, Central South UniversityChangsha, China
| |
Collapse
|
116
|
Cysewski P, Przybyłek M. Selection of effective cocrystals former for dissolution rate improvement of active pharmaceutical ingredients based on lipoaffinity index. Eur J Pharm Sci 2017; 107:87-96. [PMID: 28687528 DOI: 10.1016/j.ejps.2017.07.004] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/11/2017] [Revised: 06/06/2017] [Accepted: 07/03/2017] [Indexed: 10/19/2022]
Abstract
New theoretical screening procedure was proposed for appropriate selection of potential cocrystal formers possessing the ability of enhancing dissolution rates of drugs. The procedure relies on the training set comprising 102 positive and 17 negative cases of cocrystals found in the literature. Despite the fact that the only available data were of qualitative character, performed statistical analysis using binary classification allowed to formulate quantitative criterions. Among considered 3679 molecular descriptors the relative value of lipoaffinity index, expressed as the difference between values calculated for active compound and excipient, has been found as the most appropriate measure suited for discrimination of positive and negative cases. Assuming 5% precision, the applied classification criterion led to inclusion of 70% positive cases in the final prediction. Since lipoaffinity index is a molecular descriptor computed using only 2D information about a chemical structure, its estimation is straightforward and computationally inexpensive. The inclusion of an additional criterion quantifying the cocrystallization probability leads to the following conjunction criterions Hmix<-0.18 and ΔLA>3.61, allowing for identification of dissolution rate enhancers. The screening procedure was applied for finding the most promising coformers of such drugs as Iloperidone, Ritonavir, Carbamazepine and Enthenzamide.
Collapse
Affiliation(s)
- Piotr Cysewski
- Chair and Department of Physical Chemistry, Pharmacy Faculty, Collegium Medicum of Bydgoszcz, Nicolaus Copernicus University in Toruń, Kurpińskiego 5, 85-950 Bydgoszcz, Poland.
| | - Maciej Przybyłek
- Chair and Department of Physical Chemistry, Pharmacy Faculty, Collegium Medicum of Bydgoszcz, Nicolaus Copernicus University in Toruń, Kurpińskiego 5, 85-950 Bydgoszcz, Poland
| |
Collapse
|
117
|
Valdés-Martiní JR, Marrero-Ponce Y, García-Jacas CR, Martinez-Mayorga K, Barigye SJ, Vaz d'Almeida YS, Pham-The H, Pérez-Giménez F, Morell CA. QuBiLS-MAS, open source multi-platform software for atom- and bond-based topological (2D) and chiral (2.5D) algebraic molecular descriptors computations. J Cheminform 2017; 9:35. [PMID: 29086120 PMCID: PMC5462671 DOI: 10.1186/s13321-017-0211-5] [Citation(s) in RCA: 50] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2016] [Accepted: 04/07/2017] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND In previous reports, Marrero-Ponce et al. proposed algebraic formalisms for characterizing topological (2D) and chiral (2.5D) molecular features through atom- and bond-based ToMoCoMD-CARDD (acronym for Topological Molecular Computational Design-Computer Aided Rational Drug Design) molecular descriptors. These MDs codify molecular information based on the bilinear, quadratic and linear algebraic forms and the graph-theoretical electronic-density and edge-adjacency matrices in order to consider atom- and bond-based relations, respectively. These MDs have been successfully applied in the screening of chemical compounds of different therapeutic applications ranging from antimalarials, antibacterials, tyrosinase inhibitors and so on. To compute these MDs, a computational program with the same name was initially developed. However, this in house software barely offered the functionalities required in contemporary molecular modeling tasks, in addition to the inherent limitations that made its usability impractical. Therefore, the present manuscript introduces the QuBiLS-MAS (acronym for Quadratic, Bilinear and N-Linear mapS based on graph-theoretic electronic-density Matrices and Atomic weightingS) software designed to compute topological (0-2.5D) molecular descriptors based on bilinear, quadratic and linear algebraic forms for atom- and bond-based relations. RESULTS The QuBiLS-MAS module was designed as standalone software, in which extensions and generalizations of the former ToMoCoMD-CARDD 2D-algebraic indices are implemented, considering the following aspects: (a) two new matrix normalization approaches based on double-stochastic and mutual probability formalisms; (b) topological constraints (cut-offs) to take into account particular inter-atomic relations; (c) six additional atomic properties to be used as weighting schemes in the calculation of the molecular vectors; (d) four new local-fragments to consider molecular regions of interest; (e) number of lone-pair electrons in chemical structure defined by diagonal coefficients in matrix representations; and (f) several aggregation operators (invariants) applied over atom/bond-level descriptors in order to compute global indices. This software permits the parallel computation of the indices, contains a batch processing module and data curation functionalities. This program was developed in Java v1.7 using the Chemistry Development Kit library (version 1.4.19). The QuBiLS-MAS software consists of two components: a desktop interface (GUI) and an API library allowing for the easy integration of the latter in chemoinformatics applications. The relevance of the novel extensions and generalizations implemented in this software is demonstrated through three studies. Firstly, a comparative Shannon's entropy based variability study for the proposed QuBiLS-MAS and the DRAGON indices demonstrates superior performance for the former. A principal component analysis reveals that the QuBiLS-MAS approach captures chemical information orthogonal to that codified by the DRAGON descriptors. Lastly, a QSAR study for the binding affinity to the corticosteroid-binding globulin using Cramer's steroid dataset is carried out. CONCLUSIONS From these analyses, it is revealed that the QuBiLS-MAS approach for atom-pair relations yields similar-to-superior performance with regard to other QSAR methodologies reported in the literature. Therefore, the QuBiLS-MAS approach constitutes a useful tool for the diversity analysis of chemical compound datasets and high-throughput screening of structure-activity data.
Collapse
Affiliation(s)
- José R Valdés-Martiní
- StreelBridge Laboratories, SteelBridge Consulting Technology Solutions, Miami, FL, USA
| | - Yovani Marrero-Ponce
- Universidad San Francisco de Quito (USFQ), Grupo de Medicina Molecular y Traslacional (MeM&T), Colegio de Ciencias de la Salud (COCSA), Escuela de Medicina, Edificio de Especialidades Médicas, Quito, Ecuador. .,Universidad San Francisco de Quito (USFQ), Instituto de Simulación Computacional (ISC-USFQ), Diego de Robles y vía Interoceánica, 170157, Quito, Pichincha, Ecuador. .,Computer-Aided Molecular "Biosilico" Discovery and Bioinformatics Research International Network (CAMD-BIR IN), Cumbayá, Quito, Ecuador. .,Grupo de Investigación Ambiental (GIA), Fundación Universitaria Tecnológico de Comfenalco, Facultad de Ingenierías, Programa de Ingeniería de Procesos, Cartagena de Indias, Bolívar, Colombia. .,Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia, Universitat de València, Valencia, Spain.
| | - César R García-Jacas
- Instituto de Química, Universidad Nacional Autónoma de México (UNAM), Ciudad de México, México.,Escuela de Sistemas y Computación, Pontificia Universidad Católica del Ecuador Sede Esmeraldas (PUCESE), Esmeraldas, Ecuador.,Grupo de Investigación de Bioinformática, Universidad de las Ciencias Informáticas (UCI), Havana, Cuba
| | - Karina Martinez-Mayorga
- Instituto de Química, Universidad Nacional Autónoma de México (UNAM), Ciudad de México, México
| | - Stephen J Barigye
- Facultad de Medicina, Universidad de Las Américas, Quito, Pichincha, Ecuador
| | | | - Hai Pham-The
- Department of Pharmaceutical Chemistry, Hanoi University of Pharmacy, 13-15 Le Thanh Tong, Hoan Kiem, Hanoi, Vietnam
| | - Facundo Pérez-Giménez
- Unidad de Investigación de Diseño de Fármacos y Conectividad Molecular, Departamento de Química Física, Facultad de Farmacia, Universitat de València, Valencia, Spain
| | - Carlos A Morell
- Laboratorio de Inteligencia Artificial, Centro de Estudios de Informática (CEI), Facultad de Matemática, Física y Computación, Universidad Central "Marta Abreu" de Las Villas, Santa Clara, Villa Clara, Cuba
| |
Collapse
|
118
|
Dong J, Yao ZJ, Zhu MF, Wang NN, Lu B, Chen AF, Lu AP, Miao H, Zeng WB, Cao DS. ChemSAR: an online pipelining platform for molecular SAR modeling. J Cheminform 2017; 9:27. [PMID: 29086046 PMCID: PMC5418185 DOI: 10.1186/s13321-017-0215-1] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2016] [Accepted: 04/24/2017] [Indexed: 12/31/2022] Open
Abstract
Background In recent years, predictive models based on machine learning techniques have proven to be feasible and effective in drug discovery. However, to develop such a model, researchers usually have to combine multiple tools and undergo several different steps (e.g., RDKit or ChemoPy package for molecular descriptor calculation, ChemAxon Standardizer for structure preprocessing, scikit-learn package for model building, and ggplot2 package for statistical analysis and visualization, etc.). In addition, it may require strong programming skills to accomplish these jobs, which poses severe challenges for users without advanced training in computer programming. Therefore, an online pipelining platform that integrates a number of selected tools is a valuable and efficient solution that can meet the needs of related researchers. Results This work presents a web-based pipelining platform, called ChemSAR, for generating SAR classification models of small molecules. The capabilities of ChemSAR include the validation and standardization of chemical structure representation, the computation of 783 1D/2D molecular descriptors and ten types of widely-used fingerprints for small molecules, the filtering methods for feature selection, the generation of predictive models via a step-by-step job submission process, model interpretation in terms of feature importance and tree visualization, as well as a helpful report generation system. The results can be visualized as high-quality plots and downloaded as local files. Conclusion ChemSAR provides an integrated web-based platform for generating SAR classification models that will benefit cheminformatics and other biomedical users. It is freely available at: http://chemsar.scbdd.com.. ![]() Electronic supplementary material The online version of this article (doi:10.1186/s13321-017-0215-1) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Jie Dong
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China
| | - Zhi-Jiang Yao
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China.,The Third Xiangya Hospital, Central South University, Changsha, People's Republic of China
| | - Min-Feng Zhu
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China.,The Third Xiangya Hospital, Central South University, Changsha, People's Republic of China
| | - Ning-Ning Wang
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China
| | - Ben Lu
- The Third Xiangya Hospital, Central South University, Changsha, People's Republic of China
| | - Alex F Chen
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China.,The Third Xiangya Hospital, Central South University, Changsha, People's Republic of China
| | - Ai-Ping Lu
- Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Kowloon Tong, Hong Kong SAR, People's Republic of China
| | - Hongyu Miao
- Department of Biostatistics, School of Public Health, University of Texas Health Science Center, Houston, TX, 77030, USA
| | - Wen-Bin Zeng
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China
| | - Dong-Sheng Cao
- Xiangya School of Pharmaceutical Sciences, Central South University, No. 172, Tongzipo Road, Yuelu District, Changsha, People's Republic of China. .,Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Kowloon Tong, Hong Kong SAR, People's Republic of China.
| |
Collapse
|
119
|
Ying S, Du X, Fu W, Yun D, Chen L, Cai Y, Xu Q, Wu J, Li W, Liang G. Synthesis, biological evaluation, QSAR and molecular dynamics simulation studies of potential fibroblast growth factor receptor 1 inhibitors for the treatment of gastric cancer. Eur J Med Chem 2017; 127:885-899. [DOI: 10.1016/j.ejmech.2016.10.066] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2016] [Revised: 10/12/2016] [Accepted: 10/31/2016] [Indexed: 01/06/2023]
|
120
|
Shoombuatong W, Prathipati P, Owasirikul W, Worachartcheewan A, Simeon S, Anuwongcharoen N, Wikberg JES, Nantasenamat C. Towards the Revival of Interpretable QSAR Models. CHALLENGES AND ADVANCES IN COMPUTATIONAL CHEMISTRY AND PHYSICS 2017. [DOI: 10.1007/978-3-319-56850-8_1] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
|
121
|
Pan J, Xu T, Xu F, Zhang Y, Liu Z, Chen W, Fu W, Dai Y, Zhao Y, Feng J, Liang G. Development of resveratrol-curcumin hybrids as potential therapeutic agents for inflammatory lung diseases. Eur J Med Chem 2016; 125:478-491. [PMID: 27689730 DOI: 10.1016/j.ejmech.2016.09.033] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2016] [Revised: 09/08/2016] [Accepted: 09/09/2016] [Indexed: 01/20/2023]
Abstract
Acute lung injury (ALI) is a major cause of acute respiratory failure in critically-ill patients. Resveratrol and curcumin are proven to have potent anti-inflammatory efficacy, but their clinical application is limited by their metabolic instability. Here, a series of resveratrol and the Mono-carbonyl analogs of curcumin (MCAs) hybrids were designed and synthesized by efficient aldol construction strategy, and then screened for anti-inflammatory activities in vitro and in vivo. The results showed that the majority of analogs effectively inhibited the LPS-induced production of IL-6 and TNF-α. Five analogs, a9, a18, a19, a20 and a24 exhibited excellent anti-inflammatory activity in a dose-dependent manner along with low toxicity in vitro. Structure activity relationship study revealed that the electron-withdrawing groups at meta-position and methoxyl group (OCH3) at the para position of the phenyl ring were important for anti-inflammatory activities. The most promising compound a18 decreased LPS induced TNF-α, IL-6, IL-12, and IL-33 mRNA expression. Additionally, a18 significantly protected against LPS-induced acute lung injury in the in vivo mouse model. The research of resveratrol and MCAs hybrids could bring insight into the treatment of inflammatory diseases and compound a18 may serve as a lead compound for the development of anti-ALI agents.
Collapse
Affiliation(s)
- Jialing Pan
- Chemical Biology Research Center at School of Pharmaceutical Sciences, Wenzhou Medical University, 1210 University Town, Wenzhou, Zhejiang 325035, China
| | - Tingting Xu
- Department of Respiration, The Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang 325027, China
| | - Fengli Xu
- Department of Respiration, The Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang 325027, China
| | - Yali Zhang
- Chemical Biology Research Center at School of Pharmaceutical Sciences, Wenzhou Medical University, 1210 University Town, Wenzhou, Zhejiang 325035, China
| | - Zhiguo Liu
- Chemical Biology Research Center at School of Pharmaceutical Sciences, Wenzhou Medical University, 1210 University Town, Wenzhou, Zhejiang 325035, China
| | - Wenbo Chen
- Chemical Biology Research Center at School of Pharmaceutical Sciences, Wenzhou Medical University, 1210 University Town, Wenzhou, Zhejiang 325035, China
| | - Weitao Fu
- Chemical Biology Research Center at School of Pharmaceutical Sciences, Wenzhou Medical University, 1210 University Town, Wenzhou, Zhejiang 325035, China
| | - Yuanrong Dai
- Department of Respiration, The Second Affiliated Hospital of Wenzhou Medical University, Wenzhou, Zhejiang 325027, China
| | - Yunjie Zhao
- Chemical Biology Research Center at School of Pharmaceutical Sciences, Wenzhou Medical University, 1210 University Town, Wenzhou, Zhejiang 325035, China.
| | - Jianpeng Feng
- Chemical Biology Research Center at School of Pharmaceutical Sciences, Wenzhou Medical University, 1210 University Town, Wenzhou, Zhejiang 325035, China; Wenzhou University, 1210 University Town, Wenzhou, Zhejiang 325035, China.
| | - Guang Liang
- Chemical Biology Research Center at School of Pharmaceutical Sciences, Wenzhou Medical University, 1210 University Town, Wenzhou, Zhejiang 325035, China
| |
Collapse
|
122
|
Basant N, Gupta S, Singh KP. QSAR modeling for predicting reproductive toxicity of chemicals in rats for regulatory purposes. Toxicol Res (Camb) 2016; 5:1029-1038. [PMID: 30090410 PMCID: PMC6062388 DOI: 10.1039/c6tx00083e] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/02/2016] [Accepted: 04/07/2016] [Indexed: 01/08/2023] Open
Abstract
The experimental determination of multi-generation reproductive toxicity of chemicals involves high costs and a large number of animal studies over a long period of time. Computational toxicology offers possibilities to overcome such difficulties. In this study, we have established ensemble machine learning (EML) based quantitative structure-activity relationship models for predicting the reproductive toxicity potential (LOAEL) of structurally diverse chemicals in accordance with the OECD guidelines. Accordingly, decision tree forest (DTF) and decision tree boost (DTB) QSAR models were developed using a novel dataset composed of the toxicity endpoints for 334 chemicals. Relevant structural features of chemicals responsible for toxicity potential were identified and used in QSAR modeling. The generalization and prediction abilities of the constructed QSAR models were evaluated by internal and external validation procedures and by deriving several stringent statistical criteria parameters. In the test set, the two models (DTF and DTB) yielded R2 of 0.856 and 0.945, between the experimental and predicted endpoint toxicity values. The models were also evaluated for predictive use through the most recent criteria based on root mean squared error (RMSE) and mean absolute error (MAE). The values of various statistical validation coefficients derived for the test data were above their respective threshold limits and thus put a high confidence in this analysis. The applicability domains of the constructed QSAR models were defined using the leverage and standardization approaches. The results suggest that the proposed QSAR models can reliably predict the reproductive toxicity potential of diverse chemicals and can be useful tools for screening new chemicals for safety assessment.
Collapse
Affiliation(s)
| | - Shikha Gupta
- Environmental Chemistry Division , CSIR-Indian Institute of Toxicology Research , Post Box 80 , Mahatma Gandhi Marg , Lucknow-226 001 , India . ;
| | - Kunwar P Singh
- Environmental Chemistry Division , CSIR-Indian Institute of Toxicology Research , Post Box 80 , Mahatma Gandhi Marg , Lucknow-226 001 , India . ;
| |
Collapse
|
123
|
Roles of d-Amino Acids on the Bioactivity of Host Defense Peptides. Int J Mol Sci 2016; 17:ijms17071023. [PMID: 27376281 PMCID: PMC4964399 DOI: 10.3390/ijms17071023] [Citation(s) in RCA: 52] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2016] [Revised: 06/20/2016] [Accepted: 06/21/2016] [Indexed: 12/15/2022] Open
Abstract
Host defense peptides (HDPs) are positively-charged and amphipathic components of the innate immune system that have demonstrated great potential to become the next generation of broad spectrum therapeutic agents effective against a vast array of pathogens and tumor. As such, many approaches have been taken to improve the therapeutic efficacy of HDPs. Amongst these methods, the incorporation of d-amino acids (d-AA) is an approach that has demonstrated consistent success in improving HDPs. Although, virtually all HDP review articles briefly mentioned about the role of d-AA, however it is rather surprising that no systematic review specifically dedicated to this topic exists. Given the impact that d-AA incorporation has on HDPs, this review aims to fill that void with a systematic discussion of the impact of d-AA on HDPs.
Collapse
|
124
|
Dong J, Yao ZJ, Wen M, Zhu MF, Wang NN, Miao HY, Lu AP, Zeng WB, Cao DS. BioTriangle: a web-accessible platform for generating various molecular representations for chemicals, proteins, DNAs/RNAs and their interactions. J Cheminform 2016; 8:34. [PMID: 27330567 PMCID: PMC4915156 DOI: 10.1186/s13321-016-0146-2] [Citation(s) in RCA: 35] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Accepted: 06/14/2016] [Indexed: 12/18/2022] Open
Abstract
BACKGROUND More and more evidences from network biology indicate that most cellular components exert their functions through interactions with other cellular components, such as proteins, DNAs, RNAs and small molecules. The rapidly increasing amount of publicly available data in biology and chemistry enables researchers to revisit interaction problems by systematic integration and analysis of heterogeneous data. Currently, some tools have been developed to represent these components. However, they have some limitations and only focus on the analysis of either small molecules or proteins or DNAs/RNAs. To the best of our knowledge, there is still a lack of freely-available, easy-to-use and integrated platforms for generating molecular descriptors of DNAs/RNAs, proteins, small molecules and their interactions. RESULTS Herein, we developed a comprehensive molecular representation platform, called BioTriangle, to emphasize the integration of cheminformatics and bioinformatics into a molecular informatics platform for computational biology study. It contains a feature-rich toolkit used for the characterization of various biological molecules and complex interaction samples including chemicals, proteins, DNAs/RNAs and even their interactions. By using BioTriangle, users are able to start a full pipelining from getting molecular data, molecular representation to constructing machine learning models conveniently. CONCLUSION BioTriangle provides a user-friendly interface to calculate various features of biological molecules and complex interaction samples conveniently. The computing tasks can be submitted and performed simply in a browser without any sophisticated installation and configuration process. BioTriangle is freely available at http://biotriangle.scbdd.com.Graphical abstractAn overview of BioTriangle. A platform for generating various molecular representations for chemicals, proteins, DNAs/RNAs and their interactions.
Collapse
Affiliation(s)
- Jie Dong
- School of Pharmaceutical Sciences, Central South University, Changsha, People's Republic of China
| | - Zhi-Jiang Yao
- College of Chemistry and Chemical Engineering, Central South University, Changsha, People's Republic of China
| | - Ming Wen
- College of Chemistry and Chemical Engineering, Central South University, Changsha, People's Republic of China
| | - Min-Feng Zhu
- School of Mathematics and Statistics, Central South University, Changsha, People's Republic of China
| | - Ning-Ning Wang
- School of Pharmaceutical Sciences, Central South University, Changsha, People's Republic of China
| | - Hong-Yu Miao
- School of Public Health, University of Texas Health Science Center, Houston, TX USA
| | - Ai-Ping Lu
- Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, SAR People's Republic of China
| | - Wen-Bin Zeng
- School of Pharmaceutical Sciences, Central South University, Changsha, People's Republic of China
| | - Dong-Sheng Cao
- School of Pharmaceutical Sciences, Central South University, Changsha, People's Republic of China ; Institute for Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, SAR People's Republic of China
| |
Collapse
|
125
|
Yao ZJ, Dong J, Che YJ, Zhu MF, Wen M, Wang NN, Wang S, Lu AP, Cao DS. TargetNet: a web service for predicting potential drug-target interaction profiling via multi-target SAR models. J Comput Aided Mol Des 2016; 30:413-24. [PMID: 27167132 DOI: 10.1007/s10822-016-9915-2] [Citation(s) in RCA: 261] [Impact Index Per Article: 29.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2016] [Accepted: 05/06/2016] [Indexed: 02/01/2023]
Abstract
Drug-target interactions (DTIs) are central to current drug discovery processes and public health fields. Analyzing the DTI profiling of the drugs helps to infer drug indications, adverse drug reactions, drug-drug interactions, and drug mode of actions. Therefore, it is of high importance to reliably and fast predict DTI profiling of the drugs on a genome-scale level. Here, we develop the TargetNet server, which can make real-time DTI predictions based only on molecular structures, following the spirit of multi-target SAR methodology. Naïve Bayes models together with various molecular fingerprints were employed to construct prediction models. Ensemble learning from these fingerprints was also provided to improve the prediction ability. When the user submits a molecule, the server will predict the activity of the user's molecule across 623 human proteins by the established high quality SAR model, thus generating a DTI profiling that can be used as a feature vector of chemicals for wide applications. The 623 SAR models related to 623 human proteins were strictly evaluated and validated by several model validation strategies, resulting in the AUC scores of 75-100 %. We applied the generated DTI profiling to successfully predict potential targets, toxicity classification, drug-drug interactions, and drug mode of action, which sufficiently demonstrated the wide application value of the potential DTI profiling. The TargetNet webserver is designed based on the Django framework in Python, and is freely accessible at http://targetnet.scbdd.com .
Collapse
Affiliation(s)
- Zhi-Jiang Yao
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013, People's Republic of China
- College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083, People's Republic of China
| | - Jie Dong
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013, People's Republic of China
| | - Yu-Jing Che
- School of Mathematics and Statistics, Central South University, Changsha, 410083, People's Republic of China
| | - Min-Feng Zhu
- School of Mathematics and Statistics, Central South University, Changsha, 410083, People's Republic of China
| | - Ming Wen
- College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083, People's Republic of China
| | - Ning-Ning Wang
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013, People's Republic of China
| | - Shan Wang
- College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083, People's Republic of China
| | - Ai-Ping Lu
- Institute of Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, SAR, People's Republic of China
| | - Dong-Sheng Cao
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013, People's Republic of China.
- Institute of Advancing Translational Medicine in Bone and Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Hong Kong, SAR, People's Republic of China.
| |
Collapse
|
126
|
Venkatraman V, Alsberg BK. KRAKENX: software for the generation of alignment-independent 3D descriptors. J Mol Model 2016; 22:93. [DOI: 10.1007/s00894-016-2957-5] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2015] [Accepted: 03/07/2016] [Indexed: 10/22/2022]
|
127
|
Gupta S, Basant N, Singh KP. Three-Tier Strategy for Screening High-Energy Molecules Using Structure–Property Relationship Modeling Approaches. Ind Eng Chem Res 2016. [DOI: 10.1021/acs.iecr.5b03575] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Affiliation(s)
- Shikha Gupta
- Environmental
Chemistry Division, CSIR-Indian Institute of Toxicology Research, Post Box 80, Mahatma Gandhi Marg, Lucknow 226 001, India
| | | | - Kunwar P. Singh
- Environmental
Chemistry Division, CSIR-Indian Institute of Toxicology Research, Post Box 80, Mahatma Gandhi Marg, Lucknow 226 001, India
| |
Collapse
|
128
|
Žic M. An alternative approach to solve complex nonlinear least-squares problems. J Electroanal Chem (Lausanne) 2016. [DOI: 10.1016/j.jelechem.2015.11.015] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022]
|
129
|
Dong J, Cao DS, Miao HY, Liu S, Deng BC, Yun YH, Wang NN, Lu AP, Zeng WB, Chen AF. ChemDes: an integrated web-based platform for molecular descriptor and fingerprint computation. J Cheminform 2015; 7:60. [PMID: 26664458 PMCID: PMC4674923 DOI: 10.1186/s13321-015-0109-z] [Citation(s) in RCA: 194] [Impact Index Per Article: 19.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2015] [Accepted: 11/26/2015] [Indexed: 12/13/2022] Open
Abstract
BACKGROUND Molecular descriptors and fingerprints have been routinely used in QSAR/SAR analysis, virtual drug screening, compound search/ranking, drug ADME/T prediction and other drug discovery processes. Since the calculation of such quantitative representations of molecules may require substantial computational skills and efforts, several tools have been previously developed to make an attempt to ease the process. However, there are still several hurdles for users to overcome to fully harness the power of these tools. First, most of the tools are distributed as standalone software or packages that require necessary configuration or programming efforts of users. Second, many of the tools can only calculate a subset of molecular descriptors, and the results from multiple tools need to be manually merged to generate a comprehensive set of descriptors. Third, some packages only provide application programming interfaces and are implemented in different computer languages, which pose additional challenges to the integration of these tools. RESULTS A freely available web-based platform, named ChemDes, is developed in this study. It integrates multiple state-of-the-art packages (i.e., Pybel, CDK, RDKit, BlueDesc, Chemopy, PaDEL and jCompoundMapper) for computing molecular descriptors and fingerprints. ChemDes not only provides friendly web interfaces to relieve users from burdensome programming work, but also offers three useful and convenient auxiliary tools for format converting, MOPAC optimization and fingerprint similarity calculation. Currently, ChemDes has the capability of computing 3679 molecular descriptors and 59 types of molecular fingerprints. CONCLUSION ChemDes provides users an integrated and friendly tool to calculate various molecular descriptors and fingerprints. It is freely available at http://www.scbdd.com/chemdes. The source code of the project is also available as a supplementary file. Graphical abstract:An overview of ChemDes. A platform for computing various molecular descriptors and fingerprints.
Collapse
Affiliation(s)
- Jie Dong
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013 Hunan People's Republic of China
| | - Dong-Sheng Cao
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013 Hunan People's Republic of China
| | - Hong-Yu Miao
- Department of Biostatistics, School of Public Health, University of Texas Health Science Center at Houston, Houston, USA
| | - Shao Liu
- Xiangya Hospital, Central South University, Changsha, 410008 Hunan People's Republic of China
| | - Bai-Chuan Deng
- College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083 People's Republic of China
| | - Yong-Huan Yun
- College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083 People's Republic of China
| | - Ning-Ning Wang
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013 Hunan People's Republic of China
| | - Ai-Ping Lu
- Institute of Advancing Translational Medicine in Bone & Joint Diseases, School of Chinese Medicine, Hong Kong Baptist University, Kowloon Tong, Hong Kong SAR People's Republic of China
| | - Wen-Bin Zeng
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013 Hunan People's Republic of China
| | - Alex F Chen
- School of Pharmaceutical Sciences, Central South University, Changsha, 410013 Hunan People's Republic of China
| |
Collapse
|
130
|
3D-MoRSE descriptors explained. J Mol Graph Model 2014; 54:194-203. [PMID: 25459771 DOI: 10.1016/j.jmgm.2014.10.006] [Citation(s) in RCA: 90] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2014] [Revised: 09/08/2014] [Accepted: 10/08/2014] [Indexed: 11/23/2022]
Abstract
3D-MoRSE is a very flexible 3D structure encoding framework for chemoinformatics and QSAR purposes due to the range of scattering parameter values and variety of weighting schemes used. While arising in many QSAR studies, up to this time they were considered as hardly interpreted and were treated like a "black box". This study is intended to lift the veil of mystery, providing a comprehensible way to the interpretation of 3D-MoRSE descriptors in QSAR/QSPR studies. The values of these descriptors are calculated with rather simple equation, but may vary when using differing starting geometries as optimization input. This variation increases with scattering parameter and also is higher for electronegativity weighted and unweighted descriptors. Though each 3D-MoRSE descriptor incorporates the information about the whole molecule structure, its final value is derived mostly from short-distance (up to 3Å) atomic pairs. And, if a QSAR study covers structurally similar set of compounds, then the role of 3D-MoRSE descriptor in a model can be interpreted using just several pairs of neighbor atoms. The guide to interpretation process is discussed and illustrated with a case study. Realizing the mathematical concept behind 3D-descriptors and knowing their properties it is easy not only to interpret, but also to predict the importance of 3D-MoRSE descriptors in a QSAR study. The process of prediction is described on the practical example and its accuracy is confirmed with further QSAR modeling.
Collapse
|
131
|
Cao DS, Xiao N, Xu QS, Chen AF. Rcpi: R/Bioconductor package to generate various descriptors of proteins, compounds and their interactions. Bioinformatics 2014; 31:279-81. [DOI: 10.1093/bioinformatics/btu624] [Citation(s) in RCA: 82] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
|
132
|
Abstract
A great deal of research over the last several years has focused on how the inherent randomness in movements and reactivity of biomolecules can give rise to unexpected large-scale differences in the behavior of otherwise identical cells. Our own research has approached this problem from two vantage points - a microscopic kinetic view of the individual molecules (nucleic acids, proteins, etc.) diffusing and interacting in a crowded cellular environment; and a broader systems-level view of how enzyme variability can give rise to well-defined metabolic phenotypes. The former led to the development of the Lattice Microbes software - a GPU-accelerated stochastic simulator for reaction-diffusion processes in models of whole cells; the latter to the development of a method we call population flux balance analysis (FBA). The first part of this article reviews the Lattice Microbes methodology, and two recent technical advances that extend the capabilities of Lattice Microbes to enable simulations of larger organisms and colonies. The second part of this article focuses on our recent population FBA study of Escherichia coli, which predicted variability in the usage of different metabolic pathways resulting from heterogeneity in protein expression. Finally, we discuss exciting early work using a new hybrid methodology that integrates FBA with spatially resolved kinetic simulations to study how cells compete and cooperate within dense colonies and consortia.
Collapse
Affiliation(s)
- John A Cole
- Department of Physics, University of Illinois, Urbana-Champaign, 1110 West Green Street, Urbana, IL 61801 (USA)
| | - Zaida Luthey-Schulten
- Department of Chemistry, University of Illinois, Urbana-Champaign, 505 South Mathews Avenue, Urbana, IL 61801 (USA)
| |
Collapse
|
133
|
Pérez-Sánchez H, Cano G, García-Rodríguez J. Improving drug discovery using hybrid softcomputing methods. Appl Soft Comput 2014. [DOI: 10.1016/j.asoc.2013.10.033] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
134
|
Zhang Q, Zhang W, Li Y, Wang J, Zhang J, Hou T. MORT: a powerful foundational library for computational biology and CADD. J Cheminform 2014. [PMCID: PMC4085231 DOI: 10.1186/1758-2946-6-36] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Background A foundational library called MORT (Molecular Objects and Relevant Templates) for the development of new software packages and tools employed in computational biology and computer-aided drug design (CADD) is described here. Results MORT contains several advantages compared with the other libraries. Firstly, MORT written in C++ natively supports the paradigm of object-oriented design, and thus it can be understood and extended easily. Secondly, MORT employs the relational model to represent a molecule, and it is more convenient and flexible than the traditional hierarchical model employed by many other libraries. Thirdly, a lot of functions have been included in this library, and a molecule can be manipulated easily at different levels. For example, it can parse a variety of popular molecular formats (MOL/SDF, MOL2, PDB/ENT, SMILES/SMARTS, etc.), create the topology and coordinate files for the simulations supported by AMBER, calculate the energy of a specific molecule based on the AMBER force fields, etc. Conclusions We believe that MORT can be used as a foundational library for programmers to develop new programs and applications for computational biology and CADD. Source code of MORT is available at
http://cadd.suda.edu.cn/MORT/index.htm.
Collapse
|
135
|
Recent Advances in the Open Access Cheminformatics Toolkits, Software Tools, Workflow Environments, and Databases. METHODS IN PHARMACOLOGY AND TOXICOLOGY 2014. [DOI: 10.1007/7653_2014_35] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/30/2023]
|
136
|
Cao DS, Zhou GH, Liu S, Zhang LX, Xu QS, He M, Liang YZ. Large-scale prediction of human kinase-inhibitor interactions using protein sequences and molecular topological structures. Anal Chim Acta 2013; 792:10-8. [PMID: 23910962 DOI: 10.1016/j.aca.2013.07.003] [Citation(s) in RCA: 23] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/02/2013] [Revised: 07/03/2013] [Accepted: 07/04/2013] [Indexed: 01/28/2023]
Abstract
The kinase family is one of the largest target families in the human genome. The family's key function in signal transduction for all organisms makes it a very attractive target class for the therapeutic interventions in many diseases states such as cancer, diabetes, inflammation and arthritis. A first step toward accelerating kinase drug discovery process is to fast identify whether a chemical and a kinase interact or not. Experimentally, these interactions can be identified by in vitro binding assay - an expensive and laborious procedure that is not applicable on a large scale. Therefore, there is an urgent need to develop statistically efficient approaches for identifying kinase-inhibitor interactions. For the first time, the quantitative binding affinities of kinase-inhibitor pairs are differentiated as a measurement to define if an inhibitor interacts with a kinase, and then a chemogenomics framework using an unbiased set of general integrated features (drug descriptors and protein descriptors) and random forest (RF) is employed to construct a predictive model which can accurately classify kinase-inhibitor pairs. Our results show that RF with integrated features gave prediction accuracy of 93.76%, sensitivity of 92.26%, and specificity of 95.27%, respectively. The results are superior to those by only considering two separated spaces (chemical space and protein space), demonstrating that these integrated features contribute cooperatively. Based on the constructed model, we provided a high confidence list of drug-target associations for subsequent experimental investigation guidance at a low false discovery rate.
Collapse
Affiliation(s)
- Dong-Sheng Cao
- School of Pharmaceutical Sciences, Central South University, Changsha 410013, PR China.
| | | | | | | | | | | | | |
Collapse
|