1
|
Liu M, Zhang L, Li S, Yang T, Liu L, Zhao J, Liu H. Prediction of hERG potassium channel blockage using ensemble learning methods and molecular fingerprints. Toxicol Lett 2020; 332:88-96. [PMID: 32629073 DOI: 10.1016/j.toxlet.2020.07.003] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2020] [Revised: 06/16/2020] [Accepted: 07/02/2020] [Indexed: 11/30/2022]
Abstract
The human ether-a-go-go-related gene (hERG) encodes a tetrameric potassium channel called Kv11.1. This channel can be blocked by certain drugs, which leads to long QT syndrome, causing cardiotoxicity. This is a significant problem during drug development. Using computer models to predict compound cardiotoxicity during the early stages of drug design will help to solve this problem. In this study, we used a dataset of 1865 compounds exhibiting known hERG inhibitory activities as a training set. Thirty cardiotoxicity classification models were established using three machine learning algorithms based on molecular fingerprints and molecular descriptors. Through using these models as the base classifier, a new cardiotoxicity classification model with better predictive performance was developed using ensemble learning method. The accuracy of the best base classifier, which was generated using the XGBoost method with molecular descriptors, was 84.8 %, and the area under the receiver-operating characteristic curve (AUC) was 0.876 in the five fold cross-validation. However, all of the ensemble models that we developed had higher predictive performance than the base classifiers in the five fold cross-validation. The best predictive performance was achieved by the Ensemble-Top7 model, with accuracy of 84.9 % and AUC of 0.887. We also tested the ensemble model using external validation data and achieved accuracy of 85.0 % and AUC of 0.786. Furthermore, we identified several hERG-related substructures, which provide valuable information for designing drug candidates.
Collapse
Affiliation(s)
- Miao Liu
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Li Zhang
- School of Life Science, Liaoning University, Shenyang, 110036, China; Research Center for Computer Simulating and Information Processing of Bio-macromolecules of Shenyang, Liaoning University, Shenyang, 110036, China; Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of Liaoning, Liaoning University, Shenyang, 110036, China
| | - Shimeng Li
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Tianzhou Yang
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Lili Liu
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Jian Zhao
- School of Life Science, Liaoning University, Shenyang, 110036, China
| | - Hongsheng Liu
- School of Life Science, Liaoning University, Shenyang, 110036, China; Research Center for Computer Simulating and Information Processing of Bio-macromolecules of Shenyang, Liaoning University, Shenyang, 110036, China; Engineering Laboratory for Molecular Simulation and Designing of Drug Molecules of Liaoning, Liaoning University, Shenyang, 110036, China.
| |
Collapse
|
2
|
Mayr F, Vieider C, Temml V, Stuppner H, Schuster D. Open-Access Activity Prediction Tools for Natural Products. Case Study: hERG Blockers. PROGRESS IN THE CHEMISTRY OF ORGANIC NATURAL PRODUCTS 2019; 110:177-238. [PMID: 31621014 DOI: 10.1007/978-3-030-14632-0_6] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Abstract
Interference with the hERG potassium ion channel may cause cardiac arrhythmia and can even lead to death. Over the last few decades, several drugs, already on the market, and many more investigational drugs in various development stages, have had to be discontinued because of their hERG-associated toxicity. To recognize potential hERG activity in the early stages of drug development, a wide array of computational tools, based on different principles, such as 3D QSAR, 2D and 3D similarity, and machine learning, have been developed and are reviewed in this chapter. The various available prediction tools Similarity Ensemble Approach, SuperPred, SwissTargetPrediction, HitPick, admetSAR, PASSonline, Pred-hERG, and VirtualToxLab™ were used to screen a dataset of known hERG synthetic and natural product actives and inactives to quantify and compare their predictive power. This contribution will allow the reader to evaluate the suitability of these computational methods for their own related projects. There is an unmet need for natural product-specific prediction tools in this field.
Collapse
Affiliation(s)
- Fabian Mayr
- Institute of Pharmacy/Pharmacognosy, University of Innsbruck, Innsbruck, Austria
- Institute of Pharmacy/Pharmaceutical Chemistry, University of Innsbruck, Innsbruck, Austria
| | - Christian Vieider
- Institute of Pharmacy/Pharmaceutical Chemistry, University of Innsbruck, Innsbruck, Austria
| | - Veronika Temml
- Institute of Pharmacy/Pharmacognosy, University of Innsbruck, Innsbruck, Austria
| | - Hermann Stuppner
- Institute of Pharmacy/Pharmacognosy, University of Innsbruck, Innsbruck, Austria
| | - Daniela Schuster
- Institute of Pharmacy/Pharmaceutical Chemistry, University of Innsbruck, Innsbruck, Austria.
- Department of Pharmaceutical and Medicinal Chemistry, Institute of Pharmacy, Paracelsus Medical University Salzburg, Salzburg, Austria.
| |
Collapse
|
3
|
Munawar S, Windley MJ, Tse EG, Todd MH, Hill AP, Vandenberg JI, Jabeen I. Experimentally Validated Pharmacoinformatics Approach to Predict hERG Inhibition Potential of New Chemical Entities. Front Pharmacol 2018; 9:1035. [PMID: 30333745 PMCID: PMC6176658 DOI: 10.3389/fphar.2018.01035] [Citation(s) in RCA: 31] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2018] [Accepted: 08/27/2018] [Indexed: 12/17/2022] Open
Abstract
The hERG (human ether-a-go-go-related gene) encoded potassium ion (K+) channel plays a major role in cardiac repolarization. Drug-induced blockade of hERG has been a major cause of potentially lethal ventricular tachycardia termed Torsades de Pointes (TdPs). Therefore, we presented a pharmacoinformatics strategy using combined ligand and structure based models for the prediction of hERG inhibition potential (IC50) of new chemical entities (NCEs) during early stages of drug design and development. Integrated GRid-INdependent Descriptor (GRIND) models, and lipophilic efficiency (LipE), ligand efficiency (LE) guided template selection for the structure based pharmacophore models have been used for virtual screening and subsequent hERG activity (pIC50) prediction of identified hits. Finally selected two hits were experimentally evaluated for hERG inhibition potential (pIC50) using whole cell patch clamp assay. Overall, our results demonstrate a difference of less than ±1.6 log unit between experimentally determined and predicted hERG inhibition potential (IC50) of the selected hits. This revealed predictive ability and robustness of our models and could help in correctly rank the potency order (lower μM to higher nM range) against hERG.
Collapse
Affiliation(s)
- Saba Munawar
- Research Center for Modeling and Simulation, National University of Science and Technology, Islamabad, Pakistan.,Victor Chang Cardiac Research Institute, Sydney, NSW, Australia
| | | | - Edwin G Tse
- School of Chemistry, The University of Sydney, Sydney, NSW, Australia
| | - Matthew H Todd
- School of Chemistry, The University of Sydney, Sydney, NSW, Australia
| | - Adam P Hill
- Victor Chang Cardiac Research Institute, Sydney, NSW, Australia
| | | | - Ishrat Jabeen
- Research Center for Modeling and Simulation, National University of Science and Technology, Islamabad, Pakistan
| |
Collapse
|
4
|
Abstract
Various methods of machine learning, supervised and unsupervised, linear and nonlinear, classification and regression, in combination with various types of molecular descriptors, both "handcrafted" and "data-driven," are considered in the context of their use in computational toxicology. The use of multiple linear regression, variants of naïve Bayes classifier, k-nearest neighbors, support vector machine, decision trees, ensemble learning, random forest, several types of neural networks, and deep learning is the focus of attention of this review. The role of fragment descriptors, graph mining, and graph kernels is highlighted. The application of unsupervised methods, such as Kohonen's self-organizing maps and related approaches, which allow for combining predictions with data analysis and visualization, is also considered. The necessity of applying a wide range of machine learning methods in computational toxicology is underlined.
Collapse
Affiliation(s)
- Igor I Baskin
- Faculty of Physics, M.V. Lomonosov Moscow State University, Moscow, Russian Federation.
- Butlerov Institute of Chemistry, Kazan Federal University, Kazan, Russian Federation.
| |
Collapse
|
5
|
Miyao T, Kaneko H, Funatsu K. Inverse QSPR/QSAR Analysis for Chemical Structure Generation (from y to x). J Chem Inf Model 2016; 56:286-99. [PMID: 26818135 DOI: 10.1021/acs.jcim.5b00628] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Retrieving descriptor information (x information) from a value of an objective variable (y) is a fundamental problem in inverse quantitative structure-property relationship (inverse-QSPR) analysis but challenging because of the complexity of the preimage function. Herewith, we propose using a cluster-wise multiple linear regression (cMLR) model as a QSPR model for inverse-QSPR analysis. x information is acquired as a probability density function by combining cMLR and the prior distribution modeled with a mixture of Gaussians (GMMs). Three case studies were conducted to demonstrate various aspects of the potential of cMLR. It was found that the predictive power of cMLR was superior to that of MLR, especially for data with nonlinearity. Moreover, it turned out that the applicability domain could be considered since the posterior distribution inherits the prior distribution's feature (i.e., training data feature) and represents the possibility of having the desired property. Finally, a series of inverse analyses with the GMMs/cMLR was demonstrated with the aim to generate de novo structures having specific aqueous solubility.
Collapse
Affiliation(s)
- Tomoyuki Miyao
- Department of Chemical System Engineering, The University of Tokyo , 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
| | - Hiromasa Kaneko
- Department of Chemical System Engineering, The University of Tokyo , 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
| | - Kimito Funatsu
- Department of Chemical System Engineering, The University of Tokyo , 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan
| |
Collapse
|
6
|
Gaspar HA, Baskin II, Marcou G, Horvath D, Varnek A. Stargate GTM: Bridging Descriptor and Activity Spaces. J Chem Inf Model 2015; 55:2403-10. [DOI: 10.1021/acs.jcim.5b00398] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Héléna A. Gaspar
- Laboratoire
de Chemoinformatique, UMR 7140, Université de Strasbourg, 1 rue
Blaise Pascal, Strasbourg 67000, France
| | - Igor I. Baskin
- Faculty
of Physics, M.V. Lomonosov Moscow State University, Leninskie Gory, Moscow 119991, Russia
- Laboratory
of Chemoinformatics, Butlerov Institute of Chemistry, Kazan Federal University, Kazan 420008, Russia
| | - Gilles Marcou
- Laboratoire
de Chemoinformatique, UMR 7140, Université de Strasbourg, 1 rue
Blaise Pascal, Strasbourg 67000, France
| | - Dragos Horvath
- Laboratoire
de Chemoinformatique, UMR 7140, Université de Strasbourg, 1 rue
Blaise Pascal, Strasbourg 67000, France
| | - Alexandre Varnek
- Laboratoire
de Chemoinformatique, UMR 7140, Université de Strasbourg, 1 rue
Blaise Pascal, Strasbourg 67000, France
- Laboratory
of Chemoinformatics, Butlerov Institute of Chemistry, Kazan Federal University, Kazan 420008, Russia
| |
Collapse
|
7
|
Computational investigations of hERG channel blockers: New insights and current predictive models. Adv Drug Deliv Rev 2015; 86:72-82. [PMID: 25770776 DOI: 10.1016/j.addr.2015.03.003] [Citation(s) in RCA: 73] [Impact Index Per Article: 8.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Revised: 01/13/2015] [Accepted: 03/04/2015] [Indexed: 01/08/2023]
Abstract
Identification of potential human Ether-a-go-go Related-Gene (hERG) potassium channel blockers is an essential part of the drug development and drug safety process in pharmaceutical industries or academic drug discovery centers, as they may lead to drug-induced QT prolongation, arrhythmia and Torsade de Pointes. Recent reports also suggest starting to address such issues at the hit selection stage. In order to prioritize molecules during the early drug discovery phase and to reduce the risk of drug attrition due to cardiotoxicity during pre-clinical and clinical stages, computational approaches have been developed to predict the potential hERG blockage of new drug candidates. In this review, we will describe the current in silico methods developed and applied to predict and to understand the mechanism of actions of hERG blockers, including ligand-based and structure-based approaches. We then discuss ongoing research on other ion channels and hERG polymorphism susceptible to be involved in LQTS and how systemic approaches can help in the drug safety decision.
Collapse
|
8
|
Gaspar HA, Baskin II, Marcou G, Horvath D, Varnek A. GTM-Based QSAR Models and Their Applicability Domains. Mol Inform 2015; 34:348-56. [DOI: 10.1002/minf.201400153] [Citation(s) in RCA: 45] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2014] [Accepted: 11/28/2014] [Indexed: 11/06/2022]
|
9
|
Wiśniowska B, Mendyk A, Szlęk J, Kołaczkowski M, Polak S. Enhanced QSAR models for drug-triggered inhibition of the main cardiac ion currents. J Appl Toxicol 2015; 35:1030-9. [DOI: 10.1002/jat.3095] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/23/2014] [Revised: 10/27/2014] [Accepted: 10/31/2014] [Indexed: 11/06/2022]
Affiliation(s)
- Barbara Wiśniowska
- Unit of Pharmacoepidemiology and Pharmacoeconomics, Faculty of Pharmacy, Medical College; Jagiellonian University; Krakow Poland
| | - Aleksander Mendyk
- Department of Pharmaceutical Technology and Biopharmaceutics, Faculty of Pharmacy, Medical College; Jagiellonian University; Krakow Poland
| | - Jakub Szlęk
- Department of Pharmaceutical Technology and Biopharmaceutics, Faculty of Pharmacy, Medical College; Jagiellonian University; Krakow Poland
| | - Michał Kołaczkowski
- Building and Structure Physics Division, Institute of Building Materials and Structures, Faculty of Civil Engineering; Cracow University of Technology; Krakow Poland
| | - Sebastian Polak
- Unit of Pharmacoepidemiology and Pharmacoeconomics, Faculty of Pharmacy, Medical College; Jagiellonian University; Krakow Poland
| |
Collapse
|
10
|
Devillers J, Lagneau C, Lattes A, Garrigues J, Clémenté M, Yébakima A. In silico models for predicting vector control chemicals targeting Aedes aegypti. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2014; 25:805-835. [PMID: 25275884 PMCID: PMC4200584 DOI: 10.1080/1062936x.2014.958291] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/20/2014] [Accepted: 07/30/2014] [Indexed: 05/31/2023]
Abstract
Human arboviral diseases have emerged or re-emerged in numerous countries worldwide due to a number of factors including the lack of progress in vaccine development, lack of drugs, insecticide resistance in mosquitoes, climate changes, societal behaviours, and economical constraints. Thus, Aedes aegypti is the main vector of the yellow fever and dengue fever flaviviruses and is also responsible for several recent outbreaks of the chikungunya alphavirus. As for the other mosquito species, the A. aegypti control relies heavily on the use of insecticides. However, because of increasing resistance to the different families of insecticides, reduction of Aedes populations is becoming increasingly difficult. Despite the unquestionable utility of insecticides in fighting mosquito populations, there are very few new insecticides developed and commercialized for vector control. This is because the high cost of the discovery of an insecticide is not counterbalanced by the 'low profitability' of the vector control market. Fortunately, the use of quantitative structure-activity relationship (QSAR) modelling allows the reduction of time and cost in the discovery of new chemical structures potentially active against mosquitoes. In this context, the goal of the present study was to review all the existing QSAR models on A. aegypti. The homology and pharmacophore models were also reviewed. Specific attention was paid to show the variety of targets investigated in Aedes in relation to the physiology and ecology of the mosquito as well as the diversity of the chemical structures which have been proposed, encompassing man-made and natural substances.
Collapse
Affiliation(s)
| | | | - A. Lattes
- Laboratoire I.M.R.C.P., Université Paul Sabatier, Toulouse, France
| | - J.C. Garrigues
- Laboratoire I.M.R.C.P., Université Paul Sabatier, Toulouse, France
| | - M.M. Clémenté
- Centre de Démoustication/LAV (ARS-Conseil Général) de la Martinique, Martinique, France
| | - A. Yébakima
- Centre de Démoustication/LAV (ARS-Conseil Général) de la Martinique, Martinique, France
| |
Collapse
|
11
|
Ovchinnikova SI, Bykov AA, Tsivadze AY, Dyachkov EP, Kireeva NV. Supervised extensions of chemography approaches: case studies of chemical liabilities assessment. J Cheminform 2014; 6:20. [PMID: 24868246 PMCID: PMC4018504 DOI: 10.1186/1758-2946-6-20] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2013] [Accepted: 04/28/2014] [Indexed: 12/04/2022] Open
Abstract
Chemical liabilities, such as adverse effects and toxicity, play a significant role in modern drug discovery process. In silico assessment of chemical liabilities is an important step aimed to reduce costs and animal testing by complementing or replacing in vitro and in vivo experiments. Herein, we propose an approach combining several classification and chemography methods to be able to predict chemical liabilities and to interpret obtained results in the context of impact of structural changes of compounds on their pharmacological profile. To our knowledge for the first time, the supervised extension of Generative Topographic Mapping is proposed as an effective new chemography method. New approach for mapping new data using supervised Isomap without re-building models from the scratch has been proposed. Two approaches for estimation of model's applicability domain are used in our study to our knowledge for the first time in chemoinformatics. The structural alerts responsible for the negative characteristics of pharmacological profile of chemical compounds has been found as a result of model interpretation.
Collapse
Affiliation(s)
- Svetlana I Ovchinnikova
- Frumkin Institute of Physical Chemistry and Electrochemistry RAS, Leninsky pr-t 31-4, 119071 Moscow, Russia
- Moscow Institute of Physics and Technology, Institutsky per., 9, 141700 Dolgoprudny, Russia
| | - Arseniy A Bykov
- Frumkin Institute of Physical Chemistry and Electrochemistry RAS, Leninsky pr-t 31-4, 119071 Moscow, Russia
- Moscow Institute of Physics and Technology, Institutsky per., 9, 141700 Dolgoprudny, Russia
| | - Aslan Yu Tsivadze
- Frumkin Institute of Physical Chemistry and Electrochemistry RAS, Leninsky pr-t 31-4, 119071 Moscow, Russia
| | - Evgeny P Dyachkov
- Kurnakov Institute of General and Inorganic Chemistry RAS, Leninsky pr-t 31, 119071 Moscow, Russia
| | - Natalia V Kireeva
- Frumkin Institute of Physical Chemistry and Electrochemistry RAS, Leninsky pr-t 31-4, 119071 Moscow, Russia
- Moscow Institute of Physics and Technology, Institutsky per., 9, 141700 Dolgoprudny, Russia
| |
Collapse
|
12
|
Nonlinear Dimensionality Reduction for Visualizing Toxicity Data: Distance-Based Versus Topology-Based Approaches. ChemMedChem 2014; 9:1047-59. [DOI: 10.1002/cmdc.201400027] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2014] [Indexed: 01/11/2023]
|
13
|
Impact of distance-based metric learning on classification and visualization model performance and structure–activity landscapes. J Comput Aided Mol Des 2014; 28:61-73. [DOI: 10.1007/s10822-014-9719-1] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2013] [Accepted: 01/24/2014] [Indexed: 10/25/2022]
|
14
|
Rampe D, Brown AM. A history of the role of the hERG channel in cardiac risk assessment. J Pharmacol Toxicol Methods 2013; 68:13-22. [DOI: 10.1016/j.vascn.2013.03.005] [Citation(s) in RCA: 56] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2013] [Revised: 03/14/2013] [Accepted: 03/14/2013] [Indexed: 01/25/2023]
|