1
|
Puerta A, González-Bakker A, Santos G, Padrón JM. Early Pharmacological Profiling of Antiproliferative Compounds by Live Cell Imaging. Molecules 2022; 27:molecules27165261. [PMID: 36014500 PMCID: PMC9415461 DOI: 10.3390/molecules27165261] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Revised: 08/11/2022] [Accepted: 08/16/2022] [Indexed: 12/04/2022] Open
Abstract
Natural products represent an excellent source of unprecedented anticancer compounds. However, the identification of the mechanism of action remains a major challenge. Several techniques and methodologies have been considered, but with limited success. In this work, we explored the combination of live cell imaging and machine learning techniques as a promising tool to depict in a fast and affordable test the mode of action of natural compounds with antiproliferative activity. To develop the model, we selected the non-small cell lung cancer cell line SW1573, which was exposed to the known antimitotic drugs paclitaxel, colchicine and vinblastine. The novelty of our methodology focuses on two main features with the highest relevance, (a) meaningful phenotypic metrics, and (b) fast Fourier transform (FFT) of the time series of the phenotypic parameters into their corresponding amplitudes and phases. The resulting algorithm was able to cluster the microtubule disruptors, and meanwhile showed a negative correlation between paclitaxel and the other treatments. The FFT approach was able to group the samples as efficiently as checking by eye. This methodology could easily scale to group a large amount of data without visual supervision.
Collapse
Affiliation(s)
- Adrián Puerta
- BioLab, Instituto Universitario de Bio-Orgánica “Antonio González”, Universidad de La Laguna, Avenida Astrofísico Francisco Sánchez 2, 38206 La Laguna, Spain
| | - Aday González-Bakker
- BioLab, Instituto Universitario de Bio-Orgánica “Antonio González”, Universidad de La Laguna, Avenida Astrofísico Francisco Sánchez 2, 38206 La Laguna, Spain
| | - Guido Santos
- Departament of Biochemistry, Microbiology, Cell Biology and Genetics, Faculty of Sciences, Universidad de La Laguna, Avenida Astrofísico Francisco Sánchez s/n, 38206 La Laguna, Spain
| | - José M. Padrón
- BioLab, Instituto Universitario de Bio-Orgánica “Antonio González”, Universidad de La Laguna, Avenida Astrofísico Francisco Sánchez 2, 38206 La Laguna, Spain
- Correspondence: ; Tel.: +34-922-316-502 (ext. 6126)
| |
Collapse
|
2
|
Trapotsi MA, Hosseini-Gerami L, Bender A. Computational analyses of mechanism of action (MoA): data, methods and integration. RSC Chem Biol 2022; 3:170-200. [PMID: 35360890 PMCID: PMC8827085 DOI: 10.1039/d1cb00069a] [Citation(s) in RCA: 26] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2021] [Accepted: 12/09/2021] [Indexed: 12/15/2022] Open
Abstract
The elucidation of a compound's Mechanism of Action (MoA) is a challenging task in the drug discovery process, but it is important in order to rationalise phenotypic findings and to anticipate potential side-effects. Bioinformatic approaches, advances in machine learning techniques and the increasing deposition of high-throughput data in public databases have significantly contributed to recent advances in the field, but it is not straightforward to decide which data and methods are most suitable to use in a given case. In this review, we focus on these methods and data and their applications in generating MoA hypotheses for subsequent experimental validation. We discuss compound-specific data such as -omics, cell morphology and bioactivity data, as well as commonly used supplementary prior knowledge such as network and pathway data, and provide information on databases where this data can be accessed. In terms of methodologies, we discuss both well-established methods (connectivity mapping, pathway enrichment) as well as more developing methods (neural networks and multi-omics integration). Finally, we review case studies where the MoA of a compound was successfully suggested from computational analysis by incorporating multiple data modalities and/or methodologies. Our aim for this review is to provide researchers with insights into the benefits and drawbacks of both the data and methods in terms of level of understanding, biases and interpretation - and to highlight future avenues of investigation which we foresee will improve the field of MoA elucidation, including greater public access to -omics data and methodologies which are capable of data integration.
Collapse
Affiliation(s)
- Maria-Anna Trapotsi
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge UK
| | - Layla Hosseini-Gerami
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge UK
| | - Andreas Bender
- Centre for Molecular Informatics, Yusuf Hamied Department of Chemistry, University of Cambridge UK
| |
Collapse
|
3
|
Hughes RE, Elliott RJR, Dawson JC, Carragher NO. High-content phenotypic and pathway profiling to advance drug discovery in diseases of unmet need. Cell Chem Biol 2021; 28:338-355. [PMID: 33740435 DOI: 10.1016/j.chembiol.2021.02.015] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/29/2020] [Revised: 12/10/2020] [Accepted: 02/18/2021] [Indexed: 02/07/2023]
Abstract
Conventional thinking in modern drug discovery postulates that the design of highly selective molecules which act on a single disease-associated target will yield safer and more effective drugs. However, high clinical attrition rates and the lack of progress in developing new effective treatments for many important diseases of unmet therapeutic need challenge this hypothesis. This assumption also impinges upon the efficiency of target agnostic phenotypic drug discovery strategies, where early target deconvolution is seen as a critical step to progress phenotypic hits. In this review we provide an overview of how emerging phenotypic and pathway-profiling technologies integrate to deconvolute the mechanism-of-action of phenotypic hits. We propose that such in-depth mechanistic profiling may support more efficient phenotypic drug discovery strategies that are designed to more appropriately address complex heterogeneous diseases of unmet need.
Collapse
Affiliation(s)
- Rebecca E Hughes
- Cancer Research UK Edinburgh Centre, MRC Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh EH4 2XR, UK
| | - Richard J R Elliott
- Cancer Research UK Edinburgh Centre, MRC Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh EH4 2XR, UK
| | - John C Dawson
- Cancer Research UK Edinburgh Centre, MRC Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh EH4 2XR, UK
| | - Neil O Carragher
- Cancer Research UK Edinburgh Centre, MRC Institute of Genetics and Molecular Medicine, University of Edinburgh, Edinburgh EH4 2XR, UK.
| |
Collapse
|
4
|
Drakakis G, Cortés-Ciriano I, Alexander-Dann B, Bender A. Elucidating Compound Mechanism of Action and Predicting Cytotoxicity Using Machine Learning Approaches, Taking Prediction Confidence into Account. ACTA ACUST UNITED AC 2020; 11:e73. [PMID: 31483099 DOI: 10.1002/cpch.73] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
The modes of action (MoAs) of drugs frequently are unknown, because many are small molecules initially identified from phenotypic screens, giving rise to the need to elucidate their MoAs. In addition, the high attrition rate for candidate drugs in preclinical studies due to intolerable toxicity has motivated the development of computational approaches to predict drug candidate (cyto)toxicity as early as possible in the drug-discovery process. Here, we provide detailed instructions for capitalizing on bioactivity predictions to elucidate the MoAs of small molecules and infer their underlying phenotypic effects. We illustrate how these predictions can be used to infer the underlying antidepressive effects of marketed drugs. We also provide the necessary functionalities to model cytotoxicity data using single and ensemble machine-learning algorithms. Finally, we give detailed instructions on how to calculate confidence intervals for individual predictions using the conformal prediction framework. © 2019 by John Wiley & Sons, Inc.
Collapse
Affiliation(s)
- Georgios Drakakis
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
| | - Isidro Cortés-Ciriano
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
| | - Ben Alexander-Dann
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
| | - Andreas Bender
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
5
|
Baldo F. Prediction of modes of action of components of traditional medicinal preparations. PHYSICAL SCIENCES REVIEWS 2020. [DOI: 10.1515/psr-2018-0115] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
AbstractTraditional medicine preparations are used to treat many ailments in multiple regions across the world. Despite their widespread use, the mode of action of these preparations and their constituents are not fully understood. Traditional methods of elucidating the modes of action of these natural products (NPs) can be expensive and time consuming e. g. biochemical methods, bioactivity guided fractionation, etc. In this review, we discuss some methods for the prediction of the modes of action of traditional medicine preparations, both in mixtures and as isolated NPs. These methods are useful to predict targets of NPs before they are experimentally validated. Case studies of the applications of these methods are also provided herein.
Collapse
|
6
|
Allen CHG, Mervin LH, Mahmoud SY, Bender A. Leveraging heterogeneous data from GHS toxicity annotations, molecular and protein target descriptors and Tox21 assay readouts to predict and rationalise acute toxicity. J Cheminform 2019; 11:36. [PMID: 31152262 PMCID: PMC6544914 DOI: 10.1186/s13321-019-0356-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2018] [Accepted: 05/15/2019] [Indexed: 01/06/2023] Open
Abstract
Despite the increasing knowledge in both the chemical and biological domains the assimilation and exploration of heterogeneous datasets, encoding information about the chemical, bioactivity and phenotypic properties of compounds, remains a challenge due to requirement for overlap between chemicals assayed across the spaces. Here, we have constructed a novel dataset, larger than we have used in prior work, comprising 579 acute oral toxic compounds and 1427 non-toxic compounds derived from regulatory GHS information, along with their corresponding molecular and protein target descriptors and qHTS in vitro assay readouts from the Tox21 project. We found no clear association between the results of a FAFDrugs4 toxicophore screen and the acute oral toxicity classifications for our compound set; and a screen using a subset of the ToxAlerts toxicophores was also of limited utility, with only slight enrichment toward the toxic set (odds ratio of 1.48). We then investigated to what degree toxic and non-toxic compounds could be separated in each of the spaces, to compare their potential contribution to further analyses. Using an LDA projection, we found the largest degree of separation using chemical descriptors (Cohen’s d of 1.95) and the lowest degree of separation between toxicity classes using qHTS descriptors (Cohen’s d of 0.67). To compare the predictivity of the feature spaces for the toxicity endpoint, we next trained Random Forest (RF) acute oral toxicity classifiers on either molecular, protein target and qHTS descriptors. RFs trained on molecular and protein target descriptors were most predictive, with ROC AUC values of 0.80–0.92 and 0.70–0.85, respectively, across three test sets. RFs trained on both chemical and protein target descriptors combined exhibited similar predictive performance to the single-domain models (ROC AUC of 0.80–0.91). Model interpretability was improved by the inclusion of protein target descriptors, which allow the identification of specific targets (e.g. Retinal dehydrogenase) with literature links to toxic modes of action (e.g. oxidative stress). The dataset compiled in this study has been made available for future application.
Collapse
Affiliation(s)
- Chad H G Allen
- Department of Chemistry, Centre for Molecular Informatics, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Lewis H Mervin
- Department of Chemistry, Centre for Molecular Informatics, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Samar Y Mahmoud
- Department of Chemistry, Centre for Molecular Informatics, Lensfield Road, Cambridge, CB2 1EW, UK
| | - Andreas Bender
- Department of Chemistry, Centre for Molecular Informatics, Lensfield Road, Cambridge, CB2 1EW, UK.
| |
Collapse
|
7
|
Abstract
Drugs modulate disease states through their actions on targets in the body. Determining these targets aids the focused development of new treatments, and helps to better characterize those already employed. One means of accomplishing this is through the deployment of in silico methodologies, harnessing computational analytical and predictive power to produce educated hypotheses for experimental verification. Here, we provide an overview of the current state of the art, describe some of the well-established methods in detail, and reflect on how they, and emerging technologies promoting the incorporation of complex and heterogeneous data-sets, can be employed to improve our understanding of (poly)pharmacology.
Collapse
Affiliation(s)
- Ryan Byrne
- Department of Chemistry and Applied Biosciences, Swiss Federal Institute of Technology (ETH), Zurich, Switzerland
| | - Gisbert Schneider
- Department of Chemistry and Applied Biosciences, Swiss Federal Institute of Technology (ETH), Zurich, Switzerland.
| |
Collapse
|
8
|
Mervin LH, Afzal AM, Brive L, Engkvist O, Bender A. Extending in Silico Protein Target Prediction Models to Include Functional Effects. Front Pharmacol 2018; 9:613. [PMID: 29942259 PMCID: PMC6004408 DOI: 10.3389/fphar.2018.00613] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2018] [Accepted: 05/22/2018] [Indexed: 12/31/2022] Open
Abstract
In silico protein target deconvolution is frequently used for mechanism-of-action investigations; however existing protocols usually do not predict compound functional effects, such as activation or inhibition, upon binding to their protein counterparts. This study is hence concerned with including functional effects in target prediction. To this end, we assimilated a bioactivity training set for 332 targets, comprising 817,239 active data points with unknown functional effect (binding data) and 20,761,260 inactive compounds, along with 226,045 activating and 1,032,439 inhibiting data points from functional screens. Chemical space analysis of the data first showed some separation between compound sets (binding and inhibiting compounds were more similar to each other than both binding and activating or activating and inhibiting compounds), providing a rationale for implementing functional prediction models. We employed three different architectures to predict functional response, ranging from simplistic random forest models ('Arch1') to cascaded models which use separate binding and functional effect classification steps ('Arch2' and 'Arch3'), differing in the way training sets were generated. Fivefold stratified cross-validation outlined cascading predictions provides superior precision and recall based on an internal test set. We next prospectively validated the architectures using a temporal set of 153,467 of in-house data points (after a 4-month interim from initial data extraction). Results outlined Arch3 performed with the highest target class averaged precision and recall scores of 71% and 53%, which we attribute to the use of inactive background sets. Distance-based applicability domain (AD) analysis outlined that Arch3 provides superior extrapolation into novel areas of chemical space, and thus based on the results presented here, propose as the most suitable architecture for the functional effect prediction of small molecules. We finally conclude including functional effects could provide vital insight in future studies, to annotate cases of unanticipated functional changeover, as outlined by our CHRM1 case study.
Collapse
Affiliation(s)
- Lewis H Mervin
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
| | - Avid M Afzal
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
| | | | - Ola Engkvist
- Hit Discovery, Discovery Sciences, IMED Biotech Unit, AstraZeneca, Gothenburg, Sweden
| | - Andreas Bender
- Centre for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
9
|
Cruz-Monteagudo M, Schürer S, Tejera E, Pérez-Castillo Y, Medina-Franco JL, Sánchez-Rodríguez A, Borges F. Systemic QSAR and phenotypic virtual screening: chasing butterflies in drug discovery. Drug Discov Today 2017; 22:994-1007. [PMID: 28274840 PMCID: PMC5487293 DOI: 10.1016/j.drudis.2017.02.004] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2016] [Revised: 02/02/2017] [Accepted: 02/27/2017] [Indexed: 12/20/2022]
Abstract
Current advances in systems biology suggest a new change of paradigm reinforcing the holistic nature of the drug discovery process. According to the principles of systems biology, a simple drug perturbing a network of targets can trigger complex reactions. Therefore, it is possible to connect initial events with final outcomes and consequently prioritize those events, leading to a desired effect. Here, we introduce a new concept, 'Systemic Chemogenomics/Quantitative Structure-Activity Relationship (QSAR)'. To elaborate on the concept, relevant information surrounding it is addressed. The concept is challenged by implementing a systemic QSAR approach for phenotypic virtual screening (VS) of candidate ligands acting as neuroprotective agents in Parkinson's disease (PD). The results support the suitability of the approach for the phenotypic prioritization of drug candidates.
Collapse
Affiliation(s)
- Maykel Cruz-Monteagudo
- CIQUP/Departamento de Química e Bioquímica, Faculdade de Ciências, Universidade do Porto, Porto 4169-007, Portugal.
| | - Stephan Schürer
- Department of Pharmacology, Miller School of Medicine and Center for Computational Science, University of Miami, Miami, FL 33136, USA
| | - Eduardo Tejera
- Instituto de Investigaciones Biomédicas (IIB), Universidad de Las Américas, 170513 Quito, Ecuador
| | - Yunierkis Pérez-Castillo
- Sección Físico Química y Matemáticas, Departamento de Química, Universidad Técnica Particular de Loja, San Cayetano Alto S/N, EC1101608 Loja, Ecuador
| | - José L Medina-Franco
- Universidad Nacional Autónoma de México, Departamento de Farmacia, Facultad de Química, Avenida Universidad 3000, Mexico City, 04510, Mexico
| | - Aminael Sánchez-Rodríguez
- Departamento de Ciencias Naturales, Universidad Técnica Particular de Loja, Calle París S/N, EC1101608 Loja, Ecuador
| | - Fernanda Borges
- CIQUP/Departamento de Química e Bioquímica, Faculdade de Ciências, Universidade do Porto, Porto 4169-007, Portugal.
| |
Collapse
|
10
|
Kinaret P, Marwah V, Fortino V, Ilves M, Wolff H, Ruokolainen L, Auvinen P, Savolainen K, Alenius H, Greco D. Network Analysis Reveals Similar Transcriptomic Responses to Intrinsic Properties of Carbon Nanomaterials in Vitro and in Vivo. ACS NANO 2017; 11:3786-3796. [PMID: 28380293 DOI: 10.1021/acsnano.6b08650] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/07/2023]
Abstract
Understanding the complex molecular alterations related to engineered nanomaterial (ENM) exposure is essential for carrying out toxicity assessment. Current experimental paradigms rely on both in vitro and in vivo exposure setups that often are difficult to compare, resulting in questioning the real efficacy of cell models to mimic more complex exposure scenarios at the organism level. Here, we have systematically investigated transcriptomic responses of the THP-1 macrophage cell line and lung tissues of mice, after exposure to several carbon nanomaterials (CNMs). Under the assumption that the CNM exposure related molecular alterations are mixtures of signals related to their intrinsic properties, we inferred networks of responding genes, whose expression levels are coordinately altered in response to specific CNM intrinsic properties. We observed only a minute overlap between the sets of intrinsic property-correlated genes at different exposure scenarios, suggesting specific transcriptional programs working in different exposure scenarios. However, when the effects of the CNM were investigated at the level of significantly altered molecular functions, a broader picture of substantial commonality emerged. Our results imply that in vitro exposures can efficiently recapitulate the complex molecular functions altered in vivo. In this study, altered molecular pathways in response to specific CNM intrinsic properties have been systematically characterized from transcriptomic data generated from multiple exposure setups. Our computational approach to the analysis of network response modules further revealed similarities between in vitro and in vivo exposures that could not be detected by traditional analysis of transcriptomics data. Our analytical strategy also opens a possibility to look for pathways of toxicity and understanding the molecular and cellular responses identified across predefined biological themes.
Collapse
Affiliation(s)
| | | | | | | | - Henrik Wolff
- Finnish Institute of Occupational Health , Helsinki, Finland 00251
| | | | | | - Kai Savolainen
- Finnish Institute of Occupational Health , Helsinki, Finland 00251
| | - Harri Alenius
- Institute of Environmental Medicine (IMM), Karolinska Institutet , 171 77 Stockholm, Sweden
| | | |
Collapse
|
11
|
Fu X, Mervin LH, Li X, Yu H, Li J, Mohamad Zobir SZ, Zoufir A, Zhou Y, Song Y, Wang Z, Bender A. Toward Understanding the Cold, Hot, and Neutral Nature of Chinese Medicines Using in Silico Mode-of-Action Analysis. J Chem Inf Model 2017; 57:468-483. [DOI: 10.1021/acs.jcim.6b00725] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Xianjun Fu
- School
of Information Management, Shandong University of Traditional Chinese Medicine, 250355 Jinan, China
- Centre
for Molecular Science Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Lewis H. Mervin
- Centre
for Molecular Science Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Xuebo Li
- School
of Information Management, Shandong University of Traditional Chinese Medicine, 250355 Jinan, China
| | - Huayun Yu
- College
of TCM, Shandong University of Traditional Chinese Medicine, 250355 Jinan, China
| | - Jiaoyang Li
- School
of Information Management, Shandong University of Traditional Chinese Medicine, 250355 Jinan, China
| | - Siti Zuraidah Mohamad Zobir
- Centre
for Molecular Science Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Azedine Zoufir
- Centre
for Molecular Science Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| | - Yang Zhou
- School
of Information Management, Shandong University of Traditional Chinese Medicine, 250355 Jinan, China
| | - Yongmei Song
- School
of Information Management, Shandong University of Traditional Chinese Medicine, 250355 Jinan, China
| | - Zhenguo Wang
- School
of Information Management, Shandong University of Traditional Chinese Medicine, 250355 Jinan, China
| | - Andreas Bender
- Centre
for Molecular Science Informatics, Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, United Kingdom
| |
Collapse
|
12
|
Mervin LH, Cao Q, Barrett IP, Firth MA, Murray D, McWilliams L, Haddrick M, Wigglesworth M, Engkvist O, Bender A. Understanding Cytotoxicity and Cytostaticity in a High-Throughput Screening Collection. ACS Chem Biol 2016; 11:3007-3023. [PMID: 27571164 DOI: 10.1021/acschembio.6b00538] [Citation(s) in RCA: 29] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
While mechanisms of cytotoxicity and cytostaticity have been studied extensively from the biological side, relatively little is currently understood regarding areas of chemical space leading to cytotoxicity and cytostasis in large compound collections. Predicting and rationalizing potential adverse mechanism-of-actions (MoAs) of small molecules is however crucial for screening library design, given the link of even low level cytotoxicity and adverse events observed in man. In this study, we analyzed results from a cell-based cytotoxicity screening cascade, comprising 296 970 nontoxic, 5784 cytotoxic and cytostatic, and 2327 cytostatic-only compounds evaluated on the THP-1 cell-line. We employed an in silico MoA analysis protocol, utilizing 9.5 million active and 602 million inactive bioactivity points to generate target predictions, annotate predicted targets with pathways, and calculate enrichment metrics to highlight targets and pathways. Predictions identify known mechanisms for the top ranking targets and pathways for both phenotypes after review and indicate that while processes involved in cytotoxicity versus cytostaticity seem to overlap, differences between both phenotypes seem to exist to some extent. Cytotoxic predictions highlight many kinases, including the potentially novel cytotoxicity-related target STK32C, while cytostatic predictions outline targets linked with response to DNA damage, metabolism, and cytoskeletal machinery. Fragment analysis was also employed to generate a library of toxicophores to improve general understanding of the chemical features driving toxicity. We highlight substructures with potential kinase-dependent and kinase-independent mechanisms of toxicity. We also trained a cytotoxic classification model on proprietary and public compound readouts, and prospectively validated these on 988 novel compounds comprising difficult and trivial testing instances, to establish the applicability domain of models. The proprietary model performed with precision and recall scores of 77.9% and 83.8%, respectively. The MoA results and top ranking substructures with accompanying MoA predictions are available as a platform to assess screening collections.
Collapse
Affiliation(s)
- Lewis H. Mervin
- Centre
for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
| | - Qing Cao
- Discovery Sciences, AstraZeneca R&D, Waltham, United States
| | - Ian P. Barrett
- Discovery Sciences, AstraZeneca R&D, Cambridge Science Park, Cambridge, United Kingdom
| | - Mike A. Firth
- Discovery Sciences, AstraZeneca R&D, Cambridge Science Park, Cambridge, United Kingdom
| | - David Murray
- Discovery Sciences, AstraZeneca R&D, Alderley Park, Macclesfield, United Kingdom
| | - Lisa McWilliams
- Discovery Sciences, AstraZeneca R&D, Alderley Park, Macclesfield, United Kingdom
| | - Malcolm Haddrick
- Discovery Sciences, AstraZeneca R&D, Alderley Park, Macclesfield, United Kingdom
| | - Mark Wigglesworth
- Discovery Sciences, AstraZeneca R&D, Alderley Park, Macclesfield, United Kingdom
| | - Ola Engkvist
- Discovery Sciences, AstraZeneca R&D, Mölndal, Sweden
| | - Andreas Bender
- Centre
for Molecular Informatics, Department of Chemistry, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
13
|
Liu X, Baarsma H, Thiam C, Montrone C, Brauner B, Fobo G, Heier JS, Duscha S, Königshoff M, Angeli V, Ruepp A, Campillos M. Systematic Identification of Pharmacological Targets from Small-Molecule Phenotypic Screens. Cell Chem Biol 2016; 23:1302-1313. [DOI: 10.1016/j.chembiol.2016.08.011] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2016] [Revised: 06/10/2016] [Accepted: 08/05/2016] [Indexed: 01/29/2023]
|
14
|
Global Mapping of Traditional Chinese Medicine into Bioactivity Space and Pathways Annotation Improves Mechanistic Understanding and Discovers Relationships between Therapeutic Action (Sub)classes. EVIDENCE-BASED COMPLEMENTARY AND ALTERNATIVE MEDICINE 2016; 2016:2106465. [PMID: 26989424 PMCID: PMC4775820 DOI: 10.1155/2016/2106465] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/14/2015] [Accepted: 12/03/2015] [Indexed: 02/08/2023]
Abstract
Traditional Chinese medicine (TCM) still needs more scientific rationale to be proven for it to be accepted further in the West. We are now in the position to propose computational hypotheses for the mode-of-actions (MOAs) of 45 TCM therapeutic action (sub)classes from in silico target prediction algorithms, whose target was later annotated with Kyoto Encyclopedia of Genes and Genomes pathway, and to discover the relationship between them by generating a hierarchical clustering. The results of 10,749 TCM compounds showed 183 enriched targets and 99 enriched pathways from Estimation Score ≤ 0 and ≥ 5% of compounds/targets in a (sub)class. The MOA of a (sub)class was established from supporting literature. Overall, the most frequent top three enriched targets/pathways were immune-related targets such as tyrosine-protein phosphatase nonreceptor type 2 (PTPN2) and digestive system such as mineral absorption. We found two major protein families, G-protein coupled receptor (GPCR), and protein kinase family contributed to the diversity of the bioactivity space, while digestive system was consistently annotated pathway motif, which agreed with the important treatment principle of TCM, “the foundation of acquired constitution” that includes spleen and stomach. In short, the TCM (sub)classes, in many cases share similar targets/pathways despite having different indications.
Collapse
|
15
|
Mervin LH, Afzal AM, Drakakis G, Lewis R, Engkvist O, Bender A. Target prediction utilising negative bioactivity data covering large chemical space. J Cheminform 2015; 7:51. [PMID: 26500705 PMCID: PMC4619454 DOI: 10.1186/s13321-015-0098-y] [Citation(s) in RCA: 85] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2015] [Accepted: 09/29/2015] [Indexed: 02/25/2023] Open
Abstract
BACKGROUND In silico analyses are increasingly being used to support mode-of-action investigations; however many such approaches do not utilise the large amounts of inactive data held in chemogenomic repositories. The objective of this work is concerned with the integration of such bioactivity data in the target prediction of orphan compounds to produce the probability of activity and inactivity for a range of targets. To this end, a novel human bioactivity data set was constructed through the assimilation of over 195 million bioactivity data points deposited in the ChEMBL and PubChem repositories, and the subsequent application of a sphere-exclusion selection algorithm to oversample presumed inactive compounds. RESULTS A Bernoulli Naïve Bayes algorithm was trained using the data and evaluated using fivefold cross-validation, achieving a mean recall and precision of 67.7 and 63.8 % for active compounds and 99.6 and 99.7 % for inactive compounds, respectively. We show the performances of the models are considerably influenced by the underlying intraclass training similarity, the size of a given class of compounds, and the degree of additional oversampling. The method was also validated using compounds extracted from WOMBAT producing average precision-recall AUC and BEDROC scores of 0.56 and 0.85, respectively. Inactive data points used for this test are based on presumed inactivity, producing an approximated indication of the true extrapolative ability of the models. A distance-based applicability domain analysis was also conducted; indicating an average Tanimoto Coefficient distance of 0.3 or greater between a test and training set can be used to give a global measure of confidence in model predictions. A final comparison to a method trained solely on active data from ChEMBL performed with precision-recall AUC and BEDROC scores of 0.45 and 0.76. CONCLUSIONS The inclusion of inactive data for model training produces models with superior AUC and improved early recognition capabilities, although the results from internal and external validation of the models show differing performance between the breadth of models. The realised target prediction protocol is available at https://github.com/lhm30/PIDGIN.Graphical abstractThe inclusion of large scale negative training data for in silico target prediction improves the precision and recall AUC and BEDROC scores for target models.
Collapse
Affiliation(s)
- Lewis H. Mervin
- />Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW UK
| | - Avid M. Afzal
- />Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW UK
| | - Georgios Drakakis
- />Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW UK
| | - Richard Lewis
- />Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW UK
| | - Ola Engkvist
- />Discovery Sciences, Chemistry Innovation Centre, AstraZeneca R&D, 43183 Mölndal, Sweden
| | - Andreas Bender
- />Department of Chemistry, Centre for Molecular Informatics, University of Cambridge, Lensfield Road, Cambridge, CB2 1EW UK
| |
Collapse
|