1
|
Alagawani S, Vasilyev V, Clayton AHA, Wang F. Insights into Halogen-Induced Changes in 4-Anilinoquinazoline EGFR Inhibitors: A Computational Spectroscopic Study. Molecules 2024; 29:2800. [PMID: 38930865 PMCID: PMC11206398 DOI: 10.3390/molecules29122800] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2024] [Revised: 06/03/2024] [Accepted: 06/08/2024] [Indexed: 06/28/2024] Open
Abstract
The epidermal growth factor receptor (EGFR) is a pivotal target in cancer therapy due to its significance within the tyrosine kinase family. EGFR inhibitors like AG-1478 and PD153035, featuring a 4-anilinoquinazoline moiety, have garnered global attention for their potent therapeutic activities. While pre-clinical studies have highlighted the significant impact of halogen substitution at the C3'-anilino position on drug potency, the underlying mechanism remains unclear. This study investigates the influence of halogen substitution (X = H, F, Cl, Br, I) on the structure, properties, and spectroscopy of halogen-substituted 4-anilinoquinazoline tyrosine kinase inhibitors (TKIs) using time-dependent density functional methods (TD-DFT) with the B3LYP functional. Our calculations revealed that halogen substitution did not induce significant changes in the three-dimensional conformation of the TKIs but led to noticeable alterations in electronic properties, such as dipole moment and spatial extent, impacting interactions at the EGFR binding site. The UV-visible spectra show that more potent TKI-X compounds typically have shorter wavelengths, with bromine's peak wavelength at 326.71 nm and hydrogen, with the lowest IC50 nM, shifting its lambda max to 333.17 nm, indicating a correlation between potency and spectral characteristics. Further analysis of the four lowest-lying conformers of each TKI-X, along with their crystal structures from the EGFR database, confirms that the most potent conformer is often not the global minimum structure but one of the low-lying conformers. The more potent TKI-Cl and TKI-Br exhibit larger deviations (RMSD > 0.65 Å) from their global minimum structures compared to other TKI-X (RMSD < 0.15 Å), indicating that potency is associated with greater flexibility. Dipole moments of TKI-X correlate with drug potency (ln(IC50 nM)), with TKI-Cl and TKI-Br showing significantly higher dipole moments (>8.0 Debye) in both their global minimum and crystal structures. Additionally, optical spectral shifts correlate with potency, as TKI-Cl and TKI-Br exhibit blue shifts from their global minimum structures, in contrast to other TKI-X. This suggests that optical reporting can effectively probe drug potency and conformation changes.
Collapse
Affiliation(s)
- Sallam Alagawani
- Department of Chemistry and Biotechnology, School of Science, Computing and Engineering Technologies, Swinburne University of Technology, Melbourne, VIC 3122, Australia;
| | - Vladislav Vasilyev
- National Computational Infrastructure, Australian National University, Canberra, ACT 0200, Australia;
| | - Andrew H. A. Clayton
- Optical Sciences Centre, Department of Physics and Astronomy, School of Science, Computing and Engineering Technologies, Swinburne University of Technology, Melbourne, VIC 3122, Australia
| | - Feng Wang
- Department of Chemistry and Biotechnology, School of Science, Computing and Engineering Technologies, Swinburne University of Technology, Melbourne, VIC 3122, Australia;
| |
Collapse
|
2
|
Das AP, Nandekar P, Mathur P, Agarwal SM. A systematic pipeline of protein structure selection for computer-aided drug discovery: A case study on T790M/L858R mutant EGFR structures. Protein Sci 2023; 32:e4740. [PMID: 37515373 PMCID: PMC10443354 DOI: 10.1002/pro.4740] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2023] [Revised: 07/24/2023] [Accepted: 07/26/2023] [Indexed: 07/30/2023]
Abstract
Virtual screening (VS) is a routine method to evaluate chemical libraries for lead identification. Therefore, the selection of appropriate protein structures for VS is an essential prerequisite to identify true actives during docking. But the presence of several crystal structures of the same protein makes it difficult to select one or few structures rationally for screening. Therefore, a computational prioritization protocol has been developed for shortlisting crystal structures that identify true active molecules with better efficiency. As identification of small-molecule inhibitors is an important clinical requirement for the T790M/L858R (TMLR) EGFR mutant, it has been selected as a case study. The approach involves cross-docking of 21 co-crystal ligands with all the structures of the same protein to select structures that dock non-native ligands with lower RMSD. The cross docking performance was then correlated with ligand similarity and binding-site conformational similarity. Eventually, structures were shortlisted by integrating cross-docking performance, and ligand and binding-site similarity. Thereafter, binding pose metadynamics was employed to identify structures having stable co-crystal ligands in their respective binding pockets. Finally, different enrichment metrics like BEDROC, RIE, AUAC, and EF1% were evaluated leading to the identification of five TMLR structures (5HCX, 5CAN, 5CAP, 5CAS, and 5CAO). These structures docked a number of non-native ligands with low RMSD, contain structurally dissimilar ligands, have conformationally dissimilar binding sites, harbor stable co-crystal ligands, and also identify true actives early. The present approach can be implemented for shortlisting protein targets of any other important therapeutic kinases.
Collapse
Affiliation(s)
- Agneesh Pratim Das
- Bioinformatics Division, ICMR—National Institute of Cancer Prevention and ResearchNoidaUttar PradeshIndia
- Amity Institute of BiotechnologyAmity University Uttar PradeshNoidaUttar PradeshIndia
| | | | - Puniti Mathur
- Amity Institute of BiotechnologyAmity University Uttar PradeshNoidaUttar PradeshIndia
| | - Subhash M. Agarwal
- Bioinformatics Division, ICMR—National Institute of Cancer Prevention and ResearchNoidaUttar PradeshIndia
| |
Collapse
|
3
|
Saini R, Kumari S, Bhatnagar A, Singh A, Mishra A. Discovery of the allosteric inhibitor from actinomyces metabolites to target EGFR CSTMLR mutant protein: molecular modeling and free energy approach. Sci Rep 2023; 13:8885. [PMID: 37264083 DOI: 10.1038/s41598-023-33065-7] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 04/06/2023] [Indexed: 06/03/2023] Open
Abstract
EGFR (epidermal growth factor receptor), a surface protein on the cell, belongs to the tyrosine kinase family, responsible for cell growth and proliferation. Overexpression or mutation in the EGFR gene leads to various types of cancer, i.e., non-small cell lung cancer, breast, and pancreatic cancer. Bioactive molecules identified in this genre were also an essential source of encouragement for researchers who accomplished the design and synthesis of novel compounds with anticancer properties. World Health Organization (WHO) report states that antibiotic resistance is one of the most severe risks to global well-being, food safety, and development. The world needs to take steps to lessen this danger, such as developing new antibiotics and regulating their use. In this study, 6524 compounds derived from Streptomyces sp. were subjected to drug-likeness filters, molecular docking, and molecular dynamic simulation for 1000 ns to find new triple mutant EGFRCSTMLR (EGFR-L858R/T790M/C797S) inhibitors. Docking outcomes revealed that five compounds showed better binding affinity (- 9.074 to - 9.3 kcal/mol) than both reference drug CH7233163 (- 6.11 kcal/mol) and co-crystallized ligand Osimertinib (- 8.07 kcal/mol). Further, molecular dynamic simulation confirmed that ligand C_42 exhibited the best interaction at the active site of EGFR protein and comprised a better average radius of gyration (3.87 Å) and average SASA (Solvent Accessible Surface Area) (82.91 Å2) value than co-crystallized ligand (4.49 Å, 222.38 Å2). Additionally, its average RMSD (Root Mean Square Deviation) (3.25 Å) and RMSF (Root Mean Square Fluctuation) (1.54 Å) values were highly similar to co-crystallized ligand (3.07 Å, 1.54 Å). Compared to the reference ligand, it also demonstrated conserved H-bond interactions with the residues MET_793 and GLN_791 with strong interaction probability. In conclusion, we have found a potential drug with no violation of the rule of three, Lipinski's rule of five, and 26 other vital parameters having great potential in medicinal and pharmaceutical industries applications and can overcome synthetic drug issues.
Collapse
Affiliation(s)
- Ravi Saini
- School of Biochemical Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, India
| | - Sonali Kumari
- School of Biochemical Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, India
| | - Aditi Bhatnagar
- School of Biochemical Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, India
| | - Amit Singh
- Department of Pharmacology, Institute of Medical Sciences, Banaras Hindu University, Varanasi, 221005, India.
| | - Abha Mishra
- School of Biochemical Engineering, Indian Institute of Technology (BHU), Varanasi, 221005, India.
| |
Collapse
|
4
|
Agarwal SM, Nandekar P, Saini R. Computational identification of natural product inhibitors against EGFR double mutant (T790M/L858R) by integrating ADMET, machine learning, molecular docking and a dynamics approach. RSC Adv 2022; 12:16779-16789. [PMID: 35754875 PMCID: PMC9170516 DOI: 10.1039/d2ra00373b] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2022] [Accepted: 05/13/2022] [Indexed: 11/21/2022] Open
Abstract
Double mutated epidermal growth factor receptor is a clinically important target for addressing drug resistance in lung cancer treatment. Therefore, discovering new inhibitors against the T790M/L858R (TMLR) resistant mutation is ongoing globally. In the present study, nearly 150 000 molecules from various natural product libraries were screened by employing different ligand and structure-based techniques. Initially, the library was filtered to identify drug-like molecules, which were subjected to a machine learning based classification model to identify molecules with a higher probability of having anti-cancer activity. Simultaneously, rules for constrained docking were derived from three-dimensional protein-ligand complexes and thereafter, constrained docking was undertaken, followed by HYDE binding affinity assessment. As a result, three molecules that resemble interactions similar to the co-crystallized complex were selected and subjected to 100 ns molecular dynamics simulation for stability analysis. The interaction analysis for the 100 ns simulation period showed that the leads exhibit the conserved hydrogen bond interaction with Gln791 and Met793 as in the co-crystal ligand. Also, the study indicated that Y-shaped molecules are preferred in the binding pocket as it enables them to occupy both pockets. The MMGBSA binding energy calculations revealed that the molecules have comparable binding energy to the native ligand. The present study has enabled the identification of a few ADMET adherent leads from natural products that exhibit the potential to inhibit the double mutated drug-resistant EGFR.
Collapse
Affiliation(s)
- Subhash M Agarwal
- Bioinformatics Division, ICMR-National Institute of Cancer Prevention and Research I-7, Sector-39 Noida-201301 India
| | - Prajwal Nandekar
- Molecular and Cellular Modeling Group, Heidelberg Institute for Theoretical Studies (HITS) Schloss-Wolfsbrunnenweg 35 69118 Heidelberg Germany
| | - Ravi Saini
- School of Biochemical Engineering, Indian Institute of Technology (BHU) Uttar Pradesh Varanasi 221 005 India
| |
Collapse
|
5
|
Nguyen L, Nguyen Vo TH, Trinh QH, Nguyen BH, Nguyen-Hoang PU, Le L, Nguyen BP. iANP-EC: Identifying Anticancer Natural Products Using Ensemble Learning Incorporated with Evolutionary Computation. J Chem Inf Model 2022; 62:5080-5089. [PMID: 35157472 DOI: 10.1021/acs.jcim.1c00920] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Cancer is one of the most deadly diseases that annually kills millions of people worldwide. The investigation on anticancer medicines has never ceased to seek better and more adaptive agents with fewer side effects. Besides chemically synthetic anticancer compounds, natural products are scientifically proved as a highly potential alternative source for anticancer drug discovery. Along with experimental approaches being used to find anticancer drug candidates, computational approaches have been developed to virtually screen for potential anticancer compounds. In this study, we construct an ensemble computational framework, called iANP-EC, using machine learning approaches incorporated with evolutionary computation. Four learning algorithms (k-NN, SVM, RF, and XGB) and four molecular representation schemes are used to build a set of classifiers, among which the top-four best-performing classifiers are selected to form an ensemble classifier. Particle swarm optimization (PSO) is used to optimise the weights used to combined the four top classifiers. The models are developed by a set of curated 997 compounds which are collected from the NPACT and CancerHSP databases. The results show that iANP-EC is a stable, robust, and effective framework that achieves an AUC-ROC value of 0.9193 and an AUC-PR value of 0.8366. The comparative analysis of molecular substructures between natural anticarcinogens and nonanticarcinogens partially unveils several key substructures that drive anticancerous activities. We also deploy the proposed ensemble model as an online web server with a user-friendly interface to support the research community in identifying natural products with anticancer activities.
Collapse
Affiliation(s)
- Loc Nguyen
- Computational Biology Center, International University - VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Thanh-Hoang Nguyen Vo
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington 6140, New Zealand
| | - Quang H Trinh
- Computational Biology Center, International University - VNU HCMC, Ho Chi Minh City 700000, Vietnam.,School of Information and Communication Technology, Hanoi University of Science and Technology, Hanoi 100000, Vietnam
| | - Bach Hoai Nguyen
- School of Engineering and Computer Science, Victoria University of Wellington, Wellington 6140, New Zealand
| | - Phuong-Uyen Nguyen-Hoang
- Computational Biology Center, International University - VNU HCMC, Ho Chi Minh City 700000, Vietnam
| | - Ly Le
- Computational Biology Center, International University - VNU HCMC, Ho Chi Minh City 700000, Vietnam.,Vingroup Big Data Institute, Ha Noi 100000, Vietnam
| | - Binh P Nguyen
- School of Mathematics and Statistics, Victoria University of Wellington, Wellington 6140, New Zealand
| |
Collapse
|
6
|
EGFRisopred: a machine learning-based classification model for identifying isoform-specific inhibitors against EGFR and HER2. Mol Divers 2021; 26:1531-1543. [PMID: 34345964 DOI: 10.1007/s11030-021-10284-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2021] [Accepted: 07/21/2021] [Indexed: 10/20/2022]
Abstract
The EGFR kinase pathway is one of the most frequently activated signaling pathways in human cancers. EGFR and HER2 are the two significant members of this pathway, which are attractive drug targets of clinical relevance in lung and breast cancer. Therefore, identifying EGFR- and HER2-specific inhibitors is one of the important challenges in cancer drug discovery. To address this issue, a dataset of 519 compounds having inhibitory activity against both the isoforms, i.e., EGFR and HER2, was collected from the literature and developed a knowledge-based computational classification model for predicting the specificity of a molecule for an isoform (EGFR/HER2) with precision. A total of seventy-two classification models using nine fingerprint types, four classifiers (IBK, NB, SMO and RF) and two different datasets (EGFR and HER2 isoform specific) were developed. It was observed that the models developed using random forest and IBK performed better for EGFR- and HER2-specific datasets, respectively. Scaffold and functional group analysis led to the identification of prevalent core and fragments in each of the datasets. The accuracy of the selected best performing models was also evaluated using the decoy dataset. We have also developed an application EGFRisopred, which integrates the best performing models and permits the user to predict the specificity of a compound as an EGFR-/HER2-specific anticancer agent. It is expected that the tool's availability as a free utility will allow researchers to identify new inhibitors against these targets important in cancer.
Collapse
|
7
|
Saini R, Fatima S, Agarwal SM. TMLRpred: A machine learning classification model to distinguish reversible EGFR double mutant inhibitors. Chem Biol Drug Des 2021; 96:921-930. [PMID: 33058464 DOI: 10.1111/cbdd.13697] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2019] [Revised: 03/26/2020] [Accepted: 04/03/2020] [Indexed: 12/26/2022]
Abstract
The EGFR is a clinically important therapeutic drug target in lung cancer. The first-generation tyrosine kinase inhibitors used in clinics are effective against L858R-mutated EGFR. However, relapse of the disease due to the presence of resistant mutation (T790M) makes these inhibitors ineffective. This has necessitated the need to identify new potent EGFR inhibitors against the resistant double mutants. Therefore, various machine learning techniques ((instance-based learner (IBK), naïve Bayesian (NB), sequential minimal optimization (SMO), and random forest (RF)) were employed to develop twelve classification models on three different datasets (high, moderate, and weakly active inhibitors). The models were validated using fivefold cross-validation and independent validation datasets. It was observed that the random forest-based models showed best performance. Also, functional groups, PubChem fingerprints, and substructure of highly active inhibitors were compared to inactive to identify structural features which are important for activity. To promote open-source drug discovery, a tool has been developed, which incorporates the best performing models and allows users to predict the potential of chemical molecules as anti-TMLR inhibitor. It is expected that the machine learning classification models developed in this study will pave way for identifying novel inhibitors against the resistant EGFR double mutants.
Collapse
Affiliation(s)
- Ravi Saini
- School of Biochemical Engineering, Indian Institute of Technology (BHU), Varanasi, India
| | - Shehnaz Fatima
- Bioinformatics Division, ICMR-National Institute of Cancer Prevention and Research, Noida, India
| | - Subhash Mohan Agarwal
- Bioinformatics Division, ICMR-National Institute of Cancer Prevention and Research, Noida, India
| |
Collapse
|
8
|
Fatima S, Agarwal SM. Exploring structural features of EGFR-HER2 dual inhibitors as anti-cancer agents using G-QSAR approach. J Recept Signal Transduct Res 2020; 39:243-252. [PMID: 31538848 DOI: 10.1080/10799893.2019.1660896] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/26/2022]
Abstract
Simultaneous inhibition of EGFR and HER2 by dual-targeting inhibitors is an established anti-cancer strategy. Therefore, a recent trend in drug discovery involves understanding the features of such dual inhibitors. In this study, three different G-QSAR models were developed corresponding to individual EGFR, HER2 and the dual-model for both receptors. The dual-model provided site-specific information wherein (i) increasing electronegative character and higher index of saturated carbon at R4 position; (ii) presence of chlorine atom at R2 position; (iii) decreasing alpha modified shape index at R1 and R3 positions; and (iv) less electronegativity at R2 position; were found important for enhancing the dual activity. Also, comparison of dual-model with the EGFR/HER2 individual models revealed that it incorporates the properties of both models and, thus, represents a combination of EGFR/HER2. Further, fragment analysis revealed that R2 and R4 are important for imparting high potency while specificity is decided by R1/R3 fragment. We also checked the predictive ability of the dual-model by determining applicability domain using William's plot. Also, analysis of active molecules showed they show favorable substitutions that agree with the constructed dual-model. Thus, we have been successful in developing a single dual-response QSAR model to get an insight into various structural features influencing EGFR/HER2 activity.
Collapse
Affiliation(s)
- Shehnaz Fatima
- Bioinformatics Division, ICMR-National Institute of Cancer Prevention and Research , Noida , India
| | - Subhash Mohan Agarwal
- Bioinformatics Division, ICMR-National Institute of Cancer Prevention and Research , Noida , India
| |
Collapse
|
9
|
ADMET profiling of geographically diverse phytochemical using chemoinformatic tools. Future Med Chem 2019; 12:69-87. [PMID: 31793338 DOI: 10.4155/fmc-2019-0206] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022] Open
Abstract
Aim: Phytocompounds are important due to their uniqueness, however, only few reach the development phase due to their poor pharmacokinetics. Therefore, preassessing the absorption, distribution, metabolism, excretion and toxicity (ADMET) properties is essential in drug discovery. Methodology: Biologically diverse databases (Phytochemica, SerpentinaDB, SANCDB and NuBBEDB) covering the region of India, Brazil and South Africa were considered to predict the ADMET using chemoinformatic tools (Qikprop, pkCSM and DataWarrior). Results: Screening through each of pharmacokinetic criteria resulted in identification of 24 compounds that adhere to all the ADMET properties. Furthermore, assessment revealed that five have potent anticancer biological activity against cancer cell lines. Conclusion: We have established an open-access database (ADMET-BIS) to enable identification of promising molecules that follow ADMET properties and can be considered for drug development.
Collapse
|
10
|
Fatima S, Pal D, Agarwal SM. QSAR of clinically important EGFR mutant L858R/T790M pyridinylimidazole inhibitors. Chem Biol Drug Des 2019; 94:1306-1315. [PMID: 30811850 DOI: 10.1111/cbdd.13505] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/04/2018] [Revised: 01/05/2019] [Accepted: 01/31/2019] [Indexed: 11/29/2022]
Abstract
EGFR is a well-established therapeutic target of clinical relevance in cancer. However, acquisition of secondary mutation (T790M) makes first-generation inhibitors ineffective. Therefore, to circumvent the problem of resistance, new T790M/L858R (TMLR) double mutant inhibitors are required. In this study, fragment-based QSAR models (GQSAR) were generated for pyridinylimidazole derivatives having biological activity against TMLR mutants. The GQSAR model developed using partial least squares regression via stepwise forward-backward variable selection technique showed best results as judged using statistical parameters (r2 , q2 , and pred_r2 ). Additionally, applicability domain of the model was verified using Williams plot, which indicated that the predicted data are reliable. The GQSAR provided site-specific clues wherein modifications related to decreasing lipophilic character and rotatable bonds and increasing SaaCHE-index are required for improving inhibitory activity. Overall, the study indicated that the presence of acrylamide at R5 is essential for covalent bond formation with Cys797 and occurrence of aromatic residue at R2 is required for occupying hydrophobic region next to Met790 gatekeeper residue. Based on this information, new derivatives were designed that show better inhibitory activity than the experimentally reported most active molecules. Thus, the model developed can be used to design new pyridinylimidazole derivatives with improved TMLR bioactivity.
Collapse
Affiliation(s)
- Shehnaz Fatima
- Bioinformatics Division, ICMR-National Institute of Cancer Prevention and Research, Noida, India
| | - Divyani Pal
- Bioinformatics Division, ICMR-National Institute of Cancer Prevention and Research, Noida, India
| | - Subhash Mohan Agarwal
- Bioinformatics Division, ICMR-National Institute of Cancer Prevention and Research, Noida, India
| |
Collapse
|
11
|
Fatima S, Agarwal SM. Unraveling structural requirements of amino-pyrimidine T790M/L858R double mutant EGFR inhibitors: 2D and 3D QSAR study. J Recept Signal Transduct Res 2018; 38:299-306. [DOI: 10.1080/10799893.2018.1494740] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023]
Affiliation(s)
- Shehnaz Fatima
- Bioinformatics Division, ICMR-National Institute of Cancer Prevention and Research, Noida, India
| | - Subhash Mohan Agarwal
- Bioinformatics Division, ICMR-National Institute of Cancer Prevention and Research, Noida, India
| |
Collapse
|
12
|
Singh H, Srivastava HK, Raghava GPS. A web server for analysis, comparison and prediction of protein ligand binding sites. Biol Direct 2016; 11:14. [PMID: 27016210 PMCID: PMC4807588 DOI: 10.1186/s13062-016-0118-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2015] [Accepted: 03/22/2016] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND One of the major challenges in the field of system biology is to understand the interaction between a wide range of proteins and ligands. In the past, methods have been developed for predicting binding sites in a protein for a limited number of ligands. RESULTS In order to address this problem, we developed a web server named 'LPIcom' to facilitate users in understanding protein-ligand interaction. Analysis, comparison and prediction modules are available in the "LPIcom' server to predict protein-ligand interacting residues for 824 ligands. Each ligand must have at least 30 protein binding sites in PDB. Analysis module of the server can identify residues preferred in interaction and binding motif for a given ligand; for example residues glycine, lysine and arginine are preferred in ATP binding sites. Comparison module of the server allows comparing protein-binding sites of multiple ligands to understand the similarity between ligands based on their binding site. This module indicates that ATP, ADP and GTP ligands are in the same cluster and thus their binding sites or interacting residues exhibit a high level of similarity. Propensity-based prediction module has been developed for predicting ligand-interacting residues in a protein for more than 800 ligands. In addition, a number of web-based tools have been integrated to facilitate users in creating web logo and two-sample between ligand interacting and non-interacting residues. CONCLUSIONS In summary, this manuscript presents a web-server for analysis of ligand interacting residue. This server is available for public use from URL http://crdd.osdd.net/raghava/lpicom .
Collapse
Affiliation(s)
- Harinder Singh
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, 160036, India
| | | | - Gajendra P S Raghava
- Bioinformatics Centre, CSIR-Institute of Microbial Technology, Chandigarh, 160036, India. .,, .
| |
Collapse
|
13
|
Singh H, Kumar R, Singh S, Chaudhary K, Gautam A, Raghava GPS. Prediction of anticancer molecules using hybrid model developed on molecules screened against NCI-60 cancer cell lines. BMC Cancer 2016; 16:77. [PMID: 26860193 PMCID: PMC4748564 DOI: 10.1186/s12885-016-2082-y] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2015] [Accepted: 01/21/2016] [Indexed: 11/16/2022] Open
Abstract
Background In past, numerous quantitative structure-activity relationship (QSAR) based models have been developed for predicting anticancer activity for a specific class of molecules against different cancer drug targets. In contrast, limited attempt have been made to predict the anticancer activity of a diverse class of chemicals against a wide variety of cancer cell lines. In this study, we described a hybrid method developed on thousands of anticancer and non-anticancer molecules tested against National Cancer Institute (NCI) 60 cancer cell lines. Results Our analysis of anticancer molecules revealed that majority of anticancer molecules contains 18–24 carbon atoms and are dominated by functional groups like R2NH, R3N, ROH, RCOR, and ROR. It was also observed that certain substructures (e.g., 1-methoxy-4-methylbenzene, 1-methoxy benzene, Nitrobenzene, Indole, Propenyl benzene) are more abundant in anticancer molecules. Next, we developed anticancer molecule prediction models using various machine-learning techniques and achieved maximum matthews correlation coefficient (MCC) of 0.81 with 90.40 % accuracy using support vector machine (SVM) based models. In another approach, a novel similarity or potency score based method has been developed using selected fragments/fingerprints and achieved maximum MCC of 0.82 with 90.65 % accuracy. Finally, we combined the strength of above methods and developed a hybrid method with maximum MCC of 0.85 with 92.47 % accuracy. Conclusions We developed a hybrid method utilizing the best of machine learning and potency score based method. The highly accurate hybrid method can be used for classification of anticancer and non-anticancer molecules. In order to facilitate scientific community working in the field of anticancer drug discovery, we integrate hybrid and potency method in a web server CancerIN. This server provides various facilities that includes; virtual screening of anticancer molecules, analog based drug design, and similarity with known anticancer molecules (http://crdd.osdd.net/oscadd/cancerin). Electronic supplementary material The online version of this article (doi:10.1186/s12885-016-2082-y) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Harinder Singh
- Bioinformatics Centre, Institute of Microbial Technology, Sector 39-A, Chandigarh, India.
| | - Rahul Kumar
- Bioinformatics Centre, Institute of Microbial Technology, Sector 39-A, Chandigarh, India.
| | - Sandeep Singh
- Bioinformatics Centre, Institute of Microbial Technology, Sector 39-A, Chandigarh, India.
| | - Kumardeep Chaudhary
- Bioinformatics Centre, Institute of Microbial Technology, Sector 39-A, Chandigarh, India.
| | - Ankur Gautam
- Bioinformatics Centre, Institute of Microbial Technology, Sector 39-A, Chandigarh, India.
| | - Gajendra P S Raghava
- Bioinformatics Centre, Institute of Microbial Technology, Sector 39-A, Chandigarh, India.
| |
Collapse
|
14
|
Abstract
Volatile organic compounds in cancer database (VOCC) has been developed, which provides comprehensive information of VOCs distinctly observed in cancer vs. normal from various malignancies and different sources.
Collapse
Affiliation(s)
- Subhash Mohan Agarwal
- Bioinformatics Division
- National Institute of Cancer Prevention and Research (NICPR-ICMR)
- Noida – 201301
- India
| | - Mansi Sharma
- Bioinformatics Division
- National Institute of Cancer Prevention and Research (NICPR-ICMR)
- Noida – 201301
- India
| | - Shehnaz Fatima
- Bioinformatics Division
- National Institute of Cancer Prevention and Research (NICPR-ICMR)
- Noida – 201301
- India
| |
Collapse
|
15
|
Sharma VK, Nandekar PP, Sangamwar A, Pérez-Sánchez H, Agarwal SM. Structure guided design and binding analysis of EGFR inhibiting analogues of erlotinib and AEE788 using ensemble docking, molecular dynamics and MM-GBSA. RSC Adv 2016. [DOI: 10.1039/c6ra08517b] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
The study uncovers an essential pharmacophoric requirement for design of new EGFR inhibitors. Docking and MD simulation confirmed that the occupancy of an additional sub-pocket in the EGFR active site is important for tight EGFR-inhibitor binding.
Collapse
Affiliation(s)
- Vishnu K. Sharma
- Department of Pharmacoinformatics
- National Institute of Pharmaceutical and Education Research (NIPER)
- India
| | - Prajwal P. Nandekar
- Department of Pharmacoinformatics
- National Institute of Pharmaceutical and Education Research (NIPER)
- India
| | - Abhay Sangamwar
- Department of Pharmacoinformatics
- National Institute of Pharmaceutical and Education Research (NIPER)
- India
| | - Horacio Pérez-Sánchez
- Bioinformatics and High Performance Computing Research Group (BIO-HPC)
- Universidad Católica San Antonio de Murcia (UCAM)
- Spain
| | - Subhash Mohan Agarwal
- Bioinformatics Division
- Institute of Cytology and Preventive Oncology
- Noida-201301
- India
| |
Collapse
|
16
|
Warr WA. Many InChIs and quite some feat. J Comput Aided Mol Des 2015; 29:681-94. [PMID: 26081259 DOI: 10.1007/s10822-015-9854-3] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2015] [Accepted: 06/10/2015] [Indexed: 12/14/2022]
Affiliation(s)
- Wendy A Warr
- Wendy Warr & Associates, Holmes Chapel, Crewe, Cheshire, CW4 7HZ, UK,
| |
Collapse
|
17
|
Singh H, Singh S, Singla D, Agarwal SM, Raghava GPS. QSAR based model for discriminating EGFR inhibitors and non-inhibitors using Random forest. Biol Direct 2015; 10:10. [PMID: 25880749 PMCID: PMC4372225 DOI: 10.1186/s13062-015-0046-9] [Citation(s) in RCA: 48] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2014] [Accepted: 03/06/2015] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Epidermal Growth Factor Receptor (EGFR) is a well-characterized cancer drug target. In the past, several QSAR models have been developed for predicting inhibition activity of molecules against EGFR. These models are useful to a limited set of molecules for a particular class like quinazoline-derivatives. In this study, an attempt has been made to develop prediction models on a large set of molecules (~3500 molecules) that include diverse scaffolds like quinazoline, pyrimidine, quinoline and indole. RESULTS We train, test and validate our classification models on a dataset called EGFR10 that contains 508 inhibitors (having inhibition activity IC50 less than 10 nM) and 2997 non-inhibitors. Our Random forest based model achieved maximum MCC 0.49 with accuracy 83.7% on a validation set using 881 PubChem fingerprints. In this study, frequency-based feature selection technique has been used to identify best fingerprints. It was observed that PubChem fingerprints FP380 (C(~O) (~O)), FP579 (O = C-C-C-C), FP388 (C(:C) (:N) (:N)) and FP 816 (ClC1CC(Br)CCC1) are more frequent in the inhibitors in comparison to non-inhibitors. In addition, we created different datasets namely EGFR100 containing inhibitors having IC50 < 100 nM and EGFR1000 containing inhibitors having IC50 < 1000 nM. We trained, test and validate our models on datasets EGFR100 and EGFR1000 datasets and achieved and maximum MCC 0.58 and 0.71 respectively. In addition, models were developed for predicting quinazoline and pyrimidine based EGFR inhibitors. CONCLUSIONS In summary, models have been developed on a large set of molecules of various classes for discriminating EGFR inhibitors and non-inhibitors. These highly accurate prediction models can be used to design and discover novel EGFR inhibitors. In order to provide service to the scientific community, a web server/standalone EGFRpred also has been developed ( http://crdd.osdd.net/oscadd/egfrpred/ ).
Collapse
Affiliation(s)
- Harinder Singh
- Bioinformatics Center, Institute of Microbial Technology, Sector 39-A, Chandigarh, India.
| | - Sandeep Singh
- Bioinformatics Center, Institute of Microbial Technology, Sector 39-A, Chandigarh, India.
| | - Deepak Singla
- Bioinformatics Center, Institute of Microbial Technology, Sector 39-A, Chandigarh, India.
| | - Subhash M Agarwal
- Institute of Cytology and Preventive Oncology, Sector 39, Noida, 201301, Uttar Pradesh, India.
| | - Gajendra P S Raghava
- Bioinformatics Center, Institute of Microbial Technology, Sector 39-A, Chandigarh, India.
| |
Collapse
|
18
|
Ma L, Wang DD, Huang Y, Yan H, Wong MP, Lee VHF. EGFR Mutant Structural Database: computationally predicted 3D structures and the corresponding binding free energies with gefitinib and erlotinib. BMC Bioinformatics 2015; 16:85. [PMID: 25886721 PMCID: PMC4364680 DOI: 10.1186/s12859-015-0522-3] [Citation(s) in RCA: 28] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2014] [Accepted: 02/27/2015] [Indexed: 01/18/2023] Open
Abstract
Background Epidermal growth factor receptor (EGFR) mutation-induced drug resistance has caused great difficulties in the treatment of non-small-cell lung cancer (NSCLC). However, structural information is available for just a few EGFR mutants. In this study, we created an EGFR Mutant Structural Database (freely available at http://bcc.ee.cityu.edu.hk/data/EGFR.html), including the 3D EGFR mutant structures and their corresponding binding free energies with two commonly used inhibitors (gefitinib and erlotinib). Results We collected the information of 942 NSCLC patients belonging to 112 mutation types. These mutation types are divided into five groups (insertion, deletion, duplication, modification and substitution), and substitution accounts for 61.61% of the mutation types and 54.14% of all the patients. Among all the 942 patients, 388 cases experienced a mutation at residue site 858 with leucine replaced by arginine (L858R), making it the most common mutation type. Moreover, 36 (32.14%) mutation types occur at exon 19, and 419 (44.48%) patients carried a mutation at exon 21. In this study, we predicted the EGFR mutant structures using Rosetta with the collected mutation types. In addition, Amber was employed to refine the structures followed by calculating the binding free energies of mutant-drug complexes. Conclusions The EGFR Mutant Structural Database provides resources of 3D structures and the binding affinity with inhibitors, which can be used by other researchers to study NSCLC further and by medical doctors as reference for NSCLC treatment.
Collapse
Affiliation(s)
- Lichun Ma
- Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong.
| | - Debby D Wang
- Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong.
| | - Yiqing Huang
- School of Computer Science and Technology, Soochow University, Suzhou, China.
| | - Hong Yan
- Department of Electronic Engineering, City University of Hong Kong, Kowloon, Hong Kong.
| | - Maria P Wong
- Li Ka Sing Faculty of Medicne, University of Hong Kong, Pokfulam, Hong Kong.
| | - Victor H F Lee
- Li Ka Sing Faculty of Medicne, University of Hong Kong, Pokfulam, Hong Kong.
| |
Collapse
|