1
|
Wang X, Wang S, Wang J, Yin S. Reverse Designing the Wavelength-Specific Thermally Activation Delayed Fluorescent Molecules Using a Genetic Algorithm Coupled with Cheap QM Methods. J Phys Chem A 2023. [PMID: 37418642 DOI: 10.1021/acs.jpca.3c01714] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 07/09/2023]
Abstract
Genetic algorithm (GA) optimization coupled with the semiempirical intermediate neglect of differential overlap (INDO)/CIS method is presented to inversely design the red thermally activation delayed fluorescent (TADF) molecules. According to the predefined donor-acceptor (DA) library to build an ADn-type TADF candidate, we utilized the chemical notation language SMILES code to generate a TADF molecule and apply the RDKit program to produce the initial 3D molecular structure. A combined fitness function is proposed to evaluate the performance of the functional-lead TADF molecule. The fitness function includes three key parameters, i.e., the emission wavelength, the energy gap (ΔEST) between the lowest singlet (S1)- and triplet (T1)-excited states, and the oscillator strengths for electron transition from S0 and S1. A cheap QM method, i.e., INDO/CIS, on the basis of an xTB-optimized molecular geometry is applied to quickly calculate the fitness function. Finally, the GA approach is utilized to globally search for the wavelength-specific TADF molecules under our predefined DA library, and the optimum 630 nm red and 660 nm deep red TADF molecules are inversely designed according to the evolution of molecular fitness functions.
Collapse
Affiliation(s)
- Xubin Wang
- School of Chemistry & Chemical Engineering, Shaanxi Normal University, Xian 710119, China
| | - Shiqi Wang
- School of Chemistry & Chemical Engineering, Shaanxi Normal University, Xian 710119, China
| | - Jingwen Wang
- School of Chemistry & Chemical Engineering, Shaanxi Normal University, Xian 710119, China
| | - Shiwei Yin
- School of Chemistry & Chemical Engineering, Shaanxi Normal University, Xian 710119, China
| |
Collapse
|
2
|
Prentis LE, Singleton CD, Bickel JD, Allen WJ, Rizzo RC. A molecular evolution algorithm for ligand design in DOCK. J Comput Chem 2022; 43:1942-1963. [PMID: 36073674 PMCID: PMC9623574 DOI: 10.1002/jcc.26993] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2022] [Revised: 06/13/2022] [Accepted: 08/03/2022] [Indexed: 01/11/2023]
Abstract
As a complement to virtual screening, de novo design of small molecules is an alternative approach for identifying potential drug candidates. Here, we present a new 3D genetic algorithm to evolve molecules through breeding, mutation, fitness pressure, and selection. The method, termed DOCK_GA, builds upon and leverages powerful sampling, scoring, and searching routines previously implemented into DOCK6. Three primary experiments were used during development: Single-molecule evolution evaluated three selection methods (elitism, tournament, and roulette), in four clinically relevant systems, in terms of mutation type and crossover success, chemical properties, ensemble diversity, and fitness convergence, among others. Large scale benchmarking assessed performance across 651 different protein-ligand systems. Ensemble-based evolution demonstrated using multiple inhibitors simultaneously to seed growth in a SARS-CoV-2 target. Key takeaways include: (1) The algorithm is robust as demonstrated by the successful evolution of molecules across a large diverse dataset. (2) Users have flexibility with regards to parent input, selection method, fitness function, and molecular descriptors. (3) The program is straightforward to run and only requires a single executable and input file at run-time. (4) The elitism selection method yields more tightly clustered molecules in terms of 2D/3D similarity, with more favorable fitness, followed by tournament and roulette.
Collapse
Affiliation(s)
- Lauren E. Prentis
- Department of Biochemistry & Cell BiologyStony Brook UniversityStony BrookNew YorkUSA
| | | | - John D. Bickel
- Department of ChemistryStony Brook UniversityStony BrookNew YorkUSA
| | - William J. Allen
- Department of Applied Mathematics & StatisticsStony Brook UniversityStony BrookNew YorkUSA
| | - Robert C. Rizzo
- Department of Applied Mathematics & StatisticsStony Brook UniversityStony BrookNew YorkUSA
- Institute of Chemical Biology & Drug DiscoveryStony Brook UniversityStony BrookNew YorkUSA
- Laufer Center for Physical & Quantitative BiologyStony Brook UniversityStony BrookNew YorkUSA
| |
Collapse
|
3
|
Coley CW, Eyke NS, Jensen KF. Autonomous Discovery in the Chemical Sciences Part I: Progress. Angew Chem Int Ed Engl 2020; 59:22858-22893. [DOI: 10.1002/anie.201909987] [Citation(s) in RCA: 100] [Impact Index Per Article: 20.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2019] [Indexed: 01/05/2023]
Affiliation(s)
- Connor W. Coley
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Natalie S. Eyke
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Klavs F. Jensen
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| |
Collapse
|
4
|
Coley CW, Eyke NS, Jensen KF. Autonome Entdeckung in den chemischen Wissenschaften, Teil I: Fortschritt. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.201909987] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Connor W. Coley
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Natalie S. Eyke
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| | - Klavs F. Jensen
- Department of Chemical Engineering Massachusetts Institute of Technology Cambridge MA 02139 USA
| |
Collapse
|
5
|
Hsu HH, Huang CH, Lin ST. New Data Structure for Computational Molecular Design with Atomic or Fragment Resolution. J Chem Inf Model 2019; 59:3703-3713. [PMID: 31393721 DOI: 10.1021/acs.jcim.9b00478] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
A new molecular data structure and molecular structure operation algorithms are proposed for general purpose molecular design. The data structure allows for a variety of molecular operations for creating new molecules. Two types of molecular operations were developed, unimolecular and bimolecular operations. In unimolecular operations, a child molecule can be created from a parent via addition of a functional group, deletion of a fragment, mutation of an atom, etc. In bimolecular operations, children molecules are generated from two parent molecules through combination or crossover (hybridization). These molecular operations are essential for the creation and modification of molecules for the purpose of molecular design. The data structure is capable of representing linear, branched, multifunctional, and multivalent compounds. Algorithms are developed for deriving the molecular data structure of a molecule from its atomic coordinates and vice versa. We show that this new molecular data structure and the developed algorithms, referred to as Molecular Assembling and Representation Suite, allow one to generate a comprehensive library of new molecules via performing every possible molecular structure modification.
Collapse
Affiliation(s)
- Hsuan-Hao Hsu
- Department of Chemical Engineering , National Taiwan University , Taipei 10617 , Taiwan
| | - Chen-Hsuan Huang
- Department of Chemical Engineering , National Taiwan University , Taipei 10617 , Taiwan
| | - Shiang-Tai Lin
- Department of Chemical Engineering , National Taiwan University , Taipei 10617 , Taiwan
| |
Collapse
|
6
|
Hassanzadeh P, Atyabi F, Dinarvand R. Linkers: The key elements for the creation of efficient nanotherapeutics. J Control Release 2018; 270:260-267. [DOI: 10.1016/j.jconrel.2017.12.007] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2017] [Revised: 12/09/2017] [Accepted: 12/11/2017] [Indexed: 01/16/2023]
|
7
|
Recent Advances of Microfluidics Technologies in the Field of Medicinal Chemistry. ANNUAL REPORTS IN MEDICINAL CHEMISTRY 2017. [DOI: 10.1016/bs.armc.2017.09.001] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
|
8
|
Reker D, Schneider G. Active-learning strategies in computer-assisted drug discovery. Drug Discov Today 2015; 20:458-65. [DOI: 10.1016/j.drudis.2014.12.004] [Citation(s) in RCA: 100] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2014] [Revised: 11/13/2014] [Accepted: 12/02/2014] [Indexed: 12/20/2022]
|
9
|
Mishima K, Kaneko H, Funatsu K. Development of a New De Novo Design Algorithm for Exploring Chemical Space. Mol Inform 2014; 33:779-89. [PMID: 27485424 DOI: 10.1002/minf.201400056] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2014] [Accepted: 07/29/2014] [Indexed: 01/10/2023]
Abstract
In the first stage of development of new drugs, various lead compounds with high activity are required. To design such compounds, we focus on chemical space defined by structural descriptors. New compounds close to areas where highly active compounds exist will show the same degree of activity. We have developed a new de novo design system to search a target area in chemical space. First, highly active compounds are manually selected as initial seeds. Then, the seeds are entered into our system, and structures slightly different from the seeds are generated and pooled. Next, seeds are selected from the new structure pool based on the distance from target coordinates on the map. To test the algorithm, we used two datasets of ligand binding affinity and showed that the proposed generator could produce diverse virtual compounds that had high activity in docking simulations.
Collapse
Affiliation(s)
- Kazuaki Mishima
- Department of Chemical System Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan tel:(+81) 03-5841-7751
| | - Hiromasa Kaneko
- Department of Chemical System Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan tel:(+81) 03-5841-7751
| | - Kimito Funatsu
- Department of Chemical System Engineering, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8656, Japan tel:(+81) 03-5841-7751.
| |
Collapse
|
10
|
Samudra AP, Sahinidis NV. Optimization-based framework for computer-aided molecular design. AIChE J 2013. [DOI: 10.1002/aic.14112] [Citation(s) in RCA: 86] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Apurva P. Samudra
- Dept. of Chemical Engineering; Carnegie Mellon University; Pittsburgh, PA 15213
| | | |
Collapse
|
11
|
Zhang J, Liu S, Shang Z, Shi L, Yun J. Analysis of the relationship between end-to-end distance and activity of single-chain antibody against colorectal carcinoma. Theor Biol Med Model 2012; 9:38. [PMID: 22913623 PMCID: PMC3582594 DOI: 10.1186/1742-4682-9-38] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2012] [Accepted: 08/17/2012] [Indexed: 01/23/2023] Open
Abstract
We investigated the relationship of End-to-end distance between VH and VL with different peptide linkers and the activity of single-chain antibodies by computer-aided simulation. First, we developed (G4S)n (where n = 1-9) as the linker to connect VH and VL, and estimated the 3D structure of single-chain Fv antibody (scFv) by homologous modeling. After molecular models were evaluated and optimized, the coordinate system of every protein was built and unified into one coordinate system, and End-to-end distances calculated using 3D space coordinates. After expression and purification of scFv-n with (G4S)n as n = 1, 3, 5, 7 or 9, the immunoreactivity of purified ND-1 scFv-n was determined by ELISA. A multi-factorial relationship model was employed to analyze the structural factors affecting scFv: rn=ABn-ABO2+CDn-CDO2+BCn-BCst2. The relationship between immunoreactivity and r-values revealed that fusion protein structure approached the desired state when the r-value = 3. The immunoreactivity declined as the r-value increased, but when the r-value exceeded a certain threshold, it stabilized. We used a linear relationship to analyze structural factors affecting scFv immunoreactivity.
Collapse
Affiliation(s)
- Jianhua Zhang
- Faculty of Biomedical Engineering of Zhengzhou University, Zhengzhou, 450001, Henan Province, People's Republic of China
| | | | | | | | | |
Collapse
|
12
|
Bandholtz S, Wichard J, Kühne R, Grötzinger C. Molecular evolution of a peptide GPCR ligand driven by artificial neural networks. PLoS One 2012; 7:e36948. [PMID: 22606313 PMCID: PMC3351444 DOI: 10.1371/journal.pone.0036948] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2011] [Accepted: 04/13/2012] [Indexed: 11/18/2022] Open
Abstract
Peptide ligands of G protein-coupled receptors constitute valuable natural lead structures for the development of highly selective drugs and high-affinity tools to probe ligand-receptor interaction. Currently, pharmacological and metabolic modification of natural peptides involves either an iterative trial-and-error process based on structure-activity relationships or screening of peptide libraries that contain many structural variants of the native molecule. Here, we present a novel neural network architecture for the improvement of metabolic stability without loss of bioactivity. In this approach the peptide sequence determines the topology of the neural network and each cell corresponds one-to-one to a single amino acid of the peptide chain. Using a training set, the learning algorithm calculated weights for each cell. The resulting network calculated the fitness function in a genetic algorithm to explore the virtual space of all possible peptides. The network training was based on gradient descent techniques which rely on the efficient calculation of the gradient by back-propagation. After three consecutive cycles of sequence design by the neural network, peptide synthesis and bioassay this new approach yielded a ligand with 70fold higher metabolic stability compared to the wild type peptide without loss of the subnanomolar activity in the biological assay. Combining specialized neural networks with an exploration of the combinatorial amino acid sequence space by genetic algorithms represents a novel rational strategy for peptide design and optimization.
Collapse
Affiliation(s)
- Sebastian Bandholtz
- Charité – Universitätsmedizin Berlin, Campus Virchow-Klinikum, Department of Hepatology and Gastroenterology and Molecular Cancer Research Center (MKFZ), Tumor Targeting Lab, Berlin, Germany
| | - Jörg Wichard
- Leibnitz-Institut für Molekulare Pharmakologie (fmp), Berlin, Germany
| | - Ronald Kühne
- Leibnitz-Institut für Molekulare Pharmakologie (fmp), Berlin, Germany
| | - Carsten Grötzinger
- Charité – Universitätsmedizin Berlin, Campus Virchow-Klinikum, Department of Hepatology and Gastroenterology and Molecular Cancer Research Center (MKFZ), Tumor Targeting Lab, Berlin, Germany
- * E-mail:
| |
Collapse
|
13
|
Bandyopadhyay S, Bagchi A, Maulik U. ACTIVE SITE DRIVEN LIGAND DESIGN: AN EVOLUTIONARY APPROACH. J Bioinform Comput Biol 2011; 3:1053-70. [PMID: 16278947 DOI: 10.1142/s021972000500148x] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2004] [Accepted: 04/11/2005] [Indexed: 11/18/2022]
Abstract
An evolutionary approach for designing a ligand molecule that can bind to the active site of a target protein is described in this article. An earlier attempt in this regard assumed a fixed tree structure of the ligand on both sides of the pharmacophore, and used a genetic algorithm for optimizing the van der Waals energy. However, it is evident that knowledge about the size of the tree is difficult to obtain an a priori. Moreover, it will also change from one active site to another. This limitation is overcome in the present article by using variable string length genetic algorithm (VGA) for evolving an appropriate arrangement of the basic functional units of the molecule to be designed, whose size may now vary. The crossover and mutation operators are appropriately redesigned in order to tackle the concept of variable length chromosomes. Once the geometry of the molecule is obtained, the possible three-dimensional structure and its docking energy is determined. Results are demonstrated for five different target proteins both numerically and pictorially. It is found that not only does the molecule designed using variable length representation, in general, have lower energy values, the docking energies are also lower, as compared to the molecule evolved using fixed size representation.
Collapse
|
14
|
Shang GG, Zhang JH, Lü YG, Yun J. Bioinformatics-led design of single-chain antibody molecules targeting DNA sequences for retinoblastoma. Int J Ophthalmol 2011; 4:8-13. [PMID: 22553599 DOI: 10.3980/j.issn.2222-3959.2011.01.02] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2010] [Accepted: 01/20/2011] [Indexed: 02/01/2023] Open
Abstract
AIM To analyze the relationship between the structure and function of single-chain Fv antibody (scFv) with bioinformatics methods, so as to provide theoretical basis for retinoblastoma targeted therapy. METHODS Single-chain antibodies are reconstructed for cancer-targeted therapy to provide good penetration into tumor tissue and to improve their pharmacokinetics in vivo, offering a clinically valuable application. The relationship needs to be analyzed that there may be some variations between the structure and function of the fusion proteins, and the relationship between the structure and function of protein molecules was obtained through analyzing relevant literature at home and abroad as well as modeling analysis. RESULTS Through our analysis of the interaction region between the antibody and the antigen, and of the binding sites for molecular conformation, it is clear that existing antibodies need to be modified at the DNA sequence level, enhancing the biological activity of the antibodies. Based on the view that bio-molecular computer models are closely integrated with biological experiments, a bio-molecular structure-activity relationship model can be established in terms of molecular conformation, physical and chemical properties and the biological activity of single-chain antibodies. Two enlightenments are obtained from our analysis. On the one hand, the structure-activity relationship is clear for new immune molecules at the gene expression level. On the other hand, a single-chain antibody molecule can be designed and optimized for the cancer-oriented treatment. CONCLUSION In this article, we provide the theoretical and experimental basis for the development of single-chain antibodies appropriate for retinoblastoma therapy.
Collapse
Affiliation(s)
- Guo-Gang Shang
- Department of Radiotherapy, Zhengzhou People's Hospital, Zhengzhou 450052, Henan Province, China
| | | | | | | |
Collapse
|
15
|
Pickett SD, Green DVS, Hunt DL, Pardoe DA, Hughes I. Automated Lead Optimization of MMP-12 Inhibitors Using a Genetic Algorithm. ACS Med Chem Lett 2011; 2:28-33. [PMID: 24900251 DOI: 10.1021/ml100191f] [Citation(s) in RCA: 41] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2010] [Accepted: 10/06/2010] [Indexed: 11/29/2022] Open
Abstract
Traditional lead optimization projects involve long synthesis and testing cycles, favoring extensive structure-activity relationship (SAR) analysis and molecular design steps, in an attempt to limit the number of cycles that a project must run to optimize a development candidate. Microfluidic-based chemistry and biology platforms, with cycle times of minutes rather than weeks, lend themselves to unattended autonomous operation. The bottleneck in the lead optimization process is therefore shifted from synthesis or test to SAR analysis and design. As such, the way is open to an algorithm-directed process, without the need for detailed user data analysis. Here, we present results of two synthesis and screening experiments, undertaken using traditional methodology, to validate a genetic algorithm optimization process for future application to a microfluidic system. The algorithm has several novel features that are important for the intended application. For example, it is robust to missing data and can suggest compounds for retest to ensure reliability of optimization. The algorithm is first validated on a retrospective analysis of an in-house library embedded in a larger virtual array of presumed inactive compounds. In a second, prospective experiment with MMP-12 as the target protein, 140 compounds are submitted for synthesis over 10 cycles of optimization. Comparison is made to the results from the full combinatorial library that was synthesized manually and tested independently. The results show that compounds selected by the algorithm are heavily biased toward the more active regions of the library, while the algorithm is robust to both missing data (compounds where synthesis failed) and inactive compounds. This publication places the full combinatorial library and biological data into the public domain with the intention of advancing research into algorithm-directed lead optimization methods.
Collapse
Affiliation(s)
- Stephen D. Pickett
- GlaxoSmithKline Research and Development, Stevenage, Herts, SG1 2NY, United Kingdom
| | - Darren V. S. Green
- GlaxoSmithKline Research and Development, Stevenage, Herts, SG1 2NY, United Kingdom
| | - David L. Hunt
- Tessella plc, Stevenage, Herts, SG1 2EF, United Kingdom
| | - David A. Pardoe
- GlaxoSmithKline Research and Development, Harlow, Essex, CM19 5AW, United Kingdom
| | - Ian Hughes
- GlaxoSmithKline Research and Development, Harlow, Essex, CM19 5AW, United Kingdom
| |
Collapse
|
16
|
Kutchukian PS, Shakhnovich EI. De novo design: balancing novelty and confined chemical space. Expert Opin Drug Discov 2010; 5:789-812. [PMID: 22827800 DOI: 10.1517/17460441.2010.497534] [Citation(s) in RCA: 62] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
Abstract
IMPORTANCE OF THE FIELD De novo drug design serves as a tool for the discovery of new ligands for macromolecular targets as well as optimization of known ligands. Recently developed tools aim to address the multi-objective nature of drug design in an unprecedented manner. AREAS COVERED IN THIS REVIEW This article discusses recent advances in de novo drug design programs and accessory programs used to evaluate compounds post-generation. WHAT THE READER WILL GAIN The reader is introduced to the challenges inherent in de novo drug design and will become familiar with current trends in de novo design. Furthermore, the reader will be better prepared to assess the value of a tool, and be equipped to design more elegant tools in the future. TAKE HOME MESSAGE De novo drug design can assist in the efficient discovery of new compounds with a high affinity for a given target. The inclusion of existing chemoinformatic methods with current structure-based de novo design tools provides a means of enhancing the therapeutic value of these generated compounds.
Collapse
Affiliation(s)
- Peter S Kutchukian
- Harvard University, Chemistry and Chemical Biology Department, 12 Oxford Street, Cambridge, MA 02138, USA
| | | |
Collapse
|
17
|
Unal EB, Gursoy A, Erman B. VitAL: Viterbi algorithm for de novo peptide design. PLoS One 2010; 5:e10926. [PMID: 20532195 PMCID: PMC2880006 DOI: 10.1371/journal.pone.0010926] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2010] [Accepted: 05/07/2010] [Indexed: 01/18/2023] Open
Abstract
Background Drug design against proteins to cure various diseases has been studied for several years. Numerous design techniques were discovered for small organic molecules for specific protein targets. The specificity, toxicity and selectivity of small molecules are hard problems to solve. The use of peptide drugs enables a partial solution to the toxicity problem. There has been a wide interest in peptide design, but the design techniques of a specific and selective peptide inhibitor against a protein target have not yet been established. Methodology/Principal Findings A novel de novo peptide design approach is developed to block activities of disease related protein targets. No prior training, based on known peptides, is necessary. The method sequentially generates the peptide by docking its residues pair by pair along a chosen path on a protein. The binding site on the protein is determined via the coarse grained Gaussian Network Model. A binding path is determined. The best fitting peptide is constructed by generating all possible peptide pairs at each point along the path and determining the binding energies between these pairs and the specific location on the protein using AutoDock. The Markov based partition function for all possible choices of the peptides along the path is generated by a matrix multiplication scheme. The best fitting peptide for the given surface is obtained by a Hidden Markov model using Viterbi decoding. The suitability of the conformations of the peptides that result upon binding on the surface are included in the algorithm by considering the intrinsic Ramachandran potentials. Conclusions/Significance The model is tested on known protein-peptide inhibitor complexes. The present algorithm predicts peptides that have better binding energies than those of the existing ones. Finally, a heptapeptide is designed for a protein that has excellent binding affinity according to AutoDock results.
Collapse
Affiliation(s)
- E. Besray Unal
- Center for Computational Biology and Bioinformatics, Koc University, Istanbul, Turkey
| | - Attila Gursoy
- Center for Computational Biology and Bioinformatics, Koc University, Istanbul, Turkey
| | - Burak Erman
- Center for Computational Biology and Bioinformatics, Koc University, Istanbul, Turkey
- * E-mail:
| |
Collapse
|
18
|
Fernandez M, Caballero J, Fernandez L, Sarai A. Genetic algorithm optimization in drug design QSAR: Bayesian-regularized genetic neural networks (BRGNN) and genetic algorithm-optimized support vectors machines (GA-SVM). Mol Divers 2010; 15:269-89. [PMID: 20306130 DOI: 10.1007/s11030-010-9234-9] [Citation(s) in RCA: 48] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2009] [Accepted: 01/25/2010] [Indexed: 10/19/2022]
Abstract
Many articles in "in silico" drug design implemented genetic algorithm (GA) for feature selection, model optimization, conformational search, or docking studies. Some of these articles described GA applications to quantitative structure-activity relationships (QSAR) modeling in combination with regression and/or classification techniques. We reviewed the implementation of GA in drug design QSAR and specifically its performance in the optimization of robust mathematical models such as Bayesian-regularized artificial neural networks (BRANNs) and support vector machines (SVMs) on different drug design problems. Modeled data sets encompassed ADMET and solubility properties, cancer target inhibitors, acetylcholinesterase inhibitors, HIV-1 protease inhibitors, ion-channel and calcium entry blockers, and antiprotozoan compounds as well as protein classes, functional, and conformational stability data. The GA-optimized predictors were often more accurate and robust than previous published models on the same data sets and explained more than 65% of data variances in validation experiments. In addition, feature selection over large pools of molecular descriptors provided insights into the structural and atomic properties ruling ligand-target interactions.
Collapse
Affiliation(s)
- Michael Fernandez
- Department of Bioscience and Bioinformatics, Kyushu Institute of Technology (KIT), 680-4 Kawazu, Iizuka, 820-8502, Japan.
| | | | | | | |
Collapse
|
19
|
|
20
|
Friedrich R, Riester D, Göttig P, Thürk M, Schwienhorst A, Bode W. Structure of a novel thrombin inhibitor with an uncharged D-amino acid as P1 residue. Eur J Med Chem 2007; 43:1330-5. [PMID: 17950494 DOI: 10.1016/j.ejmech.2007.07.019] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2005] [Revised: 07/22/2007] [Accepted: 07/26/2007] [Indexed: 11/17/2022]
Abstract
Thrombin, the ultimate proteinase of the coagulation cascade, is an attractive target for the treatment of a variety of cardiovascular diseases. Previously, a series of novel thrombin inhibitors, discovered by employing a powerful and new computer-assisted multiparameter optimization process (CADDIS), have been synthesized. We have now crystallized the complex of human alpha-thrombin with the most potent of these inhibitors, 8-5 (K(i)=3 nM), and have determined its 2.3A X-ray crystal structure. The Fourier map displayed clear electron density for the inhibitor. The central part of the inhibitor binds in an improved melagatran-like mode, while the structure identifies a d-tyrosine as P1 residue which forms a charged hydrogen bond with Asp 189 of thrombin. This is the first crystal structure of a thrombin-inhibitor complex, where an uncharged inhibitor residue makes hydrogen bonds within the thrombin S1 pocket. Additionally, novel favourable intermolecular hydrogen bonds of the inhibitor with the thrombin backbone become possible due to the d-configuration of the P1 residue. Two flanking voluminous side chains increase the strength of the subjacent hydrogen bonding system by shielding it from the bulk solvent.
Collapse
Affiliation(s)
- Rainer Friedrich
- Arbeitsgruppe Proteinaseforschung, Max-Planck-Institut für Biochemie, Am Klopferspitz 18, 82152 Martinsried, Germany
| | | | | | | | | | | |
Collapse
|
21
|
Hilpert K, Winkler DFH, Hancock REW. Cellulose-bound Peptide Arrays: Preparation and Applications. Biotechnol Genet Eng Rev 2007; 24:31-106. [DOI: 10.1080/02648725.2007.10648093] [Citation(s) in RCA: 17] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
|
22
|
Belda I, Madurga S, Tarragó T, Llorà X, Giralt E. Evolutionary computation and multimodal search: a good combination to tackle molecular diversity in the field of peptide design. Mol Divers 2006; 11:7-21. [PMID: 17165156 DOI: 10.1007/s11030-006-9053-1] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2006] [Accepted: 09/24/2006] [Indexed: 10/23/2022]
Abstract
The awesome degree of structural diversity accessible in peptide design has created a demand for computational resources that can evaluate a multitude of candidate structures. In our specific case, we translate the peptide design problem to an optimization problem, and use evolutionary computation (EC) in tandem with docking to carry out a combinatorial search. However, the use of EC in huge search spaces with different optima may pose certain drawbacks. For example, EC is prone to focus a search in the first good region found. This is a problem not only because of the undesirable and automatic rejection of potentially good search space regions, but also because the found solution may be extremely difficult to synthesize chemically or may even be a false docking positive. In order to avoid rejecting potentially good solutions and to maximize the molecular diversity of the search, we have implemented evolutionary multimodal search techniques, as well as the molecular diversity metric needed by the multimodal algorithms to measure differences between various regions of the search space.
Collapse
Affiliation(s)
- Ignasi Belda
- Institut de Recerca Biomèdica, Parc Científic de Barcelona, Universitat de Barcelona, Josep Samitier, Barcelona, Spain
| | | | | | | | | |
Collapse
|
23
|
Brown N, McKay B, Gasteiger J. A novel workflow for the inverse QSPR problem using multiobjective optimization. J Comput Aided Mol Des 2006; 20:333-41. [PMID: 17031542 DOI: 10.1007/s10822-006-9063-1] [Citation(s) in RCA: 34] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/03/2006] [Accepted: 08/03/2006] [Indexed: 11/25/2022]
Abstract
A workflow for the inverse quantitative structure-property relationship (QSPR) problem is reported in this paper for the de novo design of novel chemical entities (NCE) in silico through the application of existing QSPR models to calculate multiple objectives, including prediction confidence measures, to be optimized during the de novo design process. Two physical property datasets are applied as case studies of the inverse QSPR workflow (IQW): mean molecular polarizability and aqueous solubility. The case studies demonstrate the optimization of molecular structures to within a property range of interest; the optimized structures are then validated against QSPR models that are generated from sets of alternative descriptors to those used in the IQW. The paper concludes with a discussion of the results from the case studies.
Collapse
Affiliation(s)
- Nathan Brown
- Avantium Technologies B.V., P.O. Box 2915, 1000 CX Amsterdam, The Netherlands.
| | | | | |
Collapse
|
24
|
Zhang W, Yano K, Karube I. Improving the efficiency of evolutionary de novo peptide design: strategies for probing configuration and parameter settings. Biosystems 2006; 88:35-55. [PMID: 16870325 DOI: 10.1016/j.biosystems.2006.04.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2004] [Revised: 11/30/2005] [Accepted: 04/11/2006] [Indexed: 11/28/2022]
Abstract
Evolutionary molecular design based on genetic algorithms (GAs) has been demonstrated to be a flexible and efficient optimization approach with potential for locating global optima. Its efficacy and efficiency are largely dependent on the operations and control parameters of the GAs. Accordingly, we have explored new operations and probed good parameter setting through simulations. The findings have been evaluated in a helical peptide design according to "Parameter setting by analogy" strategy; highly helical peptides have been successfully obtained with a population of only 16 peptides and 5 iterative cycles. The results indicate that new operations such as multi-step crossover-mutation are able to improve the explorative efficiency and to reduce the sensitivity to crossover and mutation rates (CR-MR). The efficiency of the peptide design has been furthermore improved by setting the GAs at the good CR-MR setting determined through simulation. These results suggest that probing the operations and parameter settings through simulation in combination with "Parameter setting by analogy" strategy provides an effective framework for improving the efficiency of the approach. Consequently, we conclude that this framework will be useful for contributing to practical peptide design, and gaining a better understanding of evolutionary molecular design.
Collapse
Affiliation(s)
- Wuming Zhang
- Research Center for Advanced Science and Technology, The University of Tokyo, 4-6-1 Komaba, Tokyo 153-8904, Japan
| | | | | |
Collapse
|
25
|
Paszkowicz W. Properties of a genetic algorithm extended by a random self-learning operator and asymmetric mutations: A convergence study for a task of powder-pattern indexing. Anal Chim Acta 2006. [DOI: 10.1016/j.aca.2006.02.055] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
26
|
Lameijer EW, Kok JN, Bäck T, Ijzerman AP. The Molecule Evoluator. An Interactive Evolutionary Algorithm for the Design of Drug-Like Molecules. J Chem Inf Model 2006; 46:545-52. [PMID: 16562982 DOI: 10.1021/ci050369d] [Citation(s) in RCA: 56] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
We developed a software tool to design drug-like molecules, the "Molecule Evoluator", which we introduce and describe here. An atom-based evolutionary approach was used allowing both several types of mutation and crossover to occur. The novelty, we claim, is the unprecedented interactive evolution, in which the user acts as a fitness function. This brings a human being's creativity, implicit knowledge, and imagination into the design process, next to the more standard chemical rules. Proof-of-concept was demonstrated in a number of ways, both computationally and in the lab. Thus, we synthesized a number of compounds designed with the aid of the Molecule Evoluator. One of these is described here, a new chemical entity with activity on alpha-adrenergic receptors.
Collapse
Affiliation(s)
- Eric-Wubbo Lameijer
- Leiden/Amsterdam Center for Drug Research, Division of Medicinal Chemistry, P.O. Box 9502, 2300RA Leiden, The Netherlands
| | | | | | | |
Collapse
|
27
|
El Oualid F, van den Elst H, Leroy IM, Pieterman E, Cohen LH, Burm BEA, Overkleeft HS, van der Marel GA, Overhand M. A combinatorial approach toward the generation of ambiphilic peptide-based inhibitors of protein:geranylgeranyl transferase-1. ACTA ACUST UNITED AC 2005; 7:703-13. [PMID: 16153065 DOI: 10.1021/cc0500203] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
A combinatorial synthesis of oligopeptide analogues and their evaluation as protein:geranylgeranyl transferase inhibitors is presented. The combinatorial strategy is based on the random mutation, in each new generation, of one of any of the four amino acid building blocks of which the most effective compounds of the previous generation are assembled. In this way, a progressive improvement of the average inhibitory activity was observed until the fifth generation. The most active inhibitors were found to inhibit PGGT-1 in the low micromolar range (IC(50): 3.8-8.1 microM).
Collapse
Affiliation(s)
- Farid El Oualid
- Leiden Institute of Chemistry, Gorlaeus Laboratories, Leiden University, P.O. Box 9502, 2300 RA 2300 RA Leiden, The Netherlands
| | | | | | | | | | | | | | | | | |
Collapse
|
28
|
Belda I, Madurga S, Llorà X, Martinell M, Tarragó T, Piqueras MG, Nicolás E, Giralt E. ENPDA: an evolutionary structure-based de novo peptide design algorithm. J Comput Aided Mol Des 2005; 19:585-601. [PMID: 16267689 DOI: 10.1007/s10822-005-9015-1] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2005] [Accepted: 08/14/2005] [Indexed: 10/25/2022]
Abstract
One of the goals of computational chemists is to automate the de novo design of bioactive molecules. Despite significant advances in computational approaches to ligand design and binding energy evaluation, novel procedures for ligand design are required. Evolutionary computation provides a new approach to this design endeavor. We propose an evolutionary tool for de novo peptide design, based on the evaluation of energies for peptide binding to a user-defined protein surface patch. Special emphasis has been placed on the evaluation of the proposed peptides, leading to two different evaluation heuristics. The software developed was successfully tested on the design of ligands for the proteins prolyl oligopeptidase, p53, and DNA gyrase.
Collapse
Affiliation(s)
- Ignasi Belda
- Institut de Recerca Biomèdica de Barcelona, Parc Científic de Barcelona, Universitat de Barcelona, Josep Samitier, 1-5, Barcelona, E 08028, Spain
| | | | | | | | | | | | | | | |
Collapse
|
29
|
Riester D, Wirsching F, Salinas G, Keller M, Gebinoga M, Kamphausen S, Merkwirth C, Goetz R, Wiesenfeldt M, Stürzebecher J, Bode W, Friedrich R, Thürk M, Schwienhorst A. Thrombin inhibitors identified by computer-assisted multiparameter design. Proc Natl Acad Sci U S A 2005; 102:8597-602. [PMID: 15937115 PMCID: PMC1150832 DOI: 10.1073/pnas.0501983102] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2004] [Indexed: 11/18/2022] Open
Abstract
Here, we present a series of thrombin inhibitors that were generated by using powerful computer-assisted multiparameter optimization process. The process was organized in design cycles, starting with a set of randomly chosen molecules. Each cycle combined combinatorial synthesis, multiparameter characterization of compounds in a variety of bioassays, and algorithmic processing of the data to devise a set of compounds to be synthesized in the next cycle. The identified lead compounds exhibited thrombin inhibitory constants in the lower nanomolar range. They are by far the most selective synthetic thrombin inhibitors, with selectivities of >100,000-fold toward other proteases such as Factor Xa, Factor XIIa, urokinase, plasmin, and Plasma kallikrein. Furthermore, these compounds exhibit a favorable profile, comprising nontoxicity, high metabolic stability, low serum protein binding, good solubility, high anticoagulant activity, and a slow and exclusively renal elimination from the circulation in a rat model. Finally, x-ray crystallographic analysis of a thrombin-inhibitor complex revealed a binding mode with a neutral moiety in the S1 pocket of thrombin.
Collapse
Affiliation(s)
- Daniel Riester
- Abteilung für Molekulare Genetik und Präparative Molekularbiologie, Institut für Mikrobiologie und Genetik, Grisebachstrasse 8, 37077 Göttingen, Germany
| | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
30
|
Brown N, McKay B, Gilardoni F, Gasteiger J. A Graph-Based Genetic Algorithm and Its Application to the Multiobjective Evolution of Median Molecules. ACTA ACUST UNITED AC 2004; 44:1079-87. [PMID: 15154776 DOI: 10.1021/ci034290p] [Citation(s) in RCA: 106] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
In this paper we propose a novel graph-based genetic algorithm for the evolution of novel molecular graphs from a predefined set of elements or molecular fragments with an external objective function. A brief overview of existing genetic algorithm approaches in molecular design is provided followed by a description of our approach. The paper continues to suggest a novel application of this program to the multiobjective evolution of median molecules that are structurally representative of a set of objective molecules. We conclude with a summary of our initial results along with a discussion of a variety of improvements and applications of our approach.
Collapse
Affiliation(s)
- Nathan Brown
- Avantium Technologies B.V., P.O. Box 2915, 1000 CX Amsterdam, The Netherlands.
| | | | | | | |
Collapse
|
31
|
Bakır GH, Zien A, Tsuda K. Learning to Find Graph Pre-images. LECTURE NOTES IN COMPUTER SCIENCE 2004. [DOI: 10.1007/978-3-540-28649-3_31] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
|