1
|
Kunz RK, Rojnuckarin A, Schmidt CM, Miranda LP. Development of human-machine language interfaces for the visual analysis of complex biologics and RNA modalities and associated experimental data. AAPS OPEN 2023. [DOI: 10.1186/s41120-023-00073-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/06/2023] Open
Abstract
AbstractThe advent of recombinant protein-based therapeutic agents in the 1980s and subsequent waves of innovation in molecular biology and engineering of biologics has permitted the production of an increasingly broad array of complex, high molecular weight constructs. While this has opened a powerful new toolbox of molecular scaffolds with which to probe and interdict biological processes, it also makes deciphering the architectural nuances between individual constructs intuitively difficult. Key to downstream data processes for the detection of data trends is the ability to unambiguously identify, compare, and communicate the nature of molecular compositions. Existing small molecule orientated software tools are not intended for structures such as peptides, proteins, antibodies, and RNA, and do not contain adequate atomistic or domain-level detail to appropriately convey their higher structural complexity. Similarly, there is a paucity of large molecule-focused data analysis and visualization tools. This article will describe four new approaches we developed for the graphical representation and analysis of complex large molecules and experimental data. These tools help fulfill key needs in scientific communication and structure-property analysis of complex biologics and modified oligonucleotide-based drug candidates.
Collapse
|
2
|
Fox T, Bieler M, Haebel P, Ochoa R, Peters S, Weber A. BILN: A Human-Readable Line Notation for Complex Peptides. J Chem Inf Model 2022; 62:3942-3947. [PMID: 35984937 DOI: 10.1021/acs.jcim.2c00703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
We present an easy, human-readable line notation to describe even complex peptides.
Collapse
Affiliation(s)
- Thomas Fox
- Medicinal Chemistry, Boehringer Ingelheim Pharma GmbH & Co KG, 88397 Biberach/Riss, Germany
| | - Michael Bieler
- Medicinal Chemistry, Boehringer Ingelheim Pharma GmbH & Co KG, 88397 Biberach/Riss, Germany
| | - Peter Haebel
- Medicinal Chemistry, Boehringer Ingelheim Pharma GmbH & Co KG, 88397 Biberach/Riss, Germany
| | - Rodrigo Ochoa
- Medicinal Chemistry, Boehringer Ingelheim Pharma GmbH & Co KG, 88397 Biberach/Riss, Germany
| | - Stefan Peters
- Medicinal Chemistry, Boehringer Ingelheim Pharma GmbH & Co KG, 88397 Biberach/Riss, Germany
| | - Alexander Weber
- Medicinal Chemistry, Boehringer Ingelheim Pharma GmbH & Co KG, 88397 Biberach/Riss, Germany
| |
Collapse
|
3
|
Minkiewicz P, Darewicz M, Iwaniak A, Turło M. Proposal of the Annotation of Phosphorylated Amino Acids and Peptides Using Biological and Chemical Codes. Molecules 2021; 26:molecules26030712. [PMID: 33573096 PMCID: PMC7866520 DOI: 10.3390/molecules26030712] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2020] [Revised: 01/21/2021] [Accepted: 01/26/2021] [Indexed: 01/04/2023] Open
Abstract
Phosphorylation represents one of the most important modifications of amino acids, peptides, and proteins. By modifying the latter, it is useful in improving the functional properties of foods. Although all these substances are broadly annotated in internet databases, there is no unified code for their annotation. The present publication aims to describe a simple code for the annotation of phosphopeptide sequences. The proposed code describes the location of phosphate residues in amino acid side chains (including new rules of atom numbering in amino acids) and the diversity of phosphate residues (e.g., di- and triphosphate residues and phosphate amidation). This article also includes translating the proposed biological code into SMILES, being the most commonly used chemical code. Finally, it discusses possible errors associated with applying the proposed code and in the resulting SMILES representations of phosphopeptides. The proposed code can be extended to describe other modifications in the future.
Collapse
|
4
|
David L, Thakkar A, Mercado R, Engkvist O. Molecular representations in AI-driven drug discovery: a review and practical guide. J Cheminform 2020; 12:56. [PMID: 33431035 PMCID: PMC7495975 DOI: 10.1186/s13321-020-00460-5] [Citation(s) in RCA: 150] [Impact Index Per Article: 37.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2020] [Accepted: 09/05/2020] [Indexed: 02/08/2023] Open
Abstract
The technological advances of the past century, marked by the computer revolution and the advent of high-throughput screening technologies in drug discovery, opened the path to the computational analysis and visualization of bioactive molecules. For this purpose, it became necessary to represent molecules in a syntax that would be readable by computers and understandable by scientists of various fields. A large number of chemical representations have been developed over the years, their numerosity being due to the fast development of computers and the complexity of producing a representation that encompasses all structural and chemical characteristics. We present here some of the most popular electronic molecular and macromolecular representations used in drug discovery, many of which are based on graph representations. Furthermore, we describe applications of these representations in AI-driven drug discovery. Our aim is to provide a brief guide on structural representations that are essential to the practice of AI in drug discovery. This review serves as a guide for researchers who have little experience with the handling of chemical representations and plan to work on applications at the interface of these fields.
Collapse
Affiliation(s)
- Laurianne David
- Hit Discovery, Discovery Sciences, BioPharmaceuticals R&D, Astrazeneca Gothenburg, Sweden.
| | - Amol Thakkar
- Hit Discovery, Discovery Sciences, BioPharmaceuticals R&D, Astrazeneca Gothenburg, Sweden
- Department of Chemistry and Biochemistry, University of Bern, Bern, Switzerland
| | - Rocío Mercado
- Hit Discovery, Discovery Sciences, BioPharmaceuticals R&D, Astrazeneca Gothenburg, Sweden
| | - Ola Engkvist
- Hit Discovery, Discovery Sciences, BioPharmaceuticals R&D, Astrazeneca Gothenburg, Sweden
| |
Collapse
|
5
|
Toropova AP, Toropov AA. Application of the Monte Carlo Method for the Prediction of Behavior of Peptides. Curr Protein Pept Sci 2019; 20:1151-1157. [DOI: 10.2174/1389203720666190123163907] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/23/2018] [Revised: 12/17/2018] [Accepted: 12/20/2018] [Indexed: 12/26/2022]
Abstract
Prediction of physicochemical and biochemical behavior of peptides is an important and attractive
task of the modern natural sciences, since these substances have a key role in life processes. The
Monte Carlo technique is a possible way to solve the above task. The Monte Carlo method is a tool with
different applications relative to the study of peptides: (i) analysis of the 3D configurations (conformers);
(ii) establishment of quantitative structure – property / activity relationships (QSPRs/QSARs); and (iii)
development of databases on the biopolymers. Current ideas related to application of the Monte Carlo
technique for studying peptides and biopolymers have been discussed in this review.
Collapse
Affiliation(s)
- Alla P. Toropova
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa 19, 20156 Milano, Italy
| | - Andrey A. Toropov
- Istituto di Ricerche Farmacologiche Mario Negri IRCCS, Via La Masa 19, 20156 Milano, Italy
| |
Collapse
|
6
|
BIOPEP-UWM Database of Bioactive Peptides: Current Opportunities. Int J Mol Sci 2019; 20:ijms20235978. [PMID: 31783634 PMCID: PMC6928608 DOI: 10.3390/ijms20235978] [Citation(s) in RCA: 363] [Impact Index Per Article: 72.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2019] [Revised: 11/21/2019] [Accepted: 11/25/2019] [Indexed: 12/11/2022] Open
Abstract
The BIOPEP-UWM™ database of bioactive peptides (formerly BIOPEP) has recently become a popular tool in the research on bioactive peptides, especially on these derived from foods and being constituents of diets that prevent development of chronic diseases. The database is continuously updated and modified. The addition of new peptides and the introduction of new information about the existing ones (e.g., chemical codes and references to other databases) is in progress. New opportunities include the possibility of annotating peptides containing D-enantiomers of amino acids, batch processing option, converting amino acid sequences into SMILES code, new quantitative parameters characterizing the presence of bioactive fragments in protein sequences, and finding proteinases that release particular peptides.
Collapse
|
7
|
Schutte RJ, Zhang X, An N, Ostrov DA, Vukmanović S. Molecular docking predictions of fragrance binding to human leukocyte antigen molecules. Contact Dermatitis 2019; 81:174-183. [PMID: 30957232 DOI: 10.1111/cod.13283] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2018] [Revised: 03/29/2019] [Accepted: 04/03/2019] [Indexed: 11/27/2022]
Abstract
BACKGROUND Over 4000 small chemicals have been identified as allergens capable of inducing skin sensitization. Many sensitizers are hypothesized to act as haptens producing novel antigens, which can be presented to T cells by human leukocyte antigens (HLAs). Recent studies suggest that some chemical allergens use hapten-independent mechanisms. OBJECTIVE To determine whether molecular docking can identify HLA molecules that bind skin-sensitizing chemical allergens. METHODS Structural models of HLA molecules were used as the basis for molecular docking of 22 chemical allergens. Allergens predicted to bind HLA-B*57:01 were tested for their ability to stimulate T cells by the use of proliferation and interferon-gamma enzyme-linked immunospot assays. RESULTS Chemical allergens that did not satisfy the criteria for hapten activity in vitro were predicted to bind more strongly to common HLA isoforms than those with known hapten activity. HLA-B*57:01, which is an HLA allele required for drug hypersensitivity reactions, was predicted to bind several allergens, including benzyl benzoate, benzyl cinnamate, and benzyl salicylate. In in vitro T cell stimulation assays, benzyl salicylate and benzyl cinnamate were found to stimulate T cell responses from HLA-B*57:01 carriers. CONCLUSIONS These data suggest that small-molecule skin sensitizers have the potential to interact with HLA, and show that T cell-based in vitro assays may be used to evaluate the immunogenicity of skin-sensitizing chemicals.
Collapse
Affiliation(s)
- Ryan J Schutte
- Department of Pathology, Immunology, and Laboratory Medicine, University of Florida College of Medicine, Gainesville, Florida
| | - Xiaojuan Zhang
- Department of Pathology, Immunology, and Laboratory Medicine, University of Florida College of Medicine, Gainesville, Florida
| | - Nan An
- Division of Cosmetics, Office of Cosmetics and Colors, CFSAN, FDA, College Park, Maryland
| | - David A Ostrov
- Department of Pathology, Immunology, and Laboratory Medicine, University of Florida College of Medicine, Gainesville, Florida
| | - Stanislav Vukmanović
- Division of Cosmetics, Office of Cosmetics and Colors, CFSAN, FDA, College Park, Maryland
| |
Collapse
|
8
|
Ricart E, Leclère V, Flissi A, Mueller M, Pupin M, Lisacek F. rBAN: retro-biosynthetic analysis of nonribosomal peptides. J Cheminform 2019; 11:13. [PMID: 30737579 PMCID: PMC6689883 DOI: 10.1186/s13321-019-0335-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2018] [Accepted: 01/31/2019] [Indexed: 12/19/2022] Open
Abstract
Proteinogenic and non-proteinogenic amino acids, fatty acids or glycans are some of the main building blocks of nonribsosomal peptides (NRPs) and as such may give insight into the origin, biosynthesis and bioactivities of their constitutive peptides. Hence, the structural representation of NRPs using monomers provides a biologically interesting skeleton of these secondary metabolites. Databases dedicated to NRPs such as Norine, already integrate monomer-based annotations in order to facilitate the development of structural analysis tools. In this paper, we present rBAN (retro-biosynthetic analysis of nonribosomal peptides), a new computational tool designed to predict the monomeric graph of NRPs from their atomic structure in SMILES format. This prediction is achieved through the "in silico" fragmentation of a chemical structure and matching the resulting fragments against the monomers of Norine for identification. Structures containing monomers not yet recorded in Norine, are processed in a "discovery mode" that uses the RESTful service from PubChem to search the unidentified substructures and suggest new monomers. rBAN was integrated in a pipeline for the curation of Norine data in which it was used to check the correspondence between the monomeric graphs annotated in Norine and SMILES-predicted graphs. The process concluded with the validation of the 97.26% of the records in Norine, a two-fold extension of its SMILES data and the introduction of 11 new monomers suggested in the discovery mode. The accuracy, robustness and high-performance of rBAN were demonstrated in benchmarking it against other tools with the same functionality: Smiles2Monomers and GRAPE.
Collapse
Affiliation(s)
- Emma Ricart
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CMU, Rue Michel-Servet 1, 1211, Geneva, Switzerland. .,Computer Science Department, University of Geneva, Geneva, Switzerland.
| | - Valérie Leclère
- EA 7394-ICV- Institut Charles Viollette, University of Lille, INRA, ISA, University of Artois, Univ. Littoral Côte d'Opale, 59000, Lille, France
| | - Areski Flissi
- UMR 9189- CRIStAL- Centre de Recherche en Informatique Signal et Automatique de Lille, University of Lille, CNRS, Centrale Lille, 59000, Lille, France.,Bonsai Team, Inria-Lille Nord Europe, 9655, Villeneuve d'Ascq Cedex, France
| | - Markus Mueller
- Vital-IT Group, SIB Swiss Institute of Bioinformatics, Amphipole Building, Quartier Sorge, 1015, Lausanne, Switzerland
| | - Maude Pupin
- UMR 9189- CRIStAL- Centre de Recherche en Informatique Signal et Automatique de Lille, University of Lille, CNRS, Centrale Lille, 59000, Lille, France.,Bonsai Team, Inria-Lille Nord Europe, 9655, Villeneuve d'Ascq Cedex, France
| | - Frédérique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CMU, Rue Michel-Servet 1, 1211, Geneva, Switzerland.,Computer Science Department, University of Geneva, Geneva, Switzerland.,Section of Biology, University of Geneva, Geneva, Switzerland
| |
Collapse
|
9
|
van den Broek K, Daniel M, Epple M, Kuhn H, Schaub J, Zielesny A. SPICES: a particle-based molecular structure line notation and support library for mesoscopic simulation. J Cheminform 2018; 10:35. [PMID: 30094683 PMCID: PMC6085218 DOI: 10.1186/s13321-018-0294-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/05/2018] [Accepted: 08/03/2018] [Indexed: 12/18/2022] Open
Abstract
Simplified Particle Input ConnEction Specification (SPICES) is a particle-based molecular structure representation derived from straightforward simplifications of the atom-based SMILES line notation. It aims at supporting tedious and error-prone molecular structure definitions for particle-based mesoscopic simulation techniques like Dissipative Particle Dynamics by allowing for an interplay of different molecular encoding levels that range from topological line notations and corresponding particle-graph visualizations to 3D structures with support of their spatial mapping into a simulation box. An open Java library for SPICES structure handling and mesoscopic simulation support in combination with an open Java Graphical User Interface viewer application for visual topological inspection of SPICES definitions are provided.![]()
Collapse
Affiliation(s)
- Karina van den Broek
- Inorganic Chemistry and Center for Nanointegration Duisburg-Essen (CeNIDE), University of Duisburg-Essen, Essen, Germany.,Institute for Bioinformatics and Chemoinformatics, Westphalian University of Applied Sciences, August-Schmidt-Ring 10, 45665, Recklinghausen, Germany
| | - Mirco Daniel
- Institute for Bioinformatics and Chemoinformatics, Westphalian University of Applied Sciences, August-Schmidt-Ring 10, 45665, Recklinghausen, Germany
| | - Matthias Epple
- Inorganic Chemistry and Center for Nanointegration Duisburg-Essen (CeNIDE), University of Duisburg-Essen, Essen, Germany
| | | | - Jonas Schaub
- Institute for Bioinformatics and Chemoinformatics, Westphalian University of Applied Sciences, August-Schmidt-Ring 10, 45665, Recklinghausen, Germany
| | - Achim Zielesny
- Institute for Bioinformatics and Chemoinformatics, Westphalian University of Applied Sciences, August-Schmidt-Ring 10, 45665, Recklinghausen, Germany.
| |
Collapse
|
10
|
Minkiewicz P, Iwaniak A, Darewicz M. Annotation of Peptide Structures Using SMILES and Other Chemical Codes-Practical Solutions. Molecules 2017; 22:molecules22122075. [PMID: 29186902 PMCID: PMC6149970 DOI: 10.3390/molecules22122075] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/16/2017] [Revised: 11/15/2017] [Accepted: 11/25/2017] [Indexed: 12/20/2022] Open
Abstract
Contemporary peptide science exploits methods and tools of bioinformatics, and cheminformatics. These approaches use different languages to describe peptide structures—amino acid sequences and chemical codes (especially SMILES), respectively. The latter may be applied, e.g., in comparative studies involving structures and properties of peptides and peptidomimetics. Progress in peptide science “in silico” may be achieved via better communication between biologists and chemists, involving the translation of peptide representation from amino acid sequence into SMILES code. Recent recommendations concerning good practice in chemical information include careful verification of data and their annotation. This publication discusses the generation of SMILES representations of peptides using existing software. Construction of peptide structures containing unnatural and modified amino acids (with special attention paid on glycosylated peptides) is also included. Special attention is paid to the detection and correction of typical errors occurring in SMILES representations of peptides and their correction using molecular editors. Brief recommendations for training of staff working on peptide annotations, are discussed as well.
Collapse
Affiliation(s)
- Piotr Minkiewicz
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Plac Cieszyński 1, 10-726 Olsztyn-Kortowo, Poland.
| | - Anna Iwaniak
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Plac Cieszyński 1, 10-726 Olsztyn-Kortowo, Poland.
| | - Małgorzata Darewicz
- Chair of Food Biochemistry, Faculty of Food Science, University of Warmia and Mazury in Olsztyn, Plac Cieszyński 1, 10-726 Olsztyn-Kortowo, Poland.
| |
Collapse
|
11
|
|
12
|
Martin YC. Challenges and prospects for computational aids to molecular diversity. ACTA ACUST UNITED AC 2017. [DOI: 10.1007/bf03380186] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
|
13
|
|
14
|
Aranzamendi E, Arrasate S, Sotomayor N, González‐Díaz H, Lete E. Chiral Brønsted Acid-Catalyzed Enantioselective α-Amidoalkylation Reactions: A Joint Experimental and Predictive Study. ChemistryOpen 2016; 5:540-549. [PMID: 28032023 PMCID: PMC5167290 DOI: 10.1002/open.201600120] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/30/2016] [Indexed: 01/01/2023] Open
Abstract
Enamides with a free NH group have been evaluated as nucleophiles in chiral Brønsted acid-catalyzed enantioselective α-amidoalkylation reactions of bicyclic hydroxylactams for the generation of quaternary stereocenters. A quantitative structure-reactivity relationship (QSRR) method has been developed to find a useful tool to rationalize the enantioselectivity in this and related processes and to orient the catalyst choice. This correlative perturbation theory (PT)-QSRR approach has been used to predict the effect of the structure of the substrate, nucleophile, and catalyst, as well as the experimental conditions, on the enantioselectivity. In this way, trends to improve the experimental results could be found without engaging in a long-term empirical investigation.
Collapse
Affiliation(s)
- Eider Aranzamendi
- Departamento de Química Orgánica IIFacultad de Ciencia y TecnologíaUniversidad del País Vasco/Euskal Herriko Unibertsitatea (UPV/EHU)Apdo. 64448080BilbaoSpain
| | - Sonia Arrasate
- Departamento de Química Orgánica IIFacultad de Ciencia y TecnologíaUniversidad del País Vasco/Euskal Herriko Unibertsitatea (UPV/EHU)Apdo. 64448080BilbaoSpain
| | - Nuria Sotomayor
- Departamento de Química Orgánica IIFacultad de Ciencia y TecnologíaUniversidad del País Vasco/Euskal Herriko Unibertsitatea (UPV/EHU)Apdo. 64448080BilbaoSpain
| | - Humberto González‐Díaz
- Departamento de Química Orgánica IIFacultad de Ciencia y TecnologíaUniversidad del País Vasco/Euskal Herriko Unibertsitatea (UPV/EHU)Apdo. 64448080BilbaoSpain
- IKERBASQUEBasque Foundation for Science48080BilbaoSpain
| | - Esther Lete
- Departamento de Química Orgánica IIFacultad de Ciencia y TecnologíaUniversidad del País Vasco/Euskal Herriko Unibertsitatea (UPV/EHU)Apdo. 64448080BilbaoSpain
| |
Collapse
|
15
|
Hanson RM. Jmol SMILES and Jmol SMARTS: specifications and applications. J Cheminform 2016; 8:50. [PMID: 28316648 PMCID: PMC5037863 DOI: 10.1186/s13321-016-0160-4] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2016] [Accepted: 09/02/2016] [Indexed: 11/22/2022] Open
Abstract
Background SMILES and SMARTS are two well-defined structure matching languages that have gained wide use in cheminformatics. Jmol is a widely used open-source molecular visualization and analysis tool written in Java and implemented in both Java and JavaScript. Over the past 10 years, from 2007 to 2016, work on Jmol has included the development of dialects of SMILES and SMARTS that incorporate novel aspects that allow new and powerful applications. Results The specifications of “Jmol SMILES” and “Jmol SMARTS” are described. The dialects most closely resemble OpenSMILES and OpenSMARTS. Jmol SMILES is a superset of OpenSMILES, allowing a freer format, including whitespace and comments, the addition of “processing directives” that modify the meaning of certain aspects of SMILES processing such as aromaticity and stereochemistry, a more extensive treatment of stereochemistry, and several minor additions. Jmol SMARTS similarly adds these same modifications to OpenSMARTS, but also adds a number of additional “primitives” and elements of syntax tuned to matching 3D molecular structures and selecting their atoms. The result is an expansion of the capabilities of SMILES and SMARTS primarily for use in 3D molecular analysis, allowing a broader range of matching involving any combination of 3D molecular structures, SMILES strings, and SMARTS patterns. While developed specifically for Jmol, these dialects of SMILES and SMARTS are independent of the Jmol application itself. Conclusions Jmol SMILES and Jmol SMARTS add value to standard SMILES and SMARTS. Together they have proven exceptionally capable in extracting valuable information from 3D structural models, as demonstrated in Jmol. Capabilities in Jmol enabled by Jmol SMILES and Jmol SMARTS include efficient MMFF94 atom typing, conformational identification, SMILES comparisons without canonicalization, identification of stereochemical relationships, quantitative comparison of 3D structures from different sources (including differences in Kekulization), conformational flexible fitting, and atom mapping used to synchronize interactive displays of 2D structures, 3D structures, and spectral correlations, where data are being drawn from multiple sources. Electronic supplementary material The online version of this article (doi:10.1186/s13321-016-0160-4) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Robert M Hanson
- Department of Chemistry, St. Olaf College, 1520 St. Olaf Ave., Northfield, MN USA
| |
Collapse
|
16
|
Blázquez-Barbadillo C, Aranzamendi E, Coya E, Lete E, Sotomayor N, González-Díaz H. Perturbation theory model of reactivity and enantioselectivity of palladium-catalyzed Heck–Heck cascade reactions. RSC Adv 2016. [DOI: 10.1039/c6ra08751e] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023] Open
Abstract
A new multi-output PT-QSRR model to correlate and predict the enantioselectivity and yield of Heck–Heck cascade reactions has been developed.
Collapse
Affiliation(s)
- C. Blázquez-Barbadillo
- Department of Organic Chemistry II
- Faculty of Science and Technology
- University of the Basque Country UPV/EHU
- 48080 Bilbao
- Spain
| | - E. Aranzamendi
- Department of Organic Chemistry II
- Faculty of Science and Technology
- University of the Basque Country UPV/EHU
- 48080 Bilbao
- Spain
| | - E. Coya
- Department of Organic Chemistry II
- Faculty of Science and Technology
- University of the Basque Country UPV/EHU
- 48080 Bilbao
- Spain
| | - E. Lete
- Department of Organic Chemistry II
- Faculty of Science and Technology
- University of the Basque Country UPV/EHU
- 48080 Bilbao
- Spain
| | - N. Sotomayor
- Department of Organic Chemistry II
- Faculty of Science and Technology
- University of the Basque Country UPV/EHU
- 48080 Bilbao
- Spain
| | - H. González-Díaz
- Department of Organic Chemistry II
- Faculty of Science and Technology
- University of the Basque Country UPV/EHU
- 48080 Bilbao
- Spain
| |
Collapse
|
17
|
Dufresne Y, Noé L, Leclère V, Pupin M. Smiles2Monomers: a link between chemical and biological structures for polymers. J Cheminform 2015; 7:62. [PMID: 26715946 PMCID: PMC4693424 DOI: 10.1186/s13321-015-0111-5] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2015] [Accepted: 12/06/2015] [Indexed: 12/17/2022] Open
Abstract
Background The monomeric composition of polymers is powerful for structure comparison and synthetic biology, among others. Many databases give access to the atomic structure of compounds but the monomeric structure of polymers is often lacking. We have designed a smart algorithm, implemented in the tool Smiles2Monomers (s2m), to infer efficiently and accurately the monomeric structure of a polymer from its chemical structure. Results Our strategy is divided into two steps: first, monomers are mapped on the atomic structure by an efficient subgraph-isomorphism algorithm ; second, the best tiling is computed so that non-overlapping monomers cover all the structure of the target polymer. The mapping is based on a Markovian index built by a dynamic programming algorithm. The index enables s2m to search quickly all the given monomers on a target polymer. After, a greedy algorithm combines the mapped monomers into a consistent monomeric structure. Finally, a local branch and cut algorithm refines the structure. We tested this method on two manually annotated databases of polymers and reconstructed the structures de novo with a sensitivity over 90 %. The average computation time per polymer is 2 s. Conclusion s2m automatically creates de novo monomeric annotations for polymers, efficiently in terms of time computation and sensitivity. s2m allowed us to detect annotation errors in the tested databases and to easily find the accurate structures. So, s2m could be integrated into the curation process of databases of small compounds to verify the current entries and accelerate the annotation of new polymers. The full method can be downloaded or accessed via a website for peptide-like polymers at http://bioinfo.lifl.fr/norine/smiles2monomers.jsp.. ![]() Electronic supplementary material The online version of this article (doi:10.1186/s13321-015-0111-5) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Yoann Dufresne
- Univ. Lille, CNRS, Centrale Lille, UMR 9189-CRIStAL-Centre de Recherche en Informatique Signal et Automatique de Lille, 59000 Lille, France ; Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40 avenue Halley, 59650 Villeneuve d'Ascq, France
| | - Laurent Noé
- Univ. Lille, CNRS, Centrale Lille, UMR 9189-CRIStAL-Centre de Recherche en Informatique Signal et Automatique de Lille, 59000 Lille, France ; Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40 avenue Halley, 59650 Villeneuve d'Ascq, France
| | - Valérie Leclère
- Univ. Lille, CNRS, Centrale Lille, UMR 9189-CRIStAL-Centre de Recherche en Informatique Signal et Automatique de Lille, 59000 Lille, France ; Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40 avenue Halley, 59650 Villeneuve d'Ascq, France ; Univ. Lille, INRA, ISA, Univ. Artois, Univ. Littoral Côte d'Opale, EA 7394 - ICV - Institut Charles Viollette, 59000 Lille, France
| | - Maude Pupin
- Univ. Lille, CNRS, Centrale Lille, UMR 9189-CRIStAL-Centre de Recherche en Informatique Signal et Automatique de Lille, 59000 Lille, France ; Inria Lille Nord Europe, Bonsai team, Parc scientifique de la Haute Borne, 40 avenue Halley, 59650 Villeneuve d'Ascq, France
| |
Collapse
|
18
|
Messina PV, Besada-Porto JM, González-Díaz H, Ruso JM. Self-Assembled Binary Nanoscale Systems: Multioutput Model with LFER-Covariance Perturbation Theory and an Experimental-Computational Study of NaGDC-DDAB Micelles. LANGMUIR : THE ACS JOURNAL OF SURFACES AND COLLOIDS 2015; 31:12009-12018. [PMID: 26484726 DOI: 10.1021/acs.langmuir.5b03074] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/05/2023]
Abstract
Studies of the self-aggregation of binary systems are of both theoretical and practical importance. They provide an opportunity to investigate the influence of the molecular structure of the hydrophobe on the nonideality of mixing. On the other hand, linear free energy relationship (LFER) models, such as Hansch's equations, may be used to predict the properties of chemical compounds such as drugs or surfactants. However, the task becomes more difficult once we want to predict simultaneaously the effect over multiple output properties of binary systems of perturbations under multiple input experimental boundary conditions (b(j)). As a consequence, we need computational chemistry or chemoinformatics models that may help us to predict different properties of the autoaggregation process of mixed surfactants under multiple conditions. In this work, we have developed the first model that combines perturbation theory (PT) and LFER ideas. The model uses as input covariance PT operators (CPTOs). CPTOs are calculated as the difference between covariance ΔCov((i)μ(k)) functions before and after multiple perturbations in the binary system. In turn, covariances calculated as the product of two Box-Jenkins operators (BJO) operators. BJOs are used to measure the deviation of the structure of different chemical compounds from a set of molecules measured under a given subset of experimental conditions. The best CPT-LFER model found predicted the effects of 25,000 perturbations over 9 different properties of binary systems. We also reported experimental studies of different experimental properties of the binary system formed by sodium glycodeoxycholate and didodecyldimethylammonium bromide (NaGDC-DDAB). Last, we used our CPT-LFER model to carry out a 1000 data point simulation of the properties of the NaGDC-DDAB system under different conditions not studied experimentally.
Collapse
Affiliation(s)
- Paula V Messina
- Department of Chemistry, INQUISUR-CONICET, Universidad Nacional del Sur , 8000 Bahía Blanca, Argentina
| | - Jose Miguel Besada-Porto
- Soft Matter and Molecular Biophysics Group, Department of Applied Physics, University of Santiago de Compostela , Santiago de Compostela E-15782, Spain
| | - Humberto González-Díaz
- Department of Organic Chemistry II, Faculty of Science and Technology, University of the Basque Country UPV/EHU , 48940 Leioa, Spain
- IKERBASQUE, Basque Foundation for Science, 48011 Bilbao, Spain
| | - Juan M Ruso
- Soft Matter and Molecular Biophysics Group, Department of Applied Physics, University of Santiago de Compostela , Santiago de Compostela E-15782, Spain
| |
Collapse
|
19
|
Liu Y, Buendía-Rodríguez G, Peñuelas-Rívas CG, Tan Z, Rívas-Guevara M, Tenorio-Borroto E, Munteanu CR, Pazos A, González-Díaz H. Experimental and computational studies of fatty acid distribution networks. MOLECULAR BIOSYSTEMS 2015; 11:2964-77. [DOI: 10.1039/c5mb00325c] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
A new PT-LFER model is useful for predicting a distribution network in terms of specific fatty acid distribution.
Collapse
Affiliation(s)
- Yong Liu
- Faculty of Veterinary Medicine and Animal Science
- Autonomous University of the State of Mexico
- Toluca
- Mexico
- Key Laboratory of Subtropical Agro-ecological Engineering
| | - Germán Buendía-Rodríguez
- National Center for Disciplinary Research on Animal Physiology and Breeding
- National Institute of Forestry
- Agriculture and Livestock Research
- Queretaro
- Mexico
| | | | - Zhiliang Tan
- Key Laboratory of Subtropical Agro-ecological Engineering
- Institute of Subtropical Agriculture, the Chinese Academy of Sciences
- Changsha
- P. R. China
| | - María Rívas-Guevara
- Ethnobiology and Biodiversity Research Center
- Chapingo Autonomous University
- Texcoco
- Mexico
| | - Esvieta Tenorio-Borroto
- Faculty of Veterinary Medicine and Animal Science
- Autonomous University of the State of Mexico
- Toluca
- Mexico
| | | | | | - Humberto González-Díaz
- Department of Organic Chemistry II
- Faculty of Science and Technology
- University of the Basque Country UPV/EHU
- Leioa
- Spain
| |
Collapse
|
20
|
Hansen MR, Villar HO, Feyfant E. Development of an Informatics Platform for Therapeutic Protein and Peptide Analytics. J Chem Inf Model 2013; 53:2774-9. [DOI: 10.1021/ci400333x] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Affiliation(s)
- Mark R. Hansen
- Altoris, Inc., 7770 Regents Rd
#557, San Diego, California 92122, United States
| | - Hugo O. Villar
- Altoris, Inc., 7770 Regents Rd
#557, San Diego, California 92122, United States
| | - Eric Feyfant
- Aileron Therapeutics, 281 Albany
Street, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
21
|
Zhang T, Li H, Xi H, Stanton RV, Rotstein SH. HELM: a hierarchical notation language for complex biomolecule structure representation. J Chem Inf Model 2012; 52:2796-806. [PMID: 22947017 DOI: 10.1021/ci3001925] [Citation(s) in RCA: 35] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
When biological macromolecules are used as therapeutic agents, it is often necessary to introduce non-natural chemical modifications to improve their pharmaceutical properties. The final products are complex structures where entities such as proteins, peptides, oligonucleotides, and small molecule drugs may be covalently linked to each other, or may include chemically modified biological moieties. An accurate in silico representation of these complex structures is essential, as it forms the basis for their electronic registration, storage, analysis, and visualization. The size of these molecules (henceforth referred to as "biomolecules") often makes them too unwieldy and impractical to represent at the atomic level, while the presence of non-natural chemical modifications makes it impossible to represent them by sequence alone. Here we describe the Hierarchical Editing Language for Macromolecules ("HELM") and demonstrate its utility in the representation of structures such as antisense oligonucleotides, short interference RNAs, peptides, proteins, and antibody drug conjugates.
Collapse
Affiliation(s)
- Tianhong Zhang
- Pfizer Inc., 35 Cambridge Park Drive, Cambridge, Massachusetts 02140, USA.
| | | | | | | | | |
Collapse
|
22
|
Drefahl A. CurlySMILES: a chemical language to customize and annotate encodings of molecular and nanodevice structures. J Cheminform 2011; 3:1. [PMID: 21214931 PMCID: PMC3027187 DOI: 10.1186/1758-2946-3-1] [Citation(s) in RCA: 230] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2010] [Accepted: 01/07/2011] [Indexed: 11/17/2022] Open
Abstract
CurlySMILES is a chemical line notation which extends SMILES with annotations for storage, retrieval and modeling of interlinked, coordinated, assembled and adsorbed molecules in supramolecular structures and nanodevices. Annotations are enclosed in curly braces and anchored to an atomic node or at the end of the molecular graph depending on the annotation type. CurlySMILES includes predefined annotations for stereogenicity, electron delocalization charges, extra-molecular interactions and connectivity, surface attachment, solutions, and crystal structures and allows extensions for domain-specific annotations. CurlySMILES provides a shorthand format to encode molecules with repetitive substructural parts or motifs such as monomer units in macromolecules and amino acids in peptide chains. CurlySMILES further accommodates special formats for non-molecular materials that are commonly denoted by composition of atoms or substructures rather than complete atom connectivity.
Collapse
Affiliation(s)
- Axel Drefahl
- Axeleratio, 4330 Tuscany Circle, Reno, Nevada 89523, USA.
| |
Collapse
|
23
|
Jensen JH, Hoeg-Jensen T, Padkjær SB. Building a BioChemformatics Database. J Chem Inf Model 2008; 48:2404-13. [DOI: 10.1021/ci800128b] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Jan H. Jensen
- Scientific Computing, Novo Nordisk Park A2P, Novo Nordisk A/S, DK-2760 Maaloev, Denmark, Diabetes Protein and Peptide Chemistry, Novo Nordisk Park D6.1.142, Novo Nordisk A/S, DK-2760 Maeloev, Denmark, and Protein Structure and Biophysics, Novo Nordisk Park G8.2.78, Novo Nordisk A/S, DK-2760 Maeloev, Denmark
| | - Thomas Hoeg-Jensen
- Scientific Computing, Novo Nordisk Park A2P, Novo Nordisk A/S, DK-2760 Maaloev, Denmark, Diabetes Protein and Peptide Chemistry, Novo Nordisk Park D6.1.142, Novo Nordisk A/S, DK-2760 Maeloev, Denmark, and Protein Structure and Biophysics, Novo Nordisk Park G8.2.78, Novo Nordisk A/S, DK-2760 Maeloev, Denmark
| | - Søren B. Padkjær
- Scientific Computing, Novo Nordisk Park A2P, Novo Nordisk A/S, DK-2760 Maaloev, Denmark, Diabetes Protein and Peptide Chemistry, Novo Nordisk Park D6.1.142, Novo Nordisk A/S, DK-2760 Maeloev, Denmark, and Protein Structure and Biophysics, Novo Nordisk Park G8.2.78, Novo Nordisk A/S, DK-2760 Maeloev, Denmark
| |
Collapse
|
24
|
Maclean D, Martin EJ. On the Representation of Combinatorial Libraries. ACTA ACUST UNITED AC 2003; 6:1-11. [PMID: 14714976 DOI: 10.1021/cc0340325] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Derek Maclean
- PETNET Pharmaceuticals, 6140 Bristol Parkway, Culver City, California 90230, USA.
| | | |
Collapse
|
25
|
McSparron H, Blythe MJ, Zygouri C, Doytchinova IA, Flower DR. JenPep: a novel computational information resource for immunobiology and vaccinology. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2003; 43:1276-87. [PMID: 12870921 DOI: 10.1021/ci030461e] [Citation(s) in RCA: 60] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
JenPep is a relational database containing a compendium of thermodynamic binding data for the interaction of peptides with a range of important immunological molecules: the major histocompatibility complex, TAP transporter, and T cell receptor. The database also includes annotated lists of B cell and T cell epitopes. Version 2.0 of the database is implemented in a bespoke postgreSQL database system and is fully searchable online via a perl/HTML interface (URL: http://www.jenner.ac.uk/JenPep).
Collapse
Affiliation(s)
- Helen McSparron
- Edward Jenner Institute for Vaccine Research, Compton, Berkshire, UK RG20 7NN
| | | | | | | | | |
Collapse
|
26
|
Siani MA, Skillman AG, Carreras CW, Ashley G, Kuntz ID, Santi DV. Development and screening of a polyketide virtual library for drug leads against a motilide pharmacophore. J Mol Graph Model 2000; 18:497-511, 539-40. [PMID: 11143565 DOI: 10.1016/s1093-3263(00)00070-x] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
Abstract
A virtual library of macrocyclic polyketide molecules was generated and screened to identify novel, conformationally constrained potential motilin receptor agonists ("motilides"). A motilide pharmacophore model was generated from the potent 6,9-enol ether erythromycin and known derivatives from the literature. The pharmacophore for each molecular conformation was a point in a distance-volume space based on presentation of the putative binding moieties. Two methods, one fragment based method and the other reaction based, were explored for constructing the polyketide virtual library. First, a virtual library was assembled from monomeric fragments using the CHORTLES language. Second, the virtual library was assembled by the in silico application of all possible polyketide synthase enzyme reactions to generate the product library. Each library was converted to low-energy 3D conformations by distance geometry and standard minimization methods. The distance-volume metric was calculated for low-energy conformations of the members of the virtual polyketide library and screened against the enol ether pharmacophore. The goal was to identify novel macrocycles that satisfy the pharmacophore. We identified three conformationally constrained, novel polyketide series that have low-energy conformations satisfying the distance-volume constraints of the motilide pharmacophore.
Collapse
Affiliation(s)
- M A Siani
- Kosan Biosciences, 3832 Bay Center Place, Hayward, CA, USA.
| | | | | | | | | | | |
Collapse
|
27
|
Bone RGA, Firth MA, Sykes RA. SMILES Extensions for Pattern Matching and Molecular Transformations: Applications in Chemoinformatics. ACTA ACUST UNITED AC 1999. [DOI: 10.1021/ci990422w] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Richard G. A. Bone
- Proteus Molecular Design Ltd., Beechfield House, Lyme Green Business Park, Macclesfield, Cheshire, SK11 0JL, United Kingdom
| | - Michael A. Firth
- Proteus Molecular Design Ltd., Beechfield House, Lyme Green Business Park, Macclesfield, Cheshire, SK11 0JL, United Kingdom
| | - Richard A. Sykes
- Proteus Molecular Design Ltd., Beechfield House, Lyme Green Business Park, Macclesfield, Cheshire, SK11 0JL, United Kingdom
| |
Collapse
|
28
|
Lustig B, Arora S, Jernigan RL. RNA base-amino acid interaction strengths derived from structures and sequences. Nucleic Acids Res 1997; 25:2562-5. [PMID: 9185564 PMCID: PMC146793 DOI: 10.1093/nar/25.13.2562] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023] Open
Abstract
We investigate RNA base-amino acid interactions by counting their contacts in structures and their implicit contacts in various functional sequences where the structures can be assumed to be preserved. These frequencies are cast into equations to extract relative interaction energetics. Previously we used this approach in considering the major groove interactions of DNA, and here we apply it to the more diverse interactions observed in RNA. Structures considered are the three different tRNA synthetase complexes, the U1A spliceosomal protein with an RNA hairpin and the BIV TAR-Tat complex. We use binding data for the base frequencies for the seryl, aspartyl and glutaminyl tRNA-synthetase and U1 RNA-protein complexes. We compare with the previously reported DNA major groove peptide contacts the results for atoms of RNA bases, usually in the major groove. There are strong similarities between the rank orders of interacting bases in the DNA and the RNA cases. The apparent strongest RNA interaction observed is between arginine and guanine which was also one of the strongest DNA interactions. The similar data for base atomic interactions, whether base paired or not, support the importance of strong atomic interactions over local structure considerations, such as groove width and alpha-helicity.
Collapse
Affiliation(s)
- B Lustig
- Department of Chemistry, San Jose State University, San Jose, CA 95192, USA.
| | | | | |
Collapse
|
29
|
Felder ER, Poppinger D. Combinatorial compound libraries for enhanced drug discovery approaches. ADVANCES IN DRUG RESEARCH 1997. [DOI: 10.1016/s0065-2490(97)80006-8] [Citation(s) in RCA: 13] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/28/2023]
|
30
|
Shemetulskis NE, Weininger D, Blankley CJ, Yang JJ, Humblet C. Stigmata: an algorithm to determine structural commonalities in diverse datasets. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 1996; 36:862-71. [PMID: 8768771 DOI: 10.1021/ci950169+] [Citation(s) in RCA: 87] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Abstract
An algorithm, Stigmata, is described, which extracts structural commonalities from chemical datasets. It is discussed using several illustrative examples and a pharmaceutically interesting set of dopamine D2 agonists. The commonalities are determined using two-dimensional topological chemical descriptions and are incorporated into the key feature of the algorithm, the modal fingerprint. Flexibility is built into the algorithm by means of a user-defined threshold value, which affects the information content of the modal fingerprint. The use of the modal fingerprint as a diversity assessment tool, as a database similarity query, and as a basis for color mapping the determined commonalities back onto the chemical structures is demonstrated.
Collapse
Affiliation(s)
- N E Shemetulskis
- Parke-Davis Pharmaceutical Research Division, Warner-Lambert Company, Ann Arbor, Michigan 48105, USA
| | | | | | | | | |
Collapse
|
31
|
Cosgrove DA, Kenny PW. BOOMSLANG: a program for combinatorial structure generation. JOURNAL OF MOLECULAR GRAPHICS 1996; 14:1-5, 23. [PMID: 8744566 DOI: 10.1016/0263-7855(95)00040-2] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
An approach to exploiting pharmacophore models is described. Structures are assembled combinatorially from user-defined fragments and flexibly overlaid into the reference frame of the pharmacophore using distance geometry and molecular mechanics. The match with the pharmacophore is quantified by conformational energy and volume of overlap.
Collapse
Affiliation(s)
- D A Cosgrove
- Zeneca Pharmaceuticals, Macclesfield, Cheshire, England
| | | |
Collapse
|
32
|
Murray CW, Clark DE, Byrne DG. PRO_LIGAND: an approach to de novo molecular design. 6. Flexible fitting in the design of peptides. J Comput Aided Mol Des 1995; 9:381-95. [PMID: 8594156 DOI: 10.1007/bf00123996] [Citation(s) in RCA: 15] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Abstract
This paper describes the further development of the functionality of our in-house de novo design program, PRO_LIGAND. In particular, attention is focused on the implementation and validation of the 'direct tweak' method for the construction of conformationally flexible molecules, such as peptides, from molecular fragments. This flexible fitting method is compared to the original method based on libraries of prestored conformations for each fragment. It is shown that the directed tweak method produces results of comparable quality, with significant time savings. By removing the need to generate a set of representative conformers for any new library fragment, the flexible fitting method increases the speed and simplicity with which new fragments can be included in a fragment library and also reduces the disk space required for library storage. A further improvement to the molecular construction process within PRO_LIGAND is the inclusion of a constrained minimisation procedure which relaxes fragments onto the design model and can be used to reject highly strained structures during the structure generation phase. This relaxation is shown to be very useful in simple test cases, but restricts diversity for more realistic examples. The advantages and disadvantages of these additions to the PRO_LIGAND methodology are illustrated by three examples: similar design to an alpha helix region of dihydrofolate reductase, complementary design to the active site of HIV-1 protease and similar design to an epitope region of lysozyme.
Collapse
Affiliation(s)
- C W Murray
- Proteus Molecular Design Ltd., Lyme Green Business Park, Macclesfield, Cheshire, U.K
| | | | | |
Collapse
|