1
|
Austin BK, Firooz A, Valafar H, Blenda AV. An Updated Overview of Existing Cancer Databases and Identified Needs. BIOLOGY 2023; 12:1152. [PMID: 37627037 PMCID: PMC10452211 DOI: 10.3390/biology12081152] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/30/2023] [Revised: 07/26/2023] [Accepted: 08/14/2023] [Indexed: 08/27/2023]
Abstract
Our search of existing cancer databases aimed to assess the current landscape and identify key needs. We analyzed 71 databases, focusing on genomics, proteomics, lipidomics, and glycomics. We found a lack of cancer-related lipidomic and glycomic databases, indicating a need for further development in these areas. Proteomic databases dedicated to cancer research were also limited. To assess overall progress, we included human non-cancer databases in proteomics, lipidomics, and glycomics for comparison. This provided insights into advancements in these fields over the past eight years. We also analyzed other types of cancer databases, such as clinical trial databases and web servers. Evaluating user-friendliness, we used the FAIRness principle to assess findability, accessibility, interoperability, and reusability. This ensured databases were easily accessible and usable. Our search summary highlights significant growth in cancer databases while identifying gaps and needs. These insights are valuable for researchers, clinicians, and database developers, guiding efforts to enhance accessibility, integration, and usability. Addressing these needs will support advancements in cancer research and benefit the wider cancer community.
Collapse
Affiliation(s)
- Brittany K. Austin
- Department of Biomedical Sciences, School of Medicine Greenville, University of South Carolina, Greenville, SC 29605, USA;
| | - Ali Firooz
- Department of Computer Science and Engineering, College of Engineering and Computing, University of South Carolina, Columbia, SC 29208, USA;
| | - Homayoun Valafar
- Department of Computer Science and Engineering, College of Engineering and Computing, University of South Carolina, Columbia, SC 29208, USA;
| | - Anna V. Blenda
- Department of Biomedical Sciences, School of Medicine Greenville, University of South Carolina, Greenville, SC 29605, USA;
- Prisma Health Cancer Institute, Prisma Health, Greenville, SC 29605, USA
| |
Collapse
|
2
|
Huang J, Jiang B, Liu M, Yang P, Cao W. gQuant, an Automated Tool for Quantitative Glycomic Data Analysis. Front Chem 2021; 9:707738. [PMID: 34395380 PMCID: PMC8355585 DOI: 10.3389/fchem.2021.707738] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2021] [Accepted: 07/15/2021] [Indexed: 11/25/2022] Open
Abstract
MALDI-MS-based glycan isotope labeling methods have been effectively and widely used for quantitative glycomics. However, interpretation of the data produced by MALDI-MS is inaccurate and tedious because the bioinformatic tools are inadequate. In this work, we present gQuant, an automated tool for MALDI-MS-based glycan isotope labeling data processing. gQuant was designed with a set of dedicated algorithms to improve the efficiency, accuracy and convenience of quantitation data processing. When tested on the reference data set, gQuant showed a fast processing speed, as it was able to search the glycan data of model glycoproteins in a few minutes and reported more results than the manual analysis did. The reported quantitation ratios matched well with the experimental glycan mixture ratios ranging from 1:10 to 10:1. In addition, gQuant is fully open-source and is coded in Python, which is supported by most operating systems, and it has a user-friendly interface. gQuant can be easily adapted by users for specific experimental designs, such as specific glycan databases, different derivatization types and relative quantitation designs and can thus facilitate fast glycomic quantitation for clinical sample analysis using MALDI-MS-based stable isotope labeling.
Collapse
Affiliation(s)
- Jiangming Huang
- The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai, China.,Department of Chemistry, Fudan University, Shanghai, China
| | - Biyun Jiang
- The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Mingqi Liu
- The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai, China
| | - Pengyuan Yang
- The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai, China.,Department of Chemistry, Fudan University, Shanghai, China
| | - Weiqian Cao
- The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, The International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Institutes of Biomedical Sciences, Fudan University, Shanghai, China.,NHC Key Laboratory of Glycoconjugates Research, Fudan University, Shanghai, China
| |
Collapse
|
3
|
Insights into Bioinformatic Applications for Glycosylation: Instigating an Awakening towards Applying Glycoinformatic Resources for Cancer Diagnosis and Therapy. Int J Mol Sci 2020; 21:ijms21249336. [PMID: 33302373 PMCID: PMC7762546 DOI: 10.3390/ijms21249336] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 11/26/2020] [Accepted: 12/01/2020] [Indexed: 01/10/2023] Open
Abstract
Glycosylation plays a crucial role in various diseases and their etiology. This has led to a clear understanding on the functions of carbohydrates in cell communication, which eventually will result in novel therapeutic approaches for treatment of various disease. Glycomics has now become one among the top ten technologies that will change the future. The direct implication of glycosylation as a hallmark of cancer and for cancer therapy is well established. As in proteomics, where bioinformatics tools have led to revolutionary achievements, bioinformatics resources for glycosylation have improved its practical implication. Bioinformatics tools, algorithms and databases are a mandatory requirement to manage and successfully analyze large amount of glycobiological data generated from glycosylation studies. This review consolidates all the available tools and their applications in glycosylation research. The achievements made through the use of bioinformatics into glycosylation studies are also presented. The importance of glycosylation in cancer diagnosis and therapy is discussed and the gap in the application of widely available glyco-informatic tools for cancer research is highlighted. This review is expected to bring an awakening amongst glyco-informaticians as well as cancer biologists to bridge this gap, to exploit the available glyco-informatic tools for cancer.
Collapse
|
4
|
Scherbinina SI, Toukach PV. Three-Dimensional Structures of Carbohydrates and Where to Find Them. Int J Mol Sci 2020; 21:E7702. [PMID: 33081008 PMCID: PMC7593929 DOI: 10.3390/ijms21207702] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2020] [Revised: 10/15/2020] [Accepted: 10/16/2020] [Indexed: 02/06/2023] Open
Abstract
Analysis and systematization of accumulated data on carbohydrate structural diversity is a subject of great interest for structural glycobiology. Despite being a challenging task, development of computational methods for efficient treatment and management of spatial (3D) structural features of carbohydrates breaks new ground in modern glycoscience. This review is dedicated to approaches of chemo- and glyco-informatics towards 3D structural data generation, deposition and processing in regard to carbohydrates and their derivatives. Databases, molecular modeling and experimental data validation services, and structure visualization facilities developed for last five years are reviewed.
Collapse
Affiliation(s)
- Sofya I. Scherbinina
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Science, Leninsky prospect 47, 119991 Moscow, Russia
- Higher Chemical College, D. Mendeleev University of Chemical Technology of Russia, Miusskaya Square 9, 125047 Moscow, Russia
| | - Philip V. Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Science, Leninsky prospect 47, 119991 Moscow, Russia
| |
Collapse
|
5
|
Adua E, Russell A, Roberts P, Wang Y, Song M, Wang W. Innovation Analysis on Postgenomic Biomarkers: Glycomics for Chronic Diseases. ACTA ACUST UNITED AC 2017; 21:183-196. [DOI: 10.1089/omi.2017.0035] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/20/2023]
Affiliation(s)
- Eric Adua
- School of Medical and Health Sciences, Edith Cowan University, Perth, Australia
| | - Alyce Russell
- School of Medical and Health Sciences, Edith Cowan University, Perth, Australia
| | - Peter Roberts
- School of Medical and Health Sciences, Edith Cowan University, Perth, Australia
| | - Youxin Wang
- Beijing Key Laboratory of Clinical Epidemiology, School of Public Health, Capital Medical University, Beijing, China
| | - Manshu Song
- Beijing Key Laboratory of Clinical Epidemiology, School of Public Health, Capital Medical University, Beijing, China
| | - Wei Wang
- School of Medical and Health Sciences, Edith Cowan University, Perth, Australia
| |
Collapse
|
6
|
Ricard-Blum S, Lisacek F. Glycosaminoglycanomics: where we are. Glycoconj J 2016; 34:339-349. [PMID: 27900575 DOI: 10.1007/s10719-016-9747-2] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2016] [Revised: 10/28/2016] [Accepted: 11/01/2016] [Indexed: 01/21/2023]
Abstract
Glycosaminoglycans regulate numerous physiopathological processes such as development, angiogenesis, innate immunity, cancer and neurodegenerative diseases. Cell surface GAGs are involved in cell-cell and cell-matrix interactions, cell adhesion and signaling, and host-pathogen interactions. GAGs contribute to the assembly of the extracellular matrix and heparan sulfate chains are able to sequester growth factors in the ECM. Their biological activities are regulated by their interactions with proteins. The structural heterogeneity of GAGs, mostly due to chemical modifications occurring during and after their synthesis, makes the development of analytical techniques for their profiling in cells, tissues, and biological fluids, and of computational tools for mining GAG-protein interaction data very challenging. We give here an overview of the experimental approaches used in glycosaminoglycomics, of the major GAG-protein interactomes characterized so far, and of the computational tools and databases available to analyze and store GAG structures and interactions.
Collapse
Affiliation(s)
- Sylvie Ricard-Blum
- Institut de Chimie et Biochimie Moléculaires et Supramoléculaires, UMR 5246 CNRS - Université Lyon 1, INSA Lyon, CPE Lyon, 69622, Villeurbanne Cedex, France.
| | - Frédérique Lisacek
- SIB Swiss Institute of Bioinformatics, 1 Rue Michel-Servet, 1211, Geneva, Switzerland.,Computer Science Department, University of Geneva, Geneva, Switzerland
| |
Collapse
|
7
|
Arroyuelo A, Vila JA, Martin OA. Azahar: a PyMOL plugin for construction, visualization and analysis of glycan molecules. J Comput Aided Mol Des 2016; 30:619-24. [DOI: 10.1007/s10822-016-9944-x] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2016] [Accepted: 08/17/2016] [Indexed: 10/21/2022]
|
8
|
|
9
|
A review of methods for interpretation of glycopeptide tandem mass spectral data. Glycoconj J 2015; 33:285-96. [PMID: 26612686 DOI: 10.1007/s10719-015-9633-3] [Citation(s) in RCA: 62] [Impact Index Per Article: 6.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2015] [Revised: 10/13/2015] [Accepted: 10/21/2015] [Indexed: 12/25/2022]
Abstract
Despite the publication of several software tools for analysis of glycopeptide tandem mass spectra, there remains a lack of consensus regarding the most effective and appropriate methods. In part, this reflects problems with applying standard methods for proteomics database searching and false discovery rate calculation. While the analysis of small post-translational modifications (PTMs) may be regarded as an extension of proteomics database searching, glycosylation requires specialized approaches. This is because glycans are large and heterogeneous by nature, causing glycopeptides to exist as multiple glycosylated variants. Thus, the mass of the peptide cannot be calculated directly from that of the intact glycopeptide. In addition, the chemical nature of the glycan strongly influences product ion patterns observed for glycopeptides. As a result, glycopeptidomics requires specialized bioinformatics methods. We summarize the recent progress towards a consensus for effective glycopeptide tandem mass spectrometric analysis.
Collapse
|
10
|
Abstract
Over the last two decades, several carbohydrate structure databases have been developed and made publicly available by different research groups around the world. This led to the fragmentation of information about carbohydrate structures into different resources that have no or only weak interaction with each other. GlycomeDB was developed to integrate the carbohydrate structures from different resources by generating a single-indexed catalog of these structures that associates each structure with its reference in the original resources. GlycomeDB facilitates searching for carbohydrate structures in all the integrated resources by eliminating the need to use several different search interfaces and manually integrating the results. References provided by GlycomeDB make it possible to retrieve information that is beyond the scope of GlycomeDB but present in the integrated databases. This chapter illustrates the use of the GlycomeDB search interfaces and web services by way of three example cases.
Collapse
Affiliation(s)
- René Ranzinger
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Road, Athens, GA, 30602-4712, USA,
| | | |
Collapse
|
11
|
Struwe WB, Pagel K, Benesch JLP, Harvey DJ, Campbell MP. GlycoMob: an ion mobility-mass spectrometry collision cross section database for glycomics. Glycoconj J 2015; 33:399-404. [PMID: 26314736 DOI: 10.1007/s10719-015-9613-7] [Citation(s) in RCA: 61] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2015] [Revised: 07/21/2015] [Accepted: 07/27/2015] [Indexed: 12/29/2022]
Abstract
Ion mobility mass spectrometry (IM-MS) is a promising analytical technique for glycomics that separates glycan ions based on their collision cross section (CCS) and provides glycan precursor and fragment masses. It has been shown that isomeric oligosaccharide species can be separated by IM and identified on basis of their CCS and fragmentation. These results indicate that adding CCSs information for glycans and glycan fragments to searchable databases and analysis pipelines will increase identification confidence and accuracy. We have developed a freely accessible database, GlycoMob ( http://www.glycomob.org ), containing over 900 CCSs values of glycans, oligosaccharide standards and their fragments that will be continually updated. We have measured the absolute CCSs of calibration standards, biologically derived and synthetic N-glycans ionized with various adducts in positive and negative mode or as protonated (positive ion) and deprotonated (negative ion) ions.
Collapse
Affiliation(s)
- Weston B Struwe
- Department of Chemistry, University of Oxford, Oxford, OX1 3QZ, UK.
| | - Kevin Pagel
- Free University Chemistry, Institute of Chemistry and Biochemistry, Takustrasse 3, 14195, Berlin, Germany.,Department of Molecular Physics, Fritz Haber Institute of the Max Planck Society, 14195, Berlin, Germany
| | | | - David J Harvey
- Oxford Glycobiology Institute, Department of Biochemistry, University of Oxford, Oxford, OX1 3QU, UK
| | - Matthew P Campbell
- Biomolecular Frontiers Research Centre, Macquarie University, Sydney, NSW 2109, Australia.
| |
Collapse
|
12
|
Harvey DJ. Analysis of carbohydrates and glycoconjugates by matrix-assisted laser desorption/ionization mass spectrometry: an update for 2009-2010. MASS SPECTROMETRY REVIEWS 2015; 34:268-422. [PMID: 24863367 PMCID: PMC7168572 DOI: 10.1002/mas.21411] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/12/2013] [Revised: 07/16/2013] [Accepted: 07/16/2013] [Indexed: 05/07/2023]
Abstract
This review is the sixth update of the original article published in 1999 on the application of MALDI mass spectrometry to the analysis of carbohydrates and glycoconjugates and brings coverage of the literature to the end of 2010. General aspects such as theory of the MALDI process, matrices, derivatization, MALDI imaging, arrays and fragmentation are covered in the first part of the review and applications to various structural typed constitutes the remainder. The main groups of compound that are discussed in this section are oligo and polysaccharides, glycoproteins, glycolipids, glycosides and biopharmaceuticals. Many of these applications are presented in tabular form. Also discussed are medical and industrial applications of the technique, studies of enzyme reactions and applications to chemical synthesis.
Collapse
Affiliation(s)
- David J. Harvey
- Department of BiochemistryOxford Glycobiology InstituteUniversity of OxfordOxfordOX1 3QUUK
| |
Collapse
|
13
|
Ranzinger R, Aoki-Kinoshita KF, Campbell MP, Kawano S, Lütteke T, Okuda S, Shinmachi D, Shikanai T, Sawaki H, Toukach P, Matsubara M, Yamada I, Narimatsu H. GlycoRDF: an ontology to standardize glycomics data in RDF. Bioinformatics 2015; 31:919-25. [PMID: 25388145 PMCID: PMC4380026 DOI: 10.1093/bioinformatics/btu732] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2014] [Revised: 10/12/2014] [Accepted: 10/28/2014] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Over the last decades several glycomics-based bioinformatics resources and databases have been created and released to the public. Unfortunately, there is no common standard in the representation of the stored information or a common machine-readable interface allowing bioinformatics groups to easily extract and cross-reference the stored information. RESULTS An international group of bioinformatics experts in the field of glycomics have worked together to create a standard Resource Description Framework (RDF) representation for glycomics data, focused on glycan sequences and related biological source, publications and experimental data. This RDF standard is defined by the GlycoRDF ontology and will be used by database providers to generate common machine-readable exports of the data stored in their databases. AVAILABILITY AND IMPLEMENTATION The ontology, supporting documentation and source code used by database providers to generate standardized RDF are available online (http://www.glycoinfo.org/GlycoRDF/).
Collapse
Affiliation(s)
- Rene Ranzinger
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Kiyoko F Aoki-Kinoshita
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Matthew P Campbell
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Shin Kawano
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Thomas Lütteke
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Shujiro Okuda
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Daisuke Shinmachi
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Toshihide Shikanai
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Hiromichi Sawaki
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Philip Toukach
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Masaaki Matsubara
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Issaku Yamada
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Hisashi Narimatsu
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| |
Collapse
|
14
|
Toukach PV, Egorova KS. Bacterial, plant, and fungal carbohydrate structure databases: daily usage. Methods Mol Biol 2015; 1273:55-85. [PMID: 25753703 DOI: 10.1007/978-1-4939-2343-4_5] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Natural carbohydrates play important roles in living systems and therefore are used as diagnostic and therapeutic targets. The main goal of glycomics is systematization of carbohydrates and elucidation of their role in human health and disease. The amount of information on natural carbohydrates accumulates rapidly, but scientists still lack databases and computer-assisted tools needed for orientation in the glycomic information space. Therefore, freely available, regularly updated, and cross-linked databases are demanded. Bacterial Carbohydrate Structure Database (Bacterial CSDB) was developed for provision of structural, bibliographic, taxonomic, NMR spectroscopic, and other related information on bacterial and archaeal carbohydrate structures. Its main features are (1) coverage above 90%, (2) high data consistence (above 90% of error-free records), and (3) presence of manually verified bibliographic, NMR spectroscopic, and taxonomic annotations. Recently, CSDB has been expanded to cover carbohydrates of plant and fungal origin. The achievement of full coverage in the plant and fungal domains is expected in the future. CSDB is freely available on the Internet as a web service at http://csdb.glycoscience.ru. This chapter aims at showing how to use CSDB in your daily scientific practice.
Collapse
Affiliation(s)
- Philip V Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky Prospekt 47, Moscow, 119991, Russia,
| | | |
Collapse
|
15
|
Venkataraman M, Sasisekharan R, Raman R. Glycan array data management at Consortium for Functional Glycomics. Methods Mol Biol 2015; 1273:181-90. [PMID: 25753711 DOI: 10.1007/978-1-4939-2343-4_13] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/01/2023]
Abstract
Glycomics or the study of structure-function relationships of complex glycans has reshaped post-genomics biology. Glycans mediate fundamental biological functions via their specific interactions with a variety of proteins. Recognizing the importance of glycomics, large-scale research initiatives such as the Consortium for Functional Glycomics (CFG) were established to address these challenges. Over the past decade, the Consortium for Functional Glycomics (CFG) has generated novel reagents and technologies for glycomics analyses, which in turn have led to generation of diverse datasets. These datasets have contributed to understanding glycan diversity and structure-function relationships at molecular (glycan-protein interactions), cellular (gene expression and glycan analysis), and whole organism (mouse phenotyping) levels. Among these analyses and datasets, screening of glycan-protein interactions on glycan array platforms has gained much prominence and has contributed to cross-disciplinary realization of the importance of glycomics in areas such as immunology, infectious diseases, cancer biomarkers, etc. This manuscript outlines methodologies for capturing data from glycan array experiments and online tools to access and visualize glycan array data implemented at the CFG.
Collapse
Affiliation(s)
- Maha Venkataraman
- Department of Biological Engineering, Koch Institute of Integrative Cancer Research, Massachusetts Institute of Technology, MIT Building 76, Room 158, 500 Main Street, Cambridge, MA, 02139, USA
| | | | | |
Collapse
|
16
|
Maeda M, Fujita N, Suzuki Y, Sawaki H, Shikanai T, Narimatsu H. JCGGDB: Japan Consortium for Glycobiology and Glycotechnology Database. Methods Mol Biol 2015; 1273:161-179. [PMID: 25753710 DOI: 10.1007/978-1-4939-2343-4_12] [Citation(s) in RCA: 22] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
The biological significance of glycans has been widely studied and reported in the past. However, most achievements of our predecessors are not readily available in existing databases. JCGGDB is a meta-database involving 15 original databases in AIST and 5 cooperative databases in alliance with JCGG: Japan Consortium for Glycobiology and Glycotechnology. It centers on a glycan structure database and accumulates information such as glycan preferences of lectins, glycosylation sites in proteins, and genes related to glycan syntheses from glycoscience and related fields. This chapter illustrates how to use three major search interfaces (Keyword Search, Structure Search, and GlycoChem Explorer) available in JCGGDB to search across multiple databases.
Collapse
Affiliation(s)
- Masako Maeda
- Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology (AIST), Central 2, 1-1-1 Umezono, Tsukuba, Ibaraki, 305-8568, Japan
| | | | | | | | | | | |
Collapse
|
17
|
Eavenson M, Kochut KJ, Miller JA, Ranzinger R, Tiemeyer M, Aoki K, York WS. Qrator: a web-based curation tool for glycan structures. Glycobiology 2014; 25:66-73. [PMID: 25165068 DOI: 10.1093/glycob/cwu090] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
Most currently available glycan structure databases use their own proprietary structure representation schema and contain numerous annotation errors. These cause problems when glycan databases are used for the annotation or mining of data generated in the laboratory. Due to the complexity of glycan structures, curating these databases is often a tedious and labor-intensive process. However, rigorously validating glycan structures can be made easier with a curation workflow that incorporates a structure-matching algorithm that compares candidate glycans to a canonical tree that embodies structural features consistent with established mechanisms for the biosynthesis of a particular class of glycans. To this end, we have implemented Qrator, a web-based application that uses a combination of external literature and database references, user annotations and canonical trees to assist and guide researchers in making informed decisions while curating glycans. Using this application, we have started the curation of large numbers of N-glycans, O-glycans and glycosphingolipids. Our curation workflow allows creating and extending canonical trees for these classes of glycans, which have subsequently been used to improve the curation workflow.
Collapse
Affiliation(s)
| | | | | | - René Ranzinger
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602-7404, USA
| | - Michael Tiemeyer
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602-7404, USA
| | - Kazuhiro Aoki
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602-7404, USA
| | - William S York
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA 30602-7404, USA
| |
Collapse
|
18
|
Chandler KB, Brnakova Z, Sanda M, Wang S, Stalnaker SH, Bridger R, Zhao P, Wells L, Edwards NJ, Goldman R. Site-specific glycan microheterogeneity of inter-alpha-trypsin inhibitor heavy chain H4. J Proteome Res 2014; 13:3314-29. [PMID: 24884609 PMCID: PMC4084840 DOI: 10.1021/pr500394z] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
Inter-alpha-trypsin inhibitor heavy chain H4 (ITIH4) is a 120 kDa acute-phase glycoprotein produced primarily in the liver, secreted into the blood, and identified in serum. ITIH4 is involved in liver development and stabilization of the extracellular matrix (ECM), and its expression is altered in liver disease. In this study, we aimed to characterize glycosylation of recombinant and serum-derived ITIH4 using analytical mass spectrometry. Recombinant ITIH4 was analyzed to optimize glycopeptide analyses, followed by serum-derived ITIH4. First, we confirmed that the four ITIH4 N-X-S/T sequons (N81, N207, N517, and N577) were glycosylated by treating ITIH4 tryptic/GluC glycopeptides with PNGaseF in the presence of (18)O water. Next, we performed glycosidase-assisted LC-MS/MS analysis of ITIH4 trypsin-GluC glycopeptides enriched via hydrophilic interaction liquid chromatography to characterize ITIH4 N-glycoforms. While microheterogeneity of N-glycoforms differed between ITIH4 protein expressed in HEK293 cells and protein isolated from serum, occupancy of N-glycosylation sites did not differ. A fifth N-glycosylation site was discovered at N274 with the rare nonconsensus NVV motif. Site N274 contained high-mannose N-linked glycans in both serum and recombinant ITIH4. We also identified isoform-specific ITIH4 O-glycoforms and documented that utilization of O-glycosylation sites on ITIH4 differed between the cell line and serum.
Collapse
Affiliation(s)
- Kevin Brown Chandler
- Department of Biochemistry and Molecular & Cellular Biology, Georgetown University , Washington, D.C. 20057, United States
| | | | | | | | | | | | | | | | | | | |
Collapse
|
19
|
Katayama T, Wilkinson MD, Aoki-Kinoshita KF, Kawashima S, Yamamoto Y, Yamaguchi A, Okamoto S, Kawano S, Kim JD, Wang Y, Wu H, Kano Y, Ono H, Bono H, Kocbek S, Aerts J, Akune Y, Antezana E, Arakawa K, Aranda B, Baran J, Bolleman J, Bonnal RJ, Buttigieg PL, Campbell MP, Chen YA, Chiba H, Cock PJ, Cohen KB, Constantin A, Duck G, Dumontier M, Fujisawa T, Fujiwara T, Goto N, Hoehndorf R, Igarashi Y, Itaya H, Ito M, Iwasaki W, Kalaš M, Katoda T, Kim T, Kokubu A, Komiyama Y, Kotera M, Laibe C, Lapp H, Lütteke T, Marshall MS, Mori T, Mori H, Morita M, Murakami K, Nakao M, Narimatsu H, Nishide H, Nishimura Y, Nystrom-Persson J, Ogishima S, Okamura Y, Okuda S, Oshita K, Packer NH, Prins P, Ranzinger R, Rocca-Serra P, Sansone S, Sawaki H, Shin SH, Splendiani A, Strozzi F, Tadaka S, Toukach P, Uchiyama I, Umezaki M, Vos R, Whetzel PL, Yamada I, Yamasaki C, Yamashita R, York WS, Zmasek CM, Kawamoto S, Takagi T. BioHackathon series in 2011 and 2012: penetration of ontology and linked data in life science domains. J Biomed Semantics 2014; 5:5. [PMID: 24495517 PMCID: PMC3978116 DOI: 10.1186/2041-1480-5-5] [Citation(s) in RCA: 43] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2013] [Accepted: 11/26/2013] [Indexed: 01/24/2023] Open
Abstract
The application of semantic technologies to the integration of biological data and the interoperability of bioinformatics analysis and visualization tools has been the common theme of a series of annual BioHackathons hosted in Japan for the past five years. Here we provide a review of the activities and outcomes from the BioHackathons held in 2011 in Kyoto and 2012 in Toyama. In order to efficiently implement semantic technologies in the life sciences, participants formed various sub-groups and worked on the following topics: Resource Description Framework (RDF) models for specific domains, text mining of the literature, ontology development, essential metadata for biological databases, platforms to enable efficient Semantic Web technology development and interoperability, and the development of applications for Semantic Web data. In this review, we briefly introduce the themes covered by these sub-groups. The observations made, conclusions drawn, and software development projects that emerged from these activities are discussed.
Collapse
Affiliation(s)
- Toshiaki Katayama
- Database Center for Life Science, Research Organization of Information and Systems, 2-11-16, Yayoi, Bunkyo-ku, Tokyo 113-0032, Japan.
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
20
|
Campbell MP, Ranzinger R, Lütteke T, Mariethoz J, Hayes CA, Zhang J, Akune Y, Aoki-Kinoshita KF, Damerell D, Carta G, York WS, Haslam SM, Narimatsu H, Rudd PM, Karlsson NG, Packer NH, Lisacek F. Toolboxes for a standardised and systematic study of glycans. BMC Bioinformatics 2014; 15 Suppl 1:S9. [PMID: 24564482 PMCID: PMC4016020 DOI: 10.1186/1471-2105-15-s1-s9] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Background Recent progress in method development for characterising the branched structures of complex carbohydrates has now enabled higher throughput technology. Automation of structure analysis then calls for software development since adding meaning to large data collections in reasonable time requires corresponding bioinformatics methods and tools. Current glycobioinformatics resources do cover information on the structure and function of glycans, their interaction with proteins or their enzymatic synthesis. However, this information is partial, scattered and often difficult to find to for non-glycobiologists. Methods Following our diagnosis of the causes of the slow development of glycobioinformatics, we review the "objective" difficulties encountered in defining adequate formats for representing complex entities and developing efficient analysis software. Results Various solutions already implemented and strategies defined to bridge glycobiology with different fields and integrate the heterogeneous glyco-related information are presented. Conclusions Despite the initial stage of our integrative efforts, this paper highlights the rapid expansion of glycomics, the validity of existing resources and the bright future of glycobioinformatics.
Collapse
|
21
|
Wang Z, Chinoy ZS, Ambre SG, Peng W, McBride R, de Vries RP, Glushka J, Paulson JC, Boons GJ. A general strategy for the chemoenzymatic synthesis of asymmetrically branched N-glycans. Science 2013; 341:379-83. [PMID: 23888036 DOI: 10.1126/science.1236231] [Citation(s) in RCA: 257] [Impact Index Per Article: 23.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
A systematic, efficient means of producing diverse libraries of asymmetrically branched N-glycans is needed to investigate the specificities and biology of glycan-binding proteins. To that end, we describe a core pentasaccharide that at potential branching positions is modified by orthogonal protecting groups to allow selective attachment of specific saccharide moieties by chemical glycosylation. The appendages were selected so that the antenna of the resulting deprotected compounds could be selectively extended by glycosyltransferases to give libraries of asymmetrical multi-antennary glycans. The power of the methodology was demonstrated by the preparation of a series of complex oligosaccharides that were printed as microarrays and screened for binding to lectins and influenza-virus hemagglutinins, which showed that recognition is modulated by presentation of minimal epitopes in the context of complex N-glycans.
Collapse
Affiliation(s)
- Zhen Wang
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Road, Athens, GA 30602, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
22
|
Chandler KB, Pompach P, Goldman R, Edwards N. Exploring site-specific N-glycosylation microheterogeneity of haptoglobin using glycopeptide CID tandem mass spectra and glycan database search. J Proteome Res 2013; 12:3652-66. [PMID: 23829323 DOI: 10.1021/pr400196s] [Citation(s) in RCA: 90] [Impact Index Per Article: 8.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Glycosylation is a common protein modification with a significant role in many vital cellular processes and human diseases, making the characterization of protein-attached glycan structures important for understanding cell biology and disease processes. Direct analysis of protein N-glycosylation by tandem mass spectrometry of glycopeptides promises site-specific elucidation of N-glycan microheterogeneity, something that detached N-glycan and deglycosylated peptide analyses cannot provide. However, successful implementation of direct N-glycopeptide analysis by tandem mass spectrometry remains a challenge. In this work, we consider algorithmic techniques for the analysis of LC-MS/MS data acquired from glycopeptide-enriched fractions of enzymatic digests of purified proteins. We implement a computational strategy that takes advantage of the properties of CID fragmentation spectra of N-glycopeptides, matching the MS/MS spectra to peptide-glycan pairs from protein sequences and glycan structure databases. Significantly, we also propose a novel false discovery rate estimation technique to estimate and manage the number of false identifications. We use a human glycoprotein standard, haptoglobin, digested with trypsin and GluC, enriched for glycopeptides using HILIC chromatography, and analyzed by LC-MS/MS to demonstrate our algorithmic strategy and evaluate its performance. Our software, GlycoPeptideSearch (GPS), assigned glycopeptide identifications to 246 of the spectra at a false discovery rate of 5.58%, identifying 42 distinct haptoglobin peptide-glycan pairs at each of the four haptoglobin N-linked glycosylation sites. We further demonstrate the effectiveness of this approach by analyzing plasma-derived haptoglobin, identifying 136 N-linked glycopeptide spectra at a false discovery rate of 0.4%, representing 15 distinct glycopeptides on at least three of the four N-linked glycosylation sites. The software, GlycoPeptideSearch, is available for download from http://edwardslab.bmcb.georgetown.edu/GPS .
Collapse
Affiliation(s)
- Kevin Brown Chandler
- Department of Biochemistry and Molecular & Cellular Biology, Georgetown University Medical Center, Washington, DC 20007, USA
| | | | | | | |
Collapse
|
23
|
Aoki-Kinoshita KF, Sawaki H, An HJ, Cho JW, Hsu D, Kato M, Kawano S, Kawasaki T, Khoo KH, Kim J, Kim JD, Li X, Lütteke T, Okuda S, Packer NH, Paulson JC, Raman R, Ranzinger R, Shen H, Shikanai T, Yamada I, Yang P, Yamaguchi Y, Ying W, Yoo JS, Zhang Y, Narimatsu H. The Third ACGG-DB Meeting Report: Towards an international collaborative infrastructure for glycobioinformatics. Glycobiology 2013; 23:144-6. [PMID: 23271684 DOI: 10.1093/glycob/cws167] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
|
24
|
Abstract
Protein glycosylation is a highly complex and regulated posttranslational modification. In this process several glycosyltransferase families are involved. In cancer this delicate equilibrium is disrupted leading to glycosylation changes on glycoconjugates, namely, glycoproteins. One of the major consequences is the increase of sialylated oligosaccharide chains in glycoproteins. Here we describe an experimental methodology focused in the enrichment and characterization of sialic acid containing glycopeptides by MALDI mass spectrometry and the subsequent data analysis.
Collapse
Affiliation(s)
- Hugo Osório
- Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal
| | | |
Collapse
|
25
|
Lütteke T. The use of glycoinformatics in glycochemistry. Beilstein J Org Chem 2012; 8:915-29. [PMID: 23015842 PMCID: PMC3388882 DOI: 10.3762/bjoc.8.104] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2012] [Accepted: 05/29/2012] [Indexed: 01/10/2023] Open
Abstract
Glycoinformatics is a small but growing branch of bioinformatics and chemoinformatics. Various resources are now available that can be of use to glycobiologists, but also to chemists who work on the synthesis or analysis of carbohydrates. This article gives an overview of existing glyco-specific databases and tools, with a focus on their application to glycochemistry: Databases can provide information on candidate glycan structures for synthesis, or on glyco-enzymes that can be used to synthesize carbohydrates. Statistical analyses of glycan databases help to plan glycan synthesis experiments. 3D-Structural data of protein-carbohydrate complexes are used in targeted drug design, and tools to support glycan structure analysis aid with quality control. Specific problems of glycoinformatics compared to bioinformatics for genomics or proteomics, especially concerning integration and long-term maintenance of the existing glycan databases, are also discussed.
Collapse
Affiliation(s)
- Thomas Lütteke
- Justus-Liebig-University Gießen, Institute of Veterinary Physiology and Biochemistry, Frankfurter Str. 100, 35392 Gießen, Germany
| |
Collapse
|
26
|
Abstract
Carbohydrate libraries printed in glycan micorarray format have had a great impact on the high-throughput analysis of the specificity of a wide range of mammalian, plant, and bacterial lectins. Chemical and chemo-enzymatic synthesis allows the construction of diverse glycan libraries but requires substantial effort and resources. To leverage the synthetic effort, the ideal library would be a minimal subset of all structures that provides optimal diversity. Therefore, a measure of library diversity is needed. To this end, we developed a linear representation of glycans using standard chemoinformatic tools. This representation was applied to measure pairwise similarity and consequently diversity of glycan libraries in a single value. The diversities of four existing sialoside glycan arrays were compared. More diverse arrays are proposed reducing the number of glycans. This algorithm can be applied to diverse aspects of library design from target structure selection to the choice of building blocks for their synthesis.
Collapse
Affiliation(s)
- Christoph Rademacher
- Department of Chemical Physiology, The Scripps Research Institute , 10550 North Torrey Pines Road, La Jolla, CA 92037, USA.
| | | |
Collapse
|
27
|
Recent advances in the analysis of carbohydrates for biomedical use. J Pharm Biomed Anal 2011; 55:702-27. [DOI: 10.1016/j.jpba.2011.02.003] [Citation(s) in RCA: 30] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2010] [Revised: 02/03/2011] [Accepted: 02/04/2011] [Indexed: 02/06/2023]
|
28
|
Visvanathan M, Siddam SR, Lee IH, Lushington GH, Bousfield GR. GlycomicsDB - A Data Integration Platform for Glycans and their Strucutres. Open Med Inform J 2011; 5:9-16. [PMID: 21603090 PMCID: PMC3098536 DOI: 10.2174/1874431101105010009] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2010] [Revised: 03/11/2011] [Accepted: 03/14/2011] [Indexed: 12/05/2022] Open
Abstract
Glycomics is a discipline of biology that deals with the structure and function of glycans (or carbohydrates). Analytical techniques such as mass spectrometry (MS) and nuclear magnetic resonance (NMR) are having a significant impact on the field of glycomics. However, effective progress in glycomics research requires collaboration between laboratories to share experimental data, structural information of glycans, and simulation results. Herein we report the development of a web-based data management system that can incorporate large volumes of data from disparate sources and organize them into a uniform format for users to store and access. This system enables participating laboratories to set up a shared data repository which members of interdisciplinary teams can access. The system is able to manage and share raw MS data and structural information of glycans. The database is available at http://www.glycomics.bcf.ku.edu
Collapse
Affiliation(s)
- Mahesh Visvanathan
- Bioinformatics Core Facility, University of Kansas, Lawrence, KS 66047, USA
| | | | | | | | | |
Collapse
|
29
|
von der Lieth CW, Freire AA, Blank D, Campbell MP, Ceroni A, Damerell DR, Dell A, Dwek RA, Ernst B, Fogh R, Frank M, Geyer H, Geyer R, Harrison MJ, Henrick K, Herget S, Hull WE, Ionides J, Joshi HJ, Kamerling JP, Leeflang BR, Lütteke T, Lundborg M, Maass K, Merry A, Ranzinger R, Rosen J, Royle L, Rudd PM, Schloissnig S, Stenutz R, Vranken WF, Widmalm G, Haslam SM. EUROCarbDB: An open-access platform for glycoinformatics. Glycobiology 2011; 21:493-502. [PMID: 21106561 PMCID: PMC3055595 DOI: 10.1093/glycob/cwq188] [Citation(s) in RCA: 104] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2010] [Revised: 11/03/2010] [Accepted: 11/03/2010] [Indexed: 01/03/2023] Open
Abstract
The EUROCarbDB project is a design study for a technical framework, which provides sophisticated, freely accessible, open-source informatics tools and databases to support glycobiology and glycomic research. EUROCarbDB is a relational database containing glycan structures, their biological context and, when available, primary and interpreted analytical data from high-performance liquid chromatography, mass spectrometry and nuclear magnetic resonance experiments. Database content can be accessed via a web-based user interface. The database is complemented by a suite of glycoinformatics tools, specifically designed to assist the elucidation and submission of glycan structure and experimental data when used in conjunction with contemporary carbohydrate research workflows. All software tools and source code are licensed under the terms of the Lesser General Public License, and publicly contributed structures and data are freely accessible. The public test version of the web interface to the EUROCarbDB can be found at http://www.ebi.ac.uk/eurocarb.
Collapse
Affiliation(s)
| | - Ana Ardá Freire
- Bijvoet-Center for Biomolecular Research, University of Utrecht, Utrecht, The Netherlands
| | - Dennis Blank
- Institute of Biochemistry, Faculty of Medicine, Justus, Liebig University, Giessen, Germany
| | - Matthew P Campbell
- Dublin-Oxford Glycobiology Laboratory, National Institute for Bioprocessing Research and Training (NIBRT), Conway Institute, University College Dublin, Dublin, Ireland
- Department of Biochemistry, Oxford Glycobiology Institute, University of Oxford, UK
| | - Alessio Ceroni
- Division of Molecular Biosciences, Faculty of Natural Sciences, Biochemistry Building, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - David R Damerell
- Division of Molecular Biosciences, Faculty of Natural Sciences, Biochemistry Building, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Anne Dell
- Division of Molecular Biosciences, Faculty of Natural Sciences, Biochemistry Building, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| | - Raymond A Dwek
- Department of Biochemistry, Oxford Glycobiology Institute, University of Oxford, UK
| | - Beat Ernst
- Department of Pharmaceutical Science, University of Basel, BaselSwitzerland
| | - Rasmus Fogh
- European Bioinformatics Institute, Hinxton, UK
| | - Martin Frank
- Core Facility, Molecular Structure Analysis, German Cancer Research Center, Heidelberg, Germany
| | - Hildegard Geyer
- Institute of Biochemistry, Faculty of Medicine, Justus, Liebig University, Giessen, Germany
| | - Rudolf Geyer
- Institute of Biochemistry, Faculty of Medicine, Justus, Liebig University, Giessen, Germany
| | | | - Kim Henrick
- European Bioinformatics Institute, Hinxton, UK
| | - Stefan Herget
- Core Facility, Molecular Structure Analysis, German Cancer Research Center, Heidelberg, Germany
| | - William E Hull
- Core Facility, Molecular Structure Analysis, German Cancer Research Center, Heidelberg, Germany
| | | | - Hiren J Joshi
- Core Facility, Molecular Structure Analysis, German Cancer Research Center, Heidelberg, Germany
- European Bioinformatics Institute, Hinxton, UK
| | - Johannis P Kamerling
- Bijvoet-Center for Biomolecular Research, University of Utrecht, Utrecht, The Netherlands
| | - Bas R Leeflang
- Bijvoet-Center for Biomolecular Research, University of Utrecht, Utrecht, The Netherlands
| | - Thomas Lütteke
- Bijvoet-Center for Biomolecular Research, University of Utrecht, Utrecht, The Netherlands
| | | | - Kai Maass
- Institute of Biochemistry, Faculty of Medicine, Justus, Liebig University, Giessen, Germany
| | | | - René Ranzinger
- Core Facility, Molecular Structure Analysis, German Cancer Research Center, Heidelberg, Germany
| | - Jimmy Rosen
- Bijvoet-Center for Biomolecular Research, University of Utrecht, Utrecht, The Netherlands
| | - Louise Royle
- Dublin-Oxford Glycobiology Laboratory, National Institute for Bioprocessing Research and Training (NIBRT), Conway Institute, University College Dublin, Dublin, Ireland
- Department of Biochemistry, Oxford Glycobiology Institute, University of Oxford, UK
| | - Pauline M Rudd
- Dublin-Oxford Glycobiology Laboratory, National Institute for Bioprocessing Research and Training (NIBRT), Conway Institute, University College Dublin, Dublin, Ireland
- Department of Biochemistry, Oxford Glycobiology Institute, University of Oxford, UK
| | - Siegfried Schloissnig
- Core Facility, Molecular Structure Analysis, German Cancer Research Center, Heidelberg, Germany
| | - Roland Stenutz
- Organic Chemistry, Stockholm University, Stockholm, Sweden
| | | | - Göran Widmalm
- Organic Chemistry, Stockholm University, Stockholm, Sweden
| | - Stuart M Haslam
- Division of Molecular Biosciences, Faculty of Natural Sciences, Biochemistry Building, Imperial College London, South Kensington Campus, London SW7 2AZ, UK
| |
Collapse
|
30
|
Neelamegham S, Liu G. Systems glycobiology: biochemical reaction networks regulating glycan structure and function. Glycobiology 2011; 21:1541-53. [PMID: 21436236 DOI: 10.1093/glycob/cwr036] [Citation(s) in RCA: 34] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
There is a growing use of bioinformatics based methods in the field of Glycobiology. These have been used largely to curate glycan structures, organize array-based experimental data and display existing knowledge of glycosylation-related pathways in silico. Although the cataloging of vast amounts of data is beneficial, it is often a challenge to gain meaningful mechanistic insight from this exercise alone. The development of specific analysis tools to query the database is necessary. If these queries can integrate existing knowledge of glycobiology, new insights may be gained. Such queries that couple biochemical knowledge and mathematics have been developed in the field of Systems Biology. The current review summarizes the current state of the art in the application of computational modeling in the field of Glycobiology. It provides (i) an overview of experimental and online resources that can be used to construct glycosylation reaction networks, (ii) mathematical methods to formulate the problem including a description of ordinary differential equation and logic-based reaction networks, (iii) optimization techniques that can be applied to fit experimental data for the purpose of model reconstruction and for evaluating unknown model parameters, (iv) post-simulation analysis methods that yield experimentally testable hypotheses and (v) a summary of available software tools that can be used by non-specialists to perform many of the above functions.
Collapse
Affiliation(s)
- Sriram Neelamegham
- Department of Chemical and Biological Engineering, and The NY State Center for Excellence in Bioinformatics and Life Sciences, State University of New York, Buffalo, NY 14260, USA.
| | | |
Collapse
|
31
|
Dall'Olio GM, Jassal B, Montanucci L, Gagneux P, Bertranpetit J, Laayouni H. The annotation of the asparagine N-linked glycosylation pathway in the Reactome database. Glycobiology 2011; 21:1395-400. [PMID: 21199820 DOI: 10.1093/glycob/cwq215] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Asparagine N-linked glycosylation is one of the most important forms of protein post-translational modification in eukaryotes and is one of the first metabolic pathways described at a biochemical level. Here, we report a new annotation of this pathway for the Human species, published after passing a peer-review process in Reactome. The new annotation presented here offers a high level of detail and provides references and descriptions for each reaction, along with integration with GeneOntology and other databases. The open-source approach of Reactome toward annotation encourages feedback from its users, making it easier to keep the annotation of this pathway updated with future knowledge. Reactome's web interface allows easy navigation between steps involved in the pathway to compare it with other pathways and resources in other scientific databases and to export it to BioPax and SBML formats, making it accessible for computational studies. This new entry in Reactome expands and complements the annotations already published in databases for biological pathways and provides a common reference to researchers interested in studying this important pathway in the human species. Finally, we discuss the status of the annotation of this pathway and point out which steps are worth further investigation or need better experimental validation.
Collapse
|
32
|
Toukach PV. Bacterial Carbohydrate Structure Database 3: Principles and Realization. J Chem Inf Model 2010; 51:159-70. [DOI: 10.1021/ci100150d] [Citation(s) in RCA: 71] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/17/2023]
Affiliation(s)
- Philip V. Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Leninsky prospekt 47, 119991 Moscow, Russian Federation
| |
Collapse
|
33
|
Ranzinger R, Herget S, von der Lieth CW, Frank M. GlycomeDB--a unified database for carbohydrate structures. Nucleic Acids Res 2010; 39:D373-6. [PMID: 21045056 PMCID: PMC3013643 DOI: 10.1093/nar/gkq1014] [Citation(s) in RCA: 89] [Impact Index Per Article: 6.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
GlycomeDB integrates the structural and taxonomic data of all major public carbohydrate databases, as well as carbohydrates contained in the Protein Data Bank, which renders the database currently the most comprehensive and unified resource for carbohydrate structures worldwide. GlycomeDB retains the links to the original databases and is updated at weekly intervals with the newest structures available from the source databases. The complete database can be downloaded freely or accessed through a Web-interface (www.glycome-db.org) that provides flexible and powerful search functionalities.
Collapse
Affiliation(s)
- René Ranzinger
- German Cancer Research Center, DKFZ, Core Facility, Molecular Structure Analysis, W160, Im Neuenheimer Feld 280, 69120 Heidelberg, Germany.
| | | | | | | |
Collapse
|
34
|
Ahmed J, Preissner S, Dunkel M, Worth CL, Eckert A, Preissner R. SuperSweet--a resource on natural and artificial sweetening agents. Nucleic Acids Res 2010; 39:D377-82. [PMID: 20952410 PMCID: PMC3013782 DOI: 10.1093/nar/gkq917] [Citation(s) in RCA: 54] [Impact Index Per Article: 3.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
A vast number of sweet tasting molecules are known, encompassing small compounds, carbohydrates, d-amino acids and large proteins. Carbohydrates play a particularly big role in human diet. The replacement of sugars in food with artificial sweeteners is common and is a general approach to prevent cavities, obesity and associated diseases such as diabetes and hyperlipidemia. Knowledge about the molecular basis of taste may reveal new strategies to overcome diet-induced diseases. In this context, the design of safe, low-calorie sweeteners is particularly important. Here, we provide a comprehensive collection of carbohydrates, artificial sweeteners and other sweet tasting agents like proteins and peptides. Additionally, structural information and properties such as number of calories, therapeutic annotations and a sweetness-index are stored in SuperSweet. Currently, the database consists of more than 8000 sweet molecules. Moreover, the database provides a modeled 3D structure of the sweet taste receptor and binding poses of the small sweet molecules. These binding poses provide hints for the design of new sweeteners. A user-friendly graphical interface allows similarity searching, visualization of docked sweeteners into the receptor etc. A sweetener classification tree and browsing features allow quick requests to be made to the database. The database is freely available at: http://bioinformatics.charite.de/sweet/.
Collapse
Affiliation(s)
- Jessica Ahmed
- Charité - Universitätsmedizin Berlin, Institute of Physiology, Structural Bioinformatics Group, Lindenberger Weg 80, 13125 Berlin, Germany
| | | | | | | | | | | |
Collapse
|
35
|
Frank M, Schloissnig S. Bioinformatics and molecular modeling in glycobiology. Cell Mol Life Sci 2010; 67:2749-72. [PMID: 20364395 PMCID: PMC2912727 DOI: 10.1007/s00018-010-0352-4] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2009] [Revised: 03/08/2010] [Accepted: 03/11/2010] [Indexed: 12/11/2022]
Abstract
The field of glycobiology is concerned with the study of the structure, properties, and biological functions of the family of biomolecules called carbohydrates. Bioinformatics for glycobiology is a particularly challenging field, because carbohydrates exhibit a high structural diversity and their chains are often branched. Significant improvements in experimental analytical methods over recent years have led to a tremendous increase in the amount of carbohydrate structure data generated. Consequently, the availability of databases and tools to store, retrieve and analyze these data in an efficient way is of fundamental importance to progress in glycobiology. In this review, the various graphical representations and sequence formats of carbohydrates are introduced, and an overview of newly developed databases, the latest developments in sequence alignment and data mining, and tools to support experimental glycan analysis are presented. Finally, the field of structural glycoinformatics and molecular modeling of carbohydrates, glycoproteins, and protein-carbohydrate interaction are reviewed.
Collapse
Affiliation(s)
- Martin Frank
- Molecular Structure Analysis Core Facility-W160, Deutsches Krebsforschungszentrum (German Cancer Research Centre), 69120 Heidelberg, Germany.
| | | |
Collapse
|