1
|
Liu Y, Huang Y, Zhu R, Farag MA, Capanoglu E, Zhao C. Structural elucidation approaches in carbohydrates: A comprehensive review on techniques and future trends. Food Chem 2023; 400:134118. [DOI: 10.1016/j.foodchem.2022.134118] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 09/01/2022] [Indexed: 10/14/2022]
|
2
|
Toukach PV, Shirkovskaya AI. Carbohydrate Structure Database and Other Glycan Databases as an Important Element of Glycoinformatics. RUSSIAN JOURNAL OF BIOORGANIC CHEMISTRY 2022. [DOI: 10.1134/s1068162022030190] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
3
|
Glycan-mediated molecular interactions in bacterial pathogenesis. Trends Microbiol 2022; 30:254-267. [PMID: 34274195 PMCID: PMC8758796 DOI: 10.1016/j.tim.2021.06.011] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2021] [Revised: 06/10/2021] [Accepted: 06/22/2021] [Indexed: 02/06/2023]
Abstract
Glycans are expressed on the surface of nearly all host and bacterial cells. Not surprisingly, glycan-mediated molecular interactions play a vital role in bacterial pathogenesis and host responses against pathogens. Glycan-mediated host-pathogen interactions can benefit the pathogen, host, or both. Here, we discuss (i) bacterial glycans that play a critical role in bacterial colonization and/or immune evasion, (ii) host glycans that are utilized by bacteria for pathogenesis, and (iii) bacterial and host glycans involved in immune responses against pathogens. We further discuss (iv) opportunities and challenges for transforming these research findings into more effective antibacterial strategies, and (v) technological advances in glycoscience that have helped to accelerate progress in research. These studies collectively offer valuable insights into new perspectives on antibacterial strategies that may effectively tackle the drug-resistant pathogens that are rapidly spreading globally.
Collapse
|
4
|
Mariethoz J, Alocci D, Karlsson NG, Packer NH, Lisacek F. An Interactive View of Glycosylation. Methods Mol Biol 2022; 2370:41-65. [PMID: 34611864 DOI: 10.1007/978-1-0716-1685-7_3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
The present chapter focuses on the interactive and explorative aspects of bioinformatics resources that have been recently released in glycobiology. The comparative analysis of data in a field where knowledge is scattered, incomplete, and disconnected from main biology requires efficient visualization, integration, and interactive tools that are currently only partially implemented. This overview highlights converging efforts toward building a consistent picture of protein glycosylation.
Collapse
Affiliation(s)
- Julien Mariethoz
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
- Computer Science Department, University of Geneva, Geneva, Switzerland
| | - Davide Alocci
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
| | - Niclas G Karlsson
- Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Nicolle H Packer
- Department of Molecular Sciences and ARC Centre of Excellence for Nanoscale Biophotonics, Macquarie University, Sydney, NSW, Australia
- Institute for Glycomics, Griffith University, Gold Coast, QLD, Australia
| | - Frédérique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, University of Geneva, Geneva, Switzerland.
| |
Collapse
|
5
|
OUP accepted manuscript. Glycobiology 2022; 32:646-650. [DOI: 10.1093/glycob/cwac025] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2022] [Revised: 04/06/2022] [Indexed: 11/14/2022] Open
|
6
|
Gong Y, Qin S, Dai L, Tian Z. The glycosylation in SARS-CoV-2 and its receptor ACE2. Signal Transduct Target Ther 2021; 6:396. [PMID: 34782609 PMCID: PMC8591162 DOI: 10.1038/s41392-021-00809-8] [Citation(s) in RCA: 96] [Impact Index Per Article: 32.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2021] [Revised: 10/10/2021] [Accepted: 10/24/2021] [Indexed: 02/05/2023] Open
Abstract
Coronavirus disease 2019 (COVID-19), a highly infectious disease caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has infected more than 235 million individuals and led to more than 4.8 million deaths worldwide as of October 5 2021. Cryo-electron microscopy and topology show that the SARS-CoV-2 genome encodes lots of highly glycosylated proteins, such as spike (S), envelope (E), membrane (M), and ORF3a proteins, which are responsible for host recognition, penetration, binding, recycling and pathogenesis. Here we reviewed the detections, substrates, biological functions of the glycosylation in SARS-CoV-2 proteins as well as the human receptor ACE2, and also summarized the approved and undergoing SARS-CoV-2 therapeutics associated with glycosylation. This review may not only broad the understanding of viral glycobiology, but also provide key clues for the development of new preventive and therapeutic methodologies against SARS-CoV-2 and its variants.
Collapse
Affiliation(s)
- Yanqiu Gong
- National Clinical Research Center for Geriatrics and Department of General Practice, State Key Laboratory of Biotherapy, West China Hospital, Sichuan University, and Collaborative Innovation Center of Biotherapy, 610041, Chengdu, China
| | - Suideng Qin
- School of Chemical Science & Engineering, Shanghai Key Laboratory of Chemical Assessment and Sustainability, Tongji University, 200092, Shanghai, China
| | - Lunzhi Dai
- National Clinical Research Center for Geriatrics and Department of General Practice, State Key Laboratory of Biotherapy, West China Hospital, Sichuan University, and Collaborative Innovation Center of Biotherapy, 610041, Chengdu, China.
| | - Zhixin Tian
- School of Chemical Science & Engineering, Shanghai Key Laboratory of Chemical Assessment and Sustainability, Tongji University, 200092, Shanghai, China.
| |
Collapse
|
7
|
Paton B, Suarez M, Herrero P, Canela N. Glycosylation Biomarkers Associated with Age-Related Diseases and Current Methods for Glycan Analysis. Int J Mol Sci 2021; 22:ijms22115788. [PMID: 34071388 PMCID: PMC8198018 DOI: 10.3390/ijms22115788] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2021] [Revised: 05/22/2021] [Accepted: 05/25/2021] [Indexed: 12/23/2022] Open
Abstract
Ageing is a complex process which implies the accumulation of molecular, cellular and organ damage, leading to an increased vulnerability to disease. In Western societies, the increase in the elderly population, which is accompanied by ageing-associated pathologies such as cardiovascular and mental diseases, is becoming an increasing economic and social burden for governments. In order to prevent, treat and determine which subjects are more likely to develop these age-related diseases, predictive biomarkers are required. In this sense, some studies suggest that glycans have a potential role as disease biomarkers, as they modify the functions of proteins and take part in intra- and intercellular biological processes. As the glycome reflects the real-time status of these interactions, its characterisation can provide potential diagnostic and prognostic biomarkers for multifactorial diseases. This review gathers the alterations in protein glycosylation profiles that are associated with ageing and age-related diseases, such as cancer, type 2 diabetes mellitus, metabolic syndrome and several chronic inflammatory diseases. Furthermore, the review includes the available techniques for the determination and characterisation of glycans, such as liquid chromatography, electrophoresis, nuclear magnetic resonance and mass spectrometry.
Collapse
Affiliation(s)
- Beatrix Paton
- Eurecat, Centre Tecnològic de Catalunya, Centre for Omic Sciences, Joint Unit Eurecat-Universitat Rovira i Virgili, Unique Scientific and Technical Infrastructure (ICTS), 43204 Reus, Spain; (B.P.); (N.C.)
| | - Manuel Suarez
- Nutrigenomics Research Group, Departament de Bioquímica i Biotecnologia, Universitat Rovira i Virgili, 43007 Tarragona, Spain
- Correspondence: (M.S.); (P.H.)
| | - Pol Herrero
- Eurecat, Centre Tecnològic de Catalunya, Centre for Omic Sciences, Joint Unit Eurecat-Universitat Rovira i Virgili, Unique Scientific and Technical Infrastructure (ICTS), 43204 Reus, Spain; (B.P.); (N.C.)
- Correspondence: (M.S.); (P.H.)
| | - Núria Canela
- Eurecat, Centre Tecnològic de Catalunya, Centre for Omic Sciences, Joint Unit Eurecat-Universitat Rovira i Virgili, Unique Scientific and Technical Infrastructure (ICTS), 43204 Reus, Spain; (B.P.); (N.C.)
| |
Collapse
|
8
|
Yamada I, Campbell MP, Edwards N, Castro LJ, Lisacek F, Mariethoz J, Ono T, Ranzinger R, Shinmachi D, Aoki-Kinoshita KF. The glycoconjugate ontology (GlycoCoO) for standardizing the annotation of glycoconjugate data and its application. Glycobiology 2021; 31:741-750. [PMID: 33677548 PMCID: PMC8351504 DOI: 10.1093/glycob/cwab013] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2020] [Revised: 12/31/2020] [Accepted: 01/01/2021] [Indexed: 01/19/2023] Open
Abstract
Recent years have seen great advances in the development of glycoproteomics protocols and methods resulting in a sustainable increase in the reporting proteins, their attached glycans and glycosylation sites. However, only very few of these reports find their way into databases or data repositories. One of the major reasons is the absence of digital standard to represent glycoproteins and the challenging annotations with glycans. Depending on the experimental method, such a standard must be able to represent glycans as complete structures or as compositions, store not just single glycans but also represent glycoforms on a specific glycosylation side, deal with partially missing site information if no site mapping was performed, and store abundances or ratios of glycans within a glycoform of a specific site. To support the above, we have developed the GlycoConjugate Ontology (GlycoCoO) as a standard semantic framework to describe and represent glycoproteomics data. GlycoCoO can be used to represent glycoproteomics data in triplestores and can serve as a basis for data exchange formats. The ontology, database providers and supporting documentation are available online (https://github.com/glycoinfo/GlycoCoO).
Collapse
Affiliation(s)
- Issaku Yamada
- Research Department, The Noguchi Institute, 1-9-7 Kaga, Itabashi, Tokyo 173-0003, Japan
| | - Matthew P Campbell
- Institute for Glycomics, Griffith University at Gold Coast, Southport, QLD 4215, Australia
| | - Nathan Edwards
- Department of Biochemistry, Molecular and Cellular Biology, Georgetown University Medical Center, Washington, D.C. 20007, USA
| | - Leyla Jael Castro
- ZB MED Information Centre for Life Sciences, Gleueler Str. 60, 50931 Cologne, Germany
| | - Frederique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Computer Science Department, University of Geneva, route de Drize 7, CH - 1227 Geneva Switzerland, and also Section of Biology, University of Geneva, Geneva, Switzerland
| | - Julien Mariethoz
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, 7 Route de Drize, 1227 Geneva, Switzerland
| | - Tamiko Ono
- Faculty of Science and Engineering, Soka University, 1-236 Tangi-machi, Hachioji, Tokyo 192-8577, Japan
| | - Rene Ranzinger
- Complex Carbohydrate Research Center, The University of Georgia, 315 Riverbend Rd, Athens, Georgia 30602, USA
| | - Daisuke Shinmachi
- R&D Department, SparqLite LLC., 1615-22 Ishikawamachi, Hachioji, Tokyo 192-0032, Japan
| | - Kiyoko F Aoki-Kinoshita
- Glycan & Life Science Integration Center (GaLSIC), Faculty of Science and Engineering, Soka University, 1-236 Tangi-machi, Hachioji, Tokyo 192-8577, Japan
| |
Collapse
|
9
|
Huang J, Wu M, Zhang Y, Kong S, Liu M, Jiang B, Yang P, Cao W. OGP: A Repository of Experimentally Characterized O-Glycoproteins to Facilitate Studies on O-Glycosylation. GENOMICS PROTEOMICS & BIOINFORMATICS 2021; 19:611-618. [PMID: 33581334 PMCID: PMC9039567 DOI: 10.1016/j.gpb.2020.05.003] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/14/2019] [Revised: 03/17/2020] [Accepted: 05/31/2020] [Indexed: 11/16/2022]
Abstract
Numerous studies on cancers, biopharmaceuticals, and clinical trials have necessitated comprehensive and precise analysis of protein O-glycosylation. However, the lack of updated and convenient databases deters the storage of and reference to emerging O-glycoprotein data. To resolve this issue, an O-glycoprotein repository named OGP was established in this work. It was constructed with a collection of O-glycoprotein data from different sources. OGP contains 9354 O-glycosylation sites and 11,633 site-specific O-glycans mapping to 2133 O-glycoproteins, and it is the largest O-glycoprotein repository thus far. Based on the recorded O-glycosylation sites, an O-glycosylation site prediction tool was developed. Moreover, an OGP-based website is already available (https://www.oglyp.org/). The website comprises four specially designed and user-friendly modules: statistical analysis, database search, site prediction, and data submission. The first version of OGP repository and the website allow users to obtain various O-glycoprotein-related information, such as protein accession Nos., O-glycosylation sites, O-glycopeptide sequences, site-specific O-glycan structures, experimental methods, and potential O-glycosylation sites. O-glycosylation data mining can be performed efficiently on this website, which will greatly facilitate related studies. In addition, the database is accessible from OGP website (https://www.oglyp.org/download.php).
Collapse
Affiliation(s)
- Jiangming Huang
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China; The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, the International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Fudan University, Shanghai 200032, China
| | - Mengxi Wu
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China; The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, the International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Fudan University, Shanghai 200032, China
| | - Yang Zhang
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China; The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, the International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Fudan University, Shanghai 200032, China
| | - Siyuan Kong
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| | - Mingqi Liu
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China
| | - Biyun Jiang
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China; The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, the International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Fudan University, Shanghai 200032, China
| | - Pengyuan Yang
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China; The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, the International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Fudan University, Shanghai 200032, China; NHC Key Laboratory of Glycoconjugates Research (Fudan University), Shanghai 200032, China.
| | - Weiqian Cao
- Department of Chemistry and Institutes of Biomedical Sciences, Fudan University, Shanghai 200032, China; The Fifth People's Hospital, Fudan University, and the Shanghai Key Laboratory of Medical Epigenetics, the International Co-laboratory of Medical Epigenetics and Metabolism, Ministry of Science and Technology, Fudan University, Shanghai 200032, China; NHC Key Laboratory of Glycoconjugates Research (Fudan University), Shanghai 200032, China.
| |
Collapse
|
10
|
Insights into Bioinformatic Applications for Glycosylation: Instigating an Awakening towards Applying Glycoinformatic Resources for Cancer Diagnosis and Therapy. Int J Mol Sci 2020; 21:ijms21249336. [PMID: 33302373 PMCID: PMC7762546 DOI: 10.3390/ijms21249336] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Revised: 11/26/2020] [Accepted: 12/01/2020] [Indexed: 01/10/2023] Open
Abstract
Glycosylation plays a crucial role in various diseases and their etiology. This has led to a clear understanding on the functions of carbohydrates in cell communication, which eventually will result in novel therapeutic approaches for treatment of various disease. Glycomics has now become one among the top ten technologies that will change the future. The direct implication of glycosylation as a hallmark of cancer and for cancer therapy is well established. As in proteomics, where bioinformatics tools have led to revolutionary achievements, bioinformatics resources for glycosylation have improved its practical implication. Bioinformatics tools, algorithms and databases are a mandatory requirement to manage and successfully analyze large amount of glycobiological data generated from glycosylation studies. This review consolidates all the available tools and their applications in glycosylation research. The achievements made through the use of bioinformatics into glycosylation studies are also presented. The importance of glycosylation in cancer diagnosis and therapy is discussed and the gap in the application of widely available glyco-informatic tools for cancer research is highlighted. This review is expected to bring an awakening amongst glyco-informaticians as well as cancer biologists to bridge this gap, to exploit the available glyco-informatic tools for cancer.
Collapse
|
11
|
Scherbinina SI, Toukach PV. Three-Dimensional Structures of Carbohydrates and Where to Find Them. Int J Mol Sci 2020; 21:E7702. [PMID: 33081008 PMCID: PMC7593929 DOI: 10.3390/ijms21207702] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2020] [Revised: 10/15/2020] [Accepted: 10/16/2020] [Indexed: 02/06/2023] Open
Abstract
Analysis and systematization of accumulated data on carbohydrate structural diversity is a subject of great interest for structural glycobiology. Despite being a challenging task, development of computational methods for efficient treatment and management of spatial (3D) structural features of carbohydrates breaks new ground in modern glycoscience. This review is dedicated to approaches of chemo- and glyco-informatics towards 3D structural data generation, deposition and processing in regard to carbohydrates and their derivatives. Databases, molecular modeling and experimental data validation services, and structure visualization facilities developed for last five years are reviewed.
Collapse
Affiliation(s)
- Sofya I. Scherbinina
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Science, Leninsky prospect 47, 119991 Moscow, Russia
- Higher Chemical College, D. Mendeleev University of Chemical Technology of Russia, Miusskaya Square 9, 125047 Moscow, Russia
| | - Philip V. Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Science, Leninsky prospect 47, 119991 Moscow, Russia
| |
Collapse
|
12
|
Robin T, Mariethoz J, Lisacek F. Examining and Fine-tuning the Selection of Glycan Compositions with GlyConnect Compozitor. Mol Cell Proteomics 2020; 19:1602-1618. [PMID: 32636234 PMCID: PMC8014996 DOI: 10.1074/mcp.ra120.002041] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2020] [Revised: 07/01/2020] [Indexed: 01/22/2023] Open
Abstract
A key point in achieving accurate intact glycopeptide identification is the definition of the glycan composition file that is used to match experimental with theoretical masses by a glycoproteomics search engine. At present, these files are mainly built from searching the literature and/or querying data sources focused on posttranslational modifications. Most glycoproteomics search engines include a default composition file that is readily used when processing MS data. We introduce here a glycan composition visualizing and comparative tool associated with the GlyConnect database and called GlyConnect Compozitor. It offers a web interface through which the database can be queried to bring out contextual information relative to a set of glycan compositions. The tool takes advantage of compositions being related to one another through shared monosaccharide counts and outputs interactive graphs summarizing information searched in the database. These results provide a guide for selecting or deselecting compositions in a file in order to reflect the context of a study as closely as possible. They also confirm the consistency of a set of compositions based on the content of the GlyConnect database. As part of the tool collection of the Glycomics@ExPASy initiative, Compozitor is hosted at https://glyconnect.expasy.org/compozitor/ where it can be run as a web application. It is also directly accessible from the GlyConnect database.
Collapse
Affiliation(s)
- Thibault Robin
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CMU, Geneva, Switzerland; Computer Science Dept., Faculty of Science, University of Geneva, Switzerland; CALIPHO Group, SIB Swiss Institute of BioinformaticsCMU, Geneva, Switzerland; Microbiology and Molecular Medicine Dept., Faculty of Medicine, University of Geneva, Switzerland
| | - Julien Mariethoz
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CMU, Geneva, Switzerland; Computer Science Dept., Faculty of Science, University of Geneva, Switzerland
| | - Frédérique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CMU, Geneva, Switzerland; Computer Science Dept., Faculty of Science, University of Geneva, Switzerland; Section of Biology, Faculty of Science, University of Geneva, Switzerland.
| |
Collapse
|
13
|
Databases and Bioinformatic Tools for Glycobiology and Glycoproteomics. Int J Mol Sci 2020; 21:ijms21186727. [PMID: 32937895 PMCID: PMC7556027 DOI: 10.3390/ijms21186727] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Revised: 09/03/2020] [Accepted: 09/11/2020] [Indexed: 02/07/2023] Open
Abstract
Glycosylation plays critical roles in various biological processes and is closely related to diseases. Deciphering the glycocode in diverse cells and tissues offers opportunities to develop new disease biomarkers and more effective recombinant therapeutics. In the past few decades, with the development of glycobiology, glycomics, and glycoproteomics technologies, a large amount of glycoscience data has been generated. Subsequently, a number of glycobiology databases covering glycan structure, the glycosylation sites, the protein scaffolds, and related glycogenes have been developed to store, analyze, and integrate these data. However, these databases and tools are not well known or widely used by the public, including clinicians and other researchers who are not in the field of glycobiology, but are interested in glycoproteins. In this study, the representative databases of glycan structure, glycoprotein, glycan-protein interactions, glycogenes, and the newly developed bioinformatic tools and integrated portal for glycoproteomics are reviewed. We hope this overview could assist readers in searching for information on glycoproteins of interest, and promote further clinical application of glycobiology.
Collapse
|
14
|
Alocci D, Mariethoz J, Gastaldello A, Gasteiger E, Karlsson NG, Kolarich D, Packer NH, Lisacek F. GlyConnect: Glycoproteomics Goes Visual, Interactive, and Analytical. J Proteome Res 2018; 18:664-677. [DOI: 10.1021/acs.jproteome.8b00766] [Citation(s) in RCA: 62] [Impact Index Per Article: 10.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Affiliation(s)
- Davide Alocci
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1227 Geneva, Switzerland
| | - Julien Mariethoz
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1227 Geneva, Switzerland
| | - Alessandra Gastaldello
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1227 Geneva, Switzerland
| | - Elisabeth Gasteiger
- Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, CH-1211 Geneva, Switzerland
| | - Niclas G. Karlsson
- Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, University of Gothenburg, 40530 Gothenburg, Sweden
| | - Daniel Kolarich
- Institute for Glycomics, Griffith University, Southport, Queensland 4215, Australia
- ARC Centre for Nanoscale BioPhotonics, Macquarie University and Griffith University, Sydney, New South Wales 2109, Australia
| | - Nicolle H. Packer
- Institute for Glycomics, Griffith University, Southport, Queensland 4215, Australia
- ARC Centre for Nanoscale BioPhotonics, Macquarie University and Griffith University, Sydney, New South Wales 2109, Australia
- Department of Molecular Sciences, Macquarie University, Sydney, New South Wales 2109, Australia
| | - Frédérique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Rue Michel-Servet 1, CH-1211 Geneva, Switzerland
- Computer Science Department, University of Geneva, CH-1227 Geneva, Switzerland
- Section of Biology, University of Geneva, CH-1211 Geneva, Switzerland
| |
Collapse
|
15
|
Alocci D, Suchánková P, Costa R, Hory N, Mariethoz J, Vařeková RS, Toukach P, Lisacek F. SugarSketcher: Quick and Intuitive Online Glycan Drawing. Molecules 2018; 23:E3206. [PMID: 30563078 PMCID: PMC6320881 DOI: 10.3390/molecules23123206] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2018] [Revised: 11/23/2018] [Accepted: 11/29/2018] [Indexed: 01/24/2023] Open
Abstract
SugarSketcher is an intuitive and fast JavaScript interface module for online drawing of glycan structures in the popular Symbol Nomenclature for Glycans (SNFG) notation and exporting them to various commonly used formats encoding carbohydrate sequences (e.g., GlycoCT) or quality images (e.g., svg). It does not require a backend server or any specific browser plugins and can be integrated in any web glycoinformatics project. SugarSketcher allows drawing glycans both for glycobiologists and non-expert users. The "quick mode" allows a newcomer to build up a glycan structure having only a limited knowledge in carbohydrate chemistry. The "normal mode" integrates advanced options which enable glycobiologists to tailor complex carbohydrate structures. The source code is freely available on GitHub and glycoinformaticians are encouraged to participate in the development process while users are invited to test a prototype available on the ExPASY web-site and send feedback.
Collapse
Affiliation(s)
- Davide Alocci
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland.
- Computer Science Department, University of Geneva, 1211 Geneva, Switzerland.
| | - Pavla Suchánková
- CEITEC⁻Central European Institute of Technology, Masaryk University Brno, 625 00 Brno-Bohunice, Czech Republic.
- National Centre for Biomolecular Research, Faculty of Science, 625 00 Brno-Bohunice, Czech Republic.
| | - Renaud Costa
- Polytech Nice Sophia, Campus SophiaTech, 06903 Sophia-Antipolis, France.
| | - Nicolas Hory
- Polytech Nice Sophia, Campus SophiaTech, 06903 Sophia-Antipolis, France.
| | - Julien Mariethoz
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland.
- Computer Science Department, University of Geneva, 1211 Geneva, Switzerland.
| | - Radka Svobodová Vařeková
- CEITEC⁻Central European Institute of Technology, Masaryk University Brno, 625 00 Brno-Bohunice, Czech Republic.
- National Centre for Biomolecular Research, Faculty of Science, 625 00 Brno-Bohunice, Czech Republic.
| | - Philip Toukach
- Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Laboratory of Carbohydrate Chemistry, 119991 Moscow, Russia.
| | - Frédérique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, 1211 Geneva, Switzerland.
- Computer Science Department, University of Geneva, 1211 Geneva, Switzerland.
- Section of Biology, University of Geneva, 1211 Geneva, Switzerland.
| |
Collapse
|
16
|
Egorova KS, Toukach PV. Glykoinformatik: Brücken zwischen isolierten Inseln im Datenmeer. Angew Chem Int Ed Engl 2018. [DOI: 10.1002/ange.201803576] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Ksenia S. Egorova
- Zelinsky Institute of Organic ChemistryRussian Academy of Sciences Leninsky Prospect 47 Moscow 119991 Russland
| | - Philip V. Toukach
- Zelinsky Institute of Organic ChemistryRussian Academy of Sciences Leninsky Prospect 47 Moscow 119991 Russland
| |
Collapse
|
17
|
Egorova KS, Toukach PV. Glycoinformatics: Bridging Isolated Islands in the Sea of Data. Angew Chem Int Ed Engl 2018; 57:14986-14990. [DOI: 10.1002/anie.201803576] [Citation(s) in RCA: 25] [Impact Index Per Article: 4.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2018] [Indexed: 11/07/2022]
Affiliation(s)
- Ksenia S. Egorova
- Zelinsky Institute of Organic ChemistryRussian Academy of Sciences Leninsky Prospect 47 Moscow 119991 Russia
| | - Philip V. Toukach
- Zelinsky Institute of Organic ChemistryRussian Academy of Sciences Leninsky Prospect 47 Moscow 119991 Russia
| |
Collapse
|
18
|
Mariethoz J, Alocci D, Gastaldello A, Horlacher O, Gasteiger E, Rojas-Macias M, Karlsson NG, Packer NH, Lisacek F. Glycomics@ExPASy: Bridging the Gap. Mol Cell Proteomics 2018; 17:2164-2176. [PMID: 30097532 PMCID: PMC6210229 DOI: 10.1074/mcp.ra118.000799] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Revised: 07/15/2018] [Indexed: 12/28/2022] Open
Abstract
Glycomics@ExPASy (https://www.expasy.org/glycomics) is the glycomics tab of ExPASy, the server of SIB Swiss Institute of Bioinformatics. It was created in 2016 to centralize web-based glycoinformatics resources developed within an international network of glycoscientists. The hosted collection currently includes mainly databases and tools created and maintained at SIB but also links to a range of reference resources popular in the glycomics community. The philosophy of our toolbox is that it should be {glycoscientist AND protein scientist}-friendly with the aim of (1) popularizing the use of bioinformatics in glycobiology and (2) emphasizing the relationship between glycobiology and protein-oriented bioinformatics resources. The scarcity of data bridging these two disciplines led us to design tools as interactive as possible based on database connectivity to facilitate data exploration and support hypothesis building. Glycomics@ExPASy was designed, and is developed, with a long-term vision in close collaboration with glycoscientists to meet as closely as possible the growing needs of the community for glycoinformatics.
Collapse
Affiliation(s)
- Julien Mariethoz
- From the ‡Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
- §Computer Science Department, University of Geneva, Geneva, Switzerland
| | - Davide Alocci
- From the ‡Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
- §Computer Science Department, University of Geneva, Geneva, Switzerland
| | - Alessandra Gastaldello
- From the ‡Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
- §Computer Science Department, University of Geneva, Geneva, Switzerland
| | - Oliver Horlacher
- From the ‡Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
| | - Elisabeth Gasteiger
- ¶Swiss-Prot Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
| | - Miguel Rojas-Macias
- ‖Glyco Inflammatory Group, Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Niclas G Karlsson
- ‖Glyco Inflammatory Group, Department of Medical Biochemistry and Cell Biology, Institute of Biomedicine, Sahlgrenska Academy, University of Gothenburg, Gothenburg, Sweden
| | - Nicolle H Packer
- **Institute for Glycomics, Gold Coast Campus, Griffith University, Southport, QLD, Australia
- ‡‡Biomolecular Discovery & Design Research Centre, Macquarie University, North Ryde, NSW, Australia
| | - Frédérique Lisacek
- From the ‡Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland;
- §Computer Science Department, University of Geneva, Geneva, Switzerland
- §§Section of Biology, University of Geneva, Geneva, Switzerland
| |
Collapse
|
19
|
Sun W, Liu Y, Zhang K. An approach for N-linked glycan identification from MS/MS spectra by target-decoy strategy. Comput Biol Chem 2018; 74:391-398. [PMID: 29580737 DOI: 10.1016/j.compbiolchem.2018.03.014] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2018] [Accepted: 03/13/2018] [Indexed: 12/28/2022]
Abstract
Glycan structure determination serves as an essential step for the thorough investigation of the structure and function of protein. Currently, appropriate sample preparation followed by tandem mass spectrometry has emerged as the dominant technique for the characterization of glycans and glycopeptides. Although extensive efforts have been made to the development of computational approaches for the automated interpretation of glycopeptide spectra, the previously appeared methods lack a reasonable quality control strategy for the statistical validation of reported results. In this manuscript, we introduced a novel method that constructed a decoy glycan database based on the glycan structures in the target database, and searched the experimental spectra against both the target and decoy databases to find the best matched glycans. Specifically, a two-layer scoring scheme for calculating a normalized matching score is applied in the search procedure which enables the unbiased ranking of the matched glycans. Experimental analysis showed that our proposed method can report more structures with high confidence compared with previous approaches.
Collapse
Affiliation(s)
- Weiping Sun
- Department of Computer Science, University of Western Ontario, London, ON N6A5B7, Canada.
| | - Yi Liu
- Department of Computer Science, University of Western Ontario, London, ON N6A5B7, Canada
| | - Kaizhong Zhang
- Department of Computer Science, University of Western Ontario, London, ON N6A5B7, Canada
| |
Collapse
|
20
|
Abstract
Chemical tools have accelerated progress in glycoscience, reducing experimental barriers to studying protein glycosylation, the most widespread and complex form of posttranslational modification. For example, chemical glycoproteomics technologies have enabled the identification of specific glycosylation sites and glycan structures that modulate protein function in a number of biological processes. This field is now entering a stage of logarithmic growth, during which chemical innovations combined with mass spectrometry advances could make it possible to fully characterize the human glycoproteome. In this review, we describe the important role that chemical glycoproteomics methods are playing in such efforts. We summarize developments in four key areas: enrichment of glycoproteins and glycopeptides from complex mixtures, emphasizing methods that exploit unique chemical properties of glycans or introduce unnatural functional groups through metabolic labeling and chemoenzymatic tagging; identification of sites of protein glycosylation; targeted glycoproteomics; and functional glycoproteomics, with a focus on probing interactions between glycoproteins and glycan-binding proteins. Our goal with this survey is to provide a foundation on which continued technological advancements can be made to promote further explorations of protein glycosylation.
Collapse
Affiliation(s)
- Krishnan K. Palaniappan
- Verily Life Sciences, 269 East Grand Ave., South San Francisco, California 94080, United States
| | - Carolyn R. Bertozzi
- Department of Chemistry, Stanford University, Stanford, California 94305, United States
- Howard Hughes Medical Institute, Stanford University, Stanford, California 94305, United States
| |
Collapse
|
21
|
Gastaldello A, Alocci D, Baeriswyl JL, Mariethoz J, Lisacek F. GlycoSiteAlign: Glycosite Alignment Based on Glycan Structure. J Proteome Res 2016; 15:3916-3928. [DOI: 10.1021/acs.jproteome.6b00481] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/04/2023]
Affiliation(s)
- Alessandra Gastaldello
- Proteome
Informatics Group, SIB Swiss Institute of Bioinformatics, 7 route
de Drize, 1227 Geneva, Switzerland
- Computer
Science Department CUI, University of Geneva, 1227 Geneva, Switzerland
| | - Davide Alocci
- Proteome
Informatics Group, SIB Swiss Institute of Bioinformatics, 7 route
de Drize, 1227 Geneva, Switzerland
- Computer
Science Department CUI, University of Geneva, 1227 Geneva, Switzerland
| | - Jean-Luc Baeriswyl
- Proteome
Informatics Group, SIB Swiss Institute of Bioinformatics, 7 route
de Drize, 1227 Geneva, Switzerland
- Section
of Biology, Faculty of Sciences, University of Geneva, 1211 Geneva, Switzerland
| | - Julien Mariethoz
- Proteome
Informatics Group, SIB Swiss Institute of Bioinformatics, 7 route
de Drize, 1227 Geneva, Switzerland
- Computer
Science Department CUI, University of Geneva, 1227 Geneva, Switzerland
| | - Frederique Lisacek
- Proteome
Informatics Group, SIB Swiss Institute of Bioinformatics, 7 route
de Drize, 1227 Geneva, Switzerland
- Computer
Science Department CUI, University of Geneva, 1227 Geneva, Switzerland
- Section
of Biology, Faculty of Sciences, University of Geneva, 1211 Geneva, Switzerland
| |
Collapse
|
22
|
Mariethoz J, Khatib K, Alocci D, Campbell MP, Karlsson NG, Packer NH, Mullen EH, Lisacek F. SugarBindDB, a resource of glycan-mediated host-pathogen interactions. Nucleic Acids Res 2016; 44:D1243-50. [PMID: 26578555 PMCID: PMC4702881 DOI: 10.1093/nar/gkv1247] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Revised: 10/22/2015] [Accepted: 10/31/2015] [Indexed: 12/16/2022] Open
Abstract
The SugarBind Database (SugarBindDB) covers knowledge of glycan binding of human pathogen lectins and adhesins. It is a curated database; each glycan-protein binding pair is associated with at least one published reference. The core data element of SugarBindDB is a set of three inseparable components: the pathogenic agent, a lectin/adhesin and a glycan ligand. Each entity (agent, lectin or ligand) is described by a range of properties that are summarized in an entity-dedicated page. Several search, navigation and visualisation tools are implemented to investigate the functional role of glycans in pathogen binding. The database is cross-linked to protein and glycan-relaled resources such as UniProtKB and UniCarbKB. It is tightly bound to the latter via a substructure search tool that maps each ligand to full structures where it occurs. Thus, a glycan-lectin binding pair of SugarBindDB can lead to the identification of a glycan-mediated protein-protein interaction, that is, a lectin-glycoprotein interaction, via substructure search and the knowledge of site-specific glycosylation stored in UniCarbKB. SugarBindDB is accessible at: http://sugarbind.expasy.org.
Collapse
Affiliation(s)
- Julien Mariethoz
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland
| | | | - Davide Alocci
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland Department of Computer Science, University of Geneva, Geneva, Switzerland
| | - Matthew P Campbell
- Biomolecular Frontiers Research Centre, Macquarie University, North Ryde, NSW, Australia
| | - Niclas G Karlsson
- University of Gothenburg, Sahlgrenska Academy, Institute of Biomedicine, Department of Medical Biochemistry and Cell Biology, Gothenburg, Sweden
| | - Nicolle H Packer
- Biomolecular Frontiers Research Centre, Macquarie University, North Ryde, NSW, Australia
| | | | - Frederique Lisacek
- Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, Geneva, Switzerland Department of Computer Science, University of Geneva, Geneva, Switzerland
| |
Collapse
|
23
|
The SIB Swiss Institute of Bioinformatics' resources: focus on curated databases. Nucleic Acids Res 2015; 44:D27-37. [PMID: 26615188 PMCID: PMC4702916 DOI: 10.1093/nar/gkv1310] [Citation(s) in RCA: 46] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2015] [Accepted: 11/09/2015] [Indexed: 12/15/2022] Open
Abstract
The SIB Swiss Institute of Bioinformatics (www.isb-sib.ch) provides world-class bioinformatics databases, software tools, services and training to the international life science community in academia and industry. These solutions allow life scientists to turn the exponentially growing amount of data into knowledge. Here, we provide an overview of SIB's resources and competence areas, with a strong focus on curated databases and SIB's most popular and widely used resources. In particular, SIB's Bioinformatics resource portal ExPASy features over 150 resources, including UniProtKB/Swiss-Prot, ENZYME, PROSITE, neXtProt, STRING, UniCarbKB, SugarBindDB, SwissRegulon, EPD, arrayMap, Bgee, SWISS-MODEL Repository, OMA, OrthoDB and other databases, which are briefly described in this article.
Collapse
|
24
|
Abstract
Over the last two decades, several carbohydrate structure databases have been developed and made publicly available by different research groups around the world. This led to the fragmentation of information about carbohydrate structures into different resources that have no or only weak interaction with each other. GlycomeDB was developed to integrate the carbohydrate structures from different resources by generating a single-indexed catalog of these structures that associates each structure with its reference in the original resources. GlycomeDB facilitates searching for carbohydrate structures in all the integrated resources by eliminating the need to use several different search interfaces and manually integrating the results. References provided by GlycomeDB make it possible to retrieve information that is beyond the scope of GlycomeDB but present in the integrated databases. This chapter illustrates the use of the GlycomeDB search interfaces and web services by way of three example cases.
Collapse
Affiliation(s)
- René Ranzinger
- Complex Carbohydrate Research Center, University of Georgia, 315 Riverbend Road, Athens, GA, 30602-4712, USA,
| | | |
Collapse
|
25
|
Toukach PV, Egorova KS. Carbohydrate structure database merged from bacterial, archaeal, plant and fungal parts. Nucleic Acids Res 2015; 44:D1229-36. [PMID: 26286194 PMCID: PMC4702937 DOI: 10.1093/nar/gkv840] [Citation(s) in RCA: 134] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2015] [Accepted: 08/07/2015] [Indexed: 12/31/2022] Open
Abstract
The Carbohydrate Structure Databases (CSDBs, http://csdb.glycoscience.ru) store structural, bibliographic, taxonomic, NMR spectroscopic, and other data on natural carbohydrates and their derivatives published in the scientific literature. The CSDB project was launched in 2005 for bacterial saccharides (as BCSDB). Currently, it includes two parts, the Bacterial CSDB and the Plant&Fungal CSDB. In March 2015, these databases were merged to the single CSDB. The combined CSDB includes information on bacterial and archaeal glycans and derivatives (the coverage is close to complete), as well as on plant and fungal glycans and glycoconjugates (almost all structures published up to 1998). CSDB is regularly updated via manual expert annotation of original publications. Both newly annotated data and data imported from other databases are manually curated. The CSDB data are exportable in a number of modern formats, such as GlycoRDF. CSDB provides additional services for simulation of (1)H, (13)C and 2D NMR spectra of saccharides, NMR-based structure prediction, glycan-based taxon clustering and other.
Collapse
Affiliation(s)
- Philip V Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow 119991, Russia
| | - Ksenia S Egorova
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow 119991, Russia
| |
Collapse
|
26
|
Liu G, Neelamegham S. Integration of systems glycobiology with bioinformatics toolboxes, glycoinformatics resources, and glycoproteomics data. WILEY INTERDISCIPLINARY REVIEWS-SYSTEMS BIOLOGY AND MEDICINE 2015; 7:163-81. [PMID: 25871730 DOI: 10.1002/wsbm.1296] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/24/2014] [Revised: 02/08/2015] [Accepted: 03/04/2015] [Indexed: 12/22/2022]
Abstract
The glycome constitutes the entire complement of free carbohydrates and glycoconjugates expressed on whole cells or tissues. 'Systems Glycobiology' is an emerging discipline that aims to quantitatively describe and analyse the glycome. Here, instead of developing a detailed understanding of single biochemical processes, a combination of computational and experimental tools are used to seek an integrated or 'systems-level' view. This can explain how multiple biochemical reactions and transport processes interact with each other to control glycome biosynthesis and function. Computational methods in this field commonly build in silico reaction network models to describe experimental data derived from structural studies that measure cell-surface glycan distribution. While considerable progress has been made, several challenges remain due to the complex and heterogeneous nature of this post-translational modification. First, for the in silico models to be standardized and shared among laboratories, it is necessary to integrate glycan structure information and glycosylation-related enzyme definitions into the mathematical models. Second, as glycoinformatics resources grow, it would be attractive to utilize 'Big Data' stored in these repositories for model construction and validation. Third, while the technology for profiling the glycome at the whole-cell level has been standardized, there is a need to integrate mass spectrometry derived site-specific glycosylation data into the models. The current review discusses progress that is being made to resolve the above bottlenecks. The focus is on how computational models can bridge the gap between 'data' generated in wet-laboratory studies with 'knowledge' that can enhance our understanding of the glycome.
Collapse
Affiliation(s)
- Gang Liu
- Department of Chemical and Biological Engineering, State University of New York, Buffalo, NY, USA
| | - Sriram Neelamegham
- Department of Chemical and Biological Engineering, State University of New York, Buffalo, NY, USA
| |
Collapse
|
27
|
Ranzinger R, Aoki-Kinoshita KF, Campbell MP, Kawano S, Lütteke T, Okuda S, Shinmachi D, Shikanai T, Sawaki H, Toukach P, Matsubara M, Yamada I, Narimatsu H. GlycoRDF: an ontology to standardize glycomics data in RDF. Bioinformatics 2015; 31:919-25. [PMID: 25388145 PMCID: PMC4380026 DOI: 10.1093/bioinformatics/btu732] [Citation(s) in RCA: 32] [Impact Index Per Article: 3.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2014] [Revised: 10/12/2014] [Accepted: 10/28/2014] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION Over the last decades several glycomics-based bioinformatics resources and databases have been created and released to the public. Unfortunately, there is no common standard in the representation of the stored information or a common machine-readable interface allowing bioinformatics groups to easily extract and cross-reference the stored information. RESULTS An international group of bioinformatics experts in the field of glycomics have worked together to create a standard Resource Description Framework (RDF) representation for glycomics data, focused on glycan sequences and related biological source, publications and experimental data. This RDF standard is defined by the GlycoRDF ontology and will be used by database providers to generate common machine-readable exports of the data stored in their databases. AVAILABILITY AND IMPLEMENTATION The ontology, supporting documentation and source code used by database providers to generate standardized RDF are available online (http://www.glycoinfo.org/GlycoRDF/).
Collapse
Affiliation(s)
- Rene Ranzinger
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Kiyoko F Aoki-Kinoshita
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Matthew P Campbell
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Shin Kawano
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Thomas Lütteke
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Shujiro Okuda
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Daisuke Shinmachi
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Toshihide Shikanai
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Hiromichi Sawaki
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Philip Toukach
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Masaaki Matsubara
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Issaku Yamada
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| | - Hisashi Narimatsu
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA, Research Center for Medical Glycoscience, National Institute of Advanced Industrial Science and Technology, Tsukuba, Japan, Faculty of Engineering, Soka University, Tokyo, Japan, Biomolecular Frontiers Research Centre, Macquarie University, Sydney, Australia, Database Center for Life Science, Research Organization of Information and Systems, Chiba, Japan, Institute of Veterinary Physiology and Biochemistry, Justus-Liebig-University Giessen, Giessen, Germany, Graduate School of Medical and Dental Sciences, Niigata University, Niigata, Japan, N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Moscow, Russia and Laboratory of Glyco-organic Chemistry, The Noguchi Institute, Tokyo, Japan
| |
Collapse
|
28
|
Toukach PV, Egorova KS. Bacterial, plant, and fungal carbohydrate structure databases: daily usage. Methods Mol Biol 2015; 1273:55-85. [PMID: 25753703 DOI: 10.1007/978-1-4939-2343-4_5] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Natural carbohydrates play important roles in living systems and therefore are used as diagnostic and therapeutic targets. The main goal of glycomics is systematization of carbohydrates and elucidation of their role in human health and disease. The amount of information on natural carbohydrates accumulates rapidly, but scientists still lack databases and computer-assisted tools needed for orientation in the glycomic information space. Therefore, freely available, regularly updated, and cross-linked databases are demanded. Bacterial Carbohydrate Structure Database (Bacterial CSDB) was developed for provision of structural, bibliographic, taxonomic, NMR spectroscopic, and other related information on bacterial and archaeal carbohydrate structures. Its main features are (1) coverage above 90%, (2) high data consistence (above 90% of error-free records), and (3) presence of manually verified bibliographic, NMR spectroscopic, and taxonomic annotations. Recently, CSDB has been expanded to cover carbohydrates of plant and fungal origin. The achievement of full coverage in the plant and fungal domains is expected in the future. CSDB is freely available on the Internet as a web service at http://csdb.glycoscience.ru. This chapter aims at showing how to use CSDB in your daily scientific practice.
Collapse
Affiliation(s)
- Philip V Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Russian Academy of Sciences, Leninsky Prospekt 47, Moscow, 119991, Russia,
| | | |
Collapse
|
29
|
Miura N, Okada T, Murayama D, Hirose K, Sato T, Hashimoto R, Fukushima N. Functional network in posttranslational modifications: Glyco-Net in Glycoconjugate Data Bank. Methods Mol Biol 2015; 1273:149-157. [PMID: 25753709 DOI: 10.1007/978-1-4939-2343-4_11] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
Elucidating pathways related to posttranslational modifications (PTMs) such as glycosylation is of growing importance in post-genome science and technology. Graphical networks describing the relationships among glycan-related molecules, including genes, proteins, lipids, and various biological events, are considered extremely valuable and convenient tools for the systematic investigation of PTMs. Glyco-Net (http://bibi.sci.hokudai.ac.jp/functions/) can dynamically make network figures among various biological molecules and biological events. A certain molecule or event is expressed with a node, and the relationship between the molecule and the event is indicated by arrows in the network figures. In this chapter, we mention the features and current status of the Glyco-Net and a simple example of the search with the Glyco-Net.
Collapse
Affiliation(s)
- Nobuaki Miura
- Graduate School of Life Sciences, Hokkaido University, 1-204, Open Laboratory in Central Campus, Sapporo, 060-0812, Japan,
| | | | | | | | | | | | | |
Collapse
|
30
|
Pérez S, Sarkar A, Rivet A, Breton C, Imberty A. Glyco3D: a portal for structural glycosciences. Methods Mol Biol 2015; 1273:241-258. [PMID: 25753716 DOI: 10.1007/978-1-4939-2343-4_18] [Citation(s) in RCA: 63] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/04/2023]
Abstract
The present work describes, in a detailed way, a family of databases covering the three-dimensional features of monosaccharides, disaccharides, oligosaccharides, polysaccharides, glycosyltransferases, lectins, monoclonal antibodies against carbohydrates, and glycosaminoglycan-binding proteins. These databases have been developed with non-proprietary software, and they are open freely to the scientific community. They are accessible through the common portal called "Glyco3D" http://www.glyco3d.cermav.cnrs.fr. The databases are accompanied by a user-friendly graphical user interface (GUI) which offers several search options. All three-dimensional structures are available for visual consultations (with basic measurements possibilities) and can be downloaded in commonly used formats for further uses.
Collapse
Affiliation(s)
- Serge Pérez
- Centre de Recherches sur les Macromolécules Végétales, UPR5301, CNRS - Université Grenoble Alpes, BP53, 38041, Grenoble cédex 09, France
| | | | | | | | | |
Collapse
|
31
|
Stockinger H, Altenhoff AM, Arnold K, Bairoch A, Bastian F, Bergmann S, Bougueleret L, Bucher P, Delorenzi M, Lane L, Le Mercier P, Lisacek F, Michielin O, Palagi PM, Rougemont J, Schwede T, von Mering C, van Nimwegen E, Walther D, Xenarios I, Zavolan M, Zdobnov EM, Zoete V, Appel RD. Fifteen years SIB Swiss Institute of Bioinformatics: life science databases, tools and support. Nucleic Acids Res 2014; 42:W436-41. [PMID: 24792157 PMCID: PMC4086091 DOI: 10.1093/nar/gku380] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The SIB Swiss Institute of Bioinformatics (www.isb-sib.ch) was created in 1998 as an institution to foster excellence in bioinformatics. It is renowned worldwide for its databases and software tools, such as UniProtKB/Swiss-Prot, PROSITE, SWISS-MODEL, STRING, etc, that are all accessible on ExPASy.org, SIB's Bioinformatics Resource Portal. This article provides an overview of the scientific and training resources SIB has consistently been offering to the life science community for more than 15 years.
Collapse
Affiliation(s)
- Heinz Stockinger
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland
| | - Adrian M Altenhoff
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland ETH Zurich, Universitätstr. 6, CH-8092 Zurich, Switzerland
| | - Konstantin Arnold
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland University of Basel, CH-4056 Basel, Switzerland
| | - Amos Bairoch
- SIB Swiss Institute of Bioinformatics, CH-1211 Geneva 4, Switzerland University of Geneva, CH-1211 Geneva 4, Switzerland
| | - Frederic Bastian
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland University of Lausanne, CH-1015 Lausanne, Switzerland
| | - Sven Bergmann
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland University of Lausanne, CH-1015 Lausanne, Switzerland
| | - Lydie Bougueleret
- SIB Swiss Institute of Bioinformatics, CH-1211 Geneva 4, Switzerland
| | - Philipp Bucher
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland EPFL, CH-1015 Lausanne, Switzerland
| | - Mauro Delorenzi
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland University of Lausanne, CH-1015 Lausanne, Switzerland
| | - Lydie Lane
- SIB Swiss Institute of Bioinformatics, CH-1211 Geneva 4, Switzerland University of Geneva, CH-1211 Geneva 4, Switzerland
| | | | - Frédérique Lisacek
- SIB Swiss Institute of Bioinformatics, CH-1211 Geneva 4, Switzerland University of Geneva, CH-1211 Geneva 4, Switzerland
| | - Olivier Michielin
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland CHUV, CH-1011 Lausanne, Switzerland
| | - Patricia M Palagi
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland SIB Swiss Institute of Bioinformatics, CH-1211 Geneva 4, Switzerland
| | - Jacques Rougemont
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland EPFL, CH-1015 Lausanne, Switzerland
| | - Torsten Schwede
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland University of Basel, CH-4056 Basel, Switzerland
| | - Christian von Mering
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland University of Zurich, CH-8057 Zurich, Switzerland
| | - Erik van Nimwegen
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland University of Basel, CH-4056 Basel, Switzerland
| | - Daniel Walther
- SIB Swiss Institute of Bioinformatics, CH-1211 Geneva 4, Switzerland
| | - Ioannis Xenarios
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland SIB Swiss Institute of Bioinformatics, CH-1211 Geneva 4, Switzerland University of Lausanne, CH-1015 Lausanne, Switzerland
| | - Mihaela Zavolan
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland University of Basel, CH-4056 Basel, Switzerland
| | - Evgeny M Zdobnov
- SIB Swiss Institute of Bioinformatics, CH-1211 Geneva 4, Switzerland University of Geneva, CH-1211 Geneva 4, Switzerland
| | - Vincent Zoete
- SIB Swiss Institute of Bioinformatics, CH-1211 Geneva 4, Switzerland
| | - Ron D Appel
- SIB Swiss Institute of Bioinformatics, CH-1015 Lausanne, Switzerland University of Geneva, CH-1211 Geneva 4, Switzerland
| |
Collapse
|
32
|
Baycin Hizal D, Wolozny D, Colao J, Jacobson E, Tian Y, Krag SS, Betenbaugh MJ, Zhang H. Glycoproteomic and glycomic databases. Clin Proteomics 2014; 11:15. [PMID: 24725457 PMCID: PMC3996109 DOI: 10.1186/1559-0275-11-15] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/01/2013] [Accepted: 02/20/2014] [Indexed: 11/17/2022] Open
Abstract
Protein glycosylation serves critical roles in the cellular and biological processes of many organisms. Aberrant glycosylation has been associated with many illnesses such as hereditary and chronic diseases like cancer, cardiovascular diseases, neurological disorders, and immunological disorders. Emerging mass spectrometry (MS) technologies that enable the high-throughput identification of glycoproteins and glycans have accelerated the analysis and made possible the creation of dynamic and expanding databases. Although glycosylation-related databases have been established by many laboratories and institutions, they are not yet widely known in the community. Our study reviews 15 different publicly available databases and identifies their key elements so that users can identify the most applicable platform for their analytical needs. These databases include biological information on the experimentally identified glycans and glycopeptides from various cells and organisms such as human, rat, mouse, fly and zebrafish. The features of these databases - 7 for glycoproteomic data, 6 for glycomic data, and 2 for glycan binding proteins are summarized including the enrichment techniques that are used for glycoproteome and glycan identification. Furthermore databases such as Unipep, GlycoFly, GlycoFish recently established by our group are introduced. The unique features of each database, such as the analytical methods used and bioinformatical tools available are summarized. This information will be a valuable resource for the glycobiology community as it presents the analytical methods and glycosylation related databases together in one compendium. It will also represent a step towards the desired long term goal of integrating the different databases of glycosylation in order to characterize and categorize glycoproteins and glycans better for biomedical research.
Collapse
Affiliation(s)
- Deniz Baycin Hizal
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Daniel Wolozny
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Joseph Colao
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Elena Jacobson
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Yuan Tian
- Department of Pathology, Johns Hopkins University, Baltimore, MD, USA
| | - Sharon S Krag
- Department of Biochemistry and Molecular Biology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, USA
| | - Michael J Betenbaugh
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD, USA
| | - Hui Zhang
- Department of Pathology, Johns Hopkins University, Baltimore, MD, USA
| |
Collapse
|
33
|
Tang H, Mayampurath A, Yu C, Mechref Y. Bioinformatics Protocols in Glycomics and Glycoproteomics. ACTA ACUST UNITED AC 2014; 76:2.15.1-2.15.7. [DOI: 10.1002/0471140864.ps0215s76] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Affiliation(s)
- Haixu Tang
- School of Informatics and Computing, Indiana University Bloomington Indiana
| | - Anoop Mayampurath
- School of Informatics and Computing, Indiana University Bloomington Indiana
| | - Chuan‐Yih Yu
- School of Informatics and Computing, Indiana University Bloomington Indiana
| | - Yehia Mechref
- Department of Chemistry and Biochemistry, Texas Tech University Lubbock Texas
| |
Collapse
|
34
|
Sobolev BN, Veselovsky AV, Poroikov VV. Prediction of protein post-translational modifications: main trends and methods. RUSSIAN CHEMICAL REVIEWS 2014. [DOI: 10.1070/rc2014v083n02abeh004377] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/08/2022]
|
35
|
Campbell MP, Ranzinger R, Lütteke T, Mariethoz J, Hayes CA, Zhang J, Akune Y, Aoki-Kinoshita KF, Damerell D, Carta G, York WS, Haslam SM, Narimatsu H, Rudd PM, Karlsson NG, Packer NH, Lisacek F. Toolboxes for a standardised and systematic study of glycans. BMC Bioinformatics 2014; 15 Suppl 1:S9. [PMID: 24564482 PMCID: PMC4016020 DOI: 10.1186/1471-2105-15-s1-s9] [Citation(s) in RCA: 58] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023] Open
Abstract
Background Recent progress in method development for characterising the branched structures of complex carbohydrates has now enabled higher throughput technology. Automation of structure analysis then calls for software development since adding meaning to large data collections in reasonable time requires corresponding bioinformatics methods and tools. Current glycobioinformatics resources do cover information on the structure and function of glycans, their interaction with proteins or their enzymatic synthesis. However, this information is partial, scattered and often difficult to find to for non-glycobiologists. Methods Following our diagnosis of the causes of the slow development of glycobioinformatics, we review the "objective" difficulties encountered in defining adequate formats for representing complex entities and developing efficient analysis software. Results Various solutions already implemented and strategies defined to bridge glycobiology with different fields and integrate the heterogeneous glyco-related information are presented. Conclusions Despite the initial stage of our integrative efforts, this paper highlights the rapid expansion of glycomics, the validity of existing resources and the bright future of glycobioinformatics.
Collapse
|
36
|
Abstract
Proteomics has rapidly become an important tool for life science research, allowing the integrated analysis of global protein expression from a single experiment. To accommodate the complexity and dynamic nature of any proteome, researchers must use a combination of disparate protein biochemistry techniques, often a highly involved and time-consuming process. Whilst highly sophisticated, individual technologies for each step in studying a proteome are available, true high-throughput proteomics that provides a high degree of reproducibility and sensitivity has been difficult to achieve. The development of high-throughput proteomic platforms, encompassing all aspects of proteome analysis and integrated with genomics and bioinformatics technology, therefore represents a crucial step for the advancement of proteomics research. ProteomIQ (Proteome Systems) is the first fully integrated, start-to-finish proteomics platform to enter the market. Sample preparation and tracking, centralized data acquisition and instrument control, and direct interfacing with genomics and bioinformatics databases are combined into a single suite of integrated hardware and software tools, facilitating high reproducibility and rapid turnaround times. This review will highlight some features of ProteomIQ, with particular emphasis on the analysis of proteins separated by 2D polyacrylamide gel electrophoresis.
Collapse
Affiliation(s)
- Andrew N Stephens
- University of Sydney, Department of Molecular & Microbial Biosciences, NSW, Australia.
| | | | | |
Collapse
|
37
|
Lichti CF, Wildburger NC, Emmett MR, Mostovenko E, Shavkunov AS, Strain SK, Nilsson CL. Post-translational Modifications in the Human Proteome. TRANSLATIONAL BIOINFORMATICS 2014. [DOI: 10.1007/978-94-017-9202-8_6] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
38
|
Mayampurath A, Yu CY, Song E, Balan J, Mechref Y, Tang H. Computational framework for identification of intact glycopeptides in complex samples. Anal Chem 2013; 86:453-63. [PMID: 24279413 DOI: 10.1021/ac402338u] [Citation(s) in RCA: 79] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Glycosylation is an important protein modification that involves enzymatic attachment of sugars to amino acid residues. Understanding the structure of these sugars and the effects of glycosylation are vital for developing indicators of disease development and progression. Although computational methods based on mass spectrometric data have proven to be effective in monitoring changes in the glycome, developing such methods for the glycoproteome are challenging, largely due to the inherent complexity in simultaneously studying glycan structures with their corresponding glycosylation sites. This paper introduces a computational framework for identifying intact N-linked glycopeptides, i.e. glycopeptides with N-linked glycans attached to their glycosylation sites, in complex proteome samples. Scoring algorithms are presented for tandem mass spectra of glycopeptides resulting from collision-induced dissociation (CID), higher-energy C-trap dissociation (HCD), and electron transfer dissociation (ETD) fragmentation modes. An empirical false-discovery rate estimation method, based on a target-decoy search approach, is derived for assigning confidence. The power of our method is further enhanced when multiple data sets are pooled together to increase identification confidence. Using this framework, 103 highly confident N-linked glycopeptides from 53 sites across 33 glycoproteins were identified in complex human serum proteome samples using conventional proteomic platforms with standard depletion of the 7-most abundant proteins. These results indicate that our method is ready to be used for characterizing site-specific protein glycosylation in complex samples.
Collapse
Affiliation(s)
- Anoop Mayampurath
- School of Informatics & Computing, Indiana University , Bloomington, Indiana 47408, United States
| | | | | | | | | | | |
Collapse
|
39
|
Gusakov AV, Sinitsyna OA, Rozhkova AM, Sinitsyn AP. N-Glycosylation patterns in two α-l-arabinofuranosidases from Penicillium canescens belonging to the glycoside hydrolase families 51 and 54. Carbohydr Res 2013; 382:71-6. [DOI: 10.1016/j.carres.2013.08.026] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/22/2013] [Revised: 08/15/2013] [Accepted: 08/28/2013] [Indexed: 10/26/2022]
|
40
|
Park HM, Park JH, Kim YW, Kim KJ, Jeong HJ, Jang KS, Kim BG, Kim YG. The Xeno-glycomics database (XDB): a relational database of qualitative and quantitative pig glycome repertoire. Bioinformatics 2013; 29:2950-2. [PMID: 24013926 DOI: 10.1093/bioinformatics/btt504] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
SUMMARY In recent years, the improvement of mass spectrometry-based glycomics techniques (i.e. highly sensitive, quantitative and high-throughput analytical tools) has enabled us to obtain a large dataset of glycans. Here we present a database named Xeno-glycomics database (XDB) that contains cell- or tissue-specific pig glycomes analyzed with mass spectrometry-based techniques, including a comprehensive pig glycan information on chemical structures, mass values, types and relative quantities. It was designed as a user-friendly web-based interface that allows users to query the database according to pig tissue/cell types or glycan masses. This database will contribute in providing qualitative and quantitative information on glycomes characterized from various pig cells/organs in xenotransplantation and might eventually provide new targets in the α1,3-galactosyltransferase gene-knock out pigs era. AVAILABILITY The database can be accessed on the web at http://bioinformatics.snu.ac.kr/xdb.
Collapse
Affiliation(s)
- Hae-Min Park
- School of Chemical and Biological Engineering and Department of Computer Science and Engineering, Seoul National University, Seoul 151-742, Korea Department of Chemical Engineering, Soongsil University, Seoul 156-743, Korea Institute of Molecular Biology and Genetics and Institute of Bioengineering, Seoul National University, Seoul 151-742, Korea
| | | | | | | | | | | | | | | |
Collapse
|
41
|
Campbell MP, Peterson R, Mariethoz J, Gasteiger E, Akune Y, Aoki-Kinoshita KF, Lisacek F, Packer NH. UniCarbKB: building a knowledge platform for glycoproteomics. Nucleic Acids Res 2013; 42:D215-21. [PMID: 24234447 PMCID: PMC3964942 DOI: 10.1093/nar/gkt1128] [Citation(s) in RCA: 129] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
Abstract
The UniCarb KnowledgeBase (UniCarbKB; http://unicarbkb.org) offers public access to a growing, curated database of information on the glycan structures of glycoproteins. UniCarbKB is an international effort that aims to further our understanding of structures, pathways and networks involved in glycosylation and glyco-mediated processes by integrating structural, experimental and functional glycoscience information. This initiative builds upon the success of the glycan structure database GlycoSuiteDB, together with the informatic standards introduced by EUROCarbDB, to provide a high-quality and updated resource to support glycomics and glycoproteomics research. UniCarbKB provides comprehensive information concerning glycan structures, and published glycoprotein information including global and site-specific attachment information. For the first release over 890 references, 3740 glycan structure entries and 400 glycoproteins have been curated. Further, 598 protein glycosylation sites have been annotated with experimentally confirmed glycan structures from the literature. Among these are 35 glycoproteins, 502 structures and 60 publications previously not included in GlycoSuiteDB. This article provides an update on the transformation of GlycoSuiteDB (featured in previous NAR Database issues and hosted by ExPASy since 2009) to UniCarbKB and its integration with UniProtKB and GlycoMod. Here, we introduce a refactored database, supported by substantial new curated data collections and intuitive user-interfaces that improve database searching.
Collapse
Affiliation(s)
- Matthew P Campbell
- Biomolecular Frontiers Research Centre, Macquarie University, North Ryde, NSW 2109, Australia, Proteome Informatics Group, Swiss Institute of Bioinformatics, Geneva, Switzerland, Swiss-Prot Group, Swiss Institute of Bioinformatics, Geneva, Switzerland, Department of Bioinformatics, Faculty of Engineering, Soka University, 1-236 Tangi-machi, Hachioji, Tokyo, Japan and Section of Biology, Faculty of Sciences, University of Geneva, Switzerland
| | | | | | | | | | | | | | | |
Collapse
|
42
|
An Y, Rininger JA, Jarvis DL, Jing X, Ye Z, Aumiller JJ, Eichelberger M, Cipollo JF. Comparative glycomics analysis of influenza Hemagglutinin (H5N1) produced in vaccine relevant cell platforms. J Proteome Res 2013; 12:3707-20. [PMID: 23848607 DOI: 10.1021/pr400329k] [Citation(s) in RCA: 51] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
Hemagglutinin (HA) is the major antigen in influenza vaccines, and glycosylation is known to influence its antigenicity. Embryonated hen eggs are traditionally used for influenza vaccine production, but vaccines produced in mammalian and insect cells were recently licensed. This raises the concern that vaccines produced with different cell systems might not be equivalent due to differences in their glycosylation patterns. Thus, we developed an analytical method to monitor vaccine glycosylation through a combination of nanoLC/MS(E) and quantitative MALDI-TOF MS permethylation profiling. We then used this method to examine glycosylation of HAs from two different influenza H5N1 strains produced in five different platforms, including hen eggs, three different insect cell lines (High Five, expresSF+ and glycoengineered expresSF+), and a human cell line (HEK293). Our results demonstrated that (1) sequon utilization is not necessarily equivalent in different cell types, (2) there are quantitative and qualitative differences in the overall N-glycosylation patterns and structures produced by different cell types, (3) ∼20% of the N-glycans on the HAs produced by High Five cells are core α1,3-fucosylated structures, which may be allergenic in humans, and (4) our method can be used to monitor differences in glycosylation during the cellular glycoengineering stages of vaccine development.
Collapse
Affiliation(s)
- Yanming An
- Center for Biologics Evaluation and Research, Food and Drug Administration, Bethesda, Maryland 20892, USA
| | | | | | | | | | | | | | | |
Collapse
|
43
|
Shakhsheer B, Anderson M, Khatib K, Tadoori L, Joshi L, Lisacek F, Hirschman L, Mullen E. SugarBind database (SugarBindDB): a resource of pathogen lectins and corresponding glycan targets. J Mol Recognit 2013; 26:426-31. [DOI: 10.1002/jmr.2285] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2013] [Revised: 05/03/2013] [Accepted: 05/05/2013] [Indexed: 11/05/2022]
Affiliation(s)
- Baddr Shakhsheer
- Department of Surgery; University of Chicago Medical Center; Chicago; IL; USA
| | | | | | | | | | | | | | | |
Collapse
|
44
|
Aoki-Kinoshita KF. Using databases and web resources for glycomics research. Mol Cell Proteomics 2013; 12:1036-45. [PMID: 23325765 DOI: 10.1074/mcp.r112.026252] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Many databases of carbohydrate structures and related information can be found on the World Wide Web. This review covers the major carbohydrate databases that have potential utility for glycoscientists and researchers entering the glycosciences. The first half provides a brief overview of carbohydrate databases and web resources (including a history of carbohydrate databases and carbohydrate notations used in these databases), and the second half provides a guide that can be used as an index to determine which resources provide the data of most interest to the user.
Collapse
Affiliation(s)
- Kiyoko F Aoki-Kinoshita
- Department of Bioinformatics, Faculty of Engineering, Soka University, 1-236 Tangi-machi, Hachioji, Tokyo, Japan.
| |
Collapse
|
45
|
Kronewitter SR, De Leoz MLA, Strum JS, An HJ, Dimapasoc LM, Guerrero A, Miyamoto S, Lebrilla CB, Leiserowitz GS. The glycolyzer: automated glycan annotation software for high performance mass spectrometry and its application to ovarian cancer glycan biomarker discovery. Proteomics 2012; 12:2523-38. [PMID: 22903841 DOI: 10.1002/pmic.201100273] [Citation(s) in RCA: 42] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/25/2022]
Abstract
Human serum glycomics is a promising method for finding cancer biomarkers but often lacks the tools for streamlined data analysis. The Glycolyzer software incorporates a suite of analytic tools capable of identifying informative glycan peaks out of raw mass spectrometry data. As a demonstration of its utility, the program was used to identify putative biomarkers for epithelial ovarian cancer from a human serum sample set. A randomized, blocked, and blinded experimental design was used on a discovery set consisting of 46 cases and 48 controls. Retrosynthetic glycan libraries were used for data analysis and several significant candidate glycan biomarkers were discovered via hypothesis testing. The significant glycans were attributed to a glycan family based on glycan composition relationships and incorporated into a linear classifier motif test. The motif test was then applied to the discovery set to evaluate the disease state discrimination performance. The test provided strongly predictive results based on receiver operator characteristic curve analysis. The area under the receiver operator characteristic curve was 0.93. Using the Glycolyzer software, we were able to identify a set of glycan biomarkers that highly discriminate between cases and controls, and are ready to be formally validated in subsequent studies.
Collapse
|
46
|
Bräutigam J, Scheidig AJ, Egge-Jacobsen W. Mass spectrometric analysis of hepatitis C viral envelope protein E2 reveals extended microheterogeneity of mucin-type O-linked glycosylation. Glycobiology 2012; 23:453-74. [PMID: 23242014 DOI: 10.1093/glycob/cws171] [Citation(s) in RCA: 24] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The infectious liver disease hepatitis C is caused by the small, enveloped, positive single-strand RNA hepatitis C virus (HCV). The HCV genome encodes for a single polyprotein precursor of ∼3010 amino acid residues. Host and cellular proteases co- and posttranslational process the precursor creating six nonstructural (NS) proteins and four structural components. Properly folded forms of the envelope proteins E1 and E2 form the associated E1-E2 complex. This complex represents a significant antigenic component at the viral surface that can interact with several target cell receptors. Extent and type of glycosylation is an important factor for virulence and escape from the immune system. Detailed characterization of the glycosylated sites is helpful for the understanding of different phenotypes as well as for the development of E1/E2-related treatments of HCV infection. Here, we have investigated in detail the O-linked glycosylation of the HCV envelope protein E2 expressed in and isolated from human embryonic kidney (HEK 293) cells. Using nano-liquid chromatography and tandem mass spectrometry approaches, we clearly have identified six residues for O-linked glycosylation within isolated glycopeptides (Ser393, Thr396, Ser401, Ser404, Thr473 and Thr518), carrying mainly Core 1 and Core 2 mucin-type structures. Based on our data, Thr385 is probably glycosylated as well. In addition, we could show that Ser479 within the hyper variable region (HVR) I is not O-glycosylated. For most of these sites, different degrees of microheterogeneity could be verified. Concerning HCV E2, this is the first case of experimentally proven O-linked glycosylation in detail via mass spectrometry.
Collapse
Affiliation(s)
- Joachim Bräutigam
- Department of Structural Biology, Centre for Biochemistry and Molecular Biology, Christian-Albrechts Universität, 24118 Kiel, Germany
| | | | | |
Collapse
|
47
|
Affiliation(s)
- K. S. Egorova
- N.D. Zelinsky Institute of Organic Chemistry, Leninsky prospekt 47, 119991 Moscow,
Russian Federation
| | - Ph. V. Toukach
- N.D. Zelinsky Institute of Organic Chemistry, Leninsky prospekt 47, 119991 Moscow,
Russian Federation
| |
Collapse
|
48
|
Xu G, Liu X, Liu QY, Zhou Y, Li J. Improve accuracy and sensibility in glycan structure prediction by matching glycan isotope abundance. Anal Chim Acta 2012; 743:80-9. [DOI: 10.1016/j.aca.2012.07.009] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/19/2012] [Revised: 06/13/2012] [Accepted: 07/10/2012] [Indexed: 11/30/2022]
|
49
|
Lütteke T. The use of glycoinformatics in glycochemistry. Beilstein J Org Chem 2012; 8:915-29. [PMID: 23015842 PMCID: PMC3388882 DOI: 10.3762/bjoc.8.104] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2012] [Accepted: 05/29/2012] [Indexed: 01/10/2023] Open
Abstract
Glycoinformatics is a small but growing branch of bioinformatics and chemoinformatics. Various resources are now available that can be of use to glycobiologists, but also to chemists who work on the synthesis or analysis of carbohydrates. This article gives an overview of existing glyco-specific databases and tools, with a focus on their application to glycochemistry: Databases can provide information on candidate glycan structures for synthesis, or on glyco-enzymes that can be used to synthesize carbohydrates. Statistical analyses of glycan databases help to plan glycan synthesis experiments. 3D-Structural data of protein-carbohydrate complexes are used in targeted drug design, and tools to support glycan structure analysis aid with quality control. Specific problems of glycoinformatics compared to bioinformatics for genomics or proteomics, especially concerning integration and long-term maintenance of the existing glycan databases, are also discussed.
Collapse
Affiliation(s)
- Thomas Lütteke
- Justus-Liebig-University Gießen, Institute of Veterinary Physiology and Biochemistry, Frankfurter Str. 100, 35392 Gießen, Germany
| |
Collapse
|
50
|
Hart-Smith G, Raftery MJ. Detection and characterization of low abundance glycopeptides via higher-energy C-trap dissociation and orbitrap mass analysis. JOURNAL OF THE AMERICAN SOCIETY FOR MASS SPECTROMETRY 2012; 23:124-140. [PMID: 22083589 DOI: 10.1007/s13361-011-0273-y] [Citation(s) in RCA: 65] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/27/2011] [Revised: 10/05/2011] [Accepted: 10/06/2011] [Indexed: 05/31/2023]
Abstract
Broad-scale mass spectrometric analyses of glycopeptides are constrained by the considerable complexity inherent to glycoproteomics, and techniques are still being actively developed to address the associated analytical difficulties. Here we apply Orbitrap mass analysis and higher-energy C-trap dissociation (HCD) to facilitate detailed insights into the compositions and heterogeneity of complex mixtures of low abundance glycopeptides. By generating diagnostic oxonium product ions at mass measurement errors of <5 ppm, highly selective glycopeptide precursor ion detections are made at sub-fmol limits of detection: analyses of proteolytic digests of a hen egg glycoprotein mixture detect 88 previously uncharacterized glycopeptides from 666 precursor ions selected for MS/MS, with only one false positive due to co-fragmentation of a non-glycosylated peptide with a glycopeptide. We also demonstrate that by (1) identifying multiple series of glycoforms using high mass accuracy single stage MS spectra, and (2) performing product ion scans at optimized HCD collision energies, the identification of peptide + N-acetylhexosamine (HexNAc) ions (Y1 ions) can be readily achieved at <5 ppm mass measurement errors. These data allow base peptide sequences and glycan compositional information to be attained with high confidence, even for glycopeptides that produce weak precursor ion signals and/or low quality MS/MS spectra. The glycopeptides characterized from low fmol abundances using these methods allow two previously unreported glycosylation sites on the Gallus gallus protein ovoglycoprotein (amino acids 82 and 90) to be confirmed; considerable glycan heterogeneities at amino acid 90 of ovoglycoprotein, and amino acids 34 and 77 of Gallus gallus ovomucoid are also revealed.
Collapse
Affiliation(s)
- Gene Hart-Smith
- NSW Systems Biology Initiative, University of New South Wales, Sydney, New South Wales 2052, Australia.
| | | |
Collapse
|