1
|
Akune-Taylor Y, Kon A, Aoki-Kinoshita KF. In silico simulation of glycosylation and related pathways. Anal Bioanal Chem 2024; 416:3687-3696. [PMID: 38748247 PMCID: PMC11180631 DOI: 10.1007/s00216-024-05331-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2023] [Revised: 04/30/2024] [Accepted: 05/02/2024] [Indexed: 06/18/2024]
Abstract
Glycans participate in a vast number of recognition systems in diverse organisms in health and in disease. However, glycans cannot be sequenced because there is no sequencer technology that can fully characterize them. There is no "template" for replicating glycans as there are for amino acids and nucleic acids. Instead, glycans are synthesized by a complicated orchestration of multitudes of glycosyltransferases and glycosidases. Thus glycans can vary greatly in structure, but they are not genetically reproducible and are usually isolated in minute amounts. To characterize (sequence) the glycome (defined as the glycans in a particular organism, tissue, cell, or protein), glycosylation pathway prediction using in silico methods based on glycogene expression data, and glycosylation simulations have been attempted. Since many of the mammalian glycogenes have been identified and cloned, it has become possible to predict the glycan biosynthesis pathway in these systems. By then incorporating systems biology and bioprocessing technologies to these pathway models, given the right enzymatic parameters including enzyme and substrate concentrations and kinetic reaction parameters, it is possible to predict the potentially synthesized glycans in the pathway. This review presents information on the data resources that are currently available to enable in silico simulations of glycosylation and related pathways. Then some of the software tools that have been developed in the past to simulate and analyze glycosylation pathways will be described, followed by a summary and vision for the future developments and research directions in this area.
Collapse
Affiliation(s)
- Yukie Akune-Taylor
- Glycan and Life Systems Integration Center, Soka University, Tokyo, Japan
| | - Akane Kon
- Graduate School of Science and Engineering, Soka University, Tokyo, Japan
| | - Kiyoko F Aoki-Kinoshita
- Glycan and Life Systems Integration Center, Soka University, Tokyo, Japan.
- Graduate School of Science and Engineering, Soka University, Tokyo, Japan.
- iGCORE, Nagoya University, Nagoya, Japan.
| |
Collapse
|
2
|
Liang C, Chiang AWT, Lewis NE. GlycoMME, a Markov modeling platform for studying N-glycosylation biosynthesis from glycomics data. STAR Protoc 2023; 4:102244. [PMID: 37086409 PMCID: PMC10160804 DOI: 10.1016/j.xpro.2023.102244] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/26/2023] [Revised: 03/07/2023] [Accepted: 03/24/2023] [Indexed: 04/23/2023] Open
Abstract
Variations in N-glycosylation, which is crucial to glycoprotein functions, impact many diseases and the safety and efficacy of biotherapeutic drugs. Here, we present a protocol for using GlycoMME (Glycosylation Markov Model Evaluator) to study N-glycosylation biosynthesis from glycomics data. We describe steps for annotating glycomics data and quantifying perturbations to N-glycan biosynthesis with interpretable models. We then detail procedures to predict the impact of mutations in disease or potential glycoengineering strategies in drug development. For complete details on the use and execution of this protocol, please refer to Liang et al. (2020).1.
Collapse
Affiliation(s)
- Chenguang Liang
- Department of Pediatrics, University of California, San Diego, La Jolla, San Diego, CA 92130, USA; Department of Bioengineering, University of California, San Diego, La Jolla, San Diego, CA 92130, USA
| | - Austin W T Chiang
- Department of Pediatrics, University of California, San Diego, La Jolla, San Diego, CA 92130, USA.
| | - Nathan E Lewis
- Department of Pediatrics, University of California, San Diego, La Jolla, San Diego, CA 92130, USA; Department of Bioengineering, University of California, San Diego, La Jolla, San Diego, CA 92130, USA.
| |
Collapse
|
3
|
Groth T, Diehl AD, Gunawan R, Neelamegham S. GlycoEnzOnto: a GlycoEnzyme pathway and molecular function ontology. Bioinformatics 2022; 38:5413-5420. [PMID: 36282863 PMCID: PMC9750110 DOI: 10.1093/bioinformatics/btac704] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/24/2022] [Revised: 08/22/2022] [Accepted: 10/24/2022] [Indexed: 12/25/2022] Open
Abstract
MOTIVATION The 'glycoEnzymes' include a set of proteins having related enzymatic, metabolic, transport, structural and cofactor functions. Currently, there is no established ontology to describe glycoEnzyme properties and to relate them to glycan biosynthesis pathways. RESULTS We present GlycoEnzOnto, an ontology describing 403 human glycoEnzymes curated along 139 glycosylation pathways, 134 molecular functions and 22 cellular compartments. The pathways described regulate nucleotide-sugar metabolism, glycosyl-substrate/donor transport, glycan biosynthesis and degradation. The role of each enzyme in the glycosylation initiation, elongation/branching and capping/termination phases is described. IUPAC linear strings present systematic human/machine-readable descriptions of individual reaction steps and enable automated knowledge-based curation of biochemical networks. All GlycoEnzOnto knowledge is integrated with the Gene Ontology biological processes. GlycoEnzOnto enables improved transcript overrepresentation analyses and glycosylation pathway identification compared to other available schema, e.g. KEGG and Reactome. Overall, GlycoEnzOnto represents a holistic glycoinformatics resource for systems-level analyses. AVAILABILITY AND IMPLEMENTATION https://github.com/neel-lab/GlycoEnzOnto. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Theodore Groth
- Department of Chemical and Biological Engineering, University at Buffalo, State University of New York, Buffalo, NY 14260, USA
| | - Alexander D Diehl
- Department of Biomedical Informatics, University at Buffalo, State University of New York, Buffalo, NY 14260, USA
| | - Rudiyanto Gunawan
- Department of Chemical and Biological Engineering, University at Buffalo, State University of New York, Buffalo, NY 14260, USA
| | - Sriram Neelamegham
- Department of Chemical and Biological Engineering, University at Buffalo, State University of New York, Buffalo, NY 14260, USA
- Department of Biomedical Engineering, University at Buffalo, State University of New York, Buffalo, NY 14260, USA
- Department of Medicine, University at Buffalo, State University of New York, Buffalo, NY 14260, USA
| |
Collapse
|
4
|
Sasmal A, Khan N, Khedri Z, Kellman BP, Srivastava S, Verhagen A, Yu H, Bruntse AB, Diaz S, Varki N, Beddoe T, Paton AW, Paton JC, Chen X, Lewis NE, Varki A. Simple and practical sialoglycan encoding system reveals vast diversity in nature and identifies a universal sialoglycan-recognizing probe derived from AB5 toxin B subunits. Glycobiology 2022; 32:1101-1115. [PMID: 36048714 PMCID: PMC9680115 DOI: 10.1093/glycob/cwac057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2022] [Revised: 08/20/2022] [Accepted: 08/23/2022] [Indexed: 01/07/2023] Open
Abstract
Vertebrate sialic acids (Sias) display much diversity in modifications, linkages, and underlying glycans. Slide microarrays allow high-throughput explorations of sialoglycan-protein interactions. A microarray presenting ~150 structurally defined sialyltrisaccharides with various Sias linkages and modifications still poses challenges in planning, data sorting, visualization, and analysis. To address these issues, we devised a simple 9-digit code for sialyltrisaccharides with terminal Sias and underlying two monosaccharides assigned from the nonreducing end, with 3 digits assigning a monosaccharide, its modifications, and linkage. Calculations based on the encoding system reveal >113,000 likely linear sialyltrisaccharides in nature. Notably, a biantennary N-glycan with 2 terminal sialyltrisaccharides could thus have >1010 potential combinations and a triantennary N-glycan with 3 terminal sequences, >1015 potential combinations. While all possibilities likely do not exist in nature, sialoglycans encode enormous diversity. While glycomic approaches are used to probe such diverse sialomes, naturally occurring bacterial AB5 toxin B subunits are simpler tools to track the dynamic sialome in biological systems. Sialoglycan microarray was utilized to compare sialoglycan-recognizing bacterial toxin B subunits. Unlike the poor correlation between B subunits and species phylogeny, there is stronger correlation with Sia-epitope preferences. Further supporting this pattern, we report a B subunit (YenB) from Yersinia enterocolitica (broad host range) recognizing almost all sialoglycans in the microarray, including 4-O-acetylated-Sias not recognized by a Yersinia pestis orthologue (YpeB). Differential Sia-binding patterns were also observed with phylogenetically related B subunits from Escherichia coli (SubB), Salmonella Typhi (PltB), Salmonella Typhimurium (ArtB), extra-intestinal E.coli (EcPltB), Vibrio cholera (CtxB), and cholera family homologue of E. coli (EcxB).
Collapse
Affiliation(s)
- Aniruddha Sasmal
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Naazneen Khan
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Zahra Khedri
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Benjamin P Kellman
- Department of Pediatrics, University of California San Diego, La Jolla, CA 92093, USA
| | - Saurabh Srivastava
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Andrea Verhagen
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Hai Yu
- Department of Chemistry, University of California Davis, CA 95616, USA
| | - Anders Bech Bruntse
- Department of Pediatrics, University of California San Diego, La Jolla, CA 92093, USA
| | - Sandra Diaz
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Nissi Varki
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Travis Beddoe
- Department of Animal, Plant and Soil Science and Centre for AgriBioscience, La Trobe University, Bundoora, Victoria 3086, Australia
| | - Adrienne W Paton
- Research Centre for Infectious Diseases, Department of Molecular and Biomedical Science, University of Adelaide, Adelaide, SA 5005, Australia
| | - James C Paton
- Research Centre for Infectious Diseases, Department of Molecular and Biomedical Science, University of Adelaide, Adelaide, SA 5005, Australia
| | - Xi Chen
- Department of Chemistry, University of California Davis, CA 95616, USA
| | - Nathan E Lewis
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Pediatrics, University of California San Diego, La Jolla, CA 92093, USA
| | - Ajit Varki
- Glycobiology Research and Training Center, University of California San Diego, La Jolla, CA 92093, USA
- Department of Cellular and Molecular Medicine, University of California San Diego, La Jolla, CA 92093, USA
- Research Centre for Infectious Diseases, Department of Molecular and Biomedical Science, University of Adelaide, Adelaide, SA 5005, Australia
- Department of Medicine, University of California at San Diego, La Jolla, CA 92093, USA
| |
Collapse
|
5
|
Kellman BP, Richelle A, Yang JY, Chapla D, Chiang AWT, Najera JA, Liang C, Fürst A, Bao B, Koga N, Mohammad MA, Bruntse AB, Haymond MW, Moremen KW, Bode L, Lewis NE. Elucidating Human Milk Oligosaccharide biosynthetic genes through network-based multi-omics integration. Nat Commun 2022; 13:2455. [PMID: 35508452 PMCID: PMC9068700 DOI: 10.1038/s41467-022-29867-4] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/11/2020] [Accepted: 04/04/2022] [Indexed: 12/18/2022] Open
Abstract
Human Milk Oligosaccharides (HMOs) are abundant carbohydrates fundamental to infant health and development. Although these oligosaccharides were discovered more than half a century ago, their biosynthesis in the mammary gland remains largely uncharacterized. Here, we use a systems biology framework that integrates glycan and RNA expression data to construct an HMO biosynthetic network and predict glycosyltransferases involved. To accomplish this, we construct models describing the most likely pathways for the synthesis of the oligosaccharides accounting for >95% of the HMO content in human milk. Through our models, we propose candidate genes for elongation, branching, fucosylation, and sialylation of HMOs. Our model aggregation approach recovers 2 of 2 previously known gene-enzyme relations and 2 of 3 empirically confirmed gene-enzyme relations. The top genes we propose for the remaining 5 linkage reactions are consistent with previously published literature. These results provide the molecular basis of HMO biosynthesis necessary to guide progress in HMO research and application with the goal of understanding and improving infant health and development.
Collapse
Affiliation(s)
- Benjamin P Kellman
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA, 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Anne Richelle
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Jeong-Yeh Yang
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA
| | - Digantkumar Chapla
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA
| | - Austin W T Chiang
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Julia A Najera
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Chenguang Liang
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Annalee Fürst
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Bokan Bao
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, 92093, USA
- Bioinformatics and Systems Biology Graduate Program, University of California, San Diego, La Jolla, CA, 92093, USA
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Natalia Koga
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Mahmoud A Mohammad
- Department of Pediatrics, Children's Nutrition Research Center, US Department of Agriculture/Agricultural Research Service, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Anders Bech Bruntse
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Morey W Haymond
- Department of Pediatrics, Children's Nutrition Research Center, US Department of Agriculture/Agricultural Research Service, Baylor College of Medicine, Houston, TX, 77030, USA
| | - Kelley W Moremen
- Complex Carbohydrate Research Center, University of Georgia, Athens, GA, USA
| | - Lars Bode
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, 92093, USA
- Larsson-Rosenquist Foundation Mother-Milk-Infant Center of Research Excellence (MOMI CORE), University of California, San Diego, La Jolla, CA, 92093, USA
| | - Nathan E Lewis
- Department of Pediatrics, University of California, San Diego, La Jolla, CA, 92093, USA.
- Department of Bioengineering, University of California, San Diego, La Jolla, CA, 92093, USA.
| |
Collapse
|
6
|
Aoki-Kinoshita KF. Functions of Glycosylation and Related Web Resources for Its Prediction. Methods Mol Biol 2022; 2499:135-144. [PMID: 35696078 DOI: 10.1007/978-1-0716-2317-6_6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/15/2023]
Abstract
Glycosylation involves the attachment of carbohydrate sugar chains, or glycans, onto an amino acid residue of a protein. These glycans are often branched structures and serve to modulate the function of proteins. Glycans are synthesized through a complex process of enzymatic reactions that occur in the Golgi apparatus in mammalian systems. Because there is currently no sequencer for glycans, technologies such as mass spectrometry is used to characterize glycans in a biological sample to ascertain its glycome. This is a tedious process that requires high levels of expertise and equipment. Thus, the enzymes that work on glycans, called glycogenes or glycoenzymes, have been studied to better understand glycan function. With the development of glycan-related databases and a glycan repository, bioinformatics approaches have attempted to predict the glycosylation pathway and the glycosylation sites on proteins. This chapter introduces these methods and related Web resources for understanding glycan function.
Collapse
|
7
|
Aoki-Kinoshita KF, Lisacek F, Karlsson N, Kolarich D, Packer NH. GlycoBioinformatics. Beilstein J Org Chem 2021; 17:2726-2728. [PMID: 34858527 PMCID: PMC8593694 DOI: 10.3762/bjoc.17.184] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/19/2021] [Accepted: 10/27/2021] [Indexed: 11/23/2022] Open
Affiliation(s)
- Kiyoko F Aoki-Kinoshita
- Faculty of Science and Engineering, Soka University, 1-236 Tangi-machi, Hachioji-shi, Tokyo, Japan
| | - Frédérique Lisacek
- University of Geneva and Swiss Institute of Bioinformatics, CUI - 7, route de Drize, 1211 Geneva, Switzerland
| | - Niclas Karlsson
- Department of Medical Biochemistry and Cell Biology, University of Gothenburg, Box 440, 40530 Gothenburg, Sweden.,Faculty of Health Sciences, Department of Life Sciences and Health, Pharmacy, Oslo Metropolitan University, 0167 Oslo, Norway
| | - Daniel Kolarich
- Griffith University, Gold Coast Campus, Southport, Queensland 4222, Australia
| | - Nicolle H Packer
- Department of Molecular Sciences, Macquarie University, Sydney, New South Wales, Australia
| |
Collapse
|
8
|
Aoki-Kinoshita KF. Glycome informatics: using systems biology to gain mechanistic insights into glycan biosynthesis. Curr Opin Chem Eng 2021. [DOI: 10.1016/j.coche.2021.100683] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
|
9
|
Kellman BP, Lewis NE. Big-Data Glycomics: Tools to Connect Glycan Biosynthesis to Extracellular Communication. Trends Biochem Sci 2021; 46:284-300. [PMID: 33349503 PMCID: PMC7954846 DOI: 10.1016/j.tibs.2020.10.004] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/04/2020] [Revised: 10/05/2020] [Accepted: 10/22/2020] [Indexed: 12/12/2022]
Abstract
Characteristically, cells must sense and respond to environmental cues. Despite the importance of cell-cell communication, our understanding remains limited and often lacks glycans. Glycans decorate proteins and cell membranes at the cell-environment interface, and modulate intercellular communication, from development to pathogenesis. Providing further challenges, glycan biosynthesis and cellular behavior are co-regulating systems. Here, we discuss how glycosylation contributes to extracellular responses and signaling. We further organize approaches for disentangling the roles of glycans in multicellular interactions using newly available datasets and tools, including glycan biosynthesis models, omics datasets, and systems-level analyses. Thus, emerging tools in big data analytics and systems biology are facilitating novel insights on glycans and their relationship with multicellular behavior.
Collapse
Affiliation(s)
- Benjamin P Kellman
- Department of Pediatrics, University of California San Diego School of Medicine, La Jolla, CA, USA; Department of Bioengineering, University of California San Diego School of Medicine, La Jolla, CA, USA; Bioinformatics and Systems Biology Program, University of California San Diego School of Medicine, La Jolla, CA, USA
| | - Nathan E Lewis
- Department of Pediatrics, University of California San Diego School of Medicine, La Jolla, CA, USA; Department of Bioengineering, University of California San Diego School of Medicine, La Jolla, CA, USA; Bioinformatics and Systems Biology Program, University of California San Diego School of Medicine, La Jolla, CA, USA; Novo Nordisk Foundation Center for Biosustainability at the University of California San Diego School of Medicine, La Jolla, CA, USA.
| |
Collapse
|
10
|
McDonald AG, Davey GP. Simulating the enzymes of ganglioside biosynthesis with Glycologue. Beilstein J Org Chem 2021; 17:739-748. [PMID: 33828618 PMCID: PMC8008095 DOI: 10.3762/bjoc.17.64] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2020] [Accepted: 03/12/2021] [Indexed: 02/03/2023] Open
Abstract
Gangliosides are an important class of sialylated glycosphingolipids linked to ceramide that are a component of the mammalian cell surface, especially those of the central nervous system, where they function in intercellular recognition and communication. We describe an in silico method for determining the metabolic pathways leading to the most common gangliosides, based on the known enzymes of their biosynthesis. A network of 41 glycolipids is produced by the actions of the 10 enzymes included in the model. The different ganglioside nomenclature systems in common use are compared and a systematic variant of the widely used Svennerholm nomenclature is described. Knockouts of specific enzyme activities are used to simulate congenital defects in ganglioside biosynthesis, and altered ganglioside status in cancer, and the effects on network structure are predicted. The simulator is available at the Glycologue website, https://glycologue.org/.
Collapse
Affiliation(s)
- Andrew G McDonald
- School of Biochemistry and Immunology, Trinity College Dublin, Dublin 2, Ireland
| | - Gavin P Davey
- School of Biochemistry and Immunology, Trinity College Dublin, Dublin 2, Ireland
| |
Collapse
|