1
|
Alvarez MRS, Holmes XA, Oloumi A, Grijaldo-Alvarez SJ, Schindler R, Zhou Q, Yadlapati A, Silsirivanit A, Lebrilla CB. Integration of RNAseq transcriptomics and N-glycomics reveal biosynthetic pathways and predict structure-specific N-glycan expression. Chem Sci 2025; 16:7155-7172. [PMID: 40191131 PMCID: PMC11970275 DOI: 10.1039/d5sc00467e] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2025] [Accepted: 03/20/2025] [Indexed: 04/09/2025] Open
Abstract
The processes involved in protein N-glycosylation represent new therapeutic targets for diseases but their stepwise and overlapping biosynthetic processes make it challenging to identify the specific glycogenes involved. In this work, we aimed to elucidate the interactions between glycogene expression and N-glycan abundance by constructing supervised machine-learning models for each N-glycan composition. Regression models were trained to predict N-glycan abundance (response variable) from glycogene expression (predictors) using paired LC-MS/MS N-glycomic and 3'-TagSeq transcriptomic datasets from cells derived from multiple tissue origins and treatment conditions. The datasets include cells from several tissue origins - B cell, brain, colon, lung, muscle, prostate - encompassing nearly 400 N-glycan compounds and over 160 glycogenes filtered from an 18 000-gene transcriptome. Accurate models (validation R 2 > 0.8) predicted N-glycan abundance across cell types, including GLC01 (lung cancer), CCD19-Lu (lung fibroblast), and Tib-190 (B cell). Model importance scores ranked glycogene contributions to N-glycan predictions, revealing significant glycogene associations with specific N-glycan types. The predictions were consistent across input cell quantities, unlike LC-MS/MS glycomics which showed inconsistent results. This suggests that the models can reliably predict N-glycosylation even in samples with low cell amounts and by extension, single-cell samples. These findings can provide insights into cellular N-glycosylation machinery, offering potential therapeutic strategies for diseases linked to aberrant glycosylation, such as cancer, and neurodegenerative and autoimmune disorders.
Collapse
Affiliation(s)
| | - Xavier A Holmes
- Department of Chemistry, University of California, Davis Davis California USA
| | - Armin Oloumi
- Department of Chemistry, University of California, Davis Davis California USA
| | | | - Ryan Schindler
- Department of Chemistry, University of California, Davis Davis California USA
| | - Qingwen Zhou
- Department of Chemistry, University of California, Davis Davis California USA
| | - Anirudh Yadlapati
- Department of Chemistry, University of California, Davis Davis California USA
| | - Atit Silsirivanit
- Department of Biochemistry, Faculty of Medicine, Khon Kaen University Khon Kaen Thailand
| | - Carlito B Lebrilla
- Department of Chemistry, University of California, Davis Davis California USA
- Department of Chemistry, Biochemistry, Molecular, Cellular and Developmental Biology Graduate Group, University of California, Davis Davis California USA
| |
Collapse
|
2
|
Dworkin LA, Clausen H, Joshi HJ. Applying transcriptomics to studyglycosylation at the cell type level. iScience 2022; 25:104419. [PMID: 35663018 PMCID: PMC9156939 DOI: 10.1016/j.isci.2022.104419] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/28/2021] [Revised: 03/30/2022] [Accepted: 05/12/2022] [Indexed: 11/22/2022] Open
Abstract
The complex multi-step process of glycosylation occurs in a single cell, yet current analytics generally cannot measure the output (the glycome) of a single cell. Here, we addressed this discordance by investigating how single cell RNA-seq data can be used to characterize the state of the glycosylation machinery and metabolic network in a single cell. The metabolic network involves 214 glycosylation and modification enzymes outlined in our previously built atlas of cellular glycosylation pathways. We studied differential mRNA regulation of enzymes at the organ and single cell level, finding that most of the general protein and lipid oligosaccharide scaffolds are produced by enzymes exhibiting limited transcriptional regulation among cells. We predict key enzymes within different glycosylation pathways to be highly transcriptionally regulated as regulatable hotspots of the cellular glycome. We designed the Glycopacity software that enables investigators to extract and interpret glycosylation information from transcriptome data and define hotspots of regulation. RNA-seq can provide information on the glycosylation metabolic network state It is possible to readout glycosylation capacity from single cell RNA-seq data Genes regulating the biosynthesis of common glycan scaffolds show little regulation Key enzymes in the glycosylation network are predicted to be regulatable hotspots
Collapse
Affiliation(s)
- Leo Alexander Dworkin
- Copenhagen Center for Glycomics, Department of Cellular and Molecular Medicine, Faculty of Health Sciences, University of Copenhagen, Blegdamsvej 3, 2200 Copenhagen N, Denmark
| | - Henrik Clausen
- Copenhagen Center for Glycomics, Department of Cellular and Molecular Medicine, Faculty of Health Sciences, University of Copenhagen, Blegdamsvej 3, 2200 Copenhagen N, Denmark
| | - Hiren Jitendra Joshi
- Copenhagen Center for Glycomics, Department of Cellular and Molecular Medicine, Faculty of Health Sciences, University of Copenhagen, Blegdamsvej 3, 2200 Copenhagen N, Denmark
- Corresponding author
| |
Collapse
|
3
|
Frey LJ. Informatics Ecosystems to Advance the Biology of Glycans. Methods Mol Biol 2022; 2303:655-673. [PMID: 34626414 DOI: 10.1007/978-1-0716-1398-6_50] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/13/2023]
Abstract
Glycomics researchers have identified the need for integrated database systems for collecting glycomics information in a consistent format. The goal is to create a resource for knowledge discovery and dissemination to wider research communities. This has the potential and has exhibited initial success, to extend the research community to include biologists, clinicians, chemists, and computer scientists. This chapter discusses the technology and approach needed to create integrated data resources and informatics ecosystems to empower the broader community to leverage extant glycomics data. The focus is on glycosaminoglycan (GAGs) and proteoglycan research, but the approach can be generalized. The methods described span the development of glycomics standards from CarbBank to Glyco Connection Tables. Integrated data sets provide a foundation for novel methods of analysis such as machine learning and deep learning for knowledge discovery. The implications of predictive analysis are examined in relation to disease biomarker to expand the target audience of GAG and proteoglycan research.
Collapse
Affiliation(s)
- Lewis J Frey
- Department of Public Health Sciences, College of Medicine, Medical University of South Carolina, Charleston, SC, USA.
| |
Collapse
|
4
|
Abstract
Glycobiology is a glycan-based field of study that focuses on the structure, function, and biology of carbohydrates, and glycomics is a sub-study of the field of glycobiology that aims to define structure/function of glycans in living organisms. With the popularity of the glycobiology and glycomics, application of computational modeling expanded in the scientific area of glycobiology over the last decades. The recent availability of progressive Wet-Lab methods in the field of glycobiology and glycomics is promising for the impact of systems biology on the research area of the glycome, an emerging field that is termed “systems glycobiology.” This chapter will summarize the up-to-date leading edge in the use of bioinformatics tools in the field of glycobiology. The chapter provides basic knowledge both for glycobiologists interested in the application of bioinformatics tools and scientists of computational biology interested in studying the glycome.
Collapse
|
5
|
Affiliation(s)
- Hayden Wilkinson
- NIBRT GlycoScience Group, National Institute for Bioprocessing, Research and Training, Blackrock, Dublin, Ireland
- CÚRAM, SFI Research Centre for Medical Devices, National University of Ireland, Galway, Ireland
- UCD School of Medicine, College of Health and Agricultural Science, University College Dublin, Dublin, Ireland
| | - Radka Saldova
- NIBRT GlycoScience Group, National Institute for Bioprocessing, Research and Training, Blackrock, Dublin, Ireland
- CÚRAM, SFI Research Centre for Medical Devices, National University of Ireland, Galway, Ireland
- UCD School of Medicine, College of Health and Agricultural Science, University College Dublin, Dublin, Ireland
| |
Collapse
|
6
|
Ahmad Izaham AR, Scott NE. Open Database Searching Enables the Identification and Comparison of Bacterial Glycoproteomes without Defining Glycan Compositions Prior to Searching. Mol Cell Proteomics 2020. [PMID: 32576591 DOI: 10.1101/2020.04.21.052845] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/15/2023] Open
Abstract
Mass spectrometry has become an indispensable tool for the characterization of glycosylation across biological systems. Our ability to generate rich fragmentation of glycopeptides has dramatically improved over the last decade yet our informatic approaches still lag behind. Although glycoproteomic informatics approaches using glycan databases have attracted considerable attention, database independent approaches have not. This has significantly limited high throughput studies of unusual or atypical glycosylation events such as those observed in bacteria. As such, computational approaches to examine bacterial glycosylation and identify chemically diverse glycans are desperately needed. Here we describe the use of wide-tolerance (up to 2000 Da) open searching as a means to rapidly examine bacterial glycoproteomes. We benchmarked this approach using N-linked glycopeptides of Campylobacter fetus subsp. fetus as well as O-linked glycopeptides of Acinetobacter baumannii and Burkholderia cenocepacia revealing glycopeptides modified with a range of glycans can be readily identified without defining the glycan masses before database searching. Using this approach, we demonstrate how wide tolerance searching can be used to compare glycan use across bacterial species by examining the glycoproteomes of eight Burkholderia species (B. pseudomallei; B. multivorans; B. dolosa; B. humptydooensis; B. ubonensis, B. anthina; B. diffusa; B. pseudomultivorans). Finally, we demonstrate how open searching enables the identification of low frequency glycoforms based on shared modified peptides sequences. Combined, these results show that open searching is a robust computational approach for the determination of glycan diversity within bacterial proteomes.
Collapse
Affiliation(s)
- Ameera Raudah Ahmad Izaham
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Nichollas E Scott
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia.
| |
Collapse
|
7
|
Ahmad Izaham AR, Scott NE. Open Database Searching Enables the Identification and Comparison of Bacterial Glycoproteomes without Defining Glycan Compositions Prior to Searching. Mol Cell Proteomics 2020; 19:1561-1574. [PMID: 32576591 PMCID: PMC8143609 DOI: 10.1074/mcp.tir120.002100] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Revised: 05/27/2020] [Indexed: 12/23/2022] Open
Abstract
Mass spectrometry has become an indispensable tool for the characterization of glycosylation across biological systems. Our ability to generate rich fragmentation of glycopeptides has dramatically improved over the last decade yet our informatic approaches still lag behind. Although glycoproteomic informatics approaches using glycan databases have attracted considerable attention, database independent approaches have not. This has significantly limited high throughput studies of unusual or atypical glycosylation events such as those observed in bacteria. As such, computational approaches to examine bacterial glycosylation and identify chemically diverse glycans are desperately needed. Here we describe the use of wide-tolerance (up to 2000 Da) open searching as a means to rapidly examine bacterial glycoproteomes. We benchmarked this approach using N-linked glycopeptides of Campylobacter fetus subsp. fetus as well as O-linked glycopeptides of Acinetobacter baumannii and Burkholderia cenocepacia revealing glycopeptides modified with a range of glycans can be readily identified without defining the glycan masses before database searching. Using this approach, we demonstrate how wide tolerance searching can be used to compare glycan use across bacterial species by examining the glycoproteomes of eight Burkholderia species (B. pseudomallei; B. multivorans; B. dolosa; B. humptydooensis; B. ubonensis, B. anthina; B. diffusa; B. pseudomultivorans). Finally, we demonstrate how open searching enables the identification of low frequency glycoforms based on shared modified peptides sequences. Combined, these results show that open searching is a robust computational approach for the determination of glycan diversity within bacterial proteomes.
Collapse
Affiliation(s)
- Ameera Raudah Ahmad Izaham
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Nichollas E Scott
- Department of Microbiology and Immunology, University of Melbourne at the Peter Doherty Institute for Infection and Immunity, Melbourne, Australia.
| |
Collapse
|
8
|
A Markov model of glycosylation elucidates isozyme specificity and glycosyltransferase interactions for glycoengineering. CURRENT RESEARCH IN BIOTECHNOLOGY 2020; 2:22-36. [PMID: 32285041 DOI: 10.1016/j.crbiot.2020.01.001] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Glycosylated biopharmaceuticals are important in the global pharmaceutical market. Despite the importance of their glycan structures, our limited knowledge of the glycosylation machinery still hinders controllability of this critical quality attribute. To facilitate discovery of glycosyltransferase specificity and predict glycoengineering efforts, here we extend the approach to model N-linked protein glycosylation as a Markov process. Our model leverages putative glycosyltransferase (GT) specificity to define the biosynthetic pathways for all measured glycans, and the Markov chain modelling is used to learn glycosyltransferase isoform activities and predict glycosylation following glycosyltransferase knock-in/knockout. We apply our methodology to four different glycoengineered therapeutics (i.e., Rituximab, erythropoietin, Enbrel, and alpha-1 antitrypsin) produced in CHO cells. Our model accurately predicted N-linked glycosylation following glycoengineering and further quantified the impact of glycosyltransferase mutations on reactions catalyzed by other glycosyltransferases. By applying these learned GT-GT interaction rules identified from single glycosyltransferase mutants, our model further predicts the outcome of multi-gene glycosyltransferase mutations on the diverse biotherapeutics. Thus, this modeling approach enables rational glycoengineering and the elucidation of relationships between glycosyltransferases, thereby facilitating biopharmaceutical research and aiding the broader study of glycosylation to elucidate the genetic basis of complex changes in glycosylation.
Collapse
|
9
|
Akiyoshi S, Iwata M, Berenger F, Yamanishi Y. Omics-based Identification of Glycan Structures as Biomarkers for a Variety of Diseases. Mol Inform 2019; 39:e1900112. [PMID: 31622036 DOI: 10.1002/minf.201900112] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Accepted: 09/24/2019] [Indexed: 12/11/2022]
Abstract
Glycans play important roles in cell communication, protein interaction, and immunity, and structural changes in glycans are associated with the regulation of a range of biological pathways involved in disease. However, our understanding of the detailed relationships between specific diseases and glycans is very limited. In this study, we proposed an omics-based method to investigate the correlations between glycans and a wide range of human diseases. We analyzed the gene expression patterns of glycogenes (glycosyltransferases and glycosidases) for 79 different diseases. A biological pathway-based glycogene signature was constructed to identify the alteration in glycan biosynthesis and the associated glycan structures for each disease state. The degradation of N-glycan and keratan sulfate, for example, may promote the growth or metastasis of multiple types of cancer, including endometrial, gastric, and nasopharyngeal. Our results also revealed that commonalities between diseases can be interpreted using glycogene expression patterns, as well as the associated glycan structure patterns at the level of the affected pathway. The proposed method is expected to be useful for understanding the relationships between glycans, glycogenes, and disease and identifying disease-specific glycan biomarkers.
Collapse
Affiliation(s)
- Sayaka Akiyoshi
- Medical Institute of Bioregulation, Kyushu University, 3-1-1 Maidashi, Higashi-ku, Fukuoka, Fukuoka, 812-8582, Japan
| | - Michio Iwata
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan
| | - Francois Berenger
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan
| | - Yoshihiro Yamanishi
- Department of Bioscience and Bioinformatics, Faculty of Computer Science and Systems Engineering, Kyushu Institute of Technology, 680-4 Kawazu, Iizuka, Fukuoka, 820-8502, Japan
| |
Collapse
|
10
|
Le T, O’Brien C, Gupta U, Sousa G, Daoutidis P, Hu W. An integrated platform for mucin‐type
O
‐glycosylation network generation and visualization. Biotechnol Bioeng 2019; 116:1341-1354. [DOI: 10.1002/bit.26952] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2018] [Revised: 12/19/2018] [Accepted: 02/08/2019] [Indexed: 01/28/2023]
Affiliation(s)
- Tung Le
- Department of Chemical Engineering and Materials ScienceUniversity of MinnesotaMinneapolis Minnesota
| | - Conor O’Brien
- Department of Chemical Engineering and Materials ScienceUniversity of MinnesotaMinneapolis Minnesota
| | - Udit Gupta
- Department of Chemical Engineering and Materials ScienceUniversity of MinnesotaMinneapolis Minnesota
| | - Guilherme Sousa
- Department of Chemical Engineering and Materials ScienceUniversity of MinnesotaMinneapolis Minnesota
| | - Prodromos Daoutidis
- Department of Chemical Engineering and Materials ScienceUniversity of MinnesotaMinneapolis Minnesota
| | - Wei‐Shou Hu
- Department of Chemical Engineering and Materials ScienceUniversity of MinnesotaMinneapolis Minnesota
| |
Collapse
|
11
|
Drake RR, West CA, Mehta AS, Angel PM. MALDI Mass Spectrometry Imaging of N-Linked Glycans in Tissues. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2018; 1104:59-76. [PMID: 30484244 DOI: 10.1007/978-981-13-2158-0_4] [Citation(s) in RCA: 43] [Impact Index Per Article: 6.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/19/2022]
Abstract
Matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) has been used for two decades to profile the glycan constituents of biological samples. An adaptation of the method to tissues, MALDI mass spectrometry imaging (MALDI-MSI), allows high-throughput spatial profiling of hundreds to thousands of molecules within a single thin tissue section. The ability to profile N-glycans within tissues using MALDI-MSI is a recently developed method that allows identification and localization of 40 or more N-glycans. The key component is to apply a molecular coating of peptide-N-glycosidase to tissues, an enzyme that releases N-glycans from their protein carrier. In this chapter, the methods and approaches to robustly and reproducibly generate two-dimensional N-glycan tissue maps by MALDI-MSI workflows are summarized. Current strengths and limitations of the approach are discussed, as well as potential future applications of the method.
Collapse
Affiliation(s)
- Richard R Drake
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics, Medical University of South Carolina, Charleston, SC, USA.
| | - Connor A West
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics, Medical University of South Carolina, Charleston, SC, USA
| | - Anand S Mehta
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics, Medical University of South Carolina, Charleston, SC, USA
| | - Peggi M Angel
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics, Medical University of South Carolina, Charleston, SC, USA
| |
Collapse
|
12
|
Aghamohseni H, Spearman M, Ohadi K, Braasch K, Moo-Young M, Butler M, Budman HM. A semi-empirical glycosylation model of a camelid monoclonal antibody under hypothermia cell culture conditions. J Ind Microbiol Biotechnol 2017; 44:1005-1020. [PMID: 28285402 DOI: 10.1007/s10295-017-1926-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2016] [Accepted: 02/22/2017] [Indexed: 01/29/2023]
Abstract
The impact of cell culture environment on the glycan distribution of a monoclonal antibody (mAb) has been investigated through a combination of experiments and modeling. A newly developed CHO DUXB cell line was cultivated at two levels of initial Glutamine (Gln) concentrations (0, 4 mM) and incubation temperatures of (33 and 37 °C) in batch operation mode. Hypothermia was applied either through the entire culture duration or only during the post-exponential phase. Beyond reducing cell growth and increasing productivity, hypothermia significantly altered the galactosylation index profiles as compared to control conditions. A novel semi-empirical dynamic model was proposed for elucidating the connections between the extracellular cell culture conditions to galactosylation index. The developed model is based on a simplified balance of nucleotides sugars and on the correlation between sugars' levels to the galactosylation index (GI). The model predictions were found to be in a good agreement with the experimental data. The proposed empirical model is expected to be useful for controlling the glycoprofiles by manipulating culture conditions.
Collapse
Affiliation(s)
- Hengameh Aghamohseni
- Faculty of Engineering, Chemical Engineering Department, University of Waterloo, E6 Building, Room 3012, 200 University Avenue West, Waterloo, ON, N2L 3G1, Canada
| | - Maureen Spearman
- Microbiology Department, University of Manitoba, Winnipeg, MB, Canada
| | - Kaveh Ohadi
- Faculty of Engineering, Chemical Engineering Department, University of Waterloo, E6 Building, Room 3012, 200 University Avenue West, Waterloo, ON, N2L 3G1, Canada
| | - Katrin Braasch
- Microbiology Department, University of Manitoba, Winnipeg, MB, Canada
| | - Murray Moo-Young
- Faculty of Engineering, Chemical Engineering Department, University of Waterloo, E6 Building, Room 3012, 200 University Avenue West, Waterloo, ON, N2L 3G1, Canada
| | - Michael Butler
- Microbiology Department, University of Manitoba, Winnipeg, MB, Canada
| | - Hector M Budman
- Faculty of Engineering, Chemical Engineering Department, University of Waterloo, E6 Building, Room 3012, 200 University Avenue West, Waterloo, ON, N2L 3G1, Canada.
| |
Collapse
|
13
|
Galleguillos SN, Ruckerbauer D, Gerstl MP, Borth N, Hanscho M, Zanghellini J. What can mathematical modelling say about CHO metabolism and protein glycosylation? Comput Struct Biotechnol J 2017; 15:212-221. [PMID: 28228925 PMCID: PMC5310201 DOI: 10.1016/j.csbj.2017.01.005] [Citation(s) in RCA: 34] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2016] [Revised: 01/09/2017] [Accepted: 01/12/2017] [Indexed: 11/15/2022] Open
Abstract
Chinese hamster ovary cells have been in the spotlight for process optimization in recent years, due to being the major, long established cell factory for the production of recombinant proteins. A deep, quantitative understanding of CHO metabolism and mechanisms involved in protein glycosylation has proven to be attainable through the development of high throughput technologies. Here we review the most notable accomplishments in the field of modelling CHO metabolism and protein glycosylation.
Collapse
Affiliation(s)
- Sarah N Galleguillos
- Department of Biotechnology, University of Natural Resources and Life Sciences, Vienna, Austria; Austrian Centre of Industrial Biotechnology, Vienna, Austria
| | - David Ruckerbauer
- Department of Biotechnology, University of Natural Resources and Life Sciences, Vienna, Austria; Austrian Centre of Industrial Biotechnology, Vienna, Austria
| | - Matthias P Gerstl
- Department of Biotechnology, University of Natural Resources and Life Sciences, Vienna, Austria; Austrian Centre of Industrial Biotechnology, Vienna, Austria
| | - Nicole Borth
- Department of Biotechnology, University of Natural Resources and Life Sciences, Vienna, Austria; Austrian Centre of Industrial Biotechnology, Vienna, Austria
| | - Michael Hanscho
- Department of Biotechnology, University of Natural Resources and Life Sciences, Vienna, Austria; Austrian Centre of Industrial Biotechnology, Vienna, Austria
| | - Jürgen Zanghellini
- Department of Biotechnology, University of Natural Resources and Life Sciences, Vienna, Austria; Austrian Centre of Industrial Biotechnology, Vienna, Austria
| |
Collapse
|
14
|
Drake RR, Powers TW, Jones EE, Bruner E, Mehta AS, Angel PM. MALDI Mass Spectrometry Imaging of N-Linked Glycans in Cancer Tissues. Adv Cancer Res 2016; 134:85-116. [PMID: 28110657 DOI: 10.1016/bs.acr.2016.11.009] [Citation(s) in RCA: 107] [Impact Index Per Article: 11.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/15/2023]
Abstract
Glycosylated proteins account for a majority of the posttranslation modifications of cell surface, secreted, and circulating proteins. Within the tumor microenvironment, the presence of immune cells, extracellular matrix proteins, cell surface receptors, and interactions between stroma and tumor cells are all processes mediated by glycan binding and recognition reactions. Changes in glycosylation during tumorigenesis are well documented to occur and affect all of these associated adhesion and regulatory functions. A MALDI imaging mass spectrometry (MALDI-IMS) workflow for profiling N-linked glycan distributions in fresh/frozen tissues and formalin-fixed paraffin-embedded tissues has recently been developed. The key to the approach is the application of a molecular coating of peptide-N-glycosidase to tissues, an enzyme that cleaves asparagine-linked glycans from their protein carrier. The released N-linked glycans can then be analyzed by MALDI-IMS directly on tissue. Generally 40 or more individual glycan structures are routinely detected, and when combined with histopathology localizations, tumor-specific glycans are readily grouped relative to nontumor regions and other structural features. This technique is a recent development and new approach in glycobiology and mass spectrometry imaging research methodology; thus, potential uses such as tumor-specific glycan biomarker panels and other applications are discussed.
Collapse
Affiliation(s)
- R R Drake
- Medical University of South Carolina, Charleston, SC, United States.
| | - T W Powers
- Medical University of South Carolina, Charleston, SC, United States
| | - E E Jones
- Medical University of South Carolina, Charleston, SC, United States
| | - E Bruner
- Medical University of South Carolina, Charleston, SC, United States
| | - A S Mehta
- Medical University of South Carolina, Charleston, SC, United States
| | - P M Angel
- Medical University of South Carolina, Charleston, SC, United States
| |
Collapse
|
15
|
Bennun SV, Hizal DB, Heffner K, Can O, Zhang H, Betenbaugh MJ. Systems Glycobiology: Integrating Glycogenomics, Glycoproteomics, Glycomics, and Other ‘Omics Data Sets to Characterize Cellular Glycosylation Processes. J Mol Biol 2016; 428:3337-3352. [PMID: 27423401 DOI: 10.1016/j.jmb.2016.07.005] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/06/2016] [Revised: 07/05/2016] [Accepted: 07/07/2016] [Indexed: 12/17/2022]
|
16
|
Hou W, Qiu Y, Hashimoto N, Ching WK, Aoki-Kinoshita KF. A systematic framework to derive N-glycan biosynthesis process and the automated construction of glycosylation networks. BMC Bioinformatics 2016; 17 Suppl 7:240. [PMID: 27454116 PMCID: PMC4965717 DOI: 10.1186/s12859-016-1094-6] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Background Abnormalities in glycan biosynthesis have been conclusively related to various diseases, whereas the complexity of the glycosylation process has impeded the quantitative analysis of biochemical experimental data for the identification of glycoforms contributing to disease. To overcome this limitation, the automatic construction of glycosylation reaction networks in silico is a critical step. Results In this paper, a framework K2014 is developed to automatically construct N-glycosylation networks in MATLAB with the involvement of the 27 most-known enzyme reaction rules of 22 enzymes, as an extension of previous model KB2005. A toolbox named Glycosylation Network Analysis Toolbox (GNAT) is applied to define network properties systematically, including linkages, stereochemical specificity and reaction conditions of enzymes. Our network shows a strong ability to predict a wider range of glycans produced by the enzymes encountered in the Golgi Apparatus in human cell expression systems. Conclusions Our results demonstrate a better understanding of the underlying glycosylation process and the potential of systems glycobiology tools for analyzing conventional biochemical or mass spectrometry-based experimental data quantitatively in a more realistic and practical way.
Collapse
Affiliation(s)
- Wenpin Hou
- Department of Mathematics, The University of Hong Kong, Hong Kong, 999077, China.
| | - Yushan Qiu
- Hematology Oncology Division, Northwestern University, Evanston, IL 60208, USA
| | - Nobuyuki Hashimoto
- Faculty of Science and Engineering, Soka University, Tokyo, 192-8577, Japan
| | - Wai-Ki Ching
- Department of Mathematics, The University of Hong Kong, Hong Kong, 999077, China
| | | |
Collapse
|
17
|
McDonald AG, Tipton KF, Davey GP. A Knowledge-Based System for Display and Prediction of O-Glycosylation Network Behaviour in Response to Enzyme Knockouts. PLoS Comput Biol 2016; 12:e1004844. [PMID: 27054587 PMCID: PMC4824424 DOI: 10.1371/journal.pcbi.1004844] [Citation(s) in RCA: 30] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2015] [Accepted: 03/02/2016] [Indexed: 12/22/2022] Open
Abstract
O-linked glycosylation is an important post-translational modification of mucin-type protein, changes to which are important biomarkers of cancer. For this study of the enzymes of O-glycosylation, we developed a shorthand notation for representing GalNAc-linked oligosaccharides, a method for their graphical interpretation, and a pattern-matching algorithm that generates networks of enzyme-catalysed reactions. Software for generating glycans from the enzyme activities is presented, and is also available online. The degree distributions of the resulting enzyme-reaction networks were found to be Poisson in nature. Simple graph-theoretic measures were used to characterise the resulting reaction networks. From a study of in-silico single-enzyme knockouts of each of 25 enzymes known to be involved in mucin O-glycan biosynthesis, six of them, β-1,4-galactosyltransferase (β4Gal-T4), four glycosyltransferases and one sulfotransferase, play the dominant role in determining O-glycan heterogeneity. In the absence of β4Gal-T4, all Lewis X, sialyl-Lewis X, Lewis Y and Sda/Cad glycoforms were eliminated, in contrast to knockouts of the N-acetylglucosaminyltransferases, which did not affect the relative abundances of O-glycans expressing these epitopes. A set of 244 experimentally determined mucin-type O-glycans obtained from the literature was used to validate the method, which was able to predict up to 98% of the most common structures obtained from human and engineered CHO cell glycoforms.
Collapse
Affiliation(s)
- Andrew G. McDonald
- School of Biochemistry and Immunology, Trinity College Dublin, Dublin, Ireland
| | - Keith F. Tipton
- School of Biochemistry and Immunology, Trinity College Dublin, Dublin, Ireland
| | - Gavin P. Davey
- School of Biochemistry and Immunology, Trinity College Dublin, Dublin, Ireland
| |
Collapse
|
18
|
Spahn PN, Hansen AH, Hansen HG, Arnsdorf J, Kildegaard HF, Lewis NE. A Markov chain model for N-linked protein glycosylation--towards a low-parameter tool for model-driven glycoengineering. Metab Eng 2016; 33:52-66. [PMID: 26537759 PMCID: PMC5031499 DOI: 10.1016/j.ymben.2015.10.007] [Citation(s) in RCA: 67] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2015] [Revised: 09/09/2015] [Accepted: 10/20/2015] [Indexed: 11/19/2022]
Abstract
Glycosylation is a critical quality attribute of most recombinant biotherapeutics. Consequently, drug development requires careful control of glycoforms to meet bioactivity and biosafety requirements. However, glycoengineering can be extraordinarily difficult given the complex reaction networks underlying glycosylation and the vast number of different glycans that can be synthesized in a host cell. Computational modeling offers an intriguing option to rationally guide glycoengineering, but the high parametric demands of current modeling approaches pose challenges to their application. Here we present a novel low-parameter approach to describe glycosylation using flux-balance and Markov chain modeling. The model recapitulates the biological complexity of glycosylation, but does not require user-provided kinetic information. We use this method to predict and experimentally validate glycoprofiles on EPO, IgG as well as the endogenous secretome following glycosyltransferase knock-out in different Chinese hamster ovary (CHO) cell lines. Our approach offers a flexible and user-friendly platform that can serve as a basis for powerful computational engineering efforts in mammalian cell factories for biopharmaceutical production.
Collapse
Affiliation(s)
- Philipp N Spahn
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, United States; The Novo Nordisk Foundation Center for Biosustainability at the University of California, San Diego, La Jolla, CA 92093, United States
| | - Anders H Hansen
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Hørsholm, Denmark
| | - Henning G Hansen
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Hørsholm, Denmark
| | - Johnny Arnsdorf
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Hørsholm, Denmark
| | - Helene F Kildegaard
- The Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Hørsholm, Denmark
| | - Nathan E Lewis
- Department of Pediatrics, University of California, San Diego, School of Medicine, La Jolla, CA 92093, United States; The Novo Nordisk Foundation Center for Biosustainability at the University of California, San Diego, La Jolla, CA 92093, United States.
| |
Collapse
|
19
|
Powers TW, Holst S, Wuhrer M, Mehta AS, Drake RR. Two-Dimensional N-Glycan Distribution Mapping of Hepatocellular Carcinoma Tissues by MALDI-Imaging Mass Spectrometry. Biomolecules 2015; 5:2554-72. [PMID: 26501333 PMCID: PMC4693247 DOI: 10.3390/biom5042554] [Citation(s) in RCA: 80] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/16/2015] [Revised: 09/18/2015] [Accepted: 09/28/2015] [Indexed: 01/28/2023] Open
Abstract
A new mass spectrometry imaging approach to simultaneously map the two-dimensional distribution of N-glycans in tissues has been recently developed. The method uses Matrix Assisted Laser Desorption Ionization Imaging Mass Spectrometry (MALDI-IMS) to spatially profile the location and distribution of multiple N-linked glycan species released by peptide N-glycosidase F in frozen or formalin-fixed tissues. Multiple formalin-fixed human hepatocellular carcinoma tissues were evaluated with this method, resulting in a panel of over 30 N-glycans detected. An ethylation reaction of extracted N-glycans released from adjacent slides was done to stabilize sialic acid containing glycans, and these structures were compared to N-glycans detected directly from tissue profiling. In addition, the distribution of singly fucosylated N-glycans detected in tumor tissue microarray cores were compared to the histochemistry staining pattern of a core fucose binding lectin. As this MALDI-IMS workflow has the potential to be applied to any formalin-fixed tissue block or tissue microarray, the advantages and limitations of the technique in context with other glycomic methods are also summarized.
Collapse
Affiliation(s)
- Thomas W Powers
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics and MUSC Proteomics Center, Medical University of South Carolina, 173 Ashley Avenue, Charleston, SC 29425, USA.
| | - Stephanie Holst
- Center for Proteomics and Metabolomics, Leiden University Medical Center, Leiden 2333ZA, The Netherlands.
| | - Manfred Wuhrer
- Center for Proteomics and Metabolomics, Leiden University Medical Center, Leiden 2333ZA, The Netherlands.
- Division of BioAnalytical Chemistry, VU University, Amsterdam 1081HV, The Netherlands.
- Department of Molecular Cell Biology and Immunology, VU University Medical Center, Amsterdam 1007MB, The Netherlands.
| | - Anand S Mehta
- Department of Microbiology and Immunology, College of Medicine, Drexel University, 2900 W. Queen Lane, Philadelphia, PA 19129, USA.
| | - Richard R Drake
- Department of Cell and Molecular Pharmacology and Experimental Therapeutics and MUSC Proteomics Center, Medical University of South Carolina, 173 Ashley Avenue, Charleston, SC 29425, USA.
| |
Collapse
|
20
|
Abstract
This chapter describes the KEGG GLYCAN database of the KEGG resource, including descriptions of links to the other databases in KEGG. In particular, KEGG GLYCAN consists of glycan structures, with links to glycogenes, orthologs, reactions, pathways, drugs, diseases, and others, all within the KEGG resources. A number of analytical tools are also available, including the composite structure map (CSM), KegDraw, KCam, and GECS. These databases and tools will be described along with simple examples of their usage.
Collapse
|
21
|
Abstract
Glycomics researchers have identified the need for integrated database systems for collecting glycomics information in a consistent format. The goal is to create a resource for knowledge discovery and dissemination to wider research communities. This has the potential to extend the research community to include biologists, clinicians, chemists, and computer scientists. This chapter discusses the technology and approach needed to create integrated data resources to empower the broader community to leverage extant glycomics data. The focus is on glycosaminoglycan (GAGs) and proteoglycan research, but the approach can be generalized. The methods described span the development of glycomics standards from CarbBank to Glyco Connection Tables. The existence of integrated data sets provides a foundation for novel methods of analysis such as machine learning for knowledge discovery. The implications of predictive analysis are examined in relation to disease biomarker to expand the target audience of GAG and proteoglycan research.
Collapse
|
22
|
Spahn PN, Lewis NE. Systems glycobiology for glycoengineering. Curr Opin Biotechnol 2014; 30:218-24. [PMID: 25202878 DOI: 10.1016/j.copbio.2014.08.004] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2014] [Revised: 08/14/2014] [Accepted: 08/15/2014] [Indexed: 12/21/2022]
Abstract
Glycosylation serves essential functions on many proteins produced in biopharmaceutical manufacturing, making it mandatory to thoroughly consider its biogenesis during the production process. Glycoengineering efforts involve the rational design of glycosylation through adjustments in culturing conditions or genetic modifications. Computational models have been developed to aid this process, aiming to offer cheaper and faster alternatives to costly screening strategies. Recently, these models have been successfully utilized to predict glycosylation of products of industrial relevance. Furthermore, systems-level analyses of glycan diversity are elucidating deeper insights into the mechanisms underlying glycosylation. As computational models of glycosylation continue to be expanded, refined, and leveraged for detailed analysis of glycomics data, they will become invaluable resources for cell line development and glycoengineering.
Collapse
Affiliation(s)
- Philipp N Spahn
- Department of Bioengineering, University of California, San Diego, La Jolla, CA 92093, United States
| | - Nathan E Lewis
- Department of Pediatrics, University of California, San Diego, La Jolla, CA 92093, United States.
| |
Collapse
|
23
|
Mickum ML, Prasanphanich NS, Heimburg-Molinaro J, Leon KE, Cummings RD. Deciphering the glycogenome of schistosomes. Front Genet 2014; 5:262. [PMID: 25147556 PMCID: PMC4122909 DOI: 10.3389/fgene.2014.00262] [Citation(s) in RCA: 29] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2014] [Accepted: 07/15/2014] [Indexed: 11/16/2022] Open
Abstract
Schistosoma mansoni and other Schistosoma sp. are multicellular parasitic helminths (worms) that infect humans and mammals worldwide. Infection by these parasites, which results in developmental maturation and sexual differentiation of the worms over a period of 5–6 weeks, induces antibodies to glycan antigens expressed in surface and secreted glycoproteins and glycolipids. There is growing interest in defining these unusual parasite-synthesized glycan antigens and using them to understand immune responses, their roles in immunomodulation, and in using glycan antigens as potential vaccine targets. A key problem in this area, however, has been the lack of information about the enzymes involved in elaborating the complex repertoire of glycans represented by the schistosome glycome. Recent availability of the nuclear genome sequences for Schistosoma sp. has created the opportunity to define the glycogenome, which represents the specific genes and cognate enzymes that generate the glycome. Here we describe the current state of information in regard to the schistosome glycogenome and glycome and highlight the important classes of glycans and glycogenes that may be important in their generation.
Collapse
Affiliation(s)
- Megan L Mickum
- Department of Biochemistry, Emory University School of Medicine Atlanta, GA, USA
| | - Nina S Prasanphanich
- Department of Biochemistry, Emory University School of Medicine Atlanta, GA, USA
| | | | - Kristoffer E Leon
- Department of Biochemistry, Emory University School of Medicine Atlanta, GA, USA
| | - Richard D Cummings
- Department of Biochemistry, Emory University School of Medicine Atlanta, GA, USA
| |
Collapse
|
24
|
Kotera M, Goto S, Kanehisa M. Predictive genomic and metabolomic analysis for the standardization of enzyme data. ACTA ACUST UNITED AC 2014. [DOI: 10.1016/j.pisc.2014.02.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
25
|
Jang KS, Kim YG, Adhya M, Park HM, Kim BG. The sweets standing at the borderline between allo- and xenotransplantation. Xenotransplantation 2013; 20:199-208. [PMID: 23551837 DOI: 10.1111/xen.12030] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2012] [Accepted: 02/28/2013] [Indexed: 01/06/2023]
Abstract
Animal cells are densely covered with glycoconjugates, such as N-glycan, O-glycan, and glycosphingolipids, which are important for various biological and immunological events at the cell surface and in the extracellular matrix. Endothelial α-Gal carbohydrate epitopes (Galα3Gal-R) expressed on porcine tissue or cell surfaces are such glycoconjugates and directly mediate hyperacute immunological rejection in pig-to-human xenotransplantation. Although researchers have been able to develop α1,3-galactosyltransferase (GalT) gene knockout (KO) pigs, there remain unclarified non-Gal antigens that prevent xenotransplantation. Based on our expertise in the structural analysis of xenoantigenic carbohydrates, we describe the immunologically significant non-human carbohydrate antigens, including α-Gal antigens, analyzed as part of efforts to assess the antigens responsible for hyperacute immunological rejection in pig-to-human xenotransplantation. The importance of studying human, pig, and GalT-KO pig glycoprofiles, and of developing adequate pig-to-human glycan databases, is also discussed.
Collapse
Affiliation(s)
- Kyoung-Soon Jang
- Institute of Molecular Biology and Genetics, Interdisciplinary Program for Biochemical Engineering and Biotechnology, Seoul National University, Seoul, Korea
| | | | | | | | | |
Collapse
|
26
|
Hattori M, Kotera M. Chemoinformatics on Metabolic Pathways. Bioinformatics 2013. [DOI: 10.4018/978-1-4666-3604-0.ch053] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022] Open
Abstract
Chemical genomics is one of the cutting-edge research areas in the post-genomic era, which requires a sophisticated integration of heterogeneous information, i.e., genomic and chemical information. Enzymes play key roles for dynamic behavior of living organisms, linking information in the chemical space and genomic space. In this chapter, the authors report our recent efforts in this area, including the development of a similarity measure between two chemical compounds, a prediction system of a plausible enzyme for a given substrate and product pair, and two different approaches to predict the fate of a given compound in a metabolic pathway. General problems and possible future directions are also discussed, in hope to attract more activities from many researchers in this research area.
Collapse
|
27
|
Aoki-Kinoshita KF. Using databases and web resources for glycomics research. Mol Cell Proteomics 2013; 12:1036-45. [PMID: 23325765 DOI: 10.1074/mcp.r112.026252] [Citation(s) in RCA: 37] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023] Open
Abstract
Many databases of carbohydrate structures and related information can be found on the World Wide Web. This review covers the major carbohydrate databases that have potential utility for glycoscientists and researchers entering the glycosciences. The first half provides a brief overview of carbohydrate databases and web resources (including a history of carbohydrate databases and carbohydrate notations used in these databases), and the second half provides a guide that can be used as an index to determine which resources provide the data of most interest to the user.
Collapse
Affiliation(s)
- Kiyoko F Aoki-Kinoshita
- Department of Bioinformatics, Faculty of Engineering, Soka University, 1-236 Tangi-machi, Hachioji, Tokyo, Japan.
| |
Collapse
|
28
|
Bennun SV, Yarema KJ, Betenbaugh MJ, Krambeck FJ. Integration of the transcriptome and glycome for identification of glycan cell signatures. PLoS Comput Biol 2013; 9:e1002813. [PMID: 23326219 PMCID: PMC3542073 DOI: 10.1371/journal.pcbi.1002813] [Citation(s) in RCA: 52] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2012] [Accepted: 10/19/2012] [Indexed: 11/24/2022] Open
Abstract
Abnormalities in glycan biosynthesis have been conclusively linked to many diseases but the complexity of glycosylation has hindered the analysis of glycan data in order to identify glycoforms contributing to disease. To overcome this limitation, we developed a quantitative N-glycosylation model that interprets and integrates mass spectral and transcriptomic data by incorporating key glycosylation enzyme activities. Using the cancer progression model of androgen-dependent to androgen-independent Lymph Node Carcinoma of the Prostate (LNCaP) cells, the N-glycosylation model identified and quantified glycan structural details not typically derived from single-stage mass spectral or gene expression data. Differences between the cell types uncovered include increases in H(II) and Le(y) epitopes, corresponding to greater activity of α2-Fuc-transferase (FUT1) in the androgen-independent cells. The model further elucidated limitations in the two analytical platforms including a defect in the microarray for detecting the GnTV (MGAT5) enzyme. Our results demonstrate the potential of systems glycobiology tools for elucidating key glycan biomarkers and potential therapeutic targets. The integration of multiple data sets represents an important application of systems biology for understanding complex cellular processes.
Collapse
Affiliation(s)
- Sandra V Bennun
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, Maryland, United States of America.
| | | | | | | |
Collapse
|
29
|
Lütteke T. The use of glycoinformatics in glycochemistry. Beilstein J Org Chem 2012; 8:915-29. [PMID: 23015842 PMCID: PMC3388882 DOI: 10.3762/bjoc.8.104] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2012] [Accepted: 05/29/2012] [Indexed: 01/10/2023] Open
Abstract
Glycoinformatics is a small but growing branch of bioinformatics and chemoinformatics. Various resources are now available that can be of use to glycobiologists, but also to chemists who work on the synthesis or analysis of carbohydrates. This article gives an overview of existing glyco-specific databases and tools, with a focus on their application to glycochemistry: Databases can provide information on candidate glycan structures for synthesis, or on glyco-enzymes that can be used to synthesize carbohydrates. Statistical analyses of glycan databases help to plan glycan synthesis experiments. 3D-Structural data of protein-carbohydrate complexes are used in targeted drug design, and tools to support glycan structure analysis aid with quality control. Specific problems of glycoinformatics compared to bioinformatics for genomics or proteomics, especially concerning integration and long-term maintenance of the existing glycan databases, are also discussed.
Collapse
Affiliation(s)
- Thomas Lütteke
- Justus-Liebig-University Gießen, Institute of Veterinary Physiology and Biochemistry, Frankfurter Str. 100, 35392 Gießen, Germany
| |
Collapse
|
30
|
Merrill AH. Sphingolipid and glycosphingolipid metabolic pathways in the era of sphingolipidomics. Chem Rev 2011; 111:6387-422. [PMID: 21942574 PMCID: PMC3191729 DOI: 10.1021/cr2002917] [Citation(s) in RCA: 589] [Impact Index Per Article: 42.1] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2011] [Indexed: 12/15/2022]
Affiliation(s)
- Alfred H Merrill
- School of Biology, and the Petit Institute for Bioengineering and Biosciences, Georgia Institute of Technology, Atlanta, Georgia 30332-0230, USA.
| |
Collapse
|
31
|
Frank M, Schloissnig S. Bioinformatics and molecular modeling in glycobiology. Cell Mol Life Sci 2010; 67:2749-72. [PMID: 20364395 PMCID: PMC2912727 DOI: 10.1007/s00018-010-0352-4] [Citation(s) in RCA: 53] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2009] [Revised: 03/08/2010] [Accepted: 03/11/2010] [Indexed: 12/11/2022]
Abstract
The field of glycobiology is concerned with the study of the structure, properties, and biological functions of the family of biomolecules called carbohydrates. Bioinformatics for glycobiology is a particularly challenging field, because carbohydrates exhibit a high structural diversity and their chains are often branched. Significant improvements in experimental analytical methods over recent years have led to a tremendous increase in the amount of carbohydrate structure data generated. Consequently, the availability of databases and tools to store, retrieve and analyze these data in an efficient way is of fundamental importance to progress in glycobiology. In this review, the various graphical representations and sequence formats of carbohydrates are introduced, and an overview of newly developed databases, the latest developments in sequence alignment and data mining, and tools to support experimental glycan analysis are presented. Finally, the field of structural glycoinformatics and molecular modeling of carbohydrates, glycoproteins, and protein-carbohydrate interaction are reviewed.
Collapse
Affiliation(s)
- Martin Frank
- Molecular Structure Analysis Core Facility-W160, Deutsches Krebsforschungszentrum (German Cancer Research Centre), 69120 Heidelberg, Germany.
| | | |
Collapse
|
32
|
Janik ME, Lityńska A, Vereecken P. Cell migration-the role of integrin glycosylation. Biochim Biophys Acta Gen Subj 2010; 1800:545-55. [PMID: 20332015 DOI: 10.1016/j.bbagen.2010.03.013] [Citation(s) in RCA: 114] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2010] [Revised: 03/11/2010] [Accepted: 03/17/2010] [Indexed: 12/25/2022]
Abstract
BACKGROUND Cell migration is an essential process in organ homeostasis, in inflammation, and also in metastasis, the main cause of death from cancer. The extracellular matrix (ECM) serves as the molecular scaffold for cell adhesion and migration; in the first phase of migration, adhesion of cells to the ECM is critical. Engagement of integrin receptors with ECM ligands gives rise to the formation of complex multiprotein structures which link the ECM to the cytoplasmic actin skeleton. Both ECM proteins and the adhesion receptors are glycoproteins, and it is well accepted that N-glycans modulate their conformation and activity, thereby affecting cell-ECM interactions. Likely targets for glycosylation are the integrins, whose ability to form functional dimers depends upon the presence of N-linked oligosaccharides. Cell migratory behavior may depend on the level of expression of adhesion proteins, and their N-glycosylation that affect receptor-ligand binding. SCOPE OF REVIEW The mechanism underlying the effect of integrin glycosylation on migration is still unknown, but results gained from integrins with artificial or mutated N-glycosylation sites provide evidence that integrin function can be regulated by changes in glycosylation. GENERAL SIGNIFICANCE A better understanding of the molecular mechanism of cell migration processes could lead to novel diagnostic and therapeutic approaches and applications. For this, the proteins and oligosaccharides involved in these events need to be characterized.
Collapse
Affiliation(s)
- Marcelina E Janik
- Department of Glycoconjugate Biochemistry, Institute of Zoology, Jagiellonian University, Krakow, Poland.
| | | | | |
Collapse
|
33
|
|
34
|
Krambeck FJ, Bennun SV, Narang S, Choi S, Yarema KJ, Betenbaugh MJ. A mathematical model to derive N-glycan structures and cellular enzyme activities from mass spectrometric data. Glycobiology 2009; 19:1163-75. [PMID: 19506293 PMCID: PMC2757573 DOI: 10.1093/glycob/cwp081] [Citation(s) in RCA: 71] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2009] [Revised: 06/03/2009] [Accepted: 06/03/2009] [Indexed: 01/22/2023] Open
Abstract
Effective representation and characterization of biosynthetic pathways of glycosylation can be facilitated by mathematical modeling. This paper describes the expansion of a previously developed detailed model for N-linked glycosylation with the further application of the model to analyze MALDI-TOF mass spectra of human N-glycans in terms of underlying cellular enzyme activities. The glycosylation reaction network is automatically generated by the model, based on the reaction specificities of the glycosylation enzymes. The use of a molecular mass cutoff and a network pruning method typically limits the model size to about 10,000 glycan structures. This allows prediction of the complete glycan profile and its abundances for any set of assumed enzyme concentrations and reaction rate parameters. A synthetic mass spectrum from model-calculated glycan profiles is obtained and enzyme concentrations are adjusted to bring the theoretically calculated mass spectrum into agreement with experiment. The result of this process is a complete characterization of a measured glycan mass spectrum containing hundreds of masses in terms of the activities of 19 enzymes. In addition, a complete annotation of the mass spectrum in terms of glycan structure is produced, including the proportions of isomers within each peak. The method was applied to mass spectrometric data of normal human monocytes and monocytic leukemia (THP1) cells to derive glycosyltransferase activity changes underlying the differences in glycan structure between the normal and diseased cells. Model predictions could lead to a better understanding of the changes associated with disease states, identification of disease-associated biomarkers, and bioengineered glycan modifications.
Collapse
Affiliation(s)
- Frederick J Krambeck
- Department of Chemical and Biomolecular Engineering, Johns Hopkins University, Baltimore, MD 21218, USA.
| | | | | | | | | | | |
Collapse
|
35
|
Ranzinger R, Frank M, von der Lieth CW, Herget S. Glycome-DB.org: a portal for querying across the digital world of carbohydrate sequences. Glycobiology 2009; 19:1563-7. [PMID: 19759275 DOI: 10.1093/glycob/cwp137] [Citation(s) in RCA: 40] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Despite ongoing harmonization efforts, the major carbohydrate sequence databases following the first initiative in this field, CarbBank, are still isolated islands, with mechanisms for automatic structure exchange and comparison largely missing. This unfavorable situation has been overcome with a systematic data integration effort, resulting in the GlycomeDB, a meta-database for public carbohydrate sequences. It contains at present 35,056 unique structures in GlycoCT encoding, referencing more than 100,000 external records from 1845 different taxonomic sources. We have created a user-friendly, web-based graphical interface which allows taxonomic and structural data to be entered and searched for. The structural search possibilities include substructure search, similarity search, and maximum common substructure. A novel search refinement mechanism allows the assembly of complex queries. With GlycomeDB (www.glycome-db.org), it is now possible to use a single portal to access all digitally encoded, public structural data in glycomics and to perform complex queries with the help of a web-based user interface.
Collapse
Affiliation(s)
- René Ranzinger
- German Cancer Research Center, Molecular Structure Analysis (W160), Molecular Modeling Group, Im Neuenheimer Feld 280, D-69120 Heidelberg, Germany.
| | | | | | | |
Collapse
|
36
|
Hashimoto K, Tokimatsu T, Kawano S, Yoshizawa AC, Okuda S, Goto S, Kanehisa M. Comprehensive analysis of glycosyltransferases in eukaryotic genomes for structural and functional characterization of glycans. Carbohydr Res 2009; 344:881-7. [DOI: 10.1016/j.carres.2009.03.001] [Citation(s) in RCA: 29] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/02/2008] [Revised: 02/24/2009] [Accepted: 03/03/2009] [Indexed: 11/25/2022]
|
37
|
Affiliation(s)
- Thomas Lütteke
- CMBI, NCMLS, Radboud University Nijmegen, P. O. Box 9010, 6500 GL Nijmegen (The Netherlands).
| |
Collapse
|
38
|
Marathe DD, Chandrasekaran EV, Lau JTY, Matta KL, Neelamegham S. Systems-level studies of glycosyltransferase gene expression and enzyme activity that are associated with the selectin binding function of human leukocytes. FASEB J 2008; 22:4154-67. [PMID: 18716032 DOI: 10.1096/fj.07-104257] [Citation(s) in RCA: 37] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/31/2022]
Abstract
The application of systems biology methods in the emerging field of glycomics requires the collection and integration of glycosyltransferase data at the gene and enzyme level for the purpose of hypothesis generation. We systematically examined the relationship between gene expression, glycosyltransferase activity, glycan expression, and selectin-binding function in different systems, including human neutrophils, undifferentiated HL-60 (human promyelocytic cells), differentiated HL-60, and HL-60 synchronized in specific growth phases. Results demonstrate that 1) the sLe(X) (sialyl-Lewis-X) epitope is expressed in P-selectin glycoprotein ligand-1 (PSGL-1) from neutrophils at higher levels compared with HL-60. This variation may be due to differences in the relative activities of alpha1,3-fucosyltransferases and alpha2,3-sialyltransferases in these two cell types. 2) HL-60 cell differentiation along granulocyte lineage increased the activity of beta1,4GalT and beta1,3GlcNAcT by 1.6- to 3.2-fold. This may contribute to LacNAc chain extension as evidenced by the 1.7-fold increase in DSA-lectin (lectin recognizing LacNAc) binding to cells after differentiation. 3) The activity of enzymes contributing to sLe(X) formation in leukocytes likely varies as ST3[Galbeta1,4GlcNAc] < or = alpha1,3FT[sialyl-LacNAc] < beta1,3GlcNAcT. 4) O-glycan specific glycosyltransferase activity does not undergo periodic variation with cell cycle phases. Overall, gene expression and enzyme activity data combined with knowledge of biochemistry can predict the resulting glycan structures and yield viable experimentally testable hypothesis.
Collapse
Affiliation(s)
- Dhananjay D Marathe
- Chemical and Biological Engineering State University of New York at Buffalo, Buffalo, NY 14260, USA
| | | | | | | | | |
Collapse
|
39
|
Herget S, Toukach PV, Ranzinger R, Hull WE, Knirel YA, von der Lieth CW. Statistical analysis of the Bacterial Carbohydrate Structure Data Base (BCSDB): characteristics and diversity of bacterial carbohydrates in comparison with mammalian glycans. BMC STRUCTURAL BIOLOGY 2008; 8:35. [PMID: 18694500 PMCID: PMC2543016 DOI: 10.1186/1472-6807-8-35] [Citation(s) in RCA: 106] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/26/2008] [Accepted: 08/11/2008] [Indexed: 11/24/2022]
Abstract
Background There are considerable differences between bacterial and mammalian glycans. In contrast to most eukaryotic carbohydrates, bacterial glycans are often composed of repeating units with diverse functions ranging from structural reinforcement to adhesion, colonization and camouflage. Since bacterial glycans are typically displayed at the cell surface, they can interact with the environment and, therefore, have significant biomedical importance. Results The sequence characteristics of glycans (monosaccharide composition, modifications, and linkage patterns) for the higher bacterial taxonomic classes have been examined and compared with the data for mammals, with both similarities and unique features becoming evident. Compared to mammalian glycans, the bacterial glycans deposited in the current databases have a more than ten-fold greater diversity at the monosaccharide level, and the disaccharide pattern space is approximately nine times larger. Specific bacterial subclasses exhibit characteristic glycans which can be distinguished on the basis of distinctive structural features or sequence properties. Conclusion For the first time a systematic database analysis of the bacterial glycome has been performed. This study summarizes the current knowledge of bacterial glycan architecture and diversity and reveals putative targets for the rational design and development of therapeutic intervention strategies by comparing bacterial and mammalian glycans.
Collapse
Affiliation(s)
- Stephan Herget
- Core Facility: Molecular Structure Analysis (W160), German Cancer Research Center, Heidelberg, Germany.
| | | | | | | | | | | |
Collapse
|
40
|
van Dijk ADJ, Bosch D, ter Braak CJF, van der Krol AR, van Ham RCHJ. Predicting sub-Golgi localization of type II membrane proteins. ACTA ACUST UNITED AC 2008; 24:1779-86. [PMID: 18562268 PMCID: PMC7110242 DOI: 10.1093/bioinformatics/btn309] [Citation(s) in RCA: 30] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2022]
Abstract
Motivation: Recent research underlines the importance of finegrained knowledge on protein localization. In particular, subcompartmental localization in the Golgi apparatus is important, for example, for the order of reactions performed in glycosylation pathways or the sorting functions of SNAREs, but is currently poorly understood. Results: We assemble a dataset of type II transmembrane proteins with experimentally determined sub-Golgi localizations and use this information to develop a predictor based on the transmembrane domain of these proteins, making use of a dedicated proteinstructure based kernel in an SVM. Various applications demonstrate the power of our approach. In particular, comparison with a large set of glycan structures illustrates the applicability of our predictions on a ‘glycomic’ scale and demonstrates a significant correlation between sub-Golgi localization and the ordering of different steps in glycan biosynthesis. Contact:roeland.vanham@wur.nl Supplementary information:Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- A D J van Dijk
- Applied Bioinformatics, PRI, Wageningen UR, Wageningen, The Netherlands
| | | | | | | | | |
Collapse
|
41
|
|
42
|
(Glyco)sphingolipidology: an amazing challenge and opportunity for systems biology. Trends Biochem Sci 2008; 32:457-68. [PMID: 17928229 DOI: 10.1016/j.tibs.2007.09.004] [Citation(s) in RCA: 98] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/01/2007] [Revised: 07/19/2007] [Accepted: 09/24/2007] [Indexed: 01/08/2023]
Abstract
Sphingolipids are found in essentially all eukaryotes and in some prokaryotes and viruses, where they influence cell structure, signaling and interactions with the extracellular environment. Because of the combinatorial nature of their biosynthesis, the sphingolipidome comprises untold thousands of species that encompass bioactive backbones and complex phospho- and glycolipids. Mass spectrometry is able to analyze a growing fraction of the sphingolipidome and is beginning to provide information about localization. Use of these structure specific, quantitative methods is producing insights, and surprises, regarding sphingolipid structure, metabolism, function and disease. Dealing with such large data sets poses special challenges for systems biology, but the intrinsic and elegant interrelationships among these compounds might provide a key to dealing with the complexity of the sphingolipidome.
Collapse
|
43
|
Hashimoto K, Yoshizawa AC, Okuda S, Kuma K, Goto S, Kanehisa M. The repertoire of desaturases and elongases reveals fatty acid variations in 56 eukaryotic genomes. J Lipid Res 2008; 49:183-91. [DOI: 10.1194/jlr.m700377-jlr200] [Citation(s) in RCA: 132] [Impact Index Per Article: 7.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
|
44
|
Kanehisa M, Araki M, Goto S, Hattori M, Hirakawa M, Itoh M, Katayama T, Kawashima S, Okuda S, Tokimatsu T, Yamanishi Y. KEGG for linking genomes to life and the environment. Nucleic Acids Res 2007; 36:D480-4. [PMID: 18077471 PMCID: PMC2238879 DOI: 10.1093/nar/gkm882] [Citation(s) in RCA: 4453] [Impact Index Per Article: 247.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
KEGG (http://www.genome.jp/kegg/) is a database of biological systems that integrates genomic, chemical and systemic functional information. KEGG provides a reference knowledge base for linking genomes to life through the process of PATHWAY mapping, which is to map, for example, a genomic or transcriptomic content of genes to KEGG reference pathways to infer systemic behaviors of the cell or the organism. In addition, KEGG provides a reference knowledge base for linking genomes to the environment, such as for the analysis of drug-target relationships, through the process of BRITE mapping. KEGG BRITE is an ontology database representing functional hierarchies of various biological objects, including molecules, cells, organisms, diseases and drugs, as well as relationships among them. KEGG PATHWAY is now supplemented with a new global map of metabolic pathways, which is essentially a combined map of about 120 existing pathway maps. In addition, smaller pathway modules are defined and stored in KEGG MODULE that also contains other functional units and complexes. The KEGG resource is being expanded to suit the needs for practical applications. KEGG DRUG contains all approved drugs in the US and Japan, and KEGG DISEASE is a new database linking disease genes, pathways, drugs and diagnostic markers.
Collapse
Affiliation(s)
- Minoru Kanehisa
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto, Japan
| | | | | | | | | | | | | | | | | | | | | |
Collapse
|
45
|
Yamamoto H, Takematsu H, Fujinawa R, Naito Y, Okuno Y, Tsujimoto G, Suzuki A, Kozutsumi Y. Correlation index-based responsible-enzyme gene screening (CIRES), a novel DNA microarray-based method for enzyme gene involved in glycan biosynthesis. PLoS One 2007; 2:e1232. [PMID: 18043739 PMCID: PMC2077928 DOI: 10.1371/journal.pone.0001232] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/19/2007] [Accepted: 11/04/2007] [Indexed: 01/25/2023] Open
Abstract
BACKGROUND Glycan biosynthesis occurs though a multi-step process that requires a variety of enzymes ranging from glycosyltransferases to those involved in cytosolic sugar metabolism. In many cases, glycan biosynthesis follows a glycan-specific, linear pathway. As glycosyltransferases are generally regulated at the level of transcription, assessing the overall transcriptional profile for glycan biosynthesis genes seems warranted. However, a systematic approach for assessing the correlation between glycan expression and glycan-related gene expression has not been reported previously. METHODOLOGY To facilitate genetic analysis of glycan biosynthesis, we sought to correlate the expression of genes involved in cell-surface glycan formation with the expression of the glycans, as detected by glycan-recognizing probes. We performed cross-sample comparisons of gene expression profiles using a newly developed, glycan-focused cDNA microarray. Cell-surface glycan expression profiles were obtained using flow cytometry of cells stained with plant lectins. Pearson's correlation coefficients were calculated for these profiles and were used to identify enzyme genes correlated with glycan biosynthesis. CONCLUSIONS This method, designated correlation index-based responsible-enzyme gene screening (CIRES), successfully identified genes already known to be involved in the biosynthesis of certain glycans. Our evaluation of CIRES indicates that it is useful for identifying genes involved in the biosynthesis of glycan chains that can be probed with lectins using flow cytometry.
Collapse
Affiliation(s)
- Harumi Yamamoto
- Laboratory of Membrane Biochemistry and Biophysics, Graduate School of Biostudies, Kyoto University, Sakyo, Kyoto, Japan
- Supra-Biomolecular System Research Group, RIKEN Frontier Research System, RIKEN, Wako, Saitama, Japan
| | - Hiromu Takematsu
- Laboratory of Membrane Biochemistry and Biophysics, Graduate School of Biostudies, Kyoto University, Sakyo, Kyoto, Japan
- Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Corporation (JST), Kawaguchi, Saitama, Japan
- * To whom correspondence should be addressed. E-mail:
| | - Reiko Fujinawa
- Supra-Biomolecular System Research Group, RIKEN Frontier Research System, RIKEN, Wako, Saitama, Japan
| | - Yuko Naito
- Laboratory of Membrane Biochemistry and Biophysics, Graduate School of Biostudies, Kyoto University, Sakyo, Kyoto, Japan
- Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Corporation (JST), Kawaguchi, Saitama, Japan
| | - Yasushi Okuno
- Department of PharmacoInformatics, Graduate School of Pharmaceutical Sciences, Kyoto University, Sakyo, Kyoto, Japan
| | - Gozoh Tsujimoto
- Department of Genomic Drug Discovery, Graduate School of Pharmaceutical Sciences, Kyoto University, Sakyo, Kyoto, Japan
| | - Akemi Suzuki
- Supra-Biomolecular System Research Group, RIKEN Frontier Research System, RIKEN, Wako, Saitama, Japan
| | - Yasunori Kozutsumi
- Laboratory of Membrane Biochemistry and Biophysics, Graduate School of Biostudies, Kyoto University, Sakyo, Kyoto, Japan
- Supra-Biomolecular System Research Group, RIKEN Frontier Research System, RIKEN, Wako, Saitama, Japan
- Core Research for Evolutional Science and Technology (CREST), Japan Science and Technology Corporation (JST), Kawaguchi, Saitama, Japan
| |
Collapse
|
46
|
Aanensen DM, Mavroidi A, Bentley SD, Reeves PR, Spratt BG. Predicted functions and linkage specificities of the products of the Streptococcus pneumoniae capsular biosynthetic loci. J Bacteriol 2007; 189:7856-76. [PMID: 17766420 PMCID: PMC2168755 DOI: 10.1128/jb.00837-07] [Citation(s) in RCA: 100] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The sequences of the capsular biosynthetic (cps) loci of 90 serotypes of Streptococcus pneumoniae have recently been determined. Bioinformatic procedures were used to predict the general functions of 1,973 of the 1,999 gene products and to identify proteins within the same homology group, Pfam family, and CAZy glycosyltransferase family. Correlating cps gene content with the 54 known capsular polysaccharide (CPS) structures provided tentative assignments of the specific functions of the different homology groups of each functional class (regulatory proteins, enzymes for synthesis of CPS constituents, polymerases, flippases, initial sugar transferases, glycosyltransferases [GTs], phosphotransferases, acetyltransferases, and pyruvyltransferases). Assignment of the glycosidic linkages catalyzed by the 342 GTs (92 homology groups) is problematic, but tentative assignments could be made by using this large set of cps loci and CPS structures to correlate the presence of particular GTs with specific glycosidic linkages, by correlating inverting or retaining linkages in CPS repeat units with the inverting or retaining mechanisms of the GTs predicted from their CAZy family membership, and by comparing the CPS structures of serotypes that have very similar cps gene contents. These large-scale comparisons between structure and gene content assigned the linkages catalyzed by 72% of the GTs, and all linkages were assigned in 32 of the serotypes with known repeat unit structures. Clear examples where very similar initial sugar transferases or glycosyltransferases catalyze different linkages in different serotypes were also identified. These assignments should provide a stimulus for biochemical studies to evaluate the reactions that are proposed.
Collapse
Affiliation(s)
- David M Aanensen
- Department of Infectious Disease Epidemiology, Imperial College London, Room G22, Old Medical School Building, St. Mary's Hospital, Norfolk Place, London W2 1PG, United Kingdom
| | | | | | | | | |
Collapse
|
47
|
Oh M, Yamada T, Hattori M, Goto S, Kanehisa M. Systematic analysis of enzyme-catalyzed reaction patterns and prediction of microbial biodegradation pathways. J Chem Inf Model 2007; 47:1702-12. [PMID: 17516640 DOI: 10.1021/ci700006f] [Citation(s) in RCA: 67] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
The roles of chemical compounds in biological systems are now systematically analyzed by high-throughput experimental technologies. To automate the processing and interpretation of large-scale data it is necessary to develop bioinformatics methods to extract information from the chemical structures of these small molecules by considering the interactions and reactions involving proteins and other biological macromolecules. Here we focus on metabolic compounds and present a knowledge-based approach for understanding reactivity and metabolic fate in enzyme-catalyzed reactions in a given organism or group. We first constructed the KEGG RPAIR database containing chemical structure alignments and structure transformation patterns, called RDM patterns, for 7091 reactant pairs (substrate-product pairs) in 5734 known enzyme-catalyzed reactions. A total of 2205 RDM patterns were then categorized based on the KEGG PATHWAY database. The majority of RDM patterns were uniquely or preferentially found in specific classes of pathways, although some RDM patterns, such as those involving phosphorylation, were ubiquitous. The xenobiotics biodegradation pathways contained the most distinct RDM patterns, and we developed a scheme for predicting bacterial biodegradation pathways given chemical structures of, for example, environmental compounds.
Collapse
Affiliation(s)
- Mina Oh
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Uji, Kyoto 611-0011, Japan
| | | | | | | | | |
Collapse
|
48
|
Mueller M, Martens L, Apweiler R. Annotating the human proteome: Beyond establishing a parts list. BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS 2007; 1774:175-91. [PMID: 17223395 DOI: 10.1016/j.bbapap.2006.11.011] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/03/2006] [Revised: 11/16/2006] [Accepted: 11/21/2006] [Indexed: 12/31/2022]
Abstract
The completion of the human genome has shifted the attention from deciphering the sequence to the identification and characterisation of the functional components, including genes. Improved gene prediction algorithms, together with the existing transcript and protein information, have enabled the identification of most exons in a genome. Availability of the 'parts list' has fostered the development of experimental approaches to systematically interrogate gene function on the genome, transcriptome and proteome level. Studying gene function at the protein level is vital to the understanding of how cells perform their functions as variations in protein isoforms and protein quantity which may underlie a change in phenotype can often not be deduced from sequence or transcript level genomics experiments alone. Recent advancements in proteomics have afforded technologies capable of measuring protein expression, post-translational modifications of these proteins, their subcellular localisation and assembly into complexes and pathways. Although an enormous amount of data already exists on the function of many human proteins, much of it is scattered over multiple resources. Public domain databases are therefore required to manage and collate this information and present it to the user community in both a human and machine readable manner. Of special importance here is the integration of heterogeneous data to facilitate the creation of resources that go beyond a mere parts list.
Collapse
Affiliation(s)
- Michael Mueller
- EMBL Outstation, The European Bioinformatics Institute, Wellcome Trust Genome Campus, Cambridge CB10 1SD, UK
| | | | | |
Collapse
|
49
|
Comelli EM, Sutton-Smith M, Yan Q, Amado M, Panico M, Gilmartin T, Whisenant T, Lanigan CM, Head SR, Goldberg D, Morris HR, Dell A, Paulson JC. Activation of murine CD4+ and CD8+ T lymphocytes leads to dramatic remodeling of N-linked glycans. THE JOURNAL OF IMMUNOLOGY 2006; 177:2431-40. [PMID: 16888005 DOI: 10.4049/jimmunol.177.4.2431] [Citation(s) in RCA: 96] [Impact Index Per Article: 5.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Abstract
Differentiation and activation of lymphocytes are documented to result in changes in glycosylation associated with biologically important consequences. In this report, we have systematically examined global changes in N-linked glycosylation following activation of murine CD4 T cells, CD8 T cells, and B cells by MALDI-TOF mass spectrometry profiling, and investigated the molecular basis for those changes by assessing alterations in the expression of glycan transferase genes. Surprisingly, the major change observed in activated CD4 and CD8 T cells was a dramatic reduction of sialylated biantennary N-glycans carrying the terminal NeuGcalpha2-6Gal sequence, and a corresponding increase in glycans carrying the Galalpha1-3Gal sequence. This change was accounted for by a decrease in the expression of the sialyltransferase ST6Gal I, and an increase in the expression of the galactosyltransferase, alpha1-3GalT. Conversely, in B cells no change in terminal sialylation of N-linked glycans was evident, and the expression of the same two glycosyltransferases was increased and decreased, respectively. The results have implications for differential recognition of activated and unactivated T cells by dendritic cells and B cells expressing glycan-binding proteins that recognize terminal sequences of N-linked glycans.
Collapse
Affiliation(s)
- Elena M Comelli
- Departments of Molecular Biology and Molecular and Experimental Medicine, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA 92037, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
50
|
Raman R, Venkataraman M, Ramakrishnan S, Lang W, Raguram S, Sasisekharan R. Advancing glycomics: implementation strategies at the consortium for functional glycomics. Glycobiology 2006; 16:82R-90R. [PMID: 16478800 DOI: 10.1093/glycob/cwj080] [Citation(s) in RCA: 180] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/19/2022] Open
Abstract
Glycomics-an integrated approach to study structure-function relationships of complex carbohydrates (or glycans)-is an emerging field in this age of post-genomics. Realizing the importance of glycomics, many large scale research initiatives have been established to generate novel resources and technologies to advance glycomics. These initiatives are generating and cataloging diverse data sets necessitating the development of bioinformatic platforms to acquire, integrate, and disseminate these data sets in a meaningful fashion. With the consortium for functional glycomics (CFG) as the model system, this review discusses databases and the bioinformatics platform developed by this consortium to advance glycomics.
Collapse
Affiliation(s)
- Rahul Raman
- Center for Biomedical Engineering, Massachusetts Institute of Technology; Harvard-Massachusetts Institute of Technology Division of Health Sciences and Technology, 77 Massachusetts Avenue, Cambridge, MA 02139, USA
| | | | | | | | | | | |
Collapse
|